Saturday, December 29, 2012

Daily Bookmarks 20121229

waue0920
https://sites.google.com/site/waue0920/
python通过thrift来操作hbase
http://abloz.com/2012/06/01/python-operating-hbase-thrift-to.html
Flask-Login 0.1.3 : Python Package Index
http://pypi.python.org/pypi/Flask-Login
9.7. Regions
http://hbase.apache.org/book/regions.arch.html
flask-openid/example/example.py at master · mitsuhiko/flask-openid · GitHub
https://github.com/mitsuhiko/flask-openid/blob/master/example/example.py

大家用 git rebase 频繁吗? » 社区 | Ruby China
http://ruby-china.org/topics/112
"
Rei 2楼, 1年前 喜欢
#1楼 @huacnlee 我这因为一个新功能可能要开发2周,要两个人协作,所以需要经常push到代码库备份。而这两个人如果经常 pull 的话,就会出现很多
Merge branch 'aaa' of git.xxx.xxx/xxx/xxx into aaa
这样的 merge commit,这些 commit 在审阅代码的时候很干扰,如果 pull 得频繁就会变成波浪状"
构造干净的 Git 历史线索 - 话题 - CodeCampo
http://codecampo.com/topics/379#replies-4
《看日记学git》之随笔笔记 | Linux大棚
http://roclinux.cn/?p=1239

Maven入门--概念与实例 [转贴] - zhouyaoh - ITeye技术网站
http://zhouyaoh.iteye.com/blog/56961
分布式计算平台Hadoop 发展现状乱而稳定的解读 | MySQLOPS 数据库与运维自动化技术分享
http://www.mysqlops.com/2012/05/02/hadoop-apache.html

HBase 官方文档 0.95
http://abloz.com/hbase/book.html#datamodel
HTable (HBase 0.95-SNAPSHOT API)
http://hbase.apache.org/apidocs/index.html?org/apache/hadoop/hbase/client/HTable.html
Get (HBase 0.95-SNAPSHOT API)
http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/client/Get.html#setMaxVersions(int)
ResultScanner (HBase 0.95-SNAPSHOT API)
http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/client/ResultScanner.html
HTable (HBase 0.95-SNAPSHOT API)
http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/client/HTable.html#get%28org.apache.hadoop.hbase.client.Get%29
hbase java 读写实例
http://abloz.com/2012/06/07/hbase-the-java-to-read-and-write-instance.html
《看日记学git》之随笔笔记 | Linux大棚
http://roclinux.cn/?p=1239

















Friday, December 28, 2012

Daily Bookmarks 20121228


[教學] Git使用教學 Part 2 -- 控制版本 - tails板 - Disp BBS
http://disp.cc/b/546-3iGz
[教學] Git使用教學 Part 3 -- 多人作業 - tails板 - Disp BBS
http://disp.cc/b/546-3iGz#!546-3j0H



A successful Git branching model » nvie.com
http://nvie.com/posts/a-successful-git-branching-model/

Git 版本控制系統(2) 開 branch 分支和操作遠端 repo. | ihower { blogging }
http://ihower.tw/blog/archives/2620/

Git Tutorial 教學

http://www.slideshare.net/ihower/git-tutorial-13695342

On feature-branches and pull-requests - Alexey Zakhlestin's Blog
http://blog.milkfarmsoft.com/2011/03/on-feature-branches/


Visualized Git best practices for team: branch, merge, rebase
http://kentnguyen.com/development/visualized-git-practices-for-team/

Wetware » Pull A Git Branch from Remote ???
http://www.wetware.co.nz/2009/07/pull-a-git-branch-from-remote/

Git Study
http://josephj.com/entry.php?id=308
Tutorial on Git / 白話文Git教學 | Thoy's
http://thoy.blog.ntu.edu.tw/2011/05/01/tutorial-on-git-%E7%99%BD%E8%A9%B1%E6%96%87git%E6%95%99%E5%AD%B8/

How to Push a New Local Branch to a Remote Git Repository
http://www.mariopareja.com/blog/archive/2010/01/11/how-to-push-a-new-local-branch-to-a-remote.aspx










Thursday, December 27, 2012

Daily Bookmarks 20121227


HBase之 HTable线程不安全 - 天行健 - ITeye技术网站
http://punishzhou.iteye.com/blog/1288163
HBase scan的客户端分析 - 天行健 - ITeye技术网站
http://punishzhou.iteye.com/blog/1297015
HBase splitlog 过程 - 天行健 - ITeye技术网站
http://punishzhou.iteye.com/blog/1286217
Hbase的log管理(一) - 天行健 - ITeye技术网站
http://punishzhou.iteye.com/blog/1286230
HBase的get过程(一) - 天行健 - ITeye技术网站
http://punishzhou.iteye.com/blog/1258848
HBase的数据的update - 天行健 - ITeye技术网站
http://punishzhou.iteye.com/blog/1266341
How does Memstore in HBase update records? - Stack Overflow
http://stackoverflow.com/questions/11514010/how-does-memstore-in-hbase-update-records
Configuring HBase Memstore: What You Should Know « Sematext Blog
http://blog.sematext.com/2012/07/16/hbase-memstore-what-you-should-know/
Presentation: Intro to HBase Internals and Schema Design « Sematext Blog
http://blog.sematext.com/2012/07/09/intro-to-hbase-internals-and-schema-desig/

http://blog.xuite.net/misgarlic/weblogic/63779746-Java+Service+Wrapper

程式搖滾: 簡易的git操作使用
http://ozzysun.blogspot.tw/2012/09/git.html
Git remote add - Middle Earth
http://blog.zhangsen.org/2011/03/git-remote-add.html





Daily Bookmarks 20121226

套件(package)
http://caterpillar.onlyfun.net/Gossip/JavaGossip-V1/Package.htm
程式語言教學誌: Java 快速導覽 - 套件
http://pydoing.blogspot.tw/2011/01/java-package.html
如何理解和使用Java package包 - Fisher的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/zhenyusoso/article/details/6174834





Tuesday, December 25, 2012

Daily Bookmarks 20121225


Apache Pig 簡介 « Fenriswolf 程式筆記
http://fenriswolf.me/2012/03/22/apache-pig-%E7%B0%A1%E4%BB%8B/
Xen的Memory問題 - iT邦幫忙::IT知識分享社群
http://ithelp.ithome.com.tw/question/10058271
Java元數據介面 - 維基百科,自由的百科全書
http://zh.wikipedia.org/wiki/Java%E5%85%83%E6%95%B0%E6%8D%AE%E6%8E%A5%E5%8F%A3




小紅帽技術論壇 - IP Address range 的設計?
http://redhat.ecenter.idv.tw/bbs/showthread.php?postid=128955

Monday, December 24, 2012

Saturday, December 22, 2012

Daily Bookmarks 20121222

Building an Inverted Index with Hadoop and Pig « SquareCog's SquareBlog
http://squarecog.wordpress.com/2009/01/17/building-an-inverted-index-with-hadoop-and-pig/
Indexing and Searching on a Hadoop Distributed File System | Dr Dobb's
http://www.drdobbs.com/parallel/indexing-and-searching-on-a-hadoop-distr/226300241

Data Recipes: Bulk Indexing With ElasticSearch and Hadoop
http://thedatachef.blogspot.tw/2011/01/bulk-indexing-with-elasticsearch-and.html
Using Python and Hadoop streaming to build an inverted index « Panagiotis Matigakis
http://pmatigakis.wordpress.com/2011/09/14/using-python-and-hadoop-streaming-to-build-an-inverted-index/

Flume的优化
http://www.ipinyou.com.cn/technicalnews/201112/Technical_6.html
java.lang.OutOfMemoryError: Java heap space解决方法 - jq_diy的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/jq_diy/article/details/4154132

logstash | Everything is a Freaking DNS problem
http://www.krisbuytaert.be/blog/category/logstash

Flume日志收集 - 残夜 - 博客园
http://www.cnblogs.com/oubo/archive/2012/05/25/2517751.html
NCHCCloudCourse100204 – Cloud Computing
http://trac.nchc.org.tw/cloud/wiki/NCHCCloudCourse100204

Java的Package與Import機制之我的理解(初學者的心得)-WEBASP.NET
http://big5.webasp.net/article/23/22119.htm


http://big5.webasp.net/article/23/22119.htm

Friday, December 21, 2012

Daily Bookmarks 20121221



logstash - open source log management
http://logstash.net/docs/1.1.0/tutorials/getting-started-centralized

Git Magic - 章 2. 基本技巧
http://www-cs-students.stanford.edu/~blynn/gitmagic/intl/zh_tw/ch02.html
Git 情境劇 - 好麻煩部落格
http://gogojimmy.net/2012/02/29/git-scenario/
git remote merge - Google 搜尋
https://www.google.com.tw/search?q=git+remote+merge&hl=zh-TW&tbo=d&ei=WdLAUN-fIuHsmAXar4DoCA&start=10&sa=N&biw=794&bih=460
Git - 與遠端協同工作
http://git-scm.com/book/zh-tw/Git-%E5%9F%BA%E7%A4%8E-%E8%88%87%E9%81%A0%E7%AB%AF%E5%8D%94%E5%90%8C%E5%B7%A5%E4%BD%9C
Git教學:Git的遠端操作及利用Dropbox建立Server進行協同開發(Windows) @ mr.mu
http://www.mrmu.com.tw/2011/05/06/git-using-dropbox-as-server/

not the same cluster logstash - Google 搜尋
https://www.google.com.tw/search?hl=zh-TW&tbo=d&q=not+the+same+cluster+logstash&oq=not+the+same+cluster+logstash&gs_l=serp.3...1981.3385.0.3528.9.9.0.0.0.0.62.429.9.9.0...0.0...1c.1.tzki3GM-oQc
elasticsearch - guide - Zen Discovery
http://www.elasticsearch.org/guide/reference/modules/discovery/zen.html
logstash - open source log management
http://webcache.googleusercontent.com/search?q=cache:7yqOJEIPVJAJ:logstash.net/docs/1.1.0/tutorials/getting-started-centralized+&cd=3&hl=zh-TW&ct=clnk&gl=tw
logstash - open source log management
http://logstash.net/docs/1.1.0/tutorials/getting-started-centralized
logstash's internal message format · logstash/logstash Wiki
https://github.com/logstash/logstash/wiki/logstash%27s-internal-message-format

[#LOGSTASH-679] elasticsearch_http does not seem to work with external hosts - logstash.jira.com
https://logstash.jira.com/browse/LOGSTASH-679


Use multiple partitions for index data
http://docs.splunk.com/Documentation/Splunk/5.0.1/Indexer/Usemultiplepartitionsforindexdata

What Database Engine Splunk Uses? - Splunk Community
http://splunk-base.splunk.com/answers/32499/what-database-engine-splunk-uses

Hadoop Cluster – The Anatomy of Hadoop Pipeline Write |MPLSVPN- This blog is basically for MPLS Service Provider domain.
http://www.mplsvpn.info/2012/11/hadoop-cluster-anatomy-of-hadoop.html
MPLSVPN- This blog is basically for MPLS Service Provider domain.
http://www.mplsvpn.info/search/label/Hadoop

Bash scripting: ElasticSearch and Kibana init.d scripts | [ themattreid ]
http://themattreid.com/wordpress/2012/12/06/bash-scripting-elasticsearch-and-kibana-init-scripts/

Hive MetaStore建表与修改表分析
http://fatkun.com/2012/04/hive-metastore-create-table-and-alter-table.html
淘宝开放平台博客 » hadoop hive 十分钟教程
http://webcache.googleusercontent.com/search?q=cache:5nnGYoR3VucJ:rdc.taobao.com/team/top/tag/hadoop-hive-%E5%8D%81%E5%88%86%E9%92%9F%E6%95%99%E7%A8%8B/+&cd=4&hl=zh-TW&ct=clnk&lr=lang_zh-CN%7Clang_zh-TW&client=firefox-a
 PoweredBy - Hadoop Wiki
http://wiki.apache.org/hadoop/PoweredBy
使用 Linux 和 Hadoop 进行分布式计算
http://www.ibm.com/developerworks/cn/linux/l-hadoop/#resources
写好Hive 程序的五个提示 - 阿里集团数据平台 alidata.org
http://www.alidata.org/archives/622
Hive官方手册翻译(Getting Started) - 实践检验真理 - 51CTO技术博客
http://slaytanic.blog.51cto.com/2057708/939950
Hive源码解析—之—Hive的入口: - 阿里集团数据平台 alidata.org
http://www.alidata.org/archives/1019
Hive Installation - Cloudera Support
https://ccp.cloudera.com/display/CDHDOC/Hive+Installation




大數據估值泡沫?對沖基金做空Splunk - 華富財經Quamnet.com 港股 即時新聞
http://www.quamnet.com/newscontent.action?articleId=2449672
大数据估值泡沫?对冲基金做空Splunk(组图)-搜狐滚动
http://roll.sohu.com/20121018/n355136264.shtml

Splunk 中国 | Take the sh out of IT
http://www.netis.com.cn/splunk/

















Daily Bookmarks 20121220

CloudStack与OpenStack,谁将称王?_云计算频道_计世网
http://cloud.ccw.com.cn/htm2012/20120621_974095.shtml
同一篇报道,两种标题 - 李力(Ada Li) - ITeye技术网站
http://adali.iteye.com/blog/1735409
3 Programmer Biases which Prevent you from Building a Successful Startup
http://niallohiggins.com/2012/12/17/3-programmer-biases-which-prevent-you-from-building-a-successful-startup/
高效存储技术研究 - 刘爱贵的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/liuben/article/details/6064045
Twitter Engineering: Blobstore: Twitter’s in-house photo storage system
http://engineering.twitter.com/2012/12/blobstore-twitters-in-house-photo.html
Twitter的跨数据中心图片存储系统Blobstore-CSDN.NET
http://www.csdn.net/article/2012-12-18/2812933-Twitter-Blobstore

imss - Google 搜尋
https://www.google.com.tw/search?q=imss&aq=f&oq=imss&sugexp=chrome,mod=13&sourceid=chrome&ie=UTF-8

HyperTable – Cloud Computing
http://trac.nchc.org.tw/cloud/wiki/HyperTable
Hypertable应用实践:基于BigTable模型优化业务
http://www.programmer.com.cn/11319/

Hadoop Architecture – Hadoop Distributed File System - Part 1 |MPLSVPN- This blog is basically for MPLS Service Provider domain.

http://www.mplsvpn.info/2012/11/hadoop-architecture-hadoop-distributed.html
Hadoop Architecture – Types of Hadoop Nodes in Cluster - Part 2 |MPLSVPN- This blog is basically for MPLS Service Provider domain.
http://www.mplsvpn.info/2012/11/hadoop-architecture-types-of-hadoop.html

Diving Into ElasticSearch(8)Mapping&Schema | 旁门左道
http://log.medcl.net/item/2011/09/diving-into-elasticsearch-8-mapping-schema/
elasticsearch | 旁门左道
http://log.medcl.net/item/tag/elasticsearch/page/2/
elasticsearch - 空门 - 博客频道 - CSDN.NET
http://blog.csdn.net/lingdu_789/article/category/1247418
elasticsearch 配置详解 - 空门 - 博客频道 - CSDN.NET
http://blog.csdn.net/lingdu_789/article/details/8025246
ElasticSearch的Gateway及存储原理 | 旁门左道
http://log.medcl.net/item/2010/10/elasticsearch-the-gateway-and-storage/

如何使用 Example42 的 Puppet 模块 | vpsee.com
http://www.vpsee.com/2012/05/using-puppet-modules-from-example42/
Foreman, Puppet and OpenStack
http://www.cloudcomp.ch/2012/07/foreman-puppet-and-openstack/

HDFS里Datanode上block大小的设置问题_云朵网络_www.02521.com
http://02521.com/news/21722.html

Blobstore: Twitter’s in-house photo storage system - Geeklist - Geeklist
http://geekli.st/johnnyray/links/11279


Bash scripting: an improved init.d service for LogStash | [ themattreid ]
http://themattreid.com/wordpress/2012/12/06/fun-with-bash-an-improved-init-d-script-for-logstash/
[#LOGSTASH-693] logstash is unable to create index in elasticsearch (Failed to index an event) - logstash.jira.com
https://logstash.jira.com/browse/LOGSTASH-693?attachmentOrder=desc
org.elasticsearch.transport.netty.NettyTransport Source Code | www.massapi.com
http://www.massapi.com/source/elasticsearch-elasticsearch-3b987f9/modules/elasticsearch/src/main/java/org/elasticsearch/transport/netty/NettyTransport.java.html


Thursday, December 20, 2012

Daily Bookmarks 20121219


GlusterFS集群文件系统研究 - 刘爱贵的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/liuben/article/details/6284551
数据同步算法研究 - 刘爱贵的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/liuben/article/details/5793706

修改HBase表的TTL - MrTitan的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/mrtitan/article/details/8292041
HBase Shell - Ray.L.Zhang - 博客园
http://www.cnblogs.com/raylzhang/archive/2012/06/28/2567240.html









http://www.cnblogs.com/raylzhang/archive/2012/06/28/2567240.html

Monday, December 17, 2012

Daily Bookmarks 20121217



Trunk.ly CTO董洵谈架构
http://webcache.googleusercontent.com/search?q=cache:IX1V2XKKVQwJ:www.infoq.com/cn/articles/dx-on-architecture+&cd=1&hl=zh-TW&ct=clnk&gl=tw
HyperTable index - Google 搜尋
https://www.google.com.tw/search?q=HyperTable+index&hl=zh-TW&lr=lang_zh-CN%7Clang_zh-TW&tbo=d&tbs=lr:lang_1zh-CN%7Clang_1zh-TW&ei=Z9_OUPXhMcbAmQWguoCADQ&start=20&sa=N&biw=794&bih=460
HyperTable – Cloud Computing
http://trac.nchc.org.tw/cloud/wiki/HyperTable
Hypertable应用实践:基于BigTable模型优化业务
http://www.programmer.com.cn/11319/
index hypertable - Google 搜尋
https://www.google.com.tw/search?q=index+hypertable&hl=zh-TW&tbo=d&ei=Cs_OUPgrhpiZBbWfgIAP&start=20&sa=N&biw=794&bih=460
Secondary Indices Have Arrived! | Hypertable - Big Data. Big Performance
http://hypertable.com/blog/secondary_indices_have_arrived/

Sehrch.com: A Structured Search Engine Powered By Hypertable | Hypertable - Big Data. Big Performance
http://hypertable.com/blog/sehrchcom_a_structured_search_engine_powered_by_hypertable/
Setting up a local DBpedia mirror with Virtuoso | Jörn's Blog
http://joernhees.de/blog/2010/10/31/setting-up-a-local-dbpedia-mirror-with-virtuoso/

Case Studies | Hypertable - Big Data. Big Performance
http://hypertable.com/customers/case_studies/
Tim Bull - Searching for 'index'
http://timbull.com/?page=2&sort=&search=index
Tim Bull - Searching for 'week 74'
http://timbull.com/?search=week+74&sort=
Tim Bull - Filed under 'trunk.ly'
http://timbull.com/tag/trunkly

Hypertable应用实践:基于BigTable模型优化业务
http://www.programmer.com.cn/11319/


ElasticSearch (5) 分布式架构 - 空门 - 博客频道 - CSDN.NET
http://blog.csdn.net/lingdu_789/article/details/8025376


http://www.programmer.com.cn/11319/

Saturday, December 15, 2012

Daily Bookmarks 20121215

AppEngine Tuning #1 « point7
http://point7.wordpress.com/2011/09/04/appengine-tuning-1/
The Amazing Story Of AppEngine And The Two Orders Of Magnitude « point7
http://point7.wordpress.com/2011/09/03/the-amazing-story-of-appengine-and-the-two-orders-of-magnitude/
谁更胜一筹:技术解析 Google App Engine 和 Amazon EC2 - 简约设计の艺术
http://88250.b3log.org/ec2-vs-gae.html
GAE 调价对 Web 架构的将来揭示了什么? - 简约设计の艺术
http://88250.b3log.org/what-google-app-engine-price-changes-say-about-the-future-of.html

Customize Twitter Bootstrap To Not Look Bootstrap-y - Aj freelancer
http://antjanus.com/blog/web-design-tips/user-interface-usability/customize-twitter-bootstrap-into-themes/
Jetstrap - The Bootstrap Interface Builder
http://jetstrap.com/
How to Build a Website Using Twitter Bootstrap and SASS - Part 1
http://www.1stwebdesigner.com/css/build-website-using-twitter-bootstrap-sass-1/
Twitter Bootstrap 101: Introduction | Webdesigntuts+
http://webdesign.tutsplus.com/tutorials/complete-websites/twitter-bootstrap-101-introduction/

AppEngine Development: Todo: Rest
http://appenginedevelopment.blogspot.tw/2011/12/todo-rest.html

Tech Notes of Yi Wang: MPI-based MapReduce Implementation from google
http://cxwangyi.blogspot.tw/2010/05/mpi-based-mapreduce-implementation.html

HBase 官方文档
http://www.yankay.com/wp-content/hbase/book.html

HBase的概念和性能选项 - - ITeye技术网站
http://beijing-josh.iteye.com/blog/211230
HBase: HBase intra row scanning
http://hadoop-hbase.blogspot.tw/2012/01/hbase-intra-row-scanning.html
hbase介绍 - 阿里集团数据平台 alidata.org
http://www.alidata.org/archives/1509
Hadoop & MapReduce & HBase @ 第二十四個夏天後 :: 痞客邦 PIXNET ::
http://webcache.googleusercontent.com/search?q=cache:QJVO5yvSHBoJ:changyy.pixnet.net/blog/post/24293032-hadoop-%2526-mapreduce-%2526-hbase+&cd=8&hl=zh-TW&ct=clnk&client=firefox-a





























Thursday, December 13, 2012

Daily Bookmarks 20121213

ORATEA » 关于hbase的read操作的深入研究
http://www.oratea.net/?p=1230
5.8. Versions
http://hbase.apache.org/book/versions.html
hbase shell基础和常用命令详解三江小渡 | 三江小渡
http://blog.pureisle.net/archives/1887.html
» HBase中的时间维度 HBase.info -- All things about HBase
http://hbase.info/2011/07/20/hbase-time-dimension
Facebook messages实现解读
http://blog.bluedavy.com/?p=258
hbase运维
http://blog.bluedavy.com/?p=251
hbase基本概念和hbase shell常用命令用法 - flying5 - 博客园
http://www.cnblogs.com/flying5/archive/2011/09/15/2178064.html













Wednesday, December 12, 2012

Daily Bookmarks 20121212



宁静致远 » [翻译]为什么数据库使用B树
http://xiaoy.info/2010/11/20/200/why_disk_based_database_use_b_tree/
A fifteen minute introduction to Redis data types – Redis
http://redis.io/topics/data-types-intro




http://redis.io/topics/data-types-intro

Tuesday, December 11, 2012

Daily Bookmarks 20121211


淘寶的起源故事 | 淘迷
http://taoptt.com/%E6%B7%98%E5%AF%B6%E7%B6%B2%E7%9A%84%E8%B5%B7%E6%BA%90%E6%95%85%E4%BA%8B/
Dropbox Tech Blog » Blog Archive » Welcome Guido!
https://tech.dropbox.com/2012/12/welcome-guido/

子柳_新浪博客
http://blog.sina.com.cn/calvinzhaoc

淘宝技术发展之路 - Kearney's 軒 - 开源中国社区
http://my.oschina.net/kear/blog/83145





Puppet class inheritance confusion - Server Fault
http://serverfault.com/questions/197909/puppet-class-inheritance-confusion

puppet测试例子一枚_天下文章一大抄_百度空间
http://hi.baidu.com/newdreamllc/item/307ac30b3fd709066c904891
还记得《程序员的思维修炼》里面说的吗?这个世界没有典型,但教科书中为了给新手指导,只能提取出抽象的理想状况,否则就算是讲述早上如何喝咖啡都需要几千页。这下我明白为什么生产环境跟做实验区别这么大了,因为做实验相当于就是理想状况,排除了一切干扰,但在现实中没有这种理想环境。

HDFS的block Id与generation stamp - aaronwxb - 博客园
http://www.cnblogs.com/aaronwxb/archive/2012/09/16/2687587.html
HDFS源码学习(1)——NameNode主要数据结构 - 非纯种程序猿
http://jiangbo.me/blog/2012/10/18/hdfs-namenode-datastructure/
HDFS里Datanode上block大小的设置问题 - linux - Cisco网络技术
http://www.07net01.com/linux/HDFSliDatanodeshangblockdaxiaodeshezhiwenti_21005_1350959795.html

Book - Puppet中文wiki
http://puppet.wikidot.com/book
Discussion of the Puppet configuration management framework
http://comments.gmane.org/gmane.comp.sysutils.puppet.user/28445
CentOS 6 Puppet Install • [ How2CentOS ]
http://www.how2centos.com/centos-6-puppet-install/
在 CentOS 6.2 上安装 Puppet 配置管理工具 | vpsee.com
http://www.vpsee.com/2012/03/install-puppet-on-centos-6-2/

Python: Best way to add to sys.path relative to the current running script - Stack Overflow
http://stackoverflow.com/questions/8663076/python-best-way-to-add-to-sys-path-relative-to-the-current-running-script

HDFS原理分析:基本概念 - 51CTO.COM
http://os.51cto.com/art/201212/369564.htm


hdfs - Hadoop: Performance degradation when increasing block sizes? - Stack Overflow
http://stackoverflow.com/questions/8249430/hadoop-performance-degradation-when-increasing-block-sizes

HDFS 6:Why is a block in HDFS So Large?_Hadoop Beginer
http://myhadoop.diandian.com/post/2011-06-05/1747763
Hadoop TDG 2 -- HDFS - fxjwind - 博客园 與GFS對照
http://www.cnblogs.com/fxjwind/archive/2012/08/27/2658955.html








的Hadoop的TDG 2 - HDFS - fxjwind - 博客園

Monday, December 10, 2012

Daily Bookmarks 20121210


puppet file source - Google 搜尋
https://www.google.com.tw/search?q=puppet+file+source+&oq=puppet+file+source+&sugexp=chrome,mod=13&sourceid=chrome&ie=UTF-8
Type Reference — Documentation — Puppet Labs
http://docs.puppetlabs.com/references/latest/type.html#file
ruby on rails - how to source a file in puppet manifest from module - Stack Overflow
http://stackoverflow.com/questions/7216375/how-to-source-a-file-in-puppet-manifest-from-module
Plugins in Modules — Documentation — Puppet Labs
http://docs.puppetlabs.com/guides/plugins_in_modules.html
Language Guide — Documentation — Puppet Labs
http://docs.puppetlabs.com/guides/language_guide.html#classes
Language Guide — Documentation — Puppet Labs
http://docs.puppetlabs.com/guides/language_guide.html#defined-resource-types
Language Guide — Documentation — Puppet Labs
http://docs.puppetlabs.com/guides/language_guide.html#classes

在 CentOS 6.2 上安装 Cobbler | vpsee.com
http://www.vpsee.com/2012/03/install-cobbler-on-centos-6-2/

利用Cobbler批量布署CentOS - 聆听未来 - 51CTO技术博客
http://kerry.blog.51cto.com/172631/648430

Sathya Says – Creating Your Own YUM Repository
http://sathyasays.com/2008/06/22/creating-your-own-yum-repository/

hadoop的1TB排序terasort<转>_面朝大海,春暖花开!_百度空间
http://hi.baidu.com/dt_zhangwei/item/c2a80032c7dbc5ff96f88dbf
Terasort 的记录_移动Labs
http://labs.chinamobile.com/mblog/384_179193

Hadoop Tutorial Series, Issue #2: Getting Started With (Customized) Partitioning | My Blog by Philippe Adjiman
http://www.philippeadjiman.com/blog/2009/12/20/hadoop-tutorial-series-issue-2-getting-started-with-customized-partitioning/

Puppet管理Puppet » 陈沙克日志
http://webcache.googleusercontent.com/search?q=cache:ZQ19MOhwF1AJ:www.chenshake.com/management-of-puppet-puppet/+&cd=1&hl=zh-TW&ct=clnk&gl=tw&lr=lang_zh-CN%7Clang_zh-TW

Oozie简介
http://www.infoq.com/cn/articles/introductionOozie

Project ideas for Hadoop « Ganbatte…!
http://wildanm.wordpress.com/2009/10/15/project-ideas-for-hadoop/
Tracking Trends with Hadoop and Hive on EC2 | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2009/07/tracking-trends-with-hadoop-and-hive-on-ec2/
Grouping Related Trends with Hadoop and Hive | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2009/09/grouping-related-trends-with-hadoop-and-hive/


hadoop in china: Wiki
http://www.hadooper.cn/dct/page/65747
hadoop in china: DistributedLucene
http://www.hadooper.cn/dct/page/65785
DistributedSearch - Solr Wiki
http://wiki.apache.org/solr/DistributedSearch
hadoop in china: 怎样DebugReduce程序
http://www.hadooper.cn/dct/page/65772
Hadoop与大数据技术大会 2012
http://hbtc2012.hadooper.cn/
datawrangling/trendingtopics · GitHub
https://github.com/datawrangling/trendingtopics
















Sunday, December 09, 2012

Daily Bookmarks 20121209

The Puppet File Server — Documentation — Puppet Labs
http://docs.puppetlabs.com/guides/file_serving.html
puppet fileserver初试 » 独醒阁
http://lostme.org/?p=854
puppet class定义 » 独醒阁
http://lostme.org/?p=834


我心目中的理想log机制 » 独醒阁
http://lostme.org/?p=825

Puppet使用方法总结 | 董的博客
http://dongxicheng.org/cluster-managemant/puppet/
init.pp - puppet-manifest-share - share the puppet manifest - Google Project Hosting
http://code.google.com/p/puppet-manifest-share/source/browse/trunk/vmx-puppet/modules/ssh/manifests/init.pp

Simplest Puppet Install Pattern - Puppet - Puppet Labs
http://projects.puppetlabs.com/projects/1/wiki/simplest_puppet_install_pattern






Thursday, December 06, 2012

Daily Bookmarks 20121206



OCTO talks ! » My reading of Percolator architecture: a Google search engine component
http://blog.octo.com/en/my-reading-of-percolator-architecture-a-google-search-engine-component/
OCTO talks ! » Hadoop
http://blog.octo.com/en/tag/hadoop/
Quickstart - Apache Mahout - Apache Software Foundation
https://cwiki.apache.org/confluence/display/MAHOUT/Quickstart
2011.berlinbuzzwords.de/sites/2011.berlinbuzzwords.de/files/AndrzejBialecki-Buzzwords-2011_0.pdf
http://2011.berlinbuzzwords.de/sites/2011.berlinbuzzwords.de/files/AndrzejBialecki-Buzzwords-2011_0.pdf
Why Jetwick moved from Solr to ElasticSearch « Karussell
http://karussell.wordpress.com/2011/02/07/why-jetwick-moved-from-solr-to-elasticsearch/
mobz/elasticsearch-head
https://github.com/mobz/elasticsearch-head
ElasticSearch Head
http://mobz.github.com/elasticsearch-head/
ElasticSearch at berlinbuzzwords 2010
http://www.slideshare.net/elasticsearch/elasticsearch-at-berlinbuzzwords-2010#btnNext
elasticsearch - guide - Gateway
http://www.elasticsearch.org/guide/reference/modules/gateway/
talks/2011/wsnparis/07-search-facets.sh at master · kimchy/talks
https://github.com/kimchy/talks/blob/master/2011/wsnparis/07-search-facets.sh
lucene - Solr vs. ElasticSearch - Stack Overflow
http://stackoverflow.com/questions/10213009/solr-vs-elasticsearch
MapReduce Tutorial
http://hadoop.apache.org/docs/r1.1.0/mapred_tutorial.html
Developer Blog: Setting up a Hadoop cluster - Part 1: Manual Installation
http://gbif.blogspot.tw/2011/01/setting-up-hadoop-cluster-part-1-manual.html

puppet
Robert Harkers Home Page'
http://www.harker.com/people/robert.html
www.harker.com/puppet/ThePuppetShow-20100930.pdf
http://www.harker.com/puppet/ThePuppetShow-20100930.pdf

rpm jar
rpm jar - Google 搜尋
https://www.google.com.tw/search?q=rpm+jar&hl=zh-TW&tbo=d&ei=PiK8UMjvE-jqmAXE34EI&start=10&sa=N&biw=794&bih=435
java - What is the proper way to package a single jar file as rpm? - Stack Overflow
http://stackoverflow.com/questions/1508762/what-is-the-proper-way-to-package-a-single-jar-file-as-rpm
2008 October « Hanumant’s Java Workshop
http://javaworkshop.wordpress.com/2008/10/
Packaging Java JARs as RPM packages « DNA of the TUX
http://tuxdna.wordpress.com/2012/01/27/packaging-java-jars-as-rpm-packages/
Creating an RPM for a Java Application « Hanumant’s Java Workshop
http://javaworkshop.wordpress.com/2008/10/22/rolling-up-an-rpm-for-a-java-application/
Creating an RPM for a Java Application « Hanumant’s Java Workshop
http://javaworkshop.wordpress.com/2008/10/22/rolling-up-an-rpm-for-a-java-application/
java - What is the proper way to package a single jar file as rpm? - Stack Overflow
http://stackoverflow.com/questions/1508762/what-is-the-proper-way-to-package-a-single-jar-file-as-rpm
RPM Maven Plugin - Introduction
http://mojo.codehaus.org/rpm-maven-plugin/
RPM Maven Plugin - Introduction
http://mojo.codehaus.org/rpm-maven-plugin/
Tom's Blog: Building RPM Packages for Java from jar files
http://tmurray3.blogspot.tw/2010/11/building-rpm-packages-for-java-from-jar.html

TCloud - Home
http://tw.ishare2.trendmicro.com/sites/TCloud/SitePages/Home.aspx?RootFolder=%2Fsites%2FTCloud%2FShared%20Documents%2Fconference%2Fhbct2012%2FHadoop%E7%94%9F%E6%80%81%E7%B3%BB%E7%BB%9F&FolderCTID=0x012000406B303E863C5D40B3AD1C1326475184&View={926B8496-070F-4A88-B65B-0687B393C5CF}
BDP 1.5_Installation_Guide.docx
http://tw.ishare2.trendmicro.com/sites/TCloud/BDP1.5/_layouts/WordViewer.aspx?id=/sites/TCloud/BDP1.5/Shared%20Documents/Release%20Doc/BDP%201.5_Installation_Guide.docx&Source=http%3A%2F%2Ftw%2Eishare2%2Etrendmicro%2Ecom%2Fsites%2FTCloud%2FBDP1%2E5%2FShared%2520Documents%2FForms%2FAllItems%2Easpx%3FRootFolder%3D%252Fsites%252FTCloud%252FBDP1%252E5%252FShared%2520Documents%252FRelease%2520Doc%26FolderCTID%3D0x012000004655ECA789744AA53EDAA3A7C9FC29%26View%3D%7B7111AA67%2D9B06%2D431D%2DBA32%2D2E490A3171C9%7D&DefaultItemOpen=1

魚乾的筆記本: git push 二三事
http://webcache.googleusercontent.com/search?q=cache:HzgwyU5DbOwJ:kevyu.blogspot.com/2011/08/git-push.html+&cd=1&hl=zh-TW&ct=clnk&gl=tw

Linux installation
http://docs.splunk.com/Documentation/Splunk/3.4/Installation/LinuxInstallation

Apache Software Foundation Index: Project Management Committee
http://projects.apache.org/indexes/pmc.html
Apache Software Foundation Index: Project Management Committee
http://projects.apache.org/indexes/pmc.html#Apache%20HBase
Apache Software Foundation Index: Project Management Committee
http://projects.apache.org/indexes/pmc.html#Apache%20Lucene

Apache Lucene -
http://lucene.apache.org/whoweare.html
Hadoop Sorts a Petabyte « Free Search
http://cutting.wordpress.com/2009/05/12/hadoop-sorts-a-petabyte/
YDN Blog
http://developer.yahoo.com/blogs/ydn/
Sematext - Search & Big Data Analytics - Solr ElasticSearch Hadoop HBase Lucene Experts
http://www.sematext.com/
Grant’s Grunts: Lucene Edition - Thoughts on Apache Lucene, Mahout, Solr, Tika and Nutch
http://lucene.grantingersoll.com/
Welcome to Apache Nutch™
http://nutch.apache.org/
Accumulo at endgame
http://www.slideshare.net/jasontrost/accumulo-at-endgame

Text Indexing in Accumulo
http://www.slideshare.net/acordova00/text-indexing-in-accumulo
Introduction to Apache Accumulo
http://www.slideshare.net/jaredwinick/introduction-to-apache-accumulo
jaredwinick/Trendulo
https://github.com/jaredwinick/Trendulo

how to trace java code - Google 搜尋
https://www.google.com.tw/search?q=how+to+trace+java+code&hl=zh-TW&tbo=d&ei=HQi_UPDjOvHRmAXn6YE4&start=40&sa=N&biw=794&bih=460
JWorld@TW Java論壇 - [公告] trace opensource 者,皆可獲得秘密獎品
http://www.javaworld.com.tw/jute/post/view?bid=11&id=17493&sty=1&tpg=72&age=0

Wednesday, December 05, 2012

Daily Bookmarks 20121205


Git Server 簡易架設 - 使用SSH | 易春木
http://eeepage.info/git-server-ssh/
Git 和 Git Server 的兩三事 « http404error
http://notexist.wordpress.com/2009/09/09/git-server-%E7%9A%84%E5%85%A9%E4%B8%89%E4%BA%8B/
Git 版本控制 branch model 分支模組基本介紹 | 小惡魔 - 電腦技術 - 工作筆記 - AppleBOY
http://blog.wu-boy.com/2011/03/git-%E7%89%88%E6%9C%AC%E6%8E%A7%E5%88%B6-branch-model-%E5%88%86%E6%94%AF%E6%A8%A1%E7%B5%84%E5%9F%BA%E6%9C%AC%E4%BB%8B%E7%B4%B9/ causes/redis-centos
https://github.com/causes/redis-centos
在 XenServer 上用 ISO 安装虚拟机 | vpsee.com
http://www.vpsee.com/2010/07/making-the-iso-availabe-to-xenserver-hosts/
如何使用XenServer使用本地ISO镜像 - 虚拟人生 - 51CTO技术博客
http://virtuallife.blog.51cto.com/152152/754509
在 XenServer 上用 ISO 安装虚拟机 | vpsee.com
http://www.vpsee.com/2010/07/making-the-iso-availabe-to-xenserver-hosts/
在 CentOS 上安装和配置 NFS 4.0 | vpsee.com
http://www.vpsee.com/2010/06/install-nfs-4-0-on-centos/

「你有多想成功?!」How bad do you want it | 看YouTube學英文

「你有多想成功?!」How bad do you want it | 看YouTube學英文
http://blog.youtubelearn.com/how-bad-do-you-want-it/

Monday, December 03, 2012

Daily Bookmarks 20121203



PChome線上購物 - 24h 購物《Golla 14吋時尚電腦側背包(高雅紅) G1296》
http://shopping.pchome.com.tw/?mod=item&func=exhibit&IT_NO=DCAG4R-A64310130&SR_NO=DCAG3V&ROWNO=110

elasticsearch - tutorials - Setting up elasticsearch on Debian
http://www.elasticsearch.org/tutorials/2010/07/02/setting-up-elasticsearch-on-debian.html

ElasticSearch design - Google 搜尋
https://www.google.com.tw/search?q=ElasticSearch+design&hl=zh-TW&tbo=d&ei=28iyUJz4E-_4mAXysIA4&start=10&sa=N&biw=927&bih=508
ElasticSearch vs Solr - ElasticSearch Tutorial.com
http://www.elasticsearchtutorial.com/elasticsearch-vs-solr.html
2011.berlinbuzzwords.de/sites/2011.berlinbuzzwords.de/files/AndrzejBialecki-Buzzwords-2011_0.pdf
http://2011.berlinbuzzwords.de/sites/2011.berlinbuzzwords.de/files/AndrzejBialecki-Buzzwords-2011_0.pdf
elasticsearch performance - Google 搜尋
https://www.google.com.tw/search?hl=zh-TW&tbo=d&q=elasticsearch+performance&oq=ElasticSearch+per&gs_l=serp.3.0.0l3j0i30j0i10i30j0i30l2j0i8i30l3.1671.2190.0.3465.4.4.0.0.0.0.48.140.4.4.0...0.0...1c.1.fpFf3HSXzhs
Why Jetwick moved from Solr to ElasticSearch « Karussell
http://karussell.wordpress.com/2011/02/07/why-jetwick-moved-from-solr-to-elasticsearch/
ElasticSearch at berlinbuzzwords 2010
http://www.slideshare.net/elasticsearch/elasticsearch-at-berlinbuzzwords-2010#btnNext
elasticsearch - guide - Gateway
http://www.elasticsearch.org/guide/reference/modules/gateway/
elasticsearch - - Open Source, Distributed, RESTful, Search Engine , ElasticSearch 官方站点中文版(开源、分布式、RESTful的搜索引擎)
http://es-cn.medcl.net/
elasticsearch - concepts - Replication
http://es-cn.medcl.net/guide/concepts/scaling-lucene/replication/
elasticsearch - concepts - Building Blocks
http://es-cn.medcl.net/guide/concepts/scaling-lucene/building-blocks/index.html
elasticsearch - concepts - Partitioning
http://es-cn.medcl.net/guide/concepts/scaling-lucene/partitioning/
Elasticsearch, Elasticsearch | SlideShare
http://www.slideshare.net/elasticsearch
talks/2011/wsnparis/07-search-facets.sh at master · kimchy/talks
https://github.com/kimchy/talks/blob/master/2011/wsnparis/07-search-facets.sh
lucene - Solr vs. ElasticSearch - Stack Overflow
http://stackoverflow.com/questions/10213009/solr-vs-elasticsearch

percolation elasticsearch - Google 搜尋
https://www.google.com.tw/search?q=percolation+elasticsearch&hl=zh-TW&tbo=d&ei=506uUNXjGcTzmAWBtYHIDA&start=10&sa=N&biw=927&bih=508
elasticsearch - guide - Percolate API
http://www.elasticsearch.org/guide/reference/api/percolate.html
logstash/lib/logstash/inputs/file.rb at master · logstash/logstash
https://github.com/logstash/logstash/blob/master/lib/logstash/inputs/file.rb

IT牛人博客聚合 - Python与简单网络爬虫的编写 |编程牛人|技术牛人
http://www.udpwork.com/item/8551.html
写了个电影搜索引擎 – 十万电影资源分享哦 « Xiaoxia[PG]
http://xiaoxia.org/2012/10/12/movie-db/#comment-27184

MapReduce Tutorial
http://hadoop.apache.org/docs/r1.1.0/mapred_tutorial.html
Developer Blog: Setting up a Hadoop cluster - Part 1: Manual Installation
http://gbif.blogspot.tw/2011/01/setting-up-hadoop-cluster-part-1-manual.html

Developing Your Java Project with Eclipse
http://www.javahotchocolate.com/tutorials/use-eclipse.html
Setting up Eclipse
http://www.javahotchocolate.com/tutorials/setup-eclipse.html#mixing

elasticsearch - guide - Store
http://www.elasticsearch.org/guide/reference/index-modules/store.html
Elasticsearch Storage Optimization · logstash/logstash Wiki
https://github.com/logstash/logstash/wiki/Elasticsearch-Storage-Optimization
ElasticSearch 原理笔记_digiter
http://digiter.diandian.com/post/2012-11-07/40042796630
Talks by Shay Banon // Speaker Deck
https://speakerdeck.com/kimchy

puppet ntp module recipe - Google 搜尋
https://www.google.com.tw/search?hl=zh-TW&tbo=d&spell=1&q=puppet+ntp+module+recipe&sa=X&ei=JBG8UL2_CqqLmwW26YHYAw&ved=0CC8QvwUoAA&biw=794&bih=460
An Introduction to Puppet
http://www.harker.com/puppet/BayLISA100715.html
Robert Harker’s Puppet Files
http://www.harker.com/puppet/
www.harker.com/puppet/ThePuppetShow-20100930.pdf
http://www.harker.com/puppet/ThePuppetShow-20100930.pdf


大智若魚::人生處處是道場: [雲端計算]HBase vs Cassandra: 我們遷移系統的原因
http://oss-tw.blogspot.tw/2010/04/hbase-vs-cassandra.html
佳文分享:CAP定理 - 走向架构师之路 - 博客频道 - CSDN.NET
http://blog.csdn.net/cutesource/article/details/5621725
ArchitectureOverview - Cassandra Wiki
http://wiki.apache.org/cassandra/ArchitectureOverview
如何“打败”CAP定理
http://www.programmer.com.cn/9260/
Cassandra - 維基百科,自由的百科全書 CP
http://zh.wikipedia.org/wiki/CassandraCAP定理中的Partition不应该译成“分区” - 好工具站长分享平台
http://www.haogongju.net/art/1683552
java.think(): Characterizing Enterprise Systems using the CAP theorem
http://javathink.blogspot.tw/2010/01/characterizing-enterprise-systems-using.html
CAP理论与分布式数据库 -- 系统架构 -- IT技术博客大学习 -- 共学习 共进步!
http://blogread.cn/it/wap/article.php?id=1531&f=sa
How to beat the CAP theorem - thoughts from the red planet - thoughts from the red planet
http://nathanmarz.com/blog/how-to-beat-the-cap-theorem.html
如何“打败”CAP定理 - Winter Is Coming
http://blog.fangjian.me/posts/2011/10/18/how-to-beat-the-cap-theorem/
CAP Confusion: Problems with ‘partition tolerance’ | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2010/04/cap-confusion-problems-with-partition-tolerance/





















Sunday, December 02, 2012

Daily Bookmarks 20121202

《Django实战系列》 - 心内求法 - 博客园
http://www.cnblogs.com/holbrook/archive/2012/03/02/2357343.html
User authentication in Django | Django documentation | Django
https://docs.djangoproject.com/en/dev/topics/auth/
Django Series 1: A custom login page | Solutoire.com
http://solutoire.com/2009/02/26/django-series-1-a-custom-login-page/

Rex's blah blah blah » 一小時內搞懂 Git
http://people.debian.org.tw/~chihchun/2009/01/05/understand-git-in-one-hour/
Git rebase 和 merge 合併操作示範 on Vimeo
https://vimeo.com/46105298
Git rebase 和 merge 合併操作示範錄影 | ihower { blogging }
http://ihower.tw/blog/archives/6704/
Git 教育訓練課程投影片 (2012) | ihower { blogging }
http://ihower.tw/blog/archives/6696/
鵝從未在裏面: [學習資源] 大大們寫的 Git 教學投影片
http://samwhelp.blogspot.tw/2012/03/git.html
















Saturday, December 01, 2012

logstash - open source log management
http://logstash.net/docs/1.0.17/getting-started-centralized
Homepage of Zabbix :: An Enterprise-Class Open Source Distributed Monitoring Solution
http://wwww.zabbix.com/
Installing logstash on CentOS 5.7 - Bits & Bytes of Life
http://mylessteinhauser.com/blog/2011/10/25/install-logstash-centos/





Daily Bookmarks 20121201

Measure Anything, Measure Everything « Code as Craft
http://codeascraft.etsy.com/2011/02/15/measure-anything-measure-everything/
javascript - Alternatives to Highcharts for datasets with 10k data points? - Stack Overflow
http://stackoverflow.com/questions/8307884/alternatives-to-highcharts-for-datasets-with-10k-data-points
flot/flot
https://github.com/flot/flot

An implementation of the Soundex Algorithm in Python | Complete Coding
http://code.rkevin.com/2010/02/an-implementation-of-the-soundex-algorithm-in-python/
Using Fuzzy Matching to Search by Sound with Python - Doug Hellmann
http://www.doughellmann.com/articles/how-tos/phonetic-hashing/index.html
Soundex - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Soundex

logstash - open source log management
http://logstash.net/docs/1.0.17/getting-started-centralized
Homepage of Zabbix :: An Enterprise-Class Open Source Distributed Monitoring Solution
http://wwww.zabbix.com/
Installing logstash on CentOS 5.7 - Bits & Bytes of Life
http://mylessteinhauser.com/blog/2011/10/25/install-logstash-centos/




我的日志 - TeamCola
https://pc.teamcola.com/home/#week/1353859200000
- iWish
http://iwish.mycolorway.com/create/
产品及解决方案-彩程设计-植根用户体验 聚焦移动互联
http://mycolorway.com/apps.html
TeamCola - iWish
http://iwish.mycolorway.com/tc/
How to Implement Phonetic Algorithm in Python on Names with Multiple Words - Stack Overflow
http://stackoverflow.com/questions/12478537/how-to-implement-phonetic-algorithm-in-python-on-names-with-multiple-words

Python | 彩程团队BLOG
http://blog.mycolorway.com/?s=python&x=0&y=0
用Python实现一个简单的WebSocket服务器 | 彩程团队BLOG
http://blog.mycolorway.com/2011/11/22/a-minimal-python-websocket-server/
基于jQuery的HTML逐帧播放器 | 彩程团队BLOG
http://blog.mycolorway.com/2009/09/21/html-frame-player/
彩程设计-植根用户体验,设计创新软件
http://mycolorway.com/


mahout-cf
http://www.slideshare.net/sscdotopen/mahoutcf#btnNext
Cloudera Hadoop RHEL/CentOS 6 Install Guide - Dakini's Bliss
http://dak1n1.com/blog/9-hadoop-el6-install
高度紀律 情感匱乏 蓋洛普:全球最「無感」國家...新加坡|國際新聞|中時電子報
http://news.chinatimes.com/world/110504/112012112400181.html
Kick Start Hadoop: Mahout Recommendations in Distributed mode with Hadoop Map Reduce
http://kickstarthadoop.blogspot.tw/2011/05/mahout-recommendations-in-distributed.html
Kick Start Hadoop: Mahout Recommendations with Data Sets containing Alpha Numeric Item Ids
http://kickstarthadoop.blogspot.tw/2011/05/mahout-recommendations-with-data-sets.html
Recommender Documentation - Apache Mahout - Apache Software Foundation
https://cwiki.apache.org/confluence/display/MAHOUT/Recommender+Documentation
Mahout Benchmarks
https://cwiki.apache.org/MAHOUT/mahout-benchmarks.html
Itembased Collaborative Filtering
https://cwiki.apache.org/MAHOUT/itembased-collaborative-filtering.html
Collaborative Filtering with ALS-WR
https://cwiki.apache.org/MAHOUT/collaborative-filtering-with-als-wr.html
比價撿便宜
http://w3.lbj.tw/BJ/?wd=1391
Learning — Variables, Conditionals, and Facts — Documentation — Puppet Labs
http://docs.puppetlabs.com/learning/variables.html#fnref:declarative
elasticsearch 權限 - Google 搜尋
https://www.google.com.tw/search?q=elasticsearch+%E6%AC%8A%E9%99%90&oq=elasticsearch+%E6%AC%8A%E9%99%90&sugexp=chrome,mod=13&sourceid=chrome&ie=UTF-8
elasticsearch | 旁门左道
http://log.medcl.net/item/tag/elasticsearch/
elasticsearch - guide - Building from Source
http://es-cn.medcl.net/guide/appendix/building-from-source.html
Puppet Tutorial for Linux: Powering up with Puppet | Bitfield Consulting
http://bitfieldconsulting.com/puppet-tutorial

logstash install pattern 自動偵測
Installing logstash on CentOS 5.7 - Bits & Bytes of Life
http://masteinhauser.github.com/blog/2011/10/25/install-logstash-centos/
Grabbing Full Java Stack Traces from Syslog-ng with Logstash – Tagged
http://blog.tagged.com/2012/05/grabbing-full-java-stack-traces-from-syslog-ng-with-logstash/
Testing Logstash grok filters | Blind Squirrel
http://blog.bealetech.com/content/testing-logstash-grok-filters
ruby - Getting timestamp of event from file name in logstash - Stack Overflow
http://stackoverflow.com/questions/12346590/getting-timestamp-of-event-from-file-name-in-logstash
logstash filter - Google 搜尋
https://www.google.com.tw/search?q=logstash+filter&aq=f&oq=logstash+filter&sugexp=chrome,mod=13&sourceid=chrome&ie=UTF-8
Jan-Piet Mens :: In where I begin to grok how to mutate a file with Logstash
http://jpmens.net/2012/08/09/i-grok-how-to-mutate-a-file-with-logstash/
grok pattern match predicates :: semicomplete.com - Jordan Sissel
http://www.semicomplete.com/blog/geekery/grok-pattern-predicates
About me: :: semicomplete.com - Jordan Sissel
http://www.semicomplete.com/about/
» Hacking your window manager with Python on Jkx@home » Blog Archive
http://www.larsen-b.com/Article/208.html
Ion - Gentoo Linux Wiki
http://en.gentoo-wiki.com/wiki/Ion

hadoop-CSDN.NET
http://www.csdn.net/article/tag/hadoop



logstash安裝模式自動偵測

Friday, November 30, 2012

Daily Bookmarks 20121130

sub-reality.org - sub quality blogging
http://sub-reality.org/2012/09/howto-setup-logstash-and-kibana-for-nginx-on-debian-squeeze/
Logstash Init script — Gist
https://gist.github.com/3623477
logstash-rpms/SOURCES at master · slojo404/logstash-rpms · GitHub
https://github.com/slojo404/logstash-rpms/tree/master/SOURCES

HIVE: Data Warehousing & Analytics on Hadoop
http://www.slideshare.net/zshao/hive-data-warehousing-analytics-on-hadoop-presentation#btnNext

Log rotation in CentOS Linux
http://abdussamad.com/archives/541-Log-rotation-in-CentOS-Linux.html
Apache Hadoop Log Files: Where to find them in CDH, and what info they contain | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2009/09/apache-hadoop-log-files-where-to-find-them-in-cdh-and-what-info-they-contain/
Migrating to CDH | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2010/08/migrating-to-cdh3/


[#SERENGETI-279] hadoop log disk full on master node after 28 jobs - SpringSource Issue Tracker gen terasort
https://issuetracker.springsource.com/browse/SERENGETI-279

log level
不重启集群调整Namenode、jobtracker等服务LOG level - 改善 - ITeye技术网站
http://heipark.iteye.com/blog/1333472

黃小瀞的學習筆記: [XenServer] 延長免費版 XenServer 的使用期限
http://fayesnote.blogspot.tw/2011/03/xenserver-xenserver.html












Thursday, November 29, 2012

Daily Bookmarks 20121129

mahout-cf
http://www.slideshare.net/sscdotopen/mahoutcf#btnNext
Algorithms - Apache Mahout - Apache Software Foundation 實作的演算法近況
https://cwiki.apache.org/confluence/display/MAHOUT/Algorithms
Itembased Collaborative Filtering - Apache Mahout - Apache Software Foundation
https://cwiki.apache.org/confluence/display/MAHOUT/Itembased+Collaborative+Filtering
Collaborative Filtering with ALS-WR - Apache Mahout - Apache Software Foundation
https://cwiki.apache.org/confluence/display/MAHOUT/Collaborative+Filtering+with+ALS-WR
Recommender Documentation - Apache Mahout - Apache Software Foundation
https://cwiki.apache.org/confluence/display/MAHOUT/Recommender+Documentation

Kick Start Hadoop: Evaluating Mahout based Recommender Implementations
http://kickstarthadoop.blogspot.tw/2011/05/evaluating-mahout-based-recommender.html

Puppet must to read
Puppet Tutorial for Linux: Powering up with Puppet | Bitfield Consulting
http://bitfieldconsulting.com/puppet-tutorial

logstash puppet
logstash « I'm Colin
http://imcol.in/tag/logstash/

chrome.tabs - Google Chrome
https://developer.chrome.com/extensions/tabs.html#type-Tab



See the links below bundled up as one link at:
http://tab.bz/1i8e

Running Hadoop On Ubuntu Linux (Multi-Node Cluster) @ Michael G. Noll
http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-multi-node-cluster/
Running Hadoop On Ubuntu Linux (Single-Node Cluster) @ Michael G. Noll
http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/#download-example-input-data
hadoop-examples-0.20.2-cdh3u5.jar wordcount - Google 搜尋
https://www.google.com.tw/search?q=hadoop-examples-0.20.2-cdh3u5.jar+wordcount&hl=zh-TW&tbo=d&ei=AxCtUPr2CqOemQX3_YHADA&start=10&sa=N&biw=927&bih=537
How to test and run a map reduce job on CDH ? - Grokbase
http://grokbase.com/p/cloudera/cdh-user/12afq10j2g/how-to-test-and-run-a-map-reduce-job-on-cdh
How to test and run a map reduce job on CDH ? - Grokbase
http://grokbase.com/p/cloudera/cdh-user/12acf18jdk/how-to-test-and-run-a-map-reduce-job-on-cdh
Hadoop map task list for job_201211211712_0001 on hc1
http://hc1:50030/jobtasks.jsp?jobid=job_201211211712_0001&type=map&pagenum=1
HDFS:/user/hdfs/gutenberg-output
http://hct5.:50075/browseDirectory.jsp?dir=/user/hdfs/gutenberg-output&namenodeInfoPort=50070
hc1 Hadoop Machine List
http://hc1:50030/machines.jsp?type=active





logstash « I'm Colin
http://imcol.in/tag/logstash/
Here comes Logstash for Ubuntu Cloud - Jorge's Stompbox
http://www.jorgecastro.org/2012/11/06/here-comes-logstash-for-ubuntu-cloud
More Ubuntu Juju - Logstash charms | Tech - PaulCz.NET
http://tech.paulcz.net/2012/11/more-ubuntu-juju-logstash-charms.html
A [Very] Beginner's guide to Canonical's Juju | Tech - PaulCz.NET
http://tech.paulcz.net/2012/10/a-very-beginners-guide-to-canonicals.html


Learning — Variables, Conditionals, and Facts — Documentation — Puppet Labs
http://docs.puppetlabs.com/learning/variables.html
Zero to puppet in one day - Finninday
http://finninday.net/wiki/index.php/Zero_to_puppet_in_one_day
Puppet - Gentoo Wiki
http://wiki.gentoo.org/wiki/Puppet
Basic Puppet Setup and Configuration – Linode Library
http://library.linode.com/application-stacks/puppet/installation#sph_configuring-puppet
Puppet安装与配置教程 | 南北漂
http://www.iforeach.com/archives/449.html
百分点科技 - 个性化推荐引擎 数据分析引擎
http://www.baifendian.com/
Dependency graphs in Puppet | Bitfield Consulting
http://bitfieldconsulting.com/puppet-dependency-graphs

Discussion of the Puppet configuration management framework
http://comments.gmane.org/gmane.comp.sysutils.puppet.user/28445
puppet测试例子一枚_天下文章一大抄_百度空间
http://hi.baidu.com/newdreamllc/item/307ac30b3fd709066c904891
centos puppet - Google 搜尋
https://www.google.com.tw/search?q=centos+puppet&aq=f&oq=centos+puppet&sugexp=chrome,mod=13&sourceid=chrome&ie=UTF-8
CentOS 6 Puppet Install • [ How2CentOS ]
http://www.how2centos.com/centos-6-puppet-install/
在 CentOS 6.2 上安装 Puppet 配置管理工具 | vpsee.com
http://www.vpsee.com/2012/03/install-puppet-on-centos-6-2/
hmc puppet server - Google 搜尋
https://www.google.com.tw/search?q=hmc+puppet+server&hl=zh-TW&prmd=imvns&ei=y-OoUKTvN8zxmAXFv4GgAg&start=10&sa=N&biw=928&bih=537
Hmc installation .
http://webcache.googleusercontent.com/search?q=cache:b_xpPdqZiA8J:www.slideshare.net/chebrian/hmc-installation+&cd=12&hl=zh-TW&ct=clnk&gl=tw
HMC:Puppet agent ping connect refused - saying的日志 - 网易博客
http://csshixian.blog.163.com/blog/static/182308522201292471714832/














http://comments.gmane.org/gmane.comp.sysutils.puppet.user/28445

Wednesday, November 28, 2012

Daily Bookmarks 20121128


Kick Start Hadoop: Mahout Recommendations in Distributed mode with Hadoop Map Reduce
http://kickstarthadoop.blogspot.tw/2011/05/mahout-recommendations-in-distributed.html
Kick Start Hadoop: Mahout Recommendations with Data Sets containing Alpha Numeric Item Ids
http://kickstarthadoop.blogspot.tw/2011/05/mahout-recommendations-with-data-sets.html
Kick Start Hadoop: Evaluating Mahout based Recommender Implementations
http://kickstarthadoop.blogspot.tw/2011/05/evaluating-mahout-based-recommender.html







http://kickstarthadoop.blogspot.tw/2011/05/evaluating-mahout-based-recommender.html

Tuesday, November 27, 2012

Daily Bookmarks 20121127

hadoop常见错误及解决办法! - 心如大海 - ITeye技术网站
http://p-x1984.iteye.com/blog/989577


This is puppet部署经验 - ITNosh .
http://www.itnosh.com/2012/02/puppet_deploy.html

Deploying Logstash with Puppet « « I'm Colin
http://imcol.in/2012/05/deploying-logstash-with-puppet/
How to set up Semantic Logging: part one with Logstash, Kibana, ElasticSearch and Puppet, – Jayway
http://www.jayway.com/2012/07/27/how-to-set-up-semantic-logging-part-one-with-logstash-kibana-elasticsearch-and-puppet/
haf/puppet-elasticsearch · GitHub
https://github.com/haf/puppet-elasticsearch
ubuntu 12.04 puppet部署Openstack » 陈沙克日志
http://www.chenshake.com/ubuntu-12-04-puppet-deployment-openstack/








Monday, November 26, 2012

Daily Bookmarks 20121126

about elastic search slide
Scaling massive elastic search clusters - Rafał Kuć - Sematext
http://www.slideshare.net/kucrafal/scaling-massive-elastic-search-clusters-rafa-ku-sematext#btnNext
Battle of the giants: Apache Solr vs ElasticSearch
http://www.slideshare.net/kucrafal/battle-of-the-giants-apache-solr-vs-elasticsearch#btnNext
ElasticSearch at berlinbuzzwords 2010
http://www.slideshare.net/elasticsearch/elasticsearch-at-berlinbuzzwords-2010#btnNext

使用elasticsearch加快過濾 - 控制Rhttp :/ / jontai.me/blog/2012/10/using-elasticsearch-to-speed-up-filtering的/
Using elasticsearch to Speed Up Filtering – Control+R
http://jontai.me/blog/2012/10/using-elasticsearch-to-speed-up-filtering/

elasticsearch - advanced features in practice precolator
http://www.slideshare.net/jsuchal/elasticsearch-advanced-features-in-practice#btnNext

Geeking with Greg: Jeff Dean keynote at WSDM 2009
http://glinden.blogspot.tw/2009/02/jeff-dean-keynote-at-wsdm-2009.html
elasticsearch - - Open Source, Distributed, RESTful, Search Engine , ElasticSearch 官方站点中文版(开源、分布式、RESTful的搜索引擎)
http://es-cn.medcl.net/
Elasticsearch Storage Optimization · logstash/logstash Wiki
https://github.com/logstash/logstash/wiki/Elasticsearch-Storage-Optimization

elasticsearch hash shard - Google 搜尋
https://www.google.com.tw/search?q=elasticsearch+hash+shard&hl=zh-TW&tbo=d&ei=7kKzULOkF8rFmQXGh4DgBw&start=10&sa=N&biw=927&bih=508
ElasticSearch 原理笔记_digiter
http://digiter.diandian.com/post/2012-11-07/40042796630
[elasticsearch] offline indexing and expected scaling performance - Grokbase
http://grokbase.com/t/gg/elasticsearch/12adcr7h52/offline-indexing-and-expected-scaling-performance
elasticsearch - concepts - Partitioning
http://es-cn.medcl.net/guide/concepts/scaling-lucene/partitioning/
















http://jontai.me/blog/2012/10/using-elasticsearch-to-speed-up-filtering/
http://glinden.blogspot.tw/2009/02/jeff-dean-keynote-at-wsdm-2009.html
http://es-cn.medcl.net/guide/concepts/scaling-lucene/partitioning/