Tuesday, December 17, 2013

Daily Bookmarks 20131217

Redirecting
http://api.justin.tv/api/stream/list.json?&language=zh-tw&limit=100
RESTful Authentication with Flask - miguelgrinberg.com
http://blog.miguelgrinberg.com/post/restful-authentication-with-flask
Write a Tumblelog Application with Flask and MongoEngine
http://docs.mongodb.org/ecosystem/tutorial/write-a-tumblelog-application-with-flask-mongoengine/
Flask patterns
http://www.slideshare.net/it-people/flask-patterns
Advanced Flask Patterns // Speaker Deck
https://speakerdeck.com/mitsuhiko/advanced-flask-patterns
The Application Context — Flask 0.10.1 documentation
http://flask.pocoo.org/docs/appcontext/
Introduction into Contexts — Flask-SQLAlchemy 0.16 documentation
http://pythonhosted.org/Flask-SQLAlchemy/contexts.html
Search — Flask 0.10.1 documentation
http://flask.pocoo.org/docs/search/?q=context&check_keywords=yes&area=default
The Application Context — Flask 0.10.1 documentation
http://flask.pocoo.org/docs/appcontext/?highlight=context
应用上下文 — Flask 0.10.1 文档
http://docs.torriacg.org/docs/flask/appcontext.html?highlight=%E5%BA%94%E7%94%A8%E4%B8%8A%E4%B8%8B%E6%96%87
Suffix Array Part 3 — Longest Common Substring (LCS) | roman10
http://www.roman10.net/suffix-array-part-3-longest-common-substring-lcs/
Nikita's blog: Fuzzy string search
http://ntz-develop.blogspot.tw/2011/03/fuzzy-string-search.html
suffix array python - Google 搜尋
https://www.google.com.tw/search?q=suffix+array+python&espv=210&es_sm=119&ei=FhOwUt7QHcXwkAWh9oC4Bg&start=30&sa=N&biw=1124&bih=591
读书:《编程珠玑》第十五章及后缀数组的Python实现和后缀树 | Silent Kogorou Mouri
http://pengwang.me/2013/04/27/%E8%AF%BB%E4%B9%A6%EF%BC%9A%E3%80%8A%E7%BC%96%E7%A8%8B%E7%8F%A0%E7%8E%91%E3%80%8B%E7%AC%AC%E5%8D%81%E4%BA%94%E7%AB%A0-%E5%8F%8A-%E5%90%8E%E7%BC%80%E6%95%B0%E7%BB%84%E7%9A%84python%E5%AE%9E%E7%8E%B0/
Suffix Arrays
http://algs4.cs.princeton.edu/63suffix/

如何衡量你的人生 資源 流程 - Google 搜尋
https://www.google.com.tw/search?espv=210&es_sm=119&q=%E5%A6%82%E4%BD%95%E8%A1%A1%E9%87%8F%E4%BD%A0%E7%9A%84%E4%BA%BA%E7%94%9F+%E8%B3%87%E6%BA%90+%E6%B5%81%E7%A8%8B&oq=%E5%A6%82%E4%BD%95%E8%A1%A1%E9%87%8F%E4%BD%A0%E7%9A%84%E4%BA%BA%E7%94%9F+%E8%B3%87%E6%BA%90+%E6%B5%81%E7%A8%8B&gs_l=serp.3...301081.301610.0.301941.3.3.0.0.0.0.68.168.3.3.0....0...1c.1j2.32.serp..2.1.53.T4eehiTzimE
pyvideo.org - Search: Flask patterns
http://pyvideo.org/search?models=videos.video&q=Flask+patterns
pyvideo.org - Advanced Flask Patterns
http://pyvideo.org/video/1269/advanced-flask-patterns
pyvideo.org - Armin Ronacher
http://pyvideo.org/speaker/238/armin-ronacher
context flask - Google 搜尋
https://www.google.com.tw/search?espv=210&es_sm=119&biw=1124&bih=591&q=context+flask&oq=context+flask&gs_l=serp.3..0i7i30l2j0i30j0i7i10i30j0i7i5i30j0i5i30l5.15061.15345.0.15663.3.3.0.0.0.0.64.169.3.3.0....0...1c.1.32.serp..0.3.167.6MwceIuEi1g
API — Flask 0.10.1 documentation
http://flask.pocoo.org/docs/api/#flask._app_ctx_stack
The Application Context — Flask 0.10-dev documentation
https://flask.readthedocs.org/en/latest/appcontext/
teardown_appcontext - Google 搜尋
https://www.google.com.tw/search?espv=210&es_sm=119&q=teardown_appcontext&oq=teardown_appcontext&gs_l=serp.3..0i10i19.2972.2972.0.3255.1.1.0.0.0.0.49.49.1.1.0....0...1c.1.32.serp..0.1.49.4LyQplnV6Vc
API — Flask 0.10.1 documentation
http://flask.pocoo.org/docs/api/
Quickstart — Flask 0.10.1 documentation
http://flask.pocoo.org/docs/quickstart/#context-locals

ht → zh-TW
https://www.google.com.tw/search?espv=210&es_sm=119&q=%E5%A6%82%E4%BD%95%E8%A1%A1%E9%87%8F%E4%BD%A0%E7%9A%84%E4%BA%BA%E7%94%9F+%E8%B3%87%E6%BA%90+%E6%B5%81%E7%A8%8B&oq=%E5%A6%82%E4%BD%95%E8%A1%A1%E9%87%8F%E4%BD%A0%E7%9A%84%E4%BA%BA%E7%94%9F+%E8%B3%87%E6%BA%90+%E6%B5%81%E7%A8%8B&gs_l=serp.3...301081.301610.0.301941.3.3.0.0.0.0.68.168.3.3.0....0...1c.1j2.32.serp..2.1.53.T4eehiTzimE

Monday, December 09, 2013

Daily bookmark 20131209

High Scalability - High Scalability - How Google Serves Data from Multiple Datacenters
http://highscalability.com/blog/2009/8/24/how-google-serves-data-from-multiple-datacenters.html
HBase - Apache HBase (TM) Replication
http://hbase.apache.org/replication.html

HBase Replication Notes
https://gist.github.com/larsgeorge/825646


Kyoto Cabinet - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Kyoto_Cabinet#Effective_implementation_of_hash_database
HASHDB:一个简单的KeyValue存储系统原型 - 刘爱贵的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/liuaigui/article/details/6670841

Disaster Recovery hortonworks - Google 搜尋
https://www.google.com.tw/search?espv=210&es_sm=119&q=Disaster+Recovery+hortonworks&spell=1&sa=X&ei=6TilUqj0I4epkgWs74CQDw&ved=0CC0QvwUoAA&biw=1122&bih=626
Hortonworks Blog
http://hortonworks.com/blog/
Cloudera, Hortonworks, MapR and now Intel and Greenplum? | OpenBI
http://www.openbi.com/content/cloudera-hortonworks-mapr-and-now-intel-and-greenplum
Online Apache HBase Backups with CopyTable | Cloudera Developer Blog
http://blog.cloudera.com/blog/2012/06/online-hbase-backups-with-copytable-2/
HBase - Apache HBase (TM) Replication
http://hbase.apache.org/replication.html
High Scalability - High Scalability - How Google Serves Data from Multiple Datacenters
http://highscalability.com/blog/2009/8/24/how-google-serves-data-from-multiple-datacenters.html

hbase的replication使用 - 蓝色时分 - ITeye技术网站
http://koven2049.iteye.com/blog/983633
add_peer hbase - Google 搜尋
https://www.google.com.tw/search?espv=210&es_sm=119&q=add_peer+hbase&oq=add_peer+hbase&gs_l=serp.3..0i30j0i8i30.1110.2632.0.2823.6.6.0.0.0.0.203.414.5j0j1.6.0....0...1c.1.32.serp..0.6.413.FQLZ_9bziFI
HBase Replication
http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/latest/CDH4-Installation-Guide/cdh4ig_topic_20_11.html#topic_20_11_4_unique_1
Apache HBase Replication: Operational Overview | Cloudera Developer Blog
http://blog.cloudera.com/blog/2012/08/hbase-replication-operational-overview/
HBase Replication Notes
https://gist.github.com/larsgeorge/825646

Apache HBase Replication Overview | Cloudera Developer Blog
http://blog.cloudera.com/blog/2012/07/hbase-replication-overview-2/
Apache HBase Replication: Operational Overview | Cloudera Developer Blog
http://blog.cloudera.com/blog/2012/08/hbase-replication-operational-overview/



Tuesday, December 03, 2013

Daily Bookmark 20131203

alignment algorithm python - Google 搜尋
https://www.google.com.tw/search?espv=210&es_sm=119&q=alignment+algorithm+python&oq=alignment+algorithm+&gs_l=serp.3.1.0i30l6j0i5i30l4.3418.3418.0.5033.1.1.0.0.0.0.49.49.1.1.0....0...1c.1.32.serp..0.1.49.JRLrGoxqobo
alevchuk/pairwise-alignment-in-python
https://github.com/alevchuk/pairwise-alignment-in-python
How to implement the Needleman–Wunsch alignment algorithm without using a single loop in Python | Echo.2
http://blogs.infoecho.net/echo/2011/04/10/how-to-implement-the-needleman%E2%80%93wunsch-alignment-algorithm-without-using-a-single-loop-in-python/
parsing - How to read specific part of large file in Python - Stack Overflow
http://stackoverflow.com/questions/15644859/how-to-read-specific-part-of-large-file-in-python
Saving a Python dict to a file using pickle « SaltyCrane Blog
http://www.saltycrane.com/blog/2008/01/saving-python-dict-to-file-using-pickle/

Exploring Lucene’s Indexing Code: Part 2 | SearchHub | Lucene/Solr Open Source Search
http://searchhub.org/2009/03/18/exploring-lucenes-indexing-code-part-2/
Processing Boolean queries
http://nlp.stanford.edu/IR-book/html/htmledition/processing-boolean-queries-1.html
Storing huge hash table in a file in Python - Stack Overflow
http://stackoverflow.com/questions/1354520/storing-huge-hash-table-in-a-file-in-python


dongweiming/flask_reveal
https://github.com/dongweiming/flask_reveal

Wednesday, October 23, 2013

Daily Bookmark 20131023

Indexing Files via Solr and Java MapReduce | Cloudera Blog
http://blog.cloudera.com/blog/2012/03/indexing-files-via-solr-and-java-mapreduce/
Scaling Solr Indexing with SolrCloud, Hadoop and Behemoth | Javalobby
http://java.dzone.com/articles/scaling-solr-indexing

DataTables - FixedColumns
http://datatables.net/extras/fixedcolumns/


rauth/examples at master · litl/rauth
https://github.com/litl/rauth/tree/master/examples
An example on how to use Oauth and Python to connect to twitter « Popdevelop – A developer team from Malmö, Sweden
http://popdevelop.com/2010/07/an-example-on-how-to-use-oauth-and-python-to-connect-to-twitter/
Rauth — rauth 0.6.2 documentation
https://rauth.readthedocs.org/en/latest/

mapreduce hbase
MR操作hbase的一点心得(含hbase表拷贝样例代码) - bluekeyv的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/kirayuan/article/details/7001278
MR操作hbase的一点心得(含hbase表拷贝样例代码) - 鹏鹏博客 - 博客频道 - CSDN.NET
http://blog.csdn.net/yuanpengs/article/details/7763210
hbase bulkload - bluekeyv的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/kirayuan/article/details/7441447
InterfaceAudience InterfaceStability - 知其然,知其所以然 - ITeye技术网站
http://x-rip.iteye.com/blog/1528572

Export.java hbase mapreduce - Google 搜尋
https://www.google.com.tw/search?espv=210&es_sm=119&q=Export.java+hbase+mapreduce&oq=Export.java+hbase+mapreduce&gs_l=serp.3...1077.7479.0.7606.28.26.1.0.0.0.572.3130.9j1j2j4j0j1.17.0....0...1c.1.29.serp..21.7.670.HdwQ3AHoKds

GrepCode: org.apache.hadoop.hbase.mapreduce.Export (.java) - Class - Source Code View
http://grepcode.com/file/repo1.maven.org/maven2/org.apache.hbase/hbase/0.90.4/org/apache/hadoop/hbase/mapreduce/Export.java#Export.createSubmittableJob%28org.apache.hadoop.hbase.mapreduce.Configuration%2Cjava.lang.String%5B%5D%29






Monday, October 14, 2013

Sunday, October 13, 2013

Daily Bookmark 20131012

python - Connecting to LinkedIn API with rauth - Stack Overflow
http://stackoverflow.com/questions/13103154/connecting-to-linkedin-api-with-rauth
OAuth for Python 總覽 - Google App Engine — Google Developers
https://developers.google.com/appengine/docs/python/oauth/overview
plurk api 2.0 (oAuth認證) 發噗&回噗 | Davidou's Blog
http://blog.davidou.org/archives/423
簡單上手撰寫你的噗浪機器人 | 噗浪通訊社 - 噗浪官方部落格(華文)
http://zh.blog.plurk.com/archives/1121
漫談OAuth認證協定與運作流程 @ 賽拉維的秋天 :: 痞客邦 PIXNET ::
http://cire.pixnet.net/blog/post/30810748-%E6%BC%AB%E8%AB%87oauth%E8%AA%8D%E8%AD%89%E5%8D%94%E5%AE%9A%E8%88%87%E9%81%8B%E4%BD%9C%E6%B5%81%E7%A8%8B
豆瓣 API OAuth认证
http://www.douban.com/service/apidoc/auth
新聞小幫手
http://newshelper.g0v.tw/
















Tuesday, September 17, 2013

Monday, August 19, 2013

迴歸分析

回歸分析 - 維基百科,自由的百科全書
http://zh.wikipedia.org/wiki/%E8%BF%B4%E6%AD%B8%E5%88%86%E6%9E%90


Daily Bookmarks 20130819

minor compaction - Google 搜尋
https://www.google.com.tw/search?q=minor+compaction&lr=lang_zh-CN%7Clang_zh-TW&tbs=lr:lang_1zh-CN%7Clang_1zh-TW&ei=G7sRUpSyApCulQXeiYGoAQ&start=10&sa=N&biw=927&bih=522
第一留学网|留学|移民
http://ptsolmyr.com/index.php/2012/07/30/visualize-hbase-flush/
HBase性能深度分析
http://www.programmer.com.cn/7246/
可视化Flushes与Compactions | UC技术博客
http://tech.uc.cn/?p=56
使用搜索技术实现URL智能匹配 | UC技术博客
http://tech.uc.cn/?p=696
利用Simhash快速查找相似文档 | UC技术博客
http://tech.uc.cn/?p=1086
利用新词统计特征进行中文分词 | UC技术博客
http://tech.uc.cn/?p=1830
hbase权威指南: store file合并(compaction) - 竹叶青 的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/azhao_dn/article/details/8867036
HBase Administration, Performance Tuning | Packt Publishing
http://www.packtpub.com/article/hbase-basic-performance-tuning
详解HBase Compaction - NoSQLFan - 关注NoSQL相关技术、新闻
http://blog.nosqlfan.com/html/1080.html
Jugnu Life :-): How HBase minor compaction works
http://jugnu-life.blogspot.com/2013/01/how-hbase-minor-compaction-works.html
Jugnu Life :-): How HBase major compaction works
http://jugnu-life.blogspot.com/2013/01/how-hbase-major-compaction-works.html
hbase compaction | Binospace
http://www.binospace.com/index.php/in-depth-understanding-of-the-hbase-compaction/
【朗格科技】LevelDb日知录之八:Compaction - 朗格科技
http://www.samecity.com/blog/Article.asp?ItemID=129
《Hbase权威指南》深入学习hbase架构(4):文件压缩合并Compaction - 飞翔的荷兰人 - ITeye技术网站
http://flyingdutchman.iteye.com/blog/1846031
hbase权威指南: store file合并(compaction) | IT瘾
http://itindex.net/detail/44429-hbase-%E6%9D%83%E5%A8%81-store
Hbase运维碎碎念
http://www.slideshare.net/NinGoo/hbase-8433555
HBase在split和major compact的一些非通常情况下的触发条件 - 关注高并发、大数据,扎实做好技术 - 博客频道 - CSDN.NET
http://blog.csdn.net/yangbutao/article/details/8627120
hbase介绍-NoSQL技术-ChinaUnix.net
http://bbs.chinaunix.net/thread-4095320-1-1.html
9.7. Regions
http://hbase.apache.org/book/regions.arch.html
Jian's Blog: HBase Region Split
http://johnjianfang.blogspot.tw/2012/12/hbase-region-split.html
HBase 官方文档
http://www.yankay.com/wp-content/hbase/book.html#disable.splitting
hbase之宽表与窄表对split的影响 - yyj0531 - 51CTO技术博客
http://yaoyinjie.blog.51cto.com/3189782/654506
《Hbase权威指南》深入学习hbase架构(5):region splits - 飞翔的荷兰人 - ITeye技术网站
http://flyingdutchman.iteye.com/blog/1846141
手指甲「月牙」反映健康狀況 中醫教你怎麼看 - 阿波羅新聞網
http://tw.aboluowang.com/life/2011/1126/226860.html#.UhHDCGRNtcM
月牙可不是健康的“血条” | 健康朝九晚五主题站 | 果壳网 科技有意思
http://www.guokr.com/article/6130/
google adsense收入 - Google 搜尋
https://www.google.com.tw/search?q=google+adsense%E6%94%B6%E5%85%A5&oq=google+ads&aqs=chrome.3.0j69i57j0l2j69i60j69i62.6677j0&sourceid=chrome&ie=UTF-8
嚇死人但絕對值得你思考的網路高手賺錢收入 | 富朋友理財筆記
http://blog.17rich.com/google-adsense-top-earner.html
Google Adsense 點擊廣告收入(單日突破20美金)…再一次令我愣住! | 零成本-網路行銷與賺錢
http://zero-cost.ebookboxs.com/adsense-guang-gao-shou-yi-fen-xiang/google-adsense-dian-ji-guang-gao-shou-ru-dan-ri-tu-po-20-mei-jin-zai-yi-ci-ling-wo-leng-zhu/#more-766
連我自己都傻眼的,單日Adsense點擊廣告收益! | 零成本-網路行銷與賺錢
http://zero-cost.ebookboxs.com/adsense-guang-gao-shou-yi-fen-xiang/lian-wo-zi-ji-dou-sha-yan-de-dan-ri-adsense-dian-ji-guang-gao-shou-yi/

Tuesday, August 06, 2013

Thursday, August 01, 2013

Wednesday, July 31, 2013

Daily Bookmarks 20130731

两个或N个字符串最大公共子串算法 - tianmo2010的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/tianmo2010/article/details/7473717

Sunday, July 28, 2013

Daily Bookmarks 20130728

Mozilla Firefox 開始頁
http://www.renren.com/268217599
收件匣 (9,207) - peicheng5@gmail.com - Gmail
https://mail.google.com/mail/u/0/?shva=1#inbox
http://www.g.cn/
http://www.g.cn/
新分頁
about:newtab
Facebook
https://www.facebook.com/
pymmesg - Google 搜尋
https://www.google.com.tw/search?q=pymmesg&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
pymmseg - Google 搜尋
https://www.google.com.tw/search?client=firefox-a&hs=dz8&rls=org.mozilla:zh-TW:official&q=pymmseg&spell=1&sa=X&ei=NPfzUYHCOY6bkgWYroCgBg&ved=0CC4QvwUoAA&biw=1275&bih=725
pluskid/pymmseg-cpp
https://github.com/pluskid/pymmseg-cpp
pymmseg-cpp - Google 搜尋
https://www.google.com.tw/search?q=pymmseg-cpp&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&channel=rcs&gws_rd=cr
Python 中文分词:用纯python实现 / FMM 算法 / pymmseg-cpp / smallseg / judou 句读 / BECer-GAE
http://www.starming.com/index.php?action=plugin&v=wave&tpl=union&ac=viewgrouppost&gid=73&tid=13336
python 中文分词,安装 pymmseg - zhangxinrun的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/zhangxinrun/article/details/7525740
youngking/pymmseg
https://github.com/youngking/pymmseg
pymmseg-cpp - High performance Chinese word segmenting module for Python - Google Project Hosting
http://code.google.com/p/pymmseg-cpp/
改进Pymmseg分词功能 - frEefiS ' tHiNkinG
http://freefis.appspot.com/?p=111001
pymmseg-cpp - High performance Chinese word segmenting module for Python - Google Project Hosting
http://code.google.com/p/pymmseg-cpp/
pymmseg-cpp/demos/use_custom_dict.py at master · shuge/pymmseg-cpp
https://github.com/shuge/pymmseg-cpp/blob/master/demos/use_custom_dict.py
python 中文分词,安装 pymmseg - python - ITeye技术网站
http://ipython.iteye.com/blog/1136931
使用pymmseg进行中文分词 - 地瓜日记 - 博客园
http://www.cnblogs.com/sweetpotato-diary/archive/2012/03/20/2408941.html
python下的两个分词工具 | 旁门左道
http://log.medcl.net/item/2011/03/python%E4%B8%8B%E7%9A%84%E5%88%86%E8%AF%8D%E5%BA%93/
longest common subsequence spam detect - Google 搜尋
https://www.google.com.tw/search?q=longest+common+subsequence+spam+detect&client=firefox-a&hs=UC9&rls=org.mozilla:zh-TW:official&ei=UfrzUYHVMciGkgXT8IGoDg&start=10&sa=N&biw=1275&bih=725
新分頁
about:newtab
pymmseg-cpp pip - Google 搜尋
https://www.google.com.tw/search?client=firefox-a&hs=ico&rls=org.mozilla%3Azh-TW%3Aofficial&q=pymmseg-cpp+pip&oq=pymmseg-cpp+pip&gs_l=serp.3...1709.3855.0.4117.4.4.0.0.0.0.93.351.4.4.0....0...1c.1.22.serp..3.1.93.iL5Fw559ofk
http://autodaguo-python.googlecode.com/svn/trunk/mybot.txt
http://autodaguo-python.googlecode.com/svn/trunk/mybot.txt
python list modules - Google 搜尋
https://www.google.com.tw/search?q=python+list+modules&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
Get a list of installed Python modules - Stack Overflow
http://stackoverflow.com/questions/739993/get-a-list-of-installed-python-modules
新酷音 dict - Google 搜尋
https://www.google.com.tw/search?q=%E6%96%B0%E9%85%B7%E9%9F%B3+dict&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
Re: [閒聊] 新酷音可不可以不要有內建詞彙 - 看板 IME - 批踢踢實業坊
http://www.ptt.cc/bbs/IME/M.1241690936.A.43A.html
http://svn.openfoundry.org/libchewingdata/readme.html
http://svn.openfoundry.org/libchewingdata/readme.html
新酷音共享詞庫
http://hyperrate.com/thread.php?tid=21020
pymmseg-cpp 繁體 - Google 搜尋
https://www.google.com.tw/search?q=pymmseg-cpp+%E7%B9%81%E9%AB%94&client=firefox-a&hs=Kh9&rls=org.mozilla:zh-TW:official&ei=yQH0UfKDDMKrkAWaoIDoCw&start=10&sa=N&biw=1275&bih=725
中文分词实战与文言文分词的初步设想 | 京華煙云
http://www.yenching.org/2009/10/%e4%b8%ad%e6%96%87%e5%88%86%e8%af%8d%e5%ae%9e%e6%88%98%e4%b8%8e%e6%96%87%e8%a8%80%e6%96%87%e5%88%86%e8%af%8d%e7%9a%84%e5%88%9d%e6%ad%a5%e8%ae%be%e6%83%b3/
Free Mind » Blog Archive » RMMSeg: Ruby 实现中文分词
http://lifegoo.pluskid.org/?p=261
新酷音 字典 - Google 搜尋
https://www.google.com.tw/search?q=%E6%96%B0%E9%85%B7%E9%9F%B3+%E5%AD%97%E5%85%B8&client=firefox-a&hs=g8o&rls=org.mozilla:zh-TW:official&ei=ZgP0Ub6UEIWokQXIqYHgBQ&start=10&sa=N&biw=1275&bih=700
TWed2k - 心得教學區 - [發現]新酷音注音修改教學
http://058176049149.ctinets.com/viewthread.php?action=printable&tid=290870
新酷音詞庫及注音修改教學
http://chewing.csie.net/chewing_dict_edit.html
新酷音詞庫及注音修改教學
http://chewing.csie.net/chewing_dict_edit.html
libchewing-data/utf-8/tsi.src at master · chewing/libchewing-data
https://github.com/chewing/libchewing-data/blob/master/utf-8/tsi.src
pluskid/pymmseg-cpp
https://github.com/pluskid/pymmseg-cpp
grep 非 打頭 - Google 搜尋
https://www.google.com.tw/search?q=grep+%E9%9D%9E+%E6%89%93%E9%A0%AD&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
正則運算式之道 - just do it - 中國經濟網 經濟部落格
http://big5.ce.cn/gate/big5/blog.ce.cn/html/33/100933-55717.html
高鐵 - Yahoo!奇摩新聞搜尋結果
http://tw.news.search.yahoo.com/search;_ylt=A8tUwYGHB_RRyk8AoElr1gt.?p=%E9%AB%98%E9%90%B5&fr=ush-globalnews&fr2=piv-web
北高1,630元 高鐵最快10月調漲 - Yahoo!奇摩新聞
http://tw.news.yahoo.com/%E5%8C%97%E9%AB%981-630%E5%85%83-%E9%AB%98%E9%90%B5%E6%9C%80%E5%BF%AB10%E6%9C%88%E8%AA%BF%E6%BC%B2-213000245.html
新詞發現 最常共同子串 - Google 搜尋
https://www.google.com.tw/search?q=%E6%96%B0%E8%A9%9E%E7%99%BC%E7%8F%BE+%E6%9C%80%E5%B8%B8%E5%85%B1%E5%90%8C%E5%AD%90%E4%B8%B2&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
基于大规模语料的新词发现算法
http://www.programmer.com.cn/12276/
LCS 新詞發現 - Google 搜尋
https://www.google.com.tw/search?q=LCS+%E6%96%B0%E8%A9%9E%E7%99%BC%E7%8F%BE&client=firefox-a&hs=LWp&rls=org.mozilla:zh-TW:official&ei=IAn0UayAHYnQkgWSkYEw&start=10&sa=N&biw=1275&bih=700
基于大规模语料的新词发现算法 - - 博客频道 - CSDN.NET
http://blog.csdn.net/qyee16/article/details/7741975
基于选择倾向性的词汇获取方法_百度文库
http://wenku.baidu.com/view/3d091d65783e0912a2162a24.html
Longest common subsequence 大規模 - Google 搜尋
https://www.google.com.tw/search?client=firefox-a&hs=JC&rls=org.mozilla%3Azh-TW%3Aofficial&channel=rcs&q=Longest+common+subsequence+%E5%A4%A7%E8%A6%8F%E6%A8%A1&oq=Longest+common+subsequence+%E5%A4%A7%E8%A6%8F%E6%A8%A1&gs_l=serp.3...1801.7287.0.7595.30.21.5.0.0.1.162.2057.15j6.21.0....0...1c.1.22.serp..25.5.210.1E4DjYTcN7o
基于大规模语料的新词发现算法 - - 博客频道 - CSDN.NET
http://blog.csdn.net/qyee16/article/details/7741975
http://sewm.pku.edu.cn/TianwangLiterature/Report/NCIS_TR_2007012.pdf
http://sewm.pku.edu.cn/TianwangLiterature/Report/NCIS_TR_2007012.pdf
抽取 公共子串 - Google 搜尋
https://www.google.com.tw/search?q=%E6%8A%BD%E5%8F%96+%E5%85%AC%E5%85%B1%E5%AD%90%E4%B8%B2&client=firefox-a&rls=org.mozilla:zh-TW:official&ei=Jwz0UbnUHpCmkgWB74GYDQ&start=60&sa=N&biw=1275&bih=700
求多个字符串的最大公共子串---后缀数组 - gdp5211314的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/gdp5211314/article/details/8362678
从diff到LCS(Longestcommonsubsequence),抽象之美-python-电脑编程网
http://biancheng.dnbcw.info/python/170358.html
[coreseek/sphinx学习笔记4]--搜索 - iLovePHP - 开源中国社区
http://my.oschina.net/wzwitblog/blog/109997
相似数据检测算法
http://www.douban.com/note/180296814/
Karp-Rabin - Google 搜尋
https://www.google.com.tw/search?q=Karp-Rabin&lr=lang_zh-CN%7Clang_zh-TW&client=firefox-a&hs=d1U&rls=org.mozilla:zh-TW:official&channel=rcs&tbs=lr:lang_1zh-CN%7Clang_1zh-TW&ei=sQv0UZbsN8flkAWXqIEo&start=10&sa=N&biw=1275&bih=700
Karp-Rabin algorithm
http://www-igm.univ-mlv.fr/~lecroq/string/node5.html
sequential extraction of common substrings - Google 搜尋
https://www.google.com.tw/search?q=sequential+extraction+of+common+substrings&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&channel=rcs&gws_rd=cr
基于统计的无词典的高频词抽取(二)——根据LCP数组计算词频 - 三度空间 - 博客园
http://www.cnblogs.com/three-zone/p/LCP.html
基于统计的无词典的高频词抽取(一)——后缀数组字典序排序 - 脚本百事通
http://www.csdn123.com/html/blogs/20130614/22454.htm
抽取 共子串 - Google 搜尋
https://www.google.com.tw/search?q=%E6%8A%BD%E5%8F%96+%E5%85%B1%E5%AD%90%E4%B8%B2&client=firefox-a&rls=org.mozilla:zh-TW:official&ei=eA_0Uf-JO8iXkwWW1ICABw&start=10&sa=N&biw=1275&bih=700
http://ir.dlut.edu.cn/ThesisList%5C2009%5C韩冰-大规模文本去重策略研究.pdf
http://ir.dlut.edu.cn/ThesisList%5C2009%5C%E9%9F%A9%E5%86%B0-%E5%A4%A7%E8%A7%84%E6%A8%A1%E6%96%87%E6%9C%AC%E5%8E%BB%E9%87%8D%E7%AD%96%E7%95%A5%E7%A0%94%E7%A9%B6.pdf

Wednesday, July 24, 2013

Daily Bookmarks 20130724

程式語言教學誌: Java 快速導覽 - 物件導向概念 泛型
http://pydoing.blogspot.tw/2010/12/java-generic.html
Generics
http://docs.oracle.com/javase/1.5.0/docs/guide/language/generics.html
Java Generics ? , E and T what is the difference? - Stack Overflow
http://stackoverflow.com/questions/6008241/java-generics-e-and-t-what-is-the-difference
Interfaces (The Java™ Tutorials > Learning the Java Language > Interfaces and Inheritance)
http://docs.oracle.com/javase/tutorial/java/IandI/createinterface.html


哪部电影让你看到了理想中的爱情? - 知乎
http://www.zhihu.com/question/20448308
AWS云搜索的使用:极简Java API
http://www.infoq.com/cn/articles/AmazonCloudSearch
HDFS namenode源码分析 | r6
http://www.r66r.net/?p=1093
快衝!LINE 熊大、兔兔、饅頭人貼圖免費下載中 @ :: ifans :: :: 痞客邦 PIXNET ::
http://ifans.pixnet.net/blog/post/153209316
聪明人都在绞尽脑汁让人点击广告,更聪明的人在做什么?收集和分析这些数据 |PingWest
http://pingwest.com/demo/adstage/

The Best of Tchaikovsky - YouTube
http://www.youtube.com/watch?v=7_WWz2DSnT8&list=PLcGkkXtask_fpbK9YXSzlJC4f0nGms1mI

The Best of Classical Music - YouTube
http://www.youtube.com/playlist?list=PLcGkkXtask_fpbK9YXSzlJC4f0nGms1mI
Lessons In Coding: The K&R Index of Blog Entries
http://lessonsincoding.blogspot.tw/p/c-programming-language-k.html
A Guide to Python Frameworks for Hadoop | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2013/01/a-guide-to-python-frameworks-for-hadoop/
Rethrick Construction
http://rethrick.com/#projects
Java Client调用ElasticSearch做全文搜索代码示例 - - ITeye技术网站
http://shuminghuang.iteye.com/blog/1732129
Getting started with ElasticSearch « Jai’s Weblog – Tech, Security & Fun…
http://jaibeermalik.wordpress.com/2013/03/15/getting-started-with-elasticsearch/
Elasticsearch源碼分析之一——使用Guice進行依賴注入與模塊化系統_人人IT網
http://rritw.com/a/bianchengyuyan/C__/20120920/226667.html
search - Beginner's guide to ElasticSearch - Stack Overflow
http://stackoverflow.com/questions/11593035/beginners-guide-to-elasticsearch
腾讯分析系统架构解析 -- 系统运维 -- IT技术博客大学习 -- 共学习 共进步!
http://blogread.cn/it/article/6440?f=wb

pros cons - Google 搜尋
https://www.google.com.tw/search?q=pros+cons&oq=pros+cons&aqs=chrome.0.69i57j69i65l2j69i61j69i59l2.2564j0&sourceid=chrome&ie=UTF-8
English of the day! -- Pros & Cons - [V!cT0R] - 無名小站
http://www.wretch.cc/blog/vicchen19/5909403
泛型與 Collection — Java Steps
http://javasteps.plweb.org/java_generic.html
java t generic - Google 搜尋
https://www.google.com.tw/search?q=java+t+generic&oq=java+T+ge&aqs=chrome.2.69i57j0l3j69i60l2.8155j0&sourceid=chrome&ie=UTF-8
Java Generics ? , E and T what is the difference? - Stack Overflow
http://stackoverflow.com/questions/6008241/java-generics-e-and-t-what-is-the-difference
Oracle Site Search - Secure Enterprise Search - Generics
http://search.oracle.com/search/search?start=1&search_p_main_operator=all&q=Generics
Introduction (The Java™ Tutorials > Bonus > Generics)
http://docs.oracle.com/javase/tutorial/extra/generics/intro.html
Defining Simple Generics (The Java™ Tutorials > Bonus > Generics)
http://docs.oracle.com/javase/tutorial/extra/generics/simple.html
Inheritance (The Java™ Tutorials > Learning the Java Language > Interfaces and Inheritance)
http://docs.oracle.com/javase/tutorial/java/IandI/subclasses.html
Creating Objects (The Java™ Tutorials > Learning the Java Language > Classes and Objects)
http://docs.oracle.com/javase/tutorial/java/javaOO/objectcreation.html
The Fine Print (The Java™ Tutorials > Bonus > Generics)
http://docs.oracle.com/javase/tutorial/extra/generics/fineprint.html
Generics
http://docs.oracle.com/javase/1.5.0/docs/guide/language/generics.html

Friday, July 19, 2013

Daily Bookmarks 20130719


hadoop - Pass Python scripts for mapreduce to HBase - Stack Overflow
http://stackoverflow.com/questions/14241729/pass-python-scripts-for-mapreduce-to-hbase
kennethreitz/requests
https://github.com/kennethreitz/requests/

Bulk importing Data into HBase | Deerwalk Blog - Result.Reflect.Repeat.
http://www.deerwalk.com/blog/bulk-importing-data/
2. Using Pig to Bulk Load Data Into HBase - Hortonworks Data Platform
http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-1.3.1/bk_user-guide/content/user-guide-hbase-import-2.html




29歲被開除?或留下來? - Cheers快樂工作人雜誌
http://www.cheers.com.tw/article/article.action?id=5029348

Wednesday, July 17, 2013

Daily Bookmarks 20130716

Google Dremel 原理 – 如何能3秒分析1PB | 我自然
http://www.yankay.com/google-dremel-rationale/
经典论文翻译导读之《Dremel: Interactive Analysis of WebScale Datasets》 - ImportNew
http://www.importnew.com/2617.html

数据科学与R语言: 重磅推荐:《机器学习之黑客帝国》
http://xccds1977.blogspot.tw/2012/03/blog-post.html
数据科学与R语言: 电影爱好者的R函数
http://xccds1977.blogspot.tw/2013/06/r.html

数据科学与R语言: Twitter的数据科学家是如何工作?
http://xccds1977.blogspot.tw/2012/03/twitter.html

PyCodersCN/issue12/machine-learning-for-hackers.rst at master · PyCodersCN/PyCodersCN
https://github.com/PyCodersCN/PyCodersCN/blob/master/issue12/machine-learning-for-hackers.rst
Unsupervised Learning — Clustering Analysis | 演衡學習筆記
http://c3h3notes.wordpress.com/2010/10/29/unsupervised-learning-clustering-analysis/



Friday, July 12, 2013

Daily Bookmarks 20130712

Inversion of Control Containers and the Dependency Injection pattern
http://martinfowler.com/articles/injection.html
parallel external merge sort - 碎碎唸
http://blog.yunglinho.com/blog/2013/03/19/parallel-external-merge-sort/
Dependency Injection in Scala - 碎碎唸
http://blog.yunglinho.com/blog/2012/04/22/dependency-injection-in-scala/
轻松学习Spring IoC容器和Dependency Injection模式 - JAVA涂鸦 - BlogJava
http://www.blogjava.net/rickhunter/articles/29015.html
Spring 學習筆記
http://openhome.cc/Gossip/SpringGossip/

python class - Google 搜尋
https://www.google.com.tw/search?q=python+class&oq=python+class&aqs=chrome.0.69i57j0l3j69i62l2.2145j1&sourceid=chrome&ie=UTF-8
定義類別
http://openhome.cc/Gossip/Python/Class.html
9. 類別(Classes)
http://larc.ee.nthu.edu.tw/~jcyeh/python/cdoc/tut/node11.html
5.5. Exploring UserDict: A Wrapper Class
http://www.diveintopython.net/object_oriented_framework/userdict.html
5.2. Importing Modules Using from module import
http://www.diveintopython.net/object_oriented_framework/importing_modules.html
Lesson 8 - Classes
http://www.sthurlow.com/python/lesson08/
Python Object Oriented
http://www.tutorialspoint.com/python/python_classes_objects.htm

Designing a RESTful API with Python and Flask - miguelgrinberg.com
http://blog.miguelgrinberg.com/post/designing-a-restful-api-with-python-and-flask
Flask-RESTful — Flask-RESTful 0.2.1 documentation
http://flask-restful.readthedocs.org/en/latest/index.html

MapReduce生成HFile入库到HBase - 石头儿 - 博客园
http://www.cnblogs.com/shitouer/archive/2013/02/20/hbase-hfile-bulk-load.html
【HBase工具】查看解析HFile - 我不是春晖 - ITeye技术网站
http://zjushch.iteye.com/blog/1676675
MapReduce生成HFile入库到HBase及源码分析三江小渡 | 三江小渡
http://blog.pureisle.net/archives/1950.html
用于大数据的并查集(基于HBase)的java类三江小渡 | 三江小渡
http://blog.pureisle.net/archives/2033.html








Sunday, July 07, 2013

Daily Bookmarks 20130707

Trie 的原理和实现 (python 实现) - ChenQi的个人空间 - 开源中国社区
http://my.oschina.net/u/158589/blog/61037
读书:《编程珠玑》第十五章及后缀数组的Python实现和后缀树 | Silent Kogorou Mouri
http://pengwang.me/2013/04/27/%e8%af%bb%e4%b9%a6%ef%bc%9a%e3%80%8a%e7%bc%96%e7%a8%8b%e7%8f%a0%e7%8e%91%e3%80%8b%e7%ac%ac%e5%8d%81%e4%ba%94%e7%ab%a0-%e5%8f%8a-%e5%90%8e%e7%bc%80%e6%95%b0%e7%bb%84%e7%9a%84python%e5%ae%9e%e7%8e%b0/
Trie树的Python实现 | Silent Kogorou Mouri
http://pengwang.me/2013/04/25/trie%E6%A0%91%E7%9A%84python%E5%AE%9E%E7%8E%B0/
Trie树的Python实现 | hbprotoss的博客
http://hbprotoss.github.io/posts/trieshu-de-pythonshi-xian.html
对Python中文分词模块结巴分词算法过程的理解和分析 | seanhuang 技术点滴
http://seanhuang.me/?p=542
- Django梦之队(DDTCMS官网)
http://ddtcms.com/blog/archive/2013/2/17/70/how-to-begin-to-study-the-chinese-word-segmentation/

Trie in Python | 我爱正则表达式
http://iregex.org/blog/trie-in-python.html
使用python代码实现三叉搜索树高效率”自动输入提示”功能
http://www.starming.com/index.php?action=plugin&v=wave&tpl=union&ac=viewgrouppost&gid=73&tid=17520


Attlin
http://www.attlin.com/
12、backbone实战:web在线聊天室(backbone+django+sqlite)(一)功能分析 | the5fire的技术博客
http://www.the5fire.com/12-backbone-webchat-1.html
说说我这个博客的架构 | the5fire的技术博客
http://www.the5fire.com/blog-architecture.html
7、backbone实例todos分析(一) | the5fire的技术博客
http://www.the5fire.com/7-backbone-todos-1.html









Friday, July 05, 2013

Dairy Bookmarks 20130705

Understanding the Parallelism of a Storm Topology - Michael G. Noll
http://www.michael-noll.com/blog/2012/10/16/understanding-the-parallelism-of-a-storm-topology/
Record 格式
http://irc.ccu.edu.tw/tools/page/show_page.php?page_url=/Site/web/dir_517fb4349001b/article_517fb6d84d3b6.html
Efficiently Reading in and Iterating Through Large Files with Python ~ Optinalysis
http://www.nikhilgopal.com/2010/12/dealing-with-large-files-in-python.html

MogileFS 的介绍(MogileFS 系列1) 扶凯
http://www.php-oa.com/2010/09/26/perl-mogilefs-1.html
Data IAP Day 1
http://dataiap.github.io/dataiap/day4/
OReilly – Hadoop The Definitive Guide (06-2009) « Xu Fei's Blog
http://autofei.wordpress.com/2010/06/27/oreilly-hadoop-the-definitive-guide-06-2009/
Java Example Code using HBase Data Model Operations « Xu Fei's Blog
http://autofei.wordpress.com/2012/04/02/java-example-code-using-hbase-data-model-operations/


Wu Mamber (String Algorithms 2007)
http://www.slideshare.net/mailund/wu-mamber-string-algorithms-2007
Memory Dump | 基于后缀搜索的多模式匹配算法——Wu-Manber算法
https://memorycn.wordpress.com/2011/11/05/matching_algorithm_-_wu-manber_algorithm_based_on_the_the_suffix_search_of_multi-mode/


Pig Macro for TF-IDF Makes Topic Summarization 2 Lines of Pig | Hortonworks
http://hortonworks.com/blog/pig-macro-for-tf-idf-makes-topic-summarization-2-lines-of-pig/
(7) TF-IDF in 2 lines of code with Pig Macros - Hadoop, Data, and Systems - Quora
http://hadoop-data-systems.quora.com/TF-IDF-in-2-lines-of-code-with-Pig-Macros
The Brotherhood of coders: Document similarity using Hadoop
http://coderscreed.blogspot.tw/2012/12/document-similarity-using-hadoop.html
TF-IDF in Hadoop Part 3: Documents in Corpus and TFIDF Computation | Marcello de Sales' Blog
http://marcellodesales.wordpress.com/2010/01/10/tf-idf-in-hadoop-part-3-documents-in-corpus-and-tfidf-computation/

Quickstart — Flask 0.10.1 documentation
http://flask.pocoo.org/docs/quickstart/
flask-tumblelog/tumblelog/admin.py at master · rozza/flask-tumblelog · GitHub
https://github.com/rozza/flask-tumblelog/blob/master/tumblelog/admin.py
Write a Tumblelog Application with Flask and MongoEngine — MongoDB Manual 2.4.5
http://docs.mongodb.org/manual/tutorial/write-a-tumblelog-application-with-flask-mongoengine/



增强版《Hadoop数据分析平台》第八期(增加5周内容),约等于免费的逆向收费式网络培_Hadoop与分布式数据处理_ITPUB论坛-it168旗下专业技术社区
http://www.itpub.net/thread-1629863-1-1.html

HBaseWD: Avoid RegionServer Hotspotting Despite Sequential Keys | Sematext Blog
http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/
电商推荐系统迷思
http://www.infoq.com/cn/presentations/electricity-supplier-recommendation-system-thinking
Bit.ly发布Forget-Table,解决非稳定类别分布问题
http://www.infoq.com/cn/news/2013/02/bitly-forget-table

演讲
http://www.infoq.com/cn/presentations/60
腾讯微博架构的成长过程
http://www.infoq.com/cn/presentations/tencent-blog-structure-growup
京东云存储服务和应用探索
http://www.infoq.com/cn/presentations/jingdong-cloud-storage-services-applications-explore
Partition-Tolerance - Google 搜尋
https://www.google.com.tw/search?q=Partition-Tolerance&source=lnt&tbs=lr:lang_1zh-CN%7Clang_1zh-TW&lr=lang_zh-CN%7Clang_zh-TW&sa=X&ei=QffMUePUL4avkgWO-4HICQ&ved=0CBYQpwUoAQ&biw=1264&bih=711
keyword tf idf - Google 搜尋
https://www.google.com.tw/search?q=keyword+tf+idf&ei=7lzNUe20FsavkgXv64CwCw&start=10&sa=N&biw=1264&bih=711
Keyword Extraction Based on tf/idf for Chinese News Document
http://d.wanfangdata.com.cn/Periodical_whdxxb-e200705030.aspx
【转】关键字提取算法之TF-IDF扫盲 - 码农.KEN - 博客园
http://www.cnblogs.com/ken-zhang/archive/2010/06/20/1761108.html
國立交通大學開放式課程(OpenCourseWare, OCW)
http://ocw.nctu.edu.tw/course_detail_3.php?bgid=9&gid=0&nid=413&v1=82a09096121314b8298ca6a3259b732e24e5a073#.UdaPKz5NtcO



Daily Bookmarks 20130703_2


ALGORITHMIC ETUDES: Map-reduce for pairwise document similarity calculation
http://algoetudes.blogspot.tw/2012/12/map-reduce-for-pairwise-document.html
【hadoop】大规模中文网站聚类kmeans的mapreduce实现(上) - lawrencesgj的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/lawrencesgj/article/details/8606532
【hadoop】大规模中文网站聚类kmeans的mapreduce实现(下) - lawrencesgj的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/lawrencesgj/article/details/8606570







en → zh-TW
文件
名詞: 文件, 文獻, 議案

Saturday, June 29, 2013

Daily Bookmarks 20130629

HBase在Facebook Message存储的使用经验总结 | Binospace
http://www.binospace.com/index.php/hbase-zai-facebook-message-cun-chu-di-shi-yong-jing-yan-zong-jie/
[HBase]KeyValue and HFile create - 吊丝码农 - ITeye技术网站
http://iwinit.iteye.com/blog/1827527
Th30z (Matteo Bertozzi Code): HBase I/O: HFile
http://th30z.blogspot.tw/2011/02/hbase-io-hfile.html
通过解析Hfile的index结构获取数据分布情况_Hadoop与分布式数据处理_ITPUB论坛-it168旗下专业技术社区
http://www.itpub.net/thread-1625291-1-1.html

Using HFile outside HBase at HUGUK #7 | Lanyrd
http://lanyrd.com/2010/huguk7/sxbw/


快速URL排重的方法
http://www.360doc.com/content/08/1031/15/3500_1855560.shtml
开源网络爬虫介绍及其比较 - Bill's Blog
http://ibillxia.github.io/blog/2010/08/20/several-open-source-web-crawlers-comparing/
网络爬虫设计—url排重算法布隆过滤器 (Bloom Filter) 详解 02_cphmvp
http://cphmvp.diandian.com/post/2013-01-17/40046782422
一种分布式网络爬虫的URL排重系统及方法 - IP.com
http://ip.com/patfam/zh/47647145

静态cache之log共现词分析 « 搜索技术博客-淘宝
http://www.searchtb.com/2013/06/%e9%9d%99%e6%80%81cache%e4%b9%8blog%e5%85%b1%e7%8e%b0%e8%af%8d%e5%88%86%e6%9e%90.html?spm=0.0.0.0.efcrfI
从狄仁杰的测字占卜到一淘网的Query分析之大结局 « 搜索技术博客-淘宝
http://www.searchtb.com/2011/01/from-augur-to-etao-query-analysis.html?spm=0.0.0.0.iMCbQH
从狄仁杰的测字占卜到一淘网的Query分析 « 搜索技术博客-淘宝
http://www.searchtb.com/2010/11/%e4%bb%8e%e7%8b%84%e4%bb%81%e6%9d%b0%e7%9a%84%e6%b5%8b%e5%ad%97%e5%8d%a0%e5%8d%9c%e5%88%b0%e4%b8%80%e6%b7%98%e7%bd%91%e7%9a%84query%e5%88%86%e6%9e%90.html?spm=0.0.0.0.iMCbQH




















Friday, June 28, 2013

Daily Bookmarks 20130628

Building Web Apps in WebView | Android Developers
http://developer.android.com/guide/webapps/webview.html
Android編程: 一個簡單的瀏覽器, 網絡視圖(WebView).
http://androidbiancheng.blogspot.tw/2010/01/webview.html
Weakapp's Memo: 怎麼使用 android 的 webview
http://weakapp0320.blogspot.tw/2013/04/android-webview-1.html
[Android] WebView 傳值給 HTML - No 1105- 點部落
http://www.dotblogs.com.tw/joe11051105/archive/2013/04/14/101573.aspx
android开发中WebView的使用(附完整程序) | 应用开发笔记
http://www.pocketdigi.com/20110216/176.html

Canned Platypus : Availability and Partition Tolerance
http://pl.atyp.us/wordpress/?p=2521
谈正确理解 CAP 理论
http://www.douban.com/group/topic/11765014/
关于CAP - 一个故事@MySQL DBA
http://www.orczhou.com/index.php/2010/05/all-about-cap-i-learn/
Brewer's CAP Theorem
http://www.julianbrowne.com/article/viewer/brewers-cap-theorem
Consistency | Xexex's Java 和其他二三事
http://www.javaworld.com.tw/roller/ingramchen/entry/consistency
为什么不能牺牲Partition tolerance? 俺同时白话一下partition._新浪轻博客
http://qing.blog.sina.com.cn/tj/709d1dde33000bb7.html
Availability and Partition Tolerance - 搞计算机的的日志 - 网易博客
http://hhw3.blog.163.com/blog/static/2690966201191442724418/
如何“打败”CAP定理
http://www.programmer.com.cn/9260/

【转】关键字提取算法之TF-IDF扫盲 - 码农.KEN - 博客园
http://www.cnblogs.com/ken-zhang/archive/2010/06/20/1761108.html
【分享】利用decorator实现Django表单防重复提交 - 码农.KEN - 博客园
http://www.cnblogs.com/ken-zhang/archive/2010/12/25/1916437.html




















Friday, June 21, 2013

Taiwan Hadoop Forum • 檢視主題 - 關於hadoop client
http://forum.hadoop.tw/viewtopic.php?t=18
Securing an Apache Hadoop Cluster Through a Gateway | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2008/12/securing-a-hadoop-cluster-through-a-gateway/

solr 初體驗 @ 不大會寫程式 :: 隨意窩 Xuite日誌
http://blog.xuite.net/misgarlic/weblogic/30448629-solr+%E5%88%9D%E9%AB%94%E9%A9%97
詳全文_全文檢索伺服器Solr初探
http://newsletter.ascc.sinica.edu.tw/news/read_news.php?nid=2288


hadoop SecondNamenode详解-qhw-ChinaUnix博客
http://blog.chinaunix.net/uid-20577907-id-3524135.html

Thursday, June 20, 2013

Daily Bookmarks 20130620

Esse, of Something: n-gram,語言,與其他符號 http://esse_tsyo.blogspot.tw/2010/10/n-gram.html
google n-gram
http://googleresearch.blogspot.tw/2006/08/all-our-n-gram-are-belong-to-you.html

2.4. Example Configurations
http://hbase.apache.org/book/example_config.html
HBase入门笔记(四)--完全分布式HBase集群安装配置 - 林场 - 博客园
http://www.cnblogs.com/ventlam/archive/2011/01/22/HBaseCluster.html
HBase导入导出 - _Deron_ - 博客园
http://www.cnblogs.com/Deron/archive/2013/03/31/2981934.html

编写MR运行在Hbase上面注意事项 - 分布式应用与服务器架构专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/chenyi8888/article/details/8646659

监控网-提供网站监控和服务器远程监控系统以及snmp、nginx、mysql、邮件服务监控的网站
http://www.jiankong.cn/

淘宝核心系统团队博客 | Beanstalkd 一个高性能分布式内存队列系统
http://rdc.taobao.com/blog/cs/?p=1201

fxsjy/miniseg
https://github.com/fxsjy/miniseg
鹰之瞳---网络自动运维系统---
https://www.yingzhitong.com/accounts/login/?next=/state/

dfs.datanode.failed.volumes.tolerated - Google 搜尋
https://www.google.com.tw/search?q=dfs.datanode.failed.volumes.tolerated&ie=utf-8&oe=utf-8&aq=t&rls=org.mozilla:zh-TW:official&client=firefox-a
Hadoop 參數設定 – hdfs-site.xml « Fenriswolf 程式筆記
http://fenriswolf.me/2012/05/25/hadoop-%E5%8F%83%E6%95%B8%E8%A8%AD%E5%AE%9A-hdfs-site-xml/
hadoop配置含义(继续更新中) - xiao晓 - 博客园
http://www.cnblogs.com/serendipity/archive/2011/08/23/2151031.html


Thursday, June 06, 2013

Daily Bookmarks 20130606

Terminal Recording with script and scriptreplay command
http://sharadchhetri.com/2012/07/16/terminal-recording-script-scriptreplay-command/
Virtual Vocaloid Manager: 如何紀錄linux終端的操作日誌
http://vocaloidmanager.blogspot.tw/2013/01/linux.html
How to use script and scriptreplay | OracleOnLinux
http://www.oracleonlinux.cn/2010/04/how-to-use-script-and-scriptreplay/
chunzi-blog-simple/chunzi-blog-posts/1171441019.html at master · chunzi/chunzi-blog-simple · GitHub
https://github.com/chunzi/chunzi-blog-simple/blob/master/chunzi-blog-posts/1171441019.html

How to Traverse a Directory Tree in Python - Guide to os.walk | Python Central
http://pythoncentral.org/how-to-traverse-a-directory-tree-in-python-guide-to-os-walk/
Python program to traverse directories and read file information - Stack Overflow
http://stackoverflow.com/questions/5421599/python-program-to-traverse-directories-and-read-file-information
filesystems - Directory listing in Python - Stack Overflow
http://stackoverflow.com/questions/120656/directory-listing-in-python

time - Timestamp Python - Stack Overflow
http://stackoverflow.com/questions/13890935/timestamp-python

大数据?别唬人了!我们真的需要盲目烧钱追求大数据吗?-CSDN.NET
http://www.csdn.net/article/2013-05-14/2815268-most-data-isnt-big

Daily Bookmarks 20130530

CloudFront: Salesforce.com's Phoenix : SQL layer for your Hbase
http://cloudfront.blogspot.tw/2013/01/salesforcecoms-phoenix-sql-layer-for.html

Hadoop Hive 中的排序 Order by ,Sort by ,Distribute by, Cluster By, - - ITeye技术网站
http://metooxi.iteye.com/blog/1447621
alo.alt: Using Hive's HBase handler
http://mapredit.blogspot.tw/2012/12/using-hives-hbase-handler.html

hbase shell基础和常用命令详解三江小渡 | 三江小渡
http://blog.pureisle.net/archives/1887.html


Daily Bookmarks 20130605

Writing shell scripts - Lesson 15: Errors and Signals and Traps (Oh My!) - Part 1
http://linuxcommand.org/wss0150.php


Tuesday, June 04, 2013

Daily Bookmarks 20130604

fcamel 技術隨手記: shell script 處理含空白字元的檔名
http://fcamel-life.blogspot.tw/2011/08/shell-script.html
LinuxCommand.org: Tips, News And Rants: Using Configuration Files With Shell Scripts
http://lcorg.blogspot.tw/2010/06/using-configuration-files-with-shell.html


hadoop - Hive multiple insert goes wrong with the DISTINCT select statement - Stack Overflow
http://stackoverflow.com/questions/15173608/hive-multiple-insert-goes-wrong-with-the-distinct-select-statement

Ankit Jain's blog: Sqoop export and import commands
http://ankitasblogger.blogspot.tw/2012/01/sqoop-export-and-import-commands.html
Welcome to Kitsune’s documentation! — Kitsune master documentation
http://kitsune.readthedocs.org/en/latest/index.html#

HBase - Who needs a Master? : Apache HBase
https://blogs.apache.org/hbase/entry/hbase_who_needs_a_master
Hadoop HBase user's mailing list ()
http://comments.gmane.org/gmane.comp.java.hadoop.hbase.user/34592
database - How Row Key is designed in Hbase - Stack Overflow
http://stackoverflow.com/questions/16356491/how-row-key-is-designed-in-hbase
HBaseWD: Avoid RegionServer Hotspotting Despite Sequential Keys | Sematext Blog
http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/
hbase介绍 - 阿里集团数据平台 alidata.org
http://www.alidata.org/archives/1509

Apache HBase Region Splitting and Merging | Hortonworks
http://hortonworks.com/blog/apache-hbase-region-splitting-and-merging/
Best Practices for Managing HBase in a High Write Environment | The AppFirst Blog
http://www.appfirst.com/blog/best-practices-for-managing-hbase-in-a-high-write-environment/
split - in HBase what will happen if a single row size exceeds region max size? - Stack Overflow
http://stackoverflow.com/questions/15828310/in-hbase-what-will-happen-if-a-single-row-size-exceeds-region-max-size
HBase一些tip - Change Dir - BlogJava good
http://www.blogjava.net/changedi/archive/2012/12/28/393577.html
HBase的数据的update - 天行健 - ITeye技术网站
http://punishzhou.iteye.com/blog/1266341
HBase的get过程(一) - 天行健 - ITeye技术网站
http://punishzhou.iteye.com/blog/1258848



Lucene in 5 minutes - Lucene Tutorial.com
http://www.lucenetutorial.com/lucene-in-5-minutes.html
Salmon Run: Writing Lucene Records to SequenceFiles on HDFS
http://sujitpal.blogspot.tw/2012/03/writing-lucene-records-to-sequencefiles.html
hadoop - opening lucene index stored in hdfs - Stack Overflow
http://stackoverflow.com/questions/2763112/opening-lucene-index-stored-in-hdfs
Indexing and Searching on a Hadoop Distributed File System | Dr Dobb's
http://www.drdobbs.com/parallel/indexing-and-searching-on-a-hadoop-distr/226300241?pgno=1
Salmon Run: Writing Lucene Records to SequenceFiles on HDFS
http://sujitpal.blogspot.tw/2012/03/writing-lucene-records-to-sequencefiles.html


Wednesday, May 29, 2013

Daily Bookmarks 20130529

注意Python中strptime的效率问题 | 不沉之月
https://blog.lzhaohao.info/archive/performance-problem-with-strptime/
Hank to hanker - Learning Note: [Python] 時間格式轉換(strtime & strftime)
http://whhnote.blogspot.tw/2011/01/python-strtime-strftime.html

datetime - Iterating through a range of dates in Python - Stack Overflow
http://stackoverflow.com/questions/1060279/iterating-through-a-range-of-dates-in-python
LanguageManual UDF
https://cwiki.apache.org/Hive/languagemanual-udf.html
Passing arguments to a shell script
http://osr507doc.sco.com/en/OSUserG/_Passing_to_shell_script.html
Kick Start Hadoop: Include values during execution time in hive QL/ Dynamically substitute values in hive
http://kickstarthadoop.blogspot.tw/2011/10/include-values-during-execution-time-in.html
Hbase interact with shell
http://www.slideshare.net/shashwat2010/hbase-interact-with-shell
Hadoop Hive与Hbase整合 - guisu,程序人生。 - 博客频道 - CSDN.NET
http://blog.csdn.net/hguisu/article/details/7282050
HBase之旅二:通过HBase Shell与HBase交互(转自:Taobao QA Team) - Lendfating的日志 - 网易博客
http://lendfating.blog.163.com/blog/static/182074367201211193176286/

hive 执行hbase创建表时找不到protobuf
http://abloz.com/2012/06/15/hive-execution-hbase-create-the-table-can-not-find-protobuf.html
HBase shell commands | Learn HBase
http://learnhbase.wordpress.com/2013/03/02/hbase-shell-commands/
HBase Shell命令学习 - 小学生 - ITeye技术网站
http://smallboby.iteye.com/blog/1525735

Qcon 北京:做一件事 - 幸福收藏夹
http://sofish.de/2193
pytesser - OCR in Python using the Tesseract engine from Google - Google Project Hosting
http://code.google.com/p/pytesser/

Tuesday, May 28, 2013

Daily Bookmarks 20130528

hadoop,hbase,hive安装全记录 - Seas_小庙的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/chengweipeng123/article/details/7174717
hive集成hbase笔记 - 能量源于改变!(改善) - ITeye技术网站
http://heipark.iteye.com/blog/1150648
View Source
http://dev.gbif.org/wiki/plugins/viewsource/viewpagesrc.action?pageId=2523151


HBase - Scans using filters from the Shell - General Dev - Confluence
http://dev.gbif.org/wiki/display/DEV/HBase+-+Scans+using+filters+from+the+Shell
用Python操作Mysql - I am migle - ITeye技术网站
http://migle.iteye.com/blog/573092
Tutorial: How to connect to MySQL with Python - Tutorials Blog
http://www.jeremymorgan.com/tutorials/python-tutorials/how-to-connect-to-mysql-with-python/


hbase的内容查询(1) hbase shell
http://abloz.com/2012/08/22/hbase-how-like-the-sql-like-query-value-as.html
在hive中创建HBase外部表
http://abloz.com/2012/07/19/create-the-hbase-an-external-table-in-the-hive.html

升级Hadoop Hive的版本 « Hey! Linux.
http://heylinux.com/archives/2163.html

伪分布式安装部署CDH4.2.1与Impala[原创实践] « Hey! Linux.
http://heylinux.com/archives/2456.html#more-2456
Impala整合HBase - - ITeye技术网站
http://yinhudongtian.iteye.com/blog/1758558

Architecture | Kiji Community - Build Real-Time Scalable Data Applications on Apache HBase
http://www.kiji.org/architecture


hive 执行hbase创建表时找不到protobuf
http://abloz.com/2012/06/15/hive-execution-hbase-create-the-table-can-not-find-protobuf.html
Hive部署(包括集成Hbase和Sqoop) - free9277 - ITeye技术网站
http://free9277.iteye.com/blog/1847094
hive hbase exists table - Google 搜尋
https://www.google.com.tw/search?q=hive+hbase+exists+table&spell=1&sa=X&ei=bSyjUd74BsrNkgXPw4CYCA&ved=0CC4QBSgA&biw=927&bih=537
Hive 和 HBase 的快速入门 - 技术翻译 - 开源中国 OSChina.NET
http://www.oschina.net/translate/hive-hbase-quickstart
Trend Micro CDC SPN Team | Region Server意外退出之后…
http://www.spnguru.com/2011/04/region-server%E6%84%8F%E5%A4%96%E9%80%80%E5%87%BA%E4%B9%8B%E5%90%8E/
Trend Micro CDC SPN Team | REST和认证
http://www.spnguru.com/2011/10/rest_authentication/
5.5.2. RegionServer process down alert - Hortonworks Data Platform
http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-1.2.3/bk_Monitoring_Hadoop_Book/content/monitor-chap3-6-5-2.html
HBase Administration, Performance Tuning | Packt Publishing
http://www.packtpub.com/article/hbase-basic-performance-tuning
hadoop+hive+hbase - 东杰书屋 - 博客频道 - CSDN.NET
http://blog.csdn.net/jiedushi/article/category/829246
Kick Start Hadoop: Hive Hbase integration/ Hive HbaseHandler : Common issues and resolution
http://kickstarthadoop.blogspot.tw/2012/05/hive-hbase-integration-common-issues.html
hive中添加自定义udf udaf udtf等函数的jar文件的三种方法 - 东杰书屋 - 博客频道 - CSDN.NET
http://blog.csdn.net/jiedushi/article/details/8631895

hbase shell example - Google 搜尋
https://www.google.com.tw/search?biw=927&bih=537&q=hbase+shell+example&oq=hbase+shell+e&gs_l=serp.3.0.0j0i30l9.1138.2321.0.3713.7.7.0.0.0.0.45.178.7.7.0...0.0.0..1c.1.12.serp.cN34gFAwGAA
hbase shell基础和常用命令详解三江小渡 | 三江小渡
http://blog.pureisle.net/archives/1887.html
Hbase/Shell - Hadoop Wiki
http://wiki.apache.org/hadoop/Hbase/Shell
HBase Shell命令学习 - 小学生 - ITeye技术网站
http://smallboby.iteye.com/blog/1525735
HBase shell commands | Learn HBase
http://learnhbase.wordpress.com/2013/03/02/hbase-shell-commands/

使用Ambari快速部署Hadoop大数据环境
http://www.uml.org.cn/sjjm/201305244.asp
alo.alt: Using Hive's HBase handler
http://mapredit.blogspot.tw/2012/12/using-hives-hbase-handler.html
Access HBase Data with Hive - Amazon Elastic MapReduce
http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-hbase-access-hive.html

Deploy code from Git using Puppet
http://livecipher.blogspot.tw/2013/01/deploy-code-from-git-using-puppet.html
git + fabric = awesome deploy team - Rumproarious
http://www.rumproarious.com/2010/09/01/git-fabric-awesome-deploy-team/
Easy Python Deployment with Fabric and Git at Mixpanel Engineering
http://code.mixpanel.com/2010/09/09/easy-python-deployment-with-fabric-and-git/
How I deploy my weekend projects with git and fabric - troebr
http://blog.troebr.net/post/37134036829/how-i-deploy-my-weekend-projects-with-git-and-fabric
Deploying Mezzanine: Fabric Git Vagrant Joy | BScientific
http://bscientific.org/blog/mezzanine-fabric-git-vagrant-joy/
Django / Python – Fabric Deployment Script and Example | Useful Stuff.
http://yuji.wordpress.com/2011/04/09/django-python-fabric-deployment-script-and-example/
How to use Fabric in a development environment ← Python For Beginners
http://www.pythonforbeginners.com/systems-programming/how-to-use-fabric-in-a-development-environment/
A Fabric function for git tagging | deployment, fabric, git, python | codeinthehole.com by David Winterbottom
http://codeinthehole.com/writing/a-fabric-function-for-git-tagging/
Python Deployment with Fabric
http://www.slideshare.net/andymccurdy/python-deployment-with-fabric


廖雪峰的官方网站 Spring入门
http://www.liaoxuefeng.com/article/000136177743223283f7b23bbd3ce64d91aec7b36d08524b58