Friday, July 05, 2013

Dairy Bookmarks 20130705

Understanding the Parallelism of a Storm Topology - Michael G. Noll
http://www.michael-noll.com/blog/2012/10/16/understanding-the-parallelism-of-a-storm-topology/
Record 格式
http://irc.ccu.edu.tw/tools/page/show_page.php?page_url=/Site/web/dir_517fb4349001b/article_517fb6d84d3b6.html
Efficiently Reading in and Iterating Through Large Files with Python ~ Optinalysis
http://www.nikhilgopal.com/2010/12/dealing-with-large-files-in-python.html

MogileFS 的介绍(MogileFS 系列1) 扶凯
http://www.php-oa.com/2010/09/26/perl-mogilefs-1.html
Data IAP Day 1
http://dataiap.github.io/dataiap/day4/
OReilly – Hadoop The Definitive Guide (06-2009) « Xu Fei's Blog
http://autofei.wordpress.com/2010/06/27/oreilly-hadoop-the-definitive-guide-06-2009/
Java Example Code using HBase Data Model Operations « Xu Fei's Blog
http://autofei.wordpress.com/2012/04/02/java-example-code-using-hbase-data-model-operations/


Wu Mamber (String Algorithms 2007)
http://www.slideshare.net/mailund/wu-mamber-string-algorithms-2007
Memory Dump | 基于后缀搜索的多模式匹配算法——Wu-Manber算法
https://memorycn.wordpress.com/2011/11/05/matching_algorithm_-_wu-manber_algorithm_based_on_the_the_suffix_search_of_multi-mode/


Pig Macro for TF-IDF Makes Topic Summarization 2 Lines of Pig | Hortonworks
http://hortonworks.com/blog/pig-macro-for-tf-idf-makes-topic-summarization-2-lines-of-pig/
(7) TF-IDF in 2 lines of code with Pig Macros - Hadoop, Data, and Systems - Quora
http://hadoop-data-systems.quora.com/TF-IDF-in-2-lines-of-code-with-Pig-Macros
The Brotherhood of coders: Document similarity using Hadoop
http://coderscreed.blogspot.tw/2012/12/document-similarity-using-hadoop.html
TF-IDF in Hadoop Part 3: Documents in Corpus and TFIDF Computation | Marcello de Sales' Blog
http://marcellodesales.wordpress.com/2010/01/10/tf-idf-in-hadoop-part-3-documents-in-corpus-and-tfidf-computation/

Quickstart — Flask 0.10.1 documentation
http://flask.pocoo.org/docs/quickstart/
flask-tumblelog/tumblelog/admin.py at master · rozza/flask-tumblelog · GitHub
https://github.com/rozza/flask-tumblelog/blob/master/tumblelog/admin.py
Write a Tumblelog Application with Flask and MongoEngine — MongoDB Manual 2.4.5
http://docs.mongodb.org/manual/tutorial/write-a-tumblelog-application-with-flask-mongoengine/



增强版《Hadoop数据分析平台》第八期(增加5周内容),约等于免费的逆向收费式网络培_Hadoop与分布式数据处理_ITPUB论坛-it168旗下专业技术社区
http://www.itpub.net/thread-1629863-1-1.html

HBaseWD: Avoid RegionServer Hotspotting Despite Sequential Keys | Sematext Blog
http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/
电商推荐系统迷思
http://www.infoq.com/cn/presentations/electricity-supplier-recommendation-system-thinking
Bit.ly发布Forget-Table,解决非稳定类别分布问题
http://www.infoq.com/cn/news/2013/02/bitly-forget-table

演讲
http://www.infoq.com/cn/presentations/60
腾讯微博架构的成长过程
http://www.infoq.com/cn/presentations/tencent-blog-structure-growup
京东云存储服务和应用探索
http://www.infoq.com/cn/presentations/jingdong-cloud-storage-services-applications-explore
Partition-Tolerance - Google 搜尋
https://www.google.com.tw/search?q=Partition-Tolerance&source=lnt&tbs=lr:lang_1zh-CN%7Clang_1zh-TW&lr=lang_zh-CN%7Clang_zh-TW&sa=X&ei=QffMUePUL4avkgWO-4HICQ&ved=0CBYQpwUoAQ&biw=1264&bih=711
keyword tf idf - Google 搜尋
https://www.google.com.tw/search?q=keyword+tf+idf&ei=7lzNUe20FsavkgXv64CwCw&start=10&sa=N&biw=1264&bih=711
Keyword Extraction Based on tf/idf for Chinese News Document
http://d.wanfangdata.com.cn/Periodical_whdxxb-e200705030.aspx
【转】关键字提取算法之TF-IDF扫盲 - 码农.KEN - 博客园
http://www.cnblogs.com/ken-zhang/archive/2010/06/20/1761108.html
國立交通大學開放式課程(OpenCourseWare, OCW)
http://ocw.nctu.edu.tw/course_detail_3.php?bgid=9&gid=0&nid=413&v1=82a09096121314b8298ca6a3259b732e24e5a073#.UdaPKz5NtcO



No comments: