Thursday, October 27, 2011

Daily Bookmarks 20111027

ik-analyzer - java开源中文分词器 - Google Project Hosting
http://code.google.com/p/ik-analyzer/
mmseg4j - MMSEG for java lucene chinese analyzer, or for solr - Google Project Hosting
http://code.google.com/p/mmseg4j/
hadoop的1TB排序 - NoSQLFan - 关注NoSQL相关技术、新闻
http://blog.nosqlfan.com/html/417.html
MongoDB在盛大大数据量下的应用 - NoSQLFan - 关注NoSQL相关技术、新闻
http://blog.nosqlfan.com/html/3315.html
Twitter同步人人脚本(Updated at 2010-04-12) | 一阁Blog
http://yegle.net/2010/04/12/php-script-synchronizing-twitter-to-renren-updated-version/
说说MMSeg分词 - bqrm_521(小奎) - 博客园
http://www.cnblogs.com/bqrm/archive/2008/08/16/1269258.html
py-instantse - Python instant search module for quora-like website - Google Project Hosting
http://code.google.com/p/py-instantse/
Apache Lucy FAQ
http://incubator.apache.org/lucy/faq.html
利用Xapian构建自己的搜索引擎:Xapian简介 - O-O Sharp - 博客频道 - CSDN.NET
http://blog.csdn.net/visualcatsharp/article/details/4176083
[转载]大数据量,海量数据 处理方法总结(转载)_cheriec_新浪博客
http://blog.sina.com.cn/s/blog_4d3a41f40100ic9d.html

No comments: