Thursday, November 03, 2011

Daily Bookmarks 20111103

The stringlib Library
http://effbot.org/zone/stringlib.htm
python - improving Boyer-Moore string search - Stack Overflow
http://stackoverflow.com/questions/1106112/improving-boyer-moore-string-search
elastic search,又一个基于lucene的nosql好项目 | summersmile1984 的个人站点
http://summersmile1984.i-branding.me/2011/03/31/elastic-search%E5%8F%88%E4%B8%80%E4%B8%AA%E5%9F%BA%E4%BA%8Elucene%E7%9A%84nosql%E5%A5%BD%E9%A1%B9%E7%9B%AE/
[projects] Contents of /python/trunk/Objects/stringlib/fastsearch.h
http://svn.python.org/view/python/trunk/Objects/stringlib/fastsearch.h?revision=77470&view=markup
Lucid Imagination » Exploring Lucene’s Indexing Code: Part 2
http://www.lucidimagination.com/blog/2009/03/18/exploring-lucenes-indexing-code-part-2/
Delve inside the Lucene indexing mechanism
http://www.ibm.com/developerworks/library/wa-lucene/

How to Index PDF Documents with Lucene | kalani's Tech blog
http://kalanir.blogspot.com/2008/08/indexing-pdf-documents-with-lucene.html
elasticsearch - - Open Source, Distributed, RESTful, Search Engine
http://www.elasticsearch.org/
Study notes 4.3 - Document filtering: Use Naive Bayes_土老冒_百度空间
http://hi.baidu.com/idontknow1987/blog/item/f36adcc5e5e87da48326ac4b.html
PyLucene安装使用简介 | 非鱼观点-互联网观察
http://www.unfish.net/archives/269-20080118.html
绚丽也尘埃 » PyLucene in Action
http://www.fuzhijie.me/?p=273
SourceForge.net: Benchmarks - clucene
http://sourceforge.net/apps/mediawiki/clucene/index.php?title=Benchmarks
Django and Lupy
http://www.rkblog.rk.edu.pl/w/p/django-lupy/

Xapian performance comparision with Whoosh « Searching with Xapian
http://xapian.wordpress.com/2009/02/12/xapian-performance-comparision-with-whoosh/

xapwrap - xapian php调用包装程序支持中文检索 - Google Project Hosting
http://code.google.com/p/xapwrap/


利用 xapian 建立索引 (python 版) - 系统架构 - python.cn(news, jobs)
http://simple-is-better.com/news/619


Stemming Algorithm - 荡气回肠,奔流不息 - tayoto - 和讯博客
http://tayoto.blog.hexun.com/38957815_d.html


在线演示|中文分词|PHP中文分词 - 开源免费的简易中文分词系统
http://www.ftphp.com/scws/demo.php

关于 xunsearch - 迅搜(xunsearch) - 开源免费中文全文搜索引擎
http://www.xunsearch.com/about

纵横搜索
http://discuz.qq.com/service/search

中文分词 « 神仙的仙居
http://xiezhenye.com/tag/%E4%B8%AD%E6%96%87%E5%88%86%E8%AF%8D

Python 中文分词:用纯python实现 / FMM 算法 / pymmseg-cpp / smallseg / judou 句读 / BECer-GAE - 杂项其他 - python.cn(news, jobs)
http://simple-is-better.com/news/387






No comments: