Sunday, July 28, 2013

Daily Bookmarks 20130728

Mozilla Firefox 開始頁
http://www.renren.com/268217599
收件匣 (9,207) - peicheng5@gmail.com - Gmail
https://mail.google.com/mail/u/0/?shva=1#inbox
http://www.g.cn/
http://www.g.cn/
新分頁
about:newtab
Facebook
https://www.facebook.com/
pymmesg - Google 搜尋
https://www.google.com.tw/search?q=pymmesg&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
pymmseg - Google 搜尋
https://www.google.com.tw/search?client=firefox-a&hs=dz8&rls=org.mozilla:zh-TW:official&q=pymmseg&spell=1&sa=X&ei=NPfzUYHCOY6bkgWYroCgBg&ved=0CC4QvwUoAA&biw=1275&bih=725
pluskid/pymmseg-cpp
https://github.com/pluskid/pymmseg-cpp
pymmseg-cpp - Google 搜尋
https://www.google.com.tw/search?q=pymmseg-cpp&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&channel=rcs&gws_rd=cr
Python 中文分词:用纯python实现 / FMM 算法 / pymmseg-cpp / smallseg / judou 句读 / BECer-GAE
http://www.starming.com/index.php?action=plugin&v=wave&tpl=union&ac=viewgrouppost&gid=73&tid=13336
python 中文分词,安装 pymmseg - zhangxinrun的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/zhangxinrun/article/details/7525740
youngking/pymmseg
https://github.com/youngking/pymmseg
pymmseg-cpp - High performance Chinese word segmenting module for Python - Google Project Hosting
http://code.google.com/p/pymmseg-cpp/
改进Pymmseg分词功能 - frEefiS ' tHiNkinG
http://freefis.appspot.com/?p=111001
pymmseg-cpp - High performance Chinese word segmenting module for Python - Google Project Hosting
http://code.google.com/p/pymmseg-cpp/
pymmseg-cpp/demos/use_custom_dict.py at master · shuge/pymmseg-cpp
https://github.com/shuge/pymmseg-cpp/blob/master/demos/use_custom_dict.py
python 中文分词,安装 pymmseg - python - ITeye技术网站
http://ipython.iteye.com/blog/1136931
使用pymmseg进行中文分词 - 地瓜日记 - 博客园
http://www.cnblogs.com/sweetpotato-diary/archive/2012/03/20/2408941.html
python下的两个分词工具 | 旁门左道
http://log.medcl.net/item/2011/03/python%E4%B8%8B%E7%9A%84%E5%88%86%E8%AF%8D%E5%BA%93/
longest common subsequence spam detect - Google 搜尋
https://www.google.com.tw/search?q=longest+common+subsequence+spam+detect&client=firefox-a&hs=UC9&rls=org.mozilla:zh-TW:official&ei=UfrzUYHVMciGkgXT8IGoDg&start=10&sa=N&biw=1275&bih=725
新分頁
about:newtab
pymmseg-cpp pip - Google 搜尋
https://www.google.com.tw/search?client=firefox-a&hs=ico&rls=org.mozilla%3Azh-TW%3Aofficial&q=pymmseg-cpp+pip&oq=pymmseg-cpp+pip&gs_l=serp.3...1709.3855.0.4117.4.4.0.0.0.0.93.351.4.4.0....0...1c.1.22.serp..3.1.93.iL5Fw559ofk
http://autodaguo-python.googlecode.com/svn/trunk/mybot.txt
http://autodaguo-python.googlecode.com/svn/trunk/mybot.txt
python list modules - Google 搜尋
https://www.google.com.tw/search?q=python+list+modules&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
Get a list of installed Python modules - Stack Overflow
http://stackoverflow.com/questions/739993/get-a-list-of-installed-python-modules
新酷音 dict - Google 搜尋
https://www.google.com.tw/search?q=%E6%96%B0%E9%85%B7%E9%9F%B3+dict&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
Re: [閒聊] 新酷音可不可以不要有內建詞彙 - 看板 IME - 批踢踢實業坊
http://www.ptt.cc/bbs/IME/M.1241690936.A.43A.html
http://svn.openfoundry.org/libchewingdata/readme.html
http://svn.openfoundry.org/libchewingdata/readme.html
新酷音共享詞庫
http://hyperrate.com/thread.php?tid=21020
pymmseg-cpp 繁體 - Google 搜尋
https://www.google.com.tw/search?q=pymmseg-cpp+%E7%B9%81%E9%AB%94&client=firefox-a&hs=Kh9&rls=org.mozilla:zh-TW:official&ei=yQH0UfKDDMKrkAWaoIDoCw&start=10&sa=N&biw=1275&bih=725
中文分词实战与文言文分词的初步设想 | 京華煙云
http://www.yenching.org/2009/10/%e4%b8%ad%e6%96%87%e5%88%86%e8%af%8d%e5%ae%9e%e6%88%98%e4%b8%8e%e6%96%87%e8%a8%80%e6%96%87%e5%88%86%e8%af%8d%e7%9a%84%e5%88%9d%e6%ad%a5%e8%ae%be%e6%83%b3/
Free Mind » Blog Archive » RMMSeg: Ruby 实现中文分词
http://lifegoo.pluskid.org/?p=261
新酷音 字典 - Google 搜尋
https://www.google.com.tw/search?q=%E6%96%B0%E9%85%B7%E9%9F%B3+%E5%AD%97%E5%85%B8&client=firefox-a&hs=g8o&rls=org.mozilla:zh-TW:official&ei=ZgP0Ub6UEIWokQXIqYHgBQ&start=10&sa=N&biw=1275&bih=700
TWed2k - 心得教學區 - [發現]新酷音注音修改教學
http://058176049149.ctinets.com/viewthread.php?action=printable&tid=290870
新酷音詞庫及注音修改教學
http://chewing.csie.net/chewing_dict_edit.html
新酷音詞庫及注音修改教學
http://chewing.csie.net/chewing_dict_edit.html
libchewing-data/utf-8/tsi.src at master · chewing/libchewing-data
https://github.com/chewing/libchewing-data/blob/master/utf-8/tsi.src
pluskid/pymmseg-cpp
https://github.com/pluskid/pymmseg-cpp
grep 非 打頭 - Google 搜尋
https://www.google.com.tw/search?q=grep+%E9%9D%9E+%E6%89%93%E9%A0%AD&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
正則運算式之道 - just do it - 中國經濟網 經濟部落格
http://big5.ce.cn/gate/big5/blog.ce.cn/html/33/100933-55717.html
高鐵 - Yahoo!奇摩新聞搜尋結果
http://tw.news.search.yahoo.com/search;_ylt=A8tUwYGHB_RRyk8AoElr1gt.?p=%E9%AB%98%E9%90%B5&fr=ush-globalnews&fr2=piv-web
北高1,630元 高鐵最快10月調漲 - Yahoo!奇摩新聞
http://tw.news.yahoo.com/%E5%8C%97%E9%AB%981-630%E5%85%83-%E9%AB%98%E9%90%B5%E6%9C%80%E5%BF%AB10%E6%9C%88%E8%AA%BF%E6%BC%B2-213000245.html
新詞發現 最常共同子串 - Google 搜尋
https://www.google.com.tw/search?q=%E6%96%B0%E8%A9%9E%E7%99%BC%E7%8F%BE+%E6%9C%80%E5%B8%B8%E5%85%B1%E5%90%8C%E5%AD%90%E4%B8%B2&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&gws_rd=cr
基于大规模语料的新词发现算法
http://www.programmer.com.cn/12276/
LCS 新詞發現 - Google 搜尋
https://www.google.com.tw/search?q=LCS+%E6%96%B0%E8%A9%9E%E7%99%BC%E7%8F%BE&client=firefox-a&hs=LWp&rls=org.mozilla:zh-TW:official&ei=IAn0UayAHYnQkgWSkYEw&start=10&sa=N&biw=1275&bih=700
基于大规模语料的新词发现算法 - - 博客频道 - CSDN.NET
http://blog.csdn.net/qyee16/article/details/7741975
基于选择倾向性的词汇获取方法_百度文库
http://wenku.baidu.com/view/3d091d65783e0912a2162a24.html
Longest common subsequence 大規模 - Google 搜尋
https://www.google.com.tw/search?client=firefox-a&hs=JC&rls=org.mozilla%3Azh-TW%3Aofficial&channel=rcs&q=Longest+common+subsequence+%E5%A4%A7%E8%A6%8F%E6%A8%A1&oq=Longest+common+subsequence+%E5%A4%A7%E8%A6%8F%E6%A8%A1&gs_l=serp.3...1801.7287.0.7595.30.21.5.0.0.1.162.2057.15j6.21.0....0...1c.1.22.serp..25.5.210.1E4DjYTcN7o
基于大规模语料的新词发现算法 - - 博客频道 - CSDN.NET
http://blog.csdn.net/qyee16/article/details/7741975
http://sewm.pku.edu.cn/TianwangLiterature/Report/NCIS_TR_2007012.pdf
http://sewm.pku.edu.cn/TianwangLiterature/Report/NCIS_TR_2007012.pdf
抽取 公共子串 - Google 搜尋
https://www.google.com.tw/search?q=%E6%8A%BD%E5%8F%96+%E5%85%AC%E5%85%B1%E5%AD%90%E4%B8%B2&client=firefox-a&rls=org.mozilla:zh-TW:official&ei=Jwz0UbnUHpCmkgWB74GYDQ&start=60&sa=N&biw=1275&bih=700
求多个字符串的最大公共子串---后缀数组 - gdp5211314的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/gdp5211314/article/details/8362678
从diff到LCS(Longestcommonsubsequence),抽象之美-python-电脑编程网
http://biancheng.dnbcw.info/python/170358.html
[coreseek/sphinx学习笔记4]--搜索 - iLovePHP - 开源中国社区
http://my.oschina.net/wzwitblog/blog/109997
相似数据检测算法
http://www.douban.com/note/180296814/
Karp-Rabin - Google 搜尋
https://www.google.com.tw/search?q=Karp-Rabin&lr=lang_zh-CN%7Clang_zh-TW&client=firefox-a&hs=d1U&rls=org.mozilla:zh-TW:official&channel=rcs&tbs=lr:lang_1zh-CN%7Clang_1zh-TW&ei=sQv0UZbsN8flkAWXqIEo&start=10&sa=N&biw=1275&bih=700
Karp-Rabin algorithm
http://www-igm.univ-mlv.fr/~lecroq/string/node5.html
sequential extraction of common substrings - Google 搜尋
https://www.google.com.tw/search?q=sequential+extraction+of+common+substrings&ie=utf-8&oe=utf-8&rls=org.mozilla:zh-TW:official&client=firefox-a&channel=rcs&gws_rd=cr
基于统计的无词典的高频词抽取(二)——根据LCP数组计算词频 - 三度空间 - 博客园
http://www.cnblogs.com/three-zone/p/LCP.html
基于统计的无词典的高频词抽取(一)——后缀数组字典序排序 - 脚本百事通
http://www.csdn123.com/html/blogs/20130614/22454.htm
抽取 共子串 - Google 搜尋
https://www.google.com.tw/search?q=%E6%8A%BD%E5%8F%96+%E5%85%B1%E5%AD%90%E4%B8%B2&client=firefox-a&rls=org.mozilla:zh-TW:official&ei=eA_0Uf-JO8iXkwWW1ICABw&start=10&sa=N&biw=1275&bih=700
http://ir.dlut.edu.cn/ThesisList%5C2009%5C韩冰-大规模文本去重策略研究.pdf
http://ir.dlut.edu.cn/ThesisList%5C2009%5C%E9%9F%A9%E5%86%B0-%E5%A4%A7%E8%A7%84%E6%A8%A1%E6%96%87%E6%9C%AC%E5%8E%BB%E9%87%8D%E7%AD%96%E7%95%A5%E7%A0%94%E7%A9%B6.pdf