Tuesday, January 31, 2012

Daily Bookmarks 20120131

python | Robin nice blog
http://robin.sh/html/tag/python
LoveBridge内容抓取脚本开发完成 | Robin
http://robin.sh/html/802_lovebridge-grap.html
用python将文本转成图片 _ PlanABC – 怿飞’s Blog
http://www.planabc.net/2011/05/28/convert_text_intoimages_in_python/
[python]生成简单的包含文字的图片(验证码) « Mozillazg's Blog
http://mozillazg.wordpress.com/2011/08/27/python-identifying-code/
给你想要
http://keyinfo2u.net/
北京的春天(二) (20图:2008年4月初)_旅游攻略_艺龙旅游博客
http://trip.elong.com/u/4686131/b012j3q7.html
北京大学的初春景色 | 图有其表
http://www.tuyouqibiao.com/archives/2816.html
PHP获取来自搜索引擎入站的关键词
http://blog.summerfly.cn/PHP_get_search_keyword.html
搜索研发部官方博客 » Blog Archive » “分布式哈希”和“一致性哈希”的概念与算法实现
http://stblog.baidu-tech.com/?p=42
搜索研发部官方博客 » Blog Archive » 多模匹配算法与dictmatch实现
http://stblog.baidu-tech.com/?p=418
搜索研发部官方博客 » Blog Archive » 搜索背后的奥秘——浅谈语义主题计算
http://stblog.baidu-tech.com/?p=1190
搜索研发部官方博客 » Blog Archive » 超级负载均衡
http://stblog.baidu-tech.com/?p=845


e

Monday, January 30, 2012

Daily Bookmarks 20120129

solr 实现去掉重复的搜索结果,打SOLR-236_collapsing.patch补丁 - Bory.Chan
http://blog.chenlb.com/2009/04/apply-solr-collapsing-patch-remove-duplicate-result.html
解决Wordpress博客搜索结果出现的重复的标题标记|技术贵在折腾
http://www.budeyan.com/tech_notes/duplicate-title-tags/
Apache Solr 实现去掉重复的搜索结果 - johnnyhg - ITeye技术网站
http://johnnyhg.iteye.com/blog/1236001
搜尋引擎對重複性內容的4個盲點
http://skenyeh.blogspot.com/2011/03/search-engine-blind-spot-for-duplicate.html
SharePoint Search Duplicate Records, 重复搜索结果 - JohnsonWong - 博客园
http://www.cnblogs.com/johnsonwong/archive/2011/03/31/2001207.html
MOSS 2007 : Duplicate Search Results - Random thoughts on tips and tricks with SharePoint! By John Pradeep - Site Home - TechNet Blogs
http://blogs.technet.com/b/jpradeep/archive/2010/09/29/moss-2007-duplicate-search-results.aspx
Google+(baidu bing youdao) | “四核”搜索引擎 for Greasemonkey
http://userscripts.org/scripts/show/66903







e

Saturday, January 28, 2012

Daily Bookmarks 20120128

WordPress on dotCloud | Blue
http://www.kdblue.com/2011/07/wordpress-dotcloud/
試用 dotcloud - gugod's blog
http://gugod.org/2011/05/-dotcloud.html
Sina App Engine分布式网页抓取服务 – FetchURL - KICCP Blog
http://blog.kiccp.com/84.html
Dotcloud云平台安装wordpress博客 - Kai Blog - ITeye技术网站
http://w3kiccp.iteye.com/blog/1106094
用python和redis打造短网址服务 | Life is A Highway dotcloud
http://amazingjxq.com/?p=488
Dotcloud 实现无密码 ssh操作(scp等) | using ssh(scp…) under dotcloud without password | McKelvin's Blog
http://blog.mckelv.in/?p=740
帶著筆記學程式 » Dotcloud 中佈署 PHP+MySQL 應用
http://studio.zeuik.com/?p=1085
理解HTTP – 缓存 – 飞纯技术
http://blog.ftao.org/2009/11/01/understanding-http-cache/
海量文档查同或聚类问题 -- Locality Sensitive Hash 算法 - fxjwind - 博客园
http://www.cnblogs.com/fxjwind/archive/2011/07/05/2098642.html


e

Monday, January 23, 2012

Daily Bookmarks 20120123

BBMAO社会化搜索引擎_互动百科
http://www.hudong.com/wiki/BBMAO%E7%A4%BE%E4%BC%9A%E5%8C%96%E6%90%9C%E7%B4%A2%E5%BC%95%E6%93%8E
bbmao的神秘配方:打破中文聚類搜索的低迷_數字商業時代_雜誌頻道_新浪網-北美
http://magazine.sina.com/bg/commercialage/200704/20070410/11317349.html
子猴博客 » Carrot2:用胡萝卜来聚类
http://www.zihou.me/html/2011/01/27/2759.html
Oreilly.Python.Cookbook.2nd.edition.Jun.2005.eBook-LiB.pdf - stid - Python Cookbook - Personal Project for stid - Google Project Hosting
http://code.google.com/p/stid/downloads/detail?name=Oreilly.Python.Cookbook.2nd.edition.Jun.2005.eBook-LiB.pdf
Index of /static/books/python/
http://slav0nic.org.ua/static/books/python/
python多logging写日志 - 会说话的狗
http://www.quou.cn/archives/446
Lab 14 - Git Immersion - Brought to you by EdgeCase
http://gitimmersion.googol.im/lab_14.html#main_content
git 101 – git的物件模型 | Ricky's murmur...
http://jcliang.twgogo.org/267/git-101-git%E7%9A%84%E7%89%A9%E4%BB%B6%E6%A8%A1%E5%9E%8B



e

Saturday, January 21, 2012

Daily Bookmarks 20120121

Clustering Data using Python – Prashanth Ellina
http://blog.prashanthellina.com/2009/07/25/clustering-data-using-python/
Python implementation of the K-means clustering algorithm
http://pandoricweb.tumblr.com/post/8646701677/python-implementation-of-the-k-means-clustering
Data Mining in Python
http://www.stat.columbia.edu/~jakulin/orng/
Python Course: Text Classification in Python
http://www.python-course.eu/text_classification_python.php
k-means (python实现)算法-python编程 - Site name
http://cq-999.appspot.com/cms/show_article/140001.html
python 聚类算法_bicloud_新浪博客
http://blog.sina.com.cn/s/blog_61c463090100ljv0.html
漫谈 Clustering (1): k-means-Python
http://www.flatws.cn/article/program/python/2011-03-01/14898.html
mxdxm博客 - 聚类搜索引擎分类文章列表 - ITeye技术网站
http://webcache.googleusercontent.com/search?q=cache:4QV-Cqt8CBoJ:mxdxm.iteye.com/category/65826+&cd=27&hl=zh-TW&ct=clnk&client=firefox-a
聚类式搜索引擎(ClusterSE)的设计与实现 | @yongPassion 的个人博客
http://www.yongblog.com/archives/62.html
聚类搜索引擎的对象、功能及算法_相关资讯_成都SEO培训基地 good site
http://www.my-seo.com.cn/news/ecommerce/470.html
Python标准模块logging - fxjwind - 博客园
http://www.cnblogs.com/fxjwind/archive/2011/07/05/2098648.html






e

Wednesday, January 18, 2012

Daily Bookmarks 20120118

MIT OpenCourseWare | Electrical Engineering and Computer Science | 6.006 Introduction to Algorithms, Spring 2008 | Lecture Notes
http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-006-introduction-to-algorithms-spring-2008/lecture-notes/
MIT OpenCourseWare | Electrical Engineering and Computer Science | 6.006 Introduction to Algorithms, Spring 2008 | Lecture Notes
http://ocw.mit.edu/ans7870/6/6.006/s08/lecturenotes/dd_dict.htm
(1) Lec1
http://www.slideshare.net/allufarp/lec1-8300566
Using Python to detect the most frequent words in a file
http://programmingzen.com/2008/03/18/use-python-to-detect-the-most-frequent-words-in-a-file/
Papers of interest in Entity Mat
http://astro.temple.edu/~joejupin/papers_of_interest_in_entity_mat.htm
Data Structures: Hashtables - Programming in Python
http://sites.google.com/site/usfcomputerscience/hashtables
Clustering Data using Python – Prashanth Ellina
http://blog.prashanthellina.com/2009/07/25/clustering-data-using-python/





e

Saturday, January 14, 2012

Daily Bookmarks 20120114

Counting Unique Words with Python « Purple Saguaro
http://yakinikuman.wordpress.com/2010/10/27/counting-unique-words-with-python/
information retrieval - Python script to find word frequencies of a given document - Stack Overflow
http://stackoverflow.com/questions/7480000/python-script-to-find-word-frequencies-of-a-given-document
Sorting a list by frequency of letter in python (decreasing order) - Stack Overflow
http://stackoverflow.com/questions/7961629/sorting-a-list-by-frequency-of-letter-in-python-decreasing-order
Word frequency count using python - Stack Overflow
http://stackoverflow.com/questions/4088265/word-frequency-count-using-python
Using Python to detect the most frequent words in a file
http://programmingzen.com/2008/03/18/use-python-to-detect-the-most-frequent-words-in-a-file/
Count word occurrences in a string - RefactorMyCode.com
http://refactormycode.com/codes/176-count-word-occurrences-in-a-string
语料库词频统计程序 - Tony-woo - 博客园
http://www.cnblogs.com/Tony-woo/archive/2007/11/13/958452.html
TOP k算法 - javaeye - ITeye技术网站
http://mxdxm.iteye.com/blog/1124935
怎样从10亿查询词找出出现频率最高的10个 | 董的博客
http://dongxicheng.org/big-data/select-ten-from-billions/
Redis源码研究—哈希表 | 董的博客
http://dongxicheng.org/nosql/redis-code-hashtable/
Building a 5 Star Rating System with jQuery, AJAX and PHP | Nettuts+
http://net.tutsplus.com/tutorials/html-css-techniques/building-a-5-star-rating-system-with-jquery-ajax-and-php/
fnotes - an open source project like Evernote ! - Google Project Hosting
http://code.google.com/p/fnotes/
3rgbcom - All source of 3rgb.com - Google Project Hosting
http://code.google.com/p/3rgbcom/
将web.py官方网站上的博客,加上分页和注册,登录程序。-七七巴巴黄页网
http://www.qy7788.com.cn/shiyongxinxi/shiyongxinxi33.html
orz-l
http://orz-l.com/
Git 版本控制系統 (1) | ihower { blogging }
http://ihower.tw/blog/archives/2591
jQuery liScroll - a jQuery News Ticker
http://www.gcmingati.net/wordpress/wp-content/lab/jquery/newsticker/jq-liscroll/scrollanimate.html
http://tw.nextmedia.com/2012president/web/SpryAssets/SpryCollapsiblePanel.js
jQuery Rater Star Plugin
http://www.raychou.com/labs/rater-star/
api.txt - fnotes - an open source project like Evernote ! - Google Project Hosting
http://code.google.com/p/fnotes/source/browse/doc/api.txt
Python and AJAX tutorial for beginners with web.py and jQuery « Kooneiform
http://kooneiform.wordpress.com/2010/02/28/python-and-ajax-for-beginners-with-webpy-and-jquery/
#128: debug mode Not supported Simplified Chinese - Issues - webpy/webpy - GitHub
https://github.com/webpy/webpy/issues/128#issuecomment-3273610
蛋疼脚本:Baidu直播贴终结者 | -猪之哀伤的Blog-
http://www.zlovezl.cn/articles/26/
NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context - Google 搜尋
https://www.google.com/search?q=NVRM%3A+os_schedule%3A+Attempted+to+yield+the+CPU+while+in+atomic+or+interrupt+context&ie=utf-8&oe=utf-8&aq=t&rls=org.mozilla:zh-TW:official&client=firefox-a
巨型網站的分散式架構設計 (雲端運算的基礎) | InspireGate 派克空間
http://inspire.twgg.org/c/internet/host-setting/mega-site-distributed-architecture-design-based-on-cloud-computing.html
關聯式資料庫 vs. Key-Value 資料庫 | InspireGate 派克空間
http://inspire.twgg.org/c/programming/other/relational-database-vs-key-value-database.html
Category » nosql « @ Zu
http://webcache.googleusercontent.com/search?q=cache:mDsQpaSE4OAJ:blog.donews.com/zuaa/archive/category/nosql+&cd=8&hl=zh-TW&ct=clnk&client=firefox-a
MongoDB的一些觀念 - 單純的資訊年代- 點部落
http://www.dotblogs.com.tw/sungnoone/archive/2012/01/11/65274.aspx
"Berkeley DB"数据库的优点和不足之处 - 井长 - Jason Yu
http://jasonyu.cn/post/254/
Berkeley DB并发数据存储编程 - 井长 - Jason Yu
http://jasonyu.cn/post/253/
Berkeley DB:网站数据缓存方案测试 » 超群.com的博客
http://www.fuchaoqun.com/2009/01/bdb-cache/
Berkeley DB 由浅入深【转自架构师杨建】@leeon | 分享未来 - 互联网技术
http://leeon.me/a/Berkeley-DB-note












e

Thursday, January 12, 2012

Daily Bookmarks 20120112

波哥的IT私房菜: 解決 /usr/bin/ld: cannot find -lxxx 問題
http://i-pogo.blogspot.com/2010/01/usrbinld-cannot-find-lxxx.html
[CentOS] /usr/bin/ld: skipping incompatible - Grokbase
http://grokbase.com/t/centos.org/centos/2008/03/centos-usr-bin-ld-skipping-incompatible/06jr6d5v6qz6qurgjoqrhi3zttna
linux - How to create a library from boost? - Stack Overflow
http://stackoverflow.com/questions/4277581/how-to-create-a-library-from-boost
mgrep.c - 一个关于多模匹配算法的实现� 源代码在线阅读 - HackChina.com
http://www.hackchina.com/r/124330/mgrep.c__html
這次的作業是用 Perl 寫一個程式 mgrep 去整理 mail folder, 程式需有
http://www.csie.nctu.edu.tw/~jjyang/course/sysadm/hw3.html
/trunk/kbs_bbs/libBBS/mgrep.c – KBS
http://trac.kcn.cn/kbs/browser/trunk/kbs_bbs/libBBS/mgrep.c?rev=10168
跨行搜索脚本:mgrep at A Geek’s Page
http://wangcong.org/blog/archives/333
vimgtd-在vim(gvim)中实现GTD时间管理!【本博原创插件】 | Vimer的程序世界
http://www.vimer.cn/2011/06/vimgtd-%e5%9c%a8vimgvim%e4%b8%ad%e5%ae%9e%e7%8e%b0gtd%ef%bc%81%e3%80%90%e6%9c%ac%e5%8d%9a%e5%8e%9f%e5%88%9b%e6%8f%92%e4%bb%b6%e3%80%91.html

第三十一讲:在Android中解析XML « { Android学习指南 }
http://android.yaohuiji.com/archives/935
节约内存:Instagram 的 Redis 实践 - 系统架构 - python.cn(news, jobs)
http://simple-is-better.com/news/764
石頭閒語:How to write a program ran in GDM screen - 樂多日誌
http://blog.roodo.com/rocksaying/archives/11625175.html
内存数据库简单实现_风之力量007_新浪博客 good
http://blog.sina.com.cn/s/blog_54384df80100erjv.html
内存数据库简单实现 - python版_风之力量007_新浪博客
http://blog.sina.com.cn/s/blog_54384df80100et1r.html
Python | 弱类型
http://troycheng.blogcn.com/articles/tag/python
关于内存数据库 - oldworm - C++博客 Good site !!!!
http://www.cppblog.com/oldworm/archive/2011/01/21/139015.html
淘宝核心系统团队博客 | VoltDB内存数据库分析
http://webcache.googleusercontent.com/search?q=cache:-90YtYi7JsMJ:rdc.taobao.com/blog/cs/%3Fp%3D1360+&cd=13&hl=zh-TW&ct=clnk&lr=lang_zh-CN%7Clang_zh-TW&client=firefox
python字符串匹配工具性能比较 | 弱类型
http://troycheng.blogcn.com/articles/python%e5%ad%97%e7%ac%a6%e4%b8%b2%e5%8c%b9%e9%85%8d%e5%b7%a5%e5%85%b7%e6%80%a7%e8%83%bd%e6%af%94%e8%be%83.html
提高 Python 程序的运行速度 - 杂项其他 - python.cn(news, jobs)
http://simple-is-better.com/news/729
基于Redis架构的短信平台系统 « 美味儿blog
http://blog.meiweier.com/2011/01/06/redis-based-sms-system.html
Neopythonic: Sorting a million 32-bit integers in 2MB of RAM using Python
http://neopythonic.blogspot.com/2008/10/sorting-million-32-bit-integers-in-2mb.html
py-instantse:一个问答网站的实时搜索功能后台实现 | 弱类型
http://troycheng.blogcn.com/articles/py-instantse%ef%bc%9a%e4%b8%80%e4%b8%aa%e9%97%ae%e7%ad%94%e7%bd%91%e7%ab%99%e7%9a%84%e5%ae%9e%e6%97%b6%e6%90%9c%e7%b4%a2%e5%8a%9f%e8%83%bd%e5%90%8e%e5%8f%b0%e5%ae%9e%e7%8e%b0.html
Python persistence « I gotta have my orange juice.
http://scottmoonen.com/2004/10/01/python-persistence/
YouTube 架构学习体会 - 淘米部落 - 博客园
http://www.cnblogs.com/tmywu/archive/2012/01/11/2319070.html
I am LAZY bones ? : 给python增加IPC模块
http://luy.li/2009/06/12/python_ipc/
深刻理解Linux进程间通信(IPC)_hi,Python,what can I do for u?_百度空间 good site
http://hi.baidu.com/wangruiqi2008/blog/item/06efd0ce42f48f0f92457e4d.html

碎碎念 :: TAGS::Vim进阶索引[7] :: August :: 2007
http://blah.blogsome.com/2007/08/04/vim_tut_tags/#tags_005fsec4
程式碼可以用tag方式: 將vim當作source insight 來使用 | 易春木
http://eeepage.info/tag-vim-source-insight/
Efficient python folding - Fold python code nicely and toggle with one keystroke : vim online
http://www.vim.org/scripts/script.php?script_id=1494
Python and vim: Make your own IDE | tail -f findings.out
http://dancingpenguinsoflight.com/2009/02/python-and-vim-make-your-own-ide/
配置vim Python IDE 开发环境_opwpo-ChinaUnix博客
http://blog.chinaunix.net/space.php?uid=25719044&do=blog&id=3026457
Tips: Vim as Python IDE « life.py
http://lifepy.wordpress.com/2010/10/17/tips-vim-as-python-ide/
笨狗又一窝 » nginx/php/检索折腾记 python 內存 daemon Good site
http://www.yewen.us/blog/2011/09/nginxphp%e6%a3%80%e7%b4%a2%e6%8a%98%e8%85%be%e8%ae%b0/
A simple unix/linux daemon in Python - Lone Wolves - Web, game, and open source development
http://www.jejik.com/articles/2007/02/a_simple_unix_linux_daemon_in_python/
在 Linux 上寫你的 Daemon @ XiaoA :: 痞客邦 PIXNET ::
http://moiamond.pixnet.net/blog/post/26253048-%E5%9C%A8-linux-%E4%B8%8A%E5%AF%AB%E4%BD%A0%E7%9A%84-daemon
Tom's Blog at McLaren Labs | … systematize, synthesize, explain …
http://www.tsheffler.com/blog/
在python中使用memcached --- Python-memcached - LemonLi - 博客园
http://www.cnblogs.com/pylemon/archive/2011/11/18/2253942.html
Windows下Memcached的安装和使用(Java) | Tanky Woo
http://www.wutianqi.com/?p=3046





e

Wednesday, January 11, 2012

Daily Bookmarks 20120111

PHP利用HTTP_X_FORWARDED_FOR抓取訪客ip @ 無呈現的網路筆記 :: 痞客邦 PIXNET ::
http://chenshin0719.pixnet.net/blog/post/12185952-php%E5%88%A9%E7%94%A8http_x_forwarded_for%E6%8A%93%E5%8F%96%E8%A8%AA%E5%AE%A2ip
使用HTTP_X_FORWARDED_FOR获取客户端IP的严重后果 - Kingthy - 博客园
http://www.cnblogs.com/kingthy/archive/2007/11/24/970783.html
伪造HTTP头X-Forwarded-For分析 | 87年
http://87year.info/2011/03/09/%E4%BC%AA%E9%80%A0http%E5%A4%B4x-forwarded-for%E6%9D%A5%E5%88%B7%E7%A5%A8/
用Python指定HTTPConnection的出口IP(specify outgoing ip) | observer专栏杂记
http://obmem.info/?p=249
使用python爬虫抓站的一些技巧总结:进阶篇 | observer专栏杂记
http://obmem.info/?p=753
python爬虫抓站技巧 - 泉水 - ITeye技术网站
http://chengjunflying.iteye.com/blog/1097913
urllib2模块学习笔记_隐去............._百度空间
http://hi.baidu.com/554151688/blog/item/945f67df3ab014d48c102918.html
pydanny: Python HTTP Requests for Humans
http://pydanny.blogspot.com/2011/05/python-http-requests-for-humans.html
【菜鸟求教】如何用python去除一个字符串中间的空格~~~ - 人人网,在这里谈论Python,Python的公共主页论坛帖子
http://page.renren.com/600772649/group/333672425
Get yourself into a Python cPickle | TechRepublic
http://www.techrepublic.com/article/get-yourself-into-a-python-cpickle/1052190
从HTML文件中抽取正文的简单方案 zz [Python俱乐部]
http://www.pythonclub.org/python-files/html-body
Manso Trick: simple strip HTML tags using Python » Codigo Manso
http://www.codigomanso.com/en/2010/09/truco-manso-eliminar-tags-html-en-python/
life is short - you need Python!: Strip HTML tags using Python
http://love-python.blogspot.com/2008/07/strip-html-tags-using-python.html








e

Tuesday, January 10, 2012

Daily Bookmarks 20120110

J2 Labs - Speed tests for *json and cPickle in Python
http://j2labs.tumblr.com/post/4262756632/speed-tests-for-json-and-cpickle-in-python
ZoomQuiet / bottle-simple-todo / wiki / HowThink — Bitbucket
https://bitbucket.org/ZoomQuiet/bottle-simple-todo/wiki/HowThink
Python之 pickle cPickle - dreaming架构师,深入理论,适当实践的一年 - ITeye技术网站
http://shuofenglxy.iteye.com/blog/930657
python模块之bsddb: bdb高性能嵌入式数据库 1.基础知识 - zhaowei的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/zhaoweikid/article/details/1665741
13.2 cPickle -- A faster pickle
http://docs.python.org/release/2.5/lib/module-cPickle.html
Python 列表(list)操作 [Python俱乐部]
http://www.pythonclub.org/python-basic/list
PyMOTW: pickle & cPickle — PyMOTW Document v1.6 documentation
http://pymotwcn.readthedocs.org/en/latest/documents/pickle.html
My Thoughts on NoSQL - Eric Florenzano's Blog
http://eflorenzano.com/blog/2009/07/21/my-thoughts-nosql/
Richard Jones | Anti-RDBMS: A list of distributed key-value stores
http://www.metabrew.com/article/anti-rdbms-a-list-of-distributed-key-value-stores
淘宝核心系统团队博客 | Tair
http://rdc.taobao.com/blog/cs/?cat=8
豆瓣网架构-国内python语言网站的王者之路_方法总比问题多,加油,契合 加减分_百度空间
http://hi.baidu.com/%CB%B9%CB%B9%C6%AF%D2%C6/blog/item/f52a6745be78a62e8694734f.html
海盗河马
http://www.i7xh.com/category/python/
[翻译]案例学习:仅使用Redis+PHP设计实现一个简单的Twitter - 后端那些事 - 微风实验室
http://dev.meettea.com/show-100-1.html
A case study: Design and implementation of a simple Twitter clone using only the Redis key-value store as database and PHP – Redis
http://redis.io/topics/twitter-clone
Python 操作BDB (1) - - ITeye技术网站
http://hammer-nail.iteye.com/blog/468022
[Python+fastcgi]第一次接触flup « Initiative
http://initiative.yo2.cn/archives/465391
Relational DB vs. Key-Value store - 智障大师 的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/historyasamirror/article/details/4135859
NoSQL数据库笔谈
http://sebug.net/paper/databases/nosql/Nosql.html
KeyValue DB之redis - blade2001的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/blade2001/article/details/5807744
memlink - Key-List Nosql System - Google Project Hosting
http://code.google.com/p/memlink/
InfoQ: 天涯新款key-list类型内存数据引擎——Memlink
http://www.infoq.com/cn/news/2010/11/tianya-memlink
Benchmark - memlink - memlink性能测试、与redis,mysql的性能测试对比 - Key-List Nosql System - Google Project Hosting
http://code.google.com/p/memlink/wiki/Benchmark

Cooper Maa: Google Chart API 教學
http://coopermaa2nd.blogspot.com/2011/01/google-chart-api.html
Google Chart API 中文版 - 开发者指南 - 中国asp之家
http://www.aspxhome.com/chm/google-chart-api-help/Google-Chart-API-Cn.htm#chtt
Google提供的各种统计图生成API - 闲暇时光~
http://autumn-sea.appspot.com/page/agphdXR1bW4tc2VhcgsLEgRCbG9nGOF2DA
google chart中如何显示中文 - 记录与PHP的PK经历
http://www.pkphp.com/2009/08/31/google-chart-show-chinese/
Google Chart API 學習筆記 - 電腦網路技術 - jsGears.com 技術論壇 - AJAX, JavaScript, jQuery, 網站開發, 前端效能優化 - Powered by Discuz!
http://jsgears.com/thread-180-1-1.html
開發人員指南 - Google Chart API - Google Code
http://code.google.com/intl/zh-TW/apis/chart/image/





e

Monday, January 09, 2012

Daily Bookmarks 20120109

pickleDB - a simple and extremely lightweight key-value database for Python. : Python
http://www.reddit.com/r/Python/comments/mkvm4/pickledb_a_simple_and_extremely_lightweight/
pickleDB - simple key-value database
http://packages.python.org/pickleDB/
dikeva - DiKeVa - Distributed Key - Value database written in Python - Google Project Hosting
http://code.google.com/p/dikeva/
NoSQL: Distributed and Scalable Non-Relational Database Systems | Linux Magazine
http://www.linux-mag.com/id/7579/
PersistenceTools - PythonInfo Wiki
http://wiki.python.org/moin/PersistenceTools
python模块之bsddb: bdb高性能嵌入式数据库 1.基础知识 - zhaowei的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/zhaoweikid/article/details/1665741
Caveats of Evaluating Databases - plok
http://jan.prima.de/plok/archives/176-Caveats-of-Evaluating-Databases.html
读取E680(i,g)的native.db文件可可熊的窝
http://cocobear.info/blog/2009/01/13/data-of-e680-native-db/

用python格式化css文件 - zhaowei的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/zhaoweikid/article/details/1663212
Sass - Syntactically Awesome Stylesheets
http://sass-lang.com/
Phrozn - Static Site Generator for PHP
http://www.phrozn.info/en/
Checko's Bookmarks
http://checko.soup.io/tag/Python
化整為零的次世代網頁開發標準: WSGI | 程式設計 遇上 小提琴
http://blog.ez2learn.com/2010/01/27/introduction-to-wsgi/
Joel on Software - 邊開火邊移動
http://chinesetrad.joelonsoftware.com/Articles/FireAndMotion.html
web.py
http://www.longtask.com/blog/?tag=web-py




e

Sunday, January 08, 2012

Daily Bookmarks 20120108

讯速分布式实时搜索引擎系统(ithunder)
http://ithunder.org/
Indexes Of /
http://books.ithunder.org/
分享个在research中用到wikipedia的好帮手 | Super.jiju's space
http://superjiju.wordpress.com/2010/01/28/%e5%88%86%e4%ba%ab%e4%b8%aa%e5%9c%a8research%e4%b8%ad%e7%94%a8%e5%88%b0wikipedia%e7%9a%84%e5%a5%bd%e5%b8%ae%e6%89%8b/
Fedora12(i386) 安装chrome浏览器 问题查找与解决 - 北方人 - 博客园
http://www.cnblogs.com/hunterfu/archive/2010/03/01/1675948.html
Record linkage - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Record_linkage
entity deduplication python - Google 搜尋
http://www.google.com.tw/search?q=entity++deduplication+python&hl=zh-TW&prmd=imvns&ei=O6AJT4qBNYvnmAXd2sCTAQ&start=10&sa=N&biw=1235&bih=712
FirteX 首页
http://www.firtex.org/

商品匹配问题求解_百度知道
http://zhidao.baidu.com/question/307289081.html?seed=0
盒子比价网,数码产品比价,新蛋、京东、卓越、易迅、绿森、锐意、淘宝、飞虎、高鸿、网邻、库巴、华强、苏宁、国美、当当、一号店、欧酷、新七天
http://www.box-z.com/
博客_wycg1984的空间_百度空间
http://hi.baidu.com/wycg1984/blog/index/1
Identity resolution - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Identity_resolution
基于 Apache Mahout 构建社会化推荐引擎【链接】 - 佘亮 - 博客园
http://www.cnblogs.com/wycg1984/archive/2010/04/27/1722407.html
佘亮 - 博客园
http://www.cnblogs.com/wycg1984/default.aspx?page=18
[totti's blog] 命名是件麻煩的事
http://totti-yang.blogspot.com/
Bayesian identity resolution | Larsblog
http://www.garshol.priv.no/blog/217.html
Febrl - Freely extensible biomedical record linkage
http://datamining.anu.edu.au/software/febrl/febrldoc/
Papers of interest in Entity Mat
http://astro.temple.edu/~joejupin/papers_of_interest_in_entity_mat.htm
Bayesian inference - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Bayesian_inference
Duplicates « Another Word For It
http://tm.durusau.net/?cat=142

http://duke.googlecode.com/hg-history/0a73bb60cc9b701859e7aedcfab2c5c84c8a690a/tst.py






e

Saturday, January 07, 2012

Daily Bookmarks 20120106

Machine Learning :: Text feature extraction (tf-idf) – Part I | Pyevolve
http://pyevolve.sourceforge.net/wordpress/?p=1589
Gensim – Topic Modelling for Humans — gensim
http://radimrehurek.com/gensim/index.html
Latent semantic indexing - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Latent_Semantic_Indexing
tf–idf - Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Tf%E2%80%93idf
similarity - How to implement "related articles?" - Stack Overflow
http://stackoverflow.com/questions/2363213/how-to-implement-related-articles

Map Reduce: A really simple introduction « Kaushik Sathupadi
http://ksat.me/map-reduce-a-really-simple-introduction-kloudo/
Dumb programming techniques to avoid... - Tim Bull
http://timbull.com/dumb-programming-techniques-to-avoid#comment
MapReduce « Programming Praxis
http://programmingpraxis.com/2009/10/06/mapreduce/
PeteSearch: MapReduce for Idiots
http://petewarden.typepad.com/searchbrowser/2010/01/mapreduce-for-idiots.html
PeteSearch: How to speed up massive data set analysis by eliminating disk seeks
http://petewarden.typepad.com/searchbrowser/2010/01/how-to-speed-up-massive-data-set-analysis-by-eliminating-disk-seeks.html
How MapReduce Works » StephenChan's Tech Space
http://blog.endlesscode.com/2010/06/24/how-mapreduce-works/#more-934
who | Disco Project
http://discoproject.org/who
分享网络2.0 - 我们只关注最具有Web2.0气质的早期创业项目
http://www.showeb20.com/

jQuery UI Bootstrap - 一个受Twitter Bootstrap启发的主题 - OPEN开发经验库
http://www.open-open.com/lib/view/open1325685779609.html
New Twitter Design with CSS and JQuery.
http://www.9lessons.info/2010/10/new-twitter-design-css-jquery.html
Twitter like Youtube Video URL Expanding using PHP | HostingCouponZ.com
http://hostingcouponz.com/twitter-like-ui-expand-url-using-php-and-jquery/
jQuery+PHP实现浏览更多内容-Helloweba-致力于WEB前端技术在中国的应用
http://www.helloweba.com/view-blog-130.html
页面内容排序插件jSort的使用-Helloweba-致力于WEB前端技术在中国的应用
http://www.helloweba.com/view-search-144.html
图片延迟加载技术-Lazyload的应用-Helloweba-致力于WEB前端技术在中国的应用
http://www.helloweba.com/view-blog-151.html
使用phpQuery轻松采集网页内容-Helloweba-致力于WEB前端技术在中国的应用
http://www.helloweba.com/view-search-133.html
jQuery+PHP+Mysql实现输入自动完成提示的功能-Helloweba-致力于WEB前端技术在中国的应用
http://www.helloweba.com/view-search-145.html
PHP实现时间轴函数-Helloweba-致力于WEB前端技术在中国的应用
http://www.helloweba.com/view-blog-60.html

e

Tuesday, January 03, 2012

Daily Bookmarks 20120103

python多线程读取文件 - 也就这样,
http://type.so/python/multi-thread-read-file.html
Python multiple threads accessing same file - Stack Overflow
http://stackoverflow.com/questions/2301458/python-multiple-threads-accessing-same-file
Scribe's N.E.W. Studio://Blogger: 用 Python 跑 Multi-threading 版 PNG 最佳化
http://blog.new-studio.org/2009/02/python-multi-threading-png-optimize.html
Simple Python: a job queue with threading | [ themattreid ]
http://themattreid.com/wordpress/2011/01/20/simple-python-a-job-queue-with-threading/
Multi-threading in Python | Artful Code good
http://www.artfulcode.net/articles/multi-threading-python/
queue - python -> multiprocessing module - Stack Overflow
http://stackoverflow.com/questions/3586723/python-multiprocessing-module
Python多线程学习(二、线程的同步) - Sam的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/lazy_tiger/article/details/3870145
tempfile – Create temporary filesystem resources. - Python Module of the Week
http://www.doughellmann.com/PyMOTW/tempfile/
Understanding Threading in Python LG #107
http://linuxgazette.net/107/pai.html
Python Performance Part 1 « Kurt Seifried Good thread
http://kurt.seifried.org/2010/05/31/python-performance-part-1/

here
Python threads synchronization: Locks, RLocks, Semaphores, Conditions, Events and Queues | Laurent Luce's Blog Good Good
http://www.laurentluce.com/posts/python-threads-synchronization-locks-rlocks-semaphores-conditions-events-and-queues/

Amazon.com Browse Node Lookup
http://www.browsenodeinfo.com/US/541966
e

Monday, January 02, 2012

Daily Bookmarks 20120102

Python写爬虫——抓取网页并解析HTML – 尘埃落定
http://www.lovelucy.info/python-crawl-pages.html
Simple Text Links to Amazon.com
http://www.a2sdeveloper.com/page-simple-text-links-to-amazoncom.html#node
Electric Duncan: Async Batching with Twisted: A Walkthrough
http://oubiwann.blogspot.com/2008/06/async-batching-with-twisted-walkthrough.html
使用python爬虫抓站的一些技巧总结:进阶篇 | observer专栏杂记
http://obmem.info/?p=753
Find Browse Nodes for Electronics at Amazon.com - FindBrowseNodes.com
http://www.findbrowsenodes.com/us/Electronics/493964
Optimizing C++ - the WWW version
http://www.steveheller.com/opt/mail.htm
Optimizing C++ - the WWW version Zensort: A Sorting Algorithm for Limited Memory
http://www.steveheller.com/opt/zensort.htm

Handling urllib2's timeout? - Python - Stack Overflow
http://stackoverflow.com/questions/2712524/handling-urllib2s-timeout-python
urllib2.urlopen超时的问题 | yanghao's blog
http://yanghao.org/blog/archives/81
8.10. Queue — A synchronized queue class — Python v2.7.2 documentation
http://docs.python.org/library/queue.html#Queue.Queue.join

Anonymous Surfing & Free Proxy List
http://proxy-list.org/en/index.php
紧急赶制的Python抓取爬虫 | Alex's blog Good
http://liyanyan.net/?p=401
Python写爬虫抓站的一些技巧 | 岭南六少 - 一朵在LAMP架构下挣扎的云
http://blog.chedushi.com/archives/1249
Examples — Eventlet v0.9.16.dev documentation
http://eventlet.net/doc/examples.html
smpctl - Google 搜尋
https://www.google.com/search?hl=zh-TW&client=firefox&hs=sVO&rls=org.mozilla:zh-TW:official&sa=X&ei=ibMBT8z2DMWNmQXz6KSjAg&ved=0CBgQvgUoAA&q=smpctl&nfpr=1&biw=1280&bih=876
sitecopy,一个用于山寨网站UI的脚本 - 代码发芽网
http://fayaa.com/code/view/15377/
Python写爬虫抓站的一些技巧 | 岭南六少 - 一朵在LAMP架构下挣扎的云
http://blog.chedushi.com/archives/1249
用Python实现常见排序算法 | 岭南六少 - 一朵在LAMP架构下挣扎的云
http://blog.chedushi.com/archives/2412
PHP & Python & Etc | Alex's blog
http://liyanyan.net/?cat=5
设置VIM为Python开发环境 | Alex's blog good
http://liyanyan.net/?p=1225
gentoo配置lighttpd | Alex's blog
http://liyanyan.net/?p=222
Python使用gzip模块 | Alex's blog
http://liyanyan.net/?p=20

Visual jQuery 1.2.6
http://visualjquery.com/
Queue – A thread-safe FIFO implementation - Python Module of the Week
http://www.doughellmann.com/PyMOTW/Queue/


e

Sunday, January 01, 2012

Daily Bookmarks 20120101

桶排序用于海量数据排序的实验。 - luuillu的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/luuillu/article/details/6544476
程序员编程艺术:第十章、如何给10^7个数据量的磁盘文件排序 - 结构之法 算法之道 - 博客频道 - CSDN.NET
http://blog.csdn.net/v_JULY_v/article/details/6451990
【排序结构6】 桶排序 - 爪哇人 - ITeye技术网站 good
http://hxraid.iteye.com/blog/647759
Java集合类(4) —— 介绍HashSet - 爪哇人 - ITeye技术网站 Good
http://hxraid.iteye.com/blog/448884
海量数据面试题整理 - 好工具站长分享平台
http://www.haogongju.net/art/1160312
从海量数据中找出中位数 (10G级别)
http://www.tengfei.biz/algorithm/32--10g.html
Bin Sort——桶排序_lucy_新浪博客
http://blog.sina.com.cn/s/blog_614316190100ei83.html

Daily Bookmarks 20111231

mincemeat.py: MapReduce on Python
http://remembersaurus.com/mincemeatpy/
Implementing MapReduce with multiprocessing - Python Module of the Week
http://www.doughellmann.com/PyMOTW/multiprocessing/mapreduce.html
Writing An Hadoop MapReduce Program In Python @ Michael G. Noll
http://www.michael-noll.com/tutorials/writing-an-hadoop-mapreduce-program-in-python/
Finding Similar Items with Amazon Elastic MapReduce, Python, and Hadoop Streaming : Articles & Tutorials : Amazon Web Services
http://aws.amazon.com/articles/2294
Outgoing: MapReduce
http://outgoing.typepad.com/outgoing/2005/04/mapreduce.html
Parallel MapReduce in Python in Ten Minutes « Cvet's Blog
http://mikecvet.wordpress.com/2010/07/02/parallel-mapreduce-in-python/
用Python来写MapReduce的实际应用程序 - 云计算架构师-解占辉 - 51CTO技术博客
http://jeffxie.blog.51cto.com/1365360/515668