Wednesday, May 29, 2013

Daily Bookmarks 20130529

注意Python中strptime的效率问题 | 不沉之月
https://blog.lzhaohao.info/archive/performance-problem-with-strptime/
Hank to hanker - Learning Note: [Python] 時間格式轉換(strtime & strftime)
http://whhnote.blogspot.tw/2011/01/python-strtime-strftime.html

datetime - Iterating through a range of dates in Python - Stack Overflow
http://stackoverflow.com/questions/1060279/iterating-through-a-range-of-dates-in-python
LanguageManual UDF
https://cwiki.apache.org/Hive/languagemanual-udf.html
Passing arguments to a shell script
http://osr507doc.sco.com/en/OSUserG/_Passing_to_shell_script.html
Kick Start Hadoop: Include values during execution time in hive QL/ Dynamically substitute values in hive
http://kickstarthadoop.blogspot.tw/2011/10/include-values-during-execution-time-in.html
Hbase interact with shell
http://www.slideshare.net/shashwat2010/hbase-interact-with-shell
Hadoop Hive与Hbase整合 - guisu,程序人生。 - 博客频道 - CSDN.NET
http://blog.csdn.net/hguisu/article/details/7282050
HBase之旅二:通过HBase Shell与HBase交互(转自:Taobao QA Team) - Lendfating的日志 - 网易博客
http://lendfating.blog.163.com/blog/static/182074367201211193176286/

hive 执行hbase创建表时找不到protobuf
http://abloz.com/2012/06/15/hive-execution-hbase-create-the-table-can-not-find-protobuf.html
HBase shell commands | Learn HBase
http://learnhbase.wordpress.com/2013/03/02/hbase-shell-commands/
HBase Shell命令学习 - 小学生 - ITeye技术网站
http://smallboby.iteye.com/blog/1525735

Qcon 北京:做一件事 - 幸福收藏夹
http://sofish.de/2193
pytesser - OCR in Python using the Tesseract engine from Google - Google Project Hosting
http://code.google.com/p/pytesser/

Tuesday, May 28, 2013

Daily Bookmarks 20130528

hadoop,hbase,hive安装全记录 - Seas_小庙的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/chengweipeng123/article/details/7174717
hive集成hbase笔记 - 能量源于改变!(改善) - ITeye技术网站
http://heipark.iteye.com/blog/1150648
View Source
http://dev.gbif.org/wiki/plugins/viewsource/viewpagesrc.action?pageId=2523151


HBase - Scans using filters from the Shell - General Dev - Confluence
http://dev.gbif.org/wiki/display/DEV/HBase+-+Scans+using+filters+from+the+Shell
用Python操作Mysql - I am migle - ITeye技术网站
http://migle.iteye.com/blog/573092
Tutorial: How to connect to MySQL with Python - Tutorials Blog
http://www.jeremymorgan.com/tutorials/python-tutorials/how-to-connect-to-mysql-with-python/


hbase的内容查询(1) hbase shell
http://abloz.com/2012/08/22/hbase-how-like-the-sql-like-query-value-as.html
在hive中创建HBase外部表
http://abloz.com/2012/07/19/create-the-hbase-an-external-table-in-the-hive.html

升级Hadoop Hive的版本 « Hey! Linux.
http://heylinux.com/archives/2163.html

伪分布式安装部署CDH4.2.1与Impala[原创实践] « Hey! Linux.
http://heylinux.com/archives/2456.html#more-2456
Impala整合HBase - - ITeye技术网站
http://yinhudongtian.iteye.com/blog/1758558

Architecture | Kiji Community - Build Real-Time Scalable Data Applications on Apache HBase
http://www.kiji.org/architecture


hive 执行hbase创建表时找不到protobuf
http://abloz.com/2012/06/15/hive-execution-hbase-create-the-table-can-not-find-protobuf.html
Hive部署(包括集成Hbase和Sqoop) - free9277 - ITeye技术网站
http://free9277.iteye.com/blog/1847094
hive hbase exists table - Google 搜尋
https://www.google.com.tw/search?q=hive+hbase+exists+table&spell=1&sa=X&ei=bSyjUd74BsrNkgXPw4CYCA&ved=0CC4QBSgA&biw=927&bih=537
Hive 和 HBase 的快速入门 - 技术翻译 - 开源中国 OSChina.NET
http://www.oschina.net/translate/hive-hbase-quickstart
Trend Micro CDC SPN Team | Region Server意外退出之后…
http://www.spnguru.com/2011/04/region-server%E6%84%8F%E5%A4%96%E9%80%80%E5%87%BA%E4%B9%8B%E5%90%8E/
Trend Micro CDC SPN Team | REST和认证
http://www.spnguru.com/2011/10/rest_authentication/
5.5.2. RegionServer process down alert - Hortonworks Data Platform
http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-1.2.3/bk_Monitoring_Hadoop_Book/content/monitor-chap3-6-5-2.html
HBase Administration, Performance Tuning | Packt Publishing
http://www.packtpub.com/article/hbase-basic-performance-tuning
hadoop+hive+hbase - 东杰书屋 - 博客频道 - CSDN.NET
http://blog.csdn.net/jiedushi/article/category/829246
Kick Start Hadoop: Hive Hbase integration/ Hive HbaseHandler : Common issues and resolution
http://kickstarthadoop.blogspot.tw/2012/05/hive-hbase-integration-common-issues.html
hive中添加自定义udf udaf udtf等函数的jar文件的三种方法 - 东杰书屋 - 博客频道 - CSDN.NET
http://blog.csdn.net/jiedushi/article/details/8631895

hbase shell example - Google 搜尋
https://www.google.com.tw/search?biw=927&bih=537&q=hbase+shell+example&oq=hbase+shell+e&gs_l=serp.3.0.0j0i30l9.1138.2321.0.3713.7.7.0.0.0.0.45.178.7.7.0...0.0.0..1c.1.12.serp.cN34gFAwGAA
hbase shell基础和常用命令详解三江小渡 | 三江小渡
http://blog.pureisle.net/archives/1887.html
Hbase/Shell - Hadoop Wiki
http://wiki.apache.org/hadoop/Hbase/Shell
HBase Shell命令学习 - 小学生 - ITeye技术网站
http://smallboby.iteye.com/blog/1525735
HBase shell commands | Learn HBase
http://learnhbase.wordpress.com/2013/03/02/hbase-shell-commands/

使用Ambari快速部署Hadoop大数据环境
http://www.uml.org.cn/sjjm/201305244.asp
alo.alt: Using Hive's HBase handler
http://mapredit.blogspot.tw/2012/12/using-hives-hbase-handler.html
Access HBase Data with Hive - Amazon Elastic MapReduce
http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-hbase-access-hive.html

Deploy code from Git using Puppet
http://livecipher.blogspot.tw/2013/01/deploy-code-from-git-using-puppet.html
git + fabric = awesome deploy team - Rumproarious
http://www.rumproarious.com/2010/09/01/git-fabric-awesome-deploy-team/
Easy Python Deployment with Fabric and Git at Mixpanel Engineering
http://code.mixpanel.com/2010/09/09/easy-python-deployment-with-fabric-and-git/
How I deploy my weekend projects with git and fabric - troebr
http://blog.troebr.net/post/37134036829/how-i-deploy-my-weekend-projects-with-git-and-fabric
Deploying Mezzanine: Fabric Git Vagrant Joy | BScientific
http://bscientific.org/blog/mezzanine-fabric-git-vagrant-joy/
Django / Python – Fabric Deployment Script and Example | Useful Stuff.
http://yuji.wordpress.com/2011/04/09/django-python-fabric-deployment-script-and-example/
How to use Fabric in a development environment ← Python For Beginners
http://www.pythonforbeginners.com/systems-programming/how-to-use-fabric-in-a-development-environment/
A Fabric function for git tagging | deployment, fabric, git, python | codeinthehole.com by David Winterbottom
http://codeinthehole.com/writing/a-fabric-function-for-git-tagging/
Python Deployment with Fabric
http://www.slideshare.net/andymccurdy/python-deployment-with-fabric


廖雪峰的官方网站 Spring入门
http://www.liaoxuefeng.com/article/000136177743223283f7b23bbd3ce64d91aec7b36d08524b58



Thursday, May 23, 2013

Daily Bookmarks 20130523

Setting up a custom domain with Pages · GitHub Help
https://help.github.com/articles/setting-up-a-custom-domain-with-pages
New GitHub Pages domain: github.io
https://github.com/blog/1452-new-github-pages-domain-github-io


Configuration — Beaker 1.6.4 documentation
http://beaker.readthedocs.org/en/latest/configuration.html#cookie-domain-config
Using the Session — SQLAlchemy 0.8 Documentation
http://docs.sqlalchemy.org/en/rel_0_8/orm/session.html#simple-vertical-partitioning



浅谈CSRF攻击方式 - hyddd - 博客园
http://www.cnblogs.com/hyddd/archive/2009/04/09/1432744.html

老生常谈session,cookie的区别,安全性

http://blog.51yip.com/php/938.html
web集群时利用memcache来同步session
http://blog.51yip.com/php/931.html
web集群时session同步的3种方法
http://blog.51yip.com/server/922.html
PHP 实现多服务器共享SESSION 数据 | vicenteforever
http://www.vicenteforever.com/2011/11/php-web-share-session

redis session 存储 同步«海底苍鹰(tank)博客
http://blog.51yip.com/cache/1434.html


批量Load到HBase _人人IT網
http://rritw.com/a/JAVAbiancheng/JAVAzonghe/20130426/346816.html
MapReduce生成HFile入库到HBase | 石头儿
http://shitouer.cn/2013/02/hbase-hfile-bulk-load/
HBase数据迁移(2)- 使用bulk load 工具从TSV文件中导入数据 - ImportNew
http://www.importnew.com/3645.html


flask-login not sure how to make it work using sqlite3 | Verious
http://www.verious.com/qa/flask-login-not-sure-how-to-make-it-work-using-sqlite3/

Terse Words: Flask Extensions For Authorization with Examples
http://terse-words.blogspot.tw/2011/06/flask-extensions-for-authorization-with.html
flask-login not sure how to make it work using sqlite3 | Verious
http://www.verious.com/qa/flask-login-not-sure-how-to-make-it-work-using-sqlite3/
Write a Tumblelog Application with Flask and MongoEngine — MongoDB Manual 2.4.3
http://docs.mongodb.org/manual/tutorial/write-a-tumblelog-application-with-flask-mongoengine/
Python Flask MongoDB User Authentication | Verious
http://www.verious.com/qa/python-flask-mongo-db-user-authentication/
python - Flask user authentication - Stack Overflow
http://stackoverflow.com/questions/6972999/flask-user-authentication
Understanding Flask-Login Tokens | The Circuit Nerd Blog - Electronics, Programming, Nerd Stuff
http://blog.thecircuitnerd.com/flask-login-tokens/




Daily Bookmarks 20130522


Nginx 配置 SSL 证书 + HTTPS 站点小记 - 走点路博客
http://zou.lu/nginx-https-ssl-module/
nginx ssl的安装和配置
http://blog.51yip.com/apachenginx/974.html


[Oracle][Performance]善用Materialized View提高查詢效能#1 簡介 - RiCo技術農場- 點部落
http://www.dotblogs.com.tw/ricochen/archive/2009/09/21/10726.aspx


Tuesday, May 21, 2013

Daily Bookmarks 20130521


bulk-load装载hdfs数据到hbase小结 - 蓝色时分 - ITeye技术网站
http://koven2049.iteye.com/blog/982831
hbase的bulk load一个小改造 - 小泥巴的玩伴 - 博客频道 - CSDN.NET
http://blog.csdn.net/jingling_zy/article/details/7330420
hbase的bulk load一个小改造(续) - 小泥巴的玩伴 - 博客频道 - CSDN.NET
http://blog.csdn.net/jingling_zy/article/details/7462353

bulk load关于分隔符的问题 - 小泥巴的玩伴 - 博客频道 - CSDN.NET
http://blog.csdn.net/jingling_zy/article/details/7260978

MapReduce生成HFile入库到HBase及源码分析三江小渡 | 三江小渡
http://blog.pureisle.net/archives/1950.html

HBase 之HFileOutputFormat - 一路看
http://www.16kan.com/post/185734.html

MapReduce生成HFile入库到HBase | 石头儿
http://shitouer.cn/2013/02/hbase-hfile-bulk-load/




用于大数据的并查集(基于HBase)的java类三江小渡 | 三江小渡
http://blog.pureisle.net/archives/2033.html
Hadoop实战(Chuck Lam)三江小渡 | 三江小渡
http://blog.pureisle.net/archives/1710.html



hadoop和hbase节点添加和单独重启 - 一路看
http://www.16kan.com/post/185739.html
怎么停止和重新启用hadoop的DataNode - Everything can be distributed - ITeye技术网站
http://coderplay.iteye.com/blog/290767

hive SQL优化之distribute by和sort by - yyj0531 - 51CTO技术博客
http://yaoyinjie.blog.51cto.com/3189782/703873





NameNode优化笔记 (一) - Everything can be distributed - ITeye技术网站
http://coderplay.iteye.com/blog/868983



Monday, May 20, 2013

Daily Bookmarks 20130520


- Add note that I currently have no intention to rewrite lumberjack in G... · d2f9f74 · jordansissel/lumberjack
https://github.com/jordansissel/lumberjack/commit/d2f9f7438b42c8f8c7916a9f2c6bf8998d2dcb2b
Integrate Tornado in Django
http://geekscrap.com/2010/02/integrate-tornado-in-django/

mapreduce join - Google 搜尋
https://www.google.com.tw/search?q=mapreduce++join&client=firefox-a&rls=org.mozilla:zh-TW:official&source=lnt&tbs=lr:lang_1zh-CN%7Clang_1zh-TW&lr=lang_zh-CN%7Clang_zh-TW&sa=X&ei=cAKaUejfKMuOkgXA0IDQCA&ved=0CBcQpwUoAQ&biw=1159&bih=659
MapReduce中的两表join几种方案简介 - leejun_2005的个人页面 - 开源中国社区
http://my.oschina.net/leejun2005/blog/95186
MapReduce之Join操作(1) - 天光云影 - ITeye技术网站
http://bjyjtdj.iteye.com/blog/1453410
Real World Hadoop - Implementing a Left Outer Join in Map Reduce | Matthew Rathbone
http://blog.matthewrathbone.com/2013/02/09/real-world-hadoop-implementing-a-left-outer-join-in-hadoop-map-reduce.html
Hadoop Python Map-reduce | 疯狂的蚂蚁
http://www.crazyant.net/tag/hadoop-python-map-reduce/
Joins with Map Reduce | Source Open
http://chamibuddhika.wordpress.com/2012/02/26/joins-with-map-reduce/
peicheng / 13udnresys / source / alsox / — Bitbucket
https://bitbucket.org/peicheng/13udnresys/src/addbca54157d/alsox?at=master



策略: Google在数据挖掘中使用Canary Request来试探测Query的破坏性 - sabolasi - ITeye技术网站
http://sabolasi.iteye.com/blog/1246014

反向SSH打洞 , Reverse SSH Tunnel
http://jakson.idv.tw/joomla/index.php?option=com_content&view=article&id=204:ssh--reverse-ssh-tunnel&catid=67:linux-setting-ssh&Itemid=125
insert mysql mapreduce streaming api - Google 搜尋
https://www.google.com.tw/search?q=insert+mysql+mapreduce+streaming+api&ei=RleWUd_PK8aikQW1_oCYCQ&start=10&sa=N&biw=927&bih=537
Integrating MySQL and Hadoop – or – A different approach on using CSV files in MySQL | pero on anything
http://aprilmayjune.org/2010/09/05/integrating-mysql-and-hadoop-or-a-different-approach-on-using-csv-files-in-mysql/
BiggData: MapReduce and Hive by example
http://biggdata.blogspot.tw/2011/04/refreshing-trendingtopics-website-data.html
Hadoop MapReduce操作MySQL - J2EE企业应用 顾问/咨询 Java传教士 -H.E.'s Blog
http://www.javabloger.com/article/mapreduce-mysql.html
mahout recommendation - Google 搜尋
https://www.google.com.tw/search?q=mahout+recommendation&oq=mahout+re&gs_l=serp.3.3.35i39j0l9.7730.8976.0.12584.9.9.0.0.0.0.34.192.9.9.0...0.0...1c.1.14.serp.GQKKrJV1cno

Playing with the Mahout recommendation engine on a Hadoop cluster | Chimpler
http://chimpler.wordpress.com/2013/02/20/playing-with-the-mahout-recommendation-engine-on-a-hadoop-cluster/
Finding association rules with Mahout Frequent Pattern Mining | Chimpler
http://chimpler.wordpress.com/2013/05/02/finding-association-rules-with-mahout-frequent-pattern-mining/
Playing with the Mahout recommendation engine on a Hadoop cluster | Chimpler
http://chimpler.wordpress.com/2013/02/20/playing-with-the-mahout-recommendation-engine-on-a-hadoop-cluster/
Deploying Hadoop on EC2 with Whirr | Chimpler
http://chimpler.wordpress.com/2013/01/20/deploying-hadoop-on-ec2-with-whirr/


Does Mahout need to be installed on the Hadoop's master node? - Stack Overflow
http://stackoverflow.com/questions/11569344/does-mahout-need-to-be-installed-on-the-hadoops-master-node

Friday, May 17, 2013

Daily Bookmarks 20130517

Building Software Systems at Google and Lessons Learned_ifttt
http://ifttt.diandian.com/post/2012-03-07/15695072

hadoop - HBase as web app backend - Stack Overflow
http://stackoverflow.com/questions/13111820/hbase-as-web-app-backend
Big Data Scalable web sites using Hadoop and Hbase | Docner Software | Technical Profile
http://docner.com/en/art/big-data-scalable-web-sites-with-hbase.html

Usage case of HBase for real-time application
http://www.slideshare.net/udanax/usage-case-of-hbase-for-realtime-application

HBase利用bulk load批量导入数据 | OneCoder
http://www.coderli.com/hadoop-hbase-bulkloadMapReduce生成HFile入库到HBase | 石头儿
http://shitouer.cn/2013/02/hbase-hfile-bulk-load/
Best practice - Stream API into a FILE or MySQL or neither? - Google 網上論壇
https://groups.google.com/forum/?fromgroups#!topic/twitter-development-talk/U1yhKoPUAXI
Integrating MySQL and Hadoop – or – A different approach on using CSV files in MySQL | pero on anything
http://aprilmayjune.org/2010/09/05/integrating-mysql-and-hadoop-or-a-different-approach-on-using-csv-files-in-mysql/
MapReduce生成HFile入库到HBase | 石头儿
http://shitouer.cn/2013/02/hbase-hfile-bulk-load/
HBase数据迁移(2)- 使用bulk load 工具从TSV文件中导入数据 - ImportNew
http://www.importnew.com/3645.html
HBase数据迁移(1) - ImportNew
http://www.importnew.com/3226.html




bulk load - Google 搜尋
https://www.google.com.tw/search?q=bulk+load&lr=lang_zh-CN%7Clang_zh-TW&client=firefox-a&hs=wJO&rls=org.mozilla:zh-TW:official&tbs=lr:lang_1zh-CN%7Clang_1zh-TW&ei=9VCWUdP-LsnGkwW9u4HADw&start=50&sa=N&biw=1275&bih=725


What are SUCCESS and part-r-00000 files in hadoop - Stack Overflow
http://stackoverflow.com/questions/10666488/what-are-success-and-part-r-00000-files-in-hadoop
What’s New in Apache Hadoop 0.21 | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2010/08/what%e2%80%99s-new-in-apache-hadoop-0-21/









Thursday, May 16, 2013

Daily Bookmarks 20130516

Get output from scans in hbase shell - Stack Overflow
http://stackoverflow.com/questions/10035475/get-output-from-scans-in-hbase-shell
HBase/Pig/Python Quickstart on OSX
http://chase-seibert.github.io/blog/2013/02/01/getting-starting-with-hbase-and-pig.html
Hive with HBase Quickstart
http://chase-seibert.github.io/blog/2013/05/10/hive-hbase-quickstart.html
HBase Schema Introduction for Programmers
http://chase-seibert.github.io/blog/2013/04/26/hbase-schema-design.html
Using thrift python client with HBase | WhyNosql
http://whynosql.com/using-thrift-python-client-with-hbase/

[MySQL]left, right, inner, outer join 使用方法 | 小惡魔 - 電腦技術 - 工作筆記 - AppleBOY
http://blog.wu-boy.com/2009/01/mysqlleft-right-inner-outer-join-%E4%BD%BF%E7%94%A8%E6%96%B9%E6%B3%95/
用SQL合併資料表
http://www.study-area.org/coobila/tutorial_381.html

Big Data Beyond MapReduce: Google's Big Data Papers | Architects Zone
http://architects.dzone.com/articles/big-data-beyond-mapreduce
How to 'git rm' all deleted files shown by 'git status' | Tyler Frankenstein
http://www.tylerfrankenstein.com/code/how-git-rm-all-deleted-files-shown-git-status
















Daily Bookmarks 20130515

Hadoop实例:二度人脉与好友推荐 - intergret - 博客园
http://www.cnblogs.com/datalab/archive/2013/05/13/3075817.html
用Map/Reduce来做好友推荐 - 竹叶青 的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/azhao_dn/article/details/7642892
hive导出查询结果到本地文件 - 竹叶青 的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/azhao_dn/article/details/6921423
用Map/Reduce来做好友推荐(接上)_Come with me_百度空间
http://hi.baidu.com/ucjohnsonzhu/item/cf1cd337898d22493075a1b8
好友推荐 map reduce - Google 搜尋
https://www.google.com.tw/search?q=%E5%A5%BD%E5%8F%8B%E6%8E%A8%E8%8D%90+map+reduce&ei=9CCTUfSJJ4PjlAWVwYCYBQ&start=10&sa=N&biw=927&bih=537
还是以上次的分析最高气温的Map-Reduce为例 - Google 搜尋
https://www.google.com.tw/search?q=%E8%BF%98%E6%98%AF%E4%BB%A5%E4%B8%8A%E6%AC%A1%E7%9A%84%E5%88%86%E6%9E%90%E6%9C%80%E9%AB%98%E6%B0%94%E6%B8%A9%E7%9A%84Map-Reduce%E4%B8%BA%E4%BE%8B&oq=%E8%BF%98%E6%98%AF%E4%BB%A5%E4%B8%8A%E6%AC%A1%E7%9A%84%E5%88%86%E6%9E%90%E6%9C%80%E9%AB%98%E6%B0%94%E6%B8%A9%E7%9A%84Map-Reduce%E4%B8%BA%E4%BE%8B&gs_l=serp.3...7001.7001.0.7251.1.1.0.0.0.0.41.41.1.1.0...0.0...1c.1.12.serp.SX50GfvE1Ig
用Hadoop管理界面來分析Map-Reduce作業_StackDoc
http://rritw.com/a/fuwuqiruanjian/Apache/20120604/167703.html
用Hadoop管理界面来分析Map-Reduce作业 - 平行线的凝聚 - 51CTO技术博客
http://supercharles888.blog.51cto.com/609344/885536
Hadoop - 平行线的凝聚 - 51CTO技术博客
http://supercharles888.blog.51cto.com/609344/d-23/p-2
如何快速接手一個項目(內部項目或開源項目)_StackDoc
http://rritw.com/a/fuwuqiruanjian/Apache/20120602/167398.html
Hadoop 运行过程深入分析 - 平行线的凝聚 - 51CTO技术博客
http://supercharles888.blog.51cto.com/609344/878422
用Map/Reduce来做好友推荐 | 我自然
http://www.yankay.com/%e7%94%a8mapreduce%e6%9d%a5%e5%81%9a%e5%a5%bd%e5%8f%8b%e6%8e%a8%e8%8d%90/?utm_source=twitterfeed&utm_medium=twitter
[第4周作业帖]用MapReduce实现资源推荐 - Hadoop分布式数据分析平台-炼数成金-Dataguru专业数据分析社区
http://f.dataguru.cn/thread-26378-1-1.html
用Map/Reduce来做好友推荐 | 我自然
http://www.yankay.com/%E7%94%A8mapreduce%E6%9D%A5%E5%81%9A%E5%A5%BD%E5%8F%8B%E6%8E%A8%E8%8D%90/
实现用户推荐和资源二度推荐,没算法,纯体力活 - 开源中国 OSChina.NET
http://www.oschina.net/question/1092_68167?sort=time
xlvector – Recommender System - 基于python实现map-reduce并用来计算co-occurance (一)
http://xlvector.net/blog/?p=874
xlvector – Recommender System - 如果翻墙,可以更好的浏览这个blog
http://xlvector.net/blog/?paged=2
一个计算好友相似度的MapReduce实现(Python版) - GavinIsNobody的个人空间 - 开源中国社区
http://my.oschina.net/u/781842/blog/128979
好友 python map reduce - Google 搜尋
https://www.google.com.tw/search?q=%E5%A5%BD%E5%8F%8B+python+map+reduce&lr=lang_zh-CN%7Clang_zh-TW&tbs=lr:lang_1zh-CN%7Clang_1zh-TW&ei=KwqSUf32NM6fkQXW_YGYDg&start=10&sa=N&biw=927&bih=537
MapReduce与自然语言处理 | 我爱自然语言处理
http://www.52nlp.cn/mapreduce%E4%B8%8E%E8%87%AA%E7%84%B6%E8%AF%AD%E8%A8%80%E5%A4%84%E7%90%86
使用Python写Map-Reduce程序 - 绚丽也尘埃的日志 - 网易博客
http://blog.163.com/ecy_fu/blog/static/4445126201002191329467/
使用Python MrJob的MapReduce实现电影推荐系统 | 搜不狐
http://www.sobuhu.com/archives/567
Yelp/mrjob · GitHub
https://github.com/Yelp/mrjob
用Map/Reduce来做好友推荐 | 我自然
http://www.yankay.com/%E7%94%A8mapreduce%E6%9D%A5%E5%81%9A%E5%A5%BD%E5%8F%8B%E6%8E%A8%E8%8D%90/
用MapReduce 实现Inverted Index - Hadoop分布式数据分析平台-炼数成金-Dataguru专业数据分析社区
http://f.dataguru.cn/thread-77387-1-1.html



Saturday, May 11, 2013

Daily Bookmarks 20130509

Awk Introduction Tutorial – 7 Awk Print Examples
http://www.thegeekstuff.com/2010/01/awk-introduction-tutorial-7-awk-print-examples/

Daily Bookmarks 20130510

hbase表结构设计研究 - tudou@NorthWind - 博客园
http://www.cnblogs.com/ylqmf/archive/2012/05/17/2506410.html

Writing Python like a Jedi | RedKrieg's Blog
http://redkrieg.com/2012/12/11/writing-python-like-a-jedi/




HBase row key design
Schema - OpenTSDB - A Distributed, Scalable Monitoring System
http://opentsdb.net/schema.html
6.3. Rowkey Design
http://hbase.apache.org/book/rowkey.design.html#timeseries
6.11. Schema Design Case Studies
http://hbase.apache.org/book/schema.casestudies.html
6.3. Rowkey Design
http://hbase.apache.org/book/rowkey.design.html
v基于HBASE的并行计算架构之rowkey设计篇-南京云创存储
http://www.cstor.cn/textdetail_3239.html
Java操作Hbase进行建表、删表以及对数据进行增删改查,条件查询 - JavaCrazyer的技术博客 - ITeye技术网站
http://javacrazyer.iteye.com/blog/1186881












Wednesday, May 01, 2013

Dairy Bookmark 20130501


Hive实例:CSDN十大常用密码 | Intergret
http://www.datalab.sinaapp.com/?p=101

mahout使用KMeans算法 - Cody的笔记本 - 博客频道 - CSDN.NET
mahout基于hadoop的CF代码分析 - Cody的笔记本 - 博客频道 - CSDN.NET
http://blog.csdn.net/inte_sleeper/article/details/7650283#comments
EasyHadoop技术大学 培训视频_免费高速下载|百度云 网盘-分享无限制
http://pan.baidu.com/share/link?shareid=492484&uk=1124363056#dir/path=%2FEasyHadoop%E6%8A%80%E6%9C%AF%E5%A4%A7%E5%AD%A6%20%E5%9F%B9%E8%AE%AD%E8%A7%86%E9%A2%91
easyhadoop/EasyHadoopCentral/expect.py at master · xianglei/easyhadoop · GitHub
https://github.com/xianglei/easyhadoop/blob/master/EasyHadoopCentral/expect.py