Sunday, March 31, 2013

Daily Bookmarks 20130331

[#HDFS-4576] Webhdfs authentication issues - ASF JIRA
https://issues.apache.org/jira/browse/HDFS-4576
Randomly Distributed: Running Hadoop 1.0 on Distributed Infrastructures using SAGA
http://randomlydistributed.blogspot.tw/2011/01/running-hadoop-10-on-distributed.html
Randomly Distributed: WebHDFS-Py - A simple, lean HDFS Python Library
http://randomlydistributed.blogspot.tw/2011/12/webhdfs-py-simple-lean-hdfs-python.html


Installing hadoop development cluster on Windows and Eclipse -- Format the namenode
http://v-lad.org/Tutorials/Hadoop/12%20-%20format%20the%20namendoe.html
Configuring Eclipse for Apache Hadoop Development (a screencast) | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2009/04/configuring-eclipse-for-hadoop-development-a-screencast/

Hadoop Illuminated :: Hadoop Illuminated
http://hadoopilluminated.com/hadoop_book/index.html
Wu Peng Ta's BLOG: 窮苦人家用的Linux KVM 建置Hadoop完全分散環境
http://wupengta.blogspot.tw/2012/08/linux-kvm-hadoop.html






Friday, March 29, 2013

Dairy Bookmarks 20130328

JAX-RS入门 四: 注入 - 刘刚的空间 - ITeye技术网站
http://liugang594.iteye.com/blog/1496651
JAX-RS 从傻逼到牛叉 3:路径匹配 - 神奇好望角 The Magical Cape of Good Hope - BlogJava
http://www.blogjava.net/shinzey/archive/2011/10/09/360199.html
JAX-RS 从傻逼到牛叉 1:REST 基础知识 - 神奇好望角 The Magical Cape of Good Hope - BlogJava
http://www.blogjava.net/shinzey/archive/2011/09/16/358799.html
JAX-RS 从傻逼到牛叉 2:开发一个简单的服务 - 神奇好望角 The Magical Cape of Good Hope - BlogJava
http://www.blogjava.net/shinzey/archive/2011/09/20/359085.html
Chapter 4. @PathParam
http://docs.jboss.org/resteasy/docs/1.0.0.GA/userguide/html/_PathParam.html

JIRA workflow - Clojure Design - Clojure Development
http://dev.clojure.org/display/design/JIRA+workflow
一个简单的例子说明java中spring框架的依赖注入-gaobaolu-中国教育人博客
http://gaobaolu.blog.edu.cn/2011/625469.html











Wednesday, March 27, 2013

Dairy Bookmarks 20130327


JAX-RS @PathParam example
http://www.mkyong.com/webservices/jax-rs/jax-rs-pathparam-example/
Chapter 4. @PathParam
http://docs.jboss.org/resteasy/docs/1.0.0.GA/userguide/html/_PathParam.html
Injector | Guice
http://google-guice.googlecode.com/git/javadoc/com/google/inject/Injector.html


A simple way to create git repository on a server machine connecting via ssh | Ralf Wehner's Blog
http://rwehner.wordpress.com/2010/03/01/a-simple-way-to-create-git-repository-on-a-server-machine-connecting-via-ssh/
建立了一个私有的Git源 | I'm TualatriX
http://imtx.me/archives/1113.html

git clone --mirror vs. git clone --bare - Jason Meridth
http://blog.jasonmeridth.com/2012/03/30/git-clone-mirror-vs-git-clone-bare.html




Git学习笔记(十一) Git克隆 - 时间更替掉季节 - 博客频道 - CSDN.NET
http://blog.csdn.net/agul_/article/details/7843678

GIT: How do I update my bare repo? - Stack Overflow
http://stackoverflow.com/questions/3382679/git-how-do-i-update-my-bare-repo


java | Ralf Wehner's Blog
http://rwehner.wordpress.com/category/java/

究竟是什么让Redshift比Hive快10倍?!-CSDN.NET
http://www.csdn.net/article/2013-03-26/2814648-redshift-is-10x-faster-than-hive

Foursquare:使用MongoDB Replica Sets的三种架构 - NoSQLFan - 关注NoSQL相关技术、新闻
http://blog.nosqlfan.com/html/1750.html
MongoDB创始人Eliot Horowitz分析FourSquare宕机原因
http://www.infoq.com/cn/news/2010/10/eliot-analyze-outage-of-4sq
Foursquare 长达 11 小时的宕机
http://dbanotes.net/arch/foursquare_outage.html

技术人如何才不至于虚度一生?
http://dbanotes.net/jobs/tech_guys_life.html
















Dairy Bookmarks 20130326


Fluentd + HDFS: Instant Big Data Collection | Fluentd
http://docs.fluentd.org/articles/http-to-hdfs
Fluentd + Hadoop: Instant Big Data Collection | Treasure Data Blog
http://blog.treasure-data.com/post/35644151211/fluentd-hadoop-instant-big-data-collection
HTTPS cloning errors · github:help
https://help.github.com/articles/https-cloning-errors
Git错误non-fast-forward后的冲突解决 - chain - 努力がゆえに淋しく、孤独がゆえに強くなる - 博客频道 - CSDN.NET
http://blog.csdn.net/chain2012/article/details/7476493

TextMate Manual » Calling TextMate from Other Applications
http://manual.macromates.com/en/using_textmate_from_terminal.html
Python | 三十岁
http://blog.30c.org/category/develop/python











Monday, March 25, 2013

Dairy Bookmarks 20130325

guice源代码分析 - warm up - 编程思索 | Thoughts of Coding
http://tocspblog.appspot.com/?p=51001
guice源代码分析(一)injector.getInstance - 编程思索 | Thoughts of Coding
http://tocspblog.appspot.com/?p=54001

Contributing to Apache CloudStack as a Non-Committer
http://cloudstack.apache.org/develop/non-contributors.html

Two-Stage Guice Provider
http://jeantessier.com/SoftwareEngineering/TwoStageGuiceProvider.html
Dependency Injection - What is Scope?
http://www.javaranch.com/journal/2008/10/dependency-injection-what-is-scope.html
E-Office學園 • 檢視主題 - [分享] 什麼是 Inversion Of Control?
http://eoffice.im.fju.edu.tw/phpbb/viewtopic.php?t=6307







Wednesday, March 20, 2013

Dairy Bookmarks 20130320

 Maven中代理服务器的设定 - 看懂容易,学会难 - 博客频道 - CSDN.NET
http://blog.csdn.net/shrekmu/article/details/1568059
How to install nodejs on centos 6.3
http://dev-tricks.com/tutorialhow-to-install-nodejs-on-centos-6-3/
在 CentOS 上安裝 Node.JS 環境(Virtual Machine)
http://jacksctsai.blogspot.tw/2011/11/ms-virtual-server-run-nodejs-virtual.html
安装 JDK 和设置 JAVA_HOME (使用 Java CAPS 6 安装 GUI)
http://docs.oracle.com/cd/E19509-01/820-5483/inst_jdk_javahome_t/index.html
How to install Maven on CentOS - Linux FAQ
http://xmodulo.com/2012/05/how-to-install-maven-on-centos.html
oop - Python - why use "self" in a class? - Stack Overflow
http://stackoverflow.com/questions/475871/python-why-use-self-in-a-class
定義類別
http://caterpillar.onlyfun.net/Gossip/Python/Class.html
Python为什么要self - 征服Python
http://sjolzy.cn/Why-should-self-Python.html
Hive – Group By 的实现
http://fatkun.com/2013/01/hive-group-by.html
Hive – Distinct 的实现
http://fatkun.com/2013/01/hive-distinct.html
Hive内容包含有 会当成分割符
http://fatkun.com/2012/10/hive-seprator.html

Tuesday, March 19, 2013

Dairy Bookmarks 20130319


HBase适合做BI分析的数据源吗? demo
http://www.verydemo.com/demo_c152_i9888.html
HBase适合做BI分析的数据源吗?_服务器应用_Linux公社-Linux系统门户网站
http://www.linuxidc.com/Linux/2013-01/77676.htm
HBase适合做BI分析的数据源吗? - xhanfriend的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/xhanfriend/article/details/8494807




Hadoop HBase user's mailing list ()
http://comments.gmane.org/gmane.comp.java.hadoop.hbase.user/27875
福布斯:Hadoop——你不得不了解的大数据工具-阿里研究-小企业,大时代
http://www.aliresearch.com/?m-cms-q-view-id-70733.html

HBase性能优化2—使用Coprocessor进行RowCount统计 | Binospace
http://www.binospace.com/index.php/make-your-hbase-better-2/
Hadoop HBase user's mailing list ()
http://comments.gmane.org/gmane.comp.java.hadoop.hbase.user/27875


不简单的URL去重 - 智障大师 的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/historyasamirror/article/details/6746217
MongoDB让人失望 - 智障大师 的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/historyasamirror/article/details/6827769
我为什么要选择RabbitMQ
http://www.slideshare.net/mryufeng/rabbitmq-13949360
ftofficer|张聪的blog » [翻译] [RabbitMQ+Python入门经典] 兔子和兔子窝
http://blog.ftofficer.com/2010/03/translation-rabbitmq-python-rabbits-and-warrens/


Twilio Conference 2011: Jeff Lindsay, Distributed Systems with Gevent and ZeroMQ on Vimeo
https://vimeo.com/31167454
Homepage | Qubole
http://www.qubole.com/









Monday, March 11, 2013

Dairy Bookmarks 20130311


facebook permanent/longterm access key for python project - Stack Overflow
http://stackoverflow.com/questions/10705922/facebook-permanent-longterm-access-key-for-python-project

fbconsole/src/fbconsole.py at master · facebook/fbconsole · GitHub
https://github.com/facebook/fbconsole/blob/master/src/fbconsole.py

Hive for Beginners | Orzota Blog
http://orzota.com/blog/hive-for-beginners/

Login for Server-side Apps - Facebook 開發人員
http://developers.facebook.com/docs/howtos/login/server-side-login/

DataScientist » Impala/Hive现状分析与前景展望
http://yanbohappy.sinaapp.com/?p=220

eBay使用Hadoop和HBase成功构建下一代搜索
http://www.infoq.com/cn/news/2011/11/eBay_new_search?utm_source=infoq&utm_medium=related_content_link&utm_campaign=relatedContent_news_clk

数据处理技术变迁.pptx_微盘下载
http://vdisk.weibo.com/s/cyjvi/1347776019








http://vdisk.weibo.com/s/cyjvi/1347776019

Thursday, March 07, 2013

Dairy Bookmarks 20130307

舒の随想日记 » Hive Tips
http://blog.hesey.net/2012/04/hive-tips.html
Hive 随谈(二)– Hive 结构 - 阿里集团数据平台 alidata.org
http://www.alidata.org/archives/499



Monday, March 04, 2013

Diary Bookmarks 20130228


Getting Started with Hive | Facility9
http://facility9.com/2010/12/getting-started-with-hive/

Hadoop管理员的十个最佳实践(转) - ggjucheng - 博客园
http://www.cnblogs.com/ggjucheng/archive/2013/01/20/2868906.html
RECOVER PARTITIONS hive - Google 搜尋
https://www.google.com.tw/search?hl=zh-TW&q=RECOVER+PARTITIONS+hive&oq=RECOVER+PARTITIONS+hive&gs_l=serp.3..0i19j0i5i30i19j0i8i30i19l3.931.1817.0.1987.5.5.0.0.0.0.37.160.5.5.0.ernk_timediscountc..0.0...1.1.4.serp.T-xnAbYylaU
Getting Started with Hive | Facility9
http://facility9.com/2010/12/getting-started-with-hive/
sql - Find TOP 10 latest record for each BUYER_ID for yesterday's date - Stack Overflow
http://stackoverflow.com/questions/11405446/find-top-10-latest-record-for-each-buyer-id-for-yesterdays-date





hive - Search Amazon Web Services
http://aws.amazon.com/search?searchQuery=hive&searchPath=articles&x=-747&y=-95
RECOVER PARTITIONS hive - Google 搜尋
https://www.google.com.tw/search?hl=zh-TW&q=RECOVER+PARTITIONS+hive&oq=RECOVER+PARTITIONS+hive&gs_l=serp.3..0i19j0i5i30i19j0i8i30i19l3.931.1817.0.1987.5.5.0.0.0.0.37.160.5.5.0.ernk_timediscountc..0.0...1.1.4.serp.T-xnAbYylaU
LanguageManual DDL
https://cwiki.apache.org/Hive/languagemanual-ddl.html#LanguageManualDDL-Recoverpartitions
(6) Hive (computing): What are partitions in Hive? What does it mean to recover partitions? - Quora
http://www.quora.com/Hive-computing/What-are-partitions-in-Hive-What-does-it-mean-to-recover-partitions
Additional Features of Hive in Amazon EMR - Amazon Elastic MapReduce
http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-hive-additional-features.html#emr-hive-recovering-partitions
question about Hive 'recover partitions' on AWS S3
http://mail-archives.apache.org/mod_mbox/hive-user/201204.mbox/%3C556325346CA26341B6F0530E07F90D96016C64CD9500@GBGH-EXCH-CMS.sig.ads%3E
hadoop - Hive Table add partition to load all subdirectories - Stack Overflow
http://stackoverflow.com/questions/10996985/hive-table-add-partition-to-load-all-subdirectories
Using Hive on Amazon Elastic MapReduce with Karmasphere Analytics : Articles & Tutorials : Amazon Web Services
http://aws.amazon.com/articles/9475657313758110
hive - Search Amazon Web Services
http://aws.amazon.com/search?searchQuery=hive&searchPath=articles&x=-747&y=-95
Running Hive on Amazon Elastic MapReduce : Articles & Tutorials : Amazon Web Services
http://aws.amazon.com/articles/2857?_encoding=UTF8&jiveRedirect=1
hive tutorial - Google 搜尋
https://www.google.com.tw/search?hl=zh-TW&q=hive+tutorial&oq=Hive+totu&gs_l=serp.3.0.0i10i30.2638153.2643813.0.2645005.9.7.2.0.0.0.51.266.7.7.0.ernk_timediscountc..0.0...1.1.5.serp.cB_-glVv5-s
Getting Started with Hive | Facility9
http://facility9.com/2010/12/getting-started-with-hive/
All ChicagoRuby’s Videos on Vimeo
http://vimeo.com/chicagoruby/videos/all/sort:date
Hadoop Data Warehousing with Hive - Google 搜尋
https://www.google.com.tw/search?q=Hadoop+Data+Warehousing+with+Hive&hl=zh-TW&ei=odYoUZyPJ8HFlAWwqYDACw&start=10&sa=N&biw=794&bih=460
Introduction to Apache Hive
http://www.slideshare.net/tapankavasthi/introduction-to-apache-hive
Getting Started with Hive - Latest Documentation (version 2.x) - www.mapr.com
http://www.mapr.com/doc/display/MapR/Getting+Started+with+Hive
hive top n - Google 搜尋
https://www.google.com.tw/search?q=hive+top+n&hl=zh-TW&ei=4eooUZqxIsX6kAWpt4GYAw&start=10&sa=N&biw=794&bih=460
Extract Top N Records in Each Group in Hadoop/hive | Musings
http://www.findnwrite.com/musings/extract-top-n-records-in-each-group-in-hadoophive/
user defined functions - Hive getting top n records in group by query - Stack Overflow
http://stackoverflow.com/questions/9390698/hive-getting-top-n-records-in-group-by-query
hive中分组取前N个值的实现 - 逆域录 - ITeye技术网站
http://baiyunl.iteye.com/blog/1466343
Hive中SELECT TOP N的方法(order by与sort by)_刘健男_新浪博客
http://blog.sina.com.cn/s/blog_6ff05a2c0101eaxf.html
Massive data processing with Hive: US flight history analysis | Datasalt
http://www.datasalt.com/2011/05/massive-data-processing-with-hive-us-flight-history-analysis/
Facebook
http://www.facebook.com/home.php
hive top n - Google 搜尋
https://www.google.com.tw/search?q=hive+top+n&hl=zh-TW&ei=HcstUYPjDsmNmQWfvYCADA&start=20&sa=N&biw=794&bih=460
sql - Find TOP 10 latest record for each BUYER_ID for yesterday's date - Stack Overflow
http://stackoverflow.com/questions/11405446/find-top-10-latest-record-for-each-buyer-id-for-yesterdays-date
hive中分组取前N个值的实现 - ggjucheng - 博客园
http://www.cnblogs.com/ggjucheng/archive/2013/01/30/2868993.html
Hadoop管理员的十个最佳实践(转) - ggjucheng - 博客园
http://www.cnblogs.com/ggjucheng/archive/2013/01/20/2868906.html
改善 - ITeye技术网站
http://heipark.iteye.com/
Sentiment Analysis using Apache Hive « Xebia Blog
http://blog.xebia.com/2012/05/15/sentiment-analysis-using-apache-hive/
Alan Liu - 博客频道 - CSDN.NET
http://blog.csdn.net/liuzhoulong
百度技术沙龙:如何设计优良的日志分析系统 - Alan Liu - 博客频道 - CSDN.NET
http://blog.csdn.net/liuzhoulong/article/details/6991677
“结巴”分词:做最好的Python分词组件 - Alan Liu - 博客频道 - CSDN.NET
http://blog.csdn.net/liuzhoulong/article/details/8051676
hadoop和hive的实践应用(三)——hive的基本应用 - Alan Liu - 博客频道 - CSDN.NET
http://blog.csdn.net/liuzhoulong/article/details/6447075
hadoop和hive的实践应用(二)——基于Hadoop的数据仓库工具hive搭建 - Alan Liu - 博客频道 - CSDN.NET
http://blog.csdn.net/liuzhoulong/article/details/6441914








Dairy Bookmarks 20130304

轻松学习Spring IoC容器和Dependency Injection模式 - JAVA涂鸦 - BlogJava
http://www.blogjava.net/rickhunter/articles/29015.html
簡介JavaBean - JAVA EE - JavaWorld@TW
http://www.javaworld.com.tw/confluence/pages/viewpage.action?pageId=978
1.簡介 JavaBean - 國立中山大學程式諮詢網
https://sites.google.com/a/mis.nsysu.edu.tw/cheng-shi-zi-xun-wang/java-se-ji-chu-pian/7-shi-yongjavabean-yuan-jian/1-jian-jie-javabean

[教學]apache lucene-建立自己的Search Engine 6-mysql資料庫資料做index @ 聰明的生活 :: 痞客邦 PIXNET ::
http://catyku.pixnet.net/blog/post/22844946
[教學]把第一次給Eclipse @ 聰明的生活 :: 痞客邦 PIXNET ::
http://catyku.pixnet.net/blog/post/15421722
[教學]apache lucene-建立自己的Search Engine 3-刪除已建立索引資料 @ 聰明的生活 :: 痞客邦 PIXNET ::
http://catyku.pixnet.net/blog/post/22417532-%5B%E6%95%99%E5%AD%B8%5Dapache-lucene-%E5%BB%BA%E7%AB%8B%E8%87%AA%E5%B7%B1%E7%9A%84search-engine-3-%E5%88%AA%E9%99%A4

Lucene 4.0 的重大升级内容一览
http://www.starming.com/index.php?action=plugin&v=wave&tpl=union&ac=viewgrouppost&gid=33263&tid=15686&pg=3