Friday, November 30, 2012

Daily Bookmarks 20121130

sub-reality.org - sub quality blogging
http://sub-reality.org/2012/09/howto-setup-logstash-and-kibana-for-nginx-on-debian-squeeze/
Logstash Init script — Gist
https://gist.github.com/3623477
logstash-rpms/SOURCES at master · slojo404/logstash-rpms · GitHub
https://github.com/slojo404/logstash-rpms/tree/master/SOURCES

HIVE: Data Warehousing & Analytics on Hadoop
http://www.slideshare.net/zshao/hive-data-warehousing-analytics-on-hadoop-presentation#btnNext

Log rotation in CentOS Linux
http://abdussamad.com/archives/541-Log-rotation-in-CentOS-Linux.html
Apache Hadoop Log Files: Where to find them in CDH, and what info they contain | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2009/09/apache-hadoop-log-files-where-to-find-them-in-cdh-and-what-info-they-contain/
Migrating to CDH | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2010/08/migrating-to-cdh3/


[#SERENGETI-279] hadoop log disk full on master node after 28 jobs - SpringSource Issue Tracker gen terasort
https://issuetracker.springsource.com/browse/SERENGETI-279

log level
不重启集群调整Namenode、jobtracker等服务LOG level - 改善 - ITeye技术网站
http://heipark.iteye.com/blog/1333472

黃小瀞的學習筆記: [XenServer] 延長免費版 XenServer 的使用期限
http://fayesnote.blogspot.tw/2011/03/xenserver-xenserver.html












Thursday, November 29, 2012

Daily Bookmarks 20121129

mahout-cf
http://www.slideshare.net/sscdotopen/mahoutcf#btnNext
Algorithms - Apache Mahout - Apache Software Foundation 實作的演算法近況
https://cwiki.apache.org/confluence/display/MAHOUT/Algorithms
Itembased Collaborative Filtering - Apache Mahout - Apache Software Foundation
https://cwiki.apache.org/confluence/display/MAHOUT/Itembased+Collaborative+Filtering
Collaborative Filtering with ALS-WR - Apache Mahout - Apache Software Foundation
https://cwiki.apache.org/confluence/display/MAHOUT/Collaborative+Filtering+with+ALS-WR
Recommender Documentation - Apache Mahout - Apache Software Foundation
https://cwiki.apache.org/confluence/display/MAHOUT/Recommender+Documentation

Kick Start Hadoop: Evaluating Mahout based Recommender Implementations
http://kickstarthadoop.blogspot.tw/2011/05/evaluating-mahout-based-recommender.html

Puppet must to read
Puppet Tutorial for Linux: Powering up with Puppet | Bitfield Consulting
http://bitfieldconsulting.com/puppet-tutorial

logstash puppet
logstash « I'm Colin
http://imcol.in/tag/logstash/

chrome.tabs - Google Chrome
https://developer.chrome.com/extensions/tabs.html#type-Tab



See the links below bundled up as one link at:
http://tab.bz/1i8e

Running Hadoop On Ubuntu Linux (Multi-Node Cluster) @ Michael G. Noll
http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-multi-node-cluster/
Running Hadoop On Ubuntu Linux (Single-Node Cluster) @ Michael G. Noll
http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/#download-example-input-data
hadoop-examples-0.20.2-cdh3u5.jar wordcount - Google 搜尋
https://www.google.com.tw/search?q=hadoop-examples-0.20.2-cdh3u5.jar+wordcount&hl=zh-TW&tbo=d&ei=AxCtUPr2CqOemQX3_YHADA&start=10&sa=N&biw=927&bih=537
How to test and run a map reduce job on CDH ? - Grokbase
http://grokbase.com/p/cloudera/cdh-user/12afq10j2g/how-to-test-and-run-a-map-reduce-job-on-cdh
How to test and run a map reduce job on CDH ? - Grokbase
http://grokbase.com/p/cloudera/cdh-user/12acf18jdk/how-to-test-and-run-a-map-reduce-job-on-cdh
Hadoop map task list for job_201211211712_0001 on hc1
http://hc1:50030/jobtasks.jsp?jobid=job_201211211712_0001&type=map&pagenum=1
HDFS:/user/hdfs/gutenberg-output
http://hct5.:50075/browseDirectory.jsp?dir=/user/hdfs/gutenberg-output&namenodeInfoPort=50070
hc1 Hadoop Machine List
http://hc1:50030/machines.jsp?type=active





logstash « I'm Colin
http://imcol.in/tag/logstash/
Here comes Logstash for Ubuntu Cloud - Jorge's Stompbox
http://www.jorgecastro.org/2012/11/06/here-comes-logstash-for-ubuntu-cloud
More Ubuntu Juju - Logstash charms | Tech - PaulCz.NET
http://tech.paulcz.net/2012/11/more-ubuntu-juju-logstash-charms.html
A [Very] Beginner's guide to Canonical's Juju | Tech - PaulCz.NET
http://tech.paulcz.net/2012/10/a-very-beginners-guide-to-canonicals.html


Learning — Variables, Conditionals, and Facts — Documentation — Puppet Labs
http://docs.puppetlabs.com/learning/variables.html
Zero to puppet in one day - Finninday
http://finninday.net/wiki/index.php/Zero_to_puppet_in_one_day
Puppet - Gentoo Wiki
http://wiki.gentoo.org/wiki/Puppet
Basic Puppet Setup and Configuration – Linode Library
http://library.linode.com/application-stacks/puppet/installation#sph_configuring-puppet
Puppet安装与配置教程 | 南北漂
http://www.iforeach.com/archives/449.html
百分点科技 - 个性化推荐引擎 数据分析引擎
http://www.baifendian.com/
Dependency graphs in Puppet | Bitfield Consulting
http://bitfieldconsulting.com/puppet-dependency-graphs

Discussion of the Puppet configuration management framework
http://comments.gmane.org/gmane.comp.sysutils.puppet.user/28445
puppet测试例子一枚_天下文章一大抄_百度空间
http://hi.baidu.com/newdreamllc/item/307ac30b3fd709066c904891
centos puppet - Google 搜尋
https://www.google.com.tw/search?q=centos+puppet&aq=f&oq=centos+puppet&sugexp=chrome,mod=13&sourceid=chrome&ie=UTF-8
CentOS 6 Puppet Install • [ How2CentOS ]
http://www.how2centos.com/centos-6-puppet-install/
在 CentOS 6.2 上安装 Puppet 配置管理工具 | vpsee.com
http://www.vpsee.com/2012/03/install-puppet-on-centos-6-2/
hmc puppet server - Google 搜尋
https://www.google.com.tw/search?q=hmc+puppet+server&hl=zh-TW&prmd=imvns&ei=y-OoUKTvN8zxmAXFv4GgAg&start=10&sa=N&biw=928&bih=537
Hmc installation .
http://webcache.googleusercontent.com/search?q=cache:b_xpPdqZiA8J:www.slideshare.net/chebrian/hmc-installation+&cd=12&hl=zh-TW&ct=clnk&gl=tw
HMC:Puppet agent ping connect refused - saying的日志 - 网易博客
http://csshixian.blog.163.com/blog/static/182308522201292471714832/














http://comments.gmane.org/gmane.comp.sysutils.puppet.user/28445

Wednesday, November 28, 2012

Daily Bookmarks 20121128


Kick Start Hadoop: Mahout Recommendations in Distributed mode with Hadoop Map Reduce
http://kickstarthadoop.blogspot.tw/2011/05/mahout-recommendations-in-distributed.html
Kick Start Hadoop: Mahout Recommendations with Data Sets containing Alpha Numeric Item Ids
http://kickstarthadoop.blogspot.tw/2011/05/mahout-recommendations-with-data-sets.html
Kick Start Hadoop: Evaluating Mahout based Recommender Implementations
http://kickstarthadoop.blogspot.tw/2011/05/evaluating-mahout-based-recommender.html







http://kickstarthadoop.blogspot.tw/2011/05/evaluating-mahout-based-recommender.html

Tuesday, November 27, 2012

Daily Bookmarks 20121127

hadoop常见错误及解决办法! - 心如大海 - ITeye技术网站
http://p-x1984.iteye.com/blog/989577


This is puppet部署经验 - ITNosh .
http://www.itnosh.com/2012/02/puppet_deploy.html

Deploying Logstash with Puppet « « I'm Colin
http://imcol.in/2012/05/deploying-logstash-with-puppet/
How to set up Semantic Logging: part one with Logstash, Kibana, ElasticSearch and Puppet, – Jayway
http://www.jayway.com/2012/07/27/how-to-set-up-semantic-logging-part-one-with-logstash-kibana-elasticsearch-and-puppet/
haf/puppet-elasticsearch · GitHub
https://github.com/haf/puppet-elasticsearch
ubuntu 12.04 puppet部署Openstack » 陈沙克日志
http://www.chenshake.com/ubuntu-12-04-puppet-deployment-openstack/








Monday, November 26, 2012

Daily Bookmarks 20121126

about elastic search slide
Scaling massive elastic search clusters - Rafał Kuć - Sematext
http://www.slideshare.net/kucrafal/scaling-massive-elastic-search-clusters-rafa-ku-sematext#btnNext
Battle of the giants: Apache Solr vs ElasticSearch
http://www.slideshare.net/kucrafal/battle-of-the-giants-apache-solr-vs-elasticsearch#btnNext
ElasticSearch at berlinbuzzwords 2010
http://www.slideshare.net/elasticsearch/elasticsearch-at-berlinbuzzwords-2010#btnNext

使用elasticsearch加快過濾 - 控制Rhttp :/ / jontai.me/blog/2012/10/using-elasticsearch-to-speed-up-filtering的/
Using elasticsearch to Speed Up Filtering – Control+R
http://jontai.me/blog/2012/10/using-elasticsearch-to-speed-up-filtering/

elasticsearch - advanced features in practice precolator
http://www.slideshare.net/jsuchal/elasticsearch-advanced-features-in-practice#btnNext

Geeking with Greg: Jeff Dean keynote at WSDM 2009
http://glinden.blogspot.tw/2009/02/jeff-dean-keynote-at-wsdm-2009.html
elasticsearch - - Open Source, Distributed, RESTful, Search Engine , ElasticSearch 官方站点中文版(开源、分布式、RESTful的搜索引擎)
http://es-cn.medcl.net/
Elasticsearch Storage Optimization · logstash/logstash Wiki
https://github.com/logstash/logstash/wiki/Elasticsearch-Storage-Optimization

elasticsearch hash shard - Google 搜尋
https://www.google.com.tw/search?q=elasticsearch+hash+shard&hl=zh-TW&tbo=d&ei=7kKzULOkF8rFmQXGh4DgBw&start=10&sa=N&biw=927&bih=508
ElasticSearch 原理笔记_digiter
http://digiter.diandian.com/post/2012-11-07/40042796630
[elasticsearch] offline indexing and expected scaling performance - Grokbase
http://grokbase.com/t/gg/elasticsearch/12adcr7h52/offline-indexing-and-expected-scaling-performance
elasticsearch - concepts - Partitioning
http://es-cn.medcl.net/guide/concepts/scaling-lucene/partitioning/
















http://jontai.me/blog/2012/10/using-elasticsearch-to-speed-up-filtering/
http://glinden.blogspot.tw/2009/02/jeff-dean-keynote-at-wsdm-2009.html
http://es-cn.medcl.net/guide/concepts/scaling-lucene/partitioning/

Friday, November 23, 2012

Daily Bookmarks 20121122


ElasticSearch vs Solr - ElasticSearch Tutorial.com
http://www.elasticsearchtutorial.com/elasticsearch-vs-solr.html
Scaling massive elastic search clusters - Rafał Kuć - Sematext
http://www.slideshare.net/kucrafal/scaling-massive-elastic-search-clusters-rafa-ku-sematext
Battle of the giants: Apache Solr vs ElasticSearch
http://www.slideshare.net/kucrafal/battle-of-the-giants-apache-solr-vs-elasticsearch#btnNext

Importing source code into Eclipse · OneBusAway/onebusaway Wiki
https://github.com/OneBusAway/onebusaway/wiki/Importing-source-code-into-Eclipse
How to setup Eclipse IDE - OpenbravoWiki
http://wiki.openbravo.com/wiki/How_to_setup_Eclipse_IDE#Import_into_Eclipse_IDE
Setting up Eclipse
http://www.javahotchocolate.com/tutorials/setup-eclipse.html
Developing Your Java Project with Eclipse
http://www.javahotchocolate.com/tutorials/use-eclipse.html


Building Single Page Web Apps With Sinatra: Part 2 | Nettuts+
http://net.tutsplus.com/tutorials/javascript-ajax/building-single-page-web-apps-with-sinatra-part-2/

Make Yahoo! Web Service REST calls with Python - Yahoo! Developer Network
http://developer.yahoo.com/python/python-rest.html#post
Learn REST: A Tutorial
http://rest.elkstein.org/

elasticsearch/elasticsearch-hadoop · GitHub
https://github.com/elasticsearch/elasticsearch-hadoop
ElasticSearch Users - Why is index not written to hdfs? 10 sec snapshot to HDFS
http://elasticsearch-users.115913.n3.nabble.com/Why-is-index-not-written-to-hdfs-td3986947.html

Hadoop 研究: Write a file to HDFS
http://kuanyuhadoop.blogspot.tw/2011/04/write-file-to-hdfs.html
Hadoop 研究: Read a file in HDFS
http://kuanyuhadoop.blogspot.tw/2011/04/read-file-in-hdfs.html
How to Read file from Hadoop using Java without command line - Stack Overflow
http://stackoverflow.com/questions/9565599/how-to-read-file-from-hadoop-using-java-without-command-line
Hadoop Tutorial - YDN
http://developer.yahoo.com/hadoop/tutorial/module2.html
Hadoop Tutorial - YDN Mapreduce
http://developer.yahoo.com/hadoop/tutorial/module4.html

read file hadoop - Google 搜尋
https://www.google.com.tw/search?q=read+file+hadoop&aq=f&oq=read+file+hadoop&sugexp=chrome,mod=13&sourceid=chrome&ie=UTF-8

Facebook重做MapReduce,Corona比YARN更胜一筹?-CSDN.NET
http://www.csdn.net/article/2012-11-13/2811845

puppet测试例子一枚_天下文章一大抄_百度空间
http://hi.baidu.com/newdreamllc/item/307ac30b3fd709066c904891

Cloudera Hadoop RHEL/CentOS 6 Install Guide - Dakini's Bliss
http://dak1n1.com/blog/9-hadoop-el6-install
HBase核心贡献者Ted Yu:参与开源比收入更重要-CSDN.NET

http://www.csdn.net/article/2012-11-12/2811819-HBase_core_contributor_TedYu




ElasticSearch用戶 - 指數不寫入到HDFS是為什麼呢?
HBase核心貢獻者Ted Yu:參與開源比收入更重要-CSDN.NET

Wednesday, November 21, 2012

Sunday, November 18, 2012

Daily Bookmarks 20121118


Puppet安装与配置教程 | 南北漂
http://www.iforeach.com/archives/449.html
An Introduction to Puppet
http://www.harker.com/puppet/BayLISA100715.html

Install Puppet client and connect to your Puppet master - What's already forgotten. | What's already forgotten.
http://www.drivard.com/2011/12/05/install-puppet-client-and-connect-to-your-puppet-master/
Zero to puppet in one day - Finninday
http://finninday.net/wiki/index.php/Zero_to_puppet_in_one_day
puppet - puppetca never returns anything - Server Fault
http://serverfault.com/questions/235513/puppetca-never-returns-anything
Howto install puppet Master and client in ubuntu | Unixmen
http://www.unixmen.com/install-puppet-master-and-client-in-ubuntu/
張旭: Puppet
http://zx-1986.blogspot.tw/2010/11/puppet.html
Install Puppet client and connect to your Puppet master - What's already forgotten. | What's already forgotten.
http://www.drivard.com/2011/12/05/install-puppet-client-and-connect-to-your-puppet-master/
Puppet安装与配置教程 | 南北漂
http://www.iforeach.com/archives/449.html


















http://www.iforeach.com/archives/449.html
http://www.iforeach.com/archives/449.html

Thursday, November 15, 2012

Daily Bookmarks 20121115

From Solr to elasticsearch | Government Digital Service
http://digital.cabinetoffice.gov.uk/2012/08/03/from-solr-to-elasticsearch/
elasticsearch - blog - Here Comes The Cloud
http://www.elasticsearch.org/blog/2010/05/11/here-comes-the-cloud.html
Why Jetwick moved from Solr to ElasticSearch « Karussell
http://karussell.wordpress.com/2011/02/07/why-jetwick-moved-from-solr-to-elasticsearch/
Realtime Search: Solr vs Elasticsearch
http://blog.socialcast.com/realtime-search-solr-vs-elasticsearch/
foursquare now uses Elastic Search (and on a related note: Slashem also works with Elastic Search)! | Foursquare Engineering Blog
http://engineering.foursquare.com/2012/08/09/foursquare-now-uses-elastic-search-and-on-a-related-note-slashem-also-works-with-elastic-search/
Solr vs. ElasticSearch: Part 1 – Overview « Sematext Blog
http://blog.sematext.com/2012/08/23/solr-vs-elasticsearch-part-1-overview/








Wednesday, November 14, 2012

Daily Bookmarks 20121114


试用logstash进行分布式事件收集 | walk walk koven
http://walkoven.com/?p=116
Disqus Comments
http://disqus.com/embed/comments/?f=tagged&t_i=1121%20http%3A%2F%2Fblog.tagged.com%2F%3Fp%3D1121&t_u=http%3A%2F%2Fblog.tagged.com%2F2012%2F05%2Fgrabbing-full-java-stack-traces-from-syslog-ng-with-logstash&t_t=Grabbing%20Full%20Java%20Stack%20Traces%20from%20Syslog-ng%20with%20Logstash&s_o=popular#1

Grabbing Full Java Stack Traces from Syslog-ng with Logstash – Tagged
http://blog.tagged.com/2012/05/grabbing-full-java-stack-traces-from-syslog-ng-with-logstash/
Linux系统调优读书笔记 | 三斗室
http://chenlinux.com/2012/04/30/reading-notes-about-linux-system/
trying for a logstash conf file that works with java's logback logger — Gist
https://gist.github.com/2782951
ZeroMQ的学习和研究 « 搜索技术博客-淘宝
http://www.searchtb.com/2012/08/zeromq-primer.html

bpaquet/node-logstash
https://github.com/bpaquet/node-logstash
Using Graylog2 for Environment-Wide Log Collection and Correlation | Avid Life Media Blog
http://blog.avidlifemedia.com/2012/06/06/using-graylog2-for-environment-wide-log-collection-and-correlation/
Centralized Logging - Jason Wilder's Blog
http://jasonwilder.com/blog/2012/01/03/centralized-logging/
云风的 BLOG: ZeroMQ 的模式
http://blog.codingnow.com/2011/02/zeromq_message_patterns.html

Monitor your Java application logs in 4 easy steps at Uptime & Performance Tips
http://blog.monitis.com/index.php/2012/08/07/monitor-your-java-application-logs-in-4-easy-steps/

Centralised Java Logging - Stack Overflow
http://stackoverflow.com/questions/11100760/centralised-java-logging

Monitor your Java application logs in 4 easy steps at Uptime & Performance Tips
http://blog.monitis.com/index.php/2012/08/07/monitor-your-java-application-logs-in-4-easy-steps/
Simple metric aggregation and automated custom monitors with Monitis and StatsD at Uptime & Performance Tips
http://blog.monitis.com/index.php/2012/07/30/simple-metric-aggregation-and-automated-custom-monitors-with-monitis-and-statsd/
新世紀通訊函式庫 – ZeroMQ | 程式設計 遇上 小提琴
http://blog.ez2learn.com/2011/12/31/transport-lib-of-new-era-zeromq/
java - log4j: How to use SocketAppender? - Stack Overflow
http://stackoverflow.com/questions/11759196/log4j-how-to-use-socketappender

为什么我希望用C而不是C++来实现ZeroMQ - 51CTO.COM
http://developer.51cto.com/art/201205/337012.htm
Why should I have written ZeroMQ in C, not C++ (part I) - 250bpm
http://www.250bpm.com/blog:4

Trend Micro CDC SPN Team | splunk HBase log4j
http://www.spnguru.com/tag/splunk-hbase-log4j/
waue/2011/chukwa – Cloud Computing
http://trac.nchc.org.tw/cloud/wiki/waue/2011/chukwa

Follow Up On Our Downtime Last Week | Bitbucket Blog
http://blog.bitbucket.org/2012/01/12/follow-up-on-our-downtime-last-week/

ElasticSearch入门笔记 | 狂人居 use python example
http://www.qwolf.com/?p=1387
网页正文提取算法研究[非正则] | 狂人居
http://www.qwolf.com/?p=791
elasticsearch入門筆記(1) - tka's blog
http://blog.tka.lu/blog/2012/02/25/elasticsearchru-men-bi-ji-1/

My logging setup: rsyslog, logstash, and Graylog2 | pete's brain
http://petes-brain.com/2011/12/my-logging-setup-rsyslog-logstash-and-graylog2/













http://blog.codingnow.com/2011/02/zeromq_message_patterns.html
http://blog.bitbucket.org/2012/01/12/follow-up-on-our-downtime-last-week/
http://petes-brain.com/2011/12/my-logging-setup-rsyslog-logstash-and-graylog2/

Tuesday, November 13, 2012

Daily Bookmarks 20121113


Is there any implementation of this string matching method in python? - Stack Overflow
http://stackoverflow.com/questions/5192273/is-there-any-implementation-of-this-string-matching-method-in-python
algorithm - String similarity: how exactly does Bitap work? - Stack Overflow
http://stackoverflow.com/questions/11317973/string-similarity-how-exactly-does-bitap-work





http://stackoverflow.com/questions/11317973/string-similarity-how-exactly-does-bitap-work

Monday, November 12, 2012

Daily Bookmarks 20121112


logstash/lib/logstash at v1.1.5 · logstash/logstash · GitHub
https://github.com/logstash/logstash/tree/v1.1.5/lib/logstash
logstash/lib/logstash/outputs/elasticsearch.rb at v1.1.5 · logstash/logstash
https://github.com/logstash/logstash/blob/v1.1.5/lib/logstash/outputs/elasticsearch.rb
Visualizing logdata with Logstash, statsd and Graphite - pkhamre.blog
http://blog.pkhamre.com/2012/07/05/visualizing-logdata-with-logstash-statsd-and-graphite/
logstash - open source log management
http://logstash.net/docs/1.1.5/tutorials/10-minute-walkthrough/
Using Logstash + Statsd + graphite – Part1 (Logstash) « beingasysadmin
http://beingasysadmin.wordpress.com/2012/09/10/using-logstash-statsd-graphite-part1/
The Future of Compass & ElasticSearch
http://www.kimchy.org/the_future_of_compass/
ElasticSearch vs. Solr #lucene « Karussell
http://karussell.wordpress.com/2011/05/12/elasticsearch-vs-solr-lucene/

(4) What are the main differences between ElasticSearch, Apache Solr and SolrCloud? - Quora
http://www.quora.com/What-are-the-main-differences-between-ElasticSearch-Apache-Solr-and-SolrCloud
Realtime Search: Solr vs Elasticsearch
http://blog.socialcast.com/realtime-search-solr-vs-elasticsearch/
HOWTO: rsyslog + elasticsearch - rsyslog wiki
http://wiki.rsyslog.com/index.php/HOWTO:_rsyslog_%2B_elasticsearch
Graylog2 - About
http://www.graylog2.org/about
8+ Splunk Alternatives | DevOpsANGLE
http://devopsangle.com/2012/04/19/8-splunk-alternatives/
elasticsearch/elasticsearch
https://github.com/elasticsearch/elasticsearch
logstash - open source log management
http://logstash.net/docs/1.1.2/tutorials/10-minute-walkthrough/


博客來書籍館>大難時代
http://www.books.com.tw/exep/prod/booksfile.php?item=0010563261
亚马逊(Amazon)面试_Observer1990的空间_百度空间
http://hi.baidu.com/observer1990/item/1b5d0c293fa9e7e450fd87b0
A recommendation webservice in 10 minutes | “I for one welcome our new computer overlords”
http://ssc.io/a-recommendation-webservice-in-10-minutes/
HOWTO: rsyslog + elasticsearch - rsyslog wiki
http://wiki.rsyslog.com/index.php/HOWTO:_rsyslog_%2B_elasticsearch
ElasticSearch 源码分析 环境入门 - wjboy49博客 - ITeye技术网站
http://wjboy49.iteye.com/blog/1602107
【Logstash系列】用rabbitmq和elasticsearch搭建分布式日志收集存储系统 | 三斗室
http://chenlinux.com/2012/06/01/dist-logstash-and-elasticsearch/
graylog2 logstash体验 | 旁门左道
http://log.medcl.net/item/2012/01/graylog2/
Using elasticsearch mappings appropriately to map as type IP, int, float, etc. | The Untergeek
http://untergeek.com/2012/10/12/using-elasticsearch-mappings-appropriately-to-map-as-type-ip-int-float-etc/
Using templates to improve elasticsearch caching (with logstash) | The Untergeek
http://untergeek.com/2012/09/20/using-templates-to-improve-elasticsearch-caching-with-logstash/
【Logstash系列】数据格式之json-event | 三斗室
http://chenlinux.com/2012/09/21/json-event-for-logstash/
Getting Apache to output JSON (for logstash) | The Untergeek
http://untergeek.com/2012/10/11/getting-apache-to-output-json-for-logstash/
Debugging java threads with top(1) and jstack. :: semicomplete.com - Jordan Sissel
http://www.semicomplete.com/blog/geekery/debugging-java-performance.html
example42/puppet-logstash
https://github.com/example42/puppet-logstash
Jan-Piet Mens :: In where I begin to grok how to mutate a file with Logstash
http://jpmens.net/2012/08/09/i-grok-how-to-mutate-a-file-with-logstash/


rashidkpc/Kibana
https://github.com/rashidkpc/Kibana
Jan-Piet Mens :: My Logstash and Graylog2 notes
http://jpmens.net/2012/08/06/my-logstash-and-graylog2-notes/
noahhl/batsd
https://github.com/noahhl/batsd#readme
Link: Our implementation of Statsd by Noah of 37signals
http://37signals.com/svn/posts/3185-link-our-implementation-of-statsd
Grok tutorial — Official Grok 1.5 documentation
http://grok.zope.org/doc/current/tutorial.html#showing-pages

Solr 使用 Log4j - Bory.Chan
http://blog.chenlb.com/2010/08/solr-with-log4j.html
IntegratingSolr - Solr Wiki
http://wiki.apache.org/solr/IntegratingSolr
FrontPage - Solr Wiki
http://wiki.apache.org/solr/FrontPage#Search_and_Indexing
Solr Facet引发思考 on the road | 淘宝网综合业务平台团队博客
http://rdc.taobao.com/team/jm/archives/2429
三五互联产品技术博客 » Tomcat中solr采用log4j输出日志
http://ptc.35.com/?p=370
Gizzard:Twitter开源的通用数据切分中间件 | 网站那些事 | 网站点兵
http://www.xiuwz.com/site/tech-ope-gizzard/#comment-10


9 open source projects tagged log-management
http://www.findbestopensource.com/tagged/log-management
教你如何迅速秒杀掉:99%的海量数据处理面试题 - 结构之法 算法之道 - 博客频道 - CSDN.NET
http://blog.csdn.net/v_july_v/article/details/7382693
海量字符串排序--hash - dmxmwkv的博客 - 我的搜狐
http://dmxmwkv.i.sohu.com/blog/view/220003542.htm
CS 367-3 - Sorting hash sort
http://pages.cs.wisc.edu/~siff/CS367/Notes/sorting.html
untitled overview search engine
https://docs.google.com/viewer?a=v&q=cache:24qkUuWbZncJ:www.ss.pku.edu.cn/mscourse/lecture/07.pdf+&hl=zh-TW&gl=tw&pid=bl&srcid=ADGEESjvuxD6XlyOwFR1fkdVuguee0oC323UXZVpTEN44UG-uC13mRoZgP6G3qCDxC0ay4C5PZW_BYl3n9t80ail8ywpiy2TAdy8VBfSVrSNXsKCjE0yQUtUJlFOQKJSc9xisM_Ee8et&sig=AHIEtbT0Zdsgw-lrjTFjfo-Cww308PR-ow

stefan.sofa-rockers.org » Designing and Testing PyZMQ Applications – Part 1
http://stefan.sofa-rockers.org/2012/02/01/designing-and-testing-pyzmq-applications-part-1/
Central Logging with Open Source Software | Hacker News
http://news.ycombinator.com/item?id=4122991
https://github.com/facebook/scribe














https://github.com/example42/puppet-logstash
1月,皮特男士::我開始神交一個文件與Logstash中如何變異
Gizzard:Twitter開源的通用數據切分中間件| 網站那些事| 網站點兵
https://github.com/facebook/scribe

Friday, November 09, 2012

Daily Bookmarks 20121109

Hadoop and Mahout in Data Mining | Spring under the hood
http://krishnasblog.com/2012/06/24/hadoop-and-mahout-in-data-mining/
Implementing a recommender engine using Hadoop and Mahout | Spring under the hood
http://krishnasblog.com/2012/06/29/implementing-a-recommender-engine-using-hadoop-and-mahout/
java - How do I build/run this simple Mahout program without getting exceptions? - Stack Overflow
http://stackoverflow.com/questions/11479600/how-do-i-build-run-this-simple-mahout-program-without-getting-exceptions
mahout在hadoop下安装与测试过程 - bai071006201的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/bai071006201/article/details/7912680
Running Mahout on Hadoop Cluster - Stack Overflow
http://stackoverflow.com/questions/11405780/running-mahout-on-hadoop-cluster?rq=1
hadoop - How run mahout in action example ReutersToSparseVectors? - Stack Overflow
http://stackoverflow.com/questions/11447145/how-run-mahout-in-action-example-reuterstosparsevectors
Mahout on Elastic MapReduce
https://cwiki.apache.org/MAHOUT/mahout-on-elastic-mapreduce.html
mahout在eclipse下的开发环境 - backsnow - ITeye技术网站
http://backsnow.iteye.com/blog/1136257
Eclipse 下mahout的配置与使用 - 百步飞扬的日志 - 网易博客
http://x-chaowu2008.blog.163.com/blog/static/100130425201261010558596/
Eclipse 下mahout的配置与使用 (原文地址:http://blog.csdn.net/zhzhl202/archive/2011/04/11/6316570.aspx) - tienan_feng的日志 - 网易博客
http://blog.163.com/tienan_feng@126/blog/static/173379258201142205646514/
CDH4 Installation - Cloudera Support
https://ccp.cloudera.com/display/CDH4DOC/CDH4+Installation#CDH4Installation-Ubuntudownload
How to cluster Seinfeld episodes with Mahout « Trifork Blog / Trifork: Enterprise Java, Open Source, software solutions, Amsterdam
http://blog.trifork.nl/2011/04/04/how-to-cluster-seinfeld-episodes-with-mahout/

Mahout Installation - Cloudera Support
https://ccp.cloudera.com/display/CDH4DOC/Mahout+Installation#MahoutInstallation-InstallingMahout
April Showers Bring May Flowers_百度空间
http://webcache.googleusercontent.com/search?q=cache:tA3mO-dtwJYJ:hi.baidu.com/xucha00/archive/tag/hadoop+&cd=17&hl=zh-TW&ct=clnk&client=ubuntu
mahout转化成eclipse项目并运行示例_漫步云端_新浪博客
http://blog.sina.com.cn/s/blog_62a9902f0100mr4y.html
Mahout - 博涛的专栏 - 博客频道 - CSDN.NET
http://blog.csdn.net/BornHe/article/category/1116242


Cluster-based Recommendation with Mahout | Burcu Dogan
http://burcudogan.com/2012/03/25/cluster-based-recommendation-with-mahout/
推荐系统介绍——Mahout笔记之一 | Dora Blog
http://diaorui.net/?p=305
eclipse安装 最新版 m2eclipse插件 | 孙伟博客-sunwei.org
http://www.sunwei.org/archives/183
Recommendation with Apache Mahout in CDH3 | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2011/11/recommendation-with-apache-mahout-in-cdh3/
Flexible Collaborative Filtering In JAVA With Mahout Taste | My Blog by Philippe Adjiman
http://www.philippeadjiman.com/blog/2009/11/11/flexible-collaborative-filtering-in-java-with-mahout-taste/

Map/Reduce Tutorial
http://hadoop.apache.org/docs/r0.20.2/mapred_tutorial.html#DistributedCache
How to Include Third-Party Libraries in Your Map-Reduce Job | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2011/01/how-to-include-third-party-libraries-in-your-map-reduce-job/
java - Running Mahout from the command line (CLASSPATH) - Stack Overflow
http://stackoverflow.com/questions/3571486/running-mahout-from-the-command-line-classpath
java - How do you programatically 'relative import' a directory of jar files? - Stack Overflow
http://stackoverflow.com/questions/8937409/how-do-you-programatically-relative-import-a-directory-of-jar-files
exception while integrating mahout recommender engine in java web application - Stack Overflow
http://stackoverflow.com/questions/9323374/exception-while-integrating-mahout-recommender-engine-in-java-web-application

































Thursday, November 08, 2012

Daily Bookmarks 20121108

How to Include Third-Party Libraries in Your Map-Reduce Job | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2011/01/how-to-include-third-party-libraries-in-your-map-reduce-job/
java - Running Mahout from the command line (CLASSPATH) - Stack Overflow
http://stackoverflow.com/questions/3571486/running-mahout-from-the-command-line-classpath

Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/mahout/cf/taste/model/DataModel - Google 搜尋
https://www.google.com/search?client=ubuntu&channel=fs&q=Exception+in+thread+%22main%22+java.lang.NoClassDefFoundError%3A+org%2Fapache%2Fmahout%2Fcf%2Ftaste%2Fmodel%2FDataModel&ie=utf-8&oe=utf-8

etsy/statsd
https://github.com/etsy/statsd#readme
Link: Our implementation of Statsd by Noah of 37signals
http://37signals.com/svn/posts/3185-link-our-implementation-of-statsd
Graphite - Scalable Realtime Graphing - Graphite
http://graphite.wikidot.com/start










http://graphite.wikidot.com/start

Tuesday, November 06, 2012

Daily Bookmarks 20121106

Recommendation with Apache Mahout in CDH3 | Apache Hadoop for the Enterprise | Cloudera
http://blog.cloudera.com/blog/2011/11/recommendation-with-apache-mahout-in-cdh3/
Mahout Installation - Cloudera Support
https://ccp.cloudera.com/display/CDHDOC/Mahout+Installation

Eclipse 下mahout的配置与使用 - 百步飞扬的日志 - 网易博客

http://x-chaowu2008.blog.163.com/blog/static/100130425201261010558596/
有例子與範例實作
Eclipse 下mahout的配置与使用 (原文地址:http://blog.csdn.net/zhzhl202/archive/2011/04/11/6316570.aspx) - tienan_feng的日志 - 网易博客
http://blog.163.com/tienan_feng@126/blog/static/173379258201142205646514/




Fast disk-based hashtables? - Stack Overflow
http://stackoverflow.com/questions/495161/fast-disk-based-hashtables
写一个分布式存储系统有多简单? (How easy to write a Distributed Storage System ?) | 呆鸥
http://www.dullgull.com/2012/03/%E5%86%99%E4%B8%80%E4%B8%AA%E5%88%86%E5%B8%83%E5%BC%8F%E6%96%87%E4%BB%B6%E7%B3%BB%E7%BB%9F%E6%9C%89%E5%A4%9A%E7%AE%80%E5%8D%95%EF%BC%9F/
Hash Tables: Implementation - Programming in Python
https://sites.google.com/site/usfcomputerscience/hash-tables-imp
Python基础篇
http://www.tsnc.edu.cn/tsnc_wgrj/doc/python/basic.htm
Fast disk-based hashtables? - Stack Overflow
http://stackoverflow.com/questions/495161/fast-disk-based-hashtables
Hashing
https://docs.google.com/viewer?a=v&q=cache:Oh_7BU2xhkkJ:www.intelligence.tuc.gr/~petrakis/courses/datastructures/hashingdisk.pdf+&hl=zh-TW&gl=tw&pid=bl&srcid=ADGEEShd2nP5_a2_wfnQLrkjnvC_gyA45ybfOV_uHfDfLKpraanxEWyrGZbgtrIMsTN7emgUp39NEHA3yJ95QPKijsJlOHTYV82w9dR0zowS-2lyUTP9R96IF5rZo47VPI4S_LT1mYz5&sig=AHIEtbSfH6wQqxaiwgh8GyE8EWc2M50tmw













https://sites.google.com/site/usfcomputerscience/hash-tables-imp
散列