Changes between Initial Version and Version 1 of jazz/12-03-27


Ignore:
Timestamp:
Mar 27, 2012, 10:09:23 AM (12 years ago)
Author:
jazz
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • jazz/12-03-27

    v1 v1  
     1= 2012-03-27 =
     2
     3== Hadoop ==
     4
     5 * 一些關於 Hadoop 的學習資訊:
     6  * MapReduce Framework
     7   * Great 1 hour video introduction: http://nosqltapes.com/video/understanding-mapreduce-with-mike-miller
     8   * Read the famous 2004 paper from Google that kicked off the MapReduce revolution. This is a very readable paper that can be digested in about 2 - 3 hours: http://research.google.com/archive/mapreduce.html
     9   * Here's a 33 minute video on what kinds of simple things you can do with MapReduce: http://www.cloudera.com/videos/mapreduce_algorithms
     10   * Google's MapReduce course: http://code.google.com/edu/parallel/mapreduce-tutorial.html
     11  * Beginner Hadoop
     12   * Excellent beginner's video on understanding Hadoop, MapReduce and HDFS: http://www.cloudera.com/protected/?resource=introduction-to-apache-mapreduce-and-hdfs
     13   * Understanding the Hadoop ecosystem: http://www.cloudera.com/protected/?resource=apache-hadoop-ecosystem
     14  * HDFS
     15   * An easy 2-3 hour read about Hadoop's distributed File System: http://www.aosabook.org/en/hdfs.html
     16  * Labs
     17   * Install VirtualBox on your laptop, get an Ubuntu Virtual Machine going and follow this excellent tutorial to install your first Hadoop node: http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/
     18   * Then use this to scale your cluster to multiple nodes: http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-multi-node-cluster/
     19   * Run a MapReduce job in Python on your cluster: http://www.michael-noll.com/tutorials/writing-an-hadoop-mapreduce-program-in-python/