Changes between Version 3 and Version 4 of Hinet130923/Lab11
- Timestamp:
- Sep 24, 2013, 12:20:48 PM (12 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Hinet130923/Lab11
v3 v4 10 10 }}} 11 11 12 {{{ 13 #!text 14 請先連線至 nodeN.3du.me , N 為您的報名編號 15 }}} 16 12 17 == Sample 1 : WordCount == 13 18 14 19 * 如名稱,WordCount會對所有的字作字數統計,並且從a-z作排列[[BR]]WordCount example will count each word shown in documents and sorting from a to z. 15 20 {{{ 16 ~$ hadoop fs -put /etc/hadoop/conf lab 5_input17 ~$ hadoop fs -rmr lab 5_out218 ~$ hadoop jar hadoop-examples.jar wordcount lab 5_input lab5_out221 ~$ hadoop fs -put /etc/hadoop/conf lab11_input 22 ~$ hadoop fs -rmr lab11_out2 23 ~$ hadoop jar hadoop-examples.jar wordcount lab11_input lab11_out2 19 24 }}} 20 25 * 檢查輸出結果的方法同之前方法[[BR]]Let's check the computed result of '''wordcount''' from HDFS : 21 26 {{{ 22 $ hadoop fs -ls lab5_out223 $ hadoop fs -cat lab5_out2/part-r-0000027 ~$ hadoop fs -ls lab11_out2 28 ~$ hadoop fs -cat lab11_out2/part-r-00000 24 29 }}} 25 30 * 結果如下[[BR]]You should see results like this: … … 40 45 * grep 這個命令是擷取文件裡面特定的字元,在Hadoop example中此指令可以擷取文件中有此指定文字的字串,並作計數統計[[BR]]grep is a command to extract specific characters in documents. In hadoop examples, you can use this command to extract strings match the regular expression and count for matched strings. 41 46 {{{ 42 $ hadoop fs -ls lab5_input43 $ hadoop jar hadoop-examples.jar grep lab5_input lab5_out3 'dfs[a-z.]+'47 ~$ hadoop fs -ls lab11_input 48 ~$ hadoop jar hadoop-examples.jar grep lab11_input lab11_out3 'dfs[a-z.]+' 44 49 }}} 45 50 * 運作的畫面如下:[[BR]]You should see procedure like this: … … 52 57 * 接著查看結果[[BR]]Let's check the computed result of '''grep''' from HDFS : 53 58 {{{ 54 $ hadoop fs -ls lab5_out359 ~$ hadoop fs -ls lab11_out3 55 60 Found 2 items 56 drwx------ - hXXXX supergroup 0 2011-04-19 10:00 /user/hXXXX/lab 5_out1/_logs57 -rw-r--r-- 2 hXXXX supergroup 1146 2011-04-19 10:00 /user/hXXXX/lab 5_out1/part-0000058 $ hadoop fs -cat lab5_out3/part-0000061 drwx------ - hXXXX supergroup 0 2011-04-19 10:00 /user/hXXXX/lab11_out1/_logs 62 -rw-r--r-- 2 hXXXX supergroup 1146 2011-04-19 10:00 /user/hXXXX/lab11_out1/part-00000 63 ~$ hadoop fs -cat lab11_out3/part-00000 59 64 }}} 60 65 * 結果如下[[BR]]You should see results like this: