Changes between Version 3 and Version 4 of Hinet130923/Lab11


Ignore:
Timestamp:
Sep 24, 2013, 12:20:48 PM (11 years ago)
Author:
jazz
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Hinet130923/Lab11

    v3 v4  
    1010}}}
    1111
     12{{{
     13#!text
     14請先連線至 nodeN.3du.me , N 為您的報名編號
     15}}}
     16
    1217== Sample 1 : WordCount ==
    1318 
    1419 * 如名稱,WordCount會對所有的字作字數統計,並且從a-z作排列[[BR]]WordCount example will count each word shown in documents and sorting from a to z.
    1520{{{
    16 ~$ hadoop fs -put /etc/hadoop/conf lab5_input
    17 ~$ hadoop fs -rmr lab5_out2
    18 ~$ hadoop jar hadoop-examples.jar wordcount lab5_input lab5_out2
     21~$ hadoop fs -put /etc/hadoop/conf lab11_input
     22~$ hadoop fs -rmr lab11_out2
     23~$ hadoop jar hadoop-examples.jar wordcount lab11_input lab11_out2
    1924}}}
    2025 * 檢查輸出結果的方法同之前方法[[BR]]Let's check the computed result of '''wordcount''' from HDFS :
    2126{{{
    22 $ hadoop fs -ls lab5_out2
    23 $ hadoop fs -cat lab5_out2/part-r-00000
     27~$ hadoop fs -ls lab11_out2
     28~$ hadoop fs -cat lab11_out2/part-r-00000
    2429}}}
    2530 * 結果如下[[BR]]You should see results like this:
     
    4045 * grep 這個命令是擷取文件裡面特定的字元,在Hadoop example中此指令可以擷取文件中有此指定文字的字串,並作計數統計[[BR]]grep is a command to extract specific characters in documents. In hadoop examples, you can use this command to extract strings match the regular expression and count for matched strings.
    4146{{{
    42 $ hadoop fs -ls lab5_input
    43 $ hadoop jar hadoop-examples.jar grep lab5_input lab5_out3 'dfs[a-z.]+'
     47~$ hadoop fs -ls lab11_input
     48~$ hadoop jar hadoop-examples.jar grep lab11_input lab11_out3 'dfs[a-z.]+'
    4449}}}
    4550 * 運作的畫面如下:[[BR]]You should see procedure like this: 
     
    5257 * 接著查看結果[[BR]]Let's check the computed result of '''grep''' from HDFS :
    5358{{{
    54 $ hadoop fs -ls lab5_out3
     59~$ hadoop fs -ls lab11_out3
    5560Found 2 items
    56 drwx------   - hXXXX supergroup          0 2011-04-19 10:00 /user/hXXXX/lab5_out1/_logs
    57 -rw-r--r--   2 hXXXX supergroup       1146 2011-04-19 10:00 /user/hXXXX/lab5_out1/part-00000
    58 $ hadoop fs -cat lab5_out3/part-00000
     61drwx------   - hXXXX supergroup          0 2011-04-19 10:00 /user/hXXXX/lab11_out1/_logs
     62-rw-r--r--   2 hXXXX supergroup       1146 2011-04-19 10:00 /user/hXXXX/lab11_out1/part-00000
     63~$ hadoop fs -cat lab11_out3/part-00000
    5964}}}
    6065 * 結果如下[[BR]]You should see results like this: