Changes between Version 5 and Version 6 of NTUOSS160412/HW1
- Timestamp:
- Apr 11, 2016, 5:54:18 PM (9 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
NTUOSS160412/HW1
v5 v6 1 = 作業一 = 1 ◢ <[wiki:NTUOSS160412/Lab7 實作七]> | <[wiki:NTUOSS160412 回課程大綱]> ▲ | > ◣ 2 2 3 * 題目:請參考 hadoop_labs/lab010 改成逆向索引(Reverse Index)。使 !ReverseIndex 執行之結果為「"關鍵字"\t"檔案名稱(用逗點隔開)"」型態。 3 = 作業 = 4 5 * 題目:請參考 hadoop_labs/lab009 改成逆向索引(Reverse Index)。使 !ReverseIndex 執行之結果為「"關鍵字"\t"檔案名稱(用逗點隔開)"」型態。 4 6 * 參考步驟:[[BR]]Here is the reference steps: 5 7 {{{ … … 8 10 ~$ echo "I like ntu Course, and we enjoy this course." > hw1_input/input2 9 11 ~$ hadoop fs -put hw1_input hw1_input 10 ~$ hadoop jar WordCount hw1_input hw1_out12 ~$ hadoop jar WordCount.jar hw1_input hw1_out 11 13 ~$ hadoop fs -cat hw1_out/part-00000 12 14 }}} 13 15 * 參考結果應該為:(路徑不限)[[BR]]The reference result should be as following:(no limitation for the format of "path") 14 16 {{{ 15 and input216 course input1,input2,input217 enjoy input218 i input1,input219 like input1,input220 ntu input1,input221 this input222 we input223 17 }}} 24 18 * 繳交期限:2016年5月2日(一) 上午 11:59 … … 30 24 * 試將出現次數統計加入結果,亦即參考結果如下:[[BR]]Try to add count of each file in the result, i.e. The reference result should be as following: 31 25 {{{ 32 and input2(1)33 cloud input1(1),input2(1)34 course input1(1),input2(2)35 enjoy input2(1)36 i input1(1),input2(1)37 like input1(1),input2(1)38 nctu input1(1),input2(1)39 this input2(1)40 we input2(1)41 26 }}} 42 27 * 配分比例: