close
Warning:
Can't synchronize with repository "(default)" (Unsupported version control system "svn": /usr/lib/python2.7/dist-packages/libsvn/_core.so: failed to map segment from shared object: Cannot allocate memory). Look in the Trac log for more information.
- Timestamp:
-
Apr 11, 2016, 5:51:52 PM (8 years ago)
- Author:
-
jazz
- Comment:
-
--
Legend:
- Unmodified
- Added
- Removed
- Modified
-
v4
|
v5
|
|
1 | 1 | = 作業一 = |
2 | 2 | |
3 | | * 題目:請參考 hadoop_labs/lab013 改成逆向索引(Reverse Index)。使 !ReverseIndex 執行之結果為「"關鍵字"\t"檔案名稱(用逗點隔開)"」型態。 |
4 | | * 參考: 以[wiki: 連結之執行方法],忽略句點(\.)與逗點(\,),並且忽略大小寫(case.sensitive=false), |
| 3 | * 題目:請參考 hadoop_labs/lab010 改成逆向索引(Reverse Index)。使 !ReverseIndex 執行之結果為「"關鍵字"\t"檔案名稱(用逗點隔開)"」型態。 |
5 | 4 | * 參考步驟:[[BR]]Here is the reference steps: |
6 | 5 | {{{ |
… |
… |
|
9 | 8 | ~$ echo "I like ntu Course, and we enjoy this course." > hw1_input/input2 |
10 | 9 | ~$ hadoop fs -put hw1_input hw1_input |
11 | | ~$ echo "\." > pattern.txt && echo "\," >> pattern.txt |
12 | | ~$ hadoop fs -put pattern.txt . |
13 | | ~$ hadoop jar WordCount -Dwordcount.case.sensitive=false hw1_input hw1_out -skip pattern.txt |
| 10 | ~$ hadoop jar WordCount hw1_input hw1_out |
14 | 11 | ~$ hadoop fs -cat hw1_out/part-00000 |
15 | 12 | }}} |