close
Warning:
Can't synchronize with repository "(default)" (Unsupported version control system "svn": /usr/lib/python2.7/dist-packages/libsvn/_fs.so: failed to map segment from shared object: Cannot allocate memory). Look in the Trac log for more information.
- Timestamp:
-
Jan 19, 2010, 4:30:40 PM (16 years ago)
- Author:
-
waue
- Comment:
-
--
Legend:
- Unmodified
- Added
- Removed
- Modified
-
|
v1
|
v2
|
|
| 2 | 2 | |
| 3 | 3 | == 前言 == |
| 4 | | 按照hadoop 0.20 官方網頁的 wordcount v2 . |
| 5 | | [[http://hadoop.apache.org/common/docs/r0.20.1/mapred_tutorial.html#Example%3A+WordCount+v1.0 ]] |
| | 4 | 按照hadoop 0.20 官方網頁的 wordcount v2 .[[BR]] |
| | 5 | [http://hadoop.apache.org/common/docs/r0.20.1/mapred_tutorial.html#Example%3A+WordCount+v1.0 WordCount] |
| 6 | 6 | |
| 7 | | 最需要給的地方是 ''' " extends MapReduceBase implements Mapper" ''' 原因是在hadoop 0.20時,mapreducebase 此class已經被deprecated, |
| 8 | | |
| | 7 | 最需要給的地方是 ''' " extends MapReduceBase implements Mapper" ''' 原因是在hadoop 0.20時,mapreducebase 此class已經被deprecated,[[BR]] |
| 9 | 8 | 因此應改寫如 ''' " extends Mapper" ''' |
| 10 | 9 | |
| 11 | | 然而最主要不能改變的原因是,程式中很重要的功能 [http://hadoop.apache.org/common/docs/r0.20.1/api/org/apache/hadoop/filecache/DistributedCache.html DistributedCache ] 以及 -Dwordcount.skip.patterns 等功能寫於 configure() 函數內。 此configure() 繼承自 MapReduceBase, |
| 12 | | |
| | 10 | 然而最主要不能改變的原因是,程式中很重要的功能 [http://hadoop.apache.org/common/docs/r0.20.1/api/org/apache/hadoop/filecache/DistributedCache.html DistributedCache ] 以及 -Dwordcount.skip.patterns 等功能寫於 configure() 函數內。 此configure() 繼承自 MapReduceBase,[[BR]] |
| 13 | 11 | 因此若整個程式改成hadoop 0.20 的 " extends Mapper" ''',則有些功能將不知是否能使用 |
| 14 | 12 | |
| … |
… |
|
| 154 | 152 | }}} |
| 155 | 153 | |
| 156 | | == 原始程式碼 == |
| | 154 | |