NutchEZ Install測試
步驟
- 將安裝shell檔及*.tar.gz放置同一目錄下
- 執行install.sh
安裝之後檢查項目
路徑 | 檢查項目
|
/home/nutchuser/nutchez/source | client安裝檔(檢查ip,hostname), client壓縮檔
|
/etc/hosts | 相同的hostsname需註解掉
|
測試
Ubuntu10.04
Ubuntu9.10
執行
2010/06/10
10/06/10 16:58:42 INFO mapred.JobClient: Task Id : attempt_201006091555_0003_r_000000_0, Status : FAILED
Shuffle Error: Exceeded MAX_FAILED_UNIQUE_FETCHES; bailing-out.
10/06/10 16:58:53 INFO mapred.JobClient: Task Id : attempt_201006091555_0003_r_000000_1, Status : FAILED
Shuffle Error: Exceeded MAX_FAILED_UNIQUE_FETCHES; bailing-out.
10/06/10 16:59:05 INFO mapred.JobClient: Task Id : attempt_201006091555_0003_r_000000_2, Status : FAILED
Shuffle Error: Exceeded MAX_FAILED_UNIQUE_FETCHES; bailing-out.
Exception in thread "main" java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1232)
at org.apache.nutch.crawl.Generator.generate(Generator.java:472)
at org.apache.nutch.crawl.Generator.generate(Generator.java:409)
at org.apache.nutch.crawl.Crawl.main(Crawl.java:116)
nutch crawl is error