Changes between Version 5 and Version 6 of waue/2011/0727
- Timestamp:
- Jul 27, 2011, 5:00:35 PM (14 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
waue/2011/0727
v5 v6 46 46 {{{ 47 47 #!text 48 <configuration> 48 49 <property> 49 50 <name>http.agent.name</name> 50 51 <value>waue_test</value> 51 52 </property> 53 <property> 54 <name>plugin.includes</name> 55 <value>protocol-http|urlfilter-regex|parse-(html|tika)|index-(basic|anchor)|scoring-opic|urlnormalizer-(pass|regex|basic)</value> 56 </property> 57 <property> 58 <name>http.robots.agents</name> 59 <value>nutch</value> 60 </property> 61 <property> 62 <name>http.agent.url</name> 63 <value>waue_test</value> 64 </property> 65 <property> 66 <name>http.agent.email</name> 67 <value>waue_test</value> 68 </property> 69 <property> 70 <name>http.agent.version</name> 71 <value>waue_test</value> 72 </property> 73 </configuration> 52 74 }}} 53 75 … … 74 96 75 97 {{{ 76 echo "http://lucene.apache.org/nutch/" >urls/url.txt98 mkdir urls ; echo "http://lucene.apache.org/nutch/" >urls/url.txt 77 99 bin/nutch crawl urls -dir crawl2 -depth 2 -topN 50 78 100 }}}