close
Warning:
Can't synchronize with repository "(default)" (Unsupported version control system "svn": /usr/lib/python2.7/dist-packages/libsvn/_repos.so: failed to map segment from shared object: Cannot allocate memory). Look in the Trac log for more information.
- Timestamp:
-
Apr 8, 2009, 5:08:13 PM (17 years ago)
- Author:
-
waue
- Comment:
-
--
Legend:
- Unmodified
- Added
- Removed
- Modified
-
|
v7
|
v8
|
|
| 61 | 61 | </property> |
| 62 | 62 | |
| | 63 | How can I find out/display the size and mime type of the hits that a search returns? |
| | 64 | <property> |
| | 65 | <name>plugin.includes</name> |
| | 66 | <value>...|index-more|...|query-more|...</value> |
| | 67 | ... |
| | 68 | </property> |
| | 69 | |
| | 70 | Nutch doesn't crawl relative URLs? Some pages are not indexed but my regex file and everything else is okay - what is going on? |
| | 71 | The crawl tool has a default limitation of 100 outlinks of one page that are being fetched. |
| | 72 | <property> |
| | 73 | <name>db.max.outlinks.per.page</name> |
| | 74 | <value>-1</value> |
| | 75 | <description> </description> |
| | 76 | </property> |
| 63 | 77 | |
| 64 | 78 | }}} |
| | 79 | |
| | 80 | * 上面有些有用的訊息,但不見得可以解決遇到的問題,至少可以看crawl.log ,看他都fetch , index了哪些網址 |