source: nutchez-0.1/tomcat/webapps/ROOT/WEB-INF/classes/suffix-urlfilter.txt @ 66

Last change on this file since 66 was 66, checked in by waue, 15 years ago

NutchEz - an easy way to nutch

File size: 630 bytes
Line 
1# config file for urlfilter-suffix plugin
2
3# case-insensitive, allow unknown suffixes
4+I
5# uncomment the line below to filter on url path
6#+P
7
8### prohibit these
9# pictures
10.gif
11.jpg
12.jpeg
13.bmp
14.png
15.tif
16.tiff
17.ico
18.eps
19.ps
20.wmf
21.fpx
22.cur
23.ani
24.img
25.lwf
26.pcd
27.psp
28.psd
29.tga
30.xbm
31.xpm
32
33# web-formats
34.css
35
36# archives/packages
37.arj
38.arc
39.7z
40.cab
41.lzw
42.lha
43.lzh
44.zip
45.gz
46.tar
47.tgz
48.sit
49.rpm
50.deb
51.pkg
52
53# audio/video
54.mid
55.midi
56.rmi
57.mpeg
58.mpg
59.mpe
60.mp3
61.mp2
62.aac
63.mov
64.fla
65.flv
66.ra
67.ram
68.rm
69.rmv
70.wma
71.wmv
72.wav
73.wave
74.ogg
75.avi
76.au
77.snd
78
79# executables
80.exe
81.com
82
83# windows links
84.lnk
85
86# typo3-extensions
87.t3x
88
89# disc-images
90.iso
91.bin
Note: See TracBrowser for help on using the repository browser.