NutchEZ安裝流程
假設條件
- JAVA_HOME=/usr/lib/jvm/java-6-sun
- User: nutchuser
- Nutch原始檔路徑:/home/nutchuser/nutch-1.0.tar.gz
- Tomcat原始檔路徑:/home/nutchuser/apache-tomcat-6.0.18.tar.gz
- NutchEZ安裝路徑:/opt/nutchEZ
- Tomcat安裝路徑:/opt/nutchEZ/tomcat
開始安裝
詢問使用者資訊及其他資訊
- Admin e-mail
- DNS name
- Master IP(程式設定)
Install Nutch
解壓縮.改資料夾名稱.擁有者
- tar zxvf nutch-1.0.tar.gz
- mv nutch-1.0 nutchEZ
- chown -R nutchuser:nutchuser /opt/nutchEZ
將設定寫入設定檔
- hadoop-env.sh
- hadoop-site.xml($MasterDNS)
- nutch-site.xml($Admin)
- slaves(叢集的client_install需更改此檔)
- crawl-urlfilter.txt(爬網規則)
啟動nutch
Install Tomcat
解壓縮.改資料夾名稱.擁有者
- tar zxvf apache-tomcat-6.0.18.tar.gz /opt/nutchEZ/
- mv /opt/nutchEZ/apache-tomcat-6.0.18 /opt/nutchEZ/tomcat
- chown -R nutchuser:nutchuser /opt/nutchEZ/
環境設定
$ cd /opt/nutchEZ
$ mkdir web
$ cd web
$ jar -xvf ../nutch-1.0.war
$ rm ../nutch-1.0.war
$ mv /opt/nuctcEZ/tomcat/webapps/ROOT /opt/tomcat/webapps/ROOT-ori
$ cd /opt/nutchEZ
$ mv /opt/nutchEZ/web /opt/nutchEZ/tomcat/webapps/ROOT
$ mkdir /opt/nutchEZ/search
修改設定檔
/opt/nutchEZ/tomcat/conf/server.xml
/opt/nutchEZ/tomcat/webapps/ROOT/WEB-INF/classes/nutch-site.xml
啟動tomcat
執行階段