Version 1 (modified by jazz, 12 years ago) (diff) |
---|
實作十三 Lab 13
抓抓龍安裝步驟示範
Crawlzilla installation in Practice
Crawlzilla installation in Practice
下載抓抓龍
- 抓抓龍官方網站: http://crawlzilla.info
- 下載 最新穩定版 Crawlzilla 1.1.1 版本
安裝抓抓龍
hadoop@debian:~$ lsb_release -a No LSB modules are available. Distributor ID: Debian Description: Debian GNU/Linux 6.0.3 (squeeze) Release: 6.0.3 Codename: squeeze hadoop@debian:~$ wget -nv http://nchc.dl.sourceforge.net/project/crawlzilla/stable/Crawlzilla-1.0/Crawlzilla-1.1.1.tar.gz hadoop@debian:~$ tar zxvf Crawlzilla-1.1.1.tar.gz hadoop@debian:~$ cd Crawlzilla_Install/ hadoop@debian:~/Crawlzilla_Install$ ./install System does not has Crawlzilla. Identify is root Your system information is: Debian , 6.0.3 Try to automatically install: sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect debian will install some packages sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect System has Sun Java 1.6 above version. System has ssh. System has ssh Server (sshd). System has dialog. Welcome to use Crawlzilla, this install program will create a new accunt and to assist you to setup the password of crawler. Set password for crawler: password: keyin the password again: password: Master IP address is: 192.168.125.1 Master MAC address is: 08:00:27:04:6b:db Please confirm the install infomation of above :1.Yes 2.No 1 Create crawler and change password. unpack success! Check and Set /etc/hosts finished. Add Crawlzilla service to /etc/init.d Crawlzilla will startup when booting update-rc.d: using dependency based boot sequencing Make the slave installation package Formatting HDFS... start up name node [Namenode] ... starting namenode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-namenode-debian.out start up job node [JobTracker] ... starting jobtracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-jobtracker-debian.out starting datanode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-datanode-debian.out starting tasktracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-tasktracker-debian.out Start up tomcat... ..... Tomcat may not start, please use " crawlzilla " to start Installed successfully! You can visit the manage website :http://192.168.125.1:8080 Finish!!!