◢ <[wiki:III120211/Lab11 實作十一]> | <[wiki:III120211 回課程大綱]> ▲ | ◣ = 實作十二 Lab 12 = [[PageOutline]] {{{ #!html
抓抓龍安裝步驟示範
Crawlzilla installation in Practice
}}} == 下載抓抓龍 == * 抓抓龍官方網站: http://crawlzilla.info * 下載 [http://sourceforge.net/projects/crawlzilla/files/stable/Crawlzilla-1.0/Crawlzilla-1.1.1.tar.gz/download 最新穩定版 Crawlzilla 1.1.1 版本] == 安裝抓抓龍 == {{{ hadoop@debian:~$ lsb_release -a No LSB modules are available. Distributor ID: Debian Description: Debian GNU/Linux 6.0.3 (squeeze) Release: 6.0.3 Codename: squeeze hadoop@debian:~$ wget -nv http://nchc.dl.sourceforge.net/project/crawlzilla/stable/Crawlzilla-1.0/Crawlzilla-1.1.1.tar.gz hadoop@debian:~$ tar zxvf Crawlzilla-1.1.1.tar.gz hadoop@debian:~$ cd Crawlzilla_Install/ hadoop@debian:~/Crawlzilla_Install$ ./install System does not has Crawlzilla. Identify is root Your system information is: Debian , 6.0.3 Try to automatically install: sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect debian will install some packages sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect System has Sun Java 1.6 above version. System has ssh. System has ssh Server (sshd). System has dialog. Welcome to use Crawlzilla, this install program will create a new accunt and to assist you to setup the password of crawler. Set password for crawler: password: keyin the password again: password: Master IP address is: 192.168.125.1 Master MAC address is: 08:00:27:04:6b:db Please confirm the install infomation of above :1.Yes 2.No 1 Create crawler and change password. unpack success! Check and Set /etc/hosts finished. Add Crawlzilla service to /etc/init.d Crawlzilla will startup when booting update-rc.d: using dependency based boot sequencing Make the slave installation package Formatting HDFS... start up name node [Namenode] ... starting namenode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-namenode-debian.out start up job node [JobTracker] ... starting jobtracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-jobtracker-debian.out starting datanode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-datanode-debian.out starting tasktracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-tasktracker-debian.out Start up tomcat... ..... Tomcat may not start, please use " crawlzilla " to start Installed successfully! You can visit the manage website :http://192.168.125.1:8080 Finish!!! }}}