| 1 | [[PageOutline]] |
| 2 | |
| 3 | ◢ <[wiki:III121201/Lab11 實作十二]> | <[wiki:III121201 回課程大綱]> ▲ | ◣ |
| 4 | |
| 5 | = 實作十三 Lab 13 = |
| 6 | |
| 7 | {{{ |
| 8 | #!html |
| 9 | <div style="text-align: center;"><big style="font-weight: bold;"><big>抓抓龍安裝步驟示範<br/>Crawlzilla installation in Practice</big></big></div> |
| 10 | }}} |
| 11 | |
| 12 | == 下載抓抓龍 == |
| 13 | |
| 14 | * 抓抓龍官方網站: http://crawlzilla.info |
| 15 | * 下載 [http://sourceforge.net/projects/crawlzilla/files/stable/Crawlzilla-1.0/Crawlzilla-1.1.2.tar.gz/download 最新穩定版 Crawlzilla 1.1.2 版本] |
| 16 | |
| 17 | == 安裝抓抓龍 == |
| 18 | |
| 19 | {{{ |
| 20 | hadoop@debian:~$ lsb_release -a |
| 21 | No LSB modules are available. |
| 22 | Distributor ID: Debian |
| 23 | Description: Debian GNU/Linux 6.0.3 (squeeze) |
| 24 | Release: 6.0.3 |
| 25 | Codename: squeeze |
| 26 | hadoop@debian:~$ wget -nv http://nchc.dl.sourceforge.net/project/crawlzilla/stable/Crawlzilla-1.0/Crawlzilla-1.1.2.tar.gz |
| 27 | hadoop@debian:~$ tar zxvf Crawlzilla-1.1.2.tar.gz |
| 28 | hadoop@debian:~$ cd Crawlzilla_Install/ |
| 29 | hadoop@debian:~/Crawlzilla_Install$ ./install |
| 30 | |
| 31 | System does not has Crawlzilla. |
| 32 | Identify is root |
| 33 | Your system information is: |
| 34 | Debian , 6.0.3 |
| 35 | Try to automatically install: sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect |
| 36 | |
| 37 | debian will install some packages sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect |
| 38 | |
| 39 | System has Sun Java 1.6 above version. |
| 40 | System has ssh. |
| 41 | System has ssh Server (sshd). |
| 42 | System has dialog. |
| 43 | Welcome to use Crawlzilla, this install program will create a new accunt and to assist you to setup the password of crawler. |
| 44 | Set password for crawler: |
| 45 | password: |
| 46 | |
| 47 | keyin the password again: |
| 48 | password: |
| 49 | |
| 50 | Master IP address is: 192.168.125.1 |
| 51 | Master MAC address is: 08:00:27:04:6b:db |
| 52 | Please confirm the install infomation of above :1.Yes 2.No |
| 53 | 1 |
| 54 | Create crawler and change password. |
| 55 | |
| 56 | unpack success! |
| 57 | Check and Set /etc/hosts finished. |
| 58 | Add Crawlzilla service to /etc/init.d |
| 59 | Crawlzilla will startup when booting |
| 60 | update-rc.d: using dependency based boot sequencing |
| 61 | Make the slave installation package |
| 62 | Formatting HDFS... |
| 63 | start up name node [Namenode] ... |
| 64 | starting namenode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-namenode-debian.out |
| 65 | start up job node [JobTracker] ... |
| 66 | starting jobtracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-jobtracker-debian.out |
| 67 | starting datanode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-datanode-debian.out |
| 68 | starting tasktracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-tasktracker-debian.out |
| 69 | Start up tomcat... |
| 70 | ..... |
| 71 | Tomcat may not start, please use " crawlzilla " to start |
| 72 | Installed successfully! |
| 73 | You can visit the manage website :http://192.168.125.1:8080 |
| 74 | Finish!!! |
| 75 | }}} |