| 1 | ◢ <[wiki:Hinet120814/Lab11 實作十二]> | <[wiki:Hinet120814 回課程大綱]> ▲ | ◣ |
| 2 | |
| 3 | = 實作十三 Lab 13 = |
| 4 | [[PageOutline]] |
| 5 | {{{ |
| 6 | #!html |
| 7 | <div style="text-align: center;"><big style="font-weight: bold;"><big>抓抓龍安裝步驟示範<br/>Crawlzilla installation in Practice</big></big></div> |
| 8 | }}} |
| 9 | |
| 10 | == 下載抓抓龍 == |
| 11 | |
| 12 | * 抓抓龍官方網站: http://crawlzilla.info |
| 13 | * 下載 [http://sourceforge.net/projects/crawlzilla/files/stable/Crawlzilla-1.0/Crawlzilla-1.1.1.tar.gz/download 最新穩定版 Crawlzilla 1.1.1 版本] |
| 14 | |
| 15 | == 安裝抓抓龍 == |
| 16 | |
| 17 | {{{ |
| 18 | hadoop@debian:~$ lsb_release -a |
| 19 | No LSB modules are available. |
| 20 | Distributor ID: Debian |
| 21 | Description: Debian GNU/Linux 6.0.3 (squeeze) |
| 22 | Release: 6.0.3 |
| 23 | Codename: squeeze |
| 24 | hadoop@debian:~$ wget -nv http://nchc.dl.sourceforge.net/project/crawlzilla/stable/Crawlzilla-1.0/Crawlzilla-1.1.1.tar.gz |
| 25 | hadoop@debian:~$ tar zxvf Crawlzilla-1.1.1.tar.gz |
| 26 | hadoop@debian:~$ cd Crawlzilla_Install/ |
| 27 | hadoop@debian:~/Crawlzilla_Install$ ./install |
| 28 | |
| 29 | System does not has Crawlzilla. |
| 30 | Identify is root |
| 31 | Your system information is: |
| 32 | Debian , 6.0.3 |
| 33 | Try to automatically install: sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect |
| 34 | |
| 35 | debian will install some packages sun-java6-jdk sun-java6-jre sun-java6-fonts dialog expect |
| 36 | |
| 37 | System has Sun Java 1.6 above version. |
| 38 | System has ssh. |
| 39 | System has ssh Server (sshd). |
| 40 | System has dialog. |
| 41 | Welcome to use Crawlzilla, this install program will create a new accunt and to assist you to setup the password of crawler. |
| 42 | Set password for crawler: |
| 43 | password: |
| 44 | |
| 45 | keyin the password again: |
| 46 | password: |
| 47 | |
| 48 | Master IP address is: 192.168.125.1 |
| 49 | Master MAC address is: 08:00:27:04:6b:db |
| 50 | Please confirm the install infomation of above :1.Yes 2.No |
| 51 | 1 |
| 52 | Create crawler and change password. |
| 53 | |
| 54 | unpack success! |
| 55 | Check and Set /etc/hosts finished. |
| 56 | Add Crawlzilla service to /etc/init.d |
| 57 | Crawlzilla will startup when booting |
| 58 | update-rc.d: using dependency based boot sequencing |
| 59 | Make the slave installation package |
| 60 | Formatting HDFS... |
| 61 | start up name node [Namenode] ... |
| 62 | starting namenode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-namenode-debian.out |
| 63 | start up job node [JobTracker] ... |
| 64 | starting jobtracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-jobtracker-debian.out |
| 65 | starting datanode, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-datanode-debian.out |
| 66 | starting tasktracker, logging to /var/log/crawlzilla/hadoop-logs/hadoop-crawler-tasktracker-debian.out |
| 67 | Start up tomcat... |
| 68 | ..... |
| 69 | Tomcat may not start, please use " crawlzilla " to start |
| 70 | Installed successfully! |
| 71 | You can visit the manage website :http://192.168.125.1:8080 |
| 72 | Finish!!! |
| 73 | }}} |