Changes between Version 29 and Version 30 of waue/2009/nutch_install


Ignore:
Timestamp:
Apr 28, 2009, 1:17:31 PM (15 years ago)
Author:
waue
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • waue/2009/nutch_install

    v29 v30  
    1 
    21{{{
    32#!html
     
    1413   * 解決中文亂碼問題
    1514   * 搜尋引擎不只是找網頁內的資料,也能爬到網頁內的檔案(如pdf,msword)
    16    * 運行在多台node
     15   * 也可運行在多台node
    1716
    1817 = 環境 =
     
    6160  <property>
    6261    <name>fs.default.name</name>
    63     <value>hdfs://node01:9000/</value>
     62    <value>hdfs://localhost:9000/</value>
    6463    <description> </description>
    6564  </property>
    6665  <property>
    6766    <name>mapred.job.tracker</name>
    68     <value>node01:9001</value>
     67    <value>localhost:9001</value>
    6968    <description>  </description>
    7069  </property>
     
    8685
    8786= step 2 nutch下載與安裝 =
    88 
     87 == 2.0 設定環境變數 ==
     88{{{
     89$ sudo su -
     90# echo "export JAVA_HOME=/usr/lib/jvm/java-6-sun" >> /etc/bash.bashrc
     91# exit
     92# exit
     93}}}
    8994 == 2.1 下載 nutch 並解壓縮 ==
    9095 *  nutch 1.0 (2009/03/28 release )
     
    153158<property>
    154159  <name>http.agent.url</name>
    155   <value>node01</value>
     160  <value>localhost</value>
    156161  <description>A URL to advertise in the User-Agent header. </description>
    157162</property>
     
    227232}}}
    228233
    229 == 3.4 完全複製到node2 ==
    230 
     234== 3.4 環境若要設定成叢集才要做 ==
     235 * 若是單機版則不用處理此節
     236 * 完全複製到node2
    231237{{{
    232238$ ssh node02 "sudo chown hadooper:hadooper /opt"