◢ <[wiki:III140705/Lab20 實作二十]> | <[wiki:III140705 回課程大綱]> ▲ | <[wiki:III140705/Lab22 實作二十二] > ◣
= 實作二十一 Lab21 =
{{{
#!html
修改檔案輸入格式
KeyValueTextInputFormat
}}}
[[PageOutline]]
{{{
#!text
請先連線至 nodeN.3du.me , N 為您的報名編號
}}}
{{{
cd ~/hadoop_labs/lab012
ant
mkdir -p kv_input
printf "A\t1\n" > kv_input/input1
printf "B\t2\n" >> kv_input/input1
printf "C\t3\n" >> kv_input/input1
printf "A\t1\n" > kv_input/input2
printf "C\t2\n" >> kv_input/input2
printf "B\t1\n" >> kv_input/input2
hadoop fs -put kv_input kv_input
hadoop jar WordCount.jar kv_input kv_output
hadoop fs -ls kv_output
hadoop fs -cat kv_output/part-*
export HADOOP_CONF_DIR=~/hadoop/conf.local/
hadoop jar WordCount.jar kv_input kv_output
ls -al kv_output
cat kv_output/part-*
unset HADOOP_CONF_DIR
}}}
* Reference:
* http://hadoop.apache.org/docs/r1.0.4/api/org/apache/hadoop/mapreduce/lib/input/FileInputFormat.html
* http://hadoop.apache.org/docs/r1.0.4/api/org/apache/hadoop/mapreduce/lib/input/TextInputFormat.html
{{{
#!text
public class TextInputFormat
extends FileInputFormat
}}}
* http://hadoop.apache.org/docs/r1.0.4/api/org/apache/hadoop/mapreduce/lib/input/KeyValueTextInputFormat.html
{{{
#!text
public class KeyValueTextInputFormat
extends FileInputFormat
}}}
* http://hadoop.apache.org/docs/r1.0.4/api/org/apache/hadoop/mapreduce/lib/input/NLineInputFormat.html
{{{
#!text
public class NLineInputFormat
extends FileInputFormat
NLineInputFormat which splits N lines of input as one split.
}}}
== 實作習題 ==
<問題 1>