Changes between Version 6 and Version 7 of jazz/09-08-27


Ignore:
Timestamp:
Aug 28, 2009, 12:34:25 PM (15 years ago)
Author:
jazz
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • jazz/09-08-27

    v6 v7  
    8383  * [統計] 目前平均 file size 大概都落在 1Kbytes 以下。
    8484  * [[Image(wiki:jazz/09-08-27:hadoop_file_size_block_size.png,width=400)]]
     85  * [發現] 根據 Hadoop 論壇的討論,發現其實 block size 是由使用者自己(Hadoop Client)決定的。
     86{{{
     87jazz@hadoop:~$ hadoop fs -D dfs.block.size=4194304 -cp /user/jazz/input/hadoop-default.xml /user/jazz/input/hadoop-default.xml.4M
     88jazz@hadoop:~$ hadoop fs -stat "filesize=%b block_size=%o filename=%n replication=%r" /user/jazz/input/hadoop-default.xml
     89filesize=40673 block_size=67108864 filename=hadoop-default.xml replication=1
     90jazz@hadoop:~$ hadoop fs -stat "filesize=%b block_size=%o filename=%n replication=%r" /user/jazz/input/hadoop-default.xml.4M
     91filesize=40673 block_size=4194304 filename=hadoop-default.xml.4M replication=2
     92jazz@hadoop:~$ hadoop fs -stat "filesize=%b block_size=%o filename=%n replication=%r" /user/jazz/input/hadoop-default.xml.new
     93filesize=40673 block_size=4096 filename=hadoop-default.xml.new replication=1
     94}}}