Changes between Initial Version and Version 1 of jazz/CloudBurst


Ignore:
Timestamp:
Mar 18, 2011, 11:31:04 AM (13 years ago)
Author:
jazz
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • jazz/CloudBurst

    v1 v1  
     1= CloudBurst =
     2
     3 * [http://cloudburst-bio.sourceforge.net CloudBurst]
     4
     5== Installation and Test Procedure 安裝與測試步驟 ==
     6
     7 * 使用 [http://sourceforge.net/apps/mediawiki/cloudburst-bio/index.php?title=Sample_Results Sample Results] 的資料集
     8{{{
     9jazz@hadoop:~$ wget "http://downloads.sourceforge.net/project/cloudburst-bio/cloudburst/CloudBurst-1.0.1/CloudBurst-1.0.1.tgz?use_mirror=nchc"
     10jazz@hadoop:~$ tar zxvf CloudBurst-1.0.1.tgz
     11jazz@hadoop:~$ cd CloudBurst-1.0.1
     12jazz@hadoop:~/CloudBurst-1.0.1$ wget "http://downloads.sourceforge.net/project/cloudburst-bio/cloudburst-data/CloudBurst-sample-data/CloudBurst-small-sample.tgz?use_mirror=nchc"
     13jazz@hadoop:~/CloudBurst-1.0.1$ tar zxvf CloudBurst-small-sample.tgz
     14jazz@hadoop:~/CloudBurst-1.0.1$ hadoop fs -mkdir cloudburst
     15jazz@hadoop:~/CloudBurst-1.0.1$ hadoop fs -put CloudBurst-small-sample/100k.br cloudburst/
     16jazz@hadoop:~/CloudBurst-1.0.1$ hadoop fs -put CloudBurst-small-sample/s_suis.br cloudburst/
     17jazz@hadoop:~/CloudBurst-1.0.1$ hadoop fs -lsr
     18drwxr-xr-x   - jazz supergroup          0 2010-04-30 10:55 /user/jazz/cloudburst
     19-rw-r--r--   2 jazz supergroup    4493593 2010-04-30 10:55 /user/jazz/cloudburst/100k.br
     20-rw-r--r--   2 jazz supergroup     579773 2010-04-30 10:55 /user/jazz/cloudburst/s_suis.br
     21jazz@hadoop:~/CloudBurst-1.0.1$ hadoop jar CloudBurst.jar cloudburst/s_suis.br cloudburst/100k.br results 36 3 0 1 240 48 24 24 128 16 >& cloudburst.err
     22jazz@hadoop:~/CloudBurst-1.0.1$ tail -n 1 cloudburst.err
     23Total Running time:  102.68
     24jazz@hadoop:~/CloudBurst-1.0.1$ hadoop fs -get results .
     25jazz@hadoop:~/CloudBurst-1.0.1$ java -jar PrintAlignments.jar results | sort -nk4 > 100k.3.txt
     26Printing results
     27}}}
     28 * 在 hadoop.nchc.org.tw 20 台環境下,執行時間約 1 分 13 秒
     29 * 根據官方網站的說明,最後輸出的格式(100k.3.txt)是給 [http://genome.ucsc.edu UCSC Genome Browser] 用的格式。
     30
     31== Reference ==
     32
     33 * [http://developer.amazonwebservices.com/connect/entry.jspa?externalID=2272&categoryID=263 Amazon Elastic MapReduce > Sample Data Processing Applications > CloudBurst]