Changes between Version 5 and Version 6 of jazz/09-04-06


Ignore:
Timestamp:
Apr 6, 2009, 1:40:35 PM (16 years ago)
Author:
jazz
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • jazz/09-04-06

    v5 v6  
    11= 2009-04-06 =
     2
     3== Hadoop / MapReduce ==
     4
     5 * 關於 MapReduce:
     6   * [http://developer.amazonwebservices.com/connect/entry.jspa?externalID=2297&categoryID=265 Introduction to Amazon Elastic MapReduce]
     7   * [http://developer.amazonwebservices.com/connect/entry.jspa?externalID=2294&categoryID=265 Finding Similar Items with Amazon Elastic MapReduce, Python, and Hadoop Streaming] - 用 Hadoop 做相似度分析,可應用在生物資訊領域。
     8   * [http://hadoop.apache.org/core/docs/current/cn/mapred_tutorial.html Hadoop Map/Reduce 教程]
     9 * [http://wiki.apache.org/hadoop/AmazonEC2 Hadoop Wiki 上關於 Amazon EC2 的使用說明]
     10
     11 * [限制] Hadoop 0.18.3 不支援 Stream 下的數值排序 - [https://issues.apache.org/jira/browse/HADOOP-2302 Streaming should provide an option for numerical sort of keys]
    212
    313== Cloud Computing and Science ==
    414
    5  * 去年在 [wiki:jazz/09-01-10 eScience 2008] 看的一些演講錄影,現在論文也已經可以在 [http://ieeexplore.ieee.org/xpl/tocresult.jsp?isnumber=4736722&isYear=2008&count=180&page=1&ResultStart=25 IEEE Xplore] 上找到了。
     15 * 去年在 [wiki:jazz/09-01-10 eScience 2008] 看的一些演講錄影,現在論文也已經可以在 [http://ieeexplore.ieee.org/xpl/tocresult.jsp?isnumber=4736722 IEEE Xplore] 上找到了。
    616 * 這兩篇是講述將 !MapReduce 運用在生物資訊領域的實例。
    717   * [http://www.cs.umd.edu/Grad/scholarlypapers/papers/MichaelSchatz.pdf BlastReduce: High Performance Short Read Mapping with MapReduce]
    818   * [http://www.ece.rutgers.edu/~parashar/Classes/08-09/ece572/readings/cloudblast-escience-08.pdf CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications]
    9  *
     19 * 跟虛擬化、生物影像、生物資訊、Mircoarray有關:
    1020   * [http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4736753&isnumber=4736722 BioVLAB-Microarray: Microarray Data Analysis in Virtual Environment]
    1121 * 有幾篇則跟 MapReduce 有關,如: [http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4736768&isnumber=4736722 MapReduce for Data Intensive Scientific Analyses]
     
    1626   * [http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4736775&isnumber=4736722 Biocep, Towards a Federative, Collaborative, User-Centric, Grid-Enabled and Cloud-Ready Computational Open Platform]
    1727 * 此外,在 !SourceForge 上也有一些應用專案:
    18    * [http://apps.sourceforge.net/mediawiki/cloudburst-bio/ CloudBurst] - [http://developer.amazonwebservices.com/connect/entry.jspa?externalID=2272&categoryID=263 Amazon 上的介紹]
    19    *
    20  * 關於 MapReduce:
    21 
     28   * [http://apps.sourceforge.net/mediawiki/cloudburst-bio/ CloudBurst] - [http://developer.amazonwebservices.com/connect/entry.jspa?externalID=2272&categoryID=263 Amazon 上的 CloudBurst 專案介紹]
    2229 * 當然我最關切的是"[http://blog.wired.com/wiredscience/2008/12/massive-amounts.html Amazon 提供公共資料庫]"的舉動對整個學術生態所造成的影響。此舉跟國網中心提供科學資料庫的定位十分相似,雖然 Amazon 在台灣區推廣有本土化方面的阻撓,但是如果真的有心朝國際化發展的話,台灣的學生應該要多學著使用這些服務,去做更大型的運算才對。[http://news.ycombinator.com/item?id=543069 這邊的討論]提供了許多參考連結。