Context Navigation

Changes between Version 5 and Version 6 of jazz/12-06-02

Timestamp:: Jun 3, 2012, 12:37:09 PM (13 years ago)
Author:: jazz
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

jazz/12-06-02

-                      v5
+                      v6
 }}}
+ * 那有沒有方法可以增加 Kernel Page Size 呢？目前核心有 hugetlbpage 的支援。
+ * [參考] [http://www.mjmwired.net/kernel/Documentation/vm/hugetlbpage.txt Linux 核心文件 : hugetlbpage] - 讓 i386 架構支援 4K 與 4M (2M in PAE mode) page sizes 的模組。
  * 結論(3):'''除了加大 Block Size 之外，最常用的方式是啟動 read ahead （運用快取來提昇硬碟循序讀取的效率）'''
   * [參考四] [http://forums.opensuse.org/forums/english/get-technical-help-here/install-boot-login/437078-changing-pagesize-kernel.html#post2152312 Changing PAGESIZE in kernel]
 …
     XFS - SGI's second generation file system.[2]
 }}}
+== Hadoop & OCR ==
+ * 在 Hadoop 課程中會介紹到一個應用案例「紐約時報用一天的時間在 100 台 EC2 上透過 Hadoop 進行 TIFF 檔轉成 PDF 的 OCR 工作」。然而並沒有實際的程式碼或者範例可以看。今天看到一套 Open Source 可以在 Linux 上運作的 OCR 軟體，名為「[http://code.google.com/p/tesseract-ocr/ tesseract-ocr]」。想必在英文語系中，透過這一套軟體可以很輕易辦到英文的大量圖檔辨識。
+ * 專案首頁：http://code.google.com/p/tesseract-ocr/
+ * An OCR Engine that was developed at HP Labs between 1985 and 1995... and now at Google.
+ * Debian 套件：http://packages.debian.org/tesseract
+== ttysnoop ==
+ * [http://daydreamer.idv.tw/rewrite.php/read-64.html ttysnoop-沒有螺絲起子的日子]
+ * [http://www.linuxhelp.net/guides/ttysnoop/ DJG's ttysnoop Guide]
+ *