Changes between Version 20 and Version 21 of jazz/13-06-02
- Timestamp:
- Jun 2, 2013, 10:45:54 AM (12 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
jazz/13-06-02
v20 v21 23 23 - Census (? Index Size : 300GB) 24 24 - Sandbox VM - Windows (?) - pcap (network packet) / screenshot - 8GB/day, 3000 malware - 存在 HDFS 25 - Similarity Search 相似度搜尋25 - 目標:'''Similarity Search 相似度搜尋''' 26 26 - 將 log 透過 MR Job 或 Pig 存成 Lucene Index (?),再匯入 Solr (Index Size: 6GB) 27 27 - 缺點:無法做到遞增索引更新(incremental index update)(也得看是否能區隔遞增的更新資料(incremental data update(?)))