wiki:rock/paper/PFS_HPC

Version 2 (modified by rock, 15 years ago) (diff)

--

Parallel File Systems: File Systems for HPC Clusters

1. 重點

  1. 作者將 Cluster FS 分為兩類:
    • DFS (Distributed File Systems)
      • ”parallel” as they utilize multiple data servers
    • PFS (Parallel File Systems)
      • use a single server, are not necessarily parallel, they can give the user ”parallel access” to a file system.
  1. DFS are network based (i.e. the actual storage hardware is not necessarily on the nodes) but it not necessarily parallel (i.e.there may not be multiple servers that are delivering the file system).
    • 作者舉 NFS 來介紹:
      • NFS:
        • Primary file system for clusters, and is pretty much” plug and play” on most *nix systems.
        • It was the first popular file system that allowed distributed systems to share data.
      • NFSv3:
        • The most popular version of NFS. It was released around 1995 and
        • added several features including support for 64-bit file sizes and offsets (so it can handle files larger than 4GB), asynchronous write support, and TCP as a transport layer.
      • NFSv4:
        • Around 2003, NFSv4 was released with some improvements. In particular,
        • it added some speed improvements, strong security (with compatibility for multiple security protocols), andNFS became a stateful protocol.
      • NFS 的效能面:
        • The good news for NFS and NAS is that many codes don’t require lots of I/O for good performance. These codes will run very well using NFS as the storage protocol even for large runs (100+ nodes or several hundred cores). NFS provides adequate performance until the input and output files for these codes become extremely large, or if the code is run across a very large number of processors (in the thousands).
        • NFS still lacks the performance and scalability required by many large clusters, but that is about to change.

  • pNFS


想法

  • 此編的論點屬於 survey 性質,可用於撰寫 DFS Paper 的文獻探討

Attachments (3)

Download all attachments as: .zip