wiki:Problem

Version 13 (modified by rock, 16 years ago) (diff)

--

Problem

Kernel version: 2.6.23 (enable -DNFS4_CLUSTER)

  1. 遠端的機器在mmstartup -a 時,會一值卡在arbitrating的狀態,估計可能是gpfs連接模組(tracedev)的問題,也有可能是kernel過新,gpfs module無法配合。但是在local端時,則可以正常使用。
gpfs02:~# tsstatus
The mmfsd daemon is not ready to handle commands yet. Waiting for quorum.
gpfs02:~# mmgetstate
 Node number  Node name        GPFS state
------------------------------------------
       3      gpfs02           arbitrating


e1000 NIC module problem

  • Our NIC is e1000 GE in our 2.6.20 kernel. This NIC in our case will cause speed from 1000M down to 100M. You can downlaod new driver from intel web site or use our's compiler modules.
    sudo wget http://trac.nchc.org.tw/grid/attachment/wiki/Problem/e1000.ko
    cp e1000.ko /lib/module/`uname -r`/kernel/driver/net/e1000/
    sudo depmod -a
    


mmadddisk

  • 若要mmadddisk時,必須確認node在一開始以在mmcrcluster -N gpfs.nodes的檔案裡,若是之後透過指令增加的node,此node的硬碟無法擴增到原來的gpfs NSD。


Use ramdisk

gpfs-server:/home/ram_config# mmcrnsd -F gpfs.disks -v no mmcrnsd: Processing disk ram0 mmcrnsd: ram0 was not found in /proc/partitions. mmcrnsd: Failed while processing disk descriptor /dev/ram0:gpfs00::dataAndMetadata:: on node gpfs00. mmcrnsd: Processing disk ram1 mmcrnsd: ram1 was not found in /proc/partitions. mmcrnsd: Failed while processing disk descriptor /dev/ram1:gpfs00::dataAndMetadata:: on node gpfs00.



Attachments (2)

Download all attachments as: .zip