Version 7 (modified by rider, 16 years ago) (diff) |
---|
Xen GPU cluster
Hardware
Node | 9 nodes |
CPU | Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz |
Memory | 6GB |
Storage | 160GB |
Video Card | NVIDIA GeForce? 9800GT 1GB |
Software
OS | Ubuntu 8.04.2 with Kernel: 2.6.24-23-server x86_64 |
Part 1
1.1 - Basic Environment
# NVIDIA CUDA driver #
rock@cloud:~/nvidia/cuda$ wget http://developer.download.nvidia.com/compute/cuda/2_1/drivers/NVIDIA-Linux-x86_64-180.22-pkg2.run
# NVIDIA CUDA toolkit #
rock@cloud:~/nvidia/cuda$ wget http://developer.download.nvidia.com/compute/cuda/2_1/toolkit/cudatoolkit_2.1_linux64_ubuntu8.04.run
# NVIDIA CUDA SDK #
rock@cloud:~/nvidia/cuda$ wget http://developer.download.nvidia.com/compute/cuda/2_1/SDK/cuda-sdk-linux-2.10.1215.2015-3233425.run
rock@cloud:~$ sudo apt-get install autoconf automake build-essential gcc make libtool initramfs-tool libxi6 libxi-dev libxmu6 libxmu-dev linux-kernel-devel linux-headers-2.6.24-23-server
rock@cloud:~$ sudo ln -sf /usr/src/linux-headers-2.6.24-23 /usr/src/linux
rock@cloud:~/nvidia/cuda$ sudo sh NVIDIA-Linux-x86_64-180.22-pkg2.run
rock@cloud:~$ sudo mkdir /opt/cuda
rock@cloud:~/nvidia/cuda$ sudo sh cudatoolkit_2.1_linux64_ubuntu8.04.run
Enter install path (default /usr/local/cuda, '/cuda' will be appended): /opt
# Note:
* Please make sure your PATH includes /opt/cuda/bin * Please make sure your LD_LIBRARY_PATH includes /opt/cuda/lib * or add /opt/cuda/lib to /etc/ld.so.conf and run ldconfig as root * Please read the release notes in /opt/cuda/doc/ * To uninstall CUDA, delete /opt/cuda * Installation Complete
rock@cloud:~$ sudo mkdir /opt/NVIDIA_CUDA_SDK
rock@cloud:~/nvidia/cuda$ sudo sh cuda-sdk-linux-2.10.1215.2015-3233425.run
# Note:
{{{ Enter install path (default /usr/local/cuda, '/cuda' will be appended): /opt/cuda }}}
Configuring SDK Makefile (/opt/NVIDIA_CUDA_SDK/common/common.mk)... * Please make sure your PATH includes /opt/cuda/bin * Please make sure your LD_LIBRARY_PATH includes /opt/cuda/lib * To uninstall the NVIDIA CUDA SDK, please delete /opt/NVIDIA_CUDA_SDK
rock@cloud:~$ sudo vim /etc/profile
Add: export PATH=$PATH:/opt/cuda/bin
rock@cloud:~$ source /etc/profile
rock@cloud:~$ sudo vim /etc/ld.so.conf
Add: /opt/cuda/lib
rock@cloud:~$ sudo ldconfig
/opt/cuda/lib: libcublasemu.so.2 -> libcublasemu.so.2.1 libcufftemu.so.2 -> libcufftemu.so.2.1 libcublas.so.2 -> libcublas.so.2.1 libcudart.so.2 -> libcudart.so.2.1 libcufft.so.2 -> libcufft.so.2.1
1.2 NVIDIA Driver Solution
# Rock said that the unknown identification of the VGA device might be the "pciids" problem.
Sol1:
rock@cloud:~$ sudo update-pciids <older version>
Sol2:
rock@cloud:~$ wget http://pciids.sourceforge.net/v2.2/pci.ids <latest version>
rock@cloud:~$ sudo cp pci.ids /usr/share/misc/
rock@cloud:~$ sudo lspci -v -v (the Unknown device 82a0?)
01:00.0 VGA compatible controller: nVidia Corporation GeForce 9800 GT (rev a2) (prog-if 00 [VGA controller]) Subsystem: ASUSTeK Computer Inc. Unknown device 82a0 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0, Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 16 Region 0: Memory at fd000000 (32-bit, non-prefetchable) [size=16M] Region 1: Memory at d0000000 (64-bit, prefetchable) [size=256M] Region 3: Memory at fa000000 (64-bit, non-prefetchable) [size=32M] Region 5: I/O ports at dc80 [size=128] Expansion ROM at fea00000 [disabled] [size=128K] Capabilities: [60] Power Management version 3 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [68] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable- Address: 0000000000000000 Data: 0000 Capabilities: [78] Express Endpoint IRQ 0 Device: Supported: MaxPayload 128 bytes, PhantFunc 0, ExtTag+ Device: Latency L0s <512ns, L1 <4us Device: AtnBtn- AtnInd- PwrInd- Device: Errors: Correctable- Non-Fatal+ Fatal+ Unsupported- Device: RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+ Device: MaxPayload 128 bytes, MaxReadReq 512 bytes Link: Supported Speed 2.5Gb/s, Width x16, ASPM L0s L1, Port 0 Link: Latency L0s <512ns, L1 <1us Link: ASPM Disabled RCB 64 bytes CommClk+ ExtSynch- Link: Speed 2.5Gb/s, Width x16
rock@cloud:~$ less /var/log/Xorg.0.log | grep nVidia (unknown chipset (0x0605) rev 162)
(--) PCI:*(1:0:0) nVidia Corporation unknown chipset (0x0605) rev 162, Mem @ 0xfd000000/24, 0xd0000000/28, 0xfa000000/25, I/O @ 0xdc80/7, BIOS @ 0xfea00000/17 DeviceID(0x065) is unknown?
rock@cloud:~$ less /usr/share/misc/pci.ids | grep 9800
0601 GeForce 9800 GT 512 0604 GeForce 9800 GX2 0605 GeForce 9800 GT 0612 GeForce 9800 GTX 0613 GeForce 9800 GTX+ 0614 GeForce 9800 GT 0617 GeForce 9800M GTX 10de GeForce 9800M GTX
rock@cloud:~$ sudo Xorg -scanpci
Probing for PCI devices (Bus:Device:Function) (0:0:0) unknown card (0x1028/0x0211) using a Intel Corporation DRAM Controller (0:1:0) Intel Corporation PCI Express Root Port (0:3:0) unknown card (0x1028/0x0211) using a Intel Corporation MEI Controller (0:3:2) unknown card (0x1028/0x0211) using a Intel Corporation PT IDER Controller (0:3:3) unknown card (0x1028/0x0211) using a Intel Corporation Serial KT Controller (0:25:0) unknown card (0x1028/0x0211) using a Intel Corporation 82566DM-2 Gigabit Network Connection (0:26:0) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #4 (0:26:1) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #5 (0:26:7) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) USB2 EHCI Controller #2 (0:27:0) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) HD Audio Controller (0:28:0) Intel Corporation 82801I (ICH9 Family) PCI Express Port 1 (0:29:0) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #1 (0:29:1) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #2 (0:29:2) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #3 (0:29:7) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) USB2 EHCI Controller #1 (0:30:0) Intel Corporation 82801 PCI Bridge (0:31:0) Intel Corporation LPC Interface Controller (0:31:2) unknown card (0x1028/0x0211) using a Intel Corporation 82801IR/IO/IH (ICH9R/DO/DH) 4 port SATA IDE Controller (0:31:3) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) SMBus Controller (0:31:5) unknown card (0x1028/0x0211) using a Intel Corporation 82801I (ICH9 Family) 2 port SATA IDE Controller ---> (1:0:0) unknown card (0x1043/0x82a0) using an unknown chip (DeviceId 0x0605) from nVidia Corporation
Attachments (5)
-
2.6.22.9.config
(40.2 KB) -
added by rider 16 years ago.
2.6.22.9_Xen_VMGL
-
tightvnc-1.2.9-amd64support.patch
(1.5 KB) -
added by rider 16 years ago.
tightvnc-1.2.9_amd64_patch
-
config-2.6.28
(90.4 KB) -
added by rider 16 years ago.
2.6.28_Vanilla_DR
-
xorg.conf
(1.7 KB) -
added by rider 16 years ago.
XEN_NVIDIA_Dom0
-
Architecture.jpg
(76.4 KB) -
added by rider 15 years ago.
Xen_GPU_Arch
Download all attachments as: .zip