CUDA6.5/Ubuntu14.04
[
Front page
] [
New
|
List of pages
|
Search
|
Recent changes
]
Start:
[[labs.beatcraft.com]]~
[[CUDA]]~
#Contents
*Overview [#be483fbd]
>
This article explains how to install Ubuntu 14.04 to a PC...
Then, CUDA6.5 is installed to the Ubuntu PC.~
*Hardware Specifications [#e9a8a6d0]
>
This is the list of hardware specifications.~
- CPU: core i7 3770 (3.4GHz, 4-core/8-thread)~
- Memory: 32B (DDR3-12800 8GB x4)
- HDD: 1TB (SATA, 7200rpm)
- GPU: ETSK20-5GER (NVIDIA Tesla K20c, for CUDA)
- GPU: GF-GT730-LE1GHD/D5 (NVIDIA Geforce GT730, for Video)
>
In the article of [[CUDA5/CentOS6.4]], ''Primary Display'...
In this article, since this is not used as a headless Ubu...
*Installing Ubuntu 14.04 [#c57313e2]
> Applying the configuration below, Ubuntu 14.04 LTS Desk...
- Language: US
- Keyboard: English (or whatever you use)
- Partitions of HDD: Whole Region (Default setting)
- Network: DHCP
>
Select ''nomodeset'' at the boot option of Ubuntu install...
Do not use ''Nouveau'' an open source NVIDIA GPU driver f...
* Configuring Ubuntu 14.04 at its Post-Installation [#ae1...
>
After the installation of Ubuntu is completed, the config...
** Update Ubuntu 14.04 [#i4612129]
>
Installing the newest packages and applications, make Ubu...
$ sudo apt-get update
$ sudo apt dist-upgrade
After the update is completed, make sure that the system ...
** Making Nouveau Ineffective [#xdb14ad4]
>
To install and use only NVIDIA's GPU Driver, this is for ...
blacklist nouveau
options nouveau modeset = 0
>
Regenerate ''kernel initramfs'' for making the new config...
$ sudo update-initramfs -u
>
To reboot the system, make sure that Nouveau Driver is NO...
Also, please check ''nouveau'' is not included in the vid...
** Installing Packages [#m8a6056d]
>
As the installation of Ubuntu 14.04 is completed, the ins...
$ sudo apt-get install vim lv ssh naoutilus-open-termina...
* Installing CUDA 6.5 [#f84ce7b3]
>
Since CUDA 6.5, the installation of CUDA on Ubuntu become...
~
- Installing Package Manager
To install the package manager, please follow the directi...
~
- Downloading the packages from CUDA Download page
Please download the deb package (cuda-repo-ubuntu1404_6.5...
[[https://developer.nvidia.com/cuda-downloads]]~
Then, install the package as applying the command line be...
$ sudo dpkg -i cuda-repo-ubuntu1404_6.5-14_amd64.deb
Since This package only puts NVIDIA's repository on apt s...
$ sudo apt-get update
$ sudo apt-get install cuda
This is the end of installing CUDA 6.5.~
Restarting the system, GUI of Ubuntu desktop became finer...
* Configuring CUDA 6.5 at its Post-Installation [#v2155fbf]
** Configuring environment [#p2259550]
>
As CUDA 6.5 is installed under the directory of /usr/loca...
~
Add the command lines shown below to the end of .bashrc f...
$ export PATH=/usr/local/cuda-6.5/bin:$PATH
$ export LD_LIBRARY_PATH=/usr/local/cuda-6.5/lib64:$LD_L...
The change in the environment variable becomes effective ...
** Copying CUDA Samples [#udf970bf]
>
The samples of CUDA are placed under the directory of /us...
$ cuda-install-samples-6.5.sh ~
Applying the command line above, the samples of CUDA are ...
** Build and Execute Samples [#xeef0cbc]
>
Applying the command lines below, move to the directory w...
$ cd ~/NVIDIA_CUDA-6.5_Samples
$ make
>
Samples are built under the directory of NVIDIA_CUDA-6.5_...
$ cd bin/x86_64/linux/release
beat@tesla:~/NVIDIA_CUDA-6.5_Samples/bin/x86_64/linux/re...
alignedTypes cudaDecodeGL matrixMu...
asyncAPI cudaOpenMP matrixMu...
bandwidthTest cuHook matrixMu...
batchCUBLAS dct8x8 matrixMu...
bicubicTexture deviceQuery matrixMu...
bilateralFilter deviceQueryDrv MC_Estim...
bindlessTexture dwtHaar1D MC_Estim...
binomialOptions dxtc MC_Estim...
BlackScholes eigenvalues MC_Estim...
boxFilter fastWalshTransform MC_Singl...
boxFilterNPP FDTD3d mergeSor...
cdpAdvancedQuicksort fluidsGL Mersenne...
cdpBezierTessellation freeImageInteropNPP MonteCar...
cdpLUDecomposition FunctionPointers nbody ...
cdpQuadtree grabcutNPP newdelet...
cdpSimplePrint histEqualizationNPP NV12ToAR...
cdpSimpleQuicksort histogram oceanFFT...
clock HSOpticalFlow p2pBandw...
concurrentKernels imageDenoising particle...
conjugateGradient imageSegmentationNPP postProc...
conjugateGradientPrecond inlinePTX ptxjit ...
conjugateGradientUM interval quasiran...
convolutionFFT2D jpegNPP radixSor...
convolutionSeparable libcuhook.so.1 randomFo...
convolutionTexture lineOfSight recursiv...
cppIntegration Mandelbrot reductio...
cppOverload marchingCubes scalarPr...
>
- Running the Binaries
To follow the directions listed [[here>http://docs.nvidia...
>
beat@tesla:~/NVIDIA_CUDA-6.5_Samples/bin/x86_64/linux/re...
./deviceQuery Starting...
CUDA Device Query (Runtime API) version (CUDART static ...
Detected 2 CUDA Capable device(s)
Device 0: "Tesla K20c"
CUDA Driver Version / Runtime Version 6.5 / 6.5
CUDA Capability Major/Minor version number: 3.5
Total amount of global memory: 4800 MB...
(13) Multiprocessors, (192) CUDA Cores/MP: 2496 CU...
GPU Clock rate: 706 MHz...
Memory Clock rate: 2600 Mhz
Memory Bus Width: 320-bit
L2 Cache Size: 1310720...
Maximum Texture Dimension Size (x,y,z) 1D=(655...
Maximum Layered 1D Texture Size, (num) layers 1D=(163...
Maximum Layered 2D Texture Size, (num) layers 2D=(163...
Total amount of constant memory: 65536 b...
Total amount of shared memory per block: 49152 b...
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1...
Max dimension size of a grid size (x,y,z): (2147483...
Maximum memory pitch: 2147483...
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes wit...
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Bus ID / PCI location ID: 1 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetD...
Device 1: "GeForce GT 730"
CUDA Driver Version / Runtime Version 6.5 / 6.5
CUDA Capability Major/Minor version number: 3.5
Total amount of global memory: 1023 MB...
( 2) Multiprocessors, (192) CUDA Cores/MP: 384 CUD...
GPU Clock rate: 954 MHz...
Memory Clock rate: 2505 Mhz
Memory Bus Width: 64-bit
L2 Cache Size: 524288 ...
Maximum Texture Dimension Size (x,y,z) 1D=(655...
Maximum Layered 1D Texture Size, (num) layers 1D=(163...
Maximum Layered 2D Texture Size, (num) layers 2D=(163...
Total amount of constant memory: 65536 b...
Total amount of shared memory per block: 49152 b...
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1...
Max dimension size of a grid size (x,y,z): (2147483...
Maximum memory pitch: 2147483...
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes wit...
Run time limit on kernels: Yes
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Device PCI Bus ID / PCI location ID: 2 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSet...
> Peer access from Tesla K20c (GPU0) -> GeForce GT 730 (...
> Peer access from GeForce GT 730 (GPU1) -> Tesla K20c (...
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version =...
Result = PASS
>
To execute ''bandwidthTest'', obtains the results shown b...
>
beat@tesla:~/NVIDIA_CUDA-6.5_Samples/bin/x86_64/linux/re...
[CUDA Bandwidth Test] - Starting...
Running on...
Device 0: Tesla K20c
Quick Mode
Host to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 6577.3
Device to Host Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 6545.8
Device to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 147234.3
Result = PASS
* Revision History [#e2d6f93d]
>
- 2015/02/16 This article is initially uploaded
End:
[[labs.beatcraft.com]]~
[[CUDA]]~
#Contents
*Overview [#be483fbd]
>
This article explains how to install Ubuntu 14.04 to a PC...
Then, CUDA6.5 is installed to the Ubuntu PC.~
*Hardware Specifications [#e9a8a6d0]
>
This is the list of hardware specifications.~
- CPU: core i7 3770 (3.4GHz, 4-core/8-thread)~
- Memory: 32B (DDR3-12800 8GB x4)
- HDD: 1TB (SATA, 7200rpm)
- GPU: ETSK20-5GER (NVIDIA Tesla K20c, for CUDA)
- GPU: GF-GT730-LE1GHD/D5 (NVIDIA Geforce GT730, for Video)
>
In the article of [[CUDA5/CentOS6.4]], ''Primary Display'...
In this article, since this is not used as a headless Ubu...
*Installing Ubuntu 14.04 [#c57313e2]
> Applying the configuration below, Ubuntu 14.04 LTS Desk...
- Language: US
- Keyboard: English (or whatever you use)
- Partitions of HDD: Whole Region (Default setting)
- Network: DHCP
>
Select ''nomodeset'' at the boot option of Ubuntu install...
Do not use ''Nouveau'' an open source NVIDIA GPU driver f...
* Configuring Ubuntu 14.04 at its Post-Installation [#ae1...
>
After the installation of Ubuntu is completed, the config...
** Update Ubuntu 14.04 [#i4612129]
>
Installing the newest packages and applications, make Ubu...
$ sudo apt-get update
$ sudo apt dist-upgrade
After the update is completed, make sure that the system ...
** Making Nouveau Ineffective [#xdb14ad4]
>
To install and use only NVIDIA's GPU Driver, this is for ...
blacklist nouveau
options nouveau modeset = 0
>
Regenerate ''kernel initramfs'' for making the new config...
$ sudo update-initramfs -u
>
To reboot the system, make sure that Nouveau Driver is NO...
Also, please check ''nouveau'' is not included in the vid...
** Installing Packages [#m8a6056d]
>
As the installation of Ubuntu 14.04 is completed, the ins...
$ sudo apt-get install vim lv ssh naoutilus-open-termina...
* Installing CUDA 6.5 [#f84ce7b3]
>
Since CUDA 6.5, the installation of CUDA on Ubuntu become...
~
- Installing Package Manager
To install the package manager, please follow the directi...
~
- Downloading the packages from CUDA Download page
Please download the deb package (cuda-repo-ubuntu1404_6.5...
[[https://developer.nvidia.com/cuda-downloads]]~
Then, install the package as applying the command line be...
$ sudo dpkg -i cuda-repo-ubuntu1404_6.5-14_amd64.deb
Since This package only puts NVIDIA's repository on apt s...
$ sudo apt-get update
$ sudo apt-get install cuda
This is the end of installing CUDA 6.5.~
Restarting the system, GUI of Ubuntu desktop became finer...
* Configuring CUDA 6.5 at its Post-Installation [#v2155fbf]
** Configuring environment [#p2259550]
>
As CUDA 6.5 is installed under the directory of /usr/loca...
~
Add the command lines shown below to the end of .bashrc f...
$ export PATH=/usr/local/cuda-6.5/bin:$PATH
$ export LD_LIBRARY_PATH=/usr/local/cuda-6.5/lib64:$LD_L...
The change in the environment variable becomes effective ...
** Copying CUDA Samples [#udf970bf]
>
The samples of CUDA are placed under the directory of /us...
$ cuda-install-samples-6.5.sh ~
Applying the command line above, the samples of CUDA are ...
** Build and Execute Samples [#xeef0cbc]
>
Applying the command lines below, move to the directory w...
$ cd ~/NVIDIA_CUDA-6.5_Samples
$ make
>
Samples are built under the directory of NVIDIA_CUDA-6.5_...
$ cd bin/x86_64/linux/release
beat@tesla:~/NVIDIA_CUDA-6.5_Samples/bin/x86_64/linux/re...
alignedTypes cudaDecodeGL matrixMu...
asyncAPI cudaOpenMP matrixMu...
bandwidthTest cuHook matrixMu...
batchCUBLAS dct8x8 matrixMu...
bicubicTexture deviceQuery matrixMu...
bilateralFilter deviceQueryDrv MC_Estim...
bindlessTexture dwtHaar1D MC_Estim...
binomialOptions dxtc MC_Estim...
BlackScholes eigenvalues MC_Estim...
boxFilter fastWalshTransform MC_Singl...
boxFilterNPP FDTD3d mergeSor...
cdpAdvancedQuicksort fluidsGL Mersenne...
cdpBezierTessellation freeImageInteropNPP MonteCar...
cdpLUDecomposition FunctionPointers nbody ...
cdpQuadtree grabcutNPP newdelet...
cdpSimplePrint histEqualizationNPP NV12ToAR...
cdpSimpleQuicksort histogram oceanFFT...
clock HSOpticalFlow p2pBandw...
concurrentKernels imageDenoising particle...
conjugateGradient imageSegmentationNPP postProc...
conjugateGradientPrecond inlinePTX ptxjit ...
conjugateGradientUM interval quasiran...
convolutionFFT2D jpegNPP radixSor...
convolutionSeparable libcuhook.so.1 randomFo...
convolutionTexture lineOfSight recursiv...
cppIntegration Mandelbrot reductio...
cppOverload marchingCubes scalarPr...
>
- Running the Binaries
To follow the directions listed [[here>http://docs.nvidia...
>
beat@tesla:~/NVIDIA_CUDA-6.5_Samples/bin/x86_64/linux/re...
./deviceQuery Starting...
CUDA Device Query (Runtime API) version (CUDART static ...
Detected 2 CUDA Capable device(s)
Device 0: "Tesla K20c"
CUDA Driver Version / Runtime Version 6.5 / 6.5
CUDA Capability Major/Minor version number: 3.5
Total amount of global memory: 4800 MB...
(13) Multiprocessors, (192) CUDA Cores/MP: 2496 CU...
GPU Clock rate: 706 MHz...
Memory Clock rate: 2600 Mhz
Memory Bus Width: 320-bit
L2 Cache Size: 1310720...
Maximum Texture Dimension Size (x,y,z) 1D=(655...
Maximum Layered 1D Texture Size, (num) layers 1D=(163...
Maximum Layered 2D Texture Size, (num) layers 2D=(163...
Total amount of constant memory: 65536 b...
Total amount of shared memory per block: 49152 b...
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1...
Max dimension size of a grid size (x,y,z): (2147483...
Maximum memory pitch: 2147483...
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes wit...
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Enabled
Device supports Unified Addressing (UVA): Yes
Device PCI Bus ID / PCI location ID: 1 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetD...
Device 1: "GeForce GT 730"
CUDA Driver Version / Runtime Version 6.5 / 6.5
CUDA Capability Major/Minor version number: 3.5
Total amount of global memory: 1023 MB...
( 2) Multiprocessors, (192) CUDA Cores/MP: 384 CUD...
GPU Clock rate: 954 MHz...
Memory Clock rate: 2505 Mhz
Memory Bus Width: 64-bit
L2 Cache Size: 524288 ...
Maximum Texture Dimension Size (x,y,z) 1D=(655...
Maximum Layered 1D Texture Size, (num) layers 1D=(163...
Maximum Layered 2D Texture Size, (num) layers 2D=(163...
Total amount of constant memory: 65536 b...
Total amount of shared memory per block: 49152 b...
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1...
Max dimension size of a grid size (x,y,z): (2147483...
Maximum memory pitch: 2147483...
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes wit...
Run time limit on kernels: Yes
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Device PCI Bus ID / PCI location ID: 2 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSet...
> Peer access from Tesla K20c (GPU0) -> GeForce GT 730 (...
> Peer access from GeForce GT 730 (GPU1) -> Tesla K20c (...
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version =...
Result = PASS
>
To execute ''bandwidthTest'', obtains the results shown b...
>
beat@tesla:~/NVIDIA_CUDA-6.5_Samples/bin/x86_64/linux/re...
[CUDA Bandwidth Test] - Starting...
Running on...
Device 0: Tesla K20c
Quick Mode
Host to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 6577.3
Device to Host Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 6545.8
Device to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 147234.3
Result = PASS
* Revision History [#e2d6f93d]
>
- 2015/02/16 This article is initially uploaded
Page: