이번에 서버에 Tesla K40m 2개를 추가한후에 그래픽드라이버를 설치하는데 오류가 생겨서 일주일을 고생하다 질문을 드립니다. <div><br></div> <div>환경은</div> <div><p style="margin:0px;padding:0px;font-family:Tahoma, '굴림';color:#222222;">서버 : HP DL380p</p> <p style="margin:0px;padding:0px;font-family:Tahoma, '굴림';color:#222222;">OS : 우분투 14.04.1 LTS</p> <p style="margin:0px;padding:0px;font-family:Tahoma, '굴림';color:#222222;">그래픽카드 : Tesla K40m 두개입니다.</p> <p style="margin:0px;padding:0px;font-family:Tahoma, '굴림';color:#222222;"><br></p> <div style="margin:0px;padding:0px;color:#222222;font-family:Tahoma;line-height:normal;font-size:medium;"><span style="margin:0px;padding:0px;font-size:12px;">01:00.1 VGA compatible controller: Matrox Electronics Systems Ltd. MGA G200EH</span></div> <div style="margin:0px;padding:0px;color:#222222;font-family:Tahoma;line-height:normal;font-size:medium;"><span style="margin:0px;padding:0px;font-size:12px;">04:00.0 3D controller: NVIDIA Corporation GK110BGL [Tesla K40m] (rev a1)</span></div> <div style="margin:0px;padding:0px;color:#222222;font-family:Tahoma;line-height:normal;font-size:medium;"><span style="margin:0px;padding:0px;font-size:12px;">24:00.0 3D controller: NVIDIA Corporation GK110BGL [Tesla K40m] (rev a1)</span></div></div> <div style="margin:0px;padding:0px;color:#222222;font-family:Tahoma;line-height:normal;font-size:medium;"><span style="font-size:9pt;">lspci로 인식은 하고 있는거 같은데 설치 에러가 납니다.</span></div> <div style="margin:0px;padding:0px;color:#222222;font-family:Tahoma;line-height:normal;font-size:medium;"><span style="font-size:9pt;"><br></span></div> <div style="margin:0px;padding:0px;color:#222222;font-family:Tahoma;line-height:normal;font-size:medium;"><span style="font-size:9pt;">처음은 lightdm을 끄고 드라이버 설치파일의 실행권한을 쓰기권한을 주고 설치를했는데</span></div> <div style="margin:0px;padding:0px;color:#222222;"><span></span><div style="font-family:Tahoma;line-height:normal;font-size:9pt;text-align:left;"><img src="http://thimg.todayhumor.co.kr/upfile/201507/1436230657b5c4MEYULzMmgxqIB6WSwtRBErntUkW.png" width="675" height="400" alt="Image1.png" style="border:none;"></div> <div style="font-family:Tahoma;line-height:normal;font-size:9pt;text-align:left;"><br></div> <div style="font-family:Tahoma;line-height:normal;font-size:9pt;text-align:left;">이런 에러가 발생했습니다.</div> <div style="text-align:left;"> <div style="font-family:Tahoma;line-height:normal;color:#000000;"><font size="2">Kernel module load error: No such device</font></div> <div style="font-family:Tahoma;line-height:normal;color:#000000;"><font size="2"><br></font></div> <div style="font-family:Tahoma;line-height:normal;color:#000000;"><font size="2"> Kernel messages:<br>[ 2846.343666] [<ffffffffa031f2c4>] nvidia_init_module+0x2c4/0x78a [nvidia]<br>[ 2846.343695] [<ffffffffa031f79f>] ? nv_drm_init+0x15/0x15 [nvidia]<br>[ 2846.343723] [<ffffffffa031f825>] nvidia_frontend_init_module+0x86/0x861 [nvidia]<br>[ 2846.343727] [<ffffffff8100214a>] do_one_initcall+0xfa/0x1b0<br>[ 2846.343731] [<ffffffff81059903>] ? set_memory_nx+0x43/0x50<br>[ 2846.343736] [<ffffffff810e275d>] load_module+0x12dd/0x1b40<br>[ 2846.343739] [<ffffffff810de1e0>] ? store_uevent+0x40/0x40<br>[ 2846.343742] [<ffffffff810e3136>] SyS_finit_module+0x86/0xb0<br>[ 2846.343746] [<ffffffff81733d5d>] system_call_fastpath+0x1a/0x1f<br>[ 2846.343747] ---[ end trace 8d51a9b3ed0ff385 ]---<br>[ 2846.343788] NVRM: This PCI I/O region assigned to your NVIDIA device is invalid:<br>[ 2846.343788] NVRM: BAR1 is 0M @ 0x0 (PCI:0000:04:00.0)<br>[ 2846.343790] NVRM: The system BIOS may have misconfigured your GPU.<br>[ 2846.343794] nvidia: probe of 0000:04:00.0 failed with error -1<br>[ 2846.343846] NVRM: This PCI I/O region assigned to your NVIDIA device is invalid:<br>[ 2846.343846] NVRM: BAR1 is 0M @ 0x0 (PCI:0000:24:00.0)<br>[ 2846.343853] NVRM: The system BIOS may have misconfigured your GPU.<br>[ 2846.343867] nvidia: probe of 0000:24:00.0 failed with error -1<br>[ 2846.343889] Error: Driver 'nvlink' is already registered, aborting...<br>[ 2846.344317] NVRM: The NVIDIA probe routine failed for 2 device(s).<br>[ 2846.344319] NVRM: None of the NVIDIA graphics adapters were initialized!<br>[ 2846.344320] [drm] Module unloaded<br>[ 2846.344395] NVRM: NVIDIA init module failed!<br>[ 2846.344902] systemd-udevd[7863]: Failed to apply ACL on /dev/dri/card0: No such file or directory<br>[ 2846.346540] systemd-udevd[7862]: Failed to apply ACL on /dev/dri/card0: No such file or directory</font></div> <div style="font-family:Tahoma;line-height:normal;color:#000000;"><font size="2"><br></font></div> <div style="font-family:Tahoma;line-height:normal;color:#000000;"><font size="2">에러 원인을 찾아보니 </font><span style="color:#222222;font-size:9pt;">nouveau</span><span style="color:#222222;font-size:9pt;"> 충돌 문제라기에 블랙리스트에 추가한후에 해도 동일한 문제가 발생하기에</span></div> <div style="font-family:Tahoma;line-height:normal;color:#000000;"><span style="color:#222222;font-size:9pt;">nvidia-current를 설치하면 자동으로 잡아준다는 글을 보고 해보니</span></div> <div style="color:#000000;"><span style="color:#222222;font-size:9pt;"> </span><div style="font-family:Tahoma;line-height:normal;text-align:left;"><img src="http://thimg.todayhumor.co.kr/upfile/201507/1436230845hZYfPx8rstMP.png" width="675" height="400" alt="Image2.png" style="border:none;"></div> <div style="font-family:Tahoma;line-height:normal;text-align:left;">여기서 100프로가 된후</div> <div style="text-align:left;"> <div style="font-family:Tahoma;line-height:normal;text-align:left;"><img src="http://thimg.todayhumor.co.kr/upfile/201507/1436230867uehjlqzoLRZORoQLUuHXyFNVe9.png" width="675" height="400" alt="Image3.png" style="border:none;"></div> <div style="font-family:Tahoma;line-height:normal;text-align:left;">이런 에러가 뜹니다.</div> <div style="font-family:Tahoma;line-height:normal;text-align:left;"><br></div> <div style="font-family:Tahoma;line-height:normal;text-align:left;">그래서 nvidia 커널 모듈을 불러오는데 </div> <div style="font-family:Tahoma;line-height:normal;text-align:left;"><span style="font-family:Verdana, '굴림';line-height:18px;">modprobe: ERROR: could not insert 'nvidia_304': No such device</span></div> <div style="font-family:Tahoma;line-height:normal;text-align:left;"><span style="font-family:Verdana, '굴림';line-height:18px;">디바이스를 찾을수 없다는 에러 메시지가 나옵니다. 여기까지 진행하고 1주일을 글을 찾아보고 진행하고 OS를 다시 설치하고 진행해도 도저히</span></div> <div style="font-family:Tahoma;line-height:normal;text-align:left;"><span style="font-family:Verdana, '굴림';line-height:18px;">해결이 안되어 질문을 드립니다.</span></div> <div style="font-family:Tahoma;line-height:normal;text-align:left;"><span style="font-family:Verdana, '굴림';line-height:18px;"><br></span></div> <div style="font-family:Tahoma;line-height:normal;text-align:left;"><span style="font-family:Verdana, '굴림';line-height:18px;">답변을 주신다면 정말 감사하겠습니다.</span></div><br></div><br></div></div><br></div>
댓글 분란 또는 분쟁 때문에 전체 댓글이 블라인드 처리되었습니다.