GTX660 Problem

WB8ILI
WB8ILI
Joined: 20 Feb 05
Posts: 45
Credit: 923965877
RAC: 231693
Topic 196727

I purchased an EVGA GeForce GTX660 video card. This is my first attempt at using a GPU capable card, so I may be missing something.

System is Windows XP Home SP3, 2 GB Ram, AMD Phenom X4 945, BioStar A70L3G main board (Computer 5765580).

Every Einstein GPU workunit I try has the following issues:

1) The computer screen "locks up". About every 30 seconds it "wakes up" for an instant. The Boinc Manager screen then gets updated. If I have mouse-clicked somewhere, that click will "get processed". The CPU is still functioning OK because, for example, I can access the hard drive over my local network with quick response.

2) The Windows System Event log starts filling up with Error, Source=nv, Event=14

3) Eventually the whole computer completely locks up or occasionally gets the black screen of death, indicating an infinite loop in NV4_disp.

What I have tried:

1) Added CUDA_GRID_SIZE_COMPAT=1 as a System Variable.

2) Tried drivers 310.90, 310.70, 306.81, and 305.27.

SETI and Milkyway workunits are being processed without issues (no Windows Error Events, no screen lockup issues) with driver 310.90.

For now, I have aborted all the Einstein CUDA workunits.

Any ideas of things to try?

Richard Haselgrove
Richard Haselgrove
Joined: 10 Dec 05
Posts: 2139
Credit: 2752648592
RAC: 1490764

GTX660 Problem

The CUDA_GRID_SIZE_COMPAT=1 is a specific workround for CUDA 3.0 (and earlier) applications. We use a minimum of CUDA 3.2 here, so it shouldn't be necessary, unless you also want to run SETI on the card.

The rest of your issues sound more like power/heat problems, which I'll leave to the hardware experts.

WB8ILI
WB8ILI
Joined: 20 Feb 05
Posts: 45
Credit: 923965877
RAC: 231693

The GPU temp according to

The GPU temp according to SpeedFan is 69C when running SETI and Milkyway.

SpeedFan eports the 12V at 11.97 volts but I don't know where that sensor is relative to the 12 volts going into the GPU card. I updated to a 650 watt supply when I installed the card.

Mike Davis
Mike Davis
Joined: 3 Apr 05
Posts: 12
Credit: 3026924
RAC: 0

Does it do anything similar

Does it do anything similar on any other cuda applicable projects, or is it just einstein? also... i read something on google about motherboard bios updates fixing some driver probs in some cases like the below. Might be worth a try, you never know!
http://www.evga.com/forums/tm.aspx?m=363206&mpage=1

WB8ILI
WB8ILI
Joined: 20 Feb 05
Posts: 45
Credit: 923965877
RAC: 231693

I am running SETI and

I am running SETI and Milkyway workunits without any issues. I have been using this computer for the three projects (SETI, Milkyway, and Einstein). Right now I am just spreading my "GPU Credit" over two projects instead of three.

I will look into BIOS updates for my computer.

Nobody316
Nobody316
Joined: 14 Jan 13
Posts: 141
Credit: 2008126
RAC: 0

Just to ask and make sure...

Just to ask and make sure... are you overclocking not just gpu but ram or processor... also have you done a bios update... I have seen many problems with gpu's since I came from fah@home I have learned a very few tricks but first need all info I can get... Also if you can you need to get the temp down to 55C or less because heat can be an issue...

PC setup MSI-970A-G46 AMD FX-8350 8 core OC'd 4.45GHz 16GB ram PC3-10700 Geforce GTX 650Ti Windows 7 x64 Einstein@Home

WB8ILI
WB8ILI
Joined: 20 Feb 05
Posts: 45
Credit: 923965877
RAC: 231693

The GTX660 model I have is

The GTX660 model I have is not one of models that comes overclocked and I am not over clocking the one I have. I have the latest BIOS. I am not overclocking the CPU either.

I did increase the fan speed and now have the temp in the low 50C range.

I then tried some more Einstein GPU workunits but the screen locked up as described in the first message of this thread. I didn't wait for the black screen of death.

This is not huge problem as I am getting a lot of credit on SETI and Milkyway as those workunits run fine.

Nobody316
Nobody316
Joined: 14 Jan 13
Posts: 141
Credit: 2008126
RAC: 0

RE: The GTX660 model I have

Quote:

The GTX660 model I have is not one of models that comes overclocked and I am not over clocking the one I have. I have the latest BIOS. I am not overclocking the CPU either.

I did increase the fan speed and now have the temp in the low 50C range.

I then tried some more Einstein GPU workunits but the screen locked up as described in the first message of this thread. I didn't wait for the black screen of death.

This is not huge problem as I am getting a lot of credit on SETI and Milkyway as those workunits run fine.

Ok 1 more thing. The past few days I have had problems with computer freezing. I noted firefox had updated atleast once and maybe twice. I stop using firefox for the past 2 days now with out problems. If you are using firefox try something different. I changed over to google chrome for the time being. It's worth a shot. Good luck I will check back in about a week. Next monday by the latest. Let me know if this works for ya.

PC setup MSI-970A-G46 AMD FX-8350 8 core OC'd 4.45GHz 16GB ram PC3-10700 Geforce GTX 650Ti Windows 7 x64 Einstein@Home

WB8ILI
WB8ILI
Joined: 20 Feb 05
Posts: 45
Credit: 923965877
RAC: 231693

I have Firefox installed but

I have Firefox installed but am not using it when my "freeze" occurs. In fact, nothing (no user application) is running except BOINC and Einstein. I stopped my SETI and Milkyway WU's when trying to run Einstein WU's.

Neil Newell
Neil Newell
Joined: 20 Nov 12
Posts: 176
Credit: 169699457
RAC: 0

RE: Every Einstein GPU

Quote:


Every Einstein GPU workunit I try has the following issues:

1) The computer screen "locks up". About every 30 seconds it "wakes up" for an instant. The Boinc Manager screen then gets updated. If I have mouse-clicked somewhere, that click will "get processed". The CPU is still functioning OK because, for example, I can access the hard drive over my local network with quick response.

2) The Windows System Event log starts filling up with Error, Source=nv, Event=14

Interesting you can access it over the network, which points the finger at the video card driver. I'm not familiar with Windows, but presumably it's the same as linux where processing continues whether the manager is running or not. Have you tried leaving it running without the manager active, and if so what happens?

Unless someone here happens to know what it means, you may be better off asking about the meaning for the 'Event=14' error on the nvidia forums.

I know e@h is really hard on bandwidth to the GPU; maybe it's showing something up that doesn't occur with other WUs. If you want to keep experimenting, might be interesting to see if the problem still occurs if you set your preferences to only process GPU versions of applications, and also if it happens if you only process CPU versions.

Nobody316
Nobody316
Joined: 14 Jan 13
Posts: 141
Credit: 2008126
RAC: 0

RE: 2) The Windows System

Quote:
2) The Windows System Event log starts filling up with Error, Source=nv, Event=14

Ok the NV error is too generic to be sure.

Quote:
2) Tried drivers 310.90, 310.70, 306.81, and 305.27.

Try uninstalling your graphics with a tool for that purpose (gets all the files) like driver cleaner pro. and may need to use a Registry Cleaner as well. Then try only 1 driver. If all else fails you may need to go to the nvidia forums. I will keep searching for an answer but until then best of luck.

PC setup MSI-970A-G46 AMD FX-8350 8 core OC'd 4.45GHz 16GB ram PC3-10700 Geforce GTX 650Ti Windows 7 x64 Einstein@Home

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.