Errors with GravWave Search - GPU

halfempty
halfempty
Joined: 3 Apr 20
Posts: 14
Credit: 37595576
RAC: 0

I am going to change the

I am going to change the entry to only exclude einstein_O2MDF instead of all Einstein work from the 1060 3GB. I'll let you know if I mess anything up.

Betreger
Betreger
Joined: 25 Feb 05
Posts: 992
Credit: 1593222339
RAC: 778218

I don't think that is

I don't think that is necessary if I understand Bernd's post.

https://einsteinathome.org/goto/comment/176698
Zalster
Zalster
Joined: 26 Nov 13
Posts: 3117
Credit: 4050672230
RAC: 0

Betreger wrote:I don't think

Betreger wrote:

I don't think that is necessary if I understand Bernd's post.

https://einsteinathome.org/goto/comment/176698

 Did you read my reply to his statement?

https://einsteinathome.org/goto/comment/176723

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3963
Credit: 47136032642
RAC: 65421160

how much ram is being used on

how much ram is being used on each GPU for these GW VelaJr tasks? as reported by nvidia-smi.

 

if it's more than 830MB or so, then that's probably the reason that the tasks don't work on 3GB and lower GPUs, since you can only use ~27% of the available memory for OpenCL on nvidia cards.

_________________________________________________________________________

halfempty
halfempty
Joined: 3 Apr 20
Posts: 14
Credit: 37595576
RAC: 0

Betreger wrote:I don't think

Betreger wrote:

I don't think that is necessary if I understand Bernd's post.

https://einsteinathome.org/goto/comment/176698

Bernd posted that the day after I had my last error and took the 3GB card out of Einstein processing. While the problem is probably fixed, Nvidia GPU BOINC processing isn't entirely clear cut, as Ian and Zalster pointed out. Right now I'm a little gun shy after 66 errors and a day in the penalty box. Next week I'll be brave and undo the exclusion when I'll have time to keep an eye on it.

mikey
mikey
Joined: 22 Jan 05
Posts: 12701
Credit: 1839103849
RAC: 3643

halfempty wrote:Just a quick

halfempty wrote:
Just a quick update to let everyone know that after I excluded the 1060 3GB from Einstein there have been no more errors. Every VelaJr1 task on the 1060 3GB failed, every one on the 1050Ti 4GB or the 1660 6GB succeeded. 

Have you tried running these task on the gpu?

Gamma-ray pulsar binary search #1 on GPUs v1.22 () windows_x86_64

Admittedly they are not the tasks you ran before but they seem to run better on my gpu's.

Eugene Stemple
Eugene Stemple
Joined: 9 Feb 11
Posts: 67
Credit: 378063297
RAC: 599600

VelaJr running in Nvidia 1060

VelaJr running in Nvidia 1060 6gb is showing (nvidia-smi) 1787 MiB memory in use.  One at a time, of course.  Several are in the validated tasks tables and none in the invalid lists so they appear to be flowing through normally.

(Can't find anything about video RAM used in stderr output.  Max card memory - yes; amount used - no.)

 

halfempty
halfempty
Joined: 3 Apr 20
Posts: 14
Credit: 37595576
RAC: 0

mikey wrote:Have you tried

mikey wrote:

Have you tried running these task on the gpu?

Gamma-ray pulsar binary search #1 on GPUs v1.22 () windows_x86_64

Admittedly they are not the tasks you ran before but they seem to run better on my gpu's.

Yes, I am currently running both. It was that specific type if task (VelaJr1) on the 3GB card that failed, but now that I have excluded the 3GB card from running GW O2 tasks I have no new errors.

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3963
Credit: 47136032642
RAC: 65421160

Eugene Stemple wrote:VelaJr

Eugene Stemple wrote:

VelaJr running in Nvidia 1060 6gb is showing (nvidia-smi) 1787 MiB memory in use.  One at a time, of course.  Several are in the validated tasks tables and none in the invalid lists so they appear to be flowing through normally.

(Can't find anything about video RAM used in stderr output.  Max card memory - yes; amount used - no.)

 

at the bottom of the Nvidia-smi output you can see how much the app is using rather than the whole card. Some of the used memory will be taken up by the desktop manager. not usually more than a couple hundred MB though 

 

_________________________________________________________________________

D_S_Spence
D_S_Spence
Joined: 1 Dec 17
Posts: 1
Credit: 94557539
RAC: 12245

I am also getting the

I am also getting the CL_MEM_OBJECT_ALLOCATION_FAILURE errors, running the VelaJr tasks.  NVidia GeForce GTX 1050 with 2GB RAM.  This is the host: https://einsteinathome.org/host/12599623

The errors started at the same time as the last Windows Update (April 15), so I thought that might have something to do with it.  I guess not, though?

I have just updated the driver for my GPU, in case that caused it, but that didn't fix it.  I am now going to deselect this project for a while so that this computer can get back to work.

 

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.