Gravitational Wave search O2 Multi-Directional ("O2MD1")

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5,870
Credit: 115,860,588,161
RAC: 35,539,347

Betreger wrote:It seems odd

Betreger wrote:
It seems odd they aren't be sent out again. 

Why do you think that??  They will be being sent out again and if it's a problem with the validator rather than the result, it will most likely get 'fixed'.  The last result in Holmis' list has already been 'fixed' so it does look promising.

Cheers,
Gary.

Holmis
Joined: 4 Jan 05
Posts: 1,118
Credit: 1,055,935,564
RAC: 0

Seems all of them have been

Seems all of them have been fixed now. Thanks for taking care of it!

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4,305
Credit: 248,839,217
RAC: 33,873

On the GPUs ("O2MDF") we have

On the GPUs ("O2MDF") we have another chunk of the "V2" workunits. Based on previous experience these should run about twice as long as expected (e.g. like the "G2" ones). I doubled the credit and flops estimation to make up for that, hope that this helps.

BM

Betreger
Betreger
Joined: 25 Feb 05
Posts: 991
Credit: 1,553,348,477
RAC: 706,112
Betreger
Betreger
Joined: 25 Feb 05
Posts: 991
Credit: 1,553,348,477
RAC: 706,112

I'm getting a fair number of

I'm getting a fair number of "Error while computing" on both boxes.

https://einsteinathome.org/workunit/441408802
Betreger
Betreger
Joined: 25 Feb 05
Posts: 991
Credit: 1,553,348,477
RAC: 706,112
Mike Hewson
Mike Hewson
Moderator
Joined: 1 Dec 05
Posts: 6,579
Credit: 306,922,254
RAC: 166,647

You have the

You have the dreaded CL_MEM_OBJECT_ALLOCATION_FAILURE on some of those Vela Junior tasks for your computer with a 2GB Nvidia card. See this thread for more information. 

Cheers, Mike.

I have made this letter longer than usual because I lack the time to make it shorter ...

... and my other CPU is a Ryzen 5950X :-) Blaise Pascal

Betreger
Betreger
Joined: 25 Feb 05
Posts: 991
Credit: 1,553,348,477
RAC: 706,112

https://einsteinathome.org/wo

https://einsteinathome.org/workunit/447314688 I have 22 in a row of these. Thos was running S@H just fine and runs pulsars fine also. This card is a GTX10603GB. Methinks it is bad data not the host. Most fail in ~ 1min. 
Betreger
Betreger
Joined: 25 Feb 05
Posts: 991
Credit: 1,553,348,477
RAC: 706,112

I rebooted the offending box

I rebooted the offending box and have now successfully completed 7 in a row so the problem seems to have been the host. The are in a pending status so time will tell. 

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5,870
Credit: 115,860,588,161
RAC: 35,539,347

Betreger

Betreger wrote:
https://einsteinathome.org/workunit/447314688 I have 22 in a row of these. Thos was running S@H just fine and runs pulsars fine also. This card is a GTX10603GB. Methinks it is bad data not the host. Most fail in ~ 1min.

Betreger wrote:
I rebooted the offending box and have now successfully completed 7 in a row so the problem seems to have been the host. The are in a pending status so time will tell.

Unless the problem is the immediate consequence of the release of a new or modified app that has just been announced here, Technical News is NOT the best forum to report new problems in longer running searches.  There is a Problems forum specifically for that purpose.  There is also no use in reporting a problem in multiple places.  You just create more work for the already overworked Devs in trying to keep up with all reports that are coming in.  You just encourage the 'me too' and the 'maybe it could be me too (but it's actually different)' reports to be in different places as well.

At the start of every day, I check the problems forum first and try to deal with any overnight problem reports, if I can.  When I checked your report, it must have been just before you rebooted because there were only failed tasks, and none in progress at the instant I looked.  It's always a good idea to try a reboot before declaring that a problem exists.

It is still quite possible that there really could be memory allocation issues with these higher frequency tasks so if you see further examples, please report it in the Problems forum.

Cheers,
Gary.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.