S5R2c WU's

Jim Howe
Jim Howe
Joined: 25 Mar 05
Posts: 18
Credit: 11707416
RAC: 0
Topic 192648

Four of my machines seem able to crunch the S5R2c WU's without the dreaded "Client error" code 11 SIGABRT.

Maybe all my other five machines have flakey memory, I don't know.

But now I am going to halt, for now, all but the four that seem to be hanging in there. I am wondering if this is being looked at as something in the Einstein app, or if this is just the way it is now.

When I get a chance I will run memtest on the machine here, but for now we're off to Shanghai.

...

DanNeely
DanNeely
Joined: 4 Sep 05
Posts: 1364
Credit: 3562358667
RAC: 0

S5R2c WU's

the current app appears to have some bugs in it, it's unlikely to be your hardware although why some machines are dieing constantly and others problem free isn't publicly known.

Ananas
Ananas
Joined: 22 Jan 05
Posts: 272
Credit: 2500681
RAC: 0

Bruce Allen has the same

Bruce Allen has the same problem (including some 4.18 results)
so I'm sure the project team will find out how to fix it.

As long as he has the same problems, you shouldn't try to fix
your machine as it is probably not broken.

Annika
Annika
Joined: 8 Aug 06
Posts: 720
Credit: 494410
RAC: 0

No, it probably isn't. One of

No, it probably isn't. One of my hosts ran into "SIGABRT" with the 4.18 app yesterday and crashed a WU after 50 hours, but it has also returned some perfectly valid results (and shown no problems in memtest) so I'm quite sure the box is working fine. It is an AMD host, though... I've heard the opinion that AMDs are more likely to get these problems, just like they tend to perform poorer (although that mostly affects Windows, and mine is a Linux host... interesting...) with the current science app. So, don't worry about your boxes, I'm sure there's nothing wrong with them. Maybe they all got unlucky with the datafile and received more "difficult" WUs. The project staff is working on it...

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Moderator
Joined: 28 Aug 06
Posts: 3522
Credit: 737442253
RAC: 1304882

RE: the current app appears

Message 62907 in response to message 62904

Quote:
the current app appears to have some bugs in it, it's unlikely to be your hardware although why some machines are dieing constantly and others problem free isn't publicly known.

This problem is only affecting the Linux client, right? It is affecting AMD and Intel CPUs alike.

For my hosts I found that the version and/or presence of libGL (Open GL package) may also make a difference. After using LD_LIBRARY_PATH inside run_client to point to a fake libGL that doesn't load (after that the client disables grahics support but will carry on regardless), all my Linux boxes seem to do just fine.

Maybe someone with a box that is reliably failing on WUs wants to give it a try, if this can be confirmed we would be a big step ahead in resolving this problem.

Disclaimer: This will disable graphics for all of your BOINC clients, and some clients may even react adversely to this. So this is best tried out for a host that is running exclusively E@H and no other BOINC projects.

CU

BRM

Jim Howe
Jim Howe
Joined: 25 Mar 05
Posts: 18
Credit: 11707416
RAC: 0

I now see that with the 4.18

I now see that with the 4.18 version of the heirarchical search that things seem stable again. I also started up the 4.21 beta version on one machine and it produced two results with no problem.

When I see these problems I have an urge to debug, but in this case I have little clue what to debug.

...

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Moderator
Joined: 28 Aug 06
Posts: 3522
Credit: 737442253
RAC: 1304882

RE: I now see that with the

Message 62909 in response to message 62908

Quote:

I now see that with the 4.18 version of the heirarchical search that things seem stable again. I also started up the 4.21 beta version on one machine and it produced two results with no problem.

When I see these problems I have an urge to debug, but in this case I have little clue what to debug.

...

4.18 was badly affected as well, but the beta (4.21) seems to fix this. Bernd said the beta will probably soon be released as regular download, so everything will be fine soon.

CU

BRM

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.