Linux box suddenly crashing wu's.

adrianxw
adrianxw
Joined: 21 Feb 05
Posts: 242
Credit: 322,654,862
RAC: 0
Topic 193304

All the wu's this machine has received since 1st November have exited part way through with a status of 11. Others are completing the wu's. I have set it to "No New Work" against Einstein for the time being.

The machine is an Intel P-IV HT 3.0GHz @ 3.0GHz with a big Zalman heatsink on it, running Suse 10, and BOINC 5.10.x, (20 I think). There have been no changes in configuration or software for months.

The machine is not being used for anything other then BOINC at the moment, and has run trouble free for months, in fact, I rarely look at it other then to make sure everything is still running. It is still crunching SIMAP, Rosetta, QMC and MCDN without error.

Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

RandyC
RandyC
Joined: 18 Jan 05
Posts: 6,238
Credit: 111,139,797
RAC: 0

Linux box suddenly crashing wu's.

Silly question, but have you tried re-booting; clean out the fans/filters/dust, etc?

Is it Overclocked? Sometimes one project stresses the system more than others and puts out errors when the other projects don't.

Quote:

All the wu's this machine has received since 1st November have exited part way through with a status of 11. Others are completing the wu's. I have set it to "No New Work" against Einstein for the time being.

The machine is an Intel P-IV HT 3.0GHz @ 3.0GHz with a big Zalman heatsink on it, running Suse 10, and BOINC 5.10.x, (20 I think). There have been no changes in configuration or software for months.

The machine is not being used for anything other then BOINC at the moment, and has run trouble free for months, in fact, I rarely look at it other then to make sure everything is still running. It is still crunching SIMAP, Rosetta, QMC and MCDN without error.


Seti Classic Final Total: 11446 WU.

Annika
Annika
Joined: 8 Aug 06
Posts: 720
Credit: 494,410
RAC: 0

Yep, I remember a case like

Yep, I remember a case like that from the S5R2 run. Someone's box suddenly kept producing weird errors, only in Einstein and only since the beginning of the new science run. Turned out to be exactly what you described: The new kind of computation was more stressful on the hardware, the box was overclocked, and those two things didn't go well together.

adrianxw
adrianxw
Joined: 21 Feb 05
Posts: 242
Credit: 322,654,862
RAC: 0

RE: Intel P-IV HT 3.0GHz @

Quote:
Intel P-IV HT 3.0GHz @ 3.0GHz


Quote:
a big Zalman heatsink


... which gets an air duster about once a month, it is spotless.

Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

th3
th3
Joined: 24 Aug 06
Posts: 208
Credit: 2,208,434
RAC: 0

Signal 11: RE: This App

Signal 11:

Quote:

This App was built with newer version of the BOINC library that I hope to fix some of the segfault client errors (exit status 11).

Try the beta app,
http://einsteinathome.org/node/193299

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.