My last six E@H workunits have all finished Client Error - Exit Status 10 (0xa).
They are all claiming the same amount of credit as the successfully completed Results for the same WU.
On one of them there are two othere completed results who have apparently been granted credit but my result hasn't. If this is going to happen for all the others I will have wasted the last 10 days of processing time (since LHC@H and S@H are not currently producing WUs E@H had exclusive use of my PC).
I can't find anything about the exit code so I can't try to fix it! If I'm not going to get credit for results that fail after running for 30+ hours each I might as well wait until S@H come back and let them have the time - E@H looks like a waste of time.
Copyright © 2024 Einstein@Home. All rights reserved.
Exit Status 10 (0xa)
)
Don't know if this is overly simplistic, but just for grins and giggles, try rebooting...
Edit: Poking around some on the web, I found TANPAKU had this issue in the past with a bug with the checkpointing in their application. The forum thread describing the issue is here
RE: RE: My last six E@H
)
As it happens I rebooted around 16:30(UTC) this afternoon. The machine has also been rebooted two or three times over the last ten days.
I'm going to bed now - will see how the latest WU gets on tomorrow (or Saturday!)
RE: My last six E@H
)
How do you run your BOINC client (as service, single installation? run always?)
I noticed that BOINC was restarted and could no longer read the checkpoint:
at 2007-05-07 14:22:53 Boinc is restarted and able to read the checkpoint file.
at the end Boinc seems to be started (a second time?) without being stopped and can't read the checkpoint file (file in use?)
[Edit] just noticed that Boinc took 12 minutes between 'Reading SFTs and setting up stacks' (at 20:26:22) and 'Found checkpoint - reading' (at 20:38:06).
What was going on there?
Udo
RE: How do you run your
)
Boinc runs as a service - run always.
I reset E@H yesterday evening and, despite there being nothing else running on the PC (I've been in bed and at work since then!) I just found that the Result that started last night says it has only managed 2 1/2 hours or CPU time and reached 5.7%. Watching it now it seems to be clocking up CPU time only slightly slower than real time. Also found 3 "... exited with zero status but no 'finished' file"
The other odd thing was that the machine was showing the E@H screen saver, time was correct when I woke it up (around 17:15) but display was frozen. It took 3 or 4 minutes before I could get any response (even to num lock/caps lock!) after which I got to the traditional WinXP "PC Locked - login prompt". According the the Boinc release notes, the screen saver should not work at all when running as a service. In addition, I use the WindowsXP Home login screen rather than the traditional windows version (easier for th efamily!) so something strange is going on.
History:
I stopped running Boinc (all projects) for several months but I recently installed the latest version Boinc Client 5.8.16. At that time I also changed to the "run as service" option so that Boinc would still run (without me being logged in) if the machine rebooted unexpectedly (kids games sometimes do that or kids forgetting I want it left on). I also recently attached to BAM.
Only other significant change was upgrading to latest Norton Internet Security (only choice was upgrade - renewal not available for version I was running before!) - this has significantly slowed the machine (using much more memory as well).
Not aware of anything unusual at that time.
While I have been typing thus - the E@H Result has kept clocking up CPU time at almost real time (only other thing going on is me typing into IE!) This is the way it used to work, so perhaps it has cured itself (got worried I was looking too hard ;-)
After you installed BOINC as
)
After you installed BOINC as a service, did you also use the work-around to re-enable the graphics and the screen saver?
And have you tried running without the screen saver? It's generally a resource hog. Even on computers with external (AGP, PCI, PCI-e) videocards.
To run without the screen saver, go to Display properties, screen saver tab, set the screen saver to None and OK out. Then just turn off the monitor, or use power standby options.
RE: After you installed
)
Wasn't aware of the work around - I wasn't too fussed about losing it anyway so I didn't look.
Now I have the info I may try it just so I can see the graphics occasionally (I quite like the LHC@H graphics)
I have just disabled the SCR - I should have done it before as I always though they looked quite expensive (I remember when the GL SCRs first came out in WinNT, if they popped up anything you might have left running on the system didn't get a look in!). Thanks for the prompt!
Graham
It's taken a while (slightly
)
It's taken a while (slightly over a month ;-)), but I finally know what exit code 10 is. If you run into this problem... just reset the project. It's a problem with the application's checkpointing capabilities. And it will affect the whole work unit you are crunching. Getting a new work unit seems to fix it for most people.
I know Bernd is trying to get this problem fixed ASAP though.