Hello all,
I have a question regarding the Einstein project within the BOINC manager I run. Each time it is resumed it restarts from 0%, no matter where it was suspended (when I turned off the PC for example or manually suspended it).
I also get the following message: "Task XXXXXX exited with zero status but no 'finished' file. If this happens repeatedly you may need to reset the project."
What can I do about this? How do I restart the project?
Will this solve the problem?
Copyright © 2024 Einstein@Home. All rights reserved.
Small issue/question...
)
Does it start from zero, but then in a few minutes pop back to a larger percentage? You have returned a few results, and they look good. The exit with 0 status is kind of a weird error, and is usually not a problem.
If you want to reset it, I would suggest you set the option "No New Work" when it finishes, do a final update, then detach and reattach. You will probably get a new ID, but this may help.
I recently came to understand
)
I recently came to understand that the restart points are set by the "Write to disk at most every" setting under "Your Account" / "General Preferences".
I had thought they were written in the code somewhere - maybe they are - I seem to recall a conversation about this sometime in Dec.
I may be off base here, but I think the info came from Eric Korpela, one of the top scientist cats over at SETI, in the last few days.
Jim
Those who don’t build must burn. It’s as old as history and juvenile delinquents.
Ray Bradbury - Fahrenheit 451
me again... For e.g. a
)
me again...
For e.g. a task is at 40% after 3hours. I turn off the PC and when I turn it on it restarts the task from 0% although the CPU time is 3h. It does not jump forward after a while...it keeps on going from scratch.
: If I change the settings for "Write to disk at most every" option will it have any effect on this? It is currently set to 60 sec. What value should it be to solve the issue...
10q
Seems that albert is unable
)
Seems that albert is unable to read the checkpoint after it exits with
"no heartbeat from core client" error message. The second time the checkpoint
is read successfully. The "no heartbeat" message can usually be ignored, maybe not in your case.
Try to reset the project after the current WU is finished and reported.
HTH
Michael
2006-06-14 23:01:28.0781 [normal]: Start of BOINC application 'projects/einstein.phys.uwm.edu/albert_4.37_windows_intelx86.exe'.
2006-06-14 23:01:28.0781 [normal]: Started search at lalDebugLevel = 0
2006-06-14 23:01:29.5468 [normal]: Checkpoint-file 'Fstat.out.ckp' not found.
2006-06-14 23:01:29.5468 [normal]: No usable checkpoint found, starting from beginning.
2006-06-14 23:09:21.8906 [normal]: Fstat file reached MaxFileSizeKB ==> compactifying ... done.
2006-06-15 20:54:39.8125 [normal]: Start of BOINC application 'projects/einstein.phys.uwm.edu/albert_4.37_windows_intelx86.exe'.
2006-06-15 20:54:39.8281 [normal]: Started search at lalDebugLevel = 0
2006-06-15 20:54:41.3437 [normal]: Found checkpoint-file 'Fstat.out.ckp'
2006-06-15 20:54:41.3906 [normal]: Trying to read Fstat-file into toplist ...
2006-06-15 20:54:44.5468 [normal]: Checksum Ok. Successfully read_toplist_from_fp()
2006-06-15 20:54:44.5468 [normal]: Resuming computation at (23268/109964945/2207927).
No heartbeat from core client for 31 sec - exiting
2006-06-16 17:22:07.5625 [normal]: Start of BOINC application 'projects/einstein.phys.uwm.edu/albert_4.37_windows_intelx86.exe'.
2006-06-16 17:22:07.5625 [normal]: Started search at lalDebugLevel = 0
2006-06-16 17:22:09.0312 [normal]: Found checkpoint-file 'Fstat.out.ckp'
Failed to read checkpoint-counters from 'Fstat.out.ckp'!
2006-06-16 17:22:09.0312 [normal]: No usable checkpoint found, starting from beginning.
2006-06-16 17:51:33.8437 [normal]: Fstat file reached MaxFileSizeKB ==> compactifying ... done.
2006-06-16 18:47:55.8281 [normal]: Start of BOINC application 'projects/einstein.phys.uwm.edu/albert_4.37_windows_intelx86.exe'.
2006-06-16 18:47:55.8281 [normal]: Started search at lalDebugLevel = 0
2006-06-16 18:47:57.2187 [normal]: Found checkpoint-file 'Fstat.out.ckp'
2006-06-16 18:47:57.2187 [normal]: Trying to read Fstat-file into toplist ...
2006-06-16 18:48:00.2187 [normal]: Checksum Ok. Successfully read_toplist_from_fp()
2006-06-16 18:48:00.2187 [normal]: Resuming computation at (23042/109577076/2200170).
2006-06-16 23:32:26.0937 [normal]: Search finished successfully.
Team Linux Users Everywhere
![](http://allprojectstats.com/su2082519h1--1-1.png)
10x... I will try this...the
)
10x...
I will try this...the problem is that now it's not only Einstein...
SETI is doing the same thing...:(
RE: 10x... I will try
)
I have a few questions, suggestions:
What Anti-Virus software are you running? If Norton, or I think there was another, have it NOT scan the BOINC folder and subdirectories.
Have you done a disk check or defrag lately, or do either of these automatically run? Projects can not be running when these happen, it causes bad things to happen.
What are all the projects you are running? Just Seti and Einstein? If so, set both to no new work, allow them to finish, and do a final update to make sure everything gets reported correctly. Then uninstall, and reinstall.
My antivirus is
)
My antivirus is BitDefender...and I did not scan lately...nor defrag.
I run Einstein, SETI and Rosetta.
I will try to reinstall...hope it works this way :)