Stuck Again

Bobby Conger
Joined: 6 Oct 19
Posts: 32
Credit: 5704389
RAC: 1801
Topic 222201

This task (h1_0896.35_O2C02Cl5In0__O2MD1C2_CasA_896.80Hz_5) got to 99% and stayed there for > 10 hours.  I aborted the task so others could continue.  Can I reload this and perhaps finish the task and if so, how?

 

Bobby

Bobby

mikey
mikey
Joined: 22 Jan 05
Posts: 11969
Credit: 1833978798
RAC: 224610

Bobby Conger wrote:This task

Bobby Conger wrote:

This task (h1_0896.35_O2C02Cl5In0__O2MD1C2_CasA_896.80Hz_5) got to 99% and stayed there for > 10 hours.  I aborted the task so others could continue.  Can I reload this and perhaps finish the task and if so, how?

 

Bobby

Once you abort a task it is sent to someone else.

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5850
Credit: 110034156707
RAC: 22397653

Bobby Conger wrote:... Can I

Bobby Conger wrote:
... Can I reload this and perhaps finish the task and if so, how?

Aborting a task kills it completely.  You can't 'reload' it.

For future reference, if something seems 'stuck' like this, you should just stop BOINC, wait a few seconds and then try restarting BOINC.  The purpose of that is to remove the task from memory and force BOINC to reload it from the last saved checkpoint (which will be on disk) and thereby give it a chance to get past where it was previously 'stuck'.

If that is still unsuccessful, you should shut down the computer and then restart it.  The purpose of that is to allow a fresh set of system libraries, etc, to be loaded, in case the problem is a system problem rather that an app problem.

In many cases (perhaps including this particular one) you may find the task can then be completed.  Here is the very end of what was returned to the project after you aborted the task.  You can check the whole thing for yourself by clicking on the Task ID link for the failed task - as shown on the website.

2020-04-26 16:53:22.7633 (9920) [normal]: Finished main analysis.
2020-04-26 16:53:22.7633 (9920) [normal]: Recalculating statistics for the final toplist...
2020-04-26 16:54:39.8643 (9920) [normal]: Finished recalculating toplist statistics.
2020-04-26 16:54:39.8663 (9920) [debug]: Writing output ... toplist2 ... toplist3 ...
</stderr_txt>

The calculations were completely finished and some results stats were being prepared to be written to several files for return to the project.  You can see that in just over a minute, that stage (the 99% to 100% stage) had been completed and files were being written out.  Two files were written but there were more to come.  Something stalled in the writing of the next file.  Writing files to disk is a system process.  Rebooting the machine may very well have cleared whatever was causing that to get stuck.  It would certainly have been worth a try.

It's not easy to know the problem in advance of the results being returned.  You just have to remember to try the standard techniques to get past a blockage before resorting to the 'final solution' :-).

Cheers,
Gary.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.