Work Unit not finishing

Rebirther
Rebirther
Joined: 4 Jan 05
Posts: 22
Credit: 31576
RAC: 0

H1_0059.9__0060.0_0.1_T11_Tes

H1_0059.9__0060.0_0.1_T11_Test02 also unfinished

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4275
Credit: 245488722
RAC: 11557

Just to give you an update on

Just to give you an update on this issue:

1. We have identified the problem and are working on it. A new set of apps fixing this should be availble in the next days.

2. We still appreciate the uploaded process directories (slots). They will help up to test these new apps before releasing them.

3. If you see such a "never ending result" apparently staying at 100% for hours, pleaase do the following:

- report it to us, probably in this thread
- if you can, zip (or tar.gz) the appropriate slots directory (and maybe the projects/einstein directory, too, but that's not so important) and make it available to us
- if you are only running E@H, just be patient - the WU will spend quite some time at the 100% mark, but will eventually finish
- if you are swapping between different projects and have set to remove the app from memory when suspended, the Result probably will never finish. The reason is that the app is not writing any checkpoints during this last stage, always gets suspended before completing it and thus starts at the bginning of this phase over and over again.
If you have a (experimental) client that allows to suspend individual projects (and aborting individual results), suspend all other projects except E@H until this Result is finished. You may also abort this individual result if you don't want to affect the other projects, but will lose you CPU time spent on it then.
If you are using a stock client (4.1x) without this possibility, well, there may be no other way than resetting the E@H project to get his Result out of the way, losing the CPU time you spent on it (and causing another 11MB data file download).
It may help to modify your preferences for a longer swap time and/or to keep the app in memory, but I'm not sure if it helps once the app is in this stage.

Thanks a lot for you help!

BM

BM

Rebirther
Rebirther
Joined: 4 Jan 05
Posts: 22
Credit: 31576
RAC: 0

OK Bernd, all my neverending

OK Bernd, all my neverending files are on Blizzard`s ftp. The adress is in in the "Einstein ftp server - anybody?" thread.
Download the files and delete these. I hope you can solve the problem :)

littleBouncer
littleBouncer
Joined: 22 Jan 05
Posts: 86
Credit: 12206010
RAC: 0

Sorry I must it describe in

Sorry I must it describe in german to be clear!

Gestern (29.01.05 06:00 UTC) bemerkte ich dass die WU "H1_0133.4__0133.7_0.1_T10_Test02_1_0" nach 6 Std. erst bei 9% war und eine Restzeit von 49 Std. hatte. Ich stoppte den Client (suspend/exit),wartete etwa 2 Min., überzeugte mich, dass keine BOINC-prozesse mehr liefen und startete den Client neu ; liess ihn dann etwa 10 Min. weitercrunchen und wiederholte das Schliessen/Starten des Clients nochmals. Nach einiger Zeit begann die Restdauer (ursprünglich 49 Std.) rapide abzunehmen. (Sie wird dann immer etwas höher als "normal" sein , nähert sich jedoch gegen Ende immer mehr dem "tatsächlichen" Wert.)
Diese WU konnte ich auf diese Weise fertig crunchen; "normalerweise" braucht dieser Rechner 10 Std und 20 Min. für eine EAH-WU, für die obengenannte brauchte der Rechner 14 Std. 28 Min. (für die 125 "geclaimten" credits erhielt ich gerade mal 40, was noch gnädig ist, denn das Resultat hätte gerade sogut als "invalid" gewertet werden können.)
http://einsteinathome.org/workunit/313352 , the second with 52000 sec

You see: how would I say that in english?
The hole "story" is only a "info", no replies needed!

Greetz from Switzerland
littleBouncer

Rebirther
Rebirther
Joined: 4 Jan 05
Posts: 22
Credit: 31576
RAC: 0

> Sorry I must it describe in

Message 876 in response to message 875

> Sorry I must it describe in german to be clear!
>
> Gestern (29.01.05 06:00 UTC) bemerkte ich dass die WU
> "H1_0133.4__0133.7_0.1_T10_Test02_1_0" nach 6 Std. erst bei 9% war und eine
> Restzeit von 49 Std. hatte. Ich stoppte den Client (suspend/exit),wartete etwa
> 2 Min., überzeugte mich, dass keine BOINC-prozesse mehr liefen und startete
> den Client neu ; liess ihn dann etwa 10 Min. weitercrunchen und wiederholte
> das Schliessen/Starten des Clients nochmals. Nach einiger Zeit begann die
> Restdauer (ursprünglich 49 Std.) rapide abzunehmen. (Sie wird dann immer
> etwas höher als "normal" sein , nähert sich jedoch gegen Ende immer mehr dem
> "tatsächlichen" Wert.)
> Diese WU konnte ich auf diese Weise fertig crunchen; "normalerweise" braucht
> dieser Rechner 10 Std und 20 Min. für eine EAH-WU, für die obengenannte
> brauchte der Rechner 14 Std. 28 Min. (für die 125 "geclaimten" credits
> erhielt ich gerade mal 40, was noch gnädig ist, denn das Resultat hätte
> gerade sogut als "invalid" gewertet werden können.)
> http://einsteinathome.org/workunit/313352 , the second with 52000
> sec
>
> You see: how would I say that in english?
> The hole "story" is only a "info", no replies needed!
>
> Greetz from Switzerland
> littleBouncer
>
>
>
Welche Version nutzt du, sowas hatte ich noch nicht?

littleBouncer
littleBouncer
Joined: 22 Jan 05
Posts: 86
Credit: 12206010
RAC: 0

> Welche Version nutzt du,

Message 877 in response to message 876

> Welche Version nutzt du, sowas hatte ich noch nicht?
>

CC 4.62 (könntest Du auch am "stderr-out" ersehen -keine Kritik-)
http://einsteinathome.org/workunit/313352
http://einsteinathome.org/task/1017103

greetz littleBouncer

Rebirther
Rebirther
Joined: 4 Jan 05
Posts: 22
Credit: 31576
RAC: 0

> > Welche Version nutzt du,

Message 878 in response to message 877

> > Welche Version nutzt du, sowas hatte ich noch nicht?
> >
>
> CC 4.62 (könntest Du auch am "stderr-out" ersehen -keine Kritik-)
> http://einsteinathome.org/workunit/313352
> http://einsteinathome.org/task/1017103
>
> greetz littleBouncer

What is different between 4.62 and 4.19? 4.62 is about 10MB large and buggy!
@Honza, ok no problem sorry ;)

Honza
Honza
Joined: 10 Nov 04
Posts: 136
Credit: 3332354
RAC: 0

Guys, no offence - but those

Guys, no offence - but those last messages can be easily written in English i guess.

[AF>Linux]Arnaud
[AF>Linux]Arnaud
Joined: 22 Jan 05
Posts: 19
Credit: 27188
RAC: 0

Hi, Some of my E@H wus are

Hi,
Some of my E@H wus are after the deadline date.
I've tried to abort them with CC4.60 & 4.62 but it didn't worked: nothing happened.
I don't know how to erase them "by hand" (sorry, bad english)
Well, I'm going to crunch them without being credited but I'd like to know where are the E@H Wus in the BOINC folder?
I don't see them in the Project/Einstein.. or in the slot folders.
Or do I have to edit the client_state.xml file because the wus are hidden in the big 11MB file named H1xxx
THX
Arnaud

Arnaud

S@NL - EJG
S@NL - EJG
Joined: 18 Jan 05
Posts: 34
Credit: 93500
RAC: 0

At the moment (Boinc 4.62)

At the moment (Boinc 4.62) you can only abort results that are running, not the ones you still have to start crunching. Just wait until an overtime Einstein WU starts, then abort it. For me that worked. To speed things up you can suspend the other projects. :-)

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.