Problem resuming Einstein@home... (linux)

jprouane
jprouane
Joined: 26 Feb 05
Posts: 1
Credit: 278244
RAC: 0
Topic 188149

Hi there,
I've been running SETI and ClimatePrediction on linux for some time, and more recently attached the Einstein@home project. While SETI and ClimatePrediction work fine, Einsten@home would not resume normally... In fact, it does 'technically' resume, but it does not compute... Likewise, it does not stop using the usual 1 hour shift so that SETI and ClimatePrediction can't start...

Now if, when this problem occurs, I break (Ctrl+C) and restart , then Einstein@home restarts fine.

Below is a copy from the console where I run BOINC. Looking at it, it seems that there is something wrong with the 'deferring communication' thing...

Any clue ?

2005-02-28 07:01:28 [Einstein@Home] Resuming result H1_0913.9__0914.1_0.1_T00_Test02_2 using einstein version 4.80
2005-02-28 07:01:28 [SETI@home] Started upload of 06ja05aa.3784.5793.86072.151_0_0
2005-02-28 07:01:30 [SETI@home] Finished upload of 06ja05aa.3784.5793.86072.151_0_0
2005-02-28 07:01:30 [SETI@home] Throughput 31478 bytes/sec
2005-02-28 07:29:49 [Einstein@Home] Deferring communication with project for 13 hours, 59 minutes, and 59 seconds
2005-02-28 07:29:49 [Einstein@Home] Deferring communication with project for 13 hours, 59 minutes, and 59 seconds
2005-02-28 08:29:49 [Einstein@Home] Deferring communication with project for 12 hours, 59 minutes, and 59 seconds
2005-02-28 08:29:49 [Einstein@Home] Deferring communication with project for 12 hours, 59 minutes, and 59 seconds
2005-02-28 09:29:49 [Einstein@Home] Deferring communication with project for 11 hours, 59 minutes, and 59 seconds
2005-02-28 09:29:49 [Einstein@Home] Deferring communication with project for 11 hours, 59 minutes, and 59 seconds
2005-02-28 10:29:49 [Einstein@Home] Deferring communication with project for 10 hours, 59 minutes, and 59 seconds
2005-02-28 10:29:49 [Einstein@Home] Deferring communication with project for 10 hours, 59 minutes, and 59 seconds
2005-02-28 11:29:49 [Einstein@Home] Deferring communication with project for 9 hours, 59 minutes, and 59 seconds
2005-02-28 11:29:49 [Einstein@Home] Deferring communication with project for 9 hours, 59 minutes, and 59 seconds
2005-02-28 12:29:49 [Einstein@Home] Deferring communication with project for 8 hours, 59 minutes, and 59 seconds
2005-02-28 12:29:49 [Einstein@Home] Deferring communication with project for 8 hours, 59 minutes, and 59 seconds
2005-02-28 13:29:49 [Einstein@Home] Deferring communication with project for 7 hours, 59 minutes, and 59 seconds
2005-02-28 13:29:49 [Einstein@Home] Deferring communication with project for 7 hours, 59 minutes, and 59 seconds
2005-02-28 14:29:49 [Einstein@Home] Deferring communication with project for 6 hours, 59 minutes, and 59 seconds
2005-02-28 14:29:49 [Einstein@Home] Deferring communication with project for 6 hours, 59 minutes, and 59 seconds
2005-02-28 15:29:49 [Einstein@Home] Deferring communication with project for 5 hours, 59 minutes, and 59 seconds
2005-02-28 15:29:49 [Einstein@Home] Deferring communication with project for 5 hours, 59 minutes, and 59 seconds
2005-02-28 16:29:49 [Einstein@Home] Deferring communication with project for 4 hours, 59 minutes, and 59 seconds
2005-02-28 16:29:49 [Einstein@Home] Deferring communication with project for 4 hours, 59 minutes, and 59 seconds
2005-02-28 17:29:49 [Einstein@Home] Deferring communication with project for 3 hours, 59 minutes, and 59 seconds
2005-02-28 17:29:49 [Einstein@Home] Deferring communication with project for 3 hours, 59 minutes, and 59 seconds
2005-02-28 18:29:49 [Einstein@Home] Deferring communication with project for 2 hours, 59 minutes, and 59 seconds
2005-02-28 18:29:49 [Einstein@Home] Deferring communication with project for 2 hours, 59 minutes, and 59 seconds
2005-02-28 19:29:49 [Einstein@Home] Deferring communication with project for 1 hours, 59 minutes, and 59 seconds
2005-02-28 19:29:49 [Einstein@Home] Deferring communication with project for 1 hours, 59 minutes, and 59 seconds
2005-02-28 20:29:49 [Einstein@Home] Deferring communication with project for 59 minutes and 59 seconds
2005-02-28 20:29:49 [Einstein@Home] Deferring communication with project for 59 minutes and 59 seconds
2005-02-28 21:29:49 [---] May run out of work in 1.00 days; requesting more
2005-02-28 21:29:49 [SETI@home] Requesting 6692 seconds of work
2005-02-28 21:29:49 [SETI@home] Sending request to scheduler: http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
2005-02-28 21:29:59 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2005-02-28 21:29:59 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2005-02-28 21:29:59 [SETI@home] No schedulers responded
2005-02-28 21:29:59 [SETI@home] No schedulers responded
2005-02-28 21:29:59 [Einstein@Home] Sending request to scheduler: http://einstein.phys.uwm.edu/EinsteinAtHome_cgi/cgi
2005-02-28 21:30:00 [Einstein@Home] Scheduler RPC to http://einstein.phys.uwm.edu/EinsteinAtHome_cgi/cgi succeeded
2005-02-28 21:30:00 [Einstein@Home] Project prefs: no separate prefs for home; using your defaults
2005-02-28 21:30:00 [SETI@home] Deferring communication with project for 59 seconds
2005-02-28 21:30:00 [SETI@home] Deferring communication with project for 59 seconds
2005-02-28 22:03:50 [---] Received signal 2
2005-02-28 22:03:51 [---] Exit requested by user
[root@localhost boinc]# ./boinc_4.19_i686-pc-linux-gnu
2005-02-28 22:03:57 [---] Starting BOINC client version 4.19 for i686-pc-linux-gnu
2005-02-28 22:03:57 [SETI@home] Project prefs: no separate prefs for home; using your defaults
2005-02-28 22:03:57 [climateprediction.net] Project prefs: no separate prefs for home; using your defaults
2005-02-28 22:03:57 [Einstein@Home] Project prefs: no separate prefs for home; using your defaults
2005-02-28 22:03:57 [SETI@home] Host ID is 520601
2005-02-28 22:03:57 [climateprediction.net] Host ID is 98933
2005-02-28 22:03:57 [Einstein@Home] Host ID is 49528
2005-02-28 22:03:57 [---] General prefs: from SETI@home (last modified 2005-01-30 00:38:41)
2005-02-28 22:03:57 [---] General prefs: no separate prefs for home; using your defaults
2005-02-28 22:03:57 [climateprediction.net] Deferring computation for result 445g_000213929_0
2005-02-28 22:03:57 [Einstein@Home] Resuming computation for result H1_0913.9__0914.1_0.1_T00_Test02_2 using einstein version 4.80

Bruce Allen
Bruce Allen
Moderator
Joined: 15 Oct 04
Posts: 1125
Credit: 172127663
RAC: 0

Problem resuming Einstein@home... (linux)

> Hi there,
> I've been running SETI and ClimatePrediction on linux for some time, and more
> recently attached the Einstein@home project. While SETI and ClimatePrediction
> work fine, Einsten@home would not resume normally... In fact, it does
> 'technically' resume, but it does not compute... Likewise, it does not stop
> using the usual 1 hour shift so that SETI and ClimatePrediction can't
> start...
>
> Now if, when this problem occurs, I break (Ctrl+C) and restart , then
> Einstein@home restarts fine.
>
> Below is a copy from the console where I run BOINC. Looking at it, it seems
> that there is something wrong with the 'deferring communication' thing...
>
> Any clue ?

I don't think anything is wrong. The communication is being defered because your machine has no results to report, and the scheduler doesn't want to give you more work (yet) because it's afraid that you might not complete it by the deadline.

Why do you think your machine is not doing any computing?

Bruce

Director, Einstein@Home

Alexander Stein
Alexander Stein
Joined: 20 Feb 05
Posts: 1
Credit: 8398827
RAC: 194

Hi, I think there is a

Hi,

I think there is a problem with einstein version 4.80 under linux. I have the same problem running boinc with seti@home and einstein@home. Before I attached to einstein boinc worked well, but now boinc stops a few times a day switching between the two projects although both projects have work to do. This only happens when einstein@home is running. When I then type 'grep fraction_done client_state.xml' from time to time I don't see any changes anymore until I restart boinc.

For the first I will switch off the option 'leave in memory' to restart einstein@home more often.

Alexander

Darren
Darren
Joined: 18 Jan 05
Posts: 94
Credit: 69632
RAC: 0

> Why do you think your

Message 6324 in response to message 6322

> Why do you think your machine is not doing any computing?

I can assure you that 3 of my mandrake systems stop computing when it changes from seti back to einstein. It does not occur with absolutely every project change coming back to einstein, but it happens about 90% of the time. Thus far, though, it has never happened with any other project - only when coming back to einstein. My gentoo system changes back and forth with no problem, and one other mandrake system is currently attached only to einstein.

On my systems that share boinc, I have to constantly ctrl-c and restart after it shifts back to einstein. Shifting from einstein to another project causes no problems at all - only coming back to einstein causes it to stop.

There are a couple ways that I know it isn't running - first and foremost is that my cpu goes idle when it comes back to einstein. Secondly, there are no updates being made to the fstats files in the project slot, even though boinc reports the project is running - the "date modified" notation in nautilus continues to show the time einstein was last really running, before it went to another project and tried to come back to einstein.

Beyond that, on my system with 4.23, the work tab in boincmgr shows the status is running, but the cpu time and wu progress never change. (Note that the problem does not only occur in 4.23 - my other systems that do the same thing are running 4.19 - 4.23 only gives one more way of seeing it.) Also, when it shifts back to einstein it simply stays "assigned" to run einstein from that time forward (I assume this is because the debt is not changing, so it never comes time, debt wise, for another project to start back up).

Below is an excerpt from my log from this morning, with all the seti and pirates futile attempts to get data edited out. As you can see in the log, it switched from einstein to seti at 03:46 then back to einstein at 04:46 - and there it sat until I broke it and restarted it after I got up and saw the cpu usage at 100% idle.

2005-03-05 03:46:40 [Einstein@Home] Pausing result H1_1106.4__1106.5_0.1_T03_Test02_1 (left in memory)
2005-03-05 03:46:40 [SETI@home] Resuming result 06ja05aa.22917.29089.611064.92_2 using setiathome version 4.02
2005-03-05 04:46:40 [Einstein@Home] Resuming result H1_1106.4__1106.5_0.1_T03_Test02_1 using einstein version 4.80
2005-03-05 04:46:40 [SETI@home] Pausing result 06ja05aa.22917.29089.611064.92_2 (left in memory)
2005-03-05 08:57:29 [---] Received signal 2
2005-03-05 08:57:29 [---] Exit requested by user
[darren@platinum BOINC]$ ./run_client
2005-03-05 08:57:32 [---] Starting BOINC client version 4.23 for i686-pc-linux-gnu
2005-03-05 08:57:32 [Einstein@Home] Host location: home
2005-03-05 08:57:32 [Einstein@Home] Using your default project prefs
2005-03-05 08:57:32 [SETI@home] Using your default project prefs
2005-03-05 08:57:32 [ProteinPredictorAtHome] Host location: home
2005-03-05 08:57:32 [ProteinPredictorAtHome] Using your default project prefs
2005-03-05 08:57:32 [Pirates@Home] Host location: home
2005-03-05 08:57:32 [Pirates@Home] Using your default project prefs
2005-03-05 08:57:32 [climateprediction.net] Host location: home
2005-03-05 08:57:32 [climateprediction.net] Using your default project prefs
2005-03-05 08:57:32 [Einstein@Home] Host ID is 44709
2005-03-05 08:57:32 [SETI@home] Host ID is 616484
2005-03-05 08:57:32 [ProteinPredictorAtHome] Host ID is 65714
2005-03-05 08:57:32 [Pirates@Home] Host ID is 8767
2005-03-05 08:57:32 [climateprediction.net] Host ID is 118972
2005-03-05 08:57:32 [---] General prefs: from SETI@home (last modified 2005-02-28 10:48:02)
2005-03-05 08:57:32 [---] General prefs: no separate prefs for home; using your defaults
2005-03-05 08:57:32 [Einstein@Home] Resuming computation for result H1_1106.4__1106.5_0.1_T03_Test02_1 using einstein version 4.80

Darren

Bruce Allen
Bruce Allen
Moderator
Joined: 15 Oct 04
Posts: 1125
Credit: 172127663
RAC: 0

> > Why do you think your

Message 6325 in response to message 6324

> > Why do you think your machine is not doing any computing?
>
> I can assure you that 3 of my mandrake systems stop computing when it changes
> from seti back to einstein. It does not occur with absolutely every project
> change coming back to einstein, but it happens about 90% of the time. Thus
> far, though, it has never happened with any other project - only when coming
> back to einstein. My gentoo system changes back and forth with no problem,
> and one other mandrake system is currently attached only to einstein.

Could you please file a bug report with http://bbugs.axpr.net/

Director, Einstein@Home

Darren
Darren
Joined: 18 Jan 05
Posts: 94
Credit: 69632
RAC: 0

> > Could you please file a

Message 6326 in response to message 6325

>
> Could you please file a bug report with http://bbugs.axpr.net/
>

Done

Hermes
Hermes
Joined: 17 Feb 05
Posts: 4
Credit: 476289
RAC: 0

Any updates on this? This

Any updates on this?

This bug also occurs in Boinv v4.27 and v4.32.
I think it is more severe than a simple Annoyance. It prevents my computer from working on more than 1 Boinc project at the same time without me babysitting it.
When this occurs, einstein resumes eventually after I switch the computer from "Run always" to "Suspend" and back several times.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.