WTF?

Nuadormrac
Nuadormrac
Joined: 9 Feb 05
Posts: 76
Credit: 219184288
RAC: 166940
Topic 188402

Out of work, and got this message, which incidently I have never received before:

3/9/2005 4:00:54 AM|Einstein@Home|Requesting 86400.00 seconds of work
3/9/2005 4:00:54 AM|Einstein@Home|Sending request to scheduler: http://einstein.phys.uwm.edu/EinsteinAtHome_cgi/cgi
3/9/2005 4:00:55 AM|Einstein@Home|Scheduler RPC to http://einstein.phys.uwm.edu/EinsteinAtHome_cgi/cgi succeeded
3/9/2005 4:00:55 AM|Einstein@Home|Message from server: No work sent
3/9/2005 4:00:55 AM|Einstein@Home|Message from server: (daily quota of 3 WU reached)
3/9/2005 4:00:55 AM|Einstein@Home|No work from project
3/9/2005 4:00:55 AM|Einstein@Home|Deferring communication with project for 1 hours, 0 minutes, and 0 seconds

Looking further, it seems some WUs failed claiming insufficient space (totally not true, either in BOINC settings or otherwise), unless they mean on the server, but umm...kinda doubt it

3/9/2005 1:13:08 AM|Einstein@Home|Unrecoverable error for result H1_0849.4__0849.5_0.1_T12_Test02_11 (Not enough storage is available to process this command. (0x8) - exit code 8 (0x8))
3/9/2005 1:13:08 AM|Einstein@Home|Deferring communication with project for 1 minutes and 0 seconds
3/9/2005 1:13:08 AM|Einstein@Home|Computation for result H1_0849.4__0849.5_0.1_T12_Test02 finished
3/9/2005 1:13:08 AM|Einstein@Home|Starting result H1_0849.4__0849.5_0.1_T07_Test02_17 using einstein version 4.79
3/9/2005 1:14:08 AM||May run out of work in 1.00 days; requesting more
3/9/2005 1:14:08 AM|Einstein@Home|Requesting 7163.94 seconds of work
3/9/2005 1:14:08 AM|Einstein@Home|Sending request to scheduler: http://einstein.phys.uwm.edu/EinsteinAtHome_cgi/cgi
3/9/2005 1:14:09 AM|Einstein@Home|Scheduler RPC to http://einstein.phys.uwm.edu/EinsteinAtHome_cgi/cgi succeeded
3/9/2005 1:14:09 AM|Einstein@Home|Message from server: No work sent
3/9/2005 1:14:09 AM|Einstein@Home|Message from server: (daily quota of 4 WU reached)
3/9/2005 1:14:09 AM|Einstein@Home|No work from project
3/9/2005 1:14:09 AM|Einstein@Home|Deferring communication with project for 1 hours, 0 minutes, and 0 seconds
3/9/2005 1:14:24 AM|Einstein@Home|Unrecoverable error for result H1_0849.4__0849.5_0.1_T07_Test02_17 (Not enough storage is available to process this command. (0x8) - exit code 8 (0x8))
3/9/2005 1:14:24 AM|Einstein@Home|Deferring communication with project for 59 minutes and 44 seconds
3/9/2005 1:14:24 AM|Einstein@Home|Computation for result H1_0849.4__0849.5_0.1_T07_Test02 finished

Looking further, it claims insufficient disk space (if it means on my comp or set aside for BOINC, entirely untrue)

http://einsteinathome.org/task/1737194

for one example.

4.22
Not enough storage is available to process this command. (0x8) - exit code 8 (0x8)

Freq index range 1528919->1529461 not in 1528920 to 1530181 (file ....projectseinstein.phys.uwm.eduH1_0849.4 (block 901))


Validate state Invalid
Claimed credit 0.00150156850747507
Granted credit 0
application version 4.79

As to storage here locally, BOINC has 5 GB set aside

Disk and memory usage
Use no more than 5 GB disk space
Leave at least 2 GB disk space free
Use no more than 50% of total disk space
Write to disk at most every 60 seconds
Use no more than 75% of total virtual memory

But only about 32 MB are in use

D:\Program Files\BOINC\projects>dir /s
Volume in drive D has no label.
Volume Serial Number is 482F-F2F9

Directory of D:\Program Files\BOINC\projects

02/09/2005 11:01 PM .
02/09/2005 11:01 PM ..
03/08/2005 11:56 PM einstein.phys.uwm.edu
03/09/2005 03:57 AM predictor1.scripps.edu
03/09/2005 03:07 AM setiathome.berkeley.edu
0 File(s) 0 bytes

Directory of D:\Program Files\BOINC\projects\einstein.phys.uwm.edu

03/08/2005 11:56 PM .
03/08/2005 11:56 PM ..
02/09/2005 11:02 PM 239 Config_Test02
02/09/2005 11:02 PM 2,745,667 earth
02/11/2005 01:19 PM 1,712,128 einstein_4.79_windows_intelx86.exe
02/11/2005 01:20 PM 3,066,880 einstein_4.79_windows_intelx86.pdb
03/07/2005 06:06 PM 12,144,000 H1_0849.4
02/09/2005 11:02 PM 274,843 sun
6 File(s) 19,943,757 bytes

Directory of D:\Program Files\BOINC\projects\predictor1.scripps.edu

03/09/2005 03:57 AM .
03/09/2005 03:57 AM ..
01/17/2005 04:42 PM 1,097,728 mfoldB120_4.21_windows_intelx86.exe
03/02/2005 11:52 PM 1,097,728 mfoldB125_4.24_windows_intelx86.exe
01/27/2005 05:19 AM 122,905 monsster.dat
01/27/2005 05:19 AM 603,113 rebuild.dat
01/27/2005 05:19 AM 3,497,598 scwrl.dat
02/08/2005 02:30 PM 50,184 t0201E_1_92890_1_1
03/08/2005 05:46 PM 1,911 t0214E_1_93557.ini
03/08/2005 05:46 PM 144 t0214E_1_93557.inp
03/08/2005 05:46 PM 3 t0214E_1_93557.res
03/08/2005 05:46 PM 2,420 t0214E_1_93557.seq
03/09/2005 03:57 AM 62 t0214E_1_93557_2_0
03/09/2005 03:57 AM 33 t0214E_1_93557_2_1
03/09/2005 03:57 AM 33 t0214E_1_93557_2_2
03/08/2005 09:22 PM 1,911 t0214E_1_96000.ini
03/08/2005 09:22 PM 144 t0214E_1_96000.inp
03/08/2005 09:22 PM 3 t0214E_1_96000.res
03/08/2005 09:22 PM 2,420 t0214E_1_96000.seq
03/09/2005 01:12 AM 1,911 t0214E_1_98865.ini
03/09/2005 01:12 AM 144 t0214E_1_98865.inp
03/09/2005 01:12 AM 3 t0214E_1_98865.res
03/09/2005 01:12 AM 2,420 t0214E_1_98865.seq
03/09/2005 03:57 AM 1,911 t0214E_1_99732.ini
03/09/2005 03:57 AM 144 t0214E_1_99732.inp
03/09/2005 03:57 AM 3 t0214E_1_99732.res
03/09/2005 03:57 AM 2,420 t0214E_1_99732.seq
25 File(s) 6,487,296 bytes

Directory of D:\Program Files\BOINC\projects\setiathome.berkeley.edu

03/09/2005 03:07 AM .
03/09/2005 03:07 AM ..
03/08/2005 04:56 PM 361,972 07ja05aa.3636.6002.723592.98
03/08/2005 01:17 AM 361,947 07ja05aa.3636.834.23586.229
03/09/2005 03:07 AM 9,810 07ja05aa.3636.834.23586.229_0_0
11/21/2004 12:22 AM 7,446 better_banner.jpg
02/15/2005 04:51 AM 753,664 setiathome_4.09_windows_intelx86.exe
02/15/2005 04:51 AM 4,869,120 setiathome_4.09_windows_intelx86.pdb
6 File(s) 6,363,959 bytes

Total Files Listed:
37 File(s) 32,795,012 bytes
11 Dir(s) 18,026,459,136 bytes free

D:Program FilesBOINCprojects>

This is on a 37.6 GB hard drive, which only has a 1 GB c: drive, and the rest is this partition? Or does it talking about something else, other then hard drive storage? WTF?

Nuadormrac
Nuadormrac
Joined: 9 Feb 05
Posts: 76
Credit: 219184288
RAC: 166940

WTF?

BTW, I just looked at some of those WUs further...and it looks like something is going on server side, which the project admins might want to look into. In any case, my daily quota is still exceeded. What I noticed however, is that EVERYBODY who received these WUs had the same problem...

http://einsteinathome.org/workunit/440067

1598627 47861 3 Mar 2005 20:49:18 UTC 3 Mar 2005 20:50:25 UTC Over Client error Computing 0.92 0.00 ---
1598628 23842 3 Mar 2005 21:32:45 UTC 4 Mar 2005 0:07:56 UTC Over Client error Computing 0.68 0.00 ---
1598629 32178 3 Mar 2005 22:52:17 UTC 4 Mar 2005 12:48:23 UTC Over Client error Computing 0.00 0.00 ---
1598630 33037 4 Mar 2005 0:20:01 UTC 4 Mar 2005 2:52:22 UTC Over Client error Computing 0.55 0.00 ---
1609893 32062 4 Mar 2005 1:27:16 UTC 4 Mar 2005 4:14:34 UTC Over Client error Computing 0.61 0.00 ---
1615046 26608 4 Mar 2005 4:35:28 UTC 4 Mar 2005 5:39:39 UTC Over Client error Computing 0.52 0.00 ---
1619662 13021 4 Mar 2005 7:12:06 UTC 4 Mar 2005 8:13:14 UTC Over Client error Computing 0.52 0.00 ---
1621601 23840 4 Mar 2005 12:07:30 UTC 4 Mar 2005 12:08:47 UTC Over Client error Computing 0.00 0.00 ---
1623405 31985 4 Mar 2005 14:59:08 UTC 4 Mar 2005 20:59:30 UTC Over Client error Computing 0.88 0.00 ---
1627291 64650 6 Mar 2005 21:59:38 UTC 6 Mar 2005 22:00:44 UTC Over Client error Computing 0.47 0.00 ---
1633309 55797 7 Mar 2005 7:46:55 UTC 7 Mar 2005 7:48:19 UTC Over Client error Computing 0.61 0.00 ---
1634358 12051 8 Mar 2005 1:04:07 UTC 9 Mar 2005 7:18:21 UTC Over Client error Computing 0.67 0.00 ---
1647228 27019 8 Mar 2005 3:59:21 UTC 15 Mar 2005 3:59:21 UTC In Progress Unknown New --- --- ---
1720910 69045 9 Mar 2005 0:56:27 UTC 16 Mar 2005 0:56:27 UTC In Progress Unknown New --- --- ---
1737166 --- --- --- Unsent Unknown New --- --- ---
1810734 --- --- --- Unsent Unknown New --- --- ---

http://einsteinathome.org/workunit/427456
http://einsteinathome.org/workunit/434238
http://einsteinathome.org/workunit/458115

all show similarly... Looking at it, there are different CCs, I had 4.22, I saw someone with 4.19, etc... This last one, one person hasn't reported an error yet, but everyone else has...

http://einsteinathome.org/workunit/420482

1511006 31985 28 Feb 2005 22:53:17 UTC 28 Feb 2005 22:54:24 UTC Over Client error Computing 0.00 0.00 ---
1511007 23842 28 Feb 2005 23:09:31 UTC 1 Mar 2005 1:41:50 UTC Over Client error Computing 0.68 0.00 ---
1511008 33037 1 Mar 2005 0:17:58 UTC 1 Mar 2005 2:50:42 UTC Over Client error Computing 0.56 0.00 ---
1511009 26608 1 Mar 2005 0:48:25 UTC 1 Mar 2005 2:57:24 UTC Over Client error Computing 0.53 0.00 ---
1511010 32062 1 Mar 2005 3:04:28 UTC 1 Mar 2005 14:52:10 UTC Over Client error Computing 0.53 0.00 ---
1515491 47861 1 Mar 2005 7:43:54 UTC 1 Mar 2005 7:44:58 UTC Over Client error Computing 0.45 0.00 ---
1516644 23840 2 Mar 2005 2:32:12 UTC 2 Mar 2005 5:33:20 UTC Over Client error Computing 0.00 0.00 ---
1516738 32178 2 Mar 2005 3:20:40 UTC 2 Mar 2005 10:48:36 UTC Over Client error Computing 1.00 0.00 ---
1522847 13021 4 Mar 2005 2:39:03 UTC 4 Mar 2005 4:47:33 UTC Over Client error Computing 0.53 0.00 ---
1532314 60941 5 Mar 2005 7:46:27 UTC 5 Mar 2005 7:46:47 UTC Over Client error Computing 0.53 0.00 ---
1553004 61532 6 Mar 2005 0:00:55 UTC 6 Mar 2005 0:02:00 UTC Over Client error Computing 0.00 0.00 ---
1560300 63345 6 Mar 2005 5:34:53 UTC 6 Mar 2005 5:38:14 UTC Over Client error Computing 0.48 0.00 ---
1622201 63310 6 Mar 2005 21:28:58 UTC 6 Mar 2005 21:30:02 UTC Over Client error Computing 0.00 0.00 ---
1662219 64650 7 Mar 2005 0:00:55 UTC 14 Mar 2005 0:00:55 UTC In Progress Unknown New --- --- ---
1686512 55797 7 Mar 2005 7:48:20 UTC 7 Mar 2005 7:49:25 UTC Over Client error Computing 0.52 0.00 ---
1695408 27019 8 Mar 2005 4:06:31 UTC 8 Mar 2005 4:08:08 UTC Over Client error Computing 0.00 0.00 ---
1719965 69045 8 Mar 2005 23:54:13 UTC 8 Mar 2005 23:55:15 UTC Over Client error Computing 0.23 0.00 ---
1737194 12051 9 Mar 2005 8:11:03 UTC 9 Mar 2005 9:14:03 UTC Over Client error Computing 0.64 0.00 ---
1771748 --- --- --- Unsent Unknown New --- --- ---
1800505 --- --- --- Unsent Unknown New --- --- ---
1813629 --- --- --- Unsent Unknown New --- --- ---

Bruce Allen
Bruce Allen
Moderator
Joined: 15 Oct 04
Posts: 1119
Credit: 172127663
RAC: 0

> BTW, I just looked at some

Message 7701 in response to message 7700

> BTW, I just looked at some of those WUs further...and it looks like something
> is going on server side, which the project admins might want to look into. In
> any case, my daily quota is still exceeded. What I noticed however, is that
> EVERYBODY who received these WUs had the same problem...
>
> http://einsteinathome.org/workunit/440067

Yes, we've found a problem with our application, where it will fail on some WU for the file H1_0849.4 and H1_848,9. We'll be releasing a new app soon that fixes this.

I'm not so concerned about this bug, because these jobs fail as soon as they start. So they don't waste any significant CPU time on your system.

Cheers,
Bruce

Director, Einstein@Home

Nuadormrac
Nuadormrac
Joined: 9 Feb 05
Posts: 76
Credit: 219184288
RAC: 166940

I suppose that after a new

I suppose that after a new app is released, these WUs can be processed then. Actually, at first I was trying to figure out what might have suddenly gone wrong on my machine, until I noticed the results were unsuccesful for practically everyone...

But would they be a way to suspend the sending of these WUs, or make them less frequent...as the limits on WUs one is allowed to download in a day could result in people being unable to get work if one gets several of these WUs one after another?

Bruce Allen
Bruce Allen
Moderator
Joined: 15 Oct 04
Posts: 1119
Credit: 172127663
RAC: 0

> I suppose that after a new

Message 7703 in response to message 7702

> I suppose that after a new app is released, these WUs can be processed then.
> Actually, at first I was trying to figure out what might have suddenly gone
> wrong on my machine, until I noticed the results were unsuccesful for
> practically everyone...
>
> But would they be a way to suspend the sending of these WUs, or make them less
> frequent...as the limits on WUs one is allowed to download in a day could
> result in people being unable to get work if one gets several of these WUs one
> after another?

Good point. We're just about done with testing the new app, so please stand by. We should have this resolved in the next couple of days.

Bruce

Director, Einstein@Home

Nuadormrac
Nuadormrac
Joined: 9 Feb 05
Posts: 76
Credit: 219184288
RAC: 166940

thx BTW, should I and others

thx BTW, should I and others who are having this problem suspend E@H until the new app is released so we can start crunching when it's released? Or will those of us who ran into this problem be given somewhat of a concession (as it wasn't our computer in particular which had gone awry) so we can d/l the new app and WUs, or what should we do in the mean time?

Paul D. Buck
Paul D. Buck
Joined: 17 Jan 05
Posts: 754
Credit: 5385205
RAC: 0

Dipping my oar in where it

Dipping my oar in where it was not requested ...

As Bruce said, the WU fail, and they fail pretty early so that this is not a signivicant issue. So, to answer the question, just keep connected to the project and all will be well.

We are detecting issues with both the BOINC Manager and with the science applications at a good clip, this should be seen as a good thing ...

Just a short year ago, we were testing in the BOINC Beta test and my lord what problems ...

Nuadormrac
Nuadormrac
Joined: 9 Feb 05
Posts: 76
Credit: 219184288
RAC: 166940

OK, this is what I'm asking.

OK, this is what I'm asking. Yes I know they're failing early, and yes that's a good thing. But there are quoatas in place to prevent one from continuing to d/l WUs when one isn't able to process a given WU. This applies regardless of whether the reason is something up with the person's computer or the science app itself.

BTW, the quota that my computer is allowed to recieve has been decreased from 3 WUs a day, to just one, and it *might* have been higher then 3 at the beginning of this, but it was never 1 hitherto. Does seem to be falling. Every WU I've gotten over the past day and a half or so, has been one of these, and the servers are penalizing my comp for it... Yes it fails early, but it's causing a secondary problem with the servers quotas...

I'm also asking about suspending temporarily (something that BOINC CC 4.25 (and also every version of the dev branch which I used since 4.62) has, not detatching. It's also something I did when SETI's servers went down so my comp wasn't constantly sending out requests for WUs when it needed, and I knew there was no server (according to their status page) to receive the requests...

In essence, should I temporarily suspend, so that when a note is up that the new app is out, I can connect, receive the new science app (fix to this given problem), and get back to processing? Or when it's out will something be done server side, so those of us who are being denied WU d/ls because of this failure, can d/l the fixed app (aka the fix to the underlying problem) and continue from there? As it stands now, the server is decreasing my daily quota, and refusing to send work in an increasing fashion, as I'm gathering it is setup to do, for cases where the user's computer has gone bad (not exactly the cause in this case)...

Nuadormrac
Nuadormrac
Joined: 9 Feb 05
Posts: 76
Credit: 219184288
RAC: 166940

Anyone else running into

Anyone else running into this, a possible fix till the new app gets out:

- Stop BOINC
- clear out the E@H projects foulder (including the dat file) (btw, I did a project reset before doing this, it was suspended if that would make a difference)
- Reconnect, one should get a new series of WUs...

Comp crunching H1_093... without incident now...

lysdexia
lysdexia
Joined: 9 Mar 05
Posts: 97
Credit: 17013
RAC: 0

is set up folder

is set up
folder


"My other computer is a virus farm."

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.