Transient http error(again)!

butts
butts
Joined: 12 Dec 05
Posts: 9
Credit: 3977789
RAC: 0
Topic 196515

Please help. The following is the output that I'm getting(again)

05/09/2012 23:38:50 | | No config file found - using defaults
05/09/2012 23:38:51 | | Starting BOINC client version 7.0.28 for windows_intelx86
05/09/2012 23:38:51 | | log flags: file_xfer, sched_ops, task
05/09/2012 23:38:51 | | Libraries: libcurl/7.25.0 OpenSSL/1.0.1 zlib/1.2.6
05/09/2012 23:38:51 | | Data directory: I:\Documents and Settings\All Users\Application Data\BOINC
05/09/2012 23:38:51 | | Running under account butts
05/09/2012 23:38:51 | | Processor: 1 GenuineIntel Intel(R) Pentium(R) 4 CPU 2.40GHz [Family 15 Model 2 Stepping 9]
05/09/2012 23:38:51 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pbe
05/09/2012 23:38:51 | | OS: Microsoft Windows XP: Professional x86 Edition, Service Pack 3, (05.01.2600.00)
05/09/2012 23:38:51 | | Memory: 1.50 GB physical, 3.23 GB virtual
05/09/2012 23:38:51 | | Disk: 114.46 GB total, 91.70 GB free
05/09/2012 23:38:51 | | Local time is UTC +1 hours
05/09/2012 23:38:51 | | No usable GPUs found
05/09/2012 23:38:51 | Einstein@Home | URL http://einstein.phys.uwm.edu/; Computer ID 5428593; resource share 100
05/09/2012 23:38:51 | SETI@home | URL http://setiathome.berkeley.edu/; Computer ID 6691683; resource share 100
05/09/2012 23:38:51 | | General prefs: from http://climateprediction.net/ (last modified 12-Dec-2011 12:29:54)
05/09/2012 23:38:51 | | Host location: none
05/09/2012 23:38:51 | | General prefs: using your defaults
05/09/2012 23:38:51 | | Preferences:
05/09/2012 23:38:51 | | max memory usage when active: 767.65MB
05/09/2012 23:38:51 | | max memory usage when idle: 1381.77MB
05/09/2012 23:38:51 | | max disk usage: 57.23GB
05/09/2012 23:38:51 | | don't use GPU while active
05/09/2012 23:38:51 | | suspend work if non-BOINC CPU load exceeds 25 %
05/09/2012 23:38:51 | | max download rate: 1024000 bytes/sec
05/09/2012 23:38:51 | | max upload rate: 128000 bytes/sec
05/09/2012 23:38:51 | | (to change preferences, visit the web site of an attached project, or select Preferences in the Manager)
05/09/2012 23:38:51 | | Not using a proxy
05/09/2012 23:38:52 | Einstein@Home | Started upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_6
05/09/2012 23:38:52 | Einstein@Home | Started upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_7
05/09/2012 23:38:52 | Einstein@Home | Restarting task LATeah0850Z_416.0_320_0.0_0 using hsgamma_FGRP1 version 30 in slot 1
05/09/2012 23:39:05 | Einstein@Home | Temporarily failed upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_6: transient HTTP error
05/09/2012 23:39:05 | Einstein@Home | Backing off 7 min 50 sec on upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_6
05/09/2012 23:39:05 | Einstein@Home | Temporarily failed upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_7: transient HTTP error
05/09/2012 23:39:05 | Einstein@Home | Backing off 6 min 50 sec on upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_7
05/09/2012 23:40:51 | Einstein@Home | Started upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_5
05/09/2012 23:41:04 | Einstein@Home | Temporarily failed upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_5: transient HTTP error
05/09/2012 23:41:04 | Einstein@Home | Backing off 9 min 33 sec on upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_5
05/09/2012 23:52:35 | Einstein@Home | Started upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_4
05/09/2012 23:52:35 | Einstein@Home | Started upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_5
05/09/2012 23:52:49 | Einstein@Home | Temporarily failed upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_4: transient HTTP error
05/09/2012 23:52:49 | Einstein@Home | Backing off 8 min 49 sec on upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_4
05/09/2012 23:52:49 | Einstein@Home | Temporarily failed upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_5: transient HTTP error
05/09/2012 23:52:49 | Einstein@Home | Backing off 25 min 2 sec on upload of p2030.20110117.G195.42-00.85.S.b3s0g0.00000_2576_1_5

This has been going on now for over a week. Rebooted machine, no difference. All other uploads/downloads are working OK. Is there a problem with the receiving server?

_______
cheers,
Padraig.

Khangollo
Khangollo
Joined: 17 Feb 11
Posts: 42
Credit: 928047659
RAC: 0

Transient http error(again)!

I was getting similar errors at work when I was in a network with bad/misconfigured transparent http proxy.
I solved the problem by adding 1 option into BOINC's configiration file.
Create a file named cc_config.xml (Notepad is Ok) in BOINC's data directory (I:\Documents and Settings\All Users\Application Data\BOINC) with the following contents:

  
    1
  


If you already have cc_config.xml file, just add 1 into block. Then restart BOINC.
Maybe, just maybe, that'll help.

MaU38.gif

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Moderator
Joined: 28 Aug 06
Posts: 3522
Credit: 692163872
RAC: 1552

RE: This has been going on

Quote:

This has been going on now for over a week. Rebooted machine, no difference. All other uploads/downloads are working OK. Is there a problem with the receiving server?

Currently one of our servers is indeed almost saturated because of the high throughput of BRP4 results (which is a good thing).

The crucial point is: are the failures just temporary, or persistent at your end. BOINC will try again and again to download and upload files when needed, so if there are some failures from time to time, there is nothing for you to worry about: no work is lost, and eventually you get new work.

So you should perhaps check the "Transfers" View in BOINCManager: if the network problems are just a temporary thing, most of the time this view should be empty (no pending uploads or downloads). If files are piling up there over days, this would be a real problem.

We are working on expanding our resources at the AEI to cope with the increased data volume. Hopefully we can add a server next week.

Sorry for the inconveniences,

Cheers
HB

butts
butts
Joined: 12 Dec 05
Posts: 9
Credit: 3977789
RAC: 0

Well, it appears that the new

Well, it appears that the new server has been added and is up and running but i still have this upload problem for this set of files.

I did add a cc_config.xml file to set the communications protocol to http_1_0 but it did not make any difference.

The problem is persistent for the files in this workunit but other workunits for Einstein and Seti are not affected.

The new server only went online today, perhaps there is a 'backlog' or something. The deadline for this workunit is tomorrow and it would be a pity to lose nearly 60 hours of computing time

Padraig.

_______
cheers,
Padraig.

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4289
Credit: 245960431
RAC: 10443

FWIW this is certainly

FWIW this is certainly unrelated to the download server probelms we had recently.

BM

BM

butts
butts
Joined: 12 Dec 05
Posts: 9
Credit: 3977789
RAC: 0

No, not a download problem,

No, not a download problem, an upload problem!!
The workunit is a BRPSSE workunit. The GWS6 workunits when complete are uploaded just fine but not these BRPSSE workunits. This is the second time this problem has happened recently i.e. 4 months or so. 120 hours of computing time down the drain! No previous problems

_______
cheers,
Padraig.

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5867
Credit: 112263953791
RAC: 35815773

RE: ... an upload

Quote:
... an upload problem!!


I notice the following from the log snippet in your original message:-

05/09/2012 23:38:51 | | Preferences:
...
05/09/2012 23:38:51 | | suspend work if non-BOINC CPU load exceeds 25 %
05/09/2012 23:38:51 | | max download rate: 1024000 bytes/sec
05/09/2012 23:38:51 | | max upload rate: 128000 bytes/sec


There shouldn't be any problem with these but just for grins, could you set these three to unlimited values temporarily to see if it makes any difference whatsoever?

Your computers are hidden but I used the hostID from your log snippet to find some details. I was surprised to see that the details page shows an average upload rate of just 1.29 KB/sec. That seems rather low and maybe that is what is preventing your upload from succeeding?

Do you have a friend or relative with a different internet connection. Is it possible to attach to a different connection temporarily to see if the upload can then succeed? That should be a fairly definitive test.

Quote:
... 120 hours of computing time down the drain! No previous problems


A 2.4GHz P4 is probably unsuited to the stresses of the BRP4 app. Even so, it shouldn't take anything like 120 hours CPU time when FGRP tasks are taking around 24 hours on your machine. There seems to be something wrong there.

EDIT: The crunching wont be wasted if you can quickly hook up to a different connection and get the result uploaded. The task is past deadline but (when I looked) a replacement hadn't yet been sent. You still have time!

Cheers,
Gary.

butts
butts
Joined: 12 Dec 05
Posts: 9
Credit: 3977789
RAC: 0

I did as Gary suggested and

I did as Gary suggested and set the upload, download and CPU values to unlimited but it has made no difference.

The 120 hours(overestimated) is for 2 BRP workunits which failed to upload. I have stopped BRP workunits being sent.

The low upload rate is due to what is laughingly called 'broadband' in rural Ireland. Thats not going to improve anytime soon. It is adequate for other Einstein work, SETI, abc@home and climate prediction. All other completed work gets uploaded.

I'm still back to my original thought that the receiving server may have a problem?

Deadline is well past now!!

cheers,
Padraig.

_______
cheers,
Padraig.

Claggy
Claggy
Joined: 29 Dec 06
Posts: 560
Credit: 2699403
RAC: 53

You could try and configure

You could try and configure Boinc to not timeout uploads & downloads too quickly with the following parameters:

Quote:

seconds abort HTTP transfers if idle for this many seconds; default 300 New in 6.12.27

bps an HTTP transfer is considered idle if its transfer rate is below this many bits per second New in 6.12.27


600
1

I've doubled the http timeout time to 600 seconds, and set the timeout for bps to 1bps (i don't know the default) you probably could try 0 too.

Client configuration

Claggy

Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5779100
RAC: 0

RE: and set the timeout for

Quote:
and set the timeout for bps to 1bps (i don't know the default)


Default is 10 bps.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.