Anybody else having issues with getting some uploads unable to get an ack from the servers?
I have several tasks, a couple on each host that can't get an ack from the server after they have finished their uploads at 100%.
On the 7th retry with large backoffs. In the meantime every other finished task is getting uploaded and acknowledged normally.
http_xfer_debug is not that helpful and only showing transient errors for the stuck tasks.
Sun 29 Nov 2020 12:11:42 AM PST | Einstein@Home | Started upload of LATeah1066L30_500.0_0_0.0_33517050_1_1
Sun 29 Nov 2020 12:11:42 AM PST | Einstein@Home | Started upload of LATeah1066L30_508.0_0_0.0_1497258_0_0
Sun 29 Nov 2020 12:11:42 AM PST | Einstein@Home | Started upload of LATeah1066L30_508.0_0_0.0_1497258_0_1
Sun 29 Nov 2020 12:11:45 AM PST | | Project communication failed: attempting access to reference site
Sun 29 Nov 2020 12:11:45 AM PST | Einstein@Home | Temporarily failed upload of LATeah1066L30_500.0_0_0.0_33517050_1_1: transient HTTP error
Sun 29 Nov 2020 12:11:45 AM PST | Einstein@Home | Backing off 04:57:41 on upload of LATeah1066L30_500.0_0_0.0_33517050_1_1
Sun 29 Nov 2020 12:11:45 AM PST | Einstein@Home | Temporarily failed upload of LATeah1066L30_508.0_0_0.0_1497258_0_0: transient HTTP error
Sun 29 Nov 2020 12:11:45 AM PST | Einstein@Home | Backing off 00:06:52 on upload of LATeah1066L30_508.0_0_0.0_1497258_0_0
Sun 29 Nov 2020 12:11:45 AM PST | Einstein@Home | Temporarily failed upload of LATeah1066L30_508.0_0_0.0_1497258_0_1: transient HTTP error
Sun 29 Nov 2020 12:11:45 AM PST | Einstein@Home | Backing off 00:06:35 on upload of LATeah1066L30_508.0_0_0.0_1497258_0_1
Sun 29 Nov 2020 12:11:48 AM PST | | [http_xfer] [ID#0] HTTP: wrote 1804 bytes
Sun 29 Nov 2020 12:11:48 AM PST | | [http_xfer] [ID#0] HTTP: wrote 2584 bytes
Sun 29 Nov 2020 12:11:48 AM PST | | [http_xfer] [ID#0] HTTP: wrote 3091 bytes
Sun 29 Nov 2020 12:11:48 AM PST | | [http_xfer] [ID#0] HTTP: wrote 2996 bytes
Sun 29 Nov 2020 12:11:48 AM PST | | [http_xfer] [ID#0] HTTP: wrote 1561 bytes
Sun 29 Nov 2020 12:11:49 AM PST | | [http_xfer] [ID#0] HTTP: wrote 201 bytes
Sun 29 Nov 2020 12:11:49 AM PST | | [http_xfer] [ID#0] HTTP: wrote 1499 bytes
Sun 29 Nov 2020 12:11:49 AM PST | | Internet access OK - project servers may be temporarily down.
Weird, as I said all other uploads for Einstein and my other projects are fine. Anybody else having issues?
Copyright © 2024 Einstein@Home. All rights reserved.
Yes, I have a few of those as
)
Yes, I have a few of those as well on my two hosts (2 & 4 uploads hanging). They seem to go eventually as it is only the latest finished tasks that are hanging. As I am typing the other host succeeded in the file transfer but the other got two more.
On first look this morning
)
On first look this morning all three of my machines showed a small number of tasks in uploading status. The biggest backoff was three hours with retry counts as high as 5. But the small number of tasks indicated that, as you specified, these were just a few of the tasks. Then, as I was typing, most cleared, and with a forced retry, all cleared.
Perhaps the actual problem at the server end is now fixed?
Forced retries just increased
)
Forced retries just increased the backoffs. This morning, looks like all the uploads have cleared on both hosts.
Great!! . . . . another
)
Great!! . . . . another project with stalled out, backed off, uploads.
... that's "standard
)
... that's "standard procedure" over the weekend ...
Usually happening Satuday night/Sunday morning, depending on where you are situated.
Have a nice Sunday!
Same here for the last
)
Same here for the last several hours.
There is a new post under
)
There is a new post under "technical news" from Bernd about this "problem".
Get used to long backoffs and
)
Get used to long backoffs and upload issues here.
GPUGrid is out of work and I know many people run Einstein as their failover, 0 resource backup project when GPUGrid has issues.
Pages of stalled and backed off Einstein GR uploads on all my hosts.
no problems if you run
)
no problems if you run Gravitational Wave tasks
_________________________________________________________________________
Ian&Steve C. wrote:no
)
I am running GW CPU tasks. Are you saying that the GW GPU tasks also don't have an upload issue either?
I have 100's of GR GPU tasks unable to upload.
Tom M
edit>> ps. Just created a 0 resource profile of short tasks for GPUs in PrimeGrid. My GPUs are busy again.
A Proud member of the O.F.A. (Old Farts Association). Be well, do good work, and keep in touch.® (Garrison Keillor) I want some more patience. RIGHT NOW!