Not sending work because client is outdated

Horacio

Joined: 3 Oct 11

Posts: 205

Credit: 80557243

RAC: 0

17 May 2012 15:16:20 UTC

Topic 196331

(moderation:

)

Ive seen that message in the rpc logs...

Quote:

2012-05-17 13:49:21.9722 [PID=11301] [send] CPU: req 442.88 sec, 0.00 instances; est delay 0.00
...
2012-05-17 13:49:22.0310 [PID=11301] outdated client version 61060 < min core version 70027
...
2012-05-17 13:49:22.0351 [PID=11301] Not sending work because client is outdated
...
2012-05-17 13:49:22.0370 [PID=11301] Sending reply to [HOST#4232972]: 0 results, delay req 86400.00

I really dont want to upgrade, because the newer versions of Boinc have too long backoffs and as the ISPs on my city are rather bad it makes almost impossible to keep the hosts feeded... (It is worst for other projects but it happens even with Einstein that has really good download rates)...

I dont have any ATI GPU, so there is no need for the OpenCL support... Is there any other good reason to upgrade Boinc, besides the long delays between requests?

(By the way, AFAIK, 7.0.27 is still beta, requiring it as minimun version is a bit annoying... It is supossed that you have to upgrade it as soon as there is a new release... forcing every donor to do that is a bit too much...)

Richard Haselgrove

Joined: 10 Dec 05

Posts: 2143

Credit: 2954966585

RAC: 715530

Not sending work because client is outdated

17 May 2012 15:57:37 UTC

Message 109579

(moderation:

)

On the other hand, I got

Quote:

2012-05-17 15:32:28.7011 [PID=17251] Request: [USER#xxxxx] [HOST#3674581] [IP xxx.xxx.xxx.133] client 5.10.13
...
2012-05-17 15:32:28.7072 [PID=17251] [send] work_req_seconds: 278.62 secs
...
2012-05-17 15:32:28.7227 [PID=17251] [send] [HOST#3674581] Sending app_version hsgamma_FGRP1 2 23 ; 2.44 GFLOPS
2012-05-17 15:32:29.9553 [PID=17251] [send] est. duration for WU 123395878: unscaled 24595.34 scaled 27516.67
2012-05-17 15:32:29.9553 [PID=17251] [HOST#3674581] Sending [RESULT#288860872 LATeah2478A_96.0_27300_-1.9e-11_1] (est. dur. 27516.67 seconds)

which would imply that the v7.0.27 requirement doesn't apply in all cases. One would expect it to apply to ATI requests only, but yours says CPU. We might need to see a longer log extract to work out what's going on.

Gavin

Joined: 21 Sep 10

Posts: 191

Credit: 40644337738

RAC: 2

I have also been seeing this

17 May 2012 18:09:48 UTC

Message 109580

(moderation:

)

I have also been seeing this behaviour over the 2 to 3 few days or so.
Here is the full message log from the last time one of my hosts contacted the server:

2012-05-17 16:45:12.8508 [PID=22609] 2012-05-17 16:45:12.8546 [PID=22609] 2012-05-17 16:45:12.8546 [PID=22609] 2012-05-17 16:45:12.8546 [PID=22609] 2012-05-17 16:45:12.8546 [PID=22609] 2012-05-17 16:45:12.8546 [PID=22609] 2012-05-17 16:45:12.8546 [PID=22609] 2012-05-17 16:45:12.8546 [PID=22609] 2012-05-17 16:45:12.8546 [PID=22609] 2012-05-17 16:45:12.8595 [PID=22609] 2012-05-17 16:45:12.8596 [PID=22609] 2012-05-17 16:45:12.8741 [PID=22609] 2012-05-17 16:45:12.8746 [PID=22609] 2012-05-17 16:45:12.8746 [PID=22609] 2012-05-17 16:45:12.8747 [PID=22609] 2012-05-17 16:45:12.8747 [PID=22609] 2012-05-17 16:45:12.8747 [PID=22609] 2012-05-17 16:45:12.8747 [PID=22609] 2012-05-17 16:45:12.8747 [PID=22609] 2012-05-17 16:45:12.8747 [PID=22609] 2012-05-17 16:45:12.8747 [PID=22609] 2012-05-17 16:45:12.8747 [PID=22609] 2012-05-17 16:45:12.8748 [PID=22609] 2012-05-17 16:45:12.8797 [PID=22609] 2012-05-17 16:45:12.8798 [PID=22609] 2012-05-17 16:45:12.8798 [PID=22609] 2012-05-17 16:45:12.8798 [PID=22609] 2012-05-17 16:45:12.8816 [PID=22609] [debug] 2012-05-17 16:45:12.8816 [PID=22609] [debug] 2012-05-17 16:45:12.8816 [PID=22609] [debug] 2012-05-17 16:45:12.8816 [PID=22609] [debug] 2012-05-17 16:45:12.8816 [PID=22609] [debug] 2012-05-17 16:45:12.8816 [PID=22609] [debug] 2012-05-17 16:45:12.8817 [PID=22609] 2012-05-17 16:45:12.8819 [PID=22609] Request: [USER#xxxxx] [HOST#4876288] [IP xxx.xxx.xxx.133] client 6.10.58
[send] effective_ncpus 2 max_jobs_on_host_cpu 999999 max_jobs_on_host 999999
[send] effective_ngpus 1 max_jobs_on_host_gpu 999999
[send] Not using matchmaker scheduling; Not using EDF sim
[send] CPU: req 66.87 sec, 0.00 instances; est delay 0.00
[send] CUDA: req 0.00 sec, 0.00 instances; est delay 0.00
[send] work_req_seconds: 66.87 secs
[send] available disk 74.08 GB, work_buf_min 0
[send] active_frac 0.999881 on_frac 0.959359 DCF 1.503522
[send] [HOST#4876288] is reliable
[send] set_trust: random choice for error rate 0.000010: yes
[version] Checking plan class 'BRP4SSE'
[version] reading plan classes from file '../plan_class_spec.xml'
[version] parsed project prefs setting 'also_run_cpu' : true : 1.000000
[version] project prefs setting 'also_run_cpu' (1.000000) prevents using plan class.
outdated client version 61058 < min core version 70027
[version] Checking plan class 'BRP4cuda32'
[version] parsed project prefs setting 'gpu_util_brp' : true : 1.000000
[version] Don't need CUDA jobs, skipping version 125 for einsteinbinary_BRP4 (BRP4cuda32)
[version] Checking plan class 'BRP4cuda32nv301'
[version] parsed project prefs setting 'gpu_util_brp' : true : 1.000000
[version] driver version required min: -30100, supplied: 28558
[version] no app version available: APP#19 (einsteinbinary_BRP4) PLATFORM#2 (windows_intelx86) min_version 0
[send] stopping work search - no locality app selected
[send] stopping work search - no locality app selected
[send] stopping work search - no locality app selected
Not sending work because client is outdated
[HOST#4876288] MSG(high) No work sent
[HOST#4876288] MSG(high) No work is available for Gravitational Wave S6 GC search
[HOST#4876288] MSG(high) No work is available for Gravitational Wave S6 LineVeto search
[HOST#4876288] MSG(high) see scheduler log messages on http://einstein.phys.uwm.edu//host_sched_logs/4876/4876288
[HOST#4876288] MSG(high) No work available for the applications you have selected. Please check your preferences on the web site.
[HOST#4876288] MSG(high) (your BOINC client is old - please install current version)
Sending reply to [HOST#4876288]: 0 results, delay req 86400.00
Scheduler ran 0.037 seconds

I may be wrong, but from my observation these messages started to appear with the release of the AMD/ATI app. which shouldn't affect me as all my rigs have Nvidia cards.

All I can say, for sure, is that letting your work queue run down a bit then manually reporting the completed tasks gets you more work as would normally happen.

Also, from my observations, the 24 hour project backoff does not appear to happen until the host later (either automatically or manualy) contacts the server with 0 results to report as is shown in the above log.

Gavin.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250378328

RAC: 35060

This looks like a serious bug

17 May 2012 18:40:37 UTC

Message 109581

(moderation:

)

This looks like a serious bug in the server code. Apparently after reading the plan class atiOpenCL (which isn't even logged) the minimum client version isn't reset. There is no Client version limit on CPU Apps.

This is a long weekend in Germany, and I'm not sure I'll come to fix it before Monday.

Horacio

Joined: 3 Oct 11

Posts: 205

Credit: 80557243

RAC: 0

RE: On the other hand, I

17 May 2012 18:42:51 UTC

Message 109582 in response to message 109579

(moderation:

)

Quote:

On the other hand, I got

Quote:
2012-05-17 15:32:28.7011 [PID=17251] Request: [USER#xxxxx] [HOST#3674581] [IP xxx.xxx.xxx.133] client 5.10.13
...
2012-05-17 15:32:28.7072 [PID=17251] [send] work_req_seconds: 278.62 secs
...
2012-05-17 15:32:28.7227 [PID=17251] [send] [HOST#3674581] Sending app_version hsgamma_FGRP1 2 23 ; 2.44 GFLOPS
2012-05-17 15:32:29.9553 [PID=17251] [send] est. duration for WU 123395878: unscaled 24595.34 scaled 27516.67
2012-05-17 15:32:29.9553 [PID=17251] [HOST#3674581] Sending [RESULT#288860872 LATeah2478A_96.0_27300_-1.9e-11_1] (est. dur. 27516.67 seconds)

which would imply that the v7.0.27 requirement doesn't apply in all cases. One would expect it to apply to ATI requests only, but yours says CPU. We might need to see a longer log extract to work out what's going on.

I dont have (and never had) an ATI GPU... The log was as the one Gavin posted, it asked just for CPU and not for GPU.

What Ive seen is that BOINC makes a couple of requests as ussual and then it just gets this long delay with that message. It happened on all my hosts and Ive not changed anything on them.
May be is as Gavin said that it happens when there is no tasks reported... I dont know...
And, yes, if I do a manual update it works even if Im still on delayed time...

EDIT: Thanks Bernd... If it is not intended, then there is not issue waiting until it can be fixed.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250378328

RAC: 35060

This is another oddity of the

17 May 2012 18:47:38 UTC

Message 109583

(moderation:

)

This is another oddity of the "mixed (locality and non-locality) scheduling". If you happen to catch the locality scheduler first (which should happen in 60% of all cases) you are more likely to get CPU work.

inomyabcs

Joined: 11 Feb 05

Posts: 6

Credit: 16488057

RAC: 0

I had this same problem and I

19 May 2012 1:11:25 UTC

Message 109584

(moderation:

)

I had this same problem and I am not running any ATI GPUs at the moment. I did have the E@H preference for use an ATI GPU if available selected. I turned it off and was able to connect and download CPU/cuda GPU workunits again. Before, I had to manually update to receive workunits. This only happened when the ATI openCL update went live.

inomyabcs

Joined: 11 Feb 05

Posts: 6

Credit: 16488057

RAC: 0

An update from my last

19 May 2012 2:46:33 UTC

Message 109585 in response to message 109584

(moderation:

)

An update from my last message.

This only temporarily solved my problem. I am going to try to turn off the cuda GPU preference and see if that helps. I should know after my CPU run is done (5 hours from now) if it lets me download more CPU units. Sorry about any confusion. There is a server log returned in my event log for troubleshooting.

5/19/2012 10:19:19 AM | Einstein@Home | Requesting new tasks for NVIDIA
5/19/2012 10:19:19 AM | Einstein@Home | [http] HTTP_OP::init_post(): http://einstein.phys.uwm.edu/EinsteinAtHome_cgi/cgi
5/19/2012 10:19:19 AM | Einstein@Home | [http] HTTP_OP::libcurl_exec(): ca-bundle set
5/19/2012 10:19:19 AM | Einstein@Home | [http] [ID#1] Info: Connection #0 seems to be dead!
5/19/2012 10:19:19 AM | Einstein@Home | [http] [ID#1] Info: Closing connection #0
5/19/2012 10:19:19 AM | Einstein@Home | [http] [ID#1] Info: About to connect() to einstein.phys.uwm.edu port 80 (#0)
5/19/2012 10:19:19 AM | Einstein@Home | [http] [ID#1] Info: Trying 129.89.61.70...
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Info: Connected to einstein.phys.uwm.edu (129.89.61.70) port 80 (#0)
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Sent header to server: POST /EinsteinAtHome_cgi/cgi HTTP/1.1
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Sent header to server: User-Agent: BOINC client (windows_x86_64 7.0.25)
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Sent header to server: Host: einstein.phys.uwm.edu
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Sent header to server: Accept: */*
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Sent header to server: Accept-Encoding: deflate, gzip
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Sent header to server: Content-Type: application/x-www-form-urlencoded
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Sent header to server: Content-Length: 68924
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Sent header to server: Expect: 100-continue
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Sent header to server:
5/19/2012 10:19:20 AM | Einstein@Home | [http] [ID#1] Received header from server: HTTP/1.1 100 Continue
5/19/2012 10:19:23 AM | Einstein@Home | [http] [ID#1] Received header from server: HTTP/1.1 200 OK
5/19/2012 10:19:23 AM | Einstein@Home | [http] [ID#1] Received header from server: Date: Sat, 19 May 2012 01:20:59 GMT
5/19/2012 10:19:23 AM | Einstein@Home | [http] [ID#1] Received header from server: Server: Apache/2.2.3 (CentOS)
5/19/2012 10:19:23 AM | Einstein@Home | [http] [ID#1] Received header from server: Transfer-Encoding: chunked
5/19/2012 10:19:23 AM | Einstein@Home | [http] [ID#1] Received header from server: Content-Type: text/xml
5/19/2012 10:19:23 AM | Einstein@Home | [http] [ID#1] Received header from server:
5/19/2012 10:19:24 AM | Einstein@Home | [http] [ID#1] Info: Connection #0 to host einstein.phys.uwm.edu left intact
5/19/2012 10:19:25 AM | Einstein@Home | Scheduler request completed: got 0 new tasks
5/19/2012 10:19:25 AM | Einstein@Home | No work sent
5/19/2012 10:19:25 AM | Einstein@Home | see scheduler log messages on http://einstein.phys.uwm.edu//host_sched_logs/4206/4206337
5/19/2012 10:19:25 AM | Einstein@Home | (your BOINC client is old - please install current version)

inomyabcs

Joined: 11 Feb 05

Posts: 6

Credit: 16488057

RAC: 0

This is confirmed. The

19 May 2012 10:13:31 UTC

Message 109586 in response to message 109585

(moderation:

)

This is confirmed. The workaround is to disable requesting GPU workunits until the scheduling issue is resolved.

Horacio

Joined: 3 Oct 11

Posts: 205

Credit: 80557243

RAC: 0

RE: This is confirmed. The

19 May 2012 17:30:35 UTC

Message 109587 in response to message 109586

(moderation:

)

Quote:

This is confirmed. The workaround is to disable requesting GPU workunits until the scheduling issue is resolved.

Or, just do a manual update if you see that you're running low on WUs cached.
Or also, disable GPU just for a while, and when the cache is filled for CPU enable it again.
Or, just set the chache a bit higher and let it run as is... In the worst case your CPUs will be idle until the delay is over...

Completely disabling GPU work will affect Einstein RAC, way more, than having idle CPU cores...

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250378328

RAC: 35060

Unlike I suspected there is

21 May 2012 9:55:04 UTC

Message 109588

(moderation:

)

Unlike I suspected there is no (obvious bug in the server code. The only thing is that the log / notification message is somewhat misleading. If there is work for any application version which has a limit on the Core Client version that is not satisfied by your client, you will get this message. This is slightly stupid, because this check is made before checking whether your computer can actually run that type of work or even asked for it. But the 'outdated' Client is just the reason why you don't get that particular type of work, and certainly not the reason why you don't get any work at all.

Not sending work because client is outdated

Forums › Problems and Bug Reports

Comment viewing options

Forums › Problems and Bug Reports