log msg direction to server log

cliff
cliff
Joined: 15 Feb 12
Posts: 176
Credit: 283452444
RAC: 0
Topic 197752

Hi Folks,

12/10/2014 04:17:59 | Einstein@Home | Sending scheduler request: To fetch work.
12/10/2014 04:17:59 | Einstein@Home | Requesting new tasks for CPU
12/10/2014 04:18:01 | Einstein@Home | Scheduler request completed: got 0 new tasks
12/10/2014 04:18:01 | Einstein@Home | No work sent
12/10/2014 04:18:01 | Einstein@Home | see scheduler log messages on http://einstein5.aei.uni-hannover.de/EinsteinAtHome/host_sched_logs/11672/11672980

went there but could not understand 'what' I was supposed to be seeing.

Anyway WU for CPU have dried up, just some WU for GPU which I'm crunching 2 at a time..

Can anyone explain 'what' I was supposed to pick up from that server log?

Regards,

Cliff,

Been there, Done that, Still no damm T Shirt.

Richard Haselgrove
Richard Haselgrove
Joined: 10 Dec 05
Posts: 2143
Credit: 2951583590
RAC: 690895

log msg direction to server log

Quote:

Hi Folks,

12/10/2014 04:17:59 | Einstein@Home | Sending scheduler request: To fetch work.
12/10/2014 04:17:59 | Einstein@Home | Requesting new tasks for CPU
12/10/2014 04:18:01 | Einstein@Home | Scheduler request completed: got 0 new tasks
12/10/2014 04:18:01 | Einstein@Home | No work sent
12/10/2014 04:18:01 | Einstein@Home | see scheduler log messages on http://einstein5.aei.uni-hannover.de/EinsteinAtHome/host_sched_logs/11672/11672980

went there but could not understand 'what' I was supposed to be seeing.

Anyway WU for CPU have dried up, just some WU for GPU which I'm crunching 2 at a time..

Can anyone explain 'what' I was supposed to pick up from that server log?

Regards,


Nothing visible at the moment. The log just shows you the most recent interaction between your computer and the server, and by bad luck they'd just had a little chat before I started typing this answer. Your computer asked for some GPU work, and received it. You perhaps need to wait until your computer next asks for CPU work, and have another look then.

You would be looking for the reason why no work was sent. No work available, perhaps? No disk space left? (unlikely in your case, you have 'available disk 6.15 GB'). Just for any line which says 'no work sent', and post that (plus any lines that appear to be part of the same message) here.

cliff
cliff
Joined: 15 Feb 12
Posts: 176
Credit: 283452444
RAC: 0

Hi Richard, Well

Hi Richard,
Well looking at the status page it seems a few servers are down and not generating WU.
I've since got some WU and have added another NV card to my rig.. so its crunching away right now.
The CPU tasks are Gamma Ray 4 ones and are long duration, so it will be some time getting them done so I've removed them from options until I'm properly sorted out.

Regards,
Cliff

Cliff,

Been there, Done that, Still no damm T Shirt.

Holmis
Joined: 4 Jan 05
Posts: 1118
Credit: 1055935564
RAC: 0

RE: Hi Richard,

Quote:
Hi Richard,
Well looking at the status page it seems a few servers are down and not generating WU.


That's perfectly normal as the work unit generators work with high and low limits so turns on when reaching the low limit and shuts down when reaching the high limit.

Quote:

I've since got some WU and have added another NV card to my rig.. so its crunching away right now.
The CPU tasks are Gamma Ray 4 ones and are long duration, so it will be some time getting them done so I've removed them from options until I'm properly sorted out.

Regards,
Cliff


Well you won't get any more CPU work then as Gamma Ray #4 is the only CPU work available at this time. We're all waiting for the next Gravity wave search to be launched but no news on that yet.

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5872
Credit: 117329204776
RAC: 35837391

RE: ... have added another

Quote:
... have added another NV card to my rig..


When you had one card only (presumably before the errors kicked in), the BRP4G tasks had run times around the hour mark with the CPU component being close to 20 mins. Those figures seem to indicate that the CPU is struggling to properly service the GPU. Now that you have 2 cards, and after the spate of errors, the BRP4G task run times have blown out further to around 4800 secs with CPU times around 1500 secs. Are you still running CPU tasks of any description on all CPU cores?

If all CPU cores are busy with CPU tasks, it would seem that you might get better overall performance if you freed up a couple of cores, using preferences, to allow better servicing of the GPUs. You should experiment and see what works best. It also could be that with two cards, each PCIe slot is only running at x8 and you have lost performance because of that. You should check your motherboard specs to see what happens with two cards inserted.

I suspect that an FX-8350 might have difficulty keeping up with the demands of two GTX970s when running the EAH GPU tasks.

Cheers,
Gary.

cliff
cliff
Joined: 15 Feb 12
Posts: 176
Credit: 283452444
RAC: 0

Hi Gary, The Errors

Hi Gary,
The Errors wer'nt due to lack of CPU availability, they were due to a system crash for other reasons [finger trouble:-(]

I've run a checking routine and my rig will handle both cards and do both types of WU..

In fact it has done so for a while now, got my GPU fans running a bit higher speed with afterburner to cover the increased workload, and my CPU is water cooled.

Regards,

Cliff,

Been there, Done that, Still no damm T Shirt.

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5872
Credit: 117329204776
RAC: 35837391

RE: .. Errors wer'nt due to

Quote:
.. Errors wer'nt due to lack of CPU availability ...


I wasn't suggesting they were. It seemed likely that they occurred around the time of adding the second card so I was comparing crunch times before any errors to those after the errors had finished, just as a crude method for identifying the two situations. I was simply using that as a guide when looking at tasks in your tasks list.

Quote:
... my rig will handle both cards and do both types of WU..


Of course it will but are you getting the best 'return' from it? I asked the question about the number of cores running CPU tasks simply to alert you to the possibility of improving your overall performance. I'm not at all criticising the way you run your machine and I wasn't suggesting that anything was getting too hot. I was just trying to suggest things that many others have found to help in improving performance.

Cheers,
Gary.

cliff
cliff
Joined: 15 Feb 12
Posts: 176
Credit: 283452444
RAC: 0

Hi Gary, Relax I

Hi Gary,
Relax I wasn't having a go at you, just saying my rig can handle it:-)

As for CPU, its 8 core with 4 core free for the GPU's [in theory at any rate]

I could pull back the core count for CPU work a bit more, but not until I get rid of some long runners that I mistakenly asked for and got...

Just proves the old maxim of being careful what you ask for:-)

[edit] Just how does one get rid of 6 FGRP4-SSE2 WU without aborting them?

Just done 2 of them at 15hours plus:-/ And the all time out on the 26th of this
month...

With 2 projects going and hourly swap overs I doubt I can process 6 of the things.

Regards,
Cliff

Cliff,

Been there, Done that, Still no damm T Shirt.

Holmis
Joined: 4 Jan 05
Posts: 1118
Credit: 1055935564
RAC: 0

RE: Just how does one get

Quote:

Just how does one get rid of 6 FGRP4-SSE2 WU without aborting them?

Just done 2 of them at 15hours plus:-/ And the all time out on the 26th of this
month...

With 2 projects going and hourly swap overs I doubt I can process 6 of the things.

Regards,
Cliff

Let 'em run? =)

Make sure you have "Leave tasks in memory while suspended?" set to yes as the FGRP4 checkpoints are far apart. Setting it to yes enables the app to resume right where it was suspended and eliminates the need to reprocess from the last checkpoint.

If Boinc decides they are in risk of missing the deadline it will switch to "High priority" and run the "in danger" tasks first, the imbalance in resource share will later be compensated by running more tasks from other projects.

cliff
cliff
Joined: 15 Feb 12
Posts: 176
Credit: 283452444
RAC: 0

Hi Holmis, Ta for

Hi Holmis,
Ta for the advice.. I'll just let em run on then:-)
I'd better change the suspend setting tho..

Regards,
Cliff

Cliff,

Been there, Done that, Still no damm T Shirt.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.