Why only one parallell GPU task?

archae86

Joined: 6 Dec 05

Posts: 3162

Credit: 7319478354

RAC: 2305088

shauge wrote:it did not start

29 Dec 2016 17:39:25 UTC

Message 153500 in response to message 153498

(moderation:

)

shauge wrote:

it did not start doing multiple apps immediately, even after forcing a config update. It may be because I had too much work for the other CPU only project I run. The new gpu apps require a whole CPU core pr. instance. After a day or two it started running multiple GPU apps.

Just a reminder, Einstein GPU multiplicity changes commanded by web site changes do not take effect until a new WU is actually downloaded. Once downloaded, they do take effect (including previously downloaded work and even already running work), though the "branding" does not update.

On the other hand, yes, if local BOINC on your host is running CPU tasks as High Priority (because of calculated deadline trouble), that can crowd out CPU support for GPU multiplicity and lead to temporary running at lower than requested multiplicity.

shauge

Joined: 21 Apr 07

Posts: 9

Credit: 544283916

RAC: 156279

archae86 wrote:shauge

29 Dec 2016 20:55:55 UTC

Message 153504 in response to message 153500

(moderation:

)

archae86 wrote:

shauge wrote:
it did not start doing multiple apps immediately, even after forcing a config update. It may be because I had too much work for the other CPU only project I run. The new gpu apps require a whole CPU core pr. instance. After a day or two it started running multiple GPU apps.

Just a reminder, Einstein GPU multiplicity changes commanded by web site changes do not take effect until a new WU is actually downloaded. Once downloaded, they do take effect (including previously downloaded work and even already running work), though the "branding" does not update.

Your interpretation of the information you get each time you make a change on the web site, is not completely correct. You can also do a force update on the client side. Besides, I did also make other changes at the same time that were in effect, so the local settings had been updated.

Gary Roberts

Moderator

Joined: 9 Feb 05

Posts: 5877

Credit: 118760223242

RAC: 21204189

shauge wrote:archae86

29 Dec 2016 22:16:09 UTC

Message 153514 in response to message 153504

(moderation:

)

shauge wrote:

archae86 wrote:
shauge wrote:
it did not start doing multiple apps immediately, even after forcing a config update. It may be because I had too much work for the other CPU only project I run. The new gpu apps require a whole CPU core pr. instance. After a day or two it started running multiple GPU apps.

Just a reminder, Einstein GPU multiplicity changes commanded by web site changes do not take effect until a new WU is actually downloaded. Once downloaded, they do take effect (including previously downloaded work and even already running work), though the "branding" does not update.

Your interpretation of the information you get each time you make a change on the web site, is not completely correct. You can also do a force update on the client side. Besides, I did also make other changes at the same time that were in effect, so the local settings had been updated.

He is correct - you are not understanding what he said. With this one setting (changing the number of concurrent GPU tasks through the GPU utilization factor on the website) it will not be reflected on your host just by clicking 'update' locally, no matter how many times you do it. The key point is that the new setting ONLY gets implemented locally AFTER a new GPU task is actually downloaded. So, to get the change to happen quickly, just increase the work cache size setting sufficiently to download a new GPU task. Then the change will occur immediately.

Cheers,
Gary.

Suffkoppus

Joined: 28 Oct 07

Posts: 47

Credit: 649648843

RAC: 0

Thanks for all the input. The

1 Jan 2017 9:47:32 UTC

Message 153624

(moderation:

)

Thanks for all the input. The fix for me was to clear up all cpu work downloaded to my problem host. It had with a cache set to 1, over 330 cpu tasks and only about 20 gpu tasks on my drive. After aborting the 330 cpu tasks it was all fine again. Somehow the cache also does not fill up again with too many cpu tasks. But that should have been the problem in the beginning.

At all: have a good crunching year 2017

Ace Casino

Joined: 25 Feb 05

Posts: 36

Credit: 1500444600

RAC: 584353

How come my 8 cpu machine is

2 Jan 2017 15:01:17 UTC

Message 153671

(moderation:

)

How come my 8 cpu machine is running 9 cpu's.

8 cores of Gamma-Ray SSE

1 core of Gamma-Ray Nividia GPU

The new Nividia GPU's take 1 core, so that's 9 running at once.

This is happening.

Edit: My Quad core machine is acting like you might think it should Running 2 Gamma GPU and 2 CPU...not 5. These new Gamma-Ray GPU's are supposed to take 1 cpu.

Mad_Max

Joined: 2 Jan 10

Posts: 163

Credit: 2243680395

RAC: 664324

Looks like the same problem

2 Jan 2017 16:43:24 UTC

Message 153678

(moderation:

)

Looks like the same problem same problem as Suffkoppus had have: too many CPU tasks in queue force client running in "high priority". So recommendation is a same too:

Mad_Max wrote:

Check if your BOINC client running E@H in "high priority mode" (on tasks tab). It may be cause if you right about 8 (not 7) Multi-Directed Continuous Gravitational Wave search CV CPU tasks running on 8 cores CPU.

If so - try to reduce work cache size in BOINC settings and(only after new cache size, 0.5-1 day max) abort some of GW WUs until BOINC stop running in ""high priority". Or just wait few days while it clear some of WU queue without downloading new ones.

Root of the problem in the wrong assessment of the duration of the work - overestimated for GPU jobs and underestimated for the CPU jobs. As a result, the client downloads too much CPU jobs and too little of the GPU jobs.

But this thing can not be corrected by the user, it should be the administration of the project change and tweak some settings. Small work cache and aborting some of CPU WUs is just a workaround.

DanNeely

Joined: 4 Sep 05

Posts: 1364

Credit: 3573236589

RAC: 713947

If you're a user with a fast

2 Jan 2017 23:16:27 UTC

Message 153695

(moderation:

)

If you're a user with a fast GPU, one thing you can do to reduce the mismatch in CPU vs GPU runtimes is to run several GPU tasks at once to slow the per GPU task runtime down. Ultimately, what is needed though is to update the boinc platform to use multiple Duration Correction Factors - ideally 1 per work unit type; but at an absolute minimum separate CPU and GPU values are needed. A lot of this probably could be done purely on the client, although (especially with a per-app implementation) server side support would make for nicer behavior (ie the devs could explicitly link science apps that should have a similar DCF together instead of requiring the client to guess relationships based on naming conventions).

Holmis

Joined: 4 Jan 05

Posts: 1118

Credit: 1055935564

RAC: 0

A fix for these DCF problems

3 Jan 2017 20:46:52 UTC

Message 153738 in response to message 153695

(moderation:

)

A fix for these DCF problems has already been developed with the CreditNew, but as that system has some other quite big problems the admins here has opted not to implement it until it's updated/rewritten to function much better.
CreditNew was tested on Albert a few years ago and some of the problems with the system was identified but hasn't been fixed yet.

Why only one parallell GPU task?

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner