Fermi LAT Gamma-ray pulsar search #3 "FGRP3"

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4349

Credit: 253665212

RAC: 34951

Better now? (The problem

3 Mar 2014 18:19:32 UTC

Message 119815

(moderation:

)

Better now?

(The problem is that on the server side we can't see that part on the Client side)

rbpeake

Joined: 18 Jan 05

Posts: 266

Credit: 1185872797

RAC: 689326

RE: Better now? (The

3 Mar 2014 18:53:20 UTC

Message 119816 in response to message 119815

(moderation:

)

Quote:

Better now?

(The problem is that on the server side we can't see that part on the Client side)

BM

Yes, thank you! All is back to normal!

Sunny129

Joined: 5 Dec 05

Posts: 162

Credit: 160342159

RAC: 0

well i've been running these

4 Mar 2014 21:22:09 UTC

Message 119817

(moderation:

)

well i've been running these FGRP3 GPU tasks 6 at a time for a while now on my 7970 3GB GPU. utilization averages 58% +/- a 5% margin of error. this is in comparison to a ~55% utilization when running only 5 tasks at a time. so despite my inability to test more than 6 FGRP3 GPU tasks at a time due to VRAM limitations, i'm fairly confident that the law of diminishing returns is having its effect, and that i'm probably already near my GPUs limit for efficiency. in other words, i think the only way to increase GPU utilization at this point is to optimize the code (i.e. give the GPU more than just FFT calculations). i understand that this takes time and requires patience host-side.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4349

Credit: 253665212

RAC: 34951

There are CPU App versions of

5 Mar 2014 11:25:45 UTC

Message 119818

(moderation:

)

There are CPU App versions of FGRP3 (plan class is "FGRPSSE"). If you didn't get work for that, you probably have to set "Run CPU versions of applications for which GPU versions are available" to "yes" in your Einstein@Home preferences. Currently FGRP3 is the only application affected by this setting, as the BRP search has separate applications for GPU (BRP5, BRP4G) and CPU (BRP4).

astro-marwil

Joined: 28 May 05

Posts: 536

Credit: 692696543

RAC: 499457

Hallo BM! Thankyou for your

5 Mar 2014 12:50:23 UTC

Message 119819 in response to message 119818

(moderation:

)

Hallo BM!
Thankyou for your quick response.
The problem for me is, that I get GPU work for FGRP3 than also, and this GPU work is running much less efficient on my system than BRP5. There is a factor 2 or 3 inbetween. So it is, with regard to Cobblestone earnage, for me more efficient to disable FGRP3, except I can select CPU work for FGRP3 only.

Kind regards and happy crunching
Martin

Gary Roberts

Moderator

Joined: 9 Feb 05

Posts: 5888

Credit: 119673993985

RAC: 25254695

Maybe you might be interested

6 Mar 2014 4:25:32 UTC

Message 119820 in response to message 119819

(moderation:

)

Maybe you might be interested in these notes?

Cheers,
Gary.

Betreger

Joined: 25 Feb 05

Posts: 992

Credit: 1645618555

RAC: 654618

I appreciate your efforts,

7 Mar 2014 2:10:17 UTC

Message 119821 in response to message 119820

(moderation:

)

I appreciate your efforts, but as a 68 yr old pensioner and one who has not written code for over 30 yrs or so your solution does not apply to me. I don't know anything about ap info exml. What this thing needs, at least for me, is an option on the preference page is the choice of running on the cpu "FGRPSSE and leave the gpu alone with this little parasite. I don't mind running it on the cpu but it hogs too many resources when run on the gpu.

astro-marwil

Joined: 28 May 05

Posts: 536

Credit: 692696543

RAC: 499457

Hallo Betreger! RE: it

7 Mar 2014 12:05:20 UTC

Message 119823 in response to message 119821

(moderation:

)

Hallo Betreger!

Quote:

it hogs too many resources when run on the gpu.

YouÂ´r right, on my PC these FGRPopend-ati taks require about 60% of the capabilities of one CPU-core at relative low GPU-load. But even this gives a benefit of 4.5, or in other words the FGRPobend-ati taks (running on CPU + GPU) are running only 22,22% as long as FGRPSSE taks (running only on CPU). As the GPU-load ist relative low - at me never higher than 60% and that only from time to time for some seconds long, - you can choose in the Einstein@Home preferences in your account the "GPU utilization factor of FGRP apps" lower than 1. If this factor is 0.5 and you have sufficient of VRAM on your graphicscard, there will run 2 tasks on your GPU. If you insert 0.33, there will run 3 tasks in parallel on the GPU, but you require for this at minimum 2GB of VRAM on your GPU. So you donÂ´t require anymore these appinfo files.
I hope this will clarify a bit.

Kind regrards and happy crunching.
Martin

Betreger

Joined: 25 Feb 05

Posts: 992

Credit: 1645618555

RAC: 654618

Martin I am running a GPU

7 Mar 2014 17:19:06 UTC

Message 119824 in response to message 119823

(moderation:

)

Martin I am running a GPU factor of .33 and when a FGRP3 runs it also requires a CPU core so when I had 3 up at once I only had 4 tasks running instead of 7 that being 4 on the CPU and 3 on the GPU.

astro-marwil

Joined: 28 May 05

Posts: 536

Credit: 692696543

RAC: 499457

Hallo Betreger! RE: I

8 Mar 2014 10:32:15 UTC

Message 119825 in response to message 119824

(moderation:

)

Hallo Betreger!

Quote:

I had 3 up at once I only had 4 tasks running instead of 7 that being 4 on the CPU and 3 on the GPU

Yes, thatÂ´s right, as you and I have an i5 processor, which canÂ´t perform HT (hyper threading). But still it should give higher overall performance, as the GPU tasks are running much faster than the CPU only tasks.

So by theory. But in practice, as I observed first time yesterday, this isnÂ´t so. With the program "MSI Afterburner" one can see the time dependend GPU-load. This shows for the FGRP taks a highly cluttered trace with often several second long very low or zero GPU-load inbetween. So by theory within this time could be easily crunched another task of the same kind. But if both task became started at the same time and they act so similar in time requiering the GPU, both tasks are very often overloading the GPU, instead of filling the timegap inbetween. It seems to help, halting one of the task for a minute or so and than continuing. The best timedelay has to be tested for. Hopefully one of the programmers could give us some hints in this regard.

I hope, my writing is understandable. If not, donÂ´t hesitate to contact me.

Kind regards and happy crunching
Martin

Fermi LAT Gamma-ray pulsar search #3 "FGRP3"

Forums › Technical News

Comment viewing options

Forums › Technical News