Binary Radio Pulsar Search (Perseus Arm Survey) "BRP5"

Eric_Kaiser

Joined: 7 Oct 08

Posts: 16

Credit: 25699305

RAC: 0

Eric Kaiser wrote:I can

28 May 2013 8:52:10 UTC

Message 115615 in response to message 115614

(moderation:

)

Eric Kaiser wrote:

I can observe that the estimated runtime for not started BRP4 WUs vary from 20 minutes up to 120 minutes.

BTW for BRP5 current range of estimated runtimes is from 3,5hrs up to 11hrs.

Beyond

Joined: 28 Feb 05

Posts: 124

Credit: 2781770974

RAC: 5210708

RE: RE: I can observe

28 May 2013 9:06:14 UTC

Message 115616 in response to message 115615

(moderation:

)

Quote:

Quote:
I can observe that the estimated runtime for not started BRP4 WUs vary from 20 minutes up to 120 minutes.

BTW for BRP5 current range of estimated runtimes is from 3,5hrs up to 11hrs.

On all 11 GPUs I have running Einstein WU time variation for any given GPU is small (within 5%) for either BRP4 or BRP5 tasks.

Eric_Kaiser

Joined: 7 Oct 08

Posts: 16

Credit: 25699305

RAC: 0

Hmm, weird... BRP4: This one

28 May 2013 9:30:29 UTC

Message 115617 in response to message 115616

(moderation:

)

Hmm, weird...
BRP4: This one had runtime 6,271s and this one had runtime 1,292s
Even so CPU time is 231s compared to 165s
System is running on cool temperature (GPU 50Â°C, CPU 50Â°C) under full load and stable too.
I have no explanation for this wide range different runtimes.

Sebastian M. Bo...

Joined: 20 Feb 05

Posts: 63

Credit: 1529609097

RAC: 105

Generally, the variation is

28 May 2013 9:39:28 UTC

Message 115618 in response to message 115616

(moderation:

)

Generally, the variation is not great if you provide more or less identical conditions. However, the database does not contain information on whether, how many and what applications were running in parallel on the same GPU.

Since I already have a little more completed jobs I see that the computation time is almost exactly 10x longer on my Nvidia cards. I also think that the scoring should take into account a bonus for the fact that task take more time. Therefore, I think that a little more than 5,000 for task will be fine. Of course this is just my personal opinion.

Eric_Kaiser

Joined: 7 Oct 08

Posts: 16

Credit: 25699305

RAC: 0

Sebastian M. Bobrecki

28 May 2013 10:06:17 UTC

Message 115619 in response to message 115618

(moderation:

)

Sebastian M. Bobrecki wrote:

Generally, the variation is not great if you provide more or less identical conditions. However, the database does not contain information on whether, how many and what applications were running in parallel on the same GPU.

In my case I only 2 WU from the same project run in parallel on my GPU. No WUs from other GPU projects.
When I switch to another GPU project I stopp all other GPU projects. When Einstein is running 10 CPU cores are running WUs from CPU projects and 2 CPU cores run Einstein GPU.
So the conditions should fairly be the same.

Sebastian M. Bo...

Joined: 20 Feb 05

Posts: 63

Credit: 1529609097

RAC: 105

RE: ... So the conditions

28 May 2013 10:21:52 UTC

Message 115620 in response to message 115619

(moderation:

)

Quote:

...
So the conditions should fairly be the same.

Not necessarily. BRP takes a lot of transfers between the GPU and RAM. So it's possible that some of these applications running on the CPU saturate the memory bandwidth so that it starts to make a difference.

Eric_Kaiser

Joined: 7 Oct 08

Posts: 16

Credit: 25699305

RAC: 0

Ok. This might explain the

28 May 2013 10:47:55 UTC

Message 115621 in response to message 115620

(moderation:

)

Ok. This might explain the variation in runtime but it doesn't explain why the estimation of WUs waiting to be executed also have this wide range.
I'm not sure if it is worth digging deeper into it.

Sebastian M. Bo...

Joined: 20 Feb 05

Posts: 63

Credit: 1529609097

RAC: 105

RE: ... I'm not sure if it

28 May 2013 10:55:45 UTC

Message 115622 in response to message 115621

(moderation:

)

Quote:

...
I'm not sure if it is worth digging deeper into it.

I think it may be in some way helpful to the crew. Finally, you probably know best what is happening on your computer(s).

Eric_Kaiser

Joined: 7 Oct 08

Posts: 16

Credit: 25699305

RAC: 0

RE: I think it may be in

28 May 2013 11:47:22 UTC

Message 115623 in response to message 115622

(moderation:

)

Quote:

I think it may be in some way helpful to the crew. Finally, you probably know best what is happening on your computer(s).

Good point.
Ok. Let me summarize some points that I have observed on BRP4 and BRP5:
Most of the BRP4 WUs have an estimated runtime of ca. 24 min. The effective runtime corresponds with estimation ~1,600sec +/- a some 100sec.
Sometimes single WUs have a significant longer estimation before they are started and during calculation. The effective runtime in this cases reflects the estimation means it is significant higher than the rest (up to 6,200sec).

BRP5 behave the same as far as I can see after 11 finished WUs (9 actuall validated). Normal runtime seems to be around 13,000sec for a single WU.
3 WU had runtime of ~25,000sec, ~30,000sec and ~41,000sec. For these the estimation was also higher than normal. The estimation of "normal" BRP5 WU is 3,5hrs +/-.

I'm executing two WU in parallel to reach ~90% GPU usage. I'm aware that the effective runtime for a single WU during prallel execution increases slightly. Single WU results in ~55% GPU usage.
If I remember correctly no parallel processing resulted in 20min per BRP4 WU means 3 WU/hr. With parallel processing I achieve >=4 WU/hr.

When 2 WUs are in execution on GPU boinc manager restricts CPU WUs to 10 WUs in parallel. That means 2 cores were used for Einstein GPU.
General setting in boinc: 92% CPU cores and 100% CPU time.
As far as I've seen VRAM usage was below 50% of the available capacity. RAM usage combined boinc, windows 7 prof 64, Firefox, ... is around 6 GB. That means 58 GB remain free.

Hardware: i7-3930k@3,2GHz, 64GB RAM, Radeon HD7850@1050Mhz and 2 GB VRAM@1350Mhz. Under full load temperature stays around 50Â°C so I suppose no step down of CPU speed caused by thermal problems.

Neil Newell

Joined: 20 Nov 12

Posts: 176

Credit: 169699457

RAC: 0

Here's a graph showing the

28 May 2013 12:28:25 UTC

Message 115624 in response to message 115622

(moderation:

)

Here's a graph showing the effect of the BRP5 introduction at 4k/task on the daily credit for my 9 hosts with NVIDIA GPUs. Results for the individual hosts are against the left axis, the topmost line is the total for all hosts plotted against the right axis.

7 of the hosts are used exclusively on E@H, 2 are used on both E@H and A@H (75%/25%). Hosts with faster GPUs run 2 tasks at once (utilisation of 0.5), all others run one task per GPU.

Be interesting to do a comparison against hosts with ATI and NVIDIA GTX6xx GPUs, if anyone would like to mail over the job log file for an e@h dedicated host.

Binary Radio Pulsar Search (Perseus Arm Survey) "BRP5"

Forums › Technical News

Comment viewing options

Forums › Technical News