All-Sky Gravitational Wave Search on O3 data (O3ASHF1)

mountkidd

Joined: 14 Jun 12

Posts: 177

Credit: 12677539811

RAC: 5768778

I'm seeing run-times a bit

5 Dec 2024 5:46:31 UTC

Message 230537

(moderation:

)

I'm seeing run-times a bit differently.

Following is host: HF r-t, Bu r-t, ratio, concurrency, GPU, app version, mps

ora: 2300, 4000, 1.74, x2, 3060ti, 1.07, n/a

tha: 1150, 3290, 2.86, x2, 3070, 1.14, mps 70%

tli: 1070, 2610, 2.44, x2, 3070ti, 1.14, mps 70%

del: 1100, 2630, 2.39, x2, 3070ti, 1.14, mps 70%

tcu: 1000, 2580, 2.58, x3, 3080ti, 1.14, mps 55%

Run times are eyeball estimates from 8hrs ago. GPU's are running with same config for both HF & Bu wu's.

San-Fernando-Valley

Joined: 16 Mar 16

Posts: 459

Credit: 10383127912

RAC: 13055403

wujj123456 wrote:But the

5 Dec 2024 8:25:57 UTC

Message 230542 in response to message 230520

(moderation:

)

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4330

Credit: 251175282

RAC: 42542

Did the host with "high

5 Dec 2024 8:32:00 UTC

Message 230543 in response to message 230527

(moderation:

)

Did the host with "high power gpus" that you were using run a single task per gpu?

Sure. Running multiple tasks on the same GPU makes no sense for us, as our cluster scheduler can't handle that. And indeed we're using Linux and the CUDA version of the app.

DF1DX

Joined: 14 Aug 10

Posts: 106

Credit: 3903287854

RAC: 2355956

With the last parameters, I

5 Dec 2024 9:21:02 UTC

Message 230544

(moderation:

)

With the last parameters, I was able to calculate around 450-500 WUs/day on the NV 4090 (with 2+2 WUs in parallel and offset for the CPU calculation).

Currently, with a longer runtime (and only 2 units in parallel make sense), it looks more like 100-110 WUs/day.

The estimate of 5-6 months for the planned 150-250 Hz therefore seems a little too optimistic to me.

Harri Liljeroos

Joined: 10 Dec 05

Posts: 4404

Credit: 3236469706

RAC: 1645893

OK, the validator is running

5 Dec 2024 9:28:48 UTC

Message 230546

(moderation:

)

OK, the validator is running for the new tasks. I got 1000 credits per valid tasks.

San-Fernando-Valley

Joined: 16 Mar 16

Posts: 459

Credit: 10383127912

RAC: 13055403

Harri Liljeroos wrote: OK,

5 Dec 2024 9:33:26 UTC

Message 230547 in response to message 230546

(moderation:

)

Harri Liljeroos wrote:

OK, the validator is running for the new tasks. I got 1000 credits per valid tasks.

Wondering why I got 4000 credits ?

Wil, PE1JRZ

Joined: 4 Apr 21

Posts: 1

Credit: 338829660

RAC: 539035

The new tasks got me 4000

5 Dec 2024 10:47:33 UTC

Message 230552

(moderation:

)

The new tasks got me 4000 credits also.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4330

Credit: 251175282

RAC: 42542

The first WUs were issued

5 Dec 2024 11:10:00 UTC

Message 230553

(moderation:

)

The first WUs were issued with the credit of the high-freq run, there were a few in between (3000?), last generated ones should give 4000.

B.I.G

Joined: 26 Oct 07

Posts: 119

Credit: 1184020285

RAC: 579320

Hm.... on my AMD Pro W7600

5 Dec 2024 13:00:45 UTC

Message 230556

(moderation:

)

Hm.... on my AMD Pro W7600 run times went up significant, I run 3 tasks paralell, going to try with 2 tasks paralell but I don't see a GPU utilisation problem, it's drawing 124 watts while with the old O3 tasks it was not going above 115 watts. CPU usage is significantly lower as the system now consumes 202 instead of previously 214 watts and it shows in the CPU time per task:

GPU / CPU / Credits

2,002

244

10,000

All-Sky Gravitational Wave search on O3 v1.07 () windows_x86_64

8,025

124

4,000

All-Sky Gravitational Wave search on O3 v1.07 () windows_x86_64

Here's the host:

https://einsteinathome.org/host/13157119

Boca Raton Comm...

Joined: 4 Nov 15

Posts: 258

Credit: 10728246895

RAC: 11645159

The total CPU recalc time has

5 Dec 2024 13:57:13 UTC

Message 230560

(moderation:

)

The total CPU recalc time has indeed significantly dropped. I was able to hand time the CPU recalc steps on an old high frequency work unit before they ran out. This was on the i9 14900ks. I ran both tests with the same amount of tasks running, same loads, etc...

Old (high frq): Mid work unit CPU recalc was 76 seconds and end of work unit CPU recalc was 69 seconds for a total CPU time of 145 seconds.

New (low freq): Total CPU recalc time = 32 seconds.

I am confident that one of our old xeons would not see such a percentage reduction in time but it is significantly less across the board.

Bernd, is the CPU recalc being done differently, or just simpler/smaller math to calculate? Or both? Just curious.

All-Sky Gravitational Wave Search on O3 data (O3ASHF1)

Forums › Technical News

Comment viewing options

Forums › Technical News