All-Sky Gravitational Wave Search on O3 data (O3ASHF1)

Speedy

Joined: 11 Aug 05

Posts: 41

Credit: 24993511

RAC: 9764

If we had Elon's help I think

19 Nov 2024 1:25:32 UTC

Message 229938 in response to message 229595

(moderation:

)

If we had Elon's help I think the project would be complete in a matter of days, leaving us nothing to do. I am currently working on 1680.00Hz

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4350

Credit: 253819986

RAC: 36029

We will conclude the current

2 Dec 2024 19:41:51 UTC

Message 230401

(moderation:

)

We will conclude the current high-frequency search to focus on a lower frequency range with a new parameter setup. The app will stay the same, GPU memory requirement shall be around 1800MB. We'll switch to the new workunits ASAP.

GWGeorge007

Joined: 8 Jan 18

Posts: 3205

Credit: 5243046723

RAC: 4419812

Bernd Machenschalk wrote: We

2 Dec 2024 21:44:04 UTC

Message 230405 in response to message 230401

(moderation:

)

Bernd Machenschalk wrote:

We will conclude the current high-frequency search to focus on a lower frequency range with a new parameter setup. The app will stay the same, GPU memory requirement shall be around 1800MB. We'll switch to the new workunits ASAP.

Thanks for the heads-up, Bernd. Much appreciated!

George

Proud member of the Old Farts Association

MAGIC Quantum M...

Joined: 18 Jan 05

Posts: 1972

Credit: 1536712534

RAC: 1860340

Thanks for the update,

3 Dec 2024 7:21:13 UTC

Message 230424 in response to message 230401

(moderation:

)

Thanks for the update, Bernd

Much appreciated!

Link

Joined: 15 Mar 20

Posts: 137

Credit: 13249818

RAC: 27918

Crunching O3 on the iGPU of

3 Dec 2024 13:55:30 UTC

Message 230435 in response to message 230401

(moderation:

)

Crunching O3 on the iGPU of my Ryzen 5700G...

Is it to be expected, that the new WUs need a lot more memory bandwidth? Even with all CPU tasks disabled I see DRAM Read Bandwidth in HWiNFO, that I've never seen before, basically it's nailed to ~38 Gbps, if I enable 4 CPU tasks I get even spikes over 40 Gbps. With the old tasks and 14 MCM WUs on the CPU cores it was somewhere around 32-34 Gbps with spikes to around 36 Gbps and the iGPU could complete a task in about 1 hour and 20-25 minutes, the estimate for my first new task is around 4-5 hours.

Is this the expected increase in runtime, or are the new tasks not very suitable for iGPUs with their limited memory bandwidth?

Ian&Steve C.

Joined: 19 Jan 20

Posts: 4158

Credit: 50247151547

RAC: 42307563

I also see much longer

3 Dec 2024 15:07:22 UTC

Message 230439

(moderation:

)

I also see much longer runtimes on the new tasks.

and it looks like they are no longer running in two stages, but rather one long stage.

Bernd, is that intended?

I suppose the validator isnt setup yet either

_________________________________________________________________________

Boca Raton Comm...

Joined: 4 Nov 15

Posts: 303

Credit: 11556499080

RAC: 14508788

Looks like work being sent

3 Dec 2024 17:36:03 UTC

Message 230443

(moderation:

)

Looks like work being sent out is paused?

I watched a few of the new work units come through- definitely take longer. Still saw a pause at 49.5% and then at 99.0%.

EDIT: Just watched one NOT pause at 49.5%

Perhaps the one that I saw pause at 49.5% was from the previous generation.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4350

Credit: 253819986

RAC: 36029

The setup is quite different.

3 Dec 2024 17:32:06 UTC

Message 230444

(moderation:

)

The setup is quite different. We split the original WUs in half and processed these one after the other, because that reduced the memory consumption. For the new WUs this isn't necessary (and actually more difficult). Also too the "recalc" stage at the end (mostly done on the CPU) is much shorter compared to the total runtime. I'm not entirely sure whether the new WUs run slower or faster on a specific machine, by how much and what properties influence that. On our machines the runtimes of the old an new workunits are comparable. So far I don't have enough results for a statistical evaluation.

Boca Raton Comm...

Joined: 4 Nov 15

Posts: 303

Credit: 11556499080

RAC: 14508788

Bernd Machenschalk

3 Dec 2024 17:41:41 UTC

Message 230445 in response to message 230444

(moderation:

)

Bernd Machenschalk wrote:

The setup is quite different. We split the original WUs in half and processed these one after the other, because that reduced the memory consumption. For the new WUs this isn't necessary (and actually more difficult). Also too the "recalc" stage at the end (mostly done on the CPU) is much shorter compared to the total runtime. I'm not entirely sure whether the new WUs run slower or faster on a specific machine, by how much and what properties influence that. On our machines the runtimes of the old an new workunits are comparable. So far I don't have enough results for a statistical evaluation.

Thanks for the info. So, just to make sure I am understanding correctly, after the WU is initially loaded into the VRAM, there is no recalc step until the very end of the task?

Ian&Steve C.

Joined: 19 Jan 20

Posts: 4158

Credit: 50247151547

RAC: 42307563

Boca Raton Community HS

3 Dec 2024 17:50:14 UTC

Message 230446 in response to message 230445

(moderation:

)

Boca Raton Community HS wrote:

Bernd Machenschalk wrote:

The setup is quite different. We split the original WUs in half and processed these one after the other, because that reduced the memory consumption. For the new WUs this isn't necessary (and actually more difficult). Also too the "recalc" stage at the end (mostly done on the CPU) is much shorter compared to the total runtime. I'm not entirely sure whether the new WUs run slower or faster on a specific machine, by how much and what properties influence that. On our machines the runtimes of the old an new workunits are comparable. So far I don't have enough results for a statistical evaluation.

Thanks for the info. So, just to make sure I am understanding correctly, after the WU is initially loaded into the VRAM, there is no recalc step until the very end of the task?

yes that sounds like it, and that recalc step will be much faster than before.

I might need to revisit the 1.14 app again to see how that does.

FYI, you can differentiate the high frequency tasks from the low frequency ones in your task list by looking at the task name, high frequency have "HF" and freq numbers >1500Hz, and low freq have "Bu" in the name and freq values around 100-200Hz.

Bernd, are you guys abandoning the HF search altogether? or just refocusing on something else for now. can you remind us which frequency ranges have been searched so far and what ranges we can expect from the LF search?

_________________________________________________________________________

All-Sky Gravitational Wave Search on O3 data (O3ASHF1)

Forums › Technical News

Comment viewing options

Forums › Technical News