Multi-Directional Gravitational Wave Search on O3 data (O3MD1/F)

Mr P Hucker

Joined: 12 Aug 06

Posts: 838

Credit: 519266647

RAC: 15440

Keith Myers wrote:These

12 Dec 2022 3:13:39 UTC

Message 205053 in response to message 205052

(moderation:

)

Keith Myers wrote:

These beta tasks are messing up the project for the rest of us that have no interest in running them.

Nobody is able to get any other kind of work.

Admins, please shut the production down and clear the RTS buffers for the other types of work to be sent.

Surely a fair number of us (including myself) are happy to run beta tasks?

The only time I've avoided beta was a certain batch on a certain old GPU that crashed them all.

If beta testing is needed to make a better version of the other work, then it has to be done.

If this page takes an hour to load, reduce posts per page to 20 in your settings, then the tinpot 486 Einstein uses can handle it.

Keith Myers

Joined: 11 Feb 11

Posts: 4964

Credit: 18714085136

RAC: 6370031

Not when they impact all the

12 Dec 2022 3:26:20 UTC

Message 205054 in response to message 205053

(moderation:

)

Not when they impact all the other users who don't want to run beta applications.

The O3MD* work generators are running unthrottled and producing more than enough work for the few users who want to run beta tasks.

But they have overloaded the RTS buffers and everybody else that is running Gamma Ray and BRP4/7 work is getting no work when requested even though there is plenty of it in the Ready to Send categories.

The beta work is swamping the download servers and schedulers and preventing all the other work from being sent out.

I am down over a thousand tasks in my 3 card hosts from my set cache levels and continuing to fall without replenishment. I will be out of work in just 8 hours.

Thyme Lawn

Joined: 13 Jun 15

Posts: 7

Credit: 12137755

RAC: 0

I've completed 8 CPU tasks on

12 Dec 2022 8:08:27 UTC

Message 205066

(moderation:

)

I've completed 8 CPU tasks on my hyper-threaded Windows 10 i7-6700K system with a wide variation of CPU time:

"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer

Elphidieus

Joined: 20 Feb 05

Posts: 245

Credit: 20603702

RAC: 0

archae86 wrote:Elphidieus

12 Dec 2022 8:32:08 UTC

Message 205068 in response to message 205048

(moderation:

)

archae86 wrote:

Elphidieus wrote:

Curious... Why am I getting Multi-Directional Gravitational Wave WUs on my M1 MacBook Air when I did not select to receive them...?

Do you have the option:

Beta Settings: Run Test Applications?

set to "Yes"

Do you have the option:

Other settings: Allow non-preferred apps?

set to "Yes"

Beta Settings: Run Test Applications = Yes, as long as they are NATIVE Arms app, neither Intel nor Legacy apps...

Allow non-preferred apps = Already No...

Looks like I have to turn Beta Settings OFF then... sad...

Thanks archae86...

mikey

Joined: 22 Jan 05

Posts: 12681

Credit: 1839084349

RAC: 3912

archae86 wrote: My two hosts

12 Dec 2022 12:02:08 UTC

Message 205071 in response to message 204919

(moderation:

)

archae86 wrote:

My two hosts which currently run O3 GPU work are building up very large pending counts.

The most heavily affected one:

https://einsteinathome.org/host/10659288/tasks/0/58

Has just 7 validations vs. 554 pending. Spot-checking through the pending list shows that tasks sent to my machine from December 3 until now nearly all show the second task required to form a quorum as unsent.

I wonder if some similarity/dissimilarity task dispatch rules are in effect which might orphan some machines?

While my second host running O3 initially had good success at getting quorum partners and validating, it now also has hundreds of pending tasks for which the required quorum partner task is unsent.

https://einsteinathome.org/host/10706295/tasks/2/58

These included tasks initially sent on December 7-8.

I'm seeing the same thing, lots of no wingman tasks sent out even though it says 'initial replication 2 tasks'

Boca Raton Comm...

Joined: 4 Nov 15

Posts: 240

Credit: 10551425586

RAC: 25651016

Each CPU task is requiring ~2

12 Dec 2022 13:04:35 UTC

Message 205077

(moderation:

)

Each CPU task is requiring ~2 GB of ram.(!) I don't think I have ever seen tasks with such large memory requirements. Our systems are chewing away at them, but wow- very memory intensive.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 3945

Credit: 46741772642

RAC: 64119708

Boca Raton Community HS

12 Dec 2022 13:21:37 UTC

Message 205079 in response to message 205077

(moderation:

)

Boca Raton Community HS wrote:

Each CPU task is requiring ~2 GB of ram.(!) I don't think I have ever seen tasks with such large memory requirements. Our systems are chewing away at them, but wow- very memory intensive.

I'm not sure about currently, but I know at one time Rosetta@home was also using about 2GB per task.

GPUGRID's Python tasks, which are a hybrid CUDA/MT task, use ~10GB system ram, ~3GB VRAM, and 32+ cores for each task lol.

_________________________________________________________________________

Boca Raton Comm...

Joined: 4 Nov 15

Posts: 240

Credit: 10551425586

RAC: 25651016

Ian&Steve C. wrote: Boca

12 Dec 2022 13:28:42 UTC

Message 205081 in response to message 205079

(moderation:

)

Ian&Steve C. wrote:

Boca Raton Community HS wrote:

Each CPU task is requiring ~2 GB of ram.(!) I don't think I have ever seen tasks with such large memory requirements. Our systems are chewing away at them, but wow- very memory intensive.

I'm not sure about currently, but I know at one time Rosetta@home was also using about 2GB per task.

GPUGRID's Python tasks, which are a hybrid CUDA/MT task, use ~10GB system ram, ~3GB VRAM, and 32+ cores for each task lol.

True- I forgot about those GPUGRID tasks- those are something! I ran those for a while but it really limited what else we could run at the same time.

Richard Haselgrove

Joined: 10 Dec 05

Posts: 2143

Credit: 2956163132

RAC: 716078

See Task 1390224758. I'm

12 Dec 2022 14:44:10 UTC

Message 205086

(moderation:

)

See Task 1390224758.

I'm getting

XLAL Error - MAIN (/home/jenkins/workspace/workspace/EaH-GW-OpenCL-Testing/SLAVE/LIBC215/TARGET/linux-x86_64/EinsteinAtHome/source/lalsuite/lalapps/src/pulsar/GCT/HierarchSearchGCT.c:2240): Internal function call failed: Generic failure
2022-12-12 14:30:34.0385 (16431) [CRITICAL]: ERROR: MAIN() returned with error '-1'

when I try to run two O3MDF together on the same GPU. One on its own is fine, as is one paired with a gamma-ray pulsar task on the same GPU. Device is a GTX 1660 Super with 6GB, so it's not just simply 'twice 2GB breaks the bank'. This is going to take some managing.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 3945

Credit: 46741772642

RAC: 64119708

OpenCL kernel failed with

12 Dec 2022 14:52:26 UTC

Message 205087

(moderation:

)

Richard Haselgrove wrote:

See Task 1390224758.

From your task:

OpenCL kernel failed with OpenCL error: CL_MEM_OBJECT_ALLOCATION_FAILURE

these O3MDF tasks use ~3200 MB per task. so 2x on a 6GB card wont work.

if you want to splice in with gamma ray without running 2x on O3MDF, try this:

in app_config
set gpu use for O3MDF to 0.6
set gpu use for gamma ray to 0.4

this will allow you to run O3+GR (1.0), or GR+GR (0.8), but never O3+O3 (1.2)

_________________________________________________________________________

Multi-Directional Gravitational Wave Search on O3 data (O3MD1/F)

Forums › Technical News

Comment viewing options

Forums › Technical News