CUDA application for the O3ASHF search

DF1DX
DF1DX
Joined: 14 Aug 10
Posts: 105
Credit: 3732632003
RAC: 3175138

I hesitate to say this, but

I hesitate to say this, but version 1.11 does not change anything significant compared to 1.08/1.10.

Host 12801270.

John
John
Joined: 17 Jan 18
Posts: 5
Credit: 2503850538
RAC: 15329006

Actually its worse than 1.08

Actually its worse than 1.08 (The recalc stage takes longer) for this host (https://einsteinathome.org/host/13160863) which negates the CUDA advantage. I'll try to create an app_info to target 1.08

 

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4307
Credit: 249639341
RAC: 34254

DF1DX wrote: I hesitate to

DF1DX wrote:

I hesitate to say this, but version 1.11 does not change anything significant compared to 1.08/1.10.

Host 12801270.

I know. 1.12 should do better.

BM

Jimbocous
Jimbocous
Joined: 31 Mar 17
Posts: 67
Credit: 1179473152
RAC: 926057

Just FYI,First 1.11

Just FYI,

1.11 https://einsteinathome.org/task/1587604598 and others crash on startup.

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)</message>
<stderr_txt>

1.12 https://einsteinathome.org/task/1587618661 and others crash at the 49.5 percent mark.

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<message>
process exited with code 255 (0xff, -1)</message>
<stderr_txt>

Host https://einsteinathome.org/host/12791541.


Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4307
Credit: 249639341
RAC: 34254

Hm. 1.12 does now do the

Hm. 1.12 does now do the "recalc" on the GPU, but it seems that this code doesn't work (yet).

BM

DF1DX
DF1DX
Joined: 14 Aug 10
Posts: 105
Credit: 3732632003
RAC: 3175138

Beta 1.13: Error while

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3923
Credit: 45267572642
RAC: 63301754

yeah V1.13 error right

yeah V1.13 error right away.

XLAL Error - XLALFstatCheckSFTLengthMismatch (/home/boinc/lalsuite/EinsteinAtHome/source/lalsuite/lalpulsar/lib/ComputeFstat.c:1895): Length of input SFTs (1800 s) must be less than 1730.29 s for CW signal with
frequency = 1237, spin-down/up = -2.7e-09, binary asini = 0, period = 0, ecc = 0,
in order to stay below an Fstat-mismatch of 0.01.

_________________________________________________________________________

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4307
Credit: 249639341
RAC: 34254

Yep - new version underway.

Yep - new version underway.

BM

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3923
Credit: 45267572642
RAC: 63301754

nice. 1.14 is running now.

nice. 1.14 is running now. need to check that it makes it past the 49.5% transition and to completion.

so all stages should be on GPU now? no long CPU-only portion?

_________________________________________________________________________

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3923
Credit: 45267572642
RAC: 63301754

seems like it's

seems like it's working.

there was a clear change at 40% where the GPU memory bandwidth use dropped to almost nothing, and the VRAM use dropped below 1GB. then at 50%, the task restarted, reloaded the VRAM to ~2.4GB, then the task resumed with high GPU and memory bus utilization. I assume it will do the same thing at 90-100%. very nice.

_________________________________________________________________________

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.