Important news on BRP7 and FGRPB1 work on E@H

TRAPPIST-713

Joined: 13 May 20

Posts: 12

Credit: 2482705069

RAC: 1486651

Gary Roberts

9 Oct 2023 17:20:32 UTC

Message 218024 in response to message 217581

(moderation:

)

Gary Roberts wrote:

TRAPPIST-713 wrote:
https://einsteinathome.org/host/12832734
If you just look at a single result, you could work out your problem.

A result at random shows run time of 21,254 with a CPU time of just 44 secs. GPU tasks are being completely starved of CPU support.

Your host shows as 4 cores. Is that 4C/4T or 2C/4T? In either case (it doesn't really matter) it looks like you are running CPU tasks on all available threads. For 27th Sep, I counted 11 CPU tasks returned with that particular return date. The average time for a CPU task is ~34.5 ksecs. So a rough ball park calculation says that 11 tasks would consume ~380 ksecs. A full day is 86.4 ksecs so the number of active cores is 380/86.4 > 4 and that is probably the problem.

Just change your prefs for that machine to allow BOINC to use 75% of the CPUs rather than all of them and I'm guessing the GPU tasks will speed up enormously.

Update and new problems with this host.

Indeed, after limiting CPU load to 75% (1 core is free from calculating CPU only tasks) GPU tasks were calculated much faster. Binary Radio Pulsar Search (MeerKAT) v0.12 () windows_x86_64 calculation time improved from ~ 20,000 s to ~ 1200 s, but 100% fail with “Validate error” or “Error while computing”.

So I had to disable GPU calculations on this computer until application is updated.

bluestang

Joined: 13 Apr 15

Posts: 34

Credit: 2492970228

RAC: 0

Really? cuda55 for Windows

9 Oct 2023 17:47:07 UTC

Message 218026

(moderation:

)

Really? cuda55 for Windows and cuda102 for Linux. Why are Windows users getting dealt the lower hand with these applications?

Some of us can not run Linux and need Windows only. Apps should be equal across platforms.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 3945

Credit: 46614392642

RAC: 64227044

the speed of cuda55 on

9 Oct 2023 17:56:04 UTC

Message 218029

(moderation:

)

the speed of cuda55 on windows is the same as the cuda102 on linux as far as i can tell.

but cuda55 on windows validates much better.

not sure what's to complain about that there.

_________________________________________________________________________

bluestang

Joined: 13 Apr 15

Posts: 34

Credit: 2492970228

RAC: 0

Ian&Steve C. wrote:the

9 Oct 2023 17:59:06 UTC

Message 218030 in response to message 218029

(moderation:

)

Ian&Steve C. wrote:

the speed of cuda55 on windows is the same as the cuda102 on linux as far as i can tell.

but cuda55 on windows validates much better.

not sure what's to complain about that there.

Haven't ran any yet because I was going off of how crappy the old cuda runs on Moo compared to newer versions of cuda.

Good to know the performance is the same between them here then. Thanks.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250367308

RAC: 35280

It looks to me that the

19 Oct 2023 11:17:12 UTC

Message 218330

(moderation:

)

It looks to me that the validation rates, in particular of the Linux 0.17 app version, got a lot better with time, without us changing anything on the project side. I see <1.5% invalids of this app version. Does anyone know why this is? Are Linux people with a lot of invalids just shying away from BRP7?

Ian&Steve C.

Joined: 19 Jan 20

Posts: 3945

Credit: 46614392642

RAC: 64227044

all of my invalids have come

19 Oct 2023 13:04:36 UTC

Message 218332

(moderation:

)

all of my invalids have come down to ~3% or so. Myself and everyone on my team (mostly Linux users with Petri's app) have seen the same trend that validations have gotten a lot better. looks to be the same story with everyone on the leaderboard, including many linux users of both AMD and Nvidia.

_________________________________________________________________________

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250367308

RAC: 35280

bluestang wrote:Really?

19 Oct 2023 13:58:00 UTC

Message 218333 in response to message 218026

(moderation:

)

bluestang wrote:

Really? cuda55 for Windows and cuda102 for Linux. Why are Windows users getting dealt the lower hand with these applications?

Some of us can not run Linux and need Windows only. Apps should be equal across platforms.

We did a comparison between CUDA 5.5 and CUDA 11, and found that for our application the performance gain of course depends on the card, but is always is <8%. Our boundary is 10% averaged across all cards, below that we judge the benefit noth worth the additional effort in building, deploying and maintaining another app version. So for Windows we went with the more compatible CUDA version, to harvest the computing power of even older cards (of which we do have a few on our own).

CUDA on Linux is a tricky thing, as the pre-compiled shared libraries from NVidia link to a specific version of libgcc, and thus must be linked with that version of gcc. However, for our application the gcc version the CPU code is compiled with is pretty peculiar to ensure good validation. I feel pretty lucky to have found a combination of OS, CUDA libs cnd gcc version that works reasonably well for us.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 3945

Credit: 46614392642

RAC: 64227044

Bernd Machenschalk

19 Oct 2023 14:36:52 UTC

Message 218335 in response to message 218333

(moderation:

)

Bernd Machenschalk wrote:

CUDA on Linux is a tricky thing, as the pre-compiled shared libraries from NVidia link to a specific version of libgcc, and thus must be linked with that version of gcc.

I know you said this before, but I'm not sure I understand exactly what you mean about specific versions "must" be used. I never had any issue building any CUDA apps with seemingly any version of GCC.

our custom app currently is built with the latest CUDA 12.2 Update 2. Normally I build in my Ubuntu 18.04 VM, which by default has gcc 7.5.0, I'be built everything from CUDA 9 to CUDA12 apps on this environment. I only maintain this old 18.04 environment so that i don't have to deal with building on a latest release and someone trying to use the app on an older OS and getting locked out by the glibc dependency. After your comments about using 7.3 for better validation, I simply installed gcc 7.3.0 into that VM (using a script I found on github), then set some env vars to bring that version of gcc into my build environment (terminal session). the app built just as fine, and does indeed seem to validate better. and the binaries were definitely different between 7.5.0 and 7.3.0 builds.

I can't complain because it's working well now. I just don't understand what you mean that certain versions of gcc "must" be used as that doesnt seem to be the case. maybe certain versions work better than others for validation, but there's nothing stopping the build from succeeding in my experience.

_________________________________________________________________________

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250367308

RAC: 35280

Thanks. The Ubuntu 18.04 is

19 Oct 2023 15:07:00 UTC

Message 218336

(moderation:

)

Thanks. The Ubuntu 18.04 is the machine I built this App on, I'll try building an official App with gcc 7.3 there, thanks!

What I said was based on my experience trying to build a CUDA 5.5 app for Linux, which simply didn't work with a gcc >4. But apparently NVidia removed that dependency to libgcc in newer versions of CUDA.

Thanks, I'll try that again.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 3945

Credit: 46614392642

RAC: 64227044

ah, you could be right about

19 Oct 2023 15:26:46 UTC

Message 218337

(moderation:

)

ah, you could be right about that with older cuda versions. i did try to build an old cuda version once (cuda 8.0 i think) and couldnt ever get it to work. building older cuda apps seems like it was much more of a bear.

the cuda 10.2 app you already have up for linux beta worked well too when i tested it.

_________________________________________________________________________

Important news on BRP7 and FGRPB1 work on E@H

Forums › Technical News

Comment viewing options

Forums › Technical News