FGRP - High invalid rate on Nvidia 4090?

DF1DX

Joined: 14 Aug 10

Posts: 105

Credit: 3743712003

RAC: 3467465

FYI: In the meantime i

1 May 2023 9:29:15 UTC

Message 211764

(moderation:

)

FYI:

In the meantime i have now calculated over 700 BRP7 WUs on the 4090.
About 11% of these are currently invalid.

From the remaining FGRPB WUs, about 12% are invalids. With the optimized AIO app from petri, there were up to 20 % invalids here.

So far, not a single error with GW-WUs.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4307

Credit: 249728786

RAC: 34763

If the "invalid"s rate on

2 May 2023 8:14:14 UTC

Message 211820

(moderation:

)

If the "invalid"s rate on BRP7 is also higher on the 4090, I would also be interested in whether there's a difference in validation between the Windows (CUDA) and the Linux (OpenCL) app.

Both FGRP and BRP Apps are mainly FFT bound, and the FFT happens in a pretty early step. Thus the result of the FFT has a much higher impact on the overall result as e.g. in the GW app.

However, the different Apps use different libraries for the FFT:

* FGRP uses "clFFT", originally developed by AMD for their cards, now OpenSource on GitHub

* BRP CUDA (BRP7 Windows) uses cuFFT

* BRP OpenCL uses an own development based on an Apple OpenCL code example, which seems to be derived from an early cuFFT version

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4307

Credit: 249728786

RAC: 34763

Hm. The overall "invalid"

2 May 2023 8:34:44 UTC

Message 211821

(moderation:

)

Hm. The overall "invalid" rate of 4090s on BRP7 is <3%, which is even lower than the overall "invalid" average there (~3,4%). However, in the DB i currently only have 4090 results from hosts running Windows. Actually, from Linux I only have 5(!) valid results in total from BRP7.

Boca Raton Comm...

Joined: 4 Nov 15

Posts: 235

Credit: 9988105586

RAC: 20733902

Bernd Machenschalk wrote: If

2 May 2023 11:41:44 UTC

Message 211827 in response to message 211821

(moderation:

)

Bernd Machenschalk wrote:

If the "invalid"s rate on BRP7 is also higher on the 4090, I would also be interested in whether there's a difference in validation between the Windows (CUDA) and the Linux (OpenCL) app.

Both FGRP and BRP Apps are mainly FFT bound, and the FFT happens in a pretty early step. Thus the result of the FFT has a much higher impact on the overall result as e.g. in the GW app.

However, the different Apps use different libraries for the FFT:

* FGRP uses "clFFT", originally developed by AMD for their cards, now OpenSource on GitHub

* BRP CUDA (BRP7 Windows) uses cuFFT

* BRP OpenCL uses an own development based on an Apple OpenCL code example, which seems to be derived from an early cuFFT version

Looks like I have some learning to do about FFT! This is interesting that different versions are used.

Bernd Machenschalk wrote:

Hm. The overall "invalid" rate of 4090s on BRP7 is <3%, which is even lower than the overall "invalid" average there (~3,4%). However, in the DB i currently only have 4090 results from hosts running Windows. Actually, from Linux I only have 5(!) valid results in total from BRP7.

I am going to try and get one of the 4090 systems working on BRP7 this week.

Boca Raton Comm...

Joined: 4 Nov 15

Posts: 235

Credit: 9988105586

RAC: 20733902

Got it working on this host.

2 May 2023 15:57:06 UTC

Message 211839

(moderation:

)

Got it working on this host. It will crunch BRP7 full-time for the rest of the week to give us a good sample size (adding to DF1DX completed work units). It is finishing a BRP7 work unit in ~3:09.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4307

Credit: 249728786

RAC: 34763

Bernd Machenschalk

3 May 2023 12:30:00 UTC

Message 211910 in response to message 211821

(moderation:

)

Bernd Machenschalk wrote:

Hm. The overall "invalid" rate of 4090s on BRP7 is <3%, which is even lower than the overall "invalid" average there (~3,4%). However, in the DB i currently only have 4090 results from hosts running Windows. Actually, from Linux I only have 5(!) valid results in total from BRP7.

The OS distinction/selection in my query was somewhat wrong. Actually on BRP7, Linux hosts have roughly 10% invalid results, while Windows hosts only have 0,5%. Judging from the above I'd guess that the problem lies in the OpenCL (compiler in the) driver, the CUDA version of BRP7 seems to work fine.

So if you are on Windows and want to avoid these invalid rates, my recommendation for now would be to restrict yourself (or your hosts) to run BRP7.

Here's the thing with the Linux CUDA version: we found that the gcc version used to build the CPU part of the application is crucial for validation (some data preparation is done beforehand on the CPU, and this needs to yield the exact same results). However I couldn't get the libgcc to link with the CUDA libraries, at least CUDA 5.5. I see if I can get this app to link with a newer CUDA version.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 3925

Credit: 45422232642

RAC: 63224699

Petri has a Linux CUDA 11 and

3 May 2023 15:01:59 UTC

Message 211918

(moderation:

)

Petri has a Linux CUDA 11 and 12 version of BRP7 that validates well. At least on 30-series cards and earlier. Not sure about 40-series.

_________________________________________________________________________

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4307

Credit: 249728786

RAC: 34763

I published a BRP7 Linux app

10 May 2023 14:08:24 UTC

Message 212315

(moderation:

)

I published a BRP7 Linux app version (0.16) with CUDA 10.2. This was built on an Ubuntu 18.04 and my not run on other systems with older libc. It's Beta anyway. You may want to give it a try.

Boca Raton Comm...

Joined: 4 Nov 15

Posts: 235

Credit: 9988105586

RAC: 20733902

Bernd Machenschalk wrote: I

10 May 2023 15:06:30 UTC

Message 212320 in response to message 212315

(moderation:

)

Bernd Machenschalk wrote:

I published a BRP7 Linux app version (0.16) with CUDA 10.2. This was built on an Ubuntu 18.04 and my not run on other systems with older libc. It's Beta anyway. You may want to give it a try.

On it! The older version of the app gave us the following results (some pending):

Pending (85)
Valid (293)
Invalid (57)
Error (0)

I will enable beta apps and then run more of these on the 4090 for this week. Will it automatically receive the version 0.16 when it requests tasks?

DF1DX

Joined: 14 Aug 10

Posts: 105

Credit: 3743712003

RAC: 3467465

I only get the previous

11 May 2023 9:35:36 UTC

Message 212352

(moderation:

)

I only get the previous version 0.15 with OpenCL at the moment. Yes, beta is enabled.

FGRP - High invalid rate on Nvidia 4090?

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner