Lots of validation errors

Ryan
Ryan
Joined: 25 Nov 14
Posts: 37
Credit: 128091735
RAC: 15975
Topic 199890

Is someone more knowledgeable that me able to have a look at the results I have returned to see if they can work out why most are failing to validate?

https://einsteinathome.org/host/11700141/tasks

 

Thanks :)

archae86
archae86
Joined: 6 Dec 05
Posts: 3145
Credit: 7051824931
RAC: 1642016

If you search or look around

If you search or look around on these forums you'll find lots of advice on this sort of case.  I suggest you review some of the many times Gary Roberts has addressed this sort of question from other users.

There is more than one possible cause, but a unifying theme is that quite likely something in your system is being clocked faster than it can reliably get the right answer to these WUs as it is currently operating.

If you have cooling deficiency, such as a failed fan, a buildup of air flow blocking material such as dust, a deteriorated interface material between your GPU and the heat sink, or a configuration error which just means the fans are not trying very hard, that can be one reason--which is common and often top of the list to check.

If you are knowingly overclocking, in your situation it would be wise to fully remove the overclocking and run a while to see if it changes the error rate. 

If you have removed suspicion of fixable cooling trouble, I, personally, in your case recommend further reducing clock rate--even if that means going below stock.

I base this recommendation on your comments and on the extremely high invalid rate shown in your task list.  I did not review your stderr files, which just possibly might reveal something useful, but in this case are not likely to.

 

 

 

Ryan
Ryan
Joined: 25 Nov 14
Posts: 37
Credit: 128091735
RAC: 15975

GPU is not overclocked and is

GPU is not overclocked and is watercooled so I dont think its that?

Tom*
Tom*
Joined: 9 Oct 11
Posts: 54
Credit: 291796664
RAC: 1020726

AS others have found out How

AS others have found out

How many work units are you running in parallel ?

 

Multiple GPU tasks on a FIJI can cause Invalids, try running just one at a time.

 

 

Ryan
Ryan
Joined: 25 Nov 14
Posts: 37
Credit: 128091735
RAC: 15975

Only running one work unit at

Only running one work unit at a time.

Christian Beer
Christian Beer
Joined: 9 Feb 05
Posts: 595
Credit: 124378185
RAC: 307463

I checked the recent results

I checked the recent results your computer send back to us. They contain a non number in a column where there should be a number. Instead I see #INF which usually means the GPU did some wrong calculations. In the BRP4 case we see a power that is too high to be plausible which also stems from the GPU having problems doing math. This started on August 10th so if you did a driver update before that I would say you should maybe revert to the previous one and see if that fixes it.

Ryan
Ryan
Joined: 25 Nov 14
Posts: 37
Credit: 128091735
RAC: 15975

Ok thanks, I think I did do a

Ok thanks, I think I did do a driver update then, ill try a full uninstall and roll back

 

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.