I have reported massive error frequencies regarding this type of WUs on GPUs earlier.
Now again: It seems, these WUs generally pose an issue.
Just two examples:
http://einsteinathome.org/workunit/216250252
http://einsteinathome.org/workunit/216192690
Note that it is not GPU-dependent, meaning it affects Intel IGP, NVIDIA and also AMD/ATI.
Could someone from the project team please look into this to save us from producing hot air instead of useful results data?
Michael.
RNA World - A Distributed Supercomputer to Advance RNA Research
Copyright © 2024 Einstein@Home. All rights reserved.
Parkes PMPS XT on GPUs - a general problem?
)
I am unsure where you are seeing it affects NVIDIA or AMD/ATI. There is known bugs with some of the Intel drivers as I am sure someone will point out. The second one in you list is awaiting a good return and has crunched OK on an ATI card.
The problem in this case is
)
The problem in this case is the Intel driver
--> http://einsteinathome.org/host/6684045/tasks&offset=0&show_names=0&state=3&appid=29
RE: I am unsure where you
)
Well, the AMD/ATI card issue I had posted before on this board. I am currently re-trying with my AMD R9 290X, but the one WU I have yet completed and which you mentioned still has not been validated (pending).
The Intel issue apparently is a known problem (although I could not extract a solution from the posting below). I use a Dell system where I cannot install any other driver than those provided by Dell because I use other custom graphics hardware whose proper function might be affected by manually changing the drivers. Only a week ago the latest Dell system driver update allowed me to at least receive WUs for the Intel HD 4000 IGP from the Einstein@home project, because before this IGP was not recognized as an OpenCL capable device.
In case of NVIDIA, I made a mistake: Taking these data
http://einsteinathome.org/workunit/216192690
I clicked on this computer:
http://einsteinathome.org/host/10141451
which carries an NVIDIA GPU. So I concluded that WU had been computed using that GPU without taking a look into more detail. But today, inspired by your posting, by looking further into the log after realizing that it was an Intel i7 CPU system that also holds an Intel HD 4000 IGP:
http://einsteinathome.org/task/494397855
I figured that this WU was actually NOT computed on the NVIDIA GPU but instead on the Intel IGP thereby explaining the error.
As said above, this posting:
makes it difficult for me to understand how exactly to solve the problem.
Michael
RNA World - A Distributed Supercomputer to Advance RNA Research
RE: The Intel issue
)
I run the iGPU on a Dell platform (Optiplex 9020 with Haswell i5), and I can - eventually - use the generic Intel driver installers to switch between drivers at will. To get started, you may need to force one to install manually: Intel's 'Installation ReadMe' file is actually pretty good, if you follow the instructions exactly. I found the 'have disk' route the best way to get started.
I've got to rush out now, but if you search the boards for "10.18.10.3621" - the most reliable driver found so far - your should be able to find a download link. Or some kind soul might post it here.
RE: ... Or some kind soul
)
This message contains a link to the win64 zip file.
Richard's install method using the README file instructions.
Cheers,
Gary.
Now that a few of these WUs
)
Now that a few of these WUs processed singly by my AMD/ATI 290X board
http://einsteinathome.org/host/11761322/tasks&offset=0&show_names=1&state=0&appid=29
have been validated, it seems that the previous errors which I got with this same system were a result of parallel processing of GPU WUs.
Michael.
RNA World - A Distributed Supercomputer to Advance RNA Research
RE: it seems that the
)
Maybe there was a temperature problem with the GPU? Did you check that?
RE: ... it seems that the
)
I would think it's not just the parallel processing. I'm running 4x on lots of HD7850s and not seeing any issues at all. It's likely to be something else as well (like temperature, as suggested) that's really causing the problem.
Cheers,
Gary.
How do you limit temperature
)
How do you limit temperature on a GPU?
I know how to check the temperature of a Nvidia GPU through nvidia-settings or nvidia control panel programs, but not how to establish an upper limit.
There are some programs for CPUs, but for GPUs?
Thanks,
Yacob
RE: How do you limit
)
efmer's TThrottle program will do it.
Also, many of the standard GPU monitoring programs give you the possibility to specific the relationship between fan speed and GPU temperature. While not a means of imposing an absolute limit, you can influence matters this way.