Gamma-ray pulsar binary search #1 on GPUs

[AF>EDLS]GuL
[AF>EDLS]GuL
Joined: 15 Feb 06
Posts: 15
Credit: 227794659
RAC: 0

Bernd Machenschalk wrote:The

Bernd Machenschalk wrote:
The 1.17 app has a minimum RAM requirement of ~750MB, 1.18 requires ~1GB.

Thanks Bernd for the precision. Smile

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250424982
RAC: 35300

Sorry, there's a flaw in the

Sorry, there's a flaw in the 1.18 OpenCL code that let (only) the OSX ATI OpenCL driver error out. Usually we find such errors ourselves, but this in a part of the code that requires double precision support, which our card doesn't have. I'll try to get this fixed and publish a new version later today. For now the 1.18 for OSX has been deprecated.

BM

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250424982
RAC: 35300

1.19 Beta released (OSX only)

1.19 Beta released (OSX only)

BM

Kailee71
Kailee71
Joined: 22 Nov 16
Posts: 35
Credit: 42623563
RAC: 0

Bernd,   great news -

Bernd,

 

great news - vielen Dank!

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250424982
RAC: 35300

FWIW I just shuffled our

FWIW I just shuffled our graphics cards such that we now have a double precision ATI card in our 10.9 Mac Pro, so we should be able to find such problems next time and fix these before release.

 

BM

fastbunny
fastbunny
Joined: 20 Apr 06
Posts: 22
Credit: 91424422
RAC: 0

Wow, processing time on a

Wow, processing time on a 7870 on Windows is cut by over 30%, even though a 7870 doesn't have very good double precision performance as far as I know. Great work!

Defender
Defender
Joined: 17 Jul 12
Posts: 19
Credit: 316032944
RAC: 77747

Matt_145 wrote:1.18 is a

Matt_145 wrote:
1.18 is a large improvement on my GTX 1080. Running 3 tasks at a time, completion times went from around 35 minutes to around 23 minutes. 

 

I can confirm this for my GTX 1070. Running 3 tasks at a time, completion times went from around 42 minutes to around 28 minutes.

Proud member of SETI.Germany

Kailee71
Kailee71
Joined: 22 Nov 16
Posts: 35
Credit: 42623563
RAC: 0

Bernd Machenschalk wrote:1.19

Bernd Machenschalk wrote:
1.19 Beta released (OSX only)

Hi Bernd,

sorry - bad news - these are bomming out on my machine also. Please see here; https://einsteinathome.org/host/12464084/tasks/error

Let us know when there's another new beta to try, I'll run the old 1.17s for now.

 

Cheerio,

 

Kailee.

Schwabenschaffer
Schwabenschaffer
Joined: 25 Apr 16
Posts: 2
Credit: 25503097
RAC: 0

In the beginning of 1.18 i

In the beginning of 1.18 i had also a lot of failures;
Increased cooling & slow down the core clock from 100% to about 70% (R9 290).

At this Moment, only 1 failed WU since 14 hours. (before i speed down, 18 WUs failed of 1.18 beta)

GPU-Z increased GPU load from about 30% to 55%.
have the logs if needed.
System run 24h.

Have also the feeling, that clock decreasing & better cooling is the best solution. Throttling is always on 65°C

(1.18) 1250 sec completing a WU -> 100% clock speed
(1.18) 1310 sec completing a WU -> 70% clock speed
decreasing the clock Speed slow down the completion in this case 5%, but i have valid results :-)

comparison to
(1.17) 2800 sec completing a WU -> 100% clock speed

everytime running 2 WU's at once.

Great improvement the 1.18, thanks to Christophe for using his holidays for this; let's fix the failures.

MarkHNC
MarkHNC
Joined: 31 Aug 12
Posts: 37
Credit: 170965842
RAC: 0

My Xeon-based crunching-only

My Xeon-based crunching-only machine has managed to get through 60 of the 1.18's, so I thought I'd examine performance so far.  I'm also including figures for my GTX 650.  The GTX 650 machine disables GPU computing while in use, and the machine got used a lot this past holiday weekend, so it hasn't had a chance to work through many 1.18's.  Very similar performance differences.

Machine  Version Count Avg Time Difference
Xeon E5-2670v1 @ 2.6GHz GTX 960 SSC (2GB) x 2 tasks 1.17 264 4,408.28  
1.18 60 2,705.55 38.62%
Phenom II X4 965 @ 3.8 GHz GTX 650 SC (2GB) x 1 task 1.17 22 18,881.89  
1.18 4 11,164.91 38.10%

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.