CUDA App einsteinbinary 3.10 for Windows available for Beta Test

RandyC
RandyC
Joined: 18 Jan 05
Posts: 6602
Credit: 111139797
RAC: 0

Since we had a new binary for

Since we had a new binary for CUDA processing, I thought I'd run a few WUs to see if the situation had improved. Also running 3.05 WUs on the second cpu for the system.

Results:
Currently processing 1 CUDA WU and one cpu only WU. Est. time to completion for cpu WU is 7hrs, 1 min, 9 seconds (they're both almost complete). Est. time to completion for CUDA WU is 7hrs, 53 seconds.

Two previous CUDA WU were logged as taking 25,277.44/25,388.73 secs each.
Three 3.05 WUs that completed also took about the same time to process.

My opinion: the CUDA app (on an AMD X2 5200 system) is not viable presently because it takes both a cpu AND the gpu to process without any noticable improvement in throughput.

AND...if processing SETI on the same system, SETI will monopolize the gpu and not allow E@H to run.

Seti Classic Final Total: 11446 WU.

TJ
TJ
Joined: 11 Feb 05
Posts: 178
Credit: 21041858
RAC: 0

RE: If I understand

Message 94400 in response to message 94396

Quote:
If I understand correctly then the current run is a cuda assisted cpu run. The gpu is used for some calculations but not all. Right now your are mainly limited by the cpu. The gpu gives you just a boost of ca. 20-50%.
If your Q9300 is not overclocked then every 4GHz Core2 runs faster than your cpu with cuda help.
When there is a standalone app for the gpu then yours will fly ;)

The CPU is needed to feed the GPU from time to time and the GPU sends it back. So the CPU can be seen as a "handler" in the process. So for now there will be always a bit of help from the CPU (until they find other ways to handle).

Greetings from
TJ

TJ
TJ
Joined: 11 Feb 05
Posts: 178
Credit: 21041858
RAC: 0

RE: I have crunching a CUDA

Message 94401 in response to message 94393

Quote:

I have crunching a CUDA WU with a GTX 285 in 5,5 hours. My wingmen has crunched the same WU with a CPU in 6 hours.

This is not a big difference .......

Hello,

I run the einstein cuda-app on a i7 with a GTX285.
It takes about 4.7-5 hours to complete, while I have seen wingman with faster CPU's (non-cuda) and they take 7-8.3 hours to complete. So I see a difference.

Greetings from
TJ

rbpeake
rbpeake
Joined: 18 Jan 05
Posts: 266
Credit: 1129367797
RAC: 726012

RE: RE: I have crunching

Message 94402 in response to message 94401

Quote:
Quote:

I have crunching a CUDA WU with a GTX 285 in 5,5 hours. My wingmen has crunched the same WU with a CPU in 6 hours.

This is not a big difference .......

Hello,

I run the einstein cuda-app on a i7 with a GTX285.
It takes about 4.7-5 hours to complete, while I have seen wingman with faster CPU's (non-cuda) and they take 7-8.3 hours to complete. So I see a difference.


I crunched a 3.10 using my CPU only, and found that compared to using the GPU it was about 80% as fast. In other words, the GPU crunches 20% faster than CPU by itself.

As others have commented, the GPU has a lot of potential I believe to become much faster while using less of the CPU. :)

RandyC
RandyC
Joined: 18 Jan 05
Posts: 6602
Credit: 111139797
RAC: 0

RE: Since we had a new

Message 94403 in response to message 94399

Quote:

Since we had a new binary for CUDA processing, I thought I'd run a few WUs to see if the situation had improved. Also running 3.05 WUs on the second cpu for the system.

Results:
Currently processing 1 CUDA WU and one cpu only WU. Est. time to completion for cpu WU is 7hrs, 1 min, 9 seconds (they're both almost complete). Est. time to completion for CUDA WU is 7hrs, 53 seconds.

Two previous CUDA WU were logged as taking 25,277.44/25,388.73 secs each.
Three 3.05 WUs that completed also took about the same time to process.


I've changed my mind on the following...comparing apples/oranges.

Quote:

My opinion: the CUDA app (on an AMD X2 5200 system) is not viable presently because it takes both a cpu AND the gpu to process without any noticable improvement in throughput.


The above is false because I'm trying to compare E@H WUs to APB1 WUs. I tried searching around for some non-CUDA APB1 WUs, but they've all rolled out of my stats. The rest of this post is valid though.

Quote:

AND...if processing SETI on the same system, SETI will monopolize the gpu and not allow E@H to run.


Seti Classic Final Total: 11446 WU.

Gerry Rough
Gerry Rough
Joined: 1 Mar 05
Posts: 102
Credit: 1847066
RAC: 0

RE: I crunched a 3.10 using

Message 94404 in response to message 94402

Quote:

I crunched a 3.10 using my CPU only, and found that compared to using the GPU it was about 80% as fast. In other words, the GPU crunches 20% faster than CPU by itself.

As others have commented, the GPU has a lot of potential I believe to become much faster while using less of the CPU. :)

For a while there it seemed that the collective BOINC CUDA channeling was working. It seemed that the GPU was only using a small fraction of the CPU. But that seems to have dissipated for the time being with E@h. It seems we have a ways to go before we get to significant throughput increases of the magnitude we had hoped. Like others here though, I suspect it might take a while but we will eventually see some very significant improvements in throughput. It's slow going, but we'll get there. :-)


(Click for detailed stats)

Alex
Alex
Joined: 1 Mar 05
Posts: 451
Credit: 507044931
RAC: 111931

Hi, i downloaded all as

Hi,

i downloaded all as posteted, installed it and started crunching.
Threre are two things i want to report:
fist, the usage of the gpu is low, it reaches only 62 deg (cent.), other applications (gpugrid, if it does not hang, or Seti) heat it to 74 deg.
Three times it caused a complete system hang just when finishing the wu at the beginning of the upload.

Vista64, E8400, 6GB, GTX260 and a lot of troubles.

Regards,

Alexander

TJ
TJ
Joined: 11 Feb 05
Posts: 178
Credit: 21041858
RAC: 0

To the developers, The

To the developers,

The previous days the cuda-app. ran perfect and takes about 4 hours to complete on my i7 with a GTX285. The temperature of the GPU is very low and I like that. Good job.

Today BOINC has downloaded 6 cuda-wu’s, however one can be run at a time. So 7 of the CPU’s are doing now crunching for Einstein and that I do not like. It should be fine when they run 4 to 6 non cuda ABP and/or S5R5 WU’s. Can that be programmed?

Thanks and success.

Greetings from
TJ

samuel7
samuel7
Joined: 16 Feb 05
Posts: 34
Credit: 1579363
RAC: 0

This app has started fine on

This app has started fine on my 9800 GT. Thanks for developing an app compatible with 64-bit Vista/Win7! Other specs: Q9550, BOINC 6.6.36, Driver 190.38, v2.3 dll's.

Runtime projection is ~5 hours compared to 5h 45min with the 3.09 CPU app. So it's in effect slower since the GPU resource is used. First task 137298125.

Following a work request for both CPU and CUDA work I got both but also the 4-hour deferral. The scheduler log is not logical:

Quote:
2009-08-24 17:28:20.6143 [PID=24659] [HOST#1758399] Sending [RESULT#137355097 h1_0892.30_S5R4__928_S5R5a_1] (est. dur. 14987.39 seconds)
2009-08-24 17:28:20.6177 [PID=24659] [locality] in_send_results_for_file(h1_0892.30_S5R4, 1) prev_result.id=137355097
2009-08-24 17:28:20.6290 [PID=24659] [send] Didn't find anonymous platform app for einstein_S5R5


It sent S5R5 work despite not finding an app for it? Or maybe it wanted an S5R5 CUDA entry. My app_info includes 3.05 for S5R5, 3.09 for CPU ABP1 and 3.10 for CUDA ABP1.

Edit - link to scheduler log, spelling

Stranger7777
Stranger7777
Joined: 17 Mar 05
Posts: 436
Credit: 429535992
RAC: 76957

I've found how to run 4 S5R5

I've found how to run 4 S5R5 tasks while working on ABP1 on CUDA device.
It's simple. Just freeze all the ABP tasks, then restart BOINC. And after that unfreeze one of ABP tasks and have fun ;)
It seems to run smoothly (but before that I killed some WU's with old driver). Will keep eye on it and report after a while.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.