Extreme CUDA processing?

Pooh Bear 27

Joined: 20 Mar 05

Posts: 1376

Credit: 20312671

RAC: 0

RE: I haven't been keeping

20 Jan 2009 11:38:54 UTC

Message 90141 in response to message 90136

(moderation:

)

Quote:

I haven't been keeping up with this subject so I apologise if this has been asked before. Is there an ATI equivalent of NVidia's CUDA? If so, what, where and when? If not, why not? It must be in an FAQ somewhere but the internet is . . . big. Thanks.

Partially the reason there is a CUDA/NVidia one and not an ATI one is because NVidia put programmers on the BOINC code to help develop the CUDA application. The other video cards would have to be done on the side by the BOINC programmers who are already busy trying to get other things done.

mikey

Joined: 22 Jan 05

Posts: 12963

Credit: 1884531203

RAC: 14160

RE: RE: I haven't been

20 Jan 2009 19:27:12 UTC

Message 90142 in response to message 90141

(moderation:

)

Quote:

Quote:
I haven't been keeping up with this subject so I apologise if this has been asked before. Is there an ATI equivalent of NVidia's CUDA? If so, what, where and when? If not, why not? It must be in an FAQ somewhere but the internet is . . . big. Thanks.

Partially the reason there is a CUDA/NVidia one and not an ATI one is because NVidia put programmers on the BOINC code to help develop the CUDA application. The other video cards would have to be done on the side by the BOINC programmers who are already busy trying to get other things done.

Folding seems to have gotten it done, they are using both NVidia and ATI cards. Now I do not know if it is a CUDA app for the ATI, I do not have one of the cards.

Jord

Joined: 26 Jan 05

Posts: 2952

Credit: 5893653

RAC: 0

RE: Folding seems to have

20 Jan 2009 19:49:34 UTC

Message 90143 in response to message 90142

(moderation:

)

Quote:

Folding seems to have gotten it done, they are using both NVidia and ATI cards. Now I do not know if it is a CUDA app for the ATI, I do not have one of the cards.

Folding@Home made their own applications for ATI, they do not have official support from ATI, more like one ATI developer who likes to help them out in his free time.

BOINC and plenty of its projects got an offer from ATI to give hardware, but they couldn't give support in porting over the applications and ironing out the bugs. Projects would have to do that themselves. Nvidia did offer to port over the applications and help out with ironing out the bugs, so that's why they were chosen first.

Bikeman (Heinz-...

Moderator

Joined: 28 Aug 06

Posts: 3522

Credit: 837874016

RAC: 932941

I think NVidia is really

21 Jan 2009 13:17:14 UTC

Message 90144

(moderation:

)

I think NVidia is really seriously trying to get a foot in the door when it comes to High Performance Computing. BOINC projects would just be showcases and testing grounds for their technology aimed at research in industry and academia. AMD might miss a chance there.

CU
Bikeman

tullio

Joined: 22 Jan 05

Posts: 2118

Credit: 61407735

RAC: 0

The "Extreme CUDA

21 Jan 2009 16:10:08 UTC

Message 90145

(moderation:

)

The "Extreme CUDA processing?" thread appears in most BOINC projects with exactly the same wording. You may be right, in my Einstein mailbox I found an advertisement for a Tesla minisupercomputer produced by nVidia.
Tullio

David Saum

Joined: 9 Feb 05

Posts: 8

Credit: 1545618

RAC: 0

RE: RE: I too would bring

25 Jan 2009 20:18:04 UTC

Message 90146 in response to message 90135

(moderation:

)

Quote:

Quote:
I too would bring them over to Einstein if their was a way to do it while keeping the cpu's and the video cards crunching full time. I do not want a cpu crunching at only 10% just so I can crunch with a video card.

The problem at the moment is that the task has to be updated in videoRAM and this doesn't happen by magic, the CPU is used for that. So the present solution is to exclude one of your CPUs to only work with the GPU. That CPU or core is only partially used, the rest of it is free.

This is done because when the task in videoRAM has to be written back to the disk, exchanged for another task, you want this to happen as quickly as possible. The GPU can only run at full whack or not at all, there's no way yet to gradually increase its use. So if the GPU has to wait for the CPU to free up enough resources that it can send the data from the disk to the videoRAM, there's a good chance the task will time out and exit with an error.

Not a problem AFAIK. I run GPUGRID(cuda) and SETI(cuda) and FOLDING@HOME(gpu) all together on my AMD dual core(x2) 4200+ computers with 9600GSO gpus, plus I am attached to several cpu applications including EINSTEIN, COSMOLOGY and QMC on these computers at the same time, and BOINC always selects 2 of these cpu applications to run along with a GPUGRID or SETI gpu application.

The BOINC Tasks window displays the fraction of cpu used for the gpu applications, and it is generally about 0.05, so the remaining 0.95 fraction of the cpu core seems to be available for use by a cpu application. Since FOLDING is not a BOINC project, it always grabs 50% of the gpu time and some of a cpu core, but both BOINC and FOLDING seem to have no problem running together.

Note that SETI(cuda) hangs up on some wu, so it requires a lot of babysitting, while GPUGRID(cuda) seems to be a more mature application, similar to FOLDING@HOME(gpu). I hope that EINSTEIN(cuda), if it appears, does not have the kind of rough edges that SETI(cuda) does. If you want to try a BOINC CUDA application, I recommend you start with GPUGRID(cuda).

In summary, there may be some overhead penalty in running gpu and cpu applications on the same cpu core, but it does not seem to be large, and the gpu does not capture a cpu core, IMHO.

Paul D. Buck

Joined: 17 Jan 05

Posts: 754

Credit: 5385205

RAC: 0

The number displayed on the

27 Jan 2009 6:01:09 UTC

Message 90147

(moderation:

)

The number displayed on the BOINC Manager window is an estimate and it ranges from 0.02 to 0.05 and it not at all correct.

You have to look in the Task Manager or system monitor to see the actual CPU usage by the applications. SaH on my Q9300 was 1 to 3% which GPU Grid is running at a solid 22% ... and this is of the system as a whole ... meaning that, in essence, one whole CPU core is taken up managing the GPU processing ...

On my i7 it is a little better in that each GPU core (I have three running) takes 7% meaning I lose 21% of CPU power to run three GPU Grid tasks ... if this could be lowered the total system throughput could be increased ...

A new GPU Grid application has been promised though the best of them that I have seen had a load on the i7 of 3-4% which is also fairly high ... several of us have made suggestions as to how this might be changed, but, still waiting ...

David Saum

Joined: 9 Feb 05

Posts: 8

Credit: 1545618

RAC: 0

RE: The number displayed on

27 Jan 2009 15:14:50 UTC

Message 90148 in response to message 90147

(moderation:

)

Quote:

The number displayed on the BOINC Manager window is an estimate and it ranges from 0.02 to 0.05 and it not at all correct.

You have to look in the Task Manager or system monitor to see the actual CPU usage by the applications. SaH on my Q9300 was 1 to 3% which GPU Grid is running at a solid 22% ... and this is of the system as a whole ... meaning that, in essence, one whole CPU core is taken up managing the GPU processing ...

On my i7 it is a little better in that each GPU core (I have three running) takes 7% meaning I lose 21% of CPU power to run three GPU Grid tasks ... if this could be lowered the total system throughput could be increased ...

A new GPU Grid application has been promised though the best of them that I have seen had a load on the i7 of 3-4% which is also fairly high ... several of us have made suggestions as to how this might be changed, but, still waiting ...

Thanks, Paul. You are quite right. My task manager shows GPU Grid hogging almost 40% of one 4200+ core. Too bad it could not be as efficient as FAH which runs closer to 1%.

Paul D. Buck

Joined: 17 Jan 05

Posts: 754

Credit: 5385205

RAC: 0

RE: RE: The number

28 Jan 2009 19:50:44 UTC

Message 90149 in response to message 90148

(moderation:

)

Quote:

Quote:
The number displayed on the BOINC Manager window is an estimate and it ranges from 0.02 to 0.05 and it not at all correct.

You have to look in the Task Manager or system monitor to see the actual CPU usage by the applications. SaH on my Q9300 was 1 to 3% which GPU Grid is running at a solid 22% ... and this is of the system as a whole ... meaning that, in essence, one whole CPU core is taken up managing the GPU processing ...

On my i7 it is a little better in that each GPU core (I have three running) takes 7% meaning I lose 21% of CPU power to run three GPU Grid tasks ... if this could be lowered the total system throughput could be increased ...

A new GPU Grid application has been promised though the best of them that I have seen had a load on the i7 of 3-4% which is also fairly high ... several of us have made suggestions as to how this might be changed, but, still waiting ...

Thanks, Paul. You are quite right. My task manager shows GPU Grid hogging almost 40% of one 4200+ core. Too bad it could not be as efficient as FAH which runs closer to 1%.

Well, version 6.62 is running at under 1% on all systems. I have not gotten to the GPU Grid boards yet so not sure if they made the switch though all the tasks I have gotten on the i7 are now 6.62 tasks. The change of the application means that though the CPU usage is where I want it some are unhappy because the GPU efficiency may have been lowered by as much as 17% ...

The debate was raging last night ... :)

Now instead of hours on the GPU Grid tasks is now down to about 10 min per task ... much more tolerable ...

Gerry Rough

Joined: 1 Mar 05

Posts: 102

Credit: 1847066

RAC: 0

RE: Well, version 6.62 is

30 Jan 2009 16:14:19 UTC

Message 90150 in response to message 90149

(moderation:

)

Quote:

Well, version 6.62 is running at under 1% on all systems. I have not gotten to the GPU Grid boards yet so not sure if they made the switch though all the tasks I have gotten on the i7 are now 6.62 tasks. The change of the application means that though the CPU usage is where I want it some are unhappy because the GPU efficiency may have been lowered by as much as 17% ...

The debate was raging last night ... :)

Now instead of hours on the GPU Grid tasks is now down to about 10 min per task ... much more tolerable ...

Even though the GPU task is running at less than 1%, is there any significant dropoff in CPU efficiency in terms of increased time to finish the CPU task? I would think we are talking maybe 20% slower(?)

Additionally, would an icrease in L1 or L2 cache help with the overload problem earlier discussed? I have a pair of Q6600 quads with some nifty cache numbers, so I wonder if those who have lots of L2 cache might be at an advantage here.

(Click for detailed stats)

Extreme CUDA processing?

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner