New Einstein@Home Radio Pulsar Search and NVIDIA GPU Code

Submitted on 20 Jan 2011 15:57:08 UTC

Einstein@Home is beginning a new round of searching for radio pulsars in short-orbital-period binary systems.

This is accompanied by the release of a new application (called BRP3). The new application is particularly efficient on NVIDIA Graphics Processor Cards (up to a factor of 20 faster than the CPU-only application). In addition, when running on an NVIDIA GPU card, this new application makes very little use of the CPU (typically around 20% CPU use when the GPU is devoted to Einstein@Home).

The NVIDIA GPU application is initially available for Windows and Linux only. We hope to have a Macintosh version available soon. Due to limitations in the NVIDIA drivers, the Linux version still makes heavy use of the CPU. This will be fixed in Spring 2011, when a new version of the NVIDIA Driver is released. Many thanks to NVIDIA technical support for their assistance!

Because we have exhausted the backlog of data from Arecibo Observatory, this new application is being shipped with data from the Parkes Multibeam Pulsar Survey (from the Parkes Radio Telescope in Australia). In the next weeks we expect to also start using this new application on fresh Arecibo data taken with the latest 'Mock Spectrometer' back-end.

Questions, problems or bug reports related to this new application and search should be reported in this news item thread as a 'Comment'.

Bruce Allen
Director, Einstein@Home

Comments

Claggy

Joined: 29 Dec 06

Posts: 560

Credit: 2699403

RAC: 0

RE: We just installed

31 Mar 2011 9:45:40 UTC

Message 103573 in response to message 103571

Quote

(moderation:

)

Quote:

We just installed Quadro 4000 cards in 4 8-core Mac Pros. Can't wait to see the GPU app and the data we can crunch. Hope the GPU app ships soon!

For Boinc to see Cuda GPU's on OSX you need to install the Cuda toolkit and Cuda driver (in that order),
i believe it'll be the Cuda 3.2 version that you need to install.

Claggy

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250558507

RAC: 34555

RE: For Boinc to see Cuda

31 Mar 2011 9:47:28 UTC

Message 103574 in response to message 103573

Quote

(moderation:

)

Quote:

For Boinc to see Cuda GPU's on OSX you need to install the Cuda toolkit and Cuda driver (in that order)

What do you need the CUDA Toolkit for?

Claggy

Joined: 29 Dec 06

Posts: 560

Credit: 2699403

RAC: 0

RE: RE: For Boinc to see

31 Mar 2011 9:52:56 UTC

Message 103575 in response to message 103574

Quote

(moderation:

)

Quote:

Quote:
For Boinc to see Cuda GPU's on OSX you need to install the Cuda toolkit and Cuda driver (in that order)

What do you need the CUDA Toolkit for?

BM

I was thinking for compatibility across different projects that have MAC Cuda apps,
(collatz's MAC Cuda app errors out without the toolkit)

Claggy

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250558507

RAC: 34555

RE: collatz's MAC Cuda app

31 Mar 2011 12:30:06 UTC

Message 103576 in response to message 103575

Quote

(moderation:

)

Quote:

collatz's MAC Cuda app errors out without the toolkit

Too bad. They should distribute the libraries (cudart, cufft, whatever they use) with the application, like all other projects I know of do. It's legal as long as they distribute the EULA.txt with the libs.

Torusoft Server

Joined: 2 Apr 11

Posts: 3

Credit: 510257

RAC: 0

We just installed a Quadro

13 Apr 2011 18:35:07 UTC

Message 103577 in response to message 103572

Quote

(moderation:

)

We just installed a Quadro 4000, the latest drivers, CUDA drivers and the CUDA toolkit. Any word on whether Einstein@home supports this card on OS X 10.6.7?

Chris

Albert Collver

Joined: 11 Apr 11

Posts: 1

Credit: 144815

RAC: 0

I have a MacBook Pro with a

13 Apr 2011 19:05:07 UTC

Message 103578

Quote

(moderation:

)

I have a MacBook Pro with a Geforce 9400M with 253 megs of RAM. I also have a Mac Mini with a Geforece 320M with 252 megs of RAM:

Both machines report the following error:

Wed Apr 13 13:51:05 2011 Einstein@Home Message from server: Your NVIDIA GPU has insufficient memory (need 300MB)

Anyone else receiving this message?

Jeroen

Joined: 25 Nov 05

Posts: 379

Credit: 740030628

RAC: 0

RE: I have a MacBook Pro

13 Apr 2011 19:26:27 UTC

Message 103579 in response to message 103578

Quote

(moderation:

)

Quote:

I have a MacBook Pro with a Geforce 9400M with 253 megs of RAM. I also have a Mac Mini with a Geforece 320M with 252 megs of RAM:

Both machines report the following error:

Wed Apr 13 13:51:05 2011 Einstein@Home Message from server: Your NVIDIA GPU has insufficient memory (need 300MB)

Anyone else receiving this message?

The BRP3 GPU application requires at least 300MB of GPU memory and sometimes more. Unfortunately your cards with 252-253MB of memory do not have enough memory to run the BRP3 GPU tasks.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250558507

RAC: 34555

RE: We just installed a

13 Apr 2011 19:49:22 UTC

Message 103580 in response to message 103577

Quote

(moderation:

)

Quote:

We just installed a Quadro 4000, the latest drivers, CUDA drivers and the CUDA toolkit. Any word on whether Einstein@home supports this card on OS X 10.6.7?

We don't support CUDA on 10.6.x until NVidia fixes the bug in the CUDA driver for that OS version (see here).

induktio

Joined: 1 Oct 10

Posts: 15

Credit: 10144774

RAC: 0

RE: Another difference of

28 Apr 2011 14:32:09 UTC

Message 103581 in response to message 103570

Quote

(moderation:

)

Quote:

Another difference of the Teslas is that they will provide the full double precision FP performance. Compared to the consumer cards this means:

Tesla: FP speed = 1/2 interger Speed
GXT580: FP speed = 1/8 interger Speed

Not that you need it for Einstein but if you are planing on using them for other projects as well...

I have to revisit this thread again because I'm considering buying some GPU for this project. But it will not be a Tesla card because of the price tag. :P It's just very hard to compare the options because of the differences in architecture etc.

It seems both the Tesla and Quadro series cards have the full double precision performance while the GTX (consumer) cards are capped to 1/4 of the full speed. How about single precision then, is it unlimited on all of these cards, and in some way comparable? However, does Einstein@Home perform only single precision calculations on GPUs? If this is the case, then Quadro series cards would be much less useful.

Earlier it was said "1.5GB version of the 580 can run four tasks at once", so one task would use at most 375MB of memory. For example, we could compare Quadro 2000 (1 GB, 400 euros) and Quadro 4000 (2 GB, 700 euros). Quadro 2000 possibly would be able to run only 2 tasks at once, while 4000 would be guaranteed to run at least 5. In this sense Quadro 2000 would be very bad in cost efficiency, am I right here?

Oliver Behnke

Moderator

Administrator

Joined: 4 Sep 07

Posts: 984

Credit: 25171438

RAC: 28

RE: However, does

29 Apr 2011 7:11:25 UTC

Message 103582 in response to message 103581

Quote

(moderation:

)

Quote:

However, does Einstein@Home perform only single precision calculations on GPUs?

We always strive to support all kinds of volunteer hardware and make use of it as good as we possibly can. Therefore we try hard to use only single-precision because all GPUs support it. Requiring double precision would severely reduce a) the number of usable GPUs as well as b) overall application performance. However, it might not be possible to use only single precision at some point in the future, but for the time being single precision is sufficient.

Oliver

Einstein@Home Project

Mikie Tim T

Joined: 22 Jan 05

Posts: 105

Credit: 263777741

RAC: 0

RE: We always strive to

1 May 2011 17:42:49 UTC

Message 103583 in response to message 103582

Quote

(moderation:

)

Quote:

We always strive to support all kinds of volunteer hardware and make use of it as good as we possibly can.

Now if we could just get an OpenCL or CAL app for ATI hardware, we would be getting a big boost in crunching! I'd throw my card at Einstein if I could, but right now, it's just doing conjecture crunching.

Oliver Behnke

Moderator

Administrator

Joined: 4 Sep 07

Posts: 984

Credit: 25171438

RAC: 28

RE: Now if we could just

2 May 2011 8:38:12 UTC

Message 103584 in response to message 103583

Quote

(moderation:

)

Quote:

Now if we could just get an OpenCL or CAL app for ATI hardware, we would be getting a big boost in crunching!

Work in progress...

Einstein@Home Project

telegd

Joined: 17 Apr 07

Posts: 91

Credit: 10212522

RAC: 0

RE: Therefore we try hard

8 May 2011 4:43:53 UTC

Message 103585 in response to message 103582

Quote

(moderation:

)

Quote:

Therefore we try hard to use only single-precision because all GPUs support it.

I very much appreciate being able to use a single-precision card at the moment.

Can I ask what might prompt the move to double-precision at E@H?

Thanks!

Oliver Behnke

Moderator

Administrator

Joined: 4 Sep 07

Posts: 984

Credit: 25171438

RAC: 28

RE: Can I ask what might

9 May 2011 7:08:35 UTC

Message 103586 in response to message 103585

Quote

(moderation:

)

Quote:

Can I ask what might prompt the move to double-precision at E@H?

Future search codes might require double-precision. But so far, single-precision is sufficient and there are no new algorithms/codes on the horizon that would change that.

Oliver

Einstein@Home Project

Old man

Joined: 28 Mar 10

Posts: 4

Credit: 5546129

RAC: 312

RE: RE: Now if we could

21 May 2011 12:39:19 UTC

Message 103587 in response to message 103584

Quote

(moderation:

)

Quote:

Quote:
Now if we could just get an OpenCL or CAL app for ATI hardware, we would be getting a big boost in crunching!

Work in progress...

Hi. When the program will be published?

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250558507

RAC: 34555

RE: RE: RE: Now if we

21 May 2011 19:29:40 UTC

Message 103588 in response to message 103587

Quote

(moderation:

)

Quote:

Quote:
Quote:
Now if we could just get an OpenCL or CAL app for ATI hardware, we would be getting a big boost in crunching!

Work in progress...

Hi. When the program will be published?

We'll publish an OpenCL App as soon as it's finished and tested to an extend that we can tell that it does more good than bad.

Oliver is working full time on the application, and he is making good progress; however, a few problems still need to be solved. And we are also actively working with BOINC developers to get an OpneCL-aware BOINC Client out of the door, which also doesn't exits yet.

Seems that E@H is again on the bleeding edge of development.

Stranger7777

Joined: 17 Mar 05

Posts: 436

Credit: 429894695

RAC: 77581

RE: We always strive to

23 May 2011 20:37:13 UTC

Message 103589 in response to message 103582

Quote

(moderation:

)

Quote:

We always strive to support all kinds of volunteer hardware and make use of it as good as we possibly can.
Oliver

Than why don't you build applications for older machines?
May be they crunch no to fast, but there are enormous amount of them around here and all over the world.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250558507

RAC: 34555

RE: Than why don't you

24 May 2011 6:17:28 UTC

Message 103590 in response to message 103589

Quote

(moderation:

)

Quote:

Than why don't you build applications for older machines?
May be they crunch no to fast, but there are enormous amount of them around here and all over the world.

What machines are you referring to? The GW Application supports Linux and Windows machines back to Pentium(TM) II, and PPC Macs that aren't sold anymore for - how long? - six years or so.

The relation between effort required (maintenance time, electricity) and benefit compared to modern machines gets worse every year a computer ages.

The source code is freely available, even a build script that is designed to work on most OS. If you want to support a machine that we don't have a stock App for, you are very welcome to 'roll your own'.

Stranger7777

Joined: 17 Mar 05

Posts: 436

Credit: 429894695

RAC: 77581

RE: RE: Than why don't

24 May 2011 6:56:07 UTC

Message 103591 in response to message 103590

Quote

(moderation:

)

Quote:

Quote:
Than why don't you build applications for older machines?
May be they crunch no to fast, but there are enormous amount of them around here and all over the world.

What machines are you referring to? The GW Application supports Linux and Windows machines back to Pentium(TM) II, and PPC Macs that aren't sold anymore for - how long? - six years or so.

The relation between effort required (maintenance time, electricity) and benefit compared to modern machines gets worse every year a computer ages.

The source code is freely available, even a build script that is designed to work on most OS. If you want to support a machine that we don't have a stock App for, you are very welcome to 'roll your own'.

BM

Yes, I'm talking about old P2s, P3s and P4s. Yes, the application is able to start there. But... it consumes so much RAM, that it becomes impossible to run E@H as a background task for still well working machine. The reasons to use these machines are:
1) They are still doing well and do their job.
2) Any new machine instead of the old one will cost much more than it can save by the difference in electricity bills.
3) If I buy a new machine, I will support certain machine manufacture to consume more and more far not endless earth resources (why new computer cases are not compatible with elders? Why new power supplies cannot be used with new motherboards with lower power consumption? etc)

So, there's no need to support any different app.type, but there's a reason for the app to look how much memory it may use in its current run.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250558507

RAC: 34555

RE: So, there's no need to

24 May 2011 7:16:02 UTC

Message 103592 in response to message 103591

Quote

(moderation:

)

Quote:

So, there's no need to support any different app.type, but there's a reason for the app to look how much memory it may use in its current run.

We're not wasting any memory in the App, in fact we try pretty hard to keep memory requirements as small as possible. From the files that you downloaded (for the GW search) we're picking only the few bins that we need. Requires a lot of I/O operations, but keeps memory requirements low.

The S6Bucket tasks should take around 100 MB. If your old machines don't have that much and you don't want to spend swap space on it, then maybe there's some other BOINC project that suits your machines better.

Phil

Joined: 24 Feb 05

Posts: 176

Credit: 1817881

RAC: 0

RE: RE: We always strive

24 May 2011 15:56:55 UTC

Message 103593 in response to message 103589

Quote

(moderation:

)

Quote:

Quote:
We always strive to support all kinds of volunteer hardware and make use of it as good as we possibly can.
Oliver

Than why don't you build applications for older machines?
May be they crunch no to fast, but there are enormous amount of them around here and all over the world.

I found I was wingman to this machine recently which rather amused me.

Stranger7777

Joined: 17 Mar 05

Posts: 436

Credit: 429894695

RAC: 77581

Oh, I see, that new S6 search

25 May 2011 6:41:49 UTC

Message 103594

Quote

(moderation:

)

Oh, I see, that new S6 search consumes less memory then previous one. And it again makes possible to put in those old ones, at least those, who have more than 256 Mb of RAM. Thank you!!! And please try your best to keep memory consumption as low as possible.
BTW, using swap file there is not useful because of terrible speed of old hard drives used in there.

Stranger7777

Joined: 17 Mar 05

Posts: 436

Credit: 429894695

RAC: 77581

RE: RE: RE: We always

25 May 2011 6:44:48 UTC

Message 103595 in response to message 103593

Quote

(moderation:

)

Quote:

Quote:
Quote:
We always strive to support all kinds of volunteer hardware and make use of it as good as we possibly can.
Oliver

Than why don't you build applications for older machines?
May be they crunch no to fast, but there are enormous amount of them around here and all over the world.

I found I was wingman to this machine recently which rather amused me.

But your wingman fails all the tasks it downloads. Firstly because of lack of memory resources.

Ver Greeneyes

Joined: 26 Mar 09

Posts: 140

Credit: 9562235

RAC: 0

RE: We're not wasting any

26 May 2011 15:49:12 UTC

Message 103596 in response to message 103592

Quote

(moderation:

)

Quote:

We're not wasting any memory in the App, in fact we try pretty hard to keep memory requirements as small as possible. From the files that you downloaded (for the GW search) we're picking only the few bins that we need. Requires a lot of I/O operations, but keeps memory requirements low.

I would actually prefer to trade some RAM for reduced disk I/O (seeing as I have 8 gigs of RAM and I rarely use all of it). Would it be possible to add this as an 'Einstein@Home preference' on the website, or would that require significant refactoring of the application? I should mention that I'm talking about system RAM here, not GPU RAM.

tullio

Joined: 22 Jan 05

Posts: 2118

Credit: 61407735

RAC: 0

I have 8 GB too on my 32-bit

26 May 2011 17:21:37 UTC

Message 103597

Quote

(moderation:

)

I have 8 GB too on my 32-bit Linux pae, but most of it is used as a disk cache, so reducing disk I/O.
Tullio