Questions, comments and problems on new Fermi LAT gamma-ray pulsar search

Nigel Garvey

Joined: 4 Oct 10

Posts: 51

Credit: 32328962

RAC: 87671

Bikeman wrote:There are

25 Aug 2011 8:01:16 UTC

Message 105842 in response to message 105835

(moderation:

)

Bikeman wrote:

There are currently two problems with the LAT search that I'm aware of â€¦ and the second is a cross-platform validation problem between Macs and the other platforms.

Any news on that? I was quite excited this morning to see that my Mac had finally had a LAT result validated, but it turned out it had sided with another Mac against a Windows machine.

http://einsteinathome.org/workunit/103074788

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250381538

RAC: 34879

We are still tuning the

26 Aug 2011 9:56:48 UTC

Message 105843 in response to message 105842

(moderation:

)

We are still tuning the validator, and I am working on a change to the application that should make it a little bit faster but with the side effect of narrowing down the differences between platforms.

robertmiles

Joined: 8 Oct 09

Posts: 127

Credit: 29300882

RAC: 22141

Would it help to make sure

27 Aug 2011 4:32:52 UTC

Message 105844

(moderation:

)

Would it help to make sure that all copies of a particular workunit are sent only to computers with the same operating system?

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250381538

RAC: 34879

RE: Would it help to make

30 Aug 2011 8:57:14 UTC

Message 105845 in response to message 105844

(moderation:

)

Quote:

Would it help to make sure that all copies of a particular workunit are sent only to computers with the same operating system?

In BOINC this concept is called "homogenous redundancy". A project can be configured to use this, but then this applies to all applications of the project.

Stranger7777

Joined: 17 Mar 05

Posts: 436

Credit: 429483062

RAC: 76349

RE: Would it help to make

30 Aug 2011 16:36:30 UTC

Message 105846 in response to message 105844

(moderation:

)

Quote:

Would it help to make sure that all copies of a particular workunit are sent only to computers with the same operating system?

Yes, it will help to reduce the number of validation errors. But this way it will produce different results on different machines that cannot be compared normally. And yet we don't know which OS produces wrong results. So, there is no other way than to find the reason of validation errors. But it needs some time to.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250381538

RAC: 34879

RE: We are still tuning the

30 Aug 2011 19:44:47 UTC

Message 105847 in response to message 105843

(moderation:

)

Quote:

We are still tuning the validator, and I am working on a change to the application that should make it a little bit faster but with the side effect of narrowing down the differences between platforms.

BM

This should have been done now with the new app version 23 shipped today.

tullio

Joined: 22 Jan 05

Posts: 2118

Credit: 61407735

RAC: 0

I validated a second gamma

1 Sep 2011 17:40:25 UTC

Message 105848

(moderation:

)

I validated a second gamma ray unit on my Linux box against a Windows 7, using version 22. Another one is running on version 23
Tullio

Gary Roberts

Moderator

Joined: 9 Feb 05

Posts: 5872

Credit: 117489320432

RAC: 35472651

RE: RE: We are still

2 Sep 2011 4:59:30 UTC

Message 105849 in response to message 105847

(moderation:

)

Quote:

Quote:
We are still tuning the validator, and I am working on a change to the application that should make it a little bit faster but with the side effect of narrowing down the differences between platforms.

BM

This should have been done now with the new app version 23 shipped today.

BM

I have had the 0.22 version running on a number of Linux hosts as well as on a single host running Win XP. The performance on Windows has been quite bad compared to that on Linux for the same hardware.

I have just finished converting all the above machines to the 0.23 version. The first results with the new version are now in and the linux hosts are showing a speedup of around 15% or so. The time on one quad core host has dropped from around 27K secs to less than 23K secs.

The improvement on Windows seems to be even better. That machine is a dual core and it was completing tasks in around 37K secs. It is now completing results in around 25K secs with the 0.23 version. Linux hosts of pretty much identical specs had been completing tasks in around 26K secs with the old version. One of these is on track to complete its first 0.23 result in around 22K secs so there is still an advantage for Linux, althought very much reduced from what it was.

Thanks very much for the very useful speedup. I hope you also get the desired result for validation improvement as well.

Cheers,
Gary.

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4312

Credit: 250381538

RAC: 34879

Welcome back, Gary! I

2 Sep 2011 11:28:05 UTC

Message 105850 in response to message 105849

(moderation:

)

Welcome back, Gary!

I suspect that most of your machines are powered by AMD CPUs? I did see larger speedups on our Intel machines here, as well as in the DB of the runtime averaged over all hosts.

Anyway, preparing for "production" we increased the deadline (10 days) and are ramping up the FGRP1 share of the project (i.e. sending out more FGRP work).

Richard Haselgrove

Joined: 10 Dec 05

Posts: 2143

Credit: 2955069908

RAC: 715254

Indeed, welcome back,

2 Sep 2011 13:21:14 UTC

Message 105851 in response to message 105850

(moderation:

)

Indeed, welcome back, Gary.

I'm only running Windows on Intel here, but I'd noticed that very significant speedup with 0.23 - I can pull out figures if it helps, but I'd guess you have plenty available.

I'm still seeing FGRP1 taking ~30% longer than S6Bucket on all machines (I'm running SSE2 minimum, mainly Core2 or above, so can take advantage of the optimisation).

The extended deadline is welcome - 0.22 took over two days on my elderly P4 server, pushing it into EDF. Perhaps you might consider the credit ratio as part of the tuning process.

Questions, comments and problems on new Fermi LAT gamma-ray pulsar search

Forums › Technical News

Comment viewing options

Forums › Technical News