Inefficient science app for MacOS X

DanNeely

Joined: 4 Sep 05

Posts: 1364

Credit: 3562358667

RAC: 0

the eah staff hand optimized

2 May 2006 23:54:19 UTC

Message 27994

(moderation:

)

the eah staff hand optimized an ~2x speedup over the compiled ppc code, akos has reached ~7x on x86. Maybe you should thing about buying him a mac. :) Seriously though, he wasn't able to orient himself on the x86 linux app, I strongly suspect PPC asm will be far worse.

Elphidieus

Joined: 20 Feb 05

Posts: 245

Credit: 20603702

RAC: 0

RE: the eah staff hand

3 May 2006 5:30:08 UTC

Message 27995 in response to message 27994

(moderation:

)

Quote:

the eah staff hand optimized an ~2x speedup over the compiled ppc code, akos has reached ~7x on x86. Maybe you should thing about buying him a mac. :) Seriously though, he wasn't able to orient himself on the x86 linux app, I strongly suspect PPC asm will be far worse.

Seriously, I'll be thinking more about having him porting his optimization to the x86-based macs instead....

Akos Fekete

Joined: 13 Nov 05

Posts: 561

Credit: 4527270

RAC: 0

RE: the eah staff hand

3 May 2006 7:34:21 UTC

Message 27996 in response to message 27994

(moderation:

)

Quote:

the eah staff hand optimized an ~2x speedup over the compiled ppc code, akos has reached ~7x on x86. Maybe you should thing about buying him a mac. :) Seriously though, he wasn't able to orient himself on the x86 linux app, I strongly suspect PPC asm will be far worse.

:) These processor based systems aren't too different, so i'm sure that the optimization is possible on any systems. I modified the win app because i use win-x86 systems and it was the easiest way to me to have fun in crunching. Of course, i have some other ideas (e.g. a cheap PCI-bus accelerator), but they would need lots of time and i don't have it. Pure software developing gives "fun" much faster, especially on win-x86. Sorry...

Let's crunch! ;-)

Elphidieus

Joined: 20 Feb 05

Posts: 245

Credit: 20603702

RAC: 0

RE: RE: the eah staff

3 May 2006 8:26:52 UTC

Message 27997 in response to message 27996

(moderation:

)

Quote:

Quote:
the eah staff hand optimized an ~2x speedup over the compiled ppc code, akos has reached ~7x on x86. Maybe you should thing about buying him a mac. :) Seriously though, he wasn't able to orient himself on the x86 linux app, I strongly suspect PPC asm will be far worse.
:) These processor based systems aren't too different, so i'm sure that the optimization is possible on any systems. I modified the win app because i use win-x86 systems and it was the easiest way to me to have fun in crunching. Of course, i have some other ideas (e.g. a cheap PCI-bus accelerator), but they would need lots of time and i don't have it. Pure software developing gives "fun" much faster, especially on win-x86. Sorry...

Let's crunch! ;-)

How I wish there were dedicated Mac crunchers here in Einstein who possess your prodigal wisdom as there were with SETI@home at MacNN....

with the exception of Bernd Maschenschalk of course...., I'm sure he's doing everything he can for the mac community....

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4330

Credit: 251470477

RAC: 36230

Despite the problems I posted

3 May 2006 13:59:20 UTC

Message 27998

(moderation:

)

Despite the problems I posted the new MacOS App for public beta testing, especially to get some feedback. See this thread.

bloed_brot

Joined: 5 Apr 05

Posts: 70

Credit: 91124558

RAC: 0

RE: (snip)We've been

3 May 2006 15:11:10 UTC

Message 27999 in response to message 27986

(moderation:

)

Quote:

(snip)We've been working with experienced programmers from Apple to make the AltiVec version as fast as we can, and of all official Apps it makes the most of every clock cycle of a CPU.(snip)
BM

Thank you, Bernd, for keeping up the communication as an official member.

May I ask, for a user like me it is difficult to understand why someone like Akos will not be granted a look over the source code for improvement given the fact what he had managed to pull out of the hat? Is it possible to elaborate a little bit why the project board chooses to release speed improvements that are a country mile off Akos' work?

Is the project team lacking the technical expertise to follow Akos methods? Is the project funding based on the fact that code MUST NOT leak?

Please understand, that I just find it VERY DIFFICULT to make sense of a decision (?) not to make the project more efficient by the factor of 7 and instead to stick with optimisation of 2 at best.

I honestly do not want to shift blame here, but if someone comes forward and offers me the chance of getting my work seven times more quickly done, than I don't turn around and say no. That's why my confusion. That's why my question.

Regards
Soenke

:
your thoughts - the ways :: the knowledge - your space
:

Martin P.

Joined: 17 Feb 05

Posts: 162

Credit: 40156217

RAC: 0

RE: RE: (snip)We've been

9 May 2006 20:08:33 UTC

Message 28000 in response to message 27999

(moderation:

)

Quote:

Quote:
(snip)We've been working with experienced programmers from Apple to make the AltiVec version as fast as we can, and of all official Apps it makes the most of every clock cycle of a CPU.(snip)
BM

Thank you, Bernd, for keeping up the communication as an official member.

May I ask, for a user like me it is difficult to understand why someone like Akos will not be granted a look over the source code for improvement given the fact what he had managed to pull out of the hat? Is it possible to elaborate a little bit why the project board chooses to release speed improvements that are a country mile off Akos' work?

Is the project team lacking the technical expertise to follow Akos methods? Is the project funding based on the fact that code MUST NOT leak?

Please understand, that I just find it VERY DIFFICULT to make sense of a decision (?) not to make the project more efficient by the factor of 7 and instead to stick with optimisation of 2 at best.

I honestly do not want to shift blame here, but if someone comes forward and offers me the chance of getting my work seven times more quickly done, than I don't turn around and say no. That's why my confusion. That's why my question.

Regards
Soenke

Hi Soenke,

unfortunately I cannot find the post where someone offered help such as Akos did for x86. Could you point me to that post, please?

Martin P.

Joined: 17 Feb 05

Posts: 162

Credit: 40156217

RAC: 0

RE: My current vector code

23 May 2006 7:43:03 UTC

Message 28001 in response to message 27990

(moderation:

)

Quote:

My current vector code has either a minor bug or a numerical instability, that shows up as occasional invalid results of the 4.55 Linux App. I want to have this fixed before I do anything else with that code, like using it for other platforms.

BM

Hi Bernd,

any progress with the new science app?

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4330

Credit: 251470477

RAC: 36230

On Intel machines we

23 May 2006 14:10:54 UTC

Message 28002

(moderation:

)

On Intel machines we currently use assembly code that avoids this problem. I'm afraid I'll not be able to do something about these invalid results from PPC code for the last remaining Workunits of the S4 run (estimated for one month), but this shouldn't occur in the next run anymore. A code that includes measurements to avoid these invalid results with the current workunits wouldn't run faster than the AltiVec code that's in the current official PPC Mac App.

Elphidieus

Joined: 20 Feb 05

Posts: 245

Credit: 20603702

RAC: 0

Will there be room for

23 May 2006 14:21:13 UTC

Message 28003

(moderation:

)

Will there be room for improvement on the PowerPC-based Mac App for the next run...?

Inefficient science app for MacOS X

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner