Optomized S5 SSE3

Akos Fekete
Akos Fekete
Joined: 13 Nov 05
Posts: 561
Credit: 4527270
RAC: 0

RE: If you check the Read

Message 38956 in response to message 38954

Quote:
If you check the Read Only thread you will see that Akos has released S5S0003.dat as a "stable" patch. Note the change from "T" to "S" which I assume means "Test" is now regarded as "Stable". It's not immediately clear if the "S" version is identical to the earlier "T" version or whether there were some additional tweaks. My guess is that it should be identical so if you have the "T" version then just keep running it until Akos releases S5T0005.dat :).

Oh, yes.
T -> Test version
S -> Stable version

S5S0003 has only one difference from S5T0003. It write the 'STABLE' to the stderr not the 'TEST' label.

The next test version will be S5T0301.dat. I think it's more logical...
Try to find it why... :-)

Ulrich Metzner
Ulrich Metzner
Joined: 22 Jan 05
Posts: 113
Credit: 963370
RAC: 0

Any chance for optimizations

Any chance for optimizations for other instruction sets, e. g. 3Dnow?
My old Athlon isn't capable of SSE but was very fast with the latest D41.15 ;)

Aloha, Uli

Akos Fekete
Akos Fekete
Joined: 13 Nov 05
Posts: 561
Credit: 4527270
RAC: 0

RE: Any chance for

Message 38958 in response to message 38957

Quote:
Any chance for optimizations for other instruction sets, e. g. 3Dnow?
My old Athlon isn't capable of SSE but was very fast with the latest D41.15 ;)

The optimization of the current app is very difficult, because the validator works with a very low tolerance. The 3DNow! hot loop give a bit different results so they all would be invalid. I would like to do the fastest app at moment. I think it means an about 10-15% faster application than the official, because the official app uses optimized routines too.

Pepperammi
Pepperammi
Joined: 20 Feb 05
Posts: 131
Credit: 437943
RAC: 0

RE: Oh, yes. T -> Test

Message 38959 in response to message 38956

Quote:

Oh, yes.
T -> Test version
S -> Stable version

S5S0003 has only one difference from S5T0003. It write the 'STABLE' to the stderr not the 'TEST' label.

The next test version will be S5T0301.dat. I think it's more logical...
Try to find it why... :-)


@akosf

Soon be returning another S5T0000 unit and yet another a few hours after. if they also come back as sucessfull and valid would you like me to contunue with the S5T0000 for a while longer or update to your 'stable' S5S0003. Or one the others? i know you want to be thorough with them all so i dont mind any.

Side note, seeing strange (BUT GOOD) behavior on HT machine with S5T0001. Basically looks to be as quick or slightly quicker doing two units than it was doing one on its own (no other projects) with standard app. Know more when they're finished in few hours

Akos Fekete
Akos Fekete
Joined: 13 Nov 05
Posts: 561
Credit: 4527270
RAC: 0

S5T0301 - eliminated

Message 38960 in response to message 38905

S5T0301

- eliminated double jumps
- reduced amount of FPU macro ops
- removed double loads on general purpose registers

- SSE3 truncation

So, use this patch only on SSE3 CPUs!
I think it will give the biggest speed-up ( +5% or more ).

_heinz
_heinz
Joined: 4 Jan 06
Posts: 79
Credit: 130476
RAC: 0

Pentium4 2,66GHz S5S0003 the

Pentium4 2,66GHz
S5S0003 the first hybrid result is ready, 50% with S5T0004 the rest with S5S0003
hybrid
other results will follow soon
britta

Yin Gang
Yin Gang
Joined: 23 Feb 05
Posts: 52
Credit: 120187750
RAC: 0

@Akos When I do patching

@Akos

When I do patching in cmd.exe, the patcher.com prompts "Invalid keyboard code specified" in the first line, is it OK?

Best regards,
Yin Gang

Welcome To Team China!

M. Schmitt
M. Schmitt
Joined: 27 Jun 05
Posts: 478
Credit: 15872262
RAC: 0

I got 2 valid results with

I got 2 valid results with long WUs and S5T003.dat so far.

34581346
34581722

A few minutes ago I switched to S5T0301 and in about one hour I will get 2 results with mixed S5T003/S5T0301.

Progress is obviously faster with S5T0301, while S5T003 was about 2% faster, but there is no compareble WU. So the 2% are the difference to to one of the fist S5-test WUs, which got a much lower credit by the server.

cu,
Michael

Pepperammi
Pepperammi
Joined: 20 Feb 05
Posts: 131
Credit: 437943
RAC: 0

first to return so can't tell

first to return so can't tell if valid yet. Had no probs.
Done with S5T0000
http://einsteinathome.org/task/34776031

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5872
Credit: 117695565815
RAC: 35084267

@ Akos, Could you please

@ Akos,

Could you please have a look at this post and the problem it points to and let us know if the idea of restricting the new releases to a smaller group of testers is workable for your purposes. In this case your latest release is restricted to SSE3 capable machines so hopefully that will be a smaller potential audience anyway.

Thanks, and thanks very much for your work.

Cheers,
Gary.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.