Optomized S5 SSE3

Pepperammi
Pepperammi
Joined: 20 Feb 05
Posts: 131
Credit: 437943
RAC: 0

RE: And Crunch3r dont

Message 39006 in response to message 39004

Quote:

And Crunch3r dont offer the newest one.


probly because he cant download them from akosf in the first place. Don't have to use the very latest. They all need testing thoroughly :)

Athlonheizer
Athlonheizer
Joined: 3 Jun 06
Posts: 33
Credit: 513937
RAC: 0

Hybride WU 9932852 0004 +

Hybride WU 9932852 0004 + 0003
Hybride WU 9932850 0003 + 0004

Both: Checked, but no consensus yet

Athlon

Stay tuned and keep crunching

Kratylos
Kratylos
Joined: 23 Nov 05
Posts: 28
Credit: 1669914
RAC: 0

Patch: S5T0301 Valid pure

Patch: S5T0301

Valid pure WU: 9883775

Pending WU: 9883776

Akos Fekete
Akos Fekete
Joined: 13 Nov 05
Posts: 561
Credit: 4527270
RAC: 0

S5T0304.dat - eliminated

Message 39009 in response to message 38988

S5T0304.dat

- eliminated double jumps
- reduced amount of FPU macro ops
- removed double loads on general purpose registers

- better SSE register usage

It has to work on SSE CPUs...

MRAO
MRAO
Joined: 7 May 05
Posts: 33
Credit: 15770746
RAC: 0

RE: S5T0304.dat -

Message 39010 in response to message 39009

Quote:

S5T0304.dat

- eliminated double jumps
- reduced amount of FPU macro ops
- removed double loads on general purpose registers

- better SSE register usage

It has to work on SSE CPUs...


Akos, is this one *any* SSE please and not just SSE3? Mike

Kratylos
Kratylos
Joined: 23 Nov 05
Posts: 28
Credit: 1669914
RAC: 0

RE: S5T0303.dat -

Message 39011 in response to message 38988

Quote:

S5T0303.dat

- eliminated double jumps
- reduced amount of FPU macro ops
- removed double loads on general purpose registers

- SSE3 truncation
- simpler address generations and some FPU optimizations
- better SSE register usage

So, use this patch only on SSE3 CPUs!

Akos, with this file is something wrong, it is 40 KB big.(?)

Pepperammi
Pepperammi
Joined: 20 Feb 05
Posts: 131
Credit: 437943
RAC: 0

http://einstein.phys.uwm.edu/

http://einsteinathome.org/task/34862246
S5T0001. last few % with S5T0301.
Valid and credit granted.

Stick
Stick
Joined: 24 Feb 05
Posts: 790
Credit: 33132401
RAC: 1101

Same for this S5T0303 result

Message 39013 in response to message 38993

Same for this S5T0303 result - "Checked, but no consensus yet". And, in both cases, the comparison was against a "standard app". Will try S5T0304 next.

Quote:

This result using S5T0302 is another 10+% faster BUT it is listed as "Checked, but no consensus yet".

Quote:

This is my first result using S5T0301 - an improvement of 6 or 7% over S5T0003 (validated OK).

Quote:

This is my first full result with the S5T0003 patch. And this result was a hybrid - about 10% standard app/90% patched. It appears the patch is 10 to 12% faster than the standard app on my 2.4 GHz P4 w/Windows XP Pro.

Edit: This is my "Results for computer" (for comparison).




Laser Jock
Laser Jock
Joined: 9 Mar 05
Posts: 3
Credit: 6664015
RAC: 0

I have 4 valid results

I have 4 valid results ([URL="http://einsteinathome.org/workunit/10050987"]here[/URL], [URL="http://einsteinathome.org/workunit/10063556"]here[/URL], [URL="http://einsteinathome.org/workunit/10122747"]here[/URL], and [URL="http://einsteinathome.org/workunit/10123498"]here[/URL]) using the S5T0301 patch and another one pending ([URL="http://einsteinathome.org/workunit/10048682"]here[/URL]). All of the results had ~12% improvement in speed over the standard app.

Crunchers For More Power
Crunchers For M...
Joined: 3 Aug 05
Posts: 69
Credit: 1071273
RAC: 0

S5T0304 Valid :-)result

S5T0304 Valid :-)
result

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.