No, I could test it on a Yonah-Notebook.
IMHO the Yonah processor support also 128bit-SSE2, doesn't it?
Yonah supports SSE instructions and the SSE registers are 128-bit wide, but the execution engine is just 64-bit wide. (The processor core does 2 overlapped instructions but with some latency. It means about 20% performance loss.)
Well, I recode the core of albert, but I cannot test it because of lack of SSE2 capabale machine. Where is my Conroe??? :)
Would you like to test it? S37a
This is a SSE2 version of C37.
RE: C37 is really slower on
)
I have a AuthenticAMD
AMD Athlon(tm) 64 Processor 3400+ and it seem as the C37 is the same or maybe 1-2% slower than A36
I also run a P4
it runns a unit in 12500 sek now instead of 15900 with A36.
Anders n
edit [more info]
RE: I'm working on it,
)
I'd like to test it if finished ;)
Thank you for your ambitious work.
CU HiNuN
RE: I'd like to test it if
)
I hope, you know that a SSE2 code won't be faster than optimised FPU code if your processor has just 64-bit wide SSE engine.
C37 running about 5-10%
)
C37 running about 5-10% faster than A36 on my Pentium D 830 (times are now a bit less than half those on the original albert app).
RE: I hope, you know that a
)
Yes, it's clear. :)
RE: RE: I hope, you know
)
Perhaps do you have a Conroe?
RE: Perhaps do you have a
)
No, I could test it on a Yonah-Notebook.
IMHO the Yonah processor support also 128bit-SSE2, doesn't it?
RE: RE: Perhaps do you
)
Yonah supports SSE instructions and the SSE registers are 128-bit wide, but the execution engine is just 64-bit wide. (The processor core does 2 overlapped instructions but with some latency. It means about 20% performance loss.)
RE: but the execution
)
So my thought was wrong, that Yonah already support "real" 128-bit instruction execution. Sorry. :-/
Ok, let's buy a Conroe ;)
RE: Ok, let's buy a Conroe
)
Yeah... :)
Well, I recode the core of albert, but I cannot test it because of lack of SSE2 capabale machine. Where is my Conroe??? :)
Would you like to test it? S37a
This is a SSE2 version of C37.