Sandy Blitz is here

mikey

Joined: 22 Jan 05

Posts: 12774

Credit: 1855380311

RAC: 1060385

RE: May I ask what that

18 Jan 2011 11:17:34 UTC

Message 103073 in response to message 103072

(moderation:

)

Quote:

May I ask what that system cost to put together?

I just built an AMD Phenom II x6 which does 6 Einstein GC WUs in about 18,000 sec (no GPU). System has 8GB 1600MHz DDR3, 1.5 TB Barracuda for right around $700. Running full bore it is 186W (average over 10 days). It's running stock 3.2GHz

The Binary Pulsar search takes right around 60,000 sec (CPU).

Not a fan boy of either manufacturer but I will probably need another system in a month or so and I'm trying to decide between an I7 and another Phenom.

Thanks,
Joe

There are reports that Intel is better here at Einstein, that may be true but I am not sure I would let be my deciding factor.

hotze33

Joined: 10 Nov 04

Posts: 100

Credit: 368387400

RAC: 0

System costs are not that

18 Jan 2011 12:09:58 UTC

Message 103074

(moderation:

)

System costs are not that much different. It all depends on how much you are into overclocking. The 2500K (200â‚¬) has no hyper threading but can clock quite high so that the credits per day are quite the same as for a Phenom II X6 (~200â‚¬). HT of the 2600K (300â‚¬) adds roughly 30% more crunching power.
The AMD mainboards seem a little less expensive but this also depends on your preferences.
I wouldnÂ´t change my system to a Phenom X6 and vice versa because the crunching power is more than sufficent.

joe areeda

Joined: 13 Dec 10

Posts: 285

Credit: 320378898

RAC: 0

Mike, Hotze, Thanks. E@H

18 Jan 2011 15:24:39 UTC

Message 103075

(moderation:

)

Mike, Hotze,

Thanks. E@H helps me justify a bit more processing power (money) but the critical factor for me is response time in my other apps.

I'm still trying to wrap my feeble brain around the common knowledge that HT increases throughput of these compute bound tasks by 30%. I thought HT would only speed up context switches but it's hard to believe total multitasking overhead is 30% in an environment with a reasonable number of tasks so I must be missing something.

I'm trying to figure out how to turn off HT on my older Xeon and I5 powered laptop to see for myself. Any links? I'm running Ubuntu 10.04 on both.

Joe

tolafoph

Joined: 14 Sep 07

Posts: 122

Credit: 74659937

RAC: 0

Did you try the BIOS? I can

18 Jan 2011 16:02:09 UTC

Message 103076 in response to message 103075

(moderation:

)

Did you try the BIOS?
I can disable it for my i5 dual-core, or use only one core with HT.

joe areeda

Joined: 13 Dec 10

Posts: 285

Credit: 320378898

RAC: 0

RE: Did you try the BIOS? I

18 Jan 2011 16:04:31 UTC

Message 103077 in response to message 103076

(moderation:

)

Quote:

Did you try the BIOS?
I can disable it for my i5 dual-core, or use only one core with HT.

I did not. I was hoping for something to read first but I'll check it out when I get back home today.

Joe

archae86

Joined: 6 Dec 05

Posts: 3161

Credit: 7265001763

RAC: 1582170

RE: I'm still trying to

18 Jan 2011 18:14:03 UTC

Message 103078 in response to message 103075

(moderation:

)

Quote:

I'm still trying to wrap my feeble brain around the common knowledge that HT increases throughput of these compute bound tasks by 30%. I thought HT would only speed up context switches but it's hard to believe total multitasking overhead is 30% in an environment with a reasonable number of tasks so I must be missing something.

Actually these speedups are observed in cases where the multitasking overhead is negligible.

It is true that one good way to think about HT is that it entails extremely rapid context switching--but it is not substituting for the explicit context switches visible to the OS. Instead it makes a pair of tasks each of which seems to the OS to have a nice long chunk of time uninterrupted (and un task-switched on a CPU) do extremely fine-grained trading of active use of the CPU.

So why does that help? Because the CPU has a number of separately-scheduled and somewhat independent resources, commonly including, but not limited to, Floating Point unit, main ALU, and external memory access. Each of these can be the next required source of data for one of the other, which means the other sits and waits. But, with the full copy of processor state for the "other" thread almost instantly available, that same unit may find it has data ready to process. In fact, though at any given moment only one of the two tasks is being "charged" CPU time, there are sufficient interlocks to allow some overlapped processing, so some work for both tasks may often be in process in the various portions of the machine at the same time

It is actually more complicated than that, but the big picture is that when hyperthreading works well, the various computing resources in the machine spend a higher fraction of their time actually doing something useful. There is switch overhead, but it is so small that it is quite rare to find cases where HT actually slows things down. There was such a case here on Einstein a few years ago, where some late stage in the wonderful sequence of akosf optimizations created code which gave slightly lower throughput with HT enabled on my Gallatin host than with it turned off. I don't know whether that same code would also suffer on the rather different HT implementation of the Nehalem family--quite possibly not.

joe areeda

Joined: 13 Dec 10

Posts: 285

Credit: 320378898

RAC: 0

RE: So why does that help?

18 Jan 2011 20:30:07 UTC

Message 103079 in response to message 103078

(moderation:

)

Quote:

So why does that help? Because the CPU has a number of separately-scheduled and somewhat independent resources, commonly including, but not limited to, Floating Point unit, main ALU, and external memory access.

Thank you.
I'm beginning to understand.

Joe

Robert

Joined: 5 Nov 05

Posts: 47

Credit: 324063405

RAC: 24178

Here are a couple of other

19 Jan 2011 1:29:18 UTC

Message 103080

(moderation:

)

Here are a couple of other comparisons to platforms in addition to the OP's comparison to the always popular over-clocked Q6600.

Configurations are as follows -

Sandy Bridge: i7-2600K @ 3.5 GHz, 2 x 4GB DDR3 1333, Nvidia GT 240
Nehalem: i7-920 @ 3.5 GHz, 3 x 2GB DDR3 1410, ATI 4670
Lynnfield: i7-860 @ 2.8 GHz, 4 x 2GB DDR3 1333, ATI 4670

Notes: Sandy Bridge just assembled without tuning and is not currently running any GPU jobs (trouble loading Nvidia drivers), OS = Ubuntu 10.10. The other two platforms have been under-volted to optimize power draws, OS = Windows 7.

All systems running with Hyper-Threading = ON, running 8 jobs at the same time. I also limited comparison to the current gravity wave jobs.

First I set my i7-920 to 3.5 GHz to do a clock for clock comparison:
i7-920: GW Job = 20,000 sec drawing 285 watts
i7-2600K: GW Job = 18,200 sec drawing 155 watts

Next, both the i7-860 and i7-2600K are drawing 155 watts
i7-860: GW Job = 28,300 sec
i7-2600K: GW Job = 18,200 sec

Robert

Joined: 5 Nov 05

Posts: 47

Credit: 324063405

RAC: 24178

For those interested in

31 Jan 2011 0:48:01 UTC

Message 103081

(moderation:

)

For those interested in under-volting this new chip here are some power numbers. I was not overly agressive with the under-volt, just a -0.1 volt offset (this board uses offsets versus setting an absolute number). Running stable for 1 week.

Configuration:
i7-2600K at stock 3.4 GHz
2 x 4GB DDR3-1333 at 1.5 volts
ASUS P8H67-M mATX motherboard (should have got a P67 board)
ATI 4670 with ATI drivers (GT-240 had HW failure, so removed)
620 watt 80+ Bronze PSU
Ubuntu 10.10

Running 8 gravity wave jobs at the same time (HT=ON) with v1.07 application.

Power = 128 watts or 3.08 Kilowatt hours / Day

Monthly electrical cost = 3.08 * 30 days * $0.11 Kwh = $10.16

Can also confirm an idle power draw of 60 watts.

archae86

Joined: 6 Dec 05

Posts: 3161

Credit: 7265001763

RAC: 1582170

RE: ATI 4670 with ATI

31 Jan 2011 5:48:00 UTC

Message 103082 in response to message 103081

(moderation:

)

Quote:

ATI 4670 with ATI drivers (GT-240 had HW failure, so removed)

Can also confirm an idle power draw of 60 watts.

As you have an H67 board, I suppose you could have used the onboard graphics on your 2600K and saved appreciable power.

Why not? Games...?

Any idea how much the 4670 is contributing to your power numbers?

While your supply has a high efficiency rating, your system is running so far down in the capability range that you are probably giving up some efficiency to what you would have with a comparably good but lower capacity supply.

Sandy Blitz is here

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner