Interesting CPU behavior

Matt White
Matt White
Joined: 9 Jul 19
Posts: 116
Credit: 170,443,036
RAC: 444,909
Topic 219674

My DL360 server is equipped with 2 Xeon 6 core processors. After a few startup bumps, it has been running and crunching without issue since mid July. About a week ago, it appears that one of the processors began to get lazy. Looking at the temperature graph it was apparent that the second processor was not doing it's share of work.

Archae86 had mentioned a program called Process Lasso which give the user more precise control over CPU assignment and process control. Once installed and configured, the split CPU behavior was eliminated.

This program looks like strong medicine with a considerable amount of options, but, I would highly recommend installing it on any Windows box with multiple cores, especially a server.

A big thank you to Archae86 for mentioning it!

Clear skies,
Matt
JStateson
JStateson
Joined: 7 May 07
Posts: 112
Credit: 1,161,780,248
RAC: 878,238

Matt White wrote:My DL360

Matt White wrote:
My DL360 server is equipped with 2 Xeon 6 core processors. After a few startup bumps, it has been running and crunching without issue since mid July. About a week ago, it appears that one of the processors began to get lazy

I have a similar problem with a X8DTL-iF.  I keep it in the garage and had to add liquid cooling for both Xeons.  They are cooled in tandem so the one closes to the radiator is cooler.  I had to solve the overheating by reducing the speed of all cores, not just the one that runs hotter.

Thinking about this I got to wishing that the Boinc client would tag each work unit with the core much like the GPU is tagged.  Not sure if that is even possible as threads and cores can change as the OS sees fit unlike the GPUs.  It might be useful to permanently assign a hyperthread or more likely just the core.  One could tell if something nefarious was happening by comparing various completion times for cores. 

Matt White
Matt White
Joined: 9 Jul 19
Posts: 116
Credit: 170,443,036
RAC: 444,909

JStateson wrote:Thinking

JStateson wrote:
Thinking about this I got to wishing that the Boinc client would tag each work unit with the core much like the GPU is tagged.  Not sure if that is even possible as threads and cores can change as the OS sees fit unlike the GPUs.  It might be useful to permanently assign a hyperthread or more likely just the core.  One could tell if something nefarious was happening by comparing various completion times for cores. 

Process Lasso does just that. You can balance and assign tasks to a work unit forcing all the cores to load share. It will tell you what cores are assigned to each task, even if you use the auto assign procedure. 

It's a very nice tool.

Clear skies,
Matt
Matt White
Matt White
Joined: 9 Jul 19
Posts: 116
Credit: 170,443,036
RAC: 444,909

Just to clarify, the working

Just to clarify, the working processor wasn't overheating. The 2nd processor was not doing its share of the work. I generally keep the processor utilization at 60%, which allows one work unit for each physical core, plus two GPU tasks.

Clear skies,
Matt

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.