Einstein causes PC to restart, Milky Way works fine

Adam
Adam
Joined: 15 Oct 11
Posts: 12
Credit: 342316741
RAC: 396636
Topic 228732

I'm currently using BOINC 7.20.2 and running Einstein@home and Milkyway@home. My computer consists of:

 

Ryzen 5800X3D

MSI B550-A Pro

32GB DDR4 3600

Radeon 6800XT

Seasonic 750w 80+ titanium

Most recent Windows 10 update

Most recent BIOS update

Computer is custom water cooled

 

When running einstein@home tasks, the computer abruptly restarts within a few minutes. No blue screen, no warning, nothing. I attempt looking in Windows event viewer but searching the specific error doesn't get me anywhere and I don't know enough about Windows errors to narrow my search. It's always a "bug check" that causes a sudden reboot followed by lots of digits in parenthesis.

I can run Milky Way all day long with CPU and GPU tasks. If I disable GPU tasks and try to run einstein, it still restarts after a few minutes. If I change the CPU utilization setting to 25%, I can get more time but it always eventually restarts. Sometimes I'll get a Radeon driver pop up message after Windows loads saying it detected a black screen or crash and asks if I want to submit a crash report.

This computer is my VR gaming computer. I recently formatted the hard drive and reloaded Windows due to frequent crashes with Flight Simulator 2020, Half Life Alyx, and BOINC. After reloading the OS, drivers, games and everything else, I did lots of trial and error with Steam VR settings and all my VR games work perfectly now. FS2020 needed extra in-game tweaking but it also hasn't crashed yet. Einstein is the last hold out.

Some settings that could potentially impact BOINC stability: in BIOS I have "MSI Kombo Strike" activated for the 5800X3D. It is an overclocking feature specific to the 5800X3D alone. I also have smart access memory activated. The other name for it is resizable BAR. I have my Windows power plan set to "high performance". I don't think the computer is running out of juice or getting too hot. I think this is a software issue. Here are some screen shots of MSI Afterburner after running Milky Way for over an hour.

 

https://imgur.com/J6ziJFU

https://imgur.com/3lJzbMi

https://imgur.com/JZww5yB

 

If you think any info from event viewer would be helpful please let me know.

Gandolph1
Gandolph1
Joined: 20 Feb 05
Posts: 180
Credit: 389655826
RAC: 1269

Just a quick guess, but the

Just a quick guess, but the power supply would cause all of the symptoms you describe including crashing while gaming. 

 

Einstein may be drawing more resources than your other project causing the supply to trip.  I personally have had this happen with a Seasonic 850.  I had to move to the Seasonic 1300 watt to eliminate all random crashing.  While your 750 watt SHOULD be enough, with your system I wouldn't go any less than an 850 and quite frankly Id buy a 1000w, just cause.....

 

 

 

GWGeorge007
GWGeorge007
Joined: 8 Jan 18
Posts: 3070
Credit: 4974457686
RAC: 1431400

Adam wrote: When running

Adam wrote:

When running einstein@home tasks, the computer abruptly restarts within a few minutes. No blue screen, no warning, nothing. I attempt looking in Windows event viewer but searching the specific error doesn't get me anywhere and I don't know enough about Windows errors to narrow my search. It's always a "bug check" that causes a sudden reboot followed by lots of digits in parenthesis.

If you think any info from event viewer would be helpful please let me know.

Gandolph1 wrote:

Just a quick guess, but the power supply would cause all of the symptoms you describe including crashing while gaming. 

Einstein may be drawing more resources than your other project causing the supply to trip.  I personally have had this happen with a Seasonic 850.  I had to move to the Seasonic 1300 watt to eliminate all random crashing.  While your 750 watt SHOULD be enough, with your system I wouldn't go any less than an 850 and quite frankly Id buy a 1000w, just cause.....

I firmly agree with Greg (Gandolph1).  Get a 1000W PSU as a minimum, Platinum or Titanium rated.  1300W or even higher if you can afford it.  My preference would be either EVGA or Seasonic, with first choice being an EVGA 1300W Platinum.  It should fit in your computer case.

For your GPU a 750W PSU is rated as a MINIMUM, and that is from AMD.  That assumes that you don't do any overclocking, and are doing 'normal' things with your computer.  It does not include running BOINC projects 24/7.

George

Proud member of the Old Farts Association

Link
Link
Joined: 15 Mar 20
Posts: 121
Credit: 10360622
RAC: 54111

Adam wrote:Some settings that

Adam wrote:
Some settings that could potentially impact BOINC stability: in BIOS I have "MSI Kombo Strike" activated for the 5800X3D. It is an overclocking feature specific to the 5800X3D alone. I also have smart access memory activated. The other name for it is resizable BAR.

Of course you should first try to get your system at default speed to run stable and than see how (and if) you can overclock it. Not every system can be overclocked, specially not for running 24/7 at BOINC load.

 

Adam wrote:
I don't think the computer (...) getting too hot.

Don't think/try to guess, check.

But I agree with the two posts above, all you write sounds like not stable power delivery.

.

Adam
Adam
Joined: 15 Oct 11
Posts: 12
Credit: 342316741
RAC: 396636

Thank you everyone. Since my

Thank you everyone. Since my post I've tried turning off Kombo Strike and that didn't make a difference. The pictures I posted show that temps for the CPU and GPU are well below their thermal throttling points so that's why I've ruled out thermals.

Does anyone know the name of the tool I need to measure power draw from the wall? I'm able to see with software that the CPU is pulling around 100w and the GPU can pull as much as 265w. I've seen tech tubers like Linus and Steve use the tool but I'm not sure what it's name is.

I should mention I have a second very similar computer that I use for pancake gaming and einstein works fine on that one. I didn't mention it yet to try and minimize the already huge wall of text in my initial post. The second computer consists of:

Ryzen 5800X

MSI X570 Gaming Edge Wifi

32GB DDR4 3600

Same Radeon 6800XT

Same Seasonic 750w 80+ titanium

Same Most recent Windows 10 update

Older BIOS version

Older AMD graphics drivers from 1 year ago

Same custom water cooled setup

It's also using PCIe gen 3 instead of 4 due to the riser cable, no smart access memory, no kombo strike. I agree when computers abruptly restart that sounds like slam dunk power supply issue. I just want to be absolutely sure it's not software before dropping a good chunk of change on a kilowatt PSU especially when my other very similar computer with the same PSU is running einstein with no problems.

GWGeorge007
GWGeorge007
Joined: 8 Jan 18
Posts: 3070
Credit: 4974457686
RAC: 1431400

Adam wrote:Does anyone know

Adam wrote:

Does anyone know the name of the tool I need to measure power draw from the wall? I'm able to see with software that the CPU is pulling around 100w and the GPU can pull as much as 265w. I've seen tech tubers like Linus and Steve use the tool but I'm not sure what it's name is.

Kill-a-Watt meter is the most common, but there are many others depending on your budget and preference.

https://www.amazon.com/s?k=power+draw+meter&crid=2UP42WC9BKXS&sprefix=power+draw+meter%2Caps%2C106&ref=nb_sb_ss_ts-doa-p_1_16

Keep in mind that the useful limit of drawing power from the wall is about 75% of the rating of your power supply for drawing power 24/7 with overclocking.  The point that we're trying to make is the voltage spikes that come from your graphics card (GPU).  Some are higher than others, and some power supply units (PSU) will suppress power spikes better than others.

You'd be better served to not have a power supply with a rating below 1000W, regardless.  The higher the better, and also with a 80+ rating of Platinum or Titanium. 

.....[EDIT].....

BTW, your GPU can draw as much as 300W TDP, not 265W.

https://www.techpowerup.com/gpu-specs/radeon-rx-6800-xt.c3694

George

Proud member of the Old Farts Association

Keith Myers
Keith Myers
Joined: 11 Feb 11
Posts: 4968
Credit: 18767118998
RAC: 7123842

From the looks of it you are

From the looks of it you are dumping on the new GW tasks which put a much bigger load on both the cpu and gpu.  No issues with the much easier GR and Meerkat tasks.  And of course the Separation tasks at MW are a trifle for any gpu.

You need a larger power supply to handle the increased loading of the gpu from 0-90% and the big load on the cpu for the final 10% toplist calculation on the cpu for each task.

 

archae86
archae86
Joined: 6 Dec 05
Posts: 3157
Credit: 7229334881
RAC: 1147158

Kill-a-watt meters and the

Kill-a-watt meters and the like are great for figuring out your actual average power consumption from a cost of computing point of view.  They are useless for the problem at hand, as they don't have fine enough time resolution.

A modern GPU can and does pull transients of far higher than average power consumption.

Plenty of capacitors exist both on the power supply and inside the system so that the shorter-duration spikes may have little effect--but something sneaks through.  If ANYTHING in the system does not like the resulting transients, trouble may come.

More likely, if you have outright restarts, the power supply's internal protection circuitry detected a transient for which the combination of width (time duration) and height (amperage above a limit) exceed the actual triggering of protection.

If you want to instrument that, you might look into just using a low-value (extremely) resistor in series with the wall supply line and an oscilloscope. Good luck finding out what value you need to avoid, but at least you'd have the capability of comparing different scenarios.  If you peak inside a Kill-a-watt, you'll find it is measure voltage across a low-value resistor.

The one thing I'd exclude is that the Einstein software has some magic flaw, which mysteriously figures out how to activate on your system, but not on thousands of other user systems running the same code.

You are not the first, and won't be the last, to post here something on the lines of "everything else works, so it must be that Einstein is at fault".  They all forget the other side of the coin "everybody else's system does not have this problem, and yours does--so there has to be something different (and quite likely faulty) about your system".

Good luck.

 

 

 

Adam
Adam
Joined: 15 Oct 11
Posts: 12
Credit: 342316741
RAC: 396636

Keith Myers wrote: From the

Keith Myers wrote:

From the looks of it you are dumping on the new GW tasks which put a much bigger load on both the cpu and gpu.  No issues with the much easier GR and Meerkat tasks.  And of course the Separation tasks at MW are a trifle for any gpu.

 

How were you able to tell that? I'd love to know for future troubleshooting.

Thanks for the advice everyone. I was just hoping to exhaust my free options first (driver problem, Windows settings etc) since a Seasonic or EVGA 1000w+ titanium PSU are easily $300+.

I turned kombo boost back on and found it actually reduced my CPU power usage from ~100w to 88w while boosting clock speed so kudos to MSI for figuring out how to pull that off. I also rearranged the cables on my power supply since it's fully modular thinking maybe there was a faulty plug. Neither made a difference, computer restarted after just a few minutes of running Einstein. Sounds like it's not worth pursuing measuring the computer's power draw. I'll start thinking about my next move.

Link
Link
Joined: 15 Mar 20
Posts: 121
Credit: 10360622
RAC: 54111

Adam wrote:I was just hoping

Adam wrote:
I was just hoping to exhaust my free options first

One free option that comes to my mind is swapping those two "identical" power supplies. Maybe the other can handle this system better.

.

Adam
Adam
Joined: 15 Oct 11
Posts: 12
Credit: 342316741
RAC: 396636

True! I was originally

True! I was originally thinking about swapping the identical GPU's but that would require the time consuming process of draining and refilling the coolant. Swapping PSU's would take no time since they're modular, all the wires can stay put. Can't believe I didn't think of that ugh! Also I realized they aren't identical, the other one is a titanium 750 and this one is a platinum 750. Maybe that's all this one needs? I'll report back what happens, thank you.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.