System stability and efficiency

Keith Myers

Joined: 11 Feb 11

Posts: 4964

Credit: 18749968477

RAC: 7086364

MontanaDoug wrote: Some more

24 Dec 2020 22:56:57 UTC

Message 181909 in response to message 181905

(moderation:

)

MontanaDoug wrote:

Some more questions:

1. My recent change to try 3 jobs/GPU seems to have done something. I had been running consistently 2/GPU, now I'm running 3 on device 0, and 1 on device 1. Am I limited in GPU memory?? CPU memory??? device 0 is my 8GB unit, device 1 is 4GB.

2. "Running (0.9 CPUs + 0.5 AMD/ATI GPUs(device #))". Can you refer me to an explanation of this, like - why .9 CPUs and is that toggled in the preferences somewhere? I also thought that the 0.5 GPUs would change when I went to 3/GPU. but no????

Fun on Xmas eve!

Cheers, Doug

No, that is set by the scheduler in the server settings. The application developer set the 0.9 cpu usage for the application as natively delivered.

If you want to change that to your own preferences you need to write and run an app_info.xml for the application.

You can read about it here. Client/Application configuration document

MontanaDoug

Joined: 16 Dec 20

Posts: 17

Credit: 15418321

RAC: 0

MontanaDoug wrote: 1. My

25 Dec 2020 15:44:21 UTC

Message 181922 in response to message 181905

(moderation:

)

MontanaDoug wrote:

1. My recent change to try 3 jobs/GPU seems to have done something. I had been running consistently 2/GPU, now I'm running 3 on device 0, and 1 on device 1. Am I limited in GPU memory?? CPU memory??? device 0 is my 8GB unit, device 1 is 4GB.

Answered my question: I was CPU limited. I've increased my CPU utilization to 50% of the processors and now have both GPUs processing 3 tasks. My temps (GPU and CPU) are a bit higher than I'd like - may need the liquid methane cooling!

Tom M

Joined: 2 Feb 06

Posts: 6460

Credit: 9583050521

RAC: 7012568

MontanaDoug

26 Dec 2020 17:27:00 UTC

Message 181942 in response to message 181904

(moderation:

)

MontanaDoug wrote:

Annoyance!!

In BOINC Manger>Options>Computing preferences, It states that it is using web based preferences from WGC. I've tried many different ways of changing that to Einstein - NO LUCK!

Suggestions?

This is part of my system stability project. Progress is being accomplished (at a snail's pace)

Normally the "last" website you make changes on in "preferences" will (eventually) be the one listed as your "preference" website.

Since I normally use the local preferences on each machine it is a non-issue and gives me more system-specific control over each machine (I do have several).

Tom M

A Proud member of the O.F.A. (Old Farts Association). Be well, do good work, and keep in touch.® (Garrison Keillor) I want some more patience. RIGHT NOW!

Tom M

Joined: 2 Feb 06

Posts: 6460

Credit: 9583050521

RAC: 7012568

MontanaDoug

26 Dec 2020 17:33:27 UTC

Message 181943 in response to message 181922

(moderation:

)

MontanaDoug wrote:

MontanaDoug wrote:

1. My recent change to try 3 jobs/GPU seems to have done something. I had been running consistently 2/GPU, now I'm running 3 on device 0, and 1 on device 1. Am I limited in GPU memory?? CPU memory??? device 0 is my 8GB unit, device 1 is 4GB.

Answered my question: I was CPU limited. I've increased my CPU utilization to 50% of the processors and now have both GPUs processing 3 tasks. My temps (GPU and CPU) are a bit higher than I'd like - may need the liquid methane cooling!

There is the gross number of CPU threads you are using for the system (I run 2-4 threads idle depending on the core count of the CPU). And the CPU "amount" per GPU thread.

For a while, I was running 0.125 CPU threads per GPU thread. And not seeing any apparent loss in production/processing speed with up to 3 GPU threads on Radeon 5700's.

Then the processing times started extending. So I regressed to 1 cpu thread per gpu thread (on Gamma-Ray tasks).

When I want to squeeze more cpu production out of my cpus I will start playing with sub 1 cpu threads again.

Tom M

A Proud member of the O.F.A. (Old Farts Association). Be well, do good work, and keep in touch.® (Garrison Keillor) I want some more patience. RIGHT NOW!

Tom M

Joined: 2 Feb 06

Posts: 6460

Credit: 9583050521

RAC: 7012568

Is there a way to run e@h gpu

2 Jan 2021 15:51:24 UTC

Message 182074

(moderation:

)

Is there a way to run e@h gpu tasks and cpu tasks on the same machine under windows?

My last attempt caused my gpu to stop receiving tasks and a large over supply of cpu tasks.

I basically have to run a non-gpu project on my e@h gpu machine to utilize the CPUs at all for cpu tasks.

Tom M

A Proud member of the O.F.A. (Old Farts Association). Be well, do good work, and keep in touch.® (Garrison Keillor) I want some more patience. RIGHT NOW!

Keith Myers

Joined: 11 Feb 11

Posts: 4964

Credit: 18749968477

RAC: 7086364

Use a small cache and a

2 Jan 2021 16:58:08 UTC

Message 182076

(moderation:

)

Use a small cache and a max_concurrent on the cpu applications.

archae86

Joined: 6 Dec 05

Posts: 3157

Credit: 7225828263

RAC: 1055620

Tom M wrote:Is there a way to

2 Jan 2021 17:05:06 UTC

Message 182077 in response to message 182074

(moderation:

)

Tom M wrote:

Is there a way to run e@h gpu tasks and cpu tasks on the same machine under windows?

My last attempt caused my gpu to stop receiving tasks and a large over supply of cpu tasks.

I basically have to run a non-gpu project on my e@h gpu machine to utilize the CPUs at all for cpu tasks.

Yes, although I don't do it.

I think the secret to avoid the "large over supply" category of trouble is to use a really small queue requested size. Try 0.1 day + 0.0 day.

At Einstein, mixing task types (even mixing GW and GRP on the GPU) gives big fluctuations in task duration correction factor, thus big swings in prefetching. On the high end of the swing too much comes in, and later the machine goes to panic mode.

That may not be exactly what happened to you, but I think that keys to getting peaceful coexistence include very small cache size, and appropriate choice of "Use at most" nn% "of the processors". The last point is especially important if the actual number of processors is overstated (as some people do to raise their maximum daily task quota to feed highly capable GPUs).

[edit: ninja overlap with Keith]

MontanaDoug

Joined: 16 Dec 20

Posts: 17

Credit: 15418321

RAC: 0

Hi Tom Thanks for your

2 Jan 2021 17:22:33 UTC

Message 182079 in response to message 182074

(moderation:

)

Hi Tom

Thanks for your suggestions. I've been running smoothly for about a week now. The settings I'm using don't seem to impair any of my spreadsheet work ( or other tasks), so I'm running 24/7. I've given an answer to your question below, though I kinda feel like the amateur instructing the expert! Beware!!

Tom M wrote:

Is there a way to run e@h gpu tasks and cpu tasks on the same machine under windows?

My last attempt caused my gpu to stop receiving tasks and a large over supply of cpu tasks.

I basically have to run a non-gpu project on my e@h gpu machine to utilize the CPUs at all for cpu tasks.

Tom M

Win10 Up-to-date.

My Preferences: (In Einstein)

Running 4 GPU tasks (with 2 GPU's), 3 CPU tasks (all WGC) all at the same time

Use at most: 50.0 % of the processors
Use at most: 100.0 % of the CPU time
Page/swap file: use at most: 75.0 % of total
Memory: when computer is in use, use at most: 50.0% of total

Resource share: 50 (WCG 90)
Use CPU: NO
Use NVIDIA GPU: NO
Use AMD GPU: YES
Use INTEL GPU: NO

Run CPU versions of applications for which GPU versions are available: NO
Allow non-preferred apps: YES
GPU utilization factor of BRP apps: 1.00
GPU utilization factor of FGRP apps: 0.50
GPU utilization factor of GW apps: 0.50

I think I listed the operative factors. I'm running a bit warmer than I'd like (mid 60's max) on the GPU's and CPU's.

Doug

MontanaDoug

Joined: 16 Dec 20

Posts: 17

Credit: 15418321

RAC: 0

Hi Tom BOINC stored about

2 Jan 2021 17:40:40 UTC

Message 182080

(moderation:

)

Hi Tom

BOINC stored about 44 hrs of Einstein work (all GPU) and about 52 hrs of WGC CPU work using the suggested .1 and .25 "store at least" and "store up to" factors.

BOINC must think my system is 10x faster than it is!!!

Don't know if either of my answers is helpful!

Cheers, Doug

Tom M

Joined: 2 Feb 06

Posts: 6460

Credit: 9583050521

RAC: 7012568

MontanaDoug wrote:I've

5 Jan 2021 15:40:39 UTC

Message 182195 in response to message 182079

(moderation:

)

MontanaDoug wrote:

I've given an answer to your question below, though I kinda feel like the amateur instructing the expert! Beware!!

Tom M wrote:

Is there a way to run e@h gpu tasks and cpu tasks on the same machine under windows?

My last attempt caused my gpu to stop receiving tasks and a large over supply of cpu tasks.

I basically have to run a non-gpu project on my e@h gpu machine to utilize the CPUs at all for cpu tasks.

Tom M

Doug,

Your not "instructing" your offering your experience. Your experience is valuable to the community and me.

I have had reliable results combining non-gpu projects like World Community Grid or other projects like Prime Grid or MLC@Home with E@H gpu tasks.

I have had trouble, multiple times, with trying to combine gpu and cpu tasks under E@H.

The system under discussion is this one.

E@H has regularly given me too many CPU tasks and then refused to download any more GPU tasks because there are "too many runnable tasks".

Under Linux, under a limited release, privately available distribution of the Boinc Manager called "Pandora", you can limit the number of specific application cpu tasks downloaded.

Since I was previously having trouble with getting the AMD drivers under Linux 18/20 to install I switched to Windows10 where I was finally able to run 2 Rx 580's and 3 Rx 5700's on the same machine. Something I couldn't do under Linux. The problem with installing the Linux driver has been fixed but I am upto 8th place on the top 50 computer listings so I am reluctant to revert.

I am driving my GPUs with an AMD 2700x (8c/16t) that under standard usage I can run up to 14 threads without overcommitting (and slowing down) the system.

So far I have discovered the Rx 580 8GB GPU's will not run Gamma-Ray tasks for 3 tasks at a time. The GPU tasks on those cards start stalling.

Yesterday I took the Rx 580's offline (on a mining rack you simply unplug the data connections) and resumed running 3 threads per Rx 5700 to see if I could "sustain" the current RAC I have.

This morning I discovered I was about to (again) run out of GPU tasks. So my backup "0" resource project PrimeGrid started running GPU tasks in place of E@H because "it" couldn't download any more GPU tasks from E@H.

I have done two things. 1) PrimeGrid is now set to NNT (No New Tasks) and 2) I aborted about 500 Gravity Wave CPU tasks.

This caused Boinc Manager to ask for and get more Gamma Ray GPU tasks.

I am currently running without faking out the CPU count because so far the GPU task supply has been sufficient to keep my box fed. I have played with faking a higher count of CPU cores but didn't reduce the % of available CPU cores in proportion so experienced a flood of downloaded GW CPU tasks.

One of the questions I have is would I experience the same issues if I was processing Gamma-Ray tasks on the CPU instead of Gravity Wave tasks?

Since I can get it to "run" GPU tasks by manually aborting the oversupply of CPU tasks why do I "have a problem"? I am a big fan of "set and forget". Sometimes it can be days or a week before I look at my Boinc machines. If I have to manually intervene on a regular basis it means it is no longer a "set and forget" rig.

If I get aggravated enough I will simply switch back to a GR GPU-only profile like I have on this machine and run probably run World Community Grid tasks on the CPU. Or even go to a GPU only server.

Tom M

A Proud member of the O.F.A. (Old Farts Association). Be well, do good work, and keep in touch.® (Garrison Keillor) I want some more patience. RIGHT NOW!

System stability and efficiency

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner