Daily task quota

Tom M

Joined: 2 Feb 06

Posts: 6871

Credit: 9782757097

RAC: 3119271

DanNeely wrote: Going the

7 Nov 2021 13:38:10 UTC

Message 190243 in response to message 190147

(moderation:

)

DanNeely wrote:

Going the fake GPU route I've actually had the opposite problem. There's a separate clientside limit on max tasks at a time, and my local backlog has become entirely filled with GPU tasks, leaving my CPUs running a backup project for the last few days.

I think I'm going to try switching to fake CPUs instead and see if skewing the notionally desired workload mix will help. If it doesn't help, or if it ends up skewing the problem hard the other direction, I guess I'll have to reduce my desired amount of local data. I don't really like doing so; but unless someone can point me to another setting to fiddle it doesn't look like I can get enough GPU work to have a few days on hand no matter what I do.

10/31/2021 6:40:50 PM | Einstein@Home | update requested by user
10/31/2021 6:40:50 PM | Einstein@Home | [sched_op] sched RPC pending: Requested by user
10/31/2021 6:40:50 PM | Einstein@Home | [sched_op] Starting scheduler request
10/31/2021 6:40:51 PM | Einstein@Home | Sending scheduler request: Requested by user.
10/31/2021 6:40:51 PM | Einstein@Home | Reporting 6 completed tasks
10/31/2021 6:40:51 PM | Einstein@Home | Not requesting tasks: too many runnable tasks
10/31/2021 6:40:51 PM | Einstein@Home | [sched_op] CPU work request: 0.00 seconds; 0.00 devices
10/31/2021 6:40:51 PM | Einstein@Home | [sched_op] NVIDIA GPU work request: 0.00 seconds; 0.00 devices
10/31/2021 6:40:54 PM | Einstein@Home | Scheduler request completed

When I end up with a message like "too many runnable tasks" the simplest way to "clear" things seems to be "reset" the project. I would try to make the changes that will keep you from getting "to many runnable tasks" first though.

I have been running days of work and additional days of work at 0.1 which might help.

Using the <ncpu>16</ncpu> (on an 4c/8 cpu) in the cc_config.xml file seems to be the most straight forward way to lift the daily task limit for me. I am not having that issue right now. But when I was playing with high multiples of gpus cards it certainly bit me.

Tom M

A Proud member of the O.F.A. (Old Farts Association).

Keith Myers

Joined: 11 Feb 11

Posts: 5057

Credit: 19246900929

RAC: 6668723

Why are you concerned about

7 Nov 2021 15:55:08 UTC

Message 190249

(moderation:

)

Why are you concerned about bunkering several days of work at Einstein?

They have very stable servers and have never had any outages that I can remember when they couldn't deliver work when asked.

I just operate on a report one/ask for one replacement schedule and my hosts always have work.

I keep just 30 tasks on hand at all times. No issues whatsoever.

DanNeely

Joined: 4 Sep 05

Posts: 1364

Credit: 3600358266

RAC: 749516

Keith Myers wrote: Why are

11 Nov 2021 15:46:25 UTC

Message 190336 in response to message 190249

(moderation:

)

Keith Myers wrote:

Why are you concerned about bunkering several days of work at Einstein?

They have very stable servers and have never had any outages that I can remember when they couldn't deliver work when asked.

I just operate on a report one/ask for one replacement schedule and my hosts always have work.

I keep just 30 tasks on hand at all times. No issues whatsoever.

My ISP can be slow to repair things if it doesn't affect a lot of customers. ex earlier this year someone (99% sure a neighbor) bumped a utility pole right were the connection between the coax coming down the pole and the coax going into my building were; breaking it badly enough I couldn't kludge up a temporary repair. I called them first thing Monday morning. It was Wednesday afternoon before they were able to send someone out to fix it.

Until my GPUs got fast enough to run into work unit limits I liked to keep a few days of work on hand to cover both stuff like that, and cases when the WU generators run dry just before a weekend. I still want to be able to cover shorter maintenance/repair outages even if I can't do that anymore.

Keith Myers

Joined: 11 Feb 11

Posts: 5057

Credit: 19246900929

RAC: 6668723

Solution for temp outages at

11 Nov 2021 16:05:03 UTC

Message 190338

(moderation:

)

Solution for temp outages at Einstein is to have some backup projects.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 4152

Credit: 49925111567

RAC: 41231379

As this is the most recent

1 Feb 2022 15:28:55 UTC

Message 192303

(moderation:

)

As this is the most recent thread about this issue and with a relevant title, I figured I would post an update. I contacted Bernd and asked him to review and update host daily quota limits, and he has essentially doubled the limits.

Quote:

I doubled the "daily_result_quota" (per CPU) from 32 to 64, which should double the GPU task limit automatically, and "max_ncpus" as well (may need some minutes to propagate). AFAIK there is no adjustable limit for the number of GPUs of a host; if there is one, it's probably hardcoded (table size) in the client an the server as well and can't easily be changed.

BM

so new limits should be 64 tasks per CPU, up to 128 CPUs. and 512 tasks per GPU, up to 8 GPUs.

most people will not have to adjust any settings anymore unless someone is running a really fast GPU with low core count CPU.

_________________________________________________________________________

archae86

Joined: 6 Dec 05

Posts: 3165

Credit: 7388481687

RAC: 2034379

Ian&Steve C. wrote:so new

1 Feb 2022 20:11:24 UTC

Message 192309 in response to message 192303

(moderation:

)

Ian&Steve C. wrote:

so new limits should be 64 tasks per CPU, up to 128 CPUs. and 512 tasks per GPU, up to 8 GPUs.

most people will not have to adjust any settings anymore unless someone is running a really fast GPU with low core count CPU.

Good news. I have so far changed my falsified ncpus value in cc_config.xml on one machine from 24 to 0.

By the way, several people posting here have cited "-1" as the proper ncpus value to get the real detected number into use, but the BOINC online documentation at:

https://boinc.berkeley.edu/wiki/Client_configuration

specifies "0" for this purpose, so I tried that first. As it should have, after doing Options|read config files, followed by an update in BOINC, that got me the expected value of 4 displayed on my account web page.

Perhaps both "-1" and "0" work? Or perhaps "-1" was a mistaken advice?

Ian&Steve C.

Joined: 19 Jan 20

Posts: 4152

Credit: 49925111567

RAC: 41231379

The client probably just

1 Feb 2022 20:30:22 UTC

Message 192310

(moderation:

)

The client probably just ignores the -1 value and reads the number from the system. Personally I just remove that element from the cc_config file altogether which achieves the same end.

_________________________________________________________________________

Keith Myers

Joined: 11 Feb 11

Posts: 5057

Credit: 19246900929

RAC: 6668723

The default value is -1 from

1 Feb 2022 20:56:48 UTC

Message 192311

(moderation:

)

The default value is -1 from a virgin cc_config initialization. The value is pulled from the client source code parameter setting.

tictoc

Joined: 1 Jan 13

Posts: 47

Credit: 7740974286

RAC: 7207989

Ian&Steve C. wrote:As this

3 Feb 2022 1:33:35 UTC

Message 192358 in response to message 192303

(moderation:

)

Ian&Steve C. wrote:

As this is the most recent thread about this issue and with a relevant title, I figured I would post an update. I contacted Bernd and asked him to review and update host daily quota limits, and he has essentially doubled the limits.

Quote:

I doubled the "daily_result_quota" (per CPU) from 32 to 64, which should double the GPU task limit automatically, and "max_ncpus" as well (may need some minutes to propagate). AFAIK there is no adjustable limit for the number of GPUs of a host; if there is one, it's probably hardcoded (table size) in the client an the server as well and can't easily be changed.

BM

so new limits should be 64 tasks per CPU, up to 128 CPUs. and 512 tasks per GPU, up to 8 GPUs.

most people will not have to adjust any settings anymore unless someone is running a really fast GPU with low core count CPU.

Thanks for taking the initiative on this, and thanks to the admin for making the change.

On a side note, sorry to any of my wingmen who were caught waiting for validation due to two driver crashes that resulted in a few thousand of my tasks erroring out last week.

Daily task quota

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner