Not enough CUDA jobs to keep busy

joe areeda
joe areeda
Joined: 13 Dec 10
Posts: 285
Credit: 320378898
RAC: 0
Topic 196533

I have 2 computers that seem to run out of GPU jobs and I'm not sure if it's a problem on my part or hitting some sort of daily limit.

As far as I can tell WU are completing and being validated at a normal rate.

I did just bump these two to 3 tasks/GPU which makes me think about a limit.

I'd prefer to learn how to figure this out but if it takes privileges to view the details would someone please look at:

http://einsteinathome.org/host/5771544

and

http://einsteinathome.org/host/5860433

Thanks,
Joe

Holmis
Joined: 4 Jan 05
Posts: 1118
Credit: 1055935564
RAC: 0

Not enough CUDA jobs to keep busy

More info on limits or daily quota can be found in this post by Bikeman:

http://einsteinathome.org/node/196400&nowrap=true#117984

archae86
archae86
Joined: 6 Dec 05
Posts: 3153
Credit: 7170184931
RAC: 657658

Do these hosts run work for

Do these hosts run work for other projects?

The project balancing activity of the schedulers can lead to non-request for work in situations which seem rather odd to me.

joe areeda
joe areeda
Joined: 13 Dec 10
Posts: 285
Credit: 320378898
RAC: 0

RE: Do these hosts run work

Quote:

Do these hosts run work for other projects?

The project balancing activity of the schedulers can lead to non-request for work in situations which seem rather odd to me.


No, actually e@h is my only project.

Joe

joe areeda
joe areeda
Joined: 13 Dec 10
Posts: 285
Credit: 320378898
RAC: 0

RE: More info on limits or

Quote:

More info on limits or daily quota can be found in this post by Bikeman:

http://einsteinathome.org/node/196400&nowrap=true#117984

thank you for the link. I posted these cpus in that thread hopefully someone will take a look for me.

Joe

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5870
Credit: 115492845495
RAC: 33766896

Please don't post the same

Please don't post the same problem in two different threads.

I saw the other one first and replied there. It would have been better to have only posted here with all responses in the one place.

Cheers,
Gary.

joe areeda
joe areeda
Joined: 13 Dec 10
Posts: 285
Credit: 320378898
RAC: 0

RE: Please don't post the

Quote:

Please don't post the same problem in two different threads.

I saw the other one first and replied there. It would have been better to have only posted here with all responses in the one place.


Sorry Gary,
I posted the question and was referred the thread that was directly on point.

What would be the appropriate way to respond to Bernd's request to be notified if anyone else saw the issue after his change?

Joe

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5870
Credit: 115492845495
RAC: 33766896

RE: I posted the question

Quote:
I posted the question and was referred the thread that was directly on point.


Being referred to another thread for the purposes of getting relevant information is hardly a demand for you to post there. Unfortunately, the link given to you didn't quite point to the best message for information about daily quotas. The later post by Bikeman gave the new formula being used (32 per CPU core + 160 per GPU) which in your case equates to 256+160 = 416 tasks per day - unless the quota has been diminished by errors. In the links to your hosts you supplied, you can actually see the daily CPU core limit for each machine (both at the max of 32) so there is no problem with errors. You can be confident you have the full quota and your problem must lie elsewhere.

So really, the other thread wasn't "directly on point" at all, for the purpose of working out why you weren't getting adequate work.

Quote:
What would be the appropriate way to respond to Bernd's request to be notified if anyone else saw the issue after his change?


First of all, be really sure that the quotas weren't behaving as intended. If you're not really sure, stick with the existing thread and wait for responses. If it becomes clear that the quotas really are screwed, send Bernd a PM, and post your analysis in the other thread too, if you wish. I'm sure Bernd would have no objection to receiving a PM about a genuine bug/problem with the server code.

Please be aware that there was no criticism intended in my earlier response - nor is there any in this one. I have a 'thing' about trying to keep related information in the one place so that others are likely to find it all without extra effort. I would be really pleased if (when you identify the problem) you post your final conclusions here. That way, anyone else having trouble getting CUDA tasks should find suggestions with an appropriate search.

Cheers,
Gary.

joe areeda
joe areeda
Joined: 13 Dec 10
Posts: 285
Credit: 320378898
RAC: 0

Thanks Gary! I'm still

Thanks Gary!

I'm still trying to track this down.

I will post here if I find anything pertinent.

I do have a mixture of boinc versions and am using all my locations but these two machines are the only ones in the "school" location because they seem to be able to run 3 jobs per GPU.

Joe

joe areeda
joe areeda
Joined: 13 Dec 10
Posts: 285
Credit: 320378898
RAC: 0

I found it and it was an

I found it and it was an embarrassingly stupid mistake on my part.

I moved these two computers to a separate location so they could have their own project preferences because I wanted to try 3 CUDA tasks per GPU.

I'm not sure how this works exactly because I did get 3 tasks running, up from 2 in the previous location.

However, I completely missed the attribute "Use NVIDIA GPU" which was set to NO.

The only thing I can think of is that I had WUs in the queue and these got processed 3 at a time but the parameter stopped it from requesting new tasks.

Setting the project parameter to use the GPU and updating the project and I'm back heating my office at full blast.

Joe

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.