Discussion Thread for the Continuous GW Search known as O2MD1 (now O2MDF - GPUs only)

Tom M

Joined: 2 Feb 06

Posts: 6891

Credit: 9810299106

RAC: 3642095

I just got done running my GW

21 Apr 2020 19:35:33 UTC

Message 176946

(moderation:

)

I just got done running my GW gpu tasks down to zilch so I would have a chance to process my pulsar gpu tasks.

Is anyone sucessfully getting both GW's and Pulsar gpu tasks to run at the same time?

I tried using the concurrent_max to reduce the GW's below my gpu count and was unable to get the Pulsar tasks to start until I had run completely out of GW tasks.

Tom M

A Proud member of the O.F.A. (Old Farts Association).

Mr P Hucker

Joined: 12 Aug 06

Posts: 838

Credit: 523479121

RAC: 179941

Tom M wrote:I just got done

21 Apr 2020 21:08:39 UTC

Message 176949 in response to message 176946

(moderation:

)

Tom M wrote:

I just got done running my GW gpu tasks down to zilch so I would have a chance to process my pulsar gpu tasks.

Is anyone sucessfully getting both GW's and Pulsar gpu tasks to run at the same time?

I tried using the concurrent_max to reduce the GW's below my gpu count and was unable to get the Pulsar tasks to start until I had run completely out of GW tasks.

I have no problems getting them to run at once, apart from they're too slow. The gravity takes 2.5-3.5GB of GPU RAM depending on the frequency used. 4 of my cards only have 3GB so I don't bother with gravity. The one with 4GB will get filled up with gravity and gamma, so it slows right down having to use system RAM. So I've set the 3GB cards to gamma only and the 4GB card to run either 2 gamma or 1 gravity, by choice of the server.

As for your problem - could you paste your app_config.xml from the Einstein folder in here, and someone will tell you what's up. There are several limits in force and you may have overlooked one.

If this page takes an hour to load, reduce posts per page to 20 in your settings, then the tinpot 486 Einstein uses can handle it.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 4158

Credit: 50208954882

RAC: 42406473

wow, I happened to catch a

21 Apr 2020 21:49:34 UTC

Message 176950

(moderation:

)

wow, I happened to catch a few VelaJr tasks using a LOT of GPU memory. 3200MB on a single task.

_________________________________________________________________________

Mr P Hucker

Joined: 12 Aug 06

Posts: 838

Credit: 523479121

RAC: 179941

Ian&Steve C. wrote:wow, I

21 Apr 2020 22:06:21 UTC

Message 176954 in response to message 176950

(moderation:

)

Ian&Steve C. wrote:

wow, I happened to catch a few VelaJr tasks using a LOT of GPU memory. 3200MB on a single task.

Half of mine use that much, which is why I've banned gravity from my 3GB cards, even after I get a CPU powerful enough to assist. STILL waiting on that heatsink bracket. The UK's Royal Mail system is horrendously slow, if only Ebay sellers would use the couriers, which are keeping up just fine.

If this page takes an hour to load, reduce posts per page to 20 in your settings, then the tinpot 486 Einstein uses can handle it.

Tom M

Joined: 2 Feb 06

Posts: 6891

Credit: 9810299106

RAC: 3642095

--edit--Quote:I tried

22 Apr 2020 0:19:21 UTC

Message 176957 in response to message 176949

(moderation:

)

--edit--

Quote:

I tried using the concurrent_max to reduce the GW's below my gpu count and was unable to get the Pulsar tasks to start until I had run completely out of GW tasks.

--edit---

Quote:

As for your problem - could you paste your app_config.xml from the Einstein folder in here, and someone will tell you what's up. There are several limits in force and you may have overlooked one.

<app_config><app>
<name>hsgamma_FGRPB1G</name>
</app>
<app>
<name>einstein_O2MD1</name>
</app>
<app>
<name>einstein_O2MDF</name>
<max_concurrent>2</max_concurrent>
</app>
<app>
<name>einstein_O2MD1</name>
<max_concurrent>2</max_concurrent>
</app>
</app_config>

This is comments from here down.

<app>
<name>hsgamma_FGRPB1G</name>
</app>
<app>
<name>einstein_O2MD1</name>
</app>

hsgamma_FGRP5', 'einstein_O2MDF', 'hsgamma_FGRPB1G', 'einstein_O2MD1'

<project_max_concurrent>24</project_max_concurrent>
<max_concurrent>7</max_concurrent>

A Proud member of the O.F.A. (Old Farts Association).

Zalster

Joined: 26 Nov 13

Posts: 3117

Credit: 4050672230

RAC: 0

If you want to prevent a GPU

22 Apr 2020 4:10:53 UTC

Message 176963

(moderation:

)

If you want to prevent a GPU from crunching GW, you have to exclude it from the cc_config.xml. You will have to add it to the xml, identify the GPU and identify the work unit and project. Trying to figure out the numbering sequence of the GPUs is the tricky part. This is an example of what it would look like. It's an older one, trying to find the one I used to use for Einstein but basically it's the same. This was for Seti when I split 2 of 4 GPUs to run seti and the other 2 to run Einstein.

<cc_config>
<options>
<exclude_gpu>
<url>setiweb.ssl.berkeley.edu/beta</url>
<device_num>0</device_num>
<app>setiathome_v8</app>
</exclude_gpu>
<exclude_gpu>
<url>setiweb.ssl.berkeley.edu/beta</url>
<device_num>2</device_num>
<app>setiathome_v8</app>
</exclude_gpu>
</options>
</cc_config>

Zalster

Joined: 26 Nov 13

Posts: 3117

Credit: 4050672230

RAC: 0

Tried to edit but Einstein

22 Apr 2020 4:14:30 UTC

Message 176964

(moderation:

)

Tried to edit but Einstein isn't having it. I was running Seti Beta on 2 of 4 and Einstein on the other 2.

Edit 2.. Found an old Einstein exclude cc_config.xml

<cc_config>
<options>
<exclude_gpu>
<url>http://einstein.phys.uwm.edu/</url>
<device_num>1</device_num>
<app>einsteinbinary_BRP4G</app>
</exclude_gpu>
<exclude_gpu>
<url>http://einstein.phys.uwm.edu/</url>
<device_num>0</device_num>
<app>einsteinbinary_BRP5</app>
</exclude_gpu>
<exclude_gpu>
<url>http://einstein.phys.uwm.edu/</url>
<device_num>2</device_num>
<app>einsteinbinary_BRP5</app>
</exclude_gpu>
</options>
</cc_config>

halfempty

Joined: 3 Apr 20

Posts: 14

Credit: 37595576

RAC: 0

Don't know if it will help,

22 Apr 2020 5:56:01 UTC

Message 176968

(moderation:

)

Don't know if it will help, but I had a similar issue. I have a system with 2 gpu's and had to exclude the GW app from the 1060 3GB to stop the errors. This is the change I made, and so far it's working.

In the <options> section of cc_config.xml in the /BOINC directory.

<exclude_gpu>
   <url>http://einstein.phys.uwm.edu/</url>
   <device_num>1</device_num>
   <app>einstein_O2MDF</app>
</exclude_gpu>

From the first few lines in the event log after startup to know which device number to exclude.

| CUDA: NVIDIA GPU 0: GeForce GTX 1660 SUPER (driver version 442.19 ...
| CUDA: NVIDIA GPU 1: GeForce GTX 1060 3GB (driver version 442.19 ...

Hope this helps.

Tom M

Joined: 2 Feb 06

Posts: 6891

Credit: 9810299106

RAC: 3642095

Thank you for your

22 Apr 2020 10:58:20 UTC

Message 176973

(moderation:

)

Thank you for your responses.

I may try that because I have two different groups of gpus so could segment them. However,

I was able to reduce the # of GW gpu tasks. I was NOT able to get the Pulsar tasks to run until all the GW tasks were gone.

As far as I can tell it was not an issue of all the Pulsar tasks had due dates later than the GW tasks. (Maybe?)

It was not a case of not enough cpu memory (32 GB) and I am running rtx 2060 Super and gtx 1660 Super. So it probably was not a case of gpu ram.

Plus "the night before" it was happily running both kinds of task. The next morning it refused to run the Pulsar tasks.

Its beginning to look like I will need to test this problem again. I would like to run 1 or 2 GW gpu tasks while mainly running the Pulsar tasks.

Tom M

A Proud member of the O.F.A. (Old Farts Association).

cecht

Joined: 7 Mar 18

Posts: 1621

Credit: 3035133538

RAC: 1459495

Tom M wrote:Thank you for

22 Apr 2020 12:54:40 UTC

Message 176977

(moderation:

)

Tom M wrote:

Thank you for your responses.

I may try that because I have two different groups of gpus so could segment them. However,

I was able to reduce the # of GW gpu tasks. I was NOT able to get the Pulsar tasks to run until all the GW tasks were gone.

As far as I can tell it was not an issue of all the Pulsar tasks had due dates later than the GW tasks. (Maybe?)

It was not a case of not enough cpu memory (32 GB) and I am running rtx 2060 Super and gtx 1660 Super. So it probably was not a case of gpu ram.

Plus "the night before" it was happily running both kinds of task. The next morning it refused to run the Pulsar tasks.

Its beginning to look like I will need to test this problem again. I would like to run 1 or 2 GW gpu tasks while mainly running the Pulsar tasks.

I too, tried that, but no luck, or partial luck. By excluding GW GPU tasks from one device in cc_config, I was able to have the excluded GPU run 3X pulsar tasks and the other GPU run 2x GW tasks, but only if I had my work queue already filled with both sets of tasks before setting the exclusion. As soon as pulsar tasks were all completed, the server did not reload them; only GW GPU tasks continued to download. I tried all combinations of exclude_gpu and gpu_usage in the two config files, but no luck. It seems that the E@H download server is interpreting <exclude_gpu> differently than we understand it to work. Is it a task priority issue?

Ideas are not fixed, nor should they be; we live in model-dependent reality.

Discussion Thread for the Continuous GW Search known as O2MD1 (now O2MDF - GPUs only)

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner