Has anyone tried those 1.09 GPU apps yet? They seem to take a while to complete. I was parallel-running those and it took 9hrs per wu to complete with RTX2070.
I am seeing about 9 hours per WU (Gravitational Wave search O2 Multi-Directional v1.01() x86_64-pc-linux-gnu)on a Linux PC (AuthenticAMD AMD Ryzen 7 1700 Eight-Core Process () x86_64-pc-linux-gnuor [Family 23 Model 1 Stepping 1](16 processors) .
What is strange to me is that I have 4 pending waiting for validation. They have been in this state for about 3 days. No credits granted.
Has anyone tried those 1.09 GPU apps yet? They seem to take a while to complete. I was parallel-running those and it took 9hrs per wu to complete with RTX2070.
I'm sure now that O2MD1 v1.01 run times depend very much on the frequency that is included at the end part of a task name (xx.xxHz). The range has been at least from 21.xx to 77.xx Hz. Lower freq tasks run faster. If 21.xx Hz represents basic level of 100 % run time then 77.xx Hz tasks have run about 500 % as long. So there's a clear freq-runtime-curve, perhaps similar to what I remember user Archae86 has shown in the past with another app.
I wonder if there's going to be new batches of work now and then... so that the first tasks of a new O2MD1 batch would start again from low freq and then the freq band would shift up as more tasks were sent out.
O2MD1 v1.01 cpu tasks seem to run about 20 % faster than O2AS v1.01 cpu tasks.
I'm pulling that statement back. Can't make that kind of comparison. Most of the tasks that I've been watching so far have completed in about 9-12h for example. But I noticed now there are some black sheeps that manage to complete the race in under 2 hours on that same host X. Perhaps the frequency bands of these tasks do have a hefty connection with the run times, again.
edit: relative run time examples from host X
57 Hz ... 33k
41 Hz ... 26k
21 Hz ... 7k
Richie, can you post a link to your results so I can see them? Thanks
Richie, can you post a link to your results so I can see them? Thanks
Z
Hi! "X" is this host: https://einsteinathome.org/host/12761897
It's been crunching 3x O2MD1 v1.01 cpu + 1x O2AS v1.09 gpu tasks concurrently. Those run times that I mentioned can be found now at the 'pending'. Ps. Two tasks got 'error while computing' yesterday. It happened after I decided to reboot the host because there were some Windows updates pending for a reboot. After I had started Boinc again and allowed crunching... that instant load of all 4 tasks at once was somekind of a shock for the system. Screen was full of artifacts, really a complete mess for a couple of seconds. Then the first task errored out. Then the second one... until I managed to hit stop. I've seen that somewhere in the past, but maybe with another host. I knew that host wouldn't recover from that and would've just crashed all the other tasks too. But then I made another reboot to make sure the condition is reset. I decided to start only 2 tasks at first and rest of the tasks a few seconds later... to decrease the system stress at the moment when crunching starts. It worked well and all is good again. I haven't seen that earlier with tasks from another apps or another kind of mixed work that this host was doing lately... and I didn't try to reproduce that problem now. In that way it surprised me a bit. Do these tasks stress the system in a different way, I don't know.
edit: To be clear, it was AS20 gpu tasks that were crashing, not these O2MD1 tasks. I suspect that problem had just something to do with AMD gpu driver or system and didn't have anything particular to do with these applications.
I've got total 171 O2MD1 tasks pending now. 6 hosts have been running same sort of mixed set (O2MD1 cpu tasks + 1x AS20 gpu) and two hosts (1 windows, 1 linux) have been running O2MD1 tasks only.
I wish I had kept the entire scheduler connect log.
That shouldn't be necessary as the last contact is always available and you would expect to see some sort of continuing problem with the O2MD1 app or plan class being mentioned each time.
I've just looked through all the most recent scheduler logs for all your hosts and can't find any of them referring in any way, good, bad or indifferent, to O2MD1. Which particular host ID is the one you wish to use for the new search? If you let me know which one it is, I can look at the log without bothering you to post the whole deal, or even a link to it.
Gary, I still have not seen any log resembling what I first saw, but I do a similar log from Zalster currently. He is not getting any O2MD1 cpu tasks either.
Has anyone tried those 1.09
)
Has anyone tried those 1.09 GPU apps yet? They seem to take a while to complete. I was parallel-running those and it took 9hrs per wu to complete with RTX2070.
I am seeing about 9 hours per
)
I am seeing about 9 hours per WU (Gravitational Wave search O2 Multi-Directional v1.01 () x86_64-pc-linux-gnu) on a Linux PC (AuthenticAMD AMD Ryzen 7 1700 Eight-Core Process () x86_64-pc-linux-gnuor [Family 23 Model 1 Stepping 1] (16 processors) .
What is strange to me is that I have 4 pending waiting for validation. They have been in this state for about 3 days. No credits granted.
Eskomorko wrote:Has anyone
)
https://einsteinathome.org/content/discussion-thread-continuous-gw-search-known-o2md1?page=2#comment-173647
robl wrote:What is strange to
)
Nothing strange at all if you read Bernd's announcement in Tech News: https://einsteinathome.org/content/gravitational-wave-search-o2-multi-directional-o2md1
Holmis wrote:robl wrote:What
)
I'm sure now that O2MD1 v1.01
)
I'm sure now that O2MD1 v1.01 run times depend very much on the frequency that is included at the end part of a task name (xx.xxHz). The range has been at least from 21.xx to 77.xx Hz. Lower freq tasks run faster. If 21.xx Hz represents basic level of 100 % run time then 77.xx Hz tasks have run about 500 % as long. So there's a clear freq-runtime-curve, perhaps similar to what I remember user Archae86 has shown in the past with another app.
I wonder if there's going to be new batches of work now and then... so that the first tasks of a new O2MD1 batch would start again from low freq and then the freq band would shift up as more tasks were sent out.
Haven't seen any O2MD1 yet,
)
Haven't seen any O2MD1 yet, all I keep getting are the O2AS20-500.
Richie wrote:Richie
)
Richie, can you post a link to your results so I can see them? Thanks
Z
Zalster wrote:Richie
)
Hi! "X" is this host: https://einsteinathome.org/host/12761897
It's been crunching 3x O2MD1 v1.01 cpu + 1x O2AS v1.09 gpu tasks concurrently. Those run times that I mentioned can be found now at the 'pending'. Ps. Two tasks got 'error while computing' yesterday. It happened after I decided to reboot the host because there were some Windows updates pending for a reboot. After I had started Boinc again and allowed crunching... that instant load of all 4 tasks at once was somekind of a shock for the system. Screen was full of artifacts, really a complete mess for a couple of seconds. Then the first task errored out. Then the second one... until I managed to hit stop. I've seen that somewhere in the past, but maybe with another host. I knew that host wouldn't recover from that and would've just crashed all the other tasks too. But then I made another reboot to make sure the condition is reset. I decided to start only 2 tasks at first and rest of the tasks a few seconds later... to decrease the system stress at the moment when crunching starts. It worked well and all is good again. I haven't seen that earlier with tasks from another apps or another kind of mixed work that this host was doing lately... and I didn't try to reproduce that problem now. In that way it surprised me a bit. Do these tasks stress the system in a different way, I don't know.
edit: To be clear, it was AS20 gpu tasks that were crashing, not these O2MD1 tasks. I suspect that problem had just something to do with AMD gpu driver or system and didn't have anything particular to do with these applications.
I've got total 171 O2MD1 tasks pending now. 6 hosts have been running same sort of mixed set (O2MD1 cpu tasks + 1x AS20 gpu) and two hosts (1 windows, 1 linux) have been running O2MD1 tasks only.
Gary Roberts wrote:Keith
)
Gary, I still have not seen any log resembling what I first saw, but I do a similar log from Zalster currently. He is not getting any O2MD1 cpu tasks either.
https://einsteinathome.org/host/12789230/log
My log had even more stuff in it about locality scheduling and other scheduling restrictions and what apps were valid etc.