Please see tasks:
http://einsteinathome.org/task/306409726
http://einsteinathome.org/task/306410082
http://einsteinathome.org/task/306410025
Very little information in the results but I'm sure it's something silly on my part.
I'm building a department server based on Scientific Linux 6.3. Most of the components have been running for a month or so while I was travelling but the temps were higher than I liked and wanted more memory and disks.
So I got a full tower with lots of fans, moved everything over there added an SSD and velociraptor, upgraded ram to 32GB and reinstalled the OS.
I have to build boinc from sources for SL6 but it seems to be working for CPU jobs but not GPU jobs. It is the same GTX 550ti from the previous enclosure.
I had a similar problem with a different system that fixed itself. That one seemed to be a missing dynamic library but this one gives no details.
Any ideas?
Copyright © 2024 Einstein@Home. All rights reserved.
What is exit status 127?
)
Well to see if there is a problem with the GPU I downloaded the CUDA SDK and built the samples.
The deviceQuery program works and the bandwidth test works and is pretty interesting
Host to Device Bandwidth, 1 Device(s), Paged memory
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 6068.5
Device to Host Bandwidth, 1 Device(s), Paged memory
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 5783.6
Device to Device Bandwidth, 1 Device(s)
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 74495.1
[bandwidthTest] test results...
PASSED
I seem to have errored out of my limit of GPU tasks for the day so, I'll get some sleep and see if the things I needed to get the SDK to compile and run fixed the BRP problems.
Joe
Well even though I can run
)
Well even though I can run all the samples in the CUDA SDK I'm still getting an error 127 with no further information.
I also found out my other systems have crappy transfer rates which is probably why the 550ti is so close to the 560ti in credits. If I can get this system to work I'll try moving the 560ti over and see what happens.
Anybody have any clues on error 127?
Doing some Google searches,
)
Doing some Google searches, it appears it's a missing library.
You could do ldd on the einstein binaries, just to check if there's anything they need that you're missing.
Thanks Jord! It never
)
Thanks Jord!
It never occurred to me that this wasn't an application specific error.
You're right ldd shows
now to see if I can track down which packages have those libraries. I'll report back when I find them.
Thanks again!
Joe
libstdc++.so.6 => not
)
libstdc++.so.6 => not found
is the culprit. libcufft and libcudart are automatically downloaded from E@H and both .so files should be in E@H directory (in boinc data dir / projects).Look for compat-libstdc++ (32 bit, .i686) packages.
Well it seems the only
)
Well it seems the only library really missing is the 32bit version of libstdc++
Fixed with
yum install libstdc++.i686
The libcu* are in the projects directory and evidently the application is smart enough to load them explicitly:
At least BRP4cuda apps now start and run for a least 10 minutes.
Joe
RE: libstdc++.so.6 => not
)
That's what had me confused. I did have compat-libstdc++-296.i686 installed which is evidently enough to remove the "missing libstdc++" from the output but not good enough to let them run.
I'm learning, slowly.
Joe
that seems to do it. 2 WUs
)
that seems to do it.
2 WUs completed and validated:
http://einsteinathome.org/task/306575855
http://einsteinathome.org/task/306575854
My thanks to Jord and Khangollo for the help!
Joe