BRP6-opencl-ati with linux mesa opencl

Paul
Paul
Joined: 3 May 07
Posts: 121
Credit: 1654671150
RAC: 22777

Hey all, So, somewhere

Hey all,

So, somewhere around June 2016, BOINC GPU crunching on my system broke.  It had been working between then and my previous post on this list.  It coincided with MESA/LLVM changes and Fedora 24.

It's almost a year later and BOINC/E@H + ATi/Radeon + MESA/OpenCL is still broken.  Has anyone been able to restore GPU crunching?  I even have a Radeon X, now.  I remember reading something about kernel 4.10 having important AMDGPU updates, but that might not be all that is necessary.  I'm getting Computation Error (SEGV) immediately.

26-Apr-2017 22:23:48 [---] OpenCL: AMD/ATI GPU 0: AMD POLARIS10 (DRM 3.9.0 / 4.10.10-200.fc25.x86_64, LLVM 3.9.1) (driver version 13.0.4, device version OpenCL 1.1 Mesa 13.0.4, 8190MB, 8190MB available, 3709 GFLOPS peak)

AgentB
AgentB
Joined: 17 Mar 12
Posts: 915
Credit: 513211304
RAC: 0

I don't use Mint but their

I don't use Mint but their forums seem to suggest the AMD gpupro driver is workable with some small effort.  

Paul
Paul
Joined: 3 May 07
Posts: 121
Credit: 1654671150
RAC: 22777

Yeah, okay, thanks.  But

Yeah, okay, thanks.  But that's not this thread.  I know people have talked about that here, but that's not what I'm asking.

Since I switched to radeon-si 1.5 years ago, my system has been a lot more stable.  Plus, the proprietary drivers are sort of a pain.  I used them for probably 14 or 15 years though, so, I realize I could go back to doing that.  But, after having everything working correctly on the OSS driver, it's even less enticing, now.

*** Error in `../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati': free(): invalid pointer: 0x00007f2a7f58f9e0 ***
======= Backtrace: =========
/lib64/libc.so.6(+0x7570b)[0x7f2a82b4170b]
/lib64/libc.so.6(+0x7deaa)[0x7f2a82b49eaa]
/lib64/libc.so.6(cfree+0x4c)[0x7f2a82b4d40c]
../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati(_ZNSt13runtime_errorD2Ev+0x58)[0x6a64a8]
/lib64/libMesaOpenCL.so.1(+0x1fda2)[0x7f2a82743da2]
../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati[0x69883f]
/lib64/libMesaOpenCL.so.1(+0x9a2b5)[0x7f2a827be2b5]
/lib64/libMesaOpenCL.so.1(+0x4b0b3)[0x7f2a8276f0b3]
/lib64/libMesaOpenCL.so.1(+0x4b0e3)[0x7f2a8276f0e3]
/lib64/ld-linux-x86-64.so.2(+0xfdaa)[0x7f2a837fbdaa]
/lib64/ld-linux-x86-64.so.2(+0xfebb)[0x7f2a837fbebb]
/lib64/ld-linux-x86-64.so.2(+0x14ab1)[0x7f2a83800ab1]
/lib64/ld-linux-x86-64.so.2(+0xfc54)[0x7f2a837fbc54]
/lib64/ld-linux-x86-64.so.2(+0x14009)[0x7f2a83800009]
/lib64/libdl.so.2(+0xf09)[0x7f2a831a4f09]
/lib64/ld-linux-x86-64.so.2(+0xfc54)[0x7f2a837fbc54]
/lib64/libdl.so.2(+0x1591)[0x7f2a831a5591]
/lib64/libdl.so.2(dlopen+0x32)[0x7f2a831a4fa2]
/lib64/libOpenCL.so.1(+0x4d94)[0x7f2a835d0d94]
/lib64/libOpenCL.so.1(+0x5072)[0x7f2a835d1072]
/lib64/libOpenCL.so.1(+0x6627)[0x7f2a835d2627]
/lib64/libOpenCL.so.1(clGetPlatformIDs+0x144)[0x7f2a835d2b74]
../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati(_Z24boinc_get_opencl_ids_auxPciiPP13_cl_device_idPP15_cl_platform_id+0x74)[0x5b9e54]
../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati(_Z20boinc_get_opencl_idsPP13_cl_device_idPP15_cl_platform_id+0xe6)[0x5ba37a]
../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati(eah_boinc_get_opencl_ids+0x26)[0x48b7c6]
../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati(gen_fft_get_ctx+0x3a)[0x48d6ea]
../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati(MAIN+0x15c)[0x47936c]
../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati(main+0x61b)[0x46bf6b]
/lib64/libc.so.6(__libc_start_main+0xf1)[0x7f2a82aec731]
../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.12_x86_64-pc-linux-gnu__FGRPopencl-Beta-ati[0x46e3e9]

The backtrace is very similar to what it was 5 months ago, but now I'm using a completely different card and driver.  I think this means that the problem must be Mesa.  Agreed?  Does anyone know if this is being tracked somewhere or when Mesa will get a patch for this?

It looks like Mesa 17 just hit update repos this week, so I'm going to update. 

Paul
Paul
Joined: 3 May 07
Posts: 121
Credit: 1654671150
RAC: 22777

Okay, so now I have Mesa 17

Okay, so now I have Mesa 17 and still get jobs failing with computation failure.  I'm thinking the problem lies in LLVM?  Does anyone agree?

LLVM 3.9.1

My AMDGPU version is xorg-x11-drv-AMDGPU 1.3, but I don't think I can do better than that right now.

Also, this error is for FGRP, not BRP6.  Should I open a new thread about that, or is this good enough.  I'm assuming this a similar enough problem to that for the old BRP6 that it's okay to keep it here.

mmonnin
mmonnin
Joined: 29 May 16
Posts: 291
Credit: 3232287015
RAC: 100812

Probably would be a good idea

Probably would be a good idea to have your own thread. I saw a new post here and immediately thought that BRP6G has been dead for awhile.

Anyway, I setup a RX580 in Ubuntu 16.04 with AMDGPU-Pro drivers for the Pentathlon:

https://einsteinathome.org/host/12530808/tasks

elhoir
elhoir
Joined: 28 Sep 11
Posts: 6
Credit: 8423997
RAC: 0

Hello all,   I have a

Hello all,

 

I have a similar problem, maybe its related to this discussion, so let me explain here,

 

I´m using Ubuntu 16.04 64-bit, with Mesa 17.0.7 and BOINC 7.6.33. thankfully, i have been able to set up BOINC so that it detects my GPU using open source mesa drivers (my GPU is an AMD A6-5200 w/ HD 8400) as OpenCL capable.

 

The thing is *every* task ends with "compute error", and im very pissed off.....

 

What can i do?

 

Thanks in advance!

Paul
Paul
Joined: 3 May 07
Posts: 121
Credit: 1654671150
RAC: 22777

Hi ELHIOR, Thanks for

Hi ELHIOR,

Thanks for posting.  That's my problem too!  I've been fighting it for over a year. :-(

Is thread is for the BRP6 app, which is old and you probably are not running that app, so your errors are not a result of problems with that app.  As you see, they told me to open a new thread for my problem.  You can find out exactly what app caused the failure by checking your work unit reports on-line.  There are several apps in the E@H set.  You should do this anyway since you'll surely be asked more about your error and the error is most easily found in the work unit report on-line.  Go to your E@H account and look for the "tasks" link.

Then, post to my thread for FGRPopencl1K-ati as I suspect that one of your apps that is failing, just like for me.

https://einsteinathome.org/content/fgrpopencl1k-ati-polaris10-amdgpu-llvm391-mesa-17-crash

elhoir
elhoir
Joined: 28 Sep 11
Posts: 6
Credit: 8423997
RAC: 0

Hello PAUL,   i did it,

Hello PAUL,

 

i did it, thank you so much!

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.