BRP4 ATI error while computing: "output file...for task...absent"

count0
count0
Joined: 11 Feb 05
Posts: 6
Credit: 10644960
RAC: 0
Topic 196874

Hi Everyone--I've been getting this error on all of my BRP runs. I've searched the forums and couldn't find a solution. Here's a snapshot from the event log for one of the jobs:

Einstein@Home 27.03.2013 13:08:35 Starting task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 using einsteinbinary_BRP4 version 134 (opencl-ati) in slot 13
Einstein@Home 27.03.2013 13:08:41 Computation for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 finished
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_0 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_1 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_2 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_3 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_4 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_5 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_6 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_7 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent

7.0.58

- exit code -1073740940 (0xc0000374)

Activated exception handling...
[13:08:22][6444][INFO ] Starting data processing...

]]>

wu: http://einstein.phys.uwm.edu/result....ltid=359363993
host: http://einsteinathome.org/host/6902352

What can I do to remedy this issue?

Thanks!

mikey
mikey
Joined: 22 Jan 05
Posts: 12699
Credit: 1839102411
RAC: 3687

BRP4 ATI error while computing: "output file...for task...absent

Quote:

Hi Everyone--I've been getting this error on all of my BRP runs. I've searched the forums and couldn't find a solution. Here's a snapshot from the event log for one of the jobs:

Einstein@Home 27.03.2013 13:08:35 Starting task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 using einsteinbinary_BRP4 version 134 (opencl-ati) in slot 13
Einstein@Home 27.03.2013 13:08:41 Computation for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 finished
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_0 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_1 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_2 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_3 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_4 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_5 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_6 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent
Einstein@Home 27.03.2013 13:08:41 Output file p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1_7 for task p2030.20130102.G204.56+00.86.C.b5s0g0.00000_952_1 absent

7.0.58

- exit code -1073740940 (0xc0000374)

Activated exception handling...
[13:08:22][6444][INFO ] Starting data processing...

]]>

wu: http://einstein.phys.uwm.edu/result....ltid=359363993
host: http://einsteinathome.org/host/6902352

What can I do to remedy this issue?

Thanks!

The only difference I see is that pc is using a newer version of Boinc then the others. Downgrading Boinc MIGHT fix it but WILL wipe out ALL units you currently have on that pc!

count0
count0
Joined: 11 Feb 05
Posts: 6
Credit: 10644960
RAC: 0

I don´t think so. I tried

I don´t think so. I tried several other versions of boinc client with the same result.

Nobody knows what the exit/error code means?

count0
count0
Joined: 11 Feb 05
Posts: 6
Credit: 10644960
RAC: 0

Ok, i found this in the event

Ok, i found this in the event log:

- EventData

einsteinbinary_BRP4_1.34_windows_x86_64__opencl-ati.exe
0.0.0.0
50f94203
ntdll.dll
6.1.7601.17725
4ec4aa8e
c0000374
00000000000c40f2
19c
01ce2ae3bec1730f
C:\ProgramData\BOINC\projects\einstein.phys.uwm.edu\einsteinbinary_BRP4_1.34_windows_x86_64__opencl-ati.exe
C:\Windows\SYSTEM32\ntdll.dll
fc7b53ff-96d6-11e2-b7b4-005056c00008

And now? :-)

Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5893653
RAC: 62

RE: Nobody knows what the

Quote:
Nobody knows what the exit/error code means?


Not off hand, no. You can compare the (0xc0000374) value to a Windows exception error, though. In this case, the value translates to a STATUS_HEAP_CORRUPTION, which isn't too helpful. But searching through some threads at Microsoft's help desk, I did find that it happens when an application tries to write to non-existent memory, or when the memory it tries to write to is corrupted.

So that's the only thing I can give you, that the memory on that 6x00 is corrupt, broken or otherwise in problems. You can test this by putting another card in that machine, or moving that card to another machine and see what it does there.

Apropos Mikey,

Quote:
Downgrading Boinc MIGHT fix it but WILL wipe out ALL units you currently have on that pc!


Since when does up- or downgrading BOINC delete the data directory or affect any of the project's science applications? It doesn't.

Resetting the project does. Removing the project and adding it again does. But just trying out another BOINC version doesn't. :)

transient
transient
Joined: 3 Jun 05
Posts: 62
Credit: 115835369
RAC: 0

RE: Apropos

Quote:

Apropos Mikey,
Quote:
Downgrading Boinc MIGHT fix it but WILL wipe out ALL units you currently have on that pc!

Since when does up- or downgrading BOINC delete the data directory or affect any of the project's science applications? It doesn't.

Resetting the project does. Removing the project and adding it again does. But just trying out another BOINC version doesn't. :)

Is it not true if you downgrade from version 7 to version 6? I seem to remember something like that.

Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5893653
RAC: 62

RE: Is it not true if you

Quote:
Is it not true if you downgrade from version 7 to version 6? I seem to remember something like that.


Perhaps if you go back from BOINC 6 to BOINC 5, since 5 doesn't use a data directory (although it can, if you instruct it to). But both 6 and 7 use the data directory which stays in place when you update or downgrade BOINC.

Horacio
Horacio
Joined: 3 Oct 11
Posts: 205
Credit: 80557243
RAC: 0

RE: RE: Is it not true if

Quote:
Quote:
Is it not true if you downgrade from version 7 to version 6? I seem to remember something like that.

Perhaps if you go back from BOINC 6 to BOINC 5, since 5 doesn't use a data directory (although it can, if you instruct it to). But both 6 and 7 use the data directory which stays in place when you update or downgrade BOINC.


AFAIK, the client_state.xml file of BOINC 7 is not compatible with BOINC 6 and it was always advised to empty the cache before downgrading from 7 to 6... Has that changed?

Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5893653
RAC: 62

No, I forgot about REC being

No, I forgot about REC being in 7 and debt being used in 6. But even then, I suppose with this project having resend lost work on, that any work lost will be resent.

Still, we're digressing far away from the point made by Mikey, that downgrading BOINC from any 7 to any other 7 will lose the work in cache, which isn't correct.

count0
count0
Joined: 11 Feb 05
Posts: 6
Credit: 10644960
RAC: 0

Does anybody know if there is

Does anybody know if there is an debug option for running einsteinbinary_BRP4_1.34_windows_x86_64__opencl-ati.exe, or can I start this program manually to see perhaps some more information about this error?

count0
count0
Joined: 11 Feb 05
Posts: 6
Credit: 10644960
RAC: 0

Ok, here some WinDbg

Ok, here some WinDbg output:

Microsoft (R) Windows Debugger Version 6.2.9200.20512 AMD64
Copyright (c) Microsoft Corporation. All rights reserved.

CommandLine: C:\ProgramData\BOINC\projects\einstein.phys.uwm.edu\einsteinbinary_BRP4_1.34_windows_x86_64__opencl-ati.exe
Symbol search path is: SRV*f:\localsymbols*http://msdl.microsoft.com/download/symbols
Executable search path is:
ModLoad: 00000000`00400000 00000000`00f27000 image00000000`00400000
ModLoad: 00000000`77b20000 00000000`77cc9000 ntdll.dll
ModLoad: 00000000`77a00000 00000000`77b1f000 C:\Windows\system32\kernel32.dll
ModLoad: 000007fe`fe1e0000 000007fe`fe24b000 C:\Windows\system32\KERNELBASE.dll
ModLoad: 000007fe`f8e20000 000007fe`f8e33000 C:\Windows\system32\OpenCL.dll
ModLoad: 000007fe`ffa20000 000007fe`ffafb000 C:\Windows\system32\ADVAPI32.dll
ModLoad: 000007fe`fe440000 000007fe`fe4df000 C:\Windows\system32\msvcrt.dll
ModLoad: 000007fe`ff450000 000007fe`ff46f000 C:\Windows\SYSTEM32\sechost.dll
ModLoad: 000007fe`ff6b0000 000007fe`ff7dd000 C:\Windows\system32\RPCRT4.dll
ModLoad: 00000000`77cf0000 00000000`77cf7000 C:\Windows\system32\PSAPI.DLL
ModLoad: 00000000`77900000 00000000`779fa000 C:\Windows\system32\USER32.dll
ModLoad: 000007fe`ff890000 000007fe`ff8f7000 C:\Windows\system32\GDI32.dll
ModLoad: 000007fe`ff900000 000007fe`ff90e000 C:\Windows\system32\LPK.dll
ModLoad: 000007fe`ff380000 000007fe`ff449000 C:\Windows\system32\USP10.dll
ModLoad: 000007fe`fe350000 000007fe`fe39d000 C:\Windows\system32\WS2_32.dll
ModLoad: 000007fe`ff7e0000 000007fe`ff7e8000 C:\Windows\system32\NSI.dll
(a30.5c4): Break instruction exception - code 80000003 (first chance)
ntdll!LdrpDoDebuggerBreak+0x30:
00000000`77bccb60 cc int 3
0:000> .exr -1
ExceptionAddress: 0000000077bccb60 (ntdll!LdrpDoDebuggerBreak+0x0000000000000030)
ExceptionCode: 80000003 (Break instruction exception)
ExceptionFlags: 00000000
NumberParameters: 1
Parameter[0]: 0000000000000000

Microsoft (R) Windows Debugger Version 6.2.9200.20512 AMD64
Copyright (c) Microsoft Corporation. All rights reserved.

Loading Dump File [C:\localdumps\einsteinbinary_BRP4_1.34_windows_x86_64__opencl-ati.exe.4804.dmp]
User Mini Dump File with Full Memory: Only application data is available

Symbol search path is: SRV*f:\localsymbols*http://msdl.microsoft.com/download/symbols
Executable search path is:
Windows 7 Version 7601 (Service Pack 1) MP (8 procs) Free x64
Product: WinNt, suite: SingleUserTS
Machine Name:
Debug session time: Thu Mar 28 12:51:54.000 2013 (UTC + 1:00)
System Uptime: 0 days 0:52:32.183
Process Uptime: 0 days 0:00:06.000
.....................
Loading unloaded module list
...........
This dump file has an exception of interest stored in it.
The stored exception information can be accessed via .ecxr.
(12c4.1310): Unknown exception - code c0000374 (first/second chance not available)
ntdll!NtWaitForSingleObject+0xa:
00000000`77b7135a c3 ret
0:000> .exr -1
ExceptionAddress: 0000000077be40f2 (ntdll!RtlReportCriticalFailure+0x0000000000000062)
ExceptionCode: c0000374
ExceptionFlags: 00000001
NumberParameters: 1
Parameter[0]: 0000000077c5b450

0:000> .ecxr
rax=0000000077d51483 rbx=00000000c0000374 rcx=000000000020bcc0
rdx=0000000077c5b450 rsi=0000000000000000 rdi=0000000077c5b450
rip=0000000077be40f2 rsp=000000000020c2d0 rbp=0000000000000000
r8=4d7176a8e51d12ad r9=000000001c0b4e5e r10=0000000000000000
r11=0000000000000286 r12=0000000000000009 r13=000007feee5a8e20
r14=0000000000000000 r15=0000000000000001
iopl=0 nv up ei pl nz na pe nc
cs=0033 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000202
ntdll!RtlReportCriticalFailure+0x62:
00000000`77be40f2 eb00 jmp ntdll!RtlReportCriticalFailure+0x64 (00000000`77be40f4)

does that help?

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.