If you have an Albert crashing on 0xC0000005 ... survey

Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5893653
RAC: 3
Topic 190664

Please post in this thread.

Please leave behind the following data:

  • On which unit did it happen? (See in your Your Account, Results.)
  • Which operating system are you using?
  • What make & model of CPU do you have? Is it overclocked?
  • How much memory does your PC have?
  • Did you get the crash when you had graphics/screensaver on?
  • What make & model videocard do you use?
  • Is it updated to use the latest DirectX and drivers?
  • Do you have Microsoft .NET installed on your computer?
  • Are you using the latest BIOS/drivers for your motherboard?
  • Which BOINC version are you using? (if need be: And why?)
  • How is BOINC installed? As a single/shared user install or as a service?
  • Are you running multiple projects? If yes, list them and tell of the switch between projects time and if you leave the applications in memory when preempted.
  • When did the crash occur? Did you reboot, was it when changing projects, something else?
  • What do you think crashed the Albert unit?

For me it was:
- this is one of the 9 I had.
- Windows 2000 SP4.
- Intel Celeron 2.3GHz (stock speed).
- 512MB PC-133 RAM.
- I don't use the screensaver or the graphics.
- Ati 9600 Pro 128MB.
- Using the last pre-dotNET drivers 5.10 with DirectX 9.0c ..
- No.
- Yes to both.
- 5.3.6. Alpha (I am a BOINC Alpha tester)
- Service by standard. Now running it manually.
- Yes: Einstein, Seti, Seti Beta, LHC, Pirates, Primegrid.
- As far as I can find, it occured after I rebooted. BOINC was running as a service at the time.
- I think I had some kind of memory problem. In my case the result was trying to read from the top of the memory stack. Weird after a reboot, but not unthinkable.

----------------------------------------
I am posting this survey on my own. I hope to get to the bottom of it by running debugger programs with the help of Walt, Bruce and Bernd, but more information from you out there would help.

If a moderator could please sticky this post?

If the developers (or I or other helpers) will have additional questions we'll post them. Please keep this thread clear from any other errors.

tomyval
tomyval
Joined: 27 Jan 06
Posts: 1
Credit: 87
RAC: 0

If you have an Albert crashing on 0xC0000005 ... survey

This is the section prervios to the crash and right after.
2/1/2006 8:02:59 AM|Einstein@Home|Resuming result r1_1159.5__1319_S4R2a_1 using albert version 437
2/1/2006 8:02:59 AM|SETI@home|Starting result 25ja04ab.24934.25200.422146.1.135_1 using setiathome version 418
2/1/2006 8:20:08 AM|Einstein@Home|Unrecoverable error for result r1_1159.5__1319_S4R2a_1 ( - exit code -1073741819 (0xc0000005))
2/1/2006 8:20:08 AM||request_reschedule_cpus: process exited
2/1/2006 8:20:08 AM|Einstein@Home|Computation for result r1_1159.5__1319_S4R2a_1 finished
OS: Win XP Srv pk 2
AMD 2400 MP x 2
2 Gb of ecc memeory
I had the Bionc screensaver on. I've been running strickly BIONC on this system at night I kill all other no-esential srvs and programs. Itry to set it to a blank screen but I forgot last night.
Graphics: Matrox Parhelia 128, latest GPU bios and driver set.
DX: lastest release
.Net :Installed latest version as of 2/1/06
Bios: up to date.
The curreent release of BIONC being 5.2.13
BIONC is installed as a servive
I am running Seti and ClimatePredictions as well as the Pulsar study. The switch, happens quite smoothly as I can see. I have no data to suggest a problem at the moment within the swith of programs. I don't know if the code has a hard time with multithreaded environments, I would doubt it but just a thought. (as in multiple CPU's or virtual CPU's) I original took it as 1 badly coded module. Now, I've sen that many of the mod's sent out are crashing not only on my machine but many others as well. I woould think that the main codeing for this type implementation needs to be readdressed but that is and opinion of someone who has very little data to base any statements on.

Ian
Ian
Joined: 17 Jan 06
Posts: 5
Credit: 183977
RAC: 0

I shall answer these

I shall answer these questions best I can...

It happened on at least two results
I am using Windows XP with SP2
Celeron processor 2.6ghz Overclocked: Dont know
256 MB RAM
Not sure; the most recent one appeared to have done so: result 17292647
Not sure how to find out my videocard type
Should have the latest directX
Microsoft .net?? Never heard of it
Not sure if i have latest BIOS setup
Version 5.2.13
Single user setup
Not using multiple projects
The most recent one appeared to be when it went to the screensaver
What do I think caused the crash? I have no experience in computer programming so have no idea!

edit:graphic driver appears to be Intel(R)82845G/GL/GE/PE/GV graphics controller

Fer
Fer
Joined: 30 Mar 05
Posts: 4
Credit: 6206
RAC: 0

I post also in this thread my

I post also in this thread my stats, if they are helpful. I have opened a new thread for trying to fix ***UNHANDLED EXCEPTION***
http://einsteinathome.org/node/190775

[*] On which unit did it happen? (See in your Your Account, Results.)
4439946
[*] Which operating system are you using?
Windows 2000 SP4
[*] What make & model of CPU do you have? Is it overclocked?
Intel Pentium 4 2.8 GHz
[*] How much memory does your PC have?
1 GB
[*] Did you get the crash when you had graphics/screensaver on?
I suppose so, but not sure
[*] What make & model videocard do you use?
Intel 82865G (built-in videocard)
[*] Is it updated to use the latest DirectX and drivers?
Yes
[*] Do you have Microsoft .NET installed on your computer?
No
[*] Are you using the latest BIOS/drivers for your motherboard?
Yes
[*] Which BOINC version are you using? (if need be: And why?)
5.2.13
[*] How is BOINC installed? As a single/shared user install or as a service?
single user
[*] Are you running multiple projects? If yes, list them and tell of the switch between projects time and if you leave the applications in memory when preempted.
Einstein (300)
Predictor (100)
SETI (100)
[*] When did the crash occur? Did you reboot, was it when changing projects, something else?
I haven't got a work unit finished for almost 2 months, more than 10 work units went to the trash. So the answer to this question is quite complex: reboot or changing projects? I don't think so.
[*] What do you think crashed the Albert unit?
Obviously a not valid access to memory

Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5893653
RAC: 3

RE: 0xC0000005 errors, or

Quote:
0xC0000005 errors, or Access Violation, means that the program tried to access memory that either didn't belong to it, wasn't allocated, or was a write operation to write-protected memory. Could be anything that caused it, often its running past the end of a table or using a pointer that doesn't point to valid data.

That's what Walt wrote me in email. So we do now know what it means, but there's no solution to the problem yet.

Wilfred Nijman
Wilfred Nijman
Joined: 3 Jun 05
Posts: 4
Credit: 655654
RAC: 0

[*] On which unit did it

[*] On which unit did it happen? ALL of them.
[*] Which operating system are you using? XP
[*] What make & model of CPU do you have? Is it overclocked? Intel P4 3.0 Ghz, not overclocked
[*] How much memory does your PC have? 1 GB
[*] Did you get the crash when you had graphics/screensaver on? some yes some no
[*] What make & model videocard do you use? Intel 82945G
[*] Is it updated to use the latest DirectX and drivers? yep
[*] Do you have Microsoft .NET installed on your computer? yep
[*] Are you using the latest BIOS/drivers for your motherboard? yep
[*] Which BOINC version are you using? (if need be: And why?) 5.2.13
[*] How is BOINC installed? As a single/shared user install or as a service? shared user
[*] Are you running multiple projects? If yes, list them and tell of the switch between projects time and if you leave the applications in memory when preempted. SETI, all prefs left at default
[*] When did the crash occur? Did you reboot, was it when changing projects, something else? on average, 30 mins after computation start
[*] What do you think crashed the Albert unit? dunno, but somehow I get the feeling it happens when computation is (nearly) finished

Wilfred Nijman
Wilfred Nijman
Joined: 3 Jun 05
Posts: 4
Credit: 655654
RAC: 0

Here are the messages from

Message 24469 in response to message 24468

Here are the messages from the log, hope it helps. Stopped Einstein in the meanwhile, only running SETI now.

08/03/2006 15:19:41|Einstein@Home|Starting result r1_0899.0__145_S4R2a_3 using albert version 437
08/03/2006 16:33:25|Einstein@Home|Unrecoverable error for result r1_0899.0__145_S4R2a_3 ( - exit code -1073741819 (0xc0000005))

08/03/2006 17:34:37|Einstein@Home|Starting result r1_0899.0__225_S4R2a_3 using albert version 437
08/03/2006 18:04:09|Einstein@Home|Unrecoverable error for result r1_0899.0__225_S4R2a_3 ( - exit code -1073741819 (0xc0000005))

08/03/2006 18:05:18|Einstein@Home|Starting result r1_0899.0__223_S4R2a_4 using albert version 437
08/03/2006 18:34:48|Einstein@Home|Unrecoverable error for result r1_0899.0__223_S4R2a_4 ( - exit code -1073741819 (0xc0000005))

08/03/2006 19:36:01|Einstein@Home|Starting result r1_0899.0__222_S4R2a_3 using albert version 437
08/03/2006 20:05:57|Einstein@Home|Unrecoverable error for result r1_0899.0__222_S4R2a_3 ( - exit code -1073741819 (0xc0000005))

08/03/2006 20:07:16|Einstein@Home|Starting result z1_1328.5__2297_S4R2a_2 using albert version 437
08/03/2006 20:36:37|Einstein@Home|Unrecoverable error for result z1_1328.5__2297_S4R2a_2 ( - exit code -1073741819 (0xc0000005))

08/03/2006 21:37:45|Einstein@Home|Starting result z1_1328.5__2296_S4R2a_1 using albert version 437
08/03/2006 22:02:52|Einstein@Home|Unrecoverable error for result z1_1328.5__2296_S4R2a_1 ( - exit code -1073741819 (0xc0000005))

08/03/2006 22:04:01|Einstein@Home|Starting result z1_1328.5__2295_S4R2a_1 using albert version 437
08/03/2006 22:33:45|Einstein@Home|Unrecoverable error for result z1_1328.5__2295_S4R2a_1 ( - exit code -1073741819 (0xc0000005))

08/03/2006 23:34:53|Einstein@Home|Starting result z1_1328.5__2294_S4R2a_0 using albert version 437
09/03/2006 00:04:51|Einstein@Home|Unrecoverable error for result z1_1328.5__2294_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

09/03/2006 00:06:09|Einstein@Home|Starting result z1_1328.5__2293_S4R2a_0 using albert version 437
09/03/2006 00:35:35|Einstein@Home|Unrecoverable error for result z1_1328.5__2293_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

09/03/2006 01:36:43|Einstein@Home|Starting result z1_1328.5__2292_S4R2a_0 using albert version 437
09/03/2006 02:06:47|Einstein@Home|Unrecoverable error for result z1_1328.5__2292_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

09/03/2006 02:08:02|Einstein@Home|Starting result z1_1328.5__2291_S4R2a_0 using albert version 437
09/03/2006 02:37:43|Einstein@Home|Unrecoverable error for result z1_1328.5__2291_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

09/03/2006 03:39:00|Einstein@Home|Starting result z1_1328.5__2290_S4R2a_0 using albert version 437
09/03/2006 04:08:19|Einstein@Home|Unrecoverable error for result z1_1328.5__2290_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

09/03/2006 04:09:38|Einstein@Home|Starting result z1_1328.5__2289_S4R2a_0 using albert version 437
09/03/2006 04:39:27|Einstein@Home|Unrecoverable error for result z1_1328.5__2289_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

09/03/2006 05:40:47|Einstein@Home|Starting result z1_1328.5__2288_S4R2a_0 using albert version 437
09/03/2006 06:10:18|Einstein@Home|Unrecoverable error for result z1_1328.5__2288_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

09/03/2006 06:11:33|Einstein@Home|Starting result z1_1328.5__2287_S4R2a_0 using albert version 437
09/03/2006 06:41:16|Einstein@Home|Unrecoverable error for result z1_1328.5__2287_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

09/03/2006 07:42:30|Einstein@Home|Starting result z1_1328.5__2286_S4R2a_0 using albert version 437
09/03/2006 08:12:30|Einstein@Home|Unrecoverable error for result z1_1328.5__2286_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

09/03/2006 08:13:45|Einstein@Home|Starting result z1_1328.5__2285_S4R2a_0 using albert version 437
09/03/2006 08:42:41|Einstein@Home|Unrecoverable error for result z1_1328.5__2285_S4R2a_0 ( - exit code -1073741819 (0xc0000005))

Michael Roycraft
Michael Roycraft
Joined: 10 Mar 05
Posts: 846
Credit: 157718
RAC: 0

Wilfred, I have a nagging

Wilfred,

I have a nagging hunch that your computer has a memory problem. In most of the errored WUs, it stopped at between 28.5 and 29.5 minutes into computation, probably at the same instruction.The time variance is close enough to be the result of other varying computer activity.

If you'd care to check this, you can ..

1) Download and run memtest86. In order to give it a good test, I suggest you run memtest while running another intensive program, such as Seti, at the same time.

or, ...

2) To prevent electrostatic damage to delicate parts - Be very careful to ground yourself to the computer case while touching anything inside the box. Turn your computer off, unplug it, open the case, and swap your two sticks of RAM, one into the other's slot, and vice versa. Then Button everything back up, restart, and run Einstein again.

Michael

microcraft
"The arc of history is long, but it bends toward justice" - MLK

Wilfred Nijman
Wilfred Nijman
Joined: 3 Jun 05
Posts: 4
Credit: 655654
RAC: 0

Michael, I see your point

Message 24471 in response to message 24470

Michael,

I see your point but I doubt it's a hardware problem... the 0xc0000005's first started coming on my old office box about 2 months ago, but now I'm running a brand new one ;-) and having mem probs on both seems unlikely (but possible); other prob is that these days our office boxes have no floppy drive, and for security reasons we can't boot them from USB devices either - so no memtest.

I'll try swapping the RAM one of these days, will keep you posted.

Wilfred

Wilfred Nijman
Wilfred Nijman
Joined: 3 Jun 05
Posts: 4
Credit: 655654
RAC: 0

I found a memory test that

Message 24472 in response to message 24471

I found a memory test that runs under XP (HCI MemTest 3.3), and let it run overnight.

Result: 3128% coverage, 0 errors.

M. Schmitt
M. Schmitt
Joined: 27 Jun 05
Posts: 478
Credit: 15872262
RAC: 0

I got this error too and

I got this error too and already postet about it here.

Ok, other details:
Workunit name: z1_1366.5__2054_S4R2a_0
Workunit: 7014283
OS: Win XP pro SP2
CPU: Athlon 64 4400+ @2.7GHz 50° C, stability tested with Prime95 stresstest. Memory is not overclocked. Case temperature 27° C.
Memory: 2 GB DDR (2 · 1GB OCZ platinum).
I never use the screensaver, boinc is running as a service.
Videocard: Asus 7800GT (not important)
DX: updated (not important)
.NET: Not jet, but:
Visual C++ 2005 Express Edition
Microsoft SQL Server 2005
Microsoft Platform SDK for Windows XP SP2
MB BIOS: Not changed (bought Jan 06)
BOINC v.: 5.2.13 BoincStudio 0.4b running as a service.
Projects: Just einstein@home
When did the crash occur?: The host was not used at that time.
What do you think crashed the Albert unit?: There might be a wrong address calculation, that refers to a forbidden part of the memory:

***UNHANDLED EXCEPTION****
Reason: Access Violation (0xc0000005) at address 0x0040AABD read attempt to address 0xFF7455C8

1: 04/20/06 14:32:20
1: e:\\einsteinathome\\cfs\\windows_build\\albert4.37\\cfslaldemod.c(941) +7 bytes (TestLALDemod)

Isn't that memory part close to the top of 4GB? There may be IO-areas for PCI-devices.

cu,
Michael

[edit] The app is akos's S40.04.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.