Sick AMD 3000+

W9FZ
W9FZ
Joined: 9 Mar 05
Posts: 20
Credit: 1015550
RAC: 0
Topic 190465

I've got an ASUS A7N8X board holding an AMD Athlon XP 3000+. This is the older basic version of the A7N8X. The purpose for the machine is as an Einstein@Home cruncher. I've been quite pleased with the A7N8X (several versions) as crunching platforms.

Installation of the machine went without problems. It's a sick puppy when it comes to crunching for E@H. It's speed through a work unit is respectable 21,000 seconds. But a high percentage of the results are "client error" and "validate error".

Host details (named Heinlein)

To see the errors, you can look

I've searched and found that "client errors" are supposed to be the chip overheating. Well, I'm not trying to overclock the chip at all. I'm trying to just run it stock. The BIOS did not give me much control. Initially I tried "aggressive" timings, but now I've shifted to "optimal" and it's actually gotten worse. I've removed the fan to inspect and carefully reseated the fan to assure that no lapped edge was catching and holding the fan away. It seems to have good contact. I can look at mobo temperature parameters via ASUSProbe and via the BIOS. Both show VERY reasonable temperatures--CPU like 100 degrees F.

The computer is living in a basement that is at 55 degrees which should be ideal for crunchers. And to repeat, I'm not overclocking at all. I've got the FSB set for 166 which should yield the 333 that this particular chip needs.

So, I'm looking for advice on potential causes and solutions for the frequent client error and validate errors on E@H.

Are some chips just "flaky"?? This was an ebay purchase and not guaranteed. Can some chips work in a flaky way like this?

I have another 3000+ chip coming that I will pop and drop in as a check of the mobo. And if I have to, I can buy another 3000+ to replace this cpu.

Or do these symptoms reek of mobo issues? I've got three other A7N8X boxes crunching quite well with 3000+'s sitting right next to this box. (No, I"ve not swapped the CPU's around--probably will for troubleshooting)

I look forward to your thoughts.

Bruce Richardson
Team Richwood Timber LLC

When the going gets weird, the weird turn PRO. -- Hunter S. Thompson

Stick
Stick
Joined: 24 Feb 05
Posts: 790
Credit: 33135866
RAC: 1084

Sick AMD 3000+

You have several Exit status -1073741819 (0xc0000005) errors. These are the standard Einstein 4.79 "graphics bug" errors. Read this message and thread. It could be that the other errors are related to this, too - but maybe not.

W9FZ
W9FZ
Joined: 9 Mar 05
Posts: 20
Credit: 1015550
RAC: 0

Hi there: I'll keep

Message 22827 in response to message 22826

Hi there:

I'll keep digging on the "graphics bug" thing. My memory is that I'm using a junker AGP 2.0 card in that one.

The box is actually heating my parents house :-) . The next time I visit, I will:
1. Uninstall Boinc 4.whatever and install the new one.
2. Update the graphics drivers.

Not quibbling, but I try to install BOINC as a service on all my machines so I've never seen the graphics.

If any of you have additional thoughts, keep 'em coming. Thanks.

Bruce Richardson
W9FZ

When the going gets weird, the weird turn PRO. -- Hunter S. Thompson

Paul D. Buck
Paul D. Buck
Joined: 17 Jan 05
Posts: 754
Credit: 5385205
RAC: 0

Don't use the Einstein@Home

Don't use the Einstein@Home graphics at all is likely to be the cure ... or to only run Albert work which is supposed to have fixed the known issues. The usual culprit is ATI cards.

As far as the heat, from what little I know of AMD chips that sounds high to me ... OF course, I don't know that much ...

J D K
J D K
Joined: 27 Aug 05
Posts: 86
Credit: 103878
RAC: 0

If you wish to do further

If you wish to do further stability test here are two programs to try;

Prime95 Torture test

Memtest86,

geonerd
geonerd
Joined: 8 Nov 04
Posts: 10
Credit: 370024
RAC: 0

You might also try manualy

You might also try manualy tweaking the memory timings. Set the the various delay values extremely conservatively and see if it helps.

(I've just fought a small battle with an old NF1 mobo and some generic 'Best Buy' (yea, sure....) memory. The RAM is specified at CAS2 at 133/266 (2,2,2,6), and reports these values to the motherboard. Unfortunately, it won't run stably unless I bump the CAS value to 2.5.)

-Greg

Stick
Stick
Joined: 24 Feb 05
Posts: 790
Credit: 33135866
RAC: 1084

RE: Hi there: I'll keep

Message 22831 in response to message 22827

Quote:

Hi there:

I'll keep digging on the "graphics bug" thing. My memory is that I'm using a junker AGP 2.0 card in that one.

The box is actually heating my parents house :-) . The next time I visit, I will:
1. Uninstall Boinc 4.whatever and install the new one.
2. Update the graphics drivers.

Not quibbling, but I try to install BOINC as a service on all my machines so I've never seen the graphics.

If any of you have additional thoughts, keep 'em coming. Thanks.

Bruce Richardson
W9FZ

It appears this host is now successfully processing Albert units. Since Albert has the same "graphics bug" fixes as the Beta application, I think you can safely assume that that the "bug" was your whole problem.

In case you were not "up to speed" on the "graphics bug" issue, there is a little more to it than ATI driver problems. That is, Einstein 4.79 was built to the BOINC library that was in effect when BOINC 4.19 was the standard. The BOINC library was changed by the time BOINC 4.45 was released. My guess is your problem was more related to the library issue than it was to your graphics card.

KWSN-GMC-Peeper of the Castle Anthrax
KWSN-GMC-Peeper...
Joined: 6 Oct 05
Posts: 62
Credit: 54755621
RAC: 0

Ah yes..good old ATI..the

Ah yes..good old ATI..the 'we'll have a new driver to fix these problems SOON, Honest, we really mean it this time' people.
Their motto should be 'we're so fast and cutting edge, we don't even work well'

ROFL. sorry..it's SO hard not to 'moon' ATI. I'll be good now.

W9FZ
W9FZ
Joined: 9 Mar 05
Posts: 20
Credit: 1015550
RAC: 0

Roger on all. Yes, I had

Message 22833 in response to message 22831

Roger on all. Yes, I had noted that it seemed to be humming along on the new WU's. I'm still away and have not visited the machine and not had a chance to effect the changes I intended to effect (new drivers and new BOINC front-end).

I think you are right, even if I had done those changes, it may not have perked right up.

And as I'm recalling this machine (named Heinlein), though I normally use AGP cards, I think I used a PCI vid card--which most likely was an ATI.

THAT explains why this machine (which is SO similar to Asimov and other hosts I've set up) kept "coughing".

Well, I'm glad the Albert WU's came along. Helped resolve a problem that was stumping me till the other postings in this thread started steering me towards potential root issues.

Bruce Richardson
Team Richwood Timber LLC

When the going gets weird, the weird turn PRO. -- Hunter S. Thompson

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.