Lots of Client errors

Pushkin
Pushkin
Joined: 12 Mar 07
Posts: 15
Credit: 33187685
RAC: 0
Topic 194676

Hello, after change of hardware and software configuration I receive a lot of Compute errors in Einstein's workunits (see here). Do you have an idea where could be the problem?
They appear on computer ID 2183839 (AMD Phenom(tm) II X4 945 running openSUSE 11.2 x86_64 (Kernel 2.6.31.5), original BOINC Client 6.10.19 (reports itself as 6.10.17; GPU computations disabled by this trick)). Thank you for your assistance.

Richard Haselgrove
Richard Haselgrove
Joined: 10 Dec 05
Posts: 2143
Credit: 2961429272
RAC: 693247

Lots of Client errors

Looks like a disk or disk access problem.

I found a

Input file l1_1112.10_S5R4 missing or invalid

and a

Error: couldn't parse symbol information for file name: ��'�D$��7#�D$���&�$������&�t$�$�7#�D$�M

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5872
Credit: 117807251835
RAC: 34702878

Is your machine

Is your machine overclocked?

The errors look like what you can see if the overclocking is a bit too aggressive or if the machine is overheating (or both).

Cheers,
Gary.

Pushkin
Pushkin
Joined: 12 Mar 07
Posts: 15
Credit: 33187685
RAC: 0

RE: Looks like a disk or

Message 95988 in response to message 95986

Quote:

Looks like a disk or disk access problem.

I found a

Input file l1_1112.10_S5R4 missing or invalid

and a

Error: couldn't parse symbol information for file name: ��'�D$��7#�D$���&�$������&�t$�$�7#�D$�M

Actually it seems like that, but I don't want to believe it, since my computer behaves stable (I have no problems except KDE4 bugs). My Einstein is running on 1TB Seagate ST31000528AS disk, partition 6, formatted as ReiserFS 3.6. The only known bug related to disk behaviour is some strange compatibility problem of KDE4 with non-UTF8 symbols, but I don't think it has something to do with Einstein.

But, there is one more bug in my system I know about - my motherboard (ASUS M4A78T-E) utilizes AMD SB790 which has a bug with Kingston USB pens... maybe there are some more problem. Do you have some similar reports?

Pushkin
Pushkin
Joined: 12 Mar 07
Posts: 15
Credit: 33187685
RAC: 0

RE: Is your machine

Message 95989 in response to message 95987

Quote:

Is your machine overclocked?

The errors look like what you can see if the overclocking is a bit too aggressive or if the machine is overheating (or both).

No, my CPU is not overclocked, nor temperatures show any problem. My cores tick on 3GHz and the CPU temperature is 54 degrees C, which is, I think, quite OK.

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5872
Credit: 117807251835
RAC: 34702878

RE: No, my CPU is not

Message 95990 in response to message 95989

Quote:
No, my CPU is not overclocked, nor temperatures show any problem.


OK, fine.

The next thing to check is memory. Is it possible to test your RAM with memtest86 or perhaps to try different memory sticks to see if the problem goes away? You could also try doing a surface scan of your HDD at the same time just in case there are any bad sectors.

Cheers,
Gary.

Pushkin
Pushkin
Joined: 12 Mar 07
Posts: 15
Credit: 33187685
RAC: 0

RE: RE: No, my CPU is not

Message 95991 in response to message 95990

Quote:
Quote:
No, my CPU is not overclocked, nor temperatures show any problem.

OK, fine.

The next thing to check is memory. Is it possible to test your RAM with memtest86 or perhaps to try different memory sticks to see if the problem goes away? You could also try doing a surface scan of your HDD at the same time just in case there are any bad sectors.

I dit both test and they are OK. I used Memtest86+ 4.00, it did two passes of memory tests and they are both without problem, but their duration is different - the first test took 35mins, but the second took approximately one hour, is it normal?

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5872
Credit: 117807251835
RAC: 34702878

RE: I dit both test and

Message 95992 in response to message 95991

Quote:
I dit both test and they are OK. I used Memtest86+ 4.00, it did two passes of memory tests and they are both without problem, but their duration is different - the first test took 35mins, but the second took approximately one hour, is it normal?


When I want to test a machine, I have a boot CD that has a range of test utilities. The version of Memtest86 I've used is something like 3.8x if I remember correctly. My recollection is that each suite of tests that represents a pass takes a certain amount of time which is much the same for each pass. I don't know why you saw such a big difference for the second pass.

The CD contains a utility called HDTune which can do a thorough surface scan of the hard disk. Depending on the size of the disk it can take quite a while to complete. Did you do a proper surface scan of the disk? You may also be able to find a manufacturer supplied test tool if you want to exhaust all possibilities.

Cheers,
Gary.

Pushkin
Pushkin
Joined: 12 Mar 07
Posts: 15
Credit: 33187685
RAC: 0

RE: Did you do a proper

Message 95993 in response to message 95992

Quote:
Did you do a proper surface scan of the disk?

Unfortunately reserfsck does not offer such a choice, so I did only default filesystem check, which was OK. I'll try to find manufacturer's tool.

Richard Haselgrove
Richard Haselgrove
Joined: 10 Dec 05
Posts: 2143
Credit: 2961429272
RAC: 693247

RE: RE: Did you do a

Message 95994 in response to message 95993

Quote:
Quote:
Did you do a proper surface scan of the disk?

Unfortunately reserfsck does not offer such a choice, so I did only default filesystem check, which was OK. I'll try to find manufacturer's tool.


The Ultimate Boot CD, which I guess is what Gary was referring to, has a number of manufacter's test programs ready for use. It was recommended to me by the trade supplier where I get most of my kit.

mickydl*
mickydl*
Joined: 7 Oct 08
Posts: 39
Credit: 200374822
RAC: 0

RE: RE: Looks like a disk

Message 95995 in response to message 95988

Quote:
Quote:

Looks like a disk or disk access problem.

I found a

Input file l1_1112.10_S5R4 missing or invalid

and a

Error: couldn't parse symbol information for file name: ��'�D$��7#�D$���&�$������&�t$�$�7#�D$�M

Actually it seems like that, but I don't want to believe it, since my computer behaves stable (I have no problems except KDE4 bugs). My Einstein is running on 1TB Seagate ST31000528AS disk, partition 6, formatted as ReiserFS 3.6. The only known bug related to disk behaviour is some strange compatibility problem of KDE4 with non-UTF8 symbols, but I don't think it has something to do with Einstein.

But, there is one more bug in my system I know about - my motherboard (ASUS M4A78T-E) utilizes AMD SB790 which has a bug with Kingston USB pens... maybe there are some more problem. Do you have some similar reports?

Hello, just found this thread and wanted to let you know that I have the same problem and a similar configuration computer ID (Asus M3N78-EM, AMD Phenom(tm) II X4 940, 8GB Ram, Seagate ST3300831A formatted with ext3 and WD1001FALS-0 formatted with ext3, Suse 11.2).

Among others, I get the

Error: couldn't parse symbol information for file name: ��'�D$��7#�D$���&�$������&�t$�$�7#�D$�M

type of error most of the time.

Unfortunately, in a first attempt to fix the problem myself, I ended up with a configuration that doesn't allow me to do any work at all anymore. Upon stating the BOINC manager it doesn't show any of my unfinished WUs or any of the projects I am attached to. Also, I cannot attach to any new projects. The manager seems to try to connect to "localhost", probably fails and stays "disconnected". Installing BOINC in a different location does not fix the problem.

The second Problem is likely to be a configuration problem on my machine, but right now I don't have any clue where or what to look for. Any suggestions?

Thanks,
Michael

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.