Validate errors [pls post here] [CLOSED]

Logforme
Logforme
Joined: 13 Aug 10
Posts: 332
Credit: 1714373961
RAC: 0

RE: I recieved 5 validate

Message 83248 in response to message 83247

Quote:
I recieved 5 validate errors today so far:
Is 5 validate errors a day a cause for concern? Could I have a faulty CPU?

I've gotten 9 invalid WU's the last 2 days, on 2 separate machines.
All the invalid WU's are for binary pulsar search. The Global correlations WU's work fine.

Looks more like data or server problems than client side error.

Zap
Zap
Joined: 12 Feb 06
Posts: 15
Credit: 3900434
RAC: 0

Got one too with my CPU along

Message 83249 in response to message 83248

Got one too with my CPU along with my wingman also using cpu.
Never ever had a error before on any project.
It's an ABP WU showing a strange long predicted time to completion ( over 8 hours ) and took indeed more then 14k seconds opposed to less then 6k usually.

http://einsteinathome.org/workunit/83874456

Have two more of them to crunch but will put them on hold for the moment.

Pete Burgess
Pete Burgess
Joined: 7 Dec 05
Posts: 21
Credit: 318570870
RAC: 0

I also have 9 failures, all

I also have 9 failures, all sent to me on 29/30th Sep which have validate errors on every host they've been returned from, looks like a set of rogue workunits !

Zapp
Zapp
Joined: 27 Mar 10
Posts: 5
Credit: 1354983
RAC: 0

And two more (apparently

And two more (apparently these longer ABP WUs contain 10 sub-jobs instead of the usual 4).

http://einsteinathome.org/workunit/83947891
http://einsteinathome.org/workunit/83925708

Gundolf Jahn
Gundolf Jahn
Joined: 1 Mar 05
Posts: 1079
Credit: 341280
RAC: 0

RE: Got one too with my CPU

Message 83252 in response to message 83249

Quote:
Got one too with my CPU along with my wingman also using cpu.
Never ever had a error before on any project.


The status "Validate error" has nothing to do with the client. It simply means that the server can't find the result where it's looking for it.

Gruß,
Gundolf

Computer sind nicht alles im Leben. (Kleiner Scherz)

tsubasa_kato@hotmail.com
tsubasa_kato@ho...
Joined: 11 Nov 06
Posts: 1
Credit: 17912813
RAC: 45098

http://einstein.phys.uwm.edu/

Message 83253 in response to message 83252

http://einsteinathome.org/workunit/83976171

Seems like I'm getting Validate error.

I tried restarting the system to see if that will make new tasks computed correctly.

tear
tear
Joined: 12 Sep 10
Posts: 9
Credit: 9914974
RAC: 0

RE: I also have 9 failures,

Message 83254 in response to message 83250

Quote:
I also have 9 failures, all sent to me on 29/30th Sep which have validate errors on every host they've been returned from, looks like a set of rogue workunits !


Same here!

Shall I post the list?

And yes, computation of all (but one) lasted about three times longer than usual.

KSMarksPsych
KSMarksPsych
Moderator
Joined: 15 Oct 05
Posts: 2702
Credit: 4090227
RAC: 0

RE: RE: I also have 9

Message 83255 in response to message 83254

Quote:
Quote:
I also have 9 failures, all sent to me on 29/30th Sep which have validate errors on every host they've been returned from, looks like a set of rogue workunits !

Same here!

Shall I post the list?

And yes, computation of all (but one) lasted about three times longer than usual.

The size of the ABP tasks have been increased. The admins are aware of the problem.

Kathryn :o)

Einstein@Home Moderator

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4273
Credit: 245211663
RAC: 12910

Occasionally there have been

Message 83256 in response to message 83237

Occasionally there have been ABP workunits where all successful results ended up as a validate error. The problem with these workunits is actually in the original observatory data. Previously we canceled these workunits manually, but with increased processing rate this turned out to require too much work.

We have now tuned the ABP2 validator such that these type of error at least pass validation (and need to be dealt with in post processing). For testing I prepared one of these workunits for re-validation. If all goes well (i.e. the results pass the new validator), I'll do the same with all other such WUs I can find.

Note: this has nothing to do with the validate errors of the early larger ABP workunits reported recently. For this, refer to the news item.

BM

BM

KB2SYB
KB2SYB
Joined: 21 Feb 05
Posts: 4
Credit: 52935198
RAC: 18395

I am getting whole batches of

I am getting whole batches of workunits that are failing as soon as they start up. Cpu time is .06 or less. I now have over 100 of them for the past week. This does not sound like the same problem described in the recent post.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.