I recieved 5 validate errors today so far:
Is 5 validate errors a day a cause for concern? Could I have a faulty CPU?
I've gotten 9 invalid WU's the last 2 days, on 2 separate machines.
All the invalid WU's are for binary pulsar search. The Global correlations WU's work fine.
Looks more like data or server problems than client side error.
Got one too with my CPU along with my wingman also using cpu.
Never ever had a error before on any project.
It's an ABP WU showing a strange long predicted time to completion ( over 8 hours ) and took indeed more then 14k seconds opposed to less then 6k usually.
I also have 9 failures, all sent to me on 29/30th Sep which have validate errors on every host they've been returned from, looks like a set of rogue workunits !
I also have 9 failures, all sent to me on 29/30th Sep which have validate errors on every host they've been returned from, looks like a set of rogue workunits !
Same here!
Shall I post the list?
And yes, computation of all (but one) lasted about three times longer than usual.
I also have 9 failures, all sent to me on 29/30th Sep which have validate errors on every host they've been returned from, looks like a set of rogue workunits !
Same here!
Shall I post the list?
And yes, computation of all (but one) lasted about three times longer than usual.
The size of the ABP tasks have been increased. The admins are aware of the problem.
Occasionally there have been ABP workunits where all successful results ended up as a validate error. The problem with these workunits is actually in the original observatory data. Previously we canceled these workunits manually, but with increased processing rate this turned out to require too much work.
We have now tuned the ABP2 validator such that these type of error at least pass validation (and need to be dealt with in post processing). For testing I prepared one of these workunits for re-validation. If all goes well (i.e. the results pass the new validator), I'll do the same with all other such WUs I can find.
Note: this has nothing to do with the validate errors of the early larger ABP workunits reported recently. For this, refer to the news item.
I am getting whole batches of workunits that are failing as soon as they start up. Cpu time is .06 or less. I now have over 100 of them for the past week. This does not sound like the same problem described in the recent post.
RE: I recieved 5 validate
I've gotten 9 invalid WU's the last 2 days, on 2 separate machines.
All the invalid WU's are for binary pulsar search. The Global correlations WU's work fine.
Looks more like data or server problems than client side error.
Got one too with my CPU along
Got one too with my CPU along with my wingman also using cpu.
Never ever had a error before on any project.
It's an ABP WU showing a strange long predicted time to completion ( over 8 hours ) and took indeed more then 14k seconds opposed to less then 6k usually.
http://einsteinathome.org/workunit/83874456
Have two more of them to crunch but will put them on hold for the moment.
I also have 9 failures, all
I also have 9 failures, all sent to me on 29/30th Sep which have validate errors on every host they've been returned from, looks like a set of rogue workunits !
And two more (apparently
And two more (apparently these longer ABP WUs contain 10 sub-jobs instead of the usual 4).
http://einsteinathome.org/workunit/83947891
http://einsteinathome.org/workunit/83925708
RE: Got one too with my CPU
The status "Validate error" has nothing to do with the client. It simply means that the server can't find the result where it's looking for it.
Gruß,
Gundolf
Computer sind nicht alles im Leben. (Kleiner Scherz)
http://einstein.phys.uwm.edu/
http://einsteinathome.org/workunit/83976171
Seems like I'm getting Validate error.
I tried restarting the system to see if that will make new tasks computed correctly.
RE: I also have 9 failures,
Same here!
Shall I post the list?
And yes, computation of all (but one) lasted about three times longer than usual.
RE: RE: I also have 9
The size of the ABP tasks have been increased. The admins are aware of the problem.
Kathryn :o)
Einstein@Home Moderator
Occasionally there have been
Occasionally there have been ABP workunits where all successful results ended up as a validate error. The problem with these workunits is actually in the original observatory data. Previously we canceled these workunits manually, but with increased processing rate this turned out to require too much work.
We have now tuned the ABP2 validator such that these type of error at least pass validation (and need to be dealt with in post processing). For testing I prepared one of these workunits for re-validation. If all goes well (i.e. the results pass the new validator), I'll do the same with all other such WUs I can find.
Note: this has nothing to do with the validate errors of the early larger ABP workunits reported recently. For this, refer to the news item.
BM
BM
I am getting whole batches of
I am getting whole batches of workunits that are failing as soon as they start up. Cpu time is .06 or less. I now have over 100 of them for the past week. This does not sound like the same problem described in the recent post.