In the past we were informed

Colin Porter
Colin Porter
Joined: 15 Feb 05
Posts: 21
Credit: 6,479,335
RAC: 0
Topic 192428

In the past we were informed about what was happening with Einstein.

Now even although the project has been down for 5 or 6 days all I've got is

14/02/2007 20:13:19|Einstein@Home|Sending scheduler request: To fetch work
14/02/2007 20:13:19|Einstein@Home|Requesting 26 seconds of new work
14/02/2007 20:15:26|Einstein@Home|Scheduler request failed: HTTP internal server error
14/02/2007 20:15:26|Einstein@Home|Deferring communication for 27 min 55 sec
14/02/2007 20:15:26|Einstein@Home|Reason: scheduler request failed
14/02/2007 20:43:29|Einstein@Home|Sending scheduler request: To fetch work
14/02/2007 20:43:29|Einstein@Home|Requesting 26 seconds of new work
14/02/2007 20:43:32|Einstein@Home|Scheduler RPC succeeded
14/02/2007 20:43:32|Einstein@Home|Message from server: Server can't open database
14/02/2007 20:43:32|Einstein@Home|Deferring communication for 1 hr 0 min 0 sec
14/02/2007 20:43:32|Einstein@Home|Reason: project is down
14/02/2007 20:43:32|Einstein@Home|Deferring communication for 2 hr 34 min 37 sec
14/02/2007 20:43:32|Einstein@Home|Reason: project is down

What a shame!

Warning! This post contains atrocious spelling, and terrible grammar. Approach with extreme edginess.

It's not the speed, but the quality - Until I get a faster computer

BarryAZ
BarryAZ
Joined: 8 May 05
Posts: 190
Credit: 121,445,222
RAC: 1,671,602

In the past we were informed

What I've seen over the past week or so is that the server seems to be connection limited -- I'd get a 'server is down' when reporting a result, and then if I retried even a couple of minutes later, the upload would succeed. Same for downloads.

Clearly there is a constraint we've not encountered before.

Also, as you note, for whatever reason, status information is not being provided.

Similarly, when other problems occur (not just the 'narrow piple syndrome' I noted above), we are also left without information. At the moment, it seems there may be a database problem which is delaying validation. This seems to be something new for today (though I recall something like this about a week ago prior to a temporary (and unremarked upon) outage.

For me, the travails of the past two months have simply meant I reallocate CPU cycles to projects with higher current reliability. Sort of turnabout for me -- what drove me to add Einstein to my project list 21 months ago was reliability problems over on Seti. I've since added four other projects, and while Einstein has the largest share of total work done by me, it is now my 4th ranking project over the past two months.

Pooh Bear 27
Pooh Bear 27
Joined: 20 Mar 05
Posts: 1,376
Credit: 20,312,671
RAC: 0

There are other threads

There are other threads talking about this, and there have been some answer in some of them. Posting new threads on information we already has just confuses the issue more.

Part of the issue is a know BOINC database side issue. The BOINC developers are trying to get this corrected, then sent down the pipe to the other projects, so they can implement the fixes, and get things more on track.

Please just try and be patient. We are just volunteers, with nothing to lose. We help a project out of the goodness of our hearts. The teams that run the projects are usually a skeleton crew, who all have other jobs (full time professors, etc.). So their time is also limited in what they can do.

There are other worthy projects to crunch for. No one forces you to stay on one project, and it's actually encouraged to do several to help multiple causes, and keep your machines busy, if one does have an outage.

Constantinos
Constantinos
Joined: 16 May 05
Posts: 39
Credit: 938,513
RAC: 0

RE: There are other threads

Message 60874 in response to message 60873

Quote:

There are other threads talking about this, and there have been some answer in some of them. Posting new threads on information we already has just confuses the issue more.

Part of the issue is a know BOINC database side issue. The BOINC developers are trying to get this corrected, then sent down the pipe to the other projects, so they can implement the fixes, and get things more on track.

Please just try and be patient. We are just volunteers, with nothing to lose. We help a project out of the goodness of our hearts. The teams that run the projects are usually a skeleton crew, who all have other jobs (full time professors, etc.). So their time is also limited in what they can do.

There are other worthy projects to crunch for. No one forces you to stay on one project, and it's actually encouraged to do several to help multiple causes, and keep your machines busy, if one does have an outage.

You are absolutely right that we are just volunteers with goodness in our hearts (sic). That is why we could have a piece of information now and then! I am not leaving EAH despite the problems. After all I like seeing my pending credits growing by the day. (around 1600 by now)

Constantinos

Gravity increases significantly in Autumn, because apples fall in large numbers during that time!

EclipseHA
EclipseHA
Joined: 19 Feb 05
Posts: 41
Credit: 10,540,182
RAC: 0

I think a KEY here is to see

I think a KEY here is to see a post from one or more folks directly working on the project.

Not a post from a moderator who gleamed info from other places, or guesses by other posters.

My "pending" here is about 10 times what I ever saw before on this project.

Can't one of the actual project admins (not a forum moderator) give us a guess when the BOINC DB fixes might be installed? Are we talking a day, a week, or a month?

Moderators/guessors need not reply, and it's only a uninformed guess!

Pooh Bear 27
Pooh Bear 27
Joined: 20 Mar 05
Posts: 1,376
Credit: 20,312,671
RAC: 0

Matt L post Not a guess,

Matt L post

Not a guess, but an actual post by a BOINC developer explaining that there are database issues. If you read the whole thread, it does explain Einstein is part of this issue.

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4,071
Credit: 226,802,773
RAC: 29,820

Sorry folks, we're still

Sorry folks, we're still having serious database problems. So far everything we are doing to solve them seems to have a temporary effect only, and the DB connections soon get unreliable again. We're working on this, but I don't have an estimation of when it will be solved.

BM

BM

clownius
clownius
Joined: 16 Jun 06
Posts: 42
Credit: 2,164,665
RAC: 0

Thank you for the post. This

Thank you for the post. This was by far one of the most reliable projects in BOINC until recently. I hope you manage to get things sorted out as B@A is chomping at the bit to do an AA here but the last 2 votes have narrowly gone to other projects due to stability issues so we have missed out on einstein for our AA's

P.S The more info we get the better.

DanNeely
DanNeely
Joined: 4 Sep 05
Posts: 1,360
Credit: 3,058,786,858
RAC: 3,051,897

RE: Sorry folks, we're

Message 60879 in response to message 60877

Quote:

Sorry folks, we're still having serious database problems. So far everything we are doing to solve them seems to have a temporary effect only, and the DB connections soon get unreliable again. We're working on this, but I don't have an estimation of when it will be solved.

BM

If possible, could you post this on the front page as well? The more visible location might actually manage to stop the whining for more than a day or two.

Astro
Astro
Joined: 18 Jan 05
Posts: 257
Credit: 1,000,560
RAC: 0

I was thinking that the

I was thinking that the additional load of boincsynergy having Einstein as POTM (project of the month) wasn't hurting Einstein. If the issue is a slow DB query, then reducing the load upon it might ease things up a bit. I'm setting my 7 hosts to NNW and will crunch elsewhere until the back end can handle the additional load.

I realize my efforts alone will only help about 0.0001%, but it's something I can do.

cu later

Good luck Admins

tony

Pooh Bear 27
Pooh Bear 27
Joined: 20 Mar 05
Posts: 1,376
Credit: 20,312,671
RAC: 0

RE: Sorry folks, we're

Message 60881 in response to message 60877

Quote:

Sorry folks, we're still having serious database problems. So far everything we are doing to solve them seems to have a temporary effect only, and the DB connections soon get unreliable again. We're working on this, but I don't have an estimation of when it will be solved.

BM


Bernd,

Would it be beneficial or detrimental if we would back off crunching for a little bit? If I suspend processing Einstein for a week or two will not hurt my computers, but would that help you and your team to try to get things on track? Or would it have a reverse effect, get things back in order, then all of a sudden all these machines come back online and cause more issues?

I think that is the information all of us would like to know.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.