If you read some of the other threads, there are some database problems at Einstein right now, affecting validating, uploads, downloads and reporting. Until these are fixed, we will have some issues.
The BOINC developers know of this issue, and it can affect other projects in the same way. Einstein hit the threshold of a certain part of the database, and a fix has been identified, being tested, and hopefully in production soon. Then Einstein will hopefully be back to it's fine old self.
My sense is that in addition to the perhaps related set of database problems, there may be some other issues going on specific to Einstein. One of the more obvious of these is where the web site itself is hosted -- it appears to be hosted on one of the same servers which handles at least some component of the database, so that the home page dies when that particular server needs to be boinc'd to restart.
In fact, it seems that there are some fairly real hardware constraints -- in that the SETI folks, even with their much larger user base and some of the same database issues (which caused them big time problems a year ago as well), seems to be more robust than here. Not sure what the solution to that will be.
One thing the SETI folks did was set up a separate 'tech board' which had regular participation from the admins. Here finding out what issues are pressing involves a fair amount of digging thru various threads to find out information. On the other hand, it seems at least some of the tech threads over in the SETI server have gotten hi-jacked by folks with other agendas sad to say.
But lets face it, Einstein is struggling right now -- and has been up against it for about two months. I certainly hope that database fixes/tweaks, whatever resolve matters. I'm also hoping the new work units will help as well -- I think when Einstein started sending out the short cycle workunits that made for more handling work for the server and hope that the new work units return to the larger size that were the standard last year when Einstein seem to hum right along.
For now though, I am REAL glad I have multiple projects to support.
Quote:
If you read some of the other threads, there are some database problems at Einstein right now, affecting validating, uploads, downloads and reporting. Until these are fixed, we will have some issues.
The BOINC developers know of this issue, and it can affect other projects in the same way. Einstein hit the threshold of a certain part of the database, and a fix has been identified, being tested, and hopefully in production soon. Then Einstein will hopefully be back to it's fine old self.
I'm reasonably happy with my existing set -- SETI, World Grid, and Rosetta have been excellent for 'normal cycle' work units, and I have BBC Climate, and Climate for 'long cycle' work units. I am sure there are other excellent projects. If Einstein doesn't recover at all and I find my detaching permanently from it, then I will look to a replacement. But at the moment, even with Einstein suspended, nearly all the workstations in my farm have at least three active projects to support.
Quote:
Quote:
For now though, I am REAL glad I have multiple projects to support.
Well, there seem to be more than a few users with the same problem, it would seem. Another user posted a thread with similar problems, and I was about to post my own one until I read this one and the other one: I have a lot of results backed up, even though the full quorum for all of them is done with the computation.
As a final comment to some people, please don't keep starting new threads with essentially the same complaint in perhaps a slightly different guise. We all know you are frustrated and we all support your right to express that frustration. But please not in umpteen different threads with pretty much the same winge over and over again. That creates its own level of frustration in others.
Rather than chastise folks who don't go spelunking the multiple boards and various topics on those multiple boards, and then look for a relatively recent message included in those multiple boards and various topics, it seems reasonable to expect a home page notice and summary (as happens on many other projects).
For that matter, the explanations offered regarding *suppositions* of the problems here strike me as incomplete. From what I can see, it is NOT a single issue, but a combination of issues, some are clearly the data base itself (though I'd note that the attribution of this problem to something that affects large projects more severely than others seems less accurate than one would expect given that the SETI project, which noted this problem, has managed to run far more reliably than Einstein over the past two months.). So, yes, there is a data base issue. But also, it seems there is a hardware issue (which may have been resolved). Further, linking one of the database servers to the home page host server means that periodically, the home page itself goes offline. That is a less than optimal approach and tends to discourage folks.
The thing is, we've not seen an update on the home page regarding problems in about a month. If you looked there, you'd think things were running fine. Since they clearly are not, and folks DO CARE, they post in one board or another, and either create a new topic, or post to one of several topics started or both. That isn't THEIR FAULT. If one expects posters to scan all the threads for news, then it is quite reasonable to expect admins to post a summary to reduce the overhead and 'uncertainty angst'.
Quote:
Anderl, Gildardo and others,
please search or browse the forum before you post!
Rather than chastise folks who don't go spelunking the multiple boards and various topics on those multiple boards, and then look for a relatively recent message included in those multiple boards and various topics, it seems reasonable to expect a home page notice and summary (as happens on many other projects).
For that matter, the explanations offered regarding *suppositions* of the problems here strike me as incomplete. From what I can see, it is NOT a single issue, but a combination of issues, some are clearly the data base itself (though I'd note that the attribution of this problem to something that affects large projects more severely than others seems less accurate than one would expect given that the SETI project, which noted this problem, has managed to run far more reliably than Einstein over the past two months.). So, yes, there is a data base issue. But also, it seems there is a hardware issue (which may have been resolved). Further, linking one of the database servers to the home page host server means that periodically, the home page itself goes offline. That is a less than optimal approach and tends to discourage folks.
The thing is, we've not seen an update on the home page regarding problems in about a month. If you looked there, you'd think things were running fine. Since they clearly are not, and folks DO CARE, they post in one board or another, and either create a new topic, or post to one of several topics started or both. That isn't THEIR FAULT. If one expects posters to scan all the threads for news, then it is quite reasonable to expect admins to post a summary to reduce the overhead and 'uncertainty angst'.
Quote:
Anderl, Gildardo and others,
please search or browse the forum before you post!
Barry,
the post I linked to above "Possible Answers to some of your Questions" is a sticky thread on top of the "Problems and Bug Reports" section of the forum. It should not be too demanding to read at least these top posts. BTW: If you read that post you knew why they don't post on the homepage.
Pending Credit
)
If you read some of the other threads, there are some database problems at Einstein right now, affecting validating, uploads, downloads and reporting. Until these are fixed, we will have some issues.
The BOINC developers know of this issue, and it can affect other projects in the same way. Einstein hit the threshold of a certain part of the database, and a fix has been identified, being tested, and hopefully in production soon. Then Einstein will hopefully be back to it's fine old self.
RE: Since over a week, all
)
Just hang tight - it'll all get sorted out soon enough.
Just imagine what your BOINCStats graph will look like when you get all that credit in a day!
;-) Thank's for your
)
;-)
Thank's for your responses
My sense is that in addition
)
My sense is that in addition to the perhaps related set of database problems, there may be some other issues going on specific to Einstein. One of the more obvious of these is where the web site itself is hosted -- it appears to be hosted on one of the same servers which handles at least some component of the database, so that the home page dies when that particular server needs to be boinc'd to restart.
In fact, it seems that there are some fairly real hardware constraints -- in that the SETI folks, even with their much larger user base and some of the same database issues (which caused them big time problems a year ago as well), seems to be more robust than here. Not sure what the solution to that will be.
One thing the SETI folks did was set up a separate 'tech board' which had regular participation from the admins. Here finding out what issues are pressing involves a fair amount of digging thru various threads to find out information. On the other hand, it seems at least some of the tech threads over in the SETI server have gotten hi-jacked by folks with other agendas sad to say.
But lets face it, Einstein is struggling right now -- and has been up against it for about two months. I certainly hope that database fixes/tweaks, whatever resolve matters. I'm also hoping the new work units will help as well -- I think when Einstein started sending out the short cycle workunits that made for more handling work for the server and hope that the new work units return to the larger size that were the standard last year when Einstein seem to hum right along.
For now though, I am REAL glad I have multiple projects to support.
RE: For now though, I am
)
QMC@HOME can give fast CPUs a good load.
Tullio
I'm reasonably happy with my
)
I'm reasonably happy with my existing set -- SETI, World Grid, and Rosetta have been excellent for 'normal cycle' work units, and I have BBC Climate, and Climate for 'long cycle' work units. I am sure there are other excellent projects. If Einstein doesn't recover at all and I find my detaching permanently from it, then I will look to a replacement. But at the moment, even with Einstein suspended, nearly all the workstations in my farm have at least three active projects to support.
Well, there seem to be more
)
Well, there seem to be more than a few users with the same problem, it would seem. Another user posted a thread with similar problems, and I was about to post my own one until I read this one and the other one: I have a lot of results backed up, even though the full quorum for all of them is done with the computation.
Anderl, Gildardo and
)
Anderl, Gildardo and others,
please search or browse the forum before you post!
One of the moderators posted this: Possible Answers to some of your Questions
So:
Rather than chastise folks
)
Rather than chastise folks who don't go spelunking the multiple boards and various topics on those multiple boards, and then look for a relatively recent message included in those multiple boards and various topics, it seems reasonable to expect a home page notice and summary (as happens on many other projects).
For that matter, the explanations offered regarding *suppositions* of the problems here strike me as incomplete. From what I can see, it is NOT a single issue, but a combination of issues, some are clearly the data base itself (though I'd note that the attribution of this problem to something that affects large projects more severely than others seems less accurate than one would expect given that the SETI project, which noted this problem, has managed to run far more reliably than Einstein over the past two months.). So, yes, there is a data base issue. But also, it seems there is a hardware issue (which may have been resolved). Further, linking one of the database servers to the home page host server means that periodically, the home page itself goes offline. That is a less than optimal approach and tends to discourage folks.
The thing is, we've not seen an update on the home page regarding problems in about a month. If you looked there, you'd think things were running fine. Since they clearly are not, and folks DO CARE, they post in one board or another, and either create a new topic, or post to one of several topics started or both. That isn't THEIR FAULT. If one expects posters to scan all the threads for news, then it is quite reasonable to expect admins to post a summary to reduce the overhead and 'uncertainty angst'.
RE: Rather than chastise
)
Barry,
the post I linked to above "Possible Answers to some of your Questions" is a sticky thread on top of the "Problems and Bug Reports" section of the forum. It should not be too demanding to read at least these top posts. BTW: If you read that post you knew why they don't post on the homepage.