S5R1 and beyond

ForumsCruncher's Corner

Bernd Machenschalk
Bernd Machenschalk
Joined: 15 Oct 04
Posts: 3633
Credit: 133682898
RAC: 106514
Topic 192312

This is a short status update. All of us have been quite busy, as you probably can imagine, trying to fix all kinds of problems, and we still are.

- Today we generated the last Workunit of S5R1. All that remains to do of that run is to crunch the remaining Workunits that are already in the database and for which no canonical result has been found yet.

[edit:] - There are probably only a small number of tasks remaining for every frequency band, which causes hosts to download a new datafile for almost each task. Dial-Up users may want to suspend the project for the next few days.

- A lot of problems we had recently, in particular the database problems, seem to have come mostly from the fact that near the end of S5R1 much more short Workunits were left, so they came in at a much higher rate than we expected. With the end of S5R1, things should be back to normal again.

- We are currently testing the setup for a new run that will look again into a smaller frequency range of the current S5R1 dataset with modified parameters (spindown and mismatch). We hope to start distributing this new workunits in the next days, so there should not be much of a gap to the S5R1 run. This run will last 2-3 months. It will consist of only one type of workunits that are a bit more than half as long as the S5R1 long ones have been.

I hope to have time to post some more info here as soon as it becomes available.

BM

BM

GreyCruncher
GreyCruncher
Joined: 2 Sep 06
Posts: 22
Credit: 28573232
RAC: 1034

S5R1 and beyond

Quote:

- A lot of problems we had recently, in paricular the database problems, seem to have come mostly from the fact that near the end of S5R1 much more short Workunits were left, so they came in at a much higher rate than we expected. With the end of S5R1, things should be back to normal again.

Many of us can't schedule the crunched files for more than 20 hours now. When do you expect to solve the problems?

All the problems were known since end of December. The information of the project officials is still disappointing for all cruncher.

EggZZ
EggZZ
Joined: 7 Feb 06
Posts: 2
Credit: 9102991
RAC: 16670

bye e@h... :( EggZZ

bye e@h...

:(

EggZZ

GreyCruncher
GreyCruncher
Joined: 2 Sep 06
Posts: 22
Credit: 28573232
RAC: 1034

RE: I hope to have time to

Quote:

I hope to have time to post some more info here as soon as it becomes available.
BM

My personal thought, someone of the officials should have the time to post some news on the project otherwise many chruncher will leave the project.

Annika_2
Annika
Joined: 8 Aug 06
Posts: 720
Credit: 494410
RAC: 0

Oh come on... chill out,

Oh come on... chill out, guys. You know all members of the project staff are giving their best; what more can they do? I didn't think information was so bad here. Don't forget all those months and months the problem has been running smoothly; it must have been one of the most stable projects around. Of course all those recent problems are frustrating, but as I said, they're all giving their best, so give them a break...

Nightbird
Nightbird
Joined: 17 Feb 05
Posts: 79
Credit: 561723
RAC: 0

RE: RE: I hope to have

Message 59477 in response to message 59475

Quote:
Quote:

I hope to have time to post some more info here as soon as it becomes available.
BM

My personal thought, someone of the officials should have the time to post some news on the project otherwise many chruncher will leave the project.


I will not leave E@Home, sure.
The recent "problems" are the first since many time now so some patience is needeed, that's all.

[

GreyCruncher
GreyCruncher
Joined: 2 Sep 06
Posts: 22
Credit: 28573232
RAC: 1034

What happens now? All WU

What happens now?
All WU uploaded, but report responses this (Example of one Host):

17.01.2007 00:39:23|Einstein@Home|Project is down
17.01.2007 01:01:05|Einstein@Home|Sending scheduler request to http://einstein.phys.uwm.edu/EinsteinAtHome_cgi/cgi
17.01.2007 01:01:05|Einstein@Home|Reason: Requested by user
17.01.2007 01:01:05|Einstein@Home|Reporting 15 tasks
17.01.2007 01:01:10|Einstein@Home|Scheduler request succeeded
17.01.2007 01:01:10|Einstein@Home|Message from server: Project is temporarily shut down for maintenance
17.01.2007 01:01:10|Einstein@Home|Project is down

For more than 24 Hours

Most of our Hosts report:
Communication deferred 167.00.00 hours :(

IMHO this is not a professionell work

Chris(one of the Borg Cube)

Winterknight
Winterknight
Joined: 4 Jun 05
Posts: 310
Credit: 1490160
RAC: 0

Bernd, Thanks for keeping us

Bernd,
Thanks for keeping us updated.

And we will remain Patient won't we.

Andy

MattDavis_2
MattDavis
Joined: 18 Jan 05
Posts: 68
Credit: 9886790
RAC: 14501

Calm down, people. The team

Calm down, people. The team is doing the best they can. Einstein has fun flawlessly FOREVER - it has problems for the first time and you jump down the team's throat?

F. Prefect
F. Prefect
Joined: 7 Nov 05
Posts: 135
Credit: 1016868
RAC: 0

RE: Calm down, people. The

Message 59481 in response to message 59480

Quote:
Calm down, people. The team is doing the best they can. Einstein has fun flawlessly FOREVER - it has problems for the first time and you jump down the team's throat?

In the past 5 years I have spent time at SETI, FAD (a completed project controlled from Oxford) and Einstein@home. I would have to say without hesitation that Einstien has had the lowest down time of the 3.

But the one thing that I can't understand is why someone at the project can't spare a couple of minutes, type a sentence or two advising when the project is expected to be back up and put it up on one of pages that can still be acessed. Keeping new users in the dark for hours on end when the project goes down is the best way to send them running to some other project as fast as they can d/l the the software. I guess I just don't understand the scientific mind.

F. Prefect

In the beginning the Universe was created. This has made a lot of people very angry and been widely regarded as a bad move.....Douglas Adams

Metod_ S56RKO
Metod, S56RKO
Joined: 11 Feb 05
Posts: 135
Credit: 286181719
RAC: 297527

RE: But the one thing that

Message 59482 in response to message 59481

Quote:
But the one thing that I can't understand is why someone at the project can't spare a couple of minutes, type a sentence or two advising when the project is expected to be back up and put it up on one of pages that can still be acessed. Keeping new users in the dark for hours on end when the project goes down is the best way to send them running to some other project as fast as they can d/l the the software. I guess I just don't understand the scientific mind.

Ford, you know how it goes: to get needed time, take expected time, multiply it by 2 and switch over to next larger time unit. With expected time of a couple of days, we really don't want to write down the result for those who don't know the law ;)

In the mean time, don't worry, happily crunch another project.

Metod ...

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.