BRP4 & FGRP1 download (server) problems

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250472527
RAC: 35169

We had some trouble with BRP4

We had some trouble with BRP4 workunit generation earlier today. The problem has been solved. It will take a few hours to build a buffer of unsent tasks, though. Currently all generated tasks are immediately sucked up by hungry clients.

BM

Edit: Sorry, the problem is only partially solved so far. We are still working on it.

BM

somanyroads
somanyroads
Joined: 6 Aug 08
Posts: 1
Credit: 541684044
RAC: 0

Einstein@home seems hugely

Einstein@home seems hugely popular. Why not give people a heads-up the few times there are problems with the equipment . . . . would save some of us a lot of trouble shooting time. Thanks.

The Xorcist
The Xorcist
Joined: 16 Aug 11
Posts: 16
Credit: 464281554
RAC: 0

Is this project taking the

Is this project taking the same cosmic path as seti@home?
Is this the distributed computing example of another black hole ?

What a joke,
Justin Uva Donator

Grutte Pier [Wa Oars]~GP500
Grutte Pier [Wa...
Joined: 18 May 09
Posts: 39
Credit: 6098013
RAC: 0

Seems the server thinks he

Seems the server thinks he has weekends OFF, i think NOT sir!

To bad no CUDA Work.

Quote:

Is this project taking the same cosmic path as seti@home?
Is this the distributed computing example of another black hole ?

What a joke,
Justin Uva Donator

The point of this projects is realistic, so it's always better.

The Xorcist
The Xorcist
Joined: 16 Aug 11
Posts: 16
Credit: 464281554
RAC: 0

Indeed, If only there were

Indeed,

If only there were realistic work units to crunch ;-)

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250472527
RAC: 35169

The download server has been

The download server has been working ok this weekend. We did have a problem with generating workunits for BRP4 (the other searches being unaffected). This problem has been solved for now, we are generating and sending out BRP4 work again.

The workunit generation for BRP4 is a chain of various software running on a couple of machines. So far it did work well and reliable since end of September. The reboot of one of the machines on Friday morning then lead to a chain of oddities and errors that resulted in no work being generated anymore.

A couple of errors in that chain still need some investigation in order to prevent this from happening again, but we won't do it today. It's advent weekend after all, and most of the people involved (Ben, Carsten, Oliver, me) spend these days with their families.

BM

BM

telegd
telegd
Joined: 17 Apr 07
Posts: 91
Credit: 10212522
RAC: 0

Thanks very much for the

Thanks very much for the update.

Quote:
So far it did work well and reliable since end of September.


Well, at least some of us understand that the occasional hardware issue has to be accepted gracefully. Thanks to all of you for your hard work.

MarkJ
MarkJ
Joined: 28 Feb 08
Posts: 437
Credit: 139002861
RAC: 0

Would it be an idea to have

Would it be an idea to have the FGRP work coming from a different download server? That could then free up some bandwidth, relieve the BRP download server of some load and remove the double point of failure (ie 2 download servers).

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250472527
RAC: 35169

The network / server load

The network / server load from FGRP1 is negligible. There is a single large file that should be downloaded only once per host for all workunits, the actual data files are just a few kB, and should also be used for many tasks.

What would make more sense would be to have two download servers for BRP4, each one fed by a single workunit generator. But currently we don't need that.

We are currently investigating different ways to encode (effectively compress) the BRP4 timeseries data, such that we need to ship fewer bytes per task. This should help both server and clients.

BM

BM

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.