All of the tasks I have been assigned over the past few days have had the wingman be an ATLAS node. I've noticed that both systems have problems, one with the "no heartbeat from core client" issue, the other with a checkpointing problem.
Copyright © 2024 Einstein@Home. All rights reserved.
ATLAS issues
)
We had some problems on ATLAS since the failure of the cooling unit on Sunday. Currently Einstein@home is not running on ATLAS and will be brought back up slowly during the next days.
BM
BM
RE: We had some problems on
)
Thanks Bernd... Merry Christmas...
How're the older
)
How're the older merlin/morgaine clusters doing now?
RE: How're the older
)
Steffen Grunewald
RE: RE: How're the older
)
that's not really what i was asking about. Last spring there was an article linked that said the oldedr array was starting to lose blades on a semi regular basis. I was wondering how many blades were still left and if the failures had spread to the second cluster yet.
RE: RE: RE: How're the
)
Hmmm... Dunno that, obviously, but it looks like that account is doing fine, whatever that means... :-)
I think I can remember the
)
I think I can remember the story about dying servers, and I think it affected the older Merlin cluster (> 5 years old), which is made from Dual Athlon MP systems (K7 architecture) in desktop cases. The newer Morgane Cluster is made from K8 Opterons in 19" cases IIRC and should not yet show that many signs of aging.
CU
Bikeman
Well, I'm just glad ATLAS got
)
Well, I'm just glad ATLAS got unbusy enough to come back and help clear out the template frequency my poor little T2400 had been trying to plow through virtually alone since the middle of October! :-D
It finally moved up 0.05 MHz with todays task download. ;-)
Of course, ATLAS probably is one of my wingmen (didn't look to see for sure, and you might not get many others after he pops into the picture) in this template as well. Most likely, I'll get left holding the bag when something more interesting/urgent comes along for him to do. :-)
Alinator
RE: I think I can remember
)
After the last power outage @AEI Potsdam a few weeks ago Merlin was no longer reactivated. It is actually dead now. I think it had less than 50 nodes left running of its original 180.
Morgane is running well, about half a dozen (of 615) nodes are down for hardware failures, that's all.
[edit]Looks like most of the failed nodes have been repaired, only one seems to be down. Learn more about the AEI clusters at gw.aei.mpg.de.[/edit]
BM
BM
Thanks Bernd.
)
Thanks Bernd.