[Sciserver-users] Failed compute node
Jonas Haase
jhaase at mpe.mpg.de
Mon Jan 22 10:24:00 CET 2024
Dear All
Unforrtunately there was no way to recover the failed drive, so I had to reinitialize it (with higher safety settings this time, knock on wood).
That means the containers previous running on sciserver-comp5 were lost - they should have disappeared from your lists in compute already.
I hope this has not caused any undue trouble.
The compute node is back online
cheers
Jonas
> On 17. Jan 2024, at 12:33, Jonas Haase <jhaase at mpe.mpg.de> wrote:
>
> Dear Sciserver users
>
> Unfortunately we had an issue with the compute node sciserver-comp5, where docker and the individual container processes had become unresponsive and refused to shut down cleanly.
> As a last resort I rebooted the machine. It has come back up, but unfortunately has the virtual disk which holds the container information become corrupted in the process.
>
> I will attempt to see if I can fix the disk, but if that does not work out I will have to replace it, which will lead to the loss of the containers which have been running on that machine.
> Your data stored on the Storage and Temporary volumes remains unaffected.
>
> I have turned the node off for the moment, you can still start new containers in the SciServerMPE-Large domain, which then should run on sciserver-comp7 instead.
>
> My apologies for the inconvenience
> Jonas
>
> —
> Jonas Haase
> Max Planck Institute for Extraterrestrial Physics (MPE)
> Giessenbachstr. 1, 85748 Garching, Germany
> X2 366
> +49 89 30000 3706
>
>
> --
> Sciserver-users mailing list
> Sciserver-users at lists.mpe.mpg.de
> https://lists.mpe.mpg.de/cgi-bin/mailman/listinfo/sciserver-users
—
Jonas Haase
Max Planck Institute for Extraterrestrial Physics (MPE)
Giessenbachstr. 1, 85748 Garching, Germany
X2 366
+49 89 30000 3706
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpe.mpg.de/pipermail/sciserver-users/attachments/20240122/0f5e582b/attachment.htm>
More information about the Sciserver-users
mailing list