On November 4th 2021 around 10:00 Renkulab was subject to an outage that lasted around 20 minutes. During that time Renkulab was not accessible and all the user-sessions that were running at the time of the incident were lost.
The outage is the result of an automated process that acted upon merging some changes in the repository where the Renku team holds the configuration for all the clusters we manage. The automated process deleted the Renku deployment (for reasons that are still being investigated) thus rendering the service inaccessible.
The team immediately intervened to restore Renkulab and revert the rogue code and around 10:20 Renkulab was again available.
Precautions to prevent rogue code to disrupt our production cluster have already been taken.
The Renku team apologizes for any inconvenience experienced by our users.