Reason :
Due to performance misbehaviors on some of our application servers, new servers were instanciated and production services were moved to them.
New applications then all tried to start at the same time, causing disk congestion during application image extractions. Applications start up thus timed out and were rescheduled for restart by the application scheduler, which ended up disturbing the scheduler's global stability.
Action :