Incident Timeline (UTC)
- 14:10 UTCFirst anomaly detected: timeout ratio jumped from 0.7% baseline to 6.1% within a single check interval.
- 14:18 UTCConfirmed broad impact on model inference endpoints. Degraded
- 14:32 UTC5xx burst observed in two regions. Automatic fallback routing policy activated.
- 15:05 UTCProvider-side partial recovery announced. Error rates start trending downward.
- 15:42 UTCLatency returns near normal for most requests, but residual timeout pockets remain.
- 16:05 UTCStability threshold restored for 3 consecutive checks. Incident moved to resolved state.
