Postmortem -
Read details
Nov 26, 14:24 UTC
Resolved -
All metrics have returned to baseline. We are marking this incident as resolved. We will publish a Root Cause Analysis here shortly.
Nov 26, 13:46 UTC
Monitoring -
The Infrastructure Team has deployed the fix, and we are seeing improvements in availability. We are monitoring the situation until all metrics return to expected baselines.
Nov 26, 13:28 UTC
Identified -
We have identified a connectivity issue preventing communication to the inferencing backends. The Infrastructure Team is currently working on a fix. We will provide the next update by 13:45 UTC.
Nov 26, 13:15 UTC
Update -
We are updating the impact to outage. The AI Model Hub Team is currently investigating and is working to restore the service as quickly as possible. We will post the next update here 13:15 UTC latest.
Nov 26, 12:43 UTC
Update -
We are continuing to investigate this issue.
Nov 26, 12:40 UTC
Investigating -
We are currently investigating increased error rate in our AI Model Hub.
Nov 26, 12:40 UTC