Previous incidents
Elevated error rates from high usage
Resolved Sep 19 at 07:22pm CST
We've been monitoring the situation for quite a while now and it seems like session acquisition + launching is back to being stable. We'll follow up with an incident report with more details.
2 previous updates
Facing some blowback from earlier
Resolved Sep 16 at 07:55pm CST
Last artifact has been cleaned up! We should be smooth sailing from now on.
1 previous update
Elevated error rates due to some backpressure
Resolved Sep 16 at 02:48pm CST
Overview
The Steel Sessions API entered a degraded state with a higher error rate due to intermittent failures in the session acquisition and reservation logic. The switching logic responsible for managing session lifecycles exhibited unpredictable behavior, resulting in users being unable to establish or maintain browser sessions. This was made prevalent to us due to a large amount of requests/backpressure which revealed this specific issue.
Timeline
- 6:03 PM UTC - Initial increase...
1 previous update
Region Selection Failures
Resolved Sep 09 at 11:08am CST
This should be resolved.
1 previous update
Network issues in BOM
Resolved Sep 08 at 07:05am CST
We have pushed changes to reflect the situation in Mumbai and we're no longer routing there. All services should be back online.
1 previous update