Previous incidents
Seeing Higher Error Rates For Some Requests
Resolved Nov 21 at 01:18pm CST
Pushed a fix here, things should be smoother now.
1 previous update
US East Region Facing Unprecedented Traffic
Resolved Nov 19 at 05:38am CST
We've pushed a scaling fix and traffic is being balanced now. Will continue to monitor for any outliers.
1 previous update
Downstream Cloudflare Errors
Resolved Nov 18 at 10:10am CST
These specific issues are resolved.
1 previous update
Some of our servers in Washington are not handling requests well
Resolved Nov 17 at 04:01pm CST
We have pushed up some further mitigations and are working with our infra provider to ensure a complete resolution to this problem. We will be posting some more updates here when that is in place.
2 previous updates
Traffic in US East is timing out
Resolved Nov 11 at 04:20pm CST
We've pushed a fix here and have been monitoring it for the last 20 minutes or so -- it seems like traffic is properly routing for now. We'll share any additional updates here.
1 previous update
Live streaming issues
Resolved Oct 20 at 12:40pm CST
We have patched the majority of the issues here. There are still lingering network related issues here (especially networks where NAT traversal is limited).
We'll push some updates there soon.
1 previous update
Some sessions are not successfully being created/released under maintenance
Resolved Oct 17 at 01:26pm CST
Things are stabilizing and seem to be back on track across the board -- we're still keeping an eye on things and will notify if things change.
1 previous update
Elevated error rates from high usage
Resolved Sep 19 at 07:22pm CST
We've been monitoring the situation for quite a while now and it seems like session acquisition + launching is back to being stable. We'll follow up with an incident report with more details.
2 previous updates
Facing some blowback from earlier
Resolved Sep 16 at 07:55pm CST
Last artifact has been cleaned up! We should be smooth sailing from now on.
1 previous update
Elevated error rates due to some backpressure
Resolved Sep 16 at 02:48pm CST
Overview
The Steel Sessions API entered a degraded state with a higher error rate due to intermittent failures in the session acquisition and reservation logic. The switching logic responsible for managing session lifecycles exhibited unpredictable behavior, resulting in users being unable to establish or maintain browser sessions. This was made prevalent to us due to a large amount of requests/backpressure which revealed this specific issue.
Timeline
- 6:03 PM UTC - Initial increase...
1 previous update
Region Selection Failures
Resolved Sep 09 at 11:08am CST
This should be resolved.
1 previous update
Network issues in BOM
Resolved Sep 08 at 07:05am CST
We have pushed changes to reflect the situation in Mumbai and we're no longer routing there. All services should be back online.
1 previous update