AWS Outage: A Single Cloud Region Shouldn’t Take Down the World. But It Did.

TL;DR

A major AWS outage disrupted high-profile services like Amazon, Snapchat, and Disney+, affecting over 70 AWS services and causing widespread operational issues.

Key Points

Highlight key points with color coding based on sentiment (positive, neutral, negative).

The AWS outage was primarily due to an "operational issue" related to DNS resolution of the DynamoDB API endpoint in the US-EAST-1 region.

The outage affected over 70 AWS services, disrupting major websites and applications such as Amazon, Snapchat, Disney+, Reddit, and Canva.

AWS reported signs of recovery shortly after the incident, although some services continued to experience issues.

The incident underscores the risks associated with the heavy reliance on a few major cloud service providers.

Past similar outages included those that brought down Facebook, Instagram, and WhatsApp due to configuration errors, and Google services because of internal storage failures.

Key Numbers

Present key numerics and statistics in a minimalist format.

The number of affected AWS services

+3000

Reports made by Reddit users

+11 Million

Estimated reports by Downdetector

Stakeholder Relationships

An interactive diagram mapping entities directly or indirectly involved in this news. Drag nodes to rearrange them and see relationship details.

Organizations

Key entities and stakeholders, categorized for clarity: people, organizations, tools, events, regulatory bodies, and industries.

Amazon Web Services (AWS) Cloud Service Provider

AWS was responsible for managing and resolving the major outage that disrupted numerous high-profile websites and services.

Amazon E-commerce and Cloud Computing Company

Amazon was one of the high-profile companies affected by the AWS outage, impacting its service delivery.

Snapchat Social Media Platform

Snapchat experienced service disruptions due to the AWS outage, affecting its user access.

Disney+ Streaming Service

Disney+ was affected by the AWS outage, disrupting its streaming services.

Reddit Social News Aggregation and Discussion Website

Reddit faced service disruptions as a result of the AWS outage, impacting its platform availability.

Canva Graphic Design Platform

Canva was one of the services disrupted by the AWS outage, affecting its design tools availability.

Coinbase Cryptocurrency Exchange

Coinbase experienced significant disruptions due to the AWS outage, impacting its trading services.

Instacart Grocery Delivery & Pickup

Instacart experienced disruptions caused by the outage

Events

Key entities and stakeholders, categorized for clarity: people, organizations, tools, events, regulatory bodies, and industries.

AWS Outage Service Disruption Event

The AWS outage was a major event that disrupted over 70 AWS services, affecting numerous high-profile websites and platforms.

Timeline of Events

Timeline of key events and milestones.

2025-10-20 ~ 07:00AM UTC AWS reported operational issue

AWS reported an "operational issue" affecting multiple services and began working on recovery.

2025-10-20 ~ 10:00AM UTC AWS confirmed recovery of global services

AWS confirmed that global services and features relying on US-EAST-1 had recovered.

2025-10-20 ~ 10:00AM UTC Reddit experienced user-reported problems

Reddit experienced a spike in user-reported problems, despite other services recovering.

2025-10-20 10:00AM UTC AWS identified potential root cause

AWS identified a potential root cause for error rates in the US-EAST-1 Region.

2025-10-20 10:00AM UTC Ring doorbells connectivity issues reported

Ring doorbells were reported to be experiencing connectivity issues.

2025-10-20 Downdetector reported global issue reports

Downdetector reported over four million global issue reports, with more than 500 companies affected.

2025-10-20 ~ 10:00AM UTC AWS announced significant signs of recovery

AWS announced significant signs of recovery, with most requests succeeding.

2025-10-20 Post Office, HMRC and other companies reported issues with their services

2025-10-20 Reports of websites and apps down (downdetector)

Reports indicated that many websites and apps were down due to the AWS outage.

2025-10-20T ~ 11:30AM UTC AWS stated most services were recovering

AWS stated that most of its services were recovering.

2025-10-20T ~ 11:00AM UTC AWS mentioned DNS resolution issue

AWS mentioned the issue might be related to DNS resolution of the DynamoDB API endpoint in US-EAST-1.

2025-10-20T ~ 12:00AM UTC AWS announced underlying issue fixed

AWS announced the underlying issue was fixed, but some issues persisted.

2025-10-20 Reddit continued to face problems according to the platform users

Reddit continued to face problems as other services recovered.

2025-10-20 Latest report by Downdetector

+2,500 companies impacted

2025-10-20 5:03 PM UTC AWS reported ongoing mitigation efforts to restore network load balancer health and recover connectivity across most AWS services.

The company noted that Lambda continued to experience function invocation errors due to an internal subsystem affected by network load balancer health checks and confirmed that recovery steps were underway for that subsystem. Regarding EC2 instance launch failures, AWS stated it was validating a fix and planned to deploy it to the first Availability Zone once it was deemed safe.

2025-10-20 6:22 PM UTC AWS reported ongoing progress in mitigating launch failures for new EC2 instances, with an increase in successful launches and a decline in network connectivity issues within the US-EAST-1 region.

The company also noted significant improvements in Lambda invocation errors, particularly when creating new execution environments, including those for Lambda @Edge. AWS stated that a further update would be provided by 7:00 PM UTC.

2025-10-20 7:15 PM UTC AWS reported continued recovery across all services, noting that EC2 instance launches were succeeding across multiple Availability Zones in the US-EAST-1 region.

The company stated that some Lambda users might still experience intermittent function errors when making network requests to other services, due to lingering connectivity issues. To mitigate Lambda invocation errors, AWS had previously reduced the rate of SQS polling through Lambda Event Source Mappings and is now gradually increasing the polling rate as invocation success improves.

A significant outage of Amazon Web Services (AWS) disrupted numerous high-profile websites and services, including Amazon, Snapchat, Chime, Instacart, Disney+, Reddit, Roblox and Canva. The outage was attributed to an operational issue affecting over 70 AWS services, causing widespread disruptions in cloud-based games and crypto exchanges like Coinbase. AWS reported signs of recovery shortly after the incident, but some services, such as Reddit, continued to experience issues.

The outage also impacted government websites and various banking and exchange services, including Coinbase, Robinhood and Bank of Scotland, leading to declined card transactions and inaccessible online banking.

AWS identified the issue as related to DNS resolution in the US-EAST-1 region and worked on multiple paths to accelerate recovery. Despite significant recovery signs, some services faced delays due to a backlog of queued requests.

More than 2,500 companies were impacted by the outage. The incident highlighted the vulnerability of relying on a few major cloud service providers, as disruptions can have extensive ripple effects across numerous platforms.