Major Platform Outages Hit Netlify, Vercel, Cloudflare in Early May
What happened
Three major deployment platforms experienced significant service disruptions in late April and early May 2024. Netlify suffered Agent Runner failures affecting builds on newly created projects on May 5. Vercel encountered multiple incidents including workflow errors, runtime log access issues, degraded observability alerts, and delays in analytics data from April 19-20 and May 5-6. Cloudflare experienced R2 custom domain addition errors and delayed network analytics on May 5, with some incidents lasting several hours before resolution.
Business impact
Background
Netlify, Vercel, and Cloudflare serve as critical infrastructure for modern web deployment and content delivery, supporting thousands of enterprise websites. The concentration of incidents across multiple major platforms within a two-week window highlights the interconnected risks in cloud-native deployment strategies. Many enterprise teams now depend on these services for both staging and production environments, making simultaneous outages particularly disruptive.
What this means for your team
What to watch
Monitor the platforms' post-incident reports for root cause analysis and prevention measures. Track whether these providers implement additional redundancy or change their incident communication procedures following this cluster of outages.
Sources
-
Agent Runner failures on new projects
Netlify Status
-
Delays Loading Observability, Analytics and Speed Insights
Vercel Status
-
Unaccessible runtime logs in the dashboard
Vercel Status
-
Observability Alerts degraded
Vercel Status
-
Increased error rate in Workflows
Vercel Status
-
Errors adding R2 custom domains
Cloudflare Status
-
Delayed Network Analytics and Alerting
Cloudflare Status
-
Slower build start time in Workers Builds
Cloudflare Status
-
Workers API Issue
Cloudflare Status
-
Partial degradation for code scanning default setup and for code quality
GitHub Status