Outage in Datadog US1

Multiple products impacted with data delays

Major
October 20, 2025 - Started about 16 hours ago
Official incident page

Incident Report

We are investigating increased latency processing APM, RUM, Log Management and Profiling. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well. We are working on bringing new capacity online and the data will be backfilled once the service is fully operational again.
Components affected
Datadog US1 APM Datadog US1 Logs

One place to monitor all your cloud vendors. Get instant alerts when an outage is detected.

Try IsDown risk-free 14-day free trial · No credit card required
Latest Updates ( sorted recent to last )
MONITORING about 3 hours ago - at 10/21/2025 01:32AM

We are seeing recovery across all of our products, and live data and monitor evaluations have resumed for all affected products. Most historical data in Logs has been backfilled and we have a small number of ongoing backfills in Metrics and other products. We will continue to monitor the situation overnight, and our next update will be 09:00 UTC.

IDENTIFIED about 4 hours ago - at 10/21/2025 12:25AM

We are seeing recovery for APM.
We continue to see delays in processing that impact the following products: Distribution Metrics, RUM, CCM, and Product Analytics. As a result of this issue, some users may see only a subset of their data when querying those products or viewing pages that rely on telemetry from those products.

IDENTIFIED about 6 hours ago - at 10/20/2025 10:41PM

Logs data have been backfilled, and users should no longer see gaps in their historical logs. Log Archives and Log Forwarding were paused between 15:00 and 18:30 UTC, and we are working to re-forward any logs from that time period.

We continue to see delays in processing that impact the following products: Distribution Metrics, APM, RUM, CCM, and Product Analytics. As a result of this issue, some users may see only a subset of their data when querying those products or viewing pages that rely on telemetry from those products.

IDENTIFIED about 6 hours ago - at 10/20/2025 10:40PM

We are seeing recovery in Profiling.

Logs data submitted after 21:30 UTC should be processed normally. Users may see gaps in historical logs prior to 21:30 UTC while our backfill is in progress.

In addition to Log Management we continue to see delays in processing that impacts the following products: Distribution Metrics, APM, RUM, CCM and Product Analytics. As a result of this issue, some users may see only a subset of their data when querying those products or viewing pages that rely on telemetry from those products.

IDENTIFIED about 7 hours ago - at 10/20/2025 09:47PM

We are seeing recovery in AWS Metrics. Logs data submitted after 21:30 UTC should be processed normally. Users may see gaps in historical logs prior to 21:30 UTC while our backfill is in progress.
In addition to Log Management we continue to see delays in processing that impacts the following products: Distribution Metrics, APM, RUM, Profiling, CCM and Product Analytics. As a result of this issue, some users may see only a subset of their data when querying those products or viewing pages that rely on telemetry from those products.

IDENTIFIED about 8 hours ago - at 10/20/2025 08:14PM

We are seeing progress in telemetry data coming from AWS into Datadog. We are starting to see our capacity requests being fulfilled more slowly than usual. App Builder and Workflow Automation are seeing recovery.
Our processing is still delayed impacting multiple products - Distribution Metrics, APM, RUM, Log Management, Profiling, CCM and Product Analytics data is still delayed. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.

IDENTIFIED about 10 hours ago - at 10/20/2025 07:01PM

We are seeing progress in telemetry data coming from AWS into Datadog. Also, we are starting to see our capacity requests being fulfilled.
Our processing is still delayed impacting multiple products - Distribution Metrics, APM, RUM, Log Management, Profiling, CCM and Product Analytics data is still delayed. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
App Builder and Workflow Automation are also experiencing elevated errors, as a result customers might not be to query applications and workflows might take longer to execute.

IDENTIFIED about 10 hours ago - at 10/20/2025 06:04PM

APM, RUM, Log Management, Profiling, CCM and Product Analytics data is still delayed. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
We are working on bringing new capacity online and for all products except RUM we expect the data will be backfilled once the service is fully operational again.
App Builder and Workflow Automation are also experiencing elevated errors, as a result customers might not be to query applications and workflows might take longer to execute.
Due to upstream provider issues, we are also continuing to see unavailability of telemetry data coming from AWS into Datadog.

IDENTIFIED about 11 hours ago - at 10/20/2025 05:05PM

APM, RUM, Log Management, Profiling, CCM and Product Analytics data is still delayed. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
We are working on bringing new capacity online and for all products except RUM we expect the data will be backfilled once the service is fully operational again.
App Builder and Workflow Automation are also experiencing elevated errors, as a result customers might not be to query applications and workflows might take longer to execute.
Due to upstream provider issues, we are also continuing to see unavailability of telemetry data coming from AWS into Datadog.

IDENTIFIED about 13 hours ago - at 10/20/2025 03:18PM

We are still seeing increased latency processing for those products and the associated monitors are delayed.
We are continuing to work on bringing new capacity online and will continue to provide updates on this issue.

IDENTIFIED about 14 hours ago - at 10/20/2025 02:07PM

We are investigating increased latency processing APM, RUM, Log Management and Profiling. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
Monitors using the impacted data are delayed.
We are working on bringing new capacity online and will provide an update once the service is fully operational again.

IDENTIFIED about 16 hours ago - at 10/20/2025 12:45PM

We are investigating increased latency processing APM, RUM, Log Management and Profiling. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
We are working on bringing new capacity online and the data will be backfilled once the service is fully operational again.

The Status Page Aggregator Built for IT Teams

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 4522 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook