Outage in Datadog US1

Multiple products impacted with data delays

Resolved Major
October 20, 2025 - Started 24 days ago - Lasted 2 days
Official incident page

Incident Report

We are investigating increased latency processing APM, RUM, Log Management and Profiling. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well. We are working on bringing new capacity online and the data will be backfilled once the service is fully operational again.

Need to monitor Datadog US1 outages?

One place to monitor all your cloud vendors. Get instant alerts when an outage is detected.

Latest Updates ( sorted recent to last )
RESOLVED 22 days ago - at 10/22/2025 02:40PM

Backfills for Metrics and Log Management data have completed. All systems are back to normal.

MONITORING 22 days ago - at 10/22/2025 09:10AM

We are making progress on outstanding backfills. Metrics and Logs backfills are still in progress.
For products still undergoing backfilling, queries that include data from the backfilled windows may appear incomplete for the affected subset of customers.
We will provide next update no later than Oct 22, 16:00 UTC.

MONITORING 23 days ago - at 10/21/2025 09:35PM

We are making progress on outstanding backfills. Cloud Cost Monitoring backfill is complete. Metrics and Logs backfills are still in progress.
For products still undergoing backfilling, queries that include data from the backfilled windows may appear incomplete for the affected subset of customers.
We will provide next update no later than Oct 22, 10:00 UTC.

MONITORING 23 days ago - at 10/21/2025 06:04PM

We are continuing the work on outstanding backfills which are not yet fully complete, during this process queries that include data from the backfilled windows may appear incomplete for the affected subset of customers and products.
We will resolve the incident when the backfills are complete or before Oct 21, 22:00 UTC.

MONITORING 23 days ago - at 10/21/2025 10:20AM

All products have been stable since the last update. We are continuing the work on outstanding backfills, during this process queries that include data from the backfilled windows may appear incomplete for the affected subset of customers and products.
We will resolve the incident when the backfills are complete or before Oct 21, 16:00 UTC.

MONITORING 23 days ago - at 10/21/2025 01:32AM

We are seeing recovery across all of our products, and live data and monitor evaluations have resumed for all affected products. Most historical data in Logs has been backfilled and we have a small number of ongoing backfills in Metrics and other products. We will continue to monitor the situation overnight, and our next update will be 09:00 UTC.

IDENTIFIED 24 days ago - at 10/21/2025 12:25AM

We are seeing recovery for APM.
We continue to see delays in processing that impact the following products: Distribution Metrics, RUM, CCM, and Product Analytics. As a result of this issue, some users may see only a subset of their data when querying those products or viewing pages that rely on telemetry from those products.

IDENTIFIED 24 days ago - at 10/20/2025 10:41PM

Logs data have been backfilled, and users should no longer see gaps in their historical logs. Log Archives and Log Forwarding were paused between 15:00 and 18:30 UTC, and we are working to re-forward any logs from that time period.

We continue to see delays in processing that impact the following products: Distribution Metrics, APM, RUM, CCM, and Product Analytics. As a result of this issue, some users may see only a subset of their data when querying those products or viewing pages that rely on telemetry from those products.

IDENTIFIED 24 days ago - at 10/20/2025 10:40PM

We are seeing recovery in Profiling.

Logs data submitted after 21:30 UTC should be processed normally. Users may see gaps in historical logs prior to 21:30 UTC while our backfill is in progress.

In addition to Log Management we continue to see delays in processing that impacts the following products: Distribution Metrics, APM, RUM, CCM and Product Analytics. As a result of this issue, some users may see only a subset of their data when querying those products or viewing pages that rely on telemetry from those products.

IDENTIFIED 24 days ago - at 10/20/2025 09:47PM

We are seeing recovery in AWS Metrics. Logs data submitted after 21:30 UTC should be processed normally. Users may see gaps in historical logs prior to 21:30 UTC while our backfill is in progress.
In addition to Log Management we continue to see delays in processing that impacts the following products: Distribution Metrics, APM, RUM, Profiling, CCM and Product Analytics. As a result of this issue, some users may see only a subset of their data when querying those products or viewing pages that rely on telemetry from those products.

IDENTIFIED 24 days ago - at 10/20/2025 08:14PM

We are seeing progress in telemetry data coming from AWS into Datadog. We are starting to see our capacity requests being fulfilled more slowly than usual. App Builder and Workflow Automation are seeing recovery.
Our processing is still delayed impacting multiple products - Distribution Metrics, APM, RUM, Log Management, Profiling, CCM and Product Analytics data is still delayed. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.

IDENTIFIED 24 days ago - at 10/20/2025 07:01PM

We are seeing progress in telemetry data coming from AWS into Datadog. Also, we are starting to see our capacity requests being fulfilled.
Our processing is still delayed impacting multiple products - Distribution Metrics, APM, RUM, Log Management, Profiling, CCM and Product Analytics data is still delayed. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
App Builder and Workflow Automation are also experiencing elevated errors, as a result customers might not be to query applications and workflows might take longer to execute.

IDENTIFIED 24 days ago - at 10/20/2025 06:04PM

APM, RUM, Log Management, Profiling, CCM and Product Analytics data is still delayed. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
We are working on bringing new capacity online and for all products except RUM we expect the data will be backfilled once the service is fully operational again.
App Builder and Workflow Automation are also experiencing elevated errors, as a result customers might not be to query applications and workflows might take longer to execute.
Due to upstream provider issues, we are also continuing to see unavailability of telemetry data coming from AWS into Datadog.

IDENTIFIED 24 days ago - at 10/20/2025 05:05PM

APM, RUM, Log Management, Profiling, CCM and Product Analytics data is still delayed. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
We are working on bringing new capacity online and for all products except RUM we expect the data will be backfilled once the service is fully operational again.
App Builder and Workflow Automation are also experiencing elevated errors, as a result customers might not be to query applications and workflows might take longer to execute.
Due to upstream provider issues, we are also continuing to see unavailability of telemetry data coming from AWS into Datadog.

IDENTIFIED 24 days ago - at 10/20/2025 03:18PM

We are still seeing increased latency processing for those products and the associated monitors are delayed.
We are continuing to work on bringing new capacity online and will continue to provide updates on this issue.

IDENTIFIED 24 days ago - at 10/20/2025 02:07PM

We are investigating increased latency processing APM, RUM, Log Management and Profiling. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
Monitors using the impacted data are delayed.
We are working on bringing new capacity online and will provide an update once the service is fully operational again.

IDENTIFIED 24 days ago - at 10/20/2025 12:45PM

We are investigating increased latency processing APM, RUM, Log Management and Profiling. As a result of this issue, some users may see only a subset of their data when querying those different products, other product pages using the same underlying product data will be impacted as well.
We are working on bringing new capacity online and the data will be backfilled once the service is fully operational again.

The Status Page Aggregator Built for IT Teams

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 4600 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook