Outage in Netdata

Slow and failing Agent chart data responses

Resolved Minor
June 29, 2022 - 5 months ago - Lasted about 23 hours
Official incident page
Components affected
Netdata Agent-Cloud Link (ACLK)
Latest Netdata outages
Delayed alarms - about 1 month ago
Routing problem on netdata cloud - about 2 months ago

Details

Users with nightly versions of the Netdata Agent are experiencing slow responses between Cloud and Agent, resulting in failing or slow charts in their Cloud dashboards. We are investigating the issue.
Updates ( sorted recent to last )
MONITORING at 06/30/2022 08:03AM

For completeness, the affected versions are v1.35.0-84-nightly and v1.35.0-96-nightly. Latest, corrected version is v1.35.0-104-nightly.

MONITORING at 06/30/2022 07:20AM

The new nightly version of the Netdata Agent has been published and installed by a large portion of the agents that auto-update. We are monitoring the results.

IDENTIFIED at 06/29/2022 02:11PM

We have identified part of the cause of failing responses for alarm values. In yesterday's nightly build of the Agent, we enabled the use of the newer MQTT5 library by default. We will create another build to revert that. In the meanwhile, you can explicitly disable this library using the mqtt5 setting in your configuration as described here: https://github.com/netdata/cloud-backend/issues/178. Additionally the other latencies appear to be another instance of a known issue that causes responses with a small payload to be delayed. We are working on resolving this issue.

INVESTIGATING at 06/29/2022 08:54AM

Users with nightly versions of the Netdata Agent are experiencing slow responses between Cloud and Agent, resulting in failing or slow charts in their Cloud dashboards. We are investigating the issue.

Monitor outages in Netdata and all your cloud services with ease

Have you ever missed an important outage from a third-party service? We've built IsDown, so you never miss another outage again. It's the easiest way to monitor all your SaaS and cloud providers and get alerted when an outage impacts your business.

Start free trial

No credit card required · Cancel anytime · 2010 services available

Integrations with Slack Microsoft Teams Google Chat PagerDuty Zapier Discord Webhook

"If you are in SRE, IT, or Security and work in an environment with a lot of SaaS (which, let's face it, is all of them) - IsDown is your new best friend. Helpfully aggregates various Statuspages from services into a very clear dashboard. Worth every penny."

Mike, Head of Security

Are you able to monitor your cloud services in a real-time and consistent way?

The Old Way
  • Subscribe to status pages one-by-one
  • Limited to 0 notification options
  • Can't monitor only the parts that matter
  • No bird's eye view over all your services
  • Losing time looking for problems elsewhere
  • No access to historical issues and stats
With IsDown
  • Easily subscribe to all status pages
  • Normalized notifications sent to your tools
  • Monitor what matters
  • Easy access to the status of all your services
  • Outages information where it's needed
  • Historical data of outages for all your providers

IsDown is the missing layer in your monitoring stack

Quickly identify external outages that impact your business. We are monitoring more than 2000 services in real time.

Birds-eye view over all your services statuses

Check the status page aggregated of all your services in one place. No more going to each of the status pages and managing them individually.

IsDown Dashboard

Outage monitoring in real time

We monitor 24 hours a day, 7 days a week and will notify you if there is an incident. No more wasting time trying to figure out why something isn't working.

Alerts in your favorite channels

Get instant notifications in your email, Slack, Teams, or Discord when we detect a service outage. Outage monitoring where you are already doing your work.

IsDown Integrations

Easily integrate with your current tools and workflows

Using Zapier or Webhooks, you can easily integrate notifications into your processes. PagerDuty integration is also available.

Avoid notifications clutter

Configure which notifications you want to receive from each service. Filter notifications by service components. You can opt to receive notifications only when a specific component is affected. You can also choose to receive notifications with a certain severity.

Notify By Components
Multiple Dashboards

Have multiple dashboards. Easily shareable with the world.

Create one dashboard for each of your teams/clients/projects. Monitor only the services that each uses. Dedicated dashboard with custom notification settings. Easily make your dashboard public and share it with the world.

Prepare for scheduled maintenances

Never again be caught off guard by unexpected maintenance from your services. A feed of the next scheduled maintenances is available.

Weekly Digest of the services' outages

Every Monday, you'll receive a weekly summary of what happened the previous week as well as the maintenance schedule for the following week.

Integrate with tools you already use and love

The data and notifications you need, in the tools you already use.

For every team in your company

DevOps & On-Call Teams

You already monitor your internal systems. What about the external services? Monitor the services your business depends on. Don't waste time looking elsewhere when external outages are the cause of issues.

IT Support Teams

Detect external outages before your clients tell you. Anticipate possible issues and make the necessary arrangements. Having proactive communication, builds trust over clients and prevents flow of support tickets.

Simple Setup. Instant Value.

  1. Step 1 Create an account

    Start with a trial account that will allow you to try and monitor up to 40 services for 14 days.

  2. Step 2 Select your services

    There are 2010 services to choose from, and we're adding more every week.

  3. Step 3 Set up notifications

    You can get notifications by email, Slack, and Discord. You can also use Zapier or Webhooks to build your workflows.

  4. Step 4 Done!

    You'll start getting alerts when we detect outages in your external dependencies! No more wasting time looking in the wrong place!

Frequently Asked Questions

Is Netdata down today?
Netdata seems to be up and running. We've updated the status 1 minute ago.
I'm having issues with Netdata, but the status is OK. What's going on?
There are a few things you can try:
How can I be notified when Netdata is having issues?
You can subscribe for updates on the official status page or create an account in IsDown. We will send you a notification in real-time when Netdata has issues.
Why use IsDown?
IsDown is a status page aggregator, which means that we aggregate the status of multiple cloud services. Monitor all the services that impact your business. Get a dashboard with the health of all services and status updates. Set up notifications via email, Slack, or Discord when a service you monitor has issues or when maintenances are scheduled.
What happens when I create an account?
You'll have access to a 14-day trial in our Pro plan. You can cancel or delete your account anytime. After 14 days, you'll need to subscribe to continue to use the service and get notifications.
How can I pay for a subscription?
You can go to the Billing section in your account and choose one of the plans. We have monthly and yearly options. We accept all major credit cards, Apple Pay, and Google Play. We use Stripe for payments.
Can I get a refund?
We'll refund your subscription if you cancel it until ten days after the subscription has started. No questions asked.
Can't find a service/integration?
Just contact us, and we'll add it ASAP.

Setup in 5 minutes or less

Try it out! How much time you'll save your team, by having the outages information close to them?

  • 14-day free trial
  • No credit card required to start
  • Cancel anytime
  • +2000 services available