Deepgram's Voice Agent API experienced elevated error rates and latency issues with the NVIDIA Llama Nemotron Super 49B managed LLM model over a 76.5-hour period. The incident affected users relying on this specific LLM provider for voice agent functionality. Deepgram recommended configuring multiple LLM providers as a workaround to avoid downtime during the ongoing issues.
Trusted by 1,000+ teams
Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.
We are seeing elevated error rates and latency when using NVIDIA Llama Nemotron Super 49B (llama-nemotron-super-49B) as the managed LLM in Voice Agent API. To avoid downtime, please define multiple LLM providers (https://developers.deepgram.com/docs/voice-agent-llm-models#using-multiple-llm-providers) in your Voice Agent configuration.
With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.
Start free trialNo credit card required · Cancel anytime · 6320 services available
Integrations with