Deepgram Models Comparison

ModelFeaturesKeywordsKeytermsBest For
Nova-3Latest, keyterms supportEnglish calls, best accuracy
Nova-2Keywords supportMulti-language, reliable
NovaKeywords supportBalanced performance
EnhancedKeywords supportLegacy support
BaseKeywords supportBasic transcription

Key Settings


Audio Denoising

When to Enable:
  • Noisy environments (restaurants, offices, outdoors)
  • Poor phone connections
  • Background music or chatter
Trade-offs:
  • Slightly increases latency (~50-100ms)
  • Improves transcription accuracy in noisy conditions

Troubleshooting


Best Practices

  • Start with defaults and adjust based on testing
  • Test with real calls in your target environment
  • Use keywords for your business-specific terminology
  • Enable denoising if you expect background noise
  • Monitor call quality and adjust timing as needed

How STT Works with Call Management

🔗 STT + Call Management = Natural Conversations

STT Settings control when Deepgram detects speech has ended.Call Management Settings control how your AI responds to that detected speech.Both must work together for natural conversation flow!
The Flow:
  1. STT detects speech using your timing settings (silence threshold, VAD, etc.)
  2. Call Management decides response using interruption and timeout settings
  3. Result: Natural conversation or awkward pauses
Key Relationships:
  • STT min_silence_duration (internal timeout) should be longer than Call Management interruption_cooldown
  • Lower STT endpointing (more responsive) works well with lower Call Management interruption_threshold
  • Higher STT timing settings pair well with patient Call Management idle_timeout