Morak
Back to platforms
Voice Agent API
Developer
Speech-to-speech
Function calling
Infrastructure

Deepgram offers speech-to-text, text-to-speech, audio intelligence, and a Voice Agent API that combines STT, LLM orchestration, TTS, turn-taking, interruption handling, and function calling in a single real-time voice stack. It is a strong fit for developers and enterprises that want voice-agent infrastructure rather than a packaged receptionist.

Quick facts

Inbound calls

Yes

Outbound calls

Yes

Human handoff

Yes

Setup difficulty

Technical

Pricing model

$4.50/hour ($0.075/min) for Voice Agent API

Developer friendly

Yes

Main use cases

  • Customer Support
  • Appointment Scheduling
  • Lead Qualification
  • Order Taking

Supported industries

  • SaaS
  • Healthcare
  • Contact Centers
  • AI Products

Integrations

WebSocket API
Custom API
Function calling
Twilio
BYO LLM
BYO TTS

Voice and call capabilities

Inbound calls

Supported

Outbound calls

Supported

Human handoff

Supported

Data and permissions

Voice agents may talk to customers, access records, record calls, and trigger external actions.

Can read customer data
Can write or update external systems
Can send messages
Can trigger workflows
Requires API keys
Records calls
Human approval available
Review audio and transcript retention, API key handling, tool-call permissions, deployment mode, HIPAA/GDPR needs, regional data residency, and any self-hosted or VPC requirements.

Useful for

  • Developer teams
  • Real-time voice agents
  • Enterprise voice infrastructure
  • Teams wanting a unified voice stack

Not ideal for

  • Non-technical buyers wanting a turnkey receptionist
  • Teams needing built-in CRM workflows without engineering
  • Businesses that only need human answering

Implementation notes

Strong option when teams want a unified speech-to-speech API rather than stitching together STT, LLM, and TTS. Test latency, endpointing, interruption behavior, function calls, telephony integration, and fallback paths with real call audio.

Pricing model

Deepgram lists Voice Agent API pricing at $4.50/hour ($0.075/min) when using Deepgram's full stack, with built-in rate reductions when bringing your own models. Deepgram also offers managed, dedicated single-tenant, VPC, and self-hosted deployment options. Verify with vendor.