Skip to content

ConvoStack

A real-time voice agent platform powered by Deepgram for speech-to-text and text-to-speech. Built with a FastAPI backend handling voice stream processing and a React frontend for live conversation UI. Features low-latency bidirectional audio streaming, intelligent turn-taking, and LLM-driven conversational responses.

Key Metrics

Full-Stack

Architecture

end-to-end

Deepgram

Voice Engine

real-time STT/TTS

Highlights

  • Deepgram-powered real-time speech-to-text and text-to-speech pipeline
  • Low-latency bidirectional audio streaming via WebSocket
  • LLM-driven conversational logic with intelligent turn-taking

Technologies

PythonFastAPIDeepgramReactTypeScriptDocker