IBM and Deepgram have partnered to integrate Deepgram’s speech-to-text and text-to-speech technology into IBM’s watsonx Orchestrate platform.
This makes Deepgram IBM’s first voice partner, providing real-time transcription and voice features for enterprise clients.
The integration is designed to improve how companies handle complex audio environments, including background noise, accents, and natural conversation.
It also supports a wide range of languages and regional dialects, including multiple Arabic and Indian variants. Users will gain access to real-time captioning, natural-sounding voices, and options for custom tuning.
These tools can be applied across sectors such as healthcare and finance, supporting automated customer care, call analysis, and voice-driven data entry.
Scott Stephenson, Deepgram CEO and co-founder, said, “Voice is rapidly becoming the default interface between humans and technology, and enterprise deployments require a real-time platform that is accurate, low latency, and reliable at scale.
“By embedding Deepgram inside watsonx Orchestrate Agent Builder, IBM clients can build voice agents and voice-enabled workflows on top of a real-time foundation that has been developed and refined over more than a decade.”
Nick Holda, vice president of AI Technology Partnerships at IBM, added, “Our watsonx Orchestrate integration powered by Deepgram APIs introduces new speech recognition and transcription capabilities to IBM clients, refining and modernizing their operations.
“This collaboration aims to help enterprise organizations accelerate their AI initiatives and reinforces IBM’s open ecosystem, bringing choice and cutting-edge voice technology to partners and customers.”
The partnership is expected to strengthen IBM’s ability to provide flexible voice solutions to enterprise clients while expanding Deepgram’s reach to new customers through a trusted platform.
Deepgram provides real-time speech-to-text, text-to-speech, and full speech-to-speech features through cloud or on-premises APIs.
It has processed over 50,000 years of audio and transcribed more than one trillion words. IBM, on the other hand, provides hybrid cloud, AI, and consulting solutions to clients in over 175 countries.




