Another huge week: Krisp, OpenAI, Google, Groq, Observe, PlayAI and more
Voice AI weekly digest
Top Updates 💪
Krisp is using AI to give people American-sounding accents (The Verge)
OpenAI’s AI voice assistant is now better to chat with (TechCrunch)
Alibaba debuts AI model that processes video, audio on phones (Bloomberg)
Groq and PlayAI just made voice AI sound way more human (VentureBeat)
Google lets users generate AI podcasts from Gemini’s Deep Research (The Verge)
Observe.AI launches VoiceAI agents for automated, natural support (VentureBeat)
OpenAI let’s you adds speech to text apps in seconds (VentureBeat)
Pricing: OpenAI vs ElevenLabs vs DeepGram for TTS and STT (YouTube)
Otter’s new AI agent can speak up in meetings (The Verge)
Observe.AI acquires DubDub to boost voice tech in contact centers (CX Today)
OpenAI API now supports building voice agents (The New Stack)
AI-Media releases AI voice translation at NAB 2025 (GlobeNewswire)
NVIDIA launches G-Assist for voice-controlled PC optimization (Indian Express)
Lok Sabha to use AI live translation, transcription, chatbot support (NDTV Profit)
Forethought delivers human-like AI voice support (CXM)
Medallia adds 7 AI-powered capabilities (SmartCustomerService)
Octal IT builds AI chatbots and virtual assistants for businesses (AiThority)
ai|coustics raises €5M in Seed funding (Finsmes)
VELS boosts professional skills with voice AI simulations (TrendHunter)
Capture and organize your thoughts effortlessly with InstaNote (TrendHunter)
DesiVocal is an AI Voice-Over Marketplace for high-quality TTS (TrendHunter)
Lotte Innovate upgrades AI platform with custom voice feature (Pulse.mk.co)
Our Latest Article
Where AI Voice Agents Fail the Most Today
AI voice agents are everywhere—handling customer service calls, booking appointments, and assisting in day-to-day business. While their quality has been improving fast, one problem keeps getting in the way: they don’t know when to talk and when to listen.
Engineering Corner 😎
Pricing: OpenAI vs ElevenLabs vs DeepGram for TTS and STT (YouTube)
Try integrating Sesame with Vapi (X)
A deep dive into TTS technology for educators (Plain English)
Orpheus TTS: The next generation open-source TTS system (Dev.to)
Building an AI-powered language learning app at UofT (Dev.to)
SuperM2M: Supervised and mixture-to-mixture co-learning for speech enhancement and noise-robust ASR (ScienceDirect)
A LoRa-based Walkie-Talkie? Meet LILYGO T3-S3 MVSR LoRa voice communication kit (CNX Software)
The first real STT guide for kids’ voices on edge devices (ML Vanguards)