Looking forward to VapiCon this Thursday! It’s going to be fun 🔥
Top Updates 💪
OpenAI CEO Sam Altman says AI will take customer service jobs first (CX Today)
Google launches live AI voice search in Google Search (The Verge)
TTS market to reach $12.5B by 2031, CAGR 16.3% (EIN Presswire)
Microsoft aims to elevate customer service with HD voice features (CX Today)
NVIDIA backs ElevenLabs to enhance customer experience (CX Today)
Apple built its own ChatGPT-like app to test out new Siri AI revamp (Mashable)
Zoom brings AI live speech translation in-house (Slator)
Red Lobster taps SoundHound AI for phone ordering (Yahoo Finance)
Kenect launches Voice AI to boost dealership efficiency (Morningstar)
Chrome for Android turns webpages into podcasts (ChromeUnboxed)
Why voice AI is the most natural customer experience channel (CMSWire)
Prosper AI raises $5M to deliver voice AI agents for healthcare (Silicon Angle)
How to drive adoption of contact center voice AI (Customer Experience Dive)
Why deepfakes threaten enterprise communications (Pindrop)
8x8 recognized in 2025 Gartner UCaaS Magic Quadrant for 14th year (Benzinga)
Cloze introduces Maia:Voice-enabled AI assistant for real estate (GlobeNewswire)
Voice AI enhances interactions: Advances, ethics & future impact (WebProNews)
AI Media’s Lexi Voice delivers accuracy in AI voice translation (TVBEurope)
Voice AI Podcast 🎙️
In case you missed the latest episode of Voice AI Podcast…
Fullband 2025: CX Research, AI Agents, and the Frontlines of Voice AI
In this special edition of the Future of Voice AI series, welcome leading voices on the state of voice AI in CX: - Nicole Kyle of CMP Research on CX market data and shifting priorities - Kwindla Hultman Kramer of Daily on building and scaling voice AI agents - Brent Stevenson of IntouchCX on AI adoption on the frontlines
Engineering Corner 😎
Wispr Flow vs SuperWhisper: Which fits your use case? (ClickUp)
Application of audio fingerprinting techniques for real-time scalable speech retrieval and speech clusterization (arXiv)
How ReSpeaker enables clear voice pickup (Seeed Studio Blog)
WhisperLiveKit: Real-time & local STT with speaker identification (GitHub)
No verifiable reward for prosody: Toward preference-guided prosody learning in TTS (arXiv)
Benchmarking the responsiveness of open-source TTS systems (MDPI)
Python audio transcription: Convert speech to text locally (PavlinBG)
ChatGPT Voice vs. Whisper AI: Key differences explained (ClickUp)
Building a voice agent to command all your apps (Composio)
BiRQ: Bi-level self-labeling random quantization for self-supervised speech recognition (arXiv)
Notely Voice: Revolutionizing note-taking with AI-powered transcription (DEV)
Voice clones can sound as real as human voices (BiometricUpdate)
VibeVoice AI podcast model may generate spontaneous singing (The Decoder)
AI Audio Avatar review: Revolutionary voice cloning tool (Vocal Media)
Top voice generators that can revolutionize business communication (BuzBlog)