CX Awards 2025 winners announced. Meta buys PlayAI. Otter's AI speaks in meetings and much more 🔥
Voice AI weekly digest
The CX Awards 2025 winners announcement (CX Today)
Top Updates 💪
Meta acquires voice AI startup PlayAI, continuing to add talent (Bloomberg)
Otter’s new AI agent can speak up in meetings (The Verge)
Exploring the future of voice AI with Mati Staniszewski (TechCrunch)
UN report urges stronger measures to detect AI‑driven deepfakes (Reuters)
Enhancing decision-making through conversational intelligence (UC Today)
Comparing conversational AI market leaders in the enterprise (UC Today)
Why fake AI calls impersonating US officials are ‘the new normal’ (CNN)
AiOla raises $25 M for voice AI in aviation operations (Airport Technology)
Leion Hey2 brings live translation to eyewear (Kr-Asia)
SoundHound AI: Voice of the future or overvalued gambit? (AInvest)
Common examples of voice deepfake attacks (Pindrop)
CallMiner acquires VOCALLS (Demand Gen Report)
Announcing Saga: The voice OS for developers (Morningstar Business Wire)
Soul App launches revolutionary full-duplex voice model (KTLA)
Voices to launch voice data solution for responsible AI (Accesswire via LocalSYR)
Vochlea wants to reinvent Voice Memos with new AI app Dubnote (MusicRadar)
Character.AI's model creates interactive videos from image and audio (NewsBytes)
Voice AI Podcast 🎙️
In case you missed the latest episode of Voice AI Podcast…
The role of empathy in AI and CX | James Bednar ( VP of Product and Innovation at TTEC)
In the Future of Voice AI series of interviews, I ask three questions to my guests: - What problems do you currently see in Enterprise Voice AI? - How does your company solve these problems? - What solutions do you envision in the next 5 years?
Engineering Corner 😎
Large language-audio models and applications (YouTube)
Kyutai STT & TTS: A perfect local voice solution (Geeky Gadgets)
Build real-time conversational AI experiences using Amazon Nova Sonic and LiveKit (AWS Machine Learning Blog)
Building an AI-powered scientific meeting transcription platform with AWS (AWS Public Sector Blog)
VideoDB: AI agent infra for your meetings (VideoDB)
Voice agents: Easy to use, hard to build (Greylock)
Chrome extension that converts your voice into text (DEV)
Best AI voice generator for every situation (Shopify Blog)
Researcher develops 'SpeechSSM,' opening up possibilities for a 24‑hour AI voice assistant (TechXplore)
How to train a new voice for Piper with only a single phrase (Hackaday)
Spectrogram‑based deep learning approaches to detect for deepfake audio detection (ResearchGate)
Joint beamforming and speaker-attributed ASR for real distant-microphone meeting transcription (arXiv)
Phoneme-aware hierarchical augmentation and semantic-aware SpecAugment for low-resource Cantonese speech recognition (MDPI)