Top Updates 💪
Meta acquires AI audio startup WaveForms (TechCrunch)
OpenAI is expanding GPT-5 with advanced voice features (Tom’s Guide)
Krisp launches even more natural-sounding Accent Conversion v3.7 (Krisp)
RingCentral and NiCE extend AI partnership (Telecom Reseller)
One UI 8 adds a new Galaxy AI feature, but there’s a catch (Android Authority)
8x8 boosts AI CX platform to increase engagement (MarTech Series)
Xiaomi releases open-source AI Voice Model MiDashengLM-7B (WinBuzzer)
Taking the bitter lesson to heart for speech-to-speech models (MetaVoice)
Xiaomi rolls out edge AI voice to 500M devices (FourWeekMBA)
Capacity secures $92m to boost AI support for contact centers (Telecom Reseller)
Krisp Launches Filipino Accent Conversion for AI Assistant (AITechPark)
RingCentral takes AIR everywhere (Telecom Reseller)
How Puzzel’s AI-driven platform turns talk into transformation (CX Today)
Elevating enterprise AI through true multimodal intelligence (SoundHound)
ReSpeaker XMOS XVF3800: AI-powered 4-mic array for clear voice (Seeed Studio)
Automakers race to add voice AI to connected vehicles (IEEE Spectrum)
Owll Translator app introduces AI voice cloning (GlobeNewswire via Fox4KC)
How deepfake vishing tricks people and avoids detection (Ars Technica)
SoundHound and Acrelec are taking drive-thrus into the future (InsiderMonkey)
Rifa AI raises $1.1M to expand voice AI solutions (Economic Times)
Hi, AI speaking: Hiring goes vernacular with voice AI (Analytics India Magazine)
Smart IVR vs traditional IVR – A comparison for decision-makers (TechBullion)
AT&T boosts Office@Hand with AI contact center & conversational AI (TMCnet)
How Speech Graphics is pioneering human-to-AI interactions (80 .lv)
Attention labs redefines voice and audio AI for everyday life (BigNewsNetwork)
Voice AI Podcast 🎙️
In case you missed the latest episode of Voice AI Podcast…
What to expect in 2025 | Jack Piunti (GTM Lead for Communications at ElevenLabs)
In the Future of Voice AI series of interviews, I ask three questions to my guests: - What problems do you currently see in Enterprise Voice AI? - How does your company solve these problems? - What solutions do you envision in the next 5 years?
Engineering Corner 😎
Kitten TTS : A TTS model that runs in your browser (X)
CapCut shows AI narrator video effect for creators (Eastern Mirror Nagaland)
Krisp Pro — The upgrade you didn’t know you needed (Medium)
Notiq — AI-powered private voice memos (BetaList)
They tested three TTS AI models to see which is best (ZDNet)
Toward low-latency end-to-end voice agents for telecommunications (arXiv)
C3: A bilingual benchmark for spoken dialogue models (arXiv)