Top Updates 💪
Scale AI launches the first real-world voice AI benchmark (VentureBeat)
NVIDIA has released Nemotron 3 VoiceChat speech to speech model (X)
Krisp launches MCP integration with Claud (LinkedIn)
Amazon Connect voice AI agents now supports 13 new languages (AWS)
Modulate launches Velma Transcribe: High-performance transcription for real-world conversations at 90% lower cost (Enterprise News)
Google News could soon give you a convenient new way to consume its audio briefings (Android Authority)
AI notetaking devices that record and transcribe your meetings (TechCrunch)
Krisp has been named a Palomarr Leader across Accent Conversion, Noise Cancellation, Voice Translation (LinkedIn)
Amazon Connect adds new generative TTS voices and expands regions (AWS)
Ringover launches enhanced AI assistant ask Empower 2.0 (AIThority)
WhatsApp upgrade — calls will sound completely different (Nokia Power User)
8x8 Engage launches globally for frontline teams (CMSWire)
Itel unveils Zeno AI Weaver voice recorder in India (Gadgets360)
AI voice cloning & synthesis are shaping the future of digital voices (TechTimes)
How businesses are replacing IVR with conversational AI (Social Media Explorer)
Bandicam launches AI feature to transcribe video to text on Mac (MarTech Series)
The mounting cost of voice fraud: revenue loss, broken trust (Retail Dive)
Robinhood’s startup fund invests $35M in Stripe and AI audio firm (The Block)
Ezra raises $3.2M in seed funding (FinSMEs)
WellSaid closes venture debt funding (FinSMEs)
Engineering Corner 😎
VoXtream2: Full-stream TTS with dynamic speaking-rate control (LinkedIn)
Adaptive AI voice layer for real-time communication (Dev)
Utterly: Transcribe speech privately on Apple devices, offline (BetaList)
MiniMax 2.7: GLM-5 at 1/3 cost SOTA open model (Smol AI News)
Best STT APIs to build an AI notetaker in 2026 (Hacker Noon)
PersonaOps: A voice-to-data intelligence system powered by Notion MCP (Dev)
Google AI releases WAXAL: Multilingual African speech dataset (MarktechPost)
WhisperWeb processed STT Directly within the browser (Trend Hunter)
Why building voice AI agents is still so hard (Dev)
OpenVoiceUI: AI voice agent app generates live canvas pages (Dev)
Vietnamese automatic speech recognition (TLDR Takara)
VoiceType AI transcribes, edits, and auto-formats your speech (Trend Hunter)
Speech synthesis API for TTS (Dev)

