Krisp is hiring!
Krisp’s Voice AI SDK team has three key openings: Sr Product Manager, Sr Solution Enginer and BD Manager. If you know exceptional people who might be a great fit, please share these roles with them. Thank you 🙏
Top Updates 💪
Meta unveils Omnilingual ASR supporting 1,600+ languages (VentureBeat)
Cisco acquires EzDubs (Webex Blog)
AppTek pioneers next-generation expressive TTS for AI dubbing (Slator)
ElevenLabs’ new AI marketplace lets brands use famous voices for ads (The Verge)
Kaltura acquires eSelf, founded by creator of Snap’s AI, in $27M deal (TechCrunch)
Willow voice keyboard lets cross-app dictation and editing on iOS (TechCrunch)
Beside raises $32M to build an AI receptionist for small businesses (Fortune)
ElevenLabs debuts Scribe v2, its fastest low-latency STT model (ElevenLabs)
Murf AI launches Falcon: The TTS API (MarTech Series)
1Mind launches AI platform for sales & customer engagement (SiliconAngle)
Broadcom unveils AI chipset for real-time on-device translation (Engadget)
Gemini Live’s new voice tricks make AI chats sound more human (Digital Trends)
Google leverages AI voice agents (ATT Currently)
Time launches new AI agent (Axios)
Dell launches Pro Plus Earbuds with AI noise cancellation and ANC (Gizmochina)
The startup claimed to use AI, but it was two founders taking notes (TechStartups)
AI speech model aiOla Drax outpaces OpenAI & Alibaba (Techzine)
ShunyaLabs.ai announces ZeroMed (TechBullion)
AI is resurrecting the voices of dead famous people (Vox)
Engineering Corner 😎
Building voice-enabled web applications using STT APIs (C# Corner)
How Decagon shipped real-time voice AI on Modal (Modal)
StepFun AI releases Step-Audio-EditX and Meet Gelato-30B-A3B, The GUI Model (AI Dev Signals)
Spatial noise-canceling technology that rapidly adapts to diverse noise throughout indoor spaces (NTT Group)
Qwen3 ASR: The speech recognition that actually works (C# Corner)
Audio converter AI review: Fast and accurate AI audio-to-text transcription (Dev)
How to build an agentic voice AI assistant that understands, reasons, plans and responds through autonomous multi-step intelligence (MarkTechPost)
Transvoicely: Bring every voice to every language (Transvoicely)
Building an audio transcription tool: A deep dive into WER metrics (Dev)
OS-Denseformer: A lightweight end-to-end noise-robust method for Chinese speech recognition (MDPI Applied Sciences)

