Updates from LiveKit, Google, ServiceNow, Nvidia and more this week 🔥

Voice AI weekly digest

Davit Baghdasaryan

Jan 26, 2026

The team at Coval has published a Voice AI 2025 report.

Top Updates 💪

LiveKit’s Series C: Towards the voice-driven era of computing (LiveKit)
Google snags team behind AI voice startup Hume AI (TechCrunch)
ServiceNow and OpenAI push AI past chatbots into real CX work (CX Today)
Voice AI just changed: How enterprise AI builders can benefit (VentureBeat)
VibeVoice-ASR: STT handling 60-minute audio in a single pass (MarkTechPost)
Deepfakes leveled up in 2025: Here’s what’s coming next (UBNow)
Krisp appoints Vimal Nair as CGO to lead India business expansion (Krisp Blog)
Adobe’s AI transforms PDFs into podcasts (WebProNews)
FlashLabs researchers release Chroma 1.0: A 4B real-time speech dialogue model with personalized voice cloning (MarkTechPost)
CareXM introduces AI voice agent to improve patient access (Business Wire)
Vodia integrates with with ElevenLabs Voice AI platform (Telecom Reseller)
Litera brings agentic AI to iOS for Litera One platform (LawNext)
Medallia & Ada partner to turn insights into action (Customer Service Manager)
HiDock introduced live transcription & translation on HiNotes (Newswire)
Deepfake-as-a-Service revolutionizing biometrics spoofing (Biometric Update)
Evernote v11: A new chapter in AI-powered productivity (Newswire Korea)
The future of AI voice agents: Trends & business applications (RingCentral)
Conversational intelligence is reshaping modern staffing decisions (StaffingTalk)
Xiaomi smart audio glasses record meetings in a lighter design (HardwareZone)
roverIQ launches Ava voice assistant for StayNTouch hotels (GlobeNewswire)
Cadence launches sixth-generation Tensilica HiFi iQ DSP for voice AI and immersive audio (New Electronics)

Engineering Corner 😎

Qwen3-TTS is officially live. They’ve open-sourced the full family (X)
PersonaPlex-7B: An open-source, full-duplex conversational model (X)

Supertonic-2: Lightning fast, on-device, multilingual TTS (X)
Velma: Understand the true meaning of every conversation (Modulate)
The NVIDIA Nemotron Stack for production agents (HackerNoon)
Offline STT On iOS and macOS with Whisper Notes (TrendHunter)
The best TTS tools: Expert tested (ZDNet)
Voice task manager: LIA Workday (Freelancer)
AI voice agents: How to get started (Social Media Examiner)
Garo ASR - STT AI model for Garo language (A’chik) (LINGUIST List)
Severity-controllable pathological TTS for clinical applications (IEEE Xplore)
Advances and challenges in speech recognition and NLP (MDPI)
Introducing the Gladia STT plugin in VideoSDK (DEV)
Comparing multi-scale and pipeline models for speaker change detection (MDPI)

Voice AI Newsletter

Discussion about this post

Ready for more?