New TTS challenges ElevenLabs, Perplexity launches iOS voice assistant, and more 🔥
Voice AI weekly digest
A new open-source TTS model Dia challenges ElevenLabs.
Top Updates 💪
A new open-source TTS model Dia challenges ElevenLabs, OpenAI (VentureBeat)
Perplexity’s AI voice assistant is now available on iOS (The Verge)
Rime unveils Arcana, a new ultra-realistic speech model (Rime)
Jericho Security raises $15M to stop deepfake fraud (VentureBeat)
SignalWire unveils first fully integrated AI Voice Stack (Telecom Reseller)
Korean telcos roll out tools for disability support (Communications Today)
NVIDIA NeMo microservices are generally available (Constellation Research)
Alibaba and Meta face off in simultaneous AI translation (Slator)
IBM unveils Granite-3-3 with AI STT and multilingual translation (Cloud Wars)
Atomicwork launches its universal agent (PR Newswire)
Listen Labs raises $27M in seed and Series A financing (Finsmes)
Should your AI sound human or like you? (UX Design)
VoiceCare AI at Becker’s 15th Annual Meeting (Becker's Hospital Review)
RJ Burnham on the future of voice AI and the role of vCons (Telecom Reseller)
AI call centre agents: Benefits, risks, & what the future holds (Call Centre Helper)
Microsoft Copilot AI podcast feature enters early testing (TestingCatalog)
Infinix AI Buds Malaysia release: real-time AI translator and ANC (TechNave)
Lace AI raises $19M to revolutionize home services (Trending Topics)
Voice AI Podcast 🎙️
In case you missed the latest episode of Voice AI Podcast…
Why Calls with AI Agents Fail | Tom Shapland (CEO, Canonical AI)
In the Future of Voice AI series of interviews, I ask three questions to my guests: - What problems do you currently see in Enterprise Voice AI? - How does your company solve these problems? - What solutions do you envision in the next 5 years?
Engineering Corner 😎
Transformer-based language-independent gender recognition in noisy audio environments (Nature)
How matrices are used for speech recognition (Medium)
Open-source solutions for AI agent developers (Hackernoon)
7 of the best AI dubbing tools to translate videos (Vimeo Blog)
Best AI transcription tools – tips and comparison guide (Noupe)
Sybill: Testing its AI sales assistant and meeting recaps (Geekflare)
AI multilingual translator: Kaggle notebook and Telegram bot project (Dev.to)
MixDiff-TTS: Mixture alignment and diffusion model for TTS (MDPI)