Top Updates 💪
Mati Staniszewski: Why voice will be the main interface for tech (Sequoia Capital)
Zoom enables next-gen AI with real-time media streams (Telecom Reseller)
ChatGPT’s conversational voice mode has been upgraded (TechCrunch)
Why Conversational Intelligence is crucial to your 2025 UC strategy (UC Today)
Google flags data quality issues in public multilingual speech datasets (Slator)
Kyutai releases 2B‑parameter streaming TTS with 220 ms latency (MarkTechPost)
Neural implant translates neural activity into speech almost instantly (PC Gamer)
RingCentral launches AI Receptionist for automated call handling (CMSWire)
English-learning startup SpeakX looks to raise $15 million (LiveMint)
Mahindra selects Cerence Audio AI for in-car voice interaction (Yahoo Finance)
TTS software market size, trends & growth forecast 2025 (EIN Presswire)
Ultatel rolls out Intelligent Voice AI Agent for task automation (Destination CRM)
Voice tech startup Intron launches new AI models (TechCabal)
How RevRag.AI is improving digital onboarding in BFSI sector (Inc42)
How Conversational AI is rewriting the future of work (CXO Today)
Testlify launches Conversational AI Interviews (TechBullion)
Roblox releases a TTS beta (Gameranx)
Voice Design v3 from Eleven Labs to create expressive AI voices (Geeky Gadgets)
Global noise cancelling headphones market set to double by 2030, reaching a forecasted $35.9 Billion (GlobeNewswire)
SpectraLayers 12 adds new audio unmixing and AI voice tools (Yamaha Musicians)
Voice AI Podcast 🎙️
In case you missed the latest episode of Voice AI Podcast…
CX AI: What the Data Says | Jordan Zivoder (Quantitative Research Lead at Customer Management Practice)
In the Future of Voice AI series of interviews, I ask three questions to my guests: - What problems do you currently see in Enterprise Voice AI? - How does your company solve these problems? - What solutions do you envision in the next 5 years?
Engineering Corner 😎
AI cloning tool is letting a Quebec man with ALS keep his voice (CBC News)
More accessible group conversations with sound localization (Google Research)
How accurate is Apple’s new transcription AI? Tested against Whisper and Parakeet (9to5Mac)
ATBU student invents smart glove to convert sign language to speech (Guardian Nigeria)
VocaLearn: AI-powered language learning with Murf-AI challenge project (DEV)
Whisper STT on Mac M4: Performance analysis and benchmarks (DEV)
Speech-based Parkinson’s detection using pre-trained self-supervised ASR models and supervised contrastive learning (MDPI)
Building modular STT workflows: Architecture and performance analysis of a CLI AI agent (HackerNoon)
Mandarin electrolaryngeal speech voice conversion with speech encoder loss learning and seq2seq modeling (ResearchGate)
Best 15.ai alternatives for AI voiceovers (Techpoint Africa)
BAST-Mamba: Binaural Audio Spectrogram Mamba Transformer for binaural sound localization (ScienceDirect)