Top Updates 💪
Google launches 'Search Live' for real-time voice interactions (Google Blog)
You sound like ChatGPT (The Verge)
Conversational AI market to reach $57B by 2032 (MENAFN)
8x8’s vision for the future of customer experience (ZDNet)
Apple devices offer amazing STT transcription in developer betas (9to5Mac)
How the TIME AI Audio Brief was built (Time)
IBM Granite tops Hugging Face speech recognition leaderboard (IBM Research)
SoundHound’s Amelia 7.0 launch: A turning point for voice AI? (Yahoo Finance)
Target Dial sets a new standard in AI voice automation (GlobeNewswire)
Deepgram launches enterprise-ready, real-time voice agent API (Business Wire)
Darwix AI raises $1.5M to build GenAI sales conversation stack (BW Disrupt)
AI Audio Stemmer instantly splits songs into editable audio stems (TrendHunter)
Appy Pie Agents launches AI voice agent to automate calls (WIVB/EIN Presswire)
Warning: Voice deepfakes continue to improve (KnowBe4)
Nurix AI launches NuPlay to power voice agents (Smart Customer Service)
Vodex voice AI agents power next-gen debt collection (PR Newswire)
The future of conversational AI in business (The Independent)
Alibaba recruits top Chinese AI scientist Li Xiangang to lead speech-recognition push (SCMP)
On X
Engineering Corner 😎
Top B2B AI Agents: Which ones actually work? (DesignRush)
4 of the best TTS apps for foreign languages (MSN)
Soniox: Voice AI that works in the real world (Soniox)
ElevenLabs introduces MCP for conversational AI customization (ElevenLabs)
How to build Agentic AI chatbots: A step-by-step guide (Dev.to)
Perplexity AI vs Grok AI comparison: which ai assistant is better? (ClickUp)
StepFun introduces Step-Audio-AQAA: A fully end-to-end audio language model for natural voice interaction (MarkTechPost)
Evaluating multimodal speech models across diverse audio tasks (HackerNoon)
Restoring voices and identity with neuroengineering (UC Davis Health)
The science behind audio‑aware language models (HackerNoon)
Vosk API: Offline open source speech recognition toolkit (GitHub)
Build an audio‑to‑text conversion tool using Azure AI Speech SDK (Dev.to)
Give your app a voice: a guide to integrating AI voice APIs (CSSAuthor)
GCP fundamentals Cloud STT API (Dev.to)