a16z: How AI will disrupt BPOs 🚀

Feb 17, 2025

Top Updates 💪

Unbundling the BPO: How AI will disrupt outsourced work (A16z)
AI voice generators market size to reach $6.40B in 2025 (Straitsresearch)
Introducing Nova-3: Setting a new standard for AI-driven STT (Deepgram)
Voice AI: Solving healthcare’s workforce challenges with Ankit Jain (A16z)
Contact Center Trends for 2025: What’s Hot and What’s Not? (CxToday)
Top news in customer contact (Customercontactweekdigital)
You can now talk to Microsoft Copilot Voice in 40 more languages (TechRadar)
Adobe's audio enhancement features have blown them away (Digitalcameraworld)
TrialWire launched its groundbreaking AI Voice Screen service (Biospace)
Cisco Webex COO: AI agents are about to change customer service (Mediapost)
Anthropic Stuns with Fact About Claude Use for AI Translation (Slator)
YouTube expands text-to-speech for shorts (Socialmediatoday)
Why Speech-to-AI will be the natural interface of the future (Pootlepress)
Galaxy S25 Excels in voice control, transcripts & Gemini (Hardwarezone)
Miarec’s RingCentral integration brings new auto QA to contact centre (Uctoday)
Vitalchat raises $6M in series A funding (Finsmes)
Nagish brings AI-powered phone calls to deaf and hard-of-hearing individuals (Prnewswire)
R7 Pro smartwatch has built-in headphones & live translation (Gadgetsandwearables)

In case you missed the latest episode of Voice AI Podcast…

How to build the world's fastest voice bot: Kwindla Hultman Kramer (Youtube)
Meta AI unveils Brain2Qwerty: A deep learning model for decoding brain activity via EEG/MEG during QWERTY typing (Marktechpost)
Best AI note taking apps for iPhone in 2025 (Ioshacker)
They analyzed 13 AI Voice solutions that are selling right now (Reddit)
SpeechCompass: Enhancing mobile captioning with diarization and directional guidance via multi-microphone localization (Arxiv)
A study on model training strategies for speaker-independent and vocabulary-mismatched dysarthric speech recognition (Mdpi)
Zonos-v0.1: A game-changer in open-source TTS Technology (Generativeai)
FireRedASR: Open-source industrial-grade Mandarin speech recognition models from encoder-decoder to LLM integration (Paperswithcode)
Advancing scalable TTS synthesis: Llasa’s transformer-based framework for improved speech quality and emotional expressiveness (Marktechpost)
Kyutai releases Hibiki: A 2.7B real-time speech-to-speech and speech-to-text translation with near-human quality and voice transfer (Marktechpost)