a16z thesis on AI Voice Agents π Updates from ElevenLabs, Avaya, Truecaller and others π₯
Top Updates πͺ
AI narrows specific sounds for headphone users
Truecaller's launches an AI scanner detecting voice scam
The expanded Avaya-RingCentral partnership: A closer look
Cerence to power Zeekr multilingual AI assistant
Kardome partners with KT corporation to bring Voice AI to IPTV users
ElevenLabs moves beyond speech with AI-generated Sound Effects
Noteworthy πͺ
Klick Labs develops deepfake detection method focusing on vocal biomarkers
Speakly AI shines at VivaTech 2024 with Its LLM-based conversation intelligence
Conversational Voice AI for L&D: Coaching, role playing, and more
The AI voice as a tool for captivating consumers
The rise of AI assistants β How theyβre reshaping everyday life
How denoising LM (DLM) improves speech recognition accuracy
Maven AGIβs $28M funding signals the rise of generative AI in customer support
A recent FTC challenge: New techniques emerge to stop audio deepfakes
Is texting a thing of the past? Voice notes take center stage
Top 10: AI tools with which you can convert from audio to text
Science and Demo Corner π
LookOnceToHear: Target speech hearing with noisy examples
A new model to produce more natural synthesized speech
ChatTTS: A TTS model created for dialogues that supports English and Chinese
Announcing Sonic: A low-latency voice model for lifelike speech
Pipecat: An open-source framework for voice and multimodal conversational AI
Pretraining and adaptation techniques for electrolaryngeal speech recognition
Morse wavelet transform-based features for voice liveness detection
Khoj's AI agentsΒ allows you to create always-available, personal AI agents
ElevenLabs Reader: AI Audio