Salesforce’s AgentForce: The AI assistants that want to run your entire business.
Top Updates 💪
Yellow.ai launches voice platform for customer service AI agents
Kyutai Labs open sourced on-device speech to speech foundation model Moshi
Tencent’s EzAudio AI transforms text to lifelike sound
Deepgram's groundbreaking voice agent API brings AI to life
Google Gemini Live's AI voice adds new styles that take inspiration from the stars
Slack can use AI to transcribe your Huddle conversations now
11x.ai raises $24M led by Benchmark to build AI digital employees
Fal.ai, which hosts media-generating AI models, raises $23M from a16z and others
AI notetaker Fathom raises $17M
AI startup Rep.ai raises $7.5M to launch ‘digital twin’ sales representatives
Voice AI Podcast 🎙️
In case you missed the latest episode of Voice AI Podcast…
Noteworthy 💪
How companies can use AI to better serve deaf and hard-of-fearing customers
WellSaid Labs: How this company is using AI to drive human parity in voice
Google working on a mobile version of "Take Notes with Gemini" for Meet
Can ChatGPT transcribe audio?
Remove background noise from audio: Techniques & tools
AirPods 4 reviews: Impressive noise cancellation in an open-ear design
W4 Pro: AI interpreter earbuds for real-time translations by Timekettle
Science and Demo Corner 😎
Two Instances of OpenAI’s ChatGPT-4 try to end a conversation, hilarity ensues
Introducing Ascle: A state-of-the-art framework for efficient NLP
A linear-time complexity alternative to self-attention, to streaming STT
ZMM-TTS: Zero-shot multilingual and multispeaker speech synthesis
What’s slowing down TTS systems and how can it be fixed?
Analysis of progress in speech recognition models
OpenAI text-to-speech voice API
Comparative study on the accuracy of STT using a contact microphone
Technical papers 2024: Audio & speech – advances in production
10 best AI voice generator tools for 2024 edition