Bessemer has published their vision around Voice AI: Voice AI isn’t just an upgrade to software’s UI — it's transforming how businesses and customers connect.
Top Updates 💪
Alorica wins BPO Partner of the year at CallMiner’s LISTEN 2024
RingCentral migrates its Agents away from NICE-powered platform
8×8 expands its cloud platform with AI tools
Universal-2 outperforms Whisper in STT model comparison
Microsoft enhances AI Copilot with voice, vision & deeper thinking
Speechmatics launches Flow: The ultimate API for seamless voice interactions
Bengaluru AI startup smallest.ai unveils lightning, new TTS model
RecCloud’s STT: Unlocking the power of AI
Voice AI Podcast 🎙️
In case you missed the latest episode of Voice AI Podcast…
Noteworthy 💪
Enter the ‘Whisperverse’: How AI voice agents will guide us through our days
Elevating audio and video performance in modern meeting spaces
Automated voice message: Enhancing communication for modern businesses
Extract insights from Amazon Transcribe audio transcripts with Amazon Bedrock
How to enforce IVR authentication without annoying callers
Google, University of Ghana partner on Project Euphonia for improved STT
Why customers and companies love interactive voice response
What is NotebookLM? Features, benefits, and use cases
Science and Demo Corner 😎
Introducing Fish Agent v0.1 + Fish Speech 1.4
OuteTTS-0.1-350M - Zero shot voice cloning, built on LLaMa architecture
Mozilla is now offering free AI voice training data in 180 languages
A robust accent classification system based on variational mode decomposition
How to Integrate OpenAI for text generation, TTS, and STT in .NET
Multilingual meta-transfer learning for low-resource speech recognition
Optimizing contextual STT using vector quantization for efficient retrieval
MaskGCT: A guide to Amphion’s zero-shot TTS model with Gradio
STT using an English multimodal corpus with integrated image and depth data
The best open-source AI models: All your free-to-use options explained
Best video to text converter for 2024