On-Device AI getting strong 👀 Updates from RingCentral, NVIDIA, Salesforce and much more 🔥

Davit Baghdasaryan

Aug 26, 2024

Top Updates 💪

RingCentral teases native AI assistants for RingCX, unveils more solutions

NVIDIA’s on-device small language model makes digital humans more lifelike
Boost AI with Azure's new Phi model, streamlined RAG and generative models

ElevenLabs’ text-to-speech app Reader is now available globally

Elevate healthcare interaction with Amazon Bedrock and Amazon Transcribe
McAfee unleashes AI deepfake audio detector - but how reliable can it be?
Microsoft: New realistic multilingual voices optimized for conversations
No Jitter Midroll: Salesforce launches AI Agents for sales
SleekFlow snaps up $7M to tap the conversational AI opportunity across Asia
D-ID launches an AI video translation tool that has voice cloning and lip sync

Voice AI Podcast 🎙️

In case you missed the latest episode of Voice AI Podcast…

Voice AI Newsletter

AI Agents lead to customer turnover, but... | John Robb (Head of Customer Success, Tenyx)

In The Future of Voice AI series of interviews, I ask three questions to my guests: - What problems do you currently see in Enterprise Voice AI? - How does your company solve these problems? - What solutions do you envision in the next 5 years…

2 years ago · 15 likes · Davit Baghdasaryan

Noteworthy 💪

GPT-4o Advanced Voice turned out to be even better than expected

Scammers using AI to clone voices: How to protect yourself and your information
How AI analytics can improve call centre performance

AI summarization vs. manual note-taking: What could possibly go wrong?
From Robo to Relatable: Make AI in customer service more human
History and evolution of contact centers
Harnessing Voice AI: Building authentic customer connections
How to better understand the voice of the customer with Speech AI
The evolution and impact of AI note takers on modern workflows

Science and Demo Corner 😎

How AI deciphers neural signals to help a man with ALS speak
Meta's Research SuperCluster for real-time voice translation AI systems
Restoring speaker voices with zero-shot cross-lingual voice transfer for TTS
AI-Based Interactive Voice Response
TrintAI: An open-source tool for STT, summarization and sentiment detection
Semantic dependency and local convolution for enhancing naturalness in TTS
AV-CPL: Continuous pseudo-labeling for audio-visual speech recognition
Coherence-based phonemic analysis of reverberation effects on STT
Quantification of STT system performance on d/Deaf and hard-of-hearing speech
LLM-based speech recognition and translation models from Moore Threads
Text-to-speech in Python: On-device solutions
Adding speaker diarization to OpenAI Whisper using Picovoice Falcon

Discussion about this post

No posts

Ready for more?

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts