On-Device AI getting strong π Updates from RingCentral, NVIDIA, Salesforce and much more π₯
Top Updates πͺ
RingCentral teases native AI assistants for RingCX, unveils more solutions
NVIDIAβs on-device small language model makes digital humans more lifelike
Boost AI with Azure's new Phi model, streamlined RAG and generative models
ElevenLabsβ text-to-speech app Reader is now available globally
Elevate healthcare interaction with Amazon Bedrock and Amazon Transcribe
McAfee unleashes AI deepfake audio detector - but how reliable can it be?
Microsoft: New realistic multilingual voices optimized for conversations
No Jitter Midroll: Salesforce launches AI Agents for sales
SleekFlow snaps up $7M to tap the conversational AI opportunity across Asia
D-ID launches an AI video translation tool that has voice cloning and lip sync
Voice AI Podcast ποΈ
In case you missed the latest episode of Voice AI Podcastβ¦
Noteworthy πͺ
GPT-4o Advanced Voice turned out to be even better than expected
Scammers using AI to clone voices: How to protect yourself and your information
How AI analytics can improve call centre performance
AI summarization vs. manual note-taking: What could possibly go wrong?
From Robo to Relatable: Make AI in customer service more human
History and evolution of contact centers
Harnessing Voice AI: Building authentic customer connections
How to better understand the voice of the customer with Speech AI
The evolution and impact of AI note takers on modern workflows
Science and Demo Corner π
How AI deciphers neural signals to help a man with ALSΒ speak
Meta's Research SuperCluster for real-time voice translation AI systems
Restoring speaker voices with zero-shot cross-lingual voice transfer for TTS
AI-Based Interactive Voice Response
TrintAI: An open-source tool for STT, summarization and sentiment detection
Semantic dependency and local convolution for enhancing naturalness in TTS
AV-CPL: Continuous pseudo-labeling for audio-visual speech recognition
Coherence-based phonemic analysis of reverberation effects on STT
Quantification of STT system performance on d/Deaf and hard-of-hearing speech
LLM-based speech recognition and translation models from Moore Threads
Text-to-speech in Python: On-device solutions
Adding speaker diarization to OpenAI Whisper using Picovoice Falcon