Using text and audio as inputs, a new generative AI model from NVIDIA can create any combination of music, voices and sounds.
Top Updates 💪
Microsoft’s new Copilot AI Voice mode is now available for everyone for free
Ircam Amplify introduces a technology to identify AI-generated voice clones
Talkdesk embeds conversational, artificial intelligence in CRM, helpdesks
HitPaw VoicePea V2.3.0 released: New TTS features enhance voice creation
Orange partners with Meta and OpenAI for African languages AI models
PlayAI snags $21m in funding to launch AI voice-cloning platform
Noteworthy 💪
Voice cloning tech is breaking customer authentication systems
Human-centric AI drives customer experience loyalty
Why can’t automatic speech recognition systems understand kids?
How Generative AI for sales is shaping the future of personalization
Hear this! Transforming health care with speech-to-text technology
VocaEase magSafe AI translation ring powered by GPT
Is there something special about the human voice?
Circleback is out to become the best meeting notetaker
Voice recognition and AI: Shaping the future of call centers in the BPO industry
Science and Demo Corner 😎
Voice-Pro: The best gradio web-ui for transcription, translation and TTS
10 best AI phone platforms & agents for call centers
Code-switched Hindi-Marathi data and transformer-based architecture for STT
D-FBSS: A deep learning algorithm for noise reduction and speech enhancement
How to transcribe Zoom participant recordings (multichannel)
VisAssist: An accessible transcription assistant for auditory impaired individuals
Speech is more than words: Do STT translation systems leverage prosody?
Multi-scale dynamic feature extraction network for pathological voice detection
A voice transcription and translation app with OpenAI Whisper and Streamlit
The ultimate guide to mastering natural language generation
VoiceScribe: Revolutionizing real-time STT
What is RVC AI? Learn to make RVC voice models