Gemini 1.5 now has ears👂, Updates from Galaxy AI, Bakstage.AI, Tenyx Voice, Read AI and more 🔥
Google’s Gemini 1.5 Pro can now hear
Top Updates 💪
FTC names winners in AI voice authentication challenge
Detecting AI fakes in video, audio, and text as deepfakes go mainstream
OpenAI cautiously debuts multilingual generative AI Voice Engine
Microsoft revamps Meeting Details experience in OneNote for Windows
HuggingFace releases Parler-TTS: An inference and training library TTS models
Bakstage.AI leverages IBM AI to better personalize customer bot conversations
Galaxy AI now supports more languages with latest update
NETINT unveils automated subtitling with OpenAI Whisper
Tenyx Voice: Revolutionizing customer service with conversational AI
Read AI raises $21M to unify communications across meetings, emails and chats
Noteworthy 💪
Trust Stamp introduces biometric authentication against deepfake voice attacks
Amazon Transcribe for accurate Speech-to-text conversion
How to protect yourself from AI scam calls
Exploring native and non-native English child speech recognition with Whisper
You don’t have to type anymore: Welcome to the golden age of voice dictation
Science and Demo Corner 😎
SpeechAlign: Enhancing naturalness in speech synthesis with human feedback
Introducing Resemble Enhance: Open source speech super resolution AI model
Accented TTS synthesis with fine and coarse-grained intensity rendering
Notta: Audio & video transcription powered by AI
Sound AiSleep: Audiobooks in your voice
Interviewer: Human-like voice interviews