ChatGPT Voice goes GA, Krisp hits 10M meeting transcripts! 🔥

Nov 27, 2023

ElevenLabs’s AI STS Converter is super impressive! ❤️

Top Updates 💪

ElevenLabs launched AI Speech to Speech Converter (STS). STS is a voice conversion tool that lets you turn the recording of one voice to sound as if spoken by another
ChatGPT Voice now available for everyone, including free users
Krisp hits 10M meeting transcripts generated with on-device AI

Zoom AI Companion has new features
- Ask questions and get summaries in multiple languages
- Get feedback on your presentation skills
Google’s SynthID can now watermark AI-created audio

How AI helps healthcare systems deliver improved staff and patient experience
Assurance IQ is analyzing 15M+ calls with Observe.AI 🔥
Zoom reaches 700 CCaaS customers 🚀
PolyAI VOX 2023 recap. Can AI take the pressure off the contact center?
The Psychology of Speech: How TTS Influences Perception. Research indicates that our perception of speakers can be influenced by factors such as accent, pitch, and speed of speech. In the realm of TTS, the choice of voice unintentionally reinforces existing biases and stereotypes.
What is Call Blasting? How it Works & Mistakes to Avoid?
Powder, an AI clipping tool for gaming, detects “yelling” during a stream
10 AI Voice generator apps
TTS in autonomous vehicles: enhancing human-machine interaction

Native whisper.cpp server with Open AI-like API. This is a very convenient way to run an efficient transcription service locally on any kind of hardware.
Open Whisper-style speech model. It reproduces Whisper training using an open-source toolkit (ESPNet) and publicly available datasets.
StyleTTS 2: Towards human-level TTS through Style Diffusion
HierSpeech++: Bridging the gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis