This GPT-4o voice demo is π€― Updates from Meta, PolyAI, GreyLabs, WellSaid and much more π₯
GPT-4o voice mode early access demo showing its capabilities:
AI researchers developed a new Listening-While-Speaking Language Model:
Top Updates πͺ
SoundHound acquires Amelia AI for $80M after it raised $189M+
OpenAI finds that GPT-4o does some truly bizarre stuff sometimes
PolyAI partners with AWS to boost of next-Gen voice AI in customer service
How AI is reshaping the customer experience
WhatsApp will give ability to convert speech to text
Bolt upgrades Driver App chat with ability to translate speech to text
SoftBankβs balancing act; Analysing conversations with GenAI
RingCentral sees double-digit revenue growth, enjoys a surge in RingCX bookings
GreyLabs AI bets on GenAI to analyze customer conversations and get insights
Interra Systems presents Media QC, monitoring, analysis solutions at IBC2024
Trial lawyerβs AI-powered voice tool brought Lori Cohen back to her life
Google project Astra: The AI assistant we have been waiting for?
WellSaid unveils verbal cues, phonetic respellings, and enhanced security
Ema raises $36M to build universal AI employees for enterprises
People can now speak all languages in their own voice with GalaxyVoice.ai
Bee raises $7M for its wearable AI assistant that learns from your conversations
Noteworthy πͺ
Transforming business communication: The power of AI-driven phone calls
FCC to require improved closed captioning accessibility for English and Spanish
UC round table: Conversational intelligence and analytics
This caller does not exist: Using AI to conduct vishing attacks
Deepfakes: The AI scam you didnβt see coming
A guide to AI voice agents for business owners and leaders
TTS and virtual reality agents in primary school classroom environments
AI in business: Elevating CX and energising employees
Science and Demo Corner π
Bytedance researchers present cross language agentΒ
A real time speech translation on VoIP number
Guidebook to reduce latency for Azure STT and TTS applications
Audio-powered robots: A new frontier in AI development
VioLA: Conditional language models for STT, TTS, and translation
A Beginnerβs guide to TTS algorithms with real-life examples
Speech recognition:Β Metrics and architecture
Multi-granularity generative error correction with LLM for joint accent and STT
Keyword guided target speech recognition
Tibetan speech synthesis based on pre-traind mixture alignment FastSpeech2
Speaker identification in single trackΒ productions
I've never been so excited to read a news latter, thanks for it !