Chinese SenseNova 5.5 beats GPT-4o 👀 Updates from a16z, Murf, ElevenLabs, Five9 and others🔥

Davit Baghdasaryan

Jul 22, 2024

Andreessen Horowitz shared their thesis on Scribes (AI meeting notes).

Top Updates 💪

Chinese company SenseTime releases SenseNova 5.5, beats OpenAI’s GPT-4o
Conversational AI market to grow with a striking 22.60% CAGR through 2031
New AI model creates ultra-realistic voices in more than 20 languages
ElevenLabs unveils text-to-speech Turbo 2.5 model with 32 languages
Five9 onboards Einstein AI, achieves landmark Salesforce partner status

Kaizen launches AI-powered voice surveillance
Deepfake-detecting firm Pindrop lands $100M loan to grow its offerings
GenAI powers CallMiner’s post-interaction & real-time summaries
Conversational AI will become 'industry standard' for guidance by 2030
Omilia launches pathfinder to reduce Conversational AI deployment times by 80%
NetSfere integrates with Microsoft Nuance’s Dragon Medical speech recognition

How Parrot AI turns conversations into results
Nvidia AI watermarking patent could help separate real from fake
Voice-enabled Volley secures $55 million in Series C funding round
Reverie sets a new benchmark in STT accuracy for Indian native languages

Noteworthy 💪

Evaluating AI-Powered Conversational Agents as Virtual Health Carers

Patagonia sued for using AI-based software to analyze customer calls
LogOn: AI helps spot audio deepfakes amid election disinformation threat
The evolution of speech recognition: A Sista AI perspective
The promise and peril of ‘agentic AI’
Rise of text-to-speech AI models: Intellectual property issues
Echo AI included in Forrester’s report for AI Analysis Capabilities
Unleashing the power of voice chatbots in customer support
How speech recognition technology enhances business processes in call centers
How conversational intelligence revolutionizes sales meetings
Unlocking the power of data with speech analytics
A tour of “emotionally intelligent” AI

Science and Demo Corner 😎

Listen to the oldest known recording of a human voice
Smart reception: An AI-driven bangla language based receptionist system
MELLE: A continuous-valued tokens-based language modeling approach for TTS

Enhancing CTC-based speech recognition with diverse modeling units
Femtosense introduces the AI-ADAM-100 for efficient AI-based voice processing
Your voice, your way: Explore free AI voice cloning tools

Discussion about this post

No posts

Ready for more?

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts