Updates from Salesforce, Yellow.ai, Kyutai, Deepgram, Slack and much more 🔥

Davit Baghdasaryan

Sep 23, 2024

Salesforce’s AgentForce: The AI assistants that want to run your entire business.

Top Updates 💪

Yellow.ai launches voice platform for customer service AI agents
Kyutai Labs open sourced on-device speech to speech foundation model Moshi
Tencent’s EzAudio AI transforms text to lifelike sound
Deepgram's groundbreaking voice agent API brings AI to life
Google Gemini Live's AI voice adds new styles that take inspiration from the stars
Slack can use AI to transcribe your Huddle conversations now

11x.ai raises $24M led by Benchmark to build AI digital employees

Fal.ai, which hosts media-generating AI models, raises $23M from a16z and others
AI notetaker Fathom raises $17M
AI startup Rep.ai raises $7.5M to launch ‘digital twin’ sales representatives

Voice AI Podcast 🎙️

In case you missed the latest episode of Voice AI Podcast…

Voice AI Newsletter

The fastest growing Voice Bot startup | Hakob Astabatsyan (CEO & Co-Founder, Synthflow AI)

In The Future of Voice AI series of interviews, I ask three questions to my guests: - What problems do you currently see in Enterprise Voice AI? - How does your company solve these problems? - What solutions do you envision in the next 5 years…

2 years ago · 9 likes · Davit Baghdasaryan

Noteworthy 💪

How companies can use AI to better serve deaf and hard-of-fearing customers
WellSaid Labs: How this company is using AI to drive human parity in voice
Google working on a mobile version of "Take Notes with Gemini" for Meet

Can ChatGPT transcribe audio?
Remove background noise from audio: Techniques & tools
AirPods 4 reviews: Impressive noise cancellation in an open-ear design
W4 Pro: AI interpreter earbuds for real-time translations by Timekettle

Science and Demo Corner 😎

Two Instances of OpenAI’s ChatGPT-4 try to end a conversation, hilarity ensues
Introducing Ascle: A state-of-the-art framework for efficient NLP
A linear-time complexity alternative to self-attention, to streaming STT
ZMM-TTS: Zero-shot multilingual and multispeaker speech synthesis
What’s slowing down TTS systems and how can it be fixed?
Analysis of progress in speech recognition models
OpenAI text-to-speech voice API
Comparative study on the accuracy of STT using a contact microphone
Technical papers 2024: Audio & speech – advances in production
10 best AI voice generator tools for 2024 edition

Discussion about this post

No posts

Ready for more?

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts