In the Future of Voice AI series of interviews, I ask three questions to my guests:
- What problems do you currently see in Enterprise Voice AI?
- How does your company solve these problems?
- What solutions do you envision in the next 5 years?
This episode’s guest is Alex Bordanova, Chief Product & Technology Officer at Voicemod.
Alex Bordanova is the Chief Product & Technology Officer at Voicemod, enabling millions of gamers, streamers, and creators to enhance their online interactions.
Before joining Voicemod, Alex served as CTO at the intersection of interactive technology, design, and communication, drawing on his background in Audio Engineering. Now, at Voicemod, he oversees the development of AI-driven voice technology, shaping next-generation real-time audio experiences to redefine how users engage with voice in video games, social communication, and beyond.
Voicemod is the global leader in real-time AI voice transformation and interactive audio, enabling users to change voices as easily as they switch skins.
Recap Video
Takeaways
Voicemod is a real-time voice-changing platform with over 50 million downloads.
They have created official content for Warner Brothers, World of Tanks (WoT), Nvidia, and Angry Birds characters.
Alex sees gaps between visual and audio experiences driving a need for deeper immersive experiences.
Real-time voice alteration can protect vulnerable players from toxic behavior.
Voicemod recently released a device that gives console gamers access to the same voice modifications.
Voicemod’s vibrant community has crafted over 600,000 sounds and 2,000 custom voices, fueling the rapid creation of memes, trends, and cultural references in real time.
User-created sound memes and trends spread instantly, capturing viral moments like the Squid Game craze overnight.
The more users generate content, the easier it is to find exactly what they're looking for.
Phoneme-to-phoneme tech must stay ultra low-latency to keep speech natural.
Balancing latency, CPU use, and audio clarity is the toughest engineering puzzle.
Voice and visual avatars may merge into one AI-based system for seamless realism.
Prompt-to-voice tools could let anyone make a brand-new voice from just text or an image.
Licensing IP is one of the biggest business opportunities in entertainment.
Because licensing IP can be drawn out with large companies, it's critical to be small and easy to integrate.
Text-to-speech feels more like a commodity, with real-time voice conversion taking center stage.
Share this post