1.5 years ago, I wrote an article comparing the 8 most important Voice AI problems and their state of readiness.
I thought it was time to revisit the scores.
“Business Pain” represents how important the problem is for the customer. The higher the pain, the more urgent it is for the customer to solve it.
“AI Readiness” represents the industry’s technological readiness to solve the pain.
Below is my Q2 2025 version.
Here is what changed in 1.5 years:
AI Note-Taking went up by 3 points! 🔥🔥🔥
AI Voice Translation went up by 3 points! 🔥🔥🔥
AI Accent Conversion went up by 2 points! 🔥🔥
AI Voice Agents went up by 1 point! 🔥
AI Live Assist/Guidance went down by 1 point 😞
I’ve removed AI Voice Conversion altogether 🤔
Let’s go over the pains one by one.
1) Conversations visibility at scale
You have 100 sales reps (or call center agents) and they receive/place calls all the time trying to close deals or serve customers.
How do you know if they are doing a good job? Just a couple of years ago, this was quite difficult to do. These days, all you need to have is a Conversational Intelligence tool (aka Speech Analytics) and you will get full visibility into every conversation.
Every customer can be recorded, transcribed and summarized for you and the team. These tools are super convenient and there are plenty of them available in the market - Cresta, Observe, Gong, Avoma, CallMiner, SalesLoft, etc.
AI Readiness: 10/10, Pain: 10/10
2) Language barrier
The ability to communicate verbally in an effective way is a foundational capability of any team and business. The language barrier has always been one of the top problems for humanity. It takes years for people to learn to speak in a non-native language. The pain is real and any business that has this pain would pay a lot of money to eliminate it.
Imagine there being a real-time speech-to-speech translation AI that people could use for communication over Zoom, Teams or Krisp. This would be a game-changer.
Krisp, OneMeta, MS Teams, Google Meet already have solutions for this.
AI Readiness: 6/10, Pain: 10/10
3) Conversations at scale
There are multiple roles where the person needs to receive or place calls and talk to another human being on the other side. Call center agents, sales and business development reps, recruiters, and others.
Maintaining and growing such teams is exceptionally difficult. Businesses need to recruit talent, and then they need to onboard and retain them.
This is an expensive endeavor. No doubt, Voice AI Agents are taking over and automate some of these functions.
The space is booming. There are plenty of startups in this space.
AI Readiness: 6/10, Pain: 10/10
4) Taking meeting notes
In many companies, there are people dedicated to taking notes in meetings. For many years this has been a manual task that can be automated with Voice AI now.
Many tools already offer meeting transcription, summary, and follow-up generation.
The quality varies from 60%-80% for now. No doubt it will keep improving and the manual work will be fully automated in the coming year or two.
There are already multiple companies doing this:
AI Readiness: 7/10, Pain: 8/10
5) Onboarding and training of associates
Call center agent turnover rate is between 30%-45%. This means that a huge number of agents leave every year and managers need to find replacements and onboard/train them. This is a costly process and any automation/simplification of the process has a clear ROI.
Similarly, companies that need to hire a high number of SDRs or AEs, need to onboard and train them, otherwise, their sales conversion rates would decrease. Again, high-ROI endeavor.
Imagine a bot sitting on an agent’s machine that listens to the customer conversation and gives real-time hints that have a history of better conversion rates or customer satisfaction. The agent ramps up quicker due to this technology.
This technology already exists and is called AI Live Assist. Multiple companies already have shipped products with such technology:
AI Readiness: 6/10, Pain: 7/10
6) Accent barrier
As with the language barrier, human accent is a serious barrier that impacts understanding and comprehension of business conversations. Nearly all humans have accents when speaking in non-native languages and it’s extremely difficult to retrain them.
Call centers have special training programs for their agents to reduce accent. The cognitive load and stress on agents for such tasks are intense.
Imagine a Voice AI technology that would, in real-time, localize the speaker’s accent to the listener’s accent to improve understanding and comprehension.
Krisp and Sanas have already deployed such technology in the call center industry.
AI Readiness: 8/10, Pain: 7/10
7) Background noises & voices
The problem of background noises and voices in calls has been around for more than 30 years. It creates a distraction for the call participants and prevents them from focusing on the core conversation. Background noise also creates a constant stress for the speakers.
In call centers, background noise can result in a customer satisfaction drop, longer conversations and mental stress for agents.
Luckily, AI-powered Noise Cancellation technology can fully solve this problem. Krisp has pioneered this technology in the industry and has large-scale deployments of it. It solves both the problem of noises as well as background voices. Zoom, MS teams and other applications also have invested in such technologies.
AI Readiness: 10/10, Pain: 6/10