<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Voice AI Newsletter: Articles]]></title><description><![CDATA[Industry analysis]]></description><link>https://voice-ai-newsletter.krisp.ai/s/industry</link><image><url>https://substackcdn.com/image/fetch/$s_!YLgs!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png</url><title>Voice AI Newsletter: Articles</title><link>https://voice-ai-newsletter.krisp.ai/s/industry</link></image><generator>Substack</generator><lastBuildDate>Tue, 07 Apr 2026 07:28:43 GMT</lastBuildDate><atom:link href="https://voice-ai-newsletter.krisp.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Krisp Technologies]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[krispai@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[krispai@substack.com]]></itunes:email><itunes:name><![CDATA[Davit Baghdasaryan]]></itunes:name></itunes:owner><itunes:author><![CDATA[Davit Baghdasaryan]]></itunes:author><googleplay:owner><![CDATA[krispai@substack.com]]></googleplay:owner><googleplay:email><![CDATA[krispai@substack.com]]></googleplay:email><googleplay:author><![CDATA[Davit Baghdasaryan]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Voice AI Takeaways Worth Carrying Into 2026]]></title><description><![CDATA[2025 produced a lot of AI activity. It also exposed where CX breaks.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/voice-ai-takeaways-worth-carrying</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/voice-ai-takeaways-worth-carrying</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 08 Jan 2026 15:35:54 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!x0Vp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!x0Vp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!x0Vp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!x0Vp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!x0Vp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!x0Vp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!x0Vp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2112047,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/183567576?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!x0Vp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!x0Vp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!x0Vp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!x0Vp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb75b5748-2c8b-4a02-974a-c078ac0780a8_1536x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2025 produced a lot of AI activity. It also exposed where CX breaks.</p><p>As Voice AI moved from pilots to real deployments, the gaps became hard to ignore. Some approaches scaled, others added friction, and many revealed that the hardest problems in CX still live inside live conversations.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>This roundup surfaces the shifts, data, and ideas that matter heading into 2026.</p><div><hr></div><h3>1. The State of Voice AI </h3><p>Voice didn&#8217;t just improve in 2025. It showed the industry where CX actually breaks. This report highlights where legacy systems, weak adoption, and language gaps continue to create cost and inconsistency across contact centers.</p><p><strong>Read this to understand how voice AI is being used today and where teams still struggle to scale it effectively.</strong></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;7621c5e7-c591-4569-ad89-7d387ca0e51f&quot;,&quot;caption&quot;:&quot;Voice AI in CX: What 819 Leaders Reveal About the Future of Voice&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;2025 State of Voice in CX&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:32916364,&quot;name&quot;:&quot;Davit Baghdasaryan&quot;,&quot;bio&quot;:&quot;CEO &amp; Co-Founder of Krisp, early pioneer in Voice AI.\n20+ years in engineering. 18 US patent applications, ex Twilion&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23088dde-6cb0-44df-b220-5f22830cdd4c_1179x960.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2025-08-28T14:30:51.846Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/da64818c-0d57-45fd-8784-d1d86b8be1ca_2400x1257.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/p/2025-state-of-voice-in-cx&quot;,&quot;section_name&quot;:&quot;Articles&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:171891807,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:26,&quot;comment_count&quot;:1,&quot;publication_id&quot;:2073467,&quot;publication_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!YLgs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h3>2. 2026 Voice AI Productivity Predictions</h3><p>2026 won&#8217;t be about more AI. It will be about where AI holds up in live conversations and at scale. These predictions focus on where Voice AI reduces friction in real time, improves understanding, and helps agents resolve issues faster without breaking trust.</p><p><strong>Read this to understand what actually drives Voice AI productivity and why clarity, not automation, is the lever that scales.</strong></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6fd29fe3-000d-43a5-974b-834b3d2c8f1c&quot;,&quot;caption&quot;:&quot;Voice AI Productivity is entering its execution phase.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;5 Predictions for Voice AI Productivity in 2026&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:32916364,&quot;name&quot;:&quot;Davit Baghdasaryan&quot;,&quot;bio&quot;:&quot;CEO &amp; Co-Founder of Krisp, early pioneer in Voice AI.\n20+ years in engineering. 18 US patent applications, ex Twilion&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23088dde-6cb0-44df-b220-5f22830cdd4c_1179x960.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2025-11-21T14:03:19.843Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!XaEY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/p/5-predictions-for-voice-ai-productivity&quot;,&quot;section_name&quot;:&quot;Articles&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:179154469,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:13,&quot;comment_count&quot;:4,&quot;publication_id&quot;:2073467,&quot;publication_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!YLgs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h3>3. From Fragmentation to Focus: A Playbook for Eliminating Agent Burnout</h3><p>Burnout is a downstream symptom. The real problem is fragmented CX systems, misaligned metrics, and automation pushed too far.</p><p>This guide breaks down how today&#8217;s contact center stacks create cognitive overload and where purposeful, real-time Voice AI can support agents without removing human judgment from the conversation.</p><p><strong>Read this to understand why burnout shows up operationally and what changes actually reduce agent load.</strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://resources.krisp.ai/hubfs/WhitePaper/202601%20Tech_vs._Humanity%20Practicality%20Guide.pdf?utm_source=substack&amp;utm_medium=newsletter&amp;utm_campaign=2026+kickoff&quot;,&quot;text&quot;:&quot;Get the Playbook&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://resources.krisp.ai/hubfs/WhitePaper/202601%20Tech_vs._Humanity%20Practicality%20Guide.pdf?utm_source=substack&amp;utm_medium=newsletter&amp;utm_campaign=2026+kickoff"><span>Get the Playbook</span></a></p><div><hr></div><h2>In the News</h2><h3>4. Why 95% of AI Pilots Fail and What the 5% Do Differently</h3><p>This isn&#8217;t about models falling short. It&#8217;s about <strong>how AI is deployed</strong>. The companies that succeed aren&#8217;t leading with autonomous, customer-facing agents. They&#8217;re starting with co-pilots that augment humans, where trust is built into the workflow. This piece lays out what separates experimentation from production.</p><p><strong>Read this to understand why copilots scale, autonomous agents stall, and why deployment choices matter more than ambition.</strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.forbes.com/sites/tarungalagali/2025/10/28/why-95-of-ai-pilots-fail-and-what-the-5-do-differently/&quot;,&quot;text&quot;:&quot;Read the full story&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.forbes.com/sites/tarungalagali/2025/10/28/why-95-of-ai-pilots-fail-and-what-the-5-do-differently/"><span>Read the full story</span></a></p><div><hr></div><h3>5. The State of CX: What 2025 Taught Us</h3><p>Customers now expect more than efficiency. In 2025, AI moved from experiment to expectation, but the gap between brand promise and delivery widened. Teams advanced automation, personalization, and data use, but cracks showed up in empathy, voice of the customer, and emotional connection.</p><p><strong>Read this to understand where CX investments outpaced real customer experience and why connection still matters as much as capability.</strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.forbes.com/sites/tarungalagali/2025/10/28/why-95-of-ai-pilots-fail-and-what-the-5-do-differently/&quot;,&quot;text&quot;:&quot;Read the full story&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.forbes.com/sites/tarungalagali/2025/10/28/why-95-of-ai-pilots-fail-and-what-the-5-do-differently/"><span>Read the full story</span></a></p><div><hr></div><h3>6. 48% of CX Leaders Plan to Access AI via BPO Partners</h3><p>As Voice AI adoption accelerates, BPOs are playing a larger role in making AI usable at scale and are often better positioned to operationalize it inside live contact center environments. Rather than building everything in-house, teams are turning to BPOs to reduce risk, speed deployment, and embed AI into real workflows.</p><p><strong>Read this to understand why Voice AI adoption is shifting toward partners who can operationalize it, not just vendors who sell it.</strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://360magazine.com/2025/08/27/48-of-cx-leaders-plan-to-access-ai-via-bpo-partners/&quot;,&quot;text&quot;:&quot;Read the full story&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://360magazine.com/2025/08/27/48-of-cx-leaders-plan-to-access-ai-via-bpo-partners/"><span>Read the full story</span></a></p><div><hr></div><h2>Worth Watching</h2><h3>7. Accent Conversion&#8217;s 85+ NPS Impact </h3><p>A concrete example of what happens when you remove friction from voice conversations at scale. The outcome wasn&#8217;t marginal. It was structural.</p><p><strong>Watch this to see how improving comprehension in real time can drive measurable CX outcomes, not marginal gains.</strong></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;00df8a0e-8fff-42d7-ae92-fdc6022e23fe&quot;,&quot;caption&quot;:&quot;In this special edition of the Future of Voice AI series of interviews, we're joined by industry vets to unpack: - How clarity became a measurable KPI for CX quality and trust - How TTEC identified and solved global voice challenges across regions - Real results: customer satisfaction, agent confidence, cost efficiency improvements and more&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Accent AI&#8217;s 85+ NPS Impact in India | James Bednar and Biju Pillai (TTEC)&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:32916364,&quot;name&quot;:&quot;Davit Baghdasaryan&quot;,&quot;bio&quot;:&quot;CEO &amp; Co-Founder of Krisp, early pioneer in Voice AI.\n20+ years in engineering. 18 US patent applications, ex Twilion&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23088dde-6cb0-44df-b220-5f22830cdd4c_1179x960.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2025-11-06T15:35:08.685Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f812d60a-00fa-4d60-8675-495eac61b55b_1721x965.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/p/accent-ais-80-nps-impact-in-india&quot;,&quot;section_name&quot;:&quot;Podcast&quot;,&quot;video_upload_id&quot;:&quot;3220c462-aa76-4e78-b1c4-f0162485d44d&quot;,&quot;id&quot;:178026252,&quot;type&quot;:&quot;podcast&quot;,&quot;reaction_count&quot;:12,&quot;comment_count&quot;:0,&quot;publication_id&quot;:2073467,&quot;publication_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!YLgs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h3>8. Inside the Data: The State of Voice in CX Unpacked</h3><p>The session also digs into where deployments fail, why overpromising slows adoption, and how measurable outcomes are replacing futuristic demos as the bar for investment.</p><p><strong>Watch this to understand where voice AI is delivering real value today and why pragmatism, not hype, is driving the next phase of CX adoption.</strong></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;b87e299f-50d1-413a-a213-1389c9010a5c&quot;,&quot;caption&quot;:&quot;In the Future of Voice AI series of interviews, I ask three questions to my guests: - What problems do you currently see in Enterprise Voice AI? - How does your company solve these problems? - What solutions do you envision in the next 5 years?&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Inside the Data: The State of Voice in CX Unpacked | Peter Ryan ( Ryan Strategic Advisory)&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:32916364,&quot;name&quot;:&quot;Davit Baghdasaryan&quot;,&quot;bio&quot;:&quot;CEO &amp; Co-Founder of Krisp, early pioneer in Voice AI.\n20+ years in engineering. 18 US patent applications, ex Twilion&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23088dde-6cb0-44df-b220-5f22830cdd4c_1179x960.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2025-09-04T14:25:40.165Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4f0aeb0c-3293-42f8-a50e-9f77a9b78bd8_1165x776.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/p/inside-the-data-the-state-of-voice&quot;,&quot;section_name&quot;:&quot;Podcast&quot;,&quot;video_upload_id&quot;:&quot;435183ab-cf9f-4912-87f0-647c9f35a6ad&quot;,&quot;id&quot;:171576167,&quot;type&quot;:&quot;podcast&quot;,&quot;reaction_count&quot;:27,&quot;comment_count&quot;:0,&quot;publication_id&quot;:2073467,&quot;publication_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!YLgs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h3>9. The Rise of Voice Productivity</h3><p>A conversation on why voice remains the highest-stakes channel in CX and how teams are redefining productivity beyond speed alone. Voice productivity is about reducing friction in live conversations. When clarity improves, teams see fewer repeats, faster resolution, and better outcomes for both customers and agents.</p><p><strong>Watch this to understand why clarity, not automation, is the real driver of voice productivity.</strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://slator.com/the-rise-of-voice-productivity-with-krisp-ceo-davit-baghdasaryan/&quot;,&quot;text&quot;:&quot;Watch now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://slator.com/the-rise-of-voice-productivity-with-krisp-ceo-davit-baghdasaryan/"><span>Watch now</span></a></p><div><hr></div><h2>10. What We&#8217;re Carrying Into 2026</h2><ul><li><p><strong>Voice quality drives outcomes.</strong> When conversations break down, everything slows: resolution, satisfaction, and agent capacity. Teams that invest in clear, reliable conversations resolve issues faster, protect CSAT, and give agents more capacity to do real work.</p></li><li><p><strong>Language and accent friction is expensive.</strong> The cost shows up in repetition, longer calls, and churn long before it shows up in reports. When understanding improves, calls shorten, repeats drop, and loyalty increases across global customer bases.</p></li><li><p><strong>AI creates value in the moment.</strong> The biggest gains come from supporting agents during live interactions, not from adding more layers of automation. When AI reduces friction in real time, agents stay focused, errors drop, and customers get to resolution faster.</p></li></ul><p>If 2026 is about anything, it&#8217;s this: CX improves when conversations get easier for the people on both sides of the call.</p><p><strong>Voice AI in 2026 is about real-time clarity, not more tools.</strong></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[5 Predictions for Voice AI Productivity in 2026]]></title><description><![CDATA[Voice AI Productivity is entering its execution phase.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/5-predictions-for-voice-ai-productivity</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/5-predictions-for-voice-ai-productivity</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Fri, 21 Nov 2025 14:03:19 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!XaEY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Voice AI Productivity is entering its execution phase.</p><p>Over the past year, we&#8217;ve seen a sharp rise in real-time voice adoption across contact centers, collaboration tools, and consumer AI. More than <strong>80 percent of contact center leaders</strong> now list AI productivity as a top priority. Models are faster, latency is dropping, and the gap between human and machine interactions is shrinking.</p><p>At Krisp, we process <strong>over 80B minutes of voice every month</strong> and work closely with enterprises, BPOs, and developers building the next generation of voice-driven applications. These predictions come from what we see in the market, the problems customers are trying to solve, and the technology that&#8217;s now mature enough to deploy at scale.</p><p>Here&#8217;s where Voice AI Productivity is headed in 2026.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe to receive new posts and the podcast.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XaEY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XaEY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!XaEY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!XaEY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!XaEY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XaEY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:622689,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/179154469?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XaEY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!XaEY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!XaEY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!XaEY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb36421de-f26a-4768-8d5a-225faeea9f81_1920x1080.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>1. Call center agent copilots become the new standard</strong></h2><p>AI copilots move from early adoption to everyday use.</p><p>In 2025, real-time call guidance grew faster than any other contact center AI tool. In 2026, copilots will be part of every agent&#8217;s workflow, giving them live prompts, better clarity, and instant context.</p><p><strong>This shift is driven by three forces:</strong></p><ul><li><p>Rising cost pressure on service teams</p></li><li><p>Higher expectations from customers</p></li><li><p>Lower cost and higher quality of real-time STT</p></li></ul><p>By the end of 2026, many global call centers will deploy copilots across entire programs, not just pilot groups. </p><p>The impact is fewer escalations, shorter handle times, and more consistent service from every agent.</p><h2><strong>2. Language barriers disappear at work</strong></h2><p>We enter the first real year of &#8220;global teams sounding local.&#8221;</p><p>Real-time accent conversion and language translation get embedded directly into meeting tools, contact center platforms, and collaboration apps. Latency drops. Accuracy improves. And companies start expecting voice clarity as a built-in feature, not an add-on.</p><p>Today, <strong>language issues contribute to over 20 percent of repeated calls</strong> in offshore teams. With better translation and accent clarity, clarity becomes the new driver of global productivity.</p><p><strong>The impact is simple:</strong></p><ul><li><p>Fewer misunderstandings</p></li><li><p>Faster resolutions</p></li><li><p>Improved and deepened customer trust</p></li><li><p>A more inclusive workplace</p></li></ul><p>By 2026, most major communication tools will ship with real-time translation or accent clarity as standard capabilities.</p><h2><strong>3. Agent coaching moves from training rooms to live calls</strong></h2><p>The model for agent development flips.</p><p>Traditional coaching cycles rely on manual scorecards, delayed feedback, and hours of formal training. In 2026, the first layer of coaching will happen inside the call itself.</p><p><strong>Real-time AI will:</strong></p><ul><li><p>Provide next-best-action prompts</p></li><li><p>Catch compliance violations</p></li><li><p>Offer guidance on tone or empathy</p></li></ul><p>Early data from call centers using real-time coaching shows <strong>10 to 20 percent improvements in AHT and CSAT</strong> within weeks (based on aggregated benchmarks from Krisp deployments and public case studies from Cresta, Cogito, and Balto). </p><p>Companies will shift training budgets from retraining to reinforcement because AI now handles the micro-corrections that drive performance.</p><h2><strong>4. Call Centers turn voice data into a new revenue line</strong></h2><p>2026 is the turning point for outsourced contact centers.</p><p>Historically, Call Center BPOs haven&#8217;t had full access to transcripts or voice data due to security restrictions. With privacy-safe architectures like Krisp&#8217;s, BPOs can finally apply AI across every seat without moving or storing sensitive audio.</p><p><strong>This unlocks a new business model:</strong></p><ul><li><p>Real-time speech analytics</p></li><li><p>Predictive insights for clients</p></li><li><p>Program-level performance dashboards</p></li><li><p>New premium-tier service offerings</p></li></ul><p>BPOs shift from cost-driven execution to insight-driven value. By the end of 2026, the top global BPOs have speech analytics and AI-enhanced insights as core differentiators.</p><h2><strong>5. Meeting productivity shifts from note-taking to real-time intelligence</strong></h2><p>AI meeting assistants grow up.</p><p>Millions already use AI note-takers, but transcription alone is no longer enough. Teams want accurate insights while the meeting is happening, not after the fact.</p><p>In 2026, AI becomes the second brain in every meeting.</p><p><strong>It will:</strong></p><ul><li><p>Detect decisions as they&#8217;re made</p></li><li><p>Track action items across teams</p></li><li><p>Summarize long discussions into structured, usable outputs</p></li><li><p>Improve clarity for global teams with localization, insights, and context</p></li></ul><p>The shift is already underway. Platforms already process millions of summaries per month. This pace only accelerates as real-time intelligence becomes the baseline expectation.</p><div><hr></div><h2><strong>The bottom line</strong></h2><p>2026 is the year Voice AI becomes infrastructure.</p><p>The winners will be the teams that focus on measurable outcomes, not experiments. Real-time clarity, copilots, translation, and meeting intelligence are no longer emerging areas. They are becoming required capabilities for modern work.</p><p>Companies that invest now will transform their service quality, workforce productivity, and global collaboration. Companies that wait will fall behind and fast.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive the weekly digest, research, and the podcast.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Africa’s Voice AI Moment]]></title><description><![CDATA[Accent Conversion arrives to support a global CX hub]]></description><link>https://voice-ai-newsletter.krisp.ai/p/africas-voice-ai-moment</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/africas-voice-ai-moment</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 23 Oct 2025 14:02:36 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!d2MG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d2MG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d2MG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png 424w, https://substackcdn.com/image/fetch/$s_!d2MG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png 848w, https://substackcdn.com/image/fetch/$s_!d2MG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png 1272w, https://substackcdn.com/image/fetch/$s_!d2MG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d2MG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png" width="1456" height="761" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3f149a68-e621-4892-b14f-02c821298620_2401x1255.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:761,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:5592708,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/176263421?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d2MG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png 424w, https://substackcdn.com/image/fetch/$s_!d2MG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png 848w, https://substackcdn.com/image/fetch/$s_!d2MG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png 1272w, https://substackcdn.com/image/fetch/$s_!d2MG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f149a68-e621-4892-b14f-02c821298620_2401x1255.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When people talk about AI and automation, they often forget where customer experience truly happens &#8212; in conversation.</p><p>Voice is still how trust is built, problems are solved, and loyalty is earned.</p><p>That&#8217;s why the launch of <strong>Accent Conversion for Africa</strong> is about more than product.</p><p>It&#8217;s a signal of how Voice AI is reshaping global communication, from the sound of support to the scale of inclusion.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h3><strong>A rising CX powerhouse</strong></h3><p>Africa is quietly becoming one of the most important regions for CX support.</p><ul><li><p>Global enterprises are setting up operations across <strong>South Africa, Kenya, Uganda, and Nigeria</strong>.</p></li><li><p>The workforce is young, skilled, and speaks English.</p></li><li><p>Cultural alignment with Western markets is strong.</p></li></ul><p>But even in the best contact centers, accent gaps still cause friction: longer calls and missed opportunities.</p><div><hr></div><h3><strong>Technology that changes the conversation</strong></h3><p>Krisp Accent Conversion for Africa bridges those gaps in real time:</p><ul><li><p>Converts African English accents to a neutral American English</p></li><li><p>Keeps each speaker&#8217;s <strong>authentic voice and tone</strong> intact</p></li><li><p>Runs securely on-device with no lag or data exposure</p></li></ul><p>This means agents don&#8217;t have to change who they are to be understood.</p><p>Clarity becomes universal, not cultural.</p><div><hr></div><h3><strong>Why this matters globally</strong></h3><p>Accent Conversion is part of a broader shift in <strong>Voice AI infrastructure</strong>, one that makes the voice channel more inclusive and scalable.</p><p>It removes the need for accent-neutralization training, saving time, cutting costs, and expanding access to high-value CX jobs across Africa. It also eliminates the cognitive load offshore agents carry to do their day-to-day jobs.</p><p>It&#8217;s the foundation of a more connected and fair voice economy where:</p><ul><li><p><strong>Talent</strong> isn&#8217;t limited by geography or accent</p></li><li><p><strong>Companies</strong> can hire globally without communication barriers</p></li><li><p><strong>Customers</strong> experience clarity, empathy, and speed in every call</p></li></ul><p>When every call is understood the first time, efficiency and satisfaction rise together.</p><div><hr></div><h3><strong>A broader shift in global CX</strong></h3><p>Africa&#8217;s CX evolution mirrors what we&#8217;ve seen in India, the Philippines, Pakistan, and Latin America: a combination of human skill, linguistic diversity, and now, expanded reach with voice AI.</p><p>With <strong>Accent Conversion</strong>, <strong>Noise Cancellation</strong>, and <strong>Voice Translation</strong> working together, Krisp is building the <strong>core Voice AI layer</strong> for the modern contact center to help African BPOs compete with the world&#8217;s leading hubs.</p><p>Even in an AI-driven industry, it&#8217;s people who build trust, and AI that ensures they&#8217;re heard clearly.</p><div><hr></div><h3><strong>What&#8217;s next</strong></h3><p>Voice AI is entering an era where clarity, empathy, and speed define great customer experiences.</p><p>Accent Conversion for Africa marks another step toward frictionless voice.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://krisp.ai/blog/category/enterprise/ai-accent-conversion/&quot;,&quot;text&quot;:&quot;Read the Full Announcement&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://krisp.ai/blog/category/enterprise/ai-accent-conversion/"><span>Read the Full Announcement</span></a></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Takeaways from VapiCon 2025]]></title><description><![CDATA[On Oct 2nd, I attended the very first VapiCon organized by Vapi.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/takeaways-from-vapicon-2025</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/takeaways-from-vapicon-2025</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 09 Oct 2025 18:19:29 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!TcZJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TcZJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TcZJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png 424w, https://substackcdn.com/image/fetch/$s_!TcZJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png 848w, https://substackcdn.com/image/fetch/$s_!TcZJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png 1272w, https://substackcdn.com/image/fetch/$s_!TcZJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TcZJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png" width="700" height="520.3665987780041" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/643ab499-c7c6-4475-a968-0e52b633601d_982x730.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:730,&quot;width&quot;:982,&quot;resizeWidth&quot;:700,&quot;bytes&quot;:81508,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/175576579?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!TcZJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png 424w, https://substackcdn.com/image/fetch/$s_!TcZJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png 848w, https://substackcdn.com/image/fetch/$s_!TcZJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png 1272w, https://substackcdn.com/image/fetch/$s_!TcZJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F643ab499-c7c6-4475-a968-0e52b633601d_982x730.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>On Oct 2nd, I attended the very first VapiCon organized by <a href="https://vapi.ai/">Vapi</a>.</p><p>This was by far the largest Voice AI gathering so far. Not my words, but pretty much everyone was asserting this.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/Speechmatics/status/1975187697817022856" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wq5O!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png 424w, https://substackcdn.com/image/fetch/$s_!wq5O!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png 848w, https://substackcdn.com/image/fetch/$s_!wq5O!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png 1272w, https://substackcdn.com/image/fetch/$s_!wq5O!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wq5O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png" width="1182" height="1344" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1344,&quot;width&quot;:1182,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:661608,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/Speechmatics/status/1975187697817022856&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/175576579?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wq5O!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png 424w, https://substackcdn.com/image/fetch/$s_!wq5O!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png 848w, https://substackcdn.com/image/fetch/$s_!wq5O!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png 1272w, https://substackcdn.com/image/fetch/$s_!wq5O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbf14cb4-f610-4da0-98f8-39b20e2ea19d_1182x1344.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I heard the demand turned out to be much bigger than they could fit.</p><p>The keynote speakers did a great job laying out the state of Voice AI.</p><p>Big kudos to Jordan@Vapi, Scott@Deepgram, Justin@OpenAI and Dylan@Assembly.</p><p>Also big kudos to the event staff. They were very caring and accommodating.</p><p>Unfortunately, I had to miss a lot of sessions but here are my takeaways from the ones I did attend:</p><ul><li><p>The Voice AI community is vibrant and energized. It&#8217;s rare to see so many young folks focused on an industry. The future is clearly being built here.</p></li><li><p>I think most of the attendees were developers from startups. I estimate that there are between 500-1000 startups in Voice AI now.</p></li><li><p>$2B+ has been invested in startups since 2024.</p></li><li><p>STT accuracy, turn-taking, speaker-separation, latency, TTS accuracy, hallucinations, function-calling, lack-of-context are the main technical challenges the industry is trying to solve.</p></li><li><p>Most of these issues have significantly improved in the last 2 years but they all STILL remain big challenges (except maybe latency?)</p></li><li><p>There is a big debate about speech-to-speech (S2S) models and the Cascading approach (STT&#8594;LLM&#8594;TTS). There are clear pros and cons here.</p></li><li><p>The vast majority of the deployed technology uses the Cascading approach</p></li><li><p>It is assumed that most of the technical problems will go away once S2S models mature, but this is yet to be seen.</p></li></ul><p>While the event was super energized, I couldn&#8217;t help but think about a topic that was in the air. I think people are NOT talking about this enough, while everyone does know about it.</p><blockquote><p><strong>Most Voice AI Agents in production are still quite unstable today. <br>And this SLOWS the industry down.</strong></p></blockquote><p>I think there are 2 main reasons why they keep failing:</p><ol><li><p><strong>The real world is complex:</strong> there is too much context and too many edge cases out there, and AI agents simply don&#8217;t know how to handle these yet. In contrast, people are really good at this because of our experience, context and multi-modality.<br><em>During VapiCon, I witnessed several live demos fail because of these exact issues&#8212;background noise, hallucinations, and AI agents not taking turns properly. It was a clear reminder that even advanced systems still struggle when faced with messy, unpredictable real-world conditions.</em></p></li><li><p><strong>STT is failing us</strong>: In most cases STTs do a great job of capturing what people said. However, the error rate of properly capturing <strong>numbers, emails, addresses</strong>, and <strong>nouns</strong> remains quite high. This might seem like a small issue but it&#8217;s not, especially in B2B use cases. Perhaps it&#8217;s the <strong>most important technical problem</strong> today, preventing the industry to grow. I really do hope this will get better in 2026.</p></li></ol><p>I estimate the Voice AI Agents traffic to be around <strong>3B/mins/month</strong> today. </p><p>If we solve the above two problems, the traffic will skyrocket &#128640;</p><p>Here is to reaching <strong>100B/mins/month</strong> by next VapiCon!</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[2025 State of Voice in CX]]></title><description><![CDATA[Findings based on insights from 800+ CX leaders]]></description><link>https://voice-ai-newsletter.krisp.ai/p/2025-state-of-voice-in-cx</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/2025-state-of-voice-in-cx</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 28 Aug 2025 14:30:51 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/da64818c-0d57-45fd-8784-d1d86b8be1ca_2400x1257.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cz4k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cz4k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png 424w, https://substackcdn.com/image/fetch/$s_!cz4k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png 848w, https://substackcdn.com/image/fetch/$s_!cz4k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png 1272w, https://substackcdn.com/image/fetch/$s_!cz4k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cz4k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png" width="1201" height="501" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:501,&quot;width&quot;:1201,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:108399,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/171891807?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cz4k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png 424w, https://substackcdn.com/image/fetch/$s_!cz4k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png 848w, https://substackcdn.com/image/fetch/$s_!cz4k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png 1272w, https://substackcdn.com/image/fetch/$s_!cz4k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb9db75-85e5-4a3e-a527-ee673d504d02_1201x501.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Voice AI in CX: What 819 Leaders Reveal About the Future of Voice</h1><p>Contact centers are under more pressure than ever. Rising labor costs, demanding customers, and the complexity of serving global audiences are forcing operators to rethink their playbook. Krisp, in partnership with Ryan Strategic Advisory, surveyed <strong>819 enterprise CX leaders</strong> and one thing is clear: AI isn&#8217;t a &#8220;someday&#8221; investment&#8212;it&#8217;s a <strong>requirement in 2025</strong>.</p><p>This research highlights how enterprises are navigating the transition from legacy systems to AI-powered voice tools, and what risks they face if they wait too long.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe to receive new posts and the podcast.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div><hr></div><h2>Most are still stuck in transition</h2><p>The data shows a sharp divide. Many contact centers have adopted modern AI solutions like <strong>noise cancellation software, translation, or accent conversion</strong>, but most still rely heavily on <strong>labor-intensive fixes</strong> like human translators or expensive hardware.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!C2QX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!C2QX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png 424w, https://substackcdn.com/image/fetch/$s_!C2QX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png 848w, https://substackcdn.com/image/fetch/$s_!C2QX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png 1272w, https://substackcdn.com/image/fetch/$s_!C2QX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!C2QX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png" width="1456" height="788" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:788,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:81377,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/171891807?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!C2QX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png 424w, https://substackcdn.com/image/fetch/$s_!C2QX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png 848w, https://substackcdn.com/image/fetch/$s_!C2QX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png 1272w, https://substackcdn.com/image/fetch/$s_!C2QX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25965d70-5b32-4043-8c65-86964eba44c9_1527x826.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That patchwork approach creates hidden costs:</p><ul><li><p>Customer experience suffers from slow or inconsistent service</p></li><li><p>Agents juggle tools instead of focusing on conversations</p></li><li><p>Quality varies across regions and languages</p></li><li><p>Costs stay high as human services and hardware pile up</p></li></ul><div><hr></div><h2>A wave of new adoption is coming</h2><p>The next 6&#8211;12 months will be decisive. Large portions of CX leaders plan to roll out <strong>AI accent reduction, translation, and noise suppression</strong> in the short term. This signals a market ready to modernize voice infrastructure fast.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1VSL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1VSL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png 424w, https://substackcdn.com/image/fetch/$s_!1VSL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png 848w, https://substackcdn.com/image/fetch/$s_!1VSL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png 1272w, https://substackcdn.com/image/fetch/$s_!1VSL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1VSL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png" width="1451" height="591" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:591,&quot;width&quot;:1451,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:53041,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/171891807?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1VSL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png 424w, https://substackcdn.com/image/fetch/$s_!1VSL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png 848w, https://substackcdn.com/image/fetch/$s_!1VSL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png 1272w, https://substackcdn.com/image/fetch/$s_!1VSL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f677190-27ec-4844-a836-fa7d27ba9a66_1451x591.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>But while some are preparing to leap ahead, others are holding back, waiting for &#8220;perfect&#8221; solutions. That hesitation could prove costly.</p><div><hr></div><h2>BPOs are the shortcut&#8212;for now</h2><p>Nearly half of enterprises say they&#8217;ll access AI through <strong>BPO partners</strong> in the near term. It&#8217;s seen as the fastest path forward, but it comes with trade-offs: less control, less flexibility, and higher long-term risk.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!espi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!espi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png 424w, https://substackcdn.com/image/fetch/$s_!espi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png 848w, https://substackcdn.com/image/fetch/$s_!espi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png 1272w, https://substackcdn.com/image/fetch/$s_!espi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!espi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png" width="1456" height="894" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/df517290-5135-4f54-ae25-171cc46f5835_1463x898.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:894,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:54828,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/171891807?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!espi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png 424w, https://substackcdn.com/image/fetch/$s_!espi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png 848w, https://substackcdn.com/image/fetch/$s_!espi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png 1272w, https://substackcdn.com/image/fetch/$s_!espi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf517290-5135-4f54-ae25-171cc46f5835_1463x898.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Those that invest directly in AI will be better positioned to reduce costs and build sustainable, differentiated customer experiences.</p><div><hr></div><h2>Budgets tell the real story</h2><p>The survey also reveals where the pain is sharpest:</p><ul><li><p><strong>Over 50%</strong> of all voice budgets still go to <strong>onshore live agents</strong></p></li><li><p>Offshore staffing, automation, and AI account for much smaller shares</p></li></ul><p>This imbalance shows how much potential savings and efficiency are still on the table if AI can close the accent and language gap.</p><div><hr></div><h2>What this means for 2025</h2><ul><li><p>AI is entering a <strong>mainstream deployment phase</strong>&#8212;waiting is no longer safe.</p></li><li><p><strong>Accent and language barriers</strong> remain the toughest challenge.</p></li><li><p><strong>BPOs are filling the AI gap</strong>, but enterprises that own their AI future will win.</p></li><li><p><strong>Onshore agents drive the cost burden</strong>, making offshore + AI an attractive path forward.</p></li></ul><p>AI in CX is no longer hype. It&#8217;s a practical tool that&#8217;s already reshaping budgets, strategies, and outcomes. Leaders who act now stand to gain speed, lower costs, and a lasting competitive edge.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://resources.krisp.ai/hubfs/Ebooks/2025%20State%20of%20Voice%20in%20CX.pdf&quot;,&quot;text&quot;:&quot;Read the full report&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://resources.krisp.ai/hubfs/Ebooks/2025%20State%20of%20Voice%20in%20CX.pdf"><span>Read the full report</span></a></p><p></p><div><hr></div><h2><strong>Next week on the podcast</strong> </h2><p>Ryan Strategic Advisory takes us inside the data on Voice AI adoption to unpack what&#8217;s real and what&#8217;s hype. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5pnj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5pnj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png 424w, https://substackcdn.com/image/fetch/$s_!5pnj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png 848w, https://substackcdn.com/image/fetch/$s_!5pnj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png 1272w, https://substackcdn.com/image/fetch/$s_!5pnj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5pnj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png" width="1165" height="776" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:776,&quot;width&quot;:1165,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:300268,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/171891807?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5pnj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png 424w, https://substackcdn.com/image/fetch/$s_!5pnj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png 848w, https://substackcdn.com/image/fetch/$s_!5pnj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png 1272w, https://substackcdn.com/image/fetch/$s_!5pnj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8445ac10-81a1-4866-b0de-eeeed6e3feed_1165x776.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive the weekly digest, research, and the podcast.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Krisp Accent Conversion v3.7, Major Leap in Naturalness and Stability]]></title><description><![CDATA[Krisp&#8217;s Accent Conversion technology has been on a rapid innovation track since v3 launched in March 2025, when it first became mature enough for wide-scale deployment.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/krisp-accent-conversion-v37-major</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/krisp-accent-conversion-v37-major</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 14 Aug 2025 14:02:17 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/11a95bf4-dc71-4602-8046-f6468e5e0241_1000x700.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GJzw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GJzw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png 424w, https://substackcdn.com/image/fetch/$s_!GJzw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png 848w, https://substackcdn.com/image/fetch/$s_!GJzw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png 1272w, https://substackcdn.com/image/fetch/$s_!GJzw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GJzw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png" width="1000" height="700" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:700,&quot;width&quot;:1000,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:116626,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/170805597?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GJzw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png 424w, https://substackcdn.com/image/fetch/$s_!GJzw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png 848w, https://substackcdn.com/image/fetch/$s_!GJzw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png 1272w, https://substackcdn.com/image/fetch/$s_!GJzw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d20a3d1-2fe4-4713-8065-50d589bffac0_1000x700.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Krisp&#8217;s Accent Conversion technology has been on a rapid path of continued innovation since v3 launched in March 2025, when it became mature enough for wide-scale deployment.</p><p>In just a few months, the technology has delivered major advancements:</p><ul><li><p><strong>May 2025, v3.5:</strong> Delivered a 20% quality boost for Filipino English and Indian English accents. Rolled out to 95% of Krisp desktop users in two days with strong agent and customer feedback.</p></li><li><p><strong>July 2025, LATAM Pack:</strong> Added Latin American English accent support, now deployed across thousands of agents.</p></li><li><p><strong>August 2025, v3.7:</strong> Focused on the Indian English accent pack, delivering significant gains in naturalness, voice stability, and clarity.</p></li></ul><h2><strong>What&#8217;s New in v3.7</strong></h2><ul><li><p>14% naturalness improvement (expert-rated) with more human-like speech and better handling of filler sounds</p></li><li><p>Improved voice stability with smoother pitch and tone, especially for thick accents</p></li><li><p>5% clarity boost with fewer artifacts and more intelligible speech</p></li><li><p>Better pronunciation accuracy with a 4% reduction in phoneme errors, improving difficult sounds like &#8220;R&#8221; and &#8220;L&#8221;</p></li></ul><h2><strong>How It Was Measured</strong></h2><ul><li><p>78 real-world agent call samples evaluated</p></li><li><p>Expert panel scoring, crowdsourced listening tests (3,120 votes), and objective metrics from Meta Audiobox Aesthetics and Facebook NN Phonemizer</p></li><li><p>Crowdsourced preference showed v3.7 was chosen 20% more often for sounding more natural</p></li></ul><p><strong>Read the full technical deep dive with benchmark data and audio samples.</strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://krisp.ai/blog/introducing-krisp-accent-conversion-v3-7/&quot;,&quot;text&quot;:&quot;Read the Technical Article&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://krisp.ai/blog/introducing-krisp-accent-conversion-v3-7/"><span>Read the Technical Article</span></a></p><h2><strong>Why It Matters</strong></h2><p>Accent Conversion v3.7 raises the bar for voice naturalness in high-variance accents, addressing key call center challenges around intelligibility and customer experience. </p>]]></content:encoded></item><item><title><![CDATA[AI, Voice, and the Human Edge: CCW 2025 Recap]]></title><description><![CDATA[At CCW Vegas 2025, it was clear that AI alone isn&#8217;t the story; AI that elevates human experience is.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/ai-voice-and-the-human-edge-ccw-2025</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/ai-voice-and-the-human-edge-ccw-2025</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 26 Jun 2025 14:25:14 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/f7f17016-df6f-4c2e-8f3f-0a755bccac39_1920x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;5c3629c6-4a91-4b9a-b2ce-ce9d935cc886&quot;,&quot;duration&quot;:null}"></div><p>At CCW Vegas 2025, it was clear that AI alone isn&#8217;t the story; AI that elevates human experience is.</p><p>Across sessions and show floor conversations, the most forward-looking leaders weren&#8217;t chasing the latest hype. They&#8217;re focused on strategic, human-centered approaches to AI in CX.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>AI + Humans: Collaboration, Not Competition</strong></h2><p>AI tools work best when they enhance the strengths of human agents. Leaders across the industry are rethinking AI&#8217;s role: from automation-first to collaboration-first. </p><p>The goal is no longer to replace agents but to help them deliver better, faster, more empathetic customer experiences&#8212;while improving agent experience, too.</p><h2><strong>Strategy First: Align AI with Human Touchpoints</strong></h2><p>Successful AI investments start with a clear strategy. Success means aligning AI tools with the right moments in the customer journey, maintaining transparency with customers, and actively supporting employees through adoption. </p><p>Without this foundation, even the most advanced AI can fall flat.</p><h2><strong>The ROI Reality: It&#8217;s in the Rollout</strong></h2><p>How you implement and deploy AI determines whether you see ROI. Sessions highlighted that thoughtful rollout&#8212;designed for real gains in CSAT, first-call resolution, and agent productivity&#8212;is what separates success from wasted spend. </p><p>A poor deployment can erode trust and negate value.</p><h2><strong>Voice Still Rules</strong></h2><p>Even in an increasingly digital-first world, voice is the highest-stakes channel for support. Complex, emotional, or high-value interactions still rely on human voice and interactions. </p><p>Innovations that remove friction and bridge communication gaps are helping make voice an even stronger channel.</p><h2><strong>Agentic AI: Still a Gray Area</strong></h2><p>Agentic AI was a buzzword at CCW, but many CX leaders still aren&#8217;t sure what it means, or whether they actually need it. In theory, it refers to AI that can take initiative, make decisions, and act autonomously. </p><p>For now, most CX organizations are focused on more immediate, proven AI use cases that drive real impact.</p><h2><strong>Hype vs. Reality: Automation Has Limits</strong></h2><p>While automation was everywhere, a lot of companies think that over-automating means losing the human touch. Over-automation risks damaging customer trust and experience. </p><p>Many CX leaders are taking a balanced approach: using automation where it drives efficiency but keeping humans in the loop where empathy matters.</p><h2><strong>Personal Perspective</strong></h2><p>The conversations at CCW Vegas reinforced our belief that the future of CX is powered by voice-first, human-centered AI. We heard strong demand for solutions that enhance, not replace, the human element in customer interactions. </p><p>It was clear that innovations like Krisp&#8217;s real-time Voice AI platform are helping companies meet that need&#8212;boosting both agent and customer experience across high-stakes voice channels.</p><h2><strong>Krisp Milestones</strong></h2><p>At CCW Vegas, we unveiled our advanced real-time Voice AI platform complete with Accent Conversion v3.5 support for India, the Philippines and Latin America, AI Live Interpreter, Agent Assist (Supervisor Assist coming soon!), and our award-winning noise cancellation technology. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://cxfoundation.com/news/krisp-launches-ai-voice" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_kkh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png 424w, https://substackcdn.com/image/fetch/$s_!_kkh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png 848w, https://substackcdn.com/image/fetch/$s_!_kkh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png 1272w, https://substackcdn.com/image/fetch/$s_!_kkh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_kkh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:165537,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://cxfoundation.com/news/krisp-launches-ai-voice&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/166264987?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_kkh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png 424w, https://substackcdn.com/image/fetch/$s_!_kkh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png 848w, https://substackcdn.com/image/fetch/$s_!_kkh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png 1272w, https://substackcdn.com/image/fetch/$s_!_kkh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12789d94-e233-4669-b836-7a240aa54e6b_3200x1800.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Krisp&#8217;s AI Live Interpreter earned runner-up for Disruptive Technology of the Year, recognized for delivering true bidirectional speech-to-speech translation across over 80 languages, well beyond any alternative in the market today.</p><p>We also announced a new strategic partnership with Everise, deploying Accent Conversion and AI-powered bidirectional noise and voice cancellation to over 10,000 seats globally, with rapid expansion underway. AI Live Interpreter and Agent Assist are also planned for rollout as part of the partnership.</p><blockquote><p><em>"Krisp's technology has consistently outperformed in head-to-head evaluations across clarity, naturalness, and accent accuracy." &#8212;</em> <em>Sudhir Agarwal, Founder and CEO of Everise</em></p></blockquote><p>Finally, we were proud to host this year&#8217;s CCW Excellence Awards. A huge thanks to our customers and partners who visited our booth and made the week a success.</p><h2><strong>What&#8217;s Next</strong></h2><p>Looking ahead, the industry&#8217;s sharpening its focus on purposeful AI and solutions that clearly improve human outcomes, both for agents and customers. </p><p>Voice AI continues to stand out as a high-impact area. As AI maturity evolves, expect to see more thoughtful and orchestrated human-AI experiences across the industry.</p><p>Stay tuned for the upcoming <em>Future of Voice AI</em> podcast CCW series, featuring industry experts shaping the next wave of CX innovation. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GqhB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GqhB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png 424w, https://substackcdn.com/image/fetch/$s_!GqhB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png 848w, https://substackcdn.com/image/fetch/$s_!GqhB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png 1272w, https://substackcdn.com/image/fetch/$s_!GqhB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GqhB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1050832,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/166264987?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GqhB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png 424w, https://substackcdn.com/image/fetch/$s_!GqhB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png 848w, https://substackcdn.com/image/fetch/$s_!GqhB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png 1272w, https://substackcdn.com/image/fetch/$s_!GqhB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc16982a4-c232-4287-bb66-e938d242d6f0_1920x1280.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[7 top problems Voice AI is solving 💪 (2nd edition)]]></title><description><![CDATA[1.5 years ago, I wrote an article comparing the 8 most important Voice AI problems and their state of readiness.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/7-top-problems-voice-ai-is-solving</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/7-top-problems-voice-ai-is-solving</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 22 May 2025 14:01:17 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!d1_3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>1.5 years ago, I wrote an article comparing the 8 most important Voice AI problems and their state of readiness.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;637144af-2d40-4c77-bf8c-6747f308259d&quot;,&quot;caption&quot;:&quot;Voice AI has come a long way. In this article, we explore what customer problems it can realistically solve today and which ones will be solved in the next couple of years.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;8 most important customer problems that Conversational Voice AI is solving &#128170;&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:32916364,&quot;name&quot;:&quot;Davit Baghdasaryan&quot;,&quot;bio&quot;:&quot;CEO &amp; Co-Founder of Krisp, early pioneer in Voice AI.\n20+ years in engineering. 18 US patent applications, ex Twilion&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23088dde-6cb0-44df-b220-5f22830cdd4c_1179x960.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2023-11-23T14:10:35.912Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb19e9d50-220a-4dcb-adeb-c388811223d9_1670x704.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/p/8-most-important-customer-problems&quot;,&quot;section_name&quot;:&quot;Articles&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139009529,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:1,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>I thought it was time to revisit the scores.</p><ul><li><p><strong>&#8220;Business Pain&#8221;</strong> represents how important the problem is for the customer. The higher the pain, the more urgent it is for the customer to solve it.</p></li><li><p><strong>&#8220;AI Readiness&#8221;</strong> represents the industry&#8217;s technological readiness to solve the pain.</p></li></ul><p>Below is my Q2 2025 version.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d1_3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d1_3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png 424w, https://substackcdn.com/image/fetch/$s_!d1_3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png 848w, https://substackcdn.com/image/fetch/$s_!d1_3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png 1272w, https://substackcdn.com/image/fetch/$s_!d1_3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d1_3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png" width="1456" height="560" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:560,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:174067,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/164014781?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d1_3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png 424w, https://substackcdn.com/image/fetch/$s_!d1_3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png 848w, https://substackcdn.com/image/fetch/$s_!d1_3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png 1272w, https://substackcdn.com/image/fetch/$s_!d1_3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff81123d4-9d64-46ab-ad4b-4805c2ad71e1_1664x640.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here is what changed in 1.5 years:</p><ul><li><p>AI Note-Taking went up by 3 points! &#128293;&#128293;&#128293;</p></li><li><p>AI Voice Translation went up by 3 points! &#128293;&#128293;&#128293;</p></li><li><p>AI Accent Conversion went up by 2 points! &#128293;&#128293;</p></li><li><p>AI Voice Agents went up by 1 point! &#128293;</p></li><li><p>AI Live Assist/Guidance went down by 1 point &#128542;</p></li><li><p>I&#8217;ve removed AI Voice Conversion altogether &#129300;</p></li></ul><p>Let&#8217;s go over the pains one by one.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://voice-ai-newsletter.krisp.ai/subscribe?"><span>Subscribe now</span></a></p><h3>1) Conversations visibility at scale</h3><p>You have 100 sales reps (or call center agents) and they receive/place calls all the time trying to close deals or serve customers.</p><p>How do you know if they are doing a good job? Just a couple of years ago, this was quite difficult to do. These days, all you need to have is a Conversational Intelligence tool (aka Speech Analytics) and you will get full visibility into every conversation.</p><p>Every customer can be recorded, transcribed and summarized for you and the team. These tools are super convenient and there are plenty of them available in the market - <a href="http://cresta.ai">Cresta</a>, <a href="http://observe.ai">Observe</a>, <a href="http://gong.io">Gong</a>, <a href="http://Avoma.com">Avoma</a>, <a href="http://CallMiner.com">CallMiner</a>, <a href="http://SalesLoft.com">SalesLoft</a>, etc.</p><p><code>AI Readiness: 10/10, Pain: 10/10</code></p><h3>2) Language barrier</h3><p>The ability to communicate verbally in an effective way is a foundational capability of any team and business. The language barrier has always been one of the top problems for humanity. It takes years for people to learn to speak in a non-native language. The pain is real and any business that has this pain would pay a lot of money to eliminate it. </p><p>Imagine there being a real-time speech-to-speech translation AI that people could use for communication over Zoom, Teams or Krisp. This would be a game-changer.</p><p><a href="https://krisp.ai/ai-interpreter/">Krisp</a>, <a href="https://www.onemeta.ai/">OneMeta</a>, MS Teams, Google Meet already have solutions for this.</p><p><code>AI Readiness: 6/10, Pain: 10/10</code></p><h3>3) Conversations at scale</h3><p>There are multiple roles where the person needs to receive or place calls and talk to another human being on the other side. Call center agents, sales and business development reps, recruiters, and others.</p><p>Maintaining and growing such teams is exceptionally difficult. Businesses need to recruit talent, and then they need to onboard and retain them.</p><p>This is an expensive endeavor. No doubt, Voice AI Agents are taking over and automate some of these functions. </p><p>The space is booming. There are plenty of startups in this space.</p><p><code>AI Readiness: 6/10, Pain: 10/10</code></p><h3>4) Taking meeting notes</h3><p>In many companies, there are people dedicated to taking notes in meetings. For many years this has been a manual task that can be automated with Voice AI now.</p><p>Many tools already offer meeting transcription, summary, and follow-up generation. </p><p>The quality varies from 60%-80% for now. No doubt it will keep improving and the manual work will be fully automated in the coming year or two.</p><p>There are already multiple companies doing this:</p><ul><li><p><a href="http://krisp.ai">Krisp</a>, <a href="http://Fireflies.ai">Fireflies</a>, <a href="http://otter.ai">Otter</a>, <a href="https://tldv.io">tl;dv</a>, <a href="https://fathom.video/">Fathom</a>, <a href="https://www.avoma.com/">Avoma</a> and others</p></li></ul><p><code>AI Readiness: 7/10, Pain: 8/10</code></p><h3>5) Onboarding and training of associates</h3><p>Call center agent turnover rate is between 30%-45%. This means that a huge number of agents leave every year and managers need to find replacements and onboard/train them. This is a costly process and any automation/simplification of the process has a clear ROI.</p><p>Similarly, companies that need to hire a high number of SDRs or AEs, need to onboard and train them, otherwise, their sales conversion rates would decrease. Again, high-ROI endeavor.</p><p>Imagine a bot sitting on an agent&#8217;s machine that listens to the customer conversation and gives real-time hints that have a history of better conversion rates or customer satisfaction. The agent ramps up quicker due to this technology.</p><p>This technology already exists and is called AI Live Assist. Multiple companies already have shipped products with such technology:</p><ul><li><p><a href="https://www.balto.ai/">Balto</a>, <a href="https://observe.ai/">Observe</a>, <a href="https://cloud.google.com/agent-assist?hl=en">Google</a>, <a href="https://aigent.ai/">Aigent</a>, <a href="https://www.five9.com/products/capabilities/agent-assist">Five9</a>, <a href="https://www.uniphore.com/products/u/u-assist/">Uniphore</a>, <a href="https://thelevel.ai/agent-assist/">LevelAI</a> and others</p></li></ul><p><code>AI Readiness: 6/10, Pain: 7/10</code></p><h3>6) Accent barrier</h3><p>As with the language barrier, human accent is a serious barrier that impacts understanding and comprehension of business conversations. Nearly all humans have accents when speaking in non-native languages and it&#8217;s extremely difficult to retrain them.</p><p>Call centers have special training programs for their agents to reduce accent. The cognitive load and stress on agents for such tasks are intense. </p><p>Imagine a Voice AI technology that would, in real-time, localize the speaker&#8217;s accent to the listener&#8217;s accent to improve understanding and comprehension. </p><p><a href="http://krisp.ai">Krisp</a> and <a href="http://sanas.ai">Sanas</a> have already deployed such technology in the call center industry.</p><p><code>AI Readiness: 8/10, Pain: 7/10</code></p><h3>7) Background noises &amp; voices</h3><p>The problem of background noises and voices in calls has been around for more than 30 years. It creates a distraction for the call participants and prevents them from focusing on the core conversation. Background noise also creates a constant stress for the speakers.</p><p>In call centers, background noise can result in a customer satisfaction drop, longer conversations and mental stress for agents.</p><p>Luckily, AI-powered Noise Cancellation technology can fully solve this problem. <a href="http://krisp.ai">Krisp</a> has pioneered this technology in the industry and has large-scale deployments of it. It solves both the problem of noises as well as background voices. Zoom, MS teams and other applications also have invested in such technologies.</p><p><code>AI Readiness: 10/10, Pain: 6/10</code></p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Where AI Voice Agents Fail the Most Today]]></title><description><![CDATA[Here&#8217;s How to Fix It]]></description><link>https://voice-ai-newsletter.krisp.ai/p/where-ai-voice-agents-fail-the-most</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/where-ai-voice-agents-fail-the-most</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 27 Mar 2025 14:25:41 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/ec812e43-7d16-472b-82fe-420b843cfbd6_1000x700.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>AI voice agents are everywhere&#8212;handling customer service calls, booking appointments, and assisting in day-to-day business. While their quality has been improving fast, one problem keeps getting in the way: they don&#8217;t know when to talk and when to listen. </p><p>The technical term for this is <strong>turn-taking</strong>, or <strong>interrupt-handling</strong>.</p><h2>Turn-Taking</h2><p>Turn-taking is a hot problem these days, with many companies trying to solve it. </p><ul><li><p>LiveKit has published <a href="https://blog.livekit.io/using-a-transformer-to-improve-end-of-turn-detection/">an article</a> showing how they tackle it</p></li><li><p>Daily recently started an open-source project called <a href="https://github.com/pipecat-ai/smart-turn">smart-turn</a></p></li><li><p>OpenAI recently launched a variant of it called &#8220;<a href="https://platform.openai.com/docs/guides/realtime-vad#:~:text=periods%20of%20silence.-,semantic_vad,-%3A%20Chunks%20the%20audio">semantic VAD</a>&#8221;. </p></li></ul><p>A particular case where turn-taking fails miserably is in noisy environments. </p><p>Whenever there is background noise or chatter, AI agents get confused and start to interrupt us at the wrong time, talk over us, or miss what&#8217;s actually being said. This makes conversations frustrating and unnatural. </p><p>In a normal conversation, humans naturally know when to pause, respond, or wait their turn. AI agents don&#8217;t have that instinct. Today, they have to rely on <strong>Voice Activity Detection (VAD)</strong>&#8212;a technology that decides when a piece of audio is human speech or not. An AI agent looks into VAD&#8217;s output and if there is enough &#8220;non-speech&#8221; data, they decide the person finished speaking.</p><p>However VAD-based turn-detection is too primitive for real-life scenarios. There are many situations where people could pause without finishing their speech.</p><p>Here is a great deep-dive video on turn-taking.</p><div id="youtube2-xWhI8RkRSGQ" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;xWhI8RkRSGQ&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/xWhI8RkRSGQ?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h1><strong>Improving turn-taking with Noise and Voice Cancellation</strong></h1><p>While solving turn-taking is difficult, we can improve it with noise and voice cancellation technology by placing it just before VAD and speech recognition models.</p><p>By filtering out background noise and voices in real time, AI agents get only the speech that matters. That means:</p><ul><li><p>No more false interruptions</p></li><li><p>No more missed responses</p></li><li><p>Smoother, more human-like conversations</p></li></ul><p>Here&#8217;s how a voice agent performs with Daily, Pipecat, and Gemini in a noisy environment&#8212;with vs. without noise cancellation:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;f114d5c5-bd57-4835-8d39-79f16bf140d1&quot;,&quot;duration&quot;:null}"></div><h2><strong>The Results</strong></h2><p>Real-world tests show that when background voice and noise cancellation are applied before VAD, AI agents perform much better:</p><ul><li><p><strong>3.5x fewer false interruptions</strong> &#8594; A 71% decrease in AI cutting off users unnecessarily.</p></li><li><p><strong>2x better speech recognition accuracy</strong> &#8594; AI agents hear and respond more accurately.</p></li><li><p><strong>50% decrease in call drops</strong> &#8594; Less conversations abandoned due to frustrating interruptions.</p></li><li><p><strong>30% increase in CSAT</strong> &#8594; Smoother interactions make happier customers.</p></li></ul><p>Leading Conversational AI platforms&#8212;including <strong>Vodex, Fixie, Daily, LiveKit, and Fluidworks</strong>&#8212;have already integrated noise and voice cancellation to fix turn-taking and improve response accuracy.</p><p>Below is a technical report on how exactly this works:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://krisp.ai/blog/improving-turn-taking-of-ai-voice-agents-with-background-voice-cancellation/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!m7VL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png 424w, https://substackcdn.com/image/fetch/$s_!m7VL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png 848w, https://substackcdn.com/image/fetch/$s_!m7VL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png 1272w, https://substackcdn.com/image/fetch/$s_!m7VL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!m7VL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png" width="1456" height="550" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:550,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:806367,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://krisp.ai/blog/improving-turn-taking-of-ai-voice-agents-with-background-voice-cancellation/&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://voice-ai-newsletter.krisp.ai/i/159289606?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!m7VL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png 424w, https://substackcdn.com/image/fetch/$s_!m7VL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png 848w, https://substackcdn.com/image/fetch/$s_!m7VL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png 1272w, https://substackcdn.com/image/fetch/$s_!m7VL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F099d67cb-9690-4374-b333-ce57f8d4e2c1_2392x904.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><h2><strong>What This Means for AI Teams</strong></h2><p>If you&#8217;re building or deploying AI voice agents, this is a must-have. Without noise cancellation, AI models are guessing when to talk and when to listen, which leads to broken conversations.</p><p>Clean audio means better AI decisions. And better AI decisions mean better user experiences. It&#8217;s that simple.</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive weekly updates and news.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[AI Voice Translation: Breaking Language Barriers in Call Centers]]></title><description><![CDATA[Language barriers have always posed challenges in contact centers.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/ai-live-interpretation-breaking-language</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/ai-live-interpretation-breaking-language</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 19 Dec 2024 14:02:33 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!mgeJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mgeJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mgeJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png 424w, https://substackcdn.com/image/fetch/$s_!mgeJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png 848w, https://substackcdn.com/image/fetch/$s_!mgeJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png 1272w, https://substackcdn.com/image/fetch/$s_!mgeJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mgeJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png" width="1080" height="720" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ee02280-cf93-426b-a952-b175505beede_1080x720.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:1080,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:78523,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mgeJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png 424w, https://substackcdn.com/image/fetch/$s_!mgeJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png 848w, https://substackcdn.com/image/fetch/$s_!mgeJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png 1272w, https://substackcdn.com/image/fetch/$s_!mgeJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ee02280-cf93-426b-a952-b175505beede_1080x720.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Language barriers have always posed challenges in contact centers. Traditionally, companies relied on human interpreters, but this approach is expensive, slow, and inefficient. </p><p>Now, AI technology is transforming how businesses deliver multilingual support, offering scalable, cost-effective, and practical solutions.</p><div><hr></div><h3>The History and Challenges of Human Interpreters</h3><p>Human interpreters have been the go-to solution for enabling communication across languages. This involves hiring multilingual staff or outsourcing to over-the-phone interpretation (OPI) services. While these methods work, they come with drawbacks:</p><ul><li><p><strong>High Costs</strong>: Multilingual agents and OPI services are expensive, especially in high-volume environments.</p></li><li><p><strong>Delays</strong>: Connecting to human interpreters adds time, increasing handle times, frustrating customers, and compromising CSAT.</p></li><li><p><strong>Scalability</strong>: Managing demand surges during peak times is resource-intensive.</p></li><li><p><strong>Security Risks</strong>: Third-party interpreters raise concerns about data privacy and compliance with sensitive customer information.</p></li></ul><p>Human interpreters are invaluable in some scenarios, but the need for more scalable and efficient solutions is undeniable. The good news is, these solutions are here. </p><div><hr></div><h3>AI Interpretation is Revolutionizing Contact Centers</h3><p>AI-powered tools are transforming multilingual support by addressing the limitations of human interpreters. Built on advancements in translation AI and GenAI technologies, these systems provide reliable, real-time interpretation. </p><h4>Introducing Krisp&#8217;s AI Live Interpreter</h4><p>Krisp recently launched <strong>AI Live Interpreter</strong>, the industry&#8217;s first AI live interpretation solution, offering real-time, bi-directional translation. With enterprise-grade scalability and security-first design, it helps call centers eliminate language barriers with the click of a button.</p><p>Key benefits:</p><ul><li><p><strong>Instant Availability</strong>: It&#8217;s available 24/7, eliminating delays.</p></li><li><p><strong>Cost Efficiency</strong>: AI Live Interpreter operates at a fraction of the cost of human interpreters.</p></li><li><p><strong>Scalability</strong>: Works with all softphones out-of-the-box, and is built on systems designed to handle unlimited simultaneous sessions, adapting effortlessly to peak demand.</p></li><li><p><strong>User Experience</strong>: Agents sees both live transcription as well translation in front of them which helps to get more context</p></li><li><p><strong>Security</strong>: Many AI solutions are privacy-first, reducing risks associated with third-party interpreters.</p></li></ul><p>Krisp supports over 25 languages with high quality and counting. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2qoQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2qoQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png 424w, https://substackcdn.com/image/fetch/$s_!2qoQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png 848w, https://substackcdn.com/image/fetch/$s_!2qoQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png 1272w, https://substackcdn.com/image/fetch/$s_!2qoQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2qoQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png" width="1180" height="992" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:992,&quot;width&quot;:1180,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:297466,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2qoQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png 424w, https://substackcdn.com/image/fetch/$s_!2qoQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png 848w, https://substackcdn.com/image/fetch/$s_!2qoQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png 1272w, https://substackcdn.com/image/fetch/$s_!2qoQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce17f51-76ba-43bf-b055-e48a48fe073d_1180x992.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Let&#8217;s see it in action.</p><h5><strong>English + Mexican Spanish language pairing:</strong> </h5><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;3898c8a9-2f8c-4386-9e9b-66fc99b74b20&quot;,&quot;duration&quot;:null}"></div><h5>English + Hindi language pairing:</h5><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;077381d0-bd0a-4825-a668-ee4212604ce3&quot;,&quot;duration&quot;:null}"></div><p>See it live and learn how Krisp&#8217;s AI Live Interpreter can transform your global support operations. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://krisp.ai/ai-interpreter/#contact&quot;,&quot;text&quot;:&quot;Book a demo&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://krisp.ai/ai-interpreter/#contact"><span>Book a demo</span></a></p><div><hr></div><h3>BLEU: Setting Standards for Translation Quality</h3><p>Ensuring translation accuracy is crucial for customer interactions. The <strong>Bilingual Evaluation Understudy (BLEU)</strong> methodology is a widely recognized metric for measuring the quality of translations&#8212;both human and AI-generated. It compares machine-generated translations to human references and assigns a score based on similarity.</p><p>Here&#8217;s how BLEU scores are interpreted:</p><ul><li><p><strong>Human Translations</strong>: Typically score around <strong>60</strong>, reflecting their flexibility and nuance.</p></li><li><p><strong>AI Translation</strong>: Scores of <strong>20-30</strong> are considered sufficient for effective communication, with scores above <strong>30</strong> indicating high-quality outputs.</p></li><li><p><strong>Short Phrases (~5 seconds)</strong>: AI translations scoring above <strong>40</strong> are nearly indistinguishable from human translations.</p></li><li><p><strong>Krisp&#8217;s</strong>:  Sees Scores averaging <strong>30-45</strong> across supported languages, ensuring robust performance for real-world customer interactions.</p></li></ul><p>Although AI systems may not match human-level nuance, their BLEU scores demonstrate they are highly capable for contact center needs. Businesses can rely on these solutions for clear, consistent, and accurate communication in real time.</p><div><hr></div><h3>Democratizing Multilingual Support</h3><p>AI interpretation technology is leveling the playing field for businesses of all sizes, enabling them to offer multilingual support without massive overhead:</p><ul><li><p><strong>For Small Businesses</strong>: Affordable, easy-to-deploy solutions help smaller companies compete globally.</p></li><li><p><strong>In Emerging Markets</strong>: Companies in regions with diverse languages can scale quickly without hiring large multilingual teams.</p></li></ul><p>By removing cost and scalability barriers, AI is making multilingual customer support a practical standard rather than a costly exception.</p><div><hr></div><h3>Challenges and the Road Ahead</h3><p>Despite its advantages, AI interpretation still has limitations:</p><ul><li><p><strong>Accuracy</strong>: Industry-specific jargon and heavy accents can impact performance.</p></li><li><p><strong>Cultural Nuance</strong>: Machines may struggle with subtle emotional or cultural cues.</p></li><li><p><strong>Integration</strong>: Ensuring AI tools align seamlessly with workflows requires thoughtful implementation.</p></li></ul><p>However, as machine learning models continue to improve, AI tools are becoming more adaptable and precise. The use of BLEU scores validates their progress and readiness for real-world applications.</p><div><hr></div><h3>Looking Ahead</h3><p>The future of customer support lies in AI-powered tools. Businesses that embrace these technologies early can streamline operations, reduce costs, and enhance customer satisfaction. AI interpretation is no longer a luxury but a necessity in today&#8217;s interconnected market.</p><p>AI is transforming contact centers, making multilingual support a reality for everyone. For companies looking to stay ahead, now is the time to act.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://krisp.ai/ai-interpreter/#contact&quot;,&quot;text&quot;:&quot;Learn more&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://krisp.ai/ai-interpreter/#contact"><span>Learn more</span></a></p><p></p>]]></content:encoded></item><item><title><![CDATA[GPT Voice vs. Human Agents]]></title><description><![CDATA[Last week, OpenAI finally launched the anticipated Voice mode, and one thing stood out: its cost.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/gpt-voice-vs-human-agents</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/gpt-voice-vs-human-agents</guid><pubDate>Thu, 17 Oct 2024 14:01:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!p9Qi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Last week, OpenAI finally launched the anticipated Voice mode, and one thing stood out: its cost.</p><pre><code>The Realtime API uses both text tokens and audio tokens. 
Text input tokens are priced at $5 per 1M and $20 per 1M output tokens. Audio input is priced at $100 per 1M tokens and output is $200 per 1M tokens. 
This equates to approximately $0.06 per minute of audio input and $0.24 per minute of audio output.</code></pre><p>This translates to $0.15 per minute. It&#8217;s arguably quite expensive and higher than expected, leaving many wondering how it compares to the cost of human agents.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!p9Qi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!p9Qi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!p9Qi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!p9Qi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!p9Qi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!p9Qi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg" width="588" height="455.53846153846155" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1128,&quot;width&quot;:1456,&quot;resizeWidth&quot;:588,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!p9Qi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!p9Qi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!p9Qi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!p9Qi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb39eac68-93d9-43b8-9255-1d60709b4e9f_1983x1536.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This pricing is less expensive than the hourly rate of human agents in the U.S. and U.K., but 2x the cost of human agents in the Philippines and 5x the cost of those in India.</p><h2><strong>What does this mean for customer support?</strong></h2><p>There&#8217;s been a lot of talk about what Voice mode means for the industry. </p><p>There are two main opinions on its impact:</p><ul><li><p>Bot enthusiasts: People who boast AI bots will very soon completely replace human agents. Likely affiliated with voice bot companies.</p></li><li><p>Traditionalists: Legacy thinkers who are skeptical of AI&#8217;s potential to replace agents due to the complexity of the industry and technology&#8217;s immaturity. </p></li></ul><p>The truth, as almost always, falls somewhere in the middle&#8212;AI will eventually disrupt the industry entirely, however it will take way more time.</p><h3><strong>Let&#8217;s get some things straight</strong></h3><ul><li><p>The contact center industry is huge and complex</p></li><li><p>Major changes will not happen overnight</p></li><li><p>Customer acceptance will be the ultimate driver in bot deployment and adoption</p></li><li><p>Millions of people losing jobs is a major political problem. If the process starts to get regulated, it may take much longer.</p></li></ul><h4><strong>Voice Bots: Pros and Cons</strong></h4><ul><li><p>Pros</p><ul><li><p>The price of Voice AI will go down by 2-3x in a year</p></li><li><p>Voice bots will cover more and more use cases</p></li><li><p>Much easier to manage bots than people</p></li><li><p>Bots work 24/7, without breaks and don&#8217;t get tired</p></li><li><p>No paying overtime</p></li><li><p>Speaks multiple languages</p></li><li><p>No onboarding and training</p></li><li><p>Easy to scale. </p></li></ul></li><li><p>Cons: </p><ul><li><p>AI hallucinates and, until this is solved, AI won&#8217;t be trusted</p></li><li><p>Hallucination errors add up and compound when multiple agentic flows are bound together</p></li><li><p>Large integration cost with legacy systems</p></li></ul></li></ul><h4><strong>Where Humans Outperform</strong></h4><ul><li><p>Show empathy, read between the lines, and connect with customers</p></li><li><p>Have intuition, can adapt, and are better at handling complex or sensitive issues</p></li><li><p>Create trust and rapport with customers</p></li></ul><h1><strong>What happens next?</strong></h1><p>Few things should happen for the adoption of Bots to accelerate</p><ul><li><p>Must stop hallucinating</p></li><li><p>Reasoning must get better</p></li><li><p>The price must go down</p></li><li><p>Must have more integrations into enterprise systems and tooling</p></li></ul><p></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://voice-ai-newsletter.krisp.ai/subscribe?"><span>Subscribe now</span></a></p><p></p>]]></content:encoded></item><item><title><![CDATA[Why Voice AI changes CX]]></title><description><![CDATA[I&#8217;ve talked about Voice channel before:]]></description><link>https://voice-ai-newsletter.krisp.ai/p/why-voice-ai-changes-cx</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/why-voice-ai-changes-cx</guid><pubDate>Thu, 12 Sep 2024 14:00:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I&#8217;ve talked about the significance of Voice channel before:</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:139912047,&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/p/voice-channel-is-and-will-stay-1&quot;,&quot;publication_id&quot;:2073467,&quot;publication_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;title&quot;:&quot;Voice channel is and will stay #1 in CX&quot;,&quot;truncated_body_text&quot;:&quot;For years, we&#8217;ve heard that the voice channel (phone/live agents) in contact centers is fading into obsolescence. Yet despite the rapid growth of digital platforms, customers continue to seek human interaction for support.&quot;,&quot;date&quot;:&quot;2023-12-21T13:45:53.726Z&quot;,&quot;like_count&quot;:2,&quot;comment_count&quot;:0,&quot;bylines&quot;:[{&quot;id&quot;:32916364,&quot;name&quot;:&quot;Davit Baghdasaryan&quot;,&quot;handle&quot;:&quot;davitb&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23088dde-6cb0-44df-b220-5f22830cdd4c_1179x960.jpeg&quot;,&quot;bio&quot;:&quot;CEO &amp; Co-Founder of Krisp, early pioneer in Voice AI.\n20+ years in engineering. 18 US patent applications, ex Twilion&quot;,&quot;profile_set_up_at&quot;:&quot;2023-11-01T10:17:13.117Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2076258,&quot;user_id&quot;:32916364,&quot;publication_id&quot;:2073467,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:2073467,&quot;name&quot;:&quot;Voice AI Newsletter&quot;,&quot;subdomain&quot;:&quot;krispai&quot;,&quot;custom_domain&quot;:&quot;voice-ai-newsletter.krisp.ai&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Voice AI insights from Krisp's CEO&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;author_id&quot;:32916364,&quot;theme_var_background_pop&quot;:&quot;#6B26FF&quot;,&quot;created_at&quot;:&quot;2023-11-01T10:17:40.391Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;copyright&quot;:&quot;Krisp Technologies&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}},{&quot;id&quot;:2739922,&quot;user_id&quot;:32916364,&quot;publication_id&quot;:2700608,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:2700608,&quot;name&quot;:&quot;Physmath School Newsletter&quot;,&quot;subdomain&quot;:&quot;physmath&quot;,&quot;custom_domain&quot;:&quot;www.physmath-newsletter.com&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;All the interesting info on Physmath School and Community&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6381e514-b8d6-44e4-b747-3c1b3c1f4922_674x674.png&quot;,&quot;author_id&quot;:32916364,&quot;theme_var_background_pop&quot;:&quot;#D10000&quot;,&quot;created_at&quot;:&quot;2024-06-12T12:54:43.809Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;Physmath School&quot;,&quot;copyright&quot;:&quot;Physmath School&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:false,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://voice-ai-newsletter.krisp.ai/p/voice-channel-is-and-will-stay-1?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!YLgs!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png"><span class="embedded-post-publication-name">Voice AI Newsletter</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">Voice channel is and will stay #1 in CX</div></div><div class="embedded-post-body">For years, we&#8217;ve heard that the voice channel (phone/live agents) in contact centers is fading into obsolescence. Yet despite the rapid growth of digital platforms, customers continue to seek human interaction for support&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">2 years ago &#183; 2 likes &#183; Davit Baghdasaryan</div></a></div><p>CCW reports that over 80% of customers still prefer voice-based human support. Customer service needs to be quick and efficient.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><ul><li><p>64.4% of inbound interactions are voice calls (live agent) &#128293; Whopping 84% in the public sector &#128562;</p></li><li><p>89% of customers want the reassurance of talking to a person</p></li><li><p>25% CX leaders think the voice channel will "greatly increase" in 2024 &#128200;</p></li><li><p>35% CX leaders will invest more in the Voice channel in 2024</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kyeM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kyeM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png 424w, https://substackcdn.com/image/fetch/$s_!kyeM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png 848w, https://substackcdn.com/image/fetch/$s_!kyeM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png 1272w, https://substackcdn.com/image/fetch/$s_!kyeM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kyeM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png" width="1456" height="984" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:984,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kyeM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png 424w, https://substackcdn.com/image/fetch/$s_!kyeM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png 848w, https://substackcdn.com/image/fetch/$s_!kyeM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png 1272w, https://substackcdn.com/image/fetch/$s_!kyeM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f08f52-5197-4eb8-adb6-aa1ba6234c54_2000x1352.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>However long wait times, poor quality, language barriers and repetitive questions frustrate both customers and agents. Modernizing CX with voice AI is crucial for solving these pain points.</p><h2><strong>Challenges today</strong></h2><p>Though customers prefer voice over other channels, it&#8217;s not without its flaws. There are three main pain points:</p><ul><li><p><strong>Long wait times</strong></p></li><li><p><strong>Repetitions</strong></p></li><li><p><strong>Poor audio quality</strong></p></li><li><p><strong>Poorly trained agents</strong></p></li><li><p><strong>Accents and language barriers</strong></p></li></ul><p>These problems lead to misunderstandings, repeated information, and customer dissatisfaction. For businesses, this means more time spent on each call and higher operational costs.</p><h2><strong>So why does voice still win?</strong></h2><p>Even though voice has multiple challenges, customers still prefer it when the problem they have meets one of the following criteria:</p><ul><li><p>Complex, OR</p></li><li><p>Emotional and Complex, OR</p></li><li><p>Emotional and Complex and Urgent</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qiZ2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qiZ2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png 424w, https://substackcdn.com/image/fetch/$s_!qiZ2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png 848w, https://substackcdn.com/image/fetch/$s_!qiZ2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png 1272w, https://substackcdn.com/image/fetch/$s_!qiZ2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qiZ2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png" width="1212" height="1070" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1070,&quot;width&quot;:1212,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1254171,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qiZ2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png 424w, https://substackcdn.com/image/fetch/$s_!qiZ2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png 848w, https://substackcdn.com/image/fetch/$s_!qiZ2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png 1272w, https://substackcdn.com/image/fetch/$s_!qiZ2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F617f2b6e-9714-450f-8343-28f125de9c1c_1212x1070.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Voice AI will fix all this</strong></h2><p>I have no doubt, Voice AI will solve all these challenges.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ks8b!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ks8b!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png 424w, https://substackcdn.com/image/fetch/$s_!Ks8b!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png 848w, https://substackcdn.com/image/fetch/$s_!Ks8b!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png 1272w, https://substackcdn.com/image/fetch/$s_!Ks8b!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ks8b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png" width="510" height="474.320987654321" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:904,&quot;width&quot;:972,&quot;resizeWidth&quot;:510,&quot;bytes&quot;:318924,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ks8b!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png 424w, https://substackcdn.com/image/fetch/$s_!Ks8b!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png 848w, https://substackcdn.com/image/fetch/$s_!Ks8b!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png 1272w, https://substackcdn.com/image/fetch/$s_!Ks8b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3acf2489-1ab4-4388-a272-b650cac6c7b0_972x904.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I strongly believe that the impact of Voice AI will be very significant on voice channel.</p><div><hr></div><p>We&#8217;ll dive into all of this live at <a href="https://resources.krisp.ai/fullband-2024">Krisp Fullband 2024</a> on Sep 25th. </p><p>Tune in to unpack 10 months of insights from CX leaders, groundbreaking research and more. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://resources.krisp.ai/fullband-2024" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xMLa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png 424w, https://substackcdn.com/image/fetch/$s_!xMLa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png 848w, https://substackcdn.com/image/fetch/$s_!xMLa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png 1272w, https://substackcdn.com/image/fetch/$s_!xMLa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xMLa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png" width="1456" height="689" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:689,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:400138,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://resources.krisp.ai/fullband-2024&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xMLa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png 424w, https://substackcdn.com/image/fetch/$s_!xMLa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png 848w, https://substackcdn.com/image/fetch/$s_!xMLa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png 1272w, https://substackcdn.com/image/fetch/$s_!xMLa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2021b166-62ea-43c0-a428-55baba11f81b_2000x947.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p> </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[$5.5B cost of accents]]></title><description><![CDATA[In a new study that sheds light on the significant impact accents have in offshore contact centers, ContactBabel reveals incredible data that quantifies the impact of language barriers on BPOs.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/55b-cost-of-accents</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/55b-cost-of-accents</guid><dc:creator><![CDATA[Shara]]></dc:creator><pubDate>Thu, 05 Sep 2024 14:02:30 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ZJA9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In a new study that sheds light on the significant impact accents have in offshore contact centers, ContactBabel reveals incredible data that quantifies the impact of language barriers on BPOs. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://voice-ai-newsletter.krisp.ai/subscribe?"><span>Subscribe now</span></a></p><p>The study surveyed BPOs with offshore operations and U.S. consumers, revealing important insights into how accent-based misunderstandings affect both customer experience and agent well-being.</p><p><strong>Key Findings:</strong></p><ul><li><p><strong>Customer Experience Discrepancy</strong>: There&#8217;s a significant gap between what contact centers believe and what customers actually want. Only 16% of contact centers thought that U.S.-based agents were a top priority for customers but 35% of U.S. consumers say it is a key factor in their customer experience. This shows that many BPOs do not fully understand their customers' needs.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZJA9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZJA9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png 424w, https://substackcdn.com/image/fetch/$s_!ZJA9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png 848w, https://substackcdn.com/image/fetch/$s_!ZJA9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png 1272w, https://substackcdn.com/image/fetch/$s_!ZJA9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZJA9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png" width="797" height="488" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7bf53f6a-2263-4786-96bf-89d731772346_797x488.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:488,&quot;width&quot;:797,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:35300,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZJA9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png 424w, https://substackcdn.com/image/fetch/$s_!ZJA9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png 848w, https://substackcdn.com/image/fetch/$s_!ZJA9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png 1272w, https://substackcdn.com/image/fetch/$s_!ZJA9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf53f6a-2263-4786-96bf-89d731772346_797x488.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p><strong>Impact on Hiring</strong>: Every BPO in the study admitted that an agent's accent plays a crucial role in hiring decisions. On average, 64% of potential agents are not hired because of their accent, rising even more in South Africa and India. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!l1ZC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!l1ZC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png 424w, https://substackcdn.com/image/fetch/$s_!l1ZC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png 848w, https://substackcdn.com/image/fetch/$s_!l1ZC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png 1272w, https://substackcdn.com/image/fetch/$s_!l1ZC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!l1ZC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png" width="627" height="440" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:440,&quot;width&quot;:627,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:18773,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!l1ZC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png 424w, https://substackcdn.com/image/fetch/$s_!l1ZC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png 848w, https://substackcdn.com/image/fetch/$s_!l1ZC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png 1272w, https://substackcdn.com/image/fetch/$s_!l1ZC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3158db30-2d05-46f7-a4f2-225e8c4d88c3_627x440.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p><strong>Repetitions and Costs</strong>: Accent-related misunderstandings lead to frequent repetition requests. About 50% of U.S. consumers feel uncomfortable asking offshore agents to repeat themselves, yet 80% of calls involve some form of repetition. These misunderstandings frustrate customers and increase call times, costing the industry <strong>over $5.5 billion annually</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Z0vJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Z0vJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png 424w, https://substackcdn.com/image/fetch/$s_!Z0vJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png 848w, https://substackcdn.com/image/fetch/$s_!Z0vJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png 1272w, https://substackcdn.com/image/fetch/$s_!Z0vJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Z0vJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png" width="696" height="295.91323210412145" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:392,&quot;width&quot;:922,&quot;resizeWidth&quot;:696,&quot;bytes&quot;:73357,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Z0vJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png 424w, https://substackcdn.com/image/fetch/$s_!Z0vJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png 848w, https://substackcdn.com/image/fetch/$s_!Z0vJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png 1272w, https://substackcdn.com/image/fetch/$s_!Z0vJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea2581ae-20fb-475c-a1b8-d58351cdb9f7_922x392.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li></ul><p>This study highlights the need for BPOs to better understand and address the challenges posed by accents in offshore contact centers. </p><p>By recognizing these issues and considering voice AI technology, like <a href="https://krisp.ai/accent-localization/">AI Accent Localization</a> and <a href="https://krisp.ai/ai-interpreter/">AI Interpreter</a>, BPOs can improve both customer satisfaction and operational efficiency.</p><p>&#128202;Access the full report <strong><a href="https://resources.krisp.ai/5.5b-impact-of-accent-and-language-barriers-on-cx">&#8220;</a></strong><em><strong><a href="https://resources.krisp.ai/5.5b-impact-of-accent-and-language-barriers-on-cx">Can you repeat that?</a></strong></em><strong><a href="https://resources.krisp.ai/5.5b-impact-of-accent-and-language-barriers-on-cx">&#8221; The $5.5 Billion Impact of Language Barriers on Offshore Agents &amp; Contact Centers</a></strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://resources.krisp.ai/5.5b-impact-of-accent-and-language-barriers-on-cx&quot;,&quot;text&quot;:&quot;Read the Report&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://resources.krisp.ai/5.5b-impact-of-accent-and-language-barriers-on-cx"><span>Read the Report</span></a></p><p><br><strong>Also,</strong> <strong>on Sep 25th join me and Steve Morrell from ContactBabel as we discuss more key findings from the report, live at Fullband 2024 </strong>&#128293;</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://resources.krisp.ai/fullband-2024" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!apjT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png 424w, https://substackcdn.com/image/fetch/$s_!apjT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png 848w, https://substackcdn.com/image/fetch/$s_!apjT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png 1272w, https://substackcdn.com/image/fetch/$s_!apjT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!apjT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png" width="1456" height="280" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:280,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:589925,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://resources.krisp.ai/fullband-2024&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!apjT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png 424w, https://substackcdn.com/image/fetch/$s_!apjT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png 848w, https://substackcdn.com/image/fetch/$s_!apjT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png 1272w, https://substackcdn.com/image/fetch/$s_!apjT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac086f17-8979-4d38-b8e9-dd0325eb8a8a_3701x713.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://resources.krisp.ai/fullband-2024&quot;,&quot;text&quot;:&quot;Register now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://resources.krisp.ai/fullband-2024"><span>Register now</span></a></p><div><hr></div><p>Sources:</p><ol><li><p>&nbsp;ContactBabel, &#8220;2024 US Contact Center Decision Makers&#8217; Guide&#8221;</p></li><li><p>&nbsp;ContactBabel, &#8220;US Contact Centers 2023-2027: The State of the Industry&#8221;</p></li><li><p>&nbsp;Helpware, &#8220;Global Adoption of Offshore Outsourcing: Statistics&#8221;</p></li></ol>]]></content:encoded></item><item><title><![CDATA[On-Device Speech-to-Text, 10x cheaper 🔥]]></title><description><![CDATA[All modern laptops and PCs are actively equipped with AI chips (NPUs).]]></description><link>https://voice-ai-newsletter.krisp.ai/p/on-device-speech-to-text-10x-cheaper</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/on-device-speech-to-text-10x-cheaper</guid><pubDate>Thu, 29 Aug 2024 14:02:01 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!VqJW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>All modern laptops and PCs are actively equipped with AI chips (NPUs).</p><p>I wrote about it here:</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:141917072,&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/p/the-ai-pc-era-for-call-centers-is&quot;,&quot;publication_id&quot;:2073467,&quot;publication_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;title&quot;:&quot;The AI PC Era &#128640; for Call Centers is here&quot;,&quot;truncated_body_text&quot;:&quot;&#8220;The AI PC will be a sea change moment in technical innovation&#8221;&quot;,&quot;date&quot;:&quot;2024-02-22T14:02:18.129Z&quot;,&quot;like_count&quot;:5,&quot;comment_count&quot;:0,&quot;bylines&quot;:[],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:false,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://voice-ai-newsletter.krisp.ai/p/the-ai-pc-era-for-call-centers-is?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!YLgs!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png"><span class="embedded-post-publication-name">Voice AI Newsletter</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">The AI PC Era &#128640; for Call Centers is here</div></div><div class="embedded-post-body">&#8220;The AI PC will be a sea change moment in technical innovation&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">2 years ago &#183; 5 likes</div></a></div><p>And here:</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:139719477,&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/p/on-device-transcription-call-centers&quot;,&quot;publication_id&quot;:2073467,&quot;publication_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;title&quot;:&quot;The Power of On-Device Transcription in Call Centers &#128170;&quot;,&quot;truncated_body_text&quot;:&quot;With advancements in Speech-to-text AI and on-device AI, the call center industry is approaching a transformative change. We should start rethinking the traditional approach of cloud-based transcriptions, bringing the process directly onto the agents' devices.&quot;,&quot;date&quot;:&quot;2023-12-14T14:04:12.423Z&quot;,&quot;like_count&quot;:0,&quot;comment_count&quot;:0,&quot;bylines&quot;:[{&quot;id&quot;:32916364,&quot;name&quot;:&quot;Davit Baghdasaryan&quot;,&quot;handle&quot;:&quot;davitb&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23088dde-6cb0-44df-b220-5f22830cdd4c_1179x960.jpeg&quot;,&quot;bio&quot;:&quot;CEO &amp; Co-Founder of Krisp, early pioneer in Voice AI.\n20+ years in engineering. 18 US patent applications, ex Twilion&quot;,&quot;profile_set_up_at&quot;:&quot;2023-11-01T10:17:13.117Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2076258,&quot;user_id&quot;:32916364,&quot;publication_id&quot;:2073467,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:2073467,&quot;name&quot;:&quot;Voice AI Newsletter&quot;,&quot;subdomain&quot;:&quot;krispai&quot;,&quot;custom_domain&quot;:&quot;voice-ai-newsletter.krisp.ai&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Voice AI insights from Krisp's CEO&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png&quot;,&quot;author_id&quot;:32916364,&quot;theme_var_background_pop&quot;:&quot;#6B26FF&quot;,&quot;created_at&quot;:&quot;2023-11-01T10:17:40.391Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;Voice AI Newsletter&quot;,&quot;copyright&quot;:&quot;Krisp Technologies&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}},{&quot;id&quot;:2739922,&quot;user_id&quot;:32916364,&quot;publication_id&quot;:2700608,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:2700608,&quot;name&quot;:&quot;Physmath School Newsletter&quot;,&quot;subdomain&quot;:&quot;physmath&quot;,&quot;custom_domain&quot;:&quot;www.physmath-newsletter.com&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;All the interesting info on Physmath School and Community&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6381e514-b8d6-44e4-b747-3c1b3c1f4922_674x674.png&quot;,&quot;author_id&quot;:32916364,&quot;theme_var_background_pop&quot;:&quot;#D10000&quot;,&quot;created_at&quot;:&quot;2024-06-12T12:54:43.809Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;Physmath School&quot;,&quot;copyright&quot;:&quot;Physmath School&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:false,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://voice-ai-newsletter.krisp.ai/p/on-device-transcription-call-centers?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!YLgs!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F831a2f7e-d0a7-4e3d-87a8-c42c65d0b71c_1000x1000.png"><span class="embedded-post-publication-name">Voice AI Newsletter</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">The Power of On-Device Transcription in Call Centers &#128170;</div></div><div class="embedded-post-body">With advancements in Speech-to-text AI and on-device AI, the call center industry is approaching a transformative change. We should start rethinking the traditional approach of cloud-based transcriptions, bringing the process directly onto the agents' devices&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">2 years ago &#183; Davit Baghdasaryan</div></a></div><p>This trend allows more Speech-to-Text workloads to be moved to on-device AI.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VqJW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VqJW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png 424w, https://substackcdn.com/image/fetch/$s_!VqJW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png 848w, https://substackcdn.com/image/fetch/$s_!VqJW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png 1272w, https://substackcdn.com/image/fetch/$s_!VqJW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VqJW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png" width="500" height="283.99725274725273" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:827,&quot;width&quot;:1456,&quot;resizeWidth&quot;:500,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VqJW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png 424w, https://substackcdn.com/image/fetch/$s_!VqJW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png 848w, https://substackcdn.com/image/fetch/$s_!VqJW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png 1272w, https://substackcdn.com/image/fetch/$s_!VqJW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f7392e1-39fd-40a2-b5db-9de304dfba0d_1842x1046.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is incredibly valuable in call centers and BPOs.</p><p>The problem with STT is that you need to first obtain the recordings from SoftPhone platforms and unfortunately it's not always possible.</p><p>- There might be no integration available with Softphone</p><p>- The cost might be prohibitive</p><p>- Customer might not allow it due to compliance</p><p>- It might take days to obtain it</p><h2>Introducing Krisp Speech-to-Text API &#128293;</h2><p>At <a href="https://www.linkedin.com/company/krisphq/">Krisp</a>, we just launched an on-device <a href="https://krisp.ai/speech-to-text-call-center/">Speech-to-Text API</a>, specifically designed for call centers and BPOs.</p><ul><li><p>Automatically supports all voice platforms (no integration needed)</p></li><li><p>Up to 10x cheaper than the industry</p></li><li><p>Customer data is not sent to any servers (including Krisp's)</p></li><li><p>Both post-call and real-time</p></li><li><p>Even PII/PCI is removed on-device</p></li></ul><h3><strong>Who is it built for?</strong></h3><p>We have been building this technology for &gt;2 years and are excited to bring it to our call center customers.</p><p>The solution is perfect for call centers and BPOs wanting to build Speech Analytics, Customer QA or Agent Assist technologies to make their operations more effective. Speech-to-Text is a core building block for these technologies.<br><br>It&#8217;s also perfect for enterprises that don&#8217;t want to share internal or customer data with 3rd parties.</p><p>Since Speech-to-Text happens on-device, the pricing is disruptive.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZHts!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZHts!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png 424w, https://substackcdn.com/image/fetch/$s_!ZHts!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png 848w, https://substackcdn.com/image/fetch/$s_!ZHts!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png 1272w, https://substackcdn.com/image/fetch/$s_!ZHts!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZHts!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png" width="1456" height="744" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:744,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZHts!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png 424w, https://substackcdn.com/image/fetch/$s_!ZHts!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png 848w, https://substackcdn.com/image/fetch/$s_!ZHts!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png 1272w, https://substackcdn.com/image/fetch/$s_!ZHts!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F36ac4fad-1137-4e56-997e-f78792ae5717_1600x818.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3><strong>How it works</strong></h3><p>The installation process is straightforward:</p><ol><li><p>Install Krisp app on agents&#8217; devices (the same app that provides <a href="https://krisp.ai/call-center-noise-cancellation/">Noise Cancellation</a> and <a href="https://krisp.ai/accent-localization/">Accent Localization</a>)</p></li><li><p>Turn on Speech-to-Text from Krisp&#8217;s web admin dashboard</p></li><li><p>Specify the <strong>private cloud</strong> where you want call transcripts to be uploaded (e.g. S3)</p></li></ol><p>Once set up, as soon as a call ends, Krisp will upload the transcript to the private cloud location, with &lt; 1 second latency.</p><pre><code><strong>If Recording is enabled, Krisp will also record calls and upload recordings to the same private cloud, along with call transcripts.</strong></code></pre><h3><strong>Integration with softphone systems</strong></h3><p>The solution automatically integrates with top CX and voice platforms such as Genesys, Avaya, TalkDesk, Teams, Zoom, and more, simplifying the implementation process. No integration is required.</p><h3>Accuracy</h3><p>Krisp-generated transcripts go through several post-processing steps to make sure the transcript has the highest quality level:</p><ul><li><p>Accuracy with a WER (Word Error Rate) of only 7.96%</p></li><li><p>Adds <strong>punctuation</strong>, <strong>capitalization</strong>, and <strong>numerical values</strong></p></li><li><p>Assigns text to <strong>speakers</strong> with <strong>timestamps</strong></p></li><li><p>If enabled, removes <strong>PII/PCI</strong> and filler words</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mwSH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mwSH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png 424w, https://substackcdn.com/image/fetch/$s_!mwSH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png 848w, https://substackcdn.com/image/fetch/$s_!mwSH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png 1272w, https://substackcdn.com/image/fetch/$s_!mwSH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mwSH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png" width="586" height="325.60027472527474" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:809,&quot;width&quot;:1456,&quot;resizeWidth&quot;:586,&quot;bytes&quot;:492482,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mwSH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png 424w, https://substackcdn.com/image/fetch/$s_!mwSH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png 848w, https://substackcdn.com/image/fetch/$s_!mwSH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png 1272w, https://substackcdn.com/image/fetch/$s_!mwSH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1e388c7-e3a8-4808-a7cf-4c1ec0e04c49_1660x922.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Supporting 4 languages</h3><p>Krisp&#8217;s STT API currently supports English, German, French and Spanish. More languages will come over time.</p><h3>Learn More</h3><p>You can learn more <a href="https://krisp.ai/speech-to-text-call-center/#contact">here</a>.</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! </p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Learnings from CCW 2024 🔥]]></title><description><![CDATA[This year&#8217;s CCW Las Vegas was a great experience. It&#8217;s great seeing that the conference is back to pre-pandemic levels and growing after COVID. From what we&#8217;ve heard, it will be even bigger next year. CCW will expect 5000 attendees in 2025. Krisp&#8217;s booth at the Expo:]]></description><link>https://voice-ai-newsletter.krisp.ai/p/learnings-from-ccw-2024</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/learnings-from-ccw-2024</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 27 Jun 2024 14:02:24 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/a66a37be-b19d-4594-9898-b24704846f17_2016x1512.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>This year&#8217;s <a href="https://www.customercontactweek.com/">CCW Las Vegas</a> was a great experience. It&#8217;s great seeing that the conference is back to pre-pandemic levels and growing after COVID.</p><p>From what we&#8217;ve heard, it will be even bigger next year. CCW will expect 5000 attendees in 2025.</p><p>Krisp&#8217;s booth at the Expo:</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;8fdc8f8e-6149-42aa-90a0-49499a6ecf95&quot;,&quot;duration&quot;:null}"></div><p>There were 2 Expo halls this year and a lot of companies showcasing their products.</p><p>Summary of the companies exhibiting:</p><ul><li><p>A lot of companies showcasing AI Voice Agent products</p></li><li><p>A lot of companies showcasing AI Agent Assist products</p></li><li><p>3 companies showcasing AI Accent Localization</p></li><li><p>(at least) 2 companies showcasing AI Voice Translation</p></li></ul><p>It&#8217;s becoming clear that there are two camps of Voice AI products in CX:</p><ul><li><p>Agent Assists - products that help agents to be more productive</p></li><li><p>AI Voice Agents - products that build autonomous voice-based agents</p></li></ul><p>I sat down with 5 amazing executives and thought leaders onsite to discuss the Future of Voice AI:</p><ul><li><p><a href="https://www.linkedin.com/in/eric-guarro-0083baa4/">Eric Guarro</a>, SVP of Digital Transformation, ibex</p></li><li><p><a href="https://www.linkedin.com/in/nataliedbeckerman1/">Natalie Beckerman</a>, Global Head of Customer Support Operations, IHG</p></li><li><p><a href="https://www.linkedin.com/in/shephyken/">Shep Hyken</a>, Customer Experience Expert | NYT Bestselling Author</p></li><li><p><a href="https://www.linkedin.com/in/philip-b-9a8535b/">Philip Bennett</a>, Head of Innovation, Empire Today</p></li><li><p><a href="https://www.linkedin.com/in/praveer-chadha-5346a48/">Praveer Chadha</a>, SVP Customer Management, Datamatics</p></li></ul><p>Here is the recap of the interviews.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;bfec8ee0-0def-4738-ae5a-a5391bb1733d&quot;,&quot;duration&quot;:null}"></div><p>I will be publishing these insightful interviews in a 4-week CCW series.</p><p>In our discussions, key themes emerged: integrating AI, training AI Agents in emotional intelligence, scaling multi-language support for global growth, and more. I'll explore these topics in depth throughout the series.</p><p>One thing everyone seems to be aligned on is the narrative that </p><ul><li><p>AI Agents will take over easy, routine tasks while frontline workers will focus on more complicated customer interactions and escalations. </p></li><li><p>Powerful and seamless Agent Assist products will enable human agents to provide deeper, more complex support, enhancing their productivity and effectiveness.   </p></li></ul><p>And both are super important categories for the future of customer experience. </p><p>This synergy promises to elevate customer satisfaction and operational efficiency across the industry.</p><p>Stay tuned!</p><p></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://voice-ai-newsletter.krisp.ai/subscribe?"><span>Subscribe now</span></a></p><p></p>]]></content:encoded></item><item><title><![CDATA[On-Device Voice AI is 🔥]]></title><description><![CDATA[In today's fast-paced digital world, customer expectations are higher than ever.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/on-device-voice-ai-is</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/on-device-voice-ai-is</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 13 Jun 2024 14:01:53 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!FSpu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In today's fast-paced digital world, customer expectations are higher than ever. They demand quick, efficient, and seamless interactions with businesses. Contact centers are at the forefront of this customer experience (CX) revolution, and the need for real-time support has never been greater. Enter On-Device AI&#8212;an emerging technology that promises to bring intelligence closer to customer interactions, significantly reducing latency and costs, while enhancing security and privacy.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://voice-ai-newsletter.krisp.ai/subscribe?"><span>Subscribe now</span></a></p><h2><strong>What is On-Device AI?</strong></h2><p>On-Device AI refers to the deployment of AI models directly on-device. Unlike Cloud AI, which relies on centralized cloud-based systems for processing and decision-making, On-Device AI processes data locally on the device. This shift in the computing paradigm offers numerous advantages for contact centers aiming to improve customer service.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FSpu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FSpu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!FSpu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!FSpu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!FSpu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FSpu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:754172,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FSpu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!FSpu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!FSpu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!FSpu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb305cdc9-9a05-462b-af91-26a91cc83ce9_1024x1024.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><h2><strong>Benefits of On-Device AI in Contact Centers</strong></h2><h3><strong>1. Reduced Latency</strong></h3><p>One of the most significant advantages of on-device AI is its ability to reduce latency. In a contact center environment, every millisecond counts. Cloud-based technology requires data to be sent to a remote server for processing, which can introduce delays. AI that runs on-device eliminates this round-trip by processing data locally, resulting in near-instantaneous responses.</p><p>For instance, AI-powered speech to text technology operating on-device can understand and process spoken words in real-time, without the lag associated with cloud processing. This leads to more accessible transcriptions.</p><h3><strong>2. Improved Real-Time Decision-Making</strong></h3><p>Contact centers often deal with a high volume of customer interactions, each requiring quick and accurate decisions. On-Device AI enables real-time data processing and analysis, empowering agents with actionable insights at the moment of interaction. This can include everything from sentiment analysis and customer behavior prediction to real-time translation and personalized recommendations.</p><p>By bringing intelligence closer to the customer interaction, contact centers can make more informed decisions faster, ultimately enhancing the overall customer experience.</p><h3><strong>3. Cost Efficiency</strong></h3><p>Processing data in the cloud can be expensive, especially when dealing with large volumes of data and high-frequency interactions. On-Device AI reduces the need for constant data transmission to and from the cloud, resulting in lower bandwidth costs and reduced cloud service fees.</p><p>Additionally, devices can handle many AI tasks independently, reducing the load on central servers and allowing for more efficient resource allocation. This cost efficiency can be a game-changer for contact centers looking to optimize their operations without compromising on performance.</p><h3><strong>4. Enhanced Security and Privacy</strong></h3><p>Data security and privacy are paramount in contact centers, where sensitive customer information is often handled. On-Device AI enhances security by keeping data processing local, minimizing the exposure of sensitive information to potential breaches during transmission.</p><h2><strong>Real-World Applications of On-Device Voice AI in Contact Centers</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YHCI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YHCI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!YHCI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!YHCI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!YHCI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YHCI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp" width="1456" height="832" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ebc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1055344,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YHCI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!YHCI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!YHCI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!YHCI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febc69539-06e0-446a-af90-5e10ec19ae57_1792x1024.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><h3><strong>AI Accent Localization</strong></h3><p>AI Accent Localization is a compelling application of on-device AI in contact centers, enabling the understanding and processing of diverse accents in real-time. This technology ensures that customer interactions are smooth and efficient, regardless of the customer's native accent or regional dialect. By operating directly on the device, Accent Localization provides quick, accurate responses without the delays associated with cloud processing.</p><h3><strong>Real-Time Transcription and Sentiment Analysis</strong></h3><p>Understanding customer sentiment is crucial for delivering personalized and empathetic service. On-device transcription and AI can analyze customer sentiment in real-time during interactions, enabling agents to adjust their approach based on the customer's emotional state. This leads to more positive and meaningful customer experiences.</p><h3><strong>Real-time Noise Cancellation</strong></h3><p>On-device AI-powered noise cancellation significantly enhances the clarity of customer interactions by filtering out background noise in real-time. This technology ensures that both customers and agents communicate more effectively, leading to higher quality conversations and improved customer satisfaction. In high-volume environments like contact centers, this capability drastically reduces misunderstandings and the need for repetition, streamlining operations and enhancing overall efficiency.</p><h3><strong>Fraud Detection and Prevention</strong></h3><p>Enhancing security measures by detecting fraudulent activities in real-time, AI can analyze voice patterns during a call to identify potential fraudsters, alerting agents to take necessary precautions. This proactive approach helps protect both the contact center and its customers from malicious activities.</p><h2><strong>Conclusion</strong></h2><p>As contact centers strive to meet the evolving demands of customers, On-Device AI emerges as a powerful tool for reducing latency, improving real-time decision-making, and enhancing overall customer experience. By processing data locally and bringing intelligence closer to customer interactions, On-Device AI offers numerous benefits, including cost efficiency, enhanced security, and simplified management.</p>]]></content:encoded></item><item><title><![CDATA[OpenAI GPT-4o and Voice Bots]]></title><description><![CDATA[On May 13th, OpenAI showed their new, multi-model called GPT-4o (omni).]]></description><link>https://voice-ai-newsletter.krisp.ai/p/openai-gpt-4o-voice-bots</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/openai-gpt-4o-voice-bots</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 16 May 2024 14:00:43 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/vgYi3Wr7v_g" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>On May 13th, OpenAI showed their new, multi-model called GPT-4o (omni). </p><div id="youtube2-vgYi3Wr7v_g" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;vgYi3Wr7v_g&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/vgYi3Wr7v_g?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>The demo app was ChatGPT and the demo&#8217;s focus was their new Voice mode.</p><p>The demos were <strong>exceptional</strong> and <strong>quite futuristic</strong>!</p><p>OpenAI&#8217;s engineering team has figured out a way to map audio to audio directly as a first-class modality which reduced the latency and added more &#8220;audio intelligence&#8221; to the model. </p><p>The result is a low-latency and natural-sounding conversational AI.</p><p>Many startups have been trying to do this for a while but bringing the latency down was a challenge.</p><p>It turns out that having an end-to-end trained speech foundational model was the solution.</p><p>The beauty of this model is that it is able to perform many tasks in parallel:</p><ul><li><p>Transcribe (even better than Whisper)</p></li><li><p>Translate (better than many existing models)</p></li><li><p>Reason better than GPT-4.5 and other models</p></li><li><p>Generate fast response</p></li></ul><div id="youtube2-WzUnEfiIqP4" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;WzUnEfiIqP4&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/WzUnEfiIqP4?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>So how will this impact Voice Bots (e.g. in Call Centers)?</p><p>Once GPT-4o <strong>voice</strong> mode is made available, the companies will switch to it. Their voice bots will:</p><ul><li><p>sound more natural</p></li><li><p>will have 2-3x lower latency</p></li><li><p>will speak different languages</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!e7Ig!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!e7Ig!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg 424w, https://substackcdn.com/image/fetch/$s_!e7Ig!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg 848w, https://substackcdn.com/image/fetch/$s_!e7Ig!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!e7Ig!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!e7Ig!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg" width="1456" height="735" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/da5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:735,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:144595,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!e7Ig!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg 424w, https://substackcdn.com/image/fetch/$s_!e7Ig!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg 848w, https://substackcdn.com/image/fetch/$s_!e7Ig!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!e7Ig!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda5dfb1d-3036-455e-bfdf-88b7318b9908_2068x1044.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The adoption of Voice Bots products will simply accelerate. Exciting times ahead!</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Call Center: SuperAgent vs AI Bot]]></title><description><![CDATA[tl;dr AI bots will have a transformational impact on BPOs in the Call Center industry The result will be that more frontline workers will switch to solving L2 and L3 customer problems instead of L1 BPOs will need to empower their workers with AI to solve more L2 and L3 problems]]></description><link>https://voice-ai-newsletter.krisp.ai/p/call-center-superagent-vs-ai-bot</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/call-center-superagent-vs-ai-bot</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 25 Apr 2024 14:02:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2><strong>tl;dr</strong></h2><ul><li><p>AI bots will  have a transformational impact on Call Center BPOs</p></li><li><p>More frontline workers will switch to solving L2 and L3 customer problems instead of L1</p></li><li><p>BPOs will need to empower their workers with AI and create &#8220;SuperAgents&#8221;</p></li><li><p>Transcription technology is a key infrastructure for SuperAgents however transcription is quite expensive. Hence cheap transcription technology is a critical component.</p></li><li><p>On-device transcription running on the agent&#8217;s device could be the perfect solution to this. There are <a href="https://krisp.ai/call-center-transcription/">already providers</a> delivering such products.</p></li></ul><h2>AI Voice Bots and BPOs</h2><p>After ChatGPT, all BPO call center companies are trying to understand how AI is going to impact the future of their companies. </p><pre><code>Call center BPOs (Business Process Outsourcing) are companies that manage customer interactions on behalf of other businesses. They handle a wide range of services, including customer support, technical assistance, sales, and other client-related functions. These centers can operate across various communication channels like phone, email, chat, and social media. 

BPOs often provide cost-effective solutions for businesses by handling large volumes of calls and communications, allowing companies to focus on their core activities while maintaining quality customer service. They can operate domestically or internationally, leveraging global talent to provide 24/7 support and services in multiple languages.</code></pre><p>The number of AI Voice Bot startups attempting to replace frontline workers (agents) with AI is growing significantly. These are some of the hottest startups attracting VC funding these days. The promise is that businesses can support their customers entirely with AI, reducing operational costs and increasing customer satisfaction.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Vokx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Vokx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png 424w, https://substackcdn.com/image/fetch/$s_!Vokx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png 848w, https://substackcdn.com/image/fetch/$s_!Vokx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png 1272w, https://substackcdn.com/image/fetch/$s_!Vokx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Vokx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png" width="1456" height="1298" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65763f95-a559-476b-940a-50980a528a98_1530x1364.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1298,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Vokx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png 424w, https://substackcdn.com/image/fetch/$s_!Vokx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png 848w, https://substackcdn.com/image/fetch/$s_!Vokx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png 1272w, https://substackcdn.com/image/fetch/$s_!Vokx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65763f95-a559-476b-940a-50980a528a98_1530x1364.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>One might think that AI is simply going to &#8220;eat&#8221; customer support and call centers, however of course things are more nuanced, as everything else in life.</p><h2>Customer Ticket Levels</h2><p>First, we need to understand that customer support doesn&#8217;t come in one shape. Typically there are 4 levels of tickets, with varying complexity.</p><ul><li><p><strong>L0: Self-service</strong><br>No direct support; customers use FAQs and automated tools.</p></li><li><p><strong>L1: Basic Support</strong><br>Handles common, simple issues like password resets.</p></li><li><p><strong>L2: Technical Support</strong><br>Deals with more complex problems requiring technical skills.</p></li><li><p><strong>L3: Expert Support</strong><br>Addresses the most complex issues, often involving system changes or code fixes.</p></li></ul><p>L0 tickets are already handled by automated tools like chatbots and helpdesk.</p><p>It&#8217;s quite clear that LLM-powered AI is going to have large impact on L1 tickets. In the next 2-5 years AI-powered voice and chat bots are going to handle the majority of such tickets. In other words, the scope of L0 is going to expand into L1</p><p>The number of frontline workers focused on L1 tickets will obviously decrease.</p><p>However, when it comes to more technically challenging tickets (L2 and L3), AI&#8217;s impact is not clear yet. LLMs are not smart enough to be trusted to autonomously solve complex technical problems.</p><h2>More Workers in L2 and L3 </h2><p>As AI takes over L1 tickets, companies will save money. </p><p>With more budget in hand, they will want to invest in improving the support of L2 and L3 tickets. After all, most companies always strive for better customer satisfaction.</p><p>Re-qualifying people from L1 to L2 and hiring new people to handle L2/L3 tickets will become a trend. </p><p>This is one area where BPOs will continue provide value to businesses - qualified workers handling more difficult tasks than AI can do.</p><h2>Enter SuperAgents</h2><p>Another trend that will indeed happen is the introduction of &#8216;Super Agents&#8217; - frontline workers equipped with amazing AI tools that empower them to be more productive.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OpE7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OpE7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp 424w, https://substackcdn.com/image/fetch/$s_!OpE7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp 848w, https://substackcdn.com/image/fetch/$s_!OpE7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp 1272w, https://substackcdn.com/image/fetch/$s_!OpE7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OpE7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp" width="992" height="992" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:992,&quot;width&quot;:992,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!OpE7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp 424w, https://substackcdn.com/image/fetch/$s_!OpE7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp 848w, https://substackcdn.com/image/fetch/$s_!OpE7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp 1272w, https://substackcdn.com/image/fetch/$s_!OpE7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54c0dd20-0f5d-46b5-8583-33d873ac692a_992x992.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Frontline workers will use CoPilots before, during and post customer interactions.</p><p>These technologies are already being deployed in call centers and BPOs and are already showing great results in reducing waiting queues, handling time and increasing CSAT.</p><p>Examples:</p><ul><li><p>AI Noise Cancellation</p></li><li><p>AI Accent Localization</p></li><li><p>AI Speech-to-Speech Translation</p></li><li><p>Chat-based CoPilot for Agents integrated with knowledge base</p></li><li><p>Voice-based CoPilot for Agents integrated with knowledge base</p></li><li><p>CoPilots for extracting insights from product logs</p></li><li><p>CoPilots for suggesting the best possible responses on calls</p></li></ul><p>Today, many large BPOs are already integrating (or building) such technologies, supporting the vision of SuperAgents.</p><h2>Transcription is Critical</h2><p>One of the foundational technologies to enable super agents in BPOs is Speech-to-Text (aka Transcription). </p><p>Many modern BPOs still struggle getting access to conversations from their thousands of frontline workers. There are various reasons for this:</p><ul><li><p>Cloud-based Speech-to-Text is expensive</p></li><li><p>BPO customers don&#8217;t want to share transcriptions with BPOs</p></li><li><p>Even if they share, it might take weeks to get these</p></li><li><p>BPOs operate in many locations, with agents using different CCaaS stacks, so getting transcriptions is consistently difficult </p></li></ul><p>We have talked before about 2 new global trends solving these challenges</p><ul><li><p><a href="https://voice-ai-newsletter.krisp.ai/p/on-device-transcription-call-centers">On-Device Transcription</a></p></li><li><p><a href="https://voice-ai-newsletter.krisp.ai/p/the-ai-pc-era-for-call-centers-is">AI PC Era</a></p></li></ul><p>Below is a demo of how On-Device Transcription works for Call Centers.</p><div id="youtube2-jbiTNRbH9-s" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;jbiTNRbH9-s&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/jbiTNRbH9-s?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>These two trends will enable BPOs to easily fill the gap in transcriptions and build powerful CoPilots for future Super Agents.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Noise Cancellation: Headsets vs AI]]></title><description><![CDATA[Exploring the links between audio quality, agent productivity, and customer satisfaction]]></description><link>https://voice-ai-newsletter.krisp.ai/p/audio-improvement-and-noise-reduction</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/audio-improvement-and-noise-reduction</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 11 Apr 2024 17:12:27 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!gAkS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In today&#8217;s evolving call center landscape, exceptional audio quality and comprehension can make or break customer interactions.&nbsp; </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gAkS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gAkS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png 424w, https://substackcdn.com/image/fetch/$s_!gAkS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png 848w, https://substackcdn.com/image/fetch/$s_!gAkS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png 1272w, https://substackcdn.com/image/fetch/$s_!gAkS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gAkS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png" width="494" height="371.07441860465116" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/40d909a5-8b78-4887-8032-30177998bbe0_860x646.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:646,&quot;width&quot;:860,&quot;resizeWidth&quot;:494,&quot;bytes&quot;:237081,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gAkS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png 424w, https://substackcdn.com/image/fetch/$s_!gAkS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png 848w, https://substackcdn.com/image/fetch/$s_!gAkS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png 1272w, https://substackcdn.com/image/fetch/$s_!gAkS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40d909a5-8b78-4887-8032-30177998bbe0_860x646.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This guide, created in partnership with ContactBabel, contains comprehensive research, actionable insights and guidance to help contact centers navigate the future of voice communication in the call center industry, where clarity, comprehension, and connection redefine customer interactions.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>The Effect of Audio Improvement on Productivity and CX</h2><p>In a ContactBabel survey of 1,000 customers, 60% of customers frequently face challenges in understanding agents due to poor audio quality.&nbsp;</p><p>Some businesses experience higher levels of audio interference due to their contact center environment, use of remote working, and type of customer (e.g. older customers experience this the most). Those taking calls from customers on mobile phones are more likely to have higher rates of repetition.&nbsp;</p><p>Lack of audio clarity is not just a problem on the contact center&#8217;s side of the conversation. With more people than ever using mobile phones to speak with organizations, both agents and customers have to concentrate very hard on the conversation, causing stress and frustration, particularly for the agent who may handle 80-100 calls each day.</p><h4>Real-world example</h4><p>A Spanish contact center gave some sets of headsets with digital audio processors to employees, while others used the more traditional headset. The first group's technology had the effect of 'cleaning up' unwanted noise at either end of the line, allowing the customer and employee to communicate more effectively. Calls were handled more quickly, fewer mistakes were made with data collection (with the attendant knock-on effect that fewer repeat calls were required), and overall, employees handled an average of 10% more calls per day compared to the control group.&nbsp;</p><p>AI-enabled voice isolation can intelligently remove background noise from both sides of the conversation, in real-time, to assist the smooth and accurate flow of the conversation, as well as in recordings to improve post-call analytics and voice-to-text transcription. This also means that businesses spend significantly less on upgrading and replacing top-of-the-line headsets.&nbsp;</p><h2>Noise Canceling Headsets and Technology</h2><p>61% of surveyed individuals indicate their headsets are equipped with noise-canceling microphones to reduce background noise, enhancing call clarity for the caller and reducing the need for repetition. </p><p>However, only 37% have noise-canceling headphones for all headsets, leaving a significant number of agents exposed to noisy environments. And these headsets don&#8217;t address inbound noise coming from the customer&#8217;s environment. The implication is reduced&nbsp;focus, accuracy, and overall performance, potentially prolonging calls.&nbsp;</p><p><strong>Figure 2: Use of noise-canceling microphones and headphones/earphones</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OTUL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OTUL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png 424w, https://substackcdn.com/image/fetch/$s_!OTUL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png 848w, https://substackcdn.com/image/fetch/$s_!OTUL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png 1272w, https://substackcdn.com/image/fetch/$s_!OTUL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OTUL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png" width="1448" height="924" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:924,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OTUL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png 424w, https://substackcdn.com/image/fetch/$s_!OTUL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png 848w, https://substackcdn.com/image/fetch/$s_!OTUL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png 1272w, https://substackcdn.com/image/fetch/$s_!OTUL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd06e447c-1a34-463b-99b3-5bf70896a426_1448x924.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Alternatively, noise-canceling&nbsp;software offers a higher quality, more accessible and scalable solution compared to traditional noise-canceling headsets. Because this technology works directly on agent devices and can be centrally managed, implementation and adoption are easy and streamlined for call centers of all sizes. </p><p>Unlike headsets that require individual distribution and maintenance and hardware refresh, AI-powered noise cancellation technology can be updated and improved from one central point, ensuring the on-device technology is always up-to-date and active, making high-quality audio more achievable and consistent.</p><h3>Improving call center audio quality and eliminating noise with Voice AI</h3><p>AI-powered voice technology is revolutionizing the way call centers manage voice interactions and quality. By filtering out background noise, echoes, and other human voices, Voice AI ensures that the primary message is heard, understood, and improves the overall customer experience and satisfaction.</p><p>Voice AI technology cleanses both inbound and outbound audio streams, ensuring clear, concise communication. This breakthrough not only improves the quality of each call but also boosts the efficiency and effectiveness of call center operations, which has massive impact at scale&#8212;reducing the $1.34B cost of poor audio quality for contact centers, industry wide.&nbsp;</p><h4>&#8216;I&#8217;m sorry, can you repeat that?&#8217; </h4><p>Reducing the number of times an agent or customer has to repeat themselves can make a huge impact on call center costs by reducing call times (and thus queue lengths)&nbsp; while also improving the overall customer experience.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Fg4j!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Fg4j!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png 424w, https://substackcdn.com/image/fetch/$s_!Fg4j!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png 848w, https://substackcdn.com/image/fetch/$s_!Fg4j!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!Fg4j!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Fg4j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png" width="836" height="1600" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:836,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Fg4j!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png 424w, https://substackcdn.com/image/fetch/$s_!Fg4j!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png 848w, https://substackcdn.com/image/fetch/$s_!Fg4j!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!Fg4j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F900b03f8-5fd0-49ec-9c4d-61cddc5e98a0_836x1600.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Call clarity directly translates to improved first call resolution rates. Issues are resolved faster, and the need for callbacks or escalations are significantly reduced. This efficiency is a win-win: customers experience improved satisfaction, and contact centers benefit from reduced operational costs and experience 10% increases in agent productivity, as seen in the example referenced earlier.</p><p>The reduction in noise complaints speaks volumes&#8212;literally. </p><p>With up to a 78% decrease in noise-related issues, Voice AI is improving call productivity, while eliminating agent stress and customer frustration. </p><p>By effectively eliminating background noise and improving audio quality, <a href="https://krisp.ai/contact-center/">AI Noise Cancellation</a> reduces the need for expensive headsets and physical modifications like soundproof partitions or white noise machines. </p><h4>Replacing traditional, expensive, noise cancellation solutions</h4><p>The shift from hardware to AI Noise Cancellation software lowers initial investment costs while decreasing ongoing maintenance expenses and providing scalability without compromising quality or incurring additional overhead. As a result, contact centers can allocate their resources more efficiently, investing in areas that directly contribute to growth, customer satisfaction, and scaling their operations.&nbsp;</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Voice AI Newsletter! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Technical Deep Dive: AI Accent Localization for Call Centers]]></title><description><![CDATA[This article was originally posted on Krisp AI blog. In this article, we dive deep into a new disruptive technology called AI Accent Localization, which in real-time translates a speaker&#8217;s accent to the listener&#8217;s natively understood accent, using AI.]]></description><link>https://voice-ai-newsletter.krisp.ai/p/deep-dive-ai-accent-localization</link><guid isPermaLink="false">https://voice-ai-newsletter.krisp.ai/p/deep-dive-ai-accent-localization</guid><dc:creator><![CDATA[Davit Baghdasaryan]]></dc:creator><pubDate>Thu, 21 Mar 2024 14:01:39 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<blockquote><p>This article was originally posted on <a href="https://krisp.ai/blog/deep-dive-ai-accent-localization-for-call-centers/">Krisp AI blog</a>.</p></blockquote><p>In this article, we dive deep into a new disruptive technology called AI Accent Localization, which in real-time translates a speaker&#8217;s accent to the listener&#8217;s natively understood accent, using AI.</p><p>Accent refers to the distinctive way in which a group of people pronounce words, influenced by their region, country, or social background. In broad terms, English accents can be categorized into major groups such as British, American, Australian, South African, and Indian among others.&nbsp;</p><p>Accents can often be a barrier to communication, affecting the clarity and comprehension of speech. Differences in pronunciation, intonation, and rhythm can lead to misunderstandings.&nbsp;</p><p>While the importance of this topic goes beyond call centers, our primary focus is this industry.</p><h2>Accented speech in call centers</h2><p>The call center industry in the United States has <a href="https://www.siteselectiongroup.com/whitepapers">experienced</a> substantial growth, with a noticeable surge in the creation of new jobs from 2020-onward, both on-shore and globally.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!g8E7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!g8E7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png 424w, https://substackcdn.com/image/fetch/$s_!g8E7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png 848w, https://substackcdn.com/image/fetch/$s_!g8E7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png 1272w, https://substackcdn.com/image/fetch/$s_!g8E7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!g8E7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png" width="526" height="423.16608996539793" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:930,&quot;width&quot;:1156,&quot;resizeWidth&quot;:526,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!g8E7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png 424w, https://substackcdn.com/image/fetch/$s_!g8E7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png 848w, https://substackcdn.com/image/fetch/$s_!g8E7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png 1272w, https://substackcdn.com/image/fetch/$s_!g8E7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98aa31ef-4927-44e6-9070-e5c00fed9efe_1156x930.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"></figcaption></figure></div><p>In 2021, many US based call centers expanded their footprints thanks to the pandemic-fueled adoption of remote work, but growth slowed substantially in 2022. Inflated salaries and limited resources drove call centers to deepen their offshore operations, both in existing and new geographies.</p><p>There are several strong incentives for businesses to expand call centers operations to off-shore locations, including:</p><p><strong>Cost savings</strong>: Labor costs in offshore locations such as India, the Philippines, and Eastern Europe are up to 70% lower than in the United States.</p><p><strong>Access to diverse talent pools:</strong> Offshoring enables access to a diverse talent pool, often with multilingual capabilities, facilitating a more comprehensive customer support service.</p><p><strong>24/7 coverage</strong>: Time zone differences allow for 24/7 coverage, enhancing operational continuity.</p><p>However, offshore operations come with a cost. One major challenge offshore call centers face is decreased language comprehension. Accents, varying fluency levels, cultural nuances and inherent biases lead to misunderstandings and frustration among customers.&nbsp;</p><p>According to Reuters, as many as <a href="https://www.reuters.com/article/idUSTRE5AN37C/">65% of customers</a> have cited difficulties in understanding offshore agents due to language-related issues. Over a third of consumers say working with US-based agents is most important to them when contacting an organization.</p><h2>Accents create challenges in call centers</h2><p>While the world celebrates global and diverse workforces at large, <a href="https://journals.sagepub.com/doi/10.1177/0023830910372495">research </a>shows that misalignment of native language backgrounds between speakers leads to a lack of comprehension and inefficient communication.&nbsp;</p><ul><li><p><strong>Longer calls:</strong> Thick accents contribute to comprehension difficulties, causing higher average handle time (AHT) and also lower first call resolutions (FCR).<br>According to ContactBabel&#8217;s &#8220;2024 US Contact Center Decision Maker&#8217;s Guide&#8221; the cost of mishearing and repetition per year for a 250-seat contact center exceeds $155,000 per year.<br></p></li><li><p><strong>Decreased customer satisfaction</strong>: Language barriers are among the primary contributors to lower customer satisfaction scores within off-shore call centers. According to ContactBabel, 35% of consumers say working with US-based call center agents is most important to them when contacting an organization.</p></li><li><p><strong>High agent attrition rates:</strong> Decreased customer satisfaction creates high stress, in turn decreasing agent morale. The result is higher employee turnover rates. In 2023, US contact centers saw an average annual agent attrition rate of 31%, according to <a href="https://resources.krisp.ai/guide-to-agent-engagement-and-empowerment">The US Contact Center Decision Makers&#8217; Guide to Agent Engagement and Empowerment</a>.</p></li><li><p><strong>Increased onboarding costs: </strong>The need for specialized training programs to address language and cultural nuances further adds to onboarding costs.&nbsp;</p></li><li><p><strong>Limited talent pool: </strong>Finding individuals who meet the required linguistic criteria within the available talent pool is challenging. The competitive demand for specialized language skills leads to increased recruitment costs.&nbsp;</p></li></ul><h2>How do call centers mitigate accent challenges?</h2><h3>Training</h3><p>Accent neutralization training is used as a solution to improve communication clarity in these environments. Call Centers invest in weeks-long accent neutralization training as part of agent onboarding and ongoing improvement. Depending on&nbsp; geography, duration, and training method, training costs can run $500-$1500 per agent during onboarding. The effectiveness of these training programs can be limited due to the inherent challenges in altering long-established accent habits. So, call centers may find it necessary to temporarily remove agents from their operational roles for further retraining, incurring additional costs in the process.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DM4I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DM4I!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DM4I!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DM4I!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DM4I!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DM4I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg" width="556" height="313.30629669156883" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:528,&quot;width&quot;:937,&quot;resizeWidth&quot;:556,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DM4I!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DM4I!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DM4I!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DM4I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0cddc6bc-bf5f-4460-8cbc-9139984960ec_937x528.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3><strong>Limited geography for expansion</strong></h3><p>Call centers limit their site selection to regions and countries where accents of the available talent pool is considered to be more neutral to the customer&#8217;s native language, sacrificing locations that would be more cost-effective.<br></p><h2>Enter AI-Powered Accent Localization</h2><p>Recent advancements in AI have introduced new accent localization technology. This technology leverages AI to translate source accents to targets accent in real-time, with the click of a button. While the technologies in production don&#8217;t support multiple accents in parallel, over time this will be solved as well.</p><h3>Demo</h3><div id="youtube2-r3fen6lefUk" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;r3fen6lefUk&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/r3fen6lefUk?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>Below is the evolution of Krisp&#8217;s AI Accent Localization technology over the past 2 years.</p><p><strong>v0.1 The Very First model (it was bad)</strong></p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;41fa09d8-e1e8-4d1c-948e-8657b120e9fe&quot;,&quot;duration&quot;:3.004082,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><p><strong>v0.2 A bit more natural sound</strong></p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;f5bf1cc3-a542-4cee-8dda-d112ec20f4fc&quot;,&quot;duration&quot;:3.004082,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><p><strong>v0.3 A bit more natural sound</strong></p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;4a020482-421d-45cd-8d4c-8fc5c3ad1203&quot;,&quot;duration&quot;:3.004082,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><p><strong>v0.4 Improved voice</strong></p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;abc0b110-a6e1-4e13-a333-3dec91c5f3dc&quot;,&quot;duration&quot;:3.004082,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><p><strong>v0.5 Improved intonation</strong></p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;85b1b160-040d-4fda-9c23-1342716a0d82&quot;,&quot;duration&quot;:3.004082,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><h2>Deploying in the call center</h2><p>There are various ways AI Accent Localization can be integrated into a call center&#8217;s tech stack.</p><p>It can be embedded into a call center&#8217;s existing CX software (e.g. CCaaS and UCaaS) or installed as a separate application on the agent&#8217;s machine (e.g. <a href="https://krisp.ai/contact-center/">Krisp</a>).</p><p>Currently, there are no CX solutions in market with accent localization capabilities, leaving the latter as the only possible path forward for call centers looking to leverage this technology today.</p><p>Applications like <a href="https://krisp.ai/contact-center/">Krisp</a> have accent localization built in their offerings.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4grB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4grB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png 424w, https://substackcdn.com/image/fetch/$s_!4grB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png 848w, https://substackcdn.com/image/fetch/$s_!4grB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png 1272w, https://substackcdn.com/image/fetch/$s_!4grB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4grB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png" width="328" height="428.1526717557252" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/de5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:684,&quot;width&quot;:524,&quot;resizeWidth&quot;:328,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4grB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png 424w, https://substackcdn.com/image/fetch/$s_!4grB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png 848w, https://substackcdn.com/image/fetch/$s_!4grB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png 1272w, https://substackcdn.com/image/fetch/$s_!4grB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde5a2cea-1cc6-48f8-963c-769fcd0607e5_524x684.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>These applications are on-device, meaning they sit locally on the agent&#8217;s machine. They support all CX software platforms out of the box since they are installed as a virtual microphone and speaker.&nbsp;</p><p>AI runs on an agent&#8217;s device so there is no additional load on the network.</p><p>The deployment and management can be done remotely, and at scale, from the admin dashboard.</p><h2>Challenges of building such technology</h2><p>At a fundamental level, speech can be divided into 4 parts: voice, text, prosody and accent.</p><p>Accents can be divided into 4 parts as well &#8211; phoneme, intonation, stress and rhythm.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ws8U!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ws8U!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png 424w, https://substackcdn.com/image/fetch/$s_!Ws8U!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png 848w, https://substackcdn.com/image/fetch/$s_!Ws8U!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png 1272w, https://substackcdn.com/image/fetch/$s_!Ws8U!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ws8U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png" width="504" height="291.05263157894734" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:790,&quot;width&quot;:1368,&quot;resizeWidth&quot;:504,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ws8U!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png 424w, https://substackcdn.com/image/fetch/$s_!Ws8U!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png 848w, https://substackcdn.com/image/fetch/$s_!Ws8U!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png 1272w, https://substackcdn.com/image/fetch/$s_!Ws8U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2209675-7d50-4f92-9e63-41275fc7a6b1_1368x790.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In order to localize or translate an accent, three of these parts must be changed &#8211; phoneme pronunciation, intonation, and stress. Doing this in real-time is an extremely difficult technical problem.</p><p>While there are numerous technical challenges in building this technology, we will focus on eight majors.</p><ol><li><p>Data Collection</p></li><li><p>Speech Synthesis</p></li><li><p>Low Latency</p></li><li><p>Background Noises and Voices</p></li><li><p>Acoustic Conditions</p></li><li><p>Maintaining Correct Intonation</p></li><li><p>Maintaining Speaker&#8217;s Voice</p></li><li><p>Wrong Pronunciations</p></li></ol><p>Let&#8217;s discuss them individually.</p><h3>1) Data collection</h3><p>Collecting accented speech data is a tough process. The data must be highly representative of different dialects spoken in the source language. Also, it should cover various voices, age groups, speaking rates, prosody, and emotion variations. For call centers, it is preferable to have natural conversational speech samples with rich vocabulary targeted for the use case.</p><p>There are two options: buy ready data or record and capture the data in-house. In practice, both can be done in parallel.</p><p>An ideal dataset would consist of thousands of hours of speech where source accent utterance is mapped to each target accent utterance and aligned with it accurately.&nbsp;</p><p>However, getting precise alignment is exceedingly challenging due to variations in the duration of phoneme pronunciations. Nonetheless, improved alignment accuracy contributes to superior results.</p><h3>2) Speech synthesis</h3><p>The speech synthesis part of the model, which is sometimes referred to as the vocoder algorithm in research, should produce a high-quality, natural-sounding speech waveform.&nbsp; It is expected to sound closer to the target accent, have high intelligibility, be low-latency, convey natural emotions and intonation, be robust against noise and background voices, and be compatible with various acoustic environments.</p><h3>3) Low latency</h3><p>As studies by the International Telecommunication Union show (G.114 recommendation), speech transmission maintains acceptable quality during real-time communication if the one-way delay is less than approximately 300 ms. Therefore, the latency of the end-to-end accent localization system should be within that range to ensure it does not impact the quality of real-time conversation.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jucD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jucD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png 424w, https://substackcdn.com/image/fetch/$s_!jucD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png 848w, https://substackcdn.com/image/fetch/$s_!jucD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png 1272w, https://substackcdn.com/image/fetch/$s_!jucD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jucD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png" width="514" height="363.0905077262693" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/010675f6-0824-4d75-9b36-67559dea5999_906x640.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:640,&quot;width&quot;:906,&quot;resizeWidth&quot;:514,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jucD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png 424w, https://substackcdn.com/image/fetch/$s_!jucD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png 848w, https://substackcdn.com/image/fetch/$s_!jucD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png 1272w, https://substackcdn.com/image/fetch/$s_!jucD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F010675f6-0824-4d75-9b36-67559dea5999_906x640.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>There are two ways to run this technology: locally or in the cloud. While both have theoretical advantages, in practice, more systems with similar characteristics (e.g. AI-powered noise cancellation, voice conversion, etc.) have been successfully deployed locally. This is mostly due to hard requirements around latency and scale.</p><p>To be able to run locally, the end-to-end neural network must be small and highly optimized, which requires significant engineering resources.</p><h3>4) Background noise and voices</h3><p>Having a sophisticated noise cancellation system is crucial for this Voice AI technology. Otherwise, the speech synthesizing model will generate unwanted artifacts.</p><p>Not only should it eliminate the input background noise but also the input background voices. Any sound that is not the speaker&#8217;s voice must be suppressed.</p><p>This is especially important in call center environments where multiple agents sit in close proximity to each other, serving multiple customers simultaneously over the phone.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9L7B!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9L7B!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png 424w, https://substackcdn.com/image/fetch/$s_!9L7B!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png 848w, https://substackcdn.com/image/fetch/$s_!9L7B!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png 1272w, https://substackcdn.com/image/fetch/$s_!9L7B!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9L7B!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png" width="532" height="298.88461538461536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:532,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9L7B!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png 424w, https://substackcdn.com/image/fetch/$s_!9L7B!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png 848w, https://substackcdn.com/image/fetch/$s_!9L7B!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png 1272w, https://substackcdn.com/image/fetch/$s_!9L7B!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b35542f-6e42-4b69-8d27-cd126f3b23ac_1600x899.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Detecting and filtering out other human voices is a very difficult problem. As of this writing, to our knowledge, there is only one system doing it properly today &#8211; Krisp&#8217;s <a href="https://krisp.ai/contact-center/">AI Noise Cancellation</a> technology.</p><h3>5) Acoustic conditions</h3><p>Acoustic conditions differ for call center agents. The sheer volume of combinations of device microphones and room setups (accountable for room echo) makes it very difficult to design a robust system against such input variations.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ysme!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ysme!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png 424w, https://substackcdn.com/image/fetch/$s_!ysme!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png 848w, https://substackcdn.com/image/fetch/$s_!ysme!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png 1272w, https://substackcdn.com/image/fetch/$s_!ysme!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ysme!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png" width="534" height="534" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:992,&quot;width&quot;:992,&quot;resizeWidth&quot;:534,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ysme!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png 424w, https://substackcdn.com/image/fetch/$s_!ysme!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png 848w, https://substackcdn.com/image/fetch/$s_!ysme!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png 1272w, https://substackcdn.com/image/fetch/$s_!ysme!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2420eb18-22c2-4bba-9925-9ce2955302f8_992x992.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>6) Maintaining the speaker&#8217;s intonation</h3><p>Not transferring the speaker&#8217;s intonation in the generated speech will result in a robotic speech that sounds worse than the original.&nbsp;</p><p>Krisp addressed this issue by developing an algorithm capturing input speaker&#8217;s intonation details in real-time and leveraging this information in the synthesized speech. Solving this challenging problem allowed us to increase the naturalness of the generated speech.&nbsp;&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WGHT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WGHT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png 424w, https://substackcdn.com/image/fetch/$s_!WGHT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png 848w, https://substackcdn.com/image/fetch/$s_!WGHT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png 1272w, https://substackcdn.com/image/fetch/$s_!WGHT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WGHT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png" width="560" height="560" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:992,&quot;width&quot;:992,&quot;resizeWidth&quot;:560,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!WGHT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png 424w, https://substackcdn.com/image/fetch/$s_!WGHT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png 848w, https://substackcdn.com/image/fetch/$s_!WGHT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png 1272w, https://substackcdn.com/image/fetch/$s_!WGHT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f6fca37-d566-4cb9-b1cd-d053ee903fda_992x992.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>7) Maintaining the speaker&#8217;s voice</h3><p>It is desirable to maintain the speaker&#8217;s vocal characteristics (e.g., formants, timbre) while generating output speech. This is a major challenge and one potential solution is designing the speech synthesis component so that it generates speech conditioned on the input speaker&#8217;s voice &#8216;fingerprint&#8217; &#8211; a special vector encoding a unique acoustic representation of an individual&#8217;s voice.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qQVq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qQVq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png 424w, https://substackcdn.com/image/fetch/$s_!qQVq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png 848w, https://substackcdn.com/image/fetch/$s_!qQVq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png 1272w, https://substackcdn.com/image/fetch/$s_!qQVq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qQVq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png" width="512" height="319" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:319,&quot;width&quot;:512,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qQVq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png 424w, https://substackcdn.com/image/fetch/$s_!qQVq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png 848w, https://substackcdn.com/image/fetch/$s_!qQVq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png 1272w, https://substackcdn.com/image/fetch/$s_!qQVq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc84a35-3fe8-45b2-af42-824673514a3a_512x319.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>8) Wrong pronunciations</h3><p>Mispronounced words can be difficult to correct in real-time, as the general setup would require separate automatic speech recognition and language modeling blocks, which introduce significant algorithmic delays and fail to meet the low latency criterion.</p><h2>3 Technical Approaches</h2><h3>Approach 1: Speech &#8594; STT &#8594; Speech</h3><p>One approach to accent localization involves applying Speech-to-Text (STT) to the input speech and subsequently utilizing Text-to-Speech (TTS) algorithms to synthesize the target speech.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5AAR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5AAR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5AAR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5AAR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5AAR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5AAR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg" width="1000" height="257" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:257,&quot;width&quot;:1000,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5AAR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5AAR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5AAR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5AAR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5785eeee-d567-49f7-b0c6-c06ff7004292_1000x257.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This approach is relatively straightforward and involves common technologies like STT and TTS, making it conceptually simple to implement.&nbsp;</p><p>STT and TTS are well-established, with existing solutions and tools readily available.&nbsp;</p><p>Integration into the algorithm can leverage these technologies effectively. These represent the strengths of the method, yet it is not without its drawbacks. There are 3 of them:</p><ul><li><p>The difficulty of having accent-robust STT with a very low word error rate.</p></li><li><p>The TTS algorithm must possess capabilities to manage emotions, intonation, and speaking rate, which should come from original accented input and produce speech that sounds natural.</p></li><li><p>Algorithmic delay within the STT plus TTS pipeline may fall short of meeting the demands of real-time communication.</p></li></ul><h3>Approach 2: Speech &#8594; Phoneme &#8594; Speech</h3><p>First, let&#8217;s define what a phoneme is. A phoneme is the smallest unit of sound in a language that can distinguish words from each other. It is an abstract concept used in linguistics to understand how language sounds function to encode meaning. Different languages have different sets of phonemes; the number of phonemes in a language can vary widely, from as few as 11 to over 100. Phonemes themselves do not have inherent meaning but work within the system of a language to create meaningful distinctions between words. For example, the English phonemes /p/ and /b/ differentiate the words &#8220;pat&#8221; and &#8220;bat.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!s6t8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!s6t8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png 424w, https://substackcdn.com/image/fetch/$s_!s6t8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png 848w, https://substackcdn.com/image/fetch/$s_!s6t8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png 1272w, https://substackcdn.com/image/fetch/$s_!s6t8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!s6t8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png" width="388" height="395.72908366533864" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:512,&quot;width&quot;:502,&quot;resizeWidth&quot;:388,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;table of English phenomes, mapping source speech to a phonetic representation, then the result to the target speech&#8217;s phonetic representation&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="table of English phenomes, mapping source speech to a phonetic representation, then the result to the target speech&#8217;s phonetic representation" title="table of English phenomes, mapping source speech to a phonetic representation, then the result to the target speech&#8217;s phonetic representation" srcset="https://substackcdn.com/image/fetch/$s_!s6t8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png 424w, https://substackcdn.com/image/fetch/$s_!s6t8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png 848w, https://substackcdn.com/image/fetch/$s_!s6t8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png 1272w, https://substackcdn.com/image/fetch/$s_!s6t8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61932329-dfb5-40ef-ac0e-2517572677b4_502x512.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The objective is to first map the source speech to a phonetic representation, then map the result to the target speech&#8217;s phonetic representation (content), and then synthesize the target speech from it.</p><p>This approach enables the achievement of comparatively smaller delays than Approach 1. However, it faces the challenge of generating natural-sounding speech output, and reliance solely on phoneme information is insufficient for accurately reconstructing the target speech. To address this issue, the model should also extract additional features such as speaking rate, emotions, loudness, and vocal characteristics. These features should then be integrated with the target speech content to synthesize the target speech based on these attributes.</p><h3>Approach 3: Speech &#8594; Speech</h3><p>Another approach is to create parallel data using deep learning or digital signal processing techniques. This entails generating a native target-accent sounding output for each accented speech input, maintaining consistent emotions, naturalness, and vocal characteristics, and achieving an ideal frame-by-frame alignment with the input data.&nbsp;</p><p>If high-quality parallel data are available, the accent localization model can be implemented as a single neural network algorithm trained to directly map input accented speech to target native speech.</p><p>The biggest challenge of this approach is obtaining high-quality parallel data.The quality of the final model directly depends on the quality of parallel data.&nbsp;</p><p>Another drawback is the lack of integrated explicit control over speech characteristics, such as intonation, voice, or loudness. Without this control, the model may fail to accurately learn these important aspects.</p><h2>Measuring the speech quality</h2><p>High-quality output of accent localization technology should:</p><ol><li><p>Be intelligible</p></li><li><p>Have little or no accentedness (the degree of deviation from the native accent)</p></li><li><p>Sound natural</p></li></ol><p>To evaluate these quality features, we use the following objective metrics:&nbsp;</p><ul><li><p>Word Error Rate (WER)</p></li><li><p>Phoneme Error Rate (PER)</p></li><li><p>Naturalness prediction</p></li></ul><h3>Word Error Rate (WER)</h3><p>WER is a crucial metric used to assess STT systems&#8217; accuracy. It quantifies the word level errors of predicted transcription compared to a reference transcription.&nbsp;</p><p>To compute WER we use a high-quality STT system on generated speech from test audios that come with predefined transcripts.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!niy0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!niy0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg 424w, https://substackcdn.com/image/fetch/$s_!niy0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg 848w, https://substackcdn.com/image/fetch/$s_!niy0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!niy0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!niy0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg" width="540" height="271.08" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:502,&quot;width&quot;:1000,&quot;resizeWidth&quot;:540,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!niy0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg 424w, https://substackcdn.com/image/fetch/$s_!niy0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg 848w, https://substackcdn.com/image/fetch/$s_!niy0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!niy0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F383f30d2-ef25-475a-aa0e-512edb3cb501_1000x502.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The evaluation process is the following:</p><ul><li><p>The test set is processed through the candidate accent localization (AL) model to obtain the converted speech samples.</p></li><li><p>These converted speech samples are then fed into the STT system to generate the predicted transcriptions.</p></li><li><p>WER is calculated using the predicted and the reference texts.</p></li></ul><p>The assumption in this methodology is that a model demonstrating better intelligibility will have a lower WER score.</p><h3>Phoneme Error Rate (PER)</h3><p>The AL model may retain some aspects of the original accent in the converted speech, notably in the pronunciation of phonemes. Given that state-of-the-art STT systems are designed to be robust to various accents, they might still achieve low WER scores even when the speech exhibits accented characteristics.&nbsp;</p><p>To identify phonetic mistakes, we employ the Phoneme Error Rate (PER) as a more suitable metric than WER. PER is calculated in a manner similar to WER, focusing on phoneme errors in the transcription, rather than word-level errors.</p><p>For PER calculation, a high-quality phoneme recognition model is used, such as the one available at <a href="https://huggingface.co/facebook/wav2vec2-xlsr-53-espeak-cv-ft">https://huggingface.co/facebook/wav2vec2-xlsr-53-espeak-cv-ft</a>. The evaluation process is as follows:</p><ul><li><p>The test set is processed by the candidate AL model to produce the converted speech samples.</p></li><li><p>These converted speech samples are fed into the phoneme recognition system to obtain the predicted phonetic transcriptions.</p></li><li><p>PER is calculated using predicted and reference phonetic transcriptions.</p></li></ul><p>This method addresses the phonetic precision of the AL model to a certain extent.</p><h3>Naturalness Prediction</h3><p>To assess the naturalness of generated speech, one common method involves conducting subjective listening tests. In these tests, listeners are asked to rate the speech samples on a 5-point scale, where 1 denotes very robotic speech and 5 denotes highly natural speech.&nbsp;</p><p>The average of these ratings, known as the Mean Opinion Score (MOS), serves as the naturalness score for the given sample.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YYsE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YYsE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg 424w, https://substackcdn.com/image/fetch/$s_!YYsE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg 848w, https://substackcdn.com/image/fetch/$s_!YYsE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!YYsE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YYsE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg" width="498" height="249.996" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:502,&quot;width&quot;:1000,&quot;resizeWidth&quot;:498,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YYsE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg 424w, https://substackcdn.com/image/fetch/$s_!YYsE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg 848w, https://substackcdn.com/image/fetch/$s_!YYsE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!YYsE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2ffe0b1-3dee-4500-adc3-81fad5ac0d7c_1000x502.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In addition to subjective evaluations, obtaining an objective measure of speech naturalness is also feasible. It is a distinct research direction&#8212;predicting the naturalness of generated speech using AI. Models in this domain are developed using large datasets comprised of subjective listening assessments of the naturalness of generated speech (obtained from various speech-generating systems like text-to-speech, voice conversion, etc).&nbsp;</p><p>These models are designed to predict the MOS score for a speech sample based on its characteristics. Developing such models is a great challenge and remains an active area of research. Therefore, one should be careful when using these models to predict naturalness. Notable examples include the self-supervised learned MOS predictor and NISQA, which represent significant advances in this field.</p><p>In addition to objective metrics mentioned above, we conduct subjective listening tests and calculate objective scores using MOS predictors. We also manually examine the quality of these objective assessments. This approach enables a thorough analysis of the naturalness of our AL models, ensuring a well-rounded evaluation of their performance.</p><h2>Closing</h2><p>AI Accent Localization technology is a disruptive innovation, primed to bridge language barriers and elevate customer service while expanding talent pools, reducing costs, and revolutionizing CX.</p><p></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://voice-ai-newsletter.krisp.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://voice-ai-newsletter.krisp.ai/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item></channel></rss>