The Measurement Problem Nobody Wants to Admit

A year ago, the big question in customer operations was whether to invest in AI automation. That debate is effectively over. The new question — and the harder one — is whether you can prove the investment was worth it. According to Antoine Nasr, Head of AI at Forethought, most enterprises cannot. Not because the results are not there, but because the way they are measuring CX AI ROI is fundamentally broken.

The CX Today report on Forethought's findings lands on something that operations leaders already sense but rarely say out loud: the metrics inherited from pre-AI customer service programmes are the wrong instruments for an AI-augmented environment. Cost-per-ticket and handle time were designed for a world where volume and speed were the only levers you had. When AI enters the picture, those metrics stop telling the whole story — and in some cases, they actively mislead.

What "Broken" Actually Looks Like in Practice

Here is the failure mode playing out across contact centres right now. A team deploys an AI deflection layer, watches containment rates climb, and declares success. Handle time goes down. Cost-per-contact improves on the dashboard. Leadership signs off on the next phase of automation. Then, six months later, CSAT quietly erodes. Re-contact rates tick upward. A segment of customers stops reaching out altogether — not because their problems were solved, but because they gave up.

The traditional ROI framework captured the efficiency gain and missed the experience loss entirely. The two numbers were never in the same spreadsheet.

What Nasr and the Forethought team argue — and what a growing body of contact centre evidence supports — is that CX AI ROI has to be measured across at least three dimensions simultaneously: operational efficiency (the classic cost metrics), experience quality (resolution accuracy, first-contact resolution, customer effort), and business outcome (retention, lifetime value, revenue influence). Strip out any one of those dimensions and the picture you get is not just incomplete. It is dangerous, because it will send you in the wrong direction with confidence.

Why This Matters More When AI Is Involved

Human agents have a natural corrective mechanism built in. A skilled agent who senses customer frustration adjusts — slows down, reframes, escalates their own empathy. They generate soft signals that managers can observe and coach. AI systems do not self-correct in the same way, and they do not surface distress signals unless you have explicitly instrumented them to do so. That means the gap between "the AI resolved the query" and "the customer felt genuinely helped" can grow invisible and unchecked unless your measurement framework is designed to catch it.

This is not an argument against AI. It is an argument for measuring it honestly — which is actually a higher standard than most organisations have historically applied to their human operations.

The Hybrid Model as a Measurement Advantage

Here is where the hybrid human-plus-AI model earns its keep beyond the obvious quality-assurance argument. When skilled human agents remain embedded in your operation — handling escalations, managing complex or emotionally charged interactions, and working alongside AI on blended queues — they generate exactly the kind of qualitative signal that pure automation cannot produce. They are, in effect, a continuous calibration layer for your AI performance.

At Conveneo, this is not a theoretical position. Multilingual human specialists operating alongside AI tools create feedback loops that isolated automation simply cannot replicate. When a Dutch-speaking agent steps in after an AI handoff and spots a recurring misclassification pattern, that insight closes a measurement gap that no dashboard metric would have caught in time. The human layer does not just handle the exceptions — it illuminates them.

Operations leaders who are serious about proving AI ROI in 2026 need to do three things: rebuild their measurement frameworks to cover efficiency, experience quality, and business outcome together; instrument their AI systems to surface failure signals, not just success signals; and retain human expertise in the loop precisely because humans remain the most sensitive detectors of where the numbers are not telling the truth.

The investment in AI is already made. Now the work is making sure you know what it is actually doing.