AI ITSM deflection + customer-success benchmarks (May 2026)

Source report: /tmp/deflection-benchmarks-2026-05-11-v2.md (569 lines, 46KB).

TL;DR

The vendor field has no normalized metric. Five distinct things get called “deflection”:

  1. Deflection = ticket never reaches human
  2. Containment = ticket stays within self-service
  3. Auto-resolution = ticket closed by AI (may or may not have routed to human first)
  4. Augmentation = AI assisted but human resolved
  5. Response automation = first reply from AI

Even within a single vendor (notably Risotto), the definition drifts page-to-page.

Industry baseline (Fixify 2026 / Forrester / Salesforce)

  • ~35-45% average at typical AI-ITSM deployment
  • 55-70% mature deployment after 6-12 month ramp
  • ~41% median per Salesforce industry stats

A 60-70% rate reads as above-market without triggering CIO diligence skepticism; rates above 75% start drawing scrutiny.

Cleanest single number per vendor (after normalization)

VendorClaimCaveat
Atomicwork65% auto-resolution at Ammex (6-mo ramp from 20%)Apples-to-apples auto-resolution
Serval80% auto-resolution at Verkada in 24-hr deployFounder ex-Verkada — strongest selection bias
ConsoleSynthesia 25% → 70-75%Strongest baseline-paired claim in the entire field
Risotto55% auto-resolution at Gusto (60% per TechCrunch)Vendor’s own numbers internally inconsistent
Treeline98% augmented-or-resolvedAugmentation-weighted, NOT deflection-comparable
Aisera84% at GAPWas “up to 80%” pre-May 2026
Moveworks88% at BroadcomMulti-year deployment

Atomicwork “98%” disambiguation

Wiki correctly attributes Atomicwork’s 98% to Pepper Money ESAT (satisfaction 77→98%), NOT to ticket resolution. The “98% augmented or directly resolved” claim belongs to Treeline. The deflection-benchmarks research pass flagged this as a common conflation in external write-ups; the wiki is currently correct on this point.

Vendors with no published rates

  • Echelon — pivoted to “ServiceNow build partner” positioning; no rates.
  • ClearFeed — no published deflection rate.
  • Thrive MSP — no published AI metrics; uses “AI services managed” framing.
  • Modern (YC W26, 2 people) — has a 80%+ marketing claim with zero named customers.

Notes

  • White space: paired autonomous-resolution + augmentation-effectiveness (handle-time reduction on human-resolved tickets) metrics. No competitor has built a clean “AI co-pilot effectiveness” headline as of May 2026, despite augmentation being most of the field today.
  • Single-pass deflection underestimates k-run buyer experience; pass^k consistency reporting is unused across the competitor set (see academic-papers-agent-reliability-2026).

Honest verification gaps

  • ASOS-side confirmation of the Edra claim — still open (see edra errata).
  • HubSpot-side confirmation of the 150K conversations / 600 KB updates / 12% handoff Edra metric — still open.
  • Gartner IT Key Metrics 2025 paywalled — exact tier-1 deflection benchmark not extracted.