ClaimLens — Presentation

The problem

The signal is buried in prose

In commercial-vehicle field quality, the slowest step of root-cause analysis is reading thousands of free-text warranty claims, field-service narratives and equipment logs to figure out what is actually failing, and how often.

A large share of those returns are overcycle anomalies — repeated abnormal device cycling (soft resets, cloud-sync failures, power cycles) — that look like hardware faults but aren't. Telling them apart by hand is slow and inconsistent.

How it works

Extract → classify → trend → hand off

Claim — tagged by source: customer complaint · dealer RO · field log │ ├─► Extract component · failure mode · symptom · action · part # (rules) │ source-aware: dealer RO → action · field log → overcycle ├─► Classify overcycle anomaly: Soft Reset · Cloud Sync · … (TF-IDF + LogReg) └─► Aggregate Pareto by label / component / failure mode / source │ ▼ /handoff → QualityMind-RAG 5-Why / 8D problem_statement

Intake

Source-typed

Each narrative is tagged customer complaint, dealer RO or field log — driving a by-source Pareto and per-stream extraction emphasis. Regex + gazetteers, zero model download.

Classify

Overcycle anomaly

TF-IDF + balanced logistic regression over a locked 5-label taxonomy, reproducible (seed 42); low-confidence (<0.55) flagged needs_review.

Hand off

RCA bridge

Dominant trend becomes a QualityMind-ready 8D / 5-Why payload, POSTed through an SSRF-guarded client — narrative to corrective action.

Measured, not asserted

macro-F1 0.90 on 1,200 narratives

Class	Precision	Recall	F1
Soft Reset	0.83	0.84	0.83
Cloud Sync	0.91	0.87	0.89
Connectivity Loss	0.88	0.95	0.91
Power Cycle	0.96	0.94	0.95
No Fault Found	0.90	0.88	0.89
macro avg	0.90	0.90	0.90

Per-class F1

Soft Reset

0.83

Cloud Sync

0.89

Conn. Loss

0.91

Power Cycle

0.95

No Fault

0.89

macro avg

0.90

scale 0–1.0 · dashed line = 0.88 macro-F1 gate