Skip to content

skill-auditor: MatrixScan AR coverage audit + gap fill#65

Merged
raulpellarini merged 13 commits into
masterfrom
feature/skill-auditor-matrixscan-ar
Jun 17, 2026
Merged

skill-auditor: MatrixScan AR coverage audit + gap fill#65
raulpellarini merged 13 commits into
masterfrom
feature/skill-auditor-matrixscan-ar

Conversation

@ilber

@ilber ilber commented Jun 11, 2026

Copy link
Copy Markdown
Member

What

Runs the internal/skill-auditor flow over the MatrixScan AR (BarcodeAr) skill family — taxonomy → coverage matrix → gap fill → build-gate → eval harness — closing every coverage gap across all 10 platforms.

Taxonomy + iOS modeling

  • New taxonomies/matrixscan-ar.yaml: 25 curated edge-case features, mined from public docs, the SDK .rst :available: directives (availability ground truth), the sample apps, and existing evals.
  • MatrixScan AR splits iOS into three skills (-ios core + -highlight-ios + -annotation-ios) while every other platform uses one. Added a generic platform_aliases mechanism to coverage_matrix.py + manifest.json so the three fold into one logical ios column with aggregated evals — a no-op for every other product.

Coverage: 63 → 0 gaps (0 required)

Parallel per-platform fills added tagged, convention-compliant evals (and reference coverage where missing), grounded in the SDK docs.

The build gate earned its keep

Anti-hallucination compile checks against the real resolved SDK caught issues string/semantic evals never would:

  • Hallucinated ScanditIconType.Info/Plus in the RN reference → fixed to the real ScanditIconBuilder API.
  • .NET namespace mismatches (ScanditIconBuilderCore.UI.Icon, etc.).
  • BarcodeArFilter is documented at 8.5/8.6 but not in any published JS/Flutter/.NET package — would have shipped non-compilable code.
  • Pre-existing maui bug: hallucinated ScanditIconType.Information/ShoppingCart → real InspectItem/ToPick (GATE-PASS).

barcode-filter policy

Documented at 8.5 (Flutter/RN/Cordova/Capacitor) / 8.6 (.NET) but those packages aren't published yet. Per decision, excluded until published — kept only on iOS/Android (8.1, buildable today); reference notes accurately say "documented at 8.5, don't generate yet."

Honest exclusions

custom-highlight/custom-annotation are scoped to where a real path exists (android/flutter/ios/rn via native protocol subclassing or the BarcodeArCustom* class) — .NET has no createView/update hook, web/JS-bridge frameworks have neither.

Eval harness — 97% (1497/1537)

Full run over all 12 skill dirs (Sonnet generator + Opus judge, deterministic-first):

Skill Pass
android 118/119 (99%)
annotation-ios 196/204 (96%)
capacitor 131/134 (98%)
cordova 133/138 (96%)
flutter 134/136 (99%)
highlight-ios 122/127 (96%)
ios 107/113 (95%)
maui 121/125 (97%)
net-android 110/112 (98%)
net-ios 107/108 (99%)
rn 141/143 (99%)
web 77/78 (99%)
Overall 1497/1537 (97%)

A first run scored 92%; the gap was mostly measurement artifacts — migration old-API-absence was judged against the model's explanatory prose (which names the old API) instead of the code blocks, plus "X conformance is present" literal substring checks. Fixed the harness (code-block scoping for migration negatives, per the documented convention) and routed conformance checks to the semantic judge; the corrected run is 97%. The residual ~3% are genuine generator-variance misses (e.g. the model occasionally emits .with(systemName:) instead of .withIcon(.checkmark), or CGSize instead of a CGFloat size) where the iOS reference doesn't pin the exact Swift API — flagged as a reference-hardening follow-up, not papered over.

🤖 Generated with Claude Code

1lb3r and others added 13 commits June 11, 2026 15:32
Add the MatrixScan AR (BarcodeAr) edge-case taxonomy (25 curated features)
and a generic `platform_aliases` mechanism so several skill dirs can fold
into one logical platform with aggregated evals.

MatrixScan AR splits iOS into three skills (matrixscan-ar-ios core +
-highlight-ios + -annotation-ios) while every other platform uses one. The
manifest now aliases the two iOS sub-skills onto matrixscan-ar-ios, so
coverage_matrix audits a single accurate `ios` column instead of treating
the sub-skills as pseudo-platforms. The mechanism is a no-op for every
other product (empty alias map).

Taxonomy encodes SDK :available: ground truth as exclusions: barcode-filter
is buildable today only on iOS/Android (8.1); custom-highlight/-annotation
exist only where a real path does (android/flutter/ios/rn).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Add tagged evals (and reference coverage where missing) for the gaps the
skill-auditor surfaced for this platform, grounded in the SDK :available:
docs. New code snippets/fixtures were build-gate verified against the real
resolved Scandit SDK where a cheap gate exists.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@ilber ilber marked this pull request as draft June 11, 2026 13:39
@ilber ilber marked this pull request as ready for review June 11, 2026 14:01
@raulpellarini raulpellarini merged commit 1a3a8e9 into master Jun 17, 2026
@ilber ilber deleted the feature/skill-auditor-matrixscan-ar branch June 17, 2026 08:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants