Atlas Heritage Systems
Diagnostic Suite
Six governed instruments · All CISP-compliant · Live May 2026
Behavioral vocabulary layer
Gap structure + divergence
Extracted from BSA Phase 2
Pressure-response fingerprint
Self-assessment calibration
deferredLoss landscape geometry
deferredECM defines the behavioral vocabulary all active instruments share. BSA and EPG are the flagship data collection instruments — both use ECM quadrant classifications. DIV runs as a lightweight standalone extracted from BSA's Phase 2. CISP governs every active run.
Instruments
Epistemic Canary Matrix
Classification framework
Behavioral characterization instrument. Maps model behavior onto a two-axis matrix: Token Economy (Verbose/Surgical) × Epistemic Stance (Compliant/Combative). Tracks quadrant migration under epistemic load. Measures token ratio R, preamble length P, and resolution code.
Behavioral Signal Assessment
Behavioral signal assessment
Measures gap structure, hallucination rates, and knowledge density on contested stimuli. Factorial design: 3×2 (Model × Grounding). Includes Divergence Testing as Phase 2 sub-component. Gemini shakedown complete; Canary Ensemble next.
Divergence Testing
Rapid ensemble probe
Semantic similarity scoring and spread matrix across the model ensemble. Extracted from BSA Phase 2 as a standalone instrument. 20 models, 30 stimulus pairs, 600 data points in Run 3.
Epistemic Pressure Gauge
Epistemic pressure gauge
Tracks how a model's verbosity, structure, and hedging behavior shift under progressively harder, more ambiguous, or more epistemically loaded prompts — producing a pressure-response fingerprint that complements BSA's stimulus-pair divergence metrics.
Global Geometry Concept Self-Assessment Pilot
Concept self-assessment
Probes self-assessment calibration across 20 lossyscape vocabulary terms. Each model rates conceptual difficulty, abstractness, global deviation, and truthfulness. Two absurd calibration items embedded as internal validity checks.
PyHessian Geometric Analysis
Loss landscape geometry
The geometric layer. Measures Hessian eigenvalues, trace, and basin sharpness on live model weights to prove or falsify the framework's terrain claims. Default specimen: GPT-2 small. Connects directly to ECM working hypotheses.
Supporting Documents
Clean Injection Synthesis Protocol — governs all instrument runs
Step-by-step operational guide — what to do, in what order, every time
Pre- and post-run operator notation protocol
Seed contamination tracking and session isolation requirements
Prompt construction standards for governed instrument runs
Data Pipeline
All instrument runs log to the atlas-pipeline/ folder. Phase 1 (schemas) and Phase 2 (scripts) are complete.