Deterministic Structural Instrumentation

Structure has geometry. Measure it.

The domains are unrelated. The principle is not. Wherever a system generates structure — an image model arranging visual mass, a language model producing text, a tumor growing through tissue — that structure has geometry. Geometry is measurable without knowing what the output means.

Parallax Metrology builds deterministic structural instruments that measure the geometry of output. No model in the loop. No semantic judgment. No guesswork. Multiple applications of one principle.

VTL measurement coordinate system — structural geometry applied to image analysis
01 — Visual Structure
Visual Thinking Lens

Geometric fingerprinting for image generation models. Measures compositional bias, spatial priors, and structural drift — invisible to semantic evaluation.

K = [Δx, rᵥ, ρᵣ, μ, xp, θ, ds] · 6,000+ images validated
02 — Linguistic Structure
Linguistic Kernel

Structural telemetry for LLM outputs. Eight dimensions. Deterministic. Every coordinate backed by countable evidence. No judge model required.

4,000+ responses · Cross-engine validated · Cohen's d = 2.47
03 — Biological Structure
Parallax Pathology

Deterministic structural measurement for H&E histology. 58 axes. Zero trained parameters. The geometry of deviation is the diagnosis.

91.5% nine-class accuracy · 0 trained parameters · HR=1.872 DSS
04 — Semiconductor Inspection
Visual Topology of Light

Physics-grounded structural coordinates for semiconductor inspection imagery. 15 axes. No learned parameters. Same coordinates transfer across SEM, wafer maps, and aerial lithography without retraining.

K = [NILS, LER, δx, μ, θ, d_s…] · 93.4% SEM accuracy · 3 modalities
05 — Seismic Array Analysis
Parallax Seismology

Independent structural agreement as a field coherence diagnostic. Ω measures how many competing organizations a seismic field supports simultaneously — what a single-answer instrument cannot see.

Ω · 6.9× independence gain · 5 events · Mw 7.0–9.0
06 — Materials
Structural Topology Kernel

A fixed 50-coordinate descriptor reaching incumbent-tier performance across microstructure classification, permeability prediction, and label-ambiguity quantification — from one training-free representation.

STK-50 · 0.832 AUC · R²=0.939 permeability · 3 tasks
Structure Has Geometry · Always Geometry Is Measurable · Without Knowing What It Means Deterministic · No Model In The Loop Same Input · Same Output Countable Evidence · Every Coordinate The Ruler, Not The Critic · Parallax Metrology Structure Has Geometry · Always Geometry Is Measurable · Without Knowing What It Means Deterministic · No Model In The Loop Same Input · Same Output Countable Evidence · Every Coordinate The Ruler, Not The Critic · Parallax Metrology

Image models repeat
the same geometry.

Semantic diversity explains less than 10% of observed spatial variance in text-to-image systems. Composition is not prompt-driven. It is prior model-driven.


400 MidJourney prompts. 8 semantic categories. One geometric attractor. 100% of outputs within 0.15 radius of center. Different prompts, different subjects, identical compositional bias.


The VTL measures the spatial signature each engine learned from its training data — the prior it applies regardless of what you ask for. That signature is stable, reproducible, and invisible to standard evaluation.

"Spatial prompt intensity explains 0–0.1% of compositional displacement variance. The model has already decided where mass goes."

VTL · Seven geometric primitives · Engine-agnostic

Instrument Pipeline

Visual Thinking Lens measurement pipeline
0.15
Attractor Radius
100% of MidJourney outputs fall within 0.15 radius of geometric center under standard prompting
6%
Semantic Variance Explained
Subject category explains 6% of spatial variance. 94% is structural prior, not prompt.
6,000+
Images Validated
Cross-platform: MidJourney, Sora, GPT, SDXL, Firefly, OpenArt
3–4
Steps Early Warning
Compositional metrics detect model degradation 3–4 inference steps before semantic breakdown
Δx,y
Mass Displacement
Where compositional mass sits — placement offset from center. The primary axis for detecting attractor behavior and steering.
rᵥ
Void Ratio
How much empty space surrounds the mass. Captures openness, compression, and negative space behavior.
ρᵣ
Packing Density
How compressed the marks are. Measures spatial concentration of compositional elements.
μ
Compositional Unity
How unified the composition reads as a field. Structural coherence across the image plane.
xp
Peripheral Pull
How hard the edges pull against center. Measures resistance to the central attractor.
θ
Orientation Stability
Directional coherence of structural elements. High values indicate trained directional priors.
ds
Structural Thickness
Surface depth and layering. Measures how models build spatial complexity above a base plane.
Try the Demo Web Apps Learn More

Text has shape.
Measure it.

Most production failures in LLM deployments are not benchmark failures. They are structural reliability failures. The model breaks an output format, drifts structurally across conversation turns, compresses under stress, or fails to recover after adversarial input. These failures are invisible to benchmark evaluation.


The Linguistic Kernel is a deterministic, behavior-only instrumentation layer. It reduces any text string to a coordinate in eight-dimensional structural space. Same input, same output, every time. No model in the loop. Every number backed by countable evidence.


It is not a scoring system. It is a location system. Two responses can be equally correct and land at completely different coordinates. That is not a flaw — it is the point.

01 Tokenize & Segment Words, sentences, paragraphs
02 Extract Evidence All countable features
03 Compute Coordinates Eight dimensions, versioned math
04 Classify Basin + rhetorical state
05 Assemble Output Full audit bundle returned

Instrument Pipeline

Linguistic Kernel measurement pipeline
2.47
Cohen's d
Δy separation between normal and constrained responses. Largest effect size across all eight dimensions.
4,000+
Responses Validated
Cross-engine: Claude, GPT, Gemini. Attractor basin coherence confirmed across all three.
100%
Collapse Detection
v19 three-tier detector: TP=15, FP=0, FN=0 on GPT EMS corpus (n=400).
0
Models In The Loop
Entirely deterministic. No external calls. ~0.3ms per 1,000 tokens.
Finding 01
Structural behavior is stable across runs
For any given prompt under normal conditions, kernel coordinates cluster tightly around a consistent centroid. 89% of Gemini EMS responses fall within a single geometrically coherent attractor basin. No random structural wandering.
Finding 02
Constraint prompts induce predictable deformation
The model bends before it breaks. Under constraint: rᵥ collapsed from 0.349 → 0.196, Δy spiked from 0.248 → 1.450, θ collapsed from 0.644 → 0.141. Bootstrap CIs show no overlap with normal intervals.
Finding 03
Two failure modes are geometrically opposite
Constraint-compliance (rᵥ collapse, Δy spike) and formatting collapse (rᵥ inflation, Δy negative) occupy opposite corners of kernel space. A single detection threshold cannot catch both. Domain predicts which failure mode is likely.
Finding 04
Cohesion is invariant under structural stress
μ coefficient of variation: 0.056 across all 400 responses including constrained and truncated groups. When μ moves significantly, something more fundamental than formatting pressure has changed. Reliable stability baseline.
Try the Demo The Instrument Learn More

The pathology is the
absence of normal form.

Standard computational pathology trains models to learn what disease looks like. Parallax Pathology asks a different question with a different instrument.


The system encodes the structural laws of normal tissue. Deviation from those laws is the signal. A rare disease the system has never seen will register as structurally distant from all known normals. That is not a misclassification — it is the correct output.


58 deterministic axes. 16 structural theories applied simultaneously. Zero trained parameters. Every output traceable to a specific geometric property. The explanation is the measurement, not post-hoc attribution.

"The average tumor state does not predict outcome. What predicts outcome is how structural states are distributed across tumor regions."

TCGA-COAD · n=180 · HR=1.872 DSS · p=0.0002

Instrument Pipeline

Pathology Kernel measurement pipeline
91.5%
Nine-Class Accuracy
CRC-VAL-HE-7K, 5-fold cross-validation. 1,800 patches. Zero trained parameters.
16.3pp
Emergence Gap
Best single layer: 75.2%. Full 58-axis system: 91.5%. The combination is genuinely greater than any component.
1.872
Hazard Ratio (DSS)
Disease-specific survival. No outcome training. No molecular data. Stage-adjusted. p=0.0002. C-index: 0.808.
3.3×
Mortality Difference
Low vs. high composite tertile. Structural geometry predicts survival without knowing outcomes.
Validation 01 — CRC-VAL-HE-7K
Tissue classification without training
Nine tissue classes. 91.5% accuracy. Zero trained parameters. Deviant TUM patches stratify into two geometrically distinct failure modes — mucinous differentiation and desmoplastic reaction — detected from geometry alone, without pathological labels.
Validation 02 — EBHI-SEG
Dysplasia grading as continuous measurement
Six-class dysplasia grading from Normal through Adenocarcinoma (n=920). Spearman ρ = 0.716. 84.3% of predictions correct or within one adjacent grade step. The system correctly places serrated adenoma structurally distinct from conventional low-grade neoplasia.
Validation 03 — GTEx Cross-Dataset
The frame is universal. The memory is not.
Three independent GTEx whole slide images. Different fixation chemistry from training data. Structural centers emerge independently within each donor without labels. The frame invariant. The reference swappable.
Validation 04 — TCGA-COAD Survival
The honest null, and what it revealed
Primary hypothesis (mean Ω predicts OS) was not supported. HR=0.766, p=0.132. Reported directly. What emerged: spatial distribution across tumor regions predicts outcome. HR=1.872 DSS, p=0.0002. The average state is not the signal — the field distribution is.
Try the Demo Read the Paper Learn More

Semiconductor patterns
have topology. Measure it.

Semiconductor inspection imagery is evaluated through classification or generic feature descriptors. Those approaches detect separation — but they do not preserve the structural route by which an image changed. The practical inspection problem is not only whether an image is unusual. It is whether the pattern has stopped behaving like the structure it was supposed to be.


Visual Topology of Light (VTL) is a 15-coordinate physics-grounded framework derived from image formation first principles: edge sharpness (NILS), edge roughness (LER), centroid geometry, spectral band decomposition. No learned parameters. The same coordinates transfer across SEM defect images, binary wafer maps, and simulated aerial lithography — without retraining.


PTD-Z extends VTL into a Pattern Topology Drift Monitor: routed structural telemetry that decomposes measurement into six interpretable route families — image geometry, pitch/phase, material topology, contrast/gradient, signal/noise, and residual/support. Each route carries a process-facing hypothesis and a refusal condition for claims the evidence cannot yet support.

"A fixed coordinate system derived from image formation physics reveals systematic organization shifts across SEM, wafer maps, and aerial imagery — without domain-specific retraining."

VTL v3.1 · PTD-Z · 15 physics coordinates · 3 imaging modalities

Instrument Pipeline

Visual Topology of Light measurement pipeline
93.4%
SEM Balanced Accuracy
Carinthia semiconductor SEM · 4,579 images · 5-fold CV · first among all classical baselines
94.9%
Wafer Map Accuracy
MixedWM38 · 7,015 maps · 7.59× lift over Hu moments, which collapse to chance on wafer maps
0.9477
NIST Breach Ratio
PTD-Z envelope breach tracking across 3,402 NIST SEM degradation images. Pattern topology drift confirmed.
0
Learned Parameters
All 15 VTL coordinates derived from image formation physics. No training required across any modality.
NILS
Edge Sharpness
Normalized Image Log Slope — standard lithography metrology quantity. Strongest reference-free focus indicator (R²=0.975 against |focus|). Leads on SEM; confirms coherent defocus in aerial images.
LER
Edge Roughness
Std of lateral contour deviation, normalized by image diagonal. Identically zero on coherent simulated aerial images — an independent negative prediction confirmed before any real data was examined.
δx,y
Centroid Geometry
Intensity-weighted centroid displacement. Largest residual route after classical descriptors in Carinthia and NFFA audits. Carries mesoscale organizational signal that texture and moment descriptors do not preserve.
d_s
Spectral Spread
FFT power spectrum center of mass. Encodes spatial frequency organization — coarse vs. fine structure. Leads on density-encoded binary wafer maps where defect topology is encoded in spatial frequency.
θ
Gradient Orientation
Dominant gradient direction weighted by magnitude. Detects scratches, line patterns, oriented defects. Part of pitch/phase route — the most defensible independence route in PTD-Z audits.
Φ
Route Grammar
PTD-Z decomposes 15 coordinates into six route families, each carrying a process-facing hypothesis (overlay, pitch drift, etch bias, focus) and a refusal condition blocking unsupported causal claims.
Finding 01 — Cross-Modality Transfer
Coordinate dominance shifts systematically with imaging physics
LER leads production SEM (physical edge roughness). μ and d_s dominate wafer maps (density + spatial frequency). This shift is reproducible across three independent datasets with no retraining — the framework tracks visual organization, not domain statistics.
Finding 02 — Hybrid Complementarity
Structural organization provides residual signal after classical descriptors
Adding PTD-Z to Classical_All improves Carinthia macro F1 by +0.0214 and NFFA by +0.0144. Hybrid systems outperform either alone. image_geometry and pitch_phase are the most defensible residual routes — they persist in independence audits where signal/noise and contrast drift toward overlap roles.
Finding 03 — Refusal as Method
The system knows what it cannot say
PTD-Z treats refusal as a first-class component. Each route carries a refusal condition: when the margin is thin, when the selected channel is support-heavy, when no process-linked sequence data exists. No public image dataset provides direct fab process-cause labels. PTD-Z withholds causal language when evidence is absent.
Finding 04 — Wafer Grammar
Constrained learned layer confirms coordinate signal
WM-811K wafer pattern grammar (4,149 maps, 9 classes): hand grammar alone reaches macro F1 0.5609. A constrained logistic layer over deterministic coordinates reaches 0.8859. Null-label shuffle collapses to chance — confirming the coordinates carry real class signal, not artifact.
Try the Demo VTL Paper PTD-Z Paper Learn More

The field supports
multiple answers. Measure it.

Standard seismic processing assumes signal rises above a stochastic background. When the background is not stochastic — when it contains its own coherent structure — coherence and correctness become separable variables. A field can be highly organized around the wrong answer.


Ω is the mean pairwise disagreement among N physically independent structural reads of a field. When five theories reading genuinely different physical observables — amplitude, onset, frequency, similarity, arrival-time geometry — all agree, the field has collapsed to one dominant organization. When they scatter, the field is supporting multiple competing structures simultaneously. That is the signal SNR cannot see.


In Japan 2021 at PDAR, the direct P-wave was present at 71% of FK peak power. The beamformer chose the peak. The peak was 59.7° off. The P-wave was not absent — it just did not win. Ω measures the structural conditions under which that can happen.

"Coherence and correctness are separable variables. When the field supports multiple simultaneous coherent organizations, a single-answer instrument will pick one, and may pick the wrong one."

Ω · 5 events · Mw 7.0–9.0 · USArray + IMS-class arrays

Instrument Pipeline

Parallax Seismology measurement pipeline
6.9×
Independence Gain
Discriminating power increase from reading genuinely different physical observables vs. different transforms of the same input.
1.78
Cohen's d
P-window vs. pre-arrival Ω separation under data-driven conditions. 4 of 5 events. No earthquake geometry required.
12
FK Failure Cases
Cases where the direct P-wave was present and physically real yet did not win. 15 total failures, 4 arrays, 2 continents.
0
Earthquake Priors
Data-driven arrival-time picks produce pre-arrival Ω = 0.100 vs. P-window Ω = 0.075 with no source location or travel-time model.
Finding 01
Pre-arrival ambient field has coherent directional structure
The pre-arrival Ω signal decomposes into three separable layers: a floor (~0.040) on quiet days, an ambient timing layer (+0.060) recovered by FK beamforming without earthquake priors, and a geometric amplification layer (+0.069) from event-geometry arrival-time picks. The geometric stagger amplifies a genuine ambient signal approximately 2-fold — it does not create it.
Finding 02
Physical independence is the source of discriminating power
Five theories reading the same amplitude image (different mathematical transforms) produced P/coda mean pairwise separation of 0.004. Five theories reading genuinely different physical observables produced separation of 0.137. The gain is 6.9×. It came entirely from reading different physics — not from more theories or more data.
Finding 03
Ω collapses during P-wave arrival — reproducibly
Across five events spanning winter and summer, NW and SE source directions, and magnitudes Mw 7.0–9.0: Ω collapses during the direct P window as all five theories pull toward a common structural center, then expands again during coda as the field reorganizes. The three-window structure (diverge → converge → diverge) is the measurement.
Finding 04
SNR does not separate the three FK failure families
Twelve cases where the direct P-wave was energetically present but did not win FK beamforming. SNR does not flag these. Station-removal robustness does. The same operational symptom — wrong answer on a coherent field — emerges from pre-existing ambient dominance, fragile knife-edge coherence, and a robust coherent competitor coupled to P-wave onset.
Read the Paper FK Reliability Paper Learn More

One representation.
Three tasks. No training.

Materials informatics typically fragments by task. Phase classification reaches for texture descriptors. Property regression reaches for geometry descriptors. Label-quality auditing is a separate, often manual, expert exercise. STK's central claim is that these are manifestations of the same topological information. Because the 50 coordinates encode local texture and global structure together, the same fixed vector feeds a classification head, a permeability-regression head, and an ambiguity readout.


On the UHCSDB ultrahigh-carbon-steel benchmark (961 SEM images, 43 parent specimens), STK reaches 0.832 macro AUC under GroupKFold by specimen — beating HOG (0.728), Haralick (0.629), and the dimensionality-matched union of all three by +0.11. The win is structural: removing all tonal and spectral channels leaves it at 0.829. The same fixed kernel scores 0.964 AUC on a second steel set and reaches R²=0.939 on 2D permeability — classical-descriptor tier, matching porosity+S2+Euler.


The representation is static. The task is a choice of head. A per-task toolbox cannot, even in principle, report where its labels are soft or why a property estimate is uncertain. Because STK's classification, property, and ambiguity readouts are the same fifty coordinates, a model's uncertainty on one task is legible as structural complexity in the others.

"One fixed, interpretable coordinate set operating at incumbent tier across tasks that the field normally addresses with separate, non-overlapping pipelines — and doing two things no specialist pipeline offers at all: auditability across tasks and a measurable readout of label ambiguity."

STK-50 · UHCSDB · 2D porous media · bainite · training-free

Instrument Pipeline

Structural Topology Kernel measurement pipeline
0.832
UHCSDB Macro AUC
Grouped-by-specimen on 961 SEM images / 43 parent specimens. Beats HOG, Haralick, and their dimensionality-matched PCA-50 union.
0.939
Permeability R²
2D porous media permeability (held-out 2,000-image test). Matches classical-descriptor tier: porosity + specific-surface + Euler + S2.
50
Fixed Coordinates
9 measurement layers. No training. No tunable per-dataset parameters. One fixed deterministic kernel across all tasks and datasets.
+0.11
AUC Above Incumbent Union
Above the dimensionality-matched PCA-50 union of S2 + Haralick + HOG. The advantage is intrinsic, not feature-compactness.
Finding 01
Classification win is structural, not an acquisition fingerprint
Removing all tonal, spectral, and edge channels from STK leaves classification at 0.829 AUC — nearly identical to the full-kernel 0.832. The gain over HOG and Haralick is structural: higher-order spatial information (connectivity, topology) that second-order descriptors cannot encode by construction. STK's resistance-graph coordinates add +0.042 AUC conditioned on S2.
Finding 02
Metallurgically interpretable — each class discriminated by the right axis
Pearlite (lamellar) ← orientation entropy (single-coordinate one-vs-rest AUC 0.91). Network (grain-boundary cementite) ← cohesion/connectivity (0.80). Spheroidite (spherical particles) ← void-topology (0.78). Martensite (fine lath) ← fine-scale texture. The coordinates that discriminate each class are the ones a metallurgist would name.
Finding 03
Label ambiguity is measurable, concentrated, and intrinsic
~40% of UHCSDB samples sit closer to another class centroid than their own label. The ambiguity is concentrated in mixed classes (56–68%) vs. structurally distinct classes (28–34%), and falls along structurally-adjacent pairs — not at random. STK's ambiguity margin predicts the misclassifications of HOG and Haralick (error AUCs 0.604, 0.645) — the samples STK flags are genuinely hard for other methods too.
Finding 04
Generalization across datasets and a second task type
0.964 AUC on bainite M-A-island subclassification (second independent steel dataset, 26 specimens, GroupKFold). R²=0.939 on 2D permeability — classical-descriptor tier with zero training. Scale-robust by direct ablation: percolation coordinates degenerate at 128 px wake up at 256 px, with no material change in residual over the strengthened incumbent.
Read the Paper Learn More