Structural Topology Kernel · Materials Microstructure
STK is a fixed 50-coordinate, deterministic descriptor of grayscale structural fields. The same kernel that classifies microstructure phases also predicts permeability and makes label ambiguity measurable — from a single, auditable representation.
Materials informatics typically fragments by task. STK's central claim is that classification, property prediction, and label auditing are manifestations of the same topological information.
STK is a fixed, deterministic operator: it maps a grayscale field to 50 real-valued coordinates with no training, no tunable per-dataset parameters, and no stochasticity. It reads structure in any grayscale field — SEM micrographs, EBSD-derived maps, binary masks — and makes no assumption about how the field was formed.
On the UHCSDB ultrahigh-carbon-steel benchmark, STK-50 scores 0.832 macro AUC under GroupKFold by specimen — the correct evaluation respecting specimen structure, since 961 images derive from only 43 parents. The win is not feature compactness: STK also beats the dimensionality-matched PCA-50 union of S2+Haralick+HOG by +0.11. Removing all tonal, spectral, and edge channels leaves STK at 0.829 — the gain is structural, not an acquisition fingerprint.
STK scores 0.964 macro AUC on bainite M-A-island subclassification under GroupKFold by specimen. The dataset is independently collected, different detector, different steel system, different task — yet the same fixed 50 coordinates produce a strong classification result. The UHCSDB win is not dataset-specific. Breadth caveat: 26 specimens is moderate statistical power; the result is consistent but not as strongly powered as UHCSDB.
2D permeability of binary porous microstructures is the sharpest test of whether STK reads property-relevant structure: it is 2D-governed (matching STK's 2D reading), and the mechanism is percolation and connectivity — exactly the coordinate class STK's island/resistance layer targets. STK-50 alone matches porosity + specific-surface + Euler + S2 and a connectivity-aware strengthened incumbent. The task-trained ConvNeXt ceiling (~0.995) represents fine pore geometry no 50-coordinate summary descriptor captures — that gap is expected and conceded.
Microstructure "classes" are often continua forced into categories. STK provides a reproducible continuous position that places mixed classes between their pure parents and flags label ambiguity. The robust finding is the structure of this ambiguity: it is concentrated in mixed classes (56–68% ambiguous) and falls along structurally-adjacent class pairs, not at random. STK's ambiguity margin predicts the misclassifications of independent descriptors (HOG-error AUC 0.604, Haralick-error AUC 0.645) — the samples STK flags are genuinely hard for other methods too. Cross-domain mechanism check: ρ=0.26 convergence with six-pathologist disagreement on H&E (Gleason 2019).
STK does not beat task-specialized methods on any single benchmark — its value is that one fixed coordinate set operates at incumbent tier across tasks normally addressed with separate, non-overlapping pipelines. The permeability result is on a synthetic, noiseless distribution and a 2,000/2,000 subset; a task-trained ConvNeXt reaches ~0.995 that no 50-coordinate summary approaches, because the fine pore geometry that sets the last ~6% is compressed away. The label-ambiguity contribution is proxy-validated in materials with no native inter-observer gold standard; the mechanism is checked cross-domain only (Gleason ρ=0.26). Cross-lab breadth rests on one strongly-powered dataset (UHCSDB) with bainite replication at 26 specimens. STK is not computationally cheaper than simple topology descriptors. The boundary is part of the contribution: STK is a portable structural measurement layer with a now-characterized ceiling, not a magic residual-finder.
Materials informatics typically uses HOG and Haralick for phase classification, porosity and Euler number for property regression, and manual expert review for label quality auditing. Each is optimized for its task. None can, even in principle, report where its own labels are soft or why a property estimate is uncertain — because the descriptor that classifies and the procedure that audits are different objects computed from different pixels.
Because STK's classification, property, and ambiguity readouts share the same fifty coordinates, a model's uncertainty on one task is legible as structural complexity in the others. The ambiguity STK flags in classification is the same margin that predicts other descriptors' errors — measured on the identical axes. This is not a packaging convenience: unification is what makes the auditing possible, and it is precisely what siloed specialists give up.
The practical consequence is a single transparent, auditable, reproducible interface — one fixed coordinate set that replaces a toolbox of siloed per-task descriptors with a representation that is easier to validate and debug, and where a model's uncertainty can be read as structural complexity rather than discarded as noise. STK is a portable structural measurement layer, not a task-specific winner.