A.rtist I.nfluencer  ·  Visual Thinking Lens  ·  Parallax Portrait Basin

A Deterministic Measurement of Image-System Independence
in Authored and Generative Portraiture

The Geometry of Authored Portraiture: Image-System Independence
1,341 Images  ·  4 Classes  ·  8 Traditions  ·  A 3-Axis Treatment Basin
Russell Parrish  ·  Parallax  ·  Portrait Basin  ·  VTL Structural + Color Kernel  ·  June 2026
Sample of the 1,341-image corpus

Sample of the 1,341-image corpus: pre-modern oil portraits, Midjourney default, and Midjourney steered

Abstract One might expect authored and generated portraiture to separate by placement, where the visual mass sits in the frame. In this corpus, they do not. Both center. Separation emerged instead from treatment, and beneath the individual treatment axes lies a single principle: the degree to which an image's subsystems — mass, color, contour, and value — are allowed to behave independently of one another. Authored portraits decouple these systems (coherent mass, Cohen d = 1.25; color autonomy, d = 0.85); the generative default keeps them coupled. Measured deterministically across a matched corpus of 446 oil portraits and 927 Midjourney portraits, every statistic bootstrapped and permutation-tested, this decoupling forms a compact basin (mass and value-independent color) separating the classes at ~78–80% held-out (91% by the full instrument), and authored construction occupies ~7× the operating volume on directional construction. The coupling is also differentially accessible: composition steering moves mass toward the authored range, but color autonomy does not move even under prompts that name the exact colorist mechanism — it is reachable, if at all, only in the weights, not the prompt.

Executive Summary

This report measures what separates an authored portrait from a generative one, using the deterministic VTL structural + color kernel. The corpus is a matched design: 446 pre-modern European and American oil portraits (CC0, museum open-access) against 447 Midjourney portraits from naive prompts, plus 448 composition-steered and 32 color-steered Midjourney portraits as two same-render controls. Every measurement is deterministic and reproducible from the source image; every headline statistic below carries a bootstrapped 95% confidence interval (Appendix B) and a permutation test.

#FindingKey statisticPlain reading§
1Position is weakboth centerYou cannot separate them by where the mass sits; artists center more3
2The tell is treatment~78% / 91% held-outMass + value-independent color separate them, not position4
3Mass vs gradientd = 1.25Artists build broad masses; MJ distributes gradient5
4The Mourgue resultCBI d = 0.85Color autonomy recovers art history (Rubens, Whistler, Goya)6
5Operating volumeoil ~7× MJOn directional construction; the engine collapses to a narrow default7
6Tradition barely moves MJη² = 0.15Naming a tradition shifts mass a little, color not at all8
7Mass is promptabler_v p < 10⁻¹⁰Steering moves MJ mass toward oil — composition, not render9
8Color is prompt-invarianthue↑ d=1.60Targeted color prompts add hues, never autonomy (likely weight-level)10
The one-sentence finding. Authored portraits and the Midjourney default both center — the difference is entirely in how the centered image is constructed: the artist builds with coherent mass and lets color act independently of light, across a wide operating volume; the engine balances through distributed gradient, couples color to luminance, and collapses to a narrow default that the named tradition barely perturbs. Mass is promptable; color authorship is not.

1. Overview & Dataset Structure

The VTL kernel is a deterministic image-measurement device. It converts a rendered 2-D image into a fixed vector of structural and chromatic coordinates — gradient mass, void ratio, cohesion, dispersion, OKLab color, and color–structure coupling — using classical computer vision (Sobel gradients, connected-component labelling, percentile thresholds). It ranks nothing; it locates an image in coordinate space, the way CIE L*a*b* locates a color or GPS locates a point. Its diagnostic power comes from distributions across a dataset: when images that should differ cluster tightly, that reveals a default; when they spread, that reveals range.

This study points the kernel at one coherent basin and its generative shadow: pre-modern (≤1900) European and American oil portraiture vs the Midjourney portrait default. The corpus is fully matched across four classes.

ClassnWhat it is
premodern_oil446CC0 oil portraits (Art Institute of Chicago, the Met, Cleveland), ≤1900, single sitter, manually culled for membership in the tradition. Reproductions.
ai_default447Midjourney, naive oil-portrait prompts, 112 sitters across 8 historical registers, default composition
ai_steered448Midjourney, composition-steered prompts (same 112 sitters), the same-render control
ai_color32Midjourney, color-autonomy-steered prompts, the color control (§10)

Six research questions structure the report. (1) Can the two classes be told apart by where the visual mass sits? (2) If not, which treatment axes separate them? (3) How is the difference physically constructed, image by image? (4) How much of the coordinate space does each class occupy? (5) Does naming a historical tradition move the engine, or does it default regardless? (6) Which of these behaviors is reachable by prompt, and which is locked in the weights?

Two senses of "distance." The engine reports a deviation-Ω, inverse-variance-weighted distance from the authored reference distribution, used here as the separation metric. Selection of the axes that define the basin runs through an explicit gauntlet: F-statistic > 50 (the axis discriminates), bootstrap coefficient-of-variation < 0.20 (it is stable), and pairwise |r| < 0.70 (it is non-redundant). Only axes that survive every confound control (§12) enter the canonical basin.

The principle beneath the axes. It is tempting to read the findings as a list — mass, then color, then contour, then variance. They are better read as instances of one thing: the degree of coupling between an image's subsystems. An authored portrait lets its value structure, its color, its contour, and its mass act as semi-independent systems — color decides where light does not, contour lives on its own, mass is composed rather than accumulated. The generative default keeps these systems coupled: color rides luminance, edges track detail, mass falls out of even rendering. Every axis below, and the recognition result in §13, is a measurement of that single underlying quantity.

2. Data Integrity & Measurement

The authored corpus was fetched under a scope filter enforced at the source (oil + painting + European/American + end-date ≤1900) with full provenance — source, accession ID, title, artist, date, license, dimensions — retained for every image. A single manual cull step removed 30 multi-figure or non-portrait works and one low-resolution scan; the criterion was membership in the single-sitter portrait tradition, never taste. Range is the asset, not a liability, so no image was dropped for being unusual.

The Midjourney classes were generated from a prompt deck of 112 sitters spanning eight registers — Dutch Golden Age, Spanish Golden Age, 18th-century English, Flemish Baroque, Italian Renaissance, early American Colonial, 18th-century French Rococo, and Northern Renaissance — with the historical tradition embedded in each filename, which is what makes the per-tradition analysis (§8) possible. All four variants of each generation were kept; no curation toward a house look.

A confound stated up front. Oil paintings and photographic-style AI renders differ in medium as well as authorship. This study does not pretend otherwise. Two devices address it: the two-regime audit (every axis is also measured luminance-normalized, so lighting-only effects are caught) and the same-render control (§9, Midjourney-vs-Midjourney), which isolates composition from render character. The honest frame is structural persistence — what survives transformation and same-render — not structural truth.

3. Finding 1: Position Is Weak — Both Center

Every position axis is weak (|Cohen d| ≤ 0.46), and where it tips, artists center more than Midjourney does. The pre-registered hypothesis — that AI portraits would read as centered, symmetric, and bokeh-blurred against looser human composition — failed. The pipeline reported the failure rather than confirming it. The condition actually shouldn't be a surprise. While an artist might semantically place a subject to one side, through mass and distribution, an artist typically uses offsets to "balance" the composition, orienting the gravity to the center.

Position axes
Figure 1. Position axes carry almost no separating signal; r_v (a treatment axis) is shown for scale. Frame-gravity (DGI), horizontal/vertical centroid offset, and bilateral symmetry barely move, and DGI actually runs higher in oil.

This is a credibility result, not an embarrassment. Formal pre-modern portraiture is mass-center by convention. Artists were largely not studying composition — "left-leaning state portrait," or any other state — and the engine found exactly that: frame-gravity is higher in oil (DGI 40.45 vs 34.89), the opposite of the prior. The hand-designed position priors were wrong in direction, the gauntlet correctly demoted them, and the search moved to where the signal actually lives: in treatment, not placement. A measurement system that only ever confirms its author's hypothesis is not measuring; this one disconfirmed its own.

4. Finding 2: The Tell Is Treatment

With position excluded, the discriminating axes resolve into two co-equal pillars: mass (how coherently visual weight is built) and value-independent color (whether color makes structural decisions independent of light). Every treatment axis below clears its permutation test at p < 0.001 (5,000 label shuffles), and the confidence intervals exclude zero by a wide margin. A note on dimensionality: the color pillar is one axis. The kernel offers two color-autonomy measures, CBI and color_independence, but they correlate r = 0.94 (color_independence is defined as half CBI), so we carry only CBI in the core, in keeping with the gauntlet's own non-redundancy rule (|r| < 0.70). The canonical basin is therefore three axes: r_v, mu, CBI.

Cohen forest plot
Figure 2. Treatment-axis effect sizes (oil − MJ-default), Cohen's d with 95% bootstrap CIs (5,000 resamples). r_v (mass) leads at d = 1.25; the color-autonomy axis (CBI) and the contour/multi-scale axes follow. color_independence is shown for completeness but is the redundant twin of CBI (r = 0.94). + = oil higher.
Axis subsetHeld-out separationNote
CANONICAL: mass + value-independent color (3 axes: r_v, mu, CBI)80% Ω / 78% LRSurvives every control; non-redundant
Mass only (r_v, mu)74.7%One pillar alone
Value-independent color only (CBI)75.3%The other pillar alone
Full 8-axis gauntlet basin83.2% Ω / 91.2% LRMaximum separation, but leans on render/frame-adjacent axes

The separation is real, not an Ω artifact. Independent classifiers trained on the same axes — LogisticRegression and Linear Discriminant Analysis — both reach 91.2% five-fold held-out, so the structure is in the data, not in the distance function. Gauntlet selection is stable across random seeds (Jaccard 1.00). The 3-axis core is the defensible scientific object; the 91% full-basin classifier is the ceiling. We report both and headline the conservative one. (Dropping the redundant color twin costs ~2 points of hold-out, inside the noise, and buys a non-redundant core.)

Basin scatter PCA
Figure 3. The basin made visible: the three core axes reduced to two principal components (82% of variance). Authored oil (green) separates to the right; the two Midjourney clusters sit left and tighter. The solid arrow is composition steering (§9) — it advances on the mass axis but trades color autonomy away, so it does not close the gap to oil. The dashed arrow is the color test (§10), which moves away entirely — hue without autonomy. Centroids as diamonds.

5. Finding 3: Mass vs Gradient — How the Quiet Is Built

The dominant axis is r_v (Cohen d = 1.25, 95% CI [1.08, 1.43]): the artist builds with coherent mass — a quiet gradient field broken by deliberate edges — while Midjourney builds with distributed gradient, a busy field of small activations. Oil mean r_v = 0.78 vs MJ 0.62. It survives luminance-normalization, so this is structure, not lighting. To anchor the effect: a random oil portrait carries more mass than a random Midjourney portrait 82% of the time (common-language effect size).

r_v distribution by class
Figure 4. r_v (gradient-quiet fraction) by class. Oil sits well above both Midjourney classes; steering (§9) pulls MJ partway up.
Mass extremes named
Figure 5. The extremes, named. Left: Whistler, "Arrangement in Flesh Color and Brown: Portrait of Arthur Jerome Eddy," the highest r_v in the corpus (0.99), almost the entire frame built from broad tonal masses. Right: the lowest-r_v Midjourney default (0.31), a Spanish-register monk rendered as a near-uniform field of fine gradient.

The named extremes make the axis concrete. The corpus maximum is Whistler's portrait of Arthur Jerome Eddy — a tonalist who titled his own painting as a structural arrangement ("Arrangement in Flesh Color and Brown") — and whose r_v of 0.99 is the engine agreeing with him. The minimum is a Midjourney monk whose every square inch carries fine rendered detail, so almost no region reads as quiet. Goya's "Portrait of General José Manuel Romero" holds the highest cohesion in the corpus (mu = 0.89): a single dominant mass against a resolved ground.

Gradient decomposition visual
Figure 6. The metric made visible: original → gradient field (bright = detail) → quiet map (gold = bright sheen, blue = dark shadow). r_v counts the quiet pixels; the coloring shows how the quiet is constructed.

r_v measures amount, not arrangement, and the quiet is built two ways. Two images can share an r_v by opposite means, so the brightness of the quiet pixels is diagnostic. Oil's quiet is predominantly dark shadow-mass (chiaroscuro): dark-quiet 63.6% vs MJ 51.1%. Midjourney leans roughly 1.5× more on bright sheen (5.3% vs 3.5%), glossy highlights deployed as an artificial balancing device where the artist would use shadow. Same r_v, reached by shadow versus gloss; the steering experiment (§9) later moves MJ off sheen and toward shadow, confirming the mechanism.

r_v collision — same score different construction
Figure 7. Two Midjourney defaults with near-identical r_v (0.442 vs 0.441) but opposite construction (mu 0.09 vs 0.31; frame-gravity 33 vs 4). A shared r_v is not shared construction, which is why the basin is a vector, not a score.

Sensitivity. Removing the most extreme 5% of each class on r_v does not weaken the finding — it strengthens it, from d = 1.25 to d = 1.68. The separation is carried by the bulk of the distribution, not by outliers (full table in Appendix B).

6. Finding 4: Color Autonomy — the Mourgue Result

The second pillar is value-independent color — whether color draws boundaries the light does not — measured by CBI (chromatic boundary independence): the fraction of strong chroma edges that fall where luminance is flat. Authored color does this structural work (CBI d = 0.85; the redundant twin color_independence agrees at d = 0.95); Midjourney's color rides luminance. And the instrument, pointed at paintings, recovers art history unprompted. (CBI captures one color grammar, color decoupled from value; it does not capture relational, all-over color where hue and value move together — a separate study.)

Color autonomy extremes named
Figure 8. The color-autonomy extremes, named. Left: "Madame Philippe Panon Desbassayns de Richemont," Jeanne Eglé Mourgue, high CBI (0.27). Right: the lowest-autonomy Midjourney default (CBI 0.002), color fully bound to value.
CBI decomposition
Figure 9. The color metric made visible (faithful to the CBI computation): original → chroma-edge map → "color-only edges," the pixels where a strong chroma edge falls with no coincident luminance edge. These are the CBI pixels. Top: Benoist's portrait of Jeanne Eglé Mourgue (CBI 0.27), a concentrated band on the green sash, red drape, and the colored shadows of the white dress — color decided where value is flat. Bottom: the Midjourney default (CBI 0.00) has almost none; its color rides the value structure.

The metric rediscovers what painters already knew. A color-autonomous image in 1,496 Mourgue has a large, smooth, near-white field of a dress — a huge reservoir of weak-luminance pixels — isolated saturated accents on a muted ground. The green bow, the gold bracelet, the orange-red drape, the boy's blue jacket: each a high-chroma island against low-chroma surroundings, so each border clears the 85th-percentile chroma-edge bar. Cool-shadow, warm-light rather than pure value. Hue turning where the light barely does is precisely color-without-luminance-edge, registered across the flesh and fabric. The engine did not know it was a Mourgue; it measured CBI = 0.271 and put it at the top. Within authored oil overall, the Flemish tradition leads color autonomy (color_independence 0.122), while the tonal Dutch and American traditions lead mass (r_v 0.82 and 0.82). A measurement system that reproduces the Venetian/Flemish colorito-versus-tonal divide from pixels alone is measuring something real about authored treatment. A random oil portrait shows more color autonomy than a random Midjourney portrait 78% of the time.

Oil traditions by axis
Figure 10. Within authored oil, by tradition: Flemish leads color autonomy (the Rubens lineage); Dutch and American lead mass (tonal traditions). The instrument recovers the colorito-vs-tonal divide from pixels alone.

Classical technique gives the mechanism a name: form modeled by warm–cool temperature at equal value, colored shadows instead of merely dark ones, hue boundaries placed where no luminance boundary falls. That is precisely what CBI counts — strong chroma edges with weak coincident luminance edges — and precisely what Midjourney's near-zero scores show it does not do.

7. Finding 5: Operating Volume & Monoculture

The deepest difference is not where Midjourney centers — it is how much structural possibility it can produce at all. On the directional treatment axes — cross-scale mass wander, orthogonal partitioning, vertical and diagonal energy — authored portraiture spans 6.76× the operating volume of the Midjourney default (generalized volume, the √det of standardized covariance: oil 0.379 vs MJ 0.056). These are the construction choices nobody is prompting for, and it is exactly there that the engine collapses to a narrow monoculture. The effect is direction-specific, and being precise about it is what makes it strong: on the core discriminating axes (mass + value-independent color) the gap is only 1.10× — expected, since the core is where the classes are defined and where steering can reach, so MJ keeps some range. The monoculture is not in the signal we classify on; it is in the unprompted treatment.

Classification asks whether two distributions differ; operating volume asks how much structural possibility each contains. Two generators could share a mean and still differ enormously in the range of constructions they can produce.

Operating volume
Monoculture variance ratios
Figure 11–12. Left: generalized volume on the directional treatment axes (oil 6.76× default). Right: per-axis variance ratio (artist σ / MJ σ) on those axes — artists range 1.7–2.5× wider; the engine collapses to a narrow default. Dashed line = parity. (On the core axes the gap is only 1.10× — the expansion lives in the directional construction, not the discriminating core.)

On the most directional axes — cross-scale mass wander, orthogonal partitioning, vertical and diagonal energy — the artist distribution is 2.46×, 2.22×, 1.89×, and 1.70× wider than Midjourney's. This reframes the question as compressed versus expansive construction strategy. It may eventually outrank the classification result: the same machine could ask whether some human traditions are already "AI-like" (compressed) while others explode the space — the library-defines-the-basin thesis, measured directly.

8. Finding 6: Does the Tradition Move the Engine?

Naming a historical tradition (Dutch Golden Age, Spanish, Flemish, Rococo…) moves Midjourney's mass a little (r_v η² = 0.15) and its color autonomy essentially not at all (CBI η² = 0.08). The engine defaults regardless of the tradition it is asked for.

Register effect within MJ
Figure 13. Within MJ-default, by register: mass (r_v) and color autonomy (CBI ×3 for visibility). Mass shifts modestly — Dutch-register prompts run densest (r_v 0.72), Rococo lightest (0.56) — but no register approaches the oil mean (dashed). Color autonomy is flat across all eight.

This is the mirror image of a content-sensitivity result. Some structural response to the tradition exists — "Dutch Golden Age" does pull the engine toward a denser, darker field, consistent with the tradition's tonal character — but it explains only 15.2% of the variance in mass and less than 8.0% in color autonomy. The deepest authored behavior, color leading structure, does not arrive by naming a colorist tradition; §10 shows it does not arrive even by describing the colorist mechanism.

9. Finding 7: Same-Render Control — Mass Is Promptable

Holding the render engine constant (Midjourney) and varying only composition (448 steered portraits, same sitters) isolates treatment from render character. Because both classes are Midjourney-native, this contrast is resolution-matched for free, and the result holds at 1536, 1200, and 1024.

Class-median r_v exemplars
Figure 14. Class-median-r_v exemplars: MJ default → MJ steered → oil. Steering blocks in heavier masses (read per-image; the sheen/shadow tendency is a class average).
DF4 rotation
Figure 15. Steering is a rotation, not a translation (steered − naive, oriented toward oil). Toward oil: mass, palette economy, luma/chroma decoupling. Away: independence, contour liveliness, midtones.

Mass is promptable — r_v moves toward oil (steered − naive d = 0.45, p < 10⁻¹⁰), identical at every resolution. So the dominant axis is composition, not render character. But steering is a rotation: MJ gains mass and palette economy while losing independence, contour liveliness, and midtones. Steered Midjourney is a new construction regime — not default-plus-oil — which is why it reads as a distinct third cluster, not an interpolation.

9.1 Which steering strategy moves which axis

The 448 steered images were generated under six rotated strategies; matching each image to its strategy through the prompt deck (432 of 448 recovered) lets us see the dissociation from inside the steered set. Mass strategies move mass; no strategy moves color autonomy toward oil, and the strategy explicitly built to create color autonomy moves it the wrong way.

Steering strategynd r_v (mass)d CBI (color)What moved
Off-center mass72+0.93−0.16mass ✓ color ✗
Chiaroscuro / value-structure72+0.71+0.02mass ✓ color ✗
Mass economy64+0.20−0.09mass (weak) color ✗
Color autonomy76+0.46−0.27color ✗ (moved away)
Tonal opposition76+0.32−0.25color ✗ (moved away)
Environmental asymmetry72+0.04−0.27position only

d = Cohen's d vs the naive default; + = toward oil. The diagonal is clean for mass and absent for color — internal corroboration of §10.

9.2 Where do steered images land — interpolation or rotation?

Train the oil-vs-default classifier (3 core axes) and apply it to the steered set. If steering were interpolation toward oil, most steered images would cross to the oil side. They do not.

Set classified by the oil-vs-default model% called "oil"
Oil (self-classification)78%
MJ default (baseline)20%
MJ steered28%

Steering lifts the steered set from 20% to 28% "oil" — real movement, but a minority crossing, not a flip. The steered images gained mass while shedding other authored properties, so the classifier still reads most of them as generated. This is the quantitative signature of a rotation, not an interpolation: a new construction regime adjacent to oil on one axis, distant on others.

Steering trades color for mass — it does not approach oil. Read on the three core axes, the trade is explicit: r_v moves 0.62 → 0.68 (toward oil 0.78), mu 0.24 → 0.28 (toward oil), but CBI moves 0.097 → 0.084, away from oil (0.159). Two axes improve, one degrades, and the net basin distance is flat: the steered set closes only ~2% of the naive→oil gap in deviation-Ω. The honest statement is not that steering moves Midjourney toward authored portraiture, but that it pivots — gaining composed mass at the cost of color autonomy.

10. Finding 8: Color Autonomy Is Not Reachable by Prompt

If mass is promptable, is color autonomy? Eight register-matched colorist prompts — El Greco, Rubens, Venetian colorito — aimed at the exact mechanism (colored shadows at equal value, warm–cool modeling, hue boundaries without value boundaries) tried to make Midjourney's color lead structure.

Color test results
Figure 16. Color test (32 images) vs baselines. hue_entropy explodes (d = +1.60, past oil); color_independence and CBI do not move toward oil.
AxisNaiveColor-testOilResult
color_independence0.0570.0480.095No move (p=0.27)
CBI0.0970.0790.159No move (p=0.26)
hue_entropy0.5530.6790.489+1.60 d, p < .001

Dissociation: hue variety is promptable; color autonomy is not. Midjourney heard "color" and poured in more hues — hue_entropy shot past the oil mean — but kept every one of them coupled to luminance. The autonomy axes did not move toward oil (both p ≈ 0.26). What this establishes experimentally is narrow and firm: color autonomy is invariant under prompt manipulation, even manipulation aimed precisely at the mechanism. The most parsimonious explanation is that the color–luminance coupling lives in the weights rather than the prompt, but that is the interpretation; the invariance is the result.

This completes the dissociation that organizes the whole report: mass is promptable (§9), hue variety is promptable (this section), and color authorship is not (this section). The prompt can dodge the basin on the first two; the deepest part of authored treatment is reachable only by the weights — the argument for open-weight models as the next sandbox.

11. Robustness

Resolution, the first external attack, is ruled out. The authored images were measured at 1536, the Midjourney images near 1200 native, raising the worry that the difference was a resampling artifact. Re-measuring both classes at matched 1024 and 1200 long-edge removes it: every treatment axis survives, and the two matched resolutions agree (1024 ≈ 1200), which a resampling artifact would not.

Resolution control
Figure 17. Treatment-axis Cohen's d across three resolutions — nothing collapses; the core axes are flat across scale.
ConfoundVerdict on the mass + color core
Ω weightingReal; independent LR/LDA reach 91% on the same axes
Luminance / lightingSurvives; two-regime audit (raw + luminance-normalized)
ResolutionRuled out; 1024 and 1200 matched re-measure agree
Frames / canvasClean; border probe — frames inflate only orientation axes within oil, mass + color untouched
Render characterResolved; r_v promptable (§9); color autonomy prompt-invariant (§10)
OutliersStrengthens; trimming 5% raises every core effect (Appendix B)

One residual, stated plainly. The negative-space-fraction axis was found degenerate during review (near-constant in MJ, σ = 0.001, which inflated its inverse-variance weight ~130× and skewed Ω); it was dropped, costing essentially no separation and rebalancing the per-class accuracy. The guard — watch any axis whose class variance approaches the regularizer — is now part of the protocol.

12. The Recognition Test — the Gate

A coordinate system earns its keep only if its ordering matches a trained eye. The gate is a blind forced-choice test: shown two portraits, which one "holds"? Ω's pick is scored against the judge's. This is a pilot (n = 1, a generalist judge, 60 pairs), reported as a pilot.

Recognition test results
Figure 18. Agreement with Ω. Cross-class (oil vs AI): 90%, 20/20 on the real oil. Within-class: chance (oil) and inverted (within MJ).

Ω is validated at the class/medium level, not yet at within-class ordering. The judge separated authored from generated almost perfectly (the minimum "extremes" bar), but within Midjourney she picked the Ω-default as "holds" — an inversion. Several explanations remain plausible, and this pilot cannot distinguish them: (1) the judge tracked polish, not authorship — if Ω ≈ independence and a lay eye ≈ finish, then within AI "less default" reads as "less finished," and the two anti-correlate by construction; (2) Ω's within-class ordering is genuinely wrong even though its class-level resolution is right; (3) the judge was a generalist, not the brief's trained-eye target, and a portrait professional would order differently; (4) n = 1, and a single judge's within-class picks are high-variance. The decisive next step is the same 60 pairs run by trained eyes, which separates "Ω is wrong" from "the judge tracked finish." We report the inversion and decline to resolve it here.

13. Interpretive Synthesis

Authored portraiture and the generative default both center; they part company in the treatment, and most of all in how independently their image systems are allowed to behave.

The concurrent behaviors

What the coordinate system does not say

These findings locate the two classes; they do not rank them. A low CBI is not "worse" than a high one — it is a different compositional philosophy. The engine measures; it does not evaluate. The cross-engine comparison reported here is exactly the study that single-engine measurement papers defer: both datasets run through the same kernel in the same configuration, with the medium confound named and bracketed rather than hidden.

What this paper is really about. This is, in the end, less a paper about Midjourney than a demonstration of deterministic structural measurement — Midjourney is the experimental vehicle, and the authored/generated contrast is the convenient first case where the coupling principle is easy to see. The lasting claim is the method and the principle: how independently an image's subsystems behave, measured without a model. If that principle survives other generators, digital painting, illustration, and photography, this becomes the first domain-specific instance of a general measurement framework, not an AI-comparison result. That is where its long-term value lies.


Appendix A: Scope of Claims

Scope confusion is the most common source of misreading in measurement research. Reading this list first prevents most interpretive errors.

What this paper does

What this paper does not do

Appendix B: Statistical Foundations

All corpus-level statistics are bootstrapped (n = 10,000 resamples, seed = 42, percentile method). Effect-size CIs use 5,000 paired bootstrap resamples; permutation p-values use 5,000 label shuffles (reported at the 1/5001 floor as "< .001"). Cohen's d = (μ_oil − μ_MJ)/σ_pooled; positive = oil higher.

B.1 Core axis means with 95% CI

AxisOil [95% CI]MJ-default [95% CI]MJ-steered
r_v0.78 [0.77, 0.79]0.62 [0.61, 0.63]0.68
mu0.31 [0.29, 0.32]0.24 [0.22, 0.25]0.28
CBI0.159 [0.154, 0.165]0.097 [0.089, 0.105]0.084

The three canonical core axes. color_independence (oil 0.095, MJ 0.057) is omitted as the redundant twin of CBI (r = 0.94).

B.2 Effect sizes (oil vs MJ-default): d, CI, permutation p

AxisCohen d95% CIPerm p
r_v1.25[1.08, 1.43]< .001
color_independence0.95[0.80, 1.13]< .001
chromatic_boundary_independence0.85[0.69, 1.03]< .001
mu0.44[0.31, 0.58]< .001
contour_kvar0.85[0.72, 0.97]< .001
vertical_energy0.74[0.63, 0.86]< .001
mscale_centroid_curv0.59[0.49, 0.69]< .001
dgi0.40[0.26, 0.53]< .001
hue_entropy−0.49[−0.61, −0.37]< .001
sdi−0.17[−0.30, −0.04]0.014

B.3 Sensitivity (trim extreme 5% per class)

Core axisd (full)d (trimmed 5%)
r_v1.251.68
mu0.440.57
color_independence0.951.48
chromatic_boundary_independence0.851.36

Trimming raises every core effect — the separation is carried by the bulk of the distribution, not by outliers.

Appendix C: Instrument Transparency

The measurement instrument is the VTL structural + color kernel. It is deterministic, model-free, and operates on rendered pixel data only — no prompts, weights, attention maps, or metadata. The same image always produces the same output, to floating-point precision.

Every metric has a closed mathematical definition (Appendix F), not a learned representation, so any value is independently verifiable from the image and the code. r_v is the fraction of pixels whose normalized Sobel gradient falls below a fixed quiet threshold; CBI is the fraction of strong chroma-edge pixels (85th-percentile chroma gradient) lacking a coincident luminance edge (30th-percentile luminance gradient); mu is the concentration of structural energy into coherent regions. Working size is 1536 long-edge (downscale-only); a configuration fingerprint is stamped into every output row and every basin file, so a result cannot silently mix configurations. r_v is computed after per-image gradient normalization, which makes it cross-image stable but removes absolute edge-energy sensitivity — values here are not directly comparable to earlier non-normalized work.

Appendix D: Anticipated Objections & Responses

These are the questions raised across five external review rounds, answered here rather than left to reframe the findings.

Q: Oil was measured at higher resolution than Midjourney — the difference is a resampling artifact.

A: Ruled out. Both classes were re-measured at matched 1024 and 1200 long-edge; every treatment axis survives and the two matched resolutions agree (1024 ≈ 1200). A resampling artifact would not be scale-stable. The raw-gradient negative control persists, locating that residual in render character, not resolution.

Q: The separation could be an artifact of the inverse-variance Ω weighting.

A: No. Independent LogisticRegression and LDA classifiers reach 91.2% five-fold held-out on the full instrument; the structure is in the data, not the distance function. Two redundancy fixes are reported: negative_space_fraction was dropped (degenerate under inverse-variance weighting), and color_independence was dropped from the core as the r=0.94 twin of CBI, leaving a clean three-axis core.

Q: Oil reproductions include frames and canvas edges.

A: A border-salience probe shows frame energy is not class-wide (oil 0.108 vs AI 0.113). Within oil, frames inflate orientation axes (vertical/orthogonal energy) by roughly a third to a half — which is exactly why those axes are excluded from the canonical core. Mass and color autonomy are untouched.

Q: Oil paintings vs photographic-style renders differ in medium, not just authorship.

A: Acknowledged and bracketed, not hidden. The two-regime audit measures every axis luminance-normalized to catch lighting-only effects, and the same-render control (§9, Midjourney-vs-Midjourney) isolates composition from render. The honest claim is structural persistence — what survives same-render and normalization — not structural truth.

Q: Color autonomy might be render character, not a weight-level property.

A: The color test (§10) settles this. Eight prompts aimed at the exact colorist mechanism moved hue variety hard (d = +1.60) and color autonomy not at all (p ≈ 0.26). A render setting reachable by description would have moved; this did not.

Q: The recognition test is n = 1.

A: It is a pilot and labelled as one. It validates Ω at the class/medium level (20/20 on real oil); the within-class inversion is unresolved and explicitly awaits a trained eye on the same pairs. No within-class ordering claim is made on this evidence.

Q: You are an independent researcher without institutional affiliation.

A: The findings should be weighted on their own evidence, which is documented in full. Every measurement is reproducible from the source images and the published kernel; every statistic is bootstrapped with stated seed and method. Affiliation does not change whether a Sobel gradient is computed correctly.

Appendix E: Methodology for Replication

E.1 Corpus

E.2 Measurement & statistics

Full command trail: REPRODUCE.md. Findings narrative: FINDINGS.md. Both ship with the engine package.

Appendix F: Operational Definitions

TermDefinition (a measurement, not a judgment)
r_v (void / mass ratio)Fraction of pixels whose normalized gradient is below the quiet threshold. High = built from broad masses.
mu (cohesion)Degree to which structural energy concentrates into coherent regions rather than distributed fields.
color_independenceAggregate chromatic autonomy: how much color varies where luminance does not.
CBI (chromatic boundary independence)Fraction of strong chroma edges lacking a coincident luminance edge. High = color draws its own boundaries.
hue_entropyDistributional diversity of hue across the image. Higher = broader palette (variety, not autonomy).
mscale_centroid_curvWander of the mass centroid across spatial scales, a directional / multi-scale treatment axis.
contour_kvarVariation in contour curvature, liveliness of edges.
DGI (frame-gravity)Degree to which mass organizes relative to the frame center.
SDI (dispersion)Spread of structural activation across the image.
operating volume√det of the standardized covariance on a chosen axis set — how much coordinate space a class occupies. Modest on the core; ~7× wider for oil on the directional treatment axes.
deviation-ΩInverse-variance-weighted distance from the authored reference distribution.

Appendix G: References