Parallax Pathology v4.1

The pathology is
the absence
of normal form.

A deterministic, training-free structural measurement system for H&E histology. Sixteen structural theories. Fifty-eight axes. No labeled pathological examples required.

Rather than learning what pathology looks like, the system learns what normal structure looks like — and measures deviation from that norm.

Benchmark CRC-VAL-HE-7K · 1,800 patches · 9 tissue classes Accuracy 91.5% nine-class linear accuracy · zero training External validation EBHI-SEG dysplasia grading · GTEx cross-dataset · TCGA-COAD survival Published 2026 · All data, code, and results available
91.5%
Nine-class accuracy
CRC-VAL-HE-7K, 5-fold cross-validation. No training. 58 deterministic axes vs. the benchmark.
r = 0.72
Dysplasia grade correlation
Spearman ρ on EBHI-SEG (n=920). Structural geometry tracks a continuous biological progression it was never shown.
16.3pp
Emergence gap
Best single layer scores 75.2%. Full 58-axis system: 91.5%. The combination is genuinely greater than any component.
0
Trained parameters
Every output is traceable to a specific geometric relationship. The explanation is the measurement — not post-hoc attribution.

A different question requires
a different instrument.

Standard approach

Train on labeled pathology. Learn what disease looks like.

Foundation models trained on millions of labeled patches achieve 95–99% accuracy on standard benchmarks. Their predictions are confident and often correct. But they require large labeled datasets of pathological examples — and they cannot explain which geometric properties drove the prediction. They produce a verdict, not a reading.

Parallax Pathology

Learn what normal looks like. Measure the departure.

The system encodes the structural laws of normal tissue — what orientation coherence, void topology, boundary permissiveness, and tonal distribution look like when tissue is doing what tissue of its type does. Deviation from those laws is the signal. A rare disease the system has never seen will register as structurally distant from all known normals. That is not a misclassification. It is the correct output.

The Frame — invariant
Fixed axes

Six structural axes — Δᵣ, θ, Γ, k_rv, highlight_mass, midtone_mass — plus 52 additional geometric, texture, and microstructure features. The frame defines how structure is measured, not what structure should be. It does not change between a colorectal biopsy and a breast biopsy.

The thermometer.
Ω — structural deviation
The reading

Ω(x) = √( Σᵢ wᵢ · (xᵢ − μᵢ)² ) where wᵢ = 1/variance. A continuous scalar measuring how far a patch sits from the structural expectation of a chosen reference class. Decomposed by axis — the source of deviation is always named.

Low Ω = tissue doing what that tissue does.
High Ω = failing to be itself.
The Memory — inserted
Swappable reference

A reference distribution built from representative normal examples of any tissue type. The memory is not part of the instrument — it is context provided to the instrument. Swap the memory for breast, kidney, or skin tissue. The frame stays. This is the Library of Normals architecture.

The reference temperature.

Nine layers. Fifty-eight axes.
All deterministic.

L1
Kernel Primitives
Nine geometric scalars on the confirmed edge field and luminance baseline. Centroid, void ratio, mass concentration, packing density, orientation coherence, structural thickness. The same compositional primitives that describe painting structure describe tissue architecture.
9 axes
L2
16 Mask Coherence
Sixteen independent structural theories applied simultaneously: VTL baseline, LMS cone channels, opponent channels, color-deficiency ablations, H&E deconvolution, Canny-derived masks. Four coherence scores from their pattern of agreement and disagreement.
+4 axes
L3
Structural Complexity
Centroid wander across scale (G1), void topology (G2), contour curvature variance (G3), orientation entropy slope over log(σ) (G4), and Structural Coherence Index. G4 is the strongest single axis for dysplasia progression detection.
+5 axes
L4–5
Radial Compliance + Tonal Structure
Mass organization relative to frame center vs. own center of gravity. Tonal structure maps LAB L* into shadow, midtone, and highlight mass fractions — the tonal axes are the primary carriers of survival signal in TCGA-COAD.
+6 axes
L6
Structural Expectation (Ω)
The deviation layer. Reference encodes per-class mean and variance-inverse weights. Ω computed for all reference classes simultaneously. Returns nearest class, own-class distance, top driving axis, and full per-class distance profile. Label-agnostic by design.
+1 axis
L7
Gap Mask Characterization
Surfaces the Sobel-minus-Canny residual — soft gradient regions that lack hard boundaries. Captures diffuse infiltrates, early stromal remodeling, nuclear membranes in transition. gap_rv (void ratio of the gap field) is a progression marker.
+5 axes
L8 / L8b
GLCM Texture (L* and H/E channels)
Gray-Level Co-occurrence Matrix at fine (cellular) and coarse (tissue) scales on luminance channel, plus separately on Beer-Lambert deconvolved hematoxylin and eosin channels. tex_homogeneity_fine produced the single largest accuracy gain: +5.1pp.
+14 axes
L9
Blob Microstructure (DoG Nuclear)
Difference-of-Gaussians detector on the hematoxylin channel. Nuclear density, mean size, and size coefficient of variation — a nuclear pleomorphism index — without requiring individual cell segmentation. Adipose correctly returns near-zero blob counts.
+5 axes

Instrument Pipeline

Pathology Kernel measurement pipeline

Three datasets.
Three different questions.

CRC-VAL-HE-7K · Tissue Classification
Tissue types occupy geometrically distinct, linearly separable regions.

1,800 Macenko-normalized 224px patches across 9 classes. Every tissue class shows visually confirmed canonical/deviant separation. Deviant TUM patches stratify into two geometrically distinct failure modes corresponding to mucinous differentiation and desmoplastic reaction — detected from geometry alone, without pathological labels.

Overall accuracy 91.5%
ADI / BACK 99% / 100%
Emergence gap 16.3pp
Trained parameters 0
EBHI-SEG · Dysplasia Grading
The instrument tracks a continuous biological progression it was never trained on.

Six-class dysplasia grading from Normal through Adenocarcinoma (n=920, 224px PNG). Adjacent classes overlap structurally by nature — this is not a classification problem, it is a measurement problem. 84.3% of predictions are correct or within one adjacent grade step. The system correctly places serrated adenoma structurally distinct from conventional low-grade neoplasia, consistent with its separate BRAF molecular pathway.

Spearman ρ (grade vs. predicted) 0.716
Correct + adjacent 84.3%
Progression axes p < 10⁻¹⁸ 11
Ridge R² (grade variance explained) 0.41
GTEx · Cross-Dataset Fixation
The frame is universal. The memory is not — and should not be.

Three independent GTEx whole slide images (PAXgene fixation — different preparation method from the formalin-fixed CRC-VAL data). Structural centers emerge independently within each donor without labels. Within-set Ω distributions (mean 2.09–2.26) are consistent with CRC-VAL results. A preparation chemistry finding: Δᵣ inverts its role in the deviation field between fixation methods, correctly detecting a real property of the tissue-chemistry interaction.

Donors tested 3 independent
Mean Ω convergence 2.09–2.26
Cross-preparation stable axes k_rv, θ
Labels required 0
TUM Deviant Analysis · Failure Mode 1
Density Loss

highlight_mass jumps from canonical mean 0.229 to 0.54–0.61. The tumor field opens, cellular density gives way to optically permissive space. Geometric signature corresponds to mucinous differentiation, intratumoral necrosis, or poorly cohesive growth — detected without training on either.

highlight_mass (Δ) +0.22 ↑ field opening
Γ — boundary permissiveness +0.21 ↑ losing definition
midtone_mass (Δ) −0.24 ↓ core density failing
TUM Deviant Analysis · Failure Mode 2
Directional Acquisition

θ jumps to 0.12–0.13, nearly five times the canonical TUM mean of 0.024. These patches have acquired directional structure that the tumor normally lacks. Geometric signature corresponds to desmoplastic reaction: host stromal fibrosis within the tumor field.

θ — orientation coherence 5× canonical → desmoplastic
Δᵣ — luminance-chromatic +0.02 ↑ structural tension
Drift destination → STR (stroma, 80%)
The honest null, and what it revealed.

TCGA-COAD survival analysis (n=180 colorectal patients, 47 OS events). The pre-specified primary hypothesis — that mean structural deviation (Ω) predicts overall survival — was not supported. HR=0.87, p=0.30. This result is reported directly, not minimized. It is also informative: the average tumor state does not predict outcome. What predicts outcome is how the structural state is distributed across the tumor. Intra-tumor structural variability and tonal density were significantly associated with survival independent of pathological stage. The Tumor Consolidation Index stratifies patients with observed mortality of 15.0%, 26.7%, and 36.7% across tertiles. Concordance index on disease-specific survival: 0.850 — without any outcome training.

2.4× mortality difference,
low vs. high TCI tertile
0.850 DSS concordance index,
no outcome training
p=0.30 primary hypothesis,
honestly null
Structural Evidence Two patients. Same stage. Different structural fates.
H&E histology comparison — structural heterogeneity vs consolidation, same stage, different survival
751 days
Structural heterogeneity · Stage IV
Hybrid composite: −5.0
Variable architecture across regions. Permissive, regionally variable organization.
61 days
Structural consolidation · Stage IV
Hybrid composite: +4.6
Uniform density, minimal patch-to-patch variation. Compressed, monotone field organization.
Higher composite scores indicate compressed, monotone field organization. Lower scores indicate permissive, regionally variable organization. Scores are z-scored composites of four structural axes from Macenko-normalized H&E tiles — no clinical outcome data used. After normalization, tonal mean axes were removed; scores reflect geometric structure, void ratios, luminance-chromatic disagreement, and intra-tumor variability, not staining intensity.

A different question
answered by a
different instrument.

A foundation model answers
What is this tissue?

Confident categorical prediction. 95–99% accuracy on standard benchmarks. Optimized for classification. Requires large labeled datasets of pathological examples. Cannot explain which geometric properties drove the confidence. The gap in accuracy between deep learning and Parallax Pathology is real and expected — it reflects genuinely non-geometric information that no hand-designed feature encodes. This gap is not hidden. It is documented explicitly in the paper.

Parallax Pathology answers
How far is this tissue from what tissue of this type is supposed to do structurally, and which geometric axis is failing?

Continuous deviation scalar. Axis-level explanation built into the measurement itself. No training on pathological examples. No training distribution — high Ω on an unseen tissue type means the geometry does not fit any known structural regime. That is not a misclassification. It is the correct output. Rare diseases surface as high-deviation from all normals.

The structural metric vector — 58 deterministic scalar outputs per patch — is a structured, biologically interpretable feature representation. It can be concatenated with any foundation model's learned embedding. Using Ω as an out-of-distribution signal alongside a deep classifier produces a system that is both accurate and capable of flagging patches outside the training distribution. The intended relationship is hybrid, not competitive. The 91.5% figure is evidence that the measurement space has real geometric structure. Its value as a hybrid component is not bounded by that ceiling.

The frontier
is explicit.

01
Library of Normals

References for breast, lung, kidney, and skin have not been built. The architecture supports them. Building each requires only normal tissue examples — not labeled pathological material.

02
448px Field of View

The current benchmark is fixed at 224px. The hypothesis that tissue-scale fiber organization reads more cleanly at larger field width requires tiling from source WSIs. The experiment was designed; the data was not available.

03
Clinical Ground Truth

No correlation between structural deviation and pathological grade, molecular subtype, or clinical outcome has been formally validated. TCGA-COAD is exploratory. This is the most important open question for translational relevance.

04
Longitudinal Drift

Multiple biopsies from the same patient produce a trajectory in Ω space. Tissue drifting toward a pathological state should show increasing deviation before a defined histological diagnosis. Serial biopsy datasets with known outcomes are required to test this.

05
Stain Normalization Study

Tonal axes carry staining-protocol sensitivity. Macenko normalization applied to TCGA-COAD would cleanly separate the staining artifact from the architectural signal. If highlight_mass survives normalization, the finding is architectural.

06
Foundation Model Integration

The 58-axis structural vector as a feature layer concatenated with UNI or CONCH embeddings. The combination carries both high-accuracy learned features and geometrically interpretable structural measurements. Not yet tested.

Geometry as a
perceptual instrument.

Origin
The Visual Thinking Lens

Parallax Pathology extends the Visual Thinking Lens (VTL), a geometric kernel originally developed for compositional analysis of AI-generated images and visual art. The same field-agnostic primitives — centroid, void ratio, orientation coherence, spatial dispersion — that describe compositional structure in paintings describe tissue architecture in H&E sections. The frame is universal. The application is not.

Principle
Finding the ceiling of what geometry can see

The system is explicitly not competing with foundation models. It is asking a different question with a different instrument: how far can deterministic geometric measurement go? At 91.5% nine-class accuracy without any training, the answer is: further than expected. The 16.3pp emergence gap confirms that the structural layers encode genuinely complementary information — not redundant signals averaging together, but a genuinely new capability from their combination.