A neuro-symbolic architecture that bridges abstract biology and continuous physics. One shared encoder, two paradigms: mechanistic interpretation (V1) and gradient-based treatment optimization (V2).
Building a tumor digital twin requires both biological depth and temporal density. No single data source has both — each fills the other's fatal gap.
6 complete modalities per patient
Measurements every 6-12 weeks. Only 3-4 data points per patient. Cannot learn continuous tumor dynamics from snapshots.
Measurements every 2-3 days. Dense longitudinal time-series that reveal growth dynamics, drug response curves, and resistance emergence.
Feature completeness across 6 modalities. The biological target for domain alignment.
Clinical predictions rely on PDX-learned physics. PDX relies on human-learned feature imputation.
Dense time-series reveal growth dynamics needed to train neural ODEs.
Two paths, one foundation — hover to explore each path
Full multi-modal latent — direct input
UNI2-h histopathology via late gated fusion
ConditionedGatingModelV3 + FiLM + physics bottleneck
RNA-derived portion of VAE latent
DSNHypernetwork + FiLM + BatchNorm
Both paths share the same VAE encoder, serving different clinical and research needs
"Tell me what is happening"
"Tell me when progression occurs"
The H-BDVAE compresses all available tumor data into a unified biological latent state (328d) that captures underlying disease biology while factoring out technical artifacts. Uses Product-of-Experts fusion for graceful handling of missing modalities.
Probabilistic Encoder Self-Distillation will train a student encoder to match a teacher that sees all modalities. Currently, missing modalities are handled via Product-of-Experts zero-masking.
All ODE parameters satisfy biological constraints: ρ∈[0,0.3], β∈[0,1], ω>0. 0.00% violation rate. Violations fail regardless of C-index.
Three independent checks — physics compliance, fidelity to data, and clinical utility — must all pass before any prediction is served.
Seven interconnected models from data encoding to evolutionary simulation
Multi-modal encoder with Product-of-Experts fusion
Strips mouse stroma, imputes missing epigenetics from RNA
Context-aware driver gene identification via GATv2
Drug sensitivity with concept bottleneck architecture
Dual-path physics-informed parameter generation
Continuous tumor dynamics with Dopri5 solver
Stochastic clonal evolution with Euler-Maruyama SDE
Every DNAI prediction decomposes into a chain of inspectable, biologically-named computations. From raw gene expression through 328 named latent dimensions, six physics parameters with physiological units, to time-resolved trajectories with calibrated uncertainty — nothing is opaque.
Explore Prediction TraceabilityReview validation metrics, benchmark comparisons, and model performance