Longevity Science

What the science actually says
about reversing biological age.

A decade ago, "anti-aging" was the domain of supplement marketing and wishful thinking. Today it's Nobel Prize-winning biology, $3 billion venture rounds, and the first FDA-accepted clinical trial targeting aging as a disease. This is what the research actually shows, and what it doesn't.

DISCLAIMER, This page summarizes published peer-reviewed research. Antiaging Labs does not administer drugs or hormones. Our protocols are lifestyle-based: nutrition, training, recovery, and evidence-based supplements. Some research below involves pharmaceutical interventions not used in our program.
01, The framework

The 12 Hallmarks of Aging

In 2013, López-Otín et al. published a landmark paper in Cell establishing 9 distinct biological mechanisms that drive aging. A decade of evidence led to a 2023 update, adding 3 more hallmarks and cementing this as the field's operating framework. Every serious longevity intervention targets one or more of these. Antiaging Labs's 4-lever protocol addresses 7 of the 12.

01
Genomic Instability
DNA damage accumulates over time from oxidative stress, replication errors, and radiation. Repair mechanisms become less efficient.
02
Telomere Attrition
Protective caps on chromosome ends shorten with each cell division. Short telomeres trigger cell senescence or death.
03
Epigenetic Alterations
DNA methylation patterns, histone modifications, and chromatin structure drift away from their youthful configuration, measurable via epigenetic clocks.
04
Loss of Proteostasis
Protein folding quality control degrades. Damaged proteins accumulate. Autophagy (cellular housekeeping) becomes less efficient.
05
Deregulated Nutrient Sensing
mTOR, AMPK, sirtuins, and insulin/IGF-1 signaling pathways lose calibration. Cells grow when they should repair, and vice versa.
06
Mitochondrial Dysfunction
Mitochondria produce more reactive oxygen species and less ATP. Energy availability declines. Cell signaling degrades.
07
Cellular Senescence
Damaged cells stop dividing but refuse to die. They secrete a cocktail of inflammatory signals (SASP) that poison neighbouring tissue.
08
Stem Cell Exhaustion
Tissue-specific stem cells decline in number and function. Regenerative capacity falls across most organs and tissues.
09
Altered Intercellular Communication
Hormones, cytokines, and exosomes that coordinate tissue function become dysregulated. Chronic low-grade inflammation (inflammaging) takes hold.
10
Disabled Macroautophagy
The cell's recycling system, responsible for clearing damaged organelles and misfolded proteins, becomes progressively impaired.
Added 2023
11
Chronic Inflammation
Low-grade sterile inflammation (inflammaging) persists systemically, driving cardiovascular disease, neurodegeneration, and metabolic dysfunction.
Added 2023
12
Dysbiosis
The gut microbiome shifts toward a pro-inflammatory composition with age, contributing to immune dysregulation and systemic inflammation.
Added 2023

Source: López-Otín et al., Cell 2013 (9 hallmarks); López-Otín et al., Cell 2023 (updated to 12 hallmarks)

02, Cellular reprogramming

Yamanaka Factors and the
age reversal breakthrough

In 2006, Shinya Yamanaka discovered that four transcription factors, Oct4, Sox2, Klf4, and c-Myc (OSKM), could reprogram any adult cell back to a pluripotent stem cell state. The implication: the genome retains a full "memory" of youth. Aging isn't a one-way ratchet. In 2012, Yamanaka received the Nobel Prize in Medicine. Since then, the question has shifted from "can we reprogram cells?" to "can we do it safely, partially, in living organisms?"

"We can reset the age of a cell, and it remembers its youthful state. Aging has an address, and we've found it."

David Sinclair, Harvard Medical School, paraphrased from multiple interviews, 2020–2023
03, David Sinclair & the information theory

Aging as information loss, and how to get it back

Professor David Sinclair (Harvard Medical School, Paul F. Glenn Center for Biology of Aging Research) has built the most coherent single theory of aging currently in science. His 2019 book Lifespan brought these ideas to a mainstream audience. The core claim: aging is not wear-and-tear or mutation accumulation, it's the progressive loss of the epigenetic information that tells cells who they are and how to behave.

The Core Theory · Published 2019–2023
The Information Theory of Aging
Sinclair & LaPlante, Lifespan (2019) · Multiple Cell / Nature publications
DNA
is the CD.
Epigenome is the music.

The hypothesis

The genome is like a compact disc, it contains all the information needed for life. The epigenome is like the music encoded on it. With age, the disc gets scratched. The information is still there, but it can't be read correctly. Cells forget who they're supposed to be. Sinclair argues this is the primary cause of aging, not genetic mutation.

The supporting evidence

  • Reprogramming works, cells can be reset to a younger epigenetic state, proving the original information is preserved
  • Sirtuins as guardians, these NAD+-dependent enzymes maintain epigenetic fidelity. When they're busy repairing DNA damage, they neglect epigenetic maintenance
  • NAD+ decline, NAD+ drops ~50% between 20 and 60, reducing sirtuin activity and accelerating epigenetic drift
  • Stress accelerates clocks, epigenetic clocks run faster under chronic stress, poor sleep, and sedentary lifestyle, consistent with the theory
NAD+ & Sirtuins · Clinical Research 2019–2023
NMN, NR, and the NAD+ restoration debate
Yoshino et al. 2021 (Science) · Liao et al. 2021 · Dollerup et al. 2020
~50%
NAD+ decline
ages 20 → 60

What the trials show

  • Yoshino et al. 2021 (Science), NMN supplementation improved muscle insulin sensitivity and gene expression in postmenopausal women with prediabetes. Double-blind, placebo-controlled.
  • Liao et al. 2021, NMN safe and well-tolerated in older adults over 60 days at up to 900mg/day. Raised blood NAD+ levels.
  • Dollerup et al. 2020, NR raised blood NAD+ but showed no significant metabolic benefit at 12 weeks. Stirred debate about bioavailability and what raising blood NAD+ actually means.

Honest assessment

The NAD+ space is contested. Blood NAD+ can be raised, the question is whether this translates to meaningful intracellular effects in aged tissues. Phase II trials are ongoing. Exercise remains the most robustly proven method of raising NAD+ and activating sirtuins, which is why it's the backbone of every serious longevity protocol, including Antiaging Labs's.

Sinclair's self-experiments (1g NMN daily) are widely publicized but are n=1 anecdotes, not clinical data. He acknowledges this.

04, Senolytics

Clearing the cells that shouldn't still be there

Cellular senescence is one of the most actionable hallmarks of aging. Senescent cells stop dividing after DNA damage, telomere shortening, or oncogene activation, a short-term safety mechanism that prevents cancer. But over time, the immune system fails to clear them efficiently, and they accumulate throughout the body, releasing a toxic cocktail of inflammatory signals (SASP: Senescence-Associated Secretory Phenotype). Senolytics are drugs that selectively kill these cells.

SASP
IL-6, IL-8, MMP3, PAI-1, the inflammatory molecules senescent cells secrete, damaging neighbouring tissue
D + Q
Dasatinib + Quercetin, the leading senolytic combination. Dasatinib is a cancer drug; quercetin is a plant flavonoid.
2019
First human senolytic trial published (Kirkland, Mayo Clinic), 11 patients, IPF. Reduced senescent cell markers.
2023
D+Q shown to reduce senescent cell burden in diabetic kidney disease. Improvement in physical function.
Mayo Clinic · 2019–2023
Kirkland Lab: Dasatinib + Quercetin in humans
Kirkland & Tchkonia, Cell Metabolism 2020 · Quercetin + Dasatinib human trials, EBioMedicine 2019, Aging Cell 2023
Phase II
Multiple active human trials across conditions

What the research shows

  • Intermittent D+Q dosing (3 days on, 2 weeks off) selectively eliminates senescent cells in human tissue
  • 2019 IPF trial: reduced senescent cell markers (p16, p21), improved 6-minute walk distance
  • 2023 DKD trial: reduced senescent cell burden in kidneys, improved eGFR in some patients
  • Frailty trials ongoing at multiple Mayo sites

Current limitations

  • Small trials, most n < 50. Effect sizes need replication at scale.
  • SASP creates feedback loops, clearing too aggressively may trigger immune response
  • Dasatinib is not a benign drug, side effects include pleural effusion at therapeutic doses
  • No FDA-approved senolytic therapy exists yet. Unity Biotechnology's UBX0101 failed Phase II for knee osteoarthritis (2020)
  • Quercetin alone has poor bioavailability, the D+Q combination appears synergistic
05, mTOR & Rapamycin

The most robust longevity drug
in animal models

mTOR (mechanistic Target of Rapamycin) is the master regulator of cellular growth, protein synthesis, and autophagy. When it's chronically active, cells prioritize growth over repair, accumulating damage faster. Caloric restriction works partly by reducing mTOR. Rapamycin, a drug originally developed as an immunosuppressant, is its most potent inhibitor, and has the most replicated lifespan-extension data of any drug in mammalian models.

NIA Interventions Testing Program · 2009, ongoing
Rapamycin: the only drug to repeatedly extend lifespan across independent labs
Harrison et al., Nature 2009 · Multiple ITP replication papers · Miller et al., Aging Cell
+13%
median lifespan extension in mice, even when started at 20 months (equivalent to ~60 human years)

What makes this remarkable

  • Late intervention still works, Harrison 2009: rapamycin fed to mice at 20 months (equivalent to starting a human at ~60) still extended lifespan 9–14%. Most drugs only work if given from youth.
  • Three independent labs, the ITP requires replication at Michigan, Maine, and Texas. Rapamycin passed. Only a handful of interventions have.
  • Immune rejuvenation in humans, Mannick et al. 2014 (Science TM): low-dose rapamycin improved vaccine response and immune function in elderly humans. Reversal of immunosenescence.

The open questions

  • Dosing problem, chronic high-dose rapamycin (transplant doses) impairs insulin signaling and wound healing. Intermittent dosing (1–3x/week) appears to preserve benefits while reducing side effects.
  • Dog Aging Project, Matt Kaeberlein's rapamycin trials in middle-aged pet dogs show improved cardiac function and apparent slowing of biological aging. Largest dog longevity trial ever run.
  • TRIAD Trial, Phase II/III human trial of rapamycin for aging. Enrolling 2024–2025. First proper human longevity data coming.
Albert Einstein College of Medicine · TAME Trial
Metformin: the first FDA-accepted trial targeting aging as a disease
Barzilai et al., Cell Metabolism 2016 · TAME Trial design, 2020 · Enrolling 2023–2024
3,000
participants · 14 clinical sites · 6-year follow-up · $75M trial

Why TAME matters

TAME (Targeting Aging with Metformin) is historic not because metformin is the most powerful longevity drug, it probably isn't, but because the FDA accepted "aging" as a therapeutic target for the first time. If TAME produces positive data, it opens a regulatory pathway for every future longevity drug. The trial enrolls adults 65–79 who already have one age-related condition. Primary endpoint: time to development of any additional age-related disease.

Why metformin?

  • 40+ years of safety data, widely prescribed for type 2 diabetes
  • Epidemiological data shows diabetics on metformin live longer than non-diabetics not on it (Bannister et al. 2014, Diabetes, Obesity and Metabolism)
  • Activates AMPK (the cellular energy sensor) and modestly inhibits mTOR
  • Anti-inflammatory, reduces oxidative stress
  • Nir Barzilai's bet: affordable, safe, scalable. Not the best drug, the best first drug.
06, Measuring biological age

Epigenetic clocks, the measuring instruments of the longevity revolution

You cannot manage what you cannot measure. The epigenetic clock revolution gave aging research its first rigorous outcome metrics, molecular timekeepers that quantify biological age from DNA methylation patterns or blood chemistry. These clocks are why the field can now run clinical trials and produce quantifiable results. They're also what Antiaging Labs uses operationally to measure client outcomes.

Horvath Clock · 2013

Published: Steve Horvath, Genome Biology

353 CpG methylation sites predict biological age across 51 different tissues and cell types. The first pan-tissue epigenetic clock. Measures cumulative epigenetic aging. Slows with caloric restriction; responds to reprogramming experiments.

GrimAge · 2019

Published: Lu et al., Aging

The strongest predictor of time-to-death among all existing clocks. Trained on mortality data directly. Integrates smoking history via methylation proxies. Most clinically relevant for longevity interventions. Requires methylation array ($300–500/sample).

DunedinPACE · 2022

Published: Belsky et al., eLife

Measures the pace of aging, not your biological age, but how fast you're aging right now. A DunedinPACE of 0.9 means aging 10% slower than average. The most sensitive clock for detecting short-term intervention effects. Used in the CALERIE trial.

What the clocks say about interventions
−3.2 yr
Horvath clock reversal in 8 weeks via diet, exercise, sleep, stress (Fitzgerald et al. 2021, Aging). No drugs.
−2.5 yr
Horvath clock reversal over 12 months with GH + DHEA + metformin (Fahy et al. 2019, TRIIM Trial, Aging). First controlled reversal trial.
↓ Pace
DunedinPACE significantly slowed by 2-year caloric restriction in 220 adults (CALERIE Trial, Nature Aging 2023). Largest longevity RCT in non-obese humans.
What Antiaging Labs uses

Antiaging Labs calculates PhenoAge (Levine 2018, Nature Medicine) from your 50+ marker blood panel. PhenoAge is a blood-chemistry-based model trained on mortality risk, 9 markers, fully calculable from standard bloodwork, validated across multiple cohorts. It is not an epigenetic clock (that requires methylation arrays at $300–500/sample), but it is the most clinically accessible and well-validated biological age model available at population scale. For our 90-day program, PhenoAge delta is the right metric. We are separately building a model to predict epigenetic clock scores from blood chemistry, see Our Research below.

07, Active clinical trials

The trials that will define the next decade

Clinical evidence in longevity is still early but moving fast. These are the most significant active or recently completed trials targeting biological aging in humans.

Trial Intervention N Duration Primary Endpoint Status
TAME Barzilai, Albert Einstein College of Medicine Metformin 1500mg/day 3,000 6 years Composite of 5 age-related conditions (CVD, cancer, dementia, disability, death) Enrolling
TRIIM-X Fahy, Intervene Immune GH + DHEA + metformin (replication of TRIIM) ~100 12 months Epigenetic clock reversal (Horvath, GrimAge) Ongoing
Dog Aging Project / TRIAD Kaeberlein, Univ. of Washington Rapamycin (low-dose, intermittent) 580 dogs → human trial planning 4 years (dogs) All-cause mortality, healthspan, cardiac function Active
CALERIE II Duke, Tufts, Washington Univ. 25% caloric restriction 220 2 years DunedinPACE, cardiometabolic risk Completed · Nature Aging 2023
PEARL (NMN) Procter & Gamble / Keio University NMN 250mg/day ~30 per arm 12 weeks Muscle function, insulin sensitivity, NAD+ levels Completed · Science 2021
Senolytic D+Q Kirkland, Mayo Clinic, multiple studies Dasatinib + Quercetin (intermittent) 11–100 (varies by arm) 3–6 months Senescent cell markers (p16, p21), physical function Phase II ongoing
Rapamycin in Healthy Adults Multiple academic sites, 2024–2025 Low-dose rapamycin (weekly) ~250 12 months Biological age clocks, immune function, cardiometabolic markers Enrolling 2024–25
08, The industry

Where the serious money is going

Longevity has moved from fringe to the most-funded area in biotechnology. These are the companies that matter, not supplement brands or wellness influencers, but organizations with serious scientific leadership and institutional capital.

Altos Labs
$3B · Founded 2022
Cellular rejuvenation programming via partial reprogramming. The largest biotech launch in history. Led by world-class scientists including Juan Carlos Izpisua Belmonte, Deepak Srivastava. Yamanaka on scientific advisory board.
Backed by: Jeff Bezos, Yuri Milner
Calico (Alphabet)
$1.5B+ · Founded 2013
Google's longevity bet. Research-heavy, publications-focused. Long-term biology of aging across organisms. Partnered with AbbVie for drug development. Notoriously private about results.
Backed by: Alphabet / Google
Retro Biosciences
$180M · Founded 2022
Partial reprogramming, autophagy enhancement, plasma-inspired therapeutics. Targeting a 10-year extension of healthy human lifespan. Focused on translating longevity science into clinical therapies.
Backed by: Sam Altman
NewLimit
$130M · Founded 2021
Partial epigenetic reprogramming. Using machine learning to design reprogramming regimens. Founded by Brian Armstrong (Coinbase CEO) and Blake Byers. Sinclair on scientific board.
Backed by: Coinbase founders, institutional VCs
Unity Biotechnology
Public (UBX)
Senolytics, drugs that selectively eliminate senescent cells. Had a Phase II setback in knee osteoarthritis (UBX0101, 2020). Pivoted to ophthalmology (UBX2954). Still the pioneer in clinical senolytic development.
Backed by: Bezos, Mayo Clinic collaboration
BioAge Labs
$90M+
Using longitudinal biomarker data and machine learning to identify novel drug targets for aging. Partnered with Abbvie. The "data-first" approach to longevity drug discovery, closest in philosophy to what Antiaging Labs is building on the research side.
Backed by: a16z, GV (Google Ventures)

"We are at the beginning of a new era in medicine. Not treating diseases of aging, but treating aging itself, before those diseases emerge."

Nir Barzilai, Director of the TAME Trial, Interview, Nature Medicine 2022
Our Research

Beyond running the Antiaging Labs program, we are building the deep technology to make epigenetic age testing 10× cheaper and accessible to any clinical panel. Below is our full research blueprint.

Full Research Blueprint: Tabular Foundation Models as Surrogate Epigenetic Clocks


The Problem, Precisely Stated

Gold standard: Epigenetic clocks (GrimAge, DunedinPACE, Horvath, PhenoAge-DNAm) require Illumina EPIC 850K methylation arrays. $300–500/sample. 3–6 week lab processing. Specialized bioinformatics pipeline.

Current blood-based alternative: PhenoAge (Levine et al., 2018, Nature Medicine), linear regression on 9 blood markers trained to predict chronological age, not methylation clock outputs. Linear. 2018. Never updated. Not actually trained to approximate GrimAge or DunedinPACE.

The gap we fill: Train a deep tabular model directly on the 30–60 blood chemistry markers available in a standard clinical panel, with methylation clock scores as the prediction target. Multi-task across 4 clocks simultaneously. Calibrated uncertainty. Graceful missingness. Deployable for $30/sample.

Our unique edge: Antiaging Labs is operationally running PhenoAge on real clients. We understand what 50-marker panels look like in practice, what's missable, and what "clinical deployment" requires. No competing researcher has that grounding.


Section 1, Dataset Selection

Primary Dataset: UK Biobank (UKB)

This is the only dataset that gives you scale, breadth, and both methylation + blood chemistry simultaneously.

What it has:

How to calculate your targets from this data:

Training data becomes: [blood chemistry vector] → [4 clock output scores]

How to apply:

  1. Register at ukbiobank.ac.uk, the process takes 2–4 months. Start today.
  2. Complete mandatory GDPR and data handling training
  3. Submit a project application with your research question (below)
  4. Data delivered to a Tier 2 secure compute environment

Application pitch:

"We aim to develop deep learning surrogate models for epigenetic biological age clocks using standard clinical blood biochemistry. Using paired methylation array and blood biochemistry data, we will train tabular transformer architectures to predict established clock outputs (GrimAge, DunedinPACE, Horvath's DNAmAge, PhenoAge-DNAm) from routine blood markers alone. This work aims to enable scalable, low-cost population-level biological age monitoring without requiring DNA methylation arrays."

Start this application this week. It is the long pole in the tent.


Secondary Datasets (external validation, critical for Nature-tier papers)

Dataset N (methylation) Blood chemistry Access Use
Generation Scotland (GS) ~18,000 Yes, clinical biochemistry Application to GS DataSafe Haven Primary external validation
MESA ~1,200 Yes, extensive dbGaP application (free) Cross-ethnic validation
Framingham Heart Study ~2,500 Yes dbGaP application (free) Longitudinal validation
Lothian Birth Cohorts (LBC1936) ~1,000 Yes Collaboration with Edinburgh Age-specific validation
KORA cohort ~1,800 Yes, German clinical Collaboration with Helmholtz European replication

The Indian cohort angle (our unique contribution):
Antiaging Labs clients, even a small cohort of 30–50, give us something no competing lab has: Indian urban professional blood chemistry paired with biological age calculation. After the main model, we validate on this cohort. Cross-ancestry generalizability writes itself as a finding. Most epigenetic clock validation is 95% European-ancestry cohorts.


Section 2, Architecture Design

Baselines (we must beat all of these)

Model Purpose
PhenoAge (Levine 2018)Existing clinical standard, 9 markers, linear
Ridge/ElasticNet on all featuresShows linear benefit of more features
Random ForestTree ensemble baseline
XGBoost / LightGBMStrong GBDT baseline, hardest to beat
MLP (vanilla deep net)Shows basic DL doesn't automatically win
FT-Transformer (Gorishniy 2021)Best off-the-shelf tabular transformer
TabNet (Arik & Pfister 2019)Interpretable attention baseline

Our novel architecture: BioAge-Transformer

Core design principles:

  1. Feature tokenization (each biomarker becomes a vector, not a raw float)
  2. Biological pathway-aware attention (metabolic markers attend to each other, etc.)
  3. Learned missing value tokens (not imputation, critical for clinical use)
  4. Multi-task heads (one per clock, shared encoder, separate decoders)
  5. Calibrated uncertainty (Gaussian NLL outputs mean + variance per clock)

Architecture in full:

class BioAgeTransformer(nn.Module):
    """
    Multi-task tabular transformer for surrogate epigenetic clock prediction.
    Predicts GrimAge, DunedinPACE, Horvath, and PhenoAge-DNAm
    from standard clinical blood chemistry panels.
    """
    def __init__(
        self,
        n_features: int,          # ~42 blood markers
        d_model: int = 192,
        n_heads: int = 8,
        n_layers: int = 6,
        n_clocks: int = 4,
        n_pathways: int = 6,      # metabolic, lipid, inflammation,
                                   # hormonal, CBC, nutrients
        dropout: float = 0.1,
    ):
        super().__init__()

        # ── Feature tokenization ──────────────────────────────────────
        self.feature_embedding = nn.Embedding(n_features, d_model)
        self.value_projector   = nn.Linear(1, d_model)

        # Learnable missing-value token (one per feature, not global)
        self.missing_tokens = nn.Parameter(torch.randn(n_features, d_model))

        # ── Biological pathway embeddings ────────────────────────────
        self.pathway_embedding = nn.Embedding(n_pathways, d_model)

        # ── [CLS] token ───────────────────────────────────────────────
        self.cls_token = nn.Parameter(torch.randn(1, 1, d_model))

        # ── Transformer encoder ───────────────────────────────────────
        encoder_layer = nn.TransformerEncoderLayer(
            d_model=d_model, nhead=n_heads,
            dim_feedforward=4 * d_model, dropout=dropout,
            activation='gelu', batch_first=True, norm_first=True,
        )
        self.transformer = nn.TransformerEncoder(encoder_layer, n_layers)

        # ── Shared projection head ─────────────────────────────────────
        self.shared_head = nn.Sequential(
            nn.LayerNorm(d_model), nn.Linear(d_model, 128),
            nn.GELU(), nn.Dropout(dropout),
        )

        # ── Per-clock output heads (mean + log_variance) ──────────────
        self.clock_heads = nn.ModuleList([
            nn.Sequential(nn.Linear(128, 64), nn.GELU(), nn.Linear(64, 2))
            for _ in range(n_clocks)
        ])

        # ── Auxiliary head: chronological age ────────────────────────
        self.age_head = nn.Linear(128, 1)

    def forward(self, x: Tensor, mask: Tensor = None):
        B = x.shape[0]
        feat_ids = torch.arange(x.shape[1], device=x.device)
        feat_emb = self.feature_embedding(feat_ids)
        val_emb  = self.value_projector(x.unsqueeze(-1))
        tokens   = feat_emb.unsqueeze(0) + val_emb

        if mask is not None:
            missing_emb = self.missing_tokens.unsqueeze(0)
            tokens = torch.where(mask.unsqueeze(-1).expand_as(tokens),
                                 missing_emb.expand(B, -1, -1), tokens)

        pathway_emb = self.pathway_embedding(self.feature_to_pathway)
        tokens = tokens + pathway_emb.unsqueeze(0)

        cls    = self.cls_token.expand(B, -1, -1)
        tokens = torch.cat([cls, tokens], dim=1)
        out    = self.transformer(tokens)
        shared = self.shared_head(out[:, 0])

        clock_preds = [head(shared) for head in self.clock_heads]
        age_pred    = self.age_head(shared).squeeze(-1)
        return clock_preds, age_pred

Training loss:

def gaussian_nll(pred, target):
    mean, log_var = pred[:, 0], pred[:, 1].clamp(-6, 6)
    return (0.5 * (log_var + (target - mean)**2 / log_var.exp())).mean()

def total_loss(clock_preds, clock_targets, age_pred, age_target,
               clock_weights=(1.0, 1.0, 0.8, 0.8), lambda_age=0.1):
    clock_loss = sum(w * gaussian_nll(p, t)
                     for w, p, t in zip(clock_weights, clock_preds, clock_targets))
    return clock_loss + lambda_age * F.mse_loss(age_pred, age_target)

Section 3, The Full Experiment Suite

Experiment 1: Main comparison table

Run all models on same train/val/test split (80/10/10, stratified by age decade and sex). Metrics: MAE, R², Pearson r, Spearman ρ per model × clock. Target for GrimAge: r > 0.80 (prior blood-based approaches achieve ~0.65–0.75).

Experiment 2: Feature minimization curve

Start with all 42 features. Remove in reverse order of SHAP importance. Plot MAE vs. panel size. Key finding to demonstrate: "A 20-marker panel achieves 95% of full-panel performance. The 9-marker PhenoAge panel achieves only 78%." This is the clinical deployment argument.

Experiment 3: Missing data robustness

Introduce missingness at 5%, 10%, 20%, 30%, 50%, 70% under MCAR, MAR, and MNAR mechanisms. Compare learned token approach vs. mean imputation, iterative imputation, KNN, and GAIN. Expected result: learned token outperforms all imputation strategies at all missingness levels.

Experiment 4: Multi-task vs. single-task

Train 4 single-task models vs. BioAge-Transformer. Expected result: multi-task model outperforms all single-task models, especially on DunedinPACE (hardest clock to predict from blood alone).

Experiment 5: Calibration analysis

Reliability diagram: for each 10th-percentile prediction interval, what fraction of true values fall inside? Report Expected Calibration Error (ECE) per model. Clinical framing: our model's 90% intervals should contain the true value 90% of the time.

Experiment 6: External validation

Train on UK Biobank (~40K). Validate on Generation Scotland (~3K), MESA (~1.2K), Framingham (~2K). Expect 5–15% MAE increase on external cohorts. Show it's not catastrophic.

Experiment 7: Longitudinal tracking

Apply model to participants with repeat blood draws (Framingham, LBC). Does predicted biological age increase at the expected rate? Does it rise faster in people who develop disease?

Experiment 8: Interpretability

Attention weight heatmap per head. SHAP values per feature per clock. Expected biological story: GrimAge dominated by albumin, inflammation, ApoA; DunedinPACE dominated by metabolic markers. This section should read like biology, not ML.

Experiment 9: Subgroup analysis

All metrics stratified by sex, age decade, BMI, smoking status, ancestry. Check for heterogeneous performance. Investigate and fix, or disclose clearly.

Experiment 10: Comparison to existing DL papers

Directly compare against Lee et al. 2023 (Aging), Bae et al. 2021 (npj Digital Medicine), Putin et al. 2016 (Aging). Run their models on our test set. Show superiority directly.


Section 4, Figures Strategy

Figure 1, Architecture: Clean schematic of the full BioAge-Transformer. Pathway-color-coded. Should be readable by a non-ML clinician.

Figure 2, Main results: Grouped bar charts, MAE per model per clock, 95% bootstrap CIs.

Figure 3, Feature minimization curve: x = panel size, y = normalized MAE. Mark the "20-marker elbow." Add cost axis on right.

Figure 4, Calibration: Reliability diagram. Our model follows the diagonal; others don't.

Figure 5, Interpretability: (A) Attention heatmap by pathway. (B) SHAP dot plot, top 15 features per clock.

Figure 6, External validation: Performance on UKB test set vs. each external cohort, with error bars.


Section 5, Week-by-Week Timeline

Weeks 1–2: Apply for UK Biobank access (critical, 2–4 month wait). Apply MESA via dbGaP. Download NHANES (free, immediate). Set up GPU compute.

Weeks 3–6: Build full data pipeline on NHANES + GEO data while waiting for UKB. Implement all baseline models.

Weeks 7–12: UK Biobank access arrives. Calculate clock targets from methylation data in R. Extract and join blood chemistry features. Run baselines.

Weeks 13–18: Implement BioAge-Transformer in PyTorch. Optuna hyperparameter sweep (200 trials). Run all 10 experiments. Log everything in W&B.

Weeks 19–24: External validation. Interpretability analysis. Statistical analysis (bootstrap CIs). Supplementary material.

Weeks 25–30: Paper writing in LaTeX. Figure production (publication-quality matplotlib/seaborn). Submit to npj Digital Medicine.

Compute requirements

TaskHardwareTime estimate
Baseline modelsCPU, 16GB RAM2–4 hours
BioAge-Transformer training1× A100 (40GB)6–12 hours per run
Hyperparameter sweep (200 trials)4× A100 parallel2–3 days
Interpretability (SHAP on 40K)GPU recommended4–8 hours

GCP a2-highgpu-1g (1× A100) = ~$3.67/hr. Full project: budget ~$500–800 compute.


Section 6, The 5 Novelty Claims

  1. First deep tabular model trained to directly approximate methylation-derived epigenetic clocks from blood chemistry, PhenoAge approximates mortality risk, not GrimAge/DunedinPACE/Horvath. These are different targets.
  2. Multi-task learning across 4 biological age clocks, joint training exploits shared variance and improves performance on harder clocks like DunedinPACE.
  3. Principled missingness handling via learned tokens, all prior work uses imputation. Our model handles arbitrary missingness patterns without separate imputation.
  4. Calibrated uncertainty quantification, clinical deployment requires confidence intervals. We provide per-sample prediction intervals that are demonstrably well-calibrated.
  5. Feature minimization analysis establishing minimum effective panels, what is the minimum blood panel for each level of clock prediction performance?

Section 7, Journal Selection

Primary target: npj Digital Medicine (Nature Portfolio), IF ~15, scope exactly this, open access, 4–8 weeks to first decision.

Secondary: Nature Communications, broader scope, slightly harder.

Tertiary: Aging (Albany NY), home journal of most clock papers, IF ~4–6, more likely if external validation is limited.

Do not target Nature Medicine, Cell, or Lancet Digital Health on first submission.


Section 8, The Antiaging Labs Data Angle

"To demonstrate real-world clinical utility, we apply BioAge-Transformer to a prospectively collected cohort of n=X urban Indian professionals enrolled in the Antiaging Labs longevity program (Hyderabad, India). This cohort differs from UK Biobank in ancestry, diet, and metabolic risk profile. Despite these differences, BioAge-Transformer [maintains performance / shows X% performance drop], suggesting [generalizability beyond European-ancestry populations / the need for fine-tuning in South Asian populations]. Notably, n=Y clients had repeat blood draws at 90-day intervals following a structured lifestyle intervention, the first demonstration of surrogate clock tracking over a short-term intervention period."

Even n=20 paired before/after samples is a publishable finding if the surrogate clock moves with the intervention. Apply to an Indian IRB before collecting data for research purposes.


The 8 Papers to Read This Week

  1. Levine et al., 2018, PhenoAge, Nature Medicine. Your benchmark.
  2. Lu et al., 2019, GrimAge, Aging. Primary prediction target.
  3. Belsky et al., 2022, DunedinPACE, eLife. Hardest prediction target.
  4. Horvath, 2013, Original epigenetic clock, Genome Biology.
  5. Gorishniy et al., 2021, FT-Transformer, NeurIPS. Primary architecture baseline.
  6. Hollmann et al., 2022, TabPFN, NeurIPS. Second architecture baseline.
  7. Lee et al., 2023, Deep learning for biological age from blood. Closest existing work.
  8. Bae et al., 2021, DL biological age from CBC, npj Digital Medicine. Primary "we improve upon" citation.

Immediate Action Items

PriorityTaskTime
🔴 CriticalApply for UK Biobank access3 hours
🔴 CriticalDownload NHANES data, start feature pipeline4 hours
🟠 HighApply for MESA via dbGaP1 hour
🟠 HighSet up GPU compute environment (GCP/AWS)2 hours
🟠 HighRead the 8 papers aboveThis week
🟡 MediumSet up GitHub repo (public, MIT license)1 hour
🟡 MediumApply to Indian IRB for Antiaging Labs data2 hours
🟢 LaterGeneration Scotland collaboration email1 hour

The UK Biobank application is the hardest blocker. Start today.