A decade ago, "anti-aging" was the domain of supplement marketing and wishful thinking. Today it's Nobel Prize-winning biology, $3 billion venture rounds, and the first FDA-accepted clinical trial targeting aging as a disease. This is what the research actually shows, and what it doesn't.
In 2013, López-Otín et al. published a landmark paper in Cell establishing 9 distinct biological mechanisms that drive aging. A decade of evidence led to a 2023 update, adding 3 more hallmarks and cementing this as the field's operating framework. Every serious longevity intervention targets one or more of these. Antiaging Labs's 4-lever protocol addresses 7 of the 12.
Source: López-Otín et al., Cell 2013 (9 hallmarks); López-Otín et al., Cell 2023 (updated to 12 hallmarks)
In 2006, Shinya Yamanaka discovered that four transcription factors, Oct4, Sox2, Klf4, and c-Myc (OSKM), could reprogram any adult cell back to a pluripotent stem cell state. The implication: the genome retains a full "memory" of youth. Aging isn't a one-way ratchet. In 2012, Yamanaka received the Nobel Prize in Medicine. Since then, the question has shifted from "can we reprogram cells?" to "can we do it safely, partially, in living organisms?"
"We can reset the age of a cell, and it remembers its youthful state. Aging has an address, and we've found it."
Professor David Sinclair (Harvard Medical School, Paul F. Glenn Center for Biology of Aging Research) has built the most coherent single theory of aging currently in science. His 2019 book Lifespan brought these ideas to a mainstream audience. The core claim: aging is not wear-and-tear or mutation accumulation, it's the progressive loss of the epigenetic information that tells cells who they are and how to behave.
The genome is like a compact disc, it contains all the information needed for life. The epigenome is like the music encoded on it. With age, the disc gets scratched. The information is still there, but it can't be read correctly. Cells forget who they're supposed to be. Sinclair argues this is the primary cause of aging, not genetic mutation.
The NAD+ space is contested. Blood NAD+ can be raised, the question is whether this translates to meaningful intracellular effects in aged tissues. Phase II trials are ongoing. Exercise remains the most robustly proven method of raising NAD+ and activating sirtuins, which is why it's the backbone of every serious longevity protocol, including Antiaging Labs's.
Sinclair's self-experiments (1g NMN daily) are widely publicized but are n=1 anecdotes, not clinical data. He acknowledges this.
Cellular senescence is one of the most actionable hallmarks of aging. Senescent cells stop dividing after DNA damage, telomere shortening, or oncogene activation, a short-term safety mechanism that prevents cancer. But over time, the immune system fails to clear them efficiently, and they accumulate throughout the body, releasing a toxic cocktail of inflammatory signals (SASP: Senescence-Associated Secretory Phenotype). Senolytics are drugs that selectively kill these cells.
mTOR (mechanistic Target of Rapamycin) is the master regulator of cellular growth, protein synthesis, and autophagy. When it's chronically active, cells prioritize growth over repair, accumulating damage faster. Caloric restriction works partly by reducing mTOR. Rapamycin, a drug originally developed as an immunosuppressant, is its most potent inhibitor, and has the most replicated lifespan-extension data of any drug in mammalian models.
TAME (Targeting Aging with Metformin) is historic not because metformin is the most powerful longevity drug, it probably isn't, but because the FDA accepted "aging" as a therapeutic target for the first time. If TAME produces positive data, it opens a regulatory pathway for every future longevity drug. The trial enrolls adults 65–79 who already have one age-related condition. Primary endpoint: time to development of any additional age-related disease.
You cannot manage what you cannot measure. The epigenetic clock revolution gave aging research its first rigorous outcome metrics, molecular timekeepers that quantify biological age from DNA methylation patterns or blood chemistry. These clocks are why the field can now run clinical trials and produce quantifiable results. They're also what Antiaging Labs uses operationally to measure client outcomes.
Published: Steve Horvath, Genome Biology
353 CpG methylation sites predict biological age across 51 different tissues and cell types. The first pan-tissue epigenetic clock. Measures cumulative epigenetic aging. Slows with caloric restriction; responds to reprogramming experiments.
Published: Lu et al., Aging
The strongest predictor of time-to-death among all existing clocks. Trained on mortality data directly. Integrates smoking history via methylation proxies. Most clinically relevant for longevity interventions. Requires methylation array ($300–500/sample).
Published: Belsky et al., eLife
Measures the pace of aging, not your biological age, but how fast you're aging right now. A DunedinPACE of 0.9 means aging 10% slower than average. The most sensitive clock for detecting short-term intervention effects. Used in the CALERIE trial.
Antiaging Labs calculates PhenoAge (Levine 2018, Nature Medicine) from your 50+ marker blood panel. PhenoAge is a blood-chemistry-based model trained on mortality risk, 9 markers, fully calculable from standard bloodwork, validated across multiple cohorts. It is not an epigenetic clock (that requires methylation arrays at $300–500/sample), but it is the most clinically accessible and well-validated biological age model available at population scale. For our 90-day program, PhenoAge delta is the right metric. We are separately building a model to predict epigenetic clock scores from blood chemistry, see Our Research below.
Clinical evidence in longevity is still early but moving fast. These are the most significant active or recently completed trials targeting biological aging in humans.
| Trial | Intervention | N | Duration | Primary Endpoint | Status |
|---|---|---|---|---|---|
| TAME Barzilai, Albert Einstein College of Medicine | Metformin 1500mg/day | 3,000 | 6 years | Composite of 5 age-related conditions (CVD, cancer, dementia, disability, death) | Enrolling |
| TRIIM-X Fahy, Intervene Immune | GH + DHEA + metformin (replication of TRIIM) | ~100 | 12 months | Epigenetic clock reversal (Horvath, GrimAge) | Ongoing |
| Dog Aging Project / TRIAD Kaeberlein, Univ. of Washington | Rapamycin (low-dose, intermittent) | 580 dogs → human trial planning | 4 years (dogs) | All-cause mortality, healthspan, cardiac function | Active |
| CALERIE II Duke, Tufts, Washington Univ. | 25% caloric restriction | 220 | 2 years | DunedinPACE, cardiometabolic risk | Completed · Nature Aging 2023 |
| PEARL (NMN) Procter & Gamble / Keio University | NMN 250mg/day | ~30 per arm | 12 weeks | Muscle function, insulin sensitivity, NAD+ levels | Completed · Science 2021 |
| Senolytic D+Q Kirkland, Mayo Clinic, multiple studies | Dasatinib + Quercetin (intermittent) | 11–100 (varies by arm) | 3–6 months | Senescent cell markers (p16, p21), physical function | Phase II ongoing |
| Rapamycin in Healthy Adults Multiple academic sites, 2024–2025 | Low-dose rapamycin (weekly) | ~250 | 12 months | Biological age clocks, immune function, cardiometabolic markers | Enrolling 2024–25 |
Longevity has moved from fringe to the most-funded area in biotechnology. These are the companies that matter, not supplement brands or wellness influencers, but organizations with serious scientific leadership and institutional capital.
"We are at the beginning of a new era in medicine. Not treating diseases of aging, but treating aging itself, before those diseases emerge."
Beyond running the Antiaging Labs program, we are building the deep technology to make epigenetic age testing 10× cheaper and accessible to any clinical panel. Below is our full research blueprint.
Gold standard: Epigenetic clocks (GrimAge, DunedinPACE, Horvath, PhenoAge-DNAm) require Illumina EPIC 850K methylation arrays. $300–500/sample. 3–6 week lab processing. Specialized bioinformatics pipeline.
Current blood-based alternative: PhenoAge (Levine et al., 2018, Nature Medicine), linear regression on 9 blood markers trained to predict chronological age, not methylation clock outputs. Linear. 2018. Never updated. Not actually trained to approximate GrimAge or DunedinPACE.
The gap we fill: Train a deep tabular model directly on the 30–60 blood chemistry markers available in a standard clinical panel, with methylation clock scores as the prediction target. Multi-task across 4 clocks simultaneously. Calibrated uncertainty. Graceful missingness. Deployable for $30/sample.
Our unique edge: Antiaging Labs is operationally running PhenoAge on real clients. We understand what 50-marker panels look like in practice, what's missable, and what "clinical deployment" requires. No competing researcher has that grounding.
This is the only dataset that gives you scale, breadth, and both methylation + blood chemistry simultaneously.
What it has:
How to calculate your targets from this data:
methylclock R packagegrimage R package (Lu et al., 2019)DunedinPACE R package (Belsky et al., 2020)Training data becomes: [blood chemistry vector] → [4 clock output scores]
How to apply:
ukbiobank.ac.uk, the process takes 2–4 months. Start today.Application pitch:
"We aim to develop deep learning surrogate models for epigenetic biological age clocks using standard clinical blood biochemistry. Using paired methylation array and blood biochemistry data, we will train tabular transformer architectures to predict established clock outputs (GrimAge, DunedinPACE, Horvath's DNAmAge, PhenoAge-DNAm) from routine blood markers alone. This work aims to enable scalable, low-cost population-level biological age monitoring without requiring DNA methylation arrays."
Start this application this week. It is the long pole in the tent.
| Dataset | N (methylation) | Blood chemistry | Access | Use |
|---|---|---|---|---|
| Generation Scotland (GS) | ~18,000 | Yes, clinical biochemistry | Application to GS DataSafe Haven | Primary external validation |
| MESA | ~1,200 | Yes, extensive | dbGaP application (free) | Cross-ethnic validation |
| Framingham Heart Study | ~2,500 | Yes | dbGaP application (free) | Longitudinal validation |
| Lothian Birth Cohorts (LBC1936) | ~1,000 | Yes | Collaboration with Edinburgh | Age-specific validation |
| KORA cohort | ~1,800 | Yes, German clinical | Collaboration with Helmholtz | European replication |
The Indian cohort angle (our unique contribution):
Antiaging Labs clients, even a small cohort of 30–50, give us something no competing lab has: Indian urban professional blood chemistry paired with biological age calculation. After the main model, we validate on this cohort. Cross-ancestry generalizability writes itself as a finding. Most epigenetic clock validation is 95% European-ancestry cohorts.
| Model | Purpose |
|---|---|
| PhenoAge (Levine 2018) | Existing clinical standard, 9 markers, linear |
| Ridge/ElasticNet on all features | Shows linear benefit of more features |
| Random Forest | Tree ensemble baseline |
| XGBoost / LightGBM | Strong GBDT baseline, hardest to beat |
| MLP (vanilla deep net) | Shows basic DL doesn't automatically win |
| FT-Transformer (Gorishniy 2021) | Best off-the-shelf tabular transformer |
| TabNet (Arik & Pfister 2019) | Interpretable attention baseline |
Core design principles:
Architecture in full:
class BioAgeTransformer(nn.Module):
"""
Multi-task tabular transformer for surrogate epigenetic clock prediction.
Predicts GrimAge, DunedinPACE, Horvath, and PhenoAge-DNAm
from standard clinical blood chemistry panels.
"""
def __init__(
self,
n_features: int, # ~42 blood markers
d_model: int = 192,
n_heads: int = 8,
n_layers: int = 6,
n_clocks: int = 4,
n_pathways: int = 6, # metabolic, lipid, inflammation,
# hormonal, CBC, nutrients
dropout: float = 0.1,
):
super().__init__()
# ── Feature tokenization ──────────────────────────────────────
self.feature_embedding = nn.Embedding(n_features, d_model)
self.value_projector = nn.Linear(1, d_model)
# Learnable missing-value token (one per feature, not global)
self.missing_tokens = nn.Parameter(torch.randn(n_features, d_model))
# ── Biological pathway embeddings ────────────────────────────
self.pathway_embedding = nn.Embedding(n_pathways, d_model)
# ── [CLS] token ───────────────────────────────────────────────
self.cls_token = nn.Parameter(torch.randn(1, 1, d_model))
# ── Transformer encoder ───────────────────────────────────────
encoder_layer = nn.TransformerEncoderLayer(
d_model=d_model, nhead=n_heads,
dim_feedforward=4 * d_model, dropout=dropout,
activation='gelu', batch_first=True, norm_first=True,
)
self.transformer = nn.TransformerEncoder(encoder_layer, n_layers)
# ── Shared projection head ─────────────────────────────────────
self.shared_head = nn.Sequential(
nn.LayerNorm(d_model), nn.Linear(d_model, 128),
nn.GELU(), nn.Dropout(dropout),
)
# ── Per-clock output heads (mean + log_variance) ──────────────
self.clock_heads = nn.ModuleList([
nn.Sequential(nn.Linear(128, 64), nn.GELU(), nn.Linear(64, 2))
for _ in range(n_clocks)
])
# ── Auxiliary head: chronological age ────────────────────────
self.age_head = nn.Linear(128, 1)
def forward(self, x: Tensor, mask: Tensor = None):
B = x.shape[0]
feat_ids = torch.arange(x.shape[1], device=x.device)
feat_emb = self.feature_embedding(feat_ids)
val_emb = self.value_projector(x.unsqueeze(-1))
tokens = feat_emb.unsqueeze(0) + val_emb
if mask is not None:
missing_emb = self.missing_tokens.unsqueeze(0)
tokens = torch.where(mask.unsqueeze(-1).expand_as(tokens),
missing_emb.expand(B, -1, -1), tokens)
pathway_emb = self.pathway_embedding(self.feature_to_pathway)
tokens = tokens + pathway_emb.unsqueeze(0)
cls = self.cls_token.expand(B, -1, -1)
tokens = torch.cat([cls, tokens], dim=1)
out = self.transformer(tokens)
shared = self.shared_head(out[:, 0])
clock_preds = [head(shared) for head in self.clock_heads]
age_pred = self.age_head(shared).squeeze(-1)
return clock_preds, age_pred
Training loss:
def gaussian_nll(pred, target):
mean, log_var = pred[:, 0], pred[:, 1].clamp(-6, 6)
return (0.5 * (log_var + (target - mean)**2 / log_var.exp())).mean()
def total_loss(clock_preds, clock_targets, age_pred, age_target,
clock_weights=(1.0, 1.0, 0.8, 0.8), lambda_age=0.1):
clock_loss = sum(w * gaussian_nll(p, t)
for w, p, t in zip(clock_weights, clock_preds, clock_targets))
return clock_loss + lambda_age * F.mse_loss(age_pred, age_target)
Run all models on same train/val/test split (80/10/10, stratified by age decade and sex). Metrics: MAE, R², Pearson r, Spearman ρ per model × clock. Target for GrimAge: r > 0.80 (prior blood-based approaches achieve ~0.65–0.75).
Start with all 42 features. Remove in reverse order of SHAP importance. Plot MAE vs. panel size. Key finding to demonstrate: "A 20-marker panel achieves 95% of full-panel performance. The 9-marker PhenoAge panel achieves only 78%." This is the clinical deployment argument.
Introduce missingness at 5%, 10%, 20%, 30%, 50%, 70% under MCAR, MAR, and MNAR mechanisms. Compare learned token approach vs. mean imputation, iterative imputation, KNN, and GAIN. Expected result: learned token outperforms all imputation strategies at all missingness levels.
Train 4 single-task models vs. BioAge-Transformer. Expected result: multi-task model outperforms all single-task models, especially on DunedinPACE (hardest clock to predict from blood alone).
Reliability diagram: for each 10th-percentile prediction interval, what fraction of true values fall inside? Report Expected Calibration Error (ECE) per model. Clinical framing: our model's 90% intervals should contain the true value 90% of the time.
Train on UK Biobank (~40K). Validate on Generation Scotland (~3K), MESA (~1.2K), Framingham (~2K). Expect 5–15% MAE increase on external cohorts. Show it's not catastrophic.
Apply model to participants with repeat blood draws (Framingham, LBC). Does predicted biological age increase at the expected rate? Does it rise faster in people who develop disease?
Attention weight heatmap per head. SHAP values per feature per clock. Expected biological story: GrimAge dominated by albumin, inflammation, ApoA; DunedinPACE dominated by metabolic markers. This section should read like biology, not ML.
All metrics stratified by sex, age decade, BMI, smoking status, ancestry. Check for heterogeneous performance. Investigate and fix, or disclose clearly.
Directly compare against Lee et al. 2023 (Aging), Bae et al. 2021 (npj Digital Medicine), Putin et al. 2016 (Aging). Run their models on our test set. Show superiority directly.
Figure 1, Architecture: Clean schematic of the full BioAge-Transformer. Pathway-color-coded. Should be readable by a non-ML clinician.
Figure 2, Main results: Grouped bar charts, MAE per model per clock, 95% bootstrap CIs.
Figure 3, Feature minimization curve: x = panel size, y = normalized MAE. Mark the "20-marker elbow." Add cost axis on right.
Figure 4, Calibration: Reliability diagram. Our model follows the diagonal; others don't.
Figure 5, Interpretability: (A) Attention heatmap by pathway. (B) SHAP dot plot, top 15 features per clock.
Figure 6, External validation: Performance on UKB test set vs. each external cohort, with error bars.
Weeks 1–2: Apply for UK Biobank access (critical, 2–4 month wait). Apply MESA via dbGaP. Download NHANES (free, immediate). Set up GPU compute.
Weeks 3–6: Build full data pipeline on NHANES + GEO data while waiting for UKB. Implement all baseline models.
Weeks 7–12: UK Biobank access arrives. Calculate clock targets from methylation data in R. Extract and join blood chemistry features. Run baselines.
Weeks 13–18: Implement BioAge-Transformer in PyTorch. Optuna hyperparameter sweep (200 trials). Run all 10 experiments. Log everything in W&B.
Weeks 19–24: External validation. Interpretability analysis. Statistical analysis (bootstrap CIs). Supplementary material.
Weeks 25–30: Paper writing in LaTeX. Figure production (publication-quality matplotlib/seaborn). Submit to npj Digital Medicine.
| Task | Hardware | Time estimate |
|---|---|---|
| Baseline models | CPU, 16GB RAM | 2–4 hours |
| BioAge-Transformer training | 1× A100 (40GB) | 6–12 hours per run |
| Hyperparameter sweep (200 trials) | 4× A100 parallel | 2–3 days |
| Interpretability (SHAP on 40K) | GPU recommended | 4–8 hours |
GCP a2-highgpu-1g (1× A100) = ~$3.67/hr. Full project: budget ~$500–800 compute.
Primary target: npj Digital Medicine (Nature Portfolio), IF ~15, scope exactly this, open access, 4–8 weeks to first decision.
Secondary: Nature Communications, broader scope, slightly harder.
Tertiary: Aging (Albany NY), home journal of most clock papers, IF ~4–6, more likely if external validation is limited.
Do not target Nature Medicine, Cell, or Lancet Digital Health on first submission.
"To demonstrate real-world clinical utility, we apply BioAge-Transformer to a prospectively collected cohort of n=X urban Indian professionals enrolled in the Antiaging Labs longevity program (Hyderabad, India). This cohort differs from UK Biobank in ancestry, diet, and metabolic risk profile. Despite these differences, BioAge-Transformer [maintains performance / shows X% performance drop], suggesting [generalizability beyond European-ancestry populations / the need for fine-tuning in South Asian populations]. Notably, n=Y clients had repeat blood draws at 90-day intervals following a structured lifestyle intervention, the first demonstration of surrogate clock tracking over a short-term intervention period."
Even n=20 paired before/after samples is a publishable finding if the surrogate clock moves with the intervention. Apply to an Indian IRB before collecting data for research purposes.
| Priority | Task | Time |
|---|---|---|
| 🔴 Critical | Apply for UK Biobank access | 3 hours |
| 🔴 Critical | Download NHANES data, start feature pipeline | 4 hours |
| 🟠 High | Apply for MESA via dbGaP | 1 hour |
| 🟠 High | Set up GPU compute environment (GCP/AWS) | 2 hours |
| 🟠 High | Read the 8 papers above | This week |
| 🟡 Medium | Set up GitHub repo (public, MIT license) | 1 hour |
| 🟡 Medium | Apply to Indian IRB for Antiaging Labs data | 2 hours |
| 🟢 Later | Generation Scotland collaboration email | 1 hour |
The UK Biobank application is the hardest blocker. Start today.