AI-powered LifeClock forecasts diseases years before symptoms appear

Using transformer-based AI, scientists created a full life-cycle biological clock that predicts future disease risk and uncovers the separate biological rhythms of growth and aging.

Study: A full life cycle biological clock based on routine clinical data and its impact in health and diseases. Image Credit: vectorfusionart / Shutterstock

Study: A full life cycle biological clock based on routine clinical data and its impact in health and diseases. Image Credit: vectorfusionart / Shutterstock

In a recent study in the journal Nature Medicine, researchers describe and validate a novel artificial intelligence (AI) model trained to use routine electronic health records (EHRs) to determine biological age across the entire human lifespan.

The model, named "LifeClock," identified two distinct clocks, one for pediatric development and another for adult aging, and could predict the risk of major diseases years before their occurrence. This framework offers a robust, low-cost tool for advancing precision health, accessibility, and personalized medicine.

Background

Chronological age, the number of years we have been alive, has long been leveraged as the benchmark of an individual's chronic, non-communicable disease risk. However, modern research has increasingly focused on biological age (BA), a measure of the body's accumulated damage and functional decline compared to the average.

Recent research has revealed the latter metric as a far better predictor of disease risk and mortality, as two people of the same chronological age can have vastly different health profiles due to combinations of genetics and lifestyle.

Early methods for estimating BA relied on complex and expensive molecular data (e.g., DNA methylation patterns). While effective, these "aging clocks" were often limited in scope.

A critical gap in research was the lack of a biological clock that could span the entire human life cycle, particularly the crucial stages of infancy and childhood. Physiological changes during these periods were found to represent scripted development rather than aging-related damage. 

Research, therefore, seeks a more accessible way to monitor health trajectories from birth onward using widely available data such as electronic health records (EHR) records.

About the Study

The present study addressed this knowledge gap by introducing "LifeClock", a biological clock built on a powerful transformer-based artificial intelligence (AI) model called EHRFormer, which uses input-output dual stochastic masking to handle sparse data, adversarial training to eliminate batch effects, and an autoregressive design for longitudinal prediction.

The model was trained using an extensive dataset from the China Health Aging Investigation (CHAI) project. The dataset comprised 24.6 million longitudinal clinical visits from 9.6 million unique individuals. This longitudinal data, which tracks patients over time, included 184 routine clinical indicators such as laboratory test results and vital signs.

The EHRFormer model was designed to create a "digital representation" of health by analyzing the sequence of clinical visits and their associated routine EHR records. The model's architecture included sophisticated strategies to address common challenges in medical data, such as imputing missing values and eliminating batch effects (variations across hospitals or equipment). The model was trained on data from healthy individuals to establish a baseline for normal development and aging.

Finally, the study evaluated model performance on separate internal and external datasets, including the UK Biobank, thereby ensuring its predictions were robust and generalizable across different populations.

Study Findings

The study's analyses revealed two distinct and separate biological clocks: (1) a "development clock" for individuals under 18, and (2) an "aging clock" for adults. Training specialized models for each phase significantly improved prediction accuracy, underscoring distinct biological processes during development versus aging.

The biomarkers driving these clocks were almost entirely different. The pediatric clock was strongly influenced by markers related to growth, such as high creatinine and total protein levels. In contrast, the adult clock was found to be driven by indicators of age-related decline, including high urea, low albumin, and high red cell distribution width (RDW).

Encouragingly, both clocks proved highly effective at predicting disease risk. The pediatric clock accurately forecasted the future risk of conditions such as malnutrition, growth, and developmental abnormalities (including growth hormone deficiency).

For example, an analysis of EHR data from children under 12 could predict which individuals in Cluster 14 were at a higher risk for developing pituitary hyperfunction (15.36 times higher risk) and obesity (11.07 times higher risk) later in childhood.

The adult clock similarly proved to accurately predict the risk of major age-related diseases, including diabetes (type 2 diabetes (T2D)), stroke, renal failure, and cardiovascular disease (CVD), with specific clusters showing dramatically elevated risks (e.g., Cluster 20 had 37.7 times higher renal failure risk).

Critically, the model distinctly supported both diagnosing current diseases and predicting future risks: after fine-tuning, it achieved an area under the curve (AUC) of 0.98 for diabetes diagnosis and 0.91 for future diabetes prediction. Furthermore, EHRFormer outperformed traditional models (RNN and XGBoost) in both tasks.

Conclusions

The present study successfully demonstrates that EHRFormer can be leveraged to generate a powerful biological clock, constructed from routine, widely available, and low-cost EHR data.

LifeClock provides a novel framework for understanding the distinct processes of pediatric development and adult aging. The model moves beyond simple chronological age assessment to offer a more dynamic and precise picture of an individual's health.

By identifying at-risk individuals years before symptoms appear, this technology holds the potential to revolutionize preventive medicine and guide personalized interventions. 

Future work may potentially involve integrating data from wearable devices and other real-time biometric and health data sources to create an even more adaptive and accurate system for promoting healthy aging.

Journal reference:
  • Wang, K., Liu, F., Wu, W., Hu, C., Shen, X., Wang, M., Li, G., Zeng, F., Liu, L., Wong, I. N., Liu, S., Zou, Z., Li, B., Li, J., Huang, X., Jin, S., Li, Z., Xu, H., Chen, G., Chen, X. (2025). A full life cycle biological clock based on routine clinical data and its impact in health and diseases. Nature Medicine. DOI – 10.1038/s41591-025-04006-w, https://www.nature.com/articles/s41591-025-04006-w
Hugo Francisco de Souza

Written by

Hugo Francisco de Souza

Hugo Francisco de Souza is a scientific writer based in Bangalore, Karnataka, India. His academic passions lie in biogeography, evolutionary biology, and herpetology. He is currently pursuing his Ph.D. from the Centre for Ecological Sciences, Indian Institute of Science, where he studies the origins, dispersal, and speciation of wetland-associated snakes. Hugo has received, amongst others, the DST-INSPIRE fellowship for his doctoral research and the Gold Medal from Pondicherry University for academic excellence during his Masters. His research has been published in high-impact peer-reviewed journals, including PLOS Neglected Tropical Diseases and Systematic Biology. When not working or writing, Hugo can be found consuming copious amounts of anime and manga, composing and making music with his bass guitar, shredding trails on his MTB, playing video games (he prefers the term ‘gaming’), or tinkering with all things tech.

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

  • APA

    Francisco de Souza, Hugo. (2025, October 28). AI-powered LifeClock forecasts diseases years before symptoms appear. News-Medical. Retrieved on October 28, 2025 from https://www.news-medical.net/news/20251028/AI-powered-LifeClock-forecasts-diseases-years-before-symptoms-appear.aspx.

  • MLA

    Francisco de Souza, Hugo. "AI-powered LifeClock forecasts diseases years before symptoms appear". News-Medical. 28 October 2025. <https://www.news-medical.net/news/20251028/AI-powered-LifeClock-forecasts-diseases-years-before-symptoms-appear.aspx>.

  • Chicago

    Francisco de Souza, Hugo. "AI-powered LifeClock forecasts diseases years before symptoms appear". News-Medical. https://www.news-medical.net/news/20251028/AI-powered-LifeClock-forecasts-diseases-years-before-symptoms-appear.aspx. (accessed October 28, 2025).

  • Harvard

    Francisco de Souza, Hugo. 2025. AI-powered LifeClock forecasts diseases years before symptoms appear. News-Medical, viewed 28 October 2025, https://www.news-medical.net/news/20251028/AI-powered-LifeClock-forecasts-diseases-years-before-symptoms-appear.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.