Suppr超能文献

使用随机森林模型根据血液检测结果进行年龄估计。

Age Estimation From Blood Test Results Using a Random Forest Model.

作者信息

Kodera Satomi, Yokoi Osamu, Kaneko Masaki, Sato Yuka, Ito Susumu, Hata Katsuhiko

机构信息

KYB Medical Service Co., LTD, Tokyo, Japan.

Department of Neuroscience, Research Centre for Mathematical Medicine, Tokyo, Japan.

出版信息

J Clin Lab Anal. 2025 Jul;39(14):e70064. doi: 10.1002/jcla.70064. Epub 2025 Jun 12.

Abstract

BACKGROUND AND OBJECTIVES

From a preventive medicine perspective, this study aims to clarify the role of screening data in aging and health problems by estimating age from screening data and verifying the number of data items required in widely used screening tests.

MATERIALS AND METHODS

A random forest model was applied to 11554 men and women (3043 and 8511, respectively) aged 0-95 years who underwent screening tests (60 blood tests, 8 urine tests and 2 saliva tests) between February 2020 and August 2023. All analyses were conducted in Python 3.10.12.

RESULTS

Using all 71 items including gender, a high accuracy of R  = 0.7010 was achieved with 9243 training datasets (80% of total). R decreased slightly to 0.6937 when data items were reduced to 15 by removing less important variables. When datasets numbered fewer than 800 or data items fewer than 7, R fell below 0.6. Notably, postmenopausal women tended to have higher estimated ages compared to premenopausal women.

CONCLUSIONS

Age estimation from blood data using the random forest model (blood age) is sufficiently precise for assessing physical aging state. Blood age, as well as other biological ages estimated from various omics estimators, was shown to be a very promising method for exploring the problems of aging such as metabolic syndrome and frail syndrome.

摘要

背景与目的

从预防医学的角度来看,本研究旨在通过从筛查数据中估计年龄并验证广泛使用的筛查测试所需的数据项数量,来阐明筛查数据在衰老和健康问题中的作用。

材料与方法

将随机森林模型应用于2020年2月至2023年8月期间接受筛查测试(60项血液测试、8项尿液测试和2项唾液测试)的11554名0至95岁的男性和女性(分别为3043名和8511名)。所有分析均在Python 3.10.12中进行。

结果

使用包括性别在内的所有71项数据,9243个训练数据集(占总数的80%)实现了较高的准确率,R = 0.7010。通过去除不太重要的变量将数据项减少到15项时,R略有下降至0.6937。当数据集数量少于800或数据项少于7项时,R降至0.6以下。值得注意的是,绝经后女性的估计年龄往往比绝经前女性更高。

结论

使用随机森林模型从血液数据估计年龄(血液年龄)对于评估身体衰老状态足够精确。血液年龄以及从各种组学估计器估计的其他生物学年龄,被证明是探索诸如代谢综合征和衰弱综合征等衰老问题的非常有前景的方法。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验