• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从 DNA 甲基化数据预测年龄:一种适用于小数据集和有限预测因子的机器学习方法。

Predicting Chronological Age from DNA Methylation Data: A Machine Learning Approach for Small Datasets and Limited Predictors.

机构信息

King's Forensics, Department of Analytical, Environmental and Forensic Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK.

出版信息

Methods Mol Biol. 2022;2432:187-200. doi: 10.1007/978-1-0716-1994-0_14.

DOI:10.1007/978-1-0716-1994-0_14
PMID:35505216
Abstract

Recent research studies using epigenetic data have been exploring whether it is possible to estimate how old someone is using only their DNA. This application stems from the strong correlation that has been observed in humans between the methylation status of certain DNA loci and chronological age. While genome-wide methylation sequencing has been the most prominent approach in epigenetics research, recent studies have shown that targeted sequencing of a limited number of loci can be successfully used for the estimation of chronological age from DNA samples, even when using small datasets. Following this shift, the need to investigate further into the appropriate statistics behind the predictive models used for DNA methylation-based prediction has been identified in multiple studies. This chapter will look into an example of basic data manipulation and modeling that can be applied to small DNA methylation datasets (100-400 samples) produced through targeted methylation sequencing for a small number of predictors (10-25 methylation sites). Data manipulation will focus on converting the obtained methylation values for the different predictors to a statistically meaningful dataset, followed by a basic introduction into importing such datasets in R, as well as randomizing and splitting into appropriate training and test sets for modeling. Finally, a basic introduction to R modeling will be outlined, starting with feature selection algorithms and continuing with a simple modeling example (linear model) as well as a more complex algorithm (Support Vector Machine).

摘要

最近使用表观遗传数据的研究一直在探索是否可以仅通过 DNA 来估计一个人的年龄。这种应用源于在人类中观察到的一个很强的相关性,即某些 DNA 位点的甲基化状态与年龄呈正相关。虽然全基因组甲基化测序一直是表观遗传学研究中最突出的方法,但最近的研究表明,即使使用小数据集,对有限数量的位点进行靶向测序也可以成功地用于从 DNA 样本中估计年龄。随着这种转变,多个研究已经确定需要进一步研究用于基于 DNA 甲基化预测的预测模型背后的适当统计数据。本章将探讨一个适用于通过靶向甲基化测序生成的小型 DNA 甲基化数据集(100-400 个样本)的基本数据操作和建模示例,这些数据集的预测因子数量较少(10-25 个甲基化位点)。数据操作将侧重于将不同预测因子的获得的甲基化值转换为具有统计学意义的数据集,然后介绍在 R 中导入此类数据集的基础知识,以及对建模进行随机化和分割为适当的训练集和测试集。最后,将概述 R 建模的基础知识,从特征选择算法开始,然后继续介绍一个简单的建模示例(线性模型)以及更复杂的算法(支持向量机)。

相似文献

1
Predicting Chronological Age from DNA Methylation Data: A Machine Learning Approach for Small Datasets and Limited Predictors.从 DNA 甲基化数据预测年龄:一种适用于小数据集和有限预测因子的机器学习方法。
Methods Mol Biol. 2022;2432:187-200. doi: 10.1007/978-1-0716-1994-0_14.
2
DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models.基于大规模平行测序数据和多种机器学习模型的 DNA 甲基化年龄预测。
Forensic Sci Int Genet. 2018 Nov;37:215-226. doi: 10.1016/j.fsigen.2018.09.003. Epub 2018 Sep 8.
3
Recalibrating the epigenetic clock: implications for assessing biological age in the human cortex.重新校准表观遗传时钟:评估人类大脑皮层生物学年龄的意义。
Brain. 2020 Dec 1;143(12):3763-3775. doi: 10.1093/brain/awaa334.
4
Chronological age prediction based on DNA methylation: Massive parallel sequencing and random forest regression.基于DNA甲基化的年龄预测:大规模平行测序与随机森林回归
Forensic Sci Int Genet. 2017 Nov;31:19-28. doi: 10.1016/j.fsigen.2017.07.015. Epub 2017 Aug 1.
5
Development of a model for the prediction of biological age.生物年龄预测模型的建立。
Comput Methods Programs Biomed. 2023 Oct;240:107686. doi: 10.1016/j.cmpb.2023.107686. Epub 2023 Jun 24.
6
EpiTEAmDNA: Sequence feature representation via transfer learning and ensemble learning for identifying multiple DNA epigenetic modification types across species.EpiTEAmDNA:通过迁移学习和集成学习进行序列特征表示,以跨物种识别多种 DNA 表观遗传修饰类型。
Comput Biol Med. 2023 Jun;160:107030. doi: 10.1016/j.compbiomed.2023.107030. Epub 2023 May 11.
7
Male-specific age estimation based on Y-chromosomal DNA methylation.基于 Y 染色体 DNA 甲基化的男性特异性年龄估计。
Aging (Albany NY). 2021 Mar 11;13(5):6442-6458. doi: 10.18632/aging.202775.
8
Evaluation of marker selection methods and statistical models for chronological age prediction based on DNA methylation.基于DNA甲基化的年龄预测中标记选择方法和统计模型的评估
Leg Med (Tokyo). 2020 Nov;47:101744. doi: 10.1016/j.legalmed.2020.101744. Epub 2020 Jul 1.
9
Epigenome-wide cross-tissue predictive modeling and comparison of cord blood and placental methylation in a birth cohort.出生队列中全表观基因组跨组织预测建模及脐血与胎盘甲基化比较
Epigenomics. 2017 Mar;9(3):231-240. doi: 10.2217/epi-2016-0109. Epub 2017 Feb 17.
10
A Novel Computational Method for Detecting DNA Methylation Sites with DNA Sequence Information and Physicochemical Properties.一种基于 DNA 序列信息和理化性质的新型 DNA 甲基化位点检测计算方法。
Int J Mol Sci. 2018 Feb 8;19(2):511. doi: 10.3390/ijms19020511.

引用本文的文献

1
Development of a novel forensic age estimation strategy for aged blood samples by combining piRNA and miRNA markers.通过结合 piRNA 和 miRNA 标志物开发新型法医年龄估计策略,用于陈旧血样。
Int J Legal Med. 2023 Sep;137(5):1327-1335. doi: 10.1007/s00414-023-03028-8. Epub 2023 Jun 1.

本文引用的文献

1
DNA methylation of the ELOVL2, FHL2, KLF14, C1orf132/MIR29B2C, and TRIM59 genes for age prediction from blood, saliva, and buccal swab samples.从血液、唾液和口腔拭子样本中预测年龄的 ELOVL2、FHL2、KLF14、C1orf132/MIR29B2C 和 TRIM59 基因的 DNA 甲基化。
Forensic Sci Int Genet. 2019 Jan;38:1-8. doi: 10.1016/j.fsigen.2018.09.010. Epub 2018 Sep 29.
2
DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models.基于大规模平行测序数据和多种机器学习模型的 DNA 甲基化年龄预测。
Forensic Sci Int Genet. 2018 Nov;37:215-226. doi: 10.1016/j.fsigen.2018.09.003. Epub 2018 Sep 8.
3
Proof of concept study of age-dependent DNA methylation markers across different tissues by massive parallel sequencing.
通过大规模平行测序研究不同组织中与年龄相关的 DNA 甲基化标记的概念验证研究。
Forensic Sci Int Genet. 2018 Sep;36:152-159. doi: 10.1016/j.fsigen.2018.07.007. Epub 2018 Jul 7.
4
Tracking age-correlated DNA methylation markers in the young.追踪年轻人群中与年龄相关的 DNA 甲基化标记物
Forensic Sci Int Genet. 2018 Sep;36:50-59. doi: 10.1016/j.fsigen.2018.06.011. Epub 2018 Jun 13.
5
Systematic feature selection improves accuracy of methylation-based forensic age estimation in Han Chinese males.系统的特征选择可提高汉族男性基于甲基化的法医年龄估计的准确性。
Forensic Sci Int Genet. 2018 Jul;35:38-45. doi: 10.1016/j.fsigen.2018.03.009. Epub 2018 Mar 23.
6
Age Estimation with DNA: From Forensic DNA Fingerprinting to Forensic (Epi)Genomics: A Mini-Review.基于 DNA 的年龄估计:从法医 DNA 指纹分析到法医(表观)基因组学:一个迷你综述。
Gerontology. 2018;64(4):326-332. doi: 10.1159/000486239. Epub 2018 Jan 23.
7
Forensic DNA methylation profiling from minimal traces: How low can we go?从微量痕迹进行法医 DNA 甲基化分析:我们能做到多低?
Forensic Sci Int Genet. 2018 Mar;33:17-23. doi: 10.1016/j.fsigen.2017.11.004. Epub 2017 Nov 13.
8
Chronological age prediction based on DNA methylation: Massive parallel sequencing and random forest regression.基于DNA甲基化的年龄预测:大规模平行测序与随机森林回归
Forensic Sci Int Genet. 2017 Nov;31:19-28. doi: 10.1016/j.fsigen.2017.07.015. Epub 2017 Aug 1.
9
Independent validation of DNA-based approaches for age prediction in blood.基于DNA的血液年龄预测方法的独立验证。
Forensic Sci Int Genet. 2017 Jul;29:250-256. doi: 10.1016/j.fsigen.2017.04.020. Epub 2017 Apr 28.
10
DNA methylation-based age prediction from saliva: High age predictability by combination of 7 CpG markers.基于唾液DNA甲基化的年龄预测:通过7个CpG标记物组合实现高年龄预测性
Forensic Sci Int Genet. 2017 Jul;29:118-125. doi: 10.1016/j.fsigen.2017.04.006. Epub 2017 Apr 9.