Suppr超能文献

1型糖尿病儿童和青少年胰岛素基因启动子的甲基化单倍型:降维方法能否预测该疾病?

Methylation haplotypes of the insulin gene promoter in children and adolescents with type 1 diabetes: Can a dimensionality reduction approach predict the disease?

作者信息

Kotanidou Eleni P, Kosvyra Alexandra, Mouzaki Konstantina, Giza Styliani, Tsinopoulou Vasiliki Rengina, Serbis Anastasios, Chouvarda Ioanna, Galli-Tsinopoulou Assimina

机构信息

Second Department of Pediatrics, Unit of Pediatric Endocrinology and Metabolism, Faculty of Health Sciences, School of Medicine, Aristotle University of Thessaloniki, AHEPA University Hospital, 54636 Thessaloniki, Greece.

Laboratory of Computing, Medical Informatics and Biomedical Imaging Technologies, Faculty of Health Sciences, School of Medicine, Aristotle University of Thessaloniki, 54636 Thessaloniki, Greece.

出版信息

Exp Ther Med. 2023 Aug 8;26(4):461. doi: 10.3892/etm.2023.12160. eCollection 2023 Oct.

Abstract

DNA methylation of cytosine-guanine sites (CpGs) is associated with type 1 diabetes (T1D). The sequence of methylated and non-methylated sites in a specific genetic region constitutes its methyl-haplotype. The aim of the present study was to identify insulin gene promoter (IGP) methyl-haplotypes among children and adolescents with T1D and suggest a predictive model for the discrimination of cases and controls according to methyl-haplotypes. A total of 40 individuals (20 T1D) participated. The IGP region from peripheral whole blood DNA of 40 participants (20 T1D) was sequenced using next-generation sequencing, sequences were read using FASTQ files and methylation status was calculated by python-based pipeline for targeted deep bisulfite sequenced amplicons (ampliMethProfiler). Methylation profile at 10 CpG sites proximal to transcription start site of the IGP was recorded and coded as 0 for unmethylation or 1 for methylation. A single read could result in '1111111111' methyl-haplotype (all methylated), '000000000' methyl-haplotype (all unmethylated) or any other combination. Principal component analysis was applied to the generated methyl-haplotypes for dimensionality reduction, and the first three principal components were employed as features with five different classifiers (random forest, decision tree, logistic regression, Naive Bayes, support vector machine). Naive Bayes was the best-performing classifier, with 0.9 accuracy. Predictive models were evaluated using receiver operating characteristics (AUC 0.96). Methyl-haplotypes '1111111111', '1111111011', '1110111111', '1111101111' and '1110101111' were revealed to be the most significantly associated with T1D according to the dimensionality reduction method. Methylation-based biomarkers such as IGP methyl-haplotypes could serve to identify individuals at high risk for T1D.

摘要

胞嘧啶 - 鸟嘌呤位点(CpG)的DNA甲基化与1型糖尿病(T1D)相关。特定基因区域中甲基化和非甲基化位点的序列构成其甲基单倍型。本研究的目的是在患有T1D的儿童和青少年中鉴定胰岛素基因启动子(IGP)甲基单倍型,并根据甲基单倍型提出一种区分病例和对照的预测模型。共有40人(20例T1D患者)参与。使用下一代测序对40名参与者(20例T1D患者)外周全血DNA的IGP区域进行测序,使用FASTQ文件读取序列,并通过基于Python的靶向深度亚硫酸氢盐测序扩增子管道(ampliMethProfiler)计算甲基化状态。记录IGP转录起始位点近端10个CpG位点的甲基化谱,未甲基化编码为0,甲基化编码为1。单个读数可能产生“1111111111”甲基单倍型(全部甲基化)、“000000000”甲基单倍型(全部未甲基化)或任何其他组合。对生成的甲基单倍型应用主成分分析进行降维,并将前三个主成分用作五种不同分类器(随机森林、决策树、逻辑回归、朴素贝叶斯、支持向量机)的特征。朴素贝叶斯是表现最佳的分类器,准确率为0.9。使用受试者工作特征曲线(AUC 0.96)评估预测模型。根据降维方法,甲基单倍型“1111111111”、“1111111011”、“1110111111”、“1111101111”和“1110101111”被发现与T1D最显著相关。基于甲基化的生物标志物如IGP甲基单倍型可用于识别T1D高危个体。

相似文献

本文引用的文献

8
Eigenvectors from Eigenvalues Sparse Principal Component Analysis (EESPCA).来自特征值稀疏主成分分析(EESPCA)的特征向量。
J Comput Graph Stat. 2022;31(2):486-501. doi: 10.1080/10618600.2021.1987254. Epub 2021 Nov 12.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验