• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FeSTwo,一种基于特征工程和采样的两步特征选择算法,用于解决年龄回归问题。

FeSTwo, a two-step feature selection algorithm based on feature engineering and sampling for the chronological age regression problem.

作者信息

Wei Zhipeng, Ding Shiying, Duan Meiyu, Liu Shuai, Huang Lan, Zhou Fengfeng

机构信息

Health Informatics Lab, College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, Jilin University, Changchun, Jilin, 130012, China.

Health Informatics Lab, College of Software, and Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, Jilin University, Changchun, Jilin, 130012, China.

出版信息

Comput Biol Med. 2020 Oct;125:104008. doi: 10.1016/j.compbiomed.2020.104008. Epub 2020 Sep 26.

DOI:10.1016/j.compbiomed.2020.104008
PMID:33035960
Abstract

Accurate determination of the sample's chronological age is an important forensic problem. This regression problem may be improved by selecting appropriate methylomic features. Most of the existing feature selection algorithms, however, optimize the regression performance by considering only the original features. This study proposed four feature engineering strategies to transform the original methylomic features. The regression performance of the age regression model was improved by the resampling-based feature selection algorithm FeSTwo proposed in this study. FeSTwo outperformed the parallel algorithms used in the previous studies even with the electronic health record data. The age prediction performance of the FeSTwo-detected features was also confirmed for another independent dataset. The study results demonstrated that the proposed model, FeSTwo, led to a more than 8% reduction in root-mean-square error (RMSE) on the test dataset with only 70 features.

摘要

准确测定样本的年代年龄是一个重要的法医学问题。通过选择合适的甲基化组特征,这个回归问题可能会得到改善。然而,大多数现有的特征选择算法仅通过考虑原始特征来优化回归性能。本研究提出了四种特征工程策略来转换原始甲基化组特征。本研究提出的基于重采样的特征选择算法FeSTwo提高了年龄回归模型的回归性能。即使使用电子健康记录数据,FeSTwo也优于先前研究中使用的并行算法。对于另一个独立数据集,也证实了FeSTwo检测到的特征的年龄预测性能。研究结果表明,所提出的模型FeSTwo在仅使用70个特征的测试数据集上使均方根误差(RMSE)降低了8%以上。

相似文献

1
FeSTwo, a two-step feature selection algorithm based on feature engineering and sampling for the chronological age regression problem.FeSTwo,一种基于特征工程和采样的两步特征选择算法,用于解决年龄回归问题。
Comput Biol Med. 2020 Oct;125:104008. doi: 10.1016/j.compbiomed.2020.104008. Epub 2020 Sep 26.
2
AgeGuess, a Methylomic Prediction Model for Human Ages.AgeGuess,一种用于预测人类年龄的甲基化组学模型。
Front Bioeng Biotechnol. 2020 Mar 10;8:80. doi: 10.3389/fbioe.2020.00080. eCollection 2020.
3
Modified Bat Algorithm for Feature Selection with the Wisconsin Diagnosis Breast Cancer (WDBC) Dataset.基于威斯康星州诊断乳腺癌(WDBC)数据集的特征选择改进蝙蝠算法
Asian Pac J Cancer Prev. 2017 May 1;18(5):1257-1264. doi: 10.22034/APJCP.2017.18.5.1257.
4
Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms.动物园:通过集成受动物启发的群体智能特征选择算法来选择转录组学和甲基组学生物标志物。
Genes (Basel). 2021 Nov 18;12(11):1814. doi: 10.3390/genes12111814.
5
A universal deep learning approach for modeling the flow of patients under different severities.一种通用的深度学习方法,用于对不同严重程度的患者进行建模。
Comput Methods Programs Biomed. 2018 Feb;154:191-203. doi: 10.1016/j.cmpb.2017.11.003. Epub 2017 Nov 7.
6
An OMIC biomarker detection algorithm TriVote and its application in methylomic biomarker detection.一种 OMIC 生物标志物检测算法 TriVote 及其在甲基化组生物标志物检测中的应用。
Epigenomics. 2018 Apr;10(4):335-347. doi: 10.2217/epi-2017-0097. Epub 2018 Jan 19.
7
A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class.一种用于改进蛋白质结构类预测的特征与算法选择方法
Comb Chem High Throughput Screen. 2017;20(7):612-621. doi: 10.2174/1386207320666170314103147.
8
Chronological age prediction based on DNA methylation: Massive parallel sequencing and random forest regression.基于DNA甲基化的年龄预测:大规模平行测序与随机森林回归
Forensic Sci Int Genet. 2017 Nov;31:19-28. doi: 10.1016/j.fsigen.2017.07.015. Epub 2017 Aug 1.
9
An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset.一种用于多维数据集分类的高效特征子集选择算法。
ScientificWorldJournal. 2015;2015:821798. doi: 10.1155/2015/821798. Epub 2015 Sep 28.
10
Multi-objective Evolutionary Approach for the Performance Improvement of Learners using Ensembling Feature Selection and Discretization Technique on Medical Data.基于集成特征选择和离散化技术的医学数据中学习者性能改进的多目标进化方法。
Curr Med Imaging. 2020;16(4):355-370. doi: 10.2174/1573405614666180903114534.

引用本文的文献

1
ResnetAge: A Resnet-Based DNA Methylation Age Prediction Method.ResnetAge:一种基于Resnet的DNA甲基化年龄预测方法。
Bioengineering (Basel). 2023 Dec 28;11(1):34. doi: 10.3390/bioengineering11010034.
2
A voting-based machine learning approach for classifying biological and clinical datasets.基于投票的机器学习方法在生物和临床数据集分类中的应用。
BMC Bioinformatics. 2023 Apr 11;24(1):140. doi: 10.1186/s12859-023-05274-4.
3
A polygenic stacking classifier revealed the complicated platelet transcriptomic landscape of adult immune thrombocytopenia.
一种多基因叠加分类器揭示了成人免疫性血小板减少症复杂的血小板转录组图谱。
Mol Ther Nucleic Acids. 2022 Apr 6;28:477-487. doi: 10.1016/j.omtn.2022.04.004. eCollection 2022 Jun 14.
4
Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms.动物园:通过集成受动物启发的群体智能特征选择算法来选择转录组学和甲基组学生物标志物。
Genes (Basel). 2021 Nov 18;12(11):1814. doi: 10.3390/genes12111814.