• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PerSEveML:一种基于网络的工具,用于使用集成机器学习方法识别罕见事件的持久性生物标志物结构。

PerSEveML: a web-based tool to identify persistent biomarker structure for rare events using an integrative machine learning approach.

机构信息

Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, Kansas, USA.

University of Kansas Cancer Center, Kansas City, USA.

出版信息

Mol Omics. 2024 Jun 10;20(5):348-358. doi: 10.1039/d4mo00008k.

DOI:10.1039/d4mo00008k
PMID:38690925
Abstract

Omics data sets often pose a computational challenge due to their high dimensionality, large size, and non-linear structures. Analyzing these data sets becomes especially daunting in the presence of rare events. Machine learning (ML) methods have gained traction for analyzing rare events, yet there has been limited exploration of bioinformatics tools that integrate ML techniques to comprehend the underlying biology. Expanding upon our previously developed computational framework of an integrative machine learning approach, we introduce PerSEveML, an interactive web-based tool that uses crowd-sourced intelligence to predict rare events and determine feature selection structures. PerSEveML provides a comprehensive overview of the integrative approach through evaluation metrics that help users understand the contribution of individual ML methods to the prediction process. Additionally, PerSEveML calculates entropy and rank scores, which visually organize input features into a persistent structure of selected, unselected, and fluctuating categories that help researchers uncover meaningful hypotheses regarding the underlying biology. We have evaluated PerSEveML on three diverse biologically complex data sets with extremely rare events from small to large scale and have demonstrated its ability to generate valid hypotheses. PerSEveML is available at https://biostats-shinyr.kumc.edu/PerSEveML/ and https://github.com/sreejatadutta/PerSEveML.

摘要

组学数据集由于其高维性、大数据量和非线性结构,常常带来计算上的挑战。在稀有事件存在的情况下,分析这些数据集尤其具有挑战性。机器学习 (ML) 方法已被广泛应用于分析稀有事件,但对于集成 ML 技术以理解潜在生物学的生物信息学工具的探索还很有限。在我们之前开发的综合机器学习方法的计算框架的基础上,我们引入了 PerSEveML,这是一个基于网络的交互式工具,利用众包智能来预测稀有事件并确定特征选择结构。PerSEveML 通过评估指标提供了综合方法的全面概述,帮助用户了解单个 ML 方法对预测过程的贡献。此外,PerSEveML 计算熵和排名分数,将输入特征以选择、未选择和波动类别的持久结构进行可视化组织,帮助研究人员发现有关潜在生物学的有意义的假设。我们已经在三个具有小到大规模的极其稀有事件的不同生物复杂性数据集上评估了 PerSEveML,并展示了其生成有效假设的能力。PerSEveML 可在 https://biostats-shinyr.kumc.edu/PerSEveML/ 和 https://github.com/sreejatadutta/PerSEveML 上获得。

相似文献

1
PerSEveML: a web-based tool to identify persistent biomarker structure for rare events using an integrative machine learning approach.PerSEveML:一种基于网络的工具,用于使用集成机器学习方法识别罕见事件的持久性生物标志物结构。
Mol Omics. 2024 Jun 10;20(5):348-358. doi: 10.1039/d4mo00008k.
2
PerSEveML: A Web-Based Tool to Identify Persistent Biomarker Structure for Rare Events Using Integrative Machine Learning Approach.PerSEveML:一种基于网络的工具,使用集成机器学习方法识别罕见事件的持久性生物标志物结构。
bioRxiv. 2023 Oct 30:2023.10.25.564000. doi: 10.1101/2023.10.25.564000.
3
Identifying dynamical persistent biomarker structures for rare events using modern integrative machine learning approach.利用现代综合机器学习方法识别罕见事件的动态持久生物标志物结构。
Proteomics. 2023 Nov;23(21-22):e2200290. doi: 10.1002/pmic.202200290. Epub 2023 Mar 10.
4
omniBiomarker: A Web-Based Application for Knowledge-Driven Biomarker Identification.全生物标志物:一个基于网络的知识驱动型生物标志物识别应用程序。
IEEE Trans Biomed Eng. 2013 Dec;60(12):3364-7. doi: 10.1109/TBME.2012.2212438. Epub 2012 Aug 8.
5
BioM2: biologically informed multi-stage machine learning for phenotype prediction using omics data.BioM2:基于生物学信息的多阶段机器学习,用于使用组学数据进行表型预测。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae384.
6
FEPS: A Tool for Feature Extraction from Protein Sequence.FEPS:一种从蛋白质序列中提取特征的工具。
Methods Mol Biol. 2022;2499:65-104. doi: 10.1007/978-1-0716-2317-6_3.
7
FlowAtlas: an interactive tool for high-dimensional immunophenotyping analysis bridging FlowJo with computational tools in Julia.FlowAtlas:一种用于高维免疫表型分析的交互式工具,将FlowJo与Julia中的计算工具相连接。
Front Immunol. 2024 Jul 17;15:1425488. doi: 10.3389/fimmu.2024.1425488. eCollection 2024.
8
iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.iLearnPlus:一个全面的、自动化的机器学习平台,用于核酸和蛋白质序列分析、预测和可视化。
Nucleic Acids Res. 2021 Jun 4;49(10):e60. doi: 10.1093/nar/gkab122.
9
Fast and interpretable genomic data analysis using multiple approximate kernel learning.使用多种近似核学习进行快速且可解释的基因组数据分析。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i77-i83. doi: 10.1093/bioinformatics/btac241.
10
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.

引用本文的文献

1
Machine learning in healthcare citizen science: A scoping review.医疗保健公民科学中的机器学习:一项范围综述。
Int J Med Inform. 2025 Mar;195:105766. doi: 10.1016/j.ijmedinf.2024.105766. Epub 2024 Dec 19.