• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MRMD3.0:一个基于集成策略的用于降维和数据可视化的 Python 工具和 Web 服务器。

MRMD3.0: A Python Tool and Webserver for Dimensionality Reduction and Data Visualization via an Ensemble Strategy.

机构信息

Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China; Department of Computer Science, University of Tsukuba, Tsukuba, Ibaraki 305-8577, Japan.

Department of Computer Science, University of Tsukuba, Tsukuba, Ibaraki 305-8577, Japan.

出版信息

J Mol Biol. 2023 Jul 15;435(14):168116. doi: 10.1016/j.jmb.2023.168116. Epub 2023 Apr 21.

DOI:10.1016/j.jmb.2023.168116
PMID:37356901
Abstract

Dimensionality reduction is a hot topic in machine learning that can help researchers find key features from complex medical or biological data, which is crucial for biological sequence research, drug development, etc. However, when applied to specific datasets, different dimensionality reduction methods generate different results, which produces instability and makes tuning the parameters a time-consuming task. Exploring high quality features, genes, or attributes from complex data is an important task and challenge. To ensure the efficiency, robustness, and accuracy of experiments, in this work, we developed a dimensionality reduction tool MRMD3.0 based on the ensemble strategy of link analysis. It is mainly divided into two steps: first, the ensemble method is used to integrate different feature ranking algorithms to calculate feature importance, and then the forward feature search strategy combined with cross-validation is used to explore the proper feature combination. Compared with the previously developed version, MRMD3.0 has added more link-based ensemble algorithms, including PageRank, HITS, LeaderRank, and TrustRank. At the same time, more feature ranking algorithms have been added, and their effect and calculation speed have been greatly improved. In addition, the newest version provides an interface used by each feature ranking method and five kinds of charts to help users analyze features. Finally, we also provide an online webserver to help researchers analyze the data. Availability and implementation Webserver: http://lab.malab.cn/soft/MRMDv3/home.html. GitHub: https://github.com/heshida01/MRMD3.0.

摘要

降维是机器学习中的一个热门话题,它可以帮助研究人员从复杂的医学或生物学数据中找到关键特征,这对于生物序列研究、药物开发等至关重要。然而,当应用于特定数据集时,不同的降维方法会产生不同的结果,这会产生不稳定性,并且调整参数是一项耗时的任务。从复杂数据中探索高质量的特征、基因或属性是一项重要的任务和挑战。为了确保实验的效率、鲁棒性和准确性,在这项工作中,我们基于链接分析的集成策略开发了一个名为 MRMD3.0 的降维工具。它主要分为两个步骤:首先,集成方法用于整合不同的特征排序算法来计算特征重要性,然后使用结合交叉验证的前向特征搜索策略来探索合适的特征组合。与之前开发的版本相比,MRMD3.0 增加了更多基于链接的集成算法,包括 PageRank、HITS、LeaderRank 和 TrustRank。同时,还添加了更多的特征排序算法,并大大提高了它们的效果和计算速度。此外,最新版本提供了每个特征排序方法的接口和五种图表,以帮助用户分析特征。最后,我们还提供了一个在线网络服务器,以帮助研究人员分析数据。

可用性和实施

Web 服务器:http://lab.malab.cn/soft/MRMDv3/home.html。

GitHub:https://github.com/heshida01/MRMD3.0。

相似文献

1
MRMD3.0: A Python Tool and Webserver for Dimensionality Reduction and Data Visualization via an Ensemble Strategy.MRMD3.0:一个基于集成策略的用于降维和数据可视化的 Python 工具和 Web 服务器。
J Mol Biol. 2023 Jul 15;435(14):168116. doi: 10.1016/j.jmb.2023.168116. Epub 2023 Apr 21.
2
ShinyLearner: A containerized benchmarking tool for machine-learning classification of tabular data.ShinyLearner:一个用于表格数据机器学习分类的容器化基准测试工具。
Gigascience. 2020 Apr 1;9(4). doi: 10.1093/gigascience/giaa026.
3
Enhancer-FRL: Improved and Robust Identification of Enhancers and Their Activities Using Feature Representation Learning.增强子-FRL:利用特征表示学习改进并稳健识别增强子及其活性
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):967-975. doi: 10.1109/TCBB.2022.3204365. Epub 2023 Apr 3.
4
Drug-target interaction prediction using ensemble learning and dimensionality reduction.基于集成学习和降维的药物-靶点相互作用预测。
Methods. 2017 Oct 1;129:81-88. doi: 10.1016/j.ymeth.2017.05.016. Epub 2017 May 24.
5
Large-scale comparative review and assessment of computational methods for anti-cancer peptide identification.大规模比较综述与评估抗癌肽鉴定的计算方法。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa312.
6
iAtbP-Hyb-EnC: Prediction of antitubercular peptides via heterogeneous feature representation and genetic algorithm based ensemble learning model.iAtbP-Hyb-EnC:基于异质特征表示和遗传算法的集合学习模型对抗结核肽的预测。
Comput Biol Med. 2021 Oct;137:104778. doi: 10.1016/j.compbiomed.2021.104778. Epub 2021 Aug 25.
7
ECAmyloid: An amyloid predictor based on ensemble learning and comprehensive sequence-derived features.ECAmyloid:一种基于集成学习和综合序列衍生特征的淀粉样蛋白预测器。
Comput Biol Chem. 2023 Jun;104:107853. doi: 10.1016/j.compbiolchem.2023.107853. Epub 2023 Mar 23.
8
iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data.iLearn:一个集成平台和元学习者,用于 DNA、RNA 和蛋白质序列数据的特征工程、机器学习分析和建模。
Brief Bioinform. 2020 May 21;21(3):1047-1057. doi: 10.1093/bib/bbz041.
9
ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.ATGPred-FL:基于特征表示学习的自噬蛋白序列预测。
Amino Acids. 2022 May;54(5):799-809. doi: 10.1007/s00726-022-03145-5. Epub 2022 Mar 14.
10
PeNGaRoo, a combined gradient boosting and ensemble learning framework for predicting non-classical secreted proteins.PeNGaRoo,一种组合梯度提升和集成学习框架,用于预测非经典分泌蛋白。
Bioinformatics. 2020 Feb 1;36(3):704-712. doi: 10.1093/bioinformatics/btz629.

引用本文的文献

1
DTI-RME: a robust and multi-kernel ensemble approach for drug-target interaction prediction.DTI-RME:一种用于药物-靶点相互作用预测的稳健多内核集成方法。
BMC Biol. 2025 Jul 28;23(1):225. doi: 10.1186/s12915-025-02340-6.
2
Classification of Acid and Alkaline Enzymes Based on Normalized Van der Waals Volume Features.基于归一化范德华体积特征的酸碱酶分类
Proteomics Clin Appl. 2025 Jul;19(4):e70009. doi: 10.1002/prca.70009. Epub 2025 May 31.
3
iNP_ESM: Neuropeptide Identification Based on Evolutionary Scale Modeling and Unified Representation Embedding Features.
iNP_ESM:基于进化尺度建模和统一表示嵌入特征的神经肽识别。
Int J Mol Sci. 2024 Jun 27;25(13):7049. doi: 10.3390/ijms25137049.
4
Prediction of miRNAs and diseases association based on sparse autoencoder and MLP.基于稀疏自编码器和多层感知器的微小RNA与疾病关联预测
Front Genet. 2024 May 30;15:1369811. doi: 10.3389/fgene.2024.1369811. eCollection 2024.
5
CircRNA identification and feature interpretability analysis.环状 RNA 鉴定和特征可解释性分析。
BMC Biol. 2024 Feb 27;22(1):44. doi: 10.1186/s12915-023-01804-x.
6
MV-CVIB: microbiome-based multi-view convolutional variational information bottleneck for predicting metastatic colorectal cancer.MV-CVIB:基于微生物组的多视图卷积变分信息瓶颈用于预测转移性结直肠癌
Front Microbiol. 2023 Aug 22;14:1238199. doi: 10.3389/fmicb.2023.1238199. eCollection 2023.