• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于基于RNA的样本鉴别和层次分类的用户驱动机器学习方法。

A user-driven machine learning approach for RNA-based sample discrimination and hierarchical classification.

作者信息

Imtiaz Tashifa, Nanayakkara Jina, Fang Alexis, Jomaa Danny, Mayotte Harrison, Damiani Simona, Javed Fiza, Jones Tristan, Kaczmarek Emily, Adebayo Flourish Omolara, Imtiaz Uroosa, Li Yiheng, Zhang Richard, Mousavi Parvin, Renwick Neil, Tyryshkin Kathrin

机构信息

Laboratory of Translational RNA Biology, Department of Pathology and Molecular Medicine, Queen's University, 88 Stuart St, Kingston, ON K7L 3N6, Canada.

Laboratory of Translational RNA Biology, Department of Pathology and Molecular Medicine, Queen's University, 88 Stuart St, Kingston, ON K7L 3N6, Canada.

出版信息

STAR Protoc. 2023 Oct 27;4(4):102661. doi: 10.1016/j.xpro.2023.102661.

DOI:10.1016/j.xpro.2023.102661
PMID:39491552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10751557/
Abstract

RNA-based sample discrimination and classification can be used to provide biological insights and/or distinguish between clinical groups. However, finding informative differences between sample groups can be challenging due to the multidimensional and noisy nature of sequencing data. Here, we apply a machine learning approach for hierarchical discrimination and classification of samples with high-dimensional miRNA expression data. Our protocol comprises data preprocessing, unsupervised learning, feature selection, and machine-learning-based hierarchical classification, alongside open-source MATLAB code.

摘要

基于RNA的样本鉴别和分类可用于提供生物学见解和/或区分临床组。然而,由于测序数据具有多维度和噪声的特性,在样本组之间找到信息性差异可能具有挑战性。在这里,我们应用一种机器学习方法,对具有高维miRNA表达数据的样本进行分层鉴别和分类。我们的方案包括数据预处理、无监督学习、特征选择和基于机器学习的分层分类,以及开源的MATLAB代码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/ec34900f1a56/gr38.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/d7faa94f7006/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/45d07f2f1dcc/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/b64e9401feaf/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/1a874acd4b54/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/97197c4496e1/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/e856410dd005/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/eca82f8c21af/gr13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/9fea28fd974d/gr15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/2efed1a93aad/gr16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/579082d6c762/gr17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/790c37708533/gr18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/e13cf5a4bf0c/gr19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/6ca4f47b8f34/gr20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/54da3a79512a/gr21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/8996e69c7119/gr22.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5e1580929b66/gr23.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/eb0c75da1f31/gr24.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/75304edbc91c/gr25.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/c25b3162f01a/gr26.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/73e94a25afa1/gr27.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5536af663b60/gr28.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5a0e8a81928c/gr29.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/b3b03acd1e89/gr30.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/c22c0b0f5464/gr31.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/8c3e02d5fc59/gr32.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5b19215bcf31/gr33.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/f1d86e7e83dd/gr34.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5bf3cae068b1/gr35.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/735b95b22301/gr36.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/b8b9cb09ce70/gr37.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/ec34900f1a56/gr38.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/d7faa94f7006/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/45d07f2f1dcc/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/b64e9401feaf/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/1a874acd4b54/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/97197c4496e1/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/e856410dd005/gr11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/eca82f8c21af/gr13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/9fea28fd974d/gr15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/2efed1a93aad/gr16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/579082d6c762/gr17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/790c37708533/gr18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/e13cf5a4bf0c/gr19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/6ca4f47b8f34/gr20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/54da3a79512a/gr21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/8996e69c7119/gr22.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5e1580929b66/gr23.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/eb0c75da1f31/gr24.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/75304edbc91c/gr25.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/c25b3162f01a/gr26.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/73e94a25afa1/gr27.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5536af663b60/gr28.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5a0e8a81928c/gr29.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/b3b03acd1e89/gr30.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/c22c0b0f5464/gr31.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/8c3e02d5fc59/gr32.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5b19215bcf31/gr33.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/f1d86e7e83dd/gr34.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/5bf3cae068b1/gr35.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/735b95b22301/gr36.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/b8b9cb09ce70/gr37.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d12/10751557/ec34900f1a56/gr38.jpg

相似文献

1
A user-driven machine learning approach for RNA-based sample discrimination and hierarchical classification.一种用于基于RNA的样本鉴别和层次分类的用户驱动机器学习方法。
STAR Protoc. 2023 Oct 27;4(4):102661. doi: 10.1016/j.xpro.2023.102661.
2
Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective Study.无监督特征选择以识别冠心病患者队列机器学习中的重要国际疾病分类第十版(ICD - 10)和解剖治疗化学分类系统(ATC)编码:回顾性研究
JMIR Med Inform. 2024 Jul 26;12:e52896. doi: 10.2196/52896.
3
Biological classification with RNA-seq data: Can alternatively spliced transcript expression enhance machine learning classifiers?基于 RNA-seq 数据的生物学分类:剪接转录本表达能否增强机器学习分类器?
RNA. 2018 Sep;24(9):1119-1132. doi: 10.1261/rna.062802.117. Epub 2018 Jun 25.
4
Gene targeting in amyotrophic lateral sclerosis using causality-based feature selection and machine learning.使用基于因果关系的特征选择和机器学习进行肌萎缩侧索硬化症的基因靶向治疗。
Mol Med. 2023 Jan 24;29(1):12. doi: 10.1186/s10020-023-00603-y.
5
An ensemble learning-based feature selection algorithm for identification of biomarkers of renal cell carcinoma.一种基于集成学习的用于识别肾细胞癌生物标志物的特征选择算法。
PeerJ Comput Sci. 2024 Jan 4;10:e1768. doi: 10.7717/peerj-cs.1768. eCollection 2024.
6
Differentiating between liver diseases by applying multiclass machine learning approaches to transcriptomics of liver tissue or blood-based samples.通过将多类机器学习方法应用于肝组织或血液样本的转录组学来区分肝脏疾病。
JHEP Rep. 2022 Aug 18;4(10):100560. doi: 10.1016/j.jhepr.2022.100560. eCollection 2022 Oct.
7
Improving platelet-RNA-based diagnostics: a comparative analysis of machine learning models for cancer detection and multiclass classification.提高基于血小板 RNA 的诊断:用于癌症检测和多类分类的机器学习模型的比较分析。
Mol Oncol. 2024 Nov;18(11):2743-2754. doi: 10.1002/1878-0261.13689. Epub 2024 Jun 17.
8
Alignment-free supervised classification of metagenomes by recursive SVM.基于递归 SVM 的无比对监督宏基因组分类。
BMC Genomics. 2013 Sep 22;14:641. doi: 10.1186/1471-2164-14-641.
9
Machine learning integrated ensemble of feature selection methods followed by survival analysis for predicting breast cancer subtype specific miRNA biomarkers.机器学习集成特征选择方法的集合,然后进行生存分析,以预测乳腺癌亚型特异性 miRNA 生物标志物。
Comput Biol Med. 2021 Apr;131:104244. doi: 10.1016/j.compbiomed.2021.104244. Epub 2021 Jan 28.
10
Source allocation of per- and polyfluoroalkyl substances (PFAS) with supervised machine learning: Classification performance and the role of feature selection in an expanded dataset.基于监督机器学习的全氟和多氟烷基物质(PFAS)源分配:在扩展数据集下的分类性能和特征选择的作用。
Chemosphere. 2021 Jul;275:130124. doi: 10.1016/j.chemosphere.2021.130124. Epub 2021 Feb 26.

本文引用的文献

1
A Computational Approach to Identification of Candidate Biomarkers in High-Dimensional Molecular Data.一种用于在高维分子数据中识别候选生物标志物的计算方法。
Diagnostics (Basel). 2022 Aug 18;12(8):1997. doi: 10.3390/diagnostics12081997.
2
Sex differences in the aging murine urinary bladder and influence on the tumor immune microenvironment of a carcinogen-induced model of bladder cancer.衰老的小鼠膀胱中的性别差异及其对致癌物诱导膀胱癌模型肿瘤免疫微环境的影响。
Biol Sex Differ. 2022 May 3;13(1):19. doi: 10.1186/s13293-022-00428-0.
3
Blood extracellular vesicles from healthy individuals regulate hematopoietic stem cells as humans age.
健康个体的血液细胞外囊泡随年龄增长调节造血干细胞。
Aging Cell. 2020 Nov;19(11):e13245. doi: 10.1111/acel.13245. Epub 2020 Oct 7.
4
Classifying Lung Neuroendocrine Neoplasms through MicroRNA Sequence Data Mining.通过微小RNA序列数据挖掘对肺神经内分泌肿瘤进行分类
Cancers (Basel). 2020 Sep 17;12(9):2653. doi: 10.3390/cancers12092653.
5
Characterizing and classifying neuroendocrine neoplasms through microRNA sequencing and data mining.通过微小RNA测序和数据挖掘对神经内分泌肿瘤进行特征描述和分类。
NAR Cancer. 2020 Sep;2(3):zcaa009. doi: 10.1093/narcan/zcaa009. Epub 2020 Jul 15.
6
A large-scale comparative study of isoform expressions measured on four platforms.四种平台检测的异构体表达的大规模比较研究。
BMC Genomics. 2020 Mar 30;21(1):272. doi: 10.1186/s12864-020-6643-8.
7
Neutrophil recruitment and function in endometriosis patients and a syngeneic murine model.子宫内膜异位症患者和同源小鼠模型中的中性粒细胞募集和功能。
FASEB J. 2020 Jan;34(1):1558-1575. doi: 10.1096/fj.201902272R. Epub 2019 Dec 2.
8
Plasma microRNA expression levels and their targeted pathways in patients with major depressive disorder who are responsive to duloxetine treatment.接受度洛西汀治疗的反应性重性抑郁障碍患者的血浆 microRNA 表达水平及其靶向通路。
J Psychiatr Res. 2019 Mar;110:38-44. doi: 10.1016/j.jpsychires.2018.12.007. Epub 2018 Dec 8.
9
Evaluating gastroenteropancreatic neuroendocrine tumors through microRNA sequencing.通过 microRNA 测序评估胃肠胰神经内分泌肿瘤。
Endocr Relat Cancer. 2019 Jan 1;26(1):47-57. doi: 10.1530/ERC-18-0244.
10
Novel prognostic and predictive microRNA targets for triple-negative breast cancer.三阴性乳腺癌的新型预后和预测性微小RNA靶点
FASEB J. 2018 May 29:fj201800120R. doi: 10.1096/fj.201800120R.