• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估液体活检来源的RNA测序数据中数量增加的生物学相关特征所提供的补充信息。

Assessing the complementary information from an increased number of biologically relevant features in liquid biopsy-derived RNA-Seq data.

作者信息

Giannoukakos Stavros, D'Ambrosi Silvia, Koppers-Lalic Danijela, Gómez-Martín Cristina, Fernandez Alberto, Hackenberg Michael

机构信息

Department of Genetics, Faculty of Science, University of Granada, Granada, 18071, Spain.

Bioinformatics Laboratory, Biomedical Research Centre (CIBM), PTS, Granada, 18100, Spain.

出版信息

Heliyon. 2024 Mar 12;10(6):e27360. doi: 10.1016/j.heliyon.2024.e27360. eCollection 2024 Mar 30.

DOI:10.1016/j.heliyon.2024.e27360
PMID:38515664
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10955244/
Abstract

Liquid biopsy-derived RNA sequencing (lbRNA-seq) exhibits significant promise for clinic-oriented cancer diagnostics due to its non-invasiveness and ease of repeatability. Despite substantial advancements, obstacles like technical artefacts and process standardisation impede seamless clinical integration. Alongside addressing technical aspects such as normalising fluctuating low-input material and establishing a standardised clinical workflow, the lack of result validation using independent datasets remains a critical factor contributing to the often low reproducibility of liquid biopsy-detected biomarkers. Considering the outlined drawbacks, our objective was to establish a workflow/methodology characterised by: 1. Harness the rich diversity of biological features accessible through lbRNA-seq data, encompassing a holistic range of molecular and functional attributes. These components are seamlessly integrated via a Machine Learning-based Ensemble Classification framework, enabling a unified and comprehensive analysis of the intricate information encoded within the data. 2. Implementing and rigorously benchmarking intra-sample normalisation methods to heighten their relevance within clinical settings. 3. Thoroughly assessing its efficacy across independent test sets to ascertain its robustness and potential utility. Using ten datasets from several studies comprising three different sources of biological material, we first show that while the best-performing normalisation methods depend strongly on the dataset and coupled Machine Learning method, the rather simple Counts Per Million method is generally very robust, showing comparable performance to cross-sample methods. Subsequently, we demonstrate that the innovative biofeature types introduced in this study, such as the Fraction of Canonical Transcript, harbour complementary information. Consequently, their inclusion consistently enhances prediction power compared to models relying solely on gene expression-based biofeatures. Finally, we demonstrate that the workflow is robust on completely independent datasets, generally from different labs and/or different protocols. Taken together, the workflow presented here outperforms generally employed methods in prediction accuracy and may hold potential for clinical diagnostics application due to its specific design.

摘要

液体活检衍生的RNA测序(lbRNA-seq)因其非侵入性和易于重复性,在面向临床的癌症诊断中展现出巨大潜力。尽管取得了重大进展,但技术假象和流程标准化等障碍阻碍了其与临床的无缝整合。除了解决诸如对波动的低输入材料进行归一化以及建立标准化临床工作流程等技术问题外,缺乏使用独立数据集进行结果验证仍然是导致液体活检检测到的生物标志物重复性往往较低的关键因素。考虑到上述缺点,我们的目标是建立一种具有以下特点的工作流程/方法:1. 利用通过lbRNA-seq数据可获取的丰富多样的生物学特征,涵盖分子和功能属性的全面范围。这些组件通过基于机器学习的集成分类框架无缝集成,能够对数据中编码的复杂信息进行统一和全面的分析。2. 实施并严格基准测试样本内归一化方法,以提高其在临床环境中的相关性。3. 在独立测试集上全面评估其功效,以确定其稳健性和潜在效用。使用来自几项研究的十个数据集,这些数据集包含三种不同来源的生物材料,我们首先表明,虽然表现最佳的归一化方法强烈依赖于数据集和相关的机器学习方法,但相当简单的每百万计数法通常非常稳健,其性能与跨样本方法相当。随后,我们证明了本研究中引入的创新生物特征类型,如标准转录本分数,具有互补信息。因此,与仅依赖基于基因表达的生物特征的模型相比,将它们纳入始终能提高预测能力。最后,我们证明该工作流程在完全独立的数据集上是稳健的,这些数据集通常来自不同的实验室和/或不同的方案。综上所述,本文提出的工作流程在预测准确性方面优于一般采用的方法,并且由于其特定设计可能在临床诊断应用中具有潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/35e5dbdcd24c/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/554fb832c6e3/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/16caa66988f4/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/267e9ee19309/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/f25f0a5eb812/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/ccc5d38cc923/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/8d877b31cf82/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/35e5dbdcd24c/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/554fb832c6e3/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/16caa66988f4/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/267e9ee19309/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/f25f0a5eb812/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/ccc5d38cc923/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/8d877b31cf82/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c573/10955244/35e5dbdcd24c/gr7.jpg

相似文献

1
Assessing the complementary information from an increased number of biologically relevant features in liquid biopsy-derived RNA-Seq data.评估液体活检来源的RNA测序数据中数量增加的生物学相关特征所提供的补充信息。
Heliyon. 2024 Mar 12;10(6):e27360. doi: 10.1016/j.heliyon.2024.e27360. eCollection 2024 Mar 30.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
4
Gene filtering strategies for machine learning guided biomarker discovery using neonatal sepsis RNA-seq data.使用新生儿败血症RNA测序数据进行机器学习引导的生物标志物发现的基因筛选策略。
Front Genet. 2023 Apr 11;14:1158352. doi: 10.3389/fgene.2023.1158352. eCollection 2023.
5
Differentiating between liver diseases by applying multiclass machine learning approaches to transcriptomics of liver tissue or blood-based samples.通过将多类机器学习方法应用于肝组织或血液样本的转录组学来区分肝脏疾病。
JHEP Rep. 2022 Aug 18;4(10):100560. doi: 10.1016/j.jhepr.2022.100560. eCollection 2022 Oct.
6
DEGnext: classification of differentially expressed genes from RNA-seq data using a convolutional neural network with transfer learning.DEGnext:使用具有迁移学习的卷积神经网络对 RNA-seq 数据进行差异表达基因分类。
BMC Bioinformatics. 2022 Jan 6;23(1):17. doi: 10.1186/s12859-021-04527-4.
7
Machine Learning Analysis of RNA-seq Data for Diagnostic and Prognostic Prediction of Colon Cancer.基于 RNA-seq 数据的机器学习分析用于结直肠癌的诊断和预后预测。
Sensors (Basel). 2023 Mar 13;23(6):3080. doi: 10.3390/s23063080.
8
Identifying novel transcript biomarkers for hepatocellular carcinoma (HCC) using RNA-Seq datasets and machine learning.利用 RNA-Seq 数据集和机器学习技术鉴定肝细胞癌(HCC)的新型转录生物标志物。
BMC Cancer. 2021 Aug 27;21(1):962. doi: 10.1186/s12885-021-08704-9.
9
An integrative machine learning strategy for improved prediction of essential genes in Escherichia coli metabolism using flux-coupled features.一种利用通量耦合特征改进大肠杆菌代谢中必需基因预测的综合机器学习策略。
Mol Biosyst. 2017 Jul 25;13(8):1584-1596. doi: 10.1039/c7mb00234c.
10
RNA-seq assistant: machine learning based methods to identify more transcriptional regulated genes.RNA-seq 辅助工具:基于机器学习的方法,以鉴定更多受转录调控的基因。
BMC Genomics. 2018 Jul 20;19(1):546. doi: 10.1186/s12864-018-4932-2.

引用本文的文献

1
Variation in bulk RNA-seq and estimated cell type proportion using deconvolution when comparing pancreatic cancer samples within the same individual.在比较同一个体内的胰腺癌样本时,使用反卷积方法对批量RNA测序和估计的细胞类型比例进行的变异分析。
medRxiv. 2025 May 6:2025.05.05.25326976. doi: 10.1101/2025.05.05.25326976.

本文引用的文献

1
Tumor-educated platelet blood tests for Non-Small Cell Lung Cancer detection and management.肿瘤教育血小板血液检测用于非小细胞肺癌的检测和管理。
Sci Rep. 2023 Jun 8;13(1):9359. doi: 10.1038/s41598-023-35818-w.
2
NORMSEQ: a tool for evaluation, selection and visualization of RNA-Seq normalization methods.NORMSEQ:一种用于 RNA-Seq 归一化方法评估、选择和可视化的工具。
Nucleic Acids Res. 2023 Jul 5;51(W1):W372-W378. doi: 10.1093/nar/gkad429.
3
From patterns to patients: Advances in clinical machine learning for cancer diagnosis, prognosis, and treatment.
从模式到患者:癌症诊断、预后和治疗的临床机器学习进展。
Cell. 2023 Apr 13;186(8):1772-1791. doi: 10.1016/j.cell.2023.01.035. Epub 2023 Mar 10.
4
Tumour-educated platelets for breast cancer detection: biological and technical insights.肿瘤教育血小板用于乳腺癌检测:生物学和技术见解。
Br J Cancer. 2023 Apr;128(8):1572-1581. doi: 10.1038/s41416-023-02174-5. Epub 2023 Feb 10.
5
Detection and localization of early- and late-stage cancers using platelet RNA.利用血小板 RNA 检测和定位早期和晚期癌症。
Cancer Cell. 2022 Sep 12;40(9):999-1009.e6. doi: 10.1016/j.ccell.2022.08.006. Epub 2022 Sep 1.
6
RNA Sequencing of Tumor-Educated Platelets Reveals a Three-Gene Diagnostic Signature in Esophageal Squamous Cell Carcinoma.肿瘤相关血小板的RNA测序揭示了食管鳞状细胞癌的三基因诊断标志物
Front Oncol. 2022 May 9;12:824354. doi: 10.3389/fonc.2022.824354. eCollection 2022.
7
RNA profiling of blood platelets noninvasively differentiates colorectal cancer from healthy donors and noncancerous intestinal diseases: a retrospective cohort study.经外周血血小板 RNA 特征分析可无创性地区分结直肠癌与健康供者和非癌性肠道疾病:一项回顾性队列研究。
Genome Med. 2022 Mar 2;14(1):26. doi: 10.1186/s13073-022-01033-x.
8
The emerging roles of NGS in clinical oncology and personalized medicine.NGS 在临床肿瘤学和个性化医学中的新兴作用。
Pathol Res Pract. 2022 Feb;230:153760. doi: 10.1016/j.prp.2022.153760. Epub 2022 Jan 10.
9
The Sequence Read Archive: a decade more of explosive growth.序列读取档案:十年的爆炸式增长。
Nucleic Acids Res. 2022 Jan 7;50(D1):D387-D390. doi: 10.1093/nar/gkab1053.
10
Toward platelet transcriptomics in cancer diagnosis, prognosis and therapy.迈向血小板转录组学在癌症诊断、预后及治疗中的应用
Br J Cancer. 2022 Feb;126(3):316-322. doi: 10.1038/s41416-021-01627-z. Epub 2021 Nov 22.