• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用文献挖掘和转录组机器学习开发剪接特征数据库以描绘癌症通路。

Splicing signature database development to delineate cancer pathways using literature mining and transcriptome machine learning.

作者信息

Lee Kyubin, Hyung Daejin, Cho Soo Young, Yu Namhee, Hong Sewha, Kim Jihyun, Kim Sunshin, Han Ji-Youn, Park Charny

机构信息

Research Institute, National Cancer Center, 232 Ilsan-ro, Goyang-si, Gyeonggi-do 10408, Republic of Korea.

Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA.

出版信息

Comput Struct Biotechnol J. 2023 Mar 2;21:1978-1988. doi: 10.1016/j.csbj.2023.02.052. eCollection 2023.

DOI:10.1016/j.csbj.2023.02.052
PMID:36942103
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10023904/
Abstract

Alternative splicing (AS) events modulate certain pathways and phenotypic plasticity in cancer. Although previous studies have computationally analyzed splicing events, it is still a challenge to uncover biological functions induced by reliable AS events from tremendous candidates. To provide essential splicing event signatures to assess pathway regulation, we developed a database by collecting two datasets: (i) reported literature and (ii) cancer transcriptome profile. The former includes knowledge-based splicing signatures collected from 63,229 PubMed abstracts using natural language processing, extracted for 202 pathways. The latter is the machine learning-based splicing signatures identified from pan-cancer transcriptome for 16 cancer types and 42 pathways. We established six different learning models to classify pathway activities from splicing profiles as a learning dataset. Top-ranked AS events by learning model feature importance became the signature for each pathway. To validate our learning results, we performed evaluations by (i) performance metrics, (ii) differential AS sets acquired from external datasets, and (iii) our knowledge-based signatures. The area under the receiver operating characteristic values of the learning models did not exhibit any drastic difference. However, random-forest distinctly presented the best performance to compare with the AS sets identified from external datasets and our knowledge-based signatures. Therefore, we used the signatures obtained from the random-forest model. Our database provided the clinical characteristics of the AS signatures, including survival test, molecular subtype, and tumor microenvironment. The regulation by splicing factors was additionally investigated. Our database for developed signatures supported retrieval and visualization system.

摘要

可变剪接(AS)事件可调节癌症中的某些信号通路和表型可塑性。尽管先前的研究已对剪接事件进行了计算分析,但从大量候选事件中发现由可靠的AS事件诱导的生物学功能仍是一项挑战。为了提供关键的剪接事件特征以评估信号通路调控,我们通过收集两个数据集开发了一个数据库:(i)已发表的文献和(ii)癌症转录组图谱。前者包括使用自然语言处理从63,229篇PubMed摘要中收集的基于知识的剪接特征,共提取了202条信号通路。后者是从16种癌症类型和42条信号通路的泛癌转录组中识别出的基于机器学习的剪接特征。我们建立了六种不同的学习模型,将剪接图谱中的信号通路活性分类为学习数据集。根据学习模型特征重要性排名靠前的AS事件成为每条信号通路的特征。为了验证我们的学习结果,我们通过(i)性能指标、(ii)从外部数据集中获取的差异AS集以及(iii)我们基于知识的特征进行了评估。学习模型的受试者工作特征值下的面积没有显示出任何显著差异。然而,随机森林与从外部数据集中识别出的AS集以及我们基于知识的特征相比,明显表现出最佳性能。因此,我们使用了从随机森林模型获得的特征。我们的数据库提供了AS特征的临床特征,包括生存测试、分子亚型和肿瘤微环境。此外,还研究了剪接因子的调控作用。我们用于开发特征的数据库支持检索和可视化系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/6f2dc673482d/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/71e048901cba/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/d802dc030821/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/78dff7d6932c/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/505430299076/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/2b5919cf034f/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/6f2dc673482d/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/71e048901cba/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/d802dc030821/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/78dff7d6932c/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/505430299076/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/2b5919cf034f/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a37/10023904/6f2dc673482d/gr5.jpg

相似文献

1
Splicing signature database development to delineate cancer pathways using literature mining and transcriptome machine learning.利用文献挖掘和转录组机器学习开发剪接特征数据库以描绘癌症通路。
Comput Struct Biotechnol J. 2023 Mar 2;21:1978-1988. doi: 10.1016/j.csbj.2023.02.052. eCollection 2023.
2
Exploration of predictive and prognostic alternative splicing signatures in lung adenocarcinoma using machine learning methods.使用机器学习方法探索肺腺癌中预测性和预后性可变剪接特征
J Transl Med. 2020 Dec 7;18(1):463. doi: 10.1186/s12967-020-02635-y.
3
Characterization of alternative splicing events and prognostic signatures in breast cancer.乳腺癌中可变剪接事件和预后特征的分析。
BMC Cancer. 2021 May 22;21(1):587. doi: 10.1186/s12885-021-08305-6.
4
Comprehensive analysis and establishment of a prediction model of alternative splicing events reveal the prognostic predictor and immune microenvironment signatures in triple negative breast cancer.全面分析和建立剪接事件的预测模型,揭示三阴性乳腺癌的预后预测因子和免疫微环境特征。
J Transl Med. 2020 Jul 28;18(1):286. doi: 10.1186/s12967-020-02454-1.
5
Novel, alternative splicing signature to detect lymph node metastasis in prostate adenocarcinoma with machine learning.利用机器学习检测前列腺腺癌淋巴结转移的新型可变剪接特征
Front Oncol. 2023 Jan 13;12:1084403. doi: 10.3389/fonc.2022.1084403. eCollection 2022.
6
N6-methyladenosine RNA methylation regulator-related alternative splicing gene signature as prognostic predictor and in immune microenvironment characterization of patients with low-grade glioma.N6-甲基腺苷RNA甲基化调节因子相关的可变剪接基因特征作为低级别胶质瘤患者的预后预测指标及免疫微环境特征分析
Front Genet. 2022 Jul 22;13:872186. doi: 10.3389/fgene.2022.872186. eCollection 2022.
7
Machine learning assisted analysis of breast cancer gene expression profiles reveals novel potential prognostic biomarkers for triple-negative breast cancer.机器学习辅助分析乳腺癌基因表达谱揭示了三阴性乳腺癌新的潜在预后生物标志物。
Comput Struct Biotechnol J. 2022 Mar 24;20:1618-1631. doi: 10.1016/j.csbj.2022.03.019. eCollection 2022.
8
Survival-Associated Alternative Messenger RNA Splicing Signatures in Pancreatic Ductal Adenocarcinoma: A Study Based on RNA-Sequencing Data.胰腺导管腺癌中与生存相关的可变剪接信使 RNA 特征:基于 RNA 测序数据的研究。
DNA Cell Biol. 2019 Nov;38(11):1207-1222. doi: 10.1089/dna.2019.4862. Epub 2019 Sep 4.
9
Prognostic Signatures of Alternative Splicing Events in Esophageal Carcinoma Based on TCGA Splice-Seq Data.基于TCGA剪接序列数据的食管癌可变剪接事件的预后特征
Front Oncol. 2021 Oct 5;11:658262. doi: 10.3389/fonc.2021.658262. eCollection 2021.
10
Genome-Wide Profiling of Alternative Splicing Signatures Associated with Prognosis and Immune Microenvironment of Hepatocellular Carcinoma.全基因组分析与肝细胞癌预后和免疫微环境相关的可变剪接特征。
Med Sci Monit. 2021 Aug 18;27:e930052. doi: 10.12659/MSM.930052.

引用本文的文献

1
Alternative Splicing in Tumorigenesis and Cancer Therapy.肿瘤发生与癌症治疗中的可变剪接
Biomolecules. 2025 May 29;15(6):789. doi: 10.3390/biom15060789.
2
ASpedia-R: a package to retrieve junction-incorporating features and knowledge-based functions of human alternative splicing events.ASpedia-R:一个用于检索人类可变剪接事件的包含连接点特征和基于知识功能的软件包。
Bioinform Adv. 2024 May 11;4(1):vbae071. doi: 10.1093/bioadv/vbae071. eCollection 2024.

本文引用的文献

1
A comprehensive survey of the approaches for pathway analysis using multi-omics data integration.多组学数据整合的通路分析方法的全面综述。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac435.
2
Identifying common transcriptome signatures of cancer by interpreting deep learning models.通过解读深度学习模型来识别癌症的常见转录组特征。
Genome Biol. 2022 May 17;23(1):117. doi: 10.1186/s13059-022-02681-3.
3
ASpediaFI: Functional Interaction Analysis of Alternative Splicing Events.ASpediaFI:可变剪接事件的功能相互作用分析
Genomics Proteomics Bioinformatics. 2022 Jun;20(3):466-482. doi: 10.1016/j.gpb.2021.10.004. Epub 2022 Jan 25.
4
Aberrant Transcript Usage Is Associated with Homologous Recombination Deficiency and Predicts Therapeutic Response.异常转录本的使用与同源重组缺陷相关,并可预测治疗反应。
Cancer Res. 2022 Jan 1;82(1):142-154. doi: 10.1158/0008-5472.CAN-21-2023. Epub 2021 Oct 28.
5
Alternative splicing and cancer: a systematic review.可变剪接与癌症:系统性综述。
Signal Transduct Target Ther. 2021 Feb 24;6(1):78. doi: 10.1038/s41392-021-00486-7.
6
The Gene Ontology resource: enriching a GOld mine.基因本体论资源:丰富一个 GOld 矿。
Nucleic Acids Res. 2021 Jan 8;49(D1):D325-D334. doi: 10.1093/nar/gkaa1113.
7
ChiTaRS 5.0: the comprehensive database of chimeric transcripts matched with druggable fusions and 3D chromatin maps.ChiTaRS 5.0:匹配可成药性融合和 3D 染色质图谱的嵌合转录本综合数据库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D825-D834. doi: 10.1093/nar/gkz1025.
8
The reactome pathway knowledgebase.Reactome 通路知识库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D498-D503. doi: 10.1093/nar/gkz1031.
9
ProtFus: A Comprehensive Method Characterizing Protein-Protein Interactions of Fusion Proteins.ProtFus:一种全面的融合蛋白蛋白质相互作用特征分析方法。
PLoS Comput Biol. 2019 Aug 22;15(8):e1007239. doi: 10.1371/journal.pcbi.1007239. eCollection 2019 Aug.
10
EWS-FLI1 modulated alternative splicing of ARID1A reveals novel oncogenic function through the BAF complex.EWS-FLI1 调控 ARID1A 的可变剪接,通过 BAF 复合物揭示新的致癌功能。
Nucleic Acids Res. 2019 Oct 10;47(18):9619-9636. doi: 10.1093/nar/gkz699.