• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NMFP:一种基于非负矩阵分解的预选方法,用于提高从RNA测序数据中识别mRNA异构体的准确性。

NMFP: a non-negative matrix factorization based preselection method to increase accuracy of identifying mRNA isoforms from RNA-seq data.

作者信息

Ye Yuting, Li Jingyi Jessica

机构信息

Division of Biostatistics, University of California, Berkeley, 94720, Berkeley, CA, USA.

Department of Statistics, 8125 Math Sciences Bldg., University of California, Los Angeles, Los Angeles, 90095-1554, CA, USA.

出版信息

BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):11. doi: 10.1186/s12864-015-2304-8.

DOI:10.1186/s12864-015-2304-8
PMID:26818007
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4895266/
Abstract

BACKGROUND

The advent of next-generation RNA sequencing (RNA-seq) has greatly advanced transcriptomic studies, including system-wide identification and quantification of mRNA isoforms under various biological conditions. A number of computational methods have been developed to systematically identify mRNA isoforms in a high-throughput manner from RNA-seq data. However, a common drawback of these methods is that their identified mRNA isoforms contain a high percentage of false positives, especially for genes with complex splicing structures, e.g., many exons and exon junctions.

RESULTS

We have developed a preselection method called "Non-negative Matrix Factorization Preselection" (NMFP) which is designed to improve the accuracy of computational methods in identifying mRNA isoforms from RNA-seq data. We demonstrated through simulation and real data studies that NMFP can effectively shrink the search space of isoform candidates and increase the accuracy of two mainstream computational methods, Cufflinks and SLIDE, in their identification of mRNA isoforms.

CONCLUSION

NMFP is a useful tool to preselect mRNA isoform candidates for downstream isoform discovery methods. It can greatly reduce the number of isoform candidates while maintaining a good coverage of unknown true isoforms. Adding NMFP as an upstream step, computational methods are expected to achieve better accuracy in identifying mRNA isoforms from RNA-seq data.

摘要

背景

新一代RNA测序(RNA-seq)技术的出现极大地推动了转录组学研究,包括在各种生物学条件下对mRNA异构体进行全系统的鉴定和定量分析。已经开发了许多计算方法,用于从RNA-seq数据中以高通量方式系统地鉴定mRNA异构体。然而,这些方法的一个共同缺点是,它们鉴定出的mRNA异构体中假阳性比例很高,特别是对于具有复杂剪接结构的基因,例如包含许多外显子和外显子连接的基因。

结果

我们开发了一种称为“非负矩阵分解预选”(NMFP)的预选方法,旨在提高从RNA-seq数据中鉴定mRNA异构体的计算方法的准确性。通过模拟和实际数据研究,我们证明NMFP可以有效地缩小异构体候选物的搜索空间,并提高两种主流计算方法Cufflinks和SLIDE在鉴定mRNA异构体时的准确性。

结论

NMFP是一种用于为下游异构体发现方法预选mRNA异构体候选物的有用工具。它可以大大减少异构体候选物的数量,同时保持对未知真实异构体的良好覆盖。将NMFP作为上游步骤,预计计算方法在从RNA-seq数据中鉴定mRNA异构体时将获得更高的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/98646a3ad412/12864_2015_2304_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/4795ec51eb92/12864_2015_2304_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/d3929ae39be6/12864_2015_2304_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/5049ca89b524/12864_2015_2304_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/1f8885af18da/12864_2015_2304_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/d74c51b551ff/12864_2015_2304_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/469aa70fb6b7/12864_2015_2304_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/0331fc4fff1f/12864_2015_2304_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/cfcb6899dfe2/12864_2015_2304_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/98646a3ad412/12864_2015_2304_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/4795ec51eb92/12864_2015_2304_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/d3929ae39be6/12864_2015_2304_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/5049ca89b524/12864_2015_2304_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/1f8885af18da/12864_2015_2304_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/d74c51b551ff/12864_2015_2304_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/469aa70fb6b7/12864_2015_2304_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/0331fc4fff1f/12864_2015_2304_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/cfcb6899dfe2/12864_2015_2304_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6660/4895266/98646a3ad412/12864_2015_2304_Fig9_HTML.jpg

相似文献

1
NMFP: a non-negative matrix factorization based preselection method to increase accuracy of identifying mRNA isoforms from RNA-seq data.NMFP:一种基于非负矩阵分解的预选方法,用于提高从RNA测序数据中识别mRNA异构体的准确性。
BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):11. doi: 10.1186/s12864-015-2304-8.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Deep sequencing of pre-translational mRNPs reveals hidden flux through evolutionarily conserved alternative splicing nonsense-mediated decay pathways.对翻译前 mRNP 的深度测序揭示了通过进化保守的选择性剪接无意义介导的衰变途径隐藏的通量。
Genome Biol. 2021 May 3;22(1):132. doi: 10.1186/s13059-021-02309-y.
4
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
5
Short-Term Memory Impairment短期记忆障碍
6
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
7
A systematic approach to RNA-associated motif discovery.一种系统的 RNA 相关基序发现方法。
BMC Genomics. 2018 Feb 14;19(1):146. doi: 10.1186/s12864-018-4528-x.
8
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
9
Electrophoresis电泳
10
SAKit: An all-in-one analysis pipeline for identifying novel proteins resulting from variant events at both large and small scales.SAKit:一种用于鉴定由大尺度和小尺度变异事件产生的新型蛋白质的一体化分析管道。
J Bioinform Comput Biol. 2024 Oct;22(5):2450022. doi: 10.1142/S0219720024500227. Epub 2024 Oct 1.

引用本文的文献

1
ECT2 cell group acts as cancer stem cell in malignant pleomorphic adenoma.ECT2细胞群在恶性多形性腺瘤中充当癌症干细胞。
NPJ Precis Oncol. 2025 Jun 17;9(1):189. doi: 10.1038/s41698-025-00974-x.
2
An NMF-based approach to discover overlooked differentially expressed gene regions from single-cell RNA-seq data.一种基于非负矩阵分解的方法,用于从单细胞RNA测序数据中发现被忽视的差异表达基因区域。
NAR Genom Bioinform. 2019 Dec 16;2(1):lqz020. doi: 10.1093/nargab/lqz020. eCollection 2020 Mar.
3
scPNMF: sparse gene encoding of single cells to facilitate gene selection for targeted gene profiling.

本文引用的文献

1
Normalization of RNA-seq data using factor analysis of control genes or samples.使用对照基因或样本的因子分析对RNA测序数据进行标准化。
Nat Biotechnol. 2014 Sep;32(9):896-902. doi: 10.1038/nbt.2931. Epub 2014 Aug 24.
2
Assessment of transcript reconstruction methods for RNA-seq.RNA-seq 转录本重构方法评估。
Nat Methods. 2013 Dec;10(12):1177-84. doi: 10.1038/nmeth.2714. Epub 2013 Nov 3.
3
Modelling and simulating generic RNA-Seq experiments with the flux simulator.使用通量模拟器对通用 RNA-Seq 实验进行建模和模拟。
scPNMF:稀疏的单细胞基因编码,以方便选择用于靶向基因分析的基因。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i358-i366. doi: 10.1093/bioinformatics/btab273.
4
A robust semi-supervised NMF model for single cell RNA-seq data.一种用于单细胞RNA测序数据的强大半监督非负矩阵分解模型。
PeerJ. 2020 Oct 16;8:e10091. doi: 10.7717/peerj.10091. eCollection 2020.
5
AIDE: annotation-assisted isoform discovery with high precision.AIDE:高精度注释辅助的异构体发现。
Genome Res. 2019 Dec;29(12):2056-2072. doi: 10.1101/gr.251108.119. Epub 2019 Nov 6.
6
MSIQ: JOINT MODELING OF MULTIPLE RNA-SEQ SAMPLES FOR ACCURATE ISOFORM QUANTIFICATION.MSIQ:用于精确异构体定量的多个RNA测序样本的联合建模
Ann Appl Stat. 2018 Mar;12(1):510-539. doi: 10.1214/17-AOAS1100. Epub 2018 Mar 9.
7
Erratum to: 'NMFP: a non-negative matrix factorization based preselection method to increase accuracy of identifying mRNA isoforms from RNA-seq data'.勘误:“NMFP:一种基于非负矩阵分解的预选择方法,用于提高从RNA测序数据中识别mRNA异构体的准确性”。
BMC Genomics. 2016 Apr 19;17:296. doi: 10.1186/s12864-016-2621-6.
Nucleic Acids Res. 2012 Nov 1;40(20):10073-83. doi: 10.1093/nar/gks666. Epub 2012 Sep 7.
4
Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration.综合基因组浏览器(IGV):高性能基因组学数据可视化和探索。
Brief Bioinform. 2013 Mar;14(2):178-92. doi: 10.1093/bib/bbs017. Epub 2012 Apr 19.
5
Sparse linear modeling of next-generation mRNA sequencing (RNA-Seq) data for isoform discovery and abundance estimation.基于下一代 mRNA 测序(RNA-Seq)数据的稀疏线性建模用于发现异构体和丰度估计。
Proc Natl Acad Sci U S A. 2011 Dec 13;108(50):19867-72. doi: 10.1073/pnas.1113972108. Epub 2011 Dec 1.
6
Identification of novel transcripts in annotated genomes using RNA-Seq.利用 RNA-Seq 鉴定注释基因组中的新型转录本。
Bioinformatics. 2011 Sep 1;27(17):2325-9. doi: 10.1093/bioinformatics/btr355. Epub 2011 Jun 21.
7
Integrative genomics viewer.整合基因组浏览器。
Nat Biotechnol. 2011 Jan;29(1):24-6. doi: 10.1038/nbt.1754.
8
Improvements to previous algorithms to predict gene structure and isoform concentrations using Affymetrix Exon arrays.利用 Affymetrix 外显子芯片改进先前的算法,以预测基因结构和异构体浓度。
BMC Bioinformatics. 2010 Nov 26;11:578. doi: 10.1186/1471-2105-11-578.
9
Modeling non-uniformity in short-read rates in RNA-Seq data.RNA-Seq 数据中短读率非均匀性建模。
Genome Biol. 2010;11(5):R50. doi: 10.1186/gb-2010-11-5-r50. Epub 2010 May 11.
10
Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.通过 RNA-Seq 进行转录本组装和定量分析揭示了细胞分化过程中未注释的转录本和异构体转换。
Nat Biotechnol. 2010 May;28(5):511-5. doi: 10.1038/nbt.1621. Epub 2010 May 2.