• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BEERS2:通过高保真的计算机模拟进行 RNA-Seq 模拟。

BEERS2: RNA-Seq simulation through high fidelity in silico modeling.

机构信息

Institute for Translational Medicine and Therapeutics, University of Pennsylvania, PA, USA.

Current address: National Institute on Aging, National Institutes of Health, Baltimore, MD, USA.

出版信息

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae164.

DOI:10.1093/bib/bbae164
PMID:38605641
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11009461/
Abstract

Simulation of RNA-seq reads is critical in the assessment, comparison, benchmarking and development of bioinformatics tools. Yet the field of RNA-seq simulators has progressed little in the last decade. To address this need we have developed BEERS2, which combines a flexible and highly configurable design with detailed simulation of the entire library preparation and sequencing pipeline. BEERS2 takes input transcripts (typically fully length messenger RNA transcripts with polyA tails) from either customizable input or from CAMPAREE simulated RNA samples. It produces realistic reads of these transcripts as FASTQ, SAM or BAM formats with the SAM or BAM formats containing the true alignment to the reference genome. It also produces true transcript-level quantification values. BEERS2 combines a flexible and highly configurable design with detailed simulation of the entire library preparation and sequencing pipeline and is designed to include the effects of polyA selection and RiboZero for ribosomal depletion, hexamer priming sequence biases, GC-content biases in polymerase chain reaction (PCR) amplification, barcode read errors and errors during PCR amplification. These characteristics combine to make BEERS2 the most complete simulation of RNA-seq to date. Finally, we demonstrate the use of BEERS2 by measuring the effect of several settings on the popular Salmon pseudoalignment algorithm.

摘要

RNA-seq 模拟在生物信息学工具的评估、比较、基准测试和开发中至关重要。然而,在过去的十年中,RNA-seq 模拟器领域的进展甚微。为了解决这一需求,我们开发了 BEERS2,它将灵活且高度可配置的设计与整个文库制备和测序流程的详细模拟相结合。BEERS2 从可定制的输入或来自 CAMPAREE 模拟的 RNA 样本中获取输入转录本(通常是带有 polyA 尾巴的全长信使 RNA 转录本)。它以 FASTQ、SAM 或 BAM 格式生成这些转录本的真实读取,其中 SAM 或 BAM 格式包含与参考基因组的真实对齐。它还生成真实的转录本水平定量值。BEERS2 将灵活且高度可配置的设计与整个文库制备和测序流程的详细模拟相结合,旨在包括 polyA 选择和 RiboZero 对核糖体耗竭的影响、六聚体引发序列偏差、聚合酶链反应 (PCR) 扩增中的 GC 含量偏差、条形码读取错误和 PCR 扩增过程中的错误。这些特征结合起来,使 BEERS2 成为迄今为止最完整的 RNA-seq 模拟。最后,我们通过测量几个设置对流行的 Salmon 伪比对算法的影响来展示 BEERS2 的用途。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb98/11009461/37e2be01031a/bbae164f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb98/11009461/76eefe3f52da/bbae164f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb98/11009461/b8601a7a3e7f/bbae164f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb98/11009461/4e9d6ec7d01b/bbae164f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb98/11009461/37e2be01031a/bbae164f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb98/11009461/76eefe3f52da/bbae164f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb98/11009461/b8601a7a3e7f/bbae164f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb98/11009461/4e9d6ec7d01b/bbae164f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb98/11009461/37e2be01031a/bbae164f4.jpg

相似文献

1
BEERS2: RNA-Seq simulation through high fidelity in silico modeling.BEERS2:通过高保真的计算机模拟进行 RNA-Seq 模拟。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae164.
2
BEERS2: RNA-Seq simulation through high fidelity modeling.BEERS2:通过高保真建模进行RNA测序模拟
bioRxiv. 2023 Apr 24:2023.04.21.537847. doi: 10.1101/2023.04.21.537847.
3
CAMPAREE: a robust and configurable RNA expression simulator.CAMPAREE:一款功能强大且可配置的RNA表达模拟器。
BMC Genomics. 2021 Sep 25;22(1):692. doi: 10.1186/s12864-021-07934-2.
4
Grape RNA-Seq analysis pipeline environment.葡萄 RNA-Seq 分析管道环境。
Bioinformatics. 2013 Mar 1;29(5):614-21. doi: 10.1093/bioinformatics/btt016. Epub 2013 Jan 17.
5
Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM).RNA-Seq 比对算法与 RNA-Seq 统一映射器(RUM)的比较分析。
Bioinformatics. 2011 Sep 15;27(18):2518-28. doi: 10.1093/bioinformatics/btr427. Epub 2011 Jul 19.
6
A Nile rat transcriptomic landscape across 22 organs by ultra-deep sequencing and comparative RNA-seq pipeline (CRSP).通过超深度测序和比较RNA测序流程(CRSP)构建的尼罗大鼠22个器官的转录组图谱。
Comput Biol Chem. 2023 Feb;102:107795. doi: 10.1016/j.compbiolchem.2022.107795. Epub 2022 Nov 23.
7
A computational method for estimating the PCR duplication rate in DNA and RNA-seq experiments.一种用于估计DNA和RNA测序实验中PCR重复率的计算方法。
BMC Bioinformatics. 2017 Mar 14;18(Suppl 3):43. doi: 10.1186/s12859-017-1471-9.
8
Benchmarking differential expression analysis tools for RNA-Seq: normalization-based vs. log-ratio transformation-based methods.RNA-Seq 差异表达分析工具的基准测试:基于标准化与基于对数比变换的方法。
BMC Bioinformatics. 2018 Jul 18;19(1):274. doi: 10.1186/s12859-018-2261-8.
9
CapTrap-seq: a platform-agnostic and quantitative approach for high-fidelity full-length RNA sequencing.CapTrap-seq:一种平台无关且定量的全长 RNA 测序方法。
Nat Commun. 2024 Jun 27;15(1):5278. doi: 10.1038/s41467-024-49523-3.
10
Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules.使用 cDNA 和 RNA 分子的纳米孔测序对小鼠样本进行转录组谱分析。
Sci Rep. 2019 Oct 17;9(1):14908. doi: 10.1038/s41598-019-51470-9.

引用本文的文献

1
Sources of non-uniform coverage in short-read RNA-Seq data.短读长RNA测序数据中覆盖度不均匀的来源。
bioRxiv. 2025 Feb 6:2025.01.30.634337. doi: 10.1101/2025.01.30.634337.

本文引用的文献

1
Access to ground truth at unconstrained size makes simulated data as indispensable as experimental data for bioinformatics methods development and benchmarking.能够获取任意规模的真实数据使得模拟数据对于生物信息学方法的开发和基准测试而言,与实验数据一样不可或缺。
Bioinformatics. 2022 Oct 31;38(21):4994-4996. doi: 10.1093/bioinformatics/btac612.
2
Alevin-fry unlocks rapid, accurate and memory-frugal quantification of single-cell RNA-seq data.Alevin-fry实现了对单细胞RNA测序数据的快速、准确且节省内存的定量分析。
Nat Methods. 2022 Mar;19(3):316-322. doi: 10.1038/s41592-022-01408-3. Epub 2022 Mar 11.
3
A benchmark study of simulation methods for single-cell RNA sequencing data.
单细胞 RNA 测序数据模拟方法的基准研究。
Nat Commun. 2021 Nov 25;12(1):6911. doi: 10.1038/s41467-021-27130-w.
4
CAMPAREE: a robust and configurable RNA expression simulator.CAMPAREE:一款功能强大且可配置的RNA表达模拟器。
BMC Genomics. 2021 Sep 25;22(1):692. doi: 10.1186/s12864-021-07934-2.
5
MOCCASIN: a method for correcting for known and unknown confounders in RNA splicing analysis.MOCCASIN:一种在RNA剪接分析中校正已知和未知混杂因素的方法。
Nat Commun. 2021 Jun 7;12(1):3353. doi: 10.1038/s41467-021-23608-9.
6
Sustainable data analysis with Snakemake.使用 Snakemake 进行可持续数据分析。
F1000Res. 2021 Jan 18;10:33. doi: 10.12688/f1000research.29032.2. eCollection 2021.
7
Comparative evaluation of full-length isoform quantification from RNA-Seq.全长异构体定量的 RNA-Seq 比较评估。
BMC Bioinformatics. 2021 May 25;22(1):266. doi: 10.1186/s12859-021-04198-1.
8
Separating measurement and expression models clarifies confusion in single-cell RNA sequencing analysis.分离测量和表达模型可澄清单细胞 RNA 测序分析中的混淆。
Nat Genet. 2021 Jun;53(6):770-777. doi: 10.1038/s41588-021-00873-4. Epub 2021 May 24.
9
ASimulatoR: splice-aware RNA-Seq data simulation.ASimulatoR:剪接感知RNA测序数据模拟
Bioinformatics. 2021 Sep 29;37(18):3008-3010. doi: 10.1093/bioinformatics/btab142.
10
Preprocessing choices affect RNA velocity results for droplet scRNA-seq data.预处理选择会影响液滴 scRNA-seq 数据的 RNA 速度结果。
PLoS Comput Biol. 2021 Jan 11;17(1):e1008585. doi: 10.1371/journal.pcbi.1008585. eCollection 2021 Jan.