• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Boquila:用于消除序列分析中读取核苷酸偏差的二代测序读段模拟器。

Boquila: NGS read simulator to eliminate read nucleotide bias in sequence analysis.

作者信息

Akköse Ümit, Adebali Ogün

机构信息

Faculty of Engineering and Natural Sciences, Sabancı University, İstanbul, Turkey.

TÜBİTAK Research Institute for Fundamental Sciences, Gebze, Turkey.

出版信息

Turk J Biol. 2023 Feb 21;47(2):158-163. doi: 10.55730/1300-0152.2650. eCollection 2023.

DOI:10.55730/1300-0152.2650
PMID:37529166
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10387831/
Abstract

Sequence content is heterogeneous throughout genomes. Therefore, genome-wide next-generation sequencing (NGS) reads biased towards specific nucleotide profiles are affected by the genome-wide heterogeneous nucleotide distribution. Boquila generates sequences that mimic the nucleotide profile of true reads, which can be used to correct the nucleotide-based bias of genome-wide distribution of NGS reads. Boquila can be configured to generate reads from only specified regions of the reference genome. It also allows the use of input DNA sequencing to correct the bias due to the copy number variations in the genome. Boquila uses standard file formats for input and output data, and it can be easily integrated into any workflow for high-throughput sequencing applications.

摘要

基因组中的序列内容在整个基因组中是异质的。因此,偏向特定核苷酸谱的全基因组下一代测序(NGS)读数会受到全基因组异质核苷酸分布的影响。Boquila生成模仿真实读数核苷酸谱的序列,可用于校正NGS读数全基因组分布的基于核苷酸的偏差。Boquila可以配置为仅从参考基因组的指定区域生成读数。它还允许使用输入DNA测序来校正由于基因组中拷贝数变异引起的偏差。Boquila使用标准文件格式进行输入和输出数据,并且可以很容易地集成到任何高通量测序应用的工作流程中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa6a/10387831/17dbfa4b7147/turkjbiol-47-2-141f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa6a/10387831/17dbfa4b7147/turkjbiol-47-2-141f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa6a/10387831/17dbfa4b7147/turkjbiol-47-2-141f1.jpg

相似文献

1
Boquila: NGS read simulator to eliminate read nucleotide bias in sequence analysis.Boquila:用于消除序列分析中读取核苷酸偏差的二代测序读段模拟器。
Turk J Biol. 2023 Feb 21;47(2):158-163. doi: 10.55730/1300-0152.2650. eCollection 2023.
2
Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence.基于注释的全基因组 SNP 发现利用下一代测序技术在没有参考基因组序列的情况下在大型复杂的粗山羊草基因组中
BMC Genomics. 2011 Jan 25;12:59. doi: 10.1186/1471-2164-12-59.
3
SeqAssist: a novel toolkit for preliminary analysis of next-generation sequencing data.SeqAssist:一种用于下一代测序数据初步分析的新型工具包。
BMC Bioinformatics. 2014;15 Suppl 11(Suppl 11):S10. doi: 10.1186/1471-2105-15-S11-S10. Epub 2014 Oct 21.
4
Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics.基于全基因组特征,对多种新一代测序比对器的读段比对进行评估。
Genomics. 2017 Jul;109(3-4):186-191. doi: 10.1016/j.ygeno.2017.03.001. Epub 2017 Mar 9.
5
UMI-Gen: A UMI-based read simulator for variant calling evaluation in paired-end sequencing NGS libraries.UMI-Gen:一种基于单分子唯一分子标识符(UMI)的读段模拟器,用于双端测序NGS文库中的变异检测评估。
Comput Struct Biotechnol J. 2020 Aug 27;18:2270-2280. doi: 10.1016/j.csbj.2020.08.011. eCollection 2020.
6
SInC: an accurate and fast error-model based simulator for SNPs, Indels and CNVs coupled with a read generator for short-read sequence data.SInC:一种准确且快速的基于错误模型的 SNP、Indel 和 CNV 模拟器,结合了用于短读序列数据的读取生成器。
BMC Bioinformatics. 2014 Feb 5;15:40. doi: 10.1186/1471-2105-15-40.
7
cPlot: Contig-Plotting Visualization for the Analysis of Short-Read Nucleotide Sequence Alignments.cPlot:用于分析短读核苷酸序列比对的重叠群绘图可视化工具。
Int J Mol Sci. 2022 Sep 29;23(19):11484. doi: 10.3390/ijms231911484.
8
Bioinformatics Basics for High-Throughput Hybridization-Based Targeted DNA Sequencing from FFPE-Derived Tumor Specimens: From Reads to Variants.基于FFPE来源肿瘤标本的高通量杂交靶向DNA测序的生物信息学基础:从 reads 到变异体
Methods Mol Biol. 2019;1908:37-48. doi: 10.1007/978-1-4939-9004-7_3.
9
BamView: visualizing and interpretation of next-generation sequencing read alignments.BamView:下一代测序读取比对的可视化和解释。
Brief Bioinform. 2013 Mar;14(2):203-12. doi: 10.1093/bib/bbr073. Epub 2012 Jan 16.
10
Pseudo-Sanger sequencing: massively parallel production of long and near error-free reads using NGS technology.伪桑格测序:使用下一代测序(NGS)技术大规模并行产生长且近乎无错误的 reads。
BMC Genomics. 2013 Oct 17;14(1):711. doi: 10.1186/1471-2164-14-711.

引用本文的文献

1
UV-induced reorganization of 3D genome mediates DNA damage response.紫外线诱导的三维基因组重组介导DNA损伤反应。
Nat Commun. 2025 Feb 5;16(1):1376. doi: 10.1038/s41467-024-55724-7.
2
Global repair is the primary nucleotide excision repair subpathway for the removal of pyrimidine-pyrimidone (6-4) damage from the Arabidopsis genome.全球修复是拟南芥基因组中嘧啶-嘧啶酮(6-4)损伤去除的主要核苷酸切除修复亚途径。
Sci Rep. 2024 Feb 8;14(1):3308. doi: 10.1038/s41598-024-53472-8.
3
The interplay of 3D genome organization with UV-induced DNA damage and repair.

本文引用的文献

1
SomatoSim: precision simulation of somatic single nucleotide variants.SomatoSim:体细胞单核苷酸变异的精确模拟。
BMC Bioinformatics. 2021 Mar 6;22(1):109. doi: 10.1186/s12859-021-04024-8.
2
SimuSCoP: reliably simulate Illumina sequencing data based on position and context dependent profiles.SimuSCoP:基于位置和上下文相关的分布可靠地模拟 Illumina 测序数据。
BMC Bioinformatics. 2020 Jul 23;21(1):331. doi: 10.1186/s12859-020-03665-5.
3
PaSS: a sequencing simulator for PacBio sequencing.PaSS:一种用于 PacBio 测序的测序模拟程序。
三维基因组组织与紫外线诱导的 DNA 损伤和修复的相互作用。
J Biol Chem. 2023 May;299(5):104679. doi: 10.1016/j.jbc.2023.104679. Epub 2023 Apr 5.
BMC Bioinformatics. 2019 Jun 21;20(1):352. doi: 10.1186/s12859-019-2901-7.
4
Systematic benchmarking of omics computational tools.系统生物学计算工具的基准测试。
Nat Commun. 2019 Mar 27;10(1):1393. doi: 10.1038/s41467-019-09406-4.
5
Xome-Blender: A novel cancer genome simulator.Xome-Blender:一种新型癌症基因组模拟器。
PLoS One. 2018 Apr 5;13(4):e0194472. doi: 10.1371/journal.pone.0194472. eCollection 2018.
6
Correcting nucleotide-specific biases in high-throughput sequencing data.校正高通量测序数据中的核苷酸特异性偏差。
BMC Bioinformatics. 2017 Aug 1;18(1):357. doi: 10.1186/s12859-017-1766-x.
7
Dynamic maps of UV damage formation and repair for the human genome.人类基因组中紫外线损伤形成和修复的动态图谱。
Proc Natl Acad Sci U S A. 2017 Jun 27;114(26):6758-6763. doi: 10.1073/pnas.1706522114. Epub 2017 Jun 12.
8
Pysim-sv: a package for simulating structural variation data with GC-biases.Pysim-sv:一个用于模拟具有GC偏差的结构变异数据的软件包。
BMC Bioinformatics. 2017 Mar 14;18(Suppl 3):53. doi: 10.1186/s12859-017-1464-8.
9
NanoSim: nanopore sequence read simulator based on statistical characterization.NanoSim:基于统计特征的纳米孔序列读取模拟器。
Gigascience. 2017 Apr 1;6(4):1-6. doi: 10.1093/gigascience/gix010.
10
Genome-wide transcription-coupled repair in is mediated by the Mfd translocase.细菌中的全基因组转录偶联修复由Mfd转位酶介导。
Proc Natl Acad Sci U S A. 2017 Mar 14;114(11):E2116-E2125. doi: 10.1073/pnas.1700230114. Epub 2017 Feb 6.