• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ShRangeSim:下一代测序数据中单核甘酸多态性簇的模拟

ShRangeSim: Simulation of Single Nucleotide Polymorphism Clusters in Next-Generation Sequencing Data.

作者信息

Boenn Markus

机构信息

1 Institute of Computer Science, Martin Luther University Halle-Wittenberg , Halle/Saale, Germany .

2 Department of Soil Ecology, UFZ - Helmholtz Centre for Environmental Research , Halle/Saale, Germany .

出版信息

J Comput Biol. 2018 Jun;25(6):613-622. doi: 10.1089/cmb.2018.0007. Epub 2018 Apr 16.

DOI:10.1089/cmb.2018.0007
PMID:29658778
Abstract

Genomic variations are in the focus of research to uncover mechanisms of host-pathogen interactions and diseases such as cancer. Nowadays, next-generation sequencing (NGS) data are analyzed through dedicated pipelines to detect them. Surrogate NGS data in conjunction with genomic variations help to evaluate pipelines and validate their outcomes, fostering selection of proper tools for a given scientific question. I describe how existing approaches for simulating NGS data in conjunction with genomic variations fail to model local enrichments of single nucleotide polymorphisms (SNPs), so called SNP clusters. Two distributions for count data are applied to publicly available collections of genomic variations. The results suggest modeling of SNP cluster sizes by overdispersion-aware distributions.

摘要

基因组变异是揭示宿主-病原体相互作用机制以及癌症等疾病机制研究的重点。如今,通过专门的流程对下一代测序(NGS)数据进行分析以检测这些变异。替代NGS数据与基因组变异相结合有助于评估流程并验证其结果,促进针对特定科学问题选择合适的工具。我描述了现有的结合基因组变异模拟NGS数据的方法是如何无法对单核苷酸多态性(SNP)的局部富集(即所谓的SNP簇)进行建模的。将两种计数数据分布应用于公开可用的基因组变异集合。结果表明可用考虑过离散的分布对SNP簇大小进行建模。

相似文献

1
ShRangeSim: Simulation of Single Nucleotide Polymorphism Clusters in Next-Generation Sequencing Data.ShRangeSim:下一代测序数据中单核甘酸多态性簇的模拟
J Comput Biol. 2018 Jun;25(6):613-622. doi: 10.1089/cmb.2018.0007. Epub 2018 Apr 16.
2
Review of alignment and SNP calling algorithms for next-generation sequencing data.下一代测序数据的比对和单核苷酸多态性(SNP)检测算法综述。
J Appl Genet. 2016 Feb;57(1):71-9. doi: 10.1007/s13353-015-0292-7. Epub 2015 Jun 9.
3
NGS_SNPAnalyzer: a desktop software supporting genome projects by identifying and visualizing sequence variations from next-generation sequencing data.NGS_SNPAnalyzer:一款桌面软件,通过识别和可视化来自下一代测序数据的序列变异,支持基因组项目。
Genes Genomics. 2020 Nov;42(11):1311-1317. doi: 10.1007/s13258-020-00997-7. Epub 2020 Sep 26.
4
Genome-Wide SNP Calling from Genotyping by Sequencing (GBS) Data: A Comparison of Seven Pipelines and Two Sequencing Technologies.基于测序基因分型(GBS)数据的全基因组单核苷酸多态性(SNP)检测:七种流程和两种测序技术的比较
PLoS One. 2016 Aug 22;11(8):e0161333. doi: 10.1371/journal.pone.0161333. eCollection 2016.
5
Challenges in exome analysis by LifeScope and its alternative computational pipelines.LifeScope及其替代计算流程在全外显子组分析中的挑战。
BMC Res Notes. 2015 Sep 7;8:421. doi: 10.1186/s13104-015-1385-4.
6
Comparison of seven SNP calling pipelines for the next-generation sequencing data of chickens.比较用于鸡下一代测序数据的七种 SNP 调用管道。
PLoS One. 2022 Jan 31;17(1):e0262574. doi: 10.1371/journal.pone.0262574. eCollection 2022.
7
Pipeliner: software to evaluate the performance of bioinformatics pipelines for next-generation resequencing.Pipeliner:用于评估下一代重测序生物信息学管道性能的软件。
Mol Ecol Resour. 2015 Jan;15(1):99-106. doi: 10.1111/1755-0998.12286. Epub 2014 Jun 28.
8
A comparison of tools for the simulation of genomic next-generation sequencing data.用于模拟基因组下一代测序数据的工具比较。
Nat Rev Genet. 2016 Aug;17(8):459-69. doi: 10.1038/nrg.2016.57. Epub 2016 Jun 20.
9
Bioinformatics: identification of markers from next-generation sequence data.生物信息学:从下一代测序数据中识别标记物。
Methods Mol Biol. 2015;1245:29-47. doi: 10.1007/978-1-4939-1966-6_3.
10
Bioinformatics Data Analysis of Next-Generation Sequencing Data from Heterogeneous Tumor Samples.来自异质性肿瘤样本的下一代测序数据的生物信息学数据分析
Methods Mol Biol. 2017;1633:185-192. doi: 10.1007/978-1-4939-7142-8_12.

引用本文的文献

1
GeDi: applying suffix arrays to increase the repertoire of detectable SNVs in tumour genomes.GeDi:应用后缀数组增加肿瘤基因组中可检测 SNV 的种类。
BMC Bioinformatics. 2020 Feb 5;21(1):45. doi: 10.1186/s12859-020-3367-3.
2
A broad survey of DNA sequence data simulation tools.DNA 序列数据模拟工具的广泛调查。
Brief Funct Genomics. 2020 Jan 22;19(1):49-59. doi: 10.1093/bfgp/elz033.