• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NGSEP3:跨物种和测序协议的准确变异调用。

NGSEP3: accurate variant calling across species and sequencing protocols.

机构信息

Systems and Computing Engineering Department, Universidad de los Andes, Bogotá 111711, Colombia.

Biotechnology lab, Centro de Investigación de la caña de azúcar de Colombia, CENICAÑA, Cali 760046, Colombia.

出版信息

Bioinformatics. 2019 Nov 1;35(22):4716-4723. doi: 10.1093/bioinformatics/btz275.

DOI:10.1093/bioinformatics/btz275
PMID:31099384
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6853766/
Abstract

MOTIVATION

Accurate detection, genotyping and downstream analysis of genomic variants from high-throughput sequencing data are fundamental features in modern production pipelines for genetic-based diagnosis in medicine or genomic selection in plant and animal breeding. Our research group maintains the Next-Generation Sequencing Experience Platform (NGSEP) as a precise, efficient and easy-to-use software solution for these features.

RESULTS

Understanding that incorrect alignments around short tandem repeats are an important source of genotyping errors, we implemented in NGSEP new algorithms for realignment and haplotype clustering of reads spanning indels and short tandem repeats. We performed extensive benchmark experiments comparing NGSEP to state-of-the-art software using real data from three sequencing protocols and four species with different distributions of repetitive elements. NGSEP consistently shows comparative accuracy and better efficiency compared to the existing solutions. We expect that this work will contribute to the continuous improvement of quality in variant calling needed for modern applications in medicine and agriculture.

AVAILABILITY AND IMPLEMENTATION

NGSEP is available as open source software at http://ngsep.sf.net.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

从高通量测序数据中准确检测、基因分型和下游分析基因组变异是医学中基于遗传的诊断或植物和动物育种中基于基因组选择的现代生产管道中的基本特征。我们的研究小组维护着下一代测序体验平台(NGSEP),作为一种精确、高效和易于使用的软件解决方案,具有这些功能。

结果

我们认识到短串联重复周围不正确的比对是基因分型错误的一个重要来源,因此在 NGSEP 中实现了新的算法,用于重新比对和单倍型聚类跨越插入缺失和短串联重复的读取。我们使用来自三种测序方案和四种具有不同重复元件分布的物种的真实数据,对 NGSEP 与最先进的软件进行了广泛的基准测试实验。与现有解决方案相比,NGSEP 始终表现出相当的准确性和更好的效率。我们期望这项工作将有助于不断提高医学和农业现代应用中所需的变异调用质量。

可用性和实现

NGSEP 可在 http://ngsep.sf.net 上作为开源软件获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1013/6853766/bc4e28307449/btz275f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1013/6853766/471adcfbea71/btz275f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1013/6853766/91ed781d7a23/btz275f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1013/6853766/9cd6e5b826c7/btz275f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1013/6853766/bc4e28307449/btz275f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1013/6853766/471adcfbea71/btz275f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1013/6853766/91ed781d7a23/btz275f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1013/6853766/9cd6e5b826c7/btz275f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1013/6853766/bc4e28307449/btz275f4.jpg

相似文献

1
NGSEP3: accurate variant calling across species and sequencing protocols.NGSEP3:跨物种和测序协议的准确变异调用。
Bioinformatics. 2019 Nov 1;35(22):4716-4723. doi: 10.1093/bioinformatics/btz275.
2
Bioinformatic analysis of genotype by sequencing (GBS) data with NGSEP.使用NGSEP对测序基因分型(GBS)数据进行生物信息学分析。
BMC Genomics. 2016 Aug 31;17 Suppl 5(Suppl 5):498. doi: 10.1186/s12864-016-2827-7.
3
An integrated framework for discovery and genotyping of genomic variants from high-throughput sequencing experiments.一个用于从高通量测序实验中发现基因组变异并进行基因分型的综合框架。
Nucleic Acids Res. 2014 Apr;42(6):e44. doi: 10.1093/nar/gkt1381. Epub 2014 Jan 11.
4
Accurate, Efficient and User-Friendly Mutation Calling and Sample Identification for TILLING Experiments.用于定向诱导基因组局部突变(TILLING)实验的准确、高效且用户友好的突变检测与样本识别
Front Genet. 2021 Feb 3;12:624513. doi: 10.3389/fgene.2021.624513. eCollection 2021.
5
Comparative analysis of algorithms for next-generation sequencing read alignment.下一代测序读段比对算法的比较分析。
Bioinformatics. 2011 Oct 15;27(20):2790-6. doi: 10.1093/bioinformatics/btr477. Epub 2011 Aug 19.
6
mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development.mInDel:一种用于全基因组插入缺失标记开发的高通量高效流程
BMC Genomics. 2016 Apr 14;17:290. doi: 10.1186/s12864-016-2614-5.
7
Robust and efficient software for reference-free genomic diversity analysis of genotyping-by-sequencing data on diploid and polyploid species.用于对二倍体和多倍体物种的测序基因分型数据进行无参考基因组多样性分析的强大且高效的软件。
Mol Ecol Resour. 2022 Jan;22(1):439-454. doi: 10.1111/1755-0998.13477. Epub 2021 Jul 29.
8
ReliableGenome: annotation of genomic regions with high/low variant calling concordance.可靠基因组:具有高/低变异检测一致性的基因组区域注释。
Bioinformatics. 2017 Jan 15;33(2):155-160. doi: 10.1093/bioinformatics/btw587. Epub 2016 Sep 7.
9
Improved indel detection in DNA and RNA via realignment with ABRA2.通过与 ABRA2 重新比对,提高 DNA 和 RNA 中的插入缺失检测。
Bioinformatics. 2019 Sep 1;35(17):2966-2973. doi: 10.1093/bioinformatics/btz033.
10
INDELseek: detection of complex insertions and deletions from next-generation sequencing data.INDELseek:从下一代测序数据中检测复杂插入和缺失
BMC Genomics. 2017 Jan 5;18(1):16. doi: 10.1186/s12864-016-3449-9.

引用本文的文献

1
Flavonoid pathway intermediates implicate UVR8 in functions beyond canonical UV-B signaling.类黄酮途径中间体表明UVR8具有超出经典UV-B信号传导的功能。
Nat Commun. 2025 Aug 21;16(1):7810. doi: 10.1038/s41467-025-63010-3.
2
Holobiont-based genetic analysis reveals new plant and microbial markers for resistance against a root rot pathogen complex in pea.基于共生功能体的遗传分析揭示了豌豆抗根腐病原菌复合体的新植物和微生物标记。
BMC Plant Biol. 2025 Aug 9;25(1):1053. doi: 10.1186/s12870-025-06995-9.
3
Plastomic studies inform the mechanisms of edaphic adaptation in North American species in the tribe Thelypodieae (Brassicaceae).

本文引用的文献

1
A universal SNP and small-indel variant caller using deep neural networks.使用深度神经网络的通用 SNP 和小插入缺失变体调用器。
Nat Biotechnol. 2018 Nov;36(10):983-987. doi: 10.1038/nbt.4235. Epub 2018 Sep 24.
2
Strelka2: fast and accurate calling of germline and somatic variants.Strelka2:快速准确地调用种系和体细胞变异。
Nat Methods. 2018 Aug;15(8):591-594. doi: 10.1038/s41592-018-0051-x. Epub 2018 Jul 16.
3
A synthetic-diploid benchmark for accurate variant-calling evaluation.用于准确变异呼叫评估的合成二倍体基准。
质体基因组研究揭示了北美芥族(十字花科)物种对土壤环境适应的机制。
Am J Bot. 2025 Jul;112(7):e70071. doi: 10.1002/ajb2.70071. Epub 2025 Jul 14.
4
Genetic diversity and comparative genomics across Leishmania (Viannia) species.利什曼原虫(维扬亚属)物种间的遗传多样性与比较基因组学
Commun Biol. 2025 Jun 14;8(1):925. doi: 10.1038/s42003-025-08331-1.
5
Chromosome-scale assemblies of three Ormosia species: repetitive sequences distribution and structural rearrangement.三种红豆属植物的染色体水平组装:重复序列分布与结构重排
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf047.
6
Genomic characterization of Leishmania (V.) braziliensis associated with antimony therapeutic failure and variable in vitro tolerance to amphotericin B.与锑治疗失败及对两性霉素B的体外耐受性可变相关的巴西利什曼原虫(Viannia亚属)的基因组特征
Sci Rep. 2025 Apr 15;15(1):12973. doi: 10.1038/s41598-025-96849-z.
7
Unlocking the geography of Azobé timber (Lophira alata): revealing spatial genetic structure beyond species boundaries.揭开红铁木(Lophira alata)的地理分布:揭示超越物种界限的空间遗传结构。
BMC Plant Biol. 2025 Mar 12;25(1):315. doi: 10.1186/s12870-025-06287-2.
8
GWAS-assisted and multitrait genomic prediction for improvement of seed yield and canning quality traits in a black bean breeding panel.在一个黑豆育种群体中,利用全基因组关联研究辅助和多性状基因组预测来改良种子产量和罐头品质性状。
G3 (Bethesda). 2025 Mar 18;15(3). doi: 10.1093/g3journal/jkaf007.
9
Genetic mechanisms of axial patterning in .……中轴向模式形成的遗传机制 。 (你提供的原文不完整,我只能根据已有内容翻译到这种程度。)
Evol Lett. 2024 Aug 7;8(6):893-901. doi: 10.1093/evlett/qrae041. eCollection 2024 Dec.
10
Genetic diversity of Anadara tuberculosa in two localities of the Colombian Pacific Coast.哥伦比亚太平洋沿岸两个地区的泥蚶遗传多样性。
Sci Rep. 2024 Nov 18;14(1):28467. doi: 10.1038/s41598-024-78869-3.
Nat Methods. 2018 Aug;15(8):595-597. doi: 10.1038/s41592-018-0054-7. Epub 2018 Jul 16.
4
A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data.用于下一代测序数据的体细胞单核苷酸变异检测算法综述。
Comput Struct Biotechnol J. 2018 Feb 6;16:15-24. doi: 10.1016/j.csbj.2018.01.003. eCollection 2018.
5
A tandem simulation framework for predicting mapping quality.一种用于预测映射质量的串联模拟框架。
Genome Biol. 2017 Aug 10;18(1):152. doi: 10.1186/s13059-017-1290-3.
6
Optimized detection of insertions/deletions (INDELs) in whole-exome sequencing data.全外显子组测序数据中插入/缺失(INDELs)的优化检测
PLoS One. 2017 Aug 9;12(8):e0182272. doi: 10.1371/journal.pone.0182272. eCollection 2017.
7
Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data.评估用于非配对下一代测序数据的变异调用工具。
Sci Rep. 2017 Feb 24;7:43169. doi: 10.1038/srep43169.
8
Detailed simulation of cancer exome sequencing data reveals differences and common limitations of variant callers.癌症外显子组测序数据的详细模拟揭示了变异检测工具的差异和常见局限性。
BMC Bioinformatics. 2017 Jan 3;18(1):8. doi: 10.1186/s12859-016-1417-7.
9
VarMatch: robust matching of small variant datasets using flexible scoring schemes.VarMatch:使用灵活评分方案对小变异数据集进行稳健匹配。
Bioinformatics. 2017 May 1;33(9):1301-1308. doi: 10.1093/bioinformatics/btw797.
10
A reference data set of 5.4 million phased human variants validated by genetic inheritance from sequencing a three-generation 17-member pedigree.通过对一个包含17名成员的三代家系进行测序,经遗传继承验证的540万个定相人类变异的参考数据集。
Genome Res. 2017 Jan;27(1):157-164. doi: 10.1101/gr.210500.116. Epub 2016 Nov 30.