• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用Hydra-Multi进行基于人群的结构变异发现。

Population-based structural variation discovery with Hydra-Multi.

作者信息

Lindberg Michael R, Hall Ira M, Quinlan Aaron R

机构信息

Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA.

Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA.

出版信息

Bioinformatics. 2015 Apr 15;31(8):1286-9. doi: 10.1093/bioinformatics/btu771. Epub 2014 Dec 2.

DOI:10.1093/bioinformatics/btu771
PMID:25527832
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4393510/
Abstract

UNLABELLED

Current strategies for SNP and INDEL discovery incorporate sequence alignments from multiple individuals to maximize sensitivity and specificity. It is widely accepted that this approach also improves structural variant (SV) detection. However, multisample SV analysis has been stymied by the fundamental difficulties of SV calling, e.g. library insert size variability, SV alignment signal integration and detecting long-range genomic rearrangements involving disjoint loci. Extant tools suffer from poor scalability, which limits the number of genomes that can be co-analyzed and complicates analysis workflows. We have developed an approach that enables multisample SV analysis in hundreds to thousands of human genomes using commodity hardware. Here, we describe Hydra-Multi and measure its accuracy, speed and scalability using publicly available datasets provided by The 1000 Genomes Project and by The Cancer Genome Atlas (TCGA).

AVAILABILITY AND IMPLEMENTATION

Hydra-Multi is written in C++ and is freely available at https://github.com/arq5x/Hydra.

CONTACT

aaronquinlan@gmail.com or ihall@genome.wustl.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

未标注

当前用于单核苷酸多态性(SNP)和插入缺失(INDEL)发现的策略纳入了多个个体的序列比对,以最大化敏感性和特异性。人们普遍认为这种方法也能改善结构变异(SV)检测。然而,多样本SV分析一直受到SV检测基本难题的阻碍,例如文库插入片段大小的变异性、SV比对信号整合以及检测涉及不连续位点的长程基因组重排。现有工具存在扩展性差的问题,这限制了可共同分析的基因组数量,并使分析工作流程复杂化。我们开发了一种方法,能够使用商用硬件对数百至数千个人类基因组进行多样本SV分析。在此,我们描述了Hydra-Multi,并使用千人基因组计划和癌症基因组图谱(TCGA)提供的公开可用数据集来衡量其准确性、速度和扩展性。

可用性与实现方式

Hydra-Multi用C++编写,可在https://github.com/arq5x/Hydra上免费获取。

联系方式

aaronquinlan@gmail.com或ihall@genome.wustl.edu

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
Population-based structural variation discovery with Hydra-Multi.使用Hydra-Multi进行基于人群的结构变异发现。
Bioinformatics. 2015 Apr 15;31(8):1286-9. doi: 10.1093/bioinformatics/btu771. Epub 2014 Dec 2.
2
SVScore: an impact prediction tool for structural variation.SVScore:一种用于结构变异的影响预测工具。
Bioinformatics. 2017 Apr 1;33(7):1083-1085. doi: 10.1093/bioinformatics/btw789.
3
Seeksv: an accurate tool for somatic structural variation and virus integration detection.SeekSV:一种用于检测体细胞结构变异和病毒整合的精确工具。
Bioinformatics. 2017 Jan 15;33(2):184-191. doi: 10.1093/bioinformatics/btw591. Epub 2016 Sep 14.
4
SVmine improves structural variation detection by integrative mining of predictions from multiple algorithms.SVmine 通过整合来自多种算法的预测结果来提高结构变异检测的效果。
Bioinformatics. 2017 Nov 1;33(21):3348-3354. doi: 10.1093/bioinformatics/btx455.
5
RAPTR-SV: a hybrid method for the detection of structural variants.RAPTR-SV:一种用于检测结构变异的混合方法。
Bioinformatics. 2015 Jul 1;31(13):2084-90. doi: 10.1093/bioinformatics/btv086. Epub 2015 Feb 16.
6
SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability.SV-Bay:利用贝叶斯方法检测癌症基因组中的结构变异,并对GC含量和读段可映射性进行校正。
Bioinformatics. 2016 Apr 1;32(7):984-92. doi: 10.1093/bioinformatics/btv751. Epub 2016 Jan 6.
7
SVExpress: identifying gene features altered recurrently in expression with nearby structural variant breakpoints.SVExpress:识别在表达上与附近结构变异断点反复改变的基因特征。
BMC Bioinformatics. 2021 Mar 21;22(1):135. doi: 10.1186/s12859-021-04072-0.
8
chainCleaner improves genome alignment specificity and sensitivity.链清洁器提高了基因组比对的特异性和灵敏度。
Bioinformatics. 2017 Jun 1;33(11):1596-1603. doi: 10.1093/bioinformatics/btx024.
9
rMFilter: acceleration of long read-based structure variation calling by chimeric read filtering.rMFilter:通过嵌合读段过滤加速基于长读段的结构变异检测
Bioinformatics. 2017 Sep 1;33(17):2750-2752. doi: 10.1093/bioinformatics/btx279.
10
BreakPoint Surveyor: a pipeline for structural variant visualization.断点勘探者:一种用于结构变异可视化的管道。
Bioinformatics. 2017 Oct 1;33(19):3121-3122. doi: 10.1093/bioinformatics/btx362.

引用本文的文献

1
A Systematic Review of the Advances and New Insights into Copy Number Variations in Plant Genomes.植物基因组拷贝数变异研究进展与新见解的系统综述
Plants (Basel). 2025 May 6;14(9):1399. doi: 10.3390/plants14091399.
2
Different structural variant prediction tools yield considerably different results in Caenorhabditis elegans.不同的结构变异预测工具在秀丽隐杆线虫中产生了相当不同的结果。
PLoS One. 2022 Dec 30;17(12):e0278424. doi: 10.1371/journal.pone.0278424. eCollection 2022.
3
Current status of structural variation studies in plants.

本文引用的文献

1
Breakpoint profiling of 64 cancer genomes reveals numerous complex rearrangements spawned by homology-independent mechanisms.64 例癌症基因组的断点分析揭示了许多由同源非依赖性机制产生的复杂重排。
Genome Res. 2013 May;23(5):762-76. doi: 10.1101/gr.143677.112. Epub 2013 Feb 14.
2
DELLY: structural variant discovery by integrated paired-end and split-read analysis.DELLY:通过整合的 paired-end 和 split-read 分析进行结构变异发现。
Bioinformatics. 2012 Sep 15;28(18):i333-i339. doi: 10.1093/bioinformatics/bts378.
3
SomaticSniper: identification of somatic point mutations in whole genome sequencing data.
植物结构变异研究的现状。
Plant Biotechnol J. 2021 Nov;19(11):2153-2163. doi: 10.1111/pbi.13646. Epub 2021 Jul 20.
4
Statistical Considerations on NGS Data for Inferring Copy Number Variations.关于推断拷贝数变异的 NGS 数据的统计考虑。
Methods Mol Biol. 2021;2243:27-58. doi: 10.1007/978-1-0716-1103-6_2.
5
MaizeCUBIC: a comprehensive variation database for a maize synthetic population.玉米 CUBIC:一个玉米合成群体的综合变异数据库。
Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa044.
6
Sequencing XMET genes to promote genotype-guided risk assessment and precision medicine.对 XMET 基因进行测序,以促进基于基因型的风险评估和精准医学。
Sci China Life Sci. 2019 Jul;62(7):895-904. doi: 10.1007/s11427-018-9479-5. Epub 2019 May 20.
7
The sequencing and interpretation of the genome obtained from a Serbian individual.从一名塞尔维亚个体中获得的基因组的测序和解读。
PLoS One. 2018 Dec 19;13(12):e0208901. doi: 10.1371/journal.pone.0208901. eCollection 2018.
8
Human copy number variants are enriched in regions of low mappability.人类拷贝数变异在可定位性低的区域中富集。
Nucleic Acids Res. 2018 Aug 21;46(14):7236-7249. doi: 10.1093/nar/gky538.
9
The Tandem Duplicator Phenotype Is a Prevalent Genome-Wide Cancer Configuration Driven by Distinct Gene Mutations.串联重复表型是一种普遍存在的全基因组癌症结构,由不同的基因突变驱动。
Cancer Cell. 2018 Aug 13;34(2):197-210.e5. doi: 10.1016/j.ccell.2018.06.008. Epub 2018 Jul 12.
10
FusorSV: an algorithm for optimally combining data from multiple structural variation detection methods. FusorSV:一种用于最优组合来自多种结构变异检测方法的数据的算法。
Genome Biol. 2018 Mar 20;19(1):38. doi: 10.1186/s13059-018-1404-6.
SomaticSniper:全基因组测序数据中体细胞点突变的识别。
Bioinformatics. 2012 Feb 1;28(3):311-7. doi: 10.1093/bioinformatics/btr665. Epub 2011 Dec 6.
4
Simultaneous structural variation discovery among multiple paired-end sequenced genomes.同时发现多个配对末端测序基因组中的结构变异。
Genome Res. 2011 Dec;21(12):2203-12. doi: 10.1101/gr.120501.111. Epub 2011 Nov 2.
5
Genome sequencing of mouse induced pluripotent stem cells reveals retroelement stability and infrequent DNA rearrangement during reprogramming.对小鼠诱导多能干细胞进行基因组测序揭示了重编程过程中逆转录元件的稳定性和罕见的 DNA 重排。
Cell Stem Cell. 2011 Oct 4;9(4):366-73. doi: 10.1016/j.stem.2011.07.018.
6
Discovery and genotyping of genome structural polymorphism by sequencing on a population scale.基于人群规模测序的基因组结构多态性的发现和基因分型。
Nat Genet. 2011 Mar;43(3):269-76. doi: 10.1038/ng.768. Epub 2011 Feb 13.
7
Mapping copy number variation by population-scale genome sequencing.通过群体规模的基因组测序来绘制拷贝数变异图谱。
Nature. 2011 Feb 3;470(7332):59-65. doi: 10.1038/nature09708.
8
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.
9
Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome.在小鼠基因组中进行全基因组范围内结构变异断点的图谱绘制和组装。
Genome Res. 2010 May;20(5):623-35. doi: 10.1101/gr.102970.109. Epub 2010 Mar 22.
10
VarScan: variant detection in massively parallel sequencing of individual and pooled samples.VarScan:个体样本与混合样本大规模平行测序中的变异检测
Bioinformatics. 2009 Sep 1;25(17):2283-5. doi: 10.1093/bioinformatics/btp373. Epub 2009 Jun 19.