• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.利用高通量测序发现串联和散在的片段重复。
Bioinformatics. 2019 Oct 15;35(20):3923-3930. doi: 10.1093/bioinformatics/btz237.
2
Discovery of large genomic inversions using long range information.利用长程信息发现大型基因组倒位。
BMC Genomics. 2017 Jan 10;18(1):65. doi: 10.1186/s12864-016-3444-1.
3
iSVP: an integrated structural variant calling pipeline from high-throughput sequencing data.iSVP:一种基于高通量测序数据的整合结构变异检测流程
BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S8. doi: 10.1186/1752-0509-7-S6-S8. Epub 2013 Dec 13.
4
MUM&Co: accurate detection of all SV types through whole-genome alignment.MUM&Co:通过全基因组比对准确检测所有 SV 类型。
Bioinformatics. 2020 May 1;36(10):3242-3243. doi: 10.1093/bioinformatics/btaa115.
5
Sensitive alignment using paralogous sequence variants improves long-read mapping and variant calling in segmental duplications.利用直系同源序列变异进行敏感比对可提高大片段重复区域的长读长序列比对和变异calling 效率。
Nucleic Acids Res. 2020 Nov 4;48(19):e114. doi: 10.1093/nar/gkaa829.
6
SVIM: structural variant identification using mapped long reads.SVIM:基于比对的长读段的结构变异识别。
Bioinformatics. 2019 Sep 1;35(17):2907-2915. doi: 10.1093/bioinformatics/btz041.
7
Toolkit for automated and rapid discovery of structural variants.用于自动化和快速发现结构变体的工具包。
Methods. 2017 Oct 1;129:3-7. doi: 10.1016/j.ymeth.2017.05.030. Epub 2017 Jun 2.
8
GGTyper: genotyping complex structural variants using short-read sequencing data.GGTyper:使用短读测序数据进行基因分型复杂结构变异。
Bioinformatics. 2024 Sep 1;40(Suppl 2):ii11-ii19. doi: 10.1093/bioinformatics/btae391.
9
invMap: a sensitive mapping tool for long noisy reads with inversion structural variants.invMap:一种用于具有反转结构变体的长噪声读取的敏感映射工具。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad726.
10
SV2: accurate structural variation genotyping and de novo mutation detection from whole genomes.SV2:全基因组中精确的结构变异基因分型和从头突变检测。
Bioinformatics. 2018 May 15;34(10):1774-1777. doi: 10.1093/bioinformatics/btx813.

引用本文的文献

1
SV-MeCa: an XGBoost-based meta-caller approach for structural variant calling from short-read data.SV-MeCa:一种基于XGBoost的元调用方法,用于从短读长数据中进行结构变异检测。
BMC Bioinformatics. 2025 Aug 20;26(1):218. doi: 10.1186/s12859-025-06246-6.
2
Genome-Wide Analysis of Genes: Identification, Evolution, Comparative Genomics, Expression Dynamics, and Sub-Cellular Localization in .基因的全基因组分析:在……中的鉴定、进化、比较基因组学、表达动态及亚细胞定位
Plants (Basel). 2025 Jul 14;14(14):2167. doi: 10.3390/plants14142167.
3
Comparative study of tools for copy number variation detection using next-generation sequencing data.使用下一代测序数据进行拷贝数变异检测工具的比较研究
Sci Rep. 2025 Jul 1;15(1):22145. doi: 10.1038/s41598-025-06527-3.
4
Comprehensive evaluation and guidance of structural variation detection tools in chicken whole genome sequence data.鸡全基因组序列数据中结构变异检测工具的综合评估和指导
BMC Genomics. 2024 Oct 16;25(1):970. doi: 10.1186/s12864-024-10875-1.
5
VISTA: an integrated framework for structural variant discovery.VISTA:一个用于结构变异发现的集成框架。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae462.
6
DTDHM: detection of tandem duplications based on hybrid methods using next-generation sequencing data.基于下一代测序数据的混合方法的串联重复检测(DTDHM)。
PeerJ. 2024 Jul 26;12:e17748. doi: 10.7717/peerj.17748. eCollection 2024.
7
Uncovering structural variants associated with body weight and obesity risk in labrador retrievers: a genome-wide study.揭示拉布拉多猎犬中与体重和肥胖风险相关的结构变异:一项全基因组研究。
Front Genet. 2023 Sep 20;14:1235821. doi: 10.3389/fgene.2023.1235821. eCollection 2023.
8
Comparative genome analysis using sample-specific string detection in accurate long reads.在准确的长读段中使用样本特异性字符串检测进行比较基因组分析。
Bioinform Adv. 2021 May 31;1(1):vbab005. doi: 10.1093/bioadv/vbab005. eCollection 2021.
9
SVDSS: structural variation discovery in hard-to-call genomic regions using sample-specific strings from accurate long reads.SVDSS:使用准确长读段中样本特异性字符串在难以测序的基因组区域发现结构变异。
Nat Methods. 2023 Apr;20(4):550-558. doi: 10.1038/s41592-022-01674-1. Epub 2022 Dec 22.
10
CONGA: Copy number variation genotyping in ancient genomes and low-coverage sequencing data.CONGA:古基因组和低覆盖度测序数据中的拷贝数变异基因分型。
PLoS Comput Biol. 2022 Dec 14;18(12):e1010788. doi: 10.1371/journal.pcbi.1010788. eCollection 2022 Dec.

本文引用的文献

1
Multi-platform discovery of haplotype-resolved structural variation in human genomes.多平台发现人类基因组中单体型分辨率结构变异。
Nat Commun. 2019 Apr 16;10(1):1784. doi: 10.1038/s41467-018-08148-z.
2
, an efficient and comprehensive structural variant caller for massive parallel sequencing data.,一种用于大规模平行测序数据的高效且全面的结构变异检测工具。
F1000Res. 2017 May 10;6:664. doi: 10.12688/f1000research.11168.2. eCollection 2017.
3
Toolkit for automated and rapid discovery of structural variants.用于自动化和快速发现结构变体的工具包。
Methods. 2017 Oct 1;129:3-7. doi: 10.1016/j.ymeth.2017.05.030. Epub 2017 Jun 2.
4
Y chromosome palindromes and gene conversion.Y染色体回文序列与基因转换。
Hum Genet. 2017 May;136(5):605-619. doi: 10.1007/s00439-017-1777-8. Epub 2017 Mar 16.
5
Discovery and genotyping of structural variation from long-read haploid genome sequence data.从长读单倍体基因组序列数据中发现结构变异并进行基因分型。
Genome Res. 2017 May;27(5):677-685. doi: 10.1101/gr.214007.116. Epub 2016 Nov 28.
6
Resolving complex structural genomic rearrangements using a randomized approach.使用随机方法解析复杂的结构基因组重排。
Genome Biol. 2016 Jun 10;17(1):126. doi: 10.1186/s13059-016-0993-1.
7
SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability.SV-Bay:利用贝叶斯方法检测癌症基因组中的结构变异,并对GC含量和读段可映射性进行校正。
Bioinformatics. 2016 Apr 1;32(7):984-92. doi: 10.1093/bioinformatics/btv751. Epub 2016 Jan 6.
8
Genetic variation and the de novo assembly of human genomes.人类基因组的遗传变异与从头组装
Nat Rev Genet. 2015 Nov;16(11):627-40. doi: 10.1038/nrg3933. Epub 2015 Oct 7.
9
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
10
Global diversity, population stratification, and selection of human copy-number variation.人类拷贝数变异的全球多样性、群体分层及选择
Science. 2015 Sep 11;349(6253):aab3761. doi: 10.1126/science.aab3761. Epub 2015 Aug 6.

利用高通量测序发现串联和散在的片段重复。

Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.

机构信息

Department of Computer Engineering, Bilkent University, Ankara.

Department of Computer Engineering, Konya Food and Agriculture University, Konya, Turkey.

出版信息

Bioinformatics. 2019 Oct 15;35(20):3923-3930. doi: 10.1093/bioinformatics/btz237.

DOI:10.1093/bioinformatics/btz237
PMID:30937433
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6792081/
Abstract

MOTIVATION

Several algorithms have been developed that use high-throughput sequencing technology to characterize structural variations (SVs). Most of the existing approaches focus on detecting relatively simple types of SVs such as insertions, deletions and short inversions. In fact, complex SVs are of crucial importance and several have been associated with genomic disorders. To better understand the contribution of complex SVs to human disease, we need new algorithms to accurately discover and genotype such variants. Additionally, due to similar sequencing signatures, inverted duplications or gene conversion events that include inverted segmental duplications are often characterized as simple inversions, likewise, duplications and gene conversions in direct orientation may be called as simple deletions. Therefore, there is still a need for accurate algorithms to fully characterize complex SVs and thus improve calling accuracy of more simple variants.

RESULTS

We developed novel algorithms to accurately characterize tandem, direct and inverted interspersed segmental duplications using short read whole genome sequencing datasets. We integrated these methods to our TARDIS tool, which is now capable of detecting various types of SVs using multiple sequence signatures such as read pair, read depth and split read. We evaluated the prediction performance of our algorithms through several experiments using both simulated and real datasets. In the simulation experiments, using a 30× coverage TARDIS achieved 96% sensitivity with only 4% false discovery rate. For experiments that involve real data, we used two haploid genomes (CHM1 and CHM13) and one human genome (NA12878) from the Illumina Platinum Genomes set. Comparison of our results with orthogonal PacBio call sets from the same genomes revealed higher accuracy for TARDIS than state-of-the-art methods. Furthermore, we showed a surprisingly low false discovery rate of our approach for discovery of tandem, direct and inverted interspersed segmental duplications prediction on CHM1 (<5% for the top 50 predictions).

AVAILABILITY AND IMPLEMENTATION

TARDIS source code is available at https://github.com/BilkentCompGen/tardis, and a corresponding Docker image is available at https://hub.docker.com/r/alkanlab/tardis/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

已经开发了几种算法,这些算法使用高通量测序技术来描述结构变异(SV)。现有的大多数方法都侧重于检测相对简单的 SV 类型,例如插入、缺失和短倒置。事实上,复杂的 SV 至关重要,其中一些与基因组疾病有关。为了更好地理解复杂 SV 对人类疾病的贡献,我们需要新的算法来准确发现和分型此类变体。此外,由于具有相似的测序特征,包括倒置片段重复的倒置重复或基因转换事件通常被描述为简单倒置,同样,直接定向的重复和基因转换可能被称为简单缺失。因此,仍然需要准确的算法来充分描述复杂的 SV,从而提高更简单变体的调用准确性。

结果

我们开发了新的算法,用于使用短读全基因组测序数据集准确地描述串联、直接和倒置分散的片段重复。我们将这些方法集成到我们的 TARDIS 工具中,该工具现在能够使用多个序列特征(如读对、读深度和分裂读)来检测各种类型的 SV。我们通过使用模拟和真实数据集的几个实验来评估我们算法的预测性能。在模拟实验中,使用 30×覆盖的 TARDIS 实现了 96%的灵敏度,假阳性率仅为 4%。对于涉及真实数据的实验,我们使用了 Illumina Platinum Genomes 集中的两个单倍体基因组(CHM1 和 CHM13)和一个人类基因组(NA12878)。与来自同一基因组的正交 PacBio 调用集的结果比较表明,TARDIS 的准确性高于最先进的方法。此外,我们还展示了我们的方法在 CHM1 上预测串联、直接和倒置分散的片段重复时的假阳性率非常低(前 50 个预测的假阳性率<5%)。

可用性和实现

TARDIS 的源代码可在 https://github.com/BilkentCompGen/tardis 上获得,相应的 Docker 镜像可在 https://hub.docker.com/r/alkanlab/tardis/ 上获得。

补充信息

补充数据可在《生物信息学》在线获得。