• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

改进结构变异聚类,以减少断点不确定性问题的负面影响。

Improving structural variant clustering to reduce the negative effect of the breakpoint uncertainty problem.

机构信息

Department of Biology and Medical Genetics, Second Faculty of Medicine, Charles University and University Hospital Motol, V Úvalu 84, 15006, Prague, Czech Republic.

Department of Biology and Medical Genetics, First Faculty of Medicine, Charles University and General University Hospital in Prague, Albertov 4, 128 00, Prague, Czech Republic.

出版信息

BMC Bioinformatics. 2021 Sep 27;22(1):464. doi: 10.1186/s12859-021-04374-3.

DOI:10.1186/s12859-021-04374-3
PMID:34579642
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8474851/
Abstract

BACKGROUND

Structural variants (SVs) represent an important source of genetic variation. One of the most critical problems in their detection is breakpoint uncertainty associated with the inability to determine their exact genomic position. Breakpoint uncertainty is a characteristic issue of structural variants detected via short-read sequencing methods and complicates subsequent population analyses. The commonly used heuristic strategy reduces this issue by clustering/merging nearby structural variants of the same type before the data from individual samples are merged.

RESULTS

We compared the two most used dissimilarity measures for SV clustering in terms of Mendelian inheritance errors (MIE), kinship prediction, and deviation from Hardy-Weinberg equilibrium. We analyzed the occurrence of Mendelian-inconsistent SV clusters that can be collapsed into one Mendelian-consistent SV as a new measure of dataset consistency. We also developed a new method based on constrained clustering that explicitly identifies these types of clusters.

CONCLUSIONS

We found that the dissimilarity measure based on the distance between SVs breakpoints produces slightly better results than the measure based on SVs overlap. This difference is evident in trivial and corrected clustering strategy, but not in constrained clustering strategy. However, constrained clustering strategy provided the best results in all aspects, regardless of the dissimilarity measure used.

摘要

背景

结构变异(SVs)是遗传变异的重要来源之一。在检测它们时,最关键的问题之一是由于无法确定其确切的基因组位置而导致的断点不确定性。断点不确定性是通过短读测序方法检测到的结构变异的一个特征问题,这使得后续的群体分析变得复杂。常用的启发式策略通过在合并来自个体样本的数据之前,对同一类型的附近结构变异进行聚类/合并,来减少这个问题。

结果

我们比较了两种最常用的用于 SV 聚类的距离度量方法,即孟德尔遗传错误(MIE)、亲缘关系预测和偏离哈迪-温伯格平衡。我们分析了可以合并为一个孟德尔一致的 SV 的 Mendelian 不一致的 SV 聚类的发生情况,作为数据集一致性的新度量。我们还开发了一种基于约束聚类的新方法,该方法可以明确识别这些类型的聚类。

结论

我们发现,基于 SV 断点之间距离的距离度量方法比基于 SV 重叠的度量方法产生的结果略好。这种差异在琐碎和校正的聚类策略中很明显,但在约束聚类策略中不明显。然而,无论使用哪种距离度量方法,约束聚类策略都在所有方面都提供了最佳的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/9bab7d4b4e63/12859_2021_4374_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/a06da26739b7/12859_2021_4374_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/4669c295d182/12859_2021_4374_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/6ad070b580f9/12859_2021_4374_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/611e61d69856/12859_2021_4374_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/9bab7d4b4e63/12859_2021_4374_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/a06da26739b7/12859_2021_4374_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/4669c295d182/12859_2021_4374_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/6ad070b580f9/12859_2021_4374_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/611e61d69856/12859_2021_4374_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10a6/8474851/9bab7d4b4e63/12859_2021_4374_Fig5_HTML.jpg

相似文献

1
Improving structural variant clustering to reduce the negative effect of the breakpoint uncertainty problem.改进结构变异聚类,以减少断点不确定性问题的负面影响。
BMC Bioinformatics. 2021 Sep 27;22(1):464. doi: 10.1186/s12859-021-04374-3.
2
Small polymorphisms are a source of ancestral bias in structural variant breakpoint placement.小的多态性是结构变异断点位置中祖先偏见的一个来源。
Genome Res. 2024 Feb 7;34(1):7-19. doi: 10.1101/gr.278203.123.
3
svclassify: a method to establish benchmark structural variant calls.svclassify:一种建立基准结构变异调用的方法。
BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2.
4
SVExpress: identifying gene features altered recurrently in expression with nearby structural variant breakpoints.SVExpress:识别在表达上与附近结构变异断点反复改变的基因特征。
BMC Bioinformatics. 2021 Mar 21;22(1):135. doi: 10.1186/s12859-021-04072-0.
5
nanotatoR: a tool for enhanced annotation of genomic structural variants.纳米标记器:一种用于增强基因组结构变异注释的工具。
BMC Genomics. 2021 Jan 6;22(1):10. doi: 10.1186/s12864-020-07182-w.
6
Detecting inherited and novel structural variants in low-coverage parent-child sequencing data.检测低覆盖度亲子测序数据中的遗传和新结构变异。
Methods. 2020 Feb 15;173:61-68. doi: 10.1016/j.ymeth.2019.06.025. Epub 2019 Jul 2.
7
Enhancing breakpoint resolution with deep segmentation model: A general refinement method for read-depth based structural variant callers.利用深度分割模型提高断点分辨率:一种基于读取深度的结构变异调用器的通用细化方法。
PLoS Comput Biol. 2021 Oct 11;17(10):e1009186. doi: 10.1371/journal.pcbi.1009186. eCollection 2021 Oct.
8
Genome sequencing in cytogenetics: Comparison of short-read and linked-read approaches for germline structural variant detection and characterization.基因组测序在细胞遗传学中的应用:短读长和连接读长方法在种系结构变异检测和特征分析中的比较。
Mol Genet Genomic Med. 2020 Mar;8(3):e1114. doi: 10.1002/mgg3.1114. Epub 2020 Jan 27.
9
Small allelic variants are a source of ancestral bias in structural variant breakpoint placement.小等位基因变异是结构变异断点定位中祖先偏差的一个来源。
bioRxiv. 2023 Jun 26:2023.06.25.546295. doi: 10.1101/2023.06.25.546295.
10
Complex structural variants in Mendelian disorders: identification and breakpoint resolution using short- and long-read genome sequencing.孟德尔疾病中的复杂结构变异:使用短读长和长读长基因组测序进行鉴定和断点解析。
Genome Med. 2018 Dec 7;10(1):95. doi: 10.1186/s13073-018-0606-6.

本文引用的文献

1
Improved structural variant interpretation for hereditary cancer susceptibility using long-read sequencing.利用长读测序提高遗传性癌症易感性的结构变异解读。
Genet Med. 2020 Nov;22(11):1892-1897. doi: 10.1038/s41436-020-0880-8. Epub 2020 Jul 6.
2
Long-read human genome sequencing and its applications.长读长基因组测序及其应用。
Nat Rev Genet. 2020 Oct;21(10):597-614. doi: 10.1038/s41576-020-0236-x. Epub 2020 Jun 5.
3
A structural variation reference for medical and population genetics.医学和人群遗传学的结构变异参考
Nature. 2020 May;581(7809):444-451. doi: 10.1038/s41586-020-2287-8. Epub 2020 May 27.
4
Mapping and characterization of structural variation in 17,795 human genomes.人类基因组 17795 号结构变异的定位与特征分析。
Nature. 2020 Jul;583(7814):83-89. doi: 10.1038/s41586-020-2371-0. Epub 2020 May 27.
5
Complex structural variants in Mendelian disorders: identification and breakpoint resolution using short- and long-read genome sequencing.孟德尔疾病中的复杂结构变异:使用短读长和长读长基因组测序进行鉴定和断点解析。
Genome Med. 2018 Dec 7;10(1):95. doi: 10.1186/s13073-018-0606-6.
6
Demerelate: calculating interindividual relatedness for kinship analysis based on codominant diploid genetic markers using R.Demerelate:使用 R 计算基于共显性二倍体遗传标记的亲缘关系分析中的个体间相关性。
Mol Ecol Resour. 2017 Nov;17(6):1371-1377. doi: 10.1111/1755-0998.12666. Epub 2017 Apr 6.
7
Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast.瞬时结构变异对裂殖酵母的数量性状和生殖隔离有强烈影响。
Nat Commun. 2017 Jan 24;8:14061. doi: 10.1038/ncomms14061.
8
Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications.Manta:用于种系和癌症测序应用的结构变异和插入缺失的快速检测。
Bioinformatics. 2016 Apr 15;32(8):1220-2. doi: 10.1093/bioinformatics/btv710. Epub 2015 Dec 8.
9
An integrated map of structural variation in 2,504 human genomes.2504个人类基因组结构变异的整合图谱。
Nature. 2015 Oct 1;526(7571):75-81. doi: 10.1038/nature15394.
10
Estimation of relatedness by DNA fingerprinting.通过DNA指纹图谱估计亲缘关系。
Mol Biol Evol. 1988 Sep;5(5):584-99. doi: 10.1093/oxfordjournals.molbev.a040518.