• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从全基因组和全外显子组测序中发现等位基因特异性拷贝数

Allele-specific copy-number discovery from whole-genome and whole-exome sequencing.

作者信息

Wang WeiBo, Wang Wei, Sun Wei, Crowley James J, Szatkiewicz Jin P

机构信息

Department of Computer Science, University of North Carolina at Chapel Hill, NC 27599-3175, USA.

Department of Computer Science, University of California, Los Angeles, CA 90095, USA.

出版信息

Nucleic Acids Res. 2015 Aug 18;43(14):e90. doi: 10.1093/nar/gkv319. Epub 2015 Apr 16.

DOI:10.1093/nar/gkv319
PMID:25883151
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4538801/
Abstract

Copy-number variants (CNVs) are a major form of genetic variation and a risk factor for various human diseases, so it is crucial to accurately detect and characterize them. It is conceivable that allele-specific reads from high-throughput sequencing data could be leveraged to both enhance CNV detection and produce allele-specific copy number (ASCN) calls. Although statistical methods have been developed to detect CNVs using whole-genome sequence (WGS) and/or whole-exome sequence (WES) data, information from allele-specific read counts has not yet been adequately exploited. In this paper, we develop an integrated method, called AS-GENSENG, which incorporates allele-specific read counts in CNV detection and estimates ASCN using either WGS or WES data. To evaluate the performance of AS-GENSENG, we conducted extensive simulations, generated empirical data using existing WGS and WES data sets and validated predicted CNVs using an independent methodology. We conclude that AS-GENSENG not only predicts accurate ASCN calls but also improves the accuracy of total copy number calls, owing to its unique ability to exploit information from both total and allele-specific read counts while accounting for various experimental biases in sequence data. Our novel, user-friendly and computationally efficient method and a complete analytic protocol is freely available at https://sourceforge.net/projects/asgenseng/.

摘要

拷贝数变异(CNV)是遗传变异的主要形式,也是多种人类疾病的风险因素,因此准确检测和表征它们至关重要。可以想象,高通量测序数据中的等位基因特异性读数可用于增强CNV检测并生成等位基因特异性拷贝数(ASCN)调用。尽管已经开发了统计方法来使用全基因组序列(WGS)和/或全外显子组序列(WES)数据检测CNV,但等位基因特异性读数计数的信息尚未得到充分利用。在本文中,我们开发了一种名为AS-GENSENG的综合方法,该方法在CNV检测中纳入等位基因特异性读数计数,并使用WGS或WES数据估计ASCN。为了评估AS-GENSENG的性能,我们进行了广泛的模拟,使用现有的WGS和WES数据集生成经验数据,并使用独立方法验证预测的CNV。我们得出结论,AS-GENSENG不仅能预测准确的ASCN调用,还能提高总拷贝数调用的准确性,这得益于其独特的能力,即在考虑序列数据中的各种实验偏差的同时,利用总读数计数和等位基因特异性读数计数的信息。我们新颖、用户友好且计算高效的方法以及完整的分析协议可在https://sourceforge.net/projects/asgenseng/上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b6/4538801/99d0c79b6b25/gkv319fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b6/4538801/aa698ca60b4e/gkv319fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b6/4538801/a4043520163c/gkv319fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b6/4538801/3aaa528fccb2/gkv319fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b6/4538801/99d0c79b6b25/gkv319fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b6/4538801/aa698ca60b4e/gkv319fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b6/4538801/a4043520163c/gkv319fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b6/4538801/3aaa528fccb2/gkv319fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/22b6/4538801/99d0c79b6b25/gkv319fig4.jpg

相似文献

1
Allele-specific copy-number discovery from whole-genome and whole-exome sequencing.从全基因组和全外显子组测序中发现等位基因特异性拷贝数
Nucleic Acids Res. 2015 Aug 18;43(14):e90. doi: 10.1093/nar/gkv319. Epub 2015 Apr 16.
2
An evaluation of copy number variation detection tools for cancer using whole exome sequencing data.使用全外显子组测序数据对癌症拷贝数变异检测工具的评估
BMC Bioinformatics. 2017 May 31;18(1):286. doi: 10.1186/s12859-017-1705-x.
3
Evaluation of somatic copy number estimation tools for whole-exome sequencing data.全外显子组测序数据的体细胞拷贝数估计工具评估
Brief Bioinform. 2016 Mar;17(2):185-92. doi: 10.1093/bib/bbv055. Epub 2015 Jul 25.
4
Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2.使用EXCAVATOR2从全外显子组测序数据中增强拷贝数变异检测。
Nucleic Acids Res. 2016 Nov 16;44(20):e154. doi: 10.1093/nar/gkw695. Epub 2016 Aug 9.
5
FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing.FACETS:用于高通量DNA测序的等位基因特异性拷贝数和克隆异质性分析工具。
Nucleic Acids Res. 2016 Sep 19;44(16):e131. doi: 10.1093/nar/gkw520. Epub 2016 Jun 7.
6
Exome sequence read depth methods for identifying copy number changes.用于识别拷贝数变化的外显子序列读取深度方法。
Brief Bioinform. 2015 May;16(3):380-92. doi: 10.1093/bib/bbu027. Epub 2014 Aug 28.
7
Polishing copy number variant calls on exome sequencing data via deep learning.通过深度学习对外显子组测序数据进行拷贝数变异的精确分析。
Genome Res. 2022 Jun;32(6):1170-1182. doi: 10.1101/gr.274845.120. Epub 2022 Jun 13.
8
Bamgineer: Introduction of simulated allele-specific copy number variants into exome and targeted sequence data sets.Bamgineer:外显子组和靶向序列数据集模拟等位基因特异性拷贝数变异的引入。
PLoS Comput Biol. 2018 Mar 28;14(3):e1006080. doi: 10.1371/journal.pcbi.1006080. eCollection 2018 Mar.
9
Determining multiallelic complex copy number and sequence variation from high coverage exome sequencing data.从高覆盖度外显子组测序数据中确定多等位基因复合体拷贝数和序列变异
BMC Genomics. 2015 Nov 2;16:891. doi: 10.1186/s12864-015-2123-y.
10
An evaluation of copy number variation detection tools from whole-exome sequencing data.基于全外显子组测序数据的拷贝数变异检测工具评估
Hum Mutat. 2014 Jul;35(7):899-907. doi: 10.1002/humu.22537. Epub 2014 May 1.

引用本文的文献

1
Progress in Methods for Copy Number Variation Profiling.拷贝数变异分析方法的研究进展。
Int J Mol Sci. 2022 Feb 15;23(4):2143. doi: 10.3390/ijms23042143.
2
A random forest-based framework for genotyping and accuracy assessment of copy number variations.一种基于随机森林的拷贝数变异基因分型及准确性评估框架。
NAR Genom Bioinform. 2020 Sep 22;2(3):lqaa071. doi: 10.1093/nargab/lqaa071. eCollection 2020 Sep.
3
DINTD: Detection and Inference of Tandem Duplications From Short Sequencing Reads.DINTD:从短测序 reads 中检测和推断串联重复序列

本文引用的文献

1
Structural architecture of SNP effects on complex traits.单核苷酸多态性(SNP)对复杂性状影响的结构架构。
Am J Hum Genet. 2014 Nov 6;95(5):477-89. doi: 10.1016/j.ajhg.2014.09.009. Epub 2014 Oct 9.
2
Inferring copy number and genotype in tumour exome data.推断肿瘤外显子组数据中的拷贝数和基因型。
BMC Genomics. 2014 Aug 28;15(1):732. doi: 10.1186/1471-2164-15-732.
3
Digital genotyping of macrosatellites and multicopy genes reveals novel biological functions associated with copy number variation of large tandem repeats.巨卫星和多拷贝基因的数字基因分型揭示了与大串联重复序列拷贝数变异相关的新生物学功能。
Front Genet. 2020 Aug 11;11:924. doi: 10.3389/fgene.2020.00924. eCollection 2020.
4
RBV: Read balance validator, a tool for prioritising copy number variations in germline conditions.RBV:读平衡验证器,一种用于优先考虑种系条件下拷贝数变异的工具。
Sci Rep. 2019 Nov 15;9(1):16934. doi: 10.1038/s41598-019-53181-7.
5
Comprehensive evaluation of structural variation detection algorithms for whole genome sequencing.全基因组测序结构变异检测算法的综合评估。
Genome Biol. 2019 Jun 3;20(1):117. doi: 10.1186/s13059-019-1720-5.
6
Association analysis using somatic mutations.基于体细胞突变的关联分析。
PLoS Genet. 2018 Nov 2;14(11):e1007746. doi: 10.1371/journal.pgen.1007746. eCollection 2018 Nov.
7
A randomized approach to speed up the analysis of large-scale read-count data in the application of CNV detection.一种用于加速大规模读计数数据在 CNV 检测中的分析的随机化方法。
BMC Bioinformatics. 2018 Mar 1;19(1):74. doi: 10.1186/s12859-018-2077-6.
8
Assessing the reproducibility of exome copy number variations predictions.评估外显子拷贝数变异预测的可重复性。
Genome Med. 2016 Aug 8;8(1):82. doi: 10.1186/s13073-016-0336-6.
9
Allele-Specific Quantification of Structural Variations in Cancer Genomes.癌症基因组中结构变异的等位基因特异性定量。
Cell Syst. 2016 Jul;3(1):21-34. doi: 10.1016/j.cels.2016.05.007. Epub 2016 Jul 21.
10
Causal variants screened by whole exome sequencing in a patient with maternal uniparental isodisomy of chromosome 10 and a complicated phenotype.对一名患有10号染色体母源性单亲等臂染色体且具有复杂表型的患者进行全外显子组测序筛查出的因果变异。
Exp Ther Med. 2016 Jun;11(6):2247-2253. doi: 10.3892/etm.2016.3241. Epub 2016 Apr 11.
PLoS Genet. 2014 Jun 19;10(6):e1004418. doi: 10.1371/journal.pgen.1004418. eCollection 2014 Jun.
4
An evaluation of copy number variation detection tools from whole-exome sequencing data.基于全外显子组测序数据的拷贝数变异检测工具评估
Hum Mutat. 2014 Jul;35(7):899-907. doi: 10.1002/humu.22537. Epub 2014 May 1.
5
Comparative study of exome copy number variation estimation tools using array comparative genomic hybridization as control.以外显子拷贝数变异估计工具为研究对象,以阵列比较基因组杂交为对照的比较研究。
Biomed Res Int. 2013;2013:915636. doi: 10.1155/2013/915636. Epub 2013 Nov 4.
6
SomatiCA: identifying, characterizing and quantifying somatic copy number aberrations from cancer genome sequencing data.SomatiCA:从癌症基因组测序数据中识别、描述和量化体细胞拷贝数异常。
PLoS One. 2013 Nov 12;8(11):e78143. doi: 10.1371/journal.pone.0078143. eCollection 2013.
7
WaveCNV: allele-specific copy number alterations in primary tumors and xenograft models from next-generation sequencing.WaveCNV:下一代测序中外源模型和原发性肿瘤的等位基因特异性拷贝数改变。
Bioinformatics. 2014 Mar 15;30(6):768-74. doi: 10.1093/bioinformatics/btt611. Epub 2013 Nov 4.
8
Advantage of using allele-specific copy numbers when testing for association in regions with common copy number variants.在具有常见拷贝数变异的区域进行关联测试时使用等位基因特异性拷贝数的优势。
PLoS One. 2013 Sep 10;8(9):e75350. doi: 10.1371/journal.pone.0075350. eCollection 2013.
9
The complex SNP and CNV genetic architecture of the increased risk of congenital heart defects in Down syndrome.唐氏综合征先天性心脏缺陷风险增加的复杂 SNP 和 CNV 遗传结构。
Genome Res. 2013 Sep;23(9):1410-21. doi: 10.1101/gr.147991.112. Epub 2013 Jun 19.
10
eQTL Mapping Using RNA-seq Data.使用RNA测序数据进行表达定量性状位点定位
Stat Biosci. 2013 May 1;5(1):198-219. doi: 10.1007/s12561-012-9068-3.