• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过比较和整合来自两种不同测序平台的数据来修正个人基因组。

Revising a personal genome by comparing and combining data from two different sequencing platforms.

机构信息

Lee Gil Ya Cancer and Diabetes Institute, Gachon University, Incheon, Korea.

出版信息

PLoS One. 2013 Apr 8;8(4):e60585. doi: 10.1371/journal.pone.0060585. Print 2013.

DOI:10.1371/journal.pone.0060585
PMID:23593254
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3620462/
Abstract

For the robust practice of genomic medicine, sequencing results must be compatible, regardless of the sequencing technologies and algorithms used. Presently, genome sequencing is still an imprecise science and is complicated by differences in the chemistry, coverage, alignment, and variant-calling algorithms. We identified 3.33 million single nucleotide variants (SNVs) and ~3.62 million SNVs in the SJK genome using SOLiD and Illumina data, respectively. Approximately 3 million SNVs were concordant between the two platforms while 68,532 SNVs were discordant; 219,616 SNVs were SOLiD-specific and 516,080 SNVs were Illumina-specific (i.e., platform-specific). Concordant, discordant, and platform-specific SNVs were further analyzed and characterized. Overall, a large portion of heterozygous SNVs that were discordant with genotyping calls of single nucleotide polymorphism chips were highly confident. Approximately 70% of the platform-specific SNVs were located in regions containing repetitive sequences. Such platform-specificity may arise from differences between platforms, with regard to read length (36 bp and 72 bp vs. 50 bp), insert size (100-300 bp vs. ~1-2 kb), sequencing chemistry (sequencing-by-synthesis using single nucleotides vs. ligation-based sequencing using oligomers), and sequencing quality. When data from the two platforms were merged for variant calling, the proportion of callable regions of the reference genome increased to 99.66%, which was 1.43% higher than the average callability of the two platforms, representing ~40 million bases. In this study, we compared the differences in sequencing results between two sequencing platforms. Approximately 90% of the SNVs were concordant between the two platforms, yet ~10% of the SNVs were either discordant or platform-specific, indicating that each platform had its own strengths and weaknesses. When data from the two platforms were merged, both the overall callability of the reference genome and the overall accuracy of the SNVs improved, demonstrating the likelihood that a re-sequenced genome can be revised using complementary data.

摘要

为了实现基因组医学的稳健实践,无论使用何种测序技术和算法,测序结果都必须具有一致性。目前,基因组测序仍然是一门不精确的科学,并且由于化学、覆盖度、比对和变异调用算法的差异而变得复杂。我们分别使用 SOLiD 和 Illumina 数据在 SJK 基因组中鉴定出约 333 万个单核苷酸变异(SNV)和约 362 万个 SNV。两个平台之间约有 300 万个 SNV 是一致的,而有 68532 个 SNV 是不一致的;219616 个 SNV 是 SOLiD 特异性的,而 516080 个 SNV 是 Illumina 特异性的(即平台特异性的)。我们进一步分析和描述了一致的、不一致的和平台特异性的 SNV。总的来说,大部分与单核苷酸多态性芯片的基因分型调用不一致的杂合性 SNV 具有高度可信度。大约 70%的平台特异性 SNV 位于含有重复序列的区域。这种平台特异性可能源于平台之间的差异,包括读长(36 个碱基对和 72 个碱基对与 50 个碱基对)、插入大小(100-300 个碱基对与1-2 kb)、测序化学(使用单核苷酸的合成测序与使用寡核苷酸的连接测序)和测序质量。当将两个平台的数据合并进行变异调用时,参考基因组的可调用区域比例增加到 99.66%,比两个平台的平均可调用性高 1.43%,代表约 4000 万个碱基。在这项研究中,我们比较了两个测序平台的测序结果差异。两个平台之间约 90%的 SNV 是一致的,但约 10%的 SNV 是不一致的或平台特异性的,这表明每个平台都有其自身的优势和劣势。当将两个平台的数据合并时,参考基因组的整体可调用性和 SNV 的整体准确性都得到了提高,这表明使用互补数据可以对重新测序的基因组进行修订。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07b4/3620462/eb1e7b745b2b/pone.0060585.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07b4/3620462/8b6970f46992/pone.0060585.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07b4/3620462/94e70e7dbef1/pone.0060585.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07b4/3620462/eb1e7b745b2b/pone.0060585.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07b4/3620462/8b6970f46992/pone.0060585.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07b4/3620462/94e70e7dbef1/pone.0060585.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07b4/3620462/eb1e7b745b2b/pone.0060585.g003.jpg

相似文献

1
Revising a personal genome by comparing and combining data from two different sequencing platforms.通过比较和整合来自两种不同测序平台的数据来修正个人基因组。
PLoS One. 2013 Apr 8;8(4):e60585. doi: 10.1371/journal.pone.0060585. Print 2013.
2
Performance comparison of whole-genome sequencing platforms.全基因组测序平台的性能比较。
Nat Biotechnol. 2011 Dec 18;30(1):78-82. doi: 10.1038/nbt.2065.
3
Short and long-read genome sequencing methodologies for somatic variant detection; genomic analysis of a patient with diffuse large B-cell lymphoma.短读长读基因组测序方法用于体细胞变异检测;弥漫性大 B 细胞淋巴瘤患者的基因组分析。
Sci Rep. 2021 Mar 19;11(1):6408. doi: 10.1038/s41598-021-85354-8.
4
Coverage bias and sensitivity of variant calling for four whole-genome sequencing technologies.四种全基因组测序技术的变异调用覆盖偏差和灵敏度。
PLoS One. 2013 Jun 11;8(6):e66621. doi: 10.1371/journal.pone.0066621. Print 2013.
5
SNVHMM: predicting single nucleotide variants from next generation sequencing.SNVHMM:从下一代测序中预测单核苷酸变异。
BMC Bioinformatics. 2013 Jul 15;14:225. doi: 10.1186/1471-2105-14-225.
6
Comparing a few SNP calling algorithms using low-coverage sequencing data.比较几种使用低覆盖度测序数据的 SNP calling 算法。
BMC Bioinformatics. 2013 Sep 17;14:274. doi: 10.1186/1471-2105-14-274.
7
Similarities and differences between variants called with human reference genome HG19 or HG38.与使用人类参考基因组 HG19 或 HG38 调用的变体之间的相似性和差异。
BMC Bioinformatics. 2019 Mar 14;20(Suppl 2):101. doi: 10.1186/s12859-019-2620-0.
8
Evaluation of whole-genome sequencing of four Chinese crested dogs for variant detection using the ion proton system.使用离子质子系统对四只中国冠毛犬进行全基因组测序以检测变异的评估。
Canine Genet Epidemiol. 2015 Oct 8;2:16. doi: 10.1186/s40575-015-0029-2. eCollection 2015.
9
Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling.两种外显子捕获试剂盒和测序平台用于变异检测的比较与评估
BMC Genomics. 2015 Aug 5;16(1):581. doi: 10.1186/s12864-015-1796-6.
10
Quality control metrics improve repeatability and reproducibility of single-nucleotide variants derived from whole-genome sequencing.质量控制指标提高了源自全基因组测序的单核苷酸变异的可重复性和再现性。
Pharmacogenomics J. 2015 Aug;15(4):298-309. doi: 10.1038/tpj.2014.70. Epub 2014 Nov 11.

引用本文的文献

1
Characterization of H5N1 Influenza Virus Quasispecies with Adaptive Hemagglutinin Mutations from Single-Virus Infections of Human Airway Cells.从人类气道细胞的单病毒感染中具有适应性血凝素突变的 H5N1 流感病毒准种的特征。
J Virol. 2018 May 14;92(11). doi: 10.1128/JVI.02004-17. Print 2018 Jun 1.
2
THE BENEFITS OF CUSTOMIZED DNA DIRECTED NUTRITION TO BALANCE THE BRAIN REWARD CIRCUITRY AND REDUCE ADDICTIVE BEHAVIORS.定制化DNA导向营养对平衡大脑奖赏回路及减少成瘾行为的益处。
Precis Med (Bangalore). 2016;1(1):18-33. Epub 2016 Aug 1.
3
An emerging place for lung cancer genomics in 2013.

本文引用的文献

1
Integrated genomic analyses identify ARID1A and ARID1B alterations in the childhood cancer neuroblastoma.基因组综合分析鉴定儿童癌症神经母细胞瘤中的 ARID1A 和 ARID1B 改变。
Nat Genet. 2013 Jan;45(1):12-7. doi: 10.1038/ng.2493. Epub 2012 Dec 2.
2
Exome sequencing of hepatitis B virus-associated hepatocellular carcinoma.乙型肝炎病毒相关性肝细胞癌的外显子组测序。
Nat Genet. 2012 Oct;44(10):1117-21. doi: 10.1038/ng.2391. Epub 2012 Aug 26.
3
Shigella sonnei genome sequencing and phylogenetic analysis indicate recent global dissemination from Europe.
2013年肺癌基因组学的一个新兴领域。
J Thorac Dis. 2013 Oct;5 Suppl 5(Suppl 5):S491-7. doi: 10.3978/j.issn.2072-1439.2013.10.06.
宋内志贺菌全基因组测序和系统发育分析表明其近期在全球范围内由欧洲传播而来。
Nat Genet. 2012 Sep;44(9):1056-9. doi: 10.1038/ng.2369. Epub 2012 Aug 5.
4
Performance comparison of benchtop high-throughput sequencing platforms.桌面高通量测序平台的性能比较。
Nat Biotechnol. 2012 May;30(5):434-9. doi: 10.1038/nbt.2198.
5
De novo mutations revealed by whole-exome sequencing are strongly associated with autism.全外显子组测序揭示的新生突变与自闭症强烈相关。
Nature. 2012 Apr 4;485(7397):237-41. doi: 10.1038/nature10945.
6
New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing.全基因组测序揭示蒂罗尔冰人的起源和表型新见解。
Nat Commun. 2012 Feb 28;3:698. doi: 10.1038/ncomms1701.
7
Sequencing of neuroblastoma identifies chromothripsis and defects in neuritogenesis genes.神经母细胞瘤的测序鉴定了染色体重排和神经突生成基因的缺陷。
Nature. 2012 Feb 22;483(7391):589-93. doi: 10.1038/nature10910.
8
Detecting false-positive signals in exome sequencing.检测外显子组测序中的假阳性信号。
Hum Mutat. 2012 Apr;33(4):609-13. doi: 10.1002/humu.22033. Epub 2012 Mar 5.
9
Clonal evolution in relapsed acute myeloid leukaemia revealed by whole-genome sequencing.全基因组测序揭示复发急性髓系白血病的克隆进化。
Nature. 2012 Jan 11;481(7382):506-10. doi: 10.1038/nature10738.
10
Accurate and comprehensive sequencing of personal genomes.对个人基因组进行准确而全面的测序。
Genome Res. 2011 Sep;21(9):1498-505. doi: 10.1101/gr.123638.111. Epub 2011 Jul 19.