• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

建立社区参考样本、数据和调用集,用于使用全基因组测序进行癌症突变检测的基准测试。

Establishing community reference samples, data and call sets for benchmarking cancer mutation detection using whole-genome sequencing.

机构信息

Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA.

Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.

出版信息

Nat Biotechnol. 2021 Sep;39(9):1151-1160. doi: 10.1038/s41587-021-00993-6. Epub 2021 Sep 9.

DOI:10.1038/s41587-021-00993-6
PMID:34504347
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8532138/
Abstract

The lack of samples for generating standardized DNA datasets for setting up a sequencing pipeline or benchmarking the performance of different algorithms limits the implementation and uptake of cancer genomics. Here, we describe reference call sets obtained from paired tumor-normal genomic DNA (gDNA) samples derived from a breast cancer cell line-which is highly heterogeneous, with an aneuploid genome, and enriched in somatic alterations-and a matched lymphoblastoid cell line. We partially validated both somatic mutations and germline variants in these call sets via whole-exome sequencing (WES) with different sequencing platforms and targeted sequencing with >2,000-fold coverage, spanning 82% of genomic regions with high confidence. Although the gDNA reference samples are not representative of primary cancer cells from a clinical sample, when setting up a sequencing pipeline, they not only minimize potential biases from technologies, assays and informatics but also provide a unique resource for benchmarking 'tumor-only' or 'matched tumor-normal' analyses.

摘要

由于缺乏用于生成标准化 DNA 数据集的样本,从而难以建立测序流程或对不同算法的性能进行基准测试,这限制了癌症基因组学的实施和应用。在这里,我们描述了从乳腺癌细胞系(具有非整倍体基因组且体细胞突变高度富集)及其匹配的淋巴母细胞系衍生的配对肿瘤-正常基因组 DNA(gDNA)样本中获得的参考调用集。我们通过不同测序平台的全外显子组测序(WES)和 >2000 倍覆盖度的靶向测序对这些调用集中的体细胞突变和种系变异进行了部分验证,涵盖了具有高置信度的 82%基因组区域。尽管 gDNA 参考样本不能代表来自临床样本的原发性癌细胞,但在建立测序流程时,它们不仅可以最大限度地减少技术、检测和信息学带来的潜在偏差,还为“仅肿瘤”或“匹配的肿瘤-正常”分析提供了独特的基准资源。

相似文献

1
Establishing community reference samples, data and call sets for benchmarking cancer mutation detection using whole-genome sequencing.建立社区参考样本、数据和调用集,用于使用全基因组测序进行癌症突变检测的基准测试。
Nat Biotechnol. 2021 Sep;39(9):1151-1160. doi: 10.1038/s41587-021-00993-6. Epub 2021 Sep 9.
2
Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing.实现全基因组和全外显子组测序中癌症基因突变检测的最佳实践。
Nat Biotechnol. 2021 Sep;39(9):1141-1150. doi: 10.1038/s41587-021-00994-5. Epub 2021 Sep 9.
3
Clinical Validation of a Next-Generation Sequencing Genomic Oncology Panel via Cross-Platform Benchmarking against Established Amplicon Sequencing Assays.通过与既定的扩增子测序检测进行跨平台基准测试对新一代测序基因组肿瘤学检测板进行临床验证。
J Mol Diagn. 2017 Jan;19(1):43-56. doi: 10.1016/j.jmoldx.2016.07.012. Epub 2016 Nov 9.
4
Whole genome and exome sequencing reference datasets from a multi-center and cross-platform benchmark study.多中心、跨平台基准研究的全基因组和外显子组测序参考数据集。
Sci Data. 2021 Nov 9;8(1):296. doi: 10.1038/s41597-021-01077-5.
5
Validated WGS and WES protocols proved saliva-derived gDNA as an equivalent to blood-derived gDNA for clinical and population genomic analyses.经验证的 WGS 和 WES 方案证明,唾液来源的 gDNA 可与血液来源的 gDNA 相媲美,可用于临床和人群基因组分析。
BMC Genomics. 2024 Feb 17;25(1):187. doi: 10.1186/s12864-024-10080-0.
6
Whole genome deep sequencing analysis of cell-free DNA in samples with low tumour content.对低肿瘤含量样本中的游离 DNA 进行全基因组深度测序分析。
BMC Cancer. 2022 Jan 20;22(1):85. doi: 10.1186/s12885-021-09160-1.
7
Deep whole-genome sequencing of 3 cancer cell lines on 2 sequencing platforms.对 2 个测序平台上的 3 种癌细胞系进行深度全基因组测序。
Sci Rep. 2019 Dec 13;9(1):19123. doi: 10.1038/s41598-019-55636-3.
8
A computational approach to distinguish somatic vs. germline origin of genomic alterations from deep sequencing of cancer specimens without a matched normal.一种计算方法,用于从无匹配正常样本的癌症标本深度测序中区分基因组改变的体细胞起源与种系起源。
PLoS Comput Biol. 2018 Feb 7;14(2):e1005965. doi: 10.1371/journal.pcbi.1005965. eCollection 2018 Feb.
9
Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers.跨多种下一代测序仪的种系变异调用管道的系统比较。
Sci Rep. 2019 Jun 27;9(1):9345. doi: 10.1038/s41598-019-45835-3.
10
Benchmarking workflows to assess performance and suitability of germline variant calling pipelines in clinical diagnostic assays.评估用于临床诊断检测的种系变异calling 管道性能和适用性的基准测试工作流程。
BMC Bioinformatics. 2021 Feb 24;22(1):85. doi: 10.1186/s12859-020-03934-3.

引用本文的文献

1
Multi-omics Quality Assessment in Personalized Medicine Through European Infrastructure for Translational Medicine (EATRIS): An Overview.通过欧洲转化医学基础设施(EATRIS)进行的个性化医疗中的多组学质量评估:概述
Phenomics. 2025 Apr 1;5(3):311-325. doi: 10.1007/s43657-024-00170-0. eCollection 2025 Jun.
2
CaMutQC: An R package for integrative quality control and filtration of cancer somatic mutations.CaMutQC:一个用于癌症体细胞突变综合质量控制和过滤的R软件包。
Comput Struct Biotechnol J. 2025 Jul 16;27:3147-3154. doi: 10.1016/j.csbj.2025.07.011. eCollection 2025.
3
Development and extensive sequencing of a broadly-consented Genome in a Bottle matched tumor-normal pair.

本文引用的文献

1
Personalized genome assembly for accurate cancer somatic mutation discovery using tumor-normal paired reference samples.使用肿瘤-正常配对参考样本进行个性化基因组组装,以准确发现癌症体细胞突变。
Genome Biol. 2022 Nov 9;23(1):237. doi: 10.1186/s13059-022-02803-x.
2
Whole genome and exome sequencing reference datasets from a multi-center and cross-platform benchmark study.多中心、跨平台基准研究的全基因组和外显子组测序参考数据集。
Sci Data. 2021 Nov 9;8(1):296. doi: 10.1038/s41597-021-01077-5.
3
Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing.
对广泛认可的“瓶中基因组”匹配的肿瘤-正常样本对进行开发及深度测序。
Sci Data. 2025 Jul 16;12(1):1195. doi: 10.1038/s41597-025-05438-2.
4
Multiplexed, universal probe-based rare variant detection with USE-PCR.使用通用单引物延伸PCR(USE-PCR)进行多重、基于通用探针的罕见变异检测。
Sci Rep. 2025 Jul 4;15(1):23947. doi: 10.1038/s41598-025-08814-5.
5
Computational methods and data resources for predicting tumor neoantigens.预测肿瘤新抗原的计算方法和数据资源
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf302.
6
A benchmarking study of copy number variation inference methods using single-cell RNA-sequencing data.一项使用单细胞RNA测序数据进行拷贝数变异推断方法的基准研究。
Precis Clin Med. 2025 Jun 4;8(2):pbaf011. doi: 10.1093/pcmedi/pbaf011. eCollection 2025 Jun.
7
SAVANA: reliable analysis of somatic structural variants and copy number aberrations using long-read sequencing.SAVANA:利用长读长测序技术对体细胞结构变异和拷贝数畸变进行可靠分析。
Nat Methods. 2025 May 28. doi: 10.1038/s41592-025-02708-0.
8
Tumor ploidy determination in low-pass whole genome sequencing and allelic copy number visualization using the Constellation Plot.利用星座图在低深度全基因组测序中确定肿瘤倍性并可视化等位基因拷贝数
Genome Biol. 2025 May 20;26(1):132. doi: 10.1186/s13059-025-03599-2.
9
ATRX, OLIG2, MGMT, and IDH2 in Glioblastoma: Essential Molecular Mechanisms and Therapeutic Significance.胶质母细胞瘤中的ATRX、OLIG2、MGMT和IDH2:重要分子机制及治疗意义
Medicina (Kaunas). 2025 Apr 10;61(4):697. doi: 10.3390/medicina61040697.
10
Severus detects somatic structural variation and complex rearrangements in cancer genomes using long-read sequencing.西弗勒斯使用长读长测序技术检测癌症基因组中的体细胞结构变异和复杂重排。
Nat Biotechnol. 2025 Apr 4. doi: 10.1038/s41587-025-02618-8.
实现全基因组和全外显子组测序中癌症基因突变检测的最佳实践。
Nat Biotechnol. 2021 Sep;39(9):1141-1150. doi: 10.1038/s41587-021-00994-5. Epub 2021 Sep 9.
4
A multi-center cross-platform single-cell RNA sequencing reference dataset.一个多中心跨平台单细胞 RNA 测序参考数据集。
Sci Data. 2021 Feb 2;8(1):39. doi: 10.1038/s41597-021-00809-x.
5
A multicenter study benchmarking single-cell RNA sequencing technologies using reference samples.一项使用参考试样对单细胞 RNA 测序技术进行基准测试的多中心研究。
Nat Biotechnol. 2021 Sep;39(9):1103-1114. doi: 10.1038/s41587-020-00748-9. Epub 2020 Dec 21.
6
A hidden Markov modeling approach for identifying tumor subclones in next-generation sequencing studies.一种用于鉴定下一代测序研究中肿瘤亚克隆的隐马尔可夫模型方法。
Biostatistics. 2022 Jan 13;23(1):69-82. doi: 10.1093/biostatistics/kxaa013.
7
Systematic comparison of somatic variant calling performance among different sequencing depth and mutation frequency.不同测序深度和突变频率下体细胞变异调用性能的系统比较
Sci Rep. 2020 Feb 26;10(1):3501. doi: 10.1038/s41598-020-60559-5.
8
Preparation of the standard cell lines for reference mutations in cancer gene-panels by genome editing in HEK 293 T/17 cells.通过在HEK 293 T/17细胞中进行基因组编辑制备癌症基因面板中参考突变的标准细胞系。
Genes Environ. 2020 Feb 11;42:8. doi: 10.1186/s41021-020-0147-2. eCollection 2020.
9
SuperFreq: Integrated mutation detection and clonal tracking in cancer.SuperFreq:癌症中的综合突变检测和克隆追踪。
PLoS Comput Biol. 2020 Feb 13;16(2):e1007603. doi: 10.1371/journal.pcbi.1007603. eCollection 2020 Feb.
10
The evolutionary history of 2,658 cancers.2658 种癌症的进化史。
Nature. 2020 Feb;578(7793):122-128. doi: 10.1038/s41586-019-1907-7. Epub 2020 Feb 6.