College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin, 300308, China.
BMC Med Genomics. 2023 Apr 5;16(1):74. doi: 10.1186/s12920-023-01495-x.
With advances in massive parallel sequencing (MPS) technology, whole-genome sequencing (WGS) has gradually evolved into the first-tier diagnostic test for genetic disorders. However, deployment practice and pipeline testing for clinical WGS are lacking.
In this study, we introduced a whole WGS pipeline for genetic disorders, which included the entire process from obtaining a sample to clinical reporting. All samples that underwent WGS were constructed using polymerase chain reaction (PCR)-free library preparation protocols and sequenced on the MGISEQ-2000 platform. Bioinformatics pipelines were developed for the simultaneous detection of various types of variants, including single nucleotide variants (SNVs), insertions and deletions (indels), copy number variants (CNVs) and balanced rearrangements, mitochondrial (MT) variants, and other complex variants such as repeat expansion, pseudogenes and absence of heterozygosity (AOH). A semiautomatic pipeline was developed for the interpretation of potential SNVs and CNVs. Forty-five samples (including 14 positive commercially available samples, 23 laboratory-held positive cell lines and 8 clinical cases) with known variants were used to validate the whole pipeline.
In this study, a whole WGS pipeline for genetic disorders was developed and optimized. Forty-five samples with known variants (6 with SNVs and Indels, 3 with MT variants, 5 with aneuploidies, 1 with triploidy, 23 with CNVs, 5 with balanced rearrangements, 2 with repeat expansions, 1 with AOHs, and 1 with exon 7-8 deletion of SMN1 gene) validated the effectiveness of our pipeline.
This study has been piloted in test development, optimization, and validation of the WGS pipeline for genetic disorders. A set of best practices were recommended using our pipeline, along with a dataset of positive samples for benchmarking.
随着高通量平行测序(MPS)技术的进步,全基因组测序(WGS)已逐渐成为遗传疾病的一线诊断测试。然而,临床 WGS 的部署实践和管道测试仍有所欠缺。
本研究引入了一种用于遗传疾病的全 WGS 管道,其中包括从获取样本到临床报告的整个过程。所有进行 WGS 的样本均采用无聚合酶链反应(PCR)文库制备方案构建,并在 MGISEQ-2000 平台上进行测序。我们开发了生物信息学管道,用于同时检测各种类型的变体,包括单核苷酸变体(SNVs)、插入和缺失(indels)、拷贝数变异(CNVs)和平衡重排、线粒体(MT)变体以及其他复杂变体,如重复扩展、假基因和杂合性缺失(AOH)。开发了一个半自动管道,用于解释潜在的 SNVs 和 CNVs。使用 45 个具有已知变体的样本(包括 14 个阳性商业样本、23 个实验室持有的阳性细胞系和 8 个临床病例)来验证整个管道。
本研究开发并优化了用于遗传疾病的全 WGS 管道。使用 45 个具有已知变体的样本(包括 6 个 SNVs 和 Indels、3 个 MT 变体、5 个非整倍体、1 个三倍体、23 个 CNVs、5 个平衡重排、2 个重复扩展、1 个 AOHs 和 1 个 SMN1 基因外显子 7-8 缺失)验证了我们管道的有效性。
本研究已在遗传疾病 WGS 管道的测试开发、优化和验证中进行了试点。我们推荐了一套使用我们的管道的最佳实践,并提供了一个阳性样本数据集作为基准。