基于基因组测量共识建模评估跨平台和实验室间一致性。

Evaluation of cross-platform and interlaboratory concordance via consensus modelling of genomic measurements.

机构信息

Epigenetics Laboratory, Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia.

South Western Sydney Clinical School, Faculty of Medicine, University of New South Wales, Liverpool, NSW, Australia.

出版信息

Bioinformatics. 2019 Feb 15;35(4):560-570. doi: 10.1093/bioinformatics/bty675.

DOI:10.1093/bioinformatics/bty675

PMID:30084929

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6378945/

Abstract

MOTIVATION

A synoptic view of the human genome benefits chiefly from the application of nucleic acid sequencing and microarray technologies. These platforms allow interrogation of patterns such as gene expression and DNA methylation at the vast majority of canonical loci, allowing granular insights and opportunities for validation of original findings. However, problems arise when validating against a "gold standard" measurement, since this immediately biases all subsequent measurements towards that particular technology or protocol. Since all genomic measurements are estimates, in the absence of a "gold standard" we instead empirically assess the measurement precision and sensitivity of a large suite of genomic technologies via a consensus modelling method called the row-linear model. This method is an application of the American Society for Testing and Materials Standard E691 for assessing interlaboratory precision and sources of variability across multiple testing sites. Both cross-platform and cross-locus comparisons can be made across all common loci, allowing identification of technology- and locus-specific tendencies.

RESULTS

We assess technologies including the Infinium MethylationEPIC BeadChip, whole genome bisulfite sequencing (WGBS), two different RNA-Seq protocols (PolyA+ and Ribo-Zero) and five different gene expression array platforms. Each technology thus is characterised herein, relative to the consensus. We showcase a number of applications of the row-linear model, including correlation with known interfering traits. We demonstrate a clear effect of cross-hybridisation on the sensitivity of Infinium methylation arrays. Additionally, we perform a true interlaboratory test on a set of samples interrogated on the same platform across twenty-one separate testing laboratories.

AVAILABILITY AND IMPLEMENTATION

A full implementation of the row-linear model, plus extra functions for visualisation, are found in the R package consensus at https://github.com/timpeters82/consensus.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

人类基因组的综合视图主要受益于核酸测序和微阵列技术的应用。这些平台允许在绝大多数规范基因座上检测基因表达和 DNA 甲基化等模式，从而提供细微的见解和验证原始发现的机会。然而，当与“金标准”测量值进行验证时，会出现问题，因为这会立即使所有后续测量值偏向于该特定技术或协议。由于所有基因组测量值都是估计值，因此在没有“金标准”的情况下，我们通过一种称为行线性模型的共识建模方法来经验性地评估大量基因组技术的测量精度和灵敏度。这种方法是美国测试材料协会标准 E691 的应用，用于评估跨多个测试站点的实验室间精度和变异性来源。可以在所有常见基因座上进行跨平台和跨基因座的比较，从而确定技术和基因座特异性的趋势。

结果

我们评估了包括 Infinium MethylationEPIC BeadChip、全基因组亚硫酸氢盐测序 (WGBS)、两种不同的 RNA-Seq 方案 (PolyA+和 Ribo-Zero) 和五种不同的基因表达阵列平台在内的技术。因此，每种技术都相对于共识进行了描述。我们展示了行线性模型的一些应用，包括与已知干扰特征的相关性。我们清楚地表明了 Infinium 甲基化阵列的交叉杂交对灵敏度的影响。此外，我们在二十一个独立的测试实验室对同一平台上检测的一组样本进行了真正的实验室间测试。

可用性和实施

行线性模型的完整实现以及用于可视化的额外功能可在 https://github.com/timpeters82/consensus 上的 R 包 consensus 中找到。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d85c/6378945/484680205995/bty675f1.jpg

相似文献

Evaluation of cross-platform and interlaboratory concordance via consensus modelling of genomic measurements.

Bioinformatics. 2019 Feb 15;35(4):560-570. doi: 10.1093/bioinformatics/bty675.

Genome-wide DNA methylation profiling in human breast tissue by Illumina TruSeq methyl capture EPIC sequencing and infinium methylationEPIC beadchip microarray.

Epigenetics. 2021 Jun-Jul;16(7):754-769. doi: 10.1080/15592294.2020.1827703. Epub 2020 Oct 13.

Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling.

Genome Biol. 2016 Oct 7;17(1):208. doi: 10.1186/s13059-016-1066-1.

missMethyl: an R package for analyzing data from Illumina's HumanMethylation450 platform.

Bioinformatics. 2016 Jan 15;32(2):286-8. doi: 10.1093/bioinformatics/btv560. Epub 2015 Sep 30.

Usability of human Infinium MethylationEPIC BeadChip for mouse DNA methylation studies.

BMC Bioinformatics. 2017 Nov 15;18(1):486. doi: 10.1186/s12859-017-1870-y.

methyLiftover: cross-platform DNA methylation data integration.

Bioinformatics. 2016 Aug 15;32(16):2517-9. doi: 10.1093/bioinformatics/btw180. Epub 2016 Apr 8.

Computationally expanding infinium HumanMethylation450 BeadChip array data to reveal distinct DNA methylation patterns of rheumatoid arthritis.

Bioinformatics. 2016 Jun 15;32(12):1773-8. doi: 10.1093/bioinformatics/btw089. Epub 2016 Feb 15.

mLiftOver: harmonizing data across Infinium DNA methylation platforms.

Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae423.

BoostMe accurately predicts DNA methylation values in whole-genome bisulfite sequencing of multiple human tissues.

BMC Genomics. 2018 May 23;19(1):390. doi: 10.1186/s12864-018-4766-y.

CpGFilter: model-based CpG probe filtering with replicates for epigenome-wide association studies.

Bioinformatics. 2016 Feb 1;32(3):469-71. doi: 10.1093/bioinformatics/btv577. Epub 2015 Oct 7.

引用本文的文献

Exploring the influence of pre-analytical variables on gene expression measurements and relative expression orderings in cancer research.

Sci Rep. 2025 Feb 6;15(1):4489. doi: 10.1038/s41598-025-88756-0.

Comparing methylation levels assayed in GC-rich regions with current and emerging methods.

BMC Genomics. 2024 Jul 30;25(1):741. doi: 10.1186/s12864-024-10605-7.

Characterisation and reproducibility of the HumanMethylationEPIC v2.0 BeadChip for DNA methylation profiling.

BMC Genomics. 2024 Mar 6;25(1):251. doi: 10.1186/s12864-024-10027-5.

Transcriptomics for Clinical and Experimental Biology Research: Hang on a Seq.

Adv Genet (Hoboken). 2023 Jan 17;4(2):2200024. doi: 10.1002/ggn2.202200024. eCollection 2023 Jun.

Epigenetic Mechanisms and Nephrotic Syndrome: A Systematic Review.

Biomedicines. 2023 Feb 10;11(2):514. doi: 10.3390/biomedicines11020514.

Calling differentially methylated regions from whole genome bisulphite sequencing with DMRcate.

Nucleic Acids Res. 2021 Nov 8;49(19):e109. doi: 10.1093/nar/gkab637.

Integrated Analysis of Multiple Microarray Studies to Identify Novel Gene Signatures in Non-alcoholic Fatty Liver Disease.

Front Endocrinol (Lausanne). 2019 Aug 30;10:599. doi: 10.3389/fendo.2019.00599. eCollection 2019.

Sequential analysis of myocardial gene expression with phenotypic change: Use of cross-platform concordance to strengthen biologic relevance.

PLoS One. 2019 Aug 30;14(8):e0221519. doi: 10.1371/journal.pone.0221519. eCollection 2019.

Aberrant Expressions of Co-stimulatory and Co-inhibitory Molecules in Autoimmune Diseases.

Front Immunol. 2019 Feb 20;10:261. doi: 10.3389/fimmu.2019.00261. eCollection 2019.

本文引用的文献

Enduring epigenetic landmarks define the cancer microenvironment.

Genome Res. 2018 May;28(5):625-638. doi: 10.1101/gr.229070.117. Epub 2018 Apr 12.

RNA sequencing and transcriptome arrays analyses show opposing results for alternative splicing in patient derived samples.

BMC Genomics. 2017 Jun 6;18(1):443. doi: 10.1186/s12864-017-3819-y.

Reproducible RNA-seq analysis using recount2.

Nat Biotechnol. 2017 Apr 11;35(4):319-321. doi: 10.1038/nbt.3838.

Making sense of replications.

Elife. 2017 Jan 19;6:e23383. doi: 10.7554/eLife.23383.

International Interlaboratory Digital PCR Study Demonstrating High Reproducibility for the Measurement of a Rare Sequence Variant.

Anal Chem. 2017 Feb 7;89(3):1724-1733. doi: 10.1021/acs.analchem.6b03980. Epub 2017 Jan 18.

RNA-seq mixology: designing realistic control experiments to compare protocols and analysis methods.

Nucleic Acids Res. 2017 Mar 17;45(5):e30. doi: 10.1093/nar/gkw1063.

Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling.

Genome Biol. 2016 Oct 7;17(1):208. doi: 10.1186/s13059-016-1066-1.

Risk-conscious correction of batch effects: maximising information extraction from high-throughput genomic datasets.

BMC Bioinformatics. 2016 Sep 1;17(1):332. doi: 10.1186/s12859-016-1212-5.

1,500 scientists lift the lid on reproducibility.

Nature. 2016 May 26;533(7604):452-4. doi: 10.1038/533452a.

Cross-platform normalization of microarray and RNA-seq data for machine learning applications.

PeerJ. 2016 Jan 21;4:e1621. doi: 10.7717/peerj.1621. eCollection 2016.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于基因组测量共识建模评估跨平台和实验室间一致性。

Evaluation of cross-platform and interlaboratory concordance via consensus modelling of genomic measurements.

机构信息

Epigenetics Laboratory, Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia.

South Western Sydney Clinical School, Faculty of Medicine, University of New South Wales, Liverpool, NSW, Australia.