Suppr超能文献

一种基于细菌微阵列的比较基因组杂交分析新方法:实证研究的见解

A new approach for the analysis of bacterial microarray-based Comparative Genomic Hybridization: insights from an empirical study.

作者信息

Taboada Eduardo N, Acedillo Rey R, Luebbert Christian C, Findlay Wendy A, Nash John H E

机构信息

Pathogen Genomics Group, Institute for Biological Sciences, National Research Council, 100 Sussex Drive, Ottawa, Ontario, K1A 0R6, Canada.

出版信息

BMC Genomics. 2005 May 27;6:78. doi: 10.1186/1471-2164-6-78.

Abstract

BACKGROUND

Microarray-based Comparative Genomic Hybridization (M-CGH) has been used to characterize the extensive intraspecies genetic diversity found in bacteria at the whole-genome level. Although conventional microarray analytical procedures have proved adequate in handling M-CGH data, data interpretation using these methods is based on a continuous character model in which gene divergence and gene absence form a spectrum of decreasing gene conservation levels. However, whereas gene divergence may yet be accompanied by retention in gene function, gene absence invariably leads to loss of function. This distinction, if ignored, leads to a loss in the information to be gained from M-CGH data. We present here results from experiments in which two genome-sequenced strains of C. jejuni were compared against each other using M-CGH. Because the gene content of both strains was known a priori, we were able to closely examine the effects of sequence divergence and gene absence on M-CGH data in order to define analytical parameters for M-CGH data interpretation. This would facilitate the examination of the relative effects of sequence divergence or gene absence in comparative genomics analyses of multiple strains of any species for which genome sequence data and a DNA microarray are available.

RESULTS

As a first step towards improving the analysis of M-CGH data, we estimated the degree of experimental error in a series of experiments in which identical samples were compared against each other by M-CGH. This variance estimate was used to validate a Log Ratio-based methodology for identification of outliers in M-CGH data. We compared two genome strains by M-CGH to examine the effect of probe/target identity on the Log Ratios of signal intensities using prior knowledge of gene divergence and gene absence to establish Log Ratio thresholds for the identification of absent and conserved genes.

CONCLUSION

The results from this empirical study validate the Log Ratio thresholds that have been used in other studies to establish gene divergence/absence. Moreover, the analytical framework presented here enhances the information content derived from M-CGH data by shifting the focus from divergent/absent gene detection to accurate detection of conserved and absent genes. This approach closely aligns the technical limitations of M-CGH analysis with practical limitations on the biological interpretation of comparative genomics data.

摘要

背景

基于微阵列的比较基因组杂交技术(M-CGH)已被用于在全基因组水平上表征细菌中广泛存在的种内遗传多样性。尽管传统的微阵列分析程序已被证明足以处理M-CGH数据,但使用这些方法进行数据解释是基于一个连续特征模型,其中基因差异和基因缺失形成了一系列基因保守水平降低的情况。然而,虽然基因差异可能仍伴随着基因功能的保留,但基因缺失总是导致功能丧失。如果忽略这种区别,就会导致从M-CGH数据中获得的信息丢失。我们在此展示了使用M-CGH将空肠弯曲菌的两个基因组测序菌株相互比较的实验结果。由于两个菌株的基因内容是先验已知的,我们能够密切研究序列差异和基因缺失对M-CGH数据的影响,以便为M-CGH数据解释定义分析参数。这将有助于在对任何有基因组序列数据和DNA微阵列的物种的多个菌株进行比较基因组分析时,研究序列差异或基因缺失的相对影响。

结果

作为改进M-CGH数据分析的第一步,我们在一系列实验中估计了实验误差程度,在这些实验中,通过M-CGH将相同样本相互比较。这个方差估计用于验证一种基于对数比值的方法,用于识别M-CGH数据中的异常值。我们通过M-CGH比较了两个基因组菌株,利用基因差异和基因缺失的先验知识来建立信号强度对数比值的阈值,以识别缺失和保守基因,从而研究探针/靶标同一性对信号强度对数比值的影响。

结论

这项实证研究的结果验证了其他研究中用于确定基因差异/缺失的对数比值阈值。此外,这里提出的分析框架通过将重点从检测差异/缺失基因转移到准确检测保守和缺失基因,提高了从M-CGH数据中获得的信息含量。这种方法使M-CGH分析的技术局限性与比较基因组学数据生物学解释的实际局限性紧密结合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3627/1168892/807229bdbaf3/1471-2164-6-78-2.jpg

相似文献

2
Large-scale comparative genomics meta-analysis of Campylobacter jejuni isolates reveals low level of genome plasticity.
J Clin Microbiol. 2004 Oct;42(10):4566-76. doi: 10.1128/JCM.42.10.4566-4576.2004.
6
Accurate detection of aneuploidies in array CGH and gene expression microarray data.
Bioinformatics. 2004 Dec 12;20(18):3533-43. doi: 10.1093/bioinformatics/bth440. Epub 2004 Jul 29.
8
Survey of genomic diversity among Enterococcus faecalis strains by microarray-based comparative genomic hybridization.
Appl Environ Microbiol. 2007 Apr;73(7):2207-17. doi: 10.1128/AEM.01599-06. Epub 2007 Jan 12.
9
A stepwise framework for the normalization of array CGH data.
BMC Bioinformatics. 2005 Nov 18;6:274. doi: 10.1186/1471-2105-6-274.

引用本文的文献

2
4
Broad conservation of milk utilization genes in Bifidobacterium longum subsp. infantis as revealed by comparative genomic hybridization.
Appl Environ Microbiol. 2010 Nov;76(22):7373-81. doi: 10.1128/AEM.00675-10. Epub 2010 Aug 27.
7
Microarray comparative genomic hybridisation analysis incorporating genomic organisation, and application to enterobacterial plant pathogens.
PLoS Comput Biol. 2009 Aug;5(8):e1000473. doi: 10.1371/journal.pcbi.1000473. Epub 2009 Aug 21.
9
Genomics-based molecular epidemiology of Campylobacter jejuni isolates from feedlot cattle and from people in Alberta, Canada.
J Clin Microbiol. 2009 Feb;47(2):410-20. doi: 10.1128/JCM.01432-08. Epub 2008 Nov 26.

本文引用的文献

1
Major structural differences and novel potential virulence mechanisms from the genomes of multiple campylobacter species.
PLoS Biol. 2005 Jan;3(1):e15. doi: 10.1371/journal.pbio.0030015. Epub 2005 Jan 4.
2
Large-scale comparative genomics meta-analysis of Campylobacter jejuni isolates reveals low level of genome plasticity.
J Clin Microbiol. 2004 Oct;42(10):4566-76. doi: 10.1128/JCM.42.10.4566-4576.2004.
3
Identification of Campylobacter jejuni ATCC 43431-specific genes by whole microbial genome comparisons.
J Bacteriol. 2004 Jul;186(14):4781-95. doi: 10.1128/JB.186.14.4781-4795.2004.
5
Genomic approaches to antibacterial discovery.
Methods Mol Biol. 2004;266:231-59. doi: 10.1385/1-59259-763-7:231.
6
Microarrays for bacterial typing: realistic hope or holy grail?
Methods Mol Biol. 2004;266:213-27. doi: 10.1385/1-59259-763-7:213.
7
Comparative genomics: digging for data.
Methods Mol Biol. 2004;266:47-69. doi: 10.1385/1-59259-763-7:047.
8
Genome sequencing and annotation: an overview.
Methods Mol Biol. 2004;266:29-45. doi: 10.1385/1-59259-763-7:029.
9
Evaluation of 50-mer oligonucleotide arrays for detecting microbial populations in environmental samples.
Biotechniques. 2004 Apr;36(4):664-70, 672, 674-5. doi: 10.2144/04364RR02.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验