微生物阵列比较基因组杂交实验中差异基因的检测。

Detection of divergent genes in microbial aCGH experiments.

作者信息

Snipen Lars, Repsilber Dirk, Nyquist Ludvig, Ziegler Andreas, Aakra Agot, Aastveit Are

机构信息

Biostatistics, Department of Chemistry, Biotechnology and Food Sciences, Norwegian University of Life Sciences, N-1432 As, Norway.

出版信息

BMC Bioinformatics. 2006 Mar 30;7:181. doi: 10.1186/1471-2105-7-181.

DOI:10.1186/1471-2105-7-181

PMID:16573812

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1563484/

Abstract

BACKGROUND

Array-based comparative genome hybridization (aCGH) is a tool for rapid comparison of genomes from different bacterial strains. The purpose of such analysis is to detect highly divergent or absent genes in a sample strain compared to an index strain. Development of methods for analyzing aCGH data has primarily focused on copy number abberations in cancer research. In microbial aCGH analyses, genes are typically ranked by log-ratios, and classification into divergent or present is done by choosing a cutoff log-ratio, either manually or by statistics calculated from the log-ratio distribution. As experimental settings vary considerably, it is not possible to develop a classical discriminant or statistical learning approach.

METHODS

We introduce a more efficient method for analyzing microbial aCGH data using a finite mixture model and a data rotation scheme. Using the average posterior probabilities from the model fitted to log-ratios before and after rotation, we get a score for each gene, and demonstrate its advantages for ranking and detecting divergent genes with enlarged specificity and sensitivity.

RESULTS

The procedure is tested and compared to other approaches on simulated data sets, as well as on four experimental validation data sets for aCGH analysis on fully sequenced strains of Staphylococcus aureus and Streptococcus pneumoniae.

CONCLUSION

When tested on simulated data as well as on four different experimental validation data sets from experiments with only fully sequenced strains, our procedure out-competes the standard procedures of using a simple log-ratio cutoff for classification into present and divergent genes.

摘要

背景

基于芯片的比较基因组杂交（aCGH）是一种用于快速比较不同细菌菌株基因组的工具。此类分析的目的是检测样本菌株中与参照菌株相比高度分化或缺失的基因。分析aCGH数据方法的开发主要集中在癌症研究中的拷贝数畸变。在微生物aCGH分析中，基因通常按对数比率排序，通过选择一个截断对数比率（手动或根据对数比率分布计算的统计量）来进行分化或存在的分类。由于实验设置差异很大，因此无法开发经典的判别或统计学习方法。

方法

我们引入了一种使用有限混合模型和数据旋转方案来分析微生物aCGH数据的更有效方法。利用拟合到旋转前后对数比率的模型的平均后验概率，我们为每个基因获得一个分数，并证明其在以更高的特异性和敏感性对分化基因进行排序和检测方面的优势。

结果

该程序在模拟数据集以及金黄色葡萄球菌和肺炎链球菌全序列菌株的aCGH分析的四个实验验证数据集上进行了测试，并与其他方法进行了比较。

结论

当在模拟数据以及仅来自全序列菌株实验的四个不同实验验证数据集上进行测试时，我们的程序优于使用简单对数比率截断将基因分类为存在和分化基因的标准程序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6718/1563484/5bb071ea1a88/1471-2105-7-181-1.jpg

相似文献

Detection of divergent genes in microbial aCGH experiments.

BMC Bioinformatics. 2006 Mar 30;7:181. doi: 10.1186/1471-2105-7-181.

Comparison of alternative mixture model methods to analyze bacterial CGH experiments with multi-genome arrays.

BMC Res Notes. 2014 Mar 14;7:148. doi: 10.1186/1756-0500-7-148.

Mixture models as a method to find present and divergent genes in comparative genomic hybridization studies on bacteria.

Biom J. 2007 Apr;49(2):242-58. doi: 10.1002/bimj.200510286.

Supervised Lowess normalization of comparative genome hybridization data--application to lactococcal strain comparisons.

BMC Bioinformatics. 2008 Feb 11;9:93. doi: 10.1186/1471-2105-9-93.

A novel computational method identifies intra- and inter-species recombination events in Staphylococcus aureus and Streptococcus pneumoniae.

PLoS Comput Biol. 2012;8(9):e1002668. doi: 10.1371/journal.pcbi.1002668. Epub 2012 Sep 6.

Optimal control and analysis of two-color genomotyping experiments using bacterial multistrain arrays.

BMC Genomics. 2008 May 19;9:230. doi: 10.1186/1471-2164-9-230.

Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.

J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.

Comparative supragenomic analyses among the pathogens Staphylococcus aureus, Streptococcus pneumoniae, and Haemophilus influenzae using a modification of the finite supragenome model.

BMC Genomics. 2011 Apr 13;12:187. doi: 10.1186/1471-2164-12-187.

A statistical change point model approach for the detection of DNA copy number variations in array CGH data.

IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):529-41. doi: 10.1109/TCBB.2008.129.

A fused lasso latent feature model for analyzing multi-sample aCGH data.

Biostatistics. 2011 Oct;12(4):776-91. doi: 10.1093/biostatistics/kxr012. Epub 2011 Jun 3.

引用本文的文献

Development of pooled suppression subtractive hybridization to analyze the pangenome of Staphylococcus aureus.

J Microbiol Methods. 2010 Apr;81(1):56-60. doi: 10.1016/j.mimet.2010.01.022. Epub 2010 Feb 4.

Efficient oligonucleotide probe selection for pan-genomic tiling arrays.

BMC Bioinformatics. 2009 Sep 16;10:293. doi: 10.1186/1471-2105-10-293.

Improved analysis of bacterial CGH data beyond the log-ratio paradigm.

BMC Bioinformatics. 2009 Mar 19;10:91. doi: 10.1186/1471-2105-10-91.

Replacement of adenylate cyclase toxin in a lineage of Bordetella bronchiseptica.

J Bacteriol. 2008 Aug;190(15):5502-11. doi: 10.1128/JB.00226-08. Epub 2008 Jun 13.

Optimal control and analysis of two-color genomotyping experiments using bacterial multistrain arrays.

BMC Genomics. 2008 May 19;9:230. doi: 10.1186/1471-2164-9-230.

Supervised Lowess normalization of comparative genome hybridization data--application to lactococcal strain comparisons.

BMC Bioinformatics. 2008 Feb 11;9:93. doi: 10.1186/1471-2105-9-93.

Survey of genomic diversity among Enterococcus faecalis strains by microarray-based comparative genomic hybridization.

Appl Environ Microbiol. 2007 Apr;73(7):2207-17. doi: 10.1128/AEM.01599-06. Epub 2007 Jan 12.

本文引用的文献

Transformations for cDNA microarray data.

Stat Appl Genet Mol Biol. 2003;2:Article4. doi: 10.2202/1544-6115.1009. Epub 2003 Jun 18.

Data rotation improves genomotyping efficiency.

Biom J. 2005 Aug;47(4):585-98. doi: 10.1002/bimj.200410160.

CGH-Plotter: MATLAB toolbox for CGH-data analysis.

Bioinformatics. 2003 Sep 1;19(13):1714-5. doi: 10.1093/bioinformatics/btg230.

Uses of Staphylococcus aureus GeneChips in genotyping and genetic composition analysis.

J Clin Microbiol. 2004 Sep;42(9):4275-83. doi: 10.1128/JCM.42.9.4275-4283.2004.

Determination of the core of a minimal bacterial gene set.

Microbiol Mol Biol Rev. 2004 Sep;68(3):518-37, table of contents. doi: 10.1128/MMBR.68.3.518-537.2004.

Breakpoint identification and smoothing of array comparative genomic hybridization data.

Bioinformatics. 2004 Dec 12;20(18):3636-7. doi: 10.1093/bioinformatics/bth355. Epub 2004 Jun 16.

Improved analytical methods for microarray-based genome-composition analysis.

Genome Biol. 2002 Oct 29;3(11):RESEARCH0065. doi: 10.1186/gb-2002-3-11-research0065.

Comparison of genetic divergence and fitness between two subclones of Helicobacter pylori.

Infect Immun. 2001 Dec;69(12):7832-8. doi: 10.1128/IAI.69.12.7832-7838.2001.

Whole genome comparison of Campylobacter jejuni human isolates using a low-cost microarray reveals extensive genetic diversity.

Genome Res. 2001 Oct;11(10):1706-15. doi: 10.1101/gr.185801.

Evolutionary genomics of Staphylococcus aureus: insights into the origin of methicillin-resistant strains and the toxic shock syndrome epidemic.

Proc Natl Acad Sci U S A. 2001 Jul 17;98(15):8821-6. doi: 10.1073/pnas.161098098. Epub 2001 Jul 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

微生物阵列比较基因组杂交实验中差异基因的检测。

Detection of divergent genes in microbial aCGH experiments.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献