• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有现实特征的微阵列数据模拟。

Simulation of microarray data with realistic characteristics.

作者信息

Nykter Matti, Aho Tommi, Ahdesmäki Miika, Ruusuvuori Pekka, Lehmussola Antti, Yli-Harja Olli

机构信息

Institute of Signal Processing, Tampere University of Technology, Tampere, Finland.

出版信息

BMC Bioinformatics. 2006 Jul 18;7:349. doi: 10.1186/1471-2105-7-349.

DOI:10.1186/1471-2105-7-349
PMID:16848902
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1574357/
Abstract

BACKGROUND

Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algorithms is not possible due to the lack of biological ground truth information. To overcome this fundamental problem, the use of simulated microarray data for algorithm validation has been proposed.

RESULTS

We present a microarray simulation model which can be used to validate different kinds of data analysis algorithms. The proposed model is unique in the sense that it includes all the steps that affect the quality of real microarray data. These steps include the simulation of biological ground truth data, applying biological and measurement technology specific error models, and finally simulating the microarray slide manufacturing and hybridization. After all these steps are taken into account, the simulated data has realistic biological and statistical characteristics. The applicability of the proposed model is demonstrated by several examples.

CONCLUSION

The proposed microarray simulation model is modular and can be used in different kinds of applications. It includes several error models that have been proposed earlier and it can be used with different types of input data. The model can be used to simulate both spotted two-channel and oligonucleotide based single-channel microarrays. All this makes the model a valuable tool for example in validation of data analysis algorithms.

摘要

背景

微阵列技术已成为生物学研究中的常用工具。因此,出现了对用于数据分析的有效计算方法的需求。已经提出了许多不同的算法来分析数据。然而,由于缺乏生物学真值信息,无法对所提出的算法进行客观评估。为了克服这个基本问题,有人提出使用模拟微阵列数据进行算法验证。

结果

我们提出了一种微阵列模拟模型,可用于验证不同类型的数据分析算法。所提出的模型的独特之处在于它包含了影响真实微阵列数据质量的所有步骤。这些步骤包括模拟生物学真值数据、应用特定于生物学和测量技术的误差模型,以及最后模拟微阵列载玻片制造和杂交。在考虑了所有这些步骤之后,模拟数据具有现实的生物学和统计特征。通过几个例子证明了所提出模型的适用性。

结论

所提出的微阵列模拟模型是模块化的,可用于不同类型的应用。它包括几个先前提出的误差模型,并且可以与不同类型的输入数据一起使用。该模型可用于模拟点阵式双通道和基于寡核苷酸的单通道微阵列。所有这些使得该模型成为例如在验证数据分析算法方面的有价值工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/a4120addd6ae/1471-2105-7-349-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/ea9ce83144c3/1471-2105-7-349-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/9c53d70c300c/1471-2105-7-349-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/509bb664ab7b/1471-2105-7-349-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/84896e6f9280/1471-2105-7-349-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/a4120addd6ae/1471-2105-7-349-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/ea9ce83144c3/1471-2105-7-349-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/9c53d70c300c/1471-2105-7-349-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/509bb664ab7b/1471-2105-7-349-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/84896e6f9280/1471-2105-7-349-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d101/1574357/a4120addd6ae/1471-2105-7-349-6.jpg

相似文献

1
Simulation of microarray data with realistic characteristics.具有现实特征的微阵列数据模拟。
BMC Bioinformatics. 2006 Jul 18;7:349. doi: 10.1186/1471-2105-7-349.
2
A novel approach to the clustering of microarray data via nonparametric density estimation.一种基于非参数密度估计的微阵列数据聚类新方法。
BMC Bioinformatics. 2011 Feb 8;12:49. doi: 10.1186/1471-2105-12-49.
3
Evaluating concentration estimation errors in ELISA microarray experiments.评估酶联免疫吸附测定微阵列实验中的浓度估计误差。
BMC Bioinformatics. 2005 Jan 26;6:17. doi: 10.1186/1471-2105-6-17.
4
Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm.使用一致性算法对大型DNA微阵列数据集进行稳健的多尺度聚类
Bioinformatics. 2006 Jan 1;22(1):58-67. doi: 10.1093/bioinformatics/bti746. Epub 2005 Oct 27.
5
In silico microdissection of microarray data from heterogeneous cell populations.对来自异质细胞群体的微阵列数据进行计算机模拟显微切割。
BMC Bioinformatics. 2005 Mar 14;6:54. doi: 10.1186/1471-2105-6-54.
6
Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.微分析器:Affymetrix 微阵列数据的自动预处理。
Comput Methods Programs Biomed. 2013 Aug;111(2):402-9. doi: 10.1016/j.cmpb.2013.04.006. Epub 2013 May 31.
7
A novel Mixture Model Method for identification of differentially expressed genes from DNA microarray data.一种从DNA微阵列数据中识别差异表达基因的新型混合模型方法。
BMC Bioinformatics. 2004 Dec 16;5:201. doi: 10.1186/1471-2105-5-201.
8
Microarray data analysis: from disarray to consolidation and consensus.微阵列数据分析:从混乱到整合与共识。
Nat Rev Genet. 2006 Jan;7(1):55-65. doi: 10.1038/nrg1749.
9
A mathematical model for the validation of gene selection methods.用于验证基因选择方法的数学模型。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1385-92. doi: 10.1109/TCBB.2010.83.
10
Simulation of DNA array hybridization experiments and evaluation of critical parameters during subsequent image and data analysis.DNA 阵列杂交实验的模拟以及后续图像和数据分析过程中关键参数的评估。
BMC Bioinformatics. 2002 Oct 22;3:29. doi: 10.1186/1471-2105-3-29.

引用本文的文献

1
A statistical simulation model to guide the choices of analytical methods in arrayed CRISPR screen experiments.用于指导阵列 CRISPR 筛选实验中分析方法选择的统计模拟模型。
PLoS One. 2024 Aug 20;19(8):e0307445. doi: 10.1371/journal.pone.0307445. eCollection 2024.
2
A statistical model for describing and simulating microbial community profiles.用于描述和模拟微生物群落分布的统计模型。
PLoS Comput Biol. 2021 Sep 13;17(9):e1008913. doi: 10.1371/journal.pcbi.1008913. eCollection 2021 Sep.
3
A Bootstrap Based Measure Robust to the Choice of Normalization Methods for Detecting Rhythmic Features in High Dimensional Data.

本文引用的文献

1
Unsupervised analysis uncovers changes in histopathologic diagnosis in supervised genomic studies.无监督分析揭示了有监督基因组研究中组织病理学诊断的变化。
Technol Cancer Res Treat. 2006 Apr;5(2):177-82. doi: 10.1177/153303460600500209.
2
BGX: a fully Bayesian integrated approach to the analysis of Affymetrix GeneChip data.BGX:一种用于分析Affymetrix基因芯片数据的全贝叶斯集成方法。
Biostatistics. 2005 Jul;6(3):349-73. doi: 10.1093/biostatistics/kxi016. Epub 2005 Apr 14.
3
In silico microdissection of microarray data from heterogeneous cell populations.
一种基于自助法的度量,对用于检测高维数据中节律特征的归一化方法的选择具有鲁棒性。
Front Genet. 2018 Feb 2;9:24. doi: 10.3389/fgene.2018.00024. eCollection 2018.
4
Robust gene selection methods using weighting schemes for microarray data analysis.用于微阵列数据分析的采用加权方案的稳健基因选择方法。
BMC Bioinformatics. 2017 Sep 2;18(1):389. doi: 10.1186/s12859-017-1810-x.
5
A Flexible Microarray Data Simulation Model.一种灵活的微阵列数据模拟模型。
Microarrays (Basel). 2013 Apr 17;2(2):115-30. doi: 10.3390/microarrays2020115.
6
A Synthetic Kinome Microarray Data Generator.一种合成激酶组微阵列数据生成器。
Microarrays (Basel). 2015 Oct 16;4(4):432-53. doi: 10.3390/microarrays4040432.
7
Evaluating a common semi-mechanistic mathematical model of gene-regulatory networks.评估基因调控网络的一种常见半机制数学模型。
BMC Syst Biol. 2015;9 Suppl 5(Suppl 5):S2. doi: 10.1186/1752-0509-9-S5-S2. Epub 2015 Sep 1.
8
SITDEM: a simulation tool for disease/endpoint models of association studies based on single nucleotide polymorphism genotypes.SITDEM:基于单核苷酸多态性基因型的关联研究疾病/终点模型的仿真工具。
Comput Biol Med. 2014 Feb;45:136-42. doi: 10.1016/j.compbiomed.2013.11.021. Epub 2013 Dec 19.
9
Symbolic data analysis to defy low signal-to-noise ratio in microarray data for breast cancer prognosis.用于克服微阵列数据中低信噪比以进行乳腺癌预后分析的符号数据分析。
J Comput Biol. 2013 Aug;20(8):610-20. doi: 10.1089/cmb.2012.0249.
10
Sources of variation in false discovery rate estimation include sample size, correlation, and inherent differences between groups.假发现率估计的变异来源包括样本量、相关性和组间固有差异。
BMC Bioinformatics. 2012;13 Suppl 13(Suppl 13):S1. doi: 10.1186/1471-2105-13-S13-S1. Epub 2012 Aug 24.
对来自异质细胞群体的微阵列数据进行计算机模拟显微切割。
BMC Bioinformatics. 2005 Mar 14;6:54. doi: 10.1186/1471-2105-6-54.
4
Simulation tools for biochemical networks: evaluation of performance and usability.生化网络的模拟工具:性能与可用性评估
Bioinformatics. 2005 Feb 1;21(3):357-63. doi: 10.1093/bioinformatics/bti018. Epub 2004 Sep 9.
5
Noise factor analysis for cDNA microarrays.cDNA微阵列的噪声因子分析
J Biomed Opt. 2004 Jul-Aug;9(4):663-78. doi: 10.1117/1.1755232.
6
Integrative analysis of cell cycle control in budding yeast.芽殖酵母细胞周期调控的综合分析
Mol Biol Cell. 2004 Aug;15(8):3841-62. doi: 10.1091/mbc.e03-11-0794. Epub 2004 May 28.
7
Noise minimization in eukaryotic gene expression.真核基因表达中的噪声最小化
PLoS Biol. 2004 Jun;2(6):e137. doi: 10.1371/journal.pbio.0020137. Epub 2004 Apr 27.
8
Spot shape modelling and data transformations for microarrays.微阵列的斑点形状建模与数据转换
Bioinformatics. 2004 Sep 22;20(14):2270-8. doi: 10.1093/bioinformatics/bth237. Epub 2004 Apr 1.
9
Bayesian hierarchical error model for analysis of gene expression data.用于基因表达数据分析的贝叶斯分层误差模型。
Bioinformatics. 2004 Sep 1;20(13):2016-25. doi: 10.1093/bioinformatics/bth192. Epub 2004 Mar 25.
10
Interpretation of skin biopsies by general pathologists: diagnostic discrepancy rate measured by blinded review.普通病理学家对皮肤活检的解读:通过盲法评估测量诊断差异率。
Arch Pathol Lab Med. 2003 Nov;127(11):1489-92. doi: 10.5858/2003-127-1489-IOSBBG.