• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在微阵列数据集之间发现一致的疾病子网络。

Finding consistent disease subnetworks across microarray datasets.

机构信息

National University of Singapore, 13 Computing Drive, Singapore 117417.

出版信息

BMC Bioinformatics. 2011;12 Suppl 13(Suppl 13):S15. doi: 10.1186/1471-2105-12-S13-S15. Epub 2011 Nov 30.

DOI:10.1186/1471-2105-12-S13-S15
PMID:22372958
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3278831/
Abstract

BACKGROUND

While contemporary methods of microarray analysis are excellent tools for studying individual microarray datasets, they have a tendency to produce different results from different datasets of the same disease. We aim to solve this reproducibility problem by introducing a technique (SNet). SNet provides both quantitative and descriptive analysis of microarray datasets by identifying specific connected portions of pathways that are significant. We term such portions within pathways as "subnetworks".

RESULTS

We tested SNet on independent datasets of several diseases, including childhood ALL, DMD and lung cancer. For each of these diseases, we obtained two independent microarray datasets produced by distinct labs on distinct platforms. In each case, our technique consistently produced almost the same list of significant nontrivial subnetworks from two independent sets of microarray data. The gene-level agreement of these significant subnetworks was between 51.18% to 93.01%. In contrast, when the same pairs of microarray datasets were analysed using GSEA, t-test and SAM, this percentage fell between 2.38% to 28.90% for GSEA, 49.60% tp 73.01% for t-test, and 49.96% to 81.25% for SAM. Furthermore, the genes selected using these existing methods did not form subnetworks of substantial size. Thus it is more probable that the subnetworks selected by our technique can provide the researcher with more descriptive information on the portions of the pathway actually affected by the disease.

CONCLUSIONS

These results clearly demonstrate that our technique generates significant subnetworks and genes that are more consistent and reproducible across datasets compared to the other popular methods available (GSEA, t-test and SAM). The large size of subnetworks which we generate indicates that they are generally more biologically significant (less likely to be spurious). In addition, we have chosen two sample subnetworks and validated them with references from biological literature. This shows that our algorithm is capable of generating descriptive biologically conclusions.

摘要

背景

虽然当代的微阵列分析方法是研究单个微阵列数据集的出色工具,但它们往往会从同一疾病的不同数据集产生不同的结果。我们旨在通过引入一种技术(SNet)来解决这个可重复性问题。SNet 通过识别有意义的特定连接路径部分,提供微阵列数据集的定量和描述性分析。我们将这些路径中的部分称为“子网”。

结果

我们在包括儿童 ALL、DMD 和肺癌在内的几种疾病的独立数据集上测试了 SNet。对于每种疾病,我们都获得了两个由不同实验室在不同平台上产生的独立微阵列数据集。在每种情况下,我们的技术都从两组独立的微阵列数据中产生了几乎相同的显著非平凡子网列表。这些显著子网的基因水平一致性在 51.18%至 93.01%之间。相比之下,当使用 GSEA、t 检验和 SAM 分析相同的微阵列数据集对时,GSEA 的百分比在 2.38%至 28.90%之间,t 检验的百分比在 49.60%至 73.01%之间,SAM 的百分比在 49.96%至 81.25%之间。此外,使用这些现有方法选择的基因没有形成实质性大小的子网。因此,我们的技术选择的子网更有可能为研究人员提供有关疾病实际影响的通路部分的更具描述性的信息。

结论

这些结果清楚地表明,与其他流行方法(GSEA、t 检验和 SAM)相比,我们的技术在数据集之间产生更一致和可重复的显著子网和基因。我们生成的子网的大规模表明它们通常更具有生物学意义(不太可能是虚假的)。此外,我们选择了两个示例子网,并通过生物学文献中的参考文献进行了验证。这表明我们的算法能够生成描述性的生物学结论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/535c7bb3cf44/1471-2105-12-S13-S15-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/6a7d67b3c36b/1471-2105-12-S13-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/0d1e4c5ce176/1471-2105-12-S13-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/b0416a78ba18/1471-2105-12-S13-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/7a6bf1c39e2b/1471-2105-12-S13-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/535c7bb3cf44/1471-2105-12-S13-S15-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/6a7d67b3c36b/1471-2105-12-S13-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/0d1e4c5ce176/1471-2105-12-S13-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/b0416a78ba18/1471-2105-12-S13-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/7a6bf1c39e2b/1471-2105-12-S13-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05e4/3278831/535c7bb3cf44/1471-2105-12-S13-S15-5.jpg

相似文献

1
Finding consistent disease subnetworks across microarray datasets.在微阵列数据集之间发现一致的疾病子网络。
BMC Bioinformatics. 2011;12 Suppl 13(Suppl 13):S15. doi: 10.1186/1471-2105-12-S13-S15. Epub 2011 Nov 30.
2
Finding consistent disease subnetworks using PFSNet.使用 PFSNet 发现一致的疾病子网络。
Bioinformatics. 2014 Jan 15;30(2):189-96. doi: 10.1093/bioinformatics/btt625. Epub 2013 Nov 28.
3
A quantum leap in the reproducibility, precision, and sensitivity of gene expression profile analysis even when sample size is extremely small.即使样本量极小,基因表达谱分析的可重复性、精确性和灵敏度也有了巨大飞跃。
J Bioinform Comput Biol. 2015 Aug;13(4):1550018. doi: 10.1142/S0219720015500183. Epub 2015 May 26.
4
Improving gene set analysis of microarray data by SAM-GS.通过SAM-GS改进微阵列数据的基因集分析
BMC Bioinformatics. 2007 Jul 5;8:242. doi: 10.1186/1471-2105-8-242.
5
Identifying consistent disease subnetworks using DNet.使用 DNet 识别一致的疾病子网络。
Methods. 2017 Dec 1;131:104-110. doi: 10.1016/j.ymeth.2017.07.024. Epub 2017 Aug 12.
6
Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.混合遗传算法-神经网络:未预处理微阵列数据的特征提取。
Artif Intell Med. 2011 Sep;53(1):47-56. doi: 10.1016/j.artmed.2011.06.008. Epub 2011 Jul 19.
7
GAGE: generally applicable gene set enrichment for pathway analysis.GAGE:用于通路分析的通用基因集富集分析
BMC Bioinformatics. 2009 May 27;10:161. doi: 10.1186/1471-2105-10-161.
8
Accurate molecular classification of cancer using simple rules.使用简单规则进行准确的癌症分子分类。
BMC Med Genomics. 2009 Oct 30;2:64. doi: 10.1186/1755-8794-2-64.
9
Comparative evaluation of gene-set analysis methods.基因集分析方法的比较评估
BMC Bioinformatics. 2007 Nov 7;8:431. doi: 10.1186/1471-2105-8-431.
10
Group testing for pathway analysis improves comparability of different microarray datasets.用于通路分析的分组检验可提高不同微阵列数据集的可比性。
Bioinformatics. 2006 Oct 15;22(20):2500-6. doi: 10.1093/bioinformatics/btl424. Epub 2006 Aug 7.

引用本文的文献

1
Resolving missing protein problems using functional class scoring.利用功能分类评分解决缺失蛋白问题。
Sci Rep. 2022 Jul 5;12(1):11358. doi: 10.1038/s41598-022-15314-3.
2
SPSNet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data.
BMC Syst Biol. 2018 Mar 19;12(Suppl 2):28. doi: 10.1186/s12918-018-0538-1.
3
Protein complex-based analysis is resistant to the obfuscating consequences of batch effects --- a case study in clinical proteomics.基于蛋白质复合物的分析可抵抗批次效应带来的混淆影响——临床蛋白质组学的一个案例研究

本文引用的文献

1
Consistency, comprehensiveness, and compatibility of pathway databases.通路数据库的一致性、全面性和兼容性。
BMC Bioinformatics. 2010 Sep 7;11:449. doi: 10.1186/1471-2105-11-449.
2
Overexpression of Rac1 in leukemia patients and its role in leukemia cell migration and growth.Rac1在白血病患者中的过表达及其在白血病细胞迁移和生长中的作用。
Biochem Biophys Res Commun. 2009 Sep 4;386(4):769-74. doi: 10.1016/j.bbrc.2009.06.125. Epub 2009 Jun 27.
3
Evaluating reproducibility of differential expression discoveries in microarray studies by considering correlated molecular changes.
BMC Genomics. 2017 Mar 14;18(Suppl 2):142. doi: 10.1186/s12864-017-3490-3.
4
Fuzzy-FishNET: a highly reproducible protein complex-based approach for feature selection in comparative proteomics.模糊鱼网(Fuzzy-FishNET):一种用于比较蛋白质组学中特征选择的基于高度可重复蛋白质复合物的方法。
BMC Med Genomics. 2016 Dec 5;9(Suppl 3):67. doi: 10.1186/s12920-016-0228-z.
5
Quantitative proteomics signature profiling based on network contextualization.基于网络情境化的定量蛋白质组学特征分析
Biol Direct. 2015 Dec 15;10:71. doi: 10.1186/s13062-015-0098-x.
6
Bayesian inference with historical data-based informative priors improves detection of differentially expressed genes.基于历史数据的信息先验的贝叶斯推理可改善差异表达基因的检测。
Bioinformatics. 2016 Mar 1;32(5):682-9. doi: 10.1093/bioinformatics/btv631. Epub 2015 Oct 30.
7
Contemporary network proteomics and its requirements.当代网络蛋白质组学及其要求。
Biology (Basel). 2013 Dec 20;3(1):22-38. doi: 10.3390/biology3010022.
8
Embracing noise to improve cross-batch prediction accuracy.利用噪声提高跨批次预测准确性。
BMC Syst Biol. 2012;6 Suppl 2(Suppl 2):S3. doi: 10.1186/1752-0509-6-S2-S3. Epub 2012 Dec 12.
9
Current challenges in development of differentially expressed and prognostic prostate cancer biomarkers.差异表达和预后性前列腺癌生物标志物开发中的当前挑战。
Prostate Cancer. 2012;2012:640968. doi: 10.1155/2012/640968. Epub 2012 Aug 28.
10
Towards big data science in the decade ahead from ten years of InCoB and the 1st ISCB-Asia Joint Conference.展望未来十年的大数据科学:来自 InCoB 十年和第一届 ISCB-Asia 联合会议。
BMC Bioinformatics. 2011;12 Suppl 13(Suppl 13):S1. doi: 10.1186/1471-2105-12-S13-S1. Epub 2011 Nov 30.
通过考虑相关分子变化评估微阵列研究中差异表达发现的可重复性。
Bioinformatics. 2009 Jul 1;25(13):1662-8. doi: 10.1093/bioinformatics/btp295. Epub 2009 May 5.
4
Molecular networks in microarray analysis.微阵列分析中的分子网络
J Bioinform Comput Biol. 2007 Apr;5(2B):429-56. doi: 10.1142/s0219720007002795.
5
Flesh and blood: the story of Vav1, a gene that signals in hematopoietic cells but can be transforming in human malignancies.有血有肉:Vav1基因的故事,它在造血细胞中发挥信号作用,但在人类恶性肿瘤中可导致细胞转化
Cancer Lett. 2007 Oct 8;255(2):241-54. doi: 10.1016/j.canlet.2007.04.015. Epub 2007 Jun 21.
6
GenMAPP 2: new features and resources for pathway analysis.基因图谱2:通路分析的新特性与资源
BMC Bioinformatics. 2007 Jun 24;8:217. doi: 10.1186/1471-2105-8-217.
7
Network-based analysis of affected biological processes in type 2 diabetes models.基于网络的2型糖尿病模型中受影响生物过程的分析。
PLoS Genet. 2007 Jun;3(6):e96. doi: 10.1371/journal.pgen.0030096.
8
Relationships between probabilistic Boolean networks and dynamic Bayesian networks as models of gene regulatory networks.作为基因调控网络模型的概率布尔网络与动态贝叶斯网络之间的关系。
Signal Processing. 2006 Apr;86(4):814-834. doi: 10.1016/j.sigpro.2005.06.008.
9
Gene expression profiling in the early phases of DMD: a constant molecular signature characterizes DMD muscle from early postnatal life throughout disease progression.杜氏肌营养不良症早期阶段的基因表达谱分析:一种恒定的分子特征在疾病进展过程中从出生后早期就可表征杜氏肌营养不良症的肌肉。
FASEB J. 2007 Apr;21(4):1210-26. doi: 10.1096/fj.06-7285com. Epub 2007 Jan 30.
10
The outcomes of pathway database computations depend on pathway ontology.通路数据库计算的结果取决于通路本体论。
Nucleic Acids Res. 2006 Aug 7;34(13):3687-97. doi: 10.1093/nar/gkl438. Print 2006.