• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用模糊c均值聚类对细胞特异性转录组数据进行荟萃分析,发现了多种病毒反应基因。

Meta-analysis of cell- specific transcriptomic data using fuzzy c-means clustering discovers versatile viral responsive genes.

作者信息

Khan Atif, Katanic Dejan, Thakar Juilee

机构信息

Department of Microbiology and Immunology, University of Rochester, Rochester, NY, 14642, USA.

Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, 14642, USA.

出版信息

BMC Bioinformatics. 2017 Jun 6;18(1):295. doi: 10.1186/s12859-017-1669-x.

DOI:10.1186/s12859-017-1669-x
PMID:28587632
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5461682/
Abstract

BACKGROUND

Despite advances in the gene-set enrichment analysis methods; inadequate definitions of gene-sets cause a major limitation in the discovery of novel biological processes from the transcriptomic datasets. Typically, gene-sets are obtained from publicly available pathway databases, which contain generalized definitions frequently derived by manual curation. Recently unsupervised clustering algorithms have been proposed to identify gene-sets from transcriptomics datasets deposited in public domain. These data-driven definitions of the gene-sets can be context-specific revealing novel biological mechanisms. However, the previously proposed algorithms for identification of data-driven gene-sets are based on hard clustering which do not allow overlap across clusters, a characteristic that is predominantly observed across biological pathways.

RESULTS

We developed a pipeline using fuzzy-C-means (FCM) soft clustering approach to identify gene-sets which recapitulates topological characteristics of biological pathways. Specifically, we apply our pipeline to derive gene-sets from transcriptomic data measuring response of monocyte derived dendritic cells and A549 epithelial cells to influenza infections. Our approach apply Ward's method for the selection of initial conditions, optimize parameters of FCM algorithm for human cell-specific transcriptomic data and identify robust gene-sets along with versatile viral responsive genes.

CONCLUSION

We validate our gene-sets and demonstrate that by identifying genes associated with multiple gene-sets, FCM clustering algorithm significantly improves interpretation of transcriptomic data facilitating investigation of novel biological processes by leveraging on transcriptomic data available in the public domain. We develop an interactive 'Fuzzy Inference of Gene-sets (FIGS)' package (GitHub: https://github.com/Thakar-Lab/FIGS ) to facilitate use of of pipeline. Future extension of FIGS across different immune cell-types will improve mechanistic investigation followed by high-throughput omics studies.

摘要

背景

尽管基因集富集分析方法取得了进展,但基因集定义不充分仍是从转录组数据集中发现新生物过程的主要限制。通常,基因集是从公开可用的通路数据库中获取的,这些数据库包含通过人工整理频繁得出的广义定义。最近,有人提出了无监督聚类算法,用于从公共领域存储的转录组数据集中识别基因集。这些数据驱动的基因集定义可能是特定于上下文的,揭示了新的生物学机制。然而,先前提出的用于识别数据驱动基因集的算法基于硬聚类,不允许聚类之间存在重叠,而这一特征在生物通路中普遍存在。

结果

我们开发了一种使用模糊C均值(FCM)软聚类方法的流程来识别基因集,该流程概括了生物通路的拓扑特征。具体而言,我们应用我们的流程从测量单核细胞衍生的树突状细胞和A549上皮细胞对流感感染反应的转录组数据中导出基因集。我们的方法应用沃德方法来选择初始条件,针对人类细胞特异性转录组数据优化FCM算法的参数,并识别出稳健的基因集以及通用的病毒反应基因。

结论

我们验证了我们的基因集,并证明通过识别与多个基因集相关的基因,FCM聚类算法显著改善了对转录组数据的解释,通过利用公共领域中可用的转录组数据促进了对新生物过程的研究。我们开发了一个交互式的“基因集模糊推理(FIGS)”软件包(GitHub:https://github.com/Thakar-Lab/FIGS ),以方便使用该流程。FIGS在不同免疫细胞类型上的未来扩展将改善高通量组学研究后的机制研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7553/5461682/42324178d9ab/12859_2017_1669_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7553/5461682/09cf97f54e89/12859_2017_1669_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7553/5461682/42324178d9ab/12859_2017_1669_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7553/5461682/09cf97f54e89/12859_2017_1669_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7553/5461682/42324178d9ab/12859_2017_1669_Fig5_HTML.jpg

相似文献

1
Meta-analysis of cell- specific transcriptomic data using fuzzy c-means clustering discovers versatile viral responsive genes.使用模糊c均值聚类对细胞特异性转录组数据进行荟萃分析,发现了多种病毒反应基因。
BMC Bioinformatics. 2017 Jun 6;18(1):295. doi: 10.1186/s12859-017-1669-x.
2
Effect of data normalization on fuzzy clustering of DNA microarray data.数据归一化对DNA微阵列数据模糊聚类的影响。
BMC Bioinformatics. 2006 Mar 14;7:134. doi: 10.1186/1471-2105-7-134.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
PathCellNet: Cell-type specific pathogen-response network explorer.PathCellNet:细胞类型特异性病原体反应网络浏览器。
J Immunol Methods. 2016 Dec;439:15-22. doi: 10.1016/j.jim.2016.09.005. Epub 2016 Sep 20.
5
An improved fuzzy c-means clustering algorithm based on shadowed sets and PSO.一种基于阴影集和粒子群优化算法的改进型模糊C均值聚类算法
Comput Intell Neurosci. 2014;2014:368628. doi: 10.1155/2014/368628. Epub 2014 Nov 12.
6
Hybrid fuzzy cluster ensemble framework for tumor clustering from biomolecular data.用于从生物分子数据中进行肿瘤聚类的混合模糊聚类集成框架。
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):657-70. doi: 10.1109/TCBB.2013.59.
7
Rough-fuzzy clustering for grouping functionally similar genes from microarray data.基于粗糙模糊聚类的基因功能相似性分组方法研究
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):286-99. doi: 10.1109/TCBB.2012.103.
8
Fuzzy C-means method for clustering microarray data.用于微阵列数据聚类的模糊C均值方法。
Bioinformatics. 2003 May 22;19(8):973-80. doi: 10.1093/bioinformatics/btg119.
9
Apache Spark based kernelized fuzzy clustering framework for single nucleotide polymorphism sequence analysis.基于 Apache Spark 的核模糊聚类框架用于单核苷酸多态性序列分析。
Comput Biol Chem. 2021 Jun;92:107454. doi: 10.1016/j.compbiolchem.2021.107454. Epub 2021 Feb 10.
10
A novel fuzzy C-means algorithm for unsupervised heterogeneous tumor quantification in PET.一种用于 PET 中无监督异质肿瘤定量的新型模糊 C 均值算法。
Med Phys. 2010 Mar;37(3):1309-24. doi: 10.1118/1.3301610.

引用本文的文献

1
ENQUIRE automatically reconstructs, expands, and drives enrichment analysis of gene and Mesh co-occurrence networks from context-specific biomedical literature.ENQUIRE可根据特定背景的生物医学文献自动重建、扩展并推动基因与医学主题词(Mesh)共现网络的富集分析。
PLoS Comput Biol. 2025 Feb 11;21(2):e1012745. doi: 10.1371/journal.pcbi.1012745. eCollection 2025 Feb.
2
Type 2 Diabetes Induced by Changes in Proteomic Profiling of Zebrafish Chronically Exposed to a Mixture of Organochlorine Pesticides at Low Concentrations.低浓度混合有机氯农药慢性暴露致斑马鱼蛋白质组谱改变诱导 2 型糖尿病。
Int J Environ Res Public Health. 2022 Apr 20;19(9):4991. doi: 10.3390/ijerph19094991.
3

本文引用的文献

1
PathCellNet: Cell-type specific pathogen-response network explorer.PathCellNet:细胞类型特异性病原体反应网络浏览器。
J Immunol Methods. 2016 Dec;439:15-22. doi: 10.1016/j.jim.2016.09.005. Epub 2016 Sep 20.
2
Compendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation.免疫特征纲要识别出炎症反应中的保守生物学特性和物种特异性生物学特性。
Immunity. 2016 Jan 19;44(1):194-206. doi: 10.1016/j.immuni.2015.12.006. Epub 2016 Jan 12.
3
Diversity in Compartmental Dynamics of Gene Regulatory Networks: The Immune Response in Primary Influenza A Infection in Mice.
Unbiased analysis of peripheral blood mononuclear cells reveals CD4 T cell response to RSV matrix protein.
对外周血单个核细胞的无偏分析揭示了CD4 T细胞对呼吸道合胞病毒基质蛋白的反应。
Vaccine X. 2020 Apr 21;5:100065. doi: 10.1016/j.jvacx.2020.100065. eCollection 2020 Aug 7.
4
Multi-Objective Optimized Fuzzy Clustering for Detecting Cell Clusters from Single-Cell Expression Profiles.基于单细胞表达谱的多目标优化模糊聚类检测细胞簇。
Genes (Basel). 2019 Aug 13;10(8):611. doi: 10.3390/genes10080611.
基因调控网络区室动力学的多样性:小鼠甲型流感病毒初次感染中的免疫反应
PLoS One. 2015 Sep 28;10(9):e0138110. doi: 10.1371/journal.pone.0138110. eCollection 2015.
4
Comparative analysis of anti-viral transcriptomics reveals novel effects of influenza immune antagonism.抗病毒转录组学的比较分析揭示了流感免疫拮抗的新作用。
BMC Immunol. 2015 Aug 14;16:46. doi: 10.1186/s12865-015-0107-y.
5
Human Dendritic Cell Response Signatures Distinguish 1918, Pandemic, and Seasonal H1N1 Influenza Viruses.人类树突状细胞反应特征可区分1918年、大流行和季节性H1N1流感病毒。
J Virol. 2015 Oct;89(20):10190-205. doi: 10.1128/JVI.01523-15. Epub 2015 Jul 29.
6
Aging-dependent alterations in gene expression and a mitochondrial signature of responsiveness to human influenza vaccination.基因表达的衰老依赖性改变以及对人流感疫苗反应的线粒体特征。
Aging (Albany NY). 2015 Jan;7(1):38-52. doi: 10.18632/aging.100720.
7
Democratizing systems immunology with modular transcriptional repertoire analyses.通过模块化转录组谱分析实现系统免疫学民主化。
Nat Rev Immunol. 2014 Apr;14(4):271-80. doi: 10.1038/nri3642.
8
Molecular signatures of antibody responses derived from a systems biology study of five human vaccines.从五项人体疫苗的系统生物学研究中得出的抗体反应的分子特征。
Nat Immunol. 2014 Feb;15(2):195-204. doi: 10.1038/ni.2789. Epub 2013 Dec 15.
9
Integrative approaches for finding modular structure in biological networks.综合方法寻找生物网络中的模块结构。
Nat Rev Genet. 2013 Oct;14(10):719-32. doi: 10.1038/nrg3552.
10
Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations.基因表达的定量集分析:一种用于量化基因集差异表达的方法,包括基因-基因相关性。
Nucleic Acids Res. 2013 Oct;41(18):e170. doi: 10.1093/nar/gkt660. Epub 2013 Aug 5.