• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分类器:一种基于语义相似性将基因分类到用户定义的生物组中的工具。

Categorizer: a tool to categorize genes into user-defined biological groups based on semantic similarity.

作者信息

Na Dokyun, Son Hyungbin, Gsponer Jörg

机构信息

Department of Biochemistry and Molecular Biology, Centre for High-throughput Biology, University of British Columbia, 2125 East Mall, Vancouver, BC V6T 1Z4, Canada.

出版信息

BMC Genomics. 2014 Dec 11;15(1):1091. doi: 10.1186/1471-2164-15-1091.

DOI:10.1186/1471-2164-15-1091
PMID:25495442
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4298957/
Abstract

BACKGROUND

Communalities between large sets of genes obtained from high-throughput experiments are often identified by searching for enrichments of genes with the same Gene Ontology (GO) annotations. The GO analysis tools used for these enrichment analyses assume that GO terms are independent and the semantic distances between all parent-child terms are identical, which is not true in a biological sense. In addition these tools output lists of often redundant or too specific GO terms, which are difficult to interpret in the context of the biological question investigated by the user. Therefore, there is a demand for a robust and reliable method for gene categorization and enrichment analysis.

RESULTS

We have developed Categorizer, a tool that classifies genes into user-defined groups (categories) and calculates p-values for the enrichment of the categories. Categorizer identifies the biologically best-fit category for each gene by taking advantage of a specialized semantic similarity measure for GO terms. We demonstrate that Categorizer provides improved categorization and enrichment results of genetic modifiers of Huntington's disease compared to a classical GO Slim-based approach or categorizations using other semantic similarity measures.

CONCLUSION

Categorizer enables more accurate categorizations of genes than currently available methods. This new tool will help experimental and computational biologists analyzing genomic and proteomic data according to their specific needs in a more reliable manner.

摘要

背景

从高通量实验中获得的大量基因之间的共性通常通过搜索具有相同基因本体论(GO)注释的基因富集情况来确定。用于这些富集分析的GO分析工具假定GO术语是独立的,并且所有父子术语之间的语义距离是相同的,而这在生物学意义上并不成立。此外,这些工具输出的往往是冗余或过于具体的GO术语列表,在用户所研究的生物学问题背景下很难解释。因此,需要一种强大且可靠的基因分类和富集分析方法。

结果

我们开发了Categorizer工具,它可以将基因分类到用户定义的组(类别)中,并计算类别的富集p值。Categorizer通过利用一种专门的GO术语语义相似性度量,为每个基因确定生物学上最匹配的类别。我们证明,与基于经典GO Slim的方法或使用其他语义相似性度量的分类方法相比,Categorizer在亨廷顿舞蹈病基因修饰因子的分类和富集结果方面有改进。

结论

与现有方法相比,Categorizer能够对基因进行更准确的分类。这个新工具将帮助实验生物学家和计算生物学家根据他们的特定需求,以更可靠的方式分析基因组和蛋白质组数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/84138671d682/12864_2014_6898_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/9fb725f24075/12864_2014_6898_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/e0f44228b1f6/12864_2014_6898_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/c0aff87c4ca6/12864_2014_6898_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/3af61d92985e/12864_2014_6898_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/84138671d682/12864_2014_6898_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/9fb725f24075/12864_2014_6898_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/e0f44228b1f6/12864_2014_6898_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/c0aff87c4ca6/12864_2014_6898_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/3af61d92985e/12864_2014_6898_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e81a/4298957/84138671d682/12864_2014_6898_Fig5_HTML.jpg

相似文献

1
Categorizer: a tool to categorize genes into user-defined biological groups based on semantic similarity.分类器:一种基于语义相似性将基因分类到用户定义的生物组中的工具。
BMC Genomics. 2014 Dec 11;15(1):1091. doi: 10.1186/1471-2164-15-1091.
2
GOcats: A tool for categorizing Gene Ontology into subgraphs of user-defined concepts.GOcats:一个将基因本体论分类为用户定义概念子图的工具。
PLoS One. 2020 Jun 11;15(6):e0233311. doi: 10.1371/journal.pone.0233311. eCollection 2020.
3
DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures.DAGO-Fun:一种基于基因本体论的功能分析工具,使用术语信息内容度量。
BMC Bioinformatics. 2013 Sep 25;14:284. doi: 10.1186/1471-2105-14-284.
4
Using Semantic Similarities and csbl.go for Analyzing Microarray Data.利用语义相似性和csbl.go分析微阵列数据。
Methods Mol Biol. 2016;1375:105-16. doi: 10.1007/7651_2015_241.
5
TopoICSim: a new semantic similarity measure based on gene ontology.TopoICSim:一种基于基因本体论的新语义相似性度量方法。
BMC Bioinformatics. 2016 Jul 29;17(1):296. doi: 10.1186/s12859-016-1160-0.
6
GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists.GOrilla:一种用于在排序后的基因列表中发现和可视化富集的基因本体(GO)术语的工具。
BMC Bioinformatics. 2009 Feb 3;10:48. doi: 10.1186/1471-2105-10-48.
7
GeneTools--application for functional annotation and statistical hypothesis testing.基因工具——用于功能注释和统计假设检验的应用程序。
BMC Bioinformatics. 2006 Oct 24;7:470. doi: 10.1186/1471-2105-7-470.
8
Interspecies gene function prediction using semantic similarity.基于语义相似性的跨物种基因功能预测
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):121. doi: 10.1186/s12918-016-0361-5.
9
DynGO: a tool for visualizing and mining of Gene Ontology and its associations.DynGO:一种用于可视化和挖掘基因本体及其关联的工具。
BMC Bioinformatics. 2005 Aug 9;6:201. doi: 10.1186/1471-2105-6-201.
10
REVIGO summarizes and visualizes long lists of gene ontology terms.REVIGO 对基因本体论术语的长列表进行总结和可视化。
PLoS One. 2011;6(7):e21800. doi: 10.1371/journal.pone.0021800. Epub 2011 Jul 18.

引用本文的文献

1
Hypoxia-induced immortalization of primary cells depends on Tfcp2L1 expression.缺氧诱导的原代细胞永生化依赖于 Tfcp2L1 的表达。
Cell Death Dis. 2024 Feb 28;15(2):177. doi: 10.1038/s41419-024-06567-z.
2
GCN2 is required to maintain core body temperature in mice during acute cold.GCN2 在急性冷暴露期间维持小鼠核心体温是必需的。
Am J Physiol Endocrinol Metab. 2023 Nov 1;325(5):E624-E637. doi: 10.1152/ajpendo.00181.2023. Epub 2023 Oct 4.
3
Regulated IRE1α-dependent decay (RIDD)-mediated reprograming of lipid metabolism in cancer.

本文引用的文献

1
Annotation enrichment analysis: an alternative method for evaluating the functional properties of gene sets.注释富集分析:一种评估基因集功能特性的替代方法。
Sci Rep. 2014 Feb 26;4:4191. doi: 10.1038/srep04191.
2
NeuroGeM, a knowledgebase of genetic modifiers in neurodegenerative diseases.神经遗传学数据库(NeuroGeM),一个神经退行性疾病遗传修饰因子知识库。
BMC Med Genomics. 2013 Nov 14;6:52. doi: 10.1186/1755-8794-6-52.
3
Information content-based gene ontology semantic similarity approaches: toward a unified framework theory.基于信息内容的基因本体语义相似性方法:迈向统一的框架理论。
调控的 IRE1α 依赖的降解(RIDD)介导的癌症中脂质代谢的重编程。
Nat Commun. 2022 May 6;13(1):2493. doi: 10.1038/s41467-022-30159-0.
4
Whether Gametophytes are Reduced or Unreduced in Angiosperms Might Be Determined Metabolically.被子植物的配子体是减数分裂的还是未减数分裂的,可能取决于其代谢方式。
Genes (Basel). 2020 Dec 2;11(12):1449. doi: 10.3390/genes11121449.
5
Graph-based exploitation of gene ontology using GOxploreR for scrutinizing biological significance.基于图的基因本体论挖掘:使用 GOxploreR 分析生物学意义
Sci Rep. 2020 Oct 7;10(1):16672. doi: 10.1038/s41598-020-73326-3.
6
GOcats: A tool for categorizing Gene Ontology into subgraphs of user-defined concepts.GOcats:一个将基因本体论分类为用户定义概念子图的工具。
PLoS One. 2020 Jun 11;15(6):e0233311. doi: 10.1371/journal.pone.0233311. eCollection 2020.
7
Epigenomic and Transcriptomic Changes During Human RPE EMT in a Stem Cell Model of Epiretinal Membrane Pathogenesis and Prevention by Nicotinamide.人视网膜色素上皮细胞在胚胎干细胞模型中的上皮间质转化过程中的表观基因组和转录组变化及烟酰胺对其的预防作用
Stem Cell Reports. 2020 Apr 14;14(4):631-647. doi: 10.1016/j.stemcr.2020.03.009. Epub 2020 Apr 2.
8
Advances in gene ontology utilization improve statistical power of annotation enrichment.基因本体论利用的进展提高了注释富集的统计功效。
PLoS One. 2019 Aug 15;14(8):e0220728. doi: 10.1371/journal.pone.0220728. eCollection 2019.
9
Indoxyl Sulfate Induces Apoptosis Through Oxidative Stress and Mitogen-Activated Protein Kinase Signaling Pathway Inhibition in Human Astrocytes.硫酸吲哚酚通过氧化应激和丝裂原活化蛋白激酶信号通路抑制诱导人星形胶质细胞凋亡。
J Clin Med. 2019 Feb 5;8(2):191. doi: 10.3390/jcm8020191.
10
A new method for evaluating the impacts of semantic similarity measures on the annotation of gene sets.一种评估语义相似性度量对基因集注释影响的新方法。
PLoS One. 2018 Nov 27;13(11):e0208037. doi: 10.1371/journal.pone.0208037. eCollection 2018.
Biomed Res Int. 2013;2013:292063. doi: 10.1155/2013/292063. Epub 2013 Sep 2.
4
DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures.DAGO-Fun:一种基于基因本体论的功能分析工具,使用术语信息内容度量。
BMC Bioinformatics. 2013 Sep 25;14:284. doi: 10.1186/1471-2105-14-284.
5
Improving the measurement of semantic similarity between gene ontology terms and gene products: insights from an edge- and IC-based hybrid method.改进基因本体术语和基因产物之间语义相似度的测量:基于边缘和 IC 的混合方法的见解。
PLoS One. 2013 May 31;8(5):e66745. doi: 10.1371/journal.pone.0066745. Print 2013.
6
A novel insight into Gene Ontology semantic similarity.基因本体论语义相似性的新见解。
Genomics. 2013 Jun;101(6):368-75. doi: 10.1016/j.ygeno.2013.04.010. Epub 2013 Apr 26.
7
A topology-based metric for measuring term similarity in the gene ontology.一种用于衡量基因本体中术语相似性的基于拓扑结构的度量方法。
Adv Bioinformatics. 2012;2012:975783. doi: 10.1155/2012/975783. Epub 2012 May 15.
8
Proteomic analysis of wild-type and mutant huntingtin-associated proteins in mouse brains identifies unique interactions and involvement in protein synthesis.在野生型和突变型亨廷顿相关蛋白的小鼠脑中进行蛋白质组分析,鉴定出独特的相互作用并参与蛋白质合成。
J Biol Chem. 2012 Jun 22;287(26):21599-614. doi: 10.1074/jbc.M112.359307. Epub 2012 May 3.
9
Alternative splicing of mRNA in the molecular pathology of neurodegenerative diseases.mRNA 在神经退行性疾病分子病理学中的可变剪接。
Neurobiol Aging. 2012 May;33(5):1012.e11-24. doi: 10.1016/j.neurobiolaging.2011.10.030. Epub 2011 Nov 26.
10
Disjunctive shared information between ontology concepts: application to Gene Ontology.本体概念之间的析取共享信息:在基因本体中的应用
J Biomed Semantics. 2011 Aug 31;2:5. doi: 10.1186/2041-1480-2-5.