• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过方程选择优化任意阶轨道计数。

Optimising orbit counting of arbitrary order by equation selection.

机构信息

Ghent University - imec, IDLab, Technologiepark 15, Ghent, 9052, Belgium.

Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium.

出版信息

BMC Bioinformatics. 2019 Jan 15;20(1):27. doi: 10.1186/s12859-018-2483-9.

DOI:10.1186/s12859-018-2483-9
PMID:30646859
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6334470/
Abstract

BACKGROUND

Graphlets are useful for bioinformatics network analysis. Based on the structure of Hočevar and Demšar's ORCA algorithm, we have created an orbit counting algorithm, named Jesse. This algorithm, like ORCA, uses equations to count the orbits, but unlike ORCA it can count graphlets of any order. To do so, it generates the required internal structures and equations automatically. Many more redundant equations are generated, however, and Jesse's running time is highly dependent on which of these equations are used. Therefore, this paper aims to investigate which equations are most efficient, and which factors have an effect on this efficiency.

RESULTS

With appropriate equation selection, Jesse's running time may be reduced by a factor of up to 2 in the best case, compared to using randomly selected equations. Which equations are most efficient depends on the density of the graph, but barely on the graph type. At low graph density, equations with terms in their right-hand side with few arguments are more efficient, whereas at high density, equations with terms with many arguments in the right-hand side are most efficient. At a density between 0.6 and 0.7, both types of equations are about equally efficient.

CONCLUSIONS

Our Jesse algorithm became up to a factor 2 more efficient, by automatically selecting the best equations based on graph density. It was adapted into a Cytoscape App that is freely available from the Cytoscape App Store to ease application by bioinformaticians.

摘要

背景

图元在生物信息学网络分析中很有用。基于 Hočevar 和 Demšar 的 ORCA 算法的结构,我们创建了一个轨道计数算法,命名为 Jesse。该算法与 ORCA 一样,使用方程来计算轨道,但与 ORCA 不同的是,它可以计算任何阶数的图元。为此,它自动生成所需的内部结构和方程。然而,生成的冗余方程更多,并且 Jesse 的运行时间高度依赖于使用哪些方程。因此,本文旨在研究哪些方程效率最高,以及哪些因素对这种效率有影响。

结果

通过适当的方程选择,与随机选择方程相比,Jesse 的运行时间在最佳情况下可能减少多达 2 倍。哪些方程效率最高取决于图的密度,但几乎与图的类型无关。在低图密度下,右侧带有较少参数的项的方程效率更高,而在高密度下,右侧带有许多参数的项的方程效率更高。在密度介于 0.6 和 0.7 之间时,这两种类型的方程效率大致相同。

结论

我们的 Jesse 算法通过根据图密度自动选择最佳方程,效率提高了多达 2 倍。它已被改编为 Cytoscape App,并可从 Cytoscape App Store 免费获得,以方便生物信息学家使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/6d6b1d5f98d1/12859_2018_2483_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/c1cb32120a73/12859_2018_2483_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/780d879014f6/12859_2018_2483_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/f743fe47d255/12859_2018_2483_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/0a6825355f30/12859_2018_2483_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/6d6b1d5f98d1/12859_2018_2483_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/c1cb32120a73/12859_2018_2483_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/780d879014f6/12859_2018_2483_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/f743fe47d255/12859_2018_2483_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/0a6825355f30/12859_2018_2483_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ba/6334470/6d6b1d5f98d1/12859_2018_2483_Fig5_HTML.jpg

相似文献

1
Optimising orbit counting of arbitrary order by equation selection.通过方程选择优化任意阶轨道计数。
BMC Bioinformatics. 2019 Jan 15;20(1):27. doi: 10.1186/s12859-018-2483-9.
2
Efficiently counting all orbits of graphlets of any order in a graph using autogenerated equations.使用自动生成的方程高效地计算图中任意阶图元的所有轨道。
Bioinformatics. 2018 Apr 15;34(8):1372-1380. doi: 10.1093/bioinformatics/btx758.
3
A combinatorial approach to graphlet counting.图元计数的组合方法。
Bioinformatics. 2014 Feb 15;30(4):559-65. doi: 10.1093/bioinformatics/btt717. Epub 2013 Dec 11.
4
Combinatorial algorithm for counting small induced graphs and orbits.用于计数小诱导子图和轨道的组合算法。
PLoS One. 2017 Feb 9;12(2):e0171428. doi: 10.1371/journal.pone.0171428. eCollection 2017.
5
An Algorithm to Automatically Generate the Combinatorial Orbit Counting Equations.一种自动生成组合轨道计数方程的算法。
PLoS One. 2016 Jan 21;11(1):e0147078. doi: 10.1371/journal.pone.0147078. eCollection 2016.
6
A Cytoscape app for motif enumeration with ISMAGS.一个使用 ISMAGS 进行基序枚举的 Cytoscape 应用程序。
Bioinformatics. 2017 Feb 1;33(3):461-463. doi: 10.1093/bioinformatics/btw626.
7
Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8.图子:图子和轨道身份的常数时间确定,包括大小达8的(可能不连通的)图子。
PLoS One. 2017 Aug 23;12(8):e0181570. doi: 10.1371/journal.pone.0181570. eCollection 2017.
8
Revealing biological modules via graph summarization.通过图摘要揭示生物模块。
J Comput Biol. 2009 Feb;16(2):253-64. doi: 10.1089/cmb.2008.11TT.
9
Efficient estimation of graphlet frequency distributions in protein-protein interaction networks.蛋白质-蛋白质相互作用网络中图形频率分布的高效估计
Bioinformatics. 2006 Apr 15;22(8):974-80. doi: 10.1093/bioinformatics/btl030. Epub 2006 Feb 1.
10
Three-dimensional visualization of protein interaction networks.蛋白质相互作用网络的三维可视化
Comput Biol Med. 2004 Mar;34(2):127-39. doi: 10.1016/S0010-4825(03)00045-3.

本文引用的文献

1
IncGraph: Incremental graphlet counting for topology optimisation.IncGraph:用于拓扑优化的增量图元计数。
PLoS One. 2018 Apr 26;13(4):e0195997. doi: 10.1371/journal.pone.0195997. eCollection 2018.
2
Efficiently counting all orbits of graphlets of any order in a graph using autogenerated equations.使用自动生成的方程高效地计算图中任意阶图元的所有轨道。
Bioinformatics. 2018 Apr 15;34(8):1372-1380. doi: 10.1093/bioinformatics/btx758.
3
Graphettes: Constant-time determination of graphlet and orbit identity including (possibly disconnected) graphlets up to size 8.
图子:图子和轨道身份的常数时间确定,包括大小达8的(可能不连通的)图子。
PLoS One. 2017 Aug 23;12(8):e0181570. doi: 10.1371/journal.pone.0181570. eCollection 2017.
4
An Algorithm to Automatically Generate the Combinatorial Orbit Counting Equations.一种自动生成组合轨道计数方程的算法。
PLoS One. 2016 Jan 21;11(1):e0147078. doi: 10.1371/journal.pone.0147078. eCollection 2016.
5
A combinatorial approach to graphlet counting.图元计数的组合方法。
Bioinformatics. 2014 Feb 15;30(4):559-65. doi: 10.1093/bioinformatics/btt717. Epub 2013 Dec 11.
6
The MIntAct project--IntAct as a common curation platform for 11 molecular interaction databases.MIntAct 项目——将 IntAct 作为 11 个分子相互作用数据库的通用协同策展平台。
Nucleic Acids Res. 2014 Jan;42(Database issue):D358-63. doi: 10.1093/nar/gkt1115. Epub 2013 Nov 13.
7
Uncovering biological network function via graphlet degree signatures.通过图let度特征揭示生物网络功能
Cancer Inform. 2008;6:257-73. Epub 2008 Apr 14.
8
Biological network comparison using graphlet degree distribution.使用图let度分布进行生物网络比较。
Bioinformatics. 2007 Jan 15;23(2):e177-83. doi: 10.1093/bioinformatics/btl301.
9
FANMOD: a tool for fast network motif detection.FANMOD:一种用于快速网络基序检测的工具。
Bioinformatics. 2006 May 1;22(9):1152-3. doi: 10.1093/bioinformatics/btl038. Epub 2006 Feb 2.
10
Modeling interactome: scale-free or geometric?建模相互作用组:无标度还是几何?
Bioinformatics. 2004 Dec 12;20(18):3508-15. doi: 10.1093/bioinformatics/bth436. Epub 2004 Jul 29.