Biodesign Center, Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China.
National Technology Innovation Center of Synthetic Biology, Tianjin 300308, PR China.
Nucleic Acids Res. 2022 Jul 5;50(W1):W298-W304. doi: 10.1093/nar/gkac288.
Cellular regulation is inherently complex, and one particular cellular function is often controlled by a cascade of different types of regulatory interactions. For example, the activity of a transcription factor (TF), which regulates the expression level of downstream genes through transcriptional regulation, can be regulated by small molecules through compound-protein interactions. To identify such complex regulatory cascades, traditional relational databases require ineffective additional operations and are computationally expensive. In contrast, graph databases are purposefully developed to execute such deep searches efficiently. Here, we present ERMer (E. coli Regulation Miner), the first cloud platform for mining the regulatory landscape of Escherichia coli based on graph databases. Combining the AWS Neptune graph database, AWS lambda function, and G6 graph visualization engine enables quick search and visualization of complex regulatory cascades/patterns. Users can also interactively navigate the E. coli regulatory landscape through ERMer. Furthermore, a Q&A module is included to showcase the power of graph databases in answering complex biological questions through simple queries. The backend graph model can be easily extended as new data become available. In addition, the framework implemented in ERMer can be easily migrated to other applications or organisms. ERMer is available at https://ermer.biodesign.ac.cn/.
细胞调节本质上很复杂,一个特定的细胞功能通常受到一系列不同类型的调节相互作用的控制。例如,转录因子 (TF) 的活性通过转录调控来调节下游基因的表达水平,而小分子可以通过化合物-蛋白相互作用来调节 TF 的活性。为了识别这种复杂的调节级联,传统的关系型数据库需要低效的额外操作,并且计算成本很高。相比之下,图数据库是专门为高效执行此类深度搜索而开发的。在这里,我们展示了 ERMer(大肠杆菌调节挖掘器),这是第一个基于图数据库挖掘大肠杆菌调节景观的云平台。结合 AWS Neptune 图数据库、AWS lambda 函数和 G6 图可视化引擎,实现了对复杂调节级联/模式的快速搜索和可视化。用户还可以通过 ERMer 交互式浏览大肠杆菌的调节景观。此外,还包括一个问答模块,展示了图数据库通过简单查询回答复杂生物学问题的强大功能。随着新数据的出现,后端图模型可以轻松扩展。此外,ERMer 中实现的框架可以轻松迁移到其他应用程序或生物体。ERMer 可在 https://ermer.biodesign.ac.cn/ 获得。