State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No. 1 Shizishan Street, Hongshan District, Wuhan 430070, Hubei, China.
College of Veterinary Medicine, Huazhong Agricultural University, No. 1 Shizishan Street, Hongshan District, Wuhan 430070, Hubei, China.
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae320.
Understanding the biological functions and processes of genes, particularly those not yet characterized, is crucial for advancing molecular biology and identifying therapeutic targets. The hypothesis guiding this study is that the 3D proximity of genes correlates with their functional interactions and relevance in prokaryotes. We introduced 3D-GeneNet, an innovative software tool that utilizes high-throughput sequencing data from chromosome conformation capture techniques and integrates topological metrics to construct gene association networks. Through a series of comparative analyses focused on spatial versus linear distances, we explored various dimensions such as topological structure, functional enrichment levels, distribution patterns of linear distances among gene pairs, and the area under the receiver operating characteristic curve by utilizing model organism Escherichia coli K-12. Furthermore, 3D-GeneNet was shown to maintain good accuracy compared to multiple algorithms (neighbourhood, co-occurrence, coexpression, and fusion) across multiple bacteria, including E. coli, Brucella abortus, and Vibrio cholerae. In addition, the accuracy of 3D-GeneNet's prediction of long-distance gene interactions was identified by bacterial two-hybrid assays on E. coli K-12 MG1655, where 3D-GeneNet not only increased the accuracy of linear genomic distance tripled but also achieved 60% accuracy by running alone. Finally, it can be concluded that the applicability of 3D-GeneNet will extend to various bacterial forms, including Gram-negative, Gram-positive, single-, and multi-chromosomal bacteria through Hi-C sequencing and analysis. Such findings highlight the broad applicability and significant promise of this method in the realm of gene association network. 3D-GeneNet is freely accessible at https://github.com/gaoyuanccc/3D-GeneNet.
理解基因的生物学功能和过程,特别是那些尚未被描述的基因,对于推进分子生物学和确定治疗靶点至关重要。本研究的假设是基因的三维接近度与它们在原核生物中的功能相互作用和相关性相关。我们引入了 3D-GeneNet,这是一种创新的软件工具,利用来自染色体构象捕获技术的高通量测序数据,并整合拓扑度量来构建基因关联网络。通过一系列专注于空间与线性距离的比较分析,我们探索了各种维度,如拓扑结构、功能富集水平、基因对之间线性距离的分布模式,以及利用模式生物大肠杆菌 K-12 进行的Receiver Operating Characteristic 曲线下面积。此外,与多个细菌(包括大肠杆菌、流产布鲁氏菌和霍乱弧菌)中的多个算法(邻居、共现、共表达和融合)相比,3D-GeneNet 表现出良好的准确性。此外,通过大肠杆菌 K-12 MG1655 的细菌双杂交实验,鉴定了 3D-GeneNet 预测长距离基因相互作用的准确性,其中 3D-GeneNet 不仅提高了三倍线性基因组距离的准确性,而且单独运行时达到了 60%的准确性。最后,可以得出结论,通过 Hi-C 测序和分析,3D-GeneNet 将扩展到各种细菌形式,包括革兰氏阴性、革兰氏阳性、单染色体和多染色体细菌。这些发现突显了该方法在基因关联网络领域的广泛适用性和重要前景。3D-GeneNet 可在 https://github.com/gaoyuanccc/3D-GeneNet 上免费获取。