Karaji Rezvan, Peña-Castillo Lourdes
Department of Computer Science, Memorial University of Newfoundland, St. John's, Newfoundland and Labrador, Canada.
Department of Biology, Memorial University of Newfoundland, St. John's, Newfoundland and Labrador, Canada.
PLoS One. 2025 Aug 1;20(8):e0329355. doi: 10.1371/journal.pone.0329355. eCollection 2025.
An operon refers to a group of neighbouring genes belonging to one or more overlapping transcription units that are transcribed in the same direction and have at least one gene in common. Operons are a characteristic of prokaryotic genomes. Identifying which genes belong to the same operon facilitates understanding of gene function and regulation. There are several computational approaches for operon detection; however, many of these computational approaches have been developed for a specific target bacterium or require information only available for a restricted number of bacterial species. Here, we introduce a general method, OpDetect, that directly utilizes RNA-sequencing (RNA-seq) reads as a signal over nucleotide bases in the genome. This representation enabled us to employ a convolutional and recurrent deep neural network architecture which demonstrated superior performance in terms of recall, F1-score and Area under the Receiver-Operating characteristic Curve (AUROC) compared to previous approaches. Additionally, OpDetect showcases species-agnostic capabilities, successfully detecting operons in a wide range of bacterial species and even in Caenorhabditis elegans, one of few eukaryotic organisms known to have operons. OpDetect is available at https://github.com/BioinformaticsLabAtMUN/OpDetect.
操纵子是指一组相邻的基因,它们属于一个或多个重叠的转录单元,这些转录单元沿相同方向转录且至少有一个共同基因。操纵子是原核生物基因组的一个特征。确定哪些基因属于同一个操纵子有助于理解基因功能和调控。有几种用于检测操纵子的计算方法;然而,这些计算方法中的许多是针对特定目标细菌开发的,或者只需要有限数量细菌物种可用的信息。在这里,我们介绍一种通用方法OpDetect,它直接将RNA测序(RNA-seq)读数用作基因组中核苷酸碱基上的信号。这种表示方式使我们能够采用卷积和循环深度神经网络架构,与以前的方法相比,该架构在召回率、F1分数和受试者操作特征曲线下面积(AUROC)方面表现出卓越性能。此外,OpDetect展示了与物种无关的能力,成功地在广泛的细菌物种甚至秀丽隐杆线虫(已知具有操纵子的少数真核生物之一)中检测到操纵子。可在https://github.com/BioinformaticsLabAtMUN/OpDetect获取OpDetect。