Ambrosino Luca, Bostan Hamed, di Salle Pasquale, Sangiovanni Mara, Vigilante Alessandra, Chiusano Maria L
Department of Agriculture, University of Naples Federico II, Portici, Italy.
Department of Electrical Engineering and Information Technology, University of Naples Federico II, Naples, Italy.
Evol Bioinform Online. 2016 Jan 11;12:1-7. doi: 10.4137/EBO.S32536. eCollection 2016.
Arabidopsis thaliana is widely accepted as a model species in plant biology. Its genome, due to its small size and diploidy, was the first to be sequenced among plants, making this species also a reference for plant comparative genomics. Nevertheless, the evolutionary mechanisms that shaped the Arabidopsis genome are still controversial. Indeed, duplications, translocations, inversions, and gene loss events that contributed to the current organization are difficult to be traced. A reliable identification of paralogs and single-copy genes is essential to understand these mechanisms. Therefore, we implemented a dedicated pipeline to identify paralog genes and classify single-copy genes into opportune categories. PATsi, a web-accessible database, was organized to allow the straightforward access to the paralogs organized into networks and to the classification of single-copy genes. This permits to efficiently explore the gene collection of Arabidopsis for evolutionary investigations and comparative genomics.
拟南芥被广泛认为是植物生物学中的模式物种。由于其基因组规模小且为二倍体,它是首个完成测序的植物基因组,这也使该物种成为植物比较基因组学的一个参考对象。然而,塑造拟南芥基因组的进化机制仍存在争议。实际上,导致当前基因组结构的重复、易位、倒位和基因丢失事件很难追溯。可靠地识别旁系同源基因和单拷贝基因对于理解这些机制至关重要。因此,我们实施了一个专门的流程来识别旁系同源基因,并将单拷贝基因分类到合适的类别中。我们构建了一个名为PATsi的可通过网络访问的数据库,以便能直接获取组织成网络的旁系同源基因以及单拷贝基因的分类信息。这使得能够高效地探索拟南芥的基因集,用于进化研究和比较基因组学。