Sanderson Theo, Rayner Julian C
Malaria Programme, Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.
Wellcome Open Res. 2017 Jul 24;2:45. doi: 10.12688/wellcomeopenres.11896.2. eCollection 2017.
Two decades after the first transfection, attempts have been made to disrupt more than 3,151 genes in malaria parasites, across five species. While results from rodent malaria transfections have been curated and systematised, empowering large-scale analysis, phenotypic data from human malaria parasite transfections currently exists as individual reports scattered across a the literature. To facilitate systematic analysis of published experimental genetic data across species, we have built PhenoPlasm ( http://www.phenoplasm.org), a database of phenotypes generated by transfection experiments in all parasites. The site provides a simple interface linking citation-backed reverse-genetic phenotypes to gene IDs. The database has been populated with phenotypic data on 367 genes, curated from 176 individual publications, as well as existing data on rodent species from RMgmDB and PlasmoGEM. This is the first time that all available data on transfection experiments has been brought together in a single place. These data are presented using ortholog mapping to allow a researcher interested in a gene in one species to see results across other species. The collaborative nature of the database enables any researcher to add new phenotypes as they are discovered. As an example of database utility, we use the currently available datasets to identify RAP (RNA-binding domain abundant in Apicomplexa)-domain containing proteins as crucial to parasite survival.
首次转染二十年后,人们已尝试在五种疟原虫中破坏超过3151个基因。虽然啮齿类疟原虫转染的结果已被整理和系统化,有助于进行大规模分析,但人类疟原虫转染的表型数据目前以分散在文献中的个别报告形式存在。为便于对已发表的跨物种实验遗传数据进行系统分析,我们建立了PhenoPlasm(http://www.phenoplasm.org),这是一个关于所有寄生虫转染实验所产生表型的数据库。该网站提供了一个简单界面,将有文献支持的反向遗传表型与基因ID相链接。该数据库已收录了从176篇单独出版物中整理出的367个基因的表型数据,以及来自RMgmDB和PlasmoGEM的啮齿类物种的现有数据。这是首次将所有关于转染实验的可用数据集中在一处。这些数据通过直系同源映射呈现,以便对一个物种中的某个基因感兴趣的研究人员能够查看其他物种的结果。该数据库的协作性质使任何研究人员在发现新表型时都能添加进去。作为数据库实用性的一个例子,我们利用当前可用的数据集确定含有RAP(顶复门中丰富的RNA结合结构域)结构域的蛋白质对寄生虫生存至关重要。