Blaschke C, Andrade M A, Ouzounis C, Valencia A
Protein Design Group, CNB-CSIC, Madrid, Spain.
Proc Int Conf Intell Syst Mol Biol. 1999:60-7.
We describe the basic design of a system for automatic detection of protein-protein interactions extracted from scientific abstracts. By restricting the problem domain and imposing a number of strong assumptions which include pre-specified protein names and a limited set of verbs that represent actions, we show that it is possible to perform accurate information extraction. The performance of the system is evaluated with different cases of real-world interaction networks, including the Drosophila cell cycle control. The results obtained computationally are in good agreement with current biological knowledge and demonstrate the feasibility of developing a fully automated system able to describe networks of protein interactions with sufficient accuracy.
我们描述了一个用于自动检测从科学摘要中提取的蛋白质-蛋白质相互作用的系统的基本设计。通过限制问题域并施加一些强假设,包括预先指定的蛋白质名称和一组表示动作的有限动词,我们表明可以进行准确的信息提取。该系统的性能通过不同的真实世界相互作用网络案例进行评估,包括果蝇细胞周期控制。通过计算获得的结果与当前的生物学知识高度一致,并证明了开发一个能够以足够精度描述蛋白质相互作用网络的全自动系统的可行性。