Barker W C, Hunt L T
National Biomedical Research Foundation, Georgetown University Medical Center, Washington, D.C., USA.
J Protein Chem. 1997 Jul;16(5):459-62. doi: 10.1023/a:1026305309829.
Protein sequence data are as useful and valuable today as was envisioned by pioneering sequencers and by the organizers of the first sequence database. Sequence analysis was first the province of specialists who developed search, comparison, and tree-building methods. Microcomputers, communication satellites, and the Internet have made these methods accessible to any scientist. The rapid increase in the data has driven a succession of changes in how databases are compiled, distributed, and accessed. Large public databases have become international collaborations. Although they need to develop still more efficient ways to accumulate, organize, annotate, and standardize huge amounts of data, inadequate support is available for such efforts. Thus there will be greater reliance on direct input from the scientific community. The World Wide Web is essential but not sufficient for integrated access to related databases.
如今,蛋白质序列数据的有用性和价值,正如早期测序人员以及首个序列数据库的组织者所设想的那样。序列分析最初是由开发搜索、比较和建树方法的专家们负责的领域。微型计算机、通信卫星和互联网已使这些方法可供任何科学家使用。数据的迅速增长推动了数据库在编译、分发和访问方式上的一系列变革。大型公共数据库已成为国际合作项目。尽管它们需要开发更高效的方法来积累、组织、注释和标准化海量数据,但为此类工作提供的支持却不足。因此,将更依赖科学界的直接投入。万维网对于综合访问相关数据库至关重要,但还不够。