Leary Patrick R, Remsen David P, Norton Catherine N, Patterson David J, Sarkar Indra Neil
MBL Informatics, Marine Biological Laboratory, Woods Hole, MA 02543, USA.
Bioinformatics. 2007 Jun 1;23(11):1434-6. doi: 10.1093/bioinformatics/btm109. Epub 2007 Mar 28.
Web content syndication through standard formats such as RSS and ATOM has become an increasingly popular mechanism for publishers, news sources and blogs to disseminate regularly updated content. These standardized syndication formats deliver content directly to the subscriber, allowing them to locally aggregate content from a variety of sources instead of having to find the information on multiple websites. The uBioRSS application is a 'taxonomically intelligent' service customized for the biological sciences. It aggregates syndicated content from academic publishers and science news feeds, and then uses a taxonomic Named Entity Recognition algorithm to identify and index taxonomic names within those data streams. The resulting name index is cross-referenced to current global taxonomic datasets to provide context for browsing the publications by taxonomic group. This process, called taxonomic indexing, draws upon services developed specifically for biological sciences, collectively referred to as 'taxonomic intelligence'. Such value-added enhancements can provide biologists with accelerated and improved access to current biological content.
通过RSS和ATOM等标准格式进行网络内容聚合,已成为出版商、新闻源和博客定期传播更新内容的一种越来越流行的机制。这些标准化的聚合格式将内容直接传送给订阅者,使他们能够在本地聚合来自各种来源的内容,而不必在多个网站上查找信息。uBioRSS应用程序是一项为生物科学定制的“分类智能”服务。它聚合来自学术出版商和科学新闻源的聚合内容,然后使用分类命名实体识别算法来识别和索引这些数据流中的分类名称。生成的名称索引会与当前的全球分类数据集进行交叉引用,以便按分类组浏览出版物提供背景信息。这个过程称为分类索引,它利用专门为生物科学开发的服务,统称为“分类智能”。这种增值增强功能可以为生物学家提供更快、更好地获取当前生物内容的途径。