Gallagher Marie E
Lister Hill National Center for Biomedical Communications, U.S. National Library of Medicine; Bethesda, Maryland, USA.
Archiving. 2013;2013:74-79.
The Profiles in Science® digital library features digitized surrogates of historical items selected from the archival collections of the U.S. National Library of Medicine as well as collaborating institutions. In addition, it contains a database of descriptive, technical and administrative metadata. It also contains various software components that allow creation of the metadata, management of the digital items, and access to the items and metadata through the Profiles in Science Web site [1]. The choices made building the digital library were designed to maximize the sustainability and long-term survival of all of the components of the digital library [2]. For example, selecting standard and open digital file formats rather than proprietary formats increases the sustainability of the digital files [3]. Correspondingly, using non-proprietary software may improve the sustainability of the software--either through in-house expertise or through the open source community. Limiting our digital library software exclusively to open source software or to software developed in-house has not been feasible. For example, we have used proprietary operating systems, scanning software, a search engine, and office productivity software. We did this when either lack of essential capabilities or the cost-benefit trade-off favored using proprietary software. We also did so knowing that in the future we would need to replace or upgrade some of our proprietary software, analogous to migrating from an obsolete digital file format to a new format as the technological landscape changes. Since our digital library's start in 1998, all of its software has been upgraded or replaced, but the digitized items have not yet required migration to other formats. Technological changes that compelled us to replace proprietary software included the cost of product licensing, product support, incompatibility with other software, prohibited use due to evolving security policies, and product abandonment. Sometimes these changes happen on short notice, so we continually monitor our library's software for signs of endangerment. We have attempted to replace proprietary software with suitable in-house or open source software. When the replacement involves a standalone piece of software with a nearly equivalent version, such as replacing a commercial HTTP server with an open source HTTP server, the replacement is straightforward. Recently we replaced software that functioned not only as our search engine but also as the backbone of the architecture of our Web site. In this paper, we describe the lessons learned and the pros and cons of replacing this software with open source software.
“科学人物”数字图书馆的特色是数字化替代物,这些替代物选自美国国立医学图书馆以及合作机构的档案馆藏中的历史资料。此外,它还包含一个描述性、技术性和管理性元数据的数据库。它还包含各种软件组件,可用于创建元数据、管理数字资料,并通过“科学人物”网站访问这些资料和元数据[1]。构建数字图书馆时所做的选择旨在最大限度地提高数字图书馆所有组件的可持续性和长期存续性[2]。例如,选择标准和开放的数字文件格式而非专有格式可提高数字文件的可持续性[3]。相应地,使用非专有软件可能会提高软件的可持续性——要么通过内部专业知识,要么通过开源社区。将我们的数字图书馆软件完全限制为开源软件或内部开发的软件并不可行。例如,我们使用过专有操作系统、扫描软件、搜索引擎和办公生产力软件。当缺乏基本功能或成本效益权衡有利于使用专有软件时,我们就会这样做。我们这样做也是知道未来我们需要替换或升级我们的一些专有软件,类似于随着技术格局的变化从过时的数字文件格式迁移到新格式。自我们的数字图书馆于1998年启动以来,其所有软件都已升级或替换,但数字化资料尚未需要迁移到其他格式。迫使我们替换专有软件的技术变革包括产品许可成本、产品支持、与其他软件不兼容、由于不断演变的安全政策而被禁止使用以及产品被弃用。有时这些变化会在短时间内发生,所以我们不断监测我们图书馆软件是否有濒危迹象。我们试图用合适的内部或开源软件替换专有软件。当替换涉及具有几乎等效版本的独立软件时,例如用开源HTTP服务器替换商业HTTP服务器,替换很简单。最近我们替换了不仅用作我们的搜索引擎而且用作我们网站架构主干的软件。在本文中,我们描述了所学到的经验教训以及用开源软件替换此软件的利弊。