Thomas Dennis G, Klaessig Fred, Harper Stacey L, Fritts Martin, Hoover Mark D, Gaheen Sharon, Stokes Todd H, Reznik-Zellen Rebecca, Freund Elaine T, Klemm Juli D, Paik David S, Baker Nathan A
Knowledge Discovery and Informatics Group, Pacific Northwest National Laboratory, Richland, WA, USA.
Pennsylvania Bio Nano Systems, LLC, Doylestown, PA, USA.
Wiley Interdiscip Rev Nanomed Nanobiotechnol. 2011 Sep-Oct;3(5):511-532. doi: 10.1002/wnan.152. Epub 2011 Jun 30.
There are several issues to be addressed concerning the management and effective use of information (or data), generated from nanotechnology studies in biomedical research and medicine. These data are large in volume, diverse in content, and are beset with gaps and ambiguities in the description and characterization of nanomaterials. In this work, we have reviewed three areas of nanomedicine informatics: information resources; taxonomies, controlled vocabularies, and ontologies; and information standards. Informatics methods and standards in each of these areas are critical for enabling collaboration; data sharing; unambiguous representation and interpretation of data; semantic (meaningful) search and integration of data; and for ensuring data quality, reliability, and reproducibility. In particular, we have considered four types of information standards in this article, which are standard characterization protocols, common terminology standards, minimum information standards, and standard data communication (exchange) formats. Currently, because of gaps and ambiguities in the data, it is also difficult to apply computational methods and machine learning techniques to analyze, interpret, and recognize patterns in data that are high dimensional in nature, and also to relate variations in nanomaterial properties to variations in their chemical composition, synthesis, characterization protocols, and so on. Progress toward resolving the issues of information management in nanomedicine using informatics methods and standards discussed in this article will be essential to the rapidly growing field of nanomedicine informatics.
在生物医学研究和医学领域的纳米技术研究中,关于信息(或数据)的管理和有效利用存在几个需要解决的问题。这些数据量很大,内容多样,并且在纳米材料的描述和表征方面存在空白和模糊之处。在这项工作中,我们回顾了纳米医学信息学的三个领域:信息资源;分类法、受控词汇表和本体;以及信息标准。这些领域中的每一个领域的信息学方法和标准对于促进合作、数据共享、数据的明确表示和解释、语义(有意义)搜索和数据整合,以及确保数据质量、可靠性和可重复性都至关重要。特别是,我们在本文中考虑了四种类型的信息标准,即标准表征协议、通用术语标准、最小信息标准和标准数据通信(交换)格式。目前,由于数据中存在空白和模糊之处,也很难应用计算方法和机器学习技术来分析、解释和识别本质上是高维的数据中的模式,以及将纳米材料特性的变化与其化学成分、合成、表征协议等的变化联系起来。使用本文讨论的信息学方法和标准在解决纳米医学信息管理问题方面取得进展对于迅速发展的纳米医学信息学领域至关重要。