• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个带有纳米医学和药代动力学参数的注释语料库。

An annotated corpus with nanomedicine and pharmacokinetic parameters.

作者信息

Lewinski Nastassja A, Jimenez Ivan, McInnes Bridget T

机构信息

Department of Chemical and Life Science Engineering, Virginia Commonwealth University, Richmond, VA.

Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.

出版信息

Int J Nanomedicine. 2017 Oct 12;12:7519-7527. doi: 10.2147/IJN.S137117. eCollection 2017.

DOI:10.2147/IJN.S137117
PMID:29066897
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5644562/
Abstract

A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration's Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided.

摘要

关于纳米药物的大量数据正在产生并发表,自然语言处理(NLP)方法可以自动提取基于非结构化文本的数据。带注释的语料库是NLP和采用机器学习的信息提取方法的关键资源。虽然有针对药品的语料库,但纳米药物和纳米技术的资源仍然有限。为了促进纳米技术文本挖掘(NanoNLP)工作,我们构建了一个带注释的药品说明书语料库,这些说明书取自美国食品药品监督管理局(FDA)的Drugs@FDA在线数据库。在这项工作中,我们展示了工程纳米药物数据库语料库的开发,以支持纳米药物实体提取的评估。对41种美国食品药品监督管理局批准的纳米药物的21种实体提及进行了人工注释,这些实体提及包括纳米药物的物理化学特征、暴露情况和生物反应信息。我们评估了人工注释的可靠性,并通过评估两个最先进的命名实体提取系统OpenNLP和斯坦福命名实体识别器(Stanford NER)来展示该语料库的用途。该带注释的语料库以开源形式提供,并基于这些结果,为未来开发更多纳米药物语料库提供了指导方针和建议。

相似文献

1
An annotated corpus with nanomedicine and pharmacokinetic parameters.一个带有纳米医学和药代动力学参数的注释语料库。
Int J Nanomedicine. 2017 Oct 12;12:7519-7527. doi: 10.2147/IJN.S137117. eCollection 2017.
2
FoodBase corpus: a new resource of annotated food entities.FoodBase 语料库:一个新的带注释食物实体资源。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz121.
3
Europe PMC annotated full-text corpus for gene/proteins, diseases and organisms.欧洲 PMC 注释全文生物库,包含基因/蛋白质、疾病和生物信息。
Sci Data. 2023 Oct 19;10(1):722. doi: 10.1038/s41597-023-02617-x.
4
Concept annotation in the CRAFT corpus.概念标注在 CRAFT 语料库中。
BMC Bioinformatics. 2012 Jul 9;13:161. doi: 10.1186/1471-2105-13-161.
5
Exploiting and assessing multi-source data for supervised biomedical named entity recognition.利用和评估多源数据进行有监督的生物医学命名实体识别。
Bioinformatics. 2018 Jul 15;34(14):2474-2482. doi: 10.1093/bioinformatics/bty152.
6
NCBI disease corpus: a resource for disease name recognition and concept normalization.NCBI疾病语料库:一种用于疾病名称识别和概念规范化的资源。
J Biomed Inform. 2014 Feb;47:1-10. doi: 10.1016/j.jbi.2013.12.006. Epub 2014 Jan 3.
7
Portable automatic text classification for adverse drug reaction detection via multi-corpus training.通过多语料库训练实现用于药物不良反应检测的便携式自动文本分类
J Biomed Inform. 2015 Feb;53:196-207. doi: 10.1016/j.jbi.2014.11.002. Epub 2014 Nov 8.
8
An annotated corpus from biomedical articles to construct a drug-food interaction database.一个来自生物医学文章的带注释语料库,用于构建药物-食物相互作用数据库。
J Biomed Inform. 2022 Feb;126:103985. doi: 10.1016/j.jbi.2022.103985. Epub 2022 Jan 7.
9
NLM-Chem-BC7: manually annotated full-text resources for chemical entity annotation and indexing in biomedical articles.NLM-Chem-BC7:用于生物医学文章中化学实体注释和索引的人工标注全文资源。
Database (Oxford). 2022 Dec 1;2022. doi: 10.1093/database/baac102.
10
TaeC: A manually annotated text dataset for trait and phenotype extraction and entity linking in wheat breeding literature.TaeC:一个用于小麦育种文献中性状和表型提取以及实体链接的人工注释文本数据集。
PLoS One. 2024 Jun 13;19(6):e0305475. doi: 10.1371/journal.pone.0305475. eCollection 2024.

引用本文的文献

1
Expanding the Horizons of Machine Learning in Nanomaterials to Chiral Nanostructures.将机器学习在纳米材料领域的应用拓展至手性纳米结构
Adv Mater. 2024 May;36(18):e2308912. doi: 10.1002/adma.202308912. Epub 2024 Feb 3.
2
Annotation and detection of drug effects in text for pharmacovigilance.用于药物警戒的文本中药物效应的标注与检测。
J Cheminform. 2018 Aug 13;10(1):37. doi: 10.1186/s13321-018-0290-y.

本文引用的文献

1
Citizen Science for Mining the Biomedical Literature.用于挖掘生物医学文献的公民科学。
Citiz Sci. 2016;1(2). doi: 10.5334/cstp.56. Epub 2016 Dec 31.
2
Nanoparticles in the clinic.临床中的纳米颗粒。
Bioeng Transl Med. 2016 Jun 3;1(1):10-29. doi: 10.1002/btm2.10003. eCollection 2016 Mar.
3
The evolving landscape of drug products containing nanomaterials in the United States.美国含纳米材料药物产品的演变格局。
Nat Nanotechnol. 2017 Jul;12(6):523-529. doi: 10.1038/nnano.2017.67. Epub 2017 Apr 24.
4
Nanoparticle-Based Medicines: A Review of FDA-Approved Materials and Clinical Trials to Date.基于纳米颗粒的药物:对美国食品药品监督管理局(FDA)批准的材料及迄今临床试验的综述。
Pharm Res. 2016 Oct;33(10):2373-87. doi: 10.1007/s11095-016-1958-5. Epub 2016 Jun 14.
5
A study of active learning methods for named entity recognition in clinical text.临床文本中命名实体识别的主动学习方法研究
J Biomed Inform. 2015 Dec;58:11-18. doi: 10.1016/j.jbi.2015.09.010. Epub 2015 Sep 15.
6
Using natural language processing techniques to inform research on nanotechnology.利用自然语言处理技术为纳米技术研究提供信息。
Beilstein J Nanotechnol. 2015 Jul 1;6:1439-49. doi: 10.3762/bjnano.6.149. eCollection 2015.
7
Microtask crowdsourcing for disease mention annotation in PubMed abstracts.用于在PubMed摘要中进行疾病提及标注的微任务众包。
Pac Symp Biocomput. 2015:282-93.
8
Nanopharmaceuticals (part 1): products on the market.纳米药物(第1部分):市场上的产品。
Int J Nanomedicine. 2014 Sep 15;9:4357-73. doi: 10.2147/IJN.S46900. eCollection 2014.
9
Therapeutic nanoparticles in clinics and under clinical evaluation.临床应用及临床评估中的治疗性纳米颗粒。
Nanomedicine (Lond). 2013 Mar;8(3):449-67. doi: 10.2217/nnm.13.8.
10
BANNER: an executable survey of advances in biomedical named entity recognition.横幅:生物医学命名实体识别进展的可执行调查。
Pac Symp Biocomput. 2008:652-63.