• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PubMed 中的作者身份识别:国立卫生研究院资助科学家的权威性精确性与召回率证据

Author Disambiguation in PubMed: Evidence on the Precision and Recall of Author-ity among NIH-Funded Scientists.

作者信息

Lerchenmueller Marc J, Sorenson Olav

机构信息

Yale School of Management, Yale University, New Haven, CT, United States of America.

出版信息

PLoS One. 2016 Jul 1;11(7):e0158731. doi: 10.1371/journal.pone.0158731. eCollection 2016.

DOI:10.1371/journal.pone.0158731
PMID:27367860
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4930168/
Abstract

We examined the usefulness (precision) and completeness (recall) of the Author-ity author disambiguation for PubMed articles by associating articles with scientists funded by the National Institutes of Health (NIH). In doing so, we exploited established unique identifiers-Principal Investigator (PI) IDs-that the NIH assigns to funded scientists. Analyzing a set of 36,987 NIH scientists who received their first R01 grant between 1985 and 2009, we identified 355,921 articles appearing in PubMed that would allow us to evaluate the precision and recall of the Author-ity disambiguation. We found that Author-ity identified the NIH scientists with 99.51% precision across the articles. It had a corresponding recall of 99.64%. Precision and recall, moreover, appeared stable across common and uncommon last names, across ethnic backgrounds, and across levels of scientist productivity.

摘要

我们通过将文章与由美国国立卫生研究院(NIH)资助的科学家相关联,来检验用于PubMed文章的Author-ity作者消歧的有用性(精确率)和完整性(召回率)。在此过程中,我们利用了NIH分配给受资助科学家的既定唯一标识符——首席研究员(PI)ID。通过分析一组在1985年至2009年间获得首个R01资助的36,987名NIH科学家,我们确定了PubMed中出现的355,921篇文章,这些文章使我们能够评估Author-ity消歧的精确率和召回率。我们发现,Author-ity在这些文章中识别NIH科学家的精确率为99.51%。其相应的召回率为99.64%。此外,精确率和召回率在常见和不常见姓氏、不同种族背景以及不同科学家生产力水平之间似乎保持稳定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/077c/4930168/55205668f657/pone.0158731.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/077c/4930168/25f6edf3061f/pone.0158731.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/077c/4930168/e62a1e504db9/pone.0158731.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/077c/4930168/d3f7ea85e929/pone.0158731.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/077c/4930168/55205668f657/pone.0158731.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/077c/4930168/25f6edf3061f/pone.0158731.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/077c/4930168/e62a1e504db9/pone.0158731.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/077c/4930168/d3f7ea85e929/pone.0158731.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/077c/4930168/55205668f657/pone.0158731.g004.jpg

相似文献

1
Author Disambiguation in PubMed: Evidence on the Precision and Recall of Author-ity among NIH-Funded Scientists.PubMed 中的作者身份识别:国立卫生研究院资助科学家的权威性精确性与召回率证据
PLoS One. 2016 Jul 1;11(7):e0158731. doi: 10.1371/journal.pone.0158731. eCollection 2016.
2
Research by pathologists not funded by external grant agencies: a success story.由非外部资助机构资助的病理学家的研究:一个成功案例。
Mod Pathol. 1992 Sep;5(5):577-9.
3
Examining the Impact of the National Institutes of Health Public Access Policy on the Citation Rates of Journal Articles.审视美国国立卫生研究院公共获取政策对期刊文章引用率的影响。
PLoS One. 2015 Oct 8;10(10):e0139951. doi: 10.1371/journal.pone.0139951. eCollection 2015.
4
Early career academic productivity among emergency physicians with R01 grant funding.R01 资助的急诊医师早期职业学术生产力。
Acad Emerg Med. 2011 Jul;18(7):759-62. doi: 10.1111/j.1553-2712.2011.01118.x.
5
Tracking publication outcomes of National Institutes of Health grants.追踪美国国立卫生研究院资助项目的发表成果。
Am J Med. 2005 Jun;118(6):658-63. doi: 10.1016/j.amjmed.2005.02.015.
6
Physician-scientists in neurology: Research contributions of a cohort of neurologists.神经科医师科学家:一组神经科医生的研究贡献。
Neurology. 2018 Sep 11;91(11):508-514. doi: 10.1212/01.wnl.0000544243.58941.11. Epub 2018 Aug 10.
7
Publication rates from biomedical and behavioral and social science R01s funded by the National Institutes of Health.美国国立卫生研究院资助的生物医学、行为和社会科学 R01 项目的出版物发表率。
PLoS One. 2020 Nov 13;15(11):e0242271. doi: 10.1371/journal.pone.0242271. eCollection 2020.
8
National Institutes of Health Funding to Departments of Orthopaedic Surgery at U.S. Medical Schools.美国国立卫生研究院对美国医学院校骨科手术科室的资助。
J Bone Joint Surg Am. 2017 Jan 18;99(2):e5. doi: 10.2106/JBJS.16.00088.
9
Perspective: is NIH funding the "best science by the best scientists"? A critique of the NIH R01 research grant review policies.观点:NIH 的资金是否用于“最优秀的科学家开展的最佳科学研究”?对 NIH R01 研究资助审查政策的批评。
Acad Med. 2010 May;85(5):775-9. doi: 10.1097/ACM.0b013e3181d74256.
10
Author Name Disambiguation in MEDLINE.医学在线数据库(MEDLINE)中的作者姓名消歧
ACM Trans Knowl Discov Data. 2009 Jul 1;3(3). doi: 10.1145/1552303.1552304.

引用本文的文献

1
Bridging the gap in author names: building an enhanced author name dataset for biomedical literature system.弥合作者姓名差异:构建生物医学文献系统的增强型作者姓名数据集。
J Am Med Inform Assoc. 2024 Aug 1;31(8):1648-1656. doi: 10.1093/jamia/ocae127.
2
Ethnicity-based name partitioning for author name disambiguation using supervised machine learning.使用监督式机器学习进行基于种族的姓名划分以消除作者姓名歧义
J Assoc Inf Sci Technol. 2021 Aug;72(8):979-994. doi: 10.1002/asi.24459. Epub 2021 Feb 23.
3
TrendyGenes, a computational pipeline for the detection of literature trends in academia and drug discovery.

本文引用的文献

1
Author Name Disambiguation for PubMed.PubMed的作者姓名消歧
J Assoc Inf Sci Technol. 2014 Apr;65(4):765-781. doi: 10.1002/asi.23063. Epub 2013 Nov 21.
2
'Seed + expand': a general methodology for detecting publication oeuvres of individual researchers.“种子+扩展”:一种检测个体研究人员发表作品全集的通用方法。
Scientometrics. 2014;101(2):1403-1417. doi: 10.1007/s11192-014-1256-0. Epub 2014 Mar 5.
3
Author Name Disambiguation in MEDLINE.医学在线数据库(MEDLINE)中的作者姓名消歧
TrendyGenes,一个用于检测学术界和药物发现文献趋势的计算管道。
Sci Rep. 2021 Aug 3;11(1):15747. doi: 10.1038/s41598-021-94897-9.
4
ReCiter: An open source, identity-driven, authorship prediction algorithm optimized for academic institutions.ReCiter:一种开源的、以身份为驱动的、针对学术机构进行优化的作者预测算法。
PLoS One. 2021 Apr 1;16(4):e0244641. doi: 10.1371/journal.pone.0244641. eCollection 2021.
5
Building a PubMed knowledge graph.构建 PubMed 知识图谱。
Sci Data. 2020 Jun 26;7(1):205. doi: 10.1038/s41597-020-0543-2.
6
Understanding Drug Repurposing From the Perspective of Biomedical Entities and Their Evolution: Bibliographic Research Using Aspirin.从生物医学实体及其演变的角度理解药物再利用:以阿司匹林为例的文献研究
JMIR Med Inform. 2020 Jun 16;8(6):e16739. doi: 10.2196/16739.
7
A new approach and gold standard toward author disambiguation in MEDLINE.一种新的方法和金标准,用于 MEDLINE 中的作者去重。
J Am Med Inform Assoc. 2019 Oct 1;26(10):1037-1045. doi: 10.1093/jamia/ocz028.
8
The Global Burden of Journal Peer Review in the Biomedical Literature: Strong Imbalance in the Collective Enterprise.生物医学文献中期刊同行评审的全球负担:集体事业中的严重不平衡
PLoS One. 2016 Nov 10;11(11):e0166387. doi: 10.1371/journal.pone.0166387. eCollection 2016.
9
How to organize science and technology information in Latin America?如何整理拉丁美洲的科技信息?
Colomb Med (Cali). 2016 Sep 30;47(3):131-132.
ACM Trans Knowl Discov Data. 2009 Jul 1;3(3). doi: 10.1145/1552303.1552304.