Suppr超能文献

一种用于Medline记录的概率相似性度量:作者姓名消歧模型。

A probabilistic similarity metric for Medline records: a model for author name disambiguation.

作者信息

Torvik Vetle I, Weeber Marc, Swanson Don R, Smalheiser Neil R

机构信息

Department of Psychiatry, University of Illinois at Chicago, IL, USA.

出版信息

AMIA Annu Symp Proc. 2003;2003:1033.

Abstract

We present a model for automatically generating training sets and estimating the probability that a pair of Medline records sharing a last and first name initial are authored by the same individual, based on shared title words, journal name, co-authors, medical subject headings, language, and affiliation, as well as distinctive features of the name itself (i.e., presence of middle initial, suffix, and prevalence in Medline).

摘要

我们提出了一个模型,用于自动生成训练集,并基于共享的标题词、期刊名称、共同作者、医学主题词、语言和机构,以及姓名本身的独特特征(即中间名首字母、后缀的存在情况以及在Medline中的出现频率),估计一对共享姓氏首字母和名字首字母的Medline记录是否由同一人撰写的概率。

相似文献

2
Author Name Disambiguation in MEDLINE.医学在线数据库(MEDLINE)中的作者姓名消歧
ACM Trans Knowl Discov Data. 2009 Jul 1;3(3). doi: 10.1145/1552303.1552304.
10
Author Name Disambiguation for PubMed.PubMed的作者姓名消歧
J Assoc Inf Sci Technol. 2014 Apr;65(4):765-781. doi: 10.1002/asi.23063. Epub 2013 Nov 21.

引用本文的文献

1
Slow convergence: Career impediments to interdisciplinary biomedical research.缓慢的融合:跨学科生物医学研究的职业障碍。
Proc Natl Acad Sci U S A. 2024 Aug 6;121(32):e2402646121. doi: 10.1073/pnas.2402646121. Epub 2024 Jul 29.
7
(Almost) all of entity resolution.(几乎)所有的实体解析。
Sci Adv. 2022 Mar 25;8(12):eabi8021. doi: 10.1126/sciadv.abi8021.
10
Self-citation is the hallmark of productive authors, of any gender.自引是高产作者的标志,无论性别。
PLoS One. 2018 Sep 26;13(9):e0195773. doi: 10.1371/journal.pone.0195773. eCollection 2018.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验