• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ReCiter:一种开源的、以身份为驱动的、针对学术机构进行优化的作者预测算法。

ReCiter: An open source, identity-driven, authorship prediction algorithm optimized for academic institutions.

机构信息

Samuel J. Wood Library and Information Technologies & Services, Weill Cornell Medicine, New York, New York, United States of America.

Information Technologies & Services, Weill Cornell Medicine, New York, New York, United States of America.

出版信息

PLoS One. 2021 Apr 1;16(4):e0244641. doi: 10.1371/journal.pone.0244641. eCollection 2021.

DOI:10.1371/journal.pone.0244641
PMID:33793563
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8016248/
Abstract

Academic institutions need to maintain publication lists for thousands of faculty and other scholars. Automated tools are essential to minimize the need for direct feedback from the scholars themselves who are practically unable to commit necessary effort to keep the data accurate. In relying exclusively on clustering techniques, author disambiguation applications fail to satisfy key use cases of academic institutions. Algorithms can perfectly group together a set of publications authored by a common individual, but, for them to be useful to an academic institution, they need to programmatically and recurrently map articles to thousands of scholars of interest en masse. Consistent with a savvy librarian's approach for generating a scholar's list of publications, identity-driven authorship prediction is the process of using information about a scholar to quantify the likelihood that person wrote certain articles. ReCiter is an application that attempts to do exactly that. ReCiter uses institutionally-maintained identity data such as name of department and year of terminal degree to predict which articles a given scholar has authored. To compute the overall score for a given candidate article from PubMed (and, optionally, Scopus), ReCiter uses: up to 12 types of commonly available, identity data; whether other members of a cluster have been accepted or rejected by a user; and the average score of a cluster. In addition, ReCiter provides scoring and qualitative evidence supporting why particular articles are suggested. This context and confidence scoring allows curators to more accurately provide feedback on behalf of scholars. To help users to more efficiently curate publication lists, we used a support vector machine analysis to optimize the scoring of the ReCiter algorithm. In our analysis of a diverse test group of 500 scholars at an academic private medical center, ReCiter correctly predicted 98% of their publications in PubMed.

摘要

学术机构需要维护成千上万的教职员工和其他学者的出版物清单。自动化工具对于减少直接从实际上无法投入必要精力来确保数据准确性的学者那里获取反馈的需求至关重要。仅依靠聚类技术,作者去重应用程序无法满足学术机构的关键用例。算法可以完美地将一组由共同作者撰写的出版物组合在一起,但为了对学术机构有用,它们需要以编程方式和定期地将文章大规模地映射到数千名感兴趣的学者。与精明的图书馆员生成学者出版物列表的方法一致,基于身份的作者预测是使用有关学者的信息来量化该人撰写某些文章的可能性的过程。ReCiter 是一个尝试做到这一点的应用程序。ReCiter 使用机构维护的身份数据(如部门名称和最高学位授予年份)来预测给定学者撰写的哪些文章。为了从 PubMed(和可选的 Scopus)计算给定候选文章的总体得分,ReCiter 使用:多达 12 种常用的身份数据类型;集群中的其他成员是否被用户接受或拒绝;以及集群的平均得分。此外,ReCiter 提供了支持特定文章被建议的原因的评分和定性证据。这种上下文和置信度评分允许策展人更准确地代表学者提供反馈。为了帮助用户更有效地管理出版物清单,我们使用支持向量机分析来优化 ReCiter 算法的评分。在对一个学术私立医疗中心的 500 名学者的多样化测试组进行的分析中,ReCiter 在 PubMed 中正确预测了 98%的他们的出版物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/f49b7fe5aeb9/pone.0244641.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/40cfb62ddf9a/pone.0244641.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/3f0f42875c6d/pone.0244641.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/da6291ee79ef/pone.0244641.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/2ffa0ac1b1fd/pone.0244641.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/2e1ed091932c/pone.0244641.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/62d64009e3a2/pone.0244641.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/1c5e7ef04c1e/pone.0244641.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/825281c78347/pone.0244641.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/ed9da1e09906/pone.0244641.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/5d1c68080240/pone.0244641.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/695060148171/pone.0244641.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/9ed0f8a1c6a3/pone.0244641.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/f49b7fe5aeb9/pone.0244641.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/40cfb62ddf9a/pone.0244641.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/3f0f42875c6d/pone.0244641.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/da6291ee79ef/pone.0244641.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/2ffa0ac1b1fd/pone.0244641.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/2e1ed091932c/pone.0244641.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/62d64009e3a2/pone.0244641.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/1c5e7ef04c1e/pone.0244641.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/825281c78347/pone.0244641.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/ed9da1e09906/pone.0244641.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/5d1c68080240/pone.0244641.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/695060148171/pone.0244641.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/9ed0f8a1c6a3/pone.0244641.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2430/8016248/f49b7fe5aeb9/pone.0244641.g013.jpg

相似文献

1
ReCiter: An open source, identity-driven, authorship prediction algorithm optimized for academic institutions.ReCiter:一种开源的、以身份为驱动的、针对学术机构进行优化的作者预测算法。
PLoS One. 2021 Apr 1;16(4):e0244641. doi: 10.1371/journal.pone.0244641. eCollection 2021.
2
Automatic generation of investigator bibliographies for institutional research networking systems.为机构研究网络系统自动生成研究者文献目录。
J Biomed Inform. 2014 Oct;51:8-14. doi: 10.1016/j.jbi.2014.03.013. Epub 2014 Mar 30.
3
A need to accelerate health research productivity in an African University: the case of Makerere University College of Health Sciences.非洲一所大学加速健康研究生产力的必要性:以马凯雷雷大学健康科学学院为例。
Health Res Policy Syst. 2017 Apr 21;15(1):33. doi: 10.1186/s12961-017-0196-6.
4
Gender and Radiology Publication Productivity: An Examination of Academic Faculty From Four Health Systems in the United States.性别与放射学发表成果:对美国四个医疗系统学术教员的考察
J Am Coll Radiol. 2017 Aug;14(8):1100-1108. doi: 10.1016/j.jacr.2017.04.017.
5
Development and Validation of an Automated Tool to Retrieve and Curate Faculty Publications of Academic Departments.用于检索和整理学术部门教师出版物的自动化工具的开发与验证
Cureus. 2023 Oct 30;15(10):e47976. doi: 10.7759/cureus.47976. eCollection 2023 Oct.
6
Importance of First and Second Authorship in Assessing Citation-Based Scholarly Activity of US Radiation Oncology Residents and Subsequent Choice of Academic Versus Private Practice Career.评估美国放射肿瘤学住院医师的引文学术活动以及随后选择学术与私人执业职业的第一作者和第二作者的重要性。
J Am Coll Radiol. 2018 Sep;15(9):1322-1325. doi: 10.1016/j.jacr.2018.05.015. Epub 2018 Jun 20.
7
c-index and Subindices of the h-index: New Variants of the h-index to Account for Variations in Author Contribution.h指数的c指数及子指数:考虑作者贡献差异的h指数新变体
Cureus. 2018 May 15;10(5):e2629. doi: 10.7759/cureus.2629.
8
Academic Productivity in Psychiatry: Benchmarks for the H-Index.精神病学领域的学术生产力:H指数基准
Acad Psychiatry. 2017 Aug;41(4):452-454. doi: 10.1007/s40596-016-0656-2. Epub 2017 Apr 18.
9
An Analysis of Research from Faculty at U.S. Adult Reconstruction Fellowships.美国成人重建奖学金获得者的教师研究分析
J Arthroplasty. 2015 Dec;30(12):2376-9. doi: 10.1016/j.arth.2015.05.051. Epub 2015 Jun 3.
10
The Higher-Ed Organizational-Scholar Tension: How Scholarship Compatibility and the Alignment of Organizational and Faculty Skills, Values and Support Affects Scholar's Performance and Well-Being.高等教育组织学者张力:学术兼容性以及组织与教师技能、价值观和支持的一致性如何影响学者的绩效与福祉。
Front Psychol. 2017 Apr 13;8:450. doi: 10.3389/fpsyg.2017.00450. eCollection 2017.

引用本文的文献

1
The role of information science within the clinical translational science ecosystem.信息科学在临床转化科学生态系统中的作用。
J Clin Transl Sci. 2024 Nov 27;8(1):e224. doi: 10.1017/cts.2024.664. eCollection 2025.
2
ORCID coverage in research institutions-Readiness for partially automated research reporting.研究机构中的ORCID覆盖范围——部分自动化研究报告的准备情况。
Front Res Metr Anal. 2022 Nov 10;7:1010504. doi: 10.3389/frma.2022.1010504. eCollection 2022.
3
Transforming and extending library services by embracing technology and collaborations: A case study.

本文引用的文献

1
Dynamically generating T32 training documents using structured data.使用结构化数据动态生成 T32 培训文件。
J Med Libr Assoc. 2019 Jul;107(3):420-424. doi: 10.5195/jmla.2019.401. Epub 2019 Jul 1.
2
A new approach and gold standard toward author disambiguation in MEDLINE.一种新的方法和金标准,用于 MEDLINE 中的作者去重。
J Am Med Inform Assoc. 2019 Oct 1;26(10):1037-1045. doi: 10.1093/jamia/ocz028.
3
Author Name Disambiguation for PubMed.PubMed的作者姓名消歧
通过拥抱技术和合作来转变和扩展图书馆服务:案例研究。
Health Info Libr J. 2022 Sep;39(3):294-298. doi: 10.1111/hir.12439. Epub 2022 Jun 22.
4
TeamTree analysis: A new approach to evaluate scientific production.团队树分析:一种评估科研产出的新方法。
PLoS One. 2021 Jul 21;16(7):e0253847. doi: 10.1371/journal.pone.0253847. eCollection 2021.
J Assoc Inf Sci Technol. 2014 Apr;65(4):765-781. doi: 10.1002/asi.23063. Epub 2013 Nov 21.
4
Data sets for author name disambiguation: an empirical analysis and a new resource.用于消除作者姓名歧义的数据集:实证分析与新资源。
Scientometrics. 2017;111(3):1467-1500. doi: 10.1007/s11192-017-2363-5. Epub 2017 Mar 27.
5
Author Disambiguation in PubMed: Evidence on the Precision and Recall of Author-ity among NIH-Funded Scientists.PubMed 中的作者身份识别:国立卫生研究院资助科学家的权威性精确性与召回率证据
PLoS One. 2016 Jul 1;11(7):e0158731. doi: 10.1371/journal.pone.0158731. eCollection 2016.
6
Automatic generation of investigator bibliographies for institutional research networking systems.为机构研究网络系统自动生成研究者文献目录。
J Biomed Inform. 2014 Oct;51:8-14. doi: 10.1016/j.jbi.2014.03.013. Epub 2014 Mar 30.
7
Author Name Disambiguation in MEDLINE.医学在线数据库(MEDLINE)中的作者姓名消歧
ACM Trans Knowl Discov Data. 2009 Jul 1;3(3). doi: 10.1145/1552303.1552304.