• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用高分辨率地理位置数据对专利发明人及受让人进行去重。

Disambiguation of patent inventors and assignees using high-resolution geolocation data.

机构信息

Department of Physics, The University of Houston, Houston, Texas, USA.

IMT Institute for Advanced Studies, Lucca, Italy.

出版信息

Sci Data. 2017 May 16;4:170064. doi: 10.1038/sdata.2017.64.

DOI:10.1038/sdata.2017.64
PMID:28509897
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5433392/
Abstract

Patent data represent a significant source of information on innovation, knowledge production, and the evolution of technology through networks of citations, co-invention and co-assignment. A major obstacle to extracting useful information from this data is the problem of name disambiguation: linking alternate spellings of individuals or institutions to a single identifier to uniquely determine the parties involved in knowledge production and diffusion. In this paper, we describe a new algorithm that uses high-resolution geolocation to disambiguate both inventors and assignees on about 8.5 million patents found in the European Patent Office (EPO), under the Patent Cooperation Treaty (PCT), and in the US Patent and Trademark Office (USPTO). We show this disambiguation is consistent with a number of ground-truth benchmarks of both assignees and inventors, significantly outperforming the use of undisambiguated names to identify unique entities. A significant benefit of this work is the high quality assignee disambiguation with coverage across the world coupled with an inventor disambiguation (that is competitive with other state of the art approaches) in multiple patent offices.

摘要

专利数据是创新、知识生产和技术演变的重要信息来源,通过引用、共同发明和共同分配的网络来体现。从这些数据中提取有用信息的一个主要障碍是名称歧义问题:将个人或机构的不同拼写与单个标识符联系起来,以唯一确定参与知识生产和传播的各方。在本文中,我们描述了一种新算法,该算法使用高分辨率地理位置来区分欧洲专利局(EPO)、专利合作条约(PCT)下和美国专利商标局(USPTO)中约 850 万项专利中的发明人及受让人。我们表明,这种去歧义与受让人及发明人的许多真实基准一致,显著优于使用未去歧义的名称来识别唯一实体。这项工作的一个显著好处是,在全球范围内具有高质量的受让人去歧义功能,同时在多个专利局中具有与其他最先进方法竞争的发明人去歧义功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/e31cbb9e66d7/sdata201764-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/65ac1e61c8a7/sdata201764-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/c2bc8e40dbde/sdata201764-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/6525bb6b9c5c/sdata201764-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/c312e3ab865d/sdata201764-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/e31cbb9e66d7/sdata201764-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/65ac1e61c8a7/sdata201764-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/c2bc8e40dbde/sdata201764-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/6525bb6b9c5c/sdata201764-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/c312e3ab865d/sdata201764-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4259/5433392/e31cbb9e66d7/sdata201764-f5.jpg

相似文献

1
Disambiguation of patent inventors and assignees using high-resolution geolocation data.利用高分辨率地理位置数据对专利发明人及受让人进行去重。
Sci Data. 2017 May 16;4:170064. doi: 10.1038/sdata.2017.64.
2
[Biotechnology research and generation of patents of interest to the health system].[生物技术研究及卫生系统相关专利的产生]
Med Clin (Barc). 2008 Dec;131 Suppl 5:55-9. doi: 10.1016/S0025-7753(08)76408-X.
3
Obstacles to prior art searching by the trilateral patent offices: empirical evidence from International Search Reports.三边专利局进行现有技术检索的障碍:来自国际检索报告的实证证据
Scientometrics. 2016;107:701-722. doi: 10.1007/s11192-016-1858-9. Epub 2016 Apr 7.
4
A study of inventiveness among Society of Interventional Radiology members and the impact of their social networks.一项关于介入放射学会成员创造力及其社交网络影响的研究。
J Vasc Interv Radiol. 2013 Jul;24(7):931-7. doi: 10.1016/j.jvir.2013.03.033.
5
Australian medical patents granted in the United States in 1984-1999.1984年至1999年在美国获得授权的澳大利亚医学专利。
Med J Aust. 2001 Jan 15;174(2):83-7. doi: 10.5694/j.1326-5377.2001.tb143160.x.
6
Nanotechnology and Protection of Intellectual Property: Emerging Trends.纳米技术与知识产权保护:新兴趋势。
Recent Pat Nanotechnol. 2020;14(4):307-327. doi: 10.2174/1872210514666200612174317.
7
Characteristics of the Most Recently Awarded Magnetic Resonance Imaging Patents in the United States.美国最近授予的磁共振成像专利的特点。
Curr Probl Diagn Radiol. 2018 Sep;47(5):302-304. doi: 10.1067/j.cpradiol.2017.07.008. Epub 2017 Jul 29.
8
Digital pathology: A systematic evaluation of the patent landscape.数字病理学:专利态势的系统评估
J Pathol Inform. 2014 May 26;5(1):16. doi: 10.4103/2153-3539.133112. eCollection 2014.
9
Gender Disparity in Surgical Device Patents: A Five-year Trend From Canada and the United States.性别差异在外科手术设备专利:来自加拿大和美国的五年趋势。
J Surg Res. 2022 Dec;280:248-257. doi: 10.1016/j.jss.2022.07.016. Epub 2022 Aug 23.
10
Exploring the topology and dynamic growth properties of co-invention networks and technology fields.探索共同发明网络和技术领域的拓扑结构和动态增长特性。
PLoS One. 2021 Sep 2;16(9):e0256956. doi: 10.1371/journal.pone.0256956. eCollection 2021.

引用本文的文献

1
SciSciNet: A large-scale open data lake for the science of science research.SciSciNet:科学学研究的大规模开放数据湖。
Sci Data. 2023 Jun 1;10(1):315. doi: 10.1038/s41597-023-02198-9.
2
Talent goes to global cities: The world network of scientists' mobility.人才流向全球城市:科学家流动的世界网络。
Res Policy. 2021 Jan;50(1):104127. doi: 10.1016/j.respol.2020.104127. Epub 2020 Sep 21.
3
Geocoding of worldwide patent data.全球专利数据的地理编码。

本文引用的文献

1
Quantifying the impact of weak, strong, and super ties in scientific careers.量化弱关系、强关系和超级关系在科学职业生涯中的影响。
Proc Natl Acad Sci U S A. 2015 Aug 25;112(34):E4671-80. doi: 10.1073/pnas.1501444112. Epub 2015 Aug 10.
2
Invention as a combinatorial process: evidence from US patents.作为组合过程的发明:来自美国专利的证据。
J R Soc Interface. 2015 May 6;12(106). doi: 10.1098/rsif.2015.0272.
3
Career on the move: geography, stratification, and scientific impact.职业变迁:地理学、分层与科学影响力。
Sci Data. 2019 Nov 6;6(1):260. doi: 10.1038/s41597-019-0264-6.
4
A dataset of publication records for Nobel laureates.诺贝尔奖得主的出版物记录数据集。
Sci Data. 2019 Apr 18;6(1):33. doi: 10.1038/s41597-019-0033-6.
Sci Rep. 2014 Apr 24;4:4770. doi: 10.1038/srep04770.
4
European policy. Is Europe evolving toward an integrated research area?欧洲政策。欧洲正在朝着一个一体化的研究领域发展吗?
Science. 2013 Feb 8;339(6120):650-1. doi: 10.1126/science.1227970.
5
Foreign-born scientists: mobility patterns for 16 countries.出生在国外的科学家:16个国家的流动模式。
Nat Biotechnol. 2012 Dec;30(12):1250-3. doi: 10.1038/nbt.2449.
6
Author Name Disambiguation in MEDLINE.医学在线数据库(MEDLINE)中的作者姓名消歧
ACM Trans Knowl Discov Data. 2009 Jul 1;3(3). doi: 10.1145/1552303.1552304.
7
The increasing dominance of teams in production of knowledge.团队在知识生产中日益占据主导地位。
Science. 2007 May 18;316(5827):1036-9. doi: 10.1126/science.1136099. Epub 2007 Apr 12.
8
Flows of knowledge from universities and federal laboratories: modeling the flow of patent citations over time and across institutional and geographic boundaries.来自大学和联邦实验室的知识流动:对专利引用随时间以及跨机构和地域界限的流动进行建模。
Proc Natl Acad Sci U S A. 1996 Nov 12;93(23):12671-7. doi: 10.1073/pnas.93.23.12671.