创新的语言。

The Language of Innovation.

机构信息

European Commission, Joint Research Centre (JRC), Seville, Spain.

Institute for Complex Systems, CNR, Rome, Italy.

出版信息

PLoS One. 2020 Apr 30;15(4):e0230107. doi: 10.1371/journal.pone.0230107. eCollection 2020.

DOI:10.1371/journal.pone.0230107

PMID:32352986

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7194493/

Abstract

Predicting innovation is a peculiar problem in data science. Following its definition, an innovation is always a never-seen-before event, leaving no room for traditional supervised learning approaches. Here we propose a strategy to address the problem in the context of innovative patents, by defining innovations as never-seen-before associations of technologies and exploiting self-supervised learning techniques. We think of technological codes present in patents as a vocabulary and the whole technological corpus as written in a specific, evolving language. We leverage such structure with techniques borrowed from Natural Language Processing by embedding technologies in a high dimensional euclidean space where relative positions are representative of learned semantics. Proximity in this space is an effective predictor of specific innovation events, that outperforms a wide range of standard link-prediction metrics. The success of patented innovations follows a complex dynamics characterized by different patterns which we analyze in details with specific examples. The methods proposed in this paper provide a completely new way of understanding and forecasting innovation, by tackling it from a revealing perspective and opening interesting scenarios for a number of applications and further analytic approaches.

摘要

预测创新是数据科学中的一个特殊问题。根据其定义，创新总是一个前所未有的事件，没有传统监督学习方法的空间。在这里，我们提出了一种在创新专利背景下解决该问题的策略，通过将创新定义为技术的前所未有的关联，并利用自监督学习技术。我们将专利中存在的技术代码视为词汇，将整个技术语料库视为用特定的、不断发展的语言书写的。我们通过将技术嵌入到高维欧几里得空间中来利用这些结构，在这个空间中，相对位置代表学习到的语义。在这个空间中的接近度是特定创新事件的有效预测指标，优于广泛的标准链接预测指标。专利创新的成功遵循一种复杂的动态，其特征是具有不同的模式，我们通过具体示例详细分析了这些模式。本文提出的方法通过从一个有启发性的角度处理创新问题，并为许多应用程序和进一步的分析方法开辟了有趣的场景，为理解和预测创新提供了一种全新的方式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca97/7194493/12357aee92b6/pone.0230107.g001.jpg

相似文献

The Language of Innovation.创新的语言。

PLoS One. 2020 Apr 30;15(4):e0230107. doi: 10.1371/journal.pone.0230107. eCollection 2020.

Modeling Trajectories Obtained from External Sensors for Location Prediction via NLP Approaches.通过自然语言处理方法对来自外部传感器的轨迹建模以进行位置预测。

Sensors (Basel). 2022 Oct 2;22(19):7475. doi: 10.3390/s22197475.

Improve word embedding using both writing and pronunciation.利用写作和发音来改进单词嵌入。

PLoS One. 2018 Dec 10;13(12):e0208785. doi: 10.1371/journal.pone.0208785. eCollection 2018.

Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window.通过使用可扩展的地理空间上下文窗口来测量词的空间语义相似性，从而理解自然语言的空间维度。

PLoS One. 2020 Jul 23;15(7):e0236347. doi: 10.1371/journal.pone.0236347. eCollection 2020.

Open-Ended Technological Innovation.开放式技术创新。

Artif Life. 2019 Winter;25(1):33-49. doi: 10.1162/artl_a_00279.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象：化学与物理邂逅生物学（瑞士阿斯科纳，2012年6月10日至14日）

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。

J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

A patent survey case: how could technological forecasting help cosmetic chemists with product innovation?一个专利调查案例：技术预测如何助力化妆品化学家进行产品创新？

J Cosmet Sci. 2012 Nov-Dec;63(6):365-83.

Unsupervised embedding of trajectories captures the latent structure of scientific migration.无监督轨迹嵌入捕获了科学移民的潜在结构。

Proc Natl Acad Sci U S A. 2023 Dec 26;120(52):e2305414120. doi: 10.1073/pnas.2305414120. Epub 2023 Dec 22.

A Question-and-Answer System to Extract Data From Free-Text Oncological Pathology Reports (CancerBERT Network): Development Study.从自由文本肿瘤病理学报告（CancerBERT 网络）中提取数据的问答系统：开发研究。

J Med Internet Res. 2022 Mar 23;24(3):e27210. doi: 10.2196/27210.

引用本文的文献

Seeking innovation: The research protocol for SMEs' networking.寻求创新：中小企业网络的研究方案。

Heliyon. 2023 Mar 21;9(4):e14689. doi: 10.1016/j.heliyon.2023.e14689. eCollection 2023 Apr.

Urban economic fitness and complexity from patent data.基于专利数据的城市经济适应性和复杂性

Sci Rep. 2023 Mar 4;13(1):3655. doi: 10.1038/s41598-023-30649-1.

Strategy and additive technologies as the catalyst for outsourcing, process innovation and operational effectiveness.策略和添加剂技术作为推动外包、流程创新和运营效率的催化剂。

PLoS One. 2023 Feb 27;18(2):e0282366. doi: 10.1371/journal.pone.0282366. eCollection 2023.

Draw me Science: Multi-level and multi-scale reconstruction of knowledge dynamics with phylomemies.用谱系图谱进行科学绘图：知识动态的多层次和多尺度重建

Scientometrics. 2022;127(1):545-575. doi: 10.1007/s11192-021-04186-5. Epub 2021 Nov 22.

Innovation indicators based on firm websites-Which website characteristics predict firm-level innovation activity?基于企业网站的创新指标——哪些网站特征可预测企业层面的创新活动？

PLoS One. 2021 Apr 5;16(4):e0249583. doi: 10.1371/journal.pone.0249583. eCollection 2021.

Where is your field going? A machine learning approach to study the relative motion of the domains of physics.你的领域发展方向在哪里？一种机器学习方法研究物理领域的相对运动。

PLoS One. 2020 Jun 18;15(6):e0233997. doi: 10.1371/journal.pone.0233997. eCollection 2020.

本文引用的文献

Entropy (Basel). 2018 Oct 31;20(11):833. doi: 10.3390/e20110833.

Zipf's, Heaps' and Taylor's Laws are Determined by the Expansion into the Adjacent Possible.齐普夫定律、希普斯定律和泰勒定律由向邻接可能态的扩展所决定。

Entropy (Basel). 2018 Sep 30;20(10):752. doi: 10.3390/e20100752.

Network Dynamics of Innovation Processes.创新过程的网络动态。

Phys Rev Lett. 2018 Jan 26;120(4):048301. doi: 10.1103/PhysRevLett.120.048301.

Waves of novelties in the expansion into the adjacent possible.在向相邻可能性扩展过程中的新奇浪潮。

PLoS One. 2017 Jun 8;12(6):e0179303. doi: 10.1371/journal.pone.0179303. eCollection 2017.

Factors contributing to non-randomness in species Co-occurrences on Islands.导致岛屿上物种共现非随机性的因素。

Oecologia. 1982 Jan;52(1):75-84. doi: 10.1007/BF00349014.

Statistically validated network of portfolio overlaps and systemic risk.具有统计验证的投资组合重叠网络和系统风险。

Sci Rep. 2016 Dec 21;6:39467. doi: 10.1038/srep39467.

Invention as a combinatorial process: evidence from US patents.作为组合过程的发明：来自美国专利的证据。

J R Soc Interface. 2015 May 6;12(106). doi: 10.1098/rsif.2015.0272.

The heterogeneous dynamics of economic complexity.经济复杂性的异质性动态

PLoS One. 2015 Feb 11;10(2):e0117174. doi: 10.1371/journal.pone.0117174. eCollection 2015.

The Scientific Competitiveness of Nations.国家的科学竞争力。

PLoS One. 2014 Dec 10;9(12):e113470. doi: 10.1371/journal.pone.0113470. eCollection 2014.

How the taxonomy of products drives the economic development of countries.产品分类法如何推动各国的经济发展。

PLoS One. 2014 Dec 8;9(12):e113770. doi: 10.1371/journal.pone.0113770. eCollection 2014.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

创新的语言。

The Language of Innovation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献