Wang Xin, Raj Anshu, Su Yanqing, Xu Shuozhi, Lu Kun
School of Library and Information Studies, University of Oklahoma, Norman, OK, 73019, USA.
School of Aerospace and Mechanical Engineering, University of Oklahoma, Norman, OK, 73019, USA.
Sci Rep. 2025 Jul 1;15(1):21219. doi: 10.1038/s41598-025-08356-w.
As global energy demands rise, the advancement of new energy technologies increasingly relies on the development of metals that can endure extreme pressures, temperatures, and fluxes of energetic particles and photons, as well as aggressive chemical reactions. One way to assist in the design and manufacturing of metals for the future is by learning from their past. Here we track the progress of metallic materials for extreme environments in the past 35 years using the text mining method, which allows us to discover patterns from a large scale of literature in the field. Specifically, we leverage transfer learning and dynamic word embeddings. Approximately one million relevant abstracts ranging from 1989 to 2023 were collected from the Web of Science. The literature was then mapped to a 200-dimensional vector space, generating time-series word embeddings across six time periods. Subsequent orthogonal Procrustes analysis was employed to align and compare vectors across these periods, overcoming challenges posed by training randomness and the non-uniqueness of singular value decomposition. This enabled the comparison of the semantic evolution of terms related to metals under extreme conditions. The model's performance was evaluated using inputs categorized into materials, properties, and applications, demonstrating its ability to identify relevant metallic materials to the three input categories. The study also revealed the temporal changes in keyword associations, indicating shifts in research focus or industrial interest towards high-performance alloys for applications in aerospace and biomedical engineering, among others. This showcases the model's capability to track the progress in metallic materials for extreme environments over time.
随着全球能源需求的增长,新能源技术的进步越来越依赖于能够承受极端压力、温度、高能粒子和光子通量以及剧烈化学反应的金属的开发。助力未来金属设计与制造的一种方法是借鉴其发展历程。在此,我们运用文本挖掘方法追踪了过去35年中用于极端环境的金属材料发展进程,这使我们能够从该领域的大量文献中发现规律。具体而言我们利用了迁移学习和动态词嵌入技术。从科学网收集了1989年至2022年约100万篇相关摘要,并将这些文献映射到200维向量空间,生成六个时间段的时间序列词嵌入向量。随后采用正交普罗克汝斯分析来对齐并比较这些时间段的向量,克服了训练随机性和奇异值分解非唯一性带来的挑战,从而实现了对极端条件下与金属相关术语语义演变的比较。使用分为材料、性能及应用的输入数据评估了该模型性能,证明其能够识别与这三个输入类别相关的金属材料。该研究还揭示了关键词关联随时间的变化,表明研究重点或行业兴趣已转向航空航天及生物医学工程等领域应用中的高性能合金。这展示了该模型随时间追踪极端环境下金属材料发展进程情况的能力。
原文中时间范围有误,应为1989至2022年,译文已修正。