He Yifan, Deng Kehui, Han Jiawei
Department of Humanities, Donghua University, Shanghai, China.
PLoS One. 2025 Apr 24;20(4):e0322182. doi: 10.1371/journal.pone.0322182. eCollection 2025.
Patent value prediction is essential for technology innovation management. This study aims to enhance technology innovation management in the field of biomedical textiles by processing complex biomedical patent information to improve the accuracy of predicting patent values. A patent value grading prediction method based on a fusion of machine learning models is proposed, utilizing 113,428 biomedical textile patents as the research sample. The method combines BERT (Bidirectional Encoder Representations from Transformers) and a stacking strategy to classify and predict the value class of biomedical textile patents using both textual information and structured patent features. We implemented this method for patent value prediction in biomedical textiles, leading to the development of BioTexVal-the first dedicated patent value prediction model for this domain. BioTexVal's innovation lies in employing a stacking strategy that integrates multiple machine learning models to enhance predictive accuracy while leveraging unstructured data during training. Results have shown that this approach significantly outperforms previous predictive methods. Validated on 113,428 biomedical textile patents spanning from 2003 to 2023, BioTexVal achieved an accuracy of 88.38%. This study uses average annual forward citations as an indicator for distinguishing patent value grades. The method may require adjustments based on data characteristics when applied to other research fields to ensure its effectiveness.
专利价值预测对于技术创新管理至关重要。本研究旨在通过处理复杂的生物医学专利信息来提高生物医学纺织品领域的技术创新管理水平,以提升专利价值预测的准确性。提出了一种基于机器学习模型融合的专利价值分级预测方法,以113428项生物医学纺织品专利作为研究样本。该方法结合了BERT(来自变换器的双向编码器表示)和堆叠策略,利用文本信息和结构化专利特征对生物医学纺织品专利的价值类别进行分类和预测。我们将此方法应用于生物医学纺织品的专利价值预测,从而开发出BioTexVal——该领域首个专门的专利价值预测模型。BioTexVal的创新之处在于采用了一种堆叠策略,该策略整合了多个机器学习模型,以提高预测准确性,同时在训练过程中利用非结构化数据。结果表明,这种方法明显优于以前的预测方法。在2003年至2023年的113428项生物医学纺织品专利上进行验证,BioTexVal的准确率达到了88.38%。本研究使用平均每年的向前引用次数作为区分专利价值等级的指标。该方法应用于其他研究领域时,可能需要根据数据特征进行调整,以确保其有效性。