• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

不同类型数字档案在文化运营管理中的关键信息抽取算法。

Key Information Extraction Algorithm of Different Types of Digital Archives for Cultural Operation and Management.

机构信息

University of Jinan, Jinan 250022, China.

出版信息

Comput Intell Neurosci. 2022 Aug 29;2022:3459605. doi: 10.1155/2022/3459605. eCollection 2022.

DOI:10.1155/2022/3459605
PMID:36072752
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9444367/
Abstract

In order to improve the effect of key information extraction from digital archives, a key information extraction algorithm for different types of digital archives is designed. Preprocess digital archive information, taking part of speech and marks as key information. Self-organizing feature mapping network is used to extract the key information features of digital archives, and the semantic similarity calculation results are obtained by combining the feature extraction results. Combine with mutual information collection, take that word with the highest mutual information value as the collection cent, traverse all keywords, and take the central word as the key information of digital archives to complete the extraction of key information. Experiments show that the recall rate of the algorithm ranges from 96% to 99%, the extraction accuracy of key information of digital archives is between 96 and 98%, and the average extraction time of key information of digital archives is 0.63 s. The practical application effect is good.

摘要

为了提高数字档案关键信息提取的效果,设计了一种针对不同类型数字档案的关键信息提取算法。对数字档案信息进行预处理,以词性和标记作为关键信息。采用自组织特征映射网络提取数字档案的关键信息特征,并结合特征提取结果得到语义相似性计算结果。结合互信息集合并取互信息值最高的词作为集核,遍历所有关键词,以中心词作为数字档案的关键信息,完成关键信息的提取。实验表明,该算法的召回率在 96%到 99%之间,数字档案关键信息的提取准确率在 96%到 98%之间,数字档案关键信息的平均提取时间为 0.63s。实际应用效果良好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/bb29c1a87e77/CIN2022-3459605.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/80f5eb258b71/CIN2022-3459605.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/65c69be08385/CIN2022-3459605.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/e6628067d795/CIN2022-3459605.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/6b900164c3a3/CIN2022-3459605.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/1d9ef5e78945/CIN2022-3459605.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/f972a9fde1a8/CIN2022-3459605.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/bb29c1a87e77/CIN2022-3459605.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/80f5eb258b71/CIN2022-3459605.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/65c69be08385/CIN2022-3459605.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/e6628067d795/CIN2022-3459605.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/6b900164c3a3/CIN2022-3459605.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/1d9ef5e78945/CIN2022-3459605.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/f972a9fde1a8/CIN2022-3459605.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e849/9444367/bb29c1a87e77/CIN2022-3459605.007.jpg

相似文献

1
Key Information Extraction Algorithm of Different Types of Digital Archives for Cultural Operation and Management.不同类型数字档案在文化运营管理中的关键信息抽取算法。
Comput Intell Neurosci. 2022 Aug 29;2022:3459605. doi: 10.1155/2022/3459605. eCollection 2022.
2
Construction of Digital Management Model of Personnel Archives Information Based on the Concept of Industry 4.0.基于工业 4.0 理念的人事档案信息数字化管理模式构建。
Comput Intell Neurosci. 2022 Jun 7;2022:9328566. doi: 10.1155/2022/9328566. eCollection 2022.
3
Value Exploration and Application of Digital Archive Information Resources under the Information Ecological Environment.信息生态环境下数字档案信息资源的价值挖掘与应用
J Environ Public Health. 2022 Sep 25;2022:7135132. doi: 10.1155/2022/7135132. eCollection 2022.
4
A semantic approach for digital long-term preservation of electronic health documents.一种用于电子健康文档数字长期保存的语义方法。
Stud Health Technol Inform. 2012;180:265-9.
5
[Recommendations for the selection of storage media for archiving digital radiological image data based on the comparison of retrieval times at different PACS archive levels].基于不同PACS存档级别检索时间比较的数字放射影像数据存档存储介质选择建议
Rofo. 2005 Feb;177(2):250-7. doi: 10.1055/s-2004-813811.
6
Support Vector Machine with Ensemble Tree Kernel for Relation Extraction.用于关系抽取的集成树核支持向量机
Comput Intell Neurosci. 2016;2016:8495754. doi: 10.1155/2016/8495754. Epub 2016 Mar 22.
7
Intelligent Classification Method of Archive Data Based on Multigranular Semantics.基于多粒度语义的档案数据智能分类方法。
Comput Intell Neurosci. 2022 May 14;2022:7559523. doi: 10.1155/2022/7559523. eCollection 2022.
8
Development of the digital retrieval system integrating intelligent information and improved genetic algorithm: A study based on art museums.开发集成智能信息和改进遗传算法的数字检索系统:基于美术馆的研究。
PLoS One. 2024 Jun 25;19(6):e0305690. doi: 10.1371/journal.pone.0305690. eCollection 2024.
9
On missing Data Treatment for degraded video and film archives: a survey and a new Bayesian approach.关于退化视频和电影档案的数据缺失处理:一项综述及一种新的贝叶斯方法
IEEE Trans Image Process. 2004 Mar;13(3):397-415. doi: 10.1109/tip.2004.823815.
10
Exploration of machine algorithms based on deep learning model and feature extraction.基于深度学习模型和特征提取的机器算法探索。
Math Biosci Eng. 2021 Sep 3;18(6):7602-7618. doi: 10.3934/mbe.2021376.

本文引用的文献

1
Analysis of Enterprise Human Resources Demand Forecast Model Based on SOM Neural Network.基于SOM神经网络的企业人力资源需求预测模型分析
Comput Intell Neurosci. 2021 Jun 21;2021:6596548. doi: 10.1155/2021/6596548. eCollection 2021.
2
A rainwater control optimization design approach for airports based on a self-organizing feature map neural network model.基于自组织特征映射神经网络模型的机场雨水控制优化设计方法。
PLoS One. 2020 Jan 21;15(1):e0227901. doi: 10.1371/journal.pone.0227901. eCollection 2020.