• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
FacetGist: Collective Extraction of Document Facets in Large Technical Corpora.方面要点:大型技术语料库中文档方面的集体提取
Proc ACM Int Conf Inf Knowl Manag. 2016 Oct;2016:871-880. doi: 10.1145/2983323.2983828.
2
Quantifying the informativeness for biomedical literature summarization: An itemset mining method.量化生物医学文献摘要的信息量:一种基于项集挖掘的方法。
Comput Methods Programs Biomed. 2017 Jul;146:77-89. doi: 10.1016/j.cmpb.2017.05.011. Epub 2017 May 27.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Can multilinguality improve Biomedical Word Sense Disambiguation?多语言能力能否改善生物医学词汇语义消歧?
J Biomed Inform. 2016 Dec;64:320-332. doi: 10.1016/j.jbi.2016.10.020. Epub 2016 Nov 2.
5
Document-Level Biomedical Relation Extraction Using Graph Convolutional Network and Multihead Attention: Algorithm Development and Validation.使用图卷积网络和多头注意力的文档级生物医学关系抽取:算法开发与验证
JMIR Med Inform. 2020 Jul 31;8(7):e17638. doi: 10.2196/17638.
6
Advancing document-level event extraction: Integration across texts and reciprocal feedback.
Math Biosci Eng. 2023 Nov 3;20(11):20050-20072. doi: 10.3934/mbe.2023888.
7
A concept-driven biomedical knowledge extraction and visualization framework for conceptualization of text corpora.面向文本语料概念化的概念驱动生物医学知识提取和可视化框架。
J Biomed Inform. 2010 Dec;43(6):1020-35. doi: 10.1016/j.jbi.2010.09.008. Epub 2010 Sep 24.
8
Literature Retrieval for Precision Medicine with Neural Matching and Faceted Summarization.基于神经匹配和分面摘要的精准医学文献检索
Proc Conf Empir Methods Nat Lang Process. 2020 Nov;2020:3389-3399. doi: 10.18653/v1/2020.findings-emnlp.304.
9
Graph-based biomedical text summarization: An itemset mining and sentence clustering approach.基于图的生物医学文本摘要:一种基于项集挖掘和句子聚类的方法。
J Biomed Inform. 2018 Aug;84:42-58. doi: 10.1016/j.jbi.2018.06.005. Epub 2018 Jun 15.
10
Large-Scale Multi-Document Summarization with Information Extraction and Compression.基于信息抽取与压缩的大规模多文档摘要生成
ArXiv. 2022 May 1:arXiv:2205.00548v1.

本文引用的文献

1
ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering.ClusType:基于关系短语聚类的有效实体识别与分类
KDD. 2015 Aug;2015:995-1004. doi: 10.1145/2783258.2783362.
2
Mining Quality Phrases from Massive Text Corpora.从海量文本语料库中挖掘高质量短语。
Proc ACM SIGMOD Int Conf Manag Data. 2015 May-Jun;2015:1729-1744. doi: 10.1145/2723372.2751523.

方面要点:大型技术语料库中文档方面的集体提取

FacetGist: Collective Extraction of Document Facets in Large Technical Corpora.

作者信息

Siddiqui Tarique, Ren Xiang, Parameswaran Aditya, Han Jiawei

机构信息

University of Illinois at Urbana-Champaign, Urbana, IL, USA.

出版信息

Proc ACM Int Conf Inf Knowl Manag. 2016 Oct;2016:871-880. doi: 10.1145/2983323.2983828.

DOI:10.1145/2983323.2983828
PMID:28210517
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5308212/
Abstract

Given the large volume of technical documents available, it is crucial to automatically organize and categorize these documents to be able to understand and extract value from them. Towards this end, we introduce a new research problem called Facet Extraction. Given a collection of technical documents, the goal of Facet Extraction is to automatically label each document with a set of concepts for the key facets (, application, technique, evaluation metrics, and dataset) that people may be interested in. Facet Extraction has numerous applications, including document summarization, literature search, patent search and business intelligence. The major challenge in performing Facet Extraction arises from multiple sources: concept extraction, concept to facet matching, and facet disambiguation. To tackle these challenges, we develop FacetGist, a framework for facet extraction. Facet Extraction involves constructing a graph-based heterogeneous network to capture information available across multiple sentence-level features, as well as context features. We then formulate a joint optimization problem, and propose an efficient algorithm for graph-based label propagation to estimate the facet of each concept mention. Experimental results on technical corpora from two domains demonstrate that Facet Extraction can lead to an improvement of over 25% in both precision and recall over competing schemes.

摘要

鉴于现有大量技术文档,自动对这些文档进行组织和分类对于理解并从中提取价值至关重要。为此,我们引入了一个名为“方面提取”的新研究问题。给定一组技术文档,方面提取的目标是用一组人们可能感兴趣的关键方面(如应用、技术、评估指标和数据集)的概念自动标记每个文档。方面提取有许多应用,包括文档摘要、文献搜索、专利搜索和商业智能。执行方面提取的主要挑战来自多个方面:概念提取、概念到方面的匹配以及方面消歧。为应对这些挑战,我们开发了FacetGist,一个方面提取框架。方面提取涉及构建基于图的异构网络,以捕获跨多个句子级特征以及上下文特征的可用信息。然后,我们制定一个联合优化问题,并提出一种基于图的标签传播的高效算法,以估计每个概念提及的方面。来自两个领域的技术语料库的实验结果表明,与竞争方案相比,方面提取在精确率和召回率方面都能提高超过25%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d17b/5308212/141858975a48/nihms843631f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d17b/5308212/8be0ea6e95d2/nihms843631f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d17b/5308212/1b22d6f2fdef/nihms843631f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d17b/5308212/a76595ba6b64/nihms843631f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d17b/5308212/141858975a48/nihms843631f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d17b/5308212/8be0ea6e95d2/nihms843631f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d17b/5308212/1b22d6f2fdef/nihms843631f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d17b/5308212/a76595ba6b64/nihms843631f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d17b/5308212/141858975a48/nihms843631f4.jpg