• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
POCASUM : Policy Categorizer and Summarizer Based on Text Mining and Machine Learning.POCASUM:基于文本挖掘和机器学习的政策分类器与摘要器
Soft comput. 2021 Jul;25(14):9365-9375. doi: 10.1007/s00500-021-05916-w. Epub 2021 Jun 11.
2
Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation.基于深度神经网络的临床相关生物医学文本摘要:模型开发与验证。
J Med Internet Res. 2020 Oct 23;22(10):e19810. doi: 10.2196/19810.
3
Quantifying the informativeness for biomedical literature summarization: An itemset mining method.量化生物医学文献摘要的信息量:一种基于项集挖掘的方法。
Comput Methods Programs Biomed. 2017 Jul;146:77-89. doi: 10.1016/j.cmpb.2017.05.011. Epub 2017 May 27.
4
Evaluating shallow and deep learning strategies for the 2018 n2c2 shared task on clinical text classification.评估浅层和深度学习策略在 2018 n2c2 临床文本分类共享任务中的应用。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1247-1254. doi: 10.1093/jamia/ocz149.
5
Summarization of biomedical articles using domain-specific word embeddings and graph ranking.基于领域特定词嵌入和图排序的生物医学文章摘要。
J Biomed Inform. 2020 Jul;107:103452. doi: 10.1016/j.jbi.2020.103452. Epub 2020 May 19.
6
Different approaches for identifying important concepts in probabilistic biomedical text summarization.概率生物医学文本摘要中重要概念识别的不同方法。
Artif Intell Med. 2018 Jan;84:101-116. doi: 10.1016/j.artmed.2017.11.004. Epub 2017 Dec 6.
7
Using discourse analysis to improve text categorization in MEDLINE.运用话语分析改进医学在线数据库(MEDLINE)中的文本分类
Stud Health Technol Inform. 2007;129(Pt 1):710-5.
8
Novel Privacy Preserving Non-Invasive Sensing-Based Diagnoses of Pneumonia Disease Leveraging Deep Network Model.新型隐私保护的基于非侵入性传感的肺炎疾病诊断利用深度网络模型。
Sensors (Basel). 2022 Jan 8;22(2):461. doi: 10.3390/s22020461.
9
Deep contextualized embeddings for quantifying the informative content in biomedical text summarization.用于量化生物医学文本摘要是信息内容的深度语境化嵌入。
Comput Methods Programs Biomed. 2020 Feb;184:105117. doi: 10.1016/j.cmpb.2019.105117. Epub 2019 Oct 4.
10
Publicly available machine learning models for identifying opioid misuse from the clinical notes of hospitalized patients.可公开获取的机器学习模型,用于从住院患者的临床记录中识别阿片类药物滥用。
BMC Med Inform Decis Mak. 2020 Apr 29;20(1):79. doi: 10.1186/s12911-020-1099-y.

引用本文的文献

1
Construction of English and American Literature Corpus Based on Machine Learning Algorithm.基于机器学习算法的英美文学语料库构建。
Comput Intell Neurosci. 2022 Jun 2;2022:9773452. doi: 10.1155/2022/9773452. eCollection 2022.

本文引用的文献

1
Effects of Distance Measure Choice on K-Nearest Neighbor Classifier Performance: A Review.距离度量选择对 K-最近邻分类器性能的影响:综述
Big Data. 2019 Dec;7(4):221-248. doi: 10.1089/big.2018.0175. Epub 2019 Aug 14.

POCASUM:基于文本挖掘和机器学习的政策分类器与摘要器

POCASUM : Policy Categorizer and Summarizer Based on Text Mining and Machine Learning.

作者信息

Deotale Rushikesh, Rawat Shreyash, Vijayarajan V, Prasath V B Surya

机构信息

School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, India.

Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati OH 45229 USA. Departments of Pediatrics, Biomedical Informatics, Electrical Engineering and Computer Science, University of Cincinnati College of Medicine, Cincinnati, OH USA.

出版信息

Soft comput. 2021 Jul;25(14):9365-9375. doi: 10.1007/s00500-021-05916-w. Epub 2021 Jun 11.

DOI:10.1007/s00500-021-05916-w
PMID:35308599
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8932938/
Abstract

Having control over your data is a right and a duty that every citizen has in our digital society. It is often that users skip entire policies of applications or websites to save time and energy without realizing the potential sticky points in these policies. Due to obscure language and verbose explanations majority of users of hypermedia do not bother to read them. Further, sometimes digital media companies do not spend enough effort in stating their policies clearly which often time can also be incomplete. A summarized version of these privacy policies that can be categorized into the useful information can help the users. To solve this problem, in this work we propose to use machine learning based models for policy categorizer that classifies the policy paragraphs under the attributes proposed like security, contact etc. By benchmarking different machine learning based classifier models, we show that artificial neural network model performs with higher accuracy on a challenging dataset of textual privacy policies. We thus show that machine learning can help summarize the relevant paragraphs under the various attributes so that the user can get the gist of that topic within a few lines.

摘要

在我们的数字社会中,掌控自己的数据是每位公民的权利和义务。用户常常为节省时间和精力而跳过应用程序或网站的完整政策,却未意识到这些政策中潜在的关键问题。由于语言晦涩且解释冗长,大多数超媒体用户懒得去阅读它们。此外,有时数字媒体公司在清晰阐述其政策方面投入不足,这些政策往往也不完整。将这些隐私政策归纳为可分类的有用信息版本会对用户有所帮助。为解决这一问题,在这项工作中,我们提议使用基于机器学习的模型作为政策分类器,根据诸如安全、联系等提出的属性对政策段落进行分类。通过对不同的基于机器学习的分类器模型进行基准测试,我们表明人工神经网络模型在具有挑战性的文本隐私政策数据集上具有更高的准确率。因此,我们证明机器学习有助于总结各属性下的相关段落,以便用户能在几行内了解该主题的要点。