• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习从大规模评估日志数据中发现行为洞察。

Discovering action insights from large-scale assessment log data using machine learning.

作者信息

Yun Minyoung, Jeon Minjeong, Yang Heyoung

机构信息

Laboratory PIMM, Arts et Métieres Paris Tech, Paris, France.

IntelWind, Berkeley, USA.

出版信息

Sci Rep. 2025 Aug 19;15(1):30412. doi: 10.1038/s41598-025-14802-6.

DOI:10.1038/s41598-025-14802-6
PMID:40830159
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12365305/
Abstract

This study introduces a novel machine learning algorithm that combines natural language processing techniques, such as Word2Vec and Doc2Vec, with neural networks to identify and validate significant actions within human action sequences. Using the 2012 Program for the International Assessment of Adult Competencies dataset, the algorithm visualizes and analyzes action sequences in a 2D vector space to uncover high-impact behaviors that influence performance. The methodology, validated across two problem sets ("Party Invitation" and "Club Membership"), successfully distinguishes performance groups by focusing on critical actions, leading to enhanced classification accuracy (up to 94.6%) and clustering coherence (silhouette score of 0.491). This approach demonstrates potential applications in personalized education, healthcare diagnostics, and consumer behavior prediction, advancing the understanding of human behavior through digital footprints.

摘要

本研究引入了一种新颖的机器学习算法,该算法将诸如Word2Vec和Doc2Vec等自然语言处理技术与神经网络相结合,以识别和验证人类动作序列中的重要动作。利用2012年成人能力国际评估项目数据集,该算法在二维向量空间中可视化并分析动作序列,以发现影响表现的高影响力行为。该方法在两个问题集(“派对邀请”和“俱乐部会员资格”)上得到验证,通过关注关键动作成功区分了表现组,从而提高了分类准确率(高达94.6%)和聚类一致性(轮廓系数为0.491)。这种方法在个性化教育、医疗诊断和消费者行为预测中展示了潜在应用,通过数字足迹增进了对人类行为的理解。

相似文献

1
Discovering action insights from large-scale assessment log data using machine learning.利用机器学习从大规模评估日志数据中发现行为洞察。
Sci Rep. 2025 Aug 19;15(1):30412. doi: 10.1038/s41598-025-14802-6.
2
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
3
Comparing neural language models for medical concept representation and patient trajectory prediction.比较用于医学概念表示和患者轨迹预测的神经语言模型。
Artif Intell Med. 2025 May;163:103108. doi: 10.1016/j.artmed.2025.103108. Epub 2025 Mar 10.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.关于使用人工智能评估临床数据完整性并生成元数据的提案:算法开发与验证
JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.
6
DNA sequence classification for diabetes mellitus using NuSVC and XGBoost: A comparative.使用NuSVC和XGBoost对糖尿病进行DNA序列分类:一项比较研究。
PLoS One. 2025 Jul 18;20(7):e0328253. doi: 10.1371/journal.pone.0328253. eCollection 2025.
7
AI in Medical Questionnaires: Innovations, Diagnosis, and Implications.医学问卷中的人工智能:创新、诊断及影响
J Med Internet Res. 2025 Jun 23;27:e72398. doi: 10.2196/72398.
8
An open dataset and machine learning algorithms for Niacin Skin-Flushing Response based screening of psychiatric disorders.基于烟酸皮肤潮红反应筛查精神疾病的开放数据集和机器学习算法
BMC Psychiatry. 2025 Aug 4;25(1):757. doi: 10.1186/s12888-025-07196-2.
9
A study on the effectiveness of machine learning models for hepatitis prediction.关于机器学习模型用于肝炎预测有效性的研究。
Sci Rep. 2025 Aug 20;15(1):30659. doi: 10.1038/s41598-025-07104-4.
10
Sentiment Analysis Using a Large Language Model-Based Approach to Detect Opioids Mixed With Other Substances Via Social Media: Method Development and Validation.使用基于大语言模型的方法通过社交媒体检测与其他物质混合的阿片类药物的情感分析:方法开发与验证
JMIR Infodemiology. 2025 Jun 19;5:e70525. doi: 10.2196/70525.

本文引用的文献

1
A novel machine learning-based prediction method for patients at risk of developing depressive symptoms using a small data.一种基于新型机器学习的预测方法,可利用小数据预测有抑郁症状风险的患者。
PLoS One. 2024 May 22;19(5):e0303889. doi: 10.1371/journal.pone.0303889. eCollection 2024.
2
Systematic review and meta-analysis of performance of wearable artificial intelligence in detecting and predicting depression.可穿戴人工智能在检测和预测抑郁症方面性能的系统评价与荟萃分析
NPJ Digit Med. 2023 May 5;6(1):84. doi: 10.1038/s41746-023-00828-5.
3
A gradient boosting classifier for purchase intention prediction of online shoppers.
一种用于预测在线购物者购买意愿的梯度提升分类器。
Heliyon. 2023 Apr 3;9(4):e15163. doi: 10.1016/j.heliyon.2023.e15163. eCollection 2023 Apr.
4
Development and validation of a machine learning-based postpartum depression prediction model: A nationwide cohort study.基于机器学习的产后抑郁症预测模型的开发和验证:一项全国性队列研究。
Depress Anxiety. 2021 Apr;38(4):400-411. doi: 10.1002/da.23123. Epub 2020 Dec 7.
5
Predicting Symptoms of Depression and Anxiety Using Smartphone and Wearable Data.利用智能手机和可穿戴设备数据预测抑郁和焦虑症状
Front Psychiatry. 2021 Jan 28;12:625247. doi: 10.3389/fpsyt.2021.625247. eCollection 2021.
6
Social Media and Depression Symptoms: a Meta-Analysis.社交媒体与抑郁症状:一项元分析。
Res Child Adolesc Psychopathol. 2021 Feb;49(2):241-253. doi: 10.1007/s10802-020-00715-7. Epub 2021 Jan 6.
7
Shopper intent prediction from clickstream e-commerce data with minimal browsing information.基于点击流电子商务数据的购物者意图预测,仅使用最少的浏览信息。
Sci Rep. 2020 Oct 12;10(1):16983. doi: 10.1038/s41598-020-73622-y.
8
Neuroimaging Biomarkers for Predicting Treatment Response and Recurrence of Major Depressive Disorder.神经影像学生物标志物可预测重度抑郁症的治疗反应和复发。
Int J Mol Sci. 2020 Mar 20;21(6):2148. doi: 10.3390/ijms21062148.
9
Social media use and depression in adolescents: a scoping review.社交媒体使用与青少年抑郁:范围综述。
Int Rev Psychiatry. 2020 May;32(3):235-253. doi: 10.1080/09540261.2020.1720623. Epub 2020 Feb 17.
10
Predictive Feature Generation and Selection Using Process Data From PISA Interactive Problem-Solving Items: An Application of Random Forests.使用国际学生评估项目(PISA)交互式解决问题项目的过程数据进行预测性特征生成与选择:随机森林的应用
Front Psychol. 2019 Nov 21;10:2461. doi: 10.3389/fpsyg.2019.02461. eCollection 2019.