• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于单纯疱疹病毒患者登记中风险组识别和用户数据收集的机器学习:算法开发与验证研究

Machine Learning for Risk Group Identification and User Data Collection in a Herpes Simplex Virus Patient Registry: Algorithm Development and Validation Study.

作者信息

Surodina Svitlana, Lam Ching, Grbich Svetislav, Milne-Ives Madison, van Velthoven Michelle, Meinert Edward

机构信息

Skein Ltd, London, United Kingdom.

Department of Informatics, King's College London, London, United Kingdom.

出版信息

JMIRx Med. 2021 Jun 11;2(2):e25560. doi: 10.2196/25560.

DOI:10.2196/25560
PMID:37725536
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10414389/
Abstract

BACKGROUND

Researching people with herpes simplex virus (HSV) is challenging because of poor data quality, low user engagement, and concerns around stigma and anonymity.

OBJECTIVE

This project aimed to improve data collection for a real-world HSV registry by identifying predictors of HSV infection and selecting a limited number of relevant questions to ask new registry users to determine their level of HSV infection risk.

METHODS

The US National Health and Nutrition Examination Survey (NHANES, 2015-2016) database includes the confirmed HSV type 1 and type 2 (HSV-1 and HSV-2, respectively) status of American participants (14-49 years) and a wealth of demographic and health-related data. The questionnaires and data sets from this survey were used to form two data sets: one for HSV-1 and one for HSV-2. These data sets were used to train and test a model that used a random forest algorithm (devised using Python) to minimize the number of anonymous lifestyle-based questions needed to identify risk groups for HSV.

RESULTS

The model selected a reduced number of questions from the NHANES questionnaire that predicted HSV infection risk with high accuracy scores of 0.91 and 0.96 and high recall scores of 0.88 and 0.98 for the HSV-1 and HSV-2 data sets, respectively. The number of questions was reduced from 150 to an average of 40, depending on age and gender. The model, therefore, provided high predictability of risk of infection with minimal required input.

CONCLUSIONS

This machine learning algorithm can be used in a real-world evidence registry to collect relevant lifestyle data and identify individuals' levels of risk of HSV infection. A limitation is the absence of real user data and integration with electronic medical records, which would enable model learning and improvement. Future work will explore model adjustments, anonymization options, explicit permissions, and a standardized data schema that meet the General Data Protection Regulation, Health Insurance Portability and Accountability Act, and third-party interface connectivity requirements.

摘要

背景

由于数据质量差、用户参与度低以及对耻辱感和匿名性的担忧,对单纯疱疹病毒(HSV)感染者进行研究具有挑战性。

目的

本项目旨在通过识别HSV感染的预测因素并选择有限数量的相关问题来询问新的登记用户,以确定其HSV感染风险水平,从而改善真实世界HSV登记处的数据收集情况。

方法

美国国家健康与营养检查调查(NHANES,2015 - 2016)数据库包含美国参与者(14 - 49岁)确诊的1型和2型单纯疱疹病毒(分别为HSV - 1和HSV - 2)状态以及大量人口统计学和健康相关数据。本次调查的问卷和数据集用于形成两个数据集:一个用于HSV - 1,一个用于HSV - 2。这些数据集用于训练和测试一个使用随机森林算法(用Python设计)的模型,以尽量减少识别HSV风险群体所需的基于匿名生活方式的问题数量。

结果

该模型从NHANES问卷中选择了数量减少的问题,对于HSV - 1和HSV - 2数据集,预测HSV感染风险的准确率分别为0.91和0.96,召回率分别为0.88和0.98。问题数量从150个减少到平均40个,具体取决于年龄和性别。因此,该模型以最少的所需输入提供了高感染风险预测能力。

结论

这种机器学习算法可用于真实世界证据登记处,以收集相关生活方式数据并识别个体的HSV感染风险水平。一个局限性是缺乏真实用户数据以及与电子病历的整合,而这将有助于模型学习和改进。未来的工作将探索模型调整、匿名化选项、明确权限以及符合《通用数据保护条例》、《健康保险流通与责任法案》和第三方接口连接要求的标准化数据模式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4b/10414389/e76d0fd7a5eb/xmed_v2i2e25560_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4b/10414389/5145be40e2b2/xmed_v2i2e25560_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4b/10414389/e76d0fd7a5eb/xmed_v2i2e25560_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4b/10414389/5145be40e2b2/xmed_v2i2e25560_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4b/10414389/e76d0fd7a5eb/xmed_v2i2e25560_fig2.jpg

相似文献

1
Machine Learning for Risk Group Identification and User Data Collection in a Herpes Simplex Virus Patient Registry: Algorithm Development and Validation Study.用于单纯疱疹病毒患者登记中风险组识别和用户数据收集的机器学习:算法开发与验证研究
JMIRx Med. 2021 Jun 11;2(2):e25560. doi: 10.2196/25560.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Engineering Requirements of a Herpes Simplex Virus Patient Registry: Discovery Phase of a Real-World Evidence Platform to Advance Pharmacogenomics and Personalized Medicine.单纯疱疹病毒患者登记系统的工程要求:推进药物基因组学和个性化医疗的真实世界证据平台的发现阶段
Biomedicines. 2019 Dec 15;7(4):100. doi: 10.3390/biomedicines7040100.
4
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
5
Socio-demographic and behavioral correlates of herpes simplex virus type 1 and 2 infections and co-infections among adults in the USA.美国成年人单纯疱疹病毒 1 型和 2 型感染及合并感染的社会人口统计学和行为学相关因素。
Int J Infect Dis. 2010 Sep;14 Suppl 3:e154-60. doi: 10.1016/j.ijid.2009.12.007. Epub 2010 Apr 24.
6
Risk factors for herpes simplex virus type 2 infection among female commercial sex workers in Mexico City.墨西哥城女性商业性工作者中2型单纯疱疹病毒感染的危险因素。
Int J STD AIDS. 1999 Feb;10(2):105-11. doi: 10.1258/0956462991913727.
7
The association between Herpes simplex virus type 2 and asthma: A cross-sectional study from National Health and Nutrition Examination Survey 1999-2016.2型单纯疱疹病毒与哮喘之间的关联:一项基于1999 - 2016年美国国家健康与营养检查调查的横断面研究。
Front Med (Lausanne). 2022 Sep 14;9:943706. doi: 10.3389/fmed.2022.943706. eCollection 2022.
8
Herpes simplex virus type 2 infection of heterosexual men attending a sexual health centre.在一家性健康中心就诊的异性恋男性的2型单纯疱疹病毒感染
Med J Aust. 1994 Jun 6;160(11):697-700.
9
Using the H2O Automatic Machine Learning Algorithms to Identify Predictors of Web-Based Medical Record Nonuse Among Patients in a Data-Rich Environment: Mixed Methods Study.在数据丰富的环境中,使用H2O自动机器学习算法识别患者中基于网络的医疗记录未使用情况的预测因素:混合方法研究。
JMIR Med Inform. 2023 Jun 19;11:e41576. doi: 10.2196/41576.
10
Herpes simplex virus type 2 seropositivity and relationship status among U.S. adults age 20 to 49: a population-based analysis.美国 20 至 49 岁成年人中单纯疱疹病毒 2 型血清阳性率与关系状况:基于人群的分析。
BMC Infect Dis. 2010 Dec 22;10:359. doi: 10.1186/1471-2334-10-359.

引用本文的文献

1
Artificial Intelligence's Use in the Diagnosis of Mouth Ulcers: A Systematic Review.人工智能在口腔溃疡诊断中的应用:一项系统综述。
Cureus. 2023 Sep 13;15(9):e45187. doi: 10.7759/cureus.45187. eCollection 2023 Sep.
2
An Explainable Artificial Intelligence Software Tool for Weight Management Experts (PRIMO): Mixed Methods Study.用于体重管理专家的可解释人工智能软件工具(PRIMO):混合方法研究。
J Med Internet Res. 2023 Sep 6;25:e42047. doi: 10.2196/42047.

本文引用的文献

1
Machine-Learning Monitoring System for Predicting Mortality Among Patients With Noncancer End-Stage Liver Disease: Retrospective Study.用于预测非癌终末期肝病患者死亡率的机器学习监测系统:回顾性研究
JMIR Med Inform. 2020 Oct 30;8(10):e24305. doi: 10.2196/24305.
2
Machine Learning Models for the Prediction of Postpartum Depression: Application and Comparison Based on a Cohort Study.用于预测产后抑郁症的机器学习模型:基于队列研究的应用与比较
JMIR Med Inform. 2020 Apr 30;8(4):e15516. doi: 10.2196/15516.
3
Predicting Metabolic Syndrome With Machine Learning Models Using a Decision Tree Algorithm: Retrospective Cohort Study.
使用决策树算法的机器学习模型预测代谢综合征:回顾性队列研究
JMIR Med Inform. 2020 Mar 23;8(3):e17110. doi: 10.2196/17110.
4
Engineering Requirements of a Herpes Simplex Virus Patient Registry: Discovery Phase of a Real-World Evidence Platform to Advance Pharmacogenomics and Personalized Medicine.单纯疱疹病毒患者登记系统的工程要求:推进药物基因组学和个性化医疗的真实世界证据平台的发现阶段
Biomedicines. 2019 Dec 15;7(4):100. doi: 10.3390/biomedicines7040100.
5
Predicting risk for Alcohol Use Disorder using longitudinal data with multimodal biomarkers and family history: a machine learning study.利用具有多模态生物标志物和家族史的纵向数据预测酒精使用障碍风险:一项机器学习研究。
Mol Psychiatry. 2021 Apr;26(4):1133-1141. doi: 10.1038/s41380-019-0534-x. Epub 2019 Oct 8.
6
Artificial intelligence and machine learning in clinical development: a translational perspective.临床开发中的人工智能与机器学习:转化医学视角
NPJ Digit Med. 2019 Jul 26;2:69. doi: 10.1038/s41746-019-0148-3. eCollection 2019.
7
Optimized machine learning methods for prediction of cognitive outcome in Parkinson's disease.优化机器学习方法预测帕金森病认知结局。
Comput Biol Med. 2019 Aug;111:103347. doi: 10.1016/j.compbiomed.2019.103347. Epub 2019 Jun 28.
8
Deep Aging Clocks: The Emergence of AI-Based Biomarkers of Aging and Longevity.深度衰老时钟:基于人工智能的衰老和长寿生物标志物的出现。
Trends Pharmacol Sci. 2019 Aug;40(8):546-549. doi: 10.1016/j.tips.2019.05.004. Epub 2019 Jul 3.
9
Artificial intelligence in healthcare.人工智能在医疗保健领域的应用。
Nat Biomed Eng. 2018 Oct;2(10):719-731. doi: 10.1038/s41551-018-0305-z. Epub 2018 Oct 10.
10
Digital Interventions to Build a Patient Registry for Rheumatology Research.数字干预措施构建风湿病学研究患者注册库。
Rheum Dis Clin North Am. 2019 May;45(2):173-186. doi: 10.1016/j.rdc.2019.01.009.