• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用多种异构数据源开发机器学习模型来估计美国每周自杀死亡人数。

Development of a Machine Learning Model Using Multiple, Heterogeneous Data Sources to Estimate Weekly US Suicide Fatalities.

机构信息

Department of Computer Science and Engineering, Incheon National University, Incheon, South Korea.

Office of Strategy and Innovation, National Center for Injury Prevention and Control, Centers for Disease Control and Prevention, Atlanta, Georgia.

出版信息

JAMA Netw Open. 2020 Dec 1;3(12):e2030932. doi: 10.1001/jamanetworkopen.2020.30932.

DOI:10.1001/jamanetworkopen.2020.30932
PMID:33355678
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7758810/
Abstract

IMPORTANCE

Suicide is a leading cause of death in the US. However, official national statistics on suicide rates are delayed by 1 to 2 years, hampering evidence-based public health planning and decision-making.

OBJECTIVE

To estimate weekly suicide fatalities in the US in near real time.

DESIGN, SETTING, AND PARTICIPANTS: This cross-sectional national study used a machine learning pipeline to combine signals from several streams of real-time information to estimate weekly suicide fatalities in the US in near real time. This 2-phase approach first fits optimal machine learning models to each individual data stream and subsequently combines predictions made from each data stream via an artificial neural network. National-level US administrative data on suicide deaths, health services, and economic, meteorological, and online data were variously obtained from 2014 to 2017. Data were analyzed from January 1, 2014, to December 31, 2017.

EXPOSURES

Longitudinal data on suicide-related exposures were obtained from multiple, heterogeneous streams: emergency department visits for suicide ideation and attempts collected via the National Syndromic Surveillance Program (2015-2017); calls to the National Suicide Prevention Lifeline (2014-2017); calls to US poison control centers for intentional self-harm (2014-2017); consumer price index and seasonality-adjusted unemployment rate, hourly earnings, home price index, and 3-month and 10-year yield curves from the Federal Reserve Economic Data (2014-2017); weekly daylight hours (2014-2017); Google and YouTube search trends related to suicide (2014-2017); and public posts on suicide on Reddit (2 314 533 posts), Twitter (9 327 472 tweets; 2015-2017), and Tumblr (1 670 378 posts; 2014-2017).

MAIN OUTCOMES AND MEASURES

Weekly estimates of suicide fatalities in the US were obtained through a machine learning pipeline that integrated the above data sources. Estimates were compared statistically with actual fatalities recorded by the National Vital Statistics System.

RESULTS

Combining information from multiple data streams, the machine learning method yielded estimates of weekly suicide deaths with high correlation to actual counts and trends (Pearson correlation, 0.811; P < .001), while estimating annual suicide rates with low error (0.55%).

CONCLUSIONS AND RELEVANCE

The proposed ensemble machine learning framework reduces the error for annual suicide rate estimation to less than one-tenth of that of current forecasting approaches that use only historical information on suicide deaths. These findings establish a novel approach for tracking suicide fatalities in near real time and provide the potential for an effective public health response such as supporting budgetary decisions or deploying interventions.

摘要

重要性

自杀是美国的主要死亡原因之一。然而,官方的全国自杀率统计数据要延迟 1 到 2 年,这阻碍了基于证据的公共卫生规划和决策。

目的

实时估算美国每周的自杀死亡人数。

设计、地点和参与者:这项横断面全国性研究使用机器学习管道,结合来自多个实时信息源的信号,实时估算美国每周的自杀死亡人数。该 2 阶段方法首先为每个单独的数据流拟合最佳机器学习模型,然后通过人工神经网络合并每个数据流的预测。来自 2014 年至 2017 年的美国国家行政数据中关于自杀死亡、卫生服务以及经济、气象和在线数据的自杀死亡、自杀意念和企图的国家监测计划、全国自杀预防生命线电话、美国中毒控制中心关于故意自残的电话、联邦储备经济数据的消费者价格指数和季节性调整后的失业率、小时工资、房价指数、3 个月和 10 年期收益率曲线、2014 年至 2017 年的每周日照时间、与自杀相关的谷歌和 YouTube 搜索趋势以及 Reddit 上的自杀公开帖子(2314533 个帖子)、Twitter(2015-2017 年的 9327472 条推文)和 Tumblr(2014-2017 年的 1670378 个帖子)被用作输入数据。

暴露因素

来自多个异质数据流的与自杀相关的纵向数据被纳入:国家综合征监测计划收集的自杀意念和企图的急诊就诊情况(2015-2017 年);全国自杀预防生命线电话(2014-2017 年);美国中毒控制中心关于故意自残的电话(2014-2017 年);联邦储备经济数据的消费者价格指数和季节性调整后的失业率、小时工资、房价指数、3 个月和 10 年期收益率曲线(2014-2017 年);每周日照时间(2014-2017 年);与自杀相关的谷歌和 YouTube 搜索趋势(2014-2017 年);Reddit 上的自杀公开帖子(2314533 个帖子)、Twitter(2015-2017 年的 9327472 条推文)和 Tumblr(2014-2017 年的 1670378 个帖子)。

主要结果和措施

通过整合上述数据源的机器学习管道,获得了美国每周自杀死亡人数的估计值。这些估计值与国家生命统计系统记录的实际死亡人数进行了统计学比较。

结果

通过结合多个数据流的信息,机器学习方法对每周自杀死亡人数的估计与实际数据具有高度相关性(Pearson 相关系数,0.811;P<0.001),同时对年度自杀率的估计误差较低(0.55%)。

结论和相关性

提出的集成机器学习框架将年度自杀率估计的误差降低到目前仅使用自杀死亡历史信息的预测方法的十分之一以下。这些发现为实时跟踪自杀死亡人数建立了一种新方法,并为有效的公共卫生应对措施提供了潜力,例如支持预算决策或部署干预措施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8eb/7758810/220fb916a93e/jamanetwopen-e2030932-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8eb/7758810/1ffed5b81f99/jamanetwopen-e2030932-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8eb/7758810/220fb916a93e/jamanetwopen-e2030932-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8eb/7758810/1ffed5b81f99/jamanetwopen-e2030932-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8eb/7758810/220fb916a93e/jamanetwopen-e2030932-g002.jpg

相似文献

1
Development of a Machine Learning Model Using Multiple, Heterogeneous Data Sources to Estimate Weekly US Suicide Fatalities.利用多种异构数据源开发机器学习模型来估计美国每周自杀死亡人数。
JAMA Netw Open. 2020 Dec 1;3(12):e2030932. doi: 10.1001/jamanetworkopen.2020.30932.
2
Development of a Machine Learning Model to Estimate US Firearm Homicides in Near Real Time.开发一个机器学习模型以实时估算美国的枪支凶杀事件。
JAMA Netw Open. 2023 Mar 1;6(3):e233413. doi: 10.1001/jamanetworkopen.2023.3413.
3
Estimating Weekly National Opioid Overdose Deaths in Near Real Time Using Multiple Proxy Data Sources.利用多个代理数据源实时估算每周全国阿片类药物过量死亡人数。
JAMA Netw Open. 2022 Jul 1;5(7):e2223033. doi: 10.1001/jamanetworkopen.2022.23033.
4
Predicting state level suicide fatalities in the united states with realtime data and machine learning.利用实时数据和机器学习预测美国各州的自杀死亡人数。
Npj Ment Health Res. 2024 Jan 16;3(1):3. doi: 10.1038/s44184-023-00045-8.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Association of 7 million+ tweets featuring suicide-related content with daily calls to the Suicide Prevention Lifeline and with suicides, United States, 2016-2018.2016-2018 年,与自杀预防生命线每日来电和自杀相关的包含 700 多万条与自杀相关内容的推文的关联。
Aust N Z J Psychiatry. 2023 Jul;57(7):994-1003. doi: 10.1177/00048674221126649. Epub 2022 Oct 14.
7
Influenza forecasting for French regions combining EHR, web and climatic data sources with a machine learning ensemble approach.利用电子健康记录、网络和气候数据源,结合机器学习集成方法对法国地区进行流感预测。
PLoS One. 2021 May 19;16(5):e0250890. doi: 10.1371/journal.pone.0250890. eCollection 2021.
8
Applying Machine Learning Models with An Ensemble Approach for Accurate Real-Time Influenza Forecasting in Taiwan: Development and Validation Study.应用集成方法的机器学习模型进行台湾地区流感的准确实时预测:开发和验证研究。
J Med Internet Res. 2020 Aug 5;22(8):e15394. doi: 10.2196/15394.
9
Emerging Trends of Self-Harm Using Sodium Nitrite in an Online Suicide Community: Observational Study Using Natural Language Processing Analysis.在线自杀社区中使用亚硝酸钠进行自我伤害的新兴趋势:使用自然语言处理分析的观察性研究。
JMIR Ment Health. 2024 May 2;11:e53730. doi: 10.2196/53730.
10
Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach.在 Twitter 上检测潜在有害和保护自杀相关内容:机器学习方法。
J Med Internet Res. 2022 Aug 17;24(8):e34705. doi: 10.2196/34705.

引用本文的文献

1
Multimodal machine learning to predict surgical site infection with healthcare workload impact assessment.用于通过医疗保健工作量影响评估预测手术部位感染的多模态机器学习。
NPJ Digit Med. 2025 Feb 23;8(1):121. doi: 10.1038/s41746-024-01419-8.
2
Advancing Ethical Considerations for Data Science in Injury and Violence Prevention.推进伤害与暴力预防数据科学中的伦理考量。
Public Health Rep. 2025 Jan 20:333549241312055. doi: 10.1177/00333549241312055.
3
Automatically extracting social determinants of health for suicide: a narrative literature review.

本文引用的文献

1
Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries.社会数据:偏见、方法陷阱与伦理边界
Front Big Data. 2019 Jul 11;2:13. doi: 10.3389/fdata.2019.00013. eCollection 2019.
2
Increase in Suicide Mortality in the United States, 1999-2018.美国自杀死亡率在 1999-2018 年间上升。
NCHS Data Brief. 2020 Apr(362):1-8.
3
Syndromic Surveillance of Suicidal Ideation and Self-Directed Violence - United States, January 2017-December 2018.自杀意念和自我伤害行为的症状监测-美国,2017 年 1 月-2018 年 12 月。
自动提取自杀的健康社会决定因素:一项叙述性文献综述。
Npj Ment Health Res. 2024 Nov 6;3(1):51. doi: 10.1038/s44184-024-00087-6.
4
News Media Framing of Suicide Circumstances and Gender: Mixed Methods Analysis.新闻媒体对自杀情境和性别的框架构建:混合方法分析。
JMIR Ment Health. 2024 Jul 3;11:e49879. doi: 10.2196/49879.
5
Special Report from the CDC: Suicide rates, sodium nitrite-related suicides, and online content, United States.美国疾病预防控制中心特别报告:自杀率、与亚硝酸钠相关的自杀事件,以及网络内容。
J Safety Res. 2024 Jun;89:361-368. doi: 10.1016/j.jsr.2024.03.002. Epub 2024 Apr 14.
6
Current status and new experimental diagnostic methods of invasive fungal infections after hematopoietic stem cell transplantation.造血干细胞移植后侵袭性真菌感染的现状及新的实验诊断方法。
Arch Microbiol. 2024 Apr 28;206(5):237. doi: 10.1007/s00203-024-03905-9.
7
Predicting state level suicide fatalities in the united states with realtime data and machine learning.利用实时数据和机器学习预测美国各州的自杀死亡人数。
Npj Ment Health Res. 2024 Jan 16;3(1):3. doi: 10.1038/s44184-023-00045-8.
8
Estimating national and state-level suicide deaths using a novel online symptom search data source.利用新颖的在线症状搜索数据源估计全国和州级别的自杀死亡人数。
J Affect Disord. 2023 Dec 1;342:63-68. doi: 10.1016/j.jad.2023.08.141. Epub 2023 Sep 11.
9
Young people's mental and social distress in times of international crisis: evidence from helpline calls, 2019-2022.国际危机时期年轻人的心理和社会困扰:来自热线电话的证据,2019-2022。
Sci Rep. 2023 Jul 22;13(1):11858. doi: 10.1038/s41598-023-39064-y.
10
Development of a Machine Learning Model to Estimate US Firearm Homicides in Near Real Time.开发一个机器学习模型以实时估算美国的枪支凶杀事件。
JAMA Netw Open. 2023 Mar 1;6(3):e233413. doi: 10.1001/jamanetworkopen.2023.3413.
MMWR Morb Mortal Wkly Rep. 2020 Jan 31;69(4):103-108. doi: 10.15585/mmwr.mm6904a3.
4
Detecting relapse in youth with psychotic disorders utilizing patient-generated and patient-contributed digital data from Facebook.利用来自脸书的患者生成和患者提供的数字数据检测患有精神障碍的青少年的复发情况。
NPJ Schizophr. 2019 Oct 7;5(1):17. doi: 10.1038/s41537-019-0085-9.
5
Facebook language predicts depression in medical records.脸书的语言可预测病历中的抑郁症状。
Proc Natl Acad Sci U S A. 2018 Oct 30;115(44):11203-11208. doi: 10.1073/pnas.1802331115. Epub 2018 Oct 15.
6
Natural Language Processing of Social Media as Screening for Suicide Risk.社交媒体的自然语言处理用于自杀风险筛查。
Biomed Inform Insights. 2018 Aug 27;10:1178222618792860. doi: 10.1177/1178222618792860. eCollection 2018.
7
Advanced Daily Prediction Model for National Suicide Numbers with Social Media Data.基于社交媒体数据的全国自杀人数每日预测模型
Psychiatry Investig. 2018 Apr;15(4):344-354. doi: 10.30773/pi.2017.10.15. Epub 2018 Apr 5.
8
Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media.从社交媒体上的心理健康内容中发现向自杀意念的转变。
Proc SIGCHI Conf Hum Factor Comput Syst. 2016 May;2016:2098-2110. doi: 10.1145/2858036.2858207.
9
Deciphering Suicide and Other Manners of Death Associated with Drug Intoxication: A Centers for Disease Control and Prevention Consultation Meeting Summary.解读与药物中毒相关的自杀及其他死亡方式:美国疾病控制与预防中心咨询会议纪要
Am J Public Health. 2017 Aug;107(8):1233-1239. doi: 10.2105/AJPH.2017.303863. Epub 2017 Jun 22.
10
Tracking search engine queries for suicide in the United Kingdom, 2004-2013.2004年至2013年英国自杀相关搜索引擎查询记录追踪
Public Health. 2016 Aug;137:147-53. doi: 10.1016/j.puhe.2015.10.015. Epub 2016 Mar 11.