• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用搜索查询数据估算社区层面慢性呼吸道疾病的患病率。

Neighborhood level chronic respiratory disease prevalence estimation using search query data.

机构信息

New York University, New York, NY, United States of America.

Microsoft Research, Redmond, WA, United States of America.

出版信息

PLoS One. 2021 Jun 9;16(6):e0252383. doi: 10.1371/journal.pone.0252383. eCollection 2021.

DOI:10.1371/journal.pone.0252383
PMID:34106982
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8189491/
Abstract

Estimation of disease prevalence at sub-city neighborhood scale allows early and targeted interventions that can help save lives and reduce public health burdens. However, the cost-prohibitive nature of highly localized data collection and sparsity of representative signals, has made it challenging to identify neighborhood scale prevalence of disease. To overcome this challenge, we utilize alternative data sources, which are both less sparse and representative of localized disease prevalence: using query data from a large commercial search engine, we identify the prevalence of respiratory illness in the United States, localized to census tract geographic granularity. Focusing on asthma and Chronic Obstructive Pulmonary Disease (COPD), we construct a set of features based on searches for symptoms, medications, and disease-related information, and use these to identify illness rates in more than 23 thousand tracts in 500 cities across the United States. Out of sample model estimates from search data alone correlate with ground truth illness rate estimates from the CDC at 0.69 to 0.76, with simple additions to these models raising those correlations to as high as 0.84. We then show that in practice search query data can be added to other relevant data such as census or land cover data to boost results, with models that incorporate all data sources correlating with ground truth data at 0.91 for asthma and 0.88 for COPD.

摘要

在城市街区层面估算疾病的流行程度,可以实现早期、有针对性的干预,有助于拯救生命和减轻公共卫生负担。然而,由于高度本地化的数据收集成本过高,以及代表性信号稀疏,因此难以确定疾病在街区层面的流行程度。为了克服这一挑战,我们利用替代数据源,这些数据源不仅更稀疏,而且更能代表本地化的疾病流行程度:我们使用来自大型商业搜索引擎的查询数据,以普查地段的地理粒度来确定美国呼吸道疾病的流行程度。我们专注于哮喘和慢性阻塞性肺疾病(COPD),根据症状、药物和与疾病相关的信息搜索构建了一组特征,并利用这些特征来确定美国 500 个城市的 23000 多个地段的疾病发病率。仅从搜索数据中得出的样本外模型估计与 CDC 的实际疾病发病率估计相关系数为 0.69 至 0.76,而对这些模型进行简单的补充可将相关性提高到 0.84 之高。然后我们展示了在实践中,搜索查询数据可以与其他相关数据(如人口普查或土地覆盖数据)结合使用,以提高结果的准确性,同时包含所有数据源的模型与真实数据的相关性分别达到 0.91(哮喘)和 0.88(COPD)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/8ff6184497fb/pone.0252383.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/af969dc375c6/pone.0252383.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/477ab6a028ab/pone.0252383.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/864fe0133640/pone.0252383.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/8c367050d0fa/pone.0252383.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/974419d8b76a/pone.0252383.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/8ff6184497fb/pone.0252383.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/af969dc375c6/pone.0252383.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/477ab6a028ab/pone.0252383.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/864fe0133640/pone.0252383.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/8c367050d0fa/pone.0252383.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/974419d8b76a/pone.0252383.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c76/8189491/8ff6184497fb/pone.0252383.g006.jpg

相似文献

1
Neighborhood level chronic respiratory disease prevalence estimation using search query data.利用搜索查询数据估算社区层面慢性呼吸道疾病的患病率。
PLoS One. 2021 Jun 9;16(6):e0252383. doi: 10.1371/journal.pone.0252383. eCollection 2021.
2
Multilevel regression and poststratification for small-area estimation of population health outcomes: a case study of chronic obstructive pulmonary disease prevalence using the behavioral risk factor surveillance system.多水平回归和后分层法在小区域人群健康结果估计中的应用:以行为风险因素监测系统为例评估慢性阻塞性肺疾病的流行率。
Am J Epidemiol. 2014 Apr 15;179(8):1025-33. doi: 10.1093/aje/kwu018. Epub 2014 Mar 4.
3
Determining Chronic Disease Prevalence in Local Populations Using Emergency Department Surveillance.利用急诊科监测确定当地人群中的慢性病患病率。
Am J Public Health. 2015 Sep;105(9):e67-74. doi: 10.2105/AJPH.2015.302679. Epub 2015 Jul 16.
4
Practical utility of general practice data capture and spatial analysis for understanding COPD and asthma.利用全科医疗数据采集和空间分析来理解慢性阻塞性肺疾病和哮喘的实际效用。
BMC Health Serv Res. 2018 Nov 26;18(1):897. doi: 10.1186/s12913-018-3714-5.
5
Prevalence of self-reported sleep problems amongst adults with obstructive airway disease in the NHANES cohort in the United States.美国国家健康与营养检查调查(NHANES)队列中患有阻塞性气道疾病的成年人自我报告睡眠问题的患病率。
Sleep Breath. 2020 Sep;24(3):985-993. doi: 10.1007/s11325-019-01941-0. Epub 2019 Sep 13.
6
Rural Residence and Poverty Are Independent Risk Factors for Chronic Obstructive Pulmonary Disease in the United States.农村居住和贫困是美国慢性阻塞性肺疾病的独立危险因素。
Am J Respir Crit Care Med. 2019 Apr 15;199(8):961-969. doi: 10.1164/rccm.201807-1374OC.
7
Gender and asthma-chronic obstructive pulmonary disease overlap syndrome.性别与哮喘-慢性阻塞性肺疾病重叠综合征
J Asthma. 2016 Sep;53(7):720-31. doi: 10.3109/02770903.2016.1154072. Epub 2016 Apr 6.
8
Exploring neighborhood-level variation in asthma and other respiratory diseases: the contribution of neighborhood social context.探索哮喘及其他呼吸道疾病在社区层面的差异:社区社会环境的作用。
J Gen Intern Med. 2004 Mar;19(3):229-36. doi: 10.1111/j.1525-1497.2004.30359.x.
9
A cross-sectional study on prevalence of chronic obstructive pulmonary disease (COPD) in India: rationale and methods.印度慢性阻塞性肺疾病(COPD)患病率的横断面研究:基本原理与方法
BMJ Open. 2017 May 29;7(5):e015211. doi: 10.1136/bmjopen-2016-015211.
10
Prevalence and features of asthma-COPD overlap in the United States 2007-2012.2007 - 2012年美国哮喘 - 慢性阻塞性肺疾病重叠综合征的患病率及特征
Clin Respir J. 2018 Aug;12(8):2369-2377. doi: 10.1111/crj.12917.

引用本文的文献

1
Bias in machine learning applications to address non-communicable diseases at a population-level: a scoping review.机器学习应用于在人群层面解决非传染性疾病的偏差:一项范围综述
BMC Public Health. 2024 Dec 28;24(1):3599. doi: 10.1186/s12889-024-21081-9.

本文引用的文献

1
Use of Deep Learning to Examine the Association of the Built Environment With Prevalence of Neighborhood Adult Obesity.利用深度学习研究建筑环境与社区成年人肥胖患病率之间的关系。
JAMA Netw Open. 2018 Aug 3;1(4):e181535. doi: 10.1001/jamanetworkopen.2018.1535.
2
Place-Based Inequity in Smoking Prevalence in the Largest Cities in the United States.美国最大城市吸烟率的基于地点的不平等。
JAMA Intern Med. 2019 Mar 1;179(3):442-444. doi: 10.1001/jamainternmed.2018.5990.
3
Combining satellite imagery and machine learning to predict poverty.
结合卫星图像和机器学习预测贫困。
Science. 2016 Aug 19;353(6301):790-4. doi: 10.1126/science.aaf7894.
4
Fine-grained dengue forecasting using telephone triage services.利用电话分诊服务进行细粒度登革热预测。
Sci Adv. 2016 Jul 8;2(7):e1501215. doi: 10.1126/sciadv.1501215. eCollection 2016 Jul.
5
Childhood asthma prediction models: a systematic review.儿童哮喘预测模型:系统评价。
Lancet Respir Med. 2015 Dec;3(12):973-84. doi: 10.1016/S2213-2600(15)00428-2. Epub 2015 Nov 18.
6
Employment and activity limitations among adults with chronic obstructive pulmonary disease--United States, 2013.2013年美国慢性阻塞性肺疾病成年患者的就业及活动受限情况
MMWR Morb Mortal Wkly Rep. 2015 Mar 27;64(11):289-95.
7
Multilevel regression and poststratification for small-area estimation of population health outcomes: a case study of chronic obstructive pulmonary disease prevalence using the behavioral risk factor surveillance system.多水平回归和后分层法在小区域人群健康结果估计中的应用:以行为风险因素监测系统为例评估慢性阻塞性肺疾病的流行率。
Am J Epidemiol. 2014 Apr 15;179(8):1025-33. doi: 10.1093/aje/kwu018. Epub 2014 Mar 4.
8
Norovirus disease surveillance using Google Internet query share data.利用谷歌互联网查询份额数据进行诺如病毒病监测。
Clin Infect Dis. 2012 Oct;55(8):e75-8. doi: 10.1093/cid/cis579. Epub 2012 Jun 19.
9
Revealing the microscale spatial signature of dengue transmission and immunity in an urban population.揭示城市人群中登革热传播和免疫的微观空间特征。
Proc Natl Acad Sci U S A. 2012 Jun 12;109(24):9535-8. doi: 10.1073/pnas.1120621109. Epub 2012 May 29.
10
Internet search trends analysis tools can provide real-time data on kidney stone disease in the United States.互联网搜索趋势分析工具可以提供美国肾结石疾病的实时数据。
Urology. 2013 Jan;81(1):37-42. doi: 10.1016/j.urology.2011.04.024. Epub 2011 Jun 15.