Center for Human Genetics Research, Vanderbilt University School of Medicine, Nashville, Tennessee 37232, USA.
J Am Med Inform Assoc. 2013 Jul-Aug;20(4):652-8. doi: 10.1136/amiajnl-2012-001557. Epub 2013 Feb 9.
To evaluate the validity of, characterize the usage of, and propose potential research applications for International Classification of Diseases, Ninth Revision (ICD-9) tobacco codes in clinical populations.
Using data on cancer cases and cancer-free controls from Vanderbilt's biorepository, BioVU, we evaluated the utility of ICD-9 tobacco use codes to identify ever-smokers in general and high smoking prevalence (lung cancer) clinic populations. We assessed potential biases in documentation, and performed temporal analysis relating transitions between smoking codes to smoking cessation attempts. We also examined the suitability of these codes for use in genetic association analyses.
ICD-9 tobacco use codes can identify smokers in a general clinic population (specificity of 1, sensitivity of 0.32), and there is little evidence of documentation bias. Frequency of code transitions between 'current' and 'former' tobacco use was significantly correlated with initial success at smoking cessation (p<0.0001). Finally, code-based smoking status assignment is a comparable covariate to text-based smoking status for genetic association studies.
Our results support the use of ICD-9 tobacco use codes for identifying smokers in a clinical population. Furthermore, with some limitations, these codes are suitable for adjustment of smoking status in genetic studies utilizing electronic health records.
Researchers should not be deterred by the unavailability of full-text records to determine smoking status if they have ICD-9 code histories.
评估国际疾病分类第 9 版(ICD-9)烟草代码在临床人群中的有效性、描述其使用情况,并提出潜在的研究应用。
利用范德比尔特生物库 BioVU 中癌症病例和非癌症对照的数据,我们评估了 ICD-9 烟草使用代码在一般人群和高吸烟率(肺癌)临床人群中识别既往吸烟者的效用。我们评估了文档记录中的潜在偏差,并进行了时间分析,以评估吸烟代码之间的转换与戒烟尝试之间的关系。我们还检查了这些代码在遗传关联分析中的适用性。
ICD-9 烟草使用代码可识别一般临床人群中的吸烟者(特异性为 1,敏感性为 0.32),且文档记录无明显偏差。“当前”和“既往”烟草使用之间的代码转换频率与初始戒烟成功率显著相关(p<0.0001)。最后,基于代码的吸烟状况分配与基于文本的吸烟状况分配对于遗传关联研究是可比的协变量。
我们的研究结果支持在临床人群中使用 ICD-9 烟草使用代码来识别吸烟者。此外,这些代码具有一定的局限性,可用于调整基于电子健康记录的遗传研究中的吸烟状况。
如果研究人员有 ICD-9 代码记录,那么他们不应因无法获得全文记录而无法确定吸烟状况。