IEEE J Biomed Health Inform. 2021 Sep;25(9):3659-3667. doi: 10.1109/JBHI.2021.3073605. Epub 2021 Sep 3.
Previous works have shown that risk factors for some kinds of cancer depend on people's lifestyle (e.g. rural or urban residence). This article looks into this, seeking relationships between cancer, age group, gender and population in the region of Lleida (Catalonia, Spain) using Multiple Correspondence Analysis (MCA).
The dataset analysed was made up of 3408 cancer episodes between 2012 and 2014, extracted from the Population-based Cancer Registry (PCR) for Lleida province. The cancers studied were colon and rectal (1059 cases), lung (551 cases), urinary bladder (446 cases), prostate (609 cases) and breast (743 cases). The MCA technique was applied and used to search relationships among the main qualitative features. The basic statistics were the percentage explaining (variance), the inertia and the contribution of each qualitative variable.
General outcomes showed a low and moderate contribution of living in rural areas to colorectal and male prostate cancer. Males in urban areas were slightly and heavily affected by lung and urinary bladder cancer respectively. The analysis of each cancer provided additional information. Colorectal cancer greatly affected males aged <60, urban residents aged 70-79, and rural females aged ≥ 80. The impact of lung cancer was high among urban females <60, moderate among males aged 70-79 and high among rural females aged ≥ 80. The results for urinary bladder cancer results were similar to those for lung cancer. Prostate cancer affected both the <60 and ≥ 80 age groups significantly in rural areas. Breast cancer hit the 70-79 group significantly and, somewhat less so, rural females aged ≥ 80.
MCA was a significant help for detecting the contributions of qualitative variables and the associations between them. MCA has proven to be an effective technique for analyzing the incidence of cancer. The outcomes obtained help to corroborate suspected trends, as well as detecting and stimulating new hypotheses about the risk factors associated with a specific area and cancer. These findings will be helpful for encouraging new studies and prevention campaigns to highlight observed singularities.
先前的研究表明,某些癌症的风险因素与人们的生活方式有关(例如,农村或城市居住)。本文使用多元对应分析(MCA)研究了西班牙加泰罗尼亚莱里达地区癌症、年龄组、性别和人群之间的关系。
分析的数据来自莱里达省基于人群的癌症登记处(PCR)在 2012 年至 2014 年期间提取的 3408 例癌症病例。研究的癌症有结肠和直肠(1059 例)、肺(551 例)、膀胱(446 例)、前列腺(609 例)和乳房(743 例)。应用 MCA 技术寻找主要定性特征之间的关系。基本统计数据是解释(方差)、惯性和每个定性变量的贡献的百分比。
总体结果显示,农村地区对结直肠癌和男性前列腺癌的影响较低且中等。城市地区的男性分别受到轻度和重度影响,患有肺癌和膀胱癌。对每种癌症的分析提供了额外的信息。结直肠癌对<60 岁的男性、70-79 岁的城市居民和≥80 岁的农村女性影响较大。肺癌的影响在<60 岁的城市女性中较高,在 70-79 岁的男性中中等,在≥80 岁的农村女性中较高。膀胱癌的结果与肺癌相似。前列腺癌在农村地区对<60 岁和≥80 岁两个年龄组的影响显著。乳腺癌对 70-79 岁组的影响显著,对农村≥80 岁的女性影响稍小。
MCA 对检测定性变量的贡献和它们之间的关联有很大帮助。MCA 已被证明是分析癌症发病率的有效技术。研究结果有助于证实可疑趋势,并发现和激发与特定地区和癌症相关的风险因素的新假设。这些发现将有助于鼓励开展新的研究和预防活动,以突出观察到的特殊性。