School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, Tamil Nadu, India.
Sci Rep. 2024 Jul 31;14(1):17709. doi: 10.1038/s41598-024-67848-3.
Generally, a person's life span depends on their food consumption because it may cause deadly diseases like colorectal cancer (CRC). In 2020, colorectal cancer accounted for one million fatalities globally, representing 10% of all cancer casualties. 76,679 males and 78,213 females over the age of 59 from ten states in the United States participated in this analysis. During follow-up, 1378 men and 981 women were diagnosed with colon cancer. This prospective cohort study used 231 food items and their variants as input features to identify CRC patients. Before labelling any foods as colorectal cancer-causing foods, it is ethical to analyse facts like how many grams of food should be consumed daily and how many times a week. This research examines five classification algorithms on real-time datasets: K-Nearest Neighbour (KNN), Decision Tree (DT), Random Forest (RF), Logistic Regression with Classifier Chain (LRCC), and Logistic Regression with Label Powerset (LRLC). Then, the SMOTE algorithm is applied to deal with and identify imbalances in the data. Our study shows that eating more than 10 g/d of low-fat butter in bread (RR 1.99, CI 0.91-4.39) and more than twice a week (RR 1.49, CI 0.93-2.38) increases CRC risk. Concerning beef, eating in excess of 74 g of beef steak daily (RR 0.88, CI 0.50-1.55) and having it more than once a week (RR 0.88, CI 0.62-1.23) decreases the risk of CRC, respectively. While eating beef and dairy products in a daily diet should be cautious about quantity. Consuming those items in moderation on a regular basis will protect us against CRC risk. Meanwhile, a high intake of poultry (RR 0.2, CI 0.05-0.81), fish (RR 0.82, CI 0.31-2.16), and pork (RR 0.67, CI 0.17-2.65) consumption negatively correlates to CRC hazards.
一般来说,一个人的寿命取决于他们的食物消耗,因为它可能导致致命的疾病,如结直肠癌(CRC)。2020 年,结直肠癌在全球造成 100 万人死亡,占所有癌症死亡人数的 10%。来自美国十个州的 76679 名男性和 78213 名 59 岁以上的女性参与了这项分析。在随访期间,1378 名男性和 981 名女性被诊断患有结肠癌。这项前瞻性队列研究使用了 231 种食物及其变体作为输入特征来识别 CRC 患者。在将任何食物标记为结直肠癌致癌食物之前,分析每天应该消耗多少克食物以及每周应该吃多少次食物等事实是合乎道德的。这项研究在实时数据集上使用了五种分类算法:K-最近邻(KNN)、决策树(DT)、随机森林(RF)、具有分类器链的逻辑回归(LRCC)和具有标签幂集的逻辑回归(LRLC)。然后,应用 SMOTE 算法来处理和识别数据中的不平衡。我们的研究表明,每天多吃 10 克低脂黄油面包(RR 1.99,CI 0.91-4.39)和每周超过两次(RR 1.49,CI 0.93-2.38)会增加 CRC 风险。至于牛肉,每天多吃 74 克牛排(RR 0.88,CI 0.50-1.55)和每周超过一次(RR 0.88,CI 0.62-1.23)分别会降低 CRC 的风险。虽然在日常饮食中应该谨慎食用牛肉和奶制品,但适量食用这些食物可以保护我们免受 CRC 风险的侵害。同时,大量摄入家禽(RR 0.2,CI 0.05-0.81)、鱼类(RR 0.82,CI 0.31-2.16)和猪肉(RR 0.67,CI 0.17-2.65)与 CRC 危害呈负相关。