Banerjee Srijan, Sengupta Antara, Ghosh Shankar Kumar, Banerjee Raja
Department of Biotechnology, Maulana Abul Kalam Azad University of Technology, Nadia, West Bengal, India.
Department of Computer Science and Engineering, University of Calcutta, Kolkata, West Bengal, India.
J Biomol Struct Dyn. 2024 Feb 19:1-14. doi: 10.1080/07391102.2024.2316770.
Breast cancer is considered to be happened due to genetic aberration. Out of several genes expressed, it is found that cadherin 1, type 1 (CDH1) is responsible in several ways to control the metabolic order in human. Deregulation of the function of protein E-cadherin, expressed from CDH1 plays an important role in lobular breast cancer. In order to understand the root cause of this recent claim, we focus on CDH1 gene: whether the genetic information translated due to any deviation/alteration/modification in its sequence is related to the occurrence of the different types breast cancer. Towards this end, quantitative analysis of different biophysical and bio-chemical properties of CDH1 gene in genomic and proteomic levels from the available genomic (cDNA) sequences of CDH1 gene (obtained from the COSMIC Database for 78 patients, suffering from various types of breast cancer) clearly emphasizes that alternation/modification in the sequence of the CDH1 gene can be detrimental. Furthermore, Random forest, K-nearest neighbour and stochastic gradient descent (SGD) algorithms are applied on the derived dataset to classify the types of breast cancer, and to validate our hypothesis regarding the acute role of CDH1 as potential bio marker for breast cancer. Analysis of the mutated CDH1 gene sequences, and their related parameters using aforesaid machine learning techniques clearly establish that CDH1 gene can take the deterministic role in predicting the chances of occurrences of different types of breast cancer with an accuracy of Such an observation opens a new paradigm in diagnostic approach of breast cancer.
乳腺癌被认为是由基因畸变引起的。在表达的多个基因中,发现钙黏蛋白1型(CDH1)在多方面对控制人体代谢秩序起着作用。由CDH1表达的E-钙黏蛋白功能失调在小叶乳腺癌中起重要作用。为了理解这一最新论断的根本原因,我们聚焦于CDH1基因:其序列中因任何偏差/改变/修饰而翻译出的遗传信息是否与不同类型乳腺癌的发生有关。为此,从CDH1基因的可用基因组(cDNA)序列(从COSMIC数据库获取的78例患有各种类型乳腺癌患者的数据)在基因组和蛋白质组水平对CDH1基因的不同生物物理和生化特性进行定量分析清楚地表明,CDH1基因序列中的改变/修饰可能是有害的。此外,将随机森林、K近邻和随机梯度下降(SGD)算法应用于导出数据集以对乳腺癌类型进行分类,并验证我们关于CDH1作为乳腺癌潜在生物标志物所起关键作用的假设。使用上述机器学习技术对突变的CDH1基因序列及其相关参数进行分析清楚地表明,CDH1基因在预测不同类型乳腺癌发生几率方面可发挥决定性作用,准确率为 。这样的观察结果为乳腺癌的诊断方法开辟了新的范式。