Suppr超能文献

基于监督机器学习技术的负样本检测风险估计算法的新方法。

New Approach for Risk Estimation Algorithms of Negativeness Detection with Modelling Supervised Machine Learning Techniques.

机构信息

Istanbul University, Oncology Institute, Department of Basic Oncology, Division of Cancer Genetics, 34093 Fatih, Istanbul, Turkey.

Istanbul University-Cerrahpasa, Engineering Faculty, Computer Engineering Department, 34320 Avcilar, Istanbul, Turkey.

出版信息

Dis Markers. 2020 Dec 9;2020:8594090. doi: 10.1155/2020/8594090. eCollection 2020.

Abstract

gene testing is a difficult, expensive, and time-consuming test which requires excessive work load. The identification of the gene mutations is significantly important in the selection of treatment and the risk of secondary cancer. We aimed to develop an algorithm considering all the clinical, demographic, and genetic features of patients for identifying the negativity in the present study. An experimental dataset was created with the collection of the all clinical, demographic, and genetic features of breast cancer patients for 20 years. This dataset consisted of 125 features of 2070 high-risk breast cancer patients. All data were numeralized and normalized for detection of the negativity in the machine learning algorithm. The performance of the algorithm was identified by studying the machine learning model with the test data. nearest neighbours (KNN) and decision tree (DT) accuracy rates of 9 features involving Dataset 2 were found to be the most effective. The removal of the unnecessary data in the dataset by reducing the number of features was shown to increase the accuracy rate of algorithm compared with the DT. negativity was identified without performing the gene test with 92.88% accuracy within minutes in high-risk breast cancer patients with this algorithm, and the test associated result waiting stress, time, and money loss were prevented. That algorithm is suggested be useful in fast performing of the treatment plans of patients and accurately in addition to speeding up the clinical practice.

摘要

基因检测是一项困难、昂贵且耗时的测试,需要大量的工作负荷。基因突变的鉴定对于治疗方案的选择和二次癌症的风险具有重要意义。本研究旨在开发一种算法,该算法考虑了患者所有的临床、人口统计学和遗传特征,以确定当前研究中的阴性结果。通过收集 20 年来所有乳腺癌患者的临床、人口统计学和遗传特征,创建了一个实验数据集。该数据集包含 2070 名高危乳腺癌患者的 125 个特征。所有数据均经过数字化和归一化处理,以检测机器学习算法中的阴性结果。通过使用测试数据研究机器学习模型来确定算法的性能。在涉及数据集 2 的 9 个特征中,最近邻 (KNN) 和决策树 (DT) 的准确率最高。通过减少特征数量来去除数据集中的不必要数据,与 DT 相比,算法的准确率有所提高。该算法可在几分钟内以 92.88%的准确率识别高危乳腺癌患者的阴性结果,无需进行基因检测,避免了检测相关的等待压力、时间和金钱损失。该算法建议在快速制定患者治疗计划、准确判断以及加快临床实践方面具有一定的应用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f6e/7787793/993a481cca34/DM2020-8594090.001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验