Genomic Testing Cooperative, Irvine, California.
Dow University of Health Sciences Karachi, Karachi City, Pakistan.
Am J Pathol. 2023 Jan;193(1):51-59. doi: 10.1016/j.ajpath.2022.09.006. Epub 2022 Oct 13.
Diagnosis and classification of tumors is increasingly dependent on biomarkers. RNA expression profiling using next-generation sequencing provides reliable and reproducible information on the biology of cancer. This study investigated targeted transcriptome and artificial intelligence for differential diagnosis of hematologic and solid tumors. RNA samples from hematologic neoplasms (N = 2606), solid tumors (N = 2038), normal bone marrow (N = 782), and lymph node control (N = 24) were sequenced using next-generation sequencing using a targeted 1408-gene panel. Twenty subtypes of hematologic neoplasms and 24 subtypes of solid tumors were identified. Machine learning was used for diagnosis between two classes. Geometric mean naïve Bayesian classifier was used for differential diagnosis across 45 diagnostic entities with assigned rankings. Machine learning showed high accuracy in distinguishing between two diagnoses, with area under the curve varying between 1 and 0.841. Geometric mean naïve Bayesian algorithm was trained using 3045 samples and tested on 1415 samples, and showed correct first-choice diagnosis in 100%, 88%, 85%, 82%, 88%, 72%, and 72% of acute lymphoblastic leukemia, acute myeloid leukemia, diffuse large B-cell lymphoma, colorectal cancer, lung cancer, chronic lymphocytic leukemia, and follicular lymphoma cases, respectively. The data indicate that targeted transcriptome combined with artificial intelligence are highly useful for diagnosis and classification of various cancers. Mutation profiles and clinical information can improve these algorithms and minimize errors in diagnoses.
诊断和分类肿瘤越来越依赖于生物标志物。使用下一代测序的 RNA 表达谱分析可提供有关癌症生物学的可靠且可重复的信息。本研究调查了靶向转录组和人工智能在血液和实体肿瘤的鉴别诊断中的应用。使用靶向 1408 个基因panel 对血液肿瘤(N=2606)、实体肿瘤(N=2038)、正常骨髓(N=782)和淋巴结对照(N=24)的 RNA 样本进行了下一代测序。确定了 20 种血液肿瘤亚型和 24 种实体肿瘤亚型。使用机器学习对两种疾病进行诊断。使用几何平均值朴素贝叶斯分类器对 45 个诊断实体进行差异诊断,并给出了分配的排名。机器学习在区分两种诊断方面表现出很高的准确性,曲线下面积在 1 到 0.841 之间变化。使用 3045 个样本训练几何平均值朴素贝叶斯算法,并在 1415 个样本上进行测试,在急性淋巴细胞白血病、急性髓细胞白血病、弥漫性大 B 细胞淋巴瘤、结直肠癌、肺癌、慢性淋巴细胞白血病和滤泡性淋巴瘤病例中,分别有 100%、88%、85%、82%、88%、72%和 72%的病例正确地做出了首选诊断。数据表明,靶向转录组结合人工智能对于各种癌症的诊断和分类非常有用。突变谱和临床信息可以改进这些算法并最小化诊断中的错误。