Computer, Electrical, and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia; Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia.
Department of Pediatric Endocrinology and Genetic Metabolism, Xin Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai Institute for Pediatric Research, Room 801, Science and Education Building, Kongjiang Road 1665, Shanghai, China.
Comput Biol Med. 2024 Jul;177:108601. doi: 10.1016/j.compbiomed.2024.108601. Epub 2024 May 14.
Automated karyotyping is of great importance for cytogenetic research, as it speeds up the process for cytogeneticists through incorporating AI-driven automated segmentation and classification techniques. Existing frameworks confront two primary issues: Firstly the necessity for instance-level data annotation with either detection bounding boxes or semantic masks for training, and secondly, its poor robustness particularly when confronted with domain shifts. In this work, we first propose an accurate segmentation framework, namely KaryoXpert. This framework leverages the strengths of both morphology algorithms and deep learning models, allowing for efficient training that breaks the limit for the acquirement of manually labeled ground-truth mask annotations. Additionally, we present an accurate classification model based on metric learning, designed to overcome the challenges posed by inter-class similarity and batch effects. Our framework exhibits state-of-the-art performance with exceptional robustness in both chromosome segmentation and classification. The proposed KaryoXpert framework showcases its capacity for instance-level chromosome segmentation even in the absence of annotated data, offering novel insights into the research for automated chromosome segmentation. The proposed method has been successfully deployed to support clinical karyotype diagnosis.
自动化核型分析对于细胞遗传学研究具有重要意义,因为它通过结合人工智能驱动的自动化分割和分类技术,加快了细胞遗传学家的工作流程。现有的框架面临两个主要问题:首先,需要对实例级别的数据进行标注,无论是使用检测边界框还是语义掩码进行训练;其次,其在面对领域转移时的鲁棒性较差。在这项工作中,我们首先提出了一个准确的分割框架,即 KaryoXpert。该框架利用形态算法和深度学习模型的优势,实现了高效的训练,打破了获取手动标注的真实掩码注释的限制。此外,我们还提出了一种基于度量学习的准确分类模型,旨在克服类间相似性和批处理效应带来的挑战。我们的框架在染色体分割和分类方面均表现出卓越的性能和出色的鲁棒性,达到了最新水平。所提出的 KaryoXpert 框架展示了其在实例级别的染色体分割方面的能力,即使在没有标注数据的情况下也能实现,为自动化染色体分割的研究提供了新的思路。该方法已成功部署以支持临床核型诊断。