Ng-Kee-Kwong Julian, Philps Ben, Smith Fiona N C, Sobieska Aleksandra, Chen Naiming, Alabert Constance, Bilen Hakan, Buonomo Sara C B
Institute of Cell Biology, School of Biological Sciences, University of Edinburgh, Roger Land Building, Alexander Crum Brown Road, Edinburgh, EH9 3FF, UK.
School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, UK.
Commun Biol. 2025 Feb 26;8(1):311. doi: 10.1038/s42003-025-07744-2.
In eukaryotic cells, DNA replication is organised both spatially and temporally, as evidenced by the stage-specific spatial distribution of replication foci in the nucleus. Despite the genetic association of aberrant DNA replication with numerous human diseases, the labour-intensive methods employed to study DNA replication have hindered large-scale analyses of its roles in pathological processes. In this study, we employ two distinct methodologies. We first apply supervised machine learning, successfully classifying S-phase patterns in wild-type mouse embryonic stem cells (mESCs), while additionally identifying altered replication dynamics in Rif1-deficient mESCs. Given the constraints imposed by a classification-based approach, we then develop an unsupervised method for large-scale detection of aberrant S-phase cells. Such a method, which does not aim to classify patterns based on pre-defined categories but rather detects differences autonomously, closely recapitulates expected differences across genotypes. We therefore extend our approach to a well-characterised cellular model of inducible deregulated origin firing, involving cyclin E overexpression. Through parallel EdU- and PCNA-based analyses, we demonstrate the potential applicability of our method to patient samples, offering a means to identify the contribution of deregulated DNA replication to a plethora of pathogenic processes.
在真核细胞中,DNA复制在空间和时间上都是有组织的,细胞核中复制位点的阶段特异性空间分布证明了这一点。尽管异常DNA复制与多种人类疾病存在遗传关联,但用于研究DNA复制的劳动密集型方法阻碍了对其在病理过程中作用的大规模分析。在本研究中,我们采用了两种不同的方法。我们首先应用监督式机器学习,成功地对野生型小鼠胚胎干细胞(mESCs)中的S期模式进行了分类,同时还识别出了Rif1缺陷型mESCs中改变的复制动态。鉴于基于分类方法的局限性,我们随后开发了一种用于大规模检测异常S期细胞的无监督方法。这种方法不是基于预定义的类别对模式进行分类,而是自主检测差异,紧密概括了不同基因型之间预期的差异。因此,我们将我们的方法扩展到一个特征明确的可诱导的起始点失控的细胞模型,该模型涉及细胞周期蛋白E的过表达。通过基于EdU和PCNA的平行分析,我们证明了我们的方法对患者样本的潜在适用性,提供了一种手段来确定失控的DNA复制对众多致病过程的贡献。