Academy of Military Medical Sciences, Beijing 100850, China.
College of Medical Informatics, Chongqing Medical University, Chongqing 400016, China.
Nucleic Acids Res. 2024 Jul 22;52(13):7610-7626. doi: 10.1093/nar/gkae441.
Gene expression is temporally and spatially regulated by the interaction of transcription factors (TFs) and cis-regulatory elements (CREs). The uneven distribution of TF binding sites across the genome poses challenges in understanding how this distribution evolves to regulate spatio-temporal gene expression and consequent heritable phenotypic variation. In this study, chromatin accessibility profiles and gene expression profiles were collected from several species including mammals (human, mouse, bovine), fish (zebrafish and medaka), and chicken. Transcription factor binding sites clustered regions (TFCRs) at different embryonic stages were characterized to investigate regulatory evolution. The study revealed dynamic changes in TFCR distribution during embryonic development and species evolution. The synchronization between TFCR complexity and gene expression was assessed across species using RegulatoryScore. Additionally, an explainable machine learning model highlighted the importance of the distance between TFCR and promoter in the coordinated regulation of TFCRs on gene expression. Our results revealed the developmental and evolutionary dynamics of TFCRs during embryonic development from fish, chicken to mammals. These data provide valuable resources for exploring the relationship between transcriptional regulation and phenotypic differences during embryonic development.
基因表达受转录因子 (TFs) 和顺式调控元件 (CREs) 的相互作用在时间和空间上受到调节。TF 结合位点在基因组上的不均匀分布给理解这种分布如何进化以调节时空基因表达和随之而来的可遗传表型变异带来了挑战。在这项研究中,从包括哺乳动物(人类、小鼠、牛)、鱼类(斑马鱼和日本青鳉)和鸡在内的几个物种中收集了染色质可及性图谱和基因表达图谱。为了研究调控进化,对不同胚胎阶段的转录因子结合位点聚类区域 (TFCR) 进行了特征描述。该研究揭示了胚胎发育和物种进化过程中 TFCR 分布的动态变化。使用 RegulatoryScore 评估了跨物种的 TFCR 复杂性和基因表达之间的同步性。此外,一个可解释的机器学习模型强调了 TFCR 和启动子之间的距离在协调 TFCR 对基因表达的调控中的重要性。我们的研究结果揭示了从鱼类、鸡到哺乳动物的胚胎发育过程中 TFCR 的发育和进化动态。这些数据为探索胚胎发育过程中转录调控与表型差异之间的关系提供了有价值的资源。