Edward S. Rogers, Sr. Department of Electrical and Computer Engineering, University of Toronto, Toronto, Ontario, Canada.
Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada.
PLoS One. 2024 Sep 18;19(9):e0307360. doi: 10.1371/journal.pone.0307360. eCollection 2024.
Neighboring genes within a shared promoter arrangement (i.e. opposite direction with the neighboring ends as the transcriptional start sites) are expected to have a high similarity in genotype tissue expression due to the potential overlap in the promoter region. This raises the question of whether similarity in expression profiles depends on orientation of the neighboring genes and whether there exist thresholds of locality where the similarity diminishes. Thus, in this work, we compared genotype tissue expression profiles at different genomic orientations and localities. Interestingly, there exist gene pairs in the human genome with very high or low expression similarity. Shorter chromosomes tend to have more similarly expressed genes. Also, a cluster of 3 adjacent genes within the average range of 20 to 60 kilobase pairs can have very similar expression profiles regardless of their orientations. However, when genes are nested and in opposite orientations, a lower than expected similarity was observed. Lastly, in cases where genotype tissue expression data does not exist or have low read counts (e.g. non-coding RNA), our identified influencing range can be a first estimate of the genotype tissue expression.
由于启动子区域可能存在重叠,因此在共享启动子排列(即相邻末端为转录起始位点的相反方向)内的相邻基因预计在基因型组织表达上具有高度相似性。这就提出了一个问题,即表达谱的相似性是否取决于相邻基因的方向,以及是否存在相似性降低的局部性阈值。因此,在这项工作中,我们比较了不同基因组方向和局部性的基因型组织表达谱。有趣的是,人类基因组中存在具有非常高或非常低表达相似性的基因对。较短的染色体往往具有更多相似表达的基因。此外,在平均范围为 20 到 60 千碱基对的 3 个相邻基因簇中,无论其方向如何,都可以具有非常相似的表达谱。然而,当基因嵌套且方向相反时,观察到的相似性低于预期。最后,在不存在基因型组织表达数据或读取计数较低的情况下(例如非编码 RNA),我们确定的影响范围可以作为基因型组织表达的初步估计。