Chen Jiabi, Chen Xiaoshu
Department of Immunology and Microbiology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.
Key Laboratory of Tropical Disease Control, Ministry of Education, Sun Yat-sen University, Guangzhou, China.
Mol Biol Evol. 2025 Feb 3;42(2). doi: 10.1093/molbev/msaf004.
A controversy in evolutionary genetics is whether active dosage compensation is necessary to resolve the gene dosage imbalance between the X chromosome and autosomes. ScRNA-seq data could provide insight into this issue. However, it's crucial to carefully evaluate whether inherent characteristics of scRNA-seq, such as the sparsity of detected genes, might bias the X:AA expression ratio in mammals. This study evaluated two common strategies for selecting genes in the calculation of X:AA, namely, filter-by-expression and filter-by-fraction, with simulated scRNA-seq and bulk RNA-seq datasets. We found that both strategies produce an inflated X:AA, thus artifactually supporting dosage compensation. Analyzing empirical human Smart-seq2 data, results from the filter-by-expression strategy suggested that X-linked genes were more highly expressed than autosomal genes, a pattern that is neither predicted by dosage compensation nor explained by genes escaping X chromosome inactivation. However, the results of the filter-by-fraction strategy are consistent with the simulation. Furthermore, despite biasing for mean expression levels, we found that scRNA-seq data could be used to detect X-to-autosome expression noise differences as small as 10%, which enabled investigation into the distribution of genes that are more likely insensitive to gene dosage changes. Analysis of the empirical Smart-seq2 data revealed a 10% to 15% increase in expression noise for X chromosomes compared with autosomes and a significant depletion of dosage-sensitive genes on X chromosomes. Overall, these results highlight the need to be cautious when interpreting scRNA-seq data, particularly when comparing the expression of different genes, and provide additional evidence for the hypothesis of X chromosome insensitivity.
进化遗传学中的一个争议是,主动剂量补偿对于解决X染色体和常染色体之间的基因剂量失衡是否必要。单细胞RNA测序(scRNA-seq)数据可以为这个问题提供见解。然而,仔细评估scRNA-seq的固有特征(如检测到的基因的稀疏性)是否可能使哺乳动物中的X:AA表达比率产生偏差至关重要。本研究使用模拟的scRNA-seq和批量RNA测序(bulk RNA-seq)数据集,评估了在计算X:AA时选择基因的两种常见策略,即按表达过滤和按比例过滤。我们发现这两种策略都会使X:AA升高,从而人为地支持剂量补偿。分析人类经验性的Smart-seq2数据,按表达过滤策略的结果表明,X连锁基因比常染色体基因表达更高,这种模式既不是剂量补偿所预测的,也不能用逃避X染色体失活的基因来解释。然而,按比例过滤策略的结果与模拟结果一致。此外,尽管对平均表达水平存在偏差,但我们发现scRNA-seq数据可用于检测低至10%的X染色体与常染色体之间的表达噪声差异,这使得能够研究对基因剂量变化不太敏感基因的分布。对经验性Smart-seq2数据的分析显示,与常染色体相比,X染色体的表达噪声增加了10%至15%,并且X染色体上剂量敏感基因显著减少。总体而言,这些结果凸显了在解释scRNA-seq数据时需要谨慎,特别是在比较不同基因的表达时,并为X染色体不敏感假说提供了额外证据。