Yoon Joon, Kim Heebal
Department of Natural Science, Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Korea.
Department of Agricultural Biotechnology, Animal Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Korea.
Asian-Australas J Anim Sci. 2019 Jul;32(7):1044-1051. doi: 10.5713/ajas.18.0516. Epub 2018 Nov 28.
Recent studies have implied that gene expression has high tissue-specificity, and therefore it is essential to investigate gene expression in a variety of tissues when performing the transcriptomic analysis. In addition, the gradual increase of long non-coding RNA (lncRNA) annotation database has increased the importance and proportion of mapped reads accordingly.
We employed simple statistical models to detect the sexually biased/dimorphic genes and their conjugate lncRNAs in 40 RNA-seq samples across two factors: sex and tissue. We employed two quantification pipeline: mRNA annotation only and mRNA+lncRNA annotation.
As a result, the tissue-specific sexually dimorphic genes are affected by the addition of lncRNA annotation at a non-negligible level. In addition, many lncRNAs are expressed in a more tissue-specific fashion and with greater variation between tissues compared to protein-coding genes. Due to the genic region lncRNAs, the differentially expressed gene list changes, which results in certain sexually biased genes to become ambiguous across the tissues.
In a past study, it has been reported that tissue-specific patterns can be seen throughout the differentially expressed genes between sexes in cattle. Using the same dataset, this study used a more recent reference, and the addition of conjugate lncRNA information, which revealed alterations of differentially expressed gene lists that result in an apparent distinction in the downstream analysis and interpretation. We firmly believe such misquantification of genic lncRNAs can be vital in both future and past studies.
近期研究表明基因表达具有高度的组织特异性,因此在进行转录组分析时,研究多种组织中的基因表达至关重要。此外,长链非编码RNA(lncRNA)注释数据库的逐渐增加相应地提高了比对 reads 的重要性和比例。
我们采用简单的统计模型,在40个RNA测序样本中,通过性别和组织这两个因素来检测性别偏向/双态性基因及其共轭lncRNA。我们采用了两种定量流程:仅mRNA注释和mRNA+lncRNA注释。
结果表明,组织特异性的性别双态性基因受到lncRNA注释添加的影响,且这种影响不可忽视。此外,与蛋白质编码基因相比,许多lncRNA以更具组织特异性的方式表达,并且在不同组织之间具有更大的变异性。由于基因区域lncRNA的存在,差异表达基因列表发生变化,导致某些性别偏向基因在不同组织中变得模糊不清。
在过去的一项研究中,曾报道在牛的性别差异表达基因中可以看到组织特异性模式。本研究使用相同的数据集,采用了更新的参考,并添加了共轭lncRNA信息,这揭示了差异表达基因列表的变化,从而在下游分析和解释中产生了明显的差异。我们坚信,这种基因lncRNA的错误定量在未来和过去的研究中都可能至关重要。