Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX, Liverpool, United Kingdom.
Key Laboratory of Ministry of Education of Gastrointestinal Cancer, School of Basic Medical Science, Fujian Medical University, Fuzhou, China.
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab088.
Motivation N6-methyladenosine (m6A) is the most prevalent RNA modification on mRNAs and lncRNAs. Evidence increasingly demonstrates its crucial importance in essential molecular mechanisms and various diseases. With recent advances in sequencing techniques, tens of thousands of m6A sites are identified in a typical high-throughput experiment, posing a key challenge to distinguish the functional m6A sites from the remaining 'passenger' (or 'silent') sites. Results: We performed a comparative conservation analysis of the human and mouse m6A epitranscriptomes at single site resolution. A novel scoring framework, ConsRM, was devised to quantitatively measure the degree of conservation of individual m6A sites. ConsRM integrates multiple information sources and a positive-unlabeled learning framework, which integrated genomic and sequence features to trace subtle hints of epitranscriptome layer conservation. With a series validation experiments in mouse, fly and zebrafish, we showed that ConsRM outperformed well-adopted conservation scores (phastCons and phyloP) in distinguishing the conserved and unconserved m6A sites. Additionally, the m6A sites with a higher ConsRM score are more likely to be functionally important. An online database was developed containing the conservation metrics of 177 998 distinct human m6A sites to support conservation analysis and functional prioritization of individual m6A sites. And it is freely accessible at: https://www.xjtlu.edu.cn/biologicalsciences/con.
动机 N6-甲基腺苷(m6A)是 mRNA 和 lncRNA 上最普遍的 RNA 修饰。越来越多的证据表明,它在重要的分子机制和各种疾病中起着至关重要的作用。随着测序技术的最新进展,在一个典型的高通量实验中可以鉴定出数万个 m6A 位点,这对区分功能 m6A 位点和剩余的“乘客”(或“沉默”)位点提出了关键挑战。
我们在单个位点分辨率上对人和小鼠的 m6A 转录后组进行了比较保守性分析。设计了一种新的评分框架 ConsRM,用于定量测量单个 m6A 位点的保守程度。ConsRM 整合了多种信息源和正无标签学习框架,该框架整合了基因组和序列特征,以追踪转录后组保守层的细微线索。通过在小鼠、果蝇和斑马鱼中的一系列验证实验,我们表明 ConsRM 在区分保守和非保守 m6A 位点方面优于经过充分验证的保守评分(phastCons 和 phyloP)。此外,具有更高 ConsRM 评分的 m6A 位点更有可能具有功能重要性。开发了一个在线数据库,其中包含 177998 个独特的人类 m6A 位点的保守性度量标准,以支持单个 m6A 位点的保守性分析和功能优先级排序。并且可以在以下网址免费访问:https://www.xjtlu.edu.cn/biologicalsciences/con.