Borel Christelle, Gagnebin Maryline, Gehrig Corinne, Kriventseva Evgenia V, Zdobnov Evgeny M, Antonarakis Stylianos E
Department of Genetic Medicine and Development, University of Geneva Medical School and University Hospitals of Geneva, Geneva 1211, Switzerland.
Am J Hum Genet. 2008 Apr;82(4):971-81. doi: 10.1016/j.ajhg.2008.02.016.
The elucidation of the largely unknown transcriptome of small RNAs is crucial for the understanding of genome and cellular function. We report here the results of the analysis of small RNAs (< 50 nt) in the ENCODE regions of the human genome. Size-fractionated RNAs from four different cell lines (HepG2, HelaS3, GM06990, SK-N-SH) were mapped with the forward and reverse ENCODE high-density resolution tiling arrays. The top 1% of hybridization signals are termed SmRfrags (Small RNA fragments). Eight percent of SmRfrags overlap the GENCODE genes (CDS), given that the majority map to intergenic regions (34%), intronic regions (53%), and untranslated regions (UTRs) (5%). In addition, 9.6% and 16.8% of SmRfrags in the 5' UTR regions overlap significantly with His/Pol II/TAF250 binding sites and DNase I Hypersensitive sites, respectively (compared to the 5.3% and 9% expected). Interestingly, 17%-24% (depending on the cell line) of SmRfrags are sense-antisense strand pairs that show evidence of overlapping transcription. Only 3.4% and 7.2% of SmRfrags in intergenic regions overlap transcribed fragments (Txfrags) in HeLa and GM06990 cell lines, respectively. We hypothesized that a fraction of the identified SmRfrags corresponded to microRNAs. We tested by Northern blot a set of 15 high-likelihood predictions of microRNA candidates that overlap with smRfrags and validated three potential microRNAs ( approximately 20 nt length). Notably, most of the remaining candidates showed a larger hybridizing band ( approximately 100 nt) that could be a microRNA precursor. The small RNA transcriptome is emerging as an important and abundant component of the genome function.
阐明大部分未知的小RNA转录组对于理解基因组和细胞功能至关重要。我们在此报告人类基因组ENCODE区域中小RNA(<50 nt)的分析结果。来自四种不同细胞系(HepG2、HelaS3、GM06990、SK-N-SH)的大小分级RNA与正向和反向ENCODE高密度分辨率平铺阵列进行了比对。杂交信号的前1%被称为SmRfrags(小RNA片段)。鉴于大多数SmRfrags定位于基因间区域(34%)、内含子区域(53%)和非翻译区域(UTR)(5%),8%的SmRfrags与GENCODE基因(CDS)重叠。此外,5'UTR区域中9.6%和16.8%的SmRfrags分别与His/Pol II/TAF250结合位点和DNase I超敏位点有显著重叠(与预期的5.3%和9%相比)。有趣的是,17%-24%(取决于细胞系)的SmRfrags是有义-反义链对,显示出重叠转录的证据。在HeLa和GM06990细胞系中,基因间区域中分别只有3.4%和7.2%的SmRfrags与转录片段(Txfrags)重叠。我们推测一部分已鉴定的SmRfrags对应于微小RNA。我们通过Northern印迹法测试了一组与smRfrags重叠的15个高可能性微小RNA候选物预测,并验证了三种潜在的微小RNA(长度约为20 nt)。值得注意的是,大多数其余候选物显示出更大的杂交条带(约100 nt),可能是微小RNA前体。小RNA转录组正成为基因组功能的一个重要且丰富的组成部分。