Ruan Yijun, Ooi Hong Sain, Choo Siew Woh, Chiu Kuo Ping, Zhao Xiao Dong, Srinivasan K G, Yao Fei, Choo Chiou Yu, Liu Jun, Ariyaratne Pramila, Bin Wilson G W, Kuznetsov Vladimir A, Shahab Atif, Sung Wing-Kin, Bourque Guillaume, Palanisamy Nallasivam, Wei Chia-Lin
Genome Technology and Biology Group, Genome Institute of Singapore, Singapore 138672, Singapore.
Genome Res. 2007 Jun;17(6):828-38. doi: 10.1101/gr.6018607.
Identification of unconventional functional features such as fusion transcripts is a challenging task in the effort to annotate all functional DNA elements in the human genome. Paired-End diTag (PET) analysis possesses a unique capability to accurately and efficiently characterize the two ends of DNA fragments, which may have either normal or unusual compositions. This unique nature of PET analysis makes it an ideal tool for uncovering unconventional features residing in the human genome. Using the PET approach for comprehensive transcriptome analysis, we were able to identify fusion transcripts derived from genome rearrangements and actively expressed retrotransposed pseudogenes, which would be difficult to capture by other means. Here, we demonstrate this unique capability through the analysis of 865,000 individual transcripts in two types of cancer cells. In addition to the characterization of a large number of differentially expressed alternative 5' and 3' transcript variants and novel transcriptional units, we identified 70 fusion transcript candidates in this study. One was validated as the product of a fusion gene between BCAS4 and BCAS3 resulting from an amplification followed by a translocation event between the two loci, chr20q13 and chr17q23. Through an examination of PETs that mapped to multiple genomic locations, we identified 4055 retrotransposed loci in the human genome, of which at least three were found to be transcriptionally active. The PET mapping strategy presented here promises to be a useful tool in annotating the human genome, especially aberrations in human cancer genomes.
在注释人类基因组中所有功能性DNA元件的过程中,识别诸如融合转录本等非常规功能特征是一项具有挑战性的任务。双末端二标签(PET)分析具有独特的能力,能够准确且高效地表征DNA片段的两端,这些片段可能具有正常或异常的组成。PET分析的这种独特性质使其成为揭示人类基因组中非常规特征的理想工具。使用PET方法进行全面的转录组分析,我们能够识别出源自基因组重排和活跃表达的逆转座假基因的融合转录本,而这些用其他方法很难捕获。在此,我们通过分析两种癌细胞中的865,000个个体转录本来证明这种独特能力。除了表征大量差异表达的可变5'和3'转录本变体以及新的转录单元外,我们在本研究中还识别出70个融合转录本候选物。其中一个被确认为是BCAS4和BCAS3之间融合基因的产物,该融合基因是由两个基因座chr20q13和chr17q23之间的扩增继之以易位事件产生的。通过检查映射到多个基因组位置的PET,我们在人类基因组中识别出4055个逆转座基因座,其中至少有三个被发现具有转录活性。本文提出的PET映射策略有望成为注释人类基因组的有用工具,尤其是注释人类癌症基因组中的畸变。