Ng Patrick, Wei Chia-Lin, Sung Wing-Kin, Chiu Kuo Ping, Lipovich Leonard, Ang Chin Chin, Gupta Sanjay, Shahab Atif, Ridwan Azmi, Wong Chee Hong, Liu Edison T, Ruan Yijun
Genome Institute of Singapore, 60 Biopolis Street, Genome 02-01, Singapore 138672.
Nat Methods. 2005 Feb;2(2):105-11. doi: 10.1038/nmeth733. Epub 2005 Jan 9.
We have developed a DNA tag sequencing and mapping strategy called gene identification signature (GIS) analysis, in which 5' and 3' signatures of full-length cDNAs are accurately extracted into paired-end ditags (PETs) that are concatenated for efficient sequencing and mapped to genome sequences to demarcate the transcription boundaries of every gene. GIS analysis is potentially 30-fold more efficient than standard cDNA sequencing approaches for transcriptome characterization. We demonstrated this approach with 116,252 PET sequences derived from mouse embryonic stem cells. Initial analysis of this dataset identified hundreds of previously uncharacterized transcripts, including alternative transcripts of known genes. We also uncovered several intergenically spliced and unusual fusion transcripts, one of which was confirmed as a trans-splicing event and was differentially expressed. The concept of paired-end ditagging described here for transcriptome analysis can also be applied to whole-genome analysis of cis-regulatory and other DNA elements and represents an important technological advance for genome annotation.
我们开发了一种名为基因识别标签(GIS)分析的DNA标签测序和定位策略,该策略可将全长cDNA的5'和3'标签准确提取为双端标签(PET),将这些双端标签串联起来以便高效测序,并将其定位到基因组序列上,从而划定每个基因的转录边界。对于转录组特征分析,GIS分析的效率可能比标准cDNA测序方法高30倍。我们用源自小鼠胚胎干细胞的116,252个PET序列证明了这种方法。对该数据集的初步分析鉴定出数百个以前未被表征的转录本,包括已知基因的可变转录本。我们还发现了几个基因间剪接和异常的融合转录本,其中一个被确认为反式剪接事件,并且存在差异表达。这里描述的用于转录组分析的双端标签概念也可应用于顺式调控和其他DNA元件的全基因组分析,代表了基因组注释的一项重要技术进步。