Fields C
Center for Advanced Computing in Molecular and Cellular Biology, New Mexico State University, Las Cruces 88003-0001.
Nucleic Acids Res. 1990 Mar 25;18(6):1509-12. doi: 10.1093/nar/18.6.1509.
A database of sequences of 139 introns from the nematode Caenorhabditis elegans was analyzed using the information measure of Schneider et al. (1986) J. Mol. Biol. 128: 415-431. Statistically significant information is encoded by at least the first 30 nt and last 20 nt of C. elegans introns. Both the quantity and the distribution of information in the 5' splice site sequences differs between the typical short (length less than 75 nt) and rarer long (length greater than 75 nt) introns, with the 5 sites of long introns containing approximately one bit more information. 3' splice site sequences of long and short C. elegans introns differ significantly in the region between -20 and -10 nt.
利用施奈德等人(1986年,《分子生物学杂志》128卷:415 - 431页)的信息度量方法,对来自秀丽隐杆线虫的139个内含子序列数据库进行了分析。秀丽隐杆线虫内含子的至少前30个核苷酸和后20个核苷酸编码着具有统计学意义的信息。典型的短内含子(长度小于75个核苷酸)和较少见的长内含子(长度大于75个核苷酸)在5'剪接位点序列中的信息量和信息分布均有所不同,长内含子的5'位点所含信息大约多一位。秀丽隐杆线虫长内含子和短内含子的3'剪接位点序列在 - 20至 - 10个核苷酸区域存在显著差异。