Housley Donna J E, Zalewski Zachary A, Beckett Stephanie E, Venta Patrick J
Small Animal Clinical Sciences, Michigan State University, East Lansing, MI 48824-1314, USA.
BMC Genomics. 2006 Oct 9;7:253. doi: 10.1186/1471-2164-7-253.
Cross-species primers have been used with moderate success to address a variety of questions concerning genome structure, evolution, and gene function. However, the factors affecting their success have never been adequately addressed, particularly with respect to producing a consistent method to achieve high throughput. Using 1,147 mammalian cross-species primer pairs (1089 not previously reported), we tested several factors to determine their influence on the probability that a given target will amplify in a given species under a single amplification condition. These factors included: number of mismatches between the two species (the index species) used to identify conserved regions to which the primers were designed, GC-content of the gene and amplified region, CpG dinucleotides in the primer region, degree of encoded protein conservation, length of the primers, and the degree of evolutionary distance between the target species and the two index species.
The amplification success rate for the cross-species primers was significantly influenced by the number of mismatches between the two index species (6-8% decrease per mismatch in a primer pair), the GC-content within the amplified region (for the dog, GC > or = 50%, 56.9% amplified; GC<50%, 74.2% amplified), the degree of protein conservation (R2 = 0.14) and the relatedness of the target species to the index species. For the dog, 598 products of 930 primer pairs (64.3%) (excluding primers in which dog was an index species) were sequenced and shown to be the expected product, with an additional three percent producing the incorrect sequence. When hamster DNA was used with the single amplification condition in a microtiter plate-based format, 510 of 1087 primer pairs (46.9%) produced amplified products. The primer pairs are spaced at an average distance of 2.3 Mb in the human genome and may be used to produce up to several hundred thousand bp of species-specific sequence.
The most important factors influencing the proportion of successful amplifications are the number of index species mismatches, GC-richness of the target amplimer, and the relatedness of the target species to the index species, at least under the single PCR condition used. The 1147 cross-species primer pairs can be used in a high throughput manner to generate data for studies on the genetics and genomics of non-sequenced mammalian genomes.
跨物种引物已被用于解决有关基因组结构、进化和基因功能的各种问题,并取得了一定的成功。然而,影响其成功的因素从未得到充分探讨,特别是在开发一种一致的高通量方法方面。我们使用了1147对哺乳动物跨物种引物对(其中1089对未曾报道过),测试了几个因素,以确定它们对在单一扩增条件下给定靶标在给定物种中扩增概率的影响。这些因素包括:用于识别设计引物的保守区域的两个物种(索引物种)之间的错配数、基因和扩增区域的GC含量、引物区域中的CpG二核苷酸、编码蛋白质的保守程度、引物长度以及靶标物种与两个索引物种之间的进化距离。
跨物种引物的扩增成功率受到两个索引物种之间错配数(引物对中每增加一个错配,成功率下降6 - 8%)、扩增区域内的GC含量(对于狗,GC≥50%时,56.9%的引物对扩增成功;GC<50%时,74.2%的引物对扩增成功)、蛋白质保守程度(R2 = 0.14)以及靶标物种与索引物种的亲缘关系的显著影响。对于狗,对930对引物中的598个产物(64.3%)(不包括以狗为索引物种的引物)进行了测序,结果显示为预期产物,另外3%产生了错误序列。当在基于微量滴定板的形式下使用单一扩增条件处理仓鼠DNA时,1087对引物中有510对(46.9%)产生了扩增产物。这些引物对在人类基因组中的平均间距为2.3 Mb,可用于产生多达几十万碱基对的物种特异性序列。
至少在所使用的单一PCR条件下,影响扩增成功比例的最重要因素是索引物种错配数、靶标扩增子的GC丰富度以及靶标物种与索引物种的亲缘关系。这1147对跨物种引物对可用于高通量方式,为未测序哺乳动物基因组的遗传学和基因组学研究生成数据。