Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA.
G3 (Bethesda). 2012 Aug;2(8):921-30. doi: 10.1534/g3.112.003392. Epub 2012 Aug 1.
Targeted sequence capture is a promising technology in many areas in biology. These methods enable efficient and relatively inexpensive sequencing of hundreds to thousands of genes or genomic regions from many more individuals than is practical using whole-genome sequencing approaches. Here, we demonstrate the feasibility of target enrichment using sequence capture in polyploid cotton. To capture and sequence both members of each gene pair (homeologs) of wild and domesticated Gossypium hirsutum, we created custom hybridization probes to target 1000 genes (500 pairs of homeologs) using information from the cotton transcriptome. Two widely divergent samples of G. hirsutum were hybridized to four custom NimbleGen capture arrays containing probes for targeted genes. We show that the two coresident homeologs in the allopolyploid nucleus were efficiently captured with high coverage. The capture efficiency was similar between the two accessions and independent of whether the samples were multiplexed. A significant amount of flanking, nontargeted sequence (untranslated regions and introns) was also captured and sequenced along with the targeted exons. Intraindividual heterozygosity is low in both wild and cultivated Upland cotton, as expected from the high level of inbreeding in natural G. hirsutum and bottlenecks accompanying domestication. In addition, levels of heterozygosity appeared asymmetrical with respect to genome (A(T) or D(T)) in cultivated cotton. The approach used here is general, scalable, and may be adapted for many different research inquiries involving polyploid plant genomes.
靶向序列捕获是生物学许多领域中一种很有前途的技术。这些方法能够高效、相对廉价地对数百到数千个基因或基因组区域进行测序,其适用范围比全基因组测序方法更广。在这里,我们展示了在多倍体棉花中使用序列捕获进行目标富集的可行性。为了捕获和测序野生和驯化棉属(Gossypium hirsutum)每个基因对(同源基因)的两个成员,我们使用棉花转录组的信息,创建了针对 1000 个基因(500 对同源基因)的定制杂交探针,用于目标基因的捕获。我们将两个广泛分歧的棉属样本杂交到四个包含目标基因探针的定制 NimbleGen 捕获阵列上。我们表明,多倍体细胞核中的两个共存同源基因能够被高效捕获,具有高覆盖率。两个品系之间的捕获效率相似,且与样本是否进行多重化无关。与靶向外显子一起,还捕获和测序了大量侧翼非靶向序列(非翻译区和内含子)。野生和栽培陆地棉个体内的杂合性均较低,这与自然棉属中高度自交和伴随驯化而来的瓶颈相一致。此外,在栽培棉花中,杂合性水平似乎与基因组(A(T) 或 D(T))不对称。这里使用的方法是通用的、可扩展的,并且可以适应涉及多倍体植物基因组的许多不同研究。