Lai Chun Wan J, Yu Qingyi, Hou Shaobin, Skelton Rachel L, Jones Meghan R, Lewis Kanako L T, Murray Jan, Eustice Moriah, Guan Peizhu, Agbayani Ricelle, Moore Paul H, Ming Ray, Presting Gernot G
Department of Molecular Biosciences and Bioengineering, University of Hawaii, 1955 East-West Road, Agricultural Sciences Building Room 218, Honolulu, HI, 96822, USA.
Mol Genet Genomics. 2006 Jul;276(1):1-12. doi: 10.1007/s00438-006-0122-z. Epub 2006 May 16.
Papaya (Carica papaya L.) is a major tree fruit crop of tropical and subtropical regions with an estimated genome size of 372 Mbp. We present the analysis of 4.7% of the papaya genome based on BAC end sequences (BESs) representing 17 million high-quality bases. Microsatellites discovered in 5,452 BESs and flanking primer sequences are available to papaya breeding programs at http://www.genomics.hawaii.edu/papaya/BES . Sixteen percent of BESs contain plant repeat elements, the vast majority (83.3%) of which are class I retrotransposons. Several novel papaya-specific repeats were identified. Approximately 19.1% of the BESs have homology to Arabidopsis cDNA. Increasing numbers of completely sequenced plant genomes and BES projects enable novel approaches to comparative plant genomics. Paired BESs of Carica, Arabidopsis, Populus, Brassica and Lycopersicon were mapped onto the completed genomes of Arabidopsis and Populus. In general the level of microsynteny was highest between closely related organisms. However, papaya revealed a higher degree of apparent synteny with the more distantly related poplar than with the more closely related Arabidopsis. This, as well as significant colinearity observed between peach and poplar genome sequences, support recent observations of frequent genome rearrangements in the Arabidopsis lineage and suggest that the poplar genome sequence may be more useful for elucidating the papaya and other rosid genomes. These insights will play a critical role in selecting species and sequencing strategies that will optimally represent crop genomes in sequence databases.
番木瓜(番木瓜属番木瓜)是热带和亚热带地区的一种主要果树作物,估计基因组大小为372兆碱基对。我们基于代表1700万个高质量碱基的细菌人工染色体末端序列(BESs),对4.7%的番木瓜基因组进行了分析。在5452个BESs中发现的微卫星及其侧翼引物序列可在http://www.genomics.hawaii.edu/papaya/BES上供番木瓜育种项目使用。16%的BESs含有植物重复元件,其中绝大多数(83.3%)是I类反转录转座子。鉴定出了几个新的番木瓜特异性重复序列。约19.1%的BESs与拟南芥cDNA具有同源性。越来越多已完成测序的植物基因组和BES项目为比较植物基因组学提供了新方法。将番木瓜、拟南芥、杨树、甘蓝和番茄的配对BESs定位到拟南芥和杨树的完整基因组上。一般来说,亲缘关系较近的生物体之间的微共线性水平最高。然而,番木瓜与亲缘关系较远的杨树之间的明显共线性程度高于与亲缘关系较近的拟南芥。这一点,以及桃和杨树基因组序列之间观察到的显著共线性,支持了最近对拟南芥谱系中频繁基因组重排的观察结果,并表明杨树基因组序列可能对阐明番木瓜和其他蔷薇类基因组更有用。这些见解将在选择能在序列数据库中最佳代表作物基因组的物种和测序策略方面发挥关键作用。