Song Xuhao, Shen Fujun, Huang Jie, Huang Yan, Du Lianming, Wang Chengdong, Fan Zhenxin, Hou Rong, Yue Bisong, Zhang Xiuyue
From the Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China (Song, Huang, Du, Fan, Yue, and Zhang); The Sichuan Key Laboratory for Conservation Biology of Endangered Wildlife, Chengdu Research Base of Giant Panda Breeding, Chengdu, Sichuan 610081, China (Shen, Wang, and Hou); and China Research and Conservation Center for the Giant Panda, Wenchuan 623006, China (Huang).
J Hered. 2016 Sep;107(5):423-30. doi: 10.1093/jhered/esw024. Epub 2016 Apr 25.
Recently, an increasing number of microsatellites or simple sequence repeats (SSRs) have been found and characterized from transcriptomes. Such SSRs can be employed as putative functional markers to easily tag corresponding genes, which play an important role in biomedical studies and genetic analysis. However, the transcriptome-derived SSRs for giant panda (Ailuropoda melanoleuca) are not yet available. In this work, we identified and characterized 20 tetranucleotide microsatellite loci from a transcript database generated from the blood of giant panda. Furthermore, we assigned their predicted transcriptome locations: 16 loci were assigned to untranslated regions (UTRs) and 4 loci were assigned to coding regions (CDSs). Gene identities of 14 transcripts contained corresponding microsatellites were determined, which provide useful information to study the potential contribution of SSRs to gene regulation in giant panda. The polymorphic information content (PIC) values ranged from 0.293 to 0.789 with an average of 0.603 for the 16 UTRs-derived SSRs. Interestingly, 4 CDS-derived microsatellites developed in our study were also polymorphic, and the instability of these 4 CDS-derived SSRs was further validated by re-genotyping and sequencing. The genes containing these 4 CDS-derived SSRs were embedded with various types of repeat motifs. The interaction of all the length-changing SSRs might provide a way against coding region frameshift caused by microsatellite instability. We hope these newly gene-associated biomarkers will pave the way for genetic and biomedical studies for giant panda in the future. In sum, this set of transcriptome-derived markers complements the genetic resources available for giant panda.
最近,越来越多的微卫星或简单序列重复(SSR)已从转录组中被发现并进行了特征描述。此类SSR可作为推定的功能标记,用于轻松标记相应基因,这在生物医学研究和遗传分析中发挥着重要作用。然而,大熊猫(Ailuropoda melanoleuca)转录组衍生的SSR尚未可得。在这项工作中,我们从大熊猫血液生成的转录数据库中鉴定并描述了20个四核苷酸微卫星位点。此外,我们确定了它们在转录组中的预测位置:16个位点被定位到非翻译区(UTR),4个位点被定位到编码区(CDS)。确定了14个含有相应微卫星的转录本的基因身份,这为研究SSR对大熊猫基因调控的潜在贡献提供了有用信息。16个源自UTR的SSR的多态信息含量(PIC)值范围为0.293至0.789,平均为0.603。有趣的是,我们研究中开发的4个源自CDS的微卫星也是多态的,并且通过重新基因分型和测序进一步验证了这4个源自CDS的SSR的不稳定性。含有这4个源自CDS的SSR的基因嵌入了各种类型的重复基序。所有长度变化的SSR之间的相互作用可能提供一种防止微卫星不稳定性导致编码区移码的方法。我们希望这些新的与基因相关的生物标志物将为未来大熊猫的遗传和生物医学研究铺平道路。总之,这组转录组衍生标记补充了大熊猫可用的遗传资源。