Fang Lu, Yang Yuchen, Guo Wuxia, Li Jianfang, Zhong Cairong, Huang Yelin, Zhou Renchao, Shi Suhua
State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources, Sun Yat-sen University, Guangzhou 510275, China.
Hainan Dongzhai Harbor National Nature Reserve, Haikou 571129, China.
Mar Genomics. 2016 Aug;28:49-52. doi: 10.1016/j.margen.2016.05.004. Epub 2016 Jun 11.
Aegiceras corniculatum (L.) Blanco is one of the most salt tolerant mangrove species and can thrive in 3% salinity at the seaward edge of mangrove forests. Here we sequenced the transcriptome of A. corniculatum used Illumina GA platform to develop its genomic resources for ecological and evolutionary studies. We obtained about 50 million high-quality paired-end reads with 75bp in length. Using the short read assembler Velvet, we yielded 49,437 contigs with the average length of 625bp. A total of 32,744 (66.23%) contigs showed significant similarity to the GenBank non-redundant (NR) protein database. 30,911 and 18,004 of these sequences were assigned to Gene Ontology and eukaryotic orthologous groups of proteins (KOG). A total of 4942 transcripts from our assemblies had significant similarity with KEGG Orthologs and were involved in 144 KEGG pathways, while 9899 unigenes had enzyme commission (EC) numbers. In addition, 9792 transcriptome-derived SSRs were identified from 7342 sequences. With our strict criteria, 4165 candidate SNPs were also identified from 2058 contigs. Some of these SNPs were further validated by Sanger sequencing. Genomic resources generated in this study should be valuable in ecological, evolutionary, and functional genomics studies for this mangrove species.
桐花树(Aegiceras corniculatum (L.) Blanco)是最耐盐的红树植物之一,能在红树林向海边缘3%盐度的环境中茁壮成长。在此,我们使用Illumina GA平台对桐花树的转录组进行测序,以开发其基因组资源用于生态和进化研究。我们获得了约5000万个长度为75bp的高质量双末端读数。使用短读组装软件Velvet,我们得到了49437个重叠群,平均长度为625bp。共有32744个(66.23%)重叠群与GenBank非冗余(NR)蛋白质数据库具有显著相似性。这些序列中的30911个和18004个被分别归类到基因本体论和真核生物直系同源蛋白质组(KOG)。我们组装的转录本中共有4942个与KEGG直系同源物具有显著相似性,并参与了144条KEGG通路,同时有9899个单基因具有酶委员会(EC)编号。此外,从7342个序列中鉴定出9792个转录组衍生的简单序列重复(SSR)。按照我们严格的标准,还从2058个重叠群中鉴定出4165个候选单核苷酸多态性(SNP)。其中一些SNP通过桑格测序进一步验证。本研究中产生的基因组资源对于该红树植物物种的生态、进化和功能基因组学研究应该具有重要价值。