Richardson Mark F, Sherman Craig D H
Deakin University, Geelong, Australia. School of Life and Environmental Sciences, Centre for Integrative Ecology, (Waurn Ponds Campus). 75 Pigdons Road. Locked Bag 20000, Geelong, VIC 3220, Australia.
PLoS One. 2015 Nov 3;10(11):e0142003. doi: 10.1371/journal.pone.0142003. eCollection 2015.
Invasive species are a major threat to global biodiversity but can also serve as valuable model systems to examine important evolutionary processes. While the ecological aspects of invasions have been well documented, the genetic basis of adaptive change during the invasion process has been hampered by a lack of genomic resources for the majority of invasive species. Here we report the first larval transcriptomic resource for the Northern Pacific Seastar, Asterias amurensis, an invasive marine predator in Australia. Approximately 117.5 million 100 base-pair (bp) paired-end reads were sequenced from a single RNA-Seq library from a pooled set of full-sibling A. amurensis bipinnaria larvae. We evaluated the efficacy of a pre-assembly error correction pipeline on subsequent de novo assembly. Error correction resulted in small but important improvements to the final assembly in terms of mapping statistics and core eukaryotic genes representation. The error-corrected de novo assembly resulted in 115,654 contigs after redundancy clustering. 41,667 assembled contigs were homologous to sequences from NCBI's non-redundant protein and UniProt databases. We assigned Gene Ontology, KEGG Orthology, Pfam protein domain terms and predicted protein-coding sequences to > 36,000 contigs. The final transcriptome dataset generated here provides functional information for 18,319 unique proteins, comprising at least 11,355 expressed genes. Furthermore, we identified 9,739 orthologs to P. miniata proteins, evaluated our annotation pipeline and generated a list of 150 candidate genes for responses to several environmental stressors that may be important for adaptation of A. amurensis in the invasive range. Our study has produced a large set of A. amurensis RNA contigs with functional annotations that can serve as a resource for future comparisons to other echinoderm transcriptomes and gene expression studies. Our data can be used to study the genetic basis of adaptive change and other important evolutionary processes during a successful invasion.
入侵物种是全球生物多样性的主要威胁,但也可以作为研究重要进化过程的宝贵模型系统。虽然入侵的生态方面已有充分记录,但由于大多数入侵物种缺乏基因组资源,入侵过程中适应性变化的遗传基础研究受到了阻碍。在此,我们报告了北太平洋海星(Asterias amurensis)的首个幼虫转录组资源,它是澳大利亚的一种入侵性海洋捕食者。从一组全同胞的A. amurensis双羽幼虫的单个RNA-Seq文库中,对约1.175亿个100碱基对(bp)的双端读段进行了测序。我们评估了预组装纠错流程对后续从头组装的效果。纠错在映射统计和核心真核基因代表性方面对最终组装产生了虽小但重要的改进。经过纠错的从头组装在冗余聚类后产生了115,654个重叠群。41,667个组装重叠群与NCBI的非冗余蛋白质和UniProt数据库中的序列同源。我们为超过36,000个重叠群分配了基因本体论、KEGG直系同源、Pfam蛋白质结构域术语并预测了蛋白质编码序列。此处生成的最终转录组数据集为18,319种独特蛋白质提供了功能信息,包括至少11,355个表达基因。此外,我们鉴定出了9,739个与P. miniata蛋白质的直系同源物,评估了我们的注释流程,并生成了一份150个候选基因的列表,这些基因与对几种环境应激源的反应有关,可能对A. amurensis在入侵范围内的适应很重要。我们的研究产生了大量带有功能注释的A. amurensis RNA重叠群,可作为未来与其他棘皮动物转录组和基因表达研究进行比较的资源。我们的数据可用于研究成功入侵过程中适应性变化和其他重要进化过程的遗传基础。