Suppr超能文献

中华扁跳甲高质量基因组草图:一种土壤模式生物(弹尾目)

A High-quality Draft Genome Assembly of Sinella curviseta: A Soil Model Organism (Collembola).

机构信息

Department of Entomology, College of Plant Protection, Nanjing Agricultural University.

Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.

出版信息

Genome Biol Evol. 2019 Feb 1;11(2):521-530. doi: 10.1093/gbe/evz013.

Abstract

Sinella curviseta, among the most widespread springtails (Collembola) in Northern Hemisphere, has often been treated as a model organism in soil ecology and environmental toxicology. However, little information on its genetic knowledge severely hinders our understanding of its adaptations to the soil habitat. We present the largest genome assembly within Collembola using ∼44.86 Gb (118X) of single-molecule real-time Pacific Bioscience Sequel sequencing. The final assembly of 599 scaffolds was ∼381.46 Mb with a N50 length of 3.28 Mb, which captured 95.3% complete and 1.5% partial arthropod Benchmarking Universal Single-Copy Orthologs (n = 1066). Transcripts and circularized mitochondrial genome were also assembled. We predicted 23,943 protein-coding genes, of which 83.88% were supported by transcriptome-based evidence and 82.49% matched protein records in UniProt. In addition, we also identified 222,501 repeats and 881 noncoding RNAs. Phylogenetic reconstructions for Collembola support Tomoceridae sistered to the remaining Entomobryomorpha with the position of Symphypleona not fully resolved. Gene family evolution analyses identified 9,898 gene families, of which 156 experienced significant expansions or contractions. Our high-quality reference genome of S. curviseta provides the genetic basis for future investigations in evolutionary biology, soil ecology, and ecotoxicology.

摘要

狭跗小跳虫(Sinella curviseta)是北半球分布最广泛的跳虫(弹尾目)之一,常被作为土壤生态学和环境毒理学的模式生物。然而,有关其遗传知识的信息很少,严重阻碍了我们对其适应土壤生境的理解。我们使用单分子实时太平洋生物科学测序(SMRT)技术获得的约 44.86 Gb(118X)数据,生成了弹尾目最大的基因组组装。最终的组装结果由 599 个支架组成,大小约为 381.46 Mb,N50 长度为 3.28 Mb,捕获了 95.3%完整和 1.5%部分节肢动物基准通用单拷贝直系同源物(n=1066)。还组装了转录本和环状化的线粒体基因组。我们预测了 23943 个蛋白编码基因,其中 83.88%得到转录组证据的支持,82.49%与 UniProt 中的蛋白记录匹配。此外,还鉴定了 222501 个重复序列和 881 个非编码 RNA。弹尾目系统发育重建支持 Tomoceridae 与剩余的 Entomobryomorpha 姐妹群关系,而 Symphypleona 的位置尚未完全确定。基因家族进化分析确定了 9898 个基因家族,其中 156 个经历了显著的扩张或收缩。我们获得的狭跗小跳虫高质量参考基因组为进化生物学、土壤生态学和生态毒理学的未来研究提供了遗传基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8a3/6389355/1a28d4c6b57a/evz013f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验