Suppr超能文献

25种草原植物物种的全长转录组

Full-length transcriptomes of 25 grassland plant species.

作者信息

Jiang Chongyi, Huang Zixia, Meizoso Cynthia, Kumpfmüller Gaby, Wolf Jochen B W, Schielzeth Holger

机构信息

Population Ecology Group, Institute of Ecology and Evolution, Friedrich Schiller University, Jena, Germany.

School of Biology and Environmental Science, University College Dublin, Dublin, Ireland.

出版信息

Sci Data. 2025 Jun 2;12(1):922. doi: 10.1038/s41597-025-05280-6.

Abstract

Grasslands are essential, biodiverse ecosystems providing critical ecosystem services. Despite their ecological and economic value, transcriptomic resources for wild grassland species to support eco-evolutionary and functional genomic studies remain limited. Here, we present full-length transcriptomes for shoot tissue from 25 wild grassland plant species collected from a long-term biodiversity experiment (the Jena Experiment). Using PacBio Iso-Seq technology, we generated a total of 522.45 million subreads, which were assembled into unique transcripts for each species independently. This resulted in an average of 49,180 transcripts per species, of which 68.6% were successfully annotated using the Swiss-Prot database. Furthermore, 40.3% of the transcripts contained complete open reading frames (ORFs), while 31.4% had incomplete ORFs. More than 36.8% of the transcripts were identified as non-coding RNAs. On average, 5.08% of the bases across all transcriptomes were flagged as repetitive elements. This dataset offers a valuable full-length transcriptomic resource for studying gene expression, alternative splicing, and evolutionary patterns in grassland species, paving the way for future research in functional genomics and conservation.

摘要

草原是至关重要的、具有生物多样性的生态系统,提供关键的生态系统服务。尽管它们具有生态和经济价值,但用于支持生态进化和功能基因组学研究的野生草原物种的转录组资源仍然有限。在这里,我们展示了从一个长期生物多样性实验(耶拿实验)中收集的25种野生草原植物物种地上组织的全长转录组。使用PacBio Iso-Seq技术,我们总共生成了5.2245亿条子序列,将其分别组装成每个物种的独特转录本。这使得每个物种平均有49180个转录本,其中68.6%使用Swiss-Prot数据库成功注释。此外,40.3%的转录本包含完整的开放阅读框(ORF),而31.4%的转录本具有不完整的ORF。超过36.8%的转录本被鉴定为非编码RNA。所有转录组中平均有5.08%的碱基被标记为重复元件。该数据集为研究草原物种的基因表达、可变剪接和进化模式提供了宝贵的全长转录组资源,为未来功能基因组学和保护研究铺平了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f8a/12130298/7586b6ee4643/41597_2025_5280_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验