• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用长读长和双端测序数据构建茶树参考基因组并进行基因注释的改良。

The tea plant reference genome and improved gene annotation using long-read and paired-end sequencing data.

机构信息

State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, 230036, China.

BGI-Shenzhen, Shenzhen, 518083, China.

出版信息

Sci Data. 2019 Jul 15;6(1):122. doi: 10.1038/s41597-019-0127-1.

DOI:10.1038/s41597-019-0127-1
PMID:31308375
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6629666/
Abstract

Tea is a globally consumed non-alcohol beverage with great economic importance. However, lack of the reference genome has largely hampered the utilization of precious tea plant genetic resources towards breeding. To address this issue, we previously generated a high-quality reference genome of tea plant using Illumina and PacBio sequencing technology, which produced a total of 2,124 Gb short and 125 Gb long read data, respectively. A hybrid strategy was employed to assemble the tea genome that has been publicly released. We here described the data framework used to generate, annotate and validate the genome assembly. Besides, we re-predicted the protein-coding genes and annotated their putative functions using more comprehensive omics datasets with improved training models. We reassessed the assembly and annotation quality using the latest version of BUSCO. These data can be utilized to develop new methodologies/tools for better assembly of complex genomes, aid in finding of novel genes, variations and evolutionary clues associated with tea quality, thus help to breed new varieties with high yield and better quality in the future.

摘要

茶是一种全球消费的非酒精饮料,具有重要的经济意义。然而,由于缺乏参考基因组,极大地阻碍了宝贵的茶树遗传资源在培育方面的利用。为了解决这个问题,我们之前使用 Illumina 和 PacBio 测序技术生成了一个高质量的茶树参考基因组,分别产生了总计 2124GB 的短读和 125GB 的长读数据。我们采用了一种混合策略来组装已经公开的茶树基因组。在这里,我们描述了用于生成、注释和验证基因组组装的数据集框架。此外,我们使用更全面的组学数据集和改进的训练模型重新预测了蛋白质编码基因,并注释了它们的可能功能。我们使用最新版本的 BUSCO 重新评估了组装和注释的质量。这些数据可用于开发新的方法/工具,以更好地组装复杂基因组,帮助发现与茶叶质量相关的新基因、变异和进化线索,从而有助于未来培育出高产、优质的新品种。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4566/6629666/a7f4c6d361fd/41597_2019_127_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4566/6629666/4c8b43bc1da3/41597_2019_127_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4566/6629666/ddf06d7caa52/41597_2019_127_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4566/6629666/a7f4c6d361fd/41597_2019_127_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4566/6629666/4c8b43bc1da3/41597_2019_127_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4566/6629666/ddf06d7caa52/41597_2019_127_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4566/6629666/a7f4c6d361fd/41597_2019_127_Fig3_HTML.jpg

相似文献

1
The tea plant reference genome and improved gene annotation using long-read and paired-end sequencing data.利用长读长和双端测序数据构建茶树参考基因组并进行基因注释的改良。
Sci Data. 2019 Jul 15;6(1):122. doi: 10.1038/s41597-019-0127-1.
2
Optimized sequencing depth and de novo assembler for deeply reconstructing the transcriptome of the tea plant, an economically important plant species.优化测序深度和从头组装算法,以深度重建经济重要植物物种茶树的转录组。
BMC Bioinformatics. 2019 Nov 6;20(1):553. doi: 10.1186/s12859-019-3166-x.
3
Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds.对茶树转录组进行深度测序,揭示了茶特异性化合物主要代谢途径的候选基因。
BMC Genomics. 2011 Feb 28;12:131. doi: 10.1186/1471-2164-12-131.
4
The Reference Genome of Tea Plant and Resequencing of 81 Diverse Accessions Provide Insights into Its Genome Evolution and Adaptation.茶树参考基因组和 81 个不同品种的重测序为其基因组进化和适应提供了新见解。
Mol Plant. 2020 Jul 6;13(7):1013-1026. doi: 10.1016/j.molp.2020.04.010. Epub 2020 Apr 27.
5
Draft genome sequence of var. provides insights into the evolution of the tea genome and tea quality.变种的基因组草图序列为研究茶树基因组和茶叶品质的演化提供了线索。
Proc Natl Acad Sci U S A. 2018 May 1;115(18):E4151-E4158. doi: 10.1073/pnas.1719622115. Epub 2018 Apr 20.
6
De novo transcriptome assembly of the wild relative of tea tree (Camellia taliensis) and comparative analysis with tea transcriptome identified putative genes associated with tea quality and stress response.茶树野生近缘种(大理茶)的从头转录组组装及与茶转录组的比较分析,鉴定出与茶叶品质和胁迫响应相关的假定基因。
BMC Genomics. 2015 Apr 15;16(1):298. doi: 10.1186/s12864-015-1494-4.
7
Identification, characterization and utilization of unigene derived microsatellite markers in tea (Camellia sinensis L.).茶树(Camellia sinensis L.)中基于单基因的微卫星标记的鉴定、特征分析及应用
BMC Plant Biol. 2009 May 11;9:53. doi: 10.1186/1471-2229-9-53.
8
Tea Plant Information Archive: a comprehensive genomics and bioinformatics platform for tea plant.茶树信息档案:一个全面的茶树基因组学和生物信息学平台。
Plant Biotechnol J. 2019 Oct;17(10):1938-1953. doi: 10.1111/pbi.13111. Epub 2019 Apr 11.
9
Deciphering tea tree chloroplast and mitochondrial genomes of Camellia sinensis var. assamica.破译阿萨姆种茶树的叶绿体和线粒体基因组。
Sci Data. 2019 Oct 17;6(1):209. doi: 10.1038/s41597-019-0201-8.
10
Genome-wide identification of conserved and novel microRNAs in one bud and two tender leaves of tea plant (Camellia sinensis) by small RNA sequencing, microarray-based hybridization and genome survey scaffold sequences.通过小 RNA 测序、基于微阵列杂交和基因组调查支架序列,在茶树(Camellia sinensis)的一个芽和两片嫩叶中进行全基因组保守和新型 microRNA 的鉴定。
BMC Plant Biol. 2017 Nov 21;17(1):212. doi: 10.1186/s12870-017-1169-1.

引用本文的文献

1
Genetic Diversity and Population Structure of Wild Ancient in Pu'an, Guizhou, China.中国贵州普安野生古茶树的遗传多样性与种群结构
Plants (Basel). 2025 Jun 4;14(11):1709. doi: 10.3390/plants14111709.
2
Transcriptomics in the era of long-read sequencing.长读长测序时代的转录组学
Nat Rev Genet. 2025 Mar 28. doi: 10.1038/s41576-025-00828-z.
3
Evaluation of strategies for evidence-driven genome annotation using long-read RNA-seq.使用长读长RNA测序对证据驱动的基因组注释策略进行评估。

本文引用的文献

1
Tea Plant Information Archive: a comprehensive genomics and bioinformatics platform for tea plant.茶树信息档案:一个全面的茶树基因组学和生物信息学平台。
Plant Biotechnol J. 2019 Oct;17(10):1938-1953. doi: 10.1111/pbi.13111. Epub 2019 Apr 11.
2
Draft genome sequence of var. provides insights into the evolution of the tea genome and tea quality.变种的基因组草图序列为研究茶树基因组和茶叶品质的演化提供了线索。
Proc Natl Acad Sci U S A. 2018 May 1;115(18):E4151-E4158. doi: 10.1073/pnas.1719622115. Epub 2018 Apr 20.
3
Transcriptome Profiling Using Single-Molecule Direct RNA Sequencing Approach for In-depth Understanding of Genes in Secondary Metabolism Pathways of .
Genome Res. 2025 Apr 14;35(4):1053-1064. doi: 10.1101/gr.279864.124.
4
Development of Genome-Wide Intron Length Polymorphism (ILP) Markers in Tea Plant () and Related Applications for Genetics Research.茶树全基因组内含子长度多态性(ILP)标记的开发及其在遗传学研究中的相关应用。
Int J Mol Sci. 2024 Mar 13;25(6):3241. doi: 10.3390/ijms25063241.
5
From comparative gene content and gene order to ancestral contigs, chromosomes and karyotypes.从比较基因含量和基因顺序到祖先的 contigs、染色体和核型。
Sci Rep. 2023 Apr 13;13(1):6095. doi: 10.1038/s41598-023-33029-x.
6
Genome-Wide Identification of AMT2-Type Ammonium Transporters Reveal That CsAMT2.2 and CsAMT2.3 Potentially Regulate NH Absorption among Three Different Cultivars of .全基因组鉴定 AMT2 型铵转运蛋白揭示 CsAMT2.2 和 CsAMT2.3 可能调节 三种不同栽培品种的 NH 吸收。
Int J Mol Sci. 2022 Dec 10;23(24):15661. doi: 10.3390/ijms232415661.
7
United States tea: A synopsis of ongoing tea research and solutions to United States tea production issues.美国茶叶:当前茶叶研究综述及美国茶叶生产问题的解决方案
Front Plant Sci. 2022 Sep 23;13:934651. doi: 10.3389/fpls.2022.934651. eCollection 2022.
8
Biosynthetic Pathway of Proanthocyanidins in Major Cash Crops.主要经济作物中原花青素的生物合成途径。
Plants (Basel). 2021 Aug 28;10(9):1792. doi: 10.3390/plants10091792.
9
Genome-Wide Identification and Expression Patterns of the C2H2-Zinc Finger Gene Family Related to Stress Responses and Catechins Accumulation in [L.] O. Kuntze.与[L.]O. Kuntze中胁迫响应和儿茶素积累相关的C2H2锌指基因家族的全基因组鉴定及表达模式
Int J Mol Sci. 2021 Apr 18;22(8):4197. doi: 10.3390/ijms22084197.
10
MAPK cascade gene family in Camellia sinensis: In-silico identification, expression profiles and regulatory network analysis.茶树中 MAPK 级联基因家族:计算机鉴定、表达谱分析及调控网络分析。
BMC Genomics. 2020 Sep 7;21(1):613. doi: 10.1186/s12864-020-07030-x.
使用单分子直接RNA测序方法进行转录组分析以深入了解[具体对象]次生代谢途径中的基因
Front Plant Sci. 2017 Jul 11;8:1205. doi: 10.3389/fpls.2017.01205. eCollection 2017.
4
The Tea Tree Genome Provides Insights into Tea Flavor and Independent Evolution of Caffeine Biosynthesis.茶树基因组为茶叶风味和咖啡因生物合成的独立进化提供了线索。
Mol Plant. 2017 Jun 5;10(6):866-877. doi: 10.1016/j.molp.2017.04.002. Epub 2017 May 2.
5
Genetic Divergence between Camellia sinensis and Its Wild Relatives Revealed via Genome-Wide SNPs from RAD Sequencing.通过RAD测序的全基因组单核苷酸多态性揭示茶树及其野生近缘种之间的遗传分化
PLoS One. 2016 Mar 10;11(3):e0151424. doi: 10.1371/journal.pone.0151424. eCollection 2016.
6
BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.BUSCO:利用单拷贝同源基因评估基因组组装和注释的完整性。
Bioinformatics. 2015 Oct 1;31(19):3210-2. doi: 10.1093/bioinformatics/btv351. Epub 2015 Jun 9.
7
StringTie enables improved reconstruction of a transcriptome from RNA-seq reads.StringTie能够从RNA测序读数中更完善地重建转录组。
Nat Biotechnol. 2015 Mar;33(3):290-5. doi: 10.1038/nbt.3122. Epub 2015 Feb 18.
8
Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads.利用全基因组鸟枪法短读长数据对高杂合基因组进行高效的从头组装。
Genome Res. 2014 Aug;24(8):1384-95. doi: 10.1101/gr.170720.113. Epub 2014 Apr 22.
9
Mind the gap: upgrading genomes with Pacific Biosciences RS long-read sequencing technology.注意差距:使用 Pacific Biosciences RS 长读测序技术升级基因组。
PLoS One. 2012;7(11):e47768. doi: 10.1371/journal.pone.0047768. Epub 2012 Nov 21.
10
HaploMerger: reconstructing allelic relationships for polymorphic diploid genome assemblies.HaploMerger:重构多态二倍体基因组组装的等位基因关系。
Genome Res. 2012 Aug;22(8):1581-8. doi: 10.1101/gr.133652.111. Epub 2012 May 3.