• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

两个重要高粱自交系Tx2783和RTx436的高质量染色体水平基因组组装。

High-quality chromosome scale genome assemblies of two important Sorghum inbred lines, Tx2783 and RTx436.

作者信息

Wang Bo, Chougule Kapeel, Jiao Yinping, Olson Andrew, Kumar Vivek, Gladman Nicholas, Huang Jian, Llaca Victor, Fengler Kevin, Wei Xuehong, Wang Liya, Wang Xiaofei, Regulski Michael, Drenkow Jorg, Gingeras Thomas, Hayes Chad, Armstrong J Scott, Huang Yinghua, Xin Zhanguo, Ware Doreen

机构信息

Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.

Texas Tech University, 1006 Canton Ave, Lubbock, TX 79409-2122, USA.

出版信息

NAR Genom Bioinform. 2024 Aug 9;6(3):lqae097. doi: 10.1093/nargab/lqae097. eCollection 2024 Sep.

DOI:10.1093/nargab/lqae097
PMID:39131819
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11310780/
Abstract

(L.) Moench is a significant grass crop globally, known for its genetic diversity. High quality genome sequences are needed to capture the diversity. We constructed high-quality, chromosome-level genome assemblies for two vital sorghum inbred lines, Tx2783 and RTx436. Through advanced single-molecule techniques, long-read sequencing and optical maps, we improved average sequence continuity 19-fold and 11-fold higher compared to existing Btx623 v3.0 reference genome and obtained 19 and 18 scaffolds (N50 of 25.6 and 14.4) for Tx2783 and RTx436, respectively. Our gene annotation efforts resulted in 29 612 protein-coding genes for the Tx2783 genome and 29 265 protein-coding genes for the RTx436 genome. Comparative analyses with 26 plant genomes which included 18 sorghum genomes and 8 outgroup species identified around 31 210 protein-coding gene families, with about 13 956 specific to sorghum. Using representative models from gene trees across the 18 sorghum genomes, a total of 72 579 pan-genes were identified, with 14% core, 60% softcore and 26% shell genes. We identified 99 genes in Tx2783 and 107 genes in RTx436 that showed functional enrichment specifically in binding and metabolic processes, as revealed by the GO enrichment Pearson Chi-Square test. We detected 36 potential large inversions in the comparison between the BTx623 Bionano map and the BTx623 v3.1 reference sequence. Strikingly, these inversions were notably absent when comparing Tx2783 or RTx436 with the BTx623 Bionano map. These inversion were mostly in the pericentromeric region which is known to have low complexity regions and harder to assemble and suggests the presence of potential artifacts in the public BTx623 reference assembly. Furthermore, in comparison to Tx2783, RTx436 exhibited 324 883 additional Single Nucleotide Polymorphisms (SNPs) and 16 506 more Insertions/Deletions (INDELs) when using BTx623 as the reference genome. We also characterized approximately 348 nucleotide-binding leucine-rich repeat (NLR) disease resistance genes in the two genomes. These high-quality genomes serve as valuable resources for discovering agronomic traits and structural variation studies.

摘要

高粱(L.)Moench是全球重要的禾本科作物,以其遗传多样性而闻名。需要高质量的基因组序列来捕捉这种多样性。我们为两个重要的高粱自交系Tx2783和RTx436构建了高质量的染色体水平基因组组装。通过先进的单分子技术、长读长测序和光学图谱,与现有的Btx623 v3.0参考基因组相比,我们将平均序列连续性提高了19倍和11倍,分别为Tx2783和RTx436获得了19个和18个支架(N50分别为25.6和14.4)。我们的基因注释工作为Tx2783基因组产生了29612个蛋白质编码基因,为RTx436基因组产生了29265个蛋白质编码基因。与26个植物基因组(包括18个高粱基因组和8个外类群物种)的比较分析确定了约31210个蛋白质编码基因家族,其中约13956个是高粱特有的。利用18个高粱基因组中基因树的代表性模型,共鉴定出72579个泛基因,其中14%为核心基因,60%为软核心基因,26%为外壳基因。通过GO富集Pearson卡方检验,我们在Tx2783中鉴定出99个基因,在RTx436中鉴定出107个基因,这些基因在结合和代谢过程中表现出功能富集。在BTx623 Bionano图谱与BTx623 v3.1参考序列的比较中,我们检测到36个潜在的大倒位。令人惊讶的是,在将Tx2783或RTx436与BTx623 Bionano图谱进行比较时,这些倒位明显不存在。这些倒位大多位于着丝粒周围区域,该区域已知具有低复杂性区域且难以组装,这表明公共BTx623参考组装中存在潜在的人工产物。此外,以BTx623作为参考基因组时,与Tx2783相比,RTx436表现出另外324883个单核苷酸多态性(SNP)和16506个更多的插入/缺失(INDEL)。我们还对这两个基因组中的约348个核苷酸结合富含亮氨酸重复序列(NLR)抗病基因进行了表征。这些高质量的基因组为发现农艺性状和结构变异研究提供了宝贵的资源。

相似文献

1
High-quality chromosome scale genome assemblies of two important Sorghum inbred lines, Tx2783 and RTx436.两个重要高粱自交系Tx2783和RTx436的高质量染色体水平基因组组装。
NAR Genom Bioinform. 2024 Aug 9;6(3):lqae097. doi: 10.1093/nargab/lqae097. eCollection 2024 Sep.
2
A chromosome-scale assembly of the sorghum genome using nanopore sequencing and optical mapping.利用纳米孔测序和光学作图技术实现高粱染色体水平的基因组组装。
Nat Commun. 2018 Nov 19;9(1):4844. doi: 10.1038/s41467-018-07271-1.
3
The Sorghum bicolor reference genome: improved assembly, gene annotations, a transcriptome atlas, and signatures of genome organization.高粱参考基因组:改进的组装、基因注释、转录组图谱和基因组结构特征。
Plant J. 2018 Jan;93(2):338-354. doi: 10.1111/tpj.13781. Epub 2017 Dec 28.
4
Digital genotyping of sorghum - a diverse plant species with a large repeat-rich genome.高粱的数字基因分型——一种具有丰富重复序列的大型基因组的多样化植物物种。
BMC Genomics. 2013 Jul 5;14:448. doi: 10.1186/1471-2164-14-448.
5
SorGSD: updating and expanding the sorghum genome science database with new contents and tools.高粱基因组科学数据库(SorGSD):用新内容和工具更新并扩展高粱基因组科学数据库
Biotechnol Biofuels. 2021 Aug 3;14(1):165. doi: 10.1186/s13068-021-02016-7.
6
Genome-Wide Association Study for Major Biofuel Traits in Sorghum Using Minicore Collection.利用核心种质资源对高粱主要生物燃料性状进行全基因组关联研究。
Protein Pept Lett. 2021;28(8):909-928. doi: 10.2174/0929866528666210215141243.
7
Telomere-to-telomere genome assembly of sorghum.高粱端粒到端粒基因组组装。
Sci Data. 2024 Aug 2;11(1):835. doi: 10.1038/s41597-024-03664-8.
8
RAD-seq-Based High-Density Linkage Map Construction and QTL Mapping of Biomass-Related Traits in Sorghum using the Japanese Landrace Takakibi NOG.基于 RAD-seq 的高粱日本地方品种高千穗高密度连锁图谱构建及与生物量相关性状的 QTL 定位
Plant Cell Physiol. 2020 Jul 1;61(7):1262-1272. doi: 10.1093/pcp/pcaa056.
9
Sorghum Pan-Genome Explores the Functional Utility for Genomic-Assisted Breeding to Accelerate the Genetic Gain.高粱泛基因组探索基因组辅助育种的功能效用以加速遗传增益。
Front Plant Sci. 2021 Jun 1;12:666342. doi: 10.3389/fpls.2021.666342. eCollection 2021.
10
Development of genome-wide simple sequence repeat markers using whole-genome shotgun sequences of sorghum (Sorghum bicolor (L.) Moench).利用高粱(双色高粱(L.)Moench)全基因组鸟枪法测序开发全基因组简单序列重复标记
DNA Res. 2009 Jun;16(3):187-93. doi: 10.1093/dnares/dsp005. Epub 2009 Apr 10.

引用本文的文献

1
 (Poaceae), a new species from Xizang, China.禾本科(Poaceae),中国西藏的一个新物种。
PhytoKeys. 2025 Jun 3;257:65-78. doi: 10.3897/phytokeys.257.151771. eCollection 2025.

本文引用的文献

1
A super pan-genomic landscape of rice.水稻的超级泛基因组景观。
Cell Res. 2022 Oct;32(10):878-896. doi: 10.1038/s41422-022-00685-z. Epub 2022 Jul 12.
2
TEsorter: an accurate and fast method to classify LTR-retrotransposons in plant genomes.TEsorter:一种准确快速的方法,用于对植物基因组中的长末端重复逆转录转座子进行分类。
Hortic Res. 2022 Feb 19;9. doi: 10.1093/hr/uhac017.
3
SorghumBase: a web-based portal for sorghum genetic information and community advancement.高粱数据库:一个基于网络的高粱遗传信息资源和社区服务平台。
Planta. 2022 Jan 11;255(2):35. doi: 10.1007/s00425-022-03821-6.
4
Foster thy young: enhanced prediction of orphan genes in assembled genomes.培育你的年轻人:在组装的基因组中增强预测孤儿基因。
Nucleic Acids Res. 2022 Apr 22;50(7):e37. doi: 10.1093/nar/gkab1238.
5
De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes.从头组装、注释和 26 个不同玉米基因组的比较分析。
Science. 2021 Aug 6;373(6555):655-662. doi: 10.1126/science.abg5289.
6
Ranked choice voting for representative transcripts with TRaCE.用 TRaCE 进行代表转录的排名选择投票。
Bioinformatics. 2021 Dec 22;38(1):261-264. doi: 10.1093/bioinformatics/btab542.
7
Two gap-free reference genomes and a global view of the centromere architecture in rice.无间隙参考基因组揭示水稻着丝粒结构的整体特征。
Mol Plant. 2021 Oct 4;14(10):1757-1767. doi: 10.1016/j.molp.2021.06.018. Epub 2021 Jun 24.
8
Extensive variation within the pan-genome of cultivated and wild sorghum.栽培高粱和野生高粱泛基因组内的广泛变异。
Nat Plants. 2021 Jun;7(6):766-773. doi: 10.1038/s41477-021-00925-x. Epub 2021 May 20.
9
Gramene 2021: harnessing the power of comparative genomics and pathways for plant research.Gramene 2021:利用比较基因组学和途径为植物研究提供支持。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1452-D1463. doi: 10.1093/nar/gkaa979.
10
A platinum standard pan-genome resource that represents the population structure of Asian rice.一个代表亚洲稻米群体结构的铂金标准泛基因组资源。
Sci Data. 2020 Apr 7;7(1):113. doi: 10.1038/s41597-020-0438-2.