• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

改良的染色体 12 重建和比较分析以纠正陆地棉的错误组装。

Improved reconstruction and comparative analysis of chromosome 12 to rectify Mis-assemblies in Gossypium arboreum.

机构信息

Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, 455000, China.

Genomics Lab, Department of Plant Breeding and Genetics, Faculty of Agricultural Sciences and Technology, Bahauddin Zakariya University, Multan, Punjab, 60000, Pakistan.

出版信息

BMC Genomics. 2020 Jul 8;21(1):470. doi: 10.1186/s12864-020-06814-5.

DOI:10.1186/s12864-020-06814-5
PMID:32640982
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7346634/
Abstract

BACKGROUND

Genome sequencing technologies have been improved at an exponential pace but precise chromosome-scale genome assembly still remains a great challenge. The draft genome of cultivated G. arboreum was sequenced and assembled with shotgun sequencing approach, however, it contains several misassemblies. To address this issue, we generated an improved reassembly of G. arboreum chromosome 12 using genetic mapping and reference-assisted approaches and evaluated this reconstruction by comparing with homologous chromosomes of G. raimondii and G. hirsutum.

RESULTS

In this study, we generated a high quality assembly of the 94.64 Mb length of G. arboreum chromosome 12 (A_A12) which comprised of 144 scaffolds and contained 3361 protein coding genes. Evaluation of results using syntenic and collinear analysis of reconstructed G. arboreum chromosome A_A12 with its homologous chromosomes of G. raimondii (D_D08) and G. hirsutum (AD_A12 and AD_D12) confirmed the significant improved quality of current reassembly as compared to previous one. We found major misassemblies in previously assembled chromosome 12 (A_Ca9) of G. arboreum particularly in anchoring and orienting of scaffolds into a pseudo-chromosome. Further, homologous chromosomes 12 of G. raimondii (D_D08) and G. arboreum (A_A12) contained almost equal number of transcription factor (TF) related genes, and showed good collinear relationship with each other. As well, a higher rate of gene loss was found in corresponding homologous chromosomes of tetraploid (AD_A12 and AD_D12) than diploid (A_A12 and D_D08) cotton, signifying that gene loss is likely a continuing process in chromosomal evolution of tetraploid cotton.

CONCLUSION

This study offers a more accurate strategy to correct misassemblies in sequenced draft genomes of cotton which will provide further insights towards its genome organization.

摘要

背景

基因组测序技术的发展速度呈指数级增长,但精确的染色体级基因组组装仍然是一个巨大的挑战。经过测序和组装,已经获得了栽培棉种亚洲棉的基因组草图,但它包含了一些错误的组装。为了解决这个问题,我们利用遗传图谱和参考辅助方法对亚洲棉第 12 号染色体进行了重新组装,并与雷蒙德氏棉和陆地棉的同源染色体进行了比较。

结果

本研究生成了亚洲棉第 12 号染色体(A_A12)的高质量组装,其长度为 94.64Mb,包含 144 个支架,包含 3361 个蛋白质编码基因。通过对重建的亚洲棉第 12 号染色体 A_A12 与其同源染色体雷蒙德氏棉(D_D08)和陆地棉(AD_A12 和 AD_D12)的同线性和共线性分析,评估结果表明,与之前的组装相比,当前的重新组装质量有了显著提高。我们发现,以前组装的亚洲棉第 12 号染色体(A_Ca9)中存在主要的错误组装,特别是在支架的锚定和定向到假染色体上。此外,雷蒙德氏棉(D_D08)和亚洲棉(A_A12)的第 12 号同源染色体包含几乎相同数量的转录因子(TF)相关基因,并且彼此之间存在良好的共线性关系。同样,在四倍体(AD_A12 和 AD_D12)的相应同源染色体中发现了更高的基因丢失率,而在二倍体(A_A12 和 D_D08)中则较低,这表明基因丢失很可能是四倍体棉花染色体进化过程中的一个持续过程。

结论

本研究为纠正棉花测序草图基因组中的错误组装提供了更准确的策略,这将为其基因组组织提供进一步的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/0ba7591b166c/12864_2020_6814_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/5c2c19293159/12864_2020_6814_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/2e7f4d21aabc/12864_2020_6814_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/1d4cd8881023/12864_2020_6814_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/ea4c60814a4d/12864_2020_6814_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/0b748b17f0b0/12864_2020_6814_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/0ba7591b166c/12864_2020_6814_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/5c2c19293159/12864_2020_6814_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/2e7f4d21aabc/12864_2020_6814_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/1d4cd8881023/12864_2020_6814_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/ea4c60814a4d/12864_2020_6814_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/0b748b17f0b0/12864_2020_6814_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06f4/7346634/0ba7591b166c/12864_2020_6814_Fig6_HTML.jpg

相似文献

1
Improved reconstruction and comparative analysis of chromosome 12 to rectify Mis-assemblies in Gossypium arboreum.改良的染色体 12 重建和比较分析以纠正陆地棉的错误组装。
BMC Genomics. 2020 Jul 8;21(1):470. doi: 10.1186/s12864-020-06814-5.
2
Genome-wide comparative analysis of NBS-encoding genes in four Gossypium species.四种棉属植物中NBS编码基因的全基因组比较分析。
BMC Genomics. 2017 Apr 12;18(1):292. doi: 10.1186/s12864-017-3682-x.
3
Development of chromosome-specific markers with high polymorphism for allotetraploid cotton based on genome-wide characterization of simple sequence repeats in diploid cottons (Gossypium arboreum L. and Gossypium raimondii Ulbrich).基于二倍体棉花(亚洲棉和雷蒙德氏棉)简单序列重复序列的全基因组特征,开发具有高多态性的异源四倍体棉花染色体特异性标记。
BMC Genomics. 2015 Feb 6;16(1):55. doi: 10.1186/s12864-015-1265-2.
4
Rapid evolutionary divergence of diploid and allotetraploid Gossypium mitochondrial genomes.二倍体和异源四倍体棉线粒体基因组的快速进化分歧。
BMC Genomics. 2017 Nov 13;18(1):876. doi: 10.1186/s12864-017-4282-5.
5
Chromosome structural changes in diploid and tetraploid A genomes of Gossypium.棉花二倍体和四倍体A基因组中的染色体结构变化
Genome. 2006 Apr;49(4):336-45. doi: 10.1139/g05-116.
6
Genome-wide analysis of the MADS-box gene family in polyploid cotton (Gossypium hirsutum) and in its diploid parental species (Gossypium arboreum and Gossypium raimondii).多倍体棉花(Gossypium hirsutum)及其二倍体亲种(Gossypium arboreum 和 Gossypium raimondii)的 MADS-box 基因家族的全基因组分析。
Plant Physiol Biochem. 2018 Jun;127:169-184. doi: 10.1016/j.plaphy.2018.03.019. Epub 2018 Mar 20.
7
The draft genome of a diploid cotton Gossypium raimondii.二倍体棉种雷蒙德氏棉的基因组草图。
Nat Genet. 2012 Oct;44(10):1098-103. doi: 10.1038/ng.2371. Epub 2012 Aug 26.
8
Genome-wide investigation and transcriptome analysis of the WRKY gene family in Gossypium.棉花中WRKY基因家族的全基因组研究与转录组分析
Mol Genet Genomics. 2015 Feb;290(1):151-71. doi: 10.1007/s00438-014-0904-7. Epub 2014 Sep 5.
9
Genome-Wide Analysis of the NF-YB Gene Family in Gossypium hirsutum L. and Characterization of the Role of GhDNF-YB22 in Embryogenesis.棉花 NF-YB 基因家族的全基因组分析及 GhDNF-YB22 在胚胎发生中的功能鉴定。
Int J Mol Sci. 2018 Feb 6;19(2):483. doi: 10.3390/ijms19020483.
10
Distribution of 5S and 18S-28S rDNA loci in a tetraploid cotton (Gossypium hirsutum L.) and its putative diploid ancestors.5S和18S - 28S核糖体DNA位点在四倍体棉花(陆地棉)及其假定二倍体祖先中的分布。
Chromosoma. 1996 Jul;105(1):55-61. doi: 10.1007/BF02510039.

本文引用的文献

1
Evolutionary superscaffolding and chromosome anchoring to improve Anopheles genome assemblies.进化超级支架和染色体锚定以改进按蚊基因组组装。
BMC Biol. 2020 Jan 2;18(1):1. doi: 10.1186/s12915-019-0728-3.
2
Systematic analysis of NAC transcription factors in uncovers their roles in response to Verticillium wilt.对NAC转录因子的系统分析揭示了它们在应对黄萎病中的作用。 (注:原英文文本似乎不完整,“in”后面缺少具体内容,这里是根据已有内容尽量通顺翻译)
PeerJ. 2019 Nov 5;7:e7995. doi: 10.7717/peerj.7995. eCollection 2019.
3
Re-sequencing and optical mapping reveals misassemblies and real inversions on Corynebacterium pseudotuberculosis genomes.
重测序和光学作图揭示了假结核棒状杆菌基因组中的错误组装和真实倒位。
Sci Rep. 2019 Nov 8;9(1):16387. doi: 10.1038/s41598-019-52695-4.
4
Integrating Hi-C links with assembly graphs for chromosome-scale assembly.将 Hi-C 链接与组装图整合用于染色体尺度的组装。
PLoS Comput Biol. 2019 Aug 21;15(8):e1007273. doi: 10.1371/journal.pcbi.1007273. eCollection 2019 Aug.
5
De novo phased assembly of the Vitis riparia grape genome.从头组装野生葡萄基因组。
Sci Data. 2019 Jul 19;6(1):127. doi: 10.1038/s41597-019-0133-3.
6
The tea plant reference genome and improved gene annotation using long-read and paired-end sequencing data.利用长读长和双端测序数据构建茶树参考基因组并进行基因注释的改良。
Sci Data. 2019 Jul 15;6(1):122. doi: 10.1038/s41597-019-0127-1.
7
The Third Revolution in Sequencing Technology.测序技术的第三次革命。
Trends Genet. 2018 Sep;34(9):666-681. doi: 10.1016/j.tig.2018.05.008. Epub 2018 Jun 22.
8
Genome-wide comparative and evolutionary analysis of Calmodulin-binding Transcription Activator (CAMTA) family in Gossypium species.基因组范围比较和进化分析棉属物种中的钙调素结合转录激活因子(CAMTA)家族。
Sci Rep. 2018 Apr 3;8(1):5573. doi: 10.1038/s41598-018-23846-w.
9
Genomic insights into divergence and dual domestication of cultivated allotetraploid cottons.栽培异源四倍体棉花分化与双重驯化的基因组学见解。
Genome Biol. 2017 Feb 20;18(1):33. doi: 10.1186/s13059-017-1167-5.
10
PlantTFDB 4.0: toward a central hub for transcription factors and regulatory interactions in plants.植物转录因子数据库4.0:迈向植物转录因子与调控互作的核心枢纽
Nucleic Acids Res. 2017 Jan 4;45(D1):D1040-D1045. doi: 10.1093/nar/gkw982. Epub 2016 Oct 24.