• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iLoci:对临时和成熟基因组组装的基因组内容与组织进行稳健评估。

iLoci: robust evaluation of genome content and organization for provisional and mature genome assemblies.

作者信息

Standage Daniel S, Lai Tim, Brendel Volker P

机构信息

Department of Biology, Indiana University, Bloomington, IN 47405, USA.

Department of Mathematics, Indiana University, Bloomington, IN 47405, USA.

出版信息

NAR Genom Bioinform. 2022 Feb 22;4(1):lqac013. doi: 10.1093/nargab/lqac013. eCollection 2022 Mar.

DOI:10.1093/nargab/lqac013
PMID:35211671
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8862717/
Abstract

We introduce a new framework for genome analyses based on parsing an annotated genome assembly into distinct interval loci (iLoci), available as open-source software as part of the AEGeAn Toolkit (https://github.com/BrendelGroup/AEGeAn). We demonstrate that iLoci provide an alternative coordinate system that is robust to changes in assembly and annotation versions and facilitates granular quality control of genome data. We discuss how statistics computed on iLoci reflect various characteristics of genome content and organization and illustrate how these statistics can be used to establish a baseline for assessment of the completeness and accuracy of the data. We also introduce a well-defined measure of relative genome compactness and compute other iLocus statistics that reveal genome-wide characteristics of gene arrangements in the whole genome context. Given the fast pace of assembly/annotation updates, our AEGeAn Toolkit fills a niche in computational genomics based on deriving persistent and species-specific genome statistics. Gene structure model-centric iLoci provide a precisely defined coordinate system that can be used to store assembly/annotation updates that reflect either stable or changed assessments. Large-scale application of the approach revealed species- and clade-specific genome organization in precisely defined computational terms, promising intriguing forays into the forces of shaping genome structure as more and more genome assemblies are being deposited.

摘要

我们引入了一种新的基因组分析框架,该框架基于将带注释的基因组组装解析为不同的区间位点(iLoci),作为AEGeAn工具包(https://github.com/BrendelGroup/AEGeAn)的一部分以开源软件形式提供。我们证明,iLoci提供了一种替代坐标系,该坐标系对组装和注释版本的变化具有鲁棒性,并有助于对基因组数据进行精细的质量控制。我们讨论了在iLoci上计算的统计数据如何反映基因组内容和组织的各种特征,并说明了如何使用这些统计数据来建立评估数据完整性和准确性的基线。我们还引入了一种定义明确的相对基因组紧凑度度量,并计算了其他iLocus统计数据,这些数据揭示了全基因组背景下基因排列的全基因组特征。鉴于组装/注释更新的快速步伐,我们的AEGeAn工具包基于推导持久且特定于物种的基因组统计数据,填补了计算基因组学的一个空白。以基因结构模型为中心的iLoci提供了一个精确定义的坐标系,可用于存储反映稳定或变化评估的组装/注释更新。该方法的大规模应用以精确的计算术语揭示了特定物种和进化枝的基因组组织,随着越来越多的基因组组装被存入,有望对塑造基因组结构的力量进行有趣的探索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/6b12142a057f/lqac013fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/67e3ae9fa085/lqac013fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/7d1e68c12017/lqac013fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/1cd9d75cf9f0/lqac013fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/eb204340e3b4/lqac013fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/1b1dd65af0c6/lqac013fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/6b12142a057f/lqac013fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/67e3ae9fa085/lqac013fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/7d1e68c12017/lqac013fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/1cd9d75cf9f0/lqac013fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/eb204340e3b4/lqac013fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/1b1dd65af0c6/lqac013fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4da7/8862717/6b12142a057f/lqac013fig6.jpg

相似文献

1
iLoci: robust evaluation of genome content and organization for provisional and mature genome assemblies.iLoci:对临时和成熟基因组组装的基因组内容与组织进行稳健评估。
NAR Genom Bioinform. 2022 Feb 22;4(1):lqac013. doi: 10.1093/nargab/lqac013. eCollection 2022 Mar.
2
GenomeQC: a quality assessment tool for genome assemblies and gene structure annotations.基因组 QC:基因组组装和基因结构注释的质量评估工具。
BMC Genomics. 2020 Mar 2;21(1):193. doi: 10.1186/s12864-020-6568-2.
3
SQUAT: a Sequencing Quality Assessment Tool for data quality assessments of genome assemblies.SQUAT:用于基因组组装数据质量评估的测序质量评估工具。
BMC Genomics. 2019 Apr 18;19(Suppl 9):238. doi: 10.1186/s12864-019-5445-3.
4
RGAAT: A Reference-based Genome Assembly and Annotation Tool for New Genomes and Upgrade of Known Genomes.RGAAT:一种基于参考的基因组组装和注释工具,用于新基因组和已知基因组的升级。
Genomics Proteomics Bioinformatics. 2018 Oct;16(5):373-381. doi: 10.1016/j.gpb.2018.03.006. Epub 2018 Dec 21.
5
miniBUSCO: a faster and more accurate reimplementation of BUSCO.迷你BUSCO:BUSCO的一个更快且更准确的重新实现版本。
bioRxiv. 2023 Jun 6:2023.06.03.543588. doi: 10.1101/2023.06.03.543588.
6
Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.对GRCh38和从头单倍体基因组组装的评估证明了参考组装的持久质量。
Genome Res. 2017 May;27(5):849-864. doi: 10.1101/gr.213611.116. Epub 2017 Apr 10.
7
iLOCi: a SNP interaction prioritization technique for detecting epistasis in genome-wide association studies.iLOCi:一种 SNP 相互作用优先级技术,用于检测全基因组关联研究中的上位性。
BMC Genomics. 2012;13 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2164-13-S7-S2. Epub 2012 Dec 13.
8
MitoZ: a toolkit for animal mitochondrial genome assembly, annotation and visualization.MitoZ:动物线粒体基因组组装、注释和可视化的工具包。
Nucleic Acids Res. 2019 Jun 20;47(11):e63. doi: 10.1093/nar/gkz173.
9
scanPAV: a pipeline for extracting presence-absence variations in genome pairs.scanPAV:用于提取基因组对中存在-缺失变异的管道。
Bioinformatics. 2018 Sep 1;34(17):3022-3024. doi: 10.1093/bioinformatics/bty189.
10
Improved annotation with de novo transcriptome assembly in four social amoeba species.通过对四种社会性变形虫物种进行从头转录组组装来改进注释。
BMC Genomics. 2017 Jan 31;18(1):120. doi: 10.1186/s12864-017-3505-0.

本文引用的文献

1
Repertoire-wide gene structure analyses: a case study comparing automatically predicted and manually annotated gene models.全面的基因结构分析:自动预测和手动注释基因模型的比较案例研究。
BMC Genomics. 2019 Oct 17;20(1):753. doi: 10.1186/s12864-019-6064-8.
2
A hybrid de novo genome assembly of the honeybee, Apis mellifera, with chromosome-length scaffolds.具有染色体级别的蜜蜂(Apis mellifera)从头杂交基因组组装。
BMC Genomics. 2019 Apr 8;20(1):275. doi: 10.1186/s12864-019-5642-0.
3
Araport11: a complete reannotation of the Arabidopsis thaliana reference genome.
Araport11:拟南芥参考基因组的完整重新注释。
Plant J. 2017 Feb;89(4):789-804. doi: 10.1111/tpj.13415. Epub 2017 Feb 10.
4
Genome, transcriptome and methylome sequencing of a primitively eusocial wasp reveal a greatly reduced DNA methylation system in a social insect.一种原始群居黄蜂的基因组、转录组和甲基化组测序揭示了一种群居昆虫中大幅简化的DNA甲基化系统。
Mol Ecol. 2016 Apr;25(8):1769-84. doi: 10.1111/mec.13578. Epub 2016 Mar 15.
5
BRAKER1: Unsupervised RNA-Seq-Based Genome Annotation with GeneMark-ET and AUGUSTUS.BRAKER1:基于RNA测序的无监督基因组注释,结合GeneMark-ET和AUGUSTUS
Bioinformatics. 2016 Mar 1;32(5):767-9. doi: 10.1093/bioinformatics/btv661. Epub 2015 Nov 11.
6
Comparative genomic data of the Avian Phylogenomics Project.《鸟类系统基因组学计划的比较基因组学数据》
Gigascience. 2014 Dec 11;3(1):26. doi: 10.1186/2047-217X-3-26. eCollection 2014.
7
Finding the missing honey bee genes: lessons learned from a genome upgrade.寻找缺失的蜜蜂基因:从基因组升级中吸取的教训。
BMC Genomics. 2014 Jan 30;15:86. doi: 10.1186/1471-2164-15-86.
8
MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations.MAKER-P:一个用于快速创建、管理和质量控制植物基因组注释的工具包。
Plant Physiol. 2014 Feb;164(2):513-24. doi: 10.1104/pp.113.230144. Epub 2013 Dec 4.
9
GenomeTools: a comprehensive software library for efficient processing of structured genome annotations.基因组工具:一个用于高效处理结构化基因组注释的综合软件库。
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):645-56. doi: 10.1109/TCBB.2013.68.
10
Evolution of Hox gene clusters in deuterostomes.后口动物中Hox基因簇的演化
BMC Dev Biol. 2013 Jul 2;13:26. doi: 10.1186/1471-213X-13-26.