• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统发现人类基因组中单核苷酸注释的保守状态。

Systematic discovery of conservation states for single-nucleotide annotation of the human genome.

机构信息

1Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095 USA.

2Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA 90095 USA.

出版信息

Commun Biol. 2019 Jul 2;2:248. doi: 10.1038/s42003-019-0488-1. eCollection 2019.

DOI:10.1038/s42003-019-0488-1
PMID:31286065
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6606595/
Abstract

Comparative genomics sequence data is an important source of information for interpreting genomes. Genome-wide annotations based on this data have largely focused on univariate scores or binary elements of evolutionary constraint. Here we present a complementary whole genome annotation approach, ConsHMM, which applies a multivariate hidden Markov model to learn de novo 'conservation states' based on the combinatorial and spatial patterns of which species align to and match a reference genome in a multiple species DNA sequence alignment. We applied ConsHMM to a 100-way vertebrate sequence alignment to annotate the human genome at single nucleotide resolution into 100 conservation states. These states have distinct enrichments for other genomic information including gene annotations, chromatin states, repeat families, and bases prioritized by various variant prioritization scores. Constrained elements have distinct heritability partitioning enrichments depending on their conservation state assignment. ConsHMM conservation states are a resource for analyzing genomes and genetic variants.

摘要

比较基因组序列数据是解释基因组的重要信息来源。基于这些数据的全基因组注释主要集中在单变量分数或进化约束的二进制元素上。在这里,我们提出了一种互补的全基因组注释方法 ConsHMM,它应用多变量隐马尔可夫模型来学习基于组合和空间模式的新的“保守状态”,这些模式基于多物种 DNA 序列比对中物种与参考基因组的对齐和匹配。我们将 ConsHMM 应用于 100 种脊椎动物序列比对,以单核苷酸分辨率将人类基因组注释为 100 种保守状态。这些状态在其他基因组信息(包括基因注释、染色质状态、重复家族和各种变体优先级得分优先的碱基)方面有明显的富集。根据保守状态分配,受约束的元素具有不同的遗传分割富集。ConsHMM 保守状态是分析基因组和遗传变异的资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/9179a542c2c2/42003_2019_488_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/e3aeedc03185/42003_2019_488_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/4bcf20482a7c/42003_2019_488_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/8d1d93c42d1c/42003_2019_488_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/68da15eef857/42003_2019_488_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/94e32db793fb/42003_2019_488_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/9179a542c2c2/42003_2019_488_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/e3aeedc03185/42003_2019_488_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/4bcf20482a7c/42003_2019_488_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/8d1d93c42d1c/42003_2019_488_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/68da15eef857/42003_2019_488_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/94e32db793fb/42003_2019_488_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c37d/6606595/9179a542c2c2/42003_2019_488_Fig6_HTML.jpg

相似文献

1
Systematic discovery of conservation states for single-nucleotide annotation of the human genome.系统发现人类基因组中单核苷酸注释的保守状态。
Commun Biol. 2019 Jul 2;2:248. doi: 10.1038/s42003-019-0488-1. eCollection 2019.
2
ConsHMM Atlas: conservation state annotations for major genomes and human genetic variation.保守隐马尔可夫模型图谱:主要基因组和人类遗传变异的保守状态注释
NAR Genom Bioinform. 2020 Dec 17;2(4):lqaa104. doi: 10.1093/nargab/lqaa104. eCollection 2020 Dec.
3
Chromatin-state discovery and genome annotation with ChromHMM.使用ChromHMM进行染色质状态发现和基因组注释。
Nat Protoc. 2017 Dec;12(12):2478-2492. doi: 10.1038/nprot.2017.124. Epub 2017 Nov 9.
4
Discovery and characterization of chromatin states for systematic annotation of the human genome.发现和描述染色质状态,用于系统注释人类基因组。
Nat Biotechnol. 2010 Aug;28(8):817-25. doi: 10.1038/nbt.1662. Epub 2010 Jul 25.
5
Systematic annotation of conservation states provides insights into regulatory regions in rice.对保守状态的系统注释为水稻中的调控区域提供了见解。
J Genet Genomics. 2022 Dec;49(12):1127-1137. doi: 10.1016/j.jgg.2022.04.003. Epub 2022 Apr 22.
6
In Silico Functional Annotation of Genomic Variation.基因组变异的计算机功能注释
Curr Protoc Hum Genet. 2016 Jan 1;88:6.15.1-6.15.17. doi: 10.1002/0471142905.hg0615s88.
7
Universal annotation of the human genome through integration of over a thousand epigenomic datasets.通过整合一千多个表观基因组数据集实现人类基因组的通用注释。
Genome Biol. 2022 Jan 6;23(1):9. doi: 10.1186/s13059-021-02572-z.
8
Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation.提高比对灵敏度可改善基因组比对在比较基因注释中的应用。
Nucleic Acids Res. 2017 Aug 21;45(14):8369-8377. doi: 10.1093/nar/gkx554.
9
A De-Novo Genome Analysis Pipeline (DeNoGAP) for large-scale comparative prokaryotic genomics studies.一种用于大规模比较原核生物基因组学研究的从头基因组分析流程(DeNoGAP)。
BMC Bioinformatics. 2016 Jun 30;17(1):260. doi: 10.1186/s12859-016-1142-2.
10
Joint annotation of chromatin state and chromatin conformation reveals relationships among domain types and identifies domains of cell-type-specific expression.染色质状态和染色质构象的联合注释揭示了结构域类型之间的关系,并识别出细胞类型特异性表达的结构域。
Genome Res. 2015 Apr;25(4):544-57. doi: 10.1101/gr.184341.114. Epub 2015 Feb 12.

引用本文的文献

1
Whole genome sequence-based association analysis of African American individuals with bipolar disorder and schizophrenia.基于全基因组序列的非裔美国双相情感障碍和精神分裂症患者关联分析。
medRxiv. 2025 Feb 19:2024.12.27.24319111. doi: 10.1101/2024.12.27.24319111.
2
Machine and Deep Learning Methods for Predicting 3D Genome Organization.机器和深度学习方法预测三维基因组结构。
Methods Mol Biol. 2025;2856:357-400. doi: 10.1007/978-1-0716-4136-1_22.
3
Where do obesity and male infertility collide?肥胖和男性不育症在哪里交汇?

本文引用的文献

1
The human noncoding genome defined by genetic diversity.遗传多样性定义的人类非编码基因组。
Nat Genet. 2018 Mar;50(3):333-337. doi: 10.1038/s41588-018-0062-7. Epub 2018 Feb 26.
2
Evolutionary Rewiring of Human Regulatory Networks by Waves of Genome Expansion.人类调控网络的进化重布线是通过基因组的扩张波实现的。
Am J Hum Genet. 2018 Feb 1;102(2):207-218. doi: 10.1016/j.ajhg.2017.12.014. Epub 2018 Jan 18.
3
FATHMM-XF: accurate prediction of pathogenic point mutations via extended features.FATHMM-XF:通过扩展特征准确预测致病性点突变。
BMC Med Genomics. 2024 May 10;17(1):128. doi: 10.1186/s12920-024-01897-5.
4
Machine and deep learning methods for predicting 3D genome organization.用于预测三维基因组组织的机器学习和深度学习方法。
ArXiv. 2024 Mar 4:arXiv:2403.03231v1.
5
Machine-learning of complex evolutionary signals improves classification of SNVs.复杂进化信号的机器学习可改善单核苷酸变异的分类。
NAR Genom Bioinform. 2022 Apr 7;4(2):lqac025. doi: 10.1093/nargab/lqac025. eCollection 2022 Jun.
6
A mammalian methylation array for profiling methylation levels at conserved sequences.一种用于分析保守序列甲基化水平的哺乳动物甲基化芯片。
Nat Commun. 2022 Feb 10;13(1):783. doi: 10.1038/s41467-022-28355-z.
7
Universal annotation of the human genome through integration of over a thousand epigenomic datasets.通过整合一千多个表观基因组数据集实现人类基因组的通用注释。
Genome Biol. 2022 Jan 6;23(1):9. doi: 10.1186/s13059-021-02572-z.
8
Segmentation and genome annotation algorithms for identifying chromatin state and other genomic patterns.用于识别染色质状态和其他基因组模式的分割和基因组注释算法。
PLoS Comput Biol. 2021 Oct 14;17(10):e1009423. doi: 10.1371/journal.pcbi.1009423. eCollection 2021 Oct.
9
NucHMM: a method for quantitative modeling of nucleosome organization identifying functional nucleosome states distinctly associated with splicing potentiality.NucHMM:一种定量建模核小体组织的方法,可识别与剪接潜能明显相关的功能核小体状态。
Genome Biol. 2021 Aug 26;22(1):250. doi: 10.1186/s13059-021-02465-1.
10
Single-nucleotide conservation state annotation of the SARS-CoV-2 genome.SARS-CoV-2 基因组单核苷酸保守状态注释。
Commun Biol. 2021 Jun 3;4(1):698. doi: 10.1038/s42003-021-02231-w.
Bioinformatics. 2018 Feb 1;34(3):511-513. doi: 10.1093/bioinformatics/btx536.
4
FIRE: functional inference of genetic variants that regulate gene expression.FIRE:调控基因表达的遗传变异的功能推断。
Bioinformatics. 2017 Dec 15;33(24):3895-3901. doi: 10.1093/bioinformatics/btx534.
5
Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data.基于功能和群体基因组数据对有害非编码变异进行快速、可扩展的预测。
Nat Genet. 2017 Apr;49(4):618-624. doi: 10.1038/ng.3810. Epub 2017 Mar 13.
6
Distal CpG islands can serve as alternative promoters to transcribe genes with silenced proximal promoters.远端CpG岛可作为替代启动子,用于转录近端启动子沉默的基因。
Genome Res. 2017 Apr;27(4):553-566. doi: 10.1101/gr.212050.116. Epub 2017 Feb 21.
7
M-CAP eliminates a majority of variants of uncertain significance in clinical exomes at high sensitivity.M-CAP 以高灵敏度消除临床外显子组中大多数意义不明的变异。
Nat Genet. 2016 Dec;48(12):1581-1586. doi: 10.1038/ng.3703. Epub 2016 Oct 24.
8
A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease.一种用于有效识别孟德尔疾病中致病调控变异的全基因组分析框架。
Am J Hum Genet. 2016 Sep 1;99(3):595-606. doi: 10.1016/j.ajhg.2016.07.005. Epub 2016 Aug 25.
9
Ensembl comparative genomics resources.Ensembl比较基因组学资源。
Database (Oxford). 2016 Feb 20;2016. doi: 10.1093/database/bav096. Print 2016.
10
A spectral approach integrating functional genomic annotations for coding and noncoding variants.一种整合编码和非编码变异功能基因组注释的光谱方法。
Nat Genet. 2016 Feb;48(2):214-20. doi: 10.1038/ng.3477. Epub 2016 Jan 4.