• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类基因组中受约束的编码区域图谱。

A map of constrained coding regions in the human genome.

机构信息

Department of Human Genetics, University of Utah, Salt Lake City, UT, USA.

USTAR Center for Genetic Discovery, University of Utah, Salt Lake City, UT, USA.

出版信息

Nat Genet. 2019 Jan;51(1):88-95. doi: 10.1038/s41588-018-0294-6. Epub 2018 Dec 10.

DOI:10.1038/s41588-018-0294-6
PMID:30531870
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6589356/
Abstract

Deep catalogs of genetic variation from thousands of humans enable the detection of intraspecies constraint by identifying coding regions with a scarcity of variation. While existing techniques summarize constraint for entire genes, single gene-wide metrics conceal regional constraint variability within each gene. Therefore, we have created a detailed map of constrained coding regions (CCRs) by leveraging variation observed among 123,136 humans from the Genome Aggregation Database. The most constrained CCRs are enriched for pathogenic variants in ClinVar and mutations underlying developmental disorders. CCRs highlight protein domain families under high constraint and suggest unannotated or incomplete protein domains. The highest-percentile CCRs complement existing variant prioritization methods when evaluating de novo mutations in studies of autosomal dominant disease. Finally, we identify highly constrained CCRs within genes lacking known disease associations. This observation suggests that CCRs may identify regions under strong purifying selection that, when mutated, cause severe developmental phenotypes or embryonic lethality.

摘要

从数千名人类中获取的深度遗传变异目录可通过识别变异稀少的编码区域来检测种内约束。虽然现有技术可汇总整个基因的约束信息,但单个基因范围的指标却掩盖了每个基因内的区域约束变异性。因此,我们利用来自基因组聚合数据库的 123,136 名人类的观察变异,创建了一个受约束编码区域 (CCR) 的详细图谱。在 ClinVar 中最受约束的 CCR 富含致病性变异和发育障碍的突变。CCR 突出了高度约束的蛋白质结构域家族,并提示了未注释或不完整的蛋白质结构域。在研究常染色体显性疾病的新生突变时,最高百分位 CCR 可补充现有的变异优先级方法。最后,我们在缺乏已知疾病关联的基因内确定了高度受约束的 CCR。这一观察结果表明,CCR 可能识别出受到强烈纯化选择的区域,当这些区域发生突变时,会导致严重的发育表型或胚胎致死性。

相似文献

1
A map of constrained coding regions in the human genome.人类基因组中受约束的编码区域图谱。
Nat Genet. 2019 Jan;51(1):88-95. doi: 10.1038/s41588-018-0294-6. Epub 2018 Dec 10.
2
Mapping the Constrained Coding Regions in the Human Genome to Their Corresponding Proteins.绘制人类基因组中受限编码区域与其对应蛋白质的图谱。
J Mol Biol. 2023 Jan 30;435(2):167892. doi: 10.1016/j.jmb.2022.167892. Epub 2022 Nov 21.
3
A genomic mutational constraint map using variation in 76,156 human genomes.基于 76156 个人类基因组的变异,绘制出基因组突变约束图谱。
Nature. 2024 Jan;625(7993):92-100. doi: 10.1038/s41586-023-06045-0. Epub 2023 Dec 6.
4
Genetic constraint at single amino acid resolution in protein domains improves missense variant prioritisation and gene discovery.在蛋白质结构域中单氨基酸分辨率的遗传限制可提高错义变异体的优先级排序和基因发现。
Genome Med. 2024 Jul 11;16(1):88. doi: 10.1186/s13073-024-01358-9.
5
Scanning window analysis of non-coding regions within normal-tumor whole-genome sequence samples.正常肿瘤全基因组序列样本中非编码区的扫描窗口分析。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa203.
6
Exhaustive prediction of disease susceptibility to coding base changes in the human genome.对人类基因组编码碱基变化的疾病易感性进行详尽预测。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S3. doi: 10.1186/1471-2105-9-S9-S3.
7
Exploring the function of genetic variants in the non-coding genomic regions: approaches for identifying human regulatory variants affecting gene expression.探索非编码基因组区域中基因变异的功能:鉴定影响基因表达的人类调控变异的方法。
Brief Bioinform. 2015 May;16(3):393-412. doi: 10.1093/bib/bbu018. Epub 2014 Jun 10.
8
Quantifying constraint in the human mitochondrial genome.量化人类线粒体基因组中的约束。
Nature. 2024 Nov;635(8038):390-397. doi: 10.1038/s41586-024-08048-x. Epub 2024 Oct 16.
9
A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease.一种用于有效识别孟德尔疾病中致病调控变异的全基因组分析框架。
Am J Hum Genet. 2016 Sep 1;99(3):595-606. doi: 10.1016/j.ajhg.2016.07.005. Epub 2016 Aug 25.
10
The human noncoding genome defined by genetic diversity.遗传多样性定义的人类非编码基因组。
Nat Genet. 2018 Mar;50(3):333-337. doi: 10.1038/s41588-018-0062-7. Epub 2018 Feb 26.

引用本文的文献

1
A framework to infer exonic variants when parental genotypes are missing enhances association studies of autism.一种在亲本基因型缺失时推断外显子变异的框架增强了自闭症的关联研究。
bioRxiv. 2025 Jul 24:2025.07.24.666675. doi: 10.1101/2025.07.24.666675.
2
A probabilistic graphical model for estimating selection coefficients of nonsynonymous variants from human population sequence data.一种用于从人类群体序列数据中估计非同义变体选择系数的概率图模型。
Nat Commun. 2025 May 20;16(1):4670. doi: 10.1038/s41467-025-59937-2.
3
PRESCOTT: a population aware, epistatic, and structural model accurately predicts missense effects.普雷斯科特:一种群体感知、上位性和结构模型能准确预测错义效应。
Genome Biol. 2025 May 6;26(1):113. doi: 10.1186/s13059-025-03581-y.
4
Analysis of canine gene constraint identifies new variants for orofacial clefts and stature.犬类基因限制分析确定了口面部裂隙和身高的新变异。
Genome Res. 2025 May 2;35(5):1080-1093. doi: 10.1101/gr.280092.124.
5
Diverse ancestral representation improves genetic intolerance metrics.多样的祖先代表性可改善基因不耐受指标。
Nat Commun. 2025 Mar 18;16(1):2648. doi: 10.1038/s41467-025-57885-5.
6
Identifying deleterious noncoding variation through gain and loss of CTCF binding activity.通过CTCF结合活性的增减来识别有害的非编码变异。
Am J Hum Genet. 2025 Apr 3;112(4):892-902. doi: 10.1016/j.ajhg.2025.02.009. Epub 2025 Mar 5.
7
Rare disease gene association discovery in the 100,000 Genomes Project.在“十万基因组计划”中发现罕见病基因关联。
Nature. 2025 Feb 26. doi: 10.1038/s41586-025-08623-w.
8
Detailed tandem repeat allele profiling in 1,027 long-read genomes reveals genome-wide patterns of pathogenicity.对1027个长读长基因组进行详细的串联重复等位基因分析揭示了全基因组范围的致病性模式。
bioRxiv. 2025 Jan 20:2025.01.06.631535. doi: 10.1101/2025.01.06.631535.
9
Analyses of Human Genetic Data to Identify Clinically Relevant Domains of Neuroligins.分析人类遗传数据以确定神经连接蛋白的临床相关结构域。
Genes (Basel). 2024 Dec 14;15(12):1601. doi: 10.3390/genes15121601.
10
Acromesomelic Dysplasia With Homozygosity for a Likely Pathogenic BMPR1B Variant: Postaxial Polydactyly as a Novel Clinical Finding.肢端-中轴型骨发育不良伴 BMPR1B 杂合致病性变异:后轴多指作为一种新的临床发现。
Mol Genet Genomic Med. 2024 Oct;12(10):e70023. doi: 10.1002/mgg3.70023.

本文引用的文献

1
An analytical framework for whole-genome sequence association studies and its implications for autism spectrum disorder.全基因组序列关联研究的分析框架及其对自闭症谱系障碍的意义。
Nat Genet. 2018 Apr 26;50(5):727-736. doi: 10.1038/s41588-018-0107-y.
2
Genomic Patterns of De Novo Mutation in Simplex Autism.单纯性自闭症的新生突变基因组模式
Cell. 2017 Oct 19;171(3):710-722.e12. doi: 10.1016/j.cell.2017.08.047. Epub 2017 Sep 28.
3
Spatial Clustering of de Novo Missense Mutations Identifies Candidate Neurodevelopmental Disorder-Associated Genes.新生错义突变的空间聚类鉴定出候选神经发育障碍相关基因。
Am J Hum Genet. 2017 Sep 7;101(3):478-484. doi: 10.1016/j.ajhg.2017.08.004. Epub 2017 Aug 31.
4
Optimizing genomic medicine in epilepsy through a gene-customized approach to missense variant interpretation.通过基因定制的方法对错义变异进行解释,优化癫痫的基因组医学。
Genome Res. 2017 Oct;27(10):1715-1729. doi: 10.1101/gr.226589.117. Epub 2017 Sep 1.
5
Refining the role of de novo protein-truncating variants in neurodevelopmental disorders by using population reference samples.通过使用群体参考样本优化新生蛋白质截短变异体在神经发育障碍中的作用。
Nat Genet. 2017 Apr;49(4):504-510. doi: 10.1038/ng.3789. Epub 2017 Feb 13.
6
Properties of human disease genes and the role of genes linked to Mendelian disorders in complex disease aetiology.人类疾病基因的特性以及与孟德尔疾病相关的基因在复杂疾病病因学中的作用。
Hum Mol Genet. 2017 Feb 1;26(3):489-500. doi: 10.1093/hmg/ddw405.
7
Quantifying unobserved protein-coding variants in human populations provides a roadmap for large-scale sequencing projects.量化人类群体中未被观察到的蛋白质编码变异为大规模测序项目提供了路线图。
Nat Commun. 2016 Oct 31;7:13293. doi: 10.1038/ncomms13293.
8
REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants.REVEL:一种预测罕见错义变异致病性的集成方法。
Am J Hum Genet. 2016 Oct 6;99(4):877-885. doi: 10.1016/j.ajhg.2016.08.016. Epub 2016 Sep 22.
9
Meta-analysis of 2,104 trios provides support for 10 new genes for intellectual disability.对 2104 个三核苷酸重复扩展家族的荟萃分析为智力障碍的 10 个新基因提供了支持。
Nat Neurosci. 2016 Sep;19(9):1194-6. doi: 10.1038/nn.4352. Epub 2016 Aug 1.
10
The Ensembl Variant Effect Predictor.Ensembl变异效应预测器。
Genome Biol. 2016 Jun 6;17(1):122. doi: 10.1186/s13059-016-0974-4.