• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于先验遗传学框架的无假设表型预测。

Hypothesis-free phenotype prediction within a genetics-first framework.

机构信息

MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Cambridge, CB2 0QH, UK.

Department of Computer Science, University of Bristol, Bristol, BS8 1UB, UK.

出版信息

Nat Commun. 2023 Feb 17;14(1):919. doi: 10.1038/s41467-023-36634-6.

DOI:10.1038/s41467-023-36634-6
PMID:36808136
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9938118/
Abstract

Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed 'rare', even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants influence disease and organism-level phenotypes. But here we show that additional discoveries can be made through a knowledge-based approach using protein domains and ontologies (function and phenotype) that considers all coding variants regardless of allele frequency. We describe an ab initio, genetics-first method making molecular knowledge-based interpretations for exome-wide non-synonymous variants for phenotypes at the organism and cellular level. By using this reverse approach, we identify plausible genetic causes for developmental disorders that have eluded other established methods and present molecular hypotheses for the causal genetics of 40 phenotypes generated from a direct-to-consumer genotype cohort. This system offers a chance to extract further discovery from genetic data after standard tools have been applied.

摘要

全队列测序研究表明,最大的一类变体是那些被认为是“罕见的”,即使是位于编码区域的亚组(已知编码变体的 99%仅见于不到 1%的人群。关联方法使我们对罕见遗传变体如何影响疾病和机体水平表型有了一些了解。但在这里,我们通过使用基于知识的方法(利用蛋白质结构域和本体论(功能和表型))来展示,该方法考虑了所有编码变体,而不论等位基因频率如何,还可以进行其他发现。我们描述了一种从头开始、以遗传学为基础的方法,对机体和细胞水平的表型进行外显子全范围非 synonymous变体的分子知识解读。通过使用这种反向方法,我们为其他已建立的方法无法确定的发育障碍确定了合理的遗传原因,并为直接面向消费者的基因型队列生成的 40 种表型的因果遗传学提供了分子假说。该系统为在应用标准工具后从遗传数据中提取更多发现提供了机会。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/3bcec139984e/41467_2023_36634_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/1c1e0025dc0b/41467_2023_36634_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/ce516250061d/41467_2023_36634_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/69806e040875/41467_2023_36634_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/f8f6266858f0/41467_2023_36634_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/8f3aacacd1cf/41467_2023_36634_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/3bcec139984e/41467_2023_36634_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/1c1e0025dc0b/41467_2023_36634_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/ce516250061d/41467_2023_36634_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/69806e040875/41467_2023_36634_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/f8f6266858f0/41467_2023_36634_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/8f3aacacd1cf/41467_2023_36634_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c13/9938118/3bcec139984e/41467_2023_36634_Fig6_HTML.jpg

相似文献

1
Hypothesis-free phenotype prediction within a genetics-first framework.基于先验遗传学框架的无假设表型预测。
Nat Commun. 2023 Feb 17;14(1):919. doi: 10.1038/s41467-023-36634-6.
2
Exome Chip Meta-analysis Fine Maps Causal Variants and Elucidates the Genetic Architecture of Rare Coding Variants in Smoking and Alcohol Use.外显子组芯片荟萃分析精细定位因果变异,并阐明了吸烟和饮酒中罕见编码变异的遗传结构。
Biol Psychiatry. 2019 Jun 1;85(11):946-955. doi: 10.1016/j.biopsych.2018.11.024. Epub 2018 Dec 6.
3
Genome analysis and knowledge-driven variant interpretation with TGex.基因组分析和基于 TGex 的知识驱动的变异解释。
BMC Med Genomics. 2019 Dec 30;12(1):200. doi: 10.1186/s12920-019-0647-8.
4
Whole exome sequencing of extreme age-related macular degeneration phenotypes.极端年龄相关性黄斑变性表型的全外显子组测序
Mol Vis. 2016 Aug 29;22:1062-76. eCollection 2016.
5
High-frequency actionable pathogenic exome variants in an average-risk cohort.平均风险队列中的高频可操作致病外显子变异
Cold Spring Harb Mol Case Stud. 2018 Dec 17;4(6). doi: 10.1101/mcs.a003178. Print 2018 Dec.
6
Gene-based burden scores identify rare variant associations for 28 blood biomarkers.基于基因的负担评分可识别 28 种血液生物标志物的罕见变异关联。
BMC Genom Data. 2023 Sep 4;24(1):50. doi: 10.1186/s12863-023-01155-0.
7
Investigation of Recessive Effects of Coding Variants on Common Clinical Phenotypes in Exome-Sequenced UK Biobank Participants.外显子组测序英国生物库参与者常见临床表型的编码变异隐性效应研究。
Hum Hered. 2024;89(1):1-7. doi: 10.1159/000537771. Epub 2024 Feb 10.
8
Exome array analysis of rare and low frequency variants in amyotrophic lateral sclerosis.外显子组分析在肌萎缩侧索硬化症中的罕见和低频变异。
Sci Rep. 2019 Apr 11;9(1):5931. doi: 10.1038/s41598-019-42091-3.
9
Impact of variant-level batch effects on identification of genetic risk factors in large sequencing studies.变异水平批次效应在大型测序研究中对遗传风险因素识别的影响。
PLoS One. 2021 Apr 16;16(4):e0249305. doi: 10.1371/journal.pone.0249305. eCollection 2021.
10
Exome sequencing study in patients with multiple sclerosis reveals variants associated with disease course.外显子组测序研究多发性硬化症患者揭示了与疾病进程相关的变异。
J Neuroinflammation. 2018 Sep 14;15(1):265. doi: 10.1186/s12974-018-1307-1.

引用本文的文献

1
DeepAnnotation: A novel interpretable deep learning-based genomic selection model that integrates comprehensive functional annotations.深度注释:一种基于深度学习的新型可解释基因组选择模型,该模型整合了全面的功能注释。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf083.
2
Integrating Bioinformatics and Machine Learning for Genomic Prediction in Chickens.将生物信息学和机器学习整合用于鸡的基因组预测。
Genes (Basel). 2024 May 26;15(6):690. doi: 10.3390/genes15060690.
3
The dcGO Domain-Centric Ontology Database in 2023: New Website and Extended Annotations for Protein Structural Domains.

本文引用的文献

1
Rare variant contribution to human disease in 281,104 UK Biobank exomes.281104 名英国生物银行外显子组中罕见变异对人类疾病的贡献。
Nature. 2021 Sep;597(7877):527-532. doi: 10.1038/s41586-021-03855-y. Epub 2021 Aug 10.
2
A survey of direct-to-consumer genotype data, and quality control tool () for research.一项针对直接面向消费者的基因数据以及用于研究的质量控制工具()的调查。
Comput Struct Biotechnol J. 2021 Jun 27;19:3747-3754. doi: 10.1016/j.csbj.2021.06.040. eCollection 2021.
3
Testing for rare conditions.罕见病症检测
2023 年的 dcGO 基于领域的本体数据库:蛋白质结构域的新网站和扩展注释。
J Mol Biol. 2023 Jul 15;435(14):168093. doi: 10.1016/j.jmb.2023.168093. Epub 2023 Apr 13.
Nat Methods. 2021 Mar;18(3):224-225. doi: 10.1038/s41592-021-01089-4.
4
The Human Phenotype Ontology in 2021.2021 年人类表型本体论。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1207-D1217. doi: 10.1093/nar/gkaa1043.
5
Pfam: The protein families database in 2021.Pfam:2021 年的蛋白质家族数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D412-D419. doi: 10.1093/nar/gkaa913.
6
Evaluating drug targets through human loss-of-function genetic variation.通过人类功能丧失性遗传变异评估药物靶点。
Nature. 2020 May;581(7809):459-464. doi: 10.1038/s41586-020-2267-z. Epub 2020 May 27.
7
An evaluation of machine-learning for predicting phenotype: studies in yeast, rice, and wheat.用于预测表型的机器学习评估:酵母、水稻和小麦的研究
Mach Learn. 2020;109(2):251-277. doi: 10.1007/s10994-019-05848-5. Epub 2019 Oct 23.
8
The International Genome Sample Resource (IGSR) collection of open human genomic variation resources.国际基因组样本资源(IGSR)汇集了开放的人类基因组变异资源。
Nucleic Acids Res. 2020 Jan 8;48(D1):D941-D947. doi: 10.1093/nar/gkz836.
9
Benefits and limitations of genome-wide association studies.全基因组关联研究的优势和局限性。
Nat Rev Genet. 2019 Aug;20(8):467-484. doi: 10.1038/s41576-019-0127-1.
10
Interpretable genotype-to-phenotype classifiers with performance guarantees.具有性能保证的可解释基因型到表型分类器。
Sci Rep. 2019 Mar 11;9(1):4071. doi: 10.1038/s41598-019-40561-2.