• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于密度的空间聚类在全基因组数据中识别与缺血性脑卒中相关的基因组位点的评估。

Evaluation of Density-Based Spatial Clustering for Identifying Genomic Loci Associated with Ischemic Stroke in Genome-Wide Data.

机构信息

National Research Centre "Kurchatov Institute", Kurchatov Sq. 2, Moscow 123182, Russia.

出版信息

Int J Mol Sci. 2023 Oct 19;24(20):15355. doi: 10.3390/ijms242015355.

DOI:10.3390/ijms242015355
PMID:37895035
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10607504/
Abstract

The genetic architecture of ischemic stroke (IS), which is one of the leading causes of death worldwide, is complex and underexplored. The traditional approach for associative gene mapping is genome-wide association studies (GWASs), testing individual single-nucleotide polymorphisms (SNPs) across the genomes of case and control groups. The purpose of this research is to develop an alternative approach in which groups of SNPs are examined rather than individual ones. We proposed, validated and applied to real data a new workflow consisting of three key stages: grouping SNPs in clusters, inferring the haplotypes in the clusters and testing haplotypes for the association with phenotype. To group SNPs, we applied the clustering algorithms DBSCAN and HDBSCAN to linkage disequilibrium (LD) matrices, representing pairwise r values between all genotyped SNPs. These clustering algorithms have never before been applied to genotype data as part of the workflow of associative studies. In total, 883,908 SNPs and insertion/deletion polymorphisms from people of European ancestry (4929 cases and 652 controls) were processed. The subsequent testing for frequencies of haplotypes restored in the clusters of SNPs revealed dozens of genes associated with IS and suggested the complex role that protocadherin molecules play in IS. The developed workflow was validated with the use of a simulated dataset of similar ancestry and the same sample sizes. The results of classic GWASs are also provided and discussed. The considered clustering algorithms can be applied to genotypic data to identify the genomic loci associated with different qualitative traits, using the workflow presented in this research.

摘要

缺血性中风(IS)的遗传结构复杂且尚未得到充分探索,IS 是全球主要死亡原因之一。关联基因映射的传统方法是全基因组关联研究(GWAS),测试病例和对照组基因组中个体单核苷酸多态性(SNP)。本研究旨在开发一种替代方法,即检查 SNP 组而不是单个 SNP。我们提出、验证并将一个新的工作流程应用于真实数据,该流程由三个关键阶段组成:SNP 聚类、聚类中单体型推断以及单体型与表型关联的检验。为了对 SNP 进行聚类,我们应用了 DBSCAN 和 HDBSCAN 聚类算法到连锁不平衡(LD)矩阵,代表所有基因分型 SNP 之间的成对 r 值。这些聚类算法以前从未作为关联研究工作流程的一部分应用于基因型数据。总共处理了来自欧洲血统人群的 883908 个 SNP 和插入/缺失多态性(4929 例病例和 652 例对照)。随后对 SNP 聚类中恢复的单体型频率进行检验,发现了数十个与 IS 相关的基因,并提示原钙黏蛋白分子在 IS 中发挥复杂作用。使用具有相似祖先和相同样本大小的模拟数据集验证了所开发的工作流程。还提供和讨论了经典 GWAS 的结果。所考虑的聚类算法可以应用于基因型数据,以使用本研究中提出的工作流程识别与不同定性特征相关的基因组位点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/74e04b181542/ijms-24-15355-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/5f3b64575633/ijms-24-15355-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/f0471ba22eeb/ijms-24-15355-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/381cd1f37a9b/ijms-24-15355-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/d2239fb1a67a/ijms-24-15355-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/0a0528c12373/ijms-24-15355-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/44bef30ed495/ijms-24-15355-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/74e04b181542/ijms-24-15355-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/5f3b64575633/ijms-24-15355-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/f0471ba22eeb/ijms-24-15355-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/381cd1f37a9b/ijms-24-15355-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/d2239fb1a67a/ijms-24-15355-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/0a0528c12373/ijms-24-15355-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/44bef30ed495/ijms-24-15355-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95f8/10607504/74e04b181542/ijms-24-15355-g007.jpg

相似文献

1
Evaluation of Density-Based Spatial Clustering for Identifying Genomic Loci Associated with Ischemic Stroke in Genome-Wide Data.基于密度的空间聚类在全基因组数据中识别与缺血性脑卒中相关的基因组位点的评估。
Int J Mol Sci. 2023 Oct 19;24(20):15355. doi: 10.3390/ijms242015355.
2
Performance of a blockwise approach in variable selection using linkage disequilibrium information.使用连锁不平衡信息进行变量选择时的分块方法性能。
BMC Bioinformatics. 2015 May 8;16:148. doi: 10.1186/s12859-015-0556-6.
3
Replication of genetic loci for ages at menarche and menopause in the multi-ethnic Population Architecture using Genomics and Epidemiology (PAGE) study.多民族人群基因与流行病学研究中遗传位点复制与初潮和绝经年龄的关联。
Hum Reprod. 2013 Jun;28(6):1695-706. doi: 10.1093/humrep/det071. Epub 2013 Mar 18.
4
Evaluation of clustering and genotype distribution for replication in genome wide association studies: the age-related eye disease study.全基因组关联研究中用于复制的聚类和基因型分布评估:年龄相关性眼病研究
PLoS One. 2008;3(11):e3813. doi: 10.1371/journal.pone.0003813. Epub 2008 Nov 26.
5
Haplotype-Based Single-Step GWAS for Yearling Temperament in American Angus Cattle.基于单体型的美国安格斯牛周岁行为学单步 GWAS 分析。
Genes (Basel). 2021 Dec 22;13(1):17. doi: 10.3390/genes13010017.
6
Increasing power of genome-wide association studies by collecting additional single-nucleotide polymorphisms.通过收集额外的单核苷酸多态性来提高全基因组关联研究的效力。
Genetics. 2011 Jun;188(2):449-60. doi: 10.1534/genetics.111.128595. Epub 2011 Apr 5.
7
Selecting Closely-Linked SNPs Based on Local Epistatic Effects for Haplotype Construction Improves Power of Association Mapping.基于局部上位效应选择紧密连锁 SNPs 进行单倍型构建可提高关联作图的功效。
G3 (Bethesda). 2019 Dec 3;9(12):4115-4126. doi: 10.1534/g3.119.400451.
8
Performance of single nucleotide polymorphisms versus haplotypes for genome-wide association analysis in barley.单核苷酸多态性与单体型在大麦全基因组关联分析中的性能比较。
PLoS One. 2010 Nov 22;5(11):e14079. doi: 10.1371/journal.pone.0014079.
9
The extent of linkage disequilibrium and computational challenges of single nucleotide polymorphisms in genome-wide association studies.全基因组关联研究中单核苷酸多态性的连锁不平衡程度和计算挑战。
Curr Drug Metab. 2011 Jun;12(5):498-506. doi: 10.2174/138920011795495312.
10
Genetic variants associated with idiopathic pulmonary fibrosis susceptibility and mortality: a genome-wide association study.与特发性肺纤维化易感性和死亡率相关的遗传变异:全基因组关联研究。
Lancet Respir Med. 2013 Jun;1(4):309-317. doi: 10.1016/S2213-2600(13)70045-6. Epub 2013 Apr 17.

引用本文的文献

1
The Use of AI for Phenotype-Genotype Mapping.人工智能在表型-基因型映射中的应用。
Methods Mol Biol. 2025;2952:369-410. doi: 10.1007/978-1-0716-4690-8_21.
2
Determinant-based grouping of SNPs and its application for detecting disease-associated genomic loci.基于决定因素的单核苷酸多态性分组及其在检测疾病相关基因组位点中的应用。
NAR Genom Bioinform. 2025 Mar 18;7(1):lqaf024. doi: 10.1093/nargab/lqaf024. eCollection 2025 Mar.
3
Genome-wide association studies of ischemic stroke based on interpretable machine learning.基于可解释机器学习的缺血性中风全基因组关联研究

本文引用的文献

1
GWAS Central: an expanding resource for finding and visualising genotype and phenotype data from genome-wide association studies.GWAS 中心:一个不断扩展的资源,用于发现和可视化全基因组关联研究中的基因型和表型数据。
Nucleic Acids Res. 2023 Jan 6;51(D1):D986-D993. doi: 10.1093/nar/gkac1017.
2
Stroke genetics informs drug discovery and risk prediction across ancestries.中风遗传学为药物发现和跨种族风险预测提供信息。
Nature. 2022 Nov;611(7934):115-123. doi: 10.1038/s41586-022-05165-3. Epub 2022 Sep 30.
3
Orphan GPR26 Counteracts Early Phases of Hyperglycemia-Mediated Monocyte Activation and Is Suppressed in Diabetic Patients.
PeerJ Comput Sci. 2024 Nov 6;10:e2454. doi: 10.7717/peerj-cs.2454. eCollection 2024.
孤儿G蛋白偶联受体26可对抗高血糖介导的单核细胞激活的早期阶段,并在糖尿病患者中受到抑制。
Biomedicines. 2022 Jul 19;10(7):1736. doi: 10.3390/biomedicines10071736.
4
Association of DNA Methylation Patterns in 7 Novel Genes With Ischemic Stroke in the Northern Chinese Population.中国北方人群中7个新基因的DNA甲基化模式与缺血性中风的关联
Front Genet. 2022 Apr 11;13:844141. doi: 10.3389/fgene.2022.844141. eCollection 2022.
5
DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update).DAVID:一个用于基因列表功能富集分析和功能注释的网络服务器(2021 更新)。
Nucleic Acids Res. 2022 Jul 5;50(W1):W216-W221. doi: 10.1093/nar/gkac194.
6
Genetic Contributors of Incident Stroke in 10,700 African Americans With Hypertension: A Meta-Analysis From the Genetics of Hypertension Associated Treatments and Reasons for Geographic and Racial Differences in Stroke Studies.10700名患有高血压的非裔美国人中风发病的遗传因素:高血压相关治疗遗传学及中风研究中地理和种族差异原因的荟萃分析
Front Genet. 2021 Dec 21;12:781451. doi: 10.3389/fgene.2021.781451. eCollection 2021.
7
Ischemic Heart Disease Selectively Modifies the Right Atrial Appendage Transcriptome.缺血性心脏病选择性地改变右心耳转录组。
Front Cardiovasc Med. 2021 Dec 2;8:728198. doi: 10.3389/fcvm.2021.728198. eCollection 2021.
8
Inflammation and Stroke Risk: A New Target for Prevention.炎症与卒中风险:预防的新靶点。
Stroke. 2021 Aug;52(8):2697-2706. doi: 10.1161/STROKEAHA.121.034388. Epub 2021 Jun 24.
9
Biological constraints on GWAS SNPs at suggestive significance thresholds reveal additional BMI loci.生物限制因素对 GWAS SNP 在提示性显著阈值下的研究揭示了额外的 BMI 位点。
Elife. 2021 Jan 18;10:e62206. doi: 10.7554/eLife.62206.
10
Right Place at the Right Time: How Changes in Protocadherins Affect Synaptic Connections Contributing to the Etiology of Neurodevelopmental Disorders.恰到好处的时机:原钙黏蛋白的变化如何影响突触连接,从而导致神经发育障碍的发病机制。
Cells. 2020 Dec 18;9(12):2711. doi: 10.3390/cells9122711.