• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Epi-SSA:一种基于多目标麻雀搜索算法的新的上位性检测方法。

Epi-SSA: A novel epistasis detection method based on a multi-objective sparrow search algorithm.

机构信息

College of Computer Science and Technology, Changchun University, Changchun City, Jilin Province, China.

School of Cultural and Media Studies, Changchun University of Science and Technology, Changchun City, Jilin Province, China.

出版信息

PLoS One. 2024 Oct 24;19(10):e0311223. doi: 10.1371/journal.pone.0311223. eCollection 2024.

DOI:10.1371/journal.pone.0311223
PMID:39446852
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11500897/
Abstract

Genome-wide association studies typically considers epistatic interactions as a crucial factor in exploring complex diseases. However, the current methods primarily concentrate on the detection of two-order epistatic interactions, with flaws in accuracy. In this work, we introduce a novel method called Epi-SSA, which can be better utilized to detect high-order epistatic interactions. Epi-SSA draws inspiration from the sparrow search algorithm and optimizes the population based on multiple objective functions in each iteration, in order to be able to more precisely identify epistatic interactions. To evaluate its performance, we conducted a comprehensive comparison between Epi-SSA and seven other methods using five simulation datasets: DME 100, DNME 100, DME 1000, DNME 1000 and DNME3 100. The DME 100 dataset encompasses eight second-order epistasis disease models with marginal effects, each comprising 100 simulated data instances, featuring 100 SNPs per instance, alongside 800 case and 800 control samples. The DNME 100 encompasses eight second-order epistasis disease models without marginal effects and retains other properties consistent with DME 100. Experiments on the DME 100 and DNME 100 datasets were designed to evaluate the algorithms' capacity to detect epistasis across varying disease models. The DME 1000 and DNME 1000 datasets extend the complexity with 1000 SNPs per simulated data instance, while retaining other properties consistent with DME 100 and DNME 100. These experiments aimed to gauge the algorithms' adaptability in detecting epistasis as the number of SNPs in the data increases. The DNME3 100 dataset introduces a higher level of complexity with six third-order epistasis disease models, otherwise paralleling the structure of DNME 100, serving to test the algorithms' proficiency in identifying higher-order epistasis. The highest average F-measures achieved by the seven other existing methods on the five datasets are 0.86, 0.86, 0.41, 0.56, and 0.79 respectively, while the average F-measures of Epi-SSA on the five datasets are 0.92, 0.97, 0.79, 0.86, and 0.97 respectively. The experimental results demonstrate that the Epi-SSA algorithm outperforms other methods in a variety of epistasis detection tasks. As the number of SNPs in the data set increases and the order of epistasis rises, the advantages of the Epi-SSA algorithm become increasingly pronounced. In addition, we applied Epi-SSA to the analysis of the WTCCC dataset, uncovering numerous genes and gene pairs that might play a significant role in the pathogenesis of seven complex diseases. It is worthy of note that some of these genes have been relatedly reported in the Comparative Toxicogenomics Database (CTD). Epi-SSA is a potent tool for detecting epistatic interactions, which aids us in further comprehending the pathogenesis of common and complex diseases. The source code of Epi-SSA can be obtained at https://osf.io/6sqwj/.

摘要

全基因组关联研究通常将上位性相互作用视为探索复杂疾病的关键因素。然而,目前的方法主要集中于检测二阶上位性相互作用,存在准确性方面的缺陷。在这项工作中,我们引入了一种名为 Epi-SSA 的新方法,该方法可以更好地用于检测高阶上位性相互作用。Epi-SSA 受到麻雀搜索算法的启发,在每次迭代中基于多个目标函数来优化种群,以便更准确地识别上位性相互作用。为了评估其性能,我们使用五个模拟数据集:DME 100、DNME 100、DME 1000、DNME 1000 和 DNME3 100,对 Epi-SSA 与其他七种方法进行了全面比较。DME 100 数据集包含八个具有边际效应的二阶上位性疾病模型,每个模型都由 100 个模拟数据实例组成,每个实例包含 100 个 SNPs,以及 800 个病例和 800 个对照样本。DNME 100 包含八个没有边际效应的二阶上位性疾病模型,保留了与 DME 100 相同的其他属性。DME 100 和 DNME 100 数据集的实验旨在评估算法在不同疾病模型中检测上位性的能力。DME 1000 和 DNME 1000 数据集通过每个模拟数据实例的 1000 个 SNPs 扩展了复杂性,同时保留了与 DME 100 和 DNME 100 相同的其他属性。这些实验旨在衡量算法在数据中 SNP 数量增加时检测上位性的适应性。DNME3 100 数据集引入了六个具有三阶上位性疾病模型的更高复杂性,其他方面与 DNME 100 结构平行,用于测试算法识别高阶上位性的能力。在五个数据集上,其他七种现有方法的最高平均 F 度量分别为 0.86、0.86、0.41、0.56 和 0.79,而 Epi-SSA 在五个数据集上的平均 F 度量分别为 0.92、0.97、0.79、0.86 和 0.97。实验结果表明,Epi-SSA 算法在各种上位性检测任务中均优于其他方法。随着数据集 SNP 数量的增加和上位性阶数的升高,Epi-SSA 算法的优势变得更加明显。此外,我们将 Epi-SSA 应用于 WTCCC 数据集的分析,发现了许多可能在七种复杂疾病发病机制中起重要作用的基因和基因对。值得注意的是,其中一些基因在比较毒理学基因组数据库 (CTD) 中有相关报道。Epi-SSA 是一种强大的检测上位性相互作用的工具,有助于我们进一步理解常见和复杂疾病的发病机制。Epi-SSA 的源代码可以在 https://osf.io/6sqwj/ 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/c8a44a55f726/pone.0311223.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/f14e06bc0795/pone.0311223.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/526b171d8060/pone.0311223.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/ac4cc3b9c1f6/pone.0311223.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/677a876fbbee/pone.0311223.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/b8070a087573/pone.0311223.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/a1070017c90d/pone.0311223.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/e8fa5dfe9a2c/pone.0311223.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/25c24002518a/pone.0311223.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/c8a44a55f726/pone.0311223.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/f14e06bc0795/pone.0311223.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/526b171d8060/pone.0311223.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/ac4cc3b9c1f6/pone.0311223.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/677a876fbbee/pone.0311223.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/b8070a087573/pone.0311223.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/a1070017c90d/pone.0311223.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/e8fa5dfe9a2c/pone.0311223.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/25c24002518a/pone.0311223.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e118/11500897/c8a44a55f726/pone.0311223.g009.jpg

相似文献

1
Epi-SSA: A novel epistasis detection method based on a multi-objective sparrow search algorithm.Epi-SSA:一种基于多目标麻雀搜索算法的新的上位性检测方法。
PLoS One. 2024 Oct 24;19(10):e0311223. doi: 10.1371/journal.pone.0311223. eCollection 2024.
2
GEP-EpiSeeker: a gene expression programming-based method for epistatic interaction detection in genome-wide association studies.GEP-EpiSeeker:一种基于基因表达式编程的全基因组关联研究中上位性相互作用检测方法。
BMC Genomics. 2021 Dec 20;22(Suppl 1):910. doi: 10.1186/s12864-021-08207-8.
3
SEEI: spherical evolution with feedback mechanism for identifying epistatic interactions.SEEI:用于识别上位相互作用的球形进化与反馈机制。
BMC Genomics. 2024 May 13;25(1):462. doi: 10.1186/s12864-024-10373-4.
4
Detecting purely epistatic multi-locus interactions by an omnibus permutation test on ensembles of two-locus analyses.通过对两基因座分析集合进行整体置换检验来检测纯上位多基因座相互作用。
BMC Bioinformatics. 2009 Sep 17;10:294. doi: 10.1186/1471-2105-10-294.
5
Fast detection of high-order epistatic interactions in genome-wide association studies using information theoretic measure.利用信息论方法在全基因组关联研究中快速检测高阶上位性相互作用
Comput Biol Chem. 2014 Jun;50:19-28. doi: 10.1016/j.compbiolchem.2014.01.005. Epub 2014 Jan 27.
6
Genetic studies of complex human diseases: characterizing SNP-disease associations using Bayesian networks.复杂人类疾病的遗传学研究:使用贝叶斯网络表征单核苷酸多态性与疾病的关联
BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S14. doi: 10.1186/1752-0509-6-S3-S14. Epub 2012 Dec 17.
7
A Markov blanket-based method for detecting causal SNPs in GWAS.基于马尔可夫毯的 GWAS 中因果 SNP 检测方法。
BMC Bioinformatics. 2010 Apr 29;11 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2105-11-S3-S5.
8
MDSN: A Module Detection Method for Identifying High-Order Epistatic Interactions.MDSN:一种用于识别高阶上位性相互作用的模块检测方法。
Genes (Basel). 2022 Dec 18;13(12):2403. doi: 10.3390/genes13122403.
9
Epi-GTBN: an approach of epistasis mining based on genetic Tabu algorithm and Bayesian network.Epi-GTBN:一种基于遗传禁忌搜索算法和贝叶斯网络的上位性挖掘方法。
BMC Bioinformatics. 2019 Aug 28;20(1):444. doi: 10.1186/s12859-019-3022-z.
10
EpiMOGA: An Epistasis Detection Method Based on a Multi-Objective Genetic Algorithm.EpiMOGA:一种基于多目标遗传算法的上位性检测方法。
Genes (Basel). 2021 Jan 28;12(2):191. doi: 10.3390/genes12020191.

本文引用的文献

1
A Novel Detection Method for High-Order SNP Epistatic Interactions Based on Explicit-Encoding-Based Multitasking Harmony Search.基于显式编码的多任务协同搜索的新型高阶 SNP 上位性互作检测方法。
Interdiscip Sci. 2024 Sep;16(3):688-711. doi: 10.1007/s12539-024-00621-2. Epub 2024 Jul 2.
2
Pleiotropy, epistasis and the genetic architecture of quantitative traits.数量性状的多效性、上位性和遗传结构。
Nat Rev Genet. 2024 Sep;25(9):639-657. doi: 10.1038/s41576-024-00711-3. Epub 2024 Apr 2.
3
Genetic Risk of Primary Aldosteronism and Its Contribution to Hypertension: A Cross-Ancestry Meta-Analysis of Genome-Wide Association Studies.
原发性醛固酮增多症的遗传风险及其对高血压的贡献:全基因组关联研究的跨祖先荟萃分析。
Circulation. 2023 Apr 4;147(14):1097-1109. doi: 10.1161/CIRCULATIONAHA.122.062349. Epub 2023 Feb 21.
4
Recent Versions and Applications of Sparrow Search Algorithm.麻雀搜索算法的最新版本与应用
Arch Comput Methods Eng. 2023;30(5):2831-2858. doi: 10.1007/s11831-023-09887-z. Epub 2023 Feb 7.
5
Advances in Sparrow Search Algorithm: A Comprehensive Survey.麻雀搜索算法的研究进展:全面综述
Arch Comput Methods Eng. 2023;30(1):427-455. doi: 10.1007/s11831-022-09804-w. Epub 2022 Aug 22.
6
A Novel Multitasking Ant Colony Optimization Method for Detecting Multiorder SNP Interactions.一种用于检测多阶 SNP 相互作用的新型多任务蚁群优化方法。
Interdiscip Sci. 2022 Dec;14(4):814-832. doi: 10.1007/s12539-022-00530-2. Epub 2022 Jul 5.
7
Multi-Objective Artificial Bee Colony Algorithm Based on Scale-Free Network for Epistasis Detection.基于无标度网络的多目标人工蜂群算法在连锁检测中的应用。
Genes (Basel). 2022 May 12;13(5):871. doi: 10.3390/genes13050871.
8
Fast and accurate exhaustive higher-order epistasis search with BitEpi.使用 BitEpi 进行快速准确的穷尽高阶上位性搜索。
Sci Rep. 2021 Aug 5;11(1):15923. doi: 10.1038/s41598-021-94959-y.
9
DeepCOMBI: explainable artificial intelligence for the analysis and discovery in genome-wide association studies.DeepCOMBI:用于全基因组关联研究分析与发现的可解释人工智能。
NAR Genom Bioinform. 2021 Jul 20;3(3):lqab065. doi: 10.1093/nargab/lqab065. eCollection 2021 Sep.
10
EpiMOGA: An Epistasis Detection Method Based on a Multi-Objective Genetic Algorithm.EpiMOGA:一种基于多目标遗传算法的上位性检测方法。
Genes (Basel). 2021 Jan 28;12(2):191. doi: 10.3390/genes12020191.