• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于单倍型的全基因组关联研究,使用一种新的 SNP 集方法。

RAINBOW: Haplotype-based genome-wide association study using a novel SNP-set method.

机构信息

Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan.

出版信息

PLoS Comput Biol. 2020 Feb 14;16(2):e1007663. doi: 10.1371/journal.pcbi.1007663. eCollection 2020 Feb.

DOI:10.1371/journal.pcbi.1007663
PMID:32059004
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7046296/
Abstract

Difficulty in detecting rare variants is one of the problems in conventional genome-wide association studies (GWAS). The problem is closely related to the complex gene compositions comprising multiple alleles, such as haplotypes. Several single nucleotide polymorphism (SNP) set approaches have been proposed to solve this problem. These methods, however, have been rarely discussed in connection with haplotypes. In this study, we developed a novel SNP-set method named "RAINBOW" and applied the method to haplotype-based GWAS by regarding a haplotype block as a SNP-set. Combining haplotype block estimation and SNP-set GWAS, haplotype-based GWAS can be conducted without prior information of haplotypes. We prepared 100 datasets of simulated phenotypic data and real marker genotype data of Oryza sativa subsp. indica, and performed GWAS of the datasets. We compared the power of our method, the conventional single-SNP GWAS, the conventional haplotype-based GWAS, and the conventional SNP-set GWAS. Our proposed method was shown to be superior to these in three aspects: (1) controlling false positives; (2) in detecting causal variants without relying on the linkage disequilibrium if causal variants were genotyped in the dataset; and (3) it showed greater power than the other methods, i.e., it was able to detect causal variants that were not detected by the others, primarily when the causal variants were located very close to each other, and the directions of their effects were opposite. By using the SNP-set approach as in this study, we expect that detecting not only rare variants but also genes with complex mechanisms, such as genes with multiple causal variants, can be realized. RAINBOW was implemented as an R package named "RAINBOWR" and is available from CRAN (https://cran.r-project.org/web/packages/RAINBOWR/index.html) and GitHub (https://github.com/KosukeHamazaki/RAINBOWR).

摘要

在传统的全基因组关联研究(GWAS)中,检测罕见变异是一个问题。这个问题与包含多个等位基因(如单倍型)的复杂基因组成密切相关。已经提出了几种单核苷酸多态性(SNP)集方法来解决这个问题。然而,这些方法很少与单倍型相关联讨论。在这项研究中,我们开发了一种名为“RAINBOW”的新 SNP 集方法,并通过将单倍型块视为 SNP 集将该方法应用于基于单倍型的 GWAS。通过组合单倍型块估计和 SNP 集 GWAS,可以在没有单倍型先验信息的情况下进行基于单倍型的 GWAS。我们准备了 100 个模拟表型数据和 Oryza sativa subsp. indica 的真实标记基因型数据集,并对这些数据集进行了 GWAS。我们比较了我们的方法、传统的单 SNP GWAS、传统的基于单倍型的 GWAS 和传统的 SNP 集 GWAS 的功效。我们的方法在三个方面表现出优越性:(1)控制假阳性;(2)在不依赖连锁不平衡的情况下检测因果变异,如果因果变异在数据集中被基因分型;(3)它比其他方法具有更大的功效,即能够检测到其他方法无法检测到的因果变异,主要是当因果变异彼此非常接近且它们的作用方向相反时。通过在这项研究中使用 SNP 集方法,我们期望不仅可以检测罕见变异,还可以检测具有复杂机制的基因,例如具有多个因果变异的基因。RAINBOW 被实现为一个名为“RAINBOWR”的 R 包,并可从 CRAN(https://cran.r-project.org/web/packages/RAINBOWR/index.html)和 GitHub(https://github.com/KosukeHamazaki/RAINBOWR)获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fb5/7046296/52cbbb2d4b02/pcbi.1007663.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fb5/7046296/016a0ce8813c/pcbi.1007663.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fb5/7046296/1a4f17f83d95/pcbi.1007663.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fb5/7046296/52cbbb2d4b02/pcbi.1007663.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fb5/7046296/016a0ce8813c/pcbi.1007663.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fb5/7046296/1a4f17f83d95/pcbi.1007663.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1fb5/7046296/52cbbb2d4b02/pcbi.1007663.g003.jpg

相似文献

1
RAINBOW: Haplotype-based genome-wide association study using a novel SNP-set method.基于单倍型的全基因组关联研究,使用一种新的 SNP 集方法。
PLoS Comput Biol. 2020 Feb 14;16(2):e1007663. doi: 10.1371/journal.pcbi.1007663. eCollection 2020 Feb.
2
Selecting Closely-Linked SNPs Based on Local Epistatic Effects for Haplotype Construction Improves Power of Association Mapping.基于局部上位效应选择紧密连锁 SNPs 进行单倍型构建可提高关联作图的功效。
G3 (Bethesda). 2019 Dec 3;9(12):4115-4126. doi: 10.1534/g3.119.400451.
3
SNP- and haplotype-based genome-wide association studies for growth, carcass, and meat quality traits in a Duroc multigenerational population.基于单核苷酸多态性(SNP)和单倍型的杜洛克多代群体生长、胴体和肉质性状全基因组关联研究。
BMC Genet. 2016 Apr 19;17:60. doi: 10.1186/s12863-016-0368-3.
4
The power comparison of the haplotype-based collapsing tests and the variant-based collapsing tests for detecting rare variants in pedigrees.基于单倍型的合并检验与基于变异的合并检验在系谱中检测罕见变异的效能比较。
BMC Genomics. 2014 Jul 28;15(1):632. doi: 10.1186/1471-2164-15-632.
5
Comparison of multimarker logistic regression models, with application to a genomewide scan of schizophrenia.多标志物逻辑回归模型的比较及其在精神分裂症全基因组扫描中的应用。
BMC Genet. 2010 Sep 9;11:80. doi: 10.1186/1471-2156-11-80.
6
CollapsABEL: an R library for detecting compound heterozygote alleles in genome-wide association studies.CollapsABEL:一个用于在全基因组关联研究中检测复合杂合子等位基因的R语言库。
BMC Bioinformatics. 2016 Apr 8;17:156. doi: 10.1186/s12859-016-1006-9.
7
Prioritized candidate causal haplotype blocks in plant genome-wide association studies.植物全基因组关联研究中优先考虑的候选因果单倍型块。
PLoS Genet. 2022 Oct 17;18(10):e1010437. doi: 10.1371/journal.pgen.1010437. eCollection 2022 Oct.
8
Multi-SNP Haplotype Analysis Methods for Association Analysis.用于关联分析的多单核苷酸多态性单倍型分析方法
Methods Mol Biol. 2017;1666:485-504. doi: 10.1007/978-1-4939-7274-6_24.
9
GWASinlps: non-local prior based iterative SNP selection tool for genome-wide association studies.GWASinlps:基于非局部先验的全基因组关联研究的迭代 SNP 选择工具。
Bioinformatics. 2019 Jan 1;35(1):1-11. doi: 10.1093/bioinformatics/bty472.
10
Genome-wide association study of rice grain width variation.全基因组关联研究水稻粒宽变异。
Genome. 2018 Apr;61(4):233-240. doi: 10.1139/gen-2017-0106. Epub 2017 Dec 1.

引用本文的文献

1
Multimodal analysis stratifies genetic susceptibility and reveals the pathogenic mechanism of kidney injury in diabetic nephropathy.多模态分析对遗传易感性进行分层,并揭示糖尿病肾病肾损伤的致病机制。
Cell Rep Med. 2025 Aug 19;6(8):102249. doi: 10.1016/j.xcrm.2025.102249. Epub 2025 Jul 24.
2
Dissecting the genetic basis of response to salmonid alphavirus in Atlantic salmon.剖析大西洋鲑对鲑鱼α病毒反应的遗传基础。
BMC Genomics. 2025 Jul 11;26(1):657. doi: 10.1186/s12864-025-11735-2.
3
Integrating multi-omics and machine learning for disease resistance prediction in legumes.

本文引用的文献

1
A One-Penny Imputed Genome from Next-Generation Reference Panels.基于新一代参考面板的单分钱估算基因组。
Am J Hum Genet. 2018 Sep 6;103(3):338-348. doi: 10.1016/j.ajhg.2018.07.015. Epub 2018 Aug 9.
2
Genomic variation in 3,010 diverse accessions of Asian cultivated rice.亚洲栽培稻 3010 份种质资源的基因组变异。
Nature. 2018 May;557(7703):43-49. doi: 10.1038/s41586-018-0063-9. Epub 2018 Apr 25.
3
Whole genome sequencing-based association study to unravel genetic architecture of cooked grain width and length traits in rice.
整合多组学和机器学习用于豆类抗病性预测
Theor Appl Genet. 2025 Jun 27;138(7):163. doi: 10.1007/s00122-025-04948-2.
4
Optimization of crossing strategy based on the usefulness criterion in interpopulation crosses considering different marker effects among populations.基于有用性标准,在考虑群体间不同标记效应的情况下,对群体间杂交的杂交策略进行优化。
Theor Appl Genet. 2025 Jun 20;138(7):155. doi: 10.1007/s00122-025-04935-7.
5
QTN detection and candidate gene identification for improved eating and cooking quality in rice using GWAS and PLS regression analysis.利用全基因组关联研究(GWAS)和偏最小二乘回归分析检测水稻优质食味和蒸煮品质的QTN并鉴定候选基因
Theor Appl Genet. 2025 Feb 26;138(3):58. doi: 10.1007/s00122-025-04850-x.
6
Haplotype breeding: fast-track the crop improvements.单倍型育种:加速作物改良。
Planta. 2025 Feb 1;261(3):51. doi: 10.1007/s00425-025-04622-3.
7
Resistance haplotypes to green rice leafhopper ( Uhler) estimated in genome-wide association study in Myanmar rice landraces.在缅甸水稻地方品种全基因组关联研究中估计的对绿稻叶蝉(乌勒)的抗性单倍型
Breed Sci. 2024 Sep;74(4):366-381. doi: 10.1270/jsbbs.23067. Epub 2024 Aug 23.
8
Phenotypic simulation for fruit-related traits in F progenies of chili peppers (Capsicum annuum) using genomic prediction based solely on parental information.仅基于亲本信息,利用基因组预测对辣椒(Capsicum annuum)F代子代果实相关性状进行表型模拟。
Mol Genet Genomics. 2025 Jan 21;300(1):15. doi: 10.1007/s00438-024-02224-4.
9
Cross potential selection: a proposal for optimizing crossing combinations in recurrent selection using the usefulness criterion of future inbred lines.交叉潜力选择:利用未来自交系的有用性标准优化轮回选择中杂交组合的建议。
G3 (Bethesda). 2024 Nov 6;14(11). doi: 10.1093/g3journal/jkae224.
10
High-Throughput Phenotyping of Soybean Biomass: Conventional Trait Estimation and Novel Latent Feature Extraction Using UAV Remote Sensing and Deep Learning Models.大豆生物量的高通量表型分析:使用无人机遥感和深度学习模型进行传统性状估计和新型潜在特征提取
Plant Phenomics. 2024 Sep 9;6:0244. doi: 10.34133/plantphenomics.0244. eCollection 2024.
基于全基因组测序的关联研究揭示水稻蒸煮谷粒长宽性状的遗传结构。
Sci Rep. 2017 Sep 29;7(1):12478. doi: 10.1038/s41598-017-12778-6.
4
Rice SNP-seek database update: new SNPs, indels, and queries.水稻SNP-seek数据库更新:新的单核苷酸多态性、插入缺失及查询内容。
Nucleic Acids Res. 2017 Jan 4;45(D1):D1075-D1081. doi: 10.1093/nar/gkw1135. Epub 2016 Nov 29.
5
Genome-wide association study using whole-genome sequencing rapidly identifies new genes influencing agronomic traits in rice.全基因组关联研究利用全基因组测序快速鉴定影响水稻农艺性状的新基因。
Nat Genet. 2016 Aug;48(8):927-34. doi: 10.1038/ng.3596. Epub 2016 Jun 20.
6
Genome-Assisted Prediction of Quantitative Traits Using the R Package sommer.使用R包sommer进行数量性状的基因组辅助预测。
PLoS One. 2016 Jun 6;11(6):e0156744. doi: 10.1371/journal.pone.0156744. eCollection 2016.
7
Genetic linkage analysis in the age of whole-genome sequencing.全基因组测序时代的基因连锁分析
Nat Rev Genet. 2015 May;16(5):275-84. doi: 10.1038/nrg3908. Epub 2015 Mar 31.
8
SNP-Seek database of SNPs derived from 3000 rice genomes.来自3000份水稻基因组的单核苷酸多态性(SNP)的SNP-Seek数据库。
Nucleic Acids Res. 2015 Jan;43(Database issue):D1023-7. doi: 10.1093/nar/gku1039. Epub 2014 Nov 27.
9
Greater power and computational efficiency for kernel-based association testing of sets of genetic variants.基于核的遗传变异集关联测试的更大的能力和计算效率。
Bioinformatics. 2014 Nov 15;30(22):3206-14. doi: 10.1093/bioinformatics/btu504. Epub 2014 Jul 29.
10
The 3,000 rice genomes project: new opportunities and challenges for future rice research.三千水稻基因组计划:未来水稻研究的新机遇和新挑战。
Gigascience. 2014 May 28;3:8. doi: 10.1186/2047-217X-3-8. eCollection 2014.