• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
A new haplotype block detection method for dense genome sequencing data based on interval graph modeling of clusters of highly correlated SNPs.基于高度相关 SNPs 簇的区间图建模的密集基因组测序数据新型单倍型块检测方法。
Bioinformatics. 2018 Feb 1;34(3):388-397. doi: 10.1093/bioinformatics/btx609.
2
Efficient haplotype block recognition of very long and dense genetic sequences.高效识别非常长且密集的遗传序列的单倍型块。
BMC Bioinformatics. 2014 Jan 14;15:10. doi: 10.1186/1471-2105-15-10.
3
HapBlock: haplotype block partitioning and tag SNP selection software using a set of dynamic programming algorithms.HapBlock:一种使用一组动态规划算法进行单倍型块划分和标签单核苷酸多态性选择的软件。
Bioinformatics. 2005 Jan 1;21(1):131-4. doi: 10.1093/bioinformatics/bth482. Epub 2004 Aug 27.
4
The impact of SNP density on fine-scale patterns of linkage disequilibrium.单核苷酸多态性(SNP)密度对连锁不平衡精细尺度模式的影响。
Hum Mol Genet. 2004 Mar 15;13(6):577-88. doi: 10.1093/hmg/ddh060. Epub 2004 Jan 20.
5
Fast Sampling-Based Whole-Genome Haplotype Block Recognition.基于快速抽样的全基因组单倍型块识别
IEEE/ACM Trans Comput Biol Bioinform. 2016 Mar-Apr;13(2):315-25. doi: 10.1109/TCBB.2015.2456897.
6
SNP identification, linkage disequilibrium, and haplotype analysis for a 200-kb genomic region in a Korean population.韩国人群中一个200千碱基基因组区域的单核苷酸多态性鉴定、连锁不平衡和单倍型分析。
Genomics. 2006 Nov;88(5):535-40. doi: 10.1016/j.ygeno.2006.03.003. Epub 2006 Aug 17.
7
Haplotype block partitioning as a tool for dimensionality reduction in SNP association studies.单倍型块划分作为SNP关联研究中降维的一种工具。
BMC Genomics. 2008 Aug 29;9:405. doi: 10.1186/1471-2164-9-405.
8
gpart: human genome partitioning and visualization of high-density SNP data by identifying haplotype blocks.gpart:通过识别单倍型块对高密度 SNP 数据进行人类基因组分区和可视化。
Bioinformatics. 2019 Nov 1;35(21):4419-4421. doi: 10.1093/bioinformatics/btz308.
9
Analysis of concordance of different haplotype block partitioning algorithms.不同单倍型块划分算法的一致性分析
BMC Bioinformatics. 2005 Dec 15;6:303. doi: 10.1186/1471-2105-6-303.
10
Similarity in recombination rate and linkage disequilibrium at CYP2C and CYP2D cytochrome P450 gene regions among Europeans indicates signs of selection and no advantage of using tagSNPs in population isolates.欧洲人群 CYP2C 和 CYP2D 细胞色素 P450 基因区域的重组率和连锁不平衡的相似性表明存在选择迹象,且在群体隔离中使用标签 SNP 没有优势。
Pharmacogenet Genomics. 2012 Dec;22(12):846-57. doi: 10.1097/FPC.0b013e32835a3a6d.

引用本文的文献

1
Admixed and single-continental genome segments of the same ancestry have distinct linkage disequilibrium patterns.具有相同祖先的混合和单一大陆基因组片段具有不同的连锁不平衡模式。
Genome Biol. 2025 Jul 11;26(1):201. doi: 10.1186/s13059-025-03672-w.
2
RegionScan: a comprehensive R package for region-level genome-wide association testing with integration and visualization of multiple-variant and single-variant hypothesis testing.RegionScan:一个用于区域水平全基因组关联测试的综合R包,集成并可视化多变异和单变异假设检验。
Bioinform Adv. 2025 Mar 13;5(1):vbaf052. doi: 10.1093/bioadv/vbaf052. eCollection 2025.
3
Determinant-based grouping of SNPs and its application for detecting disease-associated genomic loci.基于决定因素的单核苷酸多态性分组及其在检测疾病相关基因组位点中的应用。
NAR Genom Bioinform. 2025 Mar 18;7(1):lqaf024. doi: 10.1093/nargab/lqaf024. eCollection 2025 Mar.
4
ML-MAGES: A machine learning framework for multivariate genetic association analyses with genes and effect size shrinkage.ML-MAGES:一种用于基因多变量遗传关联分析及效应大小收缩的机器学习框架。
bioRxiv. 2025 Feb 12:2025.02.11.637655. doi: 10.1101/2025.02.11.637655.
5
TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data.TRIO RVEMVS:一种使用家系三联体数据进行罕见变异关联分析的贝叶斯框架,采用期望最大化变量选择方法。
PLoS One. 2024 Dec 4;19(12):e0314502. doi: 10.1371/journal.pone.0314502. eCollection 2024.
6
A SIMPLE AND FLEXIBLE TEST OF SAMPLE EXCHANGEABILITY WITH APPLICATIONS TO STATISTICAL GENOMICS.一种简单灵活的样本可交换性检验及其在统计基因组学中的应用
Ann Appl Stat. 2024 Mar;18(1):858-881. doi: 10.1214/23-aoas1817. Epub 2024 Jan 31.
7
Population dynamics of potentially harmful haplotypes: a pedigree analysis.潜在有害单倍型的种群动态:系谱分析。
BMC Genomics. 2024 May 16;25(1):487. doi: 10.1186/s12864-024-10407-x.
8
Variants of IL6, IL10, FCN2, RNASE3, IL12B and IL17B loci are associated with Schistosoma mansoni worm burden in the Albert Nile region of Uganda.IL6、IL10、FCN2、RNASE3、IL12B 和 IL17B 基因座的变异与乌干达阿尔伯特尼罗地区曼氏血吸虫虫荷有关。
PLoS Negl Trop Dis. 2023 Nov 30;17(11):e0011796. doi: 10.1371/journal.pntd.0011796. eCollection 2023 Nov.
9
Evaluation of Density-Based Spatial Clustering for Identifying Genomic Loci Associated with Ischemic Stroke in Genome-Wide Data.基于密度的空间聚类在全基因组数据中识别与缺血性脑卒中相关的基因组位点的评估。
Int J Mol Sci. 2023 Oct 19;24(20):15355. doi: 10.3390/ijms242015355.
10
An in-depth association analysis of genetic variants within nicotine-related loci: Meeting in middle of GWAS and genetic fine-mapping.深入分析尼古丁相关基因座内的遗传变异:GWAS 和遗传精细定位的交汇点。
Mol Cell Neurosci. 2023 Dec;127:103895. doi: 10.1016/j.mcn.2023.103895. Epub 2023 Aug 25.

本文引用的文献

1
Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure.针对适应连锁不平衡结构的常见变异的多重线性组合(MLC)回归检验。
Genet Epidemiol. 2017 Feb;41(2):108-121. doi: 10.1002/gepi.22024. Epub 2016 Nov 25.
2
Fast Sampling-Based Whole-Genome Haplotype Block Recognition.基于快速抽样的全基因组单倍型块识别
IEEE/ACM Trans Comput Biol Bioinform. 2016 Mar-Apr;13(2):315-25. doi: 10.1109/TCBB.2015.2456897.
3
Clique-Based Clustering of Correlated SNPs in a Gene Can Improve Performance of Gene-Based Multi-Bin Linear Combination Test.基于基因中相关单核苷酸多态性的团簇聚类可提高基于基因的多箱线性组合检验的性能。
Biomed Res Int. 2015;2015:852341. doi: 10.1155/2015/852341. Epub 2015 Aug 4.
4
A review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing.使用下一代测序技术的基因组流行病学研究的研究设计与统计方法综述。
Front Genet. 2015 Apr 20;6:149. doi: 10.3389/fgene.2015.00149. eCollection 2015.
5
Efficient haplotype block recognition of very long and dense genetic sequences.高效识别非常长且密集的遗传序列的单倍型块。
BMC Bioinformatics. 2014 Jan 14;15:10. doi: 10.1186/1471-2105-15-10.
6
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
7
On the uses and applications of the most commonly used measures of linkage disequilibrium from the comparative analysis of their statistical properties.基于连锁不平衡最常用测度统计特性的比较分析,探讨其用途与应用。
Hum Hered. 2011;71(3):186-95. doi: 10.1159/000327732. Epub 2011 Jul 20.
8
Testing for an unusual distribution of rare variants.检测罕见变异的异常分布。
PLoS Genet. 2011 Mar;7(3):e1001322. doi: 10.1371/journal.pgen.1001322. Epub 2011 Mar 3.
9
Pathway-based analysis using reduced gene subsets in genome-wide association studies.基于基因组关联研究中基因子集的通路分析。
BMC Bioinformatics. 2011 Jan 12;12:17. doi: 10.1186/1471-2105-12-17.
10
Integrating common and rare genetic variation in diverse human populations.整合不同人类群体中的常见和罕见遗传变异。
Nature. 2010 Sep 2;467(7311):52-8. doi: 10.1038/nature09298.

基于高度相关 SNPs 簇的区间图建模的密集基因组测序数据新型单倍型块检测方法。

A new haplotype block detection method for dense genome sequencing data based on interval graph modeling of clusters of highly correlated SNPs.

机构信息

Department of Statistics.

Department of Mathematics Education, Seoul National University, Seoul 08826, South Korea.

出版信息

Bioinformatics. 2018 Feb 1;34(3):388-397. doi: 10.1093/bioinformatics/btx609.

DOI:10.1093/bioinformatics/btx609
PMID:29028986
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5860363/
Abstract

MOTIVATION

Linkage disequilibrium (LD) block construction is required for research in population genetics and genetic epidemiology, including specification of sets of single nucleotide polymorphisms (SNPs) for analysis of multi-SNP based association and identification of haplotype blocks in high density sequencing data. Existing methods based on a narrow sense definition do not allow intermediate regions of low LD between strongly associated SNP pairs and tend to split high density SNP data into small blocks having high between-block correlation.

RESULTS

We present Big-LD, a block partition method based on interval graph modeling of LD bins which are clusters of strong pairwise LD SNPs, not necessarily physically consecutive. Big-LD uses an agglomerative approach that starts by identifying small communities of SNPs, i.e. the SNPs in each LD bin region, and proceeds by merging these communities. We determine the number of blocks using a method to find maximum-weight independent set. Big-LD produces larger LD blocks compared to existing methods such as MATILDE, Haploview, MIG ++, or S-MIG ++ and the LD blocks better agree with recombination hotspot locations determined by sperm-typing experiments. The observed average runtime of Big-LD for 13 288 240 non-monomorphic SNPs from 1000 Genomes Project autosome data (286 East Asians) is about 5.83 h, which is a significant improvement over the existing methods.

AVAILABILITY AND IMPLEMENTATION

Source code and documentation are available for download at http://github.com/sunnyeesl/BigLD.

CONTACT

yyoo@snu.ac.kr.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

连锁不平衡 (LD) 块构建对于群体遗传学和遗传流行病学的研究是必需的,包括指定一组单核苷酸多态性 (SNP) 用于分析基于多 SNP 的关联以及在高密度测序数据中识别单倍型块。现有的基于狭义定义的方法不允许在强相关 SNP 对之间存在中间低 LD 区域,并且往往会将高密度 SNP 数据分割成具有高块间相关性的小块。

结果

我们提出了 Big-LD,这是一种基于 LD 箱的区间图建模的块划分方法,LD 箱是强成对 LD SNP 的聚类,不一定是物理上连续的。Big-LD 使用一种凝聚方法,从识别 SNP 小社区(即每个 LD 箱区域中的 SNP)开始,然后通过合并这些社区来进行。我们使用一种方法来确定块的数量,该方法用于找到最大权独立集。与 MATILDE、Haploview、MIG ⁇ ++ 或 S-MIG ⁇ ++ 等现有方法相比,Big-LD 生成的 LD 块更大,并且 LD 块与由精子分型实验确定的重组热点位置更好地一致。对于来自 1000 基因组计划常染色体数据(286 个东亚人)的 13288240 个非同义 SNPs,观察到的 Big-LD 的平均运行时间约为 5.83 小时,这与现有方法相比有了显著的改进。

可用性和实现

源代码和文档可在 http://github.com/sunnyeesl/BigLD 下载。

联系人

yyoo@snu.ac.kr。

补充信息

补充数据可在生物信息学在线获得。