• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SelScan:一个用于基于扩展单倍型纯合性(EHH)扫描进行正选择检测的高效多线程程序。

selscan: an efficient multithreaded program to perform EHH-based scans for positive selection.

作者信息

Szpiech Zachary A, Hernandez Ryan D

机构信息

Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco

Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco Institute for Human Genetics, University of California, San Francisco Institute for Quantitative Biosciences (QB3), University of California, San Francisco.

出版信息

Mol Biol Evol. 2014 Oct;31(10):2824-7. doi: 10.1093/molbev/msu211. Epub 2014 Jul 10.

DOI:10.1093/molbev/msu211
PMID:25015648
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4166924/
Abstract

Haplotype-based scans to detect natural selection are useful to identify recent or ongoing positive selection in genomes. As both real and simulated genomic data sets grow larger, spanning thousands of samples and millions of markers, there is a need for a fast and efficient implementation of these scans for general use. Here, we present selscan, an efficient multithreaded application that implements Extended Haplotype Homozygosity (EHH), Integrated Haplotype Score (iHS), and Cross-population EHH (XPEHH). selscan accepts phased genotypes in multiple formats, including TPED, and performs extremely well on both simulated and real data and over an order of magnitude faster than existing available implementations. It calculates iHS on chromosome 22 (22,147 loci) across 204 CEU haplotypes in 353 s on one thread (33 s on 16 threads) and calculates XPEHH for the same data relative to 210 YRI haplotypes in 578 s on one thread (52 s on 16 threads). Source code and binaries (Windows, OSX, and Linux) are available at https://github.com/szpiech/selscan.

摘要

基于单倍型的扫描以检测自然选择,对于识别基因组中近期或正在进行的正选择很有用。随着真实和模拟基因组数据集变得越来越大,涵盖数千个样本和数百万个标记,需要一种快速有效的方法来普遍实施这些扫描。在此,我们展示了selscan,这是一个高效的多线程应用程序,它实现了扩展单倍型纯合性(EHH)、综合单倍型评分(iHS)和跨群体EHH(XPEHH)。selscan接受多种格式的分型基因型,包括TPED,并且在模拟数据和真实数据上都表现出色,比现有的可用实现快一个数量级以上。它在一个线程上353秒内计算204个CEU单倍型在22号染色体(22,147个位点)上的iHS(在16个线程上为33秒),并在一个线程上578秒内(在16个线程上为52秒)计算相对于210个YRI单倍型的相同数据的XPEHH。源代码和二进制文件(适用于Windows、OSX和Linux)可在https://github.com/szpiech/selscan获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/455e/4166924/5b8a54514367/msu211f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/455e/4166924/5b8a54514367/msu211f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/455e/4166924/5b8a54514367/msu211f1p.jpg

相似文献

1
selscan: an efficient multithreaded program to perform EHH-based scans for positive selection.SelScan:一个用于基于扩展单倍型纯合性(EHH)扫描进行正选择检测的高效多线程程序。
Mol Biol Evol. 2014 Oct;31(10):2824-7. doi: 10.1093/molbev/msu211. Epub 2014 Jul 10.
2
selscan 2.0: scanning for sweeps in unphased data.selscan 2.0:在非相位数据中扫描扫描。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae006.
3
hapbin: An Efficient Program for Performing Haplotype-Based Scans for Positive Selection in Large Genomic Datasets.hapbin:一个用于在大型基因组数据集中进行基于单倍型的正选择扫描的高效程序。
Mol Biol Evol. 2015 Nov;32(11):3027-9. doi: 10.1093/molbev/msv172. Epub 2015 Aug 6.
4
rehh 2.0: a reimplementation of the R package rehh to detect positive selection from haplotype structure.rehh 2.0:R包rehh的重新实现,用于从单倍型结构中检测正选择。
Mol Ecol Resour. 2017 Jan;17(1):78-90. doi: 10.1111/1755-0998.12634. Epub 2016 Nov 28.
5
GARLIC: Genomic Autozygosity Regions Likelihood-based Inference and Classification.大蒜:基于基因组自交区域似然的推断和分类。
Bioinformatics. 2017 Jul 1;33(13):2059-2062. doi: 10.1093/bioinformatics/btx102.
6
Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data.基于扩展单体型纯合度(EHH)统计量的无相位或非极化数据中的选择检测。
PLoS One. 2022 Jan 18;17(1):e0262024. doi: 10.1371/journal.pone.0262024. eCollection 2022.
7
Footprints of adaptive evolution revealed by whole Z chromosomes haplotypes in flycatchers.通过食虫鸟的全 Z 染色体单倍型揭示适应性进化的足迹。
Mol Ecol. 2019 May;28(9):2290-2304. doi: 10.1111/mec.15021. Epub 2019 May 29.
8
Detecting and characterizing genomic signatures of positive selection in global populations.检测和描述全球人群中正向选择的基因组特征。
Am J Hum Genet. 2013 Jun 6;92(6):866-81. doi: 10.1016/j.ajhg.2013.04.021. Epub 2013 May 23.
9
Detecting recent positive selection with high accuracy and reliability by conditional coalescent tree.利用条件合并树高精度、高可靠地检测近期正选择。
Mol Biol Evol. 2014 Nov;31(11):3068-80. doi: 10.1093/molbev/msu244. Epub 2014 Aug 18.
10
Relative extended haplotype homozygosity signals across breeds reveal dairy and beef specific signatures of selection.跨品种的相对扩展单倍型纯合性信号揭示了奶牛和肉牛特定的选择特征。
Genet Sel Evol. 2015 Apr 2;47(1):25. doi: 10.1186/s12711-015-0113-9.

引用本文的文献

1
Whole-genome sequencing of Tahe red deer () reveals genetic diversity and selection signatures.塔河马鹿的全基因组测序揭示了遗传多样性和选择印记。
Front Vet Sci. 2025 Aug 21;12:1642382. doi: 10.3389/fvets.2025.1642382. eCollection 2025.
2
Semi-supervised detection of natural selection with positive-unlabeled learning.基于正例未标注学习的自然选择半监督检测
bioRxiv. 2025 Aug 18:2025.08.15.670602. doi: 10.1101/2025.08.15.670602.
3
A comprehensive water buffalo pangenome reveals extensive structural variation linked to population-specific signatures of selection.

本文引用的文献

1
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
2
rehh: an R package to detect footprints of selection in genome-wide SNP data from haplotype structure.rehh:一个用于从单倍型结构的全基因组 SNP 数据中检测选择痕迹的 R 包。
Bioinformatics. 2012 Apr 15;28(8):1176-7. doi: 10.1093/bioinformatics/bts115. Epub 2012 Mar 7.
3
Signals of recent positive selection in a worldwide sample of human populations.全球人类群体样本中近期正向选择的信号。
一个全面的水牛泛基因组揭示了与群体特异性选择特征相关的广泛结构变异。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf099.
4
Molecular Genetic Basis of Reproductive Fitness in Tibetan Sheep on the Qinghai-Tibet Plateau.青藏高原藏绵羊繁殖适应性的分子遗传基础
Genes (Basel). 2025 Jul 29;16(8):909. doi: 10.3390/genes16080909.
5
RASEL: An Ensemble Model for Selection of Core SNPs and Its Application for Identification and Classification of Cattle Breeds.RASEL:一种用于选择核心单核苷酸多态性的集成模型及其在牛品种鉴定和分类中的应用
Biochem Genet. 2025 Aug 22. doi: 10.1007/s10528-025-11230-z.
6
Genomic Landscape of High-Altitude Adaptation in East African Mountain Honey Bees ().东非山地蜜蜂高海拔适应性的基因组图谱()
Ecol Evol. 2025 Aug 20;15(8):e71846. doi: 10.1002/ece3.71846. eCollection 2025 Aug.
7
Whole genome resequencing reveals genetic markers for plumage colour in Jingyuan Chicken.全基因组重测序揭示了静原鸡羽毛颜色的遗传标记。
Poult Sci. 2025 Aug 6;104(11):105666. doi: 10.1016/j.psj.2025.105666.
8
Genome-wide selective sweep analysis in high-altitude Changthangi goats reveals candidate genes for pashmina fiber production.高海拔藏山羊全基因组选择清除分析揭示了羊绒生产的候选基因。
Mamm Genome. 2025 Aug 15. doi: 10.1007/s00335-025-10155-7.
9
Whole-Genome Resequencing Analysis of Athletic Traits in Grassland-Thoroughbred.草原纯血马运动性状的全基因组重测序分析
Animals (Basel). 2025 Aug 7;15(15):2323. doi: 10.3390/ani15152323.
10
Selection Signature Analysis of Whole-Genome Sequences to Identify Genome Differences Between Selected and Unselected Holstein Cattle.全基因组序列选择特征分析以鉴定选留与未选留荷斯坦奶牛之间的基因组差异
Animals (Basel). 2025 Jul 31;15(15):2247. doi: 10.3390/ani15152247.
Genome Res. 2009 May;19(5):826-37. doi: 10.1101/gr.087577.108. Epub 2009 Mar 23.
4
Genome-wide detection and characterization of positive selection in human populations.人类群体中正选择的全基因组检测与特征分析。
Nature. 2007 Oct 18;449(7164):913-8. doi: 10.1038/nature06250.
5
A map of recent positive selection in the human genome.人类基因组中近期正选择图谱。
PLoS Biol. 2006 Mar;4(3):e72. doi: 10.1371/journal.pbio.0040072. Epub 2006 Mar 7.
6
Detecting recent positive selection in the human genome from haplotype structure.从单倍型结构检测人类基因组中近期的正选择。
Nature. 2002 Oct 24;419(6909):832-7. doi: 10.1038/nature01140. Epub 2002 Oct 9.
7
Generating samples under a Wright-Fisher neutral model of genetic variation.在遗传变异的赖特-费希尔中性模型下生成样本。
Bioinformatics. 2002 Feb;18(2):337-8. doi: 10.1093/bioinformatics/18.2.337.