Suppr超能文献

PyPop:一个用于群体基因组学的软件框架:分析大规模多位点基因型数据。

PyPop: a software framework for population genomics: analyzing large-scale multi-locus genotype data.

作者信息

Lancaster Alex, Nelson Mark P, Meyer Diogo, Single Richard M, Thomson Glenys

机构信息

Department of Integrative Biology, University of California, Berkeley, 3060 Valley Life Sciences, Berkeley, CA 94720, USA.

出版信息

Pac Symp Biocomput. 2003:514-25.

Abstract

Software to analyze multi-locus genotype data for entire populations is useful for estimating haplotype frequencies, deviation from Hardy-Weinberg equilibrium and patterns of linkage disequilibrium. These statistical results are important to both those interested in human genome variation and disease predisposition as well as evolutionary genetics. As part of the 13th International Histocompatibility and Immunogenetics Working Group (IHWG), we have developed a software framework (PyPop). The primary novelty of this package is that it allows integration of statistics across large numbers of data-sets by heavily utilizing the XML file format and the R statistical package to view graphical output, while retaining the ability to inter-operate with existing software. Largely developed to address human population data, it can, however, be used for population based data for any organism. We tested our software on the data from the 13th IHWG which involved data sets from at least 50 laboratories each of up to 1000 individuals with 9 MHC loci (both class I and class II) and found that it scales to large numbers of data sets well.

摘要

用于分析整个人群多位点基因型数据的软件,对于估计单倍型频率、偏离哈迪-温伯格平衡的程度以及连锁不平衡模式很有用。这些统计结果对于关注人类基因组变异和疾病易感性的人以及进化遗传学领域的人都很重要。作为第13届国际组织相容性和免疫遗传学工作组(IHWG)的一部分,我们开发了一个软件框架(PyPop)。该软件包的主要新颖之处在于,它通过大量使用XML文件格式和R统计软件包来查看图形输出,从而允许整合大量数据集的统计信息,并保留了与现有软件进行互操作的能力。虽然它主要是为处理人类群体数据而开发的,但也可用于任何生物体的群体数据。我们使用第13届IHWG的数据对我们的软件进行了测试,这些数据涉及至少50个实验室的数据集,每个实验室有多达1000个个体,包含9个MHC基因座(I类和II类),结果发现它能很好地扩展到大量数据集。

相似文献

2
PyPop update--a software pipeline for large-scale multilocus population genomics.
Tissue Antigens. 2007 Apr;69 Suppl 1(0 1):192-7. doi: 10.1111/j.1399-0039.2006.00769.x.
3
PyPop: a mature open-source software pipeline for population genomics.
Front Immunol. 2024 Apr 2;15:1378512. doi: 10.3389/fimmu.2024.1378512. eCollection 2024.
4
mixIndependR: a R package for statistical independence testing of loci in database of multi-locus genotypes.
BMC Bioinformatics. 2021 Jan 6;22(1):12. doi: 10.1186/s12859-020-03945-0.
7
HLA class I (A, B, C) and class II (DRB1, DQA1, DQB1, DPB1) alleles and haplotypes in the Han from southern China.
Tissue Antigens. 2007 Dec;70(6):455-63. doi: 10.1111/j.1399-0039.2007.00932.x. Epub 2007 Sep 27.
8
14th International HLA and Immunogenetics Workshop: report of progress in methodology, data collection, and analyses.
Tissue Antigens. 2007 Apr;69 Suppl 1:185-7. doi: 10.1111/j.1399-0039.2006.00767.x.
9
Haplotype inference and block partitioning in mixed population samples.
J Bioinform Comput Biol. 2008 Dec;6(6):1177-92. doi: 10.1142/s0219720008003898.
10
htSNPer1.0: software for haplotype block partition and htSNPs selection.
BMC Bioinformatics. 2005 Mar 1;6:38. doi: 10.1186/1471-2105-6-38.

引用本文的文献

1
Natural Selection on HLA-DPB1 Amino Acids Operates Primarily on DP Serologic Categories.
Hum Immunol. 2024 Oct 25;85(6):111153. doi: 10.1016/j.humimm.2024.111153.
2
Population genetic dissection of HLA-DPB1 amino acid polymorphism to infer selection.
Hum Immunol. 2024 Nov;85(6):111151. doi: 10.1016/j.humimm.2024.111151. Epub 2024 Oct 15.
3
PyPop: a mature open-source software pipeline for population genomics.
Front Immunol. 2024 Apr 2;15:1378512. doi: 10.3389/fimmu.2024.1378512. eCollection 2024.
4
Analysis of the Origin of Emiratis as Inferred from a Family Study Based on , , , -, and Genes.
Genes (Basel). 2023 May 26;14(6):1159. doi: 10.3390/genes14061159.
5
Gene Polymorphisms Increase the Susceptibility to Tuberculosis.
Pharmgenomics Pers Med. 2023 Apr 13;16:325-336. doi: 10.2147/PGPM.S404339. eCollection 2023.
7
Exome Sequencing Reveals a Putative Role for HLA-C*03:02 in Control of HIV-1 in African Pediatric Populations.
Front Genet. 2021 Aug 26;12:720213. doi: 10.3389/fgene.2021.720213. eCollection 2021.
9
Mapping the Human Leukocyte Antigen Diversity among Croatian Regions: Implication in Transplantation.
J Immunol Res. 2021 Apr 7;2021:6670960. doi: 10.1155/2021/6670960. eCollection 2021.
10
Demographic history and selection at HLA loci in Native Americans.
PLoS One. 2020 Nov 4;15(11):e0241282. doi: 10.1371/journal.pone.0241282. eCollection 2020.

本文引用的文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验