• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SPSmart:使基于群体的单核苷酸多态性(SNP)基因型数据库适用于快速全面的网络访问。

SPSmart: adapting population based SNP genotype databases for fast and comprehensive web access.

作者信息

Amigo Jorge, Salas Antonio, Phillips Christopher, Carracedo Angel

机构信息

Spanish National Genotyping Center (CeGen), Genomic Medicine Group, CIBERER, University of Santiago de Compostela, Galicia, Spain.

出版信息

BMC Bioinformatics. 2008 Oct 10;9:428. doi: 10.1186/1471-2105-9-428.

DOI:10.1186/1471-2105-9-428
PMID:18847484
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2576268/
Abstract

BACKGROUND

In the last five years large online resources of human variability have appeared, notably HapMap, Perlegen and the CEPH foundation. These databases of genotypes with population information act as catalogues of human diversity, and are widely used as reference sources for population genetics studies. Although many useful conclusions may be extracted by querying databases individually, the lack of flexibility for combining data from within and between each database does not allow the calculation of key population variability statistics.

RESULTS

We have developed a novel tool for accessing and combining large-scale genomic databases of single nucleotide polymorphisms (SNPs) in widespread use in human population genetics: SPSmart (SNPs for Population Studies). A fast pipeline creates and maintains a data mart from the most commonly accessed databases of genotypes containing population information: data is mined, summarized into the standard statistical reference indices, and stored into a relational database that currently handles as many as 4 x 10(9) genotypes and that can be easily extended to new database initiatives. We have also built a web interface to the data mart that allows the browsing of underlying data indexed by population and the combining of populations, allowing intuitive and straightforward comparison of population groups. All the information served is optimized for web display, and most of the computations are already pre-processed in the data mart to speed up the data browsing and any computational treatment requested.

CONCLUSION

In practice, SPSmart allows populations to be combined into user-defined groups, while multiple databases can be accessed and compared in a few simple steps from a single query. It performs the queries rapidly and gives straightforward graphical summaries of SNP population variability through visual inspection of allele frequencies outlined in standard pie-chart format. In addition, full numerical description of the data is output in statistical results panels that include common population genetics metrics such as heterozygosity, Fst and In.

摘要

背景

在过去五年中,出现了大量关于人类变异性的在线资源,尤其是HapMap、Perlegen和CEPH基金会。这些包含群体信息的基因型数据库充当了人类多样性的目录,并被广泛用作群体遗传学研究的参考来源。尽管通过单独查询数据库可以提取许多有用的结论,但缺乏将每个数据库内部和之间的数据进行组合的灵活性,无法计算关键的群体变异性统计数据。

结果

我们开发了一种新颖的工具,用于访问和组合在人类群体遗传学中广泛使用的单核苷酸多态性(SNP)大规模基因组数据库:SPSmart(用于群体研究的SNP)。一个快速管道从最常用的包含群体信息的基因型数据库创建并维护一个数据集市:数据被挖掘、汇总为标准统计参考指标,并存储到一个关系数据库中,该数据库目前可处理多达4×10⁹个基因型,并且可以轻松扩展到新的数据库计划。我们还为数据集市构建了一个网络界面,允许按群体浏览基础数据并组合群体,从而直观、直接地比较群体。提供的所有信息都针对网络显示进行了优化,并且大多数计算已经在数据集市中进行了预处理,以加快数据浏览和任何所需的计算处理。

结论

在实践中,SPSmart允许将群体组合成用户定义的组,同时可以通过单个查询中的几个简单步骤访问和比较多个数据库。它能快速执行查询,并通过以标准饼图格式概述的等位基因频率的视觉检查,给出SNP群体变异性的直接图形摘要。此外,数据的完整数值描述在统计结果面板中输出,其中包括常见的群体遗传学指标,如杂合度、Fst和In。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/2576268/442c4cfa64cb/1471-2105-9-428-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/2576268/485ebb422015/1471-2105-9-428-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/2576268/4ee87c62f4ea/1471-2105-9-428-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/2576268/792938a40677/1471-2105-9-428-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/2576268/442c4cfa64cb/1471-2105-9-428-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/2576268/485ebb422015/1471-2105-9-428-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/2576268/4ee87c62f4ea/1471-2105-9-428-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/2576268/792938a40677/1471-2105-9-428-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce1a/2576268/442c4cfa64cb/1471-2105-9-428-4.jpg

相似文献

1
SPSmart: adapting population based SNP genotype databases for fast and comprehensive web access.SPSmart:使基于群体的单核苷酸多态性(SNP)基因型数据库适用于快速全面的网络访问。
BMC Bioinformatics. 2008 Oct 10;9:428. doi: 10.1186/1471-2105-9-428.
2
ENGINES: exploring single nucleotide variation in entire human genomes.引擎:探索整个人类基因组中的单核苷酸变异。
BMC Bioinformatics. 2011 Apr 19;12:105. doi: 10.1186/1471-2105-12-105.
3
Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes.内部数据集市方法用于SNP基因型群体遗传学分析的可行性。
BMC Bioinformatics. 2009 Mar 19;10 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2105-10-S3-S5.
4
SNP Function Portal: a web database for exploring the function implication of SNP alleles.单核苷酸多态性(SNP)功能门户:一个用于探索SNP等位基因功能含义的网络数据库。
Bioinformatics. 2006 Jul 15;22(14):e523-9. doi: 10.1093/bioinformatics/btl241.
5
SNP-PHAGE--High throughput SNP discovery pipeline.SNP-噬菌体——高通量单核苷酸多态性发现流程
BMC Bioinformatics. 2006 Oct 23;7:468. doi: 10.1186/1471-2105-7-468.
6
The SNPforID browser: an online tool for query and display of frequency data from the SNPforID project.SNPforID浏览器:一个用于查询和显示SNPforID项目频率数据的在线工具。
Int J Legal Med. 2008 Sep;122(5):435-40. doi: 10.1007/s00414-008-0233-7. Epub 2008 May 20.
7
SNP@Evolution: a hierarchical database of positive selection on the human genome.SNP@进化:人类基因组正向选择的分层数据库。
BMC Evol Biol. 2009 Sep 5;9:221. doi: 10.1186/1471-2148-9-221.
8
Frequency Finder: a multi-source web application for collection of public allele frequencies of SNP markers.频率查找器:一个用于收集单核苷酸多态性(SNP)标记公共等位基因频率的多源网络应用程序。
Bioinformatics. 2004 Feb 12;20(3):439-43. doi: 10.1093/bioinformatics/btg446. Epub 2004 Jan 22.
9
A SNP-centric database for the investigation of the human genome.一个以单核苷酸多态性为中心的用于人类基因组研究的数据库。
BMC Bioinformatics. 2004 Mar 26;5:33. doi: 10.1186/1471-2105-5-33.
10
WASP: a Web-based Allele-Specific PCR assay designing tool for detecting SNPs and mutations.WASP:一种基于网络的用于检测单核苷酸多态性(SNP)和突变的等位基因特异性聚合酶链反应(PCR)分析设计工具。
BMC Genomics. 2007 Aug 14;8:275. doi: 10.1186/1471-2164-8-275.

引用本文的文献

1
HSD17B7 gene in self-renewal and oncogenicity of keratinocytes from Black versus White populations.HSD17B7 基因在黑种人与白种人角质形成细胞自我更新和致癌性中的作用。
EMBO Mol Med. 2021 Jul 7;13(7):e14133. doi: 10.15252/emmm.202114133. Epub 2021 Jun 29.
2
Inter-laboratory study on standardized MPS libraries: evaluation of performance, concordance, and sensitivity using mixtures and degraded DNA.标准化微卫星文库的实验室间研究:使用混合物和降解DNA评估性能、一致性和灵敏度
Int J Legal Med. 2020 Jan;134(1):185-198. doi: 10.1007/s00414-019-02201-2. Epub 2019 Nov 19.
3
A set of novel SNP loci for differentiating continental populations and three Chinese populations.

本文引用的文献

1
Worldwide human relationships inferred from genome-wide patterns of variation.从全基因组变异模式推断全球人类关系。
Science. 2008 Feb 22;319(5866):1100-4. doi: 10.1126/science.1153717.
2
Standardized subsets of the HGDP-CEPH Human Genome Diversity Cell Line Panel, accounting for atypical and duplicated samples and pairs of close relatives.HGDP-CEPH人类基因组多样性细胞系面板的标准化子集,包括非典型和重复样本以及近亲对。
Ann Hum Genet. 2006 Nov;70(Pt 6):841-7. doi: 10.1111/j.1469-1809.2006.00285.x.
3
A haplotype map of the human genome.
一组用于区分不同大陆人群和三个中国人群的新型单核苷酸多态性(SNP)位点。
PeerJ. 2019 Mar 29;7:e6508. doi: 10.7717/peerj.6508. eCollection 2019.
4
Pharmacogenetic association study on clopidogrel response in Puerto Rican Hispanics with cardiovascular disease: a novel characterization of a Caribbean population.波多黎各裔西班牙裔心血管疾病患者氯吡格雷反应的药物遗传学关联研究:加勒比人群的新特征
Pharmgenomics Pers Med. 2018 Jun 8;11:95-106. doi: 10.2147/PGPM.S165805. eCollection 2018.
5
Clinical Relevant Polymorphisms Affecting Clopidogrel Pharmacokinetics and Pharmacodynamics: Insights from the Puerto Rico Newborn Screening Program.临床相关的影响氯吡格雷药代动力学和药效学的多态性:来自波多黎各新生儿筛查计划的见解。
Int J Environ Res Public Health. 2018 May 30;15(6):1115. doi: 10.3390/ijerph15061115.
6
High burden of birthweight-lowering genetic variants in Africans and Asians.非洲人和亚洲人生育体重降低相关遗传变异的负担很高。
BMC Med. 2018 May 24;16(1):70. doi: 10.1186/s12916-018-1061-3.
7
Whole Exome Sequencing Identifies New Host Genomic Susceptibility Factors in Empyema Caused by Streptococcus pneumoniae in Children: A Pilot Study.全外显子组测序鉴定儿童肺炎链球菌所致脓胸新的宿主基因组易感性因素:一项初步研究
Genes (Basel). 2018 May 3;9(5):240. doi: 10.3390/genes9050240.
8
A SNP panel for identification of DNA and RNA specimens.用于鉴定 DNA 和 RNA 样本的 SNP 面板。
BMC Genomics. 2018 Jan 25;19(1):90. doi: 10.1186/s12864-018-4482-7.
9
Whole Exome Sequencing reveals new candidate genes in host genomic susceptibility to Respiratory Syncytial Virus Disease.全外显子组测序揭示了宿主基因组对呼吸道合胞病毒病易感性的新候选基因。
Sci Rep. 2017 Nov 21;7(1):15888. doi: 10.1038/s41598-017-15752-4.
10
SNP typing using the HID-Ion AmpliSeq™ Identity Panel in a southern Chinese population.在中国南方人群中使用HID-Ion AmpliSeq™身份鉴定试剂盒进行单核苷酸多态性分型。
Int J Legal Med. 2018 Jul;132(4):997-1006. doi: 10.1007/s00414-017-1706-3. Epub 2017 Oct 18.
人类基因组单倍型图谱。
Nature. 2005 Oct 27;437(7063):1299-320. doi: 10.1038/nature04226.
4
The International HapMap Project Web site.国际人类基因组单体型图计划网站。
Genome Res. 2005 Nov;15(11):1592-3. doi: 10.1101/gr.4413105.
5
Perlegen sciences, inc.珀勒根科学公司
Pharmacogenomics. 2005 Jun;6(4):439-42. doi: 10.1517/14622416.6.4.439.
6
Informativeness of genetic markers for inference of ancestry.用于推断血统的遗传标记的信息性。
Am J Hum Genet. 2003 Dec;73(6):1402-22. doi: 10.1086/380416. Epub 2003 Nov 20.
7
A human genome diversity cell line panel.一个人类基因组多样性细胞系面板。
Science. 2002 Apr 12;296(5566):261-2. doi: 10.1126/science.296.5566.261b.
8
The allelic correlation structure of Gainj- and Kalam-speaking people. I. The estimation and interpretation of Wright's F-statistics.讲盖因吉语和卡拉姆语人群的等位基因相关结构。I. 赖特F统计量的估计与解释。
Genetics. 1986 Mar;112(3):629-47. doi: 10.1093/genetics/112.3.629.