Packer Bernice R, Yeager Meredith, Burdett Laura, Welch Robert, Beerman Michael, Qi Liqun, Sicotte Hugues, Staats Brian, Acharya Mekhala, Crenshaw Andrew, Eckert Andrew, Puri Vinita, Gerhard Daniela S, Chanock Stephen J
Intramural Research Support Program, SAIC-Frederick, NCI-FCRDC, Frederick, MD, USA.
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D617-21. doi: 10.1093/nar/gkj151.
The SNP500Cancer database provides sequence and genotype assay information for candidate SNPs useful in mapping complex diseases, such as cancer. The database is an integral component of the NCI Cancer Genome Anatomy Project (http://cgap.nci.nih.gov). SNP500Cancer reports sequence analysis of anonymized control DNA samples (n = 102 Coriell samples representing four self-described ethnic groups: African/African-American, Caucasian, Hispanic and Pacific Rim). The website is searchable by gene, chromosome, gene ontology pathway, dbSNP ID and SNP500Cancer SNP ID. As of October 2005, the database contains >13 400 SNPs, 9124 of which have been sequenced in the SNP500Cancer population. For each analysed SNP, gene location and >200 bp of surrounding annotated sequence (including nearby SNPs) are provided, with frequency information in total and per subpopulation as well as calculation of Hardy-Weinberg equilibrium for each subpopulation. The website provides the conditions for validated sequencing and genotyping assays, as well as genotype results for the 102 samples, in both viewable and downloadable formats. A subset of sequence validated SNPs with minor allele frequency >5% are entered into a high-throughput pipeline for genotyping analysis to determine concordance for the same 102 samples. In addition, the results of genotype analysis for select validated SNP assays (defined as 100% concordance between sequence analysis and genotype results) are posted for an additional 280 samples drawn from the Human Diversity Panel (HDP). SNP500Cancer provides an invaluable resource for investigators to select SNPs for analysis, design genotyping assays using validated sequence data, choose selected assays already validated on one or more genotyping platforms, and select reference standards for genotyping assays. The SNP500Cancer database is freely accessible via the web page at http://snp500cancer.nci.nih.gov.
SNP500Cancer数据库提供了对绘制复杂疾病(如癌症)有用的候选单核苷酸多态性(SNP)的序列和基因型检测信息。该数据库是美国国立癌症研究所(NCI)癌症基因组解剖计划(http://cgap.nci.nih.gov)的一个组成部分。SNP500Cancer报告了匿名对照DNA样本(n = 102个Coriell样本,代表四个自我描述的种族群体:非洲/非裔美国人、白种人、西班牙裔和环太平洋地区人)的序列分析。该网站可通过基因、染色体、基因本体途径、dbSNP ID和SNP500Cancer SNP ID进行搜索。截至2005年10月,该数据库包含超过13400个SNP,其中9124个已在SNP500Cancer人群中进行了测序。对于每个分析的SNP,提供了基因位置和超过200 bp的周围注释序列(包括附近的SNP),以及总体和每个亚群的频率信息,以及每个亚群的哈迪-温伯格平衡计算。该网站以可查看和可下载的格式提供了经过验证的测序和基因分型检测的条件,以及102个样本的基因型结果。次要等位基因频率>5%的经过序列验证的SNP子集被输入高通量基因分型分析流程,以确定相同102个样本的一致性。此外,还公布了从人类多样性面板(HDP)抽取的另外280个样本的选定经过验证的SNP检测(定义为序列分析和基因型结果之间100%一致)的基因型分析结果。SNP500Cancer为研究人员提供了一个宝贵的资源,用于选择SNP进行分析、使用经过验证的序列数据设计基因分型检测、选择已经在一个或多个基因分型平台上经过验证的选定检测,以及选择基因分型检测的参考标准。可通过网页http://snp500cancer.nci.nih.gov免费访问SNP500Cancer数据库。