Halushka M K, Fan J B, Bentley K, Hsie L, Shen N, Weder A, Cooper R, Lipshutz R, Chakravarti A
Department of Genetics and Center for Human Genetics, Case Western Reserve University School of Medicine and University Hospitals of Cleveland, Ohio 44106, USA.
Nat Genet. 1999 Jul;22(3):239-47. doi: 10.1038/10297.
Sequence variation in human genes is largely confined to single-nucleotide polymorphisms (SNPs) and is valuable in tests of association with common diseases and pharmacogenetic traits. We performed a systematic and comprehensive survey of molecular variation to assess the nature, pattern and frequency of SNPs in 75 candidate human genes for blood-pressure homeostasis and hypertension. We assayed 28 Mb (190 kb in 148 alleles) of genomic sequence, comprising the 5' and 3' untranslated regions (UTRs), introns and coding sequence of these genes, for sequence differences in individuals of African and Northern European descent using high-density variant detection arrays (VDAs). We identified 874 candidate human SNPs, of which 22% were confirmed by DNA sequencing to reveal a discordancy rate of 21% for VDA detection. The SNPs detected have an average minor allele frequency of 11%, and 387 are within the coding sequence (cSNPs). Of all cSNPs, 54% lead to a predicted change in the protein sequence, implying a high level of human protein diversity. These protein-altering SNPs are 38% of the total number of such SNPs expected, are more likely to be population-specific and are rarer in the human population, directly demonstrating the effects of natural selection on human genes. Overall, the degree of nucleotide polymorphism across these human genes, and orthologous great ape sequences, is highly variable and is correlated with the effects of functional conservation on gene sequences.
人类基因中的序列变异主要局限于单核苷酸多态性(SNP),在与常见疾病和药物遗传学性状的关联测试中具有重要价值。我们对分子变异进行了系统而全面的调查,以评估75个参与血压稳态和高血压调节的候选人类基因中SNP的性质、模式和频率。我们使用高密度变异检测阵列(VDA)对28 Mb(148个等位基因中的190 kb)的基因组序列进行了检测,这些序列包括这些基因的5'和3'非翻译区(UTR)、内含子和编码序列,检测对象为非洲裔和北欧裔个体的序列差异。我们鉴定出874个候选人类SNP,其中22%经DNA测序确认,结果显示VDA检测的不一致率为21%。检测到的SNP平均次要等位基因频率为11%,其中387个位于编码序列内(cSNP)。在所有cSNP中,54%导致蛋白质序列发生预测性变化,这意味着人类蛋白质具有高度多样性。这些导致蛋白质改变的SNP占预期此类SNP总数的38%,更可能具有种群特异性,且在人类群体中较为罕见,这直接证明了自然选择对人类基因的影响。总体而言,这些人类基因以及直系同源的大猩猩序列中的核苷酸多态性程度高度可变,且与功能保守性对基因序列的影响相关。