• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于评估单核苷酸多态性(SNP)基因型簇的轮廓系数

Silhouette scores for assessment of SNP genotype clusters.

作者信息

Lovmar Lovisa, Ahlford Annika, Jonsson Mats, Syvänen Ann-Christine

机构信息

Molecular Medicine, Department of Medical Sciences, Uppsala University, Uppsala, Sweden.

出版信息

BMC Genomics. 2005 Mar 10;6:35. doi: 10.1186/1471-2164-6-35.

DOI:10.1186/1471-2164-6-35
PMID:15760469
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC555759/
Abstract

BACKGROUND

High-throughput genotyping of single nucleotide polymorphisms (SNPs) generates large amounts of data. In many SNP genotyping assays, the genotype assignment is based on scatter plots of signals corresponding to the two SNP alleles. In a robust assay the three clusters that define the genotypes are well separated and the distances between the data points within a cluster are short. "Silhouettes" is a graphical aid for interpretation and validation of data clusters that provides a measure of how well a data point was classified when it was assigned to a cluster. Thus "Silhouettes" can potentially be used as a quality measure for SNP genotyping results and for objective comparison of the performance of SNP assays at different circumstances.

RESULTS

We created a program (ClusterA) for calculating "Silhouette scores", and applied it to assess the quality of SNP genotype clusters obtained by single nucleotide primer extension ("minisequencing") in the Tag-microarray format. A Silhouette score condenses the quality of the genotype assignment for each SNP assay into a single numeric value, which ranges from 1.0, when the genotype assignment is unequivocal, down to -1.0, when the genotype assignment has been arbitrary. In the present study we applied Silhouette scores to compare the performance of four DNA polymerases in our minisequencing system by analyzing 26 SNPs in both DNA polarities in 16 DNA samples. We found Silhouettes to provide a relevant measure for the quality of SNP assays at different reaction conditions, illustrated by the four DNA polymerases here. According to our result, the genotypes can be unequivocally assigned without manual inspection when the Silhouette score for a SNP assay is > 0.65. All four DNA polymerases performed satisfactorily in our Tag-array minisequencing system.

CONCLUSION

"Silhouette scores" for assessing the quality of SNP genotyping clusters is convenient for evaluating the quality of SNP genotype assignment, and provides an objective, numeric measure for comparing the performance of SNP assays. The program we created for calculating Silhouette scores is freely available, and can be used for quality assessment of the results from all genotyping systems, where the genotypes are assigned by cluster analysis using scatter plots.

摘要

背景

单核苷酸多态性(SNP)的高通量基因分型产生了大量数据。在许多SNP基因分型检测中,基因型的确定是基于对应于两个SNP等位基因的信号散点图。在可靠的检测中,定义基因型的三个聚类分得很开,且聚类内数据点之间的距离很短。“轮廓系数”是一种用于数据聚类解释和验证的图形工具,它提供了一个数据点在被分配到一个聚类时分类效果的度量。因此,“轮廓系数”有可能被用作SNP基因分型结果的质量度量,以及在不同情况下对SNP检测性能进行客观比较。

结果

我们创建了一个用于计算“轮廓系数得分”的程序(ClusterA),并将其应用于评估通过单核苷酸引物延伸(“微测序”)以标签微阵列形式获得的SNP基因型聚类的质量。轮廓系数得分将每个SNP检测的基因型确定质量浓缩为一个单一数值,范围从基因型确定明确时的1.0到基因型确定随意时的 -1.0。在本研究中,我们通过分析16个DNA样本中两个DNA极性的26个SNP,应用轮廓系数得分来比较我们微测序系统中四种DNA聚合酶的性能。我们发现轮廓系数为不同反应条件下SNP检测的质量提供了一种相关度量,这里以四种DNA聚合酶为例进行说明。根据我们的结果,当一个SNP检测的轮廓系数得分> 0.65时,无需人工检查即可明确确定基因型。在我们的标签阵列微测序系统中,所有四种DNA聚合酶的表现都令人满意。

结论

用于评估SNP基因分型聚类质量的“轮廓系数得分”便于评估SNP基因型确定的质量,并为比较SNP检测的性能提供了一个客观的数值度量。我们创建的用于计算轮廓系数得分的程序可免费获取,可用于所有通过使用散点图进行聚类分析来确定基因型的基因分型系统结果的质量评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1363/555759/47ee76f8dd8e/1471-2164-6-35-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1363/555759/711cc5263991/1471-2164-6-35-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1363/555759/22e9c2004a73/1471-2164-6-35-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1363/555759/47ee76f8dd8e/1471-2164-6-35-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1363/555759/711cc5263991/1471-2164-6-35-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1363/555759/22e9c2004a73/1471-2164-6-35-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1363/555759/47ee76f8dd8e/1471-2164-6-35-3.jpg

相似文献

1
Silhouette scores for assessment of SNP genotype clusters.用于评估单核苷酸多态性(SNP)基因型簇的轮廓系数
BMC Genomics. 2005 Mar 10;6:35. doi: 10.1186/1471-2164-6-35.
2
A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.一种用于Affymetrix SNP微阵列的多阵列多SNP基因分型算法。
Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25.
3
High concordance of bovine single nucleotide polymorphism genotypes generated using two independent genotyping strategies.两种独立的基因分型策略产生的牛单核苷酸多态性基因型高度一致。
Anim Biotechnol. 2010 Oct;21(4):257-62. doi: 10.1080/10495398.2010.509680.
4
New generation pharmacogenomic tools: a SNP linkage disequilibrium Map, validated SNP assay resource, and high-throughput instrumentation system for large-scale genetic studies.新一代药物基因组学工具:一个单核苷酸多态性(SNP)连锁不平衡图谱、经过验证的SNP检测资源以及用于大规模基因研究的高通量检测系统。
Biotechniques. 2002 Jun;Suppl:48-50, 52, 54.
5
Detecting imbalanced expression of SNP alleles by minisequencing on microarrays.通过微阵列上的微测序检测单核苷酸多态性(SNP)等位基因的表达失衡
BMC Biotechnol. 2004 Oct 22;4:24. doi: 10.1186/1472-6750-4-24.
6
Dynamic model based algorithms for screening and genotyping over 100 K SNPs on oligonucleotide microarrays.基于动态模型的寡核苷酸微阵列上100K以上单核苷酸多态性(SNP)筛选和基因分型算法
Bioinformatics. 2005 May 1;21(9):1958-63. doi: 10.1093/bioinformatics/bti275. Epub 2005 Jan 18.
7
Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays.用于优化单核苷酸多态性(SNP)和聚合酶链反应-限制性片段长度多态性(PCR-RFLP)基因分型的软件,以最少的检测区分多个基因组。
BMC Genomics. 2005 May 16;6:73. doi: 10.1186/1471-2164-6-73.
8
Identification of disease causing loci using an array-based genotyping approach on pooled DNA.使用基于芯片的基因分型方法对混合DNA进行致病基因座的鉴定。
BMC Genomics. 2005 Sep 30;6:138. doi: 10.1186/1471-2164-6-138.
9
Comparison of PrASE and Pyrosequencing for SNP Genotyping.用于单核苷酸多态性基因分型的引物延伸预扩增测序法(PrASE)与焦磷酸测序法的比较
BMC Genomics. 2006 Nov 16;7:291. doi: 10.1186/1471-2164-7-291.
10
SNP-VISTA: an interactive SNP visualization tool.SNP-VISTA:一种交互式单核苷酸多态性可视化工具。
BMC Bioinformatics. 2005 Dec 8;6:292. doi: 10.1186/1471-2105-6-292.

引用本文的文献

1
Adaptive clustering for medical image analysis using the improved separation index.使用改进的分离指数进行医学图像分析的自适应聚类
Sci Rep. 2025 Aug 1;15(1):28191. doi: 10.1038/s41598-025-13670-4.
2
SPP1+ tumor-associated macrophages define a high-risk subgroup and inform personalized therapy in hepatocellular carcinoma.SPP1+肿瘤相关巨噬细胞定义了一个高危亚组,并为肝细胞癌的个性化治疗提供依据。
Front Oncol. 2025 Jul 1;15:1606195. doi: 10.3389/fonc.2025.1606195. eCollection 2025.
3
Depletion of Effector Regulatory T Cells Associates with Major Response to Induction Dual Immune Checkpoint Blockade.

本文引用的文献

1
Algorithms for large-scale genotyping microarrays.用于大规模基因分型微阵列的算法
Bioinformatics. 2003 Dec 12;19(18):2397-403. doi: 10.1093/bioinformatics/btg332.
2
Quantitative evaluation by minisequencing and microarrays reveals accurate multiplexed SNP genotyping of whole genome amplified DNA.通过微测序和微阵列进行的定量评估揭示了全基因组扩增DNA的准确多重单核苷酸多态性基因分型。
Nucleic Acids Res. 2003 Nov 1;31(21):e129. doi: 10.1093/nar/gng129.
3
Multiplex SNP genotyping in pooled DNA samples by a four-colour microarray system.利用四色微阵列系统对混合DNA样本进行多重单核苷酸多态性基因分型。
效应调节性T细胞的耗竭与诱导双重免疫检查点阻断的主要反应相关。
Cancer Discov. 2025 Aug 4;15(8):1569-1592. doi: 10.1158/2159-8290.CD-24-1390.
4
Multimodal machine learning for analysing multifactorial causes of disease-The case of childhood overweight and obesity in Mexico.用于分析多因素疾病成因的多模态机器学习——以墨西哥儿童超重和肥胖为例。
Front Public Health. 2025 Jan 7;12:1369041. doi: 10.3389/fpubh.2024.1369041. eCollection 2024.
5
Genomic and transcriptomic landscape of human gastrointestinal stromal tumors.人类胃肠道间质瘤的基因组和转录组图谱。
Nat Commun. 2024 Nov 3;15(1):9495. doi: 10.1038/s41467-024-53821-1.
6
Exploring online public survey lifestyle datasets with statistical analysis, machine learning and semantic ontology.运用统计分析、机器学习和语义本体探索在线公共调查生活方式数据集。
Sci Rep. 2024 Oct 15;14(1):24190. doi: 10.1038/s41598-024-74539-6.
7
Predictive model for novel subtypes of patients undergoing lower extremity amputation for peripheral artery disease: An unsupervised machine learning study.外周动脉疾病下肢截肢患者新型亚型的预测模型:一项无监督机器学习研究。
Heliyon. 2024 Jul 19;10(15):e34602. doi: 10.1016/j.heliyon.2024.e34602. eCollection 2024 Aug 15.
8
Construction of prediction models for novel subtypes in patients with arteriosclerosis obliterans undergoing endovascular therapy: an unsupervised machine learning study.构建接受血管内治疗的动脉硬化闭塞症患者新型亚型的预测模型:一项无监督机器学习研究。
J Cardiothorac Surg. 2024 Jun 25;19(1):370. doi: 10.1186/s13019-024-02913-6.
9
Clustering Methods for Vibro-Acoustic Sensing Features as a Potential Approach to Tissue Characterisation in Robot-Assisted Interventions.基于振动声传感特征的聚类方法在机器人辅助介入手术中的组织特征分析中的应用
Sensors (Basel). 2023 Nov 21;23(23):9297. doi: 10.3390/s23239297.
10
A case-control study and systematic review of the association between glutathione S-transferase genes and chronic kidney disease.一项关于谷胱甘肽S-转移酶基因与慢性肾脏病关联的病例对照研究及系统评价。
Heliyon. 2023 Oct 19;9(11):e21183. doi: 10.1016/j.heliyon.2023.e21183. eCollection 2023 Nov.
Nucleic Acids Res. 2002 Jul 15;30(14):e70. doi: 10.1093/nar/gnf069.
4
SNPstream UHT: ultra-high throughput SNP genotyping for pharmacogenomics and drug discovery.SNPstream超高速检测系统:用于药物基因组学和药物研发的超高通量单核苷酸多态性基因分型
Biotechniques. 2002 Jun;Suppl:70-2, 74, 76-7.
5
Accessing genetic variation: genotyping single nucleotide polymorphisms.获取基因变异:对单核苷酸多态性进行基因分型
Nat Rev Genet. 2001 Dec;2(12):930-42. doi: 10.1038/35103535.
6
A system for specific, high-throughput genotyping by allele-specific primer extension on microarrays.一种通过微阵列上的等位基因特异性引物延伸进行特异性、高通量基因分型的系统。
Genome Res. 2000 Jul;10(7):1031-42. doi: 10.1101/gr.10.7.1031.
7
Parallel genotyping of human SNPs using generic high-density oligonucleotide tag arrays.使用通用高密度寡核苷酸标签阵列对人类单核苷酸多态性进行平行基因分型。
Genome Res. 2000 Jun;10(6):853-60. doi: 10.1101/gr.10.6.853.
8
Arrayed primer extension: solid-phase four-color DNA resequencing and mutation detection technology.引物延伸阵列:固相四色DNA重测序与突变检测技术。
Genet Test. 2000;4(1):1-7. doi: 10.1089/109065700316408.
9
Fluorescence polarization in homogeneous nucleic acid analysis.均相核酸分析中的荧光偏振
Genome Res. 1999 May;9(5):492-8.
10
Rapid detection of mitochondrial sequence polymorphisms using multiplex solid-phase fluorescent minisequencing.
Genomics. 1996 May 15;34(1):107-13. doi: 10.1006/geno.1996.0247.