Suppr超能文献

G点会导致在Affymetrix微阵列中出现不正确的表达测量结果。

G-spots cause incorrect expression measurement in Affymetrix microarrays.

作者信息

Upton Graham Jg, Langdon William B, Harrison Andrew P

机构信息

Departments of Mathematical and Biological Sciences, University of Essex, Wivenhoe Park, Colchester, Essex CO43SQ, UK.

出版信息

BMC Genomics. 2008 Dec 18;9:613. doi: 10.1186/1471-2164-9-613.

Abstract

BACKGROUND

High Density Oligonucleotide arrays (HDONAs), such as the Affymetrix HG-U133A GeneChip, use sets of probes chosen to match specified genes, with the expectation that if a particular gene is highly expressed then all the probes in that gene's probe set will provide a consistent message signifying the gene's presence. However, probes that contain a G-spot (a sequence of four or more guanines) behave abnormally and it has been suggested that these probes are responding to some biochemical effect such as the formation of G-quadruplexes.

RESULTS

We have tested this expectation by examining the correlation coefficients between pairs of probes using the data on thousands of arrays that are available in the NCBI Gene Expression Omnibus (GEO) repository. We confirm the finding that G-spot probes are poorly correlated with others in their probesets and reveal that, by contrast, they are highly correlated with one another. We demonstrate that the correlation is most marked when the G-spot is at the 5' end of the probe.

CONCLUSION

Since these G-spot probes generally show little correlation with the other members of their probesets they are not fit for purpose and their values should be excluded when calculating gene expression values. This has serious implications, since more than 40% of the probesets in the HG-U133A GeneChip contain at least one such probe. Future array designs should avoid these untrustworthy probes.

摘要

背景

高密度寡核苷酸阵列(HDONAs),如Affymetrix HG-U133A基因芯片,使用经过选择以匹配特定基因的探针集,期望如果某个特定基因高度表达,那么该基因探针集中的所有探针将提供一致的信号以表明该基因的存在。然而,含有G位点(四个或更多鸟嘌呤的序列)的探针表现异常,有人认为这些探针是对某些生化效应做出反应,例如G-四链体的形成。

结果

我们通过使用NCBI基因表达综合数据库(GEO)中数千个阵列的数据检查探针之间的相关系数,来检验这一预期。我们证实了G位点探针与其探针集中的其他探针相关性较差这一发现,并揭示,相比之下,它们彼此之间高度相关。我们证明,当G位点位于探针的5'端时,相关性最为明显。

结论

由于这些G位点探针通常与其探针集中的其他成员相关性很小,它们不适合用于此目的,并且在计算基因表达值时应排除其值。这具有严重影响,因为HG-U133A基因芯片中超过40%的探针集至少包含一个这样的探针。未来的阵列设计应避免使用这些不可靠的探针。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d30b/2628396/f02fe40a9703/1471-2164-9-613-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验