Whistler Toni, Chiang Cheng-Feng, Lin Jin-Mann, Lonergan William, Reeves William C
Chronic Viral Diseases Branch, Division of Viral and Rickettsial Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia 30329, USA.
J Biomol Tech. 2010 Apr;21(1):44-53.
Understanding the biologic significance of alternative splicing has been impeded by the difficulty in systematically identifying and validating transcript isoforms. Current exon array workflows suggest several different filtration steps to reduce the number of tests and increase the detection of alternative splicing events. In this study, we examine the effects of the suggested pre-analysis filtration by detection above background P value or signal intensity. This is followed post-analytically by restriction of exon expression to a fivefold change between groups, limiting the analysis to known alternative splicing events, or using the intersection of the results from different algorithms. Combinations of the filters are also examined. We find that none of the filtering methods reduces the number of technical false-positive calls identified by visual inspection. These include edge effects, nonresponsive probe sets, and inclusion of intronic and untranslated region probe sets into transcript annotations. Modules for filtering the exon microarray data on the basis of annotation features are needed. We propose new approaches to data filtration that would reduce the number of technical false-positives and therefore, impact the time spent performing visual inspection of the exon arrays.
由于难以系统地识别和验证转录本异构体,对可变剪接生物学意义的理解受到了阻碍。当前的外显子芯片工作流程提出了几个不同的过滤步骤,以减少测试数量并增加可变剪接事件的检测。在本研究中,我们通过高于背景P值或信号强度的检测来检验建议的分析前过滤的效果。分析后,接着通过将外显子表达限制在组间五倍变化、将分析限于已知的可变剪接事件或使用不同算法结果的交集来进行。还检验了过滤器的组合。我们发现,没有一种过滤方法能减少通过目视检查识别出的技术假阳性调用数量。这些包括边缘效应、无反应的探针集以及将内含子和非翻译区探针集纳入转录本注释。需要基于注释特征对外显子芯片数据进行过滤的模块。我们提出了新的数据过滤方法,这将减少技术假阳性数量,从而影响对外显子芯片进行目视检查所花费的时间。