Department of Electronics and Communication Engineering, National Institute of Technology, Rourkela, India.
Genomics Proteomics Bioinformatics. 2011 Apr;9(1-2):45-55. doi: 10.1016/S1672-0229(11)60007-7.
Accurate identification of protein-coding regions (exons) in DNA sequences has been a challenging task in bioinformatics. Particularly the coding regions have a 3-base periodicity, which forms the basis of all exon identification methods. Many signal processing tools and techniques have been applied successfully for the identification task but still improvement in this direction is needed. In this paper, we have introduced a new promising model-independent time-frequency filtering technique based on S-transform for accurate identification of the coding regions. The S-transform is a powerful linear time-frequency representation useful for filtering in time-frequency domain. The potential of the proposed technique has been assessed through simulation study and the results obtained have been compared with the existing methods using standard datasets. The comparative study demonstrates that the proposed method outperforms its counterparts in identifying the coding regions.
在生物信息学中,准确识别 DNA 序列中的蛋白质编码区域(外显子)一直是一项具有挑战性的任务。特别是编码区域具有 3 个碱基的周期性,这构成了所有外显子识别方法的基础。许多信号处理工具和技术已成功应用于识别任务,但在这方面仍需要改进。在本文中,我们引入了一种基于 S 变换的新的有前途的、模型独立的时频滤波技术,用于准确识别编码区域。S 变换是一种强大的线性时频表示,可用于时频域滤波。通过模拟研究评估了所提出技术的潜力,并使用标准数据集将获得的结果与现有方法进行了比较。比较研究表明,该方法在识别编码区域方面优于其他方法。