• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组序列的解释性时频分析。

Interpretive time-frequency analysis of genomic sequences.

作者信息

Hassani Saadi Hamed, Sameni Reza, Zollanvari Amin

机构信息

School of Electrical and Computer Engineering, Shiraz University, Shiraz, Iran.

Department of Electrical and Electronic Engineering, Nazarbayev University, Astana, Kazakhstan.

出版信息

BMC Bioinformatics. 2017 Mar 22;18(Suppl 4):154. doi: 10.1186/s12859-017-1524-0.

DOI:10.1186/s12859-017-1524-0
PMID:28361669
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5374637/
Abstract

BACKGROUND

Time-Frequency (TF) analysis has been extensively used for the analysis of non-stationary numeric signals in the past decade. At the same time, recent studies have statistically confirmed the non-stationarity of genomic non-numeric sequences and suggested the use of non-stationary analysis for these sequences. The conventional approach to analyze non-numeric genomic sequences using techniques specific to numerical data is to convert non-numerical data into numerical values in some way and then apply time or transform domain signal processing algorithms. Nevertheless, this approach raises questions regarding the relative magnitudes under numeric transforms, which can potentially lead to spurious patterns or misinterpretation of results.

RESULTS

In this paper, using the notion of interpretive signal processing (ISP) and by redefining correlation functions for non-numeric sequences, a general class of TF transforms are extended and applied to non-numerical genomic sequences. The technique has been successfully evaluated on synthetic and real DNA sequences.

CONCLUSION

The proposed framework is fairly generic and is believed to be useful for extracting quantitative and visual information regarding local and global periodicity, symmetry, (non-) stationarity and spectral color of genomic sequences. The notion of interpretive time-frequency analysis introduced in this work can be considered as the first step towards the development of a rigorous mathematical construct for genomic signal processing.

摘要

背景

在过去十年中,时频(TF)分析已被广泛用于分析非平稳数值信号。与此同时,最近的研究已从统计学上证实了基因组非数值序列的非平稳性,并建议对这些序列采用非平稳分析方法。使用数值数据特有的技术来分析非数值基因组序列的传统方法是,以某种方式将非数值数据转换为数值,然后应用时域或变换域信号处理算法。然而,这种方法引发了关于数值变换下相对大小的问题,这可能会导致虚假模式或结果的错误解读。

结果

在本文中,利用解释性信号处理(ISP)的概念并重新定义非数值序列的相关函数,一类通用的TF变换得到扩展并应用于非数值基因组序列。该技术已在合成DNA序列和真实DNA序列上成功进行了评估。

结论

所提出的框架相当通用,并且据信对于提取有关基因组序列的局部和全局周期性、对称性、(非)平稳性和频谱特征的定量和可视化信息很有用。本文中引入的解释性时频分析概念可被视为朝着构建用于基因组信号处理的严格数学结构迈出的第一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/389a3c069f29/12859_2017_1524_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/958ab545b6e1/12859_2017_1524_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/542207aa29a5/12859_2017_1524_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/5b3adeeac625/12859_2017_1524_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/cdba81db8a54/12859_2017_1524_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/e7b803a7686d/12859_2017_1524_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/389a3c069f29/12859_2017_1524_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/958ab545b6e1/12859_2017_1524_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/542207aa29a5/12859_2017_1524_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/5b3adeeac625/12859_2017_1524_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/cdba81db8a54/12859_2017_1524_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/e7b803a7686d/12859_2017_1524_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0706/5374637/389a3c069f29/12859_2017_1524_Fig6_HTML.jpg

相似文献

1
Interpretive time-frequency analysis of genomic sequences.基因组序列的解释性时频分析。
BMC Bioinformatics. 2017 Mar 22;18(Suppl 4):154. doi: 10.1186/s12859-017-1524-0.
2
Time-dependent ARMA modeling of genomic sequences.基因组序列的时间相关自回归滑动平均建模
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S14. doi: 10.1186/1471-2105-9-S9-S14.
3
On DNA numerical representations for genomic similarity computation.关于用于基因组相似性计算的DNA数值表示。
PLoS One. 2017 Mar 21;12(3):e0173288. doi: 10.1371/journal.pone.0173288. eCollection 2017.
4
Periodic power spectrum with applications in detection of latent periodicities in DNA sequences.具有检测DNA序列中潜在周期性应用的周期功率谱。
J Math Biol. 2016 Nov;73(5):1053-1079. doi: 10.1007/s00285-016-0982-8. Epub 2016 Mar 4.
5
An investigation into inter- and intragenomic variations of graphic genomic signatures.对图形基因组特征的基因组间和基因组内变异的调查。
BMC Bioinformatics. 2015 Aug 7;16:246. doi: 10.1186/s12859-015-0655-4.
6
Improved algorithm for analysis of DNA sequences using multiresolution transformation.使用多分辨率变换分析DNA序列的改进算法
ScientificWorldJournal. 2015;2015:786497. doi: 10.1155/2015/786497. Epub 2015 Apr 27.
7
Suite of tools for statistical N-gram language modeling for pattern mining in whole genome sequences.用于全基因组序列模式挖掘的统计N元语法语言建模的工具套件。
J Bioinform Comput Biol. 2012 Dec;10(6):1250016. doi: 10.1142/S0219720012500163. Epub 2012 Jul 22.
8
Genome analysis with inter-nucleotide distances.基于核苷酸间距离的基因组分析。
Bioinformatics. 2009 Dec 1;25(23):3064-70. doi: 10.1093/bioinformatics/btp546. Epub 2009 Sep 16.
9
Mapping genomic features to functional traits through microbial whole genome sequences.通过微生物全基因组序列将基因组特征映射到功能性状。
Int J Bioinform Res Appl. 2014;10(4-5):461-78. doi: 10.1504/IJBRA.2014.062995.
10
Frequency spectra characterization of noncoding human genomic sequences.非编码人类基因组序列的频域特征分析。
Genes Genomics. 2020 Oct;42(10):1215-1226. doi: 10.1007/s13258-020-00980-2. Epub 2020 Aug 31.

引用本文的文献

1
SpecGMM: Integrating Spectral analysis and Gaussian Mixture Models for taxonomic classification and identification of discriminative DNA regions.SpecGMM:将光谱分析与高斯混合模型相结合用于分类学分类和鉴别性DNA区域的识别。
Bioinform Adv. 2024 Nov 5;4(1):vbae171. doi: 10.1093/bioadv/vbae171. eCollection 2024.
2
Improving deep models of protein-coding potential with a Fourier-transform architecture and machine translation task.利用傅里叶变换架构和机器翻译任务改进蛋白质编码潜力的深度学习模型。
PLoS Comput Biol. 2023 Oct 12;19(10):e1011526. doi: 10.1371/journal.pcbi.1011526. eCollection 2023 Oct.
3
Time-Frequency Approach Applied to Finding Interaction Regions in Pathogenic Proteins.

本文引用的文献

1
Time-dependent ARMA modeling of genomic sequences.基因组序列的时间相关自回归滑动平均建模
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S14. doi: 10.1186/1471-2105-9-S9-S14.
2
Frequency-domain analysis of biomolecular sequences.生物分子序列的频域分析。
Bioinformatics. 2000 Dec;16(12):1073-81. doi: 10.1093/bioinformatics/16.12.1073.
3
Prediction of probable genes by Fourier analysis of genomic sequences.通过基因组序列的傅里叶分析预测可能的基因。
应用于寻找致病蛋白相互作用区域的时频方法。
Bioinform Biol Insights. 2019 Jun 4;13:1177932219850172. doi: 10.1177/1177932219850172. eCollection 2019.
4
Detecting proteine coding regions using a customized multi-scales splines construction.使用定制的多尺度样条构建来检测蛋白质编码区域。
Proc Int Symp Symb Numer Algorithms Sci Comput. 2017;19:397-400. doi: 10.1109/SYNASC.2017.00072.
5
Selected research articles from the 2016 International Workshop on Computational Network Biology: Modeling, Analysis, and Control (CNB-MAC).选自2016年计算网络生物学建模、分析与控制国际研讨会(CNB-MAC)的研究文章。
BMC Bioinformatics. 2017 Mar 22;18(Suppl 4):159. doi: 10.1186/s12859-017-1521-3.
Comput Appl Biosci. 1997 Jun;13(3):263-70. doi: 10.1093/bioinformatics/13.3.263.