• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

稀疏布尔序列中的周期性模式检测。

Periodic pattern detection in sparse boolean sequences.

作者信息

Junier Ivan, Hérisson Joan, Képès François

机构信息

Epigenomics Project, Genopole, CNRS UPS3201, UniverSud Paris, University of Evry, Genopole Campus 1 - Genavenir 6, 5 rue Henri Desbruères - F-91030 EVRY cedex, France.

出版信息

Algorithms Mol Biol. 2010 Sep 10;5:31. doi: 10.1186/1748-7188-5-31.

DOI:10.1186/1748-7188-5-31
PMID:20831781
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2949599/
Abstract

BACKGROUND

The specific position of functionally related genes along the DNA has been shown to reflect the interplay between chromosome structure and genetic regulation. By investigating the statistical properties of the distances separating such genes, several studies have highlighted various periodic trends. In many cases, however, groups built up from co-functional or co-regulated genes are small and contain wrong information (data contamination) so that the statistics is poorly exploitable. In addition, gene positions are not expected to satisfy a perfectly ordered pattern along the DNA. Within this scope, we present an algorithm that aims to highlight periodic patterns in sparse boolean sequences, i.e. sequences of the type 010011011010... where the ratio of the number of 1's (denoting here the transcription start of a gene) to 0's is small.

RESULTS

The algorithm is particularly robust with respect to strong signal distortions such as the addition of 1's at arbitrary positions (contaminated data), the deletion of existing 1's in the sequence (missing data) and the presence of disorder in the position of the 1's (noise). This robustness property stems from an appropriate exploitation of the remarkable alignment properties of periodic points in solenoidal coordinates.

CONCLUSIONS

The efficiency of the algorithm is demonstrated in situations where standard Fourier-based spectral methods are poorly adapted. We also show how the proposed framework allows to identify the 1's that participate in the periodic trends, i.e. how the framework allows to allocate a positional score to genes, in the same spirit of the sequence score. The software is available for public use at http://www.issb.genopole.fr/MEGA/Softwares/iSSB_SolenoidalApplication.zip.

摘要

背景

功能相关基因在DNA上的特定位置已被证明反映了染色体结构与基因调控之间的相互作用。通过研究分隔此类基因的距离的统计特性,多项研究突出了各种周期性趋势。然而,在许多情况下,由共功能或共调控基因组成的组规模较小且包含错误信息(数据污染),因此统计数据难以有效利用。此外,基因位置预计不会沿着DNA满足完美的有序模式。在此范围内,我们提出了一种算法,旨在突出稀疏布尔序列中的周期性模式,即010011011010... 这种类型的序列,其中1(在此表示基因的转录起始)的数量与0的数量之比很小。

结果

该算法对于强烈的信号失真具有特别强的鲁棒性,例如在任意位置添加1(污染数据)、删除序列中现有的1(缺失数据)以及1的位置存在无序(噪声)。这种鲁棒性源于对螺线管坐标中周期点显著对齐特性的适当利用。

结论

在基于傅里叶的标准谱方法不太适用的情况下,证明了该算法的效率。我们还展示了所提出的框架如何能够识别参与周期性趋势的1,即该框架如何能够以与序列得分相同的精神为基因分配位置得分。该软件可在http://www.issb.genopole.fr/MEGA/Softwares/iSSB_SolenoidalApplication.zip上供公众使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/7de180184852/1748-7188-5-31-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/4bbbc5b8a674/1748-7188-5-31-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/ab05acf6c39d/1748-7188-5-31-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/e1e29ec0c444/1748-7188-5-31-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/8392a2e812dd/1748-7188-5-31-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/7de180184852/1748-7188-5-31-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/4bbbc5b8a674/1748-7188-5-31-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/ab05acf6c39d/1748-7188-5-31-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/e1e29ec0c444/1748-7188-5-31-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/8392a2e812dd/1748-7188-5-31-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e024/2949599/7de180184852/1748-7188-5-31-5.jpg

相似文献

1
Periodic pattern detection in sparse boolean sequences.稀疏布尔序列中的周期性模式检测。
Algorithms Mol Biol. 2010 Sep 10;5:31. doi: 10.1186/1748-7188-5-31.
2
Isotetrandrine ameliorates tert-butyl hydroperoxide-induced oxidative stress through upregulation of heme oxygenase-1 expression.粉防己碱通过上调血红素加氧酶-1的表达减轻叔丁基过氧化氢诱导的氧化应激。
Exp Biol Med (Maywood). 2016 Aug;241(14):1568-76. doi: 10.1177/1535370216647122. Epub 2016 May 2.
3
Hurst entropy: A method to determine predictability in a binary series based on a fractal-related process.赫斯特熵:一种基于分形相关过程确定二元序列可预测性的方法。
Phys Rev E. 2019 Jun;99(6-1):062115. doi: 10.1103/PhysRevE.99.062115.
4
Transmembrane domains 1 and 2 of the latent membrane protein 1 of Epstein-Barr virus contain a lipid raft targeting signal and play a critical role in cytostasis.爱泼斯坦-巴尔病毒潜伏膜蛋白1的跨膜结构域1和2含有脂筏靶向信号,并在细胞生长停滞中起关键作用。
J Virol. 2003 Mar;77(6):3749-58. doi: 10.1128/jvi.77.6.3749-3758.2003.
5
Boolean modeling of neural systems with point-process inputs and outputs.具有点过程输入和输出的神经系统布尔建模。
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:2114-7. doi: 10.1109/IEMBS.2006.260584.
6
Gastroprotective effects of phenylpropanoids from the rhizomes of Alpinia galanga in rats: structural requirements and mode of action.高良姜根茎中苯丙素类化合物对大鼠的胃保护作用:结构要求及作用模式
Eur J Pharmacol. 2003 Jun 13;471(1):59-67. doi: 10.1016/s0014-2999(03)01785-0.
7
LMP-1's transmembrane domains encode multiple functions required for LMP-1's efficient signaling.LMP-1的跨膜结构域编码LMP-1高效信号传导所需的多种功能。
J Virol. 2002 Nov;76(22):11551-60. doi: 10.1128/jvi.76.22.11551-11560.2002.
8
Novel purification of 1'S-1'-Acetoxychavicol acetate from Alpinia galanga and its cytotoxic plus antiproliferative activity in colorectal adenocarcinoma cell line SW480.从高良姜中新型纯化1'S-1'-乙酰氧基胡椒酚乙酸酯及其在结肠腺癌SW480细胞系中的细胞毒性和抗增殖活性
Biomed Pharmacother. 2017 Jul;91:485-493. doi: 10.1016/j.biopha.2017.04.114. Epub 2017 May 4.
9
Robust detection of periodic time series measured from biological systems.对从生物系统测量得到的周期性时间序列进行稳健检测。
BMC Bioinformatics. 2005 May 13;6:117. doi: 10.1186/1471-2105-6-117.
10
Electrostrictive cross-phase modulation of periodic pulse trains in optical fibers.光纤中周期性脉冲序列的电致伸缩交叉相位调制
Opt Lett. 1998 Sep 1;23(17):1357-9. doi: 10.1364/ol.23.001357.

引用本文的文献

1
Generative Models for Periodicity Detection in Noisy Signals.用于噪声信号中周期性检测的生成模型
Clocks Sleep. 2024 Jul 23;6(3):359-388. doi: 10.3390/clockssleep6030025.
2
Selection, periodicity and potential function for Highly Iterative Palindrome-1 (HIP1) in cyanobacterial genomes.蓝藻基因组中高度重复回文 1(HIP1)的选择、周期性和潜在功能。
Nucleic Acids Res. 2018 Mar 16;46(5):2265-2278. doi: 10.1093/nar/gky075.
3
WoPPER: Web server for Position Related data analysis of gene Expression in Prokaryotes.WoPPER:原核生物基因表达位置相关数据分析的 Web 服务器。

本文引用的文献

1
Spatial and topological organization of DNA chains induced by gene co-localization.基因共定位诱导的 DNA 链的空间和拓扑组织。
PLoS Comput Biol. 2010 Feb 12;6(2):e1000678. doi: 10.1371/journal.pcbi.1000678.
2
Robust discovery of periodically expressed genes using the laplace periodogram.使用拉普拉斯周期图稳健地发现周期性表达的基因。
BMC Bioinformatics. 2009 Jan 11;10:15. doi: 10.1186/1471-2105-10-15.
3
RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation.
Nucleic Acids Res. 2017 Jul 3;45(W1):W109-W115. doi: 10.1093/nar/gkx329.
4
From multiple pathogenicity islands to a unique organized pathogenicity archipelago.从多个致病岛到一个独特的有组织的致病群岛。
Sci Rep. 2016 Jun 15;6:27978. doi: 10.1038/srep27978.
5
Analysis tools for the interplay between genome layout and regulation.基因组布局与调控之间相互作用的分析工具。
BMC Bioinformatics. 2016 Jun 6;17 Suppl 5(Suppl 5):191. doi: 10.1186/s12859-016-1047-0.
6
GREAT: a web portal for Genome Regulatory Architecture Tools.GREAT:一个用于基因组调控结构工具的网络门户。
Nucleic Acids Res. 2016 Jul 8;44(W1):W77-82. doi: 10.1093/nar/gkw384. Epub 2016 May 5.
7
Global genomic arrangement of bacterial genes is closely tied with the total transcriptional efficiency.细菌基因的全球基因组排列与总转录效率密切相关。
Genomics Proteomics Bioinformatics. 2013 Feb;11(1):66-71. doi: 10.1016/j.gpb.2013.01.004. Epub 2013 Jan 26.
8
PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.精确性:通过基因位置提高 CIS 调控元件的预测。
Nucleic Acids Res. 2013 Feb 1;41(3):1406-15. doi: 10.1093/nar/gks1286. Epub 2012 Dec 14.
RegulonDB(版本6.0):大肠杆菌K-12超越转录的基因调控模型、活跃(实验性)注释启动子及Textpresso导航
Nucleic Acids Res. 2008 Jan;36(Database issue):D120-4. doi: 10.1093/nar/gkm994. Epub 2007 Dec 23.
4
How gene order is influenced by the biophysics of transcription regulation.基因顺序是如何受到转录调控生物物理学影响的。
Proc Natl Acad Sci U S A. 2007 Aug 28;104(35):13948-53. doi: 10.1073/pnas.0700672104. Epub 2007 Aug 20.
5
Robust regression for periodicity detection in non-uniformly sampled time-course gene expression data.用于非均匀采样时间序列基因表达数据中周期性检测的稳健回归
BMC Bioinformatics. 2007 Jul 2;8:233. doi: 10.1186/1471-2105-8-233.
6
Chromosomal periodicity of evolutionarily conserved gene pairs.进化保守基因对的染色体周期性
Proc Natl Acad Sci U S A. 2007 Jun 19;104(25):10559-64. doi: 10.1073/pnas.0610776104. Epub 2007 Jun 11.
7
Statistical analysis of the spatial distribution of operons in the transcriptional regulation network of Escherichia coli.大肠杆菌转录调控网络中操纵子空间分布的统计分析。
J Mol Biol. 2004 Oct 1;342(5):1379-90. doi: 10.1016/j.jmb.2004.07.074.
8
Periodic transcriptional organization of the E.coli genome.大肠杆菌基因组的周期性转录组织
J Mol Biol. 2004 Jul 23;340(5):957-64. doi: 10.1016/j.jmb.2004.05.039.
9
Analysis of genomic context: prediction of functional associations from conserved bidirectionally transcribed gene pairs.基因组背景分析:从保守的双向转录基因对预测功能关联。
Nat Biotechnol. 2004 Jul;22(7):911-7. doi: 10.1038/nbt988.
10
Statistical significance for genomewide studies.全基因组研究的统计学显著性
Proc Natl Acad Sci U S A. 2003 Aug 5;100(16):9440-5. doi: 10.1073/pnas.1530509100. Epub 2003 Jul 25.