• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因核苷酸组成中表达调控的探究指南。

Probing instructions for expression regulation in gene nucleotide compositions.

作者信息

Bessière Chloé, Taha May, Petitprez Florent, Vandel Jimmy, Marin Jean-Michel, Bréhélin Laurent, Lèbre Sophie, Lecellier Charles-Henri

机构信息

IBC, Univ. Montpellier, CNRS, Montpellier, France.

Institut de Génétique Moléculaire de Montpellier, University of Montpellier, CNRS, Montpellier, France.

出版信息

PLoS Comput Biol. 2018 Jan 2;14(1):e1005921. doi: 10.1371/journal.pcbi.1005921. eCollection 2018 Jan.

DOI:10.1371/journal.pcbi.1005921
PMID:29293496
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5766238/
Abstract

Gene expression is orchestrated by distinct regulatory regions to ensure a wide variety of cell types and functions. A challenge is to identify which regulatory regions are active, what are their associated features and how they work together in each cell type. Several approaches have tackled this problem by modeling gene expression based on epigenetic marks, with the ultimate goal of identifying driving regions and associated genomic variations that are clinically relevant in particular in precision medicine. However, these models rely on experimental data, which are limited to specific samples (even often to cell lines) and cannot be generated for all regulators and all patients. In addition, we show here that, although these approaches are accurate in predicting gene expression, inference of TF combinations from this type of models is not straightforward. Furthermore these methods are not designed to capture regulation instructions present at the sequence level, before the binding of regulators or the opening of the chromatin. Here, we probe sequence-level instructions for gene expression and develop a method to explain mRNA levels based solely on nucleotide features. Our method positions nucleotide composition as a critical component of gene expression. Moreover, our approach, able to rank regulatory regions according to their contribution, unveils a strong influence of the gene body sequence, in particular introns. We further provide evidence that the contribution of nucleotide content can be linked to co-regulations associated with genome 3D architecture and to associations of genes within topologically associated domains.

摘要

基因表达由不同的调控区域精心编排,以确保多种细胞类型和功能。一个挑战是确定哪些调控区域是活跃的,它们的相关特征是什么,以及它们在每种细胞类型中如何协同工作。有几种方法通过基于表观遗传标记对基因表达进行建模来解决这个问题,其最终目标是识别在精准医学中具有临床相关性的驱动区域和相关的基因组变异。然而,这些模型依赖于实验数据,这些数据仅限于特定样本(甚至通常是细胞系),并且无法针对所有调节因子和所有患者生成。此外,我们在此表明,尽管这些方法在预测基因表达方面很准确,但从这类模型推断转录因子组合并非易事。此外,这些方法并非旨在捕捉在调节因子结合或染色质开放之前序列水平上存在的调控指令。在这里,我们探究基因表达的序列水平指令,并开发一种仅基于核苷酸特征来解释mRNA水平的方法。我们的方法将核苷酸组成定位为基因表达的关键组成部分。此外,我们的方法能够根据调控区域的贡献对其进行排名,揭示了基因体序列,特别是内含子的强大影响。我们进一步提供证据表明,核苷酸含量的贡献可以与与基因组三维结构相关的共同调控以及拓扑相关结构域内基因的关联联系起来。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/1afa1eb909e0/pcbi.1005921.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/73c610a78347/pcbi.1005921.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/af0ddcbb0e58/pcbi.1005921.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/164e4aee3643/pcbi.1005921.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/b0ae4ae7d54b/pcbi.1005921.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/6e9fc7026222/pcbi.1005921.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/0b94acd96b2b/pcbi.1005921.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/1afa1eb909e0/pcbi.1005921.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/73c610a78347/pcbi.1005921.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/af0ddcbb0e58/pcbi.1005921.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/164e4aee3643/pcbi.1005921.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/b0ae4ae7d54b/pcbi.1005921.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/6e9fc7026222/pcbi.1005921.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/0b94acd96b2b/pcbi.1005921.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6dc8/5766238/1afa1eb909e0/pcbi.1005921.g007.jpg

相似文献

1
Probing instructions for expression regulation in gene nucleotide compositions.基因核苷酸组成中表达调控的探究指南。
PLoS Comput Biol. 2018 Jan 2;14(1):e1005921. doi: 10.1371/journal.pcbi.1005921. eCollection 2018 Jan.
2
Predicting cell-type-specific gene expression from regions of open chromatin.从开放染色质区域预测细胞类型特异性基因表达。
Genome Res. 2012 Sep;22(9):1711-22. doi: 10.1101/gr.135129.111.
3
Integrative modeling of eQTLs and cis-regulatory elements suggests mechanisms underlying cell type specificity of eQTLs.整合 eQTL 和顺式调控元件的建模提示了 eQTL 细胞类型特异性的潜在机制。
PLoS Genet. 2013;9(8):e1003649. doi: 10.1371/journal.pgen.1003649. Epub 2013 Aug 1.
4
Enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features.利用染色质和基因组特征的综合建模识别小鼠胚胎干细胞中的增强子。
BMC Genomics. 2012 Apr 26;13:152. doi: 10.1186/1471-2164-13-152.
5
Leveraging three-dimensional chromatin architecture for effective reconstruction of enhancer-target gene regulatory interactions.利用三维染色质结构有效重建增强子-靶基因调控相互作用。
Nucleic Acids Res. 2021 Sep 27;49(17):e97. doi: 10.1093/nar/gkab547.
6
Sequence and chromatin determinants of cell-type-specific transcription factor binding.细胞类型特异性转录因子结合的序列和染色质决定因素。
Genome Res. 2012 Sep;22(9):1723-34. doi: 10.1101/gr.127712.111.
7
GPMiner: an integrated system for mining combinatorial cis-regulatory elements in mammalian gene group.GPMiner:一个用于挖掘哺乳动物基因组合调控元件的集成系统。
BMC Genomics. 2012;13 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2164-13-S1-S3. Epub 2012 Jan 17.
8
Maps of open chromatin highlight cell type-restricted patterns of regulatory sequence variation at hematological trait loci.染色质开放性图谱突出了血液特征基因座调控序列变异的细胞类型特异性模式。
Genome Res. 2013 Jul;23(7):1130-41. doi: 10.1101/gr.155127.113. Epub 2013 Apr 9.
9
Diverse patterns of genomic targeting by transcriptional regulators in Drosophila melanogaster.果蝇中转录调节因子对基因组靶向的多种模式。
Genome Res. 2014 Jul;24(7):1224-35. doi: 10.1101/gr.168807.113.
10
Cis-regulatory variations: a study of SNPs around genes showing cis-linkage in segregating mouse populations.顺式调控变异:对在分离小鼠群体中显示顺式连锁的基因周围单核苷酸多态性的研究。
BMC Genomics. 2006 Sep 15;7:235. doi: 10.1186/1471-2164-7-235.

引用本文的文献

1
MiREx: mRNA levels prediction from gene sequence and miRNA target knowledge.MiREx:基于基因序列和 miRNA 靶知识的 mRNA 水平预测。
BMC Bioinformatics. 2023 Nov 22;24(1):443. doi: 10.1186/s12859-023-05560-1.
2
An investigation of codon usage pattern analysis in pancreatitis associated genes.对胰腺炎相关基因密码子使用模式分析的研究。
BMC Genom Data. 2022 Nov 25;23(1):81. doi: 10.1186/s12863-022-01089-z.
3
Identification of long regulatory elements in the genome of Plasmodium falciparum and other eukaryotes.鉴定疟原虫和其他真核生物基因组中的长调控元件。

本文引用的文献

1
Predicting enhancer-promoter interaction from genomic sequence with deep neural networks.利用深度神经网络从基因组序列预测增强子-启动子相互作用。
Quant Biol. 2019 Jun;7(2):122-137. doi: 10.1007/s40484-019-0154-0.
2
Genetic sequence-based prediction of long-range chromatin interactions suggests a potential role of short tandem repeat sequences in genome organization.基于遗传序列的长程染色质相互作用预测表明短串联重复序列在基因组组织中具有潜在作用。
BMC Bioinformatics. 2017 Apr 18;18(1):218. doi: 10.1186/s12859-017-1624-x.
3
An Isochore Framework Underlies Chromatin Architecture.
PLoS Comput Biol. 2021 Apr 16;17(4):e1008909. doi: 10.1371/journal.pcbi.1008909. eCollection 2021 Apr.
4
Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure.深度学习表明,基因表达是由共同进化的相互作用基因调控结构的所有部分编码的。
Nat Commun. 2020 Dec 1;11(1):6141. doi: 10.1038/s41467-020-19921-4.
5
Physicochemical Foundations of Life that Direct Evolution: Chance and Natural Selection are not Evolutionary Driving Forces.指导进化的生命的物理化学基础:偶然性和自然选择并非进化驱动力。
Life (Basel). 2020 Jan 21;10(2):7. doi: 10.3390/life10020007.
6
On the problem of confounders in modeling gene expression.关于基因表达建模中混杂因素的问题。
Bioinformatics. 2019 Feb 15;35(4):711-719. doi: 10.1093/bioinformatics/bty674.
等染色体框架构成染色质结构的基础。
PLoS One. 2017 Jan 6;12(1):e0168023. doi: 10.1371/journal.pone.0168023. eCollection 2017.
4
Combining transcription factor binding affinities with open-chromatin data for accurate gene expression prediction.结合转录因子结合亲和力与开放染色质数据以进行准确的基因表达预测。
Nucleic Acids Res. 2017 Jan 9;45(1):54-66. doi: 10.1093/nar/gkw1061. Epub 2016 Nov 29.
5
Multi-tiered Reorganization of the Genome during B Cell Affinity Maturation Anchored by a Germinal Center-Specific Locus Control Region.生发中心特异性基因座控制区锚定的B细胞亲和力成熟过程中基因组的多层次重组
Immunity. 2016 Sep 20;45(3):497-512. doi: 10.1016/j.immuni.2016.08.012. Epub 2016 Sep 13.
6
Maps of context-dependent putative regulatory regions and genomic signal interactions.上下文相关的假定调控区域和基因组信号相互作用图谱。
Nucleic Acids Res. 2016 Nov 2;44(19):9110-9120. doi: 10.1093/nar/gkw800. Epub 2016 Sep 12.
7
Codon identity regulates mRNA stability and translation efficiency during the maternal-to-zygotic transition.密码子特性在母源-合子转变过程中调节mRNA稳定性和翻译效率。
EMBO J. 2016 Oct 4;35(19):2087-2103. doi: 10.15252/embj.201694699. Epub 2016 Jul 19.
8
On TADs and LADs: Spatial Control Over Gene Expression.关于 TADs 和 LADs:基因表达的空间控制。
Trends Genet. 2016 Aug;32(8):485-495. doi: 10.1016/j.tig.2016.05.004. Epub 2016 Jun 13.
9
High-throughput functional comparison of promoter and enhancer activities.启动子和增强子活性的高通量功能比较。
Genome Res. 2016 Aug;26(8):1023-33. doi: 10.1101/gr.204834.116. Epub 2016 Jun 16.
10
TT-seq maps the human transient transcriptome.TT-seq 可绘制人类瞬时转录组图谱。
Science. 2016 Jun 3;352(6290):1225-8. doi: 10.1126/science.aad9841.