• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

G-NEST:一种基因邻域评分工具,用于识别共保守、共表达的基因。

G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes.

机构信息

Genome Center, University of California Davis, 451 Health Science Dr, Davis, CA, 95616, United States of America.

出版信息

BMC Bioinformatics. 2012 Sep 28;13:253. doi: 10.1186/1471-2105-13-253.

DOI:10.1186/1471-2105-13-253
PMID:23020263
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3575404/
Abstract

BACKGROUND

In previous studies, gene neighborhoods-spatial clusters of co-expressed genes in the genome-have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Scoring Tool (G-NEST) which combines genomic location, gene expression, and evolutionary sequence conservation data to score putative gene neighborhoods across all possible window sizes simultaneously.

RESULTS

Using G-NEST on atlases of mouse and human tissue expression data, we found that large neighborhoods of ten or more genes are extremely rare in mammalian genomes. When they do occur, neighborhoods are typically composed of families of related genes. Both the highest scoring and the largest neighborhoods in mammalian genomes are formed by tandem gene duplication. Mammalian gene neighborhoods contain highly and variably expressed genes. Co-localized noisy gene pairs exhibit lower evolutionary conservation of their adjacent genome locations, suggesting that their shared transcriptional background may be disadvantageous. Genes that are essential to mammalian survival and reproduction are less likely to occur in neighborhoods, although neighborhoods are enriched with genes that function in mitosis. We also found that gene orientation and protein-protein interactions are partially responsible for maintenance of gene neighborhoods.

CONCLUSIONS

Our experiments using G-NEST confirm that tandem gene duplication is the primary driver of non-random gene order in mammalian genomes. Non-essentiality, co-functionality, gene orientation, and protein-protein interactions are additional forces that maintain gene neighborhoods, especially those formed by tandem duplicates. We expect G-NEST to be useful for other applications such as the identification of core regulatory modules, common transcriptional backgrounds, and chromatin domains. The software is available at http://docpollard.org/software.html.

摘要

背景

在以前的研究中,基因邻域——基因组中共同表达基因的空间聚类——是使用任意规则定义的,例如要求邻接、最小基因数、固定窗口大小或最小表达水平。在本研究中,我们开发了一种基因邻域评分工具(G-NEST),该工具结合了基因组位置、基因表达和进化序列保守性数据,同时对所有可能的窗口大小的假定基因邻域进行评分。

结果

使用 G-NEST 在小鼠和人类组织表达图谱上,我们发现哺乳动物基因组中十个或更多基因的大邻域极为罕见。当它们确实发生时,邻域通常由相关基因家族组成。哺乳动物基因组中得分最高和最大的邻域都是由串联基因复制形成的。哺乳动物基因邻域包含高度和可变表达的基因。共定位的嘈杂基因对其相邻基因组位置的进化保守性较低,表明它们共享的转录背景可能不利。对哺乳动物生存和繁殖至关重要的基因不太可能出现在邻域中,尽管邻域富含在有丝分裂中起作用的基因。我们还发现基因取向和蛋白质-蛋白质相互作用部分负责维持基因邻域。

结论

我们使用 G-NEST 进行的实验证实,串联基因复制是哺乳动物基因组中非随机基因顺序的主要驱动因素。非必需性、共功能性、基因取向和蛋白质-蛋白质相互作用是维持基因邻域的其他力量,特别是那些由串联重复形成的邻域。我们期望 G-NEST 对其他应用有用,例如识别核心调控模块、常见转录背景和染色质域。该软件可在 http://docpollard.org/software.html 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/eb81af2504af/1471-2105-13-253-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/265aca14aa74/1471-2105-13-253-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/70364bb0502a/1471-2105-13-253-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/38e2c3f336c1/1471-2105-13-253-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/e2e39aa660bd/1471-2105-13-253-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/8a419e2865c0/1471-2105-13-253-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/b5319a2449b4/1471-2105-13-253-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/eb81af2504af/1471-2105-13-253-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/265aca14aa74/1471-2105-13-253-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/70364bb0502a/1471-2105-13-253-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/38e2c3f336c1/1471-2105-13-253-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/e2e39aa660bd/1471-2105-13-253-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/8a419e2865c0/1471-2105-13-253-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/b5319a2449b4/1471-2105-13-253-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2195/3575404/eb81af2504af/1471-2105-13-253-7.jpg

相似文献

1
G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes.G-NEST:一种基因邻域评分工具,用于识别共保守、共表达的基因。
BMC Bioinformatics. 2012 Sep 28;13:253. doi: 10.1186/1471-2105-13-253.
2
Comparative genomics of ParaHox clusters of teleost fishes: gene cluster breakup and the retention of gene sets following whole genome duplications.硬骨鱼类副同源盒基因簇的比较基因组学:基因簇的分裂以及全基因组复制后基因集的保留
BMC Genomics. 2007 Sep 6;8:312. doi: 10.1186/1471-2164-8-312.
3
Selection upon genome architecture: conservation of functional neighborhoods with changing genes.基因组结构选择:具有变化基因的功能域的保守性。
PLoS Comput Biol. 2010 Oct 7;6(10):e1000953. doi: 10.1371/journal.pcbi.1000953.
4
PSAT: a web tool to compare genomic neighborhoods of multiple prokaryotic genomes.PSAT:一个用于比较多个原核生物基因组的基因组邻域的网络工具。
BMC Bioinformatics. 2008 Mar 26;9:170. doi: 10.1186/1471-2105-9-170.
5
Gene teams: a new formalization of gene clusters for comparative genomics.基因团队:用于比较基因组学的基因簇新形式化
Comput Biol Chem. 2003 Feb;27(1):59-67. doi: 10.1016/s1476-9271(02)00097-x.
6
Extensive conservation of ancient microsynteny across metazoans due to cis-regulatory constraints.由于顺式调控元件的限制,后生动物中古老的微同线性得到了广泛的保守。
Genome Res. 2012 Dec;22(12):2356-67. doi: 10.1101/gr.139725.112. Epub 2012 Jun 21.
7
Genomic Locations of Conserved Noncoding Sequences and Their Proximal Protein-Coding Genes in Mammalian Expression Dynamics.哺乳动物表达动态中保守非编码序列及其附近蛋白质编码基因的基因组位置。
Mol Biol Evol. 2016 Jul;33(7):1807-17. doi: 10.1093/molbev/msw058. Epub 2016 Mar 26.
8
MicroRNA genes derived from repetitive elements and expanded by segmental duplication events in mammalian genomes.哺乳动物基因组中由重复元件衍生而来并通过片段复制事件扩增的 microRNA 基因。
PLoS One. 2011 Mar 16;6(3):e17666. doi: 10.1371/journal.pone.0017666.
9
Evolution of conserved non-coding sequences within the vertebrate Hox clusters through the two-round whole genome duplications revealed by phylogenetic footprinting analysis.通过系统发育足迹分析揭示脊椎动物 Hox 簇中保守非编码序列在两轮全基因组复制过程中的演化。
J Mol Evol. 2010 Dec;71(5-6):427-36. doi: 10.1007/s00239-010-9396-1. Epub 2010 Oct 28.
10
CTDGFinder: A Novel Homology-Based Algorithm for Identifying Closely Spaced Clusters of Tandemly Duplicated Genes.CTDGFinder:一种基于同源性的算法,用于识别紧密串联重复基因的簇。
Mol Biol Evol. 2017 Jan;34(1):215-229. doi: 10.1093/molbev/msw227. Epub 2016 Oct 20.

引用本文的文献

1
Protein family neighborhood analyzer-ProFaNA.蛋白质家族邻域分析器-ProFaNA。
PeerJ. 2023 Jul 21;11:e15715. doi: 10.7717/peerj.15715. eCollection 2023.
2
Exploring microbial functional biodiversity at the protein family level-From metagenomic sequence reads to annotated protein clusters.在蛋白质家族水平上探索微生物功能多样性——从宏基因组序列 reads 到注释的蛋白质簇。
Front Bioinform. 2023 Mar 3;3:1157956. doi: 10.3389/fbinf.2023.1157956. eCollection 2023.
3
Pathogen-driven cancers from a structural perspective: Targeting host-pathogen protein-protein interactions.

本文引用的文献

1
The evolution of gene expression levels in mammalian organs.哺乳动物器官中基因表达水平的演变。
Nature. 2011 Oct 19;478(7369):343-8. doi: 10.1038/nature10532.
2
Support for multiple classes of local expression clusters in Drosophila melanogaster, but no evidence for gene order conservation.支持果蝇中多个类别的局部表达簇,但没有证据表明基因顺序保守。
Genome Biol. 2011;12(3):R23. doi: 10.1186/gb-2011-12-3-r23. Epub 2011 Mar 17.
3
Transcriptional coupling of neighboring genes and gene expression noise: evidence that gene orientation and noncoding transcripts are modulators of noise.
从结构角度看病原体驱动的癌症:靶向宿主-病原体蛋白质-蛋白质相互作用
Front Oncol. 2023 Feb 23;13:1061595. doi: 10.3389/fonc.2023.1061595. eCollection 2023.
4
Chromosomal neighbourhoods allow identification of organ specific changes in gene expression.染色体邻近区域有助于识别基因表达的器官特异性变化。
PLoS Comput Biol. 2021 Sep 10;17(9):e1008947. doi: 10.1371/journal.pcbi.1008947. eCollection 2021 Sep.
5
Photosynthetic protein classification using genome neighborhood-based machine learning feature.基于基因组邻域的机器学习特征进行光合作用蛋白分类。
Sci Rep. 2020 Apr 28;10(1):7108. doi: 10.1038/s41598-020-64053-w.
6
Algorithms for matching partially labelled sequence graphs.用于匹配部分标记序列图的算法
Algorithms Mol Biol. 2017 Sep 25;12:24. doi: 10.1186/s13015-017-0115-y. eCollection 2017.
7
SegCorr a statistical procedure for the detection of genomic regions of correlated expression.SegCorr:一种用于检测相关表达基因组区域的统计程序。
BMC Bioinformatics. 2017 Jul 11;18(1):333. doi: 10.1186/s12859-017-1742-5.
8
AGeNNT: annotation of enzyme families by means of refined neighborhood networks.AGeNNT:通过精细邻域网络对酶家族进行注释。
BMC Bioinformatics. 2017 May 25;18(1):274. doi: 10.1186/s12859-017-1689-6.
9
Genome-wide identification of physically clustered genes suggests chromatin-level co-regulation in male reproductive development in Arabidopsis thaliana.全基因组范围内对物理聚集基因的鉴定表明拟南芥雄性生殖发育中存在染色质水平的共调控。
Nucleic Acids Res. 2017 Apr 7;45(6):3253-3265. doi: 10.1093/nar/gkx087.
10
From genes to milk: genomic organization and epigenetic regulation of the mammary transcriptome.从基因到牛奶:乳腺转录组的基因组组织和表观遗传调控。
PLoS One. 2013 Sep 26;8(9):e75030. doi: 10.1371/journal.pone.0075030. eCollection 2013.
邻近基因的转录偶联和基因表达噪声:基因取向和非编码转录本是噪声调节剂的证据。
Genome Biol Evol. 2011;3:320-31. doi: 10.1093/gbe/evr025. Epub 2011 Mar 14.
4
Ensembl 2011.Ensembl 2011年版
Nucleic Acids Res. 2011 Jan;39(Database issue):D800-6. doi: 10.1093/nar/gkq1064. Epub 2010 Nov 2.
5
The UCSC Genome Browser database: update 2011.加州大学圣克鲁兹分校基因组浏览器数据库:2011年更新
Nucleic Acids Res. 2011 Jan;39(Database issue):D876-82. doi: 10.1093/nar/gkq963. Epub 2010 Oct 18.
6
Selection upon genome architecture: conservation of functional neighborhoods with changing genes.基因组结构选择:具有变化基因的功能域的保守性。
PLoS Comput Biol. 2010 Oct 7;6(10):e1000953. doi: 10.1371/journal.pcbi.1000953.
7
The epigenetic landscape of mammary gland development and functional differentiation.乳腺发育和功能分化的表观遗传学景观。
J Mammary Gland Biol Neoplasia. 2010 Mar;15(1):85-100. doi: 10.1007/s10911-010-9170-4. Epub 2010 Feb 17.
8
Comparison of diverse developmental transcriptomes reveals that coexpression of gene neighbors is not evolutionarily conserved.比较不同发育阶段的转录组揭示,基因邻居的共表达在进化上没有保守性。
Genome Res. 2009 Dec;19(12):2214-20. doi: 10.1101/gr.093815.109. Epub 2009 Sep 10.
9
The bovine lactation genome: insights into the evolution of mammalian milk.牛的泌乳基因组:对哺乳动物乳汁进化的见解。
Genome Biol. 2009;10(4):R43. doi: 10.1186/gb-2009-10-4-r43. Epub 2009 Apr 24.
10
The impact of genomic neighborhood on the evolution of human and chimpanzee transcriptome.基因组邻域对人类和黑猩猩转录组进化的影响。
Genome Res. 2009 May;19(5):785-94. doi: 10.1101/gr.086165.108. Epub 2009 Feb 19.