• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从数据驱动的角度综合分析人类广泛表达的基因。

Comprehensive Analysis of Ubiquitously Expressed Genes in Humans from A Data-driven Perspective.

机构信息

SJTU-Yale Joint Center for Biostatistics and Data Science, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China; Center for Biomedical Informatics, Shanghai Engineering Research Center for Big Data in Pediatric Precision Medicine, Shanghai Children's Hospital, Shanghai 200040, China; Department of Biostatistics, Yale University, New Haven, CT 06511, USA.

SJTU-Yale Joint Center for Biostatistics and Data Science, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.

出版信息

Genomics Proteomics Bioinformatics. 2023 Feb;21(1):164-176. doi: 10.1016/j.gpb.2021.08.017. Epub 2022 May 13.

DOI:10.1016/j.gpb.2021.08.017
PMID:35569803
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10373092/
Abstract

Comprehensive characterization of spatial and temporal gene expression patterns in humans is critical for uncovering the regulatory codes of the human genome and understanding the molecular mechanisms of human diseases. Ubiquitously expressed genes (UEGs) refer to the genes expressed across a majority of, if not all, phenotypic and physiological conditions of an organism. It is known that many human genes are broadly expressed across tissues. However, most previous UEG studies have only focused on providing a list of UEGs without capturing their global expression patterns, thus limiting the potential use of UEG information. In this study, we proposed a novel data-driven framework to leverage the extensive collection of ∼ 40,000 human transcriptomes to derive a list of UEGs and their corresponding global expression patterns, which offers a valuable resource to further characterize human transcriptome. Our results suggest that about half (12,234; 49.01%) of the human genes are expressed in at least 80% of human transcriptomes, and the median size of the human transcriptome is 16,342 genes (65.44%). Through gene clustering, we identified a set of UEGs, named LoVarUEGs, which have stable expression across human transcriptomes and can be used as internal reference genes for expression measurement. To further demonstrate the usefulness of this resource, we evaluated the global expression patterns for 16 previously predicted disallowed genes in islet beta cells and found that seven of these genes showed relatively more varied expression patterns, suggesting that the repression of these genes may not be unique to islet beta cells.

摘要

全面描绘人类时空基因表达模式对于揭示人类基因组的调控密码和理解人类疾病的分子机制至关重要。普遍表达基因(UEGs)是指在生物体的大多数(如果不是全部)表型和生理条件下都表达的基因。已知许多人类基因在组织中广泛表达。然而,大多数先前的 UEG 研究仅侧重于提供 UEG 的列表,而没有捕捉到它们的全局表达模式,从而限制了 UEG 信息的潜在用途。在这项研究中,我们提出了一种新颖的数据驱动框架,利用大约 40,000 个人类转录组的广泛收集来得出 UEG 及其相应的全局表达模式列表,这为进一步描述人类转录组提供了有价值的资源。我们的结果表明,约一半(12,234 个;49.01%)的人类基因在至少 80%的人类转录组中表达,人类转录组的中位数大小为 16,342 个基因(65.44%)。通过基因聚类,我们鉴定了一组 UEGs,命名为 LoVarUEGs,它们在人类转录组中具有稳定的表达,可以用作表达测量的内部参考基因。为了进一步证明这个资源的有用性,我们评估了 16 个先前预测的胰岛β细胞中不允许的基因的全局表达模式,发现其中 7 个基因的表达模式相对更为多样化,这表明这些基因的抑制作用可能不是胰岛β细胞所特有的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/5b4e968f56be/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/20d53435d0fb/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/4e716d141e99/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/f4a98ad0486d/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/ae4a7cdb5dbc/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/f5bef18296f5/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/d7726cf754fa/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/5b4e968f56be/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/20d53435d0fb/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/4e716d141e99/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/f4a98ad0486d/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/ae4a7cdb5dbc/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/f5bef18296f5/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/d7726cf754fa/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454c/10373092/5b4e968f56be/gr7.jpg

相似文献

1
Comprehensive Analysis of Ubiquitously Expressed Genes in Humans from A Data-driven Perspective.从数据驱动的角度综合分析人类广泛表达的基因。
Genomics Proteomics Bioinformatics. 2023 Feb;21(1):164-176. doi: 10.1016/j.gpb.2021.08.017. Epub 2022 May 13.
2
How many human genes can be defined as housekeeping with current expression data?根据当前的表达数据,有多少人类基因可被定义为管家基因?
BMC Genomics. 2008 Apr 16;9:172. doi: 10.1186/1471-2164-9-172.
3
Defining diversity, specialization, and gene specificity in transcriptomes through information theory.通过信息论定义转录组中的多样性、特异性和基因特异性。
Proc Natl Acad Sci U S A. 2008 Jul 15;105(28):9709-14. doi: 10.1073/pnas.0803479105. Epub 2008 Jul 7.
4
A Comprehensive Mouse Transcriptomic BodyMap across 17 Tissues by RNA-seq.通过 RNA-seq 技术对 17 种组织的小鼠转录组进行全面的转录组体图谱绘制。
Sci Rep. 2017 Jun 23;7(1):4200. doi: 10.1038/s41598-017-04520-z.
5
Transcriptome sequencing and analysis for the pigmentation of scale and skin in common carp (Cyprinus carpio).转录组测序和分析鲤鱼鳞片和皮肤色素沉着的机制。
Mol Biol Rep. 2021 Mar;48(3):2399-2410. doi: 10.1007/s11033-021-06273-5. Epub 2021 Mar 19.
6
scNPF: an integrative framework assisted by network propagation and network fusion for preprocessing of single-cell RNA-seq data.scNPF:一种基于网络传播和网络融合的综合框架,用于单细胞 RNA-seq 数据的预处理。
BMC Genomics. 2019 May 8;20(1):347. doi: 10.1186/s12864-019-5747-5.
7
Deciphering the sex bias in housekeeping gene expression in adipose tissue: a comprehensive meta-analysis of transcriptomic studies.解读脂肪组织管家基因表达中的性别偏倚:转录组研究的综合荟萃分析。
Biol Sex Differ. 2023 Apr 18;14(1):20. doi: 10.1186/s13293-023-00506-x.
8
An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data.组织转录组序列数据揭示了大量普遍表达的基因。
PLoS Comput Biol. 2009 Dec;5(12):e1000598. doi: 10.1371/journal.pcbi.1000598. Epub 2009 Dec 11.
9
Transcriptome-based identification of the optimal reference genes as internal controls for quantitative RT-PCR in razor clam (Sinonovacula constricta).基于转录组鉴定缢蛏定量RT-PCR内参的最佳参考基因
Genes Genomics. 2018 Jun;40(6):603-613. doi: 10.1007/s13258-018-0661-9. Epub 2018 Feb 3.
10
Integrating the interactome and the transcriptome of Drosophila.整合果蝇的相互作用组和转录组。
BMC Bioinformatics. 2014 Jun 10;15:177. doi: 10.1186/1471-2105-15-177.

引用本文的文献

1
Transcriptomic Analysis of Expression and Co-Expression Networks in the Cerebellum During Perinatal Development.围产期发育期间小脑基因表达及共表达网络的转录组学分析
Brain Sci. 2025 Jul 25;15(8):791. doi: 10.3390/brainsci15080791.
2
Annotating publicly-available samples and studies using interpretable modeling of unstructured metadata.使用非结构化元数据的可解释模型对公开可用的样本和研究进行注释。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae652.
3
From Genotype to Phenotype: How Enhancers Control Gene Expression and Cell Identity in Hematopoiesis.

本文引用的文献

1
Benchmarking of cell type deconvolution pipelines for transcriptomics data.基于转录组数据的细胞类型去卷积分析流水线的基准测试
Nat Commun. 2020 Nov 6;11(1):5650. doi: 10.1038/s41467-020-19015-1.
2
Leveraging functional annotation to identify genes associated with complex diseases.利用功能注释来识别与复杂疾病相关的基因。
PLoS Comput Biol. 2020 Nov 2;16(11):e1008315. doi: 10.1371/journal.pcbi.1008315. eCollection 2020 Nov.
3
Evaluating stably expressed genes in single cells.评估单细胞中稳定表达的基因。
从基因型到表型:增强子如何调控造血过程中的基因表达和细胞身份
Hemasphere. 2023 Nov 8;7(11):e969. doi: 10.1097/HS9.0000000000000969. eCollection 2023 Nov.
4
Expression Pattern of Trace Amine-Associated Receptors during Differentiation of Human Pluripotent Stem Cells to Dopaminergic Neurons.人多能干细胞向多巴胺能神经元分化过程中痕量胺相关受体的表达模式。
Int J Mol Sci. 2023 Oct 18;24(20):15313. doi: 10.3390/ijms242015313.
5
Prediction of transcript isoforms in 19 chicken tissues by Oxford Nanopore long-read sequencing.通过牛津纳米孔长读长测序预测19种鸡组织中的转录本异构体
Front Genet. 2022 Oct 3;13:997460. doi: 10.3389/fgene.2022.997460. eCollection 2022.
6
Using genome-wide expression compendia to study microorganisms.利用全基因组表达汇编研究微生物。
Comput Struct Biotechnol J. 2022 Aug 10;20:4315-4324. doi: 10.1016/j.csbj.2022.08.012. eCollection 2022.
Gigascience. 2019 Sep 1;8(9). doi: 10.1093/gigascience/giz106.
4
Tissue-specific genes as an underutilized resource in drug discovery.组织特异性基因是药物发现中未充分利用的资源。
Sci Rep. 2019 May 10;9(1):7233. doi: 10.1038/s41598-019-43829-9.
5
Digital expression explorer 2: a repository of uniformly processed RNA sequencing data.数字表达探索者 2:一个统一处理 RNA 测序数据的存储库。
Gigascience. 2019 Apr 1;8(4). doi: 10.1093/gigascience/giz022.
6
Validation of Common Housekeeping Genes as Reference for qPCR Gene Expression Analysis During iPS Reprogramming Process.验证看家基因作为 iPS 重编程过程中 qPCR 基因表达分析的参考。
Sci Rep. 2018 Jun 7;8(1):8716. doi: 10.1038/s41598-018-26707-8.
7
Digitizing omics profiles by divergence from a baseline.通过与基线的差异来数字化组学特征。
Proc Natl Acad Sci U S A. 2018 May 1;115(18):4545-4552. doi: 10.1073/pnas.1721628115. Epub 2018 Apr 16.
8
An accurate and robust imputation method scImpute for single-cell RNA-seq data.一种用于单细胞 RNA-seq 数据的准确稳健的插补方法 scImpute。
Nat Commun. 2018 Mar 8;9(1):997. doi: 10.1038/s41467-018-03405-7.
9
SEGreg: a database for human specifically expressed genes and their regulations in cancer and normal tissue.SEGreg:一个用于人类特异性表达基因及其在癌症和正常组织中调控的数据库。
Brief Bioinform. 2019 Jul 19;20(4):1322-1328. doi: 10.1093/bib/bbx173.
10
ACTB Loss-of-Function Mutations Result in a Pleiotropic Developmental Disorder.β-肌动蛋白功能丧失突变导致多效性发育障碍。
Am J Hum Genet. 2017 Dec 7;101(6):1021-1033. doi: 10.1016/j.ajhg.2017.11.006.