• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于贝叶斯岭回归的血液多组织基因表达推断。

Blood-based multi-tissue gene expression inference with Bayesian ridge regression.

机构信息

Beijing Key Laboratory for Genetics of Birth Defects, Beijing Pediatric Research Institute, MOE Key Laboratory of Major Diseases in Children, Genetics and Birth Defects Control Center, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health, Beijing 100045, China.

出版信息

Bioinformatics. 2020 Jun 1;36(12):3788-3794. doi: 10.1093/bioinformatics/btaa239.

DOI:10.1093/bioinformatics/btaa239
PMID:32277818
Abstract

MOTIVATION

Gene expression profiling is widely used in basic and cancer research but still not feasible in many clinical applications because tissues, such as brain samples, are difficult and not ethnical to collect. Gene expression in uncollected tissues can be computationally inferred using genotype and expression quantitative trait loci. No methods can infer unmeasured gene expression of multiple tissues with single tissue gene expression profile as input.

RESULTS

Here, we present a Bayesian ridge regression-based method (B-GEX) to infer gene expression profiles of multiple tissues from blood gene expression profile. For each gene in a tissue, a low-dimensional feature vector was extracted from whole blood gene expression profile by feature selection. We used GTEx RNAseq data of 16 tissues to train inference models to capture the cross-tissue expression correlations between each target gene in a tissue and its preselected feature genes in peripheral blood. We compared B-GEX with least square regression, LASSO regression and ridge regression. B-GEX outperforms the other three models in most tissues in terms of mean absolute error, Pearson correlation coefficient and root-mean-squared error. Moreover, B-GEX infers expression level of tissue-specific genes as well as those of non-tissue-specific genes in all tissues. Unlike previous methods, which require genomic features or gene expression profiles of multiple tissues, our model only requires whole blood expression profile as input. B-GEX helps gain insights into gene expressions of uncollected tissues from more accessible data of blood.

AVAILABILITY AND IMPLEMENTATION

B-GEX is available at https://github.com/xuwenjian85/B-GEX.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基因表达谱分析广泛应用于基础和癌症研究,但在许多临床应用中仍然不可行,因为组织(如脑样本)难以采集且不具伦理。未采集组织中的基因表达可以通过基因型和表达数量性状基因座进行计算推断。没有方法可以使用单个组织的基因表达谱作为输入来推断多个组织的未测量基因表达。

结果

在这里,我们提出了一种基于贝叶斯岭回归的方法(B-GEX),用于从血液基因表达谱推断多个组织的基因表达谱。对于组织中的每个基因,通过特征选择从全血基因表达谱中提取一个低维特征向量。我们使用 GTEx RNAseq 数据中的 16 个组织来训练推断模型,以捕获组织中每个目标基因与其外周血中预选特征基因之间的跨组织表达相关性。我们将 B-GEX 与最小二乘回归、LASSO 回归和岭回归进行了比较。在大多数组织中,B-GEX 在平均绝对误差、皮尔逊相关系数和均方根误差方面均优于其他三种模型。此外,B-GEX 可以推断所有组织中组织特异性基因和非组织特异性基因的表达水平。与以前需要基因组特征或多个组织的基因表达谱的方法不同,我们的模型仅需要全血表达谱作为输入。B-GEX 有助于从更易获得的血液数据中深入了解未采集组织中的基因表达。

可用性和实现

B-GEX 可在 https://github.com/xuwenjian85/B-GEX 上获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
Blood-based multi-tissue gene expression inference with Bayesian ridge regression.基于贝叶斯岭回归的血液多组织基因表达推断。
Bioinformatics. 2020 Jun 1;36(12):3788-3794. doi: 10.1093/bioinformatics/btaa239.
2
Gene expression inference with deep learning.基于深度学习的基因表达推断
Bioinformatics. 2016 Jun 15;32(12):1832-9. doi: 10.1093/bioinformatics/btw074. Epub 2016 Feb 11.
3
EPS-LASSO: test for high-dimensional regression under extreme phenotype sampling of continuous traits.EPS-LASSO:连续性状极端表型抽样下的高维回归检验。
Bioinformatics. 2018 Jun 15;34(12):1996-2003. doi: 10.1093/bioinformatics/bty042.
4
Imputing Gene Expression in Uncollected Tissues Within and Beyond GTEx.推断GTEx内部和外部未采集组织中的基因表达。
Am J Hum Genet. 2016 Apr 7;98(4):697-708. doi: 10.1016/j.ajhg.2016.02.020. Epub 2016 Mar 31.
5
Model-based clustering of multi-tissue gene expression data.基于模型的多组织基因表达数据聚类。
Bioinformatics. 2020 Mar 1;36(6):1807-1813. doi: 10.1093/bioinformatics/btz805.
6
Bayesian integrative model for multi-omics data with missingness.贝叶斯综合模型在多组学数据缺失中的应用。
Bioinformatics. 2018 Nov 15;34(22):3801-3808. doi: 10.1093/bioinformatics/bty775.
7
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.
8
Using multiple measurements of tissue to estimate subject- and cell-type-specific gene expression.使用多种组织测量方法来估计个体和细胞类型特异性基因表达。
Bioinformatics. 2020 Feb 1;36(3):782-788. doi: 10.1093/bioinformatics/btz619.
9
A Comparative Study of Single-Trait and Multi-Trait Genomic Selection.单性状与多性状基因组选择的比较研究
J Comput Biol. 2019 Oct;26(10):1100-1112. doi: 10.1089/cmb.2019.0032. Epub 2019 Apr 17.
10
On transformative adaptive activation functions in neural networks for gene expression inference.关于神经网络中基因表达推断的转换自适应激活函数。
PLoS One. 2021 Jan 14;16(1):e0243915. doi: 10.1371/journal.pone.0243915. eCollection 2021.

引用本文的文献

1
Clinical symptoms and functional impairment in attention deficit hyperactivity disorder (ADHD) co-morbid tic disorder (TD) patients: a cluster-based investigation.注意缺陷多动障碍(ADHD)共病抽动障碍(TD)患者的临床症状与功能损害:一项基于聚类的调查
BMC Psychiatry. 2025 Feb 4;25(1):100. doi: 10.1186/s12888-025-06558-0.
2
Prediction of non-suicidal self-injury (NSSI) among rural Chinese junior high school students: a machine learning approach.中国农村初中生非自杀性自伤行为的预测:一种机器学习方法
Ann Gen Psychiatry. 2024 Dec 6;23(1):48. doi: 10.1186/s12991-024-00534-w.
3
Identification and validation of interferon-stimulated gene 15 as a biomarker for dermatomyositis by integrated bioinformatics analysis and machine learning.
通过整合生物信息学分析和机器学习鉴定和验证干扰素刺激基因 15 作为皮肌炎的生物标志物。
Front Immunol. 2024 Nov 4;15:1429817. doi: 10.3389/fimmu.2024.1429817. eCollection 2024.
4
A multi-modal framework improves prediction of tissue-specific gene expression from a surrogate tissue.多模态框架可提高从替代组织预测组织特异性基因表达的能力。
EBioMedicine. 2024 Sep;107:105305. doi: 10.1016/j.ebiom.2024.105305. Epub 2024 Aug 23.
5
Inferring Characteristics of the Tumor Immune Microenvironment of Patients with HNSCC from Single-Cell Transcriptomics of Peripheral Blood.从外周血单细胞转录组学推断头颈部鳞状细胞癌患者的肿瘤免疫微环境特征。
Cancer Res Commun. 2024 Sep 1;4(9):2335-2348. doi: 10.1158/2767-9764.CRC-24-0092.
6
Smart Biosensor for Breast Cancer Survival Prediction Based on Multi-View Multi-Way Graph Learning.基于多视图多向图学习的乳腺癌生存预测智能生物传感器
Sensors (Basel). 2024 May 21;24(11):3289. doi: 10.3390/s24113289.
7
Community cohesion looseness in gene networks reveals individualized drug targets and resistance.基因网络中的社区凝聚度松弛揭示了个体化的药物靶点和耐药性。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae175.
8
Predicting locus-specific DNA methylation levels in cancer and paracancer tissues.预测癌症组织和癌旁组织中位点特异性DNA甲基化水平。
Epigenomics. 2024 Mar 13;16(8):549-70. doi: 10.2217/epi-2023-0114.
9
Transformer-Based Multi-Modal Data Fusion Method for COPD Classification and Physiological and Biochemical Indicators Identification.基于Transformer的慢性阻塞性肺疾病分类及生理生化指标识别的多模态数据融合方法
Biomolecules. 2023 Sep 15;13(9):1391. doi: 10.3390/biom13091391.
10
Building an optimal predictive model for imputing tissue-specific gene expression by combining genotype and whole-blood transcriptome data.通过结合基因型和全血转录组数据构建优化的组织特异性基因表达推断预测模型。
HGG Adv. 2023 Jul 11;4(4):100223. doi: 10.1016/j.xhgg.2023.100223. eCollection 2023 Oct 12.