使用具有实体嵌入向量的混合高斯过程模型在细胞系之间进行知识转移。

Knowledge transfer across cell lines using hybrid Gaussian process models with entity embedding vectors.

机构信息

DataHow AG, Zurich, Switzerland.

Chair for Mathematical Information Science, ETH Zurich.

出版信息

Biotechnol Bioeng. 2021 Nov;118(11):4389-4401. doi: 10.1002/bit.27907. Epub 2021 Aug 12.

DOI:10.1002/bit.27907

PMID:34383309

Abstract

To date, a large number of experiments are performed to develop a biochemical process. The generated data is used only once, to take decisions for development. Could we exploit data of already developed processes to make predictions for a novel process, we could significantly reduce the number of experiments needed. Processes for different products exhibit differences in behaviour, typically only a subset behave similar. Therefore, effective learning on multiple product spanning process data requires a sensible representation of the product identity. We propose to represent the product identity (a categorical feature) by embedding vectors that serve as input to a Gaussian process regression model. We demonstrate how the embedding vectors can be learned from process data and show that they capture an interpretable notion of product similarity. The improvement in performance is compared to traditional one-hot encoding on a simulated cross product learning task. All in all, the proposed method could render possible significant reductions in wet-lab experiments.

摘要

迄今为止，已经进行了大量实验来开发生化过程。生成的数据仅使用一次，用于为开发做出决策。如果我们可以利用已经开发的过程的数据来对新的过程进行预测，我们就可以显著减少所需的实验数量。不同产品的过程表现出不同的行为，通常只有一部分表现出相似的行为。因此，对跨越多个产品的过程数据进行有效的学习需要对产品身份进行合理的表示。我们建议通过嵌入向量来表示产品身份（类别特征），将其作为高斯过程回归模型的输入。我们展示了如何从过程数据中学习嵌入向量，并表明它们捕获了产品相似性的可解释概念。在模拟的交叉产品学习任务上，与传统的独热编码相比，性能得到了提高。总之，该方法可以大大减少湿实验室实验。

相似文献

Knowledge transfer across cell lines using hybrid Gaussian process models with entity embedding vectors.

Biotechnol Bioeng. 2021 Nov;118(11):4389-4401. doi: 10.1002/bit.27907. Epub 2021 Aug 12.

Novel calibration design improves knowledge transfer across products for the characterization of pharmaceutical bioprocesses.

Biotechnol J. 2024 Jul;19(7):e2400080. doi: 10.1002/biot.202400080.

A Knowledge Graph Entity Disambiguation Method Based on Entity-Relationship Embedding and Graph Structure Embedding.

Comput Intell Neurosci. 2021 Sep 23;2021:2878189. doi: 10.1155/2021/2878189. eCollection 2021.

CDE++: Learning Categorical Data Embedding by Enhancing Heterogeneous Feature Value Coupling Relationships.

Entropy (Basel). 2020 Mar 29;22(4):391. doi: 10.3390/e22040391.

A bootstrap-aggregated hybrid semi-parametric modeling framework for bioprocess development.

Bioprocess Biosyst Eng. 2019 Nov;42(11):1853-1865. doi: 10.1007/s00449-019-02181-y. Epub 2019 Aug 2.

Resolution enhancement for lung 4D-CT based on transversal structures by using multiple Gaussian process regression learning.

Phys Med. 2020 Oct;78:187-194. doi: 10.1016/j.ejmp.2020.09.011. Epub 2020 Oct 7.

Domain Adaptation With Neural Embedding Matching.

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2387-2397. doi: 10.1109/TNNLS.2019.2935608. Epub 2019 Sep 13.

Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules.

Int J Environ Res Public Health. 2020 Apr 14;17(8):2687. doi: 10.3390/ijerph17082687.

Learning Graph Embedding With Adversarial Training Methods.

IEEE Trans Cybern. 2020 Jun;50(6):2475-2487. doi: 10.1109/TCYB.2019.2932096. Epub 2019 Sep 2.

Patient Representation From Structured Electronic Medical Records Based on Embedding Technique: Development and Validation Study.

JMIR Med Inform. 2021 Jul 23;9(7):e19905. doi: 10.2196/19905.

引用本文的文献

Cross-disciplinary perspectives on the potential for artificial intelligence across chemistry.

Chem Soc Rev. 2025 Apr 25. doi: 10.1039/d5cs00146c.

Iterative hybrid model based optimization of rAAV production.

Biotechnol Prog. 2025 Mar 24:e70006. doi: 10.1002/btpr.70006.

Transfer learning Bayesian optimization for competitor DNA molecule design for use in diagnostic assays.

Biotechnol Bioeng. 2025 Jan;122(1):189-210. doi: 10.1002/bit.28854. Epub 2024 Oct 16.

A perspective-driven and technical evaluation of machine learning in bioreactor scale-up: A case-study for potential model developments.

Eng Life Sci. 2024 Mar 20;24(7):e2400023. doi: 10.1002/elsc.202400023. eCollection 2024 Jul.

Transfer Learning Bayesian Optimization to Design Competitor DNA Molecules for Use in Diagnostic Assays.

ArXiv. 2024 Oct 22:arXiv:2402.17704v2.

Hybrid deep modeling of a CHO-K1 fed-batch process: combining first-principles with deep neural networks.

Front Bioeng Biotechnol. 2023 Sep 8;11:1237963. doi: 10.3389/fbioe.2023.1237963. eCollection 2023.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用具有实体嵌入向量的混合高斯过程模型在细胞系之间进行知识转移。

Knowledge transfer across cell lines using hybrid Gaussian process models with entity embedding vectors.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献