• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物工艺文献组学的文本挖掘:主题建模与文献分类。

Text mining of CHO bioprocess bibliome: Topic modeling and document classification.

机构信息

Department of Computer and Information Sciences, University of Delaware, Newark, Delaware, United States of America.

Center for Bioinformatics and Computational Biology, University of Delaware, Newark, Delaware, United States of America.

出版信息

PLoS One. 2023 Apr 6;18(4):e0274042. doi: 10.1371/journal.pone.0274042. eCollection 2023.

DOI:10.1371/journal.pone.0274042
PMID:37022994
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10079098/
Abstract

Chinese hamster ovary (CHO) cells are widely used for mass production of therapeutic proteins in the pharmaceutical industry. With the growing need in optimizing the performance of producer CHO cell lines, research on CHO cell line development and bioprocess continues to increase in recent decades. Bibliographic mapping and classification of relevant research studies will be essential for identifying research gaps and trends in literature. To qualitatively and quantitatively understand the CHO literature, we have conducted topic modeling using a CHO bioprocess bibliome manually compiled in 2016, and compared the topics uncovered by the Latent Dirichlet Allocation (LDA) models with the human labels of the CHO bibliome. The results show a significant overlap between the manually selected categories and computationally generated topics, and reveal the machine-generated topic-specific characteristics. To identify relevant CHO bioprocessing papers from new scientific literature, we have developed supervized models using Logistic Regression to identify specific article topics and evaluated the results using three CHO bibliome datasets, Bioprocessing set, Glycosylation set, and Phenotype set. The use of top terms as features supports the explainability of document classification results to yield insights on new CHO bioprocessing papers.

摘要

中国仓鼠卵巢(CHO)细胞被广泛用于制药行业的大量生产治疗性蛋白。随着对优化生产用 CHO 细胞系性能的需求不断增长,近年来 CHO 细胞系开发和生物工艺的研究持续增加。文献综述和相关研究的分类对于确定文献中的研究空白和趋势至关重要。为了定性和定量地理解 CHO 文献,我们使用 2016 年手动编制的 CHO 生物工艺生物组进行了主题建模,并将潜在狄利克雷分配(LDA)模型发现的主题与 CHO 生物组的人工标签进行了比较。结果表明,手动选择的类别和计算生成的主题之间存在显著重叠,并揭示了机器生成的主题特定特征。为了从新的科学文献中识别相关的 CHO 生物加工论文,我们使用逻辑回归开发了监督模型来识别特定的文章主题,并使用三个 CHO 生物组数据集(生物处理集、糖基化集和表型集)来评估结果。使用顶级术语作为特征支持文档分类结果的可解释性,从而为新的 CHO 生物加工论文提供见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5655/10079098/a2ffb26c8f0c/pone.0274042.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5655/10079098/3cae7d30655e/pone.0274042.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5655/10079098/00b9a863c315/pone.0274042.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5655/10079098/0bb63e7f8a4e/pone.0274042.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5655/10079098/a2ffb26c8f0c/pone.0274042.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5655/10079098/3cae7d30655e/pone.0274042.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5655/10079098/00b9a863c315/pone.0274042.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5655/10079098/0bb63e7f8a4e/pone.0274042.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5655/10079098/a2ffb26c8f0c/pone.0274042.g004.jpg

相似文献

1
Text mining of CHO bioprocess bibliome: Topic modeling and document classification.生物工艺文献组学的文本挖掘:主题建模与文献分类。
PLoS One. 2023 Apr 6;18(4):e0274042. doi: 10.1371/journal.pone.0274042. eCollection 2023.
2
Quantitative feature extraction from the Chinese hamster ovary bioprocess bibliome using a novel meta-analysis workflow.利用新颖的元分析工作流程从中国仓鼠卵巢生物工艺文献库中进行定量特征提取。
Biotechnol Adv. 2016 Sep-Oct;34(5):621-633. doi: 10.1016/j.biotechadv.2016.02.011. Epub 2016 Mar 3.
3
Perfusion culture of Chinese Hamster Ovary cells for bioprocessing applications.用于生物加工应用的中国仓鼠卵巢细胞灌流培养。
Crit Rev Biotechnol. 2022 Nov;42(7):1099-1115. doi: 10.1080/07388551.2021.1998821. Epub 2021 Nov 29.
4
Genomic landscapes of Chinese hamster ovary cell lines as revealed by the Cricetulus griseus draft genome.通过灰仓鼠草图基因组揭示的中国仓鼠卵巢细胞系的基因组景观。
Nat Biotechnol. 2013 Aug;31(8):759-65. doi: 10.1038/nbt.2624. Epub 2013 Jul 21.
5
Factors and Mechanisms Affecting the Secretion of Recombinant Protein in CHO Cells.影响重组蛋白在CHO细胞中分泌的因素及机制
Curr Pharm Biotechnol. 2023;24(3):391-400. doi: 10.2174/1389201023666220603121316.
6
A global RNA-seq-driven analysis of CHO host and production cell lines reveals distinct differential expression patterns of genes contributing to recombinant antibody glycosylation.一项基于全球RNA测序的对CHO宿主细胞系和生产细胞系的分析揭示了对重组抗体糖基化有贡献的基因的不同差异表达模式。
Biotechnol J. 2015 Sep;10(9):1412-23. doi: 10.1002/biot.201400652. Epub 2015 Aug 26.
7
CHO microRNA engineering is growing up: recent successes and future challenges.CHO 微小 RNA 工程正在兴起:近期的成功与未来的挑战。
Biotechnol Adv. 2013 Dec;31(8):1501-13. doi: 10.1016/j.biotechadv.2013.07.007. Epub 2013 Aug 2.
8
The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line.中国仓鼠卵巢(CHO-K1)细胞系的基因组序列。
Nat Biotechnol. 2011 Jul 31;29(8):735-41. doi: 10.1038/nbt.1932.
9
Bioprocessing of Recombinant CHO-K1, CHO-DG44, and CHO-S: CHO Expression Hosts Favor Either mAb Production or Biomass Synthesis.重组 CHO-K1、CHO-DG44 和 CHO-S 的生物加工:CHO 表达宿主偏爱单抗生产或生物量合成。
Biotechnol J. 2019 Mar;14(3):e1700686. doi: 10.1002/biot.201700686. Epub 2018 Jun 11.
10
A reflection on the improvement of Chinese hamster ovary cell-based bioprocesses through advances in proteomic techniques.通过蛋白质组学技术的进步来反思中国仓鼠卵巢细胞基生物工艺的改进。
Biotechnol Adv. 2023 Jul-Aug;65:108141. doi: 10.1016/j.biotechadv.2023.108141. Epub 2023 Mar 29.

本文引用的文献

1
Strategies and Considerations for Improving Recombinant Antibody Production and Quality in Chinese Hamster Ovary Cells.提高中国仓鼠卵巢细胞中重组抗体产量和质量的策略与考量
Front Bioeng Biotechnol. 2022 Mar 4;10:856049. doi: 10.3389/fbioe.2022.856049. eCollection 2022.
2
Biopharmaceutical Manufacturing: Historical Perspectives and Future Directions.生物制药制造:历史视角与未来方向
Annu Rev Chem Biomol Eng. 2022 Jun 10;13:141-165. doi: 10.1146/annurev-chembioeng-092220-125832. Epub 2022 Mar 17.
3
A Chinese hamster transcription start site atlas that enables targeted editing of CHO cells.
一个能够实现对中国仓鼠卵巢细胞进行靶向编辑的中国仓鼠转录起始位点图谱。
NAR Genom Bioinform. 2021 Jul 13;3(3):lqab061. doi: 10.1093/nargab/lqab061. eCollection 2021 Sep.
4
Supporting topic modeling and trends analysis in biomedical literature.支持生物医学文献中的主题建模和趋势分析。
J Biomed Inform. 2020 Oct;110:103574. doi: 10.1016/j.jbi.2020.103574. Epub 2020 Sep 21.
5
A Review on the Current Methods of Chinese Hamster Ovary (CHO) Cells Cultivation for the Production of Therapeutic Protein.关于用于治疗性蛋白生产的中国仓鼠卵巢(CHO)细胞培养的当前方法的综述。
Curr Drug Discov Technol. 2021;18(3):354-364. doi: 10.2174/1570163817666200312102137.
6
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
7
PubTator central: automated concept annotation for biomedical full text articles.PubTator 中心:用于生物医学全文文章的自动概念标注。
Nucleic Acids Res. 2019 Jul 2;47(W1):W587-W593. doi: 10.1093/nar/gkz389.
8
An overview of topic modeling and its current applications in bioinformatics.主题建模概述及其在生物信息学中的当前应用。
Springerplus. 2016 Sep 20;5(1):1608. doi: 10.1186/s40064-016-3252-8. eCollection 2016.
9
Quantitative feature extraction from the Chinese hamster ovary bioprocess bibliome using a novel meta-analysis workflow.利用新颖的元分析工作流程从中国仓鼠卵巢生物工艺文献库中进行定量特征提取。
Biotechnol Adv. 2016 Sep-Oct;34(5):621-633. doi: 10.1016/j.biotechadv.2016.02.011. Epub 2016 Mar 3.
10
Survey of Natural Language Processing Techniques in Bioinformatics.生物信息学中的自然语言处理技术综述
Comput Math Methods Med. 2015;2015:674296. doi: 10.1155/2015/674296. Epub 2015 Oct 7.