• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于单细胞RNA测序数据,用机器学习方法识别体外培养的人肝细胞标志物

Identifying In Vitro Cultured Human Hepatocytes Markers with Machine Learning Methods Based on Single-Cell RNA-Seq Data.

作者信息

Li ZhanDong, Huang FeiMing, Chen Lei, Huang Tao, Cai Yu-Dong

机构信息

College of Biological and Food Engineering, Jilin Engineering Normal University, Changchun, China.

School of Life Sciences, Shanghai University, Shanghai, China.

出版信息

Front Bioeng Biotechnol. 2022 May 30;10:916309. doi: 10.3389/fbioe.2022.916309. eCollection 2022.

DOI:10.3389/fbioe.2022.916309
PMID:35706505
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9189284/
Abstract

Cell transplantation is an effective method for compensating for the loss of liver function and improve patient survival. However, given that hepatocytes cultivated have diverse developmental processes and physiological features, obtaining hepatocytes that can properly function is difficult. In the present study, we present an advanced computational analysis on single-cell transcriptional profiling to resolve the heterogeneity of the hepatocyte differentiation process and to mine biomarkers at different periods of differentiation. We obtained a batch of compressed and effective classification features with the Boruta method and ranked them using the Max-Relevance and Min-Redundancy method. Some key genes were identified during the culture of hepatocytes, including , which not only regulates terminally differentiated cells in the liver but also affects cell differentiation. , which encodes a CD147 ligand, also appeared in the identified gene list, and the combination of the two proteins mediated multiple biological pathways. Other genes, such as , , and , which are involved in the maturation and differentiation of hepatocytes and assist different hepatic cell types in performing their roles were also identified. Then, several classifiers were trained and evaluated to obtain optimal classifiers and optimal feature subsets, using three classification algorithms (random forest, k-nearest neighbor, and decision tree) and the incremental feature selection method. The best random forest classifier with a 0.940 Matthews correlation coefficient was constructed to distinguish different hepatic cell types. Finally, classification rules were created for quantitatively describing hepatic cell types. In summary, This study provided potential targets for cell transplantation associated liver disease treatment strategies by elucidating the process and mechanism of hepatocyte development at both qualitative and quantitative levels.

摘要

细胞移植是一种补偿肝功能丧失和提高患者生存率的有效方法。然而,鉴于培养的肝细胞具有不同的发育过程和生理特征,获得能够正常发挥功能的肝细胞是困难的。在本研究中,我们对单细胞转录谱进行了先进的计算分析,以解决肝细胞分化过程的异质性,并挖掘分化不同时期的生物标志物。我们使用Boruta方法获得了一批压缩且有效的分类特征,并使用最大相关和最小冗余方法对其进行排序。在肝细胞培养过程中鉴定出了一些关键基因,包括[具体基因1],它不仅调节肝脏中的终末分化细胞,还影响细胞分化。[具体基因2],其编码一种CD147配体,也出现在鉴定出的基因列表中,并且这两种蛋白质的组合介导了多种生物学途径。还鉴定出了其他一些基因,如[具体基因3]、[具体基因4]和[具体基因5],它们参与肝细胞的成熟和分化,并协助不同类型的肝细胞发挥其作用。然后,使用三种分类算法(随机森林、k近邻和决策树)和增量特征选择方法训练和评估了几个分类器,以获得最优分类器和最优特征子集。构建了具有0.940马修斯相关系数的最佳随机森林分类器,以区分不同类型的肝细胞。最后,创建了分类规则来定量描述肝细胞类型。总之,本研究通过在定性和定量水平上阐明肝细胞发育的过程和机制,为细胞移植相关肝病治疗策略提供了潜在靶点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/f523c9e173b1/fbioe-10-916309-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/e5a98d1e20e2/fbioe-10-916309-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/51a07c7697e6/fbioe-10-916309-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/321a015a01b8/fbioe-10-916309-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/f0549042aedb/fbioe-10-916309-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/57cc3d5ad787/fbioe-10-916309-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/f523c9e173b1/fbioe-10-916309-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/e5a98d1e20e2/fbioe-10-916309-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/51a07c7697e6/fbioe-10-916309-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/321a015a01b8/fbioe-10-916309-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/f0549042aedb/fbioe-10-916309-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/57cc3d5ad787/fbioe-10-916309-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de30/9189284/f523c9e173b1/fbioe-10-916309-g006.jpg

相似文献

1
Identifying In Vitro Cultured Human Hepatocytes Markers with Machine Learning Methods Based on Single-Cell RNA-Seq Data.基于单细胞RNA测序数据,用机器学习方法识别体外培养的人肝细胞标志物
Front Bioeng Biotechnol. 2022 May 30;10:916309. doi: 10.3389/fbioe.2022.916309. eCollection 2022.
2
Characterization of spleen and lymph node cell types CITE-seq and machine learning methods.脾脏和淋巴结细胞类型的表征:CITE-seq和机器学习方法
Front Mol Neurosci. 2022 Oct 13;15:1033159. doi: 10.3389/fnmol.2022.1033159. eCollection 2022.
3
Identification of genes related to immune enhancement caused by heterologous ChAdOx1-BNT162b2 vaccines in lymphocytes at single-cell resolution with machine learning methods.利用机器学习方法在单细胞分辨率下鉴定与异源 ChAdOx1-BNT162b2 疫苗引起的淋巴细胞免疫增强相关的基因。
Front Immunol. 2023 Mar 2;14:1131051. doi: 10.3389/fimmu.2023.1131051. eCollection 2023.
4
Detecting Blood Methylation Signatures in Response to Childhood Cancer Radiotherapy via Machine Learning Methods.通过机器学习方法检测儿童癌症放疗反应中的血液甲基化特征。
Biology (Basel). 2022 Apr 15;11(4):607. doi: 10.3390/biology11040607.
5
Identification of DNA Methylation Signature and Rules for SARS-CoV-2 Associated with Age.鉴定与年龄相关的 SARS-CoV-2 的 DNA 甲基化特征和规律。
Front Biosci (Landmark Ed). 2022 Jun 27;27(7):204. doi: 10.31083/j.fbl2707204.
6
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
7
Identification of Cell Markers and Their Expression Patterns in Skin Based on Single-Cell RNA-Sequencing Profiles.基于单细胞RNA测序图谱鉴定皮肤中的细胞标志物及其表达模式。
Life (Basel). 2022 Apr 7;12(4):550. doi: 10.3390/life12040550.
8
Identifying Lung Cancer Cell Markers with Machine Learning Methods and Single-Cell RNA-Seq Data.利用机器学习方法和单细胞RNA测序数据识别肺癌细胞标志物
Life (Basel). 2021 Sep 9;11(9):940. doi: 10.3390/life11090940.
9
Characterization of chromatin accessibility patterns in different mouse cell types using machine learning methods at single-cell resolution.使用机器学习方法在单细胞分辨率下对不同小鼠细胞类型中的染色质可及性模式进行表征。
Front Genet. 2023 Mar 1;14:1145647. doi: 10.3389/fgene.2023.1145647. eCollection 2023.
10
Identifying Key MicroRNA Signatures for Neurodegenerative Diseases With Machine Learning Methods.运用机器学习方法识别神经退行性疾病的关键微小RNA特征
Front Genet. 2022 Apr 21;13:880997. doi: 10.3389/fgene.2022.880997. eCollection 2022.

引用本文的文献

1
Global trends in machine learning applications for single-cell transcriptomics research.单细胞转录组学研究中机器学习应用的全球趋势。
Hereditas. 2025 Aug 16;162(1):164. doi: 10.1186/s41065-025-00528-y.
2
TMSB10 drives prostate cancer aggressiveness via immune microenvironment regulation.TMSB10通过免疫微环境调节驱动前列腺癌的侵袭性。
Mol Med. 2025 Apr 30;31(1):160. doi: 10.1186/s10020-025-01211-8.
3
Patterns of Gene Expression Profiles Associated with Colorectal Cancer in Colorectal Mucosa by Using Machine Learning Methods.

本文引用的文献

1
Identification of Cell Markers and Their Expression Patterns in Skin Based on Single-Cell RNA-Sequencing Profiles.基于单细胞RNA测序图谱鉴定皮肤中的细胞标志物及其表达模式。
Life (Basel). 2022 Apr 7;12(4):550. doi: 10.3390/life12040550.
2
Similarity-Based Method with Multiple-Feature Sampling for Predicting Drug Side Effects.基于相似性的多特征采样方法预测药物副作用。
Comput Math Methods Med. 2022 Apr 1;2022:9547317. doi: 10.1155/2022/9547317. eCollection 2022.
3
Identification of protein functions in mouse with a label space partition method.
利用机器学习方法分析结直肠黏膜中与结直肠癌相关的基因表达谱模式。
Comb Chem High Throughput Screen. 2024;27(19):2921-2934. doi: 10.2174/0113862073266300231026103844.
4
Combining Global-Constrained Concept Factorization and a Regularized Gaussian Graphical Model for Clustering Single-Cell RNA-seq Data.基于全局约束概念分解和正则化高斯图模型的单细胞 RNA-seq 数据聚类分析。
Interdiscip Sci. 2024 Mar;16(1):1-15. doi: 10.1007/s12539-023-00587-7. Epub 2023 Oct 10.
5
Identifying anal and cervical tumorigenesis-associated methylation signaling with machine learning methods.运用机器学习方法识别肛门和宫颈肿瘤发生相关的甲基化信号。
Front Oncol. 2022 Sep 29;12:998032. doi: 10.3389/fonc.2022.998032. eCollection 2022.
用标签空间划分方法鉴定小鼠中的蛋白质功能。
Math Biosci Eng. 2022 Feb 10;19(4):3820-3842. doi: 10.3934/mbe.2022176.
4
Predicting Heart Cell Types by Using Transcriptome Profiles and a Machine Learning Method.利用转录组图谱和机器学习方法预测心脏细胞类型
Life (Basel). 2022 Jan 31;12(2):228. doi: 10.3390/life12020228.
5
Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions.基于关键序列特征和分布预测 RNA 5-甲基胞嘧啶位点
Biomed Res Int. 2022 Jan 13;2022:4035462. doi: 10.1155/2022/4035462. eCollection 2022.
6
Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines.基于癌细胞系基因表达谱的泛癌生物标志物鉴定
Front Cell Dev Biol. 2021 Nov 30;9:781285. doi: 10.3389/fcell.2021.781285. eCollection 2021.
7
iMPT-FDNPL: Identification of Membrane Protein Types with Functional Domains and a Natural Language Processing Approach.iMPT-FDNPL:基于功能域和自然语言处理方法识别膜蛋白类型。
Comput Math Methods Med. 2021 Oct 11;2021:7681497. doi: 10.1155/2021/7681497. eCollection 2021.
8
clusterProfiler 4.0: A universal enrichment tool for interpreting omics data.clusterProfiler 4.0:用于解释组学数据的通用富集工具。
Innovation (Camb). 2021 Jul 1;2(3):100141. doi: 10.1016/j.xinn.2021.100141. eCollection 2021 Aug 28.
9
Identifying COVID-19-Specific Transcriptomic Biomarkers with Machine Learning Methods.运用机器学习方法识别新冠病毒特异性转录组生物标志物
Biomed Res Int. 2021 Jul 6;2021:9939134. doi: 10.1155/2021/9939134. eCollection 2021.
10
Identification of Microbiota Biomarkers With Orthologous Gene Annotation for Type 2 Diabetes.通过直系同源基因注释鉴定2型糖尿病的微生物群生物标志物
Front Microbiol. 2021 Jul 9;12:711244. doi: 10.3389/fmicb.2021.711244. eCollection 2021.