• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

运用机器学习算法研究七个胚胎阶段细胞的基因表达谱。

Investigating the gene expression profiles of cells in seven embryonic stages with machine learning algorithms.

机构信息

School of Life Sciences, Shanghai University, Shanghai 200444, China; College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China; Shanghai Key Laboratory of PMMP, East China Normal University, Shanghai 200241, China.

Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Key Laboratory of System Control and Information Processing, Ministry of Education of China, 200240 Shanghai, China.

出版信息

Genomics. 2020 May;112(3):2524-2534. doi: 10.1016/j.ygeno.2020.02.004. Epub 2020 Feb 8.

DOI:10.1016/j.ygeno.2020.02.004
PMID:32045671
Abstract

The development of embryonic cells involves several continuous stages, and some genes are related to embryogenesis. To date, few studies have systematically investigated changes in gene expression profiles during mammalian embryogenesis. In this study, a computational analysis using machine learning algorithms was performed on the gene expression profiles of mouse embryonic cells at seven stages. First, the profiles were analyzed through a powerful Monte Carlo feature selection method for the generation of a feature list. Second, increment feature selection was applied on the list by incorporating two classification algorithms: support vector machine (SVM) and repeated incremental pruning to produce error reduction (RIPPER). Through SVM, we extracted several latent gene biomarkers, indicating the stages of embryonic cells, and constructed an optimal SVM classifier that produced a nearly perfect classification of embryonic cells. Furthermore, some interesting rules were accessed by the RIPPER algorithm, suggesting different expression patterns for different stages.

摘要

胚胎细胞的发育涉及几个连续的阶段,其中一些基因与胚胎发生有关。迄今为止,很少有研究系统地研究哺乳动物胚胎发生过程中基因表达谱的变化。在这项研究中,我们使用机器学习算法对来自 7 个阶段的小鼠胚胎细胞的基因表达谱进行了计算分析。首先,通过强大的蒙特卡罗特征选择方法对谱进行分析,以生成特征列表。其次,通过结合两种分类算法:支持向量机(SVM)和重复增量剪枝以产生错误减少(RIPPER),在列表上应用增量特征选择。通过 SVM,我们提取了几个潜在的基因生物标志物,指示胚胎细胞的阶段,并构建了一个最佳的 SVM 分类器,几乎可以完美地对胚胎细胞进行分类。此外,RIPPER 算法还获取了一些有趣的规则,表明不同阶段的表达模式不同。

相似文献

1
Investigating the gene expression profiles of cells in seven embryonic stages with machine learning algorithms.运用机器学习算法研究七个胚胎阶段细胞的基因表达谱。
Genomics. 2020 May;112(3):2524-2534. doi: 10.1016/j.ygeno.2020.02.004. Epub 2020 Feb 8.
2
Analysis of Expression Pattern of snoRNAs in Different Cancer Types with Machine Learning Algorithms.基于机器学习算法的 snoRNAs 在不同癌症类型中的表达模式分析。
Int J Mol Sci. 2019 May 2;20(9):2185. doi: 10.3390/ijms20092185.
3
Identification of leukemia stem cell expression signatures through Monte Carlo feature selection strategy and support vector machine.通过蒙特卡罗特征选择策略和支持向量机鉴定白血病干细胞表达特征。
Cancer Gene Ther. 2020 Feb;27(1-2):56-69. doi: 10.1038/s41417-019-0105-y. Epub 2019 May 29.
4
Analysis of gene expression profiles of lung cancer subtypes with machine learning algorithms.基于机器学习算法的肺癌亚型基因表达谱分析。
Biochim Biophys Acta Mol Basis Dis. 2020 Aug 1;1866(8):165822. doi: 10.1016/j.bbadis.2020.165822. Epub 2020 Apr 28.
5
HelPredictor models single-cell transcriptome to predict human embryo lineage allocation.HelPredictor模型通过单细胞转录组预测人类胚胎谱系分配。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab196.
6
An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.基于基因表达数据的多支持向量机技术的高效特征选择策略。
Biomed Res Int. 2018 Aug 30;2018:7538204. doi: 10.1155/2018/7538204. eCollection 2018.
7
Distinguishing three subtypes of hematopoietic cells based on gene expression profiles using a support vector machine.基于基因表达谱使用支持向量机区分三种造血细胞亚型。
Biochim Biophys Acta Mol Basis Dis. 2018 Jun;1864(6 Pt B):2255-2265. doi: 10.1016/j.bbadis.2017.12.003. Epub 2017 Dec 12.
8
A machine learning approach for the identification of key markers involved in brain development from single-cell transcriptomic data.一种从单细胞转录组数据中识别参与大脑发育的关键标志物的机器学习方法。
BMC Genomics. 2016 Dec 22;17(Suppl 13):1025. doi: 10.1186/s12864-016-3317-7.
9
Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles.基于基因表达谱鉴定区分化学污染物的生物标志物。
BMC Genomics. 2014 Mar 31;15:248. doi: 10.1186/1471-2164-15-248.
10
Identification of the Gene Expression Rules That Define the Subtypes in Glioma.确定定义神经胶质瘤亚型的基因表达规则。
J Clin Med. 2018 Oct 13;7(10):350. doi: 10.3390/jcm7100350.

引用本文的文献

1
Opportunities and Challenges in Applying AI to Evolutionary Morphology.将人工智能应用于进化形态学的机遇与挑战。
Integr Org Biol. 2024 Sep 23;6(1):obae036. doi: 10.1093/iob/obae036. eCollection 2024.
2
Machine learning-assisted imaging analysis of a human epiblast model.机器学习辅助的人类上胚层模型成像分析。
Integr Biol (Camb). 2021 Oct 15;13(9):221-229. doi: 10.1093/intbio/zyab014.
3
Identifying COVID-19-Specific Transcriptomic Biomarkers with Machine Learning Methods.运用机器学习方法识别新冠病毒特异性转录组生物标志物
Biomed Res Int. 2021 Jul 6;2021:9939134. doi: 10.1155/2021/9939134. eCollection 2021.
4
Analysis of the Sequence Characteristics of Antifreeze Protein.抗冻蛋白的序列特征分析
Life (Basel). 2021 Jun 3;11(6):520. doi: 10.3390/life11060520.
5
Identification of Common Genes and Pathways in Eight Fibrosis Diseases.八种纤维化疾病中常见基因和通路的鉴定
Front Genet. 2021 Jan 15;11:627396. doi: 10.3389/fgene.2020.627396. eCollection 2020.
6
Distinguishing Glioblastoma Subtypes by Methylation Signatures.通过甲基化特征区分胶质母细胞瘤亚型
Front Genet. 2020 Nov 24;11:604336. doi: 10.3389/fgene.2020.604336. eCollection 2020.
7
Multiple Feature Selection Strategies Identified Novel Cardiac Gene Expression Signature for Heart Failure.多种特征选择策略确定了心力衰竭的新型心脏基因表达特征。
Front Physiol. 2020 Nov 11;11:604241. doi: 10.3389/fphys.2020.604241. eCollection 2020.
8
Identification of Post-myocardial Infarction Blood Expression Signatures Using Multiple Feature Selection Strategies.使用多种特征选择策略识别心肌梗死后血液表达特征
Front Physiol. 2020 Jun 3;11:483. doi: 10.3389/fphys.2020.00483. eCollection 2020.
9
Identification and Analysis of Dysfunctional Genes and Pathways in CD8 T Cells of Non-Small Cell Lung Cancer Based on RNA Sequencing.基于RNA测序的非小细胞肺癌CD8 T细胞中功能失调基因和通路的鉴定与分析
Front Genet. 2020 May 8;11:352. doi: 10.3389/fgene.2020.00352. eCollection 2020.
10
The Serum MicroRNA Signatures for Pancreatic Cancer Detection and Operability Evaluation.用于胰腺癌检测和可切除性评估的血清微小RNA特征
Front Bioeng Biotechnol. 2020 Apr 29;8:379. doi: 10.3389/fbioe.2020.00379. eCollection 2020.