• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

胚胎毒性的预测生物标志物:一种减轻 RNA-Seq 中多重共线性的机器学习方法。

Predictive biomarkers for embryotoxicity: a machine learning approach to mitigating multicollinearity in RNA-Seq.

机构信息

Developmental and Reproductive Toxicology Research Group, Korea Institute of Toxicology, Daejeon, 34114, Republic of Korea.

Institute for Advanced Studies, Universiti Malaya, 50603, Kuala Lumpur, Malaysia.

出版信息

Arch Toxicol. 2024 Dec;98(12):4093-4105. doi: 10.1007/s00204-024-03852-w. Epub 2024 Sep 6.

DOI:10.1007/s00204-024-03852-w
PMID:39242367
Abstract

Multicollinearity, characterized by significant co-expression patterns among genes, often occurs in high-throughput expression data, potentially impacting the predictive model's reliability. This study examined multicollinearity among closely related genes, particularly in RNA-Seq data obtained from embryoid bodies (EB) exposed to 5-fluorouracil perturbation to identify genes associated with embryotoxicity. Six genes-Dppa5a, Gdf3, Zfp42, Meis1, Hoxa2, and Hoxb1-emerged as candidates based on domain knowledge and were validated using qPCR in EBs perturbed by 39 test substances. We conducted correlation studies and utilized the variance inflation factor (VIF) to examine the existence of multicollinearity among the genes. Recursive feature elimination with cross-validation (RFECV) ranked Zfp42 and Hoxb1 as the top two among the seven features considered, identifying them as potential early embryotoxicity assessment biomarkers. As a result, a t test assessing the statistical significance of this two-feature prediction model yielded a p value of 0.0044, confirming the successful reduction of redundancies and multicollinearity through RFECV. Our study presents a systematic methodology for using machine learning techniques in transcriptomics data analysis, enhancing the discovery of potential reporter gene candidates for embryotoxicity screening research, and improving the predictive model's predictive accuracy and feasibility while reducing financial and time constraints.

摘要

多线性,其特征是基因之间存在显著的共表达模式,经常出现在高通量表达数据中,可能会影响预测模型的可靠性。本研究检查了密切相关基因之间的多线性,特别是在胚胎体(EB)中暴露于 5-氟尿嘧啶扰动后获得的 RNA-Seq 数据中,以鉴定与胚胎毒性相关的基因。根据领域知识,六个基因-Dppa5a、Gdf3、Zfp42、Meis1、Hoxa2 和 Hoxb1-作为候选基因出现,并在 39 种测试物质扰动的 EB 中使用 qPCR 进行了验证。我们进行了相关性研究,并利用方差膨胀因子(VIF)来检查基因之间是否存在多线性。递归特征消除与交叉验证(RFECV)将 Zfp42 和 Hoxb1 排在考虑的七个特征中的前两位,将它们确定为潜在的早期胚胎毒性评估生物标志物。因此,评估该两特征预测模型统计显著性的 t 检验得出的 p 值为 0.0044,证实了通过 RFECV 成功减少了冗余和多线性。我们的研究提出了一种系统的方法,用于在转录组学数据分析中使用机器学习技术,增强了对胚胎毒性筛选研究中潜在报告基因候选物的发现,并提高了预测模型的预测准确性和可行性,同时减少了财务和时间限制。

相似文献

1
Predictive biomarkers for embryotoxicity: a machine learning approach to mitigating multicollinearity in RNA-Seq.胚胎毒性的预测生物标志物:一种减轻 RNA-Seq 中多重共线性的机器学习方法。
Arch Toxicol. 2024 Dec;98(12):4093-4105. doi: 10.1007/s00204-024-03852-w. Epub 2024 Sep 6.
2
Neuronal and cardiac toxicity of pharmacological compounds identified through transcriptomic analysis of human pluripotent stem cell-derived embryoid bodies.通过人类多能干细胞衍生的胚状体的转录组分析鉴定出的药物化合物的神经毒性和心脏毒性。
Toxicol Appl Pharmacol. 2021 Dec 15;433:115792. doi: 10.1016/j.taap.2021.115792. Epub 2021 Nov 3.
3
Rapid quantitative high-throughput mouse embryoid body model for embryotoxicity assessment.快速定量高通量小鼠胚胎体模型用于胚胎毒性评估。
Arch Toxicol. 2024 Nov;98(11):3897-3908. doi: 10.1007/s00204-024-03845-9. Epub 2024 Sep 5.
4
Identifying novel transcript biomarkers for hepatocellular carcinoma (HCC) using RNA-Seq datasets and machine learning.利用 RNA-Seq 数据集和机器学习技术鉴定肝细胞癌(HCC)的新型转录生物标志物。
BMC Cancer. 2021 Aug 27;21(1):962. doi: 10.1186/s12885-021-08704-9.
5
Morphological observation of embryoid bodies completes the in vitro evaluation of nanomaterial embryotoxicity in the embryonic stem cell test (EST).胚状体的形态学观察完善了胚胎干细胞试验(EST)中纳米材料胚胎毒性的体外评估。
Toxicol In Vitro. 2015 Oct;29(7):1587-96. doi: 10.1016/j.tiv.2015.06.015. Epub 2015 Jun 17.
6
Robust biomarker screening from gene expression data by stable machine learning-recursive feature elimination methods.基于稳健机器学习-递归特征消除方法的基因表达数据的稳健生物标志物筛选。
Comput Biol Chem. 2022 Oct;100:107747. doi: 10.1016/j.compbiolchem.2022.107747. Epub 2022 Jul 29.
7
Standardization and optimization of the hiPSC-based PluriLum assay for detection of embryonic and developmental toxicants.基于 hiPSC 的 PluriLum assay 用于检测胚胎和发育毒物的标准化和优化。
Arch Toxicol. 2024 Dec;98(12):4107-4116. doi: 10.1007/s00204-024-03870-8. Epub 2024 Oct 4.
8
Specific effect of 5-fluorouracil on alpha-fetoprotein gene expression during the in vitro mouse embryonic stem cell differentiation.5-氟尿嘧啶对体外小鼠胚胎干细胞分化过程中甲胎蛋白基因表达的特异性影响。
Int J Toxicol. 2010 May-Jun;29(3):297-304. doi: 10.1177/1091581810366312.
9
Enhancing diabetic foot ulcer prediction with machine learning: A focus on Localized examinations.利用机器学习增强糖尿病足溃疡预测:聚焦局部检查。
Heliyon. 2024 Sep 19;10(19):e37635. doi: 10.1016/j.heliyon.2024.e37635. eCollection 2024 Oct 15.
10
Advanced developmental toxicity test method based on embryoid body's area.基于类胚体面积的先进发育毒性测试方法。
Reprod Toxicol. 2017 Sep;72:74-85. doi: 10.1016/j.reprotox.2017.06.185. Epub 2017 Jun 30.

引用本文的文献

1
Developmental toxicity: artificial intelligence-powered assessments.发育毒性:人工智能驱动的评估
Trends Pharmacol Sci. 2025 Jun;46(6):486-502. doi: 10.1016/j.tips.2025.04.005. Epub 2025 May 15.
2
Interpretable Machine Learning Model for Predicting Postpartum Depression: Retrospective Study.用于预测产后抑郁症的可解释机器学习模型:回顾性研究
JMIR Med Inform. 2025 Jan 20;13:e58649. doi: 10.2196/58649.

本文引用的文献

1
Neuronally enriched microvesicle RNAs are differentially expressed in the serums of Parkinson's patients.神经元富集的微泡RNA在帕金森病患者血清中存在差异表达。
Front Neurosci. 2023 Jul 6;17:1145923. doi: 10.3389/fnins.2023.1145923. eCollection 2023.
2
Alterations in DNA Methylation in Orofacial Clefts.口腔颌面裂中 DNA 甲基化的改变。
Int J Mol Sci. 2022 Oct 22;23(21):12727. doi: 10.3390/ijms232112727.
3
Machine learning for cell type classification from single nucleus RNA sequencing data.基于单细胞 RNA 测序数据的细胞类型分类的机器学习方法。
PLoS One. 2022 Sep 23;17(9):e0275070. doi: 10.1371/journal.pone.0275070. eCollection 2022.
4
Stem cells differentiation into insulin-producing cells (IPCs): recent advances and current challenges.干细胞分化为胰岛素分泌细胞(IPCs):最新进展和当前挑战。
Stem Cell Res Ther. 2022 Jul 15;13(1):309. doi: 10.1186/s13287-022-02977-y.
5
Serum biomarker-based osteoporosis risk prediction and the systemic effects of Trifolium pratense ethanolic extract in a postmenopausal model.基于血清生物标志物的骨质疏松症风险预测及红车轴草乙醇提取物在绝经后模型中的全身效应
Chin Med. 2022 Jun 14;17(1):70. doi: 10.1186/s13020-022-00622-7.
6
Optimization of the TeraTox Assay for Preclinical Teratogenicity Assessment.优化 Teratox assay 用于临床前致畸性评估。
Toxicol Sci. 2022 Jun 28;188(1):17-33. doi: 10.1093/toxsci/kfac046.
7
Coexpression reveals conserved gene programs that co-vary with cell type across kingdoms.共表达揭示了与细胞类型在整个生物界中共同变化的保守基因程序。
Nucleic Acids Res. 2022 May 6;50(8):4302-4314. doi: 10.1093/nar/gkac276.
8
A novel human stem cell-based biomarker assay for in vitro assessment of developmental toxicity.一种新型基于人干细胞的生物标志物检测方法,用于体外发育毒性评估。
Birth Defects Res. 2022 Nov 15;114(19):1210-1228. doi: 10.1002/bdr2.2001. Epub 2022 Mar 14.
9
Global DNA methylation and chondrogenesis of rat limb buds in a three-dimensional organ culture system.三维器官培养系统中大鼠肢芽的全球 DNA 甲基化和软骨生成。
Bosn J Basic Med Sci. 2022 Jul 29;22(4):560-568. doi: 10.17305/bjbms.2021.6584.
10
Neuronal and cardiac toxicity of pharmacological compounds identified through transcriptomic analysis of human pluripotent stem cell-derived embryoid bodies.通过人类多能干细胞衍生的胚状体的转录组分析鉴定出的药物化合物的神经毒性和心脏毒性。
Toxicol Appl Pharmacol. 2021 Dec 15;433:115792. doi: 10.1016/j.taap.2021.115792. Epub 2021 Nov 3.