• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用逻辑概率隐变量模型进行稳健共聚类以发现化学化合物的毒理基因组生物标志物及其调节剂量

Robust Co-clustering to Discover Toxicogenomic Biomarkers and Their Regulatory Doses of Chemical Compounds Using Logistic Probabilistic Hidden Variable Model.

作者信息

Hasan Mohammad Nazmol, Rana Md Masud, Begum Anjuman Ara, Rahman Moizur, Mollah Md Nurul Haque

机构信息

Bioinformatics Laboratory, Department of Statistics, University of Rajshahi, Rajshahi, Bangladesh.

Department of Statistics, Bangabandhu Sheikh Mujibur Rahman Agricultural University, Gazipur, Bangladesh.

出版信息

Front Genet. 2018 Nov 1;9:516. doi: 10.3389/fgene.2018.00516. eCollection 2018.

DOI:10.3389/fgene.2018.00516
PMID:30450112
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6225736/
Abstract

Detection of biomarker genes and their regulatory doses of chemical compounds (DCCs) is one of the most important tasks in toxicogenomic studies as well as in drug design and development. There is an online computational platform "Toxygates" to identify biomarker genes and their regulatory DCCs by co-clustering approach. Nevertheless, the algorithm of that platform based on hierarchical clustering (HC) does not share gene-DCC two-way information simultaneously during co-clustering between genes and DCCs. Also it is sensitive to outlying observations. Thus, this platform may produce misleading results in some cases. The probabilistic hidden variable model (PHVM) is a more effective co-clustering approach that share two-way information simultaneously, but it is also sensitive to outlying observations. Therefore, in this paper we have proposed logistic probabilistic hidden variable model (LPHVM) for robust co-clustering between genes and DCCs, since gene expression data are often contaminated by outlying observations. We have investigated the performance of the proposed LPHVM co-clustering approach in a comparison with the conventional PHVM and Toxygates co-clustering approaches using simulated and real life TGP gene expression datasets, respectively. Simulation results show that the proposed method improved the performance over the conventional PHVM in presence of outliers; otherwise, it keeps equal performance. In the case of real life TGP data analysis, three DCCs (glibenclamide-low, perhexilline-low, and hexachlorobenzene-medium) for glutathione metabolism pathway dataset as well as two DCCs (acetaminophen-medium and methapyrilene-low) for PPAR signaling pathway dataset were incorrectly co-clustered by the Toxygates online platform, while only one DCC (hexachlorobenzene-low) for glutathione metabolism pathway was incorrectly co-clustered by the proposed LPHVM approach. Our findings from the real data analysis are also supported by the other findings in the literature.

摘要

检测生物标志物基因及其化学化合物(DCCs)的调控剂量是毒理基因组学研究以及药物设计与开发中最重要的任务之一。有一个在线计算平台“Toxygates”,通过共聚类方法来识别生物标志物基因及其调控的DCCs。然而,该平台基于层次聚类(HC)的算法在基因与DCCs的共聚类过程中不能同时共享基因 - DCC的双向信息。而且它对异常观测值很敏感。因此,该平台在某些情况下可能会产生误导性结果。概率隐变量模型(PHVM)是一种更有效的共聚类方法,能同时共享双向信息,但它同样对异常观测值敏感。所以,在本文中,我们提出了逻辑概率隐变量模型(LPHVM)用于基因与DCCs之间的稳健共聚类,因为基因表达数据常常受到异常观测值的污染。我们分别使用模拟的和真实的TGP基因表达数据集,将所提出的LPHVM共聚类方法与传统的PHVM和Toxygates共聚类方法进行比较,研究了其性能。模拟结果表明,在存在异常值的情况下,所提出的方法比传统的PHVM性能有所提高;否则,它们的性能相当。在实际的TGP数据分析中,Toxygates在线平台将谷胱甘肽代谢途径数据集中的三个DCCs(低剂量格列本脲、低剂量哌克昔林和中剂量六氯苯)以及PPAR信号通路数据集中的两个DCCs(中剂量对乙酰氨基酚和低剂量美吡拉敏)错误地共聚类,而所提出的LPHVM方法仅将谷胱甘肽代谢途径中的一个DCC(低剂量六氯苯)错误地共聚类。我们从实际数据分析中得到的结果也得到了文献中其他研究结果的支持。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45fb/6225736/09598876f504/fgene-09-00516-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45fb/6225736/f24248de1e37/fgene-09-00516-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45fb/6225736/46b9cc2a17c8/fgene-09-00516-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45fb/6225736/09598876f504/fgene-09-00516-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45fb/6225736/f24248de1e37/fgene-09-00516-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45fb/6225736/46b9cc2a17c8/fgene-09-00516-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45fb/6225736/09598876f504/fgene-09-00516-g0003.jpg

相似文献

1
Robust Co-clustering to Discover Toxicogenomic Biomarkers and Their Regulatory Doses of Chemical Compounds Using Logistic Probabilistic Hidden Variable Model.使用逻辑概率隐变量模型进行稳健共聚类以发现化学化合物的毒理基因组生物标志物及其调节剂量
Front Genet. 2018 Nov 1;9:516. doi: 10.3389/fgene.2018.00516. eCollection 2018.
2
Assessment of Drugs Toxicity and Associated Biomarker Genes Using Hierarchical Clustering.利用层次聚类评估药物毒性和相关生物标志物基因。
Medicina (Kaunas). 2019 Aug 8;55(8):451. doi: 10.3390/medicina55080451.
3
Robust identification of significant interactions between toxicogenomic biomarkers and their regulatory chemical compounds using logistic moving range chart.使用逻辑移动范围图稳健识别毒代动力学生物标志物与其调节化学化合物之间的显著相互作用。
Comput Biol Chem. 2019 Feb;78:375-381. doi: 10.1016/j.compbiolchem.2018.12.020. Epub 2018 Dec 26.
4
Toxic Dose prediction of Chemical Compounds to Biomarkers using an ANOVA based Gene Expression Analysis.基于方差分析的基因表达分析预测化学化合物对生物标志物的毒性剂量
Bioinformation. 2018 Jul 31;14(7):369-377. doi: 10.6026/97320630014369. eCollection 2018.
5
Improving the sensitivity of sample clustering by leveraging gene co-expression networks in variable selection.利用基因共表达网络进行变量选择来提高样本聚类的灵敏度。
BMC Bioinformatics. 2014 May 20;15:153. doi: 10.1186/1471-2105-15-153.
6
A Robust Approach for Identification of Cancer Biomarkers and Candidate Drugs.一种稳健的癌症生物标志物和候选药物鉴定方法。
Medicina (Kaunas). 2019 Jun 11;55(6):269. doi: 10.3390/medicina55060269.
7
Robust complementary hierarchical clustering for gene expression data analysis by β-divergence.基于β散度的稳健互补层次聚类在基因表达数据分析中的应用。
J Biosci Bioeng. 2013 Sep;116(3):397-407. doi: 10.1016/j.jbiosc.2013.03.010. Epub 2013 Apr 19.
8
Clustering and re-clustering for pattern discovery in gene expression data.用于基因表达数据中模式发现的聚类和再聚类。
J Bioinform Comput Biol. 2005 Apr;3(2):281-301. doi: 10.1142/s0219720005001053.
9
Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm.使用一致性算法对大型DNA微阵列数据集进行稳健的多尺度聚类
Bioinformatics. 2006 Jan 1;22(1):58-67. doi: 10.1093/bioinformatics/bti746. Epub 2005 Oct 27.
10
A Hybrid One-Way ANOVA Approach for the Robust and Efficient Estimation of Differential Gene Expression with Multiple Patterns.一种用于稳健高效估计具有多种模式的差异基因表达的混合单向方差分析方法。
PLoS One. 2015 Sep 28;10(9):e0138810. doi: 10.1371/journal.pone.0138810. eCollection 2015.

引用本文的文献

1
Impact of climate change on Boro rice production in Bangladesh: Evidence from time series modeling.气候变化对孟加拉国波萝稻产量的影响:来自时间序列建模的证据
PLoS One. 2025 Jul 23;20(7):e0328699. doi: 10.1371/journal.pone.0328699. eCollection 2025.
2
Robust hierarchical co-clustering for exploring toxicogenomic biomarkers and their chemical regulators.用于探索毒理基因组生物标志物及其化学调节剂的稳健分层共聚类
Sci Rep. 2025 May 14;15(1):16676. doi: 10.1038/s41598-025-99568-7.
3
An investigation of the pigments, antioxidants and free radical scavenging potential of twenty medicinal weeds found in the southern part of Bangladesh.

本文引用的文献

1
Interactive Toxicogenomics: Gene set discovery, clustering and analysis in Toxygates.交互式毒理学基因组学:在 Toxygates 中进行基因集发现、聚类和分析。
Sci Rep. 2017 May 3;7(1):1390. doi: 10.1038/s41598-017-01500-1.
2
Identifying dynamic pathway interactions based on clinical information.
Comput Biol Chem. 2017 Jun;68:260-265. doi: 10.1016/j.compbiolchem.2017.04.009. Epub 2017 Apr 24.
3
ToxDB: pathway-level interpretation of drug-treatment data.ToxDB:药物治疗数据的通路水平解读
调查孟加拉国南部地区的 20 种药用杂草中的色素、抗氧化剂和自由基清除能力。
PeerJ. 2024 Jul 23;12:e17698. doi: 10.7717/peerj.17698. eCollection 2024.
4
An introduction to new robust linear and monotonic correlation coefficients.新的稳健线性和单调相关系数简介。
BMC Bioinformatics. 2021 Mar 31;22(1):170. doi: 10.1186/s12859-021-04098-4.
5
Assessment of Drugs Toxicity and Associated Biomarker Genes Using Hierarchical Clustering.利用层次聚类评估药物毒性和相关生物标志物基因。
Medicina (Kaunas). 2019 Aug 8;55(8):451. doi: 10.3390/medicina55080451.
Database (Oxford). 2016 Apr 13;2016. doi: 10.1093/database/baw052. Print 2016.
4
Open TG-GATEs: a large-scale toxicogenomics database.开放TG-GATEs:一个大规模的毒理基因组学数据库。
Nucleic Acids Res. 2015 Jan;43(Database issue):D921-7. doi: 10.1093/nar/gku955. Epub 2014 Oct 13.
5
Toxygates: interactive toxicity analysis on a hybrid microarray and linked data platform.毒理学基因芯片数据库:基于混合基因芯片和链接数据平台的交互式毒性分析。
Bioinformatics. 2013 Dec 1;29(23):3080-6. doi: 10.1093/bioinformatics/btt531. Epub 2013 Sep 17.
6
Network-based stratification of tumor mutations.基于网络的肿瘤突变分层。
Nat Methods. 2013 Nov;10(11):1108-15. doi: 10.1038/nmeth.2651. Epub 2013 Sep 15.
7
Oxidative stress and antioxidant defense.氧化应激与抗氧化防御。
World Allergy Organ J. 2012 Jan;5(1):9-19. doi: 10.1097/WOX.0b013e3182439613. Epub 2012 Jan 13.
8
A decade of toxicogenomic research and its contribution to toxicological science.毒理基因组学研究十年及其对毒理学科学的贡献。
Toxicol Sci. 2012 Dec;130(2):217-28. doi: 10.1093/toxsci/kfs223. Epub 2012 Jul 12.
9
Human embryonic stem cell derived hepatocyte-like cells as a tool for in vitro hazard assessment of chemical carcinogenicity.人胚胎干细胞来源的肝细胞样细胞作为体外化学致癌性危害评估的工具。
Toxicol Sci. 2011 Dec;124(2):278-90. doi: 10.1093/toxsci/kfr225. Epub 2011 Aug 27.
10
The evolution of bioinformatics in toxicology: advancing toxicogenomics.毒理学中的生物信息学的发展:推进毒代动力学基因组学。
Toxicol Sci. 2011 Mar;120 Suppl 1(Suppl 1):S225-37. doi: 10.1093/toxsci/kfq373. Epub 2010 Dec 22.