• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在美国各县艾滋病流行病学和社会经济数据的复杂网络上绘制高效抗逆转录病毒治疗药物鸡尾酒的化学结构-活性信息。

Mapping chemical structure-activity information of HAART-drug cocktails over complex networks of AIDS epidemiology and socioeconomic data of U.S. counties.

作者信息

Herrera-Ibatá Diana María, Pazos Alejandro, Orbegozo-Medina Ricardo Alfredo, Romero-Durán Francisco Javier, González-Díaz Humberto

机构信息

Department of Information and Communication Technologies, University of A Coruña (UDC), 15071 A Coruña, Spain.

Department of Information and Communication Technologies, University of A Coruña (UDC), 15071 A Coruña, Spain.

出版信息

Biosystems. 2015 Jun;132-133:20-34. doi: 10.1016/j.biosystems.2015.04.007. Epub 2015 Apr 24.

DOI:10.1016/j.biosystems.2015.04.007
PMID:25916548
Abstract

Using computational algorithms to design tailored drug cocktails for highly active antiretroviral therapy (HAART) on specific populations is a goal of major importance for both pharmaceutical industry and public health policy institutions. New combinations of compounds need to be predicted in order to design HAART cocktails. On the one hand, there are the biomolecular factors related to the drugs in the cocktail (experimental measure, chemical structure, drug target, assay organisms, etc.); on the other hand, there are the socioeconomic factors of the specific population (income inequalities, employment levels, fiscal pressure, education, migration, population structure, etc.) to study the relationship between the socioeconomic status and the disease. In this context, machine learning algorithms, able to seek models for problems with multi-source data, have to be used. In this work, the first artificial neural network (ANN) model is proposed for the prediction of HAART cocktails, to halt AIDS on epidemic networks of U.S. counties using information indices that codify both biomolecular and several socioeconomic factors. The data was obtained from at least three major sources. The first dataset included assays of anti-HIV chemical compounds released to ChEMBL. The second dataset is the AIDSVu database of Emory University. AIDSVu compiled AIDS prevalence for >2300 U.S. counties. The third data set included socioeconomic data from the U.S. Census Bureau. Three scales or levels were employed to group the counties according to the location or population structure codes: state, rural urban continuum code (RUCC) and urban influence code (UIC). An analysis of >130,000 pairs (network links) was performed, corresponding to AIDS prevalence in 2310 counties in U.S. vs. drug cocktails made up of combinations of ChEMBL results for 21,582 unique drugs, 9 viral or human protein targets, 4856 protocols, and 10 possible experimental measures. The best model found with the original data was a linear neural network (LNN) with AUROC>0.80 and accuracy, specificity, and sensitivity≈77% in training and external validation series. The change of the spatial and population structure scale (State, UIC, or RUCC codes) does not affect the quality of the model. Unbalance was detected in all the models found comparing positive/negative cases and linear/non-linear model accuracy ratios. Using synthetic minority over-sampling technique (SMOTE), data pre-processing and machine-learning algorithms implemented into the WEKA software, more balanced models were found. In particular, a multilayer perceptron (MLP) with AUROC=97.4% and precision, recall, and F-measure >90% was found.

摘要

使用计算算法为特定人群设计用于高效抗逆转录病毒疗法(HAART)的定制药物鸡尾酒,这对制药行业和公共卫生政策机构而言都是极为重要的目标。为了设计HAART鸡尾酒,需要预测化合物的新组合。一方面,存在与鸡尾酒中的药物相关的生物分子因素(实验测量、化学结构、药物靶点、测定生物等);另一方面,存在特定人群的社会经济因素(收入不平等、就业水平、财政压力、教育、移民、人口结构等),以研究社会经济状况与疾病之间的关系。在这种背景下,必须使用能够为多源数据问题寻找模型的机器学习算法。在这项工作中,提出了第一个用于预测HAART鸡尾酒的人工神经网络(ANN)模型,以利用编码生物分子和多种社会经济因素的信息指数,在美国各县的流行网络上遏制艾滋病。数据至少来自三个主要来源。第一个数据集包括发布到ChEMBL的抗HIV化合物的测定。第二个数据集是埃默里大学的AIDSVu数据库。AIDSVu汇编了美国2300多个县的艾滋病患病率。第三个数据集包括来自美国人口普查局的社会经济数据。根据位置或人口结构代码,采用三个尺度或级别对县进行分组:州、农村城市连续体代码(RUCC)和城市影响代码(UIC)。对超过130,000对(网络链接)进行了分析,对应于美国2310个县的艾滋病患病率与由21,582种独特药物、9种病毒或人类蛋白质靶点、4856种方案和10种可能的实验测量结果组合而成的药物鸡尾酒。使用原始数据找到的最佳模型是线性神经网络(LNN),在训练和外部验证系列中,其曲线下面积(AUROC)>0.80,准确率、特异性和灵敏度约为77%。空间和人口结构尺度(州、UIC或RUCC代码)的变化不会影响模型的质量。在所有找到的模型中,比较阳性/阴性病例和线性/非线性模型准确率比率时检测到不平衡。使用合成少数过采样技术(SMOTE)、数据预处理以及在WEKA软件中实现的机器学习算法,找到了更平衡的模型。特别是,发现了一个多层感知器(MLP),其AUROC = 97.4%,精确率、召回率和F值>90%。

相似文献

1
Mapping chemical structure-activity information of HAART-drug cocktails over complex networks of AIDS epidemiology and socioeconomic data of U.S. counties.在美国各县艾滋病流行病学和社会经济数据的复杂网络上绘制高效抗逆转录病毒治疗药物鸡尾酒的化学结构-活性信息。
Biosystems. 2015 Jun;132-133:20-34. doi: 10.1016/j.biosystems.2015.04.007. Epub 2015 Apr 24.
2
ANN multiscale model of anti-HIV drugs activity vs AIDS prevalence in the US at county level based on information indices of molecular graphs and social networks.基于分子图谱和社会网络信息指数的美国县级抗艾滋病毒药物活性与艾滋病流行率的人工神经网络多尺度模型。
J Chem Inf Model. 2014 Mar 24;54(3):744-55. doi: 10.1021/ci400716y. Epub 2014 Feb 21.
3
Multioutput Perturbation-Theory Machine Learning (PTML) Model of ChEMBL Data for Antiretroviral Compounds.多输出扰断理论机器学习(PTML)模型的 CHEMBL 数据抗逆转录病毒化合物。
Mol Pharm. 2019 Oct 7;16(10):4200-4212. doi: 10.1021/acs.molpharmaceut.9b00538. Epub 2019 Aug 30.
4
2D MI-DRAGON: a new predictor for protein-ligands interactions and theoretic-experimental studies of US FDA drug-target network, oxoisoaporphine inhibitors for MAO-A and human parasite proteins.2D MI-DRAGON:一种新的蛋白配体相互作用预测因子,以及美国 FDA 药物靶点网络、MAO-A 抑制剂和人体寄生虫蛋白的理论-实验研究。
Eur J Med Chem. 2011 Dec;46(12):5838-51. doi: 10.1016/j.ejmech.2011.09.045. Epub 2011 Oct 1.
5
NL MIND-BEST: a web server for ligands and proteins discovery--theoretic-experimental study of proteins of Giardia lamblia and new compounds active against Plasmodium falciparum.NL MIND-BEST:一个用于配体和蛋白质发现的网络服务器——理论-实验研究蓝氏贾第鞭毛虫蛋白和新的抗疟化合物。
J Theor Biol. 2011 May 7;276(1):229-49. doi: 10.1016/j.jtbi.2011.01.010. Epub 2011 Jan 26.
6
[Development of antituberculous drugs: current status and future prospects].[抗结核药物的研发:现状与未来前景]
Kekkaku. 2006 Dec;81(12):753-74.
7
Study of the impact of HIV genotypic drug resistance testing on therapy efficacy.人类免疫缺陷病毒基因耐药性检测对治疗效果的影响研究。
Verh K Acad Geneeskd Belg. 2001;63(5):447-73.
8
AIDS mortality before and after the introduction of highly active antiretroviral therapy: does it vary with socioeconomic group in a country with a National Health System?高效抗逆转录病毒疗法引入前后的艾滋病死亡率:在一个拥有国家卫生系统的国家,其在社会经济群体中是否存在差异?
Eur J Public Health. 2006 Dec;16(6):601-8. doi: 10.1093/eurpub/ckl062. Epub 2006 May 12.
9
Using entropy of drug and protein graphs to predict FDA drug-target network: theoretic-experimental study of MAO inhibitors and hemoglobin peptides from Fasciola hepatica.利用药物和蛋白质网络图的熵预测 FDA 药物-靶标网络:单胺氧化酶抑制剂和来自 Fasciola hepatica 的血红蛋白肽的理论-实验研究。
Eur J Med Chem. 2011 Apr;46(4):1074-94. doi: 10.1016/j.ejmech.2011.01.023. Epub 2011 Jan 21.
10
PTML Model for Selection of Nanoparticles, Anticancer Drugs, and Vitamins in the Design of Drug-Vitamin Nanoparticle Release Systems for Cancer Cotherapy.用于癌症联合治疗的药物-维生素纳米颗粒释放系统设计中纳米颗粒、抗癌药物和维生素选择的 PTML 模型。
Mol Pharm. 2020 Jul 6;17(7):2612-2627. doi: 10.1021/acs.molpharmaceut.0c00308. Epub 2020 Jun 8.

引用本文的文献

1
Perturbation-Theory Machine Learning for Multi-Target Drug Discovery in Modern Anticancer Research.现代抗癌研究中用于多靶点药物发现的微扰理论机器学习
Curr Issues Mol Biol. 2025 Apr 25;47(5):301. doi: 10.3390/cimb47050301.
2
The unequivocal preponderance of biocomputation in clinical virology.生物计算在临床病毒学中占绝对优势。
RSC Adv. 2018 May 18;8(31):17334-17345. doi: 10.1039/c8ra00888d. eCollection 2018 May 9.
3
PTML Modeling for Pancreatic Cancer Research: In Silico Design of Simultaneous Multi-Protein and Multi-Cell Inhibitors.
用于胰腺癌研究的PTML建模:同时针对多种蛋白质和多种细胞的抑制剂的计算机模拟设计
Biomedicines. 2022 Feb 18;10(2):491. doi: 10.3390/biomedicines10020491.
4
In Silico Drug Repurposing for Anti-Inflammatory Therapy: Virtual Search for Dual Inhibitors of Caspase-1 and TNF-Alpha.计算机药物重定位用于抗炎治疗:靶向 Caspase-1 和 TNF-α 的双重抑制剂的虚拟筛选。
Biomolecules. 2021 Dec 4;11(12):1832. doi: 10.3390/biom11121832.
5
A scoping review on the use of machine learning in research on social determinants of health: Trends and research prospects.关于机器学习在健康社会决定因素研究中的应用的范围综述:趋势与研究前景
SSM Popul Health. 2021 Jun 5;15:100836. doi: 10.1016/j.ssmph.2021.100836. eCollection 2021 Sep.
6
Identification of DEP domain-containing proteins by a machine learning method and experimental analysis of their expression in human HCC tissues.通过机器学习方法鉴定 DEP 结构域蛋白,并通过实验分析其在人 HCC 组织中的表达。
Sci Rep. 2016 Dec 21;6:39655. doi: 10.1038/srep39655.