• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用凸优化创建的稀疏模型从逆转录酶和蛋白酶氨基酸序列准确预测HIV-1药物反应。

Accurate prediction of HIV-1 drug response from the reverse transcriptase and protease amino acid sequences using sparse models created by convex optimization.

作者信息

Rabinowitz Matthew, Myers Lance, Banjevic Milena, Chan Albert, Sweetkind-Singer Joshua, Haberer Jessica, McCann Kelly, Wolkowicz Roland

机构信息

Gene Security Network, Palo Alto, CA, USA.

出版信息

Bioinformatics. 2006 Mar 1;22(5):541-9. doi: 10.1093/bioinformatics/btk011. Epub 2005 Dec 20.

DOI:10.1093/bioinformatics/btk011
PMID:16368772
Abstract

MOTIVATION

Genotype-phenotype modeling problems are often overcomplete, or ill-posed, since the number of potential predictors-genes, proteins, mutations and their interactions-is large relative to the number of measured outcomes. Such datasets can still be used to train sparse parameter models that generalize accurately, by exerting a principle similar to Occam's Razor: When many possible theories can explain the observations, the most simple is most likely to be correct. We apply this philosophy to modeling the drug response of Type-1 Human Immunodeficiency Virus (HIV-1). Owing to the decreasing expense of genetic sequencing relative to in vitro phenotype testing, a statistical model that reliably predicts viral drug response from genetic data is an important tool in the selection of antiretroviral therapy (ART). The optimization techniques described will have application to many genotype-phenotype modeling problems for the purpose of enhancing clinical decisions.

RESULTS

We describe two regression techniques for predicting viral phenotype in response to ART from genetic sequence data. Both techniques employ convex optimization for the continuous subset selection of a sparse set of model parameters. The first technique, the least absolute shrinkage and selection operator, uses the l(1) norm loss function to create a sparse linear model; the second, the support vector machine with radial basis kernel functions, uses the epsilon-insensitive loss function to create a sparse non-linear model. The techniques are applied to predict the response of the HIV-1 virus to 10 reverse transcriptase inhibitor and 7 protease inhibitor drugs. The genetic data are derived from the HIV coding sequences for the reverse transcriptase and protease enzymes. When tested by cross-validation with actual laboratory measurements, these models predict drug response phenotype more accurately than models previously discussed in the literature, and other canonical techniques described here. Key features of the methods that enable this performance are the tendency to generate simple models where many of the parameters are zero, and the convexity of the cost function, which assures that we can find model parameters to globally minimize the cost function for a particular training dataset.

AVAILABILITY

Results, tables and figures are available at ftp://ftp.genesecurity.net.

SUPPLEMENTARY INFORMATION

An Appendix to accompany this article is available at Bioinformatics online.

摘要

动机

基因型 - 表型建模问题通常是超完备的,或者是不适定的,因为相对于测量结果的数量而言,潜在预测因子(基因、蛋白质、突变及其相互作用)的数量很大。通过运用类似于奥卡姆剃刀的原则,这样的数据集仍可用于训练能准确泛化的稀疏参数模型:当许多可能的理论都能解释观测结果时,最简单的理论最有可能是正确的。我们将这一理念应用于对1型人类免疫缺陷病毒(HIV - 1)药物反应的建模。由于相对于体外表型测试,基因测序成本不断降低,一个能从基因数据可靠预测病毒药物反应的统计模型是抗逆转录病毒疗法(ART)选择中的重要工具。所描述的优化技术将应用于许多基因型 - 表型建模问题,以增强临床决策。

结果

我们描述了两种从基因序列数据预测ART反应中病毒表型的回归技术。两种技术都采用凸优化来对稀疏的一组模型参数进行连续子集选择。第一种技术,即最小绝对收缩和选择算子,使用l(1)范数损失函数创建一个稀疏线性模型;第二种技术,即具有径向基核函数的支持向量机,使用ε - 不敏感损失函数创建一个稀疏非线性模型。这些技术被应用于预测HIV - 1病毒对10种逆转录酶抑制剂和7种蛋白酶抑制剂药物的反应。基因数据源自逆转录酶和蛋白酶的HIV编码序列。当通过与实际实验室测量进行交叉验证测试时,这些模型比文献中先前讨论的模型以及这里描述的其他经典技术更准确地预测药物反应表型。实现这种性能的方法的关键特征是倾向于生成许多参数为零的简单模型,以及成本函数的凸性,这确保我们能够找到模型参数以全局最小化特定训练数据集的成本函数。

可用性

结果、表格和图形可在ftp://ftp.genesecurity.net获取。

补充信息

本文的附录可在《生物信息学》在线版获取。

相似文献

1
Accurate prediction of HIV-1 drug response from the reverse transcriptase and protease amino acid sequences using sparse models created by convex optimization.利用凸优化创建的稀疏模型从逆转录酶和蛋白酶氨基酸序列准确预测HIV-1药物反应。
Bioinformatics. 2006 Mar 1;22(5):541-9. doi: 10.1093/bioinformatics/btk011. Epub 2005 Dec 20.
2
Why neural networks should not be used for HIV-1 protease cleavage site prediction.为何神经网络不应被用于预测HIV-1蛋白酶切割位点。
Bioinformatics. 2004 Jul 22;20(11):1702-9. doi: 10.1093/bioinformatics/bth144. Epub 2004 Feb 26.
3
Use of the l1 norm for selection of sparse parameter sets that accurately predict drug response phenotype from viral genetic sequences.使用L1范数来选择能够从病毒基因序列准确预测药物反应表型的稀疏参数集。
AMIA Annu Symp Proc. 2005;2005:505-9.
4
Mining HIV protease cleavage data using genetic programming with a sum-product function.使用带有和积函数的遗传编程挖掘HIV蛋白酶切割数据。
Bioinformatics. 2004 Dec 12;20(18):3398-405. doi: 10.1093/bioinformatics/bth414. Epub 2004 Jul 15.
5
Prediction of phenotypic susceptibility to antiretroviral drugs using physiochemical properties of the primary enzymatic structure combined with artificial neural networks.利用主要酶结构的物理化学性质结合人工神经网络预测对抗逆转录病毒药物的表型易感性。
HIV Med. 2008 Oct;9(8):642-52. doi: 10.1111/j.1468-1293.2008.00612.x. Epub 2008 Jul 8.
6
Predicting human immunodeficiency virus protease cleavage sites in proteins by a discriminant function method.用判别函数法预测蛋白质中的人类免疫缺陷病毒蛋白酶切割位点
Proteins. 1996 Jan;24(1):51-72. doi: 10.1002/(SICI)1097-0134(199601)24:1<51::AID-PROT4>3.0.CO;2-R.
7
Predicting hepatitis C virus protease cleavage sites using generalized linear indicator regression models.使用广义线性指标回归模型预测丙型肝炎病毒蛋白酶切割位点。
IEEE Trans Biomed Eng. 2006 Oct;53(10):2119-23. doi: 10.1109/TBME.2006.881779.
8
An integrative approach for predicting interactions of protein regions.一种预测蛋白质区域相互作用的综合方法。
Bioinformatics. 2008 Aug 15;24(16):i35-41. doi: 10.1093/bioinformatics/btn290.
9
Prediction of Ras-effector interactions using position energy matrices.使用位置能量矩阵预测Ras效应器相互作用。
Bioinformatics. 2007 Sep 1;23(17):2226-30. doi: 10.1093/bioinformatics/btm336. Epub 2007 Jun 28.
10
Prediction of HIV-1 drug susceptibility phenotype from the viral genotype using linear regression modeling.使用线性回归模型从病毒基因型预测HIV-1药物敏感性表型。
J Virol Methods. 2007 Oct;145(1):47-55. doi: 10.1016/j.jviromet.2007.05.009. Epub 2007 Jun 15.

引用本文的文献

1
Prediction of Antileishmanial Compounds: General Model, Preparation, and Evaluation of 2-Acylpyrrole Derivatives.抗利什曼原虫化合物的预测:2-酰基吡咯衍生物的通用模型、制备和评价。
J Chem Inf Model. 2022 Aug 22;62(16):3928-3940. doi: 10.1021/acs.jcim.2c00731. Epub 2022 Aug 10.
2
Lasso regularization for left-censored Gaussian outcome and high-dimensional predictors.左截断高斯结局和高维预测因子的套索正则化。
BMC Med Res Methodol. 2018 Dec 4;18(1):159. doi: 10.1186/s12874-018-0609-4.
3
Improving counterfactual reasoning with kernelised dynamic mixing models.
利用核化动态混合模型改进反事实推理。
PLoS One. 2018 Nov 12;13(11):e0205839. doi: 10.1371/journal.pone.0205839. eCollection 2018.
4
The individualized genetic barrier predicts treatment response in a large cohort of HIV-1 infected patients.个体化遗传屏障可预测大量 HIV-1 感染患者的治疗反应。
PLoS Comput Biol. 2013;9(8):e1003203. doi: 10.1371/journal.pcbi.1003203. Epub 2013 Aug 29.
5
Computational analysis of anti-HIV-1 antibody neutralization panel data to identify potential functional epitope residues.计算分析抗 HIV-1 抗体中和面板数据,以鉴定潜在的功能表位残基。
Proc Natl Acad Sci U S A. 2013 Jun 25;110(26):10598-603. doi: 10.1073/pnas.1309215110. Epub 2013 Jun 10.
6
Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data.超级学习者方法在HIV-1小样本上的研究:在捷豹试验数据中的应用
AIDS Res Treat. 2012;2012:478467. doi: 10.1155/2012/478467. Epub 2012 Apr 3.
7
A multifaceted analysis of HIV-1 protease multidrug resistance phenotypes.HIV-1 蛋白酶多药耐药表型的多方面分析。
BMC Bioinformatics. 2011 Dec 15;12:477. doi: 10.1186/1471-2105-12-477.
8
Alternative methods to analyse the impact of HIV mutations on virological response to antiviral therapy.分析HIV突变对抗病毒治疗病毒学反应影响的替代方法。
BMC Med Res Methodol. 2008 Oct 22;8:68. doi: 10.1186/1471-2288-8-68.
9
Identification of a novel resistance (E40F) and compensatory (K43E) substitution in HIV-1 reverse transcriptase.在HIV-1逆转录酶中鉴定出一种新的耐药性(E40F)和补偿性(K43E)替代。
Retrovirology. 2008 Feb 13;5:20. doi: 10.1186/1742-4690-5-20.