• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer.非小细胞肺癌临床试验的强化学习策略
Biometrics. 2011 Dec;67(4):1422-33. doi: 10.1111/j.1541-0420.2011.01572.x. Epub 2011 Mar 8.
2
Reinforcement learning design for cancer clinical trials.强化学习在癌症临床试验中的设计。
Stat Med. 2009 Nov 20;28(26):3294-315. doi: 10.1002/sim.3720.
3
Dose-comparative monotherapy trials of ZD1839 in previously treated non-small cell lung cancer patients.ZD1839在既往接受过治疗的非小细胞肺癌患者中的剂量对比单药治疗试验。
Semin Oncol. 2003 Feb;30(1 Suppl 1):30-8. doi: 10.1053/sonc.2003.50030.
4
Second-line treatments after first-line gefitinib therapy in advanced nonsmall cell lung cancer.
Int J Cancer. 2010 Jan 1;126(1):247-55. doi: 10.1002/ijc.24657.
5
First-line treatment of advanced epidermal growth factor receptor (EGFR) mutation positive non-squamous non-small cell lung cancer.晚期表皮生长因子受体(EGFR)突变阳性非鳞状非小细胞肺癌的一线治疗。
Cochrane Database Syst Rev. 2021 Mar 18;3(3):CD010383. doi: 10.1002/14651858.CD010383.pub3.
6
Optimal pharmacotherapeutic strategies for elderly patients with advanced non-small cell lung cancer.老年晚期非小细胞肺癌患者的最佳药物治疗策略。
Drugs Aging. 2011 Nov 1;28(11):885-94. doi: 10.2165/11595100-000000000-00000.
7
Influence of trial duration on the bias of the estimated treatment effect in clinical trials when individual heterogeneity is ignored.当忽略个体异质性时,临床试验持续时间对估计治疗效果偏差的影响。
Biom J. 2015 May;57(3):371-83. doi: 10.1002/bimj.201400046. Epub 2015 Jan 16.
8
Bayesian network meta-comparison of maintenance treatments for stage IIIb/IV non-small-cell lung cancer (NSCLC) patients with good performance status not progressing after first-line induction chemotherapy: results by performance status, EGFR mutation, histology and response to previous induction.贝叶斯网络荟萃分析在一线诱导化疗后疾病未进展且体能状态良好的 IIIb/IV 期非小细胞肺癌(NSCLC)患者中维持治疗的疗效:基于体能状态、EGFR 突变、组织学和对前序诱导治疗的反应的结果
Eur J Cancer. 2015 Nov;51(16):2330-44. doi: 10.1016/j.ejca.2015.07.007. Epub 2015 Sep 10.
9
The novel and effective nonplatinum, nontaxane combination of gemcitabine and vinorelbine in advanced nonsmall cell lung carcinoma: potential for decreased toxicity and combination with biological therapy.吉西他滨与长春瑞滨这种新型且有效的非铂类、非紫杉烷类联合用药方案用于晚期非小细胞肺癌:降低毒性的潜力及与生物治疗联合应用的可能性
Cancer. 2002 Jul 15;95(2):340-53. doi: 10.1002/cncr.10629.
10
First-line treatment of advanced epidermal growth factor receptor (EGFR) mutation positive non-squamous non-small cell lung cancer.晚期表皮生长因子受体(EGFR)突变阳性非鳞状非小细胞肺癌的一线治疗
Cochrane Database Syst Rev. 2016 May 25(5):CD010383. doi: 10.1002/14651858.CD010383.pub2.

引用本文的文献

1
Optimal adaptive SMART designs with binary outcomes.具有二元结果的最优适应性序贯多重分配随机试验设计
Biometrics. 2024 Oct 3;80(4). doi: 10.1093/biomtc/ujae140.
2
Enhancing Medical Training Through Learning From Mistakes by Interacting With an Ill-Trained Reinforcement Learning Agent.通过与训练不佳的强化学习智能体交互从错误中学习来加强医学培训。
IEEE Trans Learn Technol. 2024;17:1248-1260. doi: 10.1109/tlt.2024.3372508. Epub 2024 Mar 4.
3
Reinforcement Learning to Optimize Ventilator Settings for Patients on Invasive Mechanical Ventilation: Retrospective Study.强化学习优化有创机械通气患者呼吸机设置:回顾性研究。
J Med Internet Res. 2024 Oct 16;26:e44494. doi: 10.2196/44494.
4
Enhancing wound healing through deep reinforcement learning for optimal therapeutics.通过深度强化学习实现最佳治疗以促进伤口愈合。
R Soc Open Sci. 2024 Jul 31;11(7):240228. doi: 10.1098/rsos.240228. eCollection 2024 Jul.
5
Using Pilot Data for Power Analysis of Observational Studies for the Estimation of Dynamic Treatment Regimes.利用试点数据进行观察性研究的功效分析以估计动态治疗方案
Obs Stud. 2023;9(4):25-48. doi: 10.1353/obs.2023.a906627.
6
Multiobjective tree-based reinforcement learning for estimating tolerant dynamic treatment regimes.基于多目标树的强化学习估计宽容动态治疗方案。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad017.
7
Estimation of optimal treatment regimes with electronic medical record data using the residual life value estimator.利用剩余寿命值估计器从电子病历数据中估计最佳治疗方案。
Biostatistics. 2024 Oct 1;25(4):933-946. doi: 10.1093/biostatistics/kxae002.
8
Multi-stage optimal dynamic treatment regimes for survival outcomes with dependent censoring.用于具有相依删失的生存结局的多阶段最优动态治疗方案
Biometrika. 2022 Aug 13;110(2):395-410. doi: 10.1093/biomet/asac047. eCollection 2023 Jun.
9
Estimating optimal individualized treatment rules with multistate processes.用多状态过程估计最优个体化治疗规则。
Biometrics. 2023 Dec;79(4):2830-2842. doi: 10.1111/biom.13864. Epub 2023 Apr 17.
10
Model selection for survival individualized treatment rules using the jackknife estimator.利用刀切估计量进行生存个体化治疗规则的模型选择。
BMC Med Res Methodol. 2022 Dec 22;22(1):328. doi: 10.1186/s12874-022-01811-6.

本文引用的文献

1
Reinforcement learning design for cancer clinical trials.强化学习在癌症临床试验中的设计。
Stat Med. 2009 Nov 20;28(26):3294-315. doi: 10.1002/sim.3720.
2
Phase III study of immediate compared with delayed docetaxel after front-line therapy with gemcitabine plus carboplatin in advanced non-small-cell lung cancer.吉西他滨联合卡铂一线治疗晚期非小细胞肺癌后,多西他赛立即给药与延迟给药的III期研究。
J Clin Oncol. 2009 Feb 1;27(4):591-8. doi: 10.1200/JCO.2008.17.1405. Epub 2008 Dec 15.
3
Considerations for second-line therapy of non-small cell lung cancer.非小细胞肺癌二线治疗的考量因素
Oncologist. 2008;13 Suppl 1:28-36. doi: 10.1634/theoncologist.13-S1-28.
4
Duration of first-line chemotherapy in advanced non small-cell lung cancer: less is more in the era of effective subsequent therapies.晚期非小细胞肺癌一线化疗的疗程:在有有效后续治疗手段的时代,疗程越短越好。
J Clin Oncol. 2007 Nov 20;25(33):5155-7. doi: 10.1200/JCO.2007.13.4015.
5
Treatment of non-small cell lung cancer, stage IV: ACCP evidence-based clinical practice guidelines (2nd edition).IV期非小细胞肺癌的治疗:美国胸科医师学会循证临床实践指南(第2版)
Chest. 2007 Sep;132(3 Suppl):277S-289S. doi: 10.1378/chest.07-1381.
6
Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring.基于区间删失的序贯失效时间的贝叶斯和频率主义两阶段治疗策略。
Stat Med. 2007 Nov 20;26(26):4687-702. doi: 10.1002/sim.2894.
7
Paclitaxel-carboplatin alone or with bevacizumab for non-small-cell lung cancer.单独使用紫杉醇-卡铂或联合贝伐单抗治疗非小细胞肺癌。
N Engl J Med. 2006 Dec 14;355(24):2542-50. doi: 10.1056/NEJMoa061884.
8
A Generalization Error for Q-Learning.Q学习的泛化误差
J Mach Learn Res. 2005 Jul;6:1073-1097.
9
Erlotinib in previously treated non-small-cell lung cancer.厄洛替尼用于既往接受过治疗的非小细胞肺癌。
N Engl J Med. 2005 Jul 14;353(2):123-32. doi: 10.1056/NEJMoa050753.
10
An experimental design for the development of adaptive treatment strategies.一种用于制定适应性治疗策略的实验设计。
Stat Med. 2005 May 30;24(10):1455-81. doi: 10.1002/sim.2022.

非小细胞肺癌临床试验的强化学习策略

Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer.

作者信息

Zhao Yufan, Zeng Donglin, Socinski Mark A, Kosorok Michael R

机构信息

Global Biostatistics and Epidemiology, Amgen Inc., One Amgen Center Drive, Thousand Oaks, California 91320, USA.

出版信息

Biometrics. 2011 Dec;67(4):1422-33. doi: 10.1111/j.1541-0420.2011.01572.x. Epub 2011 Mar 8.

DOI:10.1111/j.1541-0420.2011.01572.x
PMID:21385164
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3138840/
Abstract

Typical regimens for advanced metastatic stage IIIB/IV nonsmall cell lung cancer (NSCLC) consist of multiple lines of treatment. We present an adaptive reinforcement learning approach to discover optimal individualized treatment regimens from a specially designed clinical trial (a "clinical reinforcement trial") of an experimental treatment for patients with advanced NSCLC who have not been treated previously with systemic therapy. In addition to the complexity of the problem of selecting optimal compounds for first- and second-line treatments based on prognostic factors, another primary goal is to determine the optimal time to initiate second-line therapy, either immediately or delayed after induction therapy, yielding the longest overall survival time. A reinforcement learning method called Q-learning is utilized, which involves learning an optimal regimen from patient data generated from the clinical reinforcement trial. Approximating the Q-function with time-indexed parameters can be achieved by using a modification of support vector regression that can utilize censored data. Within this framework, a simulation study shows that the procedure can extract optimal regimens for two lines of treatment directly from clinical data without prior knowledge of the treatment effect mechanism. In addition, we demonstrate that the design reliably selects the best initial time for second-line therapy while taking into account the heterogeneity of NSCLC across patients.

摘要

晚期转移性IIIB/IV期非小细胞肺癌(NSCLC)的典型治疗方案包括多线治疗。我们提出了一种自适应强化学习方法,以从一项专门设计的针对未接受过全身治疗的晚期NSCLC患者的实验性治疗临床试验(“临床强化试验”)中发现最佳个体化治疗方案。除了基于预后因素为一线和二线治疗选择最佳化合物这一问题的复杂性之外,另一个主要目标是确定开始二线治疗的最佳时间,是在诱导治疗后立即开始还是延迟开始,以获得最长的总生存时间。我们使用了一种名为Q学习的强化学习方法,该方法涉及从临床强化试验产生的患者数据中学习最佳治疗方案。通过使用一种可以利用截尾数据的支持向量回归的修改方法,可以用时间索引参数来近似Q函数。在这个框架内,一项模拟研究表明,该程序可以直接从临床数据中提取两线治疗的最佳治疗方案,而无需事先了解治疗效果机制。此外,我们证明,该设计在考虑到NSCLC患者间的异质性的同时,可靠地选择了二线治疗的最佳初始时间。