• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于提取最简约特征集并发现潜在抗菌药物耐药性(AMR)生物标志物的交叉验证特征选择(CVFS)方法。

A Cross-Validated Feature Selection (CVFS) approach for extracting the most parsimonious feature sets and discovering potential antimicrobial resistance (AMR) biomarkers.

作者信息

Yang Ming-Ren, Wu Yu-Wei

机构信息

Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei 110, Taiwan, ROC.

Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei 106, Taiwan, ROC.

出版信息

Comput Struct Biotechnol J. 2022 Dec 28;21:769-779. doi: 10.1016/j.csbj.2022.12.046. eCollection 2023.

DOI:10.1016/j.csbj.2022.12.046
PMID:36698972
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9842539/
Abstract

Understanding genes and their underlying mechanisms is critical in deciphering how antimicrobial-resistant (AMR) bacteria withstand detrimental effects of antibiotic drugs. At the same time the genes related to AMR phenotypes may also serve as biomarkers for predicting whether a microbial strain is resistant to certain antibiotic drugs. We developed a Cross-Validated Feature Selection (CVFS) approach for robustly selecting the most parsimonious gene sets for predicting AMR activities from bacterial pan-genomes. The core idea behind the CVFS approach is interrogating features among non-overlapping sub-parts of the datasets to ensure the representativeness of the features. By randomly splitting the dataset into disjoint sub-parts, conducting feature selection within each sub-part, and intersecting the features shared by all sub-parts, the CVFS approach is able to achieve the goal of extracting the most representative features for yielding satisfactory AMR activity prediction accuracy. By testing this idea on bacterial pan-genome datasets, we showed that this approach was able to extract the most succinct feature sets that predicted AMR activities very well, indicating the potential of these genes as AMR biomarkers. The functional analysis demonstrated that the CVFS approach was able to extract both known AMR genes and novel ones, suggesting the capabilities of the algorithm in selecting relevant features and highlighting the potential of the novel genes in expanding the antimicrobial resistance gene databases.

摘要

了解基因及其潜在机制对于破译抗微生物药物耐药性(AMR)细菌如何抵御抗生素药物的有害影响至关重要。同时,与AMR表型相关的基因也可作为生物标志物,用于预测微生物菌株是否对某些抗生素药物耐药。我们开发了一种交叉验证特征选择(CVFS)方法,用于从细菌泛基因组中稳健地选择最简约的基因集,以预测AMR活性。CVFS方法背后的核心思想是在数据集的非重叠子部分中询问特征,以确保特征的代表性。通过将数据集随机拆分为不相交的子部分,在每个子部分内进行特征选择,并交叉所有子部分共享的特征,CVFS方法能够实现提取最具代表性特征的目标,从而产生令人满意的AMR活性预测准确性。通过在细菌泛基因组数据集上测试这一想法,我们表明该方法能够提取出能很好预测AMR活性的最简洁特征集,这表明这些基因作为AMR生物标志物的潜力。功能分析表明,CVFS方法能够提取已知的AMR基因和新基因,这表明该算法在选择相关特征方面的能力,并突出了新基因在扩展抗微生物耐药基因数据库方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/118b2696517a/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/65f089035b24/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/35c37ec5e616/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/7c24c8d20dd7/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/4753a42639f4/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/73be44120cab/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/118b2696517a/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/65f089035b24/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/35c37ec5e616/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/7c24c8d20dd7/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/4753a42639f4/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/73be44120cab/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a92/9842539/118b2696517a/gr5.jpg

相似文献

1
A Cross-Validated Feature Selection (CVFS) approach for extracting the most parsimonious feature sets and discovering potential antimicrobial resistance (AMR) biomarkers.一种用于提取最简约特征集并发现潜在抗菌药物耐药性(AMR)生物标志物的交叉验证特征选择(CVFS)方法。
Comput Struct Biotechnol J. 2022 Dec 28;21:769-779. doi: 10.1016/j.csbj.2022.12.046. eCollection 2023.
2
Enhancing predictions of antimicrobial resistance of pathogens by expanding the potential resistance gene repertoire using a pan-genome-based feature selection approach.通过使用基于泛基因组的特征选择方法扩展潜在抗性基因库,增强对病原体抗菌抗性的预测。
BMC Bioinformatics. 2022 Apr 15;23(Suppl 4):131. doi: 10.1186/s12859-022-04666-2.
3
Unitig-centered pan-genome machine learning approach for predicting antibiotic resistance and discovering novel resistance genes in bacterial strains.基于重叠群的泛基因组机器学习方法用于预测细菌菌株的抗生素抗性并发现新的抗性基因。
Comput Struct Biotechnol J. 2024 Apr 16;23:1864-1876. doi: 10.1016/j.csbj.2024.04.035. eCollection 2024 Dec.
4
A pan-genome-based machine learning approach for predicting antimicrobial resistance activities of the Escherichia coli strains.基于泛基因组的机器学习方法预测大肠杆菌菌株的抗菌药物抗性活性。
Bioinformatics. 2018 Jul 1;34(13):i89-i95. doi: 10.1093/bioinformatics/bty276.
5
Using bacterial pan-genome-based feature selection approach to improve the prediction of minimum inhibitory concentration (MIC).使用基于细菌泛基因组的特征选择方法来改善最低抑菌浓度(MIC)的预测。
Front Genet. 2023 May 30;14:1054032. doi: 10.3389/fgene.2023.1054032. eCollection 2023.
6
PATRIC as a unique resource for studying antimicrobial resistance.PATRIC 作为研究抗菌药物耐药性的独特资源。
Brief Bioinform. 2019 Jul 19;20(4):1094-1102. doi: 10.1093/bib/bbx083.
7
PARMAP: A Pan-Genome-Based Computational Framework for Predicting Antimicrobial Resistance.PARMAP:一种基于泛基因组预测抗菌药物耐药性的计算框架。
Front Microbiol. 2020 Oct 22;11:578795. doi: 10.3389/fmicb.2020.578795. eCollection 2020.
8
Accurate and rapid prediction of tuberculosis drug resistance from genome sequence data using traditional machine learning algorithms and CNN.利用传统机器学习算法和 CNN 从基因组序列数据中准确快速预测结核病耐药性。
Sci Rep. 2022 Feb 14;12(1):2427. doi: 10.1038/s41598-022-06449-4.
9
Predicting future hospital antimicrobial resistance prevalence using machine learning.使用机器学习预测未来医院抗菌药物耐药性的流行情况。
Commun Med (Lond). 2024 Oct 10;4(1):197. doi: 10.1038/s43856-024-00606-8.
10
Antimicrobial resistance genetic factor identification from whole-genome sequence data using deep feature selection.基于全基因组序列数据的深度特征选择进行抗菌药物耐药性遗传因子鉴定。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 15):535. doi: 10.1186/s12859-019-3054-4.

引用本文的文献

1
Integrating deep learning and molecular dynamics simulations for FXR antagonist discovery.整合深度学习与分子动力学模拟以发现法尼醇X受体拮抗剂
Mol Divers. 2025 Apr 2. doi: 10.1007/s11030-025-11145-2.
2
Unitig-centered pan-genome machine learning approach for predicting antibiotic resistance and discovering novel resistance genes in bacterial strains.基于重叠群的泛基因组机器学习方法用于预测细菌菌株的抗生素抗性并发现新的抗性基因。
Comput Struct Biotechnol J. 2024 Apr 16;23:1864-1876. doi: 10.1016/j.csbj.2024.04.035. eCollection 2024 Dec.
3
Using bacterial pan-genome-based feature selection approach to improve the prediction of minimum inhibitory concentration (MIC).

本文引用的文献

1
Identifying interpretable gene-biomarker associations with functionally informed kernel-based tests in 190,000 exomes.在 190,000 个外显子组中,利用基于功能信息核的测试,识别可解释的基因 - 生物标志物关联。
Nat Commun. 2022 Sep 10;13(1):5332. doi: 10.1038/s41467-022-32864-2.
2
Enhancing predictions of antimicrobial resistance of pathogens by expanding the potential resistance gene repertoire using a pan-genome-based feature selection approach.通过使用基于泛基因组的特征选择方法扩展潜在抗性基因库,增强对病原体抗菌抗性的预测。
BMC Bioinformatics. 2022 Apr 15;23(Suppl 4):131. doi: 10.1186/s12859-022-04666-2.
3
Coupling Machine Learning and High Throughput Multiplex Digital PCR Enables Accurate Detection of Carbapenem-Resistant Genes in Clinical Isolates.
使用基于细菌泛基因组的特征选择方法来改善最低抑菌浓度(MIC)的预测。
Front Genet. 2023 May 30;14:1054032. doi: 10.3389/fgene.2023.1054032. eCollection 2023.
4
Deep Learning for Operating Performance Assessment of Industrial Processes with Layer Attention-Based Stacked Performance-Relevant Denoising Auto-Encoders.基于层注意力的堆叠性能相关去噪自动编码器的工业过程运行性能评估深度学习
ACS Omega. 2023 Apr 10;8(16):14583-14596. doi: 10.1021/acsomega.3c00414. eCollection 2023 Apr 25.
结合机器学习与高通量多重数字PCR可实现临床分离株中碳青霉烯耐药基因的准确检测。
Front Mol Biosci. 2021 Nov 23;8:775299. doi: 10.3389/fmolb.2021.775299. eCollection 2021.
4
PangenomeNet: a pan-genome-based network reveals functional modules on antimicrobial resistome for Escherichia coli strains.泛基因组网络:基于泛基因组的网络揭示了大肠杆菌菌株抗微生物耐药组的功能模块。
BMC Bioinformatics. 2021 Nov 10;22(1):548. doi: 10.1186/s12859-021-04459-z.
5
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
6
Integrative Analysis of Whole Genome Sequencing and Phenotypic Resistance Toward Prediction of Trimethoprim-Sulfamethoxazole Resistance in .全基因组测序与表型耐药性的综合分析用于预测……中对甲氧苄啶-磺胺甲恶唑的耐药性
Front Microbiol. 2021 Jan 14;11:607842. doi: 10.3389/fmicb.2020.607842. eCollection 2020.
7
Pan-Cancer Biomarkers: Changing the Landscape of Molecular Testing.泛癌生物标志物:改变分子检测的格局。
Arch Pathol Lab Med. 2021 Jun 1;145(6):692-698. doi: 10.5858/arpa.2020-0513-RA.
8
Identifying Gene Signatures for Cancer Drug Repositioning Based on Sample Clustering.基于样本聚类的癌症药物重定位基因特征识别。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Mar-Apr;19(2):953-965. doi: 10.1109/TCBB.2020.3019781. Epub 2022 Apr 1.
9
ResFinder 4.0 for predictions of phenotypes from genotypes.ResFinder 4.0 用于基因型到表型的预测。
J Antimicrob Chemother. 2020 Dec 1;75(12):3491-3500. doi: 10.1093/jac/dkaa345.
10
Discovery of a novel integron-borne aminoglycoside resistance gene present in clinical pathogens by screening environmental bacterial communities.通过筛选环境细菌群落,发现了一种新型整合子携带的氨基糖苷类抗生素抗性基因,该基因存在于临床病原体中。
Microbiome. 2020 Mar 20;8(1):41. doi: 10.1186/s40168-020-00814-z.