Marklevitz Jessica, Harris Laura K
Department of Science, Davenport University, Lansing, Michigan, United States of America.
Department of Health Informatics, Rutgers School of Health Professions, Newark, New Jersey, United States of America.
Bioinformation. 2017 Apr 30;13(4):104-110. doi: 10.6026/97320630013104. eCollection 2017.
Antibiotic resistant Staphylococcus aureus is a major public health concern effecting millions of people annually. Medical science has documented completely untreatable S. aureus infections. These strains are appearing in the community with increasing frequency. New diagnostic and therapeutic options are needed to combat this deadly infection. Interestingly, around 50% of the proteins in S. aureus are annotated as hypothetical. Methods to select hypothetical proteins related to antibiotic resistance have been inadequate. This study uses differential gene expression to identify hypothetical proteins related to antibiotic resistant phenotype strain variations. We apply computational tools to predict physiochemical properties, cellular location, sequence-based homologs, domains, 3D modeling, active site features, and binding partners. Nine of 23 hypothetical proteins were <100 residues, unlikely to be functional proteins based on size. Of the 14 differentially expressed hypothetical proteins examined, confident predictions on function could not be made. Most identified domains had unknown functions. Six hypothetical protein models had >50% confidence over >20% residues. These findings indicate the method of hypothetical protein identification is sufficient; however, current scientific knowledge is inadequate to properly annotate these proteins. This process should be repeated regularly until entire genomes are clearly and accurately annotated.
耐抗生素金黄色葡萄球菌是一个重大的公共卫生问题,每年影响数百万人。医学科学已记录到完全无法治疗的金黄色葡萄球菌感染。这些菌株在社区中出现的频率越来越高。需要新的诊断和治疗方法来对抗这种致命感染。有趣的是,金黄色葡萄球菌中约50%的蛋白质被注释为假想蛋白。选择与抗生素耐药性相关的假想蛋白的方法一直不够充分。本研究利用差异基因表达来鉴定与抗生素耐药表型菌株变异相关的假想蛋白。我们应用计算工具来预测物理化学性质、细胞定位、基于序列的同源物、结构域、三维建模、活性位点特征和结合伙伴。23个假想蛋白中有9个少于100个残基,基于大小不太可能是功能蛋白。在所研究的14个差异表达的假想蛋白中,无法对其功能做出可靠预测。大多数鉴定出的结构域功能未知。六个假想蛋白模型在超过20%的残基上具有超过50%的可信度。这些发现表明假想蛋白鉴定方法是充分的;然而,目前的科学知识不足以对这些蛋白进行恰当注释。这个过程应该定期重复,直到整个基因组得到清晰准确的注释。