Huang Hailiang, Maertens Alexandra M, Hyland Edel M, Dai Junbiao, Norris Anne, Boeke Jef D, Bader Joel S
Department of Biomedical Engineering, Johns Hopkins University and School of Medicine, Baltimore, Maryland 21218, USA.
Genome Res. 2009 Apr;19(4):674-81. doi: 10.1101/gr.083402.108. Epub 2009 Feb 13.
Histones are the basic protein components of nucleosomes. They are among the most conserved proteins and are subject to a plethora of post-translational modifications. Specific histone residues are important in establishing chromatin structure, regulating gene expression and silencing, and responding to DNA damage. Here we present HistoneHits, a database of phenotypes for systematic collections of histone mutants. This database combines assay results (phenotypes) with information about sequences, structures, post-translational modifications, and evolutionary conservation. The web interface presents the information through dynamic tables and figures. It calculates the availability of data for specific mutants and for nucleosome surfaces. The database currently includes 42 assays on 677 mutants multiply covering 405 of the 498 residues across yeast histones H3, H4, H2A, and H2B. We also provide an interface with an extensible controlled vocabulary for research groups to submit new data. Preliminary analyses confirm that mutations at highly conserved residues and modifiable residues are more likely to generate phenotypes. Buried residues and residues on the lateral surface tend to generate more phenotypes, while tail residues generate significantly fewer phenotypes than other residues. Yeast mutants are cross referenced with known human histone variants, identifying a position where a yeast mutant causes loss of ribosomal silencing and a human variant increases breast cancer susceptibility. All data sets are freely available for download.
组蛋白是核小体的基本蛋白质成分。它们是最保守的蛋白质之一,会经历大量的翻译后修饰。特定的组蛋白残基在建立染色质结构、调节基因表达和沉默以及对DNA损伤作出反应方面很重要。在此,我们展示了HistoneHits,这是一个用于系统收集组蛋白突变体表型的数据库。该数据库将检测结果(表型)与有关序列、结构、翻译后修饰和进化保守性的信息相结合。网络界面通过动态表格和图表展示信息。它计算特定突变体和核小体表面的数据可用性。该数据库目前包括对677个突变体的42项检测,这些突变体多重覆盖了酵母组蛋白H3、H4、H2A和H2B中498个残基中的405个。我们还为研究小组提供了一个带有可扩展控制词汇表的界面,以便提交新数据。初步分析证实,高度保守残基和可修饰残基处的突变更有可能产生表型。埋藏残基和侧面残基往往会产生更多表型,而尾部残基产生的表型明显少于其他残基。酵母突变体与已知的人类组蛋白变体相互参照,确定了一个酵母突变体导致核糖体沉默丧失而人类变体增加乳腺癌易感性的位置。所有数据集均可免费下载。