• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用人类胚胎干细胞系H1中的表观遗传特征和DNA序列组成进行基因表达分类。

Gene expression classification using epigenetic features and DNA sequence composition in the human embryonic stem cell line H1.

作者信息

Su Wen-Xia, Li Qian-Zhong, Zhang Lu-Qiang, Fan Guo-Liang, Wu Cheng-Yan, Yan Zhen-He, Zuo Yong-Chun

机构信息

Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China.

Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China.

出版信息

Gene. 2016 Oct 30;592(1):227-234. doi: 10.1016/j.gene.2016.07.059. Epub 2016 Jul 25.

DOI:10.1016/j.gene.2016.07.059
PMID:27468948
Abstract

Epigenetic factors are known to correlate with gene expression in the existing studies. However, quantitative models that accurately classify the highly and lowly expressed genes based on epigenetic factors are currently lacking. In this study, a new machine learning method combines histone modifications, DNA methylation, DNA accessibility, transcription factors, and trinucleotide composition with support vector machines (SVM) is developed in the context of human embryonic stem cell line (H1). The results indicate that the predictive accuracy will be markedly improved when the epigenetic features are considered. The predictive accuracy and Matthews correlation coefficient of the best model are as high as 95.96% and 0.92 for 10-fold cross-validation test, and 95.58% and 0.92 for independent dataset test, respectively. Our model provides a good way to judge a gene is either highly or lowly expressed gene by using genetic and epigenetic data, when the expression data of the gene is lacking. And a web-server GECES for our analysis method is established at http://202.207.14.87:8032/fuwu/GECES/index.asp, so that other scientists can easily get their desired results by our web-server, without going through the mathematical details.

摘要

在现有研究中,已知表观遗传因素与基因表达相关。然而,目前缺乏基于表观遗传因素准确分类高表达基因和低表达基因的定量模型。在本研究中,在人类胚胎干细胞系(H1)的背景下,开发了一种将组蛋白修饰、DNA甲基化、DNA可及性、转录因子和三核苷酸组成与支持向量机(SVM)相结合的新机器学习方法。结果表明,考虑表观遗传特征时,预测准确性将显著提高。最佳模型在10折交叉验证测试中的预测准确率和马修斯相关系数分别高达95.96%和0.92,在独立数据集测试中分别为95.58%和0.92。当缺乏基因表达数据时,我们的模型提供了一种利用遗传和表观遗传数据判断一个基因是高表达基因还是低表达基因的好方法。并且在http://202.207.14.87:8032/fuwu/GECES/index.asp建立了用于我们分析方法的网络服务器GECES,以便其他科学家无需了解数学细节,通过我们的网络服务器就能轻松获得他们想要的结果。

相似文献

1
Gene expression classification using epigenetic features and DNA sequence composition in the human embryonic stem cell line H1.利用人类胚胎干细胞系H1中的表观遗传特征和DNA序列组成进行基因表达分类。
Gene. 2016 Oct 30;592(1):227-234. doi: 10.1016/j.gene.2016.07.059. Epub 2016 Jul 25.
2
Association analysis between the distributions of histone modifications and gene expression in the human embryonic stem cell.人类胚胎干细胞中组蛋白修饰分布与基因表达之间的关联分析
Gene. 2016 Jan 1;575(1):90-100. doi: 10.1016/j.gene.2015.08.041. Epub 2015 Aug 22.
3
Genome-wide identification of Polycomb target genes in human embryonic stem cells.全基因组鉴定人类胚胎干细胞中的多梳靶基因。
Gene. 2013 Apr 15;518(2):425-30. doi: 10.1016/j.gene.2012.12.022. Epub 2013 Jan 9.
4
Genome-wide dynamic changes of DNA methylation of repetitive elements in human embryonic stem cells and fetal fibroblasts.人类胚胎干细胞和胎儿成纤维细胞中重复元件的 DNA 甲基化的全基因组动态变化。
Genomics. 2012 Jan;99(1):10-7. doi: 10.1016/j.ygeno.2011.10.004. Epub 2011 Oct 25.
5
Chromatin modifications and genomic contexts linked to dynamic DNA methylation patterns across human cell types.与人类不同细胞类型中动态DNA甲基化模式相关的染色质修饰和基因组背景。
Sci Rep. 2015 Feb 12;5:8410. doi: 10.1038/srep08410.
6
Predicting methylation status of human DNA sequences by pseudo-trinucleotide composition.基于伪三核苷酸组成预测人类 DNA 序列的甲基化状态。
Talanta. 2011 Aug 15;85(2):1143-7. doi: 10.1016/j.talanta.2011.05.043. Epub 2011 May 27.
7
Epigenetic modification of retinoic acid-treated human embryonic stem cells.维甲酸处理的人类胚胎干细胞的表观遗传学修饰。
BMB Rep. 2010 Dec;43(12):830-5. doi: 10.5483/BMBRep.2010.43.12.830.
8
Epigenetic regulation of Nanog gene in embryonic stem and trophoblast stem cells.胚胎干细胞和滋养层干细胞中Nanog基因的表观遗传调控
Genes Cells. 2007 Mar;12(3):387-96. doi: 10.1111/j.1365-2443.2007.01058.x.
9
Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein.基于蛋白质的氨基酸和二肽组成对基因表达水平进行相关性分析与预测。
BMC Bioinformatics. 2005 Mar 17;6:59. doi: 10.1186/1471-2105-6-59.
10
Epigenetics: the study of embryonic stem cells by restriction landmark genomic scanning.表观遗传学:通过限制性内切酶标记基因组扫描对胚胎干细胞的研究。
FEBS J. 2008 Apr;275(8):1624-30. doi: 10.1111/j.1742-4658.2008.06331.x. Epub 2008 Mar 7.

引用本文的文献

1
Study on the spatial distribution patterns of histone modifications in Hippo pathway genes.河马通路基因中组蛋白修饰的空间分布模式研究。
Biophys Rep. 2021 Feb 28;7(1):71-79. doi: 10.52601/bpr.2021.200042.
2
A Systematic Review of the Effects of High-Fat Diet Exposure on Oocyte and Follicular Quality: A Molecular Point of View.高脂肪饮食暴露对卵母细胞和卵泡质量影响的系统评价:从分子角度看。
Int J Mol Sci. 2022 Aug 10;23(16):8890. doi: 10.3390/ijms23168890.
3
Selective elimination of pluripotent stem cells by PIKfyve specific inhibitors.
PIKfyve 特异性抑制剂选择性消除多能干细胞。
Stem Cell Reports. 2022 Feb 8;17(2):397-412. doi: 10.1016/j.stemcr.2021.12.013. Epub 2022 Jan 20.
4
Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine.机器学习在表观基因组学中的应用:癌症生物学和医学的新视角。
Biochim Biophys Acta Rev Cancer. 2021 Dec;1876(2):188588. doi: 10.1016/j.bbcan.2021.188588. Epub 2021 Jul 7.
5
Identifying Heat Shock Protein Families from Imbalanced Data by Using Combined Features.利用组合特征从不平衡数据中识别热休克蛋白家族。
Comput Math Methods Med. 2020 Sep 23;2020:8894478. doi: 10.1155/2020/8894478. eCollection 2020.
6
Revealing transcription factor and histone modification co-localization and dynamics across cell lines by integrating ChIP-seq and RNA-seq data.通过整合 ChIP-seq 和 RNA-seq 数据揭示转录因子和组蛋白修饰在细胞系中的共定位和动态变化。
BMC Genomics. 2018 Dec 31;19(Suppl 10):914. doi: 10.1186/s12864-018-5278-5.