全基因组特征计算预测的综合综述。

A comprehensive review of computational prediction of genome-wide features.

作者信息

Xu Tianlei, Zheng Xiaoqi, Li Ben, Jin Peng, Qin Zhaohui, Wu Hao

机构信息

Department of Mathematics and Computer Science, Emory University, Atlanta, GA, USA.

Department of Mathematics, Shanghai Normal University, Shanghai, China.

出版信息

Brief Bioinform. 2020 Jan 17;21(1):120-134. doi: 10.1093/bib/bby110.

DOI:10.1093/bib/bby110

PMID:30462144

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10233247/

Abstract

There are significant correlations among different types of genetic, genomic and epigenomic features within the genome. These correlations make the in silico feature prediction possible through statistical or machine learning models. With the accumulation of a vast amount of high-throughput data, feature prediction has gained significant interest lately, and a plethora of papers have been published in the past few years. Here we provide a comprehensive review on these published works, categorized by the prediction targets, including protein binding site, enhancer, DNA methylation, chromatin structure and gene expression. We also provide discussions on some important points and possible future directions.

摘要

基因组内不同类型的遗传、基因组和表观基因组特征之间存在显著相关性。这些相关性使得通过统计或机器学习模型进行计算机特征预测成为可能。随着大量高通量数据的积累，特征预测最近引起了广泛关注，在过去几年中发表了大量相关论文。在此，我们对这些已发表的研究进行全面综述，按照预测目标进行分类，包括蛋白质结合位点、增强子、DNA甲基化、染色质结构和基因表达。我们还对一些要点和可能的未来方向进行了讨论。

相似文献

A comprehensive review of computational prediction of genome-wide features.

Brief Bioinform. 2020 Jan 17;21(1):120-134. doi: 10.1093/bib/bby110.

Opening up the blackbox: an interpretable deep neural network-based classifier for cell-type specific enhancer predictions.

BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):54. doi: 10.1186/s12918-016-0302-3.

A survey of recently emerged genome-wide computational enhancer predictor tools.

Comput Biol Chem. 2018 Jun;74:132-141. doi: 10.1016/j.compbiolchem.2018.03.019. Epub 2018 Mar 16.

Epigenomic and enhancer dysregulation in uterine leiomyomas.

Hum Reprod Update. 2022 Jun 30;28(4):518-547. doi: 10.1093/humupd/dmac008.

Using machine learning to realize genetic site screening and genomic prediction of productive traits in pigs.

FASEB J. 2023 Jun;37(6):e22961. doi: 10.1096/fj.202300245R.

Computational methods for the prediction of chromatin interaction and organization using sequence and epigenomic profiles.

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa405.

Assessing the model transferability for prediction of transcription factor binding sites based on chromatin accessibility.

BMC Bioinformatics. 2017 Jul 27;18(1):355. doi: 10.1186/s12859-017-1769-7.

Using epigenomics data to predict gene expression in lung cancer.

BMC Bioinformatics. 2015;16 Suppl 5(Suppl 5):S10. doi: 10.1186/1471-2105-16-S5-S10. Epub 2015 Mar 18.

Genome-Wide Locations of Potential Epimutations Associated with Environmentally Induced Epigenetic Transgenerational Inheritance of Disease Using a Sequential Machine Learning Prediction Approach.

PLoS One. 2015 Nov 16;10(11):e0142274. doi: 10.1371/journal.pone.0142274. eCollection 2015.

Enhancer prediction in the human genome by probabilistic modelling of the chromatin feature patterns.

BMC Bioinformatics. 2020 Jul 20;21(1):317. doi: 10.1186/s12859-020-03621-3.

引用本文的文献

Enhancing Genomic Prediction Accuracy of Reproduction Traits in Rongchang Pigs Through Machine Learning.

Animals (Basel). 2025 Feb 12;15(4):525. doi: 10.3390/ani15040525.

Machine and deep learning methods for predicting 3D genome organization.

ArXiv. 2024 Mar 4:arXiv:2403.03231v1.

Trends in biological data integration for the selection of enzymes and transcription factors related to cellulose and hemicellulose degradation in fungi.

3 Biotech. 2021 Nov;11(11):475. doi: 10.1007/s13205-021-03032-y. Epub 2021 Oct 26.

Predicting Genome Architecture: Challenges and Solutions.

Front Genet. 2021 Jan 22;11:617202. doi: 10.3389/fgene.2020.617202. eCollection 2020.

Integrative Methods and Practical Challenges for Single-Cell Multi-omics.

Trends Biotechnol. 2020 Sep;38(9):1007-1022. doi: 10.1016/j.tibtech.2020.02.013. Epub 2020 Mar 26.

Prediction of RNA Methylation Status From Gene Expression Data Using Classification and Regression Methods.

Evol Bioinform Online. 2020 Jul 20;16:1176934320915707. doi: 10.1177/1176934320915707. eCollection 2020.

A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression.

Front Bioeng Biotechnol. 2020 Feb 6;8:40. doi: 10.3389/fbioe.2020.00040. eCollection 2020.

Quantitative prediction of enhancer-promoter interactions.

Genome Res. 2020 Jan;30(1):72-84. doi: 10.1101/gr.249367.119. Epub 2019 Dec 2.

Predict Epitranscriptome Targets and Regulatory Functions of -Methyladenosine (mA) Writers and Erasers.

Evol Bioinform Online. 2019 Sep 5;15:1176934319871290. doi: 10.1177/1176934319871290. eCollection 2019.

Ensemble of decision tree reveals potential miRNA-disease associations.

PLoS Comput Biol. 2019 Jul 22;15(7):e1007209. doi: 10.1371/journal.pcbi.1007209. eCollection 2019 Jul.

本文引用的文献

FactorNet: A deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data.

Methods. 2019 Aug 15;166:40-47. doi: 10.1016/j.ymeth.2019.03.020. Epub 2019 Mar 26.

BoostMe accurately predicts DNA methylation values in whole-genome bisulfite sequencing of multiple human tissues.

BMC Genomics. 2018 May 23;19(1):390. doi: 10.1186/s12864-018-4766-y.

Dynamic motif occupancy (DynaMO) analysis identifies transcription factors and their binding sites driving dynamic biological processes.

Nucleic Acids Res. 2018 Jan 9;46(1):e2. doi: 10.1093/nar/gkx905.

Assessing the model transferability for prediction of transcription factor binding sites based on chromatin accessibility.

BMC Bioinformatics. 2017 Jul 27;18(1):355. doi: 10.1186/s12859-017-1769-7.

Prediction of Chromatin Accessibility in Gene-Regulatory Regions from Transcriptomics Data.

Sci Rep. 2017 Jul 5;7(1):4660. doi: 10.1038/s41598-017-04929-6.

DNA sequence+shape kernel enables alignment-free modeling of transcription factor binding.

Bioinformatics. 2017 Oct 1;33(19):3003-3010. doi: 10.1093/bioinformatics/btx336.

DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning.

Genome Biol. 2017 Apr 11;18(1):67. doi: 10.1186/s13059-017-1189-z.

Mocap: large-scale inference of transcription factor binding sites from chromatin accessibility.

Nucleic Acids Res. 2017 May 5;45(8):4315-4329. doi: 10.1093/nar/gkx174.

Predicting the impact of non-coding variants on DNA methylation.

Nucleic Acids Res. 2017 Jun 20;45(11):e99. doi: 10.1093/nar/gkx177.

Improved regulatory element prediction based on tissue-specific local epigenomic signatures.

Proc Natl Acad Sci U S A. 2017 Feb 28;114(9):E1633-E1640. doi: 10.1073/pnas.1618353114. Epub 2017 Feb 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

全基因组特征计算预测的综合综述。

A comprehensive review of computational prediction of genome-wide features.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

全基因组特征计算预测的综合综述。

A comprehensive review of computational prediction of genome-wide features.

作者信息

机构信息

出版信息