Suppr超能文献

利用人类基因组学、转录组学和表观遗传学数据预测的哺乳动物基因组调控区域。

Mammalian genomic regulatory regions predicted by utilizing human genomics, transcriptomics, and epigenetics data.

机构信息

CSIRO Agriculture, 306 Carmody Road, St. Lucia, 4067, QLD, Australia.

Divisions of Genomics of Development and Disease, Institute for Molecular Bioscience, University of Queensland, 306 Carmody Road, St. Lucia, 4067, QLD, Australia.

出版信息

Gigascience. 2018 Mar 1;7(3):1-17. doi: 10.1093/gigascience/gix136.

Abstract

Genome sequences for hundreds of mammalian species are available, but an understanding of their genomic regulatory regions, which control gene expression, is only beginning. A comprehensive prediction of potential active regulatory regions is necessary to functionally study the roles of the majority of genomic variants in evolution, domestication, and animal production. We developed a computational method to predict regulatory DNA sequences (promoters, enhancers, and transcription factor binding sites) in production animals (cows and pigs) and extended its broad applicability to other mammals. The method utilizes human regulatory features identified from thousands of tissues, cell lines, and experimental assays to find homologous regions that are conserved in sequences and genome organization and are enriched for regulatory elements in the genome sequences of other mammalian species. Importantly, we developed a filtering strategy, including a machine learning classification method, to utilize a very small number of species-specific experimental datasets available to select for the likely active regulatory regions. The method finds the optimal combination of sensitivity and accuracy to unbiasedly predict regulatory regions in mammalian species. Furthermore, we demonstrated the utility of the predicted regulatory datasets in cattle for prioritizing variants associated with multiple production and climate change adaptation traits and identifying potential genome editing targets.

摘要

数百种哺乳动物的基因组序列已经可用,但对控制基因表达的基因组调控区域的理解才刚刚开始。全面预测潜在的活性调控区域对于功能研究进化、驯化和动物生产中大多数基因组变异的作用是必要的。我们开发了一种计算方法来预测生产动物(牛和猪)中的调控 DNA 序列(启动子、增强子和转录因子结合位点),并将其广泛适用性扩展到其他哺乳动物。该方法利用从数千种组织、细胞系和实验测定中鉴定出的人类调控特征,找到在序列和基因组组织中保守的同源区域,并且在其他哺乳动物的基因组序列中富含调控元件。重要的是,我们开发了一种过滤策略,包括机器学习分类方法,利用可用的极少数物种特异性实验数据集来选择可能的活性调控区域。该方法找到了最佳的灵敏度和准确性组合,可以在哺乳动物物种中无偏地预测调控区域。此外,我们还证明了预测的调控数据集在牛中的实用性,可用于优先考虑与多种生产和气候变化适应性状相关的变异,并识别潜在的基因组编辑靶标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3931/5838836/5a5b0092dc4a/gix136fig1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验