• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SNPs2ChIP:用于推断非编码单核苷酸多态性(SNP)功能的染色质免疫沉淀测序(ChIP-seq)潜在因子

SNPs2ChIP: Latent Factors of ChIP-seq to infer functions of non-coding SNPs.

作者信息

Anand Shankara, Kalesinskas Laurynas, Smail Craig, Tanigawa Yosuke

机构信息

Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, U.S.A.*These authors contributed equally to this work.

出版信息

Pac Symp Biocomput. 2019;24:184-195.

PMID:30864321
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6417821/
Abstract

Genetic variations of the human genome are linked to many disease phenotypes. While whole-genome sequencing and genome-wide association studies (GWAS) have uncovered a number of genotype-phenotype associations, their functional interpretation remains challenging given most single nucleotide polymorphisms (SNPs) fall into the non-coding region of the genome. Advances in chromatin immunoprecipitation sequencing (ChIP-seq) have made large-scale repositories of epigenetic data available, allowing investigation of coordinated mechanisms of epigenetic markers and transcriptional regulation and their influence on biological function. To address this, we propose SNPs2ChIP, a method to infer biological functions of non-coding variants through unsupervised statistical learning methods applied to publicly-available epigenetic datasets. We systematically characterized latent factors by applying singular value decomposition to ChIP-seq tracks of lymphoblastoid cell lines, and annotated the biological function of each latent factor using the genomic region enrichment analysis tool. Using these annotated latent factors as reference, we developed SNPs2ChIP, a pipeline that takes genomic region(s) as an input, identifies the relevant latent factors with quantitative scores, and returns them along with their inferred functions. As a case study, we focused on systemic lupus erythematosus and demonstrated our method's ability to infer relevant biological function. We systematically applied SNPs2ChIP on publicly available datasets, including known GWAS associations from the GWAS catalogue and ChIP-seq peaks from a previously published study. Our approach to leverage latent patterns across genome-wide epigenetic datasets to infer the biological function will advance understanding of the genetics of human diseases by accelerating the interpretation of non-coding genomes.

摘要

人类基因组的遗传变异与许多疾病表型相关。虽然全基因组测序和全基因组关联研究(GWAS)已经发现了一些基因型与表型的关联,但鉴于大多数单核苷酸多态性(SNP)位于基因组的非编码区域,对其功能的解释仍然具有挑战性。染色质免疫沉淀测序(ChIP-seq)技术的进步使得大规模表观遗传数据存储库得以建立,从而能够研究表观遗传标记和转录调控的协同机制及其对生物学功能的影响。为了解决这个问题,我们提出了SNPs2ChIP方法,该方法通过应用于公开可用表观遗传数据集的无监督统计学习方法来推断非编码变异的生物学功能。我们通过对淋巴母细胞系的ChIP-seq轨迹应用奇异值分解来系统地表征潜在因子,并使用基因组区域富集分析工具注释每个潜在因子的生物学功能。以这些注释的潜在因子为参考,我们开发了SNPs2ChIP流程,该流程以基因组区域为输入,通过定量评分识别相关潜在因子,并返回这些因子及其推断的功能。作为一个案例研究,我们聚焦于系统性红斑狼疮,并展示了我们方法推断相关生物学功能的能力。我们在公开可用数据集上系统地应用了SNPs2ChIP,包括来自GWAS目录的已知GWAS关联和先前发表研究中的ChIP-seq峰。我们利用全基因组表观遗传数据集的潜在模式来推断生物学功能的方法,将通过加速非编码基因组的解释,推动对人类疾病遗传学的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/84a2d6f573f8/nihms-999789-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/17cf5dcf8f2b/nihms-999789-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/b5ac43e2df78/nihms-999789-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/b370ac3dd284/nihms-999789-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/296bc879eed8/nihms-999789-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/84a2d6f573f8/nihms-999789-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/17cf5dcf8f2b/nihms-999789-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/b5ac43e2df78/nihms-999789-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/b370ac3dd284/nihms-999789-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/296bc879eed8/nihms-999789-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c33e/6417821/84a2d6f573f8/nihms-999789-f0005.jpg

相似文献

1
SNPs2ChIP: Latent Factors of ChIP-seq to infer functions of non-coding SNPs.SNPs2ChIP:用于推断非编码单核苷酸多态性(SNP)功能的染色质免疫沉淀测序(ChIP-seq)潜在因子
Pac Symp Biocomput. 2019;24:184-195.
2
On the identification of potential regulatory variants within genome wide association candidate SNP sets.在全基因组关联候选 SNP 集中鉴定潜在的调控变异。
BMC Med Genomics. 2014 Jun 11;7:34. doi: 10.1186/1755-8794-7-34.
3
Integration of VDR genome wide binding and GWAS genetic variation data reveals co-occurrence of VDR and NF-κB binding that is linked to immune phenotypes.维生素D受体(VDR)全基因组结合与全基因组关联研究(GWAS)遗传变异数据的整合揭示了VDR与核因子κB(NF-κB)结合的共现,这与免疫表型相关。
BMC Genomics. 2017 Feb 6;18(1):132. doi: 10.1186/s12864-017-3481-4.
4
Chromatin landscapes and genetic risk in systemic lupus.系统性红斑狼疮中的染色质景观与遗传风险
Arthritis Res Ther. 2016 Dec 1;18(1):281. doi: 10.1186/s13075-016-1169-9.
5
CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection.CNN-Peaks:使用卷积神经网络进行 ChIP-Seq 峰检测的管道,该网络模仿人类视觉检查。
Sci Rep. 2020 May 13;10(1):7933. doi: 10.1038/s41598-020-64655-4.
6
Characterising ChIP-seq binding patterns by model-based peak shape deconvolution.基于模型的峰形反卷积分析 ChIP-seq 结合模式。
BMC Genomics. 2013 Nov 26;14(1):834. doi: 10.1186/1471-2164-14-834.
7
Imputation-based assessment of next generation rare exome variant arrays.基于插补法的新一代罕见外显子变异阵列评估
Pac Symp Biocomput. 2014:241-52.
8
Detection of regulatory SNPs in human genome using ChIP-seq ENCODE data.使用 ChIP-seq ENCODE 数据检测人类基因组中的调控 SNP。
PLoS One. 2013 Oct 29;8(10):e78833. doi: 10.1371/journal.pone.0078833. eCollection 2013.
9
Unified Analysis of Multiple ChIP-Seq Datasets.多个 ChIP-Seq 数据集的统一分析。
Methods Mol Biol. 2021;2198:451-465. doi: 10.1007/978-1-0716-0876-0_33.
10
The genetic basis of systemic lupus erythematosus: What are the risk factors and what have we learned.系统性红斑狼疮的遗传学基础:危险因素有哪些,我们从中了解到了什么。
J Autoimmun. 2016 Nov;74:161-175. doi: 10.1016/j.jaut.2016.08.001. Epub 2016 Aug 10.

本文引用的文献

1
Components of genetic associations across 2,138 phenotypes in the UK Biobank highlight adipocyte biology.英国生物库中 2138 种表型的遗传关联成分突出了脂肪细胞生物学。
Nat Commun. 2019 Sep 6;10(1):4064. doi: 10.1038/s41467-019-11953-9.
2
The Human Transcription Factors.人类转录因子。
Cell. 2018 Feb 8;172(4):650-665. doi: 10.1016/j.cell.2018.01.029.
3
ReMap 2018: an updated atlas of regulatory regions from an integrative analysis of DNA-binding ChIP-seq experiments.ReMap 2018:整合 DNA 结合 ChIP-seq 实验的分析结果,对调控区域进行的更新图谱绘制。
Nucleic Acids Res. 2018 Jan 4;46(D1):D267-D275. doi: 10.1093/nar/gkx1092.
4
Chromatin-state discovery and genome annotation with ChromHMM.使用ChromHMM进行染色质状态发现和基因组注释。
Nat Protoc. 2017 Dec;12(12):2478-2492. doi: 10.1038/nprot.2017.124. Epub 2017 Nov 9.
5
Genetic effects on gene expression across human tissues.基因对人体各组织基因表达的影响。
Nature. 2017 Oct 11;550(7675):204-213. doi: 10.1038/nature24277.
6
10 Years of GWAS Discovery: Biology, Function, and Translation.全基因组关联研究十年发现:生物学、功能与转化
Am J Hum Genet. 2017 Jul 6;101(1):5-22. doi: 10.1016/j.ajhg.2017.06.005.
7
The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog).新的NHGRI-EBI已发表全基因组关联研究目录(GWAS目录)。
Nucleic Acids Res. 2017 Jan 4;45(D1):D896-D901. doi: 10.1093/nar/gkw1133. Epub 2016 Nov 29.
8
The Human Phenotype Ontology in 2017.2017年的人类表型本体论。
Nucleic Acids Res. 2017 Jan 4;45(D1):D865-D876. doi: 10.1093/nar/gkw1039. Epub 2016 Nov 28.
9
Mouse Genome Database (MGD)-2017: community knowledge resource for the laboratory mouse.小鼠基因组数据库(MGD)-2017:实验室小鼠的社区知识资源。
Nucleic Acids Res. 2017 Jan 4;45(D1):D723-D729. doi: 10.1093/nar/gkw1040. Epub 2016 Nov 28.
10
Expansion of the Gene Ontology knowledgebase and resources.基因本体知识库及资源的扩展。
Nucleic Acids Res. 2017 Jan 4;45(D1):D331-D338. doi: 10.1093/nar/gkw1108. Epub 2016 Nov 29.