• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

建立并验证用于变异注释和表达分析的调控区域。

Establishing and validating regulatory regions for variant annotation and expression analysis.

作者信息

Kaplun Alexander, Krull Mathias, Lakshman Karthick, Matys Volker, Lewicki Birgit, Hogan Jennifer D

机构信息

QIAGEN Bioinformatics, 35 Gatehouse Drive, Waltham, MA, 02451, USA.

出版信息

BMC Genomics. 2016 Jun 23;17 Suppl 2(Suppl 2):393. doi: 10.1186/s12864-016-2724-0.

DOI:10.1186/s12864-016-2724-0
PMID:27357948
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4928138/
Abstract

BACKGROUND

The regulatory effect of inherited or de novo genetic variants occurring in promoters as well as in transcribed or even coding gene regions is gaining greater recognition as a contributing factor to disease processes in addition to mutations affecting protein functionality. Thousands of such regulatory mutations are already recorded in HGMD, OMIM, ClinVar and other databases containing published disease causing and associated mutations. It is therefore important to properly annotate genetic variants occurring in experimentally verified and predicted transcription factor binding sites (TFBS) that could thus influence the factor binding event. Selection of the promoter sequence used is an important factor in the analysis as it directly influences the composition of the sequence available for transcription factor binding analysis.

RESULTS

In this study we first establish genomic regions likely to be involved in regulation of gene expression. TRANSFAC uses a method of virtual transcription start sites (vTSS) calculation to define the best supported promoter for a gene. We have performed a comparison of the virtually calculated promoters between the best supported and secondary promoters in hg19 and hg38 reference genomes to test and validate the approach. Next we create and utilize a workflow for systematic analysis of casual disease associated variants in TFBS using Genome Trax and TRANSFAC databases. A total of 841 and 736 experimentally verified TFBSs within best supported promoters were mapped over HGMD and ClinVar mutation sites respectively. Tens of thousands of predicted ChIP-Seq derived TFBSs were mapped over mutations as well. We have further analyzed some of these mutations for potential gain or loss in transcription factor binding.

CONCLUSIONS

We have confirmed the validity of TRANSFAC's approach to define the best supported promoters and established a workflow of their use in annotation of regulatory genetic variants.

摘要

背景

除了影响蛋白质功能的突变外,启动子以及转录甚至编码基因区域中发生的遗传变异的调控作用作为疾病发生过程的一个促成因素正得到越来越多的认可。HGMD、OMIM、ClinVar和其他包含已发表的致病和相关突变的数据库中已经记录了数千种此类调控突变。因此,正确注释实验验证和预测的转录因子结合位点(TFBS)中发生的遗传变异非常重要,因为这些变异可能会影响因子结合事件。所使用的启动子序列的选择是分析中的一个重要因素,因为它直接影响可用于转录因子结合分析的序列组成。

结果

在本研究中,我们首先确定可能参与基因表达调控的基因组区域。TRANSFAC使用虚拟转录起始位点(vTSS)计算方法来定义基因的最佳支持启动子。我们对hg19和hg38参考基因组中最佳支持启动子和次要启动子之间的虚拟计算启动子进行了比较,以测试和验证该方法。接下来,我们创建并利用了一个工作流程,使用Genome Trax和TRANSFAC数据库对TFBS中与疾病相关的偶然变异进行系统分析。分别在HGMD和ClinVar突变位点上定位了最佳支持启动子内总共841个和736个经实验验证的TFBS。数以万计的预测ChIP-Seq衍生TFBS也被定位到突变上。我们进一步分析了其中一些突变对转录因子结合的潜在增减情况。

结论

我们已经证实了TRANSFAC定义最佳支持启动子方法的有效性,并建立了在调控遗传变异注释中使用这些启动子的工作流程。

相似文献

1
Establishing and validating regulatory regions for variant annotation and expression analysis.建立并验证用于变异注释和表达分析的调控区域。
BMC Genomics. 2016 Jun 23;17 Suppl 2(Suppl 2):393. doi: 10.1186/s12864-016-2724-0.
2
Selective constraints in experimentally defined primate regulatory regions.实验定义的灵长类动物调控区域中的选择限制
PLoS Genet. 2008 Aug 15;4(8):e1000157. doi: 10.1371/journal.pgen.1000157.
3
Genome-wide transcription factor binding site/promoter databases for the analysis of gene sets and co-occurrence of transcription factor binding motifs.基因组范围的转录因子结合位点/启动子数据库,用于分析基因集和转录因子结合模体的共发生。
BMC Genomics. 2010 Mar 1;11:145. doi: 10.1186/1471-2164-11-145.
4
Bioinformatic prediction of transcription factor binding sites at promoter regions of genes for photoperiod and vernalization responses in model and temperate cereal plants.模式植物和温带谷类作物中光周期及春化反应相关基因启动子区域转录因子结合位点的生物信息学预测
BMC Genomics. 2016 Aug 8;17:573. doi: 10.1186/s12864-016-2916-7.
5
Systematic identification and annotation of multiple-variant compound effects at transcription factor binding sites in human genome.系统识别和注释人类基因组转录因子结合位点的多变量复合效应。
J Genet Genomics. 2018 Jul 20;45(7):373-379. doi: 10.1016/j.jgg.2018.05.005. Epub 2018 Jul 7.
6
Identifying functional transcription factor binding sites in yeast by considering their positional preference in the promoters.通过考虑启动子中位置偏好来识别酵母中的功能转录因子结合位点。
PLoS One. 2013 Dec 26;8(12):e83791. doi: 10.1371/journal.pone.0083791. eCollection 2013.
7
Disease-associated genetic variants in the regulatory regions of human genes: mechanisms of action on transcription and genomic resources for dissecting these mechanisms.人类基因调控区域中与疾病相关的遗传变异:对转录的作用机制及剖析这些机制的基因组资源
Vavilovskii Zhurnal Genet Selektsii. 2021 Feb;25(1):18-29. doi: 10.18699/VJ21.003.
8
Identification of estrogen-responsive genes using a genome-wide analysis of promoter elements for transcription factor binding sites.利用转录因子结合位点启动子元件的全基因组分析鉴定雌激素反应基因。
J Biol Chem. 2005 Jun 3;280(22):21491-7. doi: 10.1074/jbc.M409176200. Epub 2005 Mar 24.
9
MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes.MAPPER:一种用于在多个基因组中通过计算识别潜在转录因子结合位点的搜索引擎。
BMC Bioinformatics. 2005 Mar 30;6:79. doi: 10.1186/1471-2105-6-79.
10
The spatial distribution of cis regulatory elements in yeast promoters and its implications for transcriptional regulation.酵母启动子中顺式调控元件的空间分布及其对转录调控的影响。
BMC Genomics. 2010 Oct 19;11:581. doi: 10.1186/1471-2164-11-581.

引用本文的文献

1
Airborne fine particulate matter exposure induces transcriptomic alterations resembling asthmatic signatures: insights from integrated omics analysis.暴露于空气中的细颗粒物会引发类似于哮喘特征的转录组改变:综合组学分析的见解。
Environ Epigenet. 2025 Jan 2;11(1):dvae026. doi: 10.1093/eep/dvae026. eCollection 2024.
2
Polycomb protein RYBP activates transcription factor during cardiac differentiation of mouse embryonic stem cells.多梳蛋白 RYBP 在小鼠胚胎干细胞的心脏分化过程中激活转录因子。
Open Biol. 2023 Feb;13(2):220305. doi: 10.1098/rsob.220305. Epub 2023 Feb 8.
3
General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA.

本文引用的文献

1
PGMD: a comprehensive manually curated pharmacogenomic database.PGMD:一个全面的人工整理的药物基因组学数据库。
Pharmacogenomics J. 2016 Apr;16(2):124-8. doi: 10.1038/tpj.2015.32. Epub 2015 May 5.
2
Human gene-centered transcription factor networks for enhancers and disease variants.用于增强子和疾病变异体的以人类基因为中心的转录因子网络。
Cell. 2015 Apr 23;161(3):661-673. doi: 10.1016/j.cell.2015.03.003.
3
Ensembl 2015.Ensembl 2015.
一般设计揭示了蛋白质编码和非编码人类 DNA 中的不同代码。
Genes (Basel). 2022 Oct 28;13(11):1970. doi: 10.3390/genes13111970.
4
Identification of genes, pathways and transcription factor-miRNA-target gene networks and experimental verification in venous thromboembolism.静脉血栓栓塞症中基因、通路和转录因子-miRNA-靶基因网络的鉴定及实验验证。
Sci Rep. 2021 Aug 11;11(1):16352. doi: 10.1038/s41598-021-95909-4.
5
Motif scraper: a cross-platform, open-source tool for identifying degenerate nucleotide motif matches in FASTA files.基序提取器:一个跨平台的、开源的工具,用于在 FASTA 文件中识别简并核苷酸基序匹配。
Bioinformatics. 2018 Nov 15;34(22):3926-3928. doi: 10.1093/bioinformatics/bty437.
6
GeneHancer: genome-wide integration of enhancers and target genes in GeneCards.基因增强子:基因卡片中增强子与靶基因的全基因组整合
Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax028.
7
VarI-SIG 2015: methods for personalized medicine - the role of variant interpretation in research and diagnostics.2015年变异体信号会议:个性化医疗方法——变异体解读在研究与诊断中的作用
BMC Genomics. 2016 Jun 23;17 Suppl 2(Suppl 2):425. doi: 10.1186/s12864-016-2721-3.
Nucleic Acids Res. 2015 Jan;43(Database issue):D662-9. doi: 10.1093/nar/gku1010. Epub 2014 Oct 28.
4
Exposing synonymous mutations.揭示同义突变。
Trends Genet. 2014 Jul;30(7):308-21. doi: 10.1016/j.tig.2014.04.006. Epub 2014 Jun 19.
5
Synonymous mutations frequently act as driver mutations in human cancers.同义突变经常在人类癌症中充当驱动突变。
Cell. 2014 Mar 13;156(6):1324-1335. doi: 10.1016/j.cell.2014.01.051.
6
ClinVar: public archive of relationships among sequence variation and human phenotype.ClinVar:序列变异与人类表型之间关系的公共档案。
Nucleic Acids Res. 2014 Jan;42(Database issue):D980-5. doi: 10.1093/nar/gkt1113. Epub 2013 Nov 14.
7
The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine.人类基因突变数据库:为临床和分子遗传学、诊断测试以及个性化基因组医学构建全面的基因突变知识库。
Hum Genet. 2014 Jan;133(1):1-9. doi: 10.1007/s00439-013-1358-4.
8
Mutational analysis of the 5' non-coding region of GJB1 in a Taiwanese cohort with Charcot-Marie-Tooth neuropathy.GJB1 5'非编码区突变分析在台湾地区腓骨肌萎缩症患者中的应用。
J Neurol Sci. 2013 Sep 15;332(1-2):51-5. doi: 10.1016/j.jns.2013.06.011. Epub 2013 Jul 2.
9
PharmGKB: the Pharmacogenomics Knowledge Base.药物基因组学知识库(PharmGKB)
Methods Mol Biol. 2013;1015:311-20. doi: 10.1007/978-1-62703-435-7_20.
10
Dissecting the genotype in syndromic intellectual disability using whole exome sequencing in addition to genome-wide copy number analysis.采用全外显子组测序联合全基因组拷贝数分析技术对综合征性智力障碍进行基因型分析。
Hum Genet. 2013 Jul;132(7):825-41. doi: 10.1007/s00439-013-1296-1. Epub 2013 Apr 4.