• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

心脏转录因子数据库(CardioTF),一个用于解析心脏系统转录调控网络的数据库。

CardioTF, a database of deconstructing transcriptional circuits in the heart system.

作者信息

Zhen Yisong

机构信息

State Key Laboratory of Cardiovascular Disease, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College , Beijing , China.

出版信息

PeerJ. 2016 Aug 23;4:e2339. doi: 10.7717/peerj.2339. eCollection 2016.

DOI:10.7717/peerj.2339
PMID:27635320
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5012272/
Abstract

BACKGROUND

Information on cardiovascular gene transcription is fragmented and far behind the present requirements of the systems biology field. To create a comprehensive source of data for cardiovascular gene regulation and to facilitate a deeper understanding of genomic data, the CardioTF database was constructed. The purpose of this database is to collate information on cardiovascular transcription factors (TFs), position weight matrices (PWMs), and enhancer sequences discovered using the ChIP-seq method.

METHODS

The Naïve-Bayes algorithm was used to classify literature and identify all PubMed abstracts on cardiovascular development. The natural language learning tool GNAT was then used to identify corresponding gene names embedded within these abstracts. Local Perl scripts were used to integrate and dump data from public databases into the MariaDB management system (MySQL). In-house R scripts were written to analyze and visualize the results.

RESULTS

Known cardiovascular TFs from humans and human homologs from fly, Ciona, zebrafish, frog, chicken, and mouse were identified and deposited in the database. PWMs from Jaspar, hPDI, and UniPROBE databases were deposited in the database and can be retrieved using their corresponding TF names. Gene enhancer regions from various sources of ChIP-seq data were deposited into the database and were able to be visualized by graphical output. Besides biocuration, mouse homologs of the 81 core cardiac TFs were selected using a Naïve-Bayes approach and then by intersecting four independent data sources: RNA profiling, expert annotation, PubMed abstracts and phenotype.

DISCUSSION

The CardioTF database can be used as a portal to construct transcriptional network of cardiac development.

AVAILABILITY AND IMPLEMENTATION

Database URL: http://www.cardiosignal.org/database/cardiotf.html.

摘要

背景

心血管基因转录的信息零散,远远落后于系统生物学领域目前的要求。为了创建心血管基因调控的全面数据源,并促进对基因组数据的更深入理解,构建了心脏转录因子(CardioTF)数据库。该数据库的目的是整理有关心血管转录因子(TFs)、位置权重矩阵(PWMs)以及使用ChIP-seq方法发现的增强子序列的信息。

方法

使用朴素贝叶斯算法对文献进行分类,并识别所有关于心血管发育的PubMed摘要。然后使用自然语言学习工具GNAT识别这些摘要中嵌入的相应基因名称。使用本地Perl脚本将来自公共数据库的数据整合并转储到MariaDB管理系统(MySQL)中。编写内部R脚本以分析和可视化结果。

结果

识别出人类已知的心血管TFs以及果蝇(fly)、海鞘(Ciona)、斑马鱼、青蛙、鸡和小鼠的人类同源物,并将其存入数据库。来自Jaspar、hPDI和UniPROBE数据库的PWMs存入数据库,可使用其相应的TF名称进行检索。来自各种ChIP-seq数据源的基因增强子区域存入数据库,并能够通过图形输出进行可视化。除了生物编目外,还使用朴素贝叶斯方法,然后通过交叉四个独立数据源(RNA分析、专家注释、PubMed摘要和表型)选择了81个核心心脏TFs的小鼠同源物。

讨论

CardioTF数据库可作为构建心脏发育转录网络的门户。

可用性和实现方式

数据库网址:http://www.cardiosignal.org/database/cardiotf.html。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/342c/5012272/293f728f0a52/peerj-04-2339-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/342c/5012272/5088c475d3bb/peerj-04-2339-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/342c/5012272/46201195ca9b/peerj-04-2339-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/342c/5012272/a96caea48fd8/peerj-04-2339-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/342c/5012272/293f728f0a52/peerj-04-2339-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/342c/5012272/5088c475d3bb/peerj-04-2339-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/342c/5012272/46201195ca9b/peerj-04-2339-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/342c/5012272/a96caea48fd8/peerj-04-2339-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/342c/5012272/293f728f0a52/peerj-04-2339-g004.jpg

相似文献

1
CardioTF, a database of deconstructing transcriptional circuits in the heart system.心脏转录因子数据库(CardioTF),一个用于解析心脏系统转录调控网络的数据库。
PeerJ. 2016 Aug 23;4:e2339. doi: 10.7717/peerj.2339. eCollection 2016.
2
CardioSignal: a database of transcriptional regulation in cardiac development and hypertrophy.心脏信号:一个关于心脏发育和肥大中转录调控的数据库。
Int J Cardiol. 2007 Apr 4;116(3):338-47. doi: 10.1016/j.ijcard.2006.03.069. Epub 2006 Aug 2.
3
Inferring dynamic gene regulatory networks in cardiac differentiation through the integration of multi-dimensional data.通过整合多维度数据推断心脏分化过程中的动态基因调控网络。
BMC Bioinformatics. 2015 Mar 7;16:74. doi: 10.1186/s12859-015-0460-0.
4
Transcription Factor Information System (TFIS): A Tool for Detection of Transcription Factor Binding Sites.转录因子信息系统(TFIS):一种用于检测转录因子结合位点的工具。
Interdiscip Sci. 2017 Sep;9(3):378-391. doi: 10.1007/s12539-016-0168-5. Epub 2016 Apr 6.
5
Sequence homology in eukaryotes (SHOE): interactive visual tool for promoter analysis.真核生物序列同源性(SHOE):用于启动子分析的交互式可视化工具。
BMC Genomics. 2018 Sep 27;19(1):715. doi: 10.1186/s12864-018-5101-3.
6
Systematic target function annotation of human transcription factors.系统的人类转录因子靶功能注释。
BMC Biol. 2018 Jan 10;16(1):4. doi: 10.1186/s12915-017-0469-0.
7
NeisseriaBase: a specialised Neisseria genomic resource and analysis platform.奈瑟菌数据库:一个专门的奈瑟菌基因组资源与分析平台。
PeerJ. 2016 Mar 17;4:e1698. doi: 10.7717/peerj.1698. eCollection 2016.
8
RegNetwork: an integrated database of transcriptional and post-transcriptional regulatory networks in human and mouse.RegNetwork:人类和小鼠转录及转录后调控网络的综合数据库。
Database (Oxford). 2015 Sep 30;2015. doi: 10.1093/database/bav095. Print 2015.
9
: A Tool for Searching Putative Factors Regulating Gene Expression Using ChIP-seq Data.: 一种使用 ChIP-seq 数据搜索调控基因表达的潜在因子的工具。
Int J Biol Sci. 2018 Sep 7;14(12):1724-1731. doi: 10.7150/ijbs.28850. eCollection 2018.
10
CrusTF: a comprehensive resource of transcriptomes for evolutionary and functional studies of crustacean transcription factors.甲壳动物转录因子进化和功能研究的转录组综合资源——CrusTF
BMC Genomics. 2017 Nov 25;18(1):908. doi: 10.1186/s12864-017-4305-2.

引用本文的文献

1
Towards a gene regulatory network shaping the fins of the Princess cichlid.构建公主慈鲷鱼鳍的基因调控网络。
Sci Rep. 2018 Jun 25;8(1):9602. doi: 10.1038/s41598-018-27977-y.

本文引用的文献

1
Origins of cardiac fibroblasts.心脏成纤维细胞的起源。
J Mol Cell Cardiol. 2016 Feb;91:1-5. doi: 10.1016/j.yjmcc.2015.12.031. Epub 2015 Dec 31.
2
FlyBase: establishing a Gene Group resource for Drosophila melanogaster.果蝇数据库:为黑腹果蝇建立一个基因群组资源。
Nucleic Acids Res. 2016 Jan 4;44(D1):D786-92. doi: 10.1093/nar/gkv1046. Epub 2015 Oct 13.
3
Genome-wide association studies of late-onset cardiovascular disease.迟发性心血管疾病的全基因组关联研究。
J Mol Cell Cardiol. 2015 Jun;83:131-41. doi: 10.1016/j.yjmcc.2015.04.004. Epub 2015 Apr 11.
4
Epigenomics: Roadmap for regulation.表观基因组学:调控路线图
Nature. 2015 Feb 19;518(7539):314-6. doi: 10.1038/518314a.
5
InParanoid 8: orthology analysis between 273 proteomes, mostly eukaryotic.InParanoid 8:273个蛋白质组之间的直系同源分析,大部分为真核生物蛋白质组。
Nucleic Acids Res. 2015 Jan;43(Database issue):D234-9. doi: 10.1093/nar/gku1203. Epub 2014 Nov 27.
6
Database resources of the National Center for Biotechnology Information.美国国立生物技术信息中心的数据库资源。
Nucleic Acids Res. 2015 Jan;43(Database issue):D6-17. doi: 10.1093/nar/gku1130. Epub 2014 Nov 14.
7
UniPROBE, update 2015: new tools and content for the online database of protein-binding microarray data on protein-DNA interactions.UniPROBE 2015年更新:用于蛋白质 - DNA相互作用的蛋白质结合微阵列数据在线数据库的新工具和内容。
Nucleic Acids Res. 2015 Jan;43(Database issue):D117-22. doi: 10.1093/nar/gku1045. Epub 2014 Nov 5.
8
How difficult is inference of mammalian causal gene regulatory networks?推断哺乳动物因果基因调控网络的难度有多大?
PLoS One. 2014 Nov 4;9(11):e111661. doi: 10.1371/journal.pone.0111661. eCollection 2014.
9
Xenbase, the Xenopus model organism database; new virtualized system, data types and genomes.非洲爪蟾模式生物数据库Xenbase;新的虚拟化系统、数据类型和基因组。
Nucleic Acids Res. 2015 Jan;43(Database issue):D756-63. doi: 10.1093/nar/gku956. Epub 2014 Oct 13.
10
The UCSC Genome Browser: What Every Molecular Biologist Should Know.加州大学圣克鲁兹分校基因组浏览器:每个分子生物学家都应该了解的内容。
Curr Protoc Mol Biol. 2014 Jul 1;107:19.9.1-19.9.36. doi: 10.1002/0471142727.mb1909s107.