Suppr超能文献

PolyA_DB 3 目录编目了通过多种基因组的深度测序鉴定的剪接和多聚腺苷酸化位点。

PolyA_DB 3 catalogs cleavage and polyadenylation sites identified by deep sequencing in multiple genomes.

机构信息

Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School and Rutgers Cancer Institute of New Jersey, Newark, NJ 07103, USA.

Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA.

出版信息

Nucleic Acids Res. 2018 Jan 4;46(D1):D315-D319. doi: 10.1093/nar/gkx1000.

Abstract

PolyA_DB is a database cataloging cleavage and polyadenylation sites (PASs) in several genomes. Previous versions were based mainly on expressed sequence tags (ESTs), which had a limited amount and could lead to inaccurate PAS identification due to the presence of internal A-rich sequences in transcripts. Here, we present an updated version of the database based solely on deep sequencing data. First, PASs are mapped by the 3' region extraction and deep sequencing (3'READS) method, ensuring unequivocal PAS identification. Second, a large volume of data based on diverse biological samples increases PAS coverage by 3.5-fold over the EST-based version and provides PAS usage information. Third, strand-specific RNA-seq data are used to extend annotated 3' ends of genes to obtain more thorough annotations of alternative polyadenylation (APA) sites. Fourth, conservation information of PAS across mammals sheds light on significance of APA sites. The database (URL: http://www.polya-db.org/v3) currently holds PASs in human, mouse, rat and chicken, and has links to the UCSC genome browser for further visualization and for integration with other genomic data.

摘要

PolyA_DB 是一个数据库,其中包含了多个基因组中的切割和多聚腺苷酸化位点 (PAS)。以前的版本主要基于表达序列标签 (EST),这些标签数量有限,并且由于转录本中存在内部富含 A 的序列,可能导致 PAS 识别不准确。在这里,我们展示了一个基于深度测序数据的更新版本。首先,通过 3' 区域提取和深度测序 (3'READS) 方法来映射 PAS,从而确保 PAS 的识别是明确的。其次,大量基于不同生物样本的数据将 PAS 覆盖率比基于 EST 的版本提高了 3.5 倍,并提供了 PAS 使用信息。第三,使用链特异性 RNA-seq 数据来扩展基因的注释 3' 末端,从而获得对替代多聚腺苷酸化 (APA) 位点的更全面注释。第四,PAS 在哺乳动物中的保守信息揭示了 APA 位点的重要性。该数据库(网址:http://www.polya-db.org/v3)目前包含了人类、小鼠、大鼠和鸡中的 PAS,并与 UCSC 基因组浏览器链接,以便进一步可视化和与其他基因组数据集成。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e87a/5753232/831e1405d5a3/gkx1000fig1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验