School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PT, UK.
Nucleic Acids Res. 2019 Jan 8;47(D1):D155-D162. doi: 10.1093/nar/gky1141.
miRBase catalogs, names and distributes microRNA gene sequences. The latest release of miRBase (v22) contains microRNA sequences from 271 organisms: 38 589 hairpin precursors and 48 860 mature microRNAs. We describe improvements to the database and website to provide more information about the quality of microRNA gene annotations, and the cellular functions of their products. We have collected 1493 small RNA deep sequencing datasets and mapped a total of 5.5 billion reads to microRNA sequences. The read mapping patterns provide strong support for the validity of between 20% and 65% of microRNA annotations in different well-studied animal genomes, and evidence for the removal of >200 sequences from the database. To improve the availability of microRNA functional information, we are disseminating Gene Ontology terms annotated against miRBase sequences. We have also used a text-mining approach to search for microRNA gene names in the full-text of open access articles. Over 500 000 sentences from 18 542 papers contain microRNA names. We score these sentences for functional information and link them with 12 519 microRNA entries. The sentences themselves, and word clouds built from them, provide effective summaries of the functional information about specific microRNAs. miRBase is publicly and freely available at http://mirbase.org/.
miRBase 对 microRNA 基因序列进行编目、命名和分发。miRBase 的最新版本(v22)包含了 271 种生物的 microRNA 序列:38589 个发夹前体和 48860 个成熟 microRNA。我们描述了对数据库和网站的改进,以提供有关 microRNA 基因注释质量以及其产物细胞功能的更多信息。我们收集了 1493 个小 RNA 深度测序数据集,并将总共 55 亿个读数映射到 microRNA 序列上。这些读取映射模式为不同研究充分的动物基因组中 20%至 65%的 microRNA 注释的有效性提供了强有力的支持,并证明了数据库中删除了>200 个序列。为了提高 microRNA 功能信息的可用性,我们正在传播针对 miRBase 序列注释的基因本体论术语。我们还使用文本挖掘方法在开放获取文章的全文中搜索 microRNA 基因名称。来自 18542 篇论文的超过 500000 个句子包含 microRNA 名称。我们对这些句子进行功能信息评分,并将它们与 12519 个 microRNA 条目相关联。这些句子本身以及从它们构建的词云,为特定 microRNA 的功能信息提供了有效的摘要。miRBase 可在 http://mirbase.org/ 上公开免费获取。