Thünen Institute of Fisheries Ecology, Herwigstraße 31, 27572, Bremerhaven, Germany.
BMC Genom Data. 2023 Mar 17;24(1):18. doi: 10.1186/s12863-023-01119-4.
Biodiversity assessment approaches based on molecular biology techniques such as metabarcoding, RAD-seq, or SnaPshot sequencing are increasingly applied in assessing marine and aquatic ecosystems. Here we present a new reference database for fish meta-barcoding based on mitochondrial genes. The Mare-MAGE database contains quality-checked sequences of the mitochondrial 12S ribosomal RNA and Cytochrome c Oxidase I gene. All sequences were obtained from the National Center for Biotechnology Information- GenBank (NBCI-GenBank), the European Nucleotide Archive (ENA), AquaGene Database and BOLD database, and have undergone intensive processing. They were checked for false annotations and non-target anomalies, according to the Integrated Taxonomic Information System (ITIS) and FishBase. The dataset is compiled in ARB-Home, FASTA and Qiime2 formats, and is publicly available from the Mare-MAGE database website ( http://mare-mage.weebly.com/ ). It includes altogether 231,333 COI and 12S rRNA gene sequences of fish, covering 19,506 species of 4,058 genera and 586 families.
基于分子生物学技术(如代谢组学、RAD-seq 或 SnaPshot 测序)的生物多样性评估方法越来越多地应用于海洋和水生生态系统的评估。在这里,我们提出了一个基于线粒体基因的鱼类代谢组学新参考数据库。Mare-MAGE 数据库包含经过质量检查的线粒体 12S 核糖体 RNA 和细胞色素 c 氧化酶 I 基因序列。所有序列均来自国家生物技术信息中心 - GenBank(NBCI-GenBank)、欧洲核苷酸档案(ENA)、AquaGene 数据库和 BOLD 数据库,并经过了密集处理。根据综合分类信息系统(ITIS)和 FishBase,对它们进行了虚假注释和非目标异常的检查。该数据集以 ARB-Home、FASTA 和 Qiime2 格式编译,并可从 Mare-MAGE 数据库网站(http://mare-mage.weebly.com/)公开获得。它总共包含 231,333 条 COI 和 12S rRNA 基因序列,涵盖了 4,058 属和 586 科的 19,506 种鱼类。