Rodriguez-Ramos Luis E, Rios-Velazquez Carlos
Biology Department, University of Puerto Rico-Mayaguez, Puerto Rico.
Data Brief. 2018 Nov 9;21:1674-1677. doi: 10.1016/j.dib.2018.11.028. eCollection 2018 Dec.
Camuy River Cave Park (CRCP) is an underground cave system located at the subtropical karst carved by the Camuy River in the subtropical moist forest of northern Puerto Rico (Nieves-Rivera, 2003) [1]. This article contains a metagenomic dataset from the microbial and functional diversity of Clara Cave and Empalme Sinkhole water samples. The environmental DNA (eDNA) from the samples was extracted following direct Metagenomic DNA Isolation method, followed by Next-Generation-Sequencing technology (Illumina MiSeq). The sequences were submitted to MG-RAST online server for taxonomic profile generation and functional description of the samples. The data consisted of domain Bacteria (96.69%), followed up by Viruses (2.87%), Eukaryotes (0.37%), and Archaea (0.02%). The data distribution by phyla showed (92.61%), (1.66%), (1.12%), and (0.48%). The subsystem functional data showed that 12.97% of genes were related to clustering-based subsystems, 11.40% to carbohydrates, and 11.0% to amino acids and derivatives. The metagenome dataset generated will provide an understanding and comparison framework of the microbial composition and functional diversity present in caves.
卡穆伊河洞穴公园(CRCP)是一个地下洞穴系统,位于波多黎各北部亚热带湿润森林中由卡穆伊河雕刻而成的亚热带喀斯特地区(涅韦斯 - 里韦拉,2003年)[1]。本文包含来自克拉拉洞穴和恩帕尔梅落水洞水样的微生物和功能多样性的宏基因组数据集。按照直接宏基因组DNA分离方法提取样本中的环境DNA(eDNA),随后采用下一代测序技术(Illumina MiSeq)。将序列提交至MG-RAST在线服务器以生成样本的分类概况并进行功能描述。数据由细菌域(96.69%)组成,其次是病毒(2.87%)、真核生物(0.37%)和古菌(0.02%)。按门分类的数据分布显示(此处原文缺失具体门的信息)(92.61%)、(此处原文缺失具体门的信息)(1.66%)、(此处原文缺失具体门的信息)(1.12%)和(此处原文缺失具体门的信息)(0.48%)。子系统功能数据显示,12.97%的基因与基于聚类的子系统相关,11.40%与碳水化合物相关,11.0%与氨基酸及其衍生物相关。所生成的宏基因组数据集将为洞穴中存在的微生物组成和功能多样性提供一个理解和比较框架。