Ghosh Samrat, Sarangi Aditya Narayan, Mukherjee Mayuri, Singh Deeksha, Madhavi Madduluri, Tripathy Sucheta
Computational Genomics Lab, Structural Biology and Bioinformatics Division, CSIR Indian Institute of Chemical Biology, Kolkata, India.
Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India.
Bioinform Biol Insights. 2021 Jun 22;15:11779322211025332. doi: 10.1177/11779322211025332. eCollection 2021.
Rabindra Sarovar lake is an artificial freshwater lake in the arsenic infested eastern region of India. In this study, using the genome resolved metagenomics approach; we have deciphered the taxonomic diversity as well as the functional insights of the gene pools specific to this region. Initially, a total of 113 Metagenome Assembled Genomes (MAGs) were recovered from the two predominant seasons, that is, rainy (n = 50) and winter (n = 63). After bin refinement and de-replication, 27 MAGs (18 from Winter season and 9 from Rainy season) were reconstructed. These MAGs were either of high-quality (n = 10) or of medium quality (n = 17) that was determined based on genome completeness and contamination. These 27 MAGs spanning across 6 bacterial phyla and the most predominant ones were Proteobacteria, Bacteroidetes, and Cyanobacteria regardless of the season. Functional annotation across the MAGs suggested the existence of all known types of arsenic resistance and metabolism genes. Besides, important secondary metabolites such as zoocin_A, prochlorosin, and microcin were also abundantly present in these genomes. The metagenomic study of this lake provides the first insights into the microbiome composition and functional classification of the gene pools in two predominant seasons. The presence of arsenic metabolism and resistance genes in the recovered genomes is a sign of adaptation of the microbes to the arsenic contamination in this region. The presence of secondary metabolite genes in the lake microbiome has several implications including the potential use of these for the pharmaceutical industry.
拉宾德拉·萨罗瓦尔湖是印度东部砷污染地区的一个人工淡水湖。在本研究中,我们采用基因组解析宏基因组学方法,破译了该地区特定基因库的分类多样性以及功能见解。最初,从两个主要季节,即雨季(n = 50)和冬季(n = 63)共获得了113个宏基因组组装基因组(MAG)。经过分箱优化和去重复后,重建了27个MAG(18个来自冬季,9个来自雨季)。这些MAG要么是高质量的(n = 10),要么是中等质量的(n = 17),这是根据基因组完整性和污染情况确定的。这27个MAG跨越6个细菌门,无论季节如何,最主要的是变形菌门、拟杆菌门和蓝细菌门。对这些MAG的功能注释表明存在所有已知类型的抗砷和砷代谢基因。此外,这些基因组中还大量存在重要的次生代谢产物,如锌霉素_A、原绿球藻素和微菌素。对该湖的宏基因组研究首次揭示了两个主要季节微生物群落组成和基因库的功能分类。回收基因组中砷代谢和抗性基因的存在表明微生物对该地区砷污染的适应。湖泊微生物群中次生代谢产物基因的存在具有多种意义,包括这些基因在制药行业的潜在用途。