Chowdhury Labrechai Mog, Maurya Rajesh Kumar, Singh Rajeev Kumar, Mishra Shubhi, Chauhan Nishita, Jena J K, Mohindra Vindhya
ICAR-National Bureau of Fish Genetic Resources, Canal Ring Road, P.O. Dilkusha, Lucknow, 226 002, India.
Indian Council of Agricultural Research (ICAR), Krishi Anusandhan Bhawan - II, New Delhi, 110 012, India.
Mol Biol Rep. 2021 Nov;48(11):7333-7342. doi: 10.1007/s11033-021-06735-w. Epub 2021 Oct 13.
Full length transcriptomes, achieved through long-read sequencing, along with the isoform analysis can reveal complexities in the gene expression profiles, as well as annotate the transcriptomes of non-model organisms.
Full length transcripts of brain transcriptome of Tenualosa ilisha, Hilsa shad, were generated through PacBio single molecule real-time sequencing and were characterized. A total of 8.30 Gb clean reads were generated, with PacBio RSII, which resulted in 57,651 high quality consensus transcripts. After removing redundant reads, a total of 19,220 high-quality non-redundant transcripts and 17,341 full length ORF transcripts were classified to 7522 putative ortholog groups. Genes involved in various neural pathways were identified. In addition, isoform clusters and lncRNAs were discovered, along with Hilsa specific transcripts with coding frames and 29,147 SSRs in 944 transcripts (1141 annotated).
The present study provided, for the first time, a comprehensive view of the alternative isoforms of genes and transcriptome complexity in Hilsa shad brain and forms a rich resource for functional studies in brain of this anadromous fish.
通过长读长测序获得的全长转录组,以及异构体分析,可以揭示基因表达谱的复杂性,并对非模式生物的转录组进行注释。
通过PacBio单分子实时测序生成了鲥鱼(Tenualosa ilisha)脑转录组的全长转录本并进行了表征。使用PacBio RSII共产生了8.30 Gb的clean reads,得到了57,651个高质量的一致性转录本。去除冗余reads后,共19,220个高质量非冗余转录本和17,341个全长ORF转录本被分类到7522个假定的直系同源组中。鉴定出了参与各种神经通路的基因。此外,还发现了异构体簇和lncRNAs,以及具有编码框的鲥鱼特异性转录本和944个转录本(1141个注释)中的29,147个SSR。
本研究首次全面展示了鲥鱼脑内基因的可变异构体和转录组复杂性,为这种溯河产卵鱼类脑的功能研究提供了丰富资源。