Suppr超能文献

基于计算机的方法挖掘与鉴定大戟科部分农艺重要物种中的高质量 SNP/Indels

In Silico Mining and Characterization of High-Quality SNP/Indels in Some Agro-Economically Important Species Belonging to the Family Euphorbiaceae.

机构信息

Department of Zoology, Mariani College, Mariani 785634, India.

Department of Biochemistry and Agricultural Chemistry, Assam Agricultural University, Jorhat 785013, India.

出版信息

Genes (Basel). 2023 Jan 27;14(2):332. doi: 10.3390/genes14020332.

Abstract

(1) Background: To assess the genetic makeup among the agro-economically important members of Euphorbiaceae, the present study was conducted to identify and characterize high-quality single-nucleotide polymorphism (SNP) markers and their comparative distribution in exonic and intronic regions from the publicly available expressed sequence tags (ESTs). (2) Methods: Quality sequences obtained after pre-processing by an EG assembler were assembled into contigs using the CAP3 program at 95% identity; the mining of SNP was performed by QualitySNP; GENSCAN (standalone) was used for detecting the distribution of SNPs in the exonic and intronic regions. (3) Results: A total of 25,432 potential SNPs (pSNP) and 14,351 high-quality SNPs (qSNP), including 2276 indels, were detected from 260,479 EST sequences. The ratio of quality SNP to potential SNP ranged from 0.22 to 0.75. A higher frequency of transitions and transversions was observed more in the exonic than the intronic region, while indels were present more in the intronic region. C↔T (transition) was the most dominant nucleotide substitution, while in transversion, A↔T was the dominant nucleotide substitution, and in indel, A/- was dominant. (4) Conclusions: Detected SNP markers may be useful for linkage mapping; marker-assisted breeding; studying genetic diversity; mapping important phenotypic traits, such as adaptation or oil production; or disease resistance by targeting and screening mutations in important genes.

摘要

(1) 背景:为了评估经济上重要的大戟科成员的遗传构成,本研究旨在从公开的表达序列标签 (EST) 中鉴定和表征高质量的单核苷酸多态性 (SNP) 标记及其在编码区和非编码区的比较分布。

(2) 方法:通过 EG 组装器进行预处理后获得的高质量序列,使用 CAP3 程序在 95%的同一性下组装成 contigs;通过 QualitySNP 进行 SNP 挖掘;使用 GENSCAN(独立)检测 SNP 在编码区和非编码区的分布。

(3) 结果:从 260479 个 EST 序列中检测到了 25432 个潜在 SNP(pSNP)和 14351 个高质量 SNP(qSNP),包括 2276 个插入缺失。质量 SNP 与潜在 SNP 的比例从 0.22 到 0.75 不等。在编码区比非编码区观察到更多的转换和颠换,而插入缺失在非编码区更常见。C↔T(转换)是最主要的核苷酸取代,而在颠换中,A↔T 是主要的核苷酸取代,在插入缺失中,A/-是主要的。

(4) 结论:检测到的 SNP 标记可用于连锁作图、标记辅助育种、遗传多样性研究、重要表型性状(如适应或产油)的作图、或针对重要基因进行突变筛选以提高疾病抗性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c58/9956114/b8adf92833de/genes-14-00332-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验