从头构建并解析荠（Lepidium sativum L.）的基因组草图。

Construction and characterization of a de novo draft genome of garden cress (Lepidium sativum L.).

机构信息

Department of Molecular Biology and Genetics, Necmettin Erbakan University, Meram, Konya, 42090, Turkey.

Department of Biotechnology, Necmettin Erbakan University, Meram, Konya, 42090, Turkey.

出版信息

Funct Integr Genomics. 2022 Oct;22(5):879-889. doi: 10.1007/s10142-022-00866-4. Epub 2022 May 20.

DOI:10.1007/s10142-022-00866-4

PMID:35596045

Abstract

Garden cress (Lepidium sativum L.) is a Brassicaceae crop recognized as a healthy vegetable and a medicinal plant. Lepidium is one of the largest genera in Brassicaceae, yet, the genus has not been a focus of extensive genomic research. In the present work, garden cress genome was sequenced using the long read high-fidelity sequencing technology. A de novo, draft genome assembly that spans 336.5 Mb was produced, corresponding to 88.6% of the estimated genome size and representing 90% of the evolutionarily expected orthologous gene content. Protein coding gene content was structurally predicted and functionally annotated, resulting in the identification of 25,668 putative genes. A total of 599 candidate disease resistance genes were identified by predicting resistance gene domains in gene structures, and 37 genes were detected as orthologs of heavy metal associated protein coding genes. In addition, 4289 genes were assigned as "transcription factor coding." Six different machine learning algorithms were trained and tested for their performance in classifying miRNA coding genomic sequences. Logistic regression proved the best performing trained algorithm, thus utilized for pre-miRNA coding loci identification in the assembly. Repetitive DNA analysis involved the characterization of transposable element and microsatellite contents. L. sativum chloroplast genome was also assembled and functionally annotated. Data produced in the present work is expected to constitute a foundation for genomic research in garden cress and contribute to genomics-assisted crop improvement and genome evolution studies in the Brassicaceae family.

摘要

荠菜（Lepidium sativum L.）是十字花科的一种作物，被认为是一种健康的蔬菜和药用植物。荠属是十字花科中最大的属之一，但该属尚未成为广泛基因组研究的重点。在本工作中，使用长读高通量测序技术对荠菜基因组进行了测序。生成了一个跨越 336.5 Mb 的从头、草图基因组组装，对应于估计基因组大小的 88.6%，代表了进化上预期的同源基因含量的 90%。对蛋白质编码基因进行了结构预测和功能注释，鉴定出 25668 个假定基因。通过预测基因结构中的抗病基因结构域，共鉴定出 599 个候选抗病基因，检测到 37 个基因是与重金属相关蛋白编码基因的同源基因。此外，4289 个基因被归类为“转录因子编码”。使用六种不同的机器学习算法对其在分类 miRNA 编码基因组序列中的性能进行了训练和测试。逻辑回归被证明是表现最好的训练算法，因此用于组装中前体 miRNA 编码基因座的识别。重复 DNA 分析涉及转座元件和微卫星含量的特征描述。荠属叶绿体基因组也被组装并进行了功能注释。本工作产生的数据有望为荠菜的基因组研究奠定基础，并有助于十字花科作物的基因组辅助改良和基因组进化研究。

相似文献

Construction and characterization of a de novo draft genome of garden cress (Lepidium sativum L.).从头构建并解析荠（Lepidium sativum L.）的基因组草图。

Funct Integr Genomics. 2022 Oct;22(5):879-889. doi: 10.1007/s10142-022-00866-4. Epub 2022 May 20.

De novo assembly and characterization of the first draft genome of quince (Cydonia oblonga Mill.).重建并描述绵苹果（Cydonia oblonga Mill.）的首个草图基因组。

Sci Rep. 2021 Feb 15;11(1):3818. doi: 10.1038/s41598-021-83113-3.

The complete chloroplast genome sequence of garden cress ( L.) and its phylogenetic analysis in Brassicaceae family.独行菜（L.）的完整叶绿体基因组序列及其在十字花科中的系统发育分析。

Mitochondrial DNA B Resour. 2019 Oct 16;4(2):3601-3602. doi: 10.1080/23802359.2019.1677527.

In silico molecular modeling of cold pressed garden cress (Lepidium sativum L.) seed oil toward the binding pocket of antimicrobial resistance Staphylococcus aureus DNA-gyrase complexes.冷榨独行菜（Lepidium sativum L.）籽油对耐抗菌性金黄色葡萄球菌DNA促旋酶复合物结合口袋的计算机模拟分子建模

Eur Rev Med Pharmacol Sci. 2023 Feb;27(4):1238-1247. doi: 10.26355/eurrev_202302_31356.

Identification of genes regulating traits targeted for domestication of field cress (Lepidium campestre) as a biennial and perennial oilseed crop.鉴定调控野油菜（Lepidium campestre）作为二年生和多年生油料作物的驯化目标性状的基因。

BMC Genet. 2018 May 29;19(1):36. doi: 10.1186/s12863-018-0624-9.

Research Update on the Therapeutic Potential of Garden Cress ( Linn.) with Threatened Status.荠（ Linn.）的治疗潜力研究进展，该物种濒危。

Curr Drug Res Rev. 2024;16(3):369-380. doi: 10.2174/0125899775273877231023102011.

Cadmium at high dose perturbs growth, photosynthesis and nitrogen metabolism while at low dose it up regulates sulfur assimilation and antioxidant machinery in garden cress (Lepidium sativum L.).高剂量的镉会干扰生长、光合作用和氮代谢，而低剂量的镉会上调硫同化和抗氧化剂机制在花园水芹（Lepidium sativum L.）中。

Plant Sci. 2012 Jan;182:112-20. doi: 10.1016/j.plantsci.2011.04.018. Epub 2011 May 18.

Time study on the uptake of four different beta-blockers in garden cress (Lepidium sativum) as a model plant.不同β-阻滞剂在模型植物荠（Lepidium sativum）中的吸收的时间研究。

Environ Sci Pollut Res Int. 2021 Nov;28(42):59382-59390. doi: 10.1007/s11356-020-11610-5. Epub 2020 Nov 18.

Endosperm-limited Brassicaceae seed germination: abscisic acid inhibits embryo-induced endosperm weakening of Lepidium sativum (cress) and endosperm rupture of cress and Arabidopsis thaliana.胚乳受限的十字花科种子萌发：脱落酸抑制独行菜（水芹）胚诱导的胚乳弱化以及水芹和拟南芥的胚乳破裂。

Plant Cell Physiol. 2006 Jul;47(7):864-77. doi: 10.1093/pcp/pcj059. Epub 2006 May 16.

Induction of apoptosis in leukemic cells by the alkaloid extract of garden cress (Lepidium sativum L.).十字花科植物葶苈（Lepidium sativum L.）生物碱提取物诱导白血病细胞凋亡。

J Integr Med. 2019 May;17(3):221-228. doi: 10.1016/j.joim.2019.03.004. Epub 2019 Mar 23.

引用本文的文献

Remarkable mitochondrial genome heterogeneity in Meniocus linifolius (Brassicaceae).在山黧豆属（十字花科）中发现显著的线粒体基因组异质性。

Plant Cell Rep. 2024 Jan 11;43(2):36. doi: 10.1007/s00299-023-03102-w.

Pilot study of a comprehensive resource estimation method from environmental DNA using universal D-loop amplification primers.利用通用 D 环扩增引物进行环境 DNA 综合资源评估方法的初步研究。

Funct Integr Genomics. 2023 Mar 22;23(2):96. doi: 10.1007/s10142-023-01013-3.

本文引用的文献

Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm.使用带有 hifiasm 的相定装配图进行单体型解析从头组装。

Nat Methods. 2021 Feb;18(2):170-175. doi: 10.1038/s41592-020-01056-5. Epub 2021 Feb 1.

Genome Size Evolution Mediated by Gypsy Retrotransposons in Brassicaceae.gypsy 反转录转座子介导的芸薹科基因组大小进化

Genomics Proteomics Bioinformatics. 2020 Jun;18(3):321-332. doi: 10.1016/j.gpb.2018.07.009. Epub 2020 Oct 31.

Building near-complete plant genomes.构建近乎完整的植物基因组。

Curr Opin Plant Biol. 2020 Apr;54:26-33. doi: 10.1016/j.pbi.2019.12.009. Epub 2020 Jan 22.

PmiREN: a comprehensive encyclopedia of plant miRNAs.PmiREN：一个综合性的植物 miRNA 百科全书。

Nucleic Acids Res. 2020 Jan 8;48(D1):D1114-D1121. doi: 10.1093/nar/gkz894.

Assessment of genetic diversity in L. using inter simple sequence repeat (ISSR) marker.利用简单重复序列区间（ISSR）标记评估某植物（原文未明确写出植物名称，推测为L.代表的某种植物）的遗传多样性。

Physiol Mol Biol Plants. 2019 Mar;25(2):399-406. doi: 10.1007/s12298-018-0622-4. Epub 2018 Nov 20.

Improved Pre-miRNAs Identification Through Mutual Information of Pre-miRNA Sequences and Structures.通过前体miRNA序列和结构的互信息改进前体miRNA识别

Front Genet. 2019 Feb 25;10:119. doi: 10.3389/fgene.2019.00119. eCollection 2019.

The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.Galaxy 平台：用于可访问、可重复和协作的生物医学分析：2018 年更新。

Nucleic Acids Res. 2018 Jul 2;46(W1):W537-W544. doi: 10.1093/nar/gky379.

PRGdb 3.0: a comprehensive platform for prediction and analysis of plant disease resistance genes.PRGdb 3.0：一个用于植物抗病基因预测和分析的综合性平台。

Nucleic Acids Res. 2018 Jan 4;46(D1):D1197-D1201. doi: 10.1093/nar/gkx1119.

Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper.通过eggNOG-Mapper进行直系同源物分配实现全基因组快速功能注释

Mol Biol Evol. 2017 Aug 1;34(8):2115-2122. doi: 10.1093/molbev/msx148.

MicroRNA categorization using sequence motifs and k-mers.使用序列基序和k-mer对微小RNA进行分类。

BMC Bioinformatics. 2017 Mar 14;18(1):170. doi: 10.1186/s12859-017-1584-1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从头构建并解析荠（Lepidium sativum L.）的基因组草图。

Construction and characterization of a de novo draft genome of garden cress (Lepidium sativum L.).

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献