• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

火炬松基因组研究进展:BAC 和 fosmid 序列的特征分析。

Insights into the loblolly pine genome: characterization of BAC and fosmid sequences.

机构信息

Department of Plant Sciences, University of California Davis, Davis, California, United States of America.

出版信息

PLoS One. 2013 Sep 4;8(9):e72439. doi: 10.1371/journal.pone.0072439. eCollection 2013.

DOI:10.1371/journal.pone.0072439
PMID:24023741
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3762812/
Abstract

Despite their prevalence and importance, the genome sequences of loblolly pine, Norway spruce, and white spruce, three ecologically and economically important conifer species, are just becoming available to the research community. Following the completion of these large assemblies, annotation efforts will be undertaken to characterize the reference sequences. Accurate annotation of these ancient genomes would be aided by a comprehensive repeat library; however, few studies have generated enough sequence to fully evaluate and catalog their non-genic content. In this paper, two sets of loblolly pine genomic sequence, 103 previously assembled BACs and 90,954 newly sequenced and assembled fosmid scaffolds, were analyzed. Together, this sequence represents 280 Mbp (roughly 1% of the loblolly pine genome) and one of the most comprehensive studies of repetitive elements and genes in a gymnosperm species. A combination of homology and de novo methodologies were applied to identify both conserved and novel repeats. Similarity analysis estimated a repetitive content of 27% that included both full and partial elements. When combined with the de novo investigation, the estimate increased to almost 86%. Over 60% of the repetitive sequence consists of full or partial LTR (long terminal repeat) retrotransposons. Through de novo approaches, 6,270 novel, full-length transposable element families and 9,415 sub-families were identified. Among those 6,270 families, 82% were annotated as single-copy. Several of the novel, high-copy families are described here, with the largest, PtPiedmont, comprising 133 full-length copies. In addition to repeats, analysis of the coding region reported 23 full-length eukaryotic orthologous proteins (KOGS) and another 29 novel or orthologous genes. These discoveries, along with other genomic resources, will be used to annotate conifer genomes and address long-standing questions about gymnosperm evolution.

摘要

尽管落叶松、挪威云杉和白云杉这三种生态和经济上重要的针叶树的基因组序列已经变得普遍存在和重要,但它们的序列对于研究社区来说才刚刚可用。在完成这些大型组装之后,将进行注释工作以描述参考序列。如果有一个全面的重复序列库,那么对这些古老基因组的准确注释将得到帮助;然而,很少有研究产生足够的序列来全面评估和编目它们的非基因内容。在本文中,分析了两组火炬松基因组序列,103 个先前组装的 BAC 和 90954 个新测序和组装的 fosmid 支架。这两组序列共代表了 280Mbp(大致是火炬松基因组的 1%),这是对裸子植物物种中的重复元件和基因进行的最全面的研究之一。同源性和从头方法的组合被应用于识别保守和新的重复序列。相似性分析估计重复序列的含量为 27%,其中包括完整和部分元件。当与从头研究结合时,估计值增加到近 86%。超过 60%的重复序列由完整或部分 LTR(长末端重复)逆转录转座子组成。通过从头方法,鉴定了 6270 个新的全长转座元件家族和 9415 个子家族。在这 6270 个家族中,82%被注释为单拷贝。这里描述了一些新的、高拷贝家族,其中最大的 PtPiedmont 家族由 133 个全长拷贝组成。除了重复序列外,对编码区的分析报告了 23 个全长真核直系同源蛋白(KOGS)和另外 29 个新的或直系同源基因。这些发现,以及其他基因组资源,将用于注释针叶树基因组,并解决关于裸子植物进化的长期存在的问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/13204dca0572/pone.0072439.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/657ae253bc76/pone.0072439.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/1878056a1aee/pone.0072439.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/6b9b6f8cebb5/pone.0072439.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/1045d34c2847/pone.0072439.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/70f9116c7abf/pone.0072439.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/13204dca0572/pone.0072439.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/657ae253bc76/pone.0072439.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/1878056a1aee/pone.0072439.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/6b9b6f8cebb5/pone.0072439.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/1045d34c2847/pone.0072439.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/70f9116c7abf/pone.0072439.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3cad/3762812/13204dca0572/pone.0072439.g006.jpg

相似文献

1
Insights into the loblolly pine genome: characterization of BAC and fosmid sequences.火炬松基因组研究进展:BAC 和 fosmid 序列的特征分析。
PLoS One. 2013 Sep 4;8(9):e72439. doi: 10.1371/journal.pone.0072439. eCollection 2013.
2
Exploring the loblolly pine (Pinus taeda L.) genome by BAC sequencing and Cot analysis.通过 BAC 测序和 Cot 分析探索火炬松(Pinus taeda L.)基因组。
Gene. 2018 Jul 15;663:165-177. doi: 10.1016/j.gene.2018.04.024. Epub 2018 Apr 12.
3
The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences.湿地松基因组的特点是具有多样化和高度分化的重复序列。
BMC Genomics. 2010 Jul 7;11:420. doi: 10.1186/1471-2164-11-420.
4
Unique features of the loblolly pine (Pinus taeda L.) megagenome revealed through sequence annotation.通过序列注释揭示火炬松(Pinus taeda L.)大亚基因组的独特特征。
Genetics. 2014 Mar;196(3):891-909. doi: 10.1534/genetics.113.159996.
5
Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies.利用单倍体DNA和新型组装策略解码火炬松的庞大基因组。
Genome Biol. 2014 Mar 4;15(3):R59. doi: 10.1186/gb-2014-15-3-r59.
6
Adventures in the enormous: a 1.8 million clone BAC library for the 21.7 Gb genome of loblolly pine.探索巨型文库:用于构建 21.7Gb 火炬松基因组的 180 万克隆 BAC 文库。
PLoS One. 2011 Jan 21;6(1):e16214. doi: 10.1371/journal.pone.0016214.
7
Sequencing and assembly of the 22-gb loblolly pine genome.测序和组装 22 吉字节的火炬松基因组。
Genetics. 2014 Mar;196(3):875-90. doi: 10.1534/genetics.113.159715.
8
Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome.针对参与针叶树防御的萜类合酶和细胞色素P450基因,对两个白云杉(Picea glauca)BAC克隆进行靶向分离、序列组装和表征,揭示了针叶树基因组的相关见解。
BMC Plant Biol. 2009 Aug 6;9:106. doi: 10.1186/1471-2229-9-106.
9
A high-density gene map of loblolly pine (Pinus taeda L.) based on exome sequence capture genotyping.基于外显子序列捕获基因分型的火炬松(Pinus taeda L.)高密度基因图谱。
G3 (Bethesda). 2014 Jan 10;4(1):29-37. doi: 10.1534/g3.113.008714.
10
Complete chloroplast genome sequence and comparative analysis of loblolly pine (Pinus taeda L.) with related species.火炬松(Pinus taeda L.)完整叶绿体基因组序列及其与相关物种的比较分析。
PLoS One. 2018 Mar 29;13(3):e0192966. doi: 10.1371/journal.pone.0192966. eCollection 2018.

引用本文的文献

1
Annotation of Siberian Larch ( Ledeb.) Nuclear Genome-One of the Most Cold-Resistant Tree Species in the Only Deciduous GENUS in .西伯利亚落叶松(Larix sibirica Ledeb.)核基因组注释——仅存的落叶松属中最耐寒的树种之一的基因组注释
Plants (Basel). 2022 Aug 6;11(15):2062. doi: 10.3390/plants11152062.
2
Comparative Genomics Analysis of Repetitive Elements in Ten Gymnosperm Species: "Dark Repeatome" and Its Abundance in Conifer and Species.十种裸子植物重复元件的比较基因组学分析:“暗重复基因组”及其在针叶树和其他物种中的丰度
Life (Basel). 2021 Nov 15;11(11):1234. doi: 10.3390/life11111234.
3
Insights from the first genome assembly of Onion (Allium cepa).

本文引用的文献

1
A physical, genetic and functional sequence assembly of the barley genome.大麦基因组的物理、遗传和功能序列组装。
Nature. 2012 Nov 29;491(7426):711-6. doi: 10.1038/nature11543. Epub 2012 Oct 17.
2
Towards decoding the conifer giga-genome.解析针叶树超大基因组。
Plant Mol Biol. 2012 Dec;80(6):555-69. doi: 10.1007/s11103-012-9961-7. Epub 2012 Sep 9.
3
New insights into nested long terminal repeat retrotransposons in Brassica species.甘蓝属物种中嵌套的长末端重复反转录转座子的新见解。
洋葱(Allium cepa)基因组首次组装的研究进展。
G3 (Bethesda). 2021 Sep 6;11(9). doi: 10.1093/g3journal/jkab243.
4
Comparative Repeat Profiling of Two Closely Related Conifers ( and ) Reveals High Genome Similarity With Only Few Fast-Evolving Satellite DNAs.两种近缘针叶树(和)的重复序列比较分析揭示了基因组高度相似,仅存在少数快速进化的卫星DNA。
Front Genet. 2021 Jul 12;12:683668. doi: 10.3389/fgene.2021.683668. eCollection 2021.
5
Siberian larch (Larix sibirica Ledeb.) mitochondrial genome assembled using both short and long nucleotide sequence reads is currently the largest known mitogenome.西伯利亚落叶松(Larix sibirica Ledeb.)的线粒体基因组使用短序列和长序列读数组装而成,是目前已知的最大线粒体基因组。
BMC Genomics. 2020 Sep 23;21(1):654. doi: 10.1186/s12864-020-07061-4.
6
A Field Guide to Eukaryotic Transposable Elements.真核转座元件野外手册。
Annu Rev Genet. 2020 Nov 23;54:539-561. doi: 10.1146/annurev-genet-040620-022145. Epub 2020 Sep 21.
7
The transcriptome of Pinus pinaster under Fusarium circinatum challenge.油松转录组在聚生镰孢菌挑战下的变化。
BMC Genomics. 2020 Jan 8;21(1):28. doi: 10.1186/s12864-019-6444-0.
8
A Reference Genome Sequence for the European Silver Fir ( Mill.): A Community-Generated Genomic Resource.欧洲云杉(Mill.)参考基因组序列:一个社区生成的基因组资源。
G3 (Bethesda). 2019 Jul 9;9(7):2039-2049. doi: 10.1534/g3.119.400083.
9
Novel Insights into Plant Genome Evolution and Adaptation as Revealed through Transposable Elements and Non-Coding RNAs in Conifers.通过松柏类植物中转座元件和非编码 RNA 揭示植物基因组进化和适应的新见解。
Genes (Basel). 2019 Mar 18;10(3):228. doi: 10.3390/genes10030228.
10
ConTEdb: a comprehensive database of transposable elements in conifers.ConTEdb:一个松柏类植物转座元件的综合数据库。
Database (Oxford). 2018 Jan 1;2018:bay131. doi: 10.1093/database/bay131.
Mol Plant. 2013 Mar;6(2):470-82. doi: 10.1093/mp/sss081. Epub 2012 Aug 28.
4
Corky, a gypsy-like retrotransposon is differentially transcribed in Quercus suber tissues.科尔基(Corky)是一种类似吉普赛的反转录转座子,在栓皮栎组织中差异转录。
BMC Res Notes. 2012 Aug 13;5:432. doi: 10.1186/1756-0500-5-432.
5
MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects.MAKER2:用于第二代基因组项目的注释流水线和基因组数据库管理工具。
BMC Bioinformatics. 2011 Dec 22;12:491. doi: 10.1186/1471-2105-12-491.
6
Repetitive DNA and next-generation sequencing: computational challenges and solutions.重复 DNA 和新一代测序:计算挑战与解决方案。
Nat Rev Genet. 2011 Nov 29;13(1):36-46. doi: 10.1038/nrg3117.
7
Phytozome: a comparative platform for green plant genomics.植物生物学数据库:一个用于绿色植物基因组学的比较平台。
Nucleic Acids Res. 2012 Jan;40(Database issue):D1178-86. doi: 10.1093/nar/gkr944. Epub 2011 Nov 22.
8
Characterizing the walnut genome through analyses of BAC end sequences.通过对 BAC 末端序列的分析来描绘核桃基因组。
Plant Mol Biol. 2012 Jan;78(1-2):95-107. doi: 10.1007/s11103-011-9849-y. Epub 2011 Nov 19.
9
Characterization of the genome of bald cypress.柏拉木基因组的特征。
BMC Genomics. 2011 Nov 11;12:553. doi: 10.1186/1471-2164-12-553.
10
Transcriptome profiling of wood maturation in Pinus radiata identifies differentially expressed genes with implications in juvenile and mature wood variation.转录组谱分析辐射松木材成熟过程,鉴定出差异表达基因,这些基因与幼龄材和成熟材的变化有关。
Gene. 2011 Nov 1;487(1):62-71. doi: 10.1016/j.gene.2011.07.028. Epub 2011 Aug 3.