从长序列 reads 重建的玉米染色体区域中串联基因拷贝的分析。

Analysis of tandem gene copies in maize chromosomal regions reconstructed from long sequence reads.

作者信息

Dong Jiaqiang, Feng Yaping, Kumar Dibyendu, Zhang Wei, Zhu Tingting, Luo Ming-Cheng, Messing Joachim

机构信息

Waksman Institute of Microbiology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854;

Department of Plant Sciences, University of California, Davis, CA 95616.

出版信息

Proc Natl Acad Sci U S A. 2016 Jul 19;113(29):7949-56. doi: 10.1073/pnas.1608775113. Epub 2016 Jun 27.

DOI:10.1073/pnas.1608775113

PMID:27354512

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4961126/

Abstract

Haplotype variation not only involves SNPs but also insertions and deletions, in particular gene copy number variations. However, comparisons of individual genomes have been difficult because traditional sequencing methods give too short reads to unambiguously reconstruct chromosomal regions containing repetitive DNA sequences. An example of such a case is the protein gene family in maize that acts as a sink for reduced nitrogen in the seed. Previously, 41-48 gene copies of the alpha zein gene family that spread over six loci spanning between 30- and 500-kb chromosomal regions have been described in two Iowa Stiff Stalk (SS) inbreds. Analyses of those regions were possible because of overlapping BAC clones, generated by an expensive and labor-intensive approach. Here we used single-molecule real-time (Pacific Biosciences) shotgun sequencing to assemble the six chromosomal regions from the Non-Stiff Stalk maize inbred W22 from a single DNA sequence dataset. To validate the reconstructed regions, we developed an optical map (BioNano genome map; BioNano Genomics) of W22 and found agreement between the two datasets. Using the sequences of full-length cDNAs from W22, we found that the error rate of PacBio sequencing seemed to be less than 0.1% after autocorrection and assembly. Expressed genes, some with premature stop codons, are interspersed with nonexpressed genes, giving rise to genotype-specific expression differences. Alignment of these regions with those from the previous analyzed regions of SS lines exhibits in part dramatic differences between these two heterotic groups.

摘要

单倍型变异不仅涉及单核苷酸多态性（SNP），还包括插入和缺失，特别是基因拷贝数变异。然而，个体基因组的比较一直很困难，因为传统测序方法获得的读长过短，无法明确重建包含重复DNA序列的染色体区域。玉米中的蛋白质基因家族就是这样一个例子，它在种子中作为还原态氮的储存库。此前，在两个爱荷华硬秆（SS）自交系中，已描述了α-玉米醇溶蛋白基因家族的41 - 48个基因拷贝，这些拷贝分布在跨越30 - 500 kb染色体区域的六个位点上。由于重叠细菌人工染色体（BAC）克隆，这些区域的分析才得以进行，而BAC克隆是通过一种昂贵且费力的方法产生的。在这里，我们使用单分子实时（Pacific Biosciences）鸟枪法测序，从单个DNA序列数据集中组装了非硬秆玉米自交系W22的六个染色体区域。为了验证重建区域，我们构建了W22的光学图谱（BioNano基因组图谱；BioNano Genomics），并发现两个数据集之间具有一致性。利用W22全长cDNA的序列，我们发现PacBio测序在自动校正和组装后的错误率似乎低于0.1%。表达的基因，有些带有提前终止密码子，与未表达的基因相间分布，导致基因型特异性的表达差异。将这些区域与之前分析的SS系区域进行比对，结果显示这两个杂种优势群之间部分存在显著差异。

相似文献

Analysis of tandem gene copies in maize chromosomal regions reconstructed from long sequence reads.从长序列 reads 重建的玉米染色体区域中串联基因拷贝的分析。

Proc Natl Acad Sci U S A. 2016 Jul 19;113(29):7949-56. doi: 10.1073/pnas.1608775113. Epub 2016 Jun 27.

The maize W22 genome provides a foundation for functional genomics and transposon biology.玉米 W22 基因组为功能基因组学和转座子生物学提供了基础。

Nat Genet. 2018 Sep;50(9):1282-1288. doi: 10.1038/s41588-018-0158-0. Epub 2018 Jul 30.

Enrichment of gene-coding sequences in maize by genome filtration.通过基因组过滤富集玉米中的基因编码序列。

Science. 2003 Dec 19;302(5653):2118-20. doi: 10.1126/science.1090047.

Differential gene expression and epiregulation of alpha zein gene copies in maize haplotypes.玉米单倍型中α-zein 基因拷贝的差异表达和表观调控。

PLoS Genet. 2011 Jun;7(6):e1002131. doi: 10.1371/journal.pgen.1002131. Epub 2011 Jun 23.

Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains.甲基化敏感连接文库增强了复杂基因组的基因富集测序并绘制DNA甲基化结构域图谱。

BMC Genomics. 2008 Dec 19;9:621. doi: 10.1186/1471-2164-9-621.

Maize haplotype with a helitron-amplified cytidine deaminase gene copy.具有一个通过Helitron转座子扩增的胞苷脱氨酶基因拷贝的玉米单倍型。

BMC Genet. 2006 Nov 9;7:52. doi: 10.1186/1471-2156-7-52.

Targeted analysis of orthologous phytochrome A regions of the sorghum, maize, and rice genomes using comparative gene-island sequencing.利用比较基因岛测序对高粱、玉米和水稻基因组的直系同源光敏色素A区域进行靶向分析。

Plant Physiol. 2002 Dec;130(4):1614-25. doi: 10.1104/pp.012567.

Genomic variation within the maize stiff-stalk heterotic germplasm pool.玉米硬秆杂种优势种质资源群体中的基因组变异。

Plant Genome. 2021 Nov;14(3):e20114. doi: 10.1002/tpg2.20114. Epub 2021 Jul 18.

Genomic organization of an alpha-zein gene cluster in maize.玉米中一个α-醇溶蛋白基因簇的基因组组织

Mol Gen Genet. 1992 Jan;231(2):304-12. doi: 10.1007/BF00279804.

Molecular analysis of high-copy insertion sites in maize.玉米中高拷贝插入位点的分子分析。

Nucleic Acids Res. 2004 Apr 1;32(6):e54. doi: 10.1093/nar/gnh052.

引用本文的文献

The Fie1-PRC2 complex regulates H3K27me3 deposition to balance endosperm filling and development in cereals.Fie1-PRC2复合体调控H3K27me3沉积，以平衡谷物胚乳的充实和发育。

Plant Commun. 2025 Jun 9;6(6):101343. doi: 10.1016/j.xplc.2025.101343. Epub 2025 Apr 22.

Editing the 19 kDa alpha-zein gene family generates non-opaque2-based quality protein maize.编辑 19 kDa α-zein 基因家族可产生基于非透明 2 的优质蛋白玉米。

Plant Biotechnol J. 2024 Apr;22(4):946-959. doi: 10.1111/pbi.14237. Epub 2023 Nov 21.

Copy Number Variation among Resistance Genes Analogues in .在. 中，抗性基因类似物的拷贝数变异。

Genes (Basel). 2022 Nov 4;13(11):2037. doi: 10.3390/genes13112037.

Tandem duplicate expression patterns are conserved between maize haplotypes of the -zein gene family.串联重复表达模式在玉米醇溶蛋白基因家族的单倍型之间是保守的。

Plant Direct. 2021 Sep 14;5(9):e346. doi: 10.1002/pld3.346. eCollection 2021 Sep.

Full-length transcript sequencing accelerates the transcriptome research of Gymnocypris namensis, an iconic fish of the Tibetan Plateau.全长转录本测序加速了高原标志性鱼类——花斑裸鲤的转录组研究。

Sci Rep. 2020 Jun 15;10(1):9668. doi: 10.1038/s41598-020-66582-w.

Genomics-Enabled Analysis of Genes Identifies New Alleles in Wheat and Related Species.基于基因组学的基因分析鉴定了小麦及相关物种中的新等位基因。

Int J Mol Sci. 2020 Feb 14;21(4):1304. doi: 10.3390/ijms21041304.

High frequency DNA rearrangement at creates a novel allele for Quality Protein Maize breeding.高频 DNA 重排在处产生了一个新的等位基因，用于优质蛋白玉米的培育。

Commun Biol. 2019 Dec 10;2:460. doi: 10.1038/s42003-019-0711-0. eCollection 2019.

Plant evolution and environmental adaptation unveiled by long-read whole-genome sequencing of .通过. 的长读全基因组测序揭示植物进化和环境适应

Proc Natl Acad Sci U S A. 2019 Sep 17;116(38):18893-18899. doi: 10.1073/pnas.1910401116. Epub 2019 Sep 4.

Nonallelic homologous recombination events responsible for copy number variation within an RNA silencing locus.导致RNA沉默位点内拷贝数变异的非等位基因同源重组事件。

Plant Direct. 2019 Aug 27;3(8):e00162. doi: 10.1002/pld3.162. eCollection 2019 Aug.

Altered nucleosome positions in maize haplotypes and mutants of a subset of SWI/SNF-like proteins.玉米单倍型和SWI/SNF样蛋白亚组突变体中核小体位置的改变。

Plant Direct. 2017 Oct 16;1(4):e00019. doi: 10.1002/pld3.19. eCollection 2017 Oct.

本文引用的文献

Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.耐旱草 Oropetium thomaeum 的单分子测序。

Nature. 2015 Nov 26;527(7579):508-11. doi: 10.1038/nature15714. Epub 2015 Nov 11.

Twenty years of bacterial genome sequencing.二十年的细菌基因组测序。

Nat Rev Microbiol. 2015 Dec;13(12):787-94. doi: 10.1038/nrmicro3565. Epub 2015 Nov 9.

Repbase Update, a database of repetitive elements in eukaryotic genomes.Repbase Update，一个真核生物基因组中重复元件的数据库。

Mob DNA. 2015 Jun 2;6:11. doi: 10.1186/s13100-015-0041-9. eCollection 2015.

High-resolution genetic mapping of maize pan-genome sequence anchors.玉米泛基因组序列锚定的高分辨率遗传图谱

Nat Commun. 2015 Apr 16;6:6914. doi: 10.1038/ncomms7914.

Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology.利用基于纳米通道的基因组作图技术快速检测人类基因组中的结构变异。

Gigascience. 2014 Dec 30;3(1):34. doi: 10.1186/2047-217X-3-34. eCollection 2014.

Dynamic transcriptome landscape of maize embryo and endosperm development.玉米胚和胚乳发育的动态转录组图谱

Plant Physiol. 2014 Sep;166(1):252-64. doi: 10.1104/pp.114.240689. Epub 2014 Jul 18.

Proteome balancing of the maize seed for higher nutritional value.调节玉米种子的蛋白质组以提高营养价值。

Front Plant Sci. 2014 May 30;5:240. doi: 10.3389/fpls.2014.00240. eCollection 2014.

PacBio sequencing of gene families - a case study with wheat gluten genes.PacBio 测序基因家族 - 以小麦醇溶蛋白基因为例。

Gene. 2014 Jan 10;533(2):541-6. doi: 10.1016/j.gene.2013.10.009. Epub 2013 Oct 19.

Gene tagging with engineered Ds elements in maize.利用工程化Ds元件在玉米中进行基因标记

Methods Mol Biol. 2013;1057:83-99. doi: 10.1007/978-1-62703-568-2_6.

Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.非杂交、基于长读长 SMRT 测序数据的完成微生物基因组组装。

Nat Methods. 2013 Jun;10(6):563-9. doi: 10.1038/nmeth.2474. Epub 2013 May 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验