鉴定人类基因组中独特的、复杂但有序的重复元件文库及其潜在参与病理生物学的可能性。

Identification of a unique library of complex, but ordered, arrays of repetitive elements in the human genome and implication of their potential involvement in pathobiology.

机构信息

Burn Research, Shriners Hospitals for Children Northern California and Department of Surgery, University of California-Davis, 2425 Stockton Blvd., Sacramento, CA 95817, USA.

出版信息

Exp Mol Pathol. 2011 Jun;90(3):300-11. doi: 10.1016/j.yexmp.2011.02.007. Epub 2011 Mar 1.

DOI:10.1016/j.yexmp.2011.02.007

PMID:21376035

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3092023/

Abstract

Approximately 2% of the human genome is reported to be occupied by genes. Various forms of repetitive elements (REs), both characterized and uncharacterized, are presumed to make up the vast majority of the rest of the genomes of human and other species. In conjunction with a comprehensive annotation of genes, information regarding components of genome biology, such as gene polymorphisms, non-coding RNAs, and certain REs, is found in human genome databases. However, the genome-wide profile of unique RE arrangements formed by different groups of REs has not been fully characterized yet. In this study, the entire human genome was subjected to an unbiased RE survey to establish a whole-genome profile of REs and their arrangements. Due to the limitation in query size within the bl2seq alignment program (National Center for Biotechnology Information [NCBI]) utilized for the RE survey, the entire NCBI reference human genome was fragmented into 6206 units of 0.5M nucleotides. A number of RE arrangements with varying complexities and patterns were identified throughout the genome. Each chromosome had unique profiles of RE arrangements and density, and high levels of RE density were measured near the centromere regions. Subsequently, 175 complex RE arrangements, which were selected throughout the genome, were subjected to a comparison analysis using five different human genome sequences. Interestingly, three of the five human genome databases shared the exactly same arrangement patterns and sequences for all 175 RE arrangement regions (a total of 12,765,625 nucleotides). The findings from this study demonstrate that a substantial fraction of REs in the human genome are clustered into various forms of ordered structures. Further investigations are needed to examine whether some of these ordered RE arrangements contribute to the human pathobiology as a functional genome unit.

摘要

据报道，人类基因组的大约 2%被基因占据。各种形式的重复元件（REs），包括已被描述和未被描述的，被认为构成了人类和其他物种基因组其余部分的绝大多数。结合对基因的全面注释，可以在人类基因组数据库中找到有关基因组生物学成分的信息，例如基因多态性、非编码 RNA 和某些 RE。然而，不同 RE 组形成的独特 RE 排列的全基因组图谱尚未得到充分描述。在这项研究中，对整个人类基因组进行了无偏 RE 调查，以建立 RE 及其排列的全基因组图谱。由于用于 RE 调查的 bl2seq 比对程序（国家生物技术信息中心 [NCBI]）中的查询大小限制，整个 NCBI 参考人类基因组被分割成 6206 个 0.5M 核苷酸的单位。在整个基因组中鉴定出具有不同复杂性和模式的多种 RE 排列。每条染色体都具有独特的 RE 排列和密度特征，在着丝粒区域附近测量到高水平的 RE 密度。随后，对整个基因组中选择的 175 个复杂 RE 排列进行了使用五个不同人类基因组序列的比较分析。有趣的是，五个人类基因组数据库中的三个共享了所有 175 个 RE 排列区域的完全相同的排列模式和序列（总共 12,765,625 个核苷酸）。这项研究的结果表明，人类基因组中的大量 RE 聚集在各种形式的有序结构中。需要进一步研究这些有序 RE 排列是否作为一个功能基因组单元有助于人类病理生物学。

相似文献

Identification of a unique library of complex, but ordered, arrays of repetitive elements in the human genome and implication of their potential involvement in pathobiology.

Exp Mol Pathol. 2011 Jun;90(3):300-11. doi: 10.1016/j.yexmp.2011.02.007. Epub 2011 Mar 1.

Unique profile of ordered arrangements of repetitive elements in the C57BL/6J mouse genome implicating their functional roles.

PLoS One. 2012;7(4):e35156. doi: 10.1371/journal.pone.0035156. Epub 2012 Apr 18.

Chromosome Res. 2013 Mar;21(1):15-26. doi: 10.1007/s10577-012-9334-8. Epub 2013 Jan 29.

REMiner: a tool for unbiased mining and analysis of repetitive elements and their arrangement structures of large chromosomes.

Genomics. 2011 Nov;98(5):381-9. doi: 10.1016/j.ygeno.2011.07.002. Epub 2011 Jul 22.

REViewer: a tool for linear visualization of repetitive elements within a sequence query.

Genomics. 2013 Oct;102(4):209-14. doi: 10.1016/j.ygeno.2013.07.008. Epub 2013 Jul 24.

REMiner-II: a tool for rapid identification and configuration of repetitive element arrays from large mammalian chromosomes as a single query.

Genomics. 2012 Sep;100(3):131-40. doi: 10.1016/j.ygeno.2012.06.006. Epub 2012 Jun 28.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

WindowMasker: window-based masker for sequenced genomes.

Bioinformatics. 2006 Jan 15;22(2):134-41. doi: 10.1093/bioinformatics/bti774. Epub 2005 Nov 15.

[Correction of five different types of errors of model REFSEQs appeared in NCBI human gene database only by using two novel human genes C17orf32 and ZNF362].

Yi Chuan Xue Bao. 2004 Apr;31(4):325-34.

Temporal and spatial rearrangements of a repetitive element array on C57BL/6J mouse genome.

Exp Mol Pathol. 2015 Jun;98(3):439-45. doi: 10.1016/j.yexmp.2015.03.037. Epub 2015 Mar 31.

引用本文的文献

Temporal and spatial rearrangements of a repetitive element array on C57BL/6J mouse genome.

Exp Mol Pathol. 2015 Jun;98(3):439-45. doi: 10.1016/j.yexmp.2015.03.037. Epub 2015 Mar 31.

REViewer: a tool for linear visualization of repetitive elements within a sequence query.

Genomics. 2013 Oct;102(4):209-14. doi: 10.1016/j.ygeno.2013.07.008. Epub 2013 Jul 24.

Chromosome Res. 2013 Mar;21(1):15-26. doi: 10.1007/s10577-012-9334-8. Epub 2013 Jan 29.

REMiner-II: a tool for rapid identification and configuration of repetitive element arrays from large mammalian chromosomes as a single query.

Genomics. 2012 Sep;100(3):131-40. doi: 10.1016/j.ygeno.2012.06.006. Epub 2012 Jun 28.

Unique profile of ordered arrangements of repetitive elements in the C57BL/6J mouse genome implicating their functional roles.

PLoS One. 2012;7(4):e35156. doi: 10.1371/journal.pone.0035156. Epub 2012 Apr 18.

本文引用的文献

The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group.

Genome Res. 2009 Sep;19(9):1622-9. doi: 10.1101/gr.092197.109. Epub 2009 May 26.

Comparative genomics and molecular dynamics of DNA repeats in eukaryotes.

Microbiol Mol Biol Rev. 2008 Dec;72(4):686-727. doi: 10.1128/MMBR.00011-08.

The diploid genome sequence of an Asian individual.

Nature. 2008 Nov 6;456(7218):60-5. doi: 10.1038/nature07484.

The complete genome of an individual by massively parallel DNA sequencing.

Nature. 2008 Apr 17;452(7189):872-6. doi: 10.1038/nature06884.

Endogenous retroviruses in systemic response to stress signals.

Shock. 2008 Aug;30(2):105-16. doi: 10.1097/SHK.0b013e31816a363f.

Evol Dev. 2007 Nov-Dec;9(6):555-65. doi: 10.1111/j.1525-142X.2007.00196.x.

The diploid genome sequence of an individual human.

PLoS Biol. 2007 Sep 4;5(10):e254. doi: 10.1371/journal.pbio.0050254.

Expandable DNA repeats and human disease.

Nature. 2007 Jun 21;447(7147):932-40. doi: 10.1038/nature05977.

Useful 'junk': Alu RNAs in the human transcriptome.

Cell Mol Life Sci. 2007 Jul;64(14):1793-800. doi: 10.1007/s00018-007-7084-0.

Current topics in genome evolution: molecular mechanisms of new gene formation.

Cell Mol Life Sci. 2007 Mar;64(5):542-54. doi: 10.1007/s00018-006-6453-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

鉴定人类基因组中独特的、复杂但有序的重复元件文库及其潜在参与病理生物学的可能性。

Identification of a unique library of complex, but ordered, arrays of repetitive elements in the human genome and implication of their potential involvement in pathobiology.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献