• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

小鼠基因组中近期的片段和基因重复。

Recent segmental and gene duplications in the mouse genome.

作者信息

Cheung Joseph, Wilson Michael D, Zhang Junjun, Khaja Razi, MacDonald Jeffrey R, Heng Henry H Q, Koop Ben F, Scherer Stephen W

机构信息

Program in Genetics and Genomic Biology, Research Institute, The Hospital for Sick Children, Toronto, ON M5G 1X8, Canada.

出版信息

Genome Biol. 2003;4(8):R47. doi: 10.1186/gb-2003-4-8-r47. Epub 2003 Jul 9.

DOI:10.1186/gb-2003-4-8-r47
PMID:12914656
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC193640/
Abstract

BACKGROUND

The high quality of the mouse genome draft sequence and its associated annotations are an invaluable biological resource. Identifying recent duplications in the mouse genome, especially in regions containing genes, may highlight important events in recent murine evolution. In addition, detecting recent sequence duplications can reveal potentially problematic regions of the genome assembly. We use BLAST-based computational heuristics to identify large (>/= 5 kb) and recent (>/= 90% sequence identity) segmental duplications in the mouse genome sequence. Here we present a database of recently duplicated regions of the mouse genome found in the mouse genome sequencing consortium (MGSC) February 2002 and February 2003 assemblies.

RESULTS

We determined that 33.6 Mb of 2,695 Mb (1.2%) of sequence from the February 2003 mouse genome sequence assembly is involved in recent segmental duplications, which is less than that observed in the human genome (around 3.5-5%). From this dataset, 8.9 Mb (26%) of the duplication content consisted of 'unmapped' chromosome sequence. Moreover, we suspect that an additional 18.5 Mb of sequence is involved in duplication artifacts arising from sequence misassignment errors in this genome assembly. By searching for genes that are located within these regions, we identified 675 genes that mapped to duplicated regions of the mouse genome. Sixteen of these genes appear to have been duplicated independently in the human genome. From our dataset we further characterized a 42 kb recent segmental duplication of Mater, a maternal-effect gene essential for embryogenesis in mice.

CONCLUSION

Our results provide an initial analysis of the recently duplicated sequence and gene content of the mouse genome. Many of these duplicated loci, as well as regions identified to be involved in potential sequence misassignment errors, will require further mapping and sequencing to achieve accuracy. A Genome Browser database was set up to display the identified duplication content presented in this work. This data will also be relevant to the growing number of investigators who use the draft genome sequence for experimental design and analysis.

摘要

背景

高质量的小鼠基因组草图序列及其相关注释是一种宝贵的生物学资源。识别小鼠基因组中近期的重复片段,尤其是包含基因的区域,可能会突显小鼠近期进化中的重要事件。此外,检测近期的序列重复可以揭示基因组组装中潜在的问题区域。我们使用基于BLAST的计算启发式方法来识别小鼠基因组序列中大片段(≥5 kb)且近期(序列同一性≥90%)的片段重复。在此,我们展示了一个在2002年2月和2003年2月小鼠基因组测序联盟(MGSC)组装中发现的小鼠基因组近期重复区域的数据库。

结果

我们确定,2003年2月小鼠基因组序列组装的2695 Mb序列中有33.6 Mb(1.2%)参与了近期的片段重复,这一比例低于人类基因组中观察到的比例(约3.5 - 5%)。在这个数据集中,8.9 Mb(26%)的重复内容由“未定位”的染色体序列组成。此外,我们怀疑另有18.5 Mb的序列参与了因该基因组组装中序列错误分配导致的重复假象。通过搜索位于这些区域内的基因,我们鉴定出675个映射到小鼠基因组重复区域的基因。其中16个基因似乎在人类基因组中是独立重复的。从我们的数据集中,我们进一步对 Mater(小鼠胚胎发育必需的母源效应基因)的一个42 kb近期片段重复进行了特征分析。

结论

我们的结果提供了对小鼠基因组近期重复序列和基因内容的初步分析。许多这些重复位点以及被确定参与潜在序列错误分配错误的区域,将需要进一步的定位和测序以达到准确。我们建立了一个基因组浏览器数据库来展示本研究中鉴定出的重复内容。这些数据对于越来越多使用基因组草图序列进行实验设计和分析的研究人员也将具有相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcd7/193640/b8927c4c5523/gb-2003-4-8-r47-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcd7/193640/b28542fa9726/gb-2003-4-8-r47-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcd7/193640/290f256deae5/gb-2003-4-8-r47-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcd7/193640/b8927c4c5523/gb-2003-4-8-r47-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcd7/193640/b28542fa9726/gb-2003-4-8-r47-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcd7/193640/290f256deae5/gb-2003-4-8-r47-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcd7/193640/b8927c4c5523/gb-2003-4-8-r47-3.jpg

相似文献

1
Recent segmental and gene duplications in the mouse genome.小鼠基因组中近期的片段和基因重复。
Genome Biol. 2003;4(8):R47. doi: 10.1186/gb-2003-4-8-r47. Epub 2003 Jul 9.
2
Recent segmental duplications in the working draft assembly of the brown Norway rat.褐家鼠工作草图组装中的近期节段性重复。
Genome Res. 2004 Apr;14(4):493-506. doi: 10.1101/gr.1907504.
3
Segmental duplications: organization and impact within the current human genome project assembly.节段性重复:当前人类基因组计划组装中的组织与影响
Genome Res. 2001 Jun;11(6):1005-17. doi: 10.1101/gr.gr-1871r.
4
Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence.人类基因组序列中节段性重复和潜在组装错误的全基因组检测。
Genome Biol. 2003;4(4):R25. doi: 10.1186/gb-2003-4-4-r25. Epub 2003 Mar 17.
5
Segmental duplication density decrease with distance to human-mouse breaks of synteny.节段性重复密度随与人类-小鼠共线性断裂的距离而降低。
Eur J Hum Genet. 2006 Feb;14(2):216-21. doi: 10.1038/sj.ejhg.5201534.
6
Patterns of segmental duplication in the human genome.人类基因组中的片段重复模式。
Mol Biol Evol. 2005 Jan;22(1):135-41. doi: 10.1093/molbev/msh262. Epub 2004 Sep 15.
7
Analysis of segmental duplications and genome assembly in the mouse.小鼠中节段性重复与基因组组装的分析
Genome Res. 2004 May;14(5):789-801. doi: 10.1101/gr.2238404.
8
Enrichment of segmental duplications in regions of breaks of synteny between the human and mouse genomes suggest their involvement in evolutionary rearrangements.人类和小鼠基因组之间同线性断裂区域中节段性重复的富集表明它们参与了进化重排。
Hum Mol Genet. 2003 Sep 1;12(17):2201-8. doi: 10.1093/hmg/ddg223. Epub 2003 Jul 8.
9
Analysis of recent segmental duplications in the bovine genome.牛基因组中近期片段重复的分析。
BMC Genomics. 2009 Dec 1;10:571. doi: 10.1186/1471-2164-10-571.
10
Human-specific duplication and mosaic transcripts: the recent paralogous structure of chromosome 22.人类特有的重复和嵌合转录本:22号染色体的近期旁系同源结构。
Am J Hum Genet. 2002 Jan;70(1):83-100. doi: 10.1086/338458. Epub 2001 Nov 30.

引用本文的文献

1
Genome organization and botanical diversity.基因组组织与植物多样性。
Plant Cell. 2024 May 1;36(5):1186-1204. doi: 10.1093/plcell/koae045.
2
A High-Quality Blue Whale Genome, Segmental Duplications, and Historical Demography.高质量蓝鲸基因组、片段重复序列和历史人口动态。
Mol Biol Evol. 2024 Mar 1;41(3). doi: 10.1093/molbev/msae036.
3
Telomere-to-telomere pear () reference genome reveals segmental and whole genome duplication driving genome evolution.端粒到端粒的梨()参考基因组揭示了片段和全基因组重复驱动基因组进化。

本文引用的文献

1
Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence.人类基因组序列中节段性重复和潜在组装错误的全基因组检测。
Genome Biol. 2003;4(4):R25. doi: 10.1186/gb-2003-4-4-r25. Epub 2003 Mar 17.
2
Engineering the mouse genome with bacterial artificial chromosomes to create multipurpose alleles.利用细菌人工染色体构建小鼠基因组以创建多用途等位基因。
Nat Biotechnol. 2003 Apr;21(4):443-7. doi: 10.1038/nbt804. Epub 2003 Mar 10.
3
Whole-genome sequence assembly for mammalian genomes: Arachne 2.
Hortic Res. 2023 Oct 12;10(11):uhad201. doi: 10.1093/hr/uhad201. eCollection 2023 Nov.
4
A haplotype-resolved genome assembly of the Nile rat facilitates exploration of the genetic basis of diabetes.尼罗草原田鼠单倍型基因组组装图谱助力探索糖尿病的遗传基础。
BMC Biol. 2022 Nov 8;20(1):245. doi: 10.1186/s12915-022-01427-8.
5
Genomic Identification, Evolution, and Expression Analysis of Bromodomain Genes Family in Buffalo.水牛中溴结构域基因家族的基因组鉴定、进化和表达分析。
Genes (Basel). 2022 Jan 1;13(1):103. doi: 10.3390/genes13010103.
6
Gene Duplication Analysis Reveals No Ancient Whole Genome Duplication but Extensive Small-Scale Duplications during Genome Evolution and Adaptation of .基因复制分析显示, 在 的基因组进化和适应过程中,没有发生古老的全基因组复制,而是发生了广泛的小规模复制。
Front Cell Infect Microbiol. 2017 Sep 21;7:412. doi: 10.3389/fcimb.2017.00412. eCollection 2017.
7
Chromosomal Rearrangements in Cancer: Detection and potential causal mechanisms.癌症中的染色体重排:检测与潜在因果机制
Mol Cell Oncol. 2014 Jul;1(1). doi: 10.4161/mco.29904.
8
Sequence, structural and expression divergence of duplicate genes in the bovine genome.牛基因组中重复基因的序列、结构和表达差异
PLoS One. 2014 Jul 23;9(7):e102868. doi: 10.1371/journal.pone.0102868. eCollection 2014.
9
Functional conservation of Gsdma cluster genes specifically duplicated in the mouse genome.在小鼠基因组中特异性复制的Gsdma基因簇基因的功能保守性。
G3 (Bethesda). 2013 Oct 3;3(10):1843-50. doi: 10.1534/g3.113.007393.
10
Reconstruction of ancestral metabolic enzymes reveals molecular mechanisms underlying evolutionary innovation through gene duplication.重建祖先代谢酶揭示了基因复制导致进化创新的分子机制。
PLoS Biol. 2012;10(12):e1001446. doi: 10.1371/journal.pbio.1001446. Epub 2012 Dec 11.
哺乳动物基因组的全基因组序列组装:Arachne 2
Genome Res. 2003 Jan;13(1):91-6. doi: 10.1101/gr.828403.
4
Initial sequencing and comparative analysis of the mouse genome.小鼠基因组的初步测序与比较分析。
Nature. 2002 Dec 5;420(6915):520-62. doi: 10.1038/nature01262.
5
Gene content and function of the ancestral chromosome fusion site in human chromosome 2q13-2q14.1 and paralogous regions.人类2号染色体2q13 - 2q14.1区域及旁系同源区域中祖先染色体融合位点的基因内容与功能。
Genome Res. 2002 Nov;12(11):1663-72. doi: 10.1101/gr.338402.
6
Splitting pairs: the diverging fates of duplicated genes.基因对的分化:重复基因的不同命运
Nat Rev Genet. 2002 Nov;3(11):827-37. doi: 10.1038/nrg928.
7
The generic genome browser: a building block for a model organism system database.通用基因组浏览器:模式生物系统数据库的一个构建模块。
Genome Res. 2002 Oct;12(10):1599-610. doi: 10.1101/gr.403602.
8
Genetic sophistication of human complement components C4A and C4B and RP-C4-CYP21-TNX (RCCX) modules in the major histocompatibility complex.主要组织相容性复合体中人类补体成分C4A和C4B以及RP-C4-CYP21-TNX(RCCX)模块的基因复杂性。
Am J Hum Genet. 2002 Oct;71(4):823-37. doi: 10.1086/342777. Epub 2002 Sep 11.
9
Recent segmental duplications in the human genome.人类基因组中近期的节段性重复。
Science. 2002 Aug 9;297(5583):1003-7. doi: 10.1126/science.1072047.
10
Chromosomal regions containing high-density and ambiguously mapped putative single nucleotide polymorphisms (SNPs) correlate with segmental duplications in the human genome.包含高密度且定位模糊的假定单核苷酸多态性(SNP)的染色体区域与人类基因组中的片段重复相关。
Hum Mol Genet. 2002 Aug 15;11(17):1987-95. doi: 10.1093/hmg/11.17.1987.