• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用全基因组接触图谱填补酵母基因组中的注释空白。

Filling annotation gaps in yeast genomes using genome-wide contact maps.

机构信息

Institut Pasteur, Groupe Régulation Spatiale des Génomes, Department of Genomes and Genetics, CNRS, UMR 3525, Institut Pasteur, Unité Imagerie et Modélisation, Department of Cell Biology and Infection, CNRS, URA 2582, F-75015 Paris, France, Institute for Research on Cancer and Ageing of Nice (IRCAN), CNRS UMR 7284 - INSERM U108, Université de Nice Sophia Antipolis, 06107 Nice, France, CNRS, UMR7238, Biologie Computationnelle et Quantitative and Sorbonne Universités, UPMC Univ Paris 06, UMR7238, Biologie Computationnelle et Quantitative, F-75005, Paris, FranceInstitut Pasteur, Groupe Régulation Spatiale des Génomes, Department of Genomes and Genetics, CNRS, UMR 3525, Institut Pasteur, Unité Imagerie et Modélisation, Department of Cell Biology and Infection, CNRS, URA 2582, F-75015 Paris, France, Institute for Research on Cancer and Ageing of Nice (IRCAN), CNRS UMR 7284 - INSERM U108, Université de Nice Sophia Antipolis, 06107 Nice, France, CNRS, UMR7238, Biologie Computationnelle et Quantitative and Sorbonne Universités, UPMC Univ Paris 06, UMR7238, Biologie Computationnelle et Quantitative, F-75005, Paris, FranceInstitut Pasteur, Groupe Régulation Spatiale des Génomes, Department of Genomes and Genetics, CNRS, UMR 3525, Institut Pasteur, Unité Imagerie et Modélisation, Department of Cell Biology and Infection, CNRS, URA 2582, F-75015 Paris, France, Institute for Research on Cancer and Ageing of Nice (IRCAN), CNRS UMR 7284 - INSERM U108, Université de Nice Sophia Antipolis, 06107 Nice, France, CNRS, UMR7238, Biologie Computationnelle et Quantitative and Sorbonne Universités, UPMC Univ Paris 06, UMR7238, Biologie Computationnelle et Quantitative, F-75005, Paris, FranceInstitut Pasteur, Groupe Régulation Spatiale des Génomes, Department of Genomes and Genetics, CNRS, UMR 3525, Institut Pasteur, Unité Imagerie et Modélisation, Department of Cell Biology and Infection, CNRS, URA 2582, F-75015 Paris, France, Institute for Research on Cancer and Agein

Institut Pasteur, Groupe Régulation Spatiale des Génomes, Department of Genomes and Genetics, CNRS, UMR 3525, Institut Pasteur, Unité Imagerie et Modélisation, Department of Cell Biology and Infection, CNRS, URA 2582, F-75015 Paris, France, Institute for Research on Cancer and Ageing of Nice (IRCAN), CNRS UMR 7284 - INSERM U108, Université de Nice Sophia Antipolis, 06107 Nice, France, CNRS, UMR7238, Biologie Computationnelle et Quantitative and Sorbonne Universités, UPMC Univ Paris 06, UMR7238, Biologie Computationnelle et Quantitative, F-75005, Paris, FranceInstitut Pasteur, Groupe Régulation Spatiale des Génomes, Department of Genomes and Genetics, CNRS, UMR 3525, Institut Pasteur, Unité Imagerie et Modélisation, Department of Cell Biology and Infection, CNRS, URA 2582, F-75015 Paris, France, Institute for Research on Cancer and Ageing of Nice (IRCAN), CNRS UMR 7284 - INSERM U108, Université de Nice Sophia Antipolis, 06107 Nice, France, CNRS, UMR7238, Biologie Computationnelle et Quantitative and Sorbonne Universités, UPMC Univ Paris 06, UMR7238, Biologie Computationnelle et Quantitative, F-75005, Paris, France.

出版信息

Bioinformatics. 2014 Aug 1;30(15):2105-13. doi: 10.1093/bioinformatics/btu162. Epub 2014 Apr 7.

DOI:10.1093/bioinformatics/btu162
PMID:24711652
Abstract

MOTIVATIONS

De novo sequencing of genomes is followed by annotation analyses aiming at identifying functional genomic features such as genes, non-coding RNAs or regulatory sequences, taking advantage of diverse datasets. These steps sometimes fail at detecting non-coding functional sequences: for example, origins of replication, centromeres and rDNA positions have proven difficult to annotate with high confidence. Here, we demonstrate an unconventional application of Chromosome Conformation Capture (3C) technique, which typically aims at deciphering the average 3D organization of genomes, by showing how functional information about the sequence can be extracted solely from the chromosome contact map.

RESULTS

Specifically, we describe a combined experimental and bioinformatic procedure that determines the genomic positions of centromeres and ribosomal DNA clusters in yeasts, including species where classical computational approaches fail. For instance, we determined the centromere positions in Naumovozyma castellii, where these coordinates could not be obtained previously. Although computed centromere positions were characterized by conserved synteny with neighboring species, no consensus sequences could be found, suggesting that centromeric binding proteins or mechanisms have significantly diverged. We also used our approach to refine centromere positions in Kuraishia capsulata and to identify rDNA positions in Debaryomyces hansenii. Our study demonstrates how 3C data can be used to complete the functional annotation of eukaryotic genomes.

AVAILABILITY AND IMPLEMENTATION

The source code is provided in the Supplementary Material. This includes a zipped file with the Python code and a contact matrix of Saccharomyces cerevisiae.

CONTACT

romain.koszul@pasteur.fr

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

对基因组进行从头测序后,通常会进行注释分析,以利用各种数据集来识别基因、非编码 RNA 或调控序列等功能基因组特征。这些步骤有时无法检测到非编码功能序列:例如,复制起点、着丝粒和 rDNA 位置的注释置信度很难提高。在这里,我们展示了染色体构象捕获(3C)技术的一种非常规应用,该技术通常旨在解析基因组的平均 3D 结构,我们展示了如何仅从染色体接触图谱中提取有关序列的功能信息。

结果

具体来说,我们描述了一种组合的实验和生物信息学程序,该程序确定了酵母中的着丝粒和核糖体 DNA 簇的基因组位置,包括经典计算方法无法确定的物种。例如,我们确定了 Naumovozyma castellii 中的着丝粒位置,在此之前无法获得这些坐标。尽管计算出的着丝粒位置与相邻物种具有保守的同线性,但没有找到共识序列,这表明着丝粒结合蛋白或机制已经显著分化。我们还使用我们的方法来改进 Kuraishia capsulata 中的着丝粒位置,并确定 Debaryomyces hansenii 中的 rDNA 位置。我们的研究表明 3C 数据如何用于完成真核基因组的功能注释。

可用性和实施

源代码在补充材料中提供。这包括一个包含 Python 代码和酿酒酵母接触矩阵的压缩文件。

联系方式

romain.koszul@pasteur.fr

补充信息

补充数据可在 Bioinformatics 在线获取。

相似文献

1
Filling annotation gaps in yeast genomes using genome-wide contact maps.利用全基因组接触图谱填补酵母基因组中的注释空白。
Bioinformatics. 2014 Aug 1;30(15):2105-13. doi: 10.1093/bioinformatics/btu162. Epub 2014 Apr 7.
2
A pipeline for automated annotation of yeast genome sequences by a conserved-synteny approach.通过保守同线性方法对酵母基因组序列进行自动化注释的流水线。
BMC Bioinformatics. 2012 Sep 17;13:237. doi: 10.1186/1471-2105-13-237.
3
Complete DNA sequence of Kuraishia capsulata illustrates novel genomic features among budding yeasts (Saccharomycotina).菊头孢霉的完整 DNA 序列阐明了出芽酵母(子囊菌门)中的新型基因组特征。
Genome Biol Evol. 2013;5(12):2524-39. doi: 10.1093/gbe/evt201.
4
The Methylotroph Gene Order Browser (MGOB) reveals conserved synteny and ancestral centromere locations in the yeast family Pichiaceae.甲基营养菌基因顺序浏览器(MGOB)揭示了毕赤酵母科酵母中的保守同线性和祖先着丝粒位置。
FEMS Yeast Res. 2019 Sep 1;19(6). doi: 10.1093/femsyr/foz058.
5
Saccharomyces cerevisiae: gene annotation and genome variability, state of the art through comparative genomics.酿酒酵母:基因注释与基因组变异性,通过比较基因组学呈现的最新技术水平
Methods Mol Biol. 2011;759:31-40. doi: 10.1007/978-1-61779-173-4_2.
6
Discovery of an unconventional centromere in budding yeast redefines evolution of point centromeres.在芽殖酵母中发现非常规着丝粒重新定义了点着丝粒的进化。
Curr Biol. 2015 Aug 3;25(15):2026-33. doi: 10.1016/j.cub.2015.06.023. Epub 2015 Jul 9.
7
Accurate identification of centromere locations in yeast genomes using Hi-C.利用Hi-C技术准确识别酵母基因组中的着丝粒位置。
Nucleic Acids Res. 2015 Jun 23;43(11):5331-9. doi: 10.1093/nar/gkv424. Epub 2015 May 4.
8
Alpha-CENTAURI: assessing novel centromeric repeat sequence variation with long read sequencing.半人马座α星:利用长读长测序评估新型着丝粒重复序列变异
Bioinformatics. 2016 Jul 1;32(13):1921-1924. doi: 10.1093/bioinformatics/btw101. Epub 2016 Feb 24.
9
Only centromeres can supply the partition system required for ARS function in the yeast Yarrowia lipolytica.只有着丝粒能够提供解脂耶氏酵母中ARS功能所需的分配系统。
J Mol Biol. 2001 Jan 12;305(2):203-17. doi: 10.1006/jmbi.2000.4300.
10
Asymmetry indices for analysis and prediction of replication origins in eukaryotic genomes.用于分析和预测真核基因组复制起点的不对称指数。
PLoS One. 2012;7(9):e45050. doi: 10.1371/journal.pone.0045050. Epub 2012 Sep 27.

引用本文的文献

1
Convergent Patterns of Karyotype Evolution Underlying Karyotype Uniformity in Conifers.针叶树染色体组型一致性背后的染色体组型进化趋同模式。
Adv Sci (Weinh). 2025 Feb;12(7):e2411098. doi: 10.1002/advs.202411098. Epub 2024 Dec 25.
2
Chromosome-scale assembly of the streamlined picoeukaryote sp. SENEW3 genome reveals Rabl-like chromatin structure and potential for C photosynthesis.流线型微微型真核生物 sp. SENEW3 染色体水平基因组组装揭示了类 Rabl 染色质结构和 C 光合作用的潜力。
Microb Genom. 2024 Apr;10(4). doi: 10.1099/mgen.0.001223.
3
Chromosome-scale assembly and evolution of the tetraploid Salvia splendens (Lamiaceae) genome.
四倍体一串红(唇形科)基因组的染色体水平组装与进化
Hortic Res. 2021 Sep 1;8(1):177. doi: 10.1038/s41438-021-00614-y.
4
Computer vision for pattern detection in chromosome contact maps.计算机视觉在染色体接触图谱中的模式检测。
Nat Commun. 2020 Nov 16;11(1):5795. doi: 10.1038/s41467-020-19562-7.
5
MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries.MetaTOR:一种从哺乳动物肠道邻近连接(meta3C)文库中恢复高质量宏基因组分箱的计算流程。
Front Genet. 2019 Aug 20;10:753. doi: 10.3389/fgene.2019.00753. eCollection 2019.
6
The Methylotroph Gene Order Browser (MGOB) reveals conserved synteny and ancestral centromere locations in the yeast family Pichiaceae.甲基营养菌基因顺序浏览器(MGOB)揭示了毕赤酵母科酵母中的保守同线性和祖先着丝粒位置。
FEMS Yeast Res. 2019 Sep 1;19(6). doi: 10.1093/femsyr/foz058.
7
Characterizing meiotic chromosomes' structure and pairing using a designer sequence optimized for Hi-C.使用经过 Hi-C 优化的设计序列来描绘减数分裂染色体的结构和配对。
Mol Syst Biol. 2018 Jul 16;14(7):e8293. doi: 10.15252/msb.20188293.
8
From dynamic chromatin architecture to DNA damage repair and back.从动态染色质结构到 DNA 损伤修复,再回到动态染色质结构。
Nucleus. 2018 Jan 1;9(1):161-170. doi: 10.1080/19491034.2017.1419847.
9
Proximity ligation scaffolding and comparison of two strains genomes.邻近连接支架构建及两菌株基因组比较
Biotechnol Biofuels. 2017 Jun 12;10:151. doi: 10.1186/s13068-017-0837-6. eCollection 2017.
10
Inferring the physical properties of yeast chromatin through Bayesian analysis of whole nucleus simulations.通过对全细胞核模拟进行贝叶斯分析推断酵母染色质的物理性质。
Genome Biol. 2017 May 3;18(1):81. doi: 10.1186/s13059-017-1199-x.