用于鉴定X和Y染色体中重复序列的新方法。

New methodology for repetitive sequences identification in X and Y chromosomes.

作者信息

Touati Rabeb, Tajouri Asma, Mesaoudi Imen, Oueslati Afef Elloumi, Lachiri Zied, Kharrat Maher

机构信息

University of Tunis El Manar, LR99ES10 Human Genetics Laboratory, Faculty of Medicine of Tunis (FMT), Tunisia.

University of Tunis El Manar, SITI Laboratory, National School of Engineers of Tunis, BP 37, Le Belvédère, 1002, Tunis, Tunisia.

出版信息

Biomed Signal Process Control. 2021 Feb;64:102207. doi: 10.1016/j.bspc.2020.102207. Epub 2020 Oct 19.

DOI:10.1016/j.bspc.2020.102207

PMID:33101452

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7572123/

Abstract

Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species' genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserving such DNA sequences or not in different locations in the chromosome remains a challenge for researchers in biology. Detecting their location despite their great variability and finding novel repetitive sequences remains a challenging task. To side-step this problem, we developed a new method based on signal and image processing tools. In fact, using this method we could find repetitive patterns in DNA images regardless of the repetition length. This new technique seems to be more efficient in detecting new repetitive sequences than bioinformatics tools. In fact, the classical tools present limited performances especially in case of mutations (insertion or deletion). However, modifying one or a few numbers of pixels in the image doesn't affect the global form of the repetitive pattern. As a consequence, we generated a new repetitive patterns database which contains tandem and dispersed repeated sequences. The highly repetitive sequences, we have identified in X and Y chromosomes, are shown to be located in other human chromosomes or in other genomes. The data we have generated is then taken as input to a Convolutional neural network classifier in order to classify them. The system we have constructed is efficient and gives an average of 94.4% as recognition score.

摘要

重复DNA序列在人类基因组乃至其他物种的基因组中占据了DNA的主要比例。每种重复DNA类型的重要性取决于许多因素：结构和功能作用、这些重复序列的位置、长度和数量就是明显的例子。在染色体的不同位置是否保留此类DNA序列，对生物学研究人员来说仍然是一项挑战。尽管其具有很大的变异性，但检测它们的位置并发现新的重复序列仍然是一项具有挑战性的任务。为了避开这个问题，我们基于信号和图像处理工具开发了一种新方法。事实上，使用这种方法，我们可以在DNA图像中找到重复模式，而不管重复长度如何。这项新技术在检测新的重复序列方面似乎比生物信息学工具更有效。事实上，传统工具的性能有限，尤其是在发生突变（插入或缺失）的情况下。然而，在图像中修改一个或几个像素不会影响重复模式的整体形式。因此，我们生成了一个新的重复模式数据库，其中包含串联和分散的重复序列。我们在X和Y染色体中鉴定出的高度重复序列，也存在于其他人类染色体或其他基因组中。然后，我们将生成的数据作为卷积神经网络分类器的输入，以便对它们进行分类。我们构建的系统效率很高，识别分数平均为94.4%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/071b/7572123/5c5de8105248/fx1_lrg.jpg

相似文献

New methodology for repetitive sequences identification in X and Y chromosomes.用于鉴定X和Y染色体中重复序列的新方法。

Biomed Signal Process Control. 2021 Feb;64:102207. doi: 10.1016/j.bspc.2020.102207. Epub 2020 Oct 19.

Repetitive DNA in eukaryotic genomes.真核生物基因组中的重复DNA

Chromosome Res. 2015 Sep;23(3):415-20. doi: 10.1007/s10577-015-9499-z.

Comparative Distribution of Repetitive Sequences in the Karyotypes of and (Anura, Pipidae).和（有尾目，树蛙科）染色体组型中重复序列的比较分布。

Genes (Basel). 2021 Apr 21;12(5):617. doi: 10.3390/genes12050617.

Repetitive DNA and chromosome evolution in plants.植物中的重复DNA与染色体进化

Philos Trans R Soc Lond B Biol Sci. 1986 Jan 29;312(1154):227-42. doi: 10.1098/rstb.1986.0004.

Mapping simple repeated DNA sequences in heterochromatin of Drosophila melanogaster.绘制黑腹果蝇异染色质中的简单重复DNA序列图谱。

Genetics. 1993 Aug;134(4):1149-74. doi: 10.1093/genetics/134.4.1149.

The nature and genomic landscape of repetitive DNA classes in Chrysanthemum nankingense shows recent genomic changes.菊花南农基因组中重复 DNA 类别的性质和基因组景观揭示了近期的基因组变化。

Ann Bot. 2023 Feb 7;131(1):215-228. doi: 10.1093/aob/mcac066.

Repeated DNA of the human Y chromosome.人类Y染色体的重复DNA。

Development. 1987;101 Suppl:77-92. doi: 10.1242/dev.101.Supplement.77.

Characterization of repetitive DNA landscape in wheat homeologous group 4 chromosomes.小麦4号部分同源群染色体中重复DNA图谱的特征分析

BMC Genomics. 2015 May 12;16(1):375. doi: 10.1186/s12864-015-1579-0.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象：化学与物理邂逅生物学（瑞士阿斯科纳，2012年6月10日至14日）

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

Impact of repetitive DNA on sex chromosome evolution in plants.重复DNA对植物性染色体进化的影响。

Chromosome Res. 2015 Sep;23(3):561-70. doi: 10.1007/s10577-015-9496-2.

引用本文的文献

Dose-dependent role of AMH and AMHR2 signaling in male differentiation and regulation of sex determination in Spotted knifejaw (Oplegnathus punctatus) with XXXX/XXY chromosome system.抗缪勒氏管激素（AMH）和抗缪勒氏管激素受体2（AMHR2）信号通路在具有XXXX/XXY染色体系统的斑石鲷（Oplegnathus punctatus）雄性分化和性别决定调控中的剂量依赖性作用。

Cell Commun Signal. 2025 Feb 1;23(1):59. doi: 10.1186/s12964-025-02038-w.

Repetitive DNA sequence detection and its role in the human genome.重复 DNA 序列检测及其在人类基因组中的作用。

Commun Biol. 2023 Sep 19;6(1):954. doi: 10.1038/s42003-023-05322-y.

Intelligent system based comparative analysis study of SARS-CoV-2 spike protein and antigenic proteins in different types of vaccines.基于智能系统的不同类型疫苗中SARS-CoV-2刺突蛋白和抗原蛋白的比较分析研究

Beni Suef Univ J Basic Appl Sci. 2022;11(1):34. doi: 10.1186/s43088-022-00216-0. Epub 2022 Mar 7.

本文引用的文献

Comparative genomic signature representations of the emerging COVID-19 coronavirus and other coronaviruses: High identity and possible recombination between Bat and Pangolin coronaviruses.新兴的 COVID-19 冠状病毒与其他冠状病毒的比较基因组特征表示：蝙蝠和穿山甲冠状病毒之间具有高度的同一性和可能的重组。

Genomics. 2020 Nov;112(6):4189-4202. doi: 10.1016/j.ygeno.2020.07.003. Epub 2020 Jul 6.

The Helitron family classification using SVM based on Fourier transform features applied on an unbalanced dataset.基于傅里叶变换特征的支持向量机在不平衡数据集上的Helitron 家族分类。

Med Biol Eng Comput. 2019 Oct;57(10):2289-2304. doi: 10.1007/s11517-019-02027-5. Epub 2019 Aug 17.

Bacterial classification with convolutional neural networks based on different data reduction layers.基于不同数据缩减层的卷积神经网络细菌分类

Nucleosides Nucleotides Nucleic Acids. 2020;39(4):493-503. doi: 10.1080/15257770.2019.1645851. Epub 2019 Aug 16.

Weakly supervised 3D deep learning for breast cancer classification and localization of the lesions in MR images.基于弱监督的 3D 深度学习在磁共振图像中用于乳腺癌分类和病变定位。

J Magn Reson Imaging. 2019 Oct;50(4):1144-1151. doi: 10.1002/jmri.26721. Epub 2019 Mar 29.

Identification of Short Exons Disunited by a Short Intron in Eukaryotic DNA Regions.鉴定真核 DNA 区域中由短内含子分隔的短外显子。

IEEE/ACM Trans Comput Biol Bioinform. 2020 Sep-Oct;17(5):1660-1670. doi: 10.1109/TCBB.2019.2900040. Epub 2019 Feb 18.

CNN-MGP: Convolutional Neural Networks for Metagenomics Gene Prediction.CNN-MGP：用于宏基因组基因预测的卷积神经网络。

Interdiscip Sci. 2019 Dec;11(4):628-635. doi: 10.1007/s12539-018-0313-4. Epub 2018 Dec 27.

Early Stages of XY Sex Chromosomes Differentiation in the Fish (Characiformes, Erythrinidae) Revealed by DNA Repeats Accumulation.DNA重复序列积累揭示鱼类（脂鲤目，红鳍鱼科）XY性染色体分化的早期阶段

Curr Genomics. 2018 Apr;19(3):216-226. doi: 10.2174/1389202918666170711160528.

Convolutional neural network architectures for predicting DNA-protein binding.用于预测DNA-蛋白质结合的卷积神经网络架构。

Bioinformatics. 2016 Jun 15;32(12):i121-i127. doi: 10.1093/bioinformatics/btw255.

Repetitive DNA in eukaryotic genomes.真核生物基因组中的重复DNA

Chromosome Res. 2015 Sep;23(3):415-20. doi: 10.1007/s10577-015-9499-z.

Cancer Specific Long Noncoding RNAs Show Differential Expression Patterns and Competing Endogenous RNA Potential in Hepatocellular Carcinoma.癌症特异性长链非编码RNA在肝细胞癌中呈现差异表达模式及竞争性内源RNA潜力

PLoS One. 2015 Oct 22;10(10):e0141042. doi: 10.1371/journal.pone.0141042. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于鉴定X和Y染色体中重复序列的新方法。

New methodology for repetitive sequences identification in X and Y chromosomes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献