• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于创建猫基因组合成核酸序列的生成对抗网络。

Generative Adversarial Networks for Creating Synthetic Nucleic Acid Sequences of Cat Genome.

作者信息

Hazra Debapriya, Kim Mi-Ryung, Byun Yung-Cheol

机构信息

Department of Computer Engineering, Jeju National University, Jeju 63243, Korea.

Veterinary Internal Medicine, Kyungpook National University, Daegu 41566, Korea.

出版信息

Int J Mol Sci. 2022 Mar 28;23(7):3701. doi: 10.3390/ijms23073701.

DOI:10.3390/ijms23073701
PMID:35409058
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8998662/
Abstract

Nucleic acids are the basic units of deoxyribonucleic acid (DNA) sequencing. Every organism demonstrates different DNA sequences with specific nucleotides. It reveals the genetic information carried by a particular DNA segment. Nucleic acid sequencing expresses the evolutionary changes among organisms and revolutionizes disease diagnosis in animals. This paper proposes a generative adversarial networks (GAN) model to create synthetic nucleic acid sequences of the cat genome tuned to exhibit specific desired properties. We obtained the raw sequence data from Illumina next generation sequencing. Various data preprocessing steps were performed using Cutadapt and DADA2 tools. The processed data were fed to the GAN model that was designed following the architecture of Wasserstein GAN with gradient penalty (WGAN-GP). We introduced a predictor and an evaluator in our proposed GAN model to tune the synthetic sequences to acquire certain realistic properties. The predictor was built for extracting samples with a promoter sequence, and the evaluator was built for filtering samples that scored high for motif-matching. The filtered samples were then passed to the discriminator. We evaluated our model based on multiple metrics and demonstrated outputs for latent interpolation, latent complementation, and motif-matching. Evaluation results showed our proposed GAN model achieved 93.7% correlation with the original data and produced significant outcomes as compared to existing models for sequence generation.

摘要

核酸是脱氧核糖核酸(DNA)测序的基本单位。每个生物体都展示出具有特定核苷酸的不同DNA序列。它揭示了特定DNA片段所携带的遗传信息。核酸测序表达了生物体之间的进化变化,并彻底改变了动物疾病的诊断。本文提出了一种生成对抗网络(GAN)模型,以创建经过调整以展现特定所需特性的猫基因组合成核酸序列。我们从Illumina下一代测序中获得了原始序列数据。使用Cutadapt和DADA2工具执行了各种数据预处理步骤。将处理后的数据输入到基于带有梯度惩罚的 Wasserstein GAN(WGAN-GP)架构设计的GAN模型中。我们在提出的GAN模型中引入了一个预测器和一个评估器,以调整合成序列以获得某些现实特性。构建预测器用于提取具有启动子序列的样本,构建评估器用于过滤在基序匹配中得分高的样本。然后将经过过滤的样本传递给鉴别器。我们基于多个指标评估了我们的模型,并展示了潜在插值、潜在互补和基序匹配的输出。评估结果表明,我们提出的GAN模型与原始数据的相关性达到了93.7%,并且与现有的序列生成模型相比产生了显著的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/56e177fd2d8a/ijms-23-03701-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/7cce803f5ed8/ijms-23-03701-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/88c953d8bd10/ijms-23-03701-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/6c8c521b8e5e/ijms-23-03701-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/72514909def4/ijms-23-03701-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/2664a7b52c1d/ijms-23-03701-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/e907831f5f1e/ijms-23-03701-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/8f7d9e0f20e2/ijms-23-03701-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/3ba406ce5dfc/ijms-23-03701-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/56e177fd2d8a/ijms-23-03701-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/7cce803f5ed8/ijms-23-03701-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/88c953d8bd10/ijms-23-03701-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/6c8c521b8e5e/ijms-23-03701-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/72514909def4/ijms-23-03701-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/2664a7b52c1d/ijms-23-03701-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/e907831f5f1e/ijms-23-03701-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/8f7d9e0f20e2/ijms-23-03701-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/3ba406ce5dfc/ijms-23-03701-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8c8/8998662/56e177fd2d8a/ijms-23-03701-g009.jpg

相似文献

1
Generative Adversarial Networks for Creating Synthetic Nucleic Acid Sequences of Cat Genome.用于创建猫基因组合成核酸序列的生成对抗网络。
Int J Mol Sci. 2022 Mar 28;23(7):3701. doi: 10.3390/ijms23073701.
2
Enhancing classification of cells procured from bone marrow aspirate smears using generative adversarial networks and sequential convolutional neural network.利用生成对抗网络和序列卷积神经网络增强骨髓穿刺涂片获取的细胞分类。
Comput Methods Programs Biomed. 2022 Sep;224:107019. doi: 10.1016/j.cmpb.2022.107019. Epub 2022 Jul 10.
3
Synthesis of Microscopic Cell Images Obtained from Bone Marrow Aspirate Smears through Generative Adversarial Networks.通过生成对抗网络合成从骨髓穿刺涂片获得的微观细胞图像。
Biology (Basel). 2022 Feb 10;11(2):276. doi: 10.3390/biology11020276.
4
Parallel Connected Generative Adversarial Network with Quadratic Operation for SAR Image Generation and Application for Classification.并行连接生成对抗网络与二次运算在 SAR 图像生成中的应用及分类。
Sensors (Basel). 2019 Feb 19;19(4):871. doi: 10.3390/s19040871.
5
Synthesizing anonymized and labeled TOF-MRA patches for brain vessel segmentation using generative adversarial networks.使用生成对抗网络对 TOF-MRA 斑块进行匿名和标记,以进行脑部血管分割。
Comput Biol Med. 2021 Apr;131:104254. doi: 10.1016/j.compbiomed.2021.104254. Epub 2021 Feb 15.
6
iEnhancer-GAN: A Deep Learning Framework in Combination with Word Embedding and Sequence Generative Adversarial Net to Identify Enhancers and Their Strength.iEnhancer-GAN:一种结合词嵌入和序列生成对抗网络以识别增强子及其强度的深度学习框架。
Int J Mol Sci. 2021 Mar 30;22(7):3589. doi: 10.3390/ijms22073589.
7
Generative AI with WGAN-GP for boosting seizure detection accuracy.用于提高癫痫发作检测准确性的带有 Wasserstein 生成对抗网络梯度惩罚的生成式人工智能。
Front Artif Intell. 2024 Oct 2;7:1437315. doi: 10.3389/frai.2024.1437315. eCollection 2024.
8
A Hyperspectral Image Classification Method Based on Multi-Discriminator Generative Adversarial Networks.一种基于多判别器生成对抗网络的高光谱图像分类方法。
Sensors (Basel). 2019 Jul 25;19(15):3269. doi: 10.3390/s19153269.
9
Artifact correction in low-dose dental CT imaging using Wasserstein generative adversarial networks.基于 Wasserstein 生成对抗网络的低剂量牙科 CT 成像伪影校正。
Med Phys. 2019 Apr;46(4):1686-1696. doi: 10.1002/mp.13415. Epub 2019 Feb 14.
10
Image denoising by transfer learning of generative adversarial network for dental CT.基于生成对抗网络的迁移学习在牙科 CT 中的图像去噪。
Biomed Phys Eng Express. 2020 Sep 8;6(5):055024. doi: 10.1088/2057-1976/abb068.

引用本文的文献

1
The development of the generative adversarial supporting vector machine for molecular property generation.用于分子性质生成的生成对抗支持向量机的开发。
J Cheminform. 2025 Jul 7;17(1):100. doi: 10.1186/s13321-025-01052-x.
2
The Use of AI for Phenotype-Genotype Mapping.人工智能在表型-基因型映射中的应用。
Methods Mol Biol. 2025;2952:369-410. doi: 10.1007/978-1-0716-4690-8_21.
3
Differential Expression of tRNA-Derived Small RNA Markers of Antidepressant Response and Functional Forecast of Duloxetine in MDD Patients.抑郁症患者中抗抑郁反应的tRNA衍生小RNA标志物的差异表达及度洛西汀的功能预测

本文引用的文献

1
MichiGAN: sampling from disentangled representations of single-cell data using generative adversarial networks.MichiGAN:使用生成对抗网络从单细胞数据的解缠表示中进行采样。
Genome Biol. 2021 May 20;22(1):158. doi: 10.1186/s13059-021-02373-4.
2
Creating artificial human genomes using generative neural networks.使用生成式神经网络创建人工人类基因组。
PLoS Genet. 2021 Feb 4;17(2):e1009303. doi: 10.1371/journal.pgen.1009303. eCollection 2021 Feb.
3
pcPromoter-CNN: A CNN-Based Prediction and Classification of Promoters.pcPromoter-CNN:一种基于 CNN 的启动子预测和分类方法。
Genes (Basel). 2025 Jan 27;16(2):162. doi: 10.3390/genes16020162.
4
SFMBT2 regulates plumage color via serum metabolites in Chinese Anyi tile-like gray chickens.SFMBT2通过血清代谢物调控中国安义瓦灰鸡的羽色。
Poult Sci. 2024 Dec;103(12):104391. doi: 10.1016/j.psj.2024.104391. Epub 2024 Oct 9.
5
DeepBP: A transformer-based model for identifying blood-brain barrier penetrating peptides with data augmentation using feedback GAN.DeepBP:一种基于Transformer的模型,用于通过反馈生成对抗网络进行数据增强来识别血脑屏障穿透肽。
J Adv Res. 2024 Aug 5. doi: 10.1016/j.jare.2024.08.002.
6
Progress of the "Molecular Informatics" Section in 2022.2022 年“分子信息学”分会进展情况。
Int J Mol Sci. 2023 May 29;24(11):9442. doi: 10.3390/ijms24119442.
7
Editorial of Special Issue "Deep Learning and Machine Learning in Bioinformatics".专刊编辑寄语:深度学习与生物信息学中的机器学习
Int J Mol Sci. 2022 Jun 14;23(12):6610. doi: 10.3390/ijms23126610.
Genes (Basel). 2020 Dec 21;11(12):1529. doi: 10.3390/genes11121529.
4
Impact of DNA extraction on whole genome sequencing analysis for characterization and relatedness of Shiga toxin-producing Escherichia coli isolates.DNA 提取对产志贺毒素大肠杆菌分离株的特征分析和相关性的全基因组测序分析的影响。
Sci Rep. 2020 Sep 4;10(1):14649. doi: 10.1038/s41598-020-71207-3.
5
Artificial intelligence in clinical and genomic diagnostics.人工智能在临床和基因组诊断中的应用。
Genome Med. 2019 Nov 19;11(1):70. doi: 10.1186/s13073-019-0689-8.
6
Electrocardiogram generation with a bidirectional LSTM-CNN generative adversarial network.使用双向 LSTM-CNN 生成对抗网络生成心电图。
Sci Rep. 2019 May 1;9(1):6734. doi: 10.1038/s41598-019-42516-z.
7
MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.MULTiPly:一种用于发现通用和特定类型启动子的新型多层预测器。
Bioinformatics. 2019 Sep 1;35(17):2957-2965. doi: 10.1093/bioinformatics/btz016.
8
iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC.iPromoter-2L:一种双层预测器,通过基于多窗口的 PseKNC 来识别启动子及其类型。
Bioinformatics. 2018 Jan 1;34(1):33-40. doi: 10.1093/bioinformatics/btx579.
9
Zseq: An Approach for Preprocessing Next-Generation Sequencing Data.Zseq:一种用于下一代测序数据预处理的方法。
J Comput Biol. 2017 Aug;24(8):746-755. doi: 10.1089/cmb.2017.0021. Epub 2017 Apr 17.
10
DADA2: High-resolution sample inference from Illumina amplicon data.DADA2:从Illumina扩增子数据进行高分辨率样本推断。
Nat Methods. 2016 Jul;13(7):581-3. doi: 10.1038/nmeth.3869. Epub 2016 May 23.