Suppr超能文献

在合成基因中嵌入永久性水印。

Embedding permanent watermarks in synthetic genes.

机构信息

Life Technologies/Geneart AG, Regensburg, Germany.

出版信息

PLoS One. 2012;7(8):e42465. doi: 10.1371/journal.pone.0042465. Epub 2012 Aug 8.

Abstract

As synthetic biology advances, labeling of genes or organisms, like other high-value products, will become important not only to pinpoint their identity, origin, or spread, but also for intellectual property, classification, bio-security or legal reasons. Ideally information should be inseparably interlaced into expressed genes. We describe a method for embedding messages within open reading frames of synthetic genes by adapting steganographic algorithms typically used for watermarking digital media files. Text messages are first translated into a binary string, and then represented in the reading frame by synonymous codon choice. To aim for good expression of the labeled gene in its host as well as retain a high degree of codon assignment flexibility for gene optimization, codon usage tables of the target organism are taken into account. Preferably amino acids with 4 or 6 synonymous codons are used to comprise binary digits. Several different messages were embedded into open reading frames of T7 RNA polymerase, GFP, human EMG1 and HIV gag, variously optimized for bacterial, yeast, mammalian or plant expression, without affecting their protein expression or function. We also introduced Vigenère polyalphabetic substitution to cipher text messages, and developed an identifier as a key to deciphering codon usage ranking stored for a specific organism within a sequence of 35 nucleotides.

摘要

随着合成生物学的发展,对基因或生物体进行标记,就像其他高价值产品一样,不仅对于确定其身份、来源或传播至关重要,而且对于知识产权、分类、生物安全或法律原因也是如此。理想情况下,信息应该不可分割地交织在表达基因中。我们描述了一种通过适应通常用于数字媒体文件水印的隐写算法在合成基因的开放阅读框中嵌入消息的方法。文本消息首先被翻译成二进制字符串,然后通过同义密码子选择在阅读框中表示。为了在宿主中良好表达标记基因,并保留基因优化的高度密码子分配灵活性,我们考虑了目标生物体的密码子使用表。最好使用具有 4 或 6 个同义密码子的氨基酸来组成二进制数字。我们将几个不同的消息嵌入到 T7 RNA 聚合酶、GFP、人 EMG1 和 HIV gag 的开放阅读框中,这些基因经过各种优化,可在细菌、酵母、哺乳动物或植物中表达,而不影响其蛋白质表达或功能。我们还引入了维吉尼亚多字母替换密码来加密文本消息,并开发了一种标识符作为对存储在 35 个核苷酸序列中特定生物体的密码子使用排名进行解码的密钥。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f829/3414517/d736e15a6deb/pone.0042465.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验