Suppr超能文献

抗体序列数据库。

AntiBody Sequence Database.

作者信息

Malesys Simon, Torchet Rachel, Saunier Bertrand, Maillet Nicolas

机构信息

Institut Pasteur, Université Paris Cité, Bioinformatics and Biostatistics Hub, 28 rue du Dr Roux, F-75015 Paris, France.

Unité de Virologie Structurale, Institut Pasteur, Université Paris Cité, CNRS UMR 3569, 28 rue du Dr Roux, 75015 Paris, France.

出版信息

NAR Genom Bioinform. 2024 Dec 18;6(4):lqae171. doi: 10.1093/nargab/lqae171. eCollection 2024 Dec.

Abstract

Antibodies play a crucial role in the humoral immune response against health threats, such as viral infections. Although the theoretical number of human immunoglobulins is well over a trillion, the total number of unique antibody protein sequences accessible in databases is much lower than the number found in a single individual. Training AI (Artificial Intelligence) models, for example to assist in developing serodiagnoses or antibody-based therapies, requires building datasets according to strict criteria to include as many standardized antibody sequences as possible. However, the available sequences are scattered across partially redundant databases, making it difficult to compile them into single non-redundant datasets. Here, we introduce ABSD (AntiBody Sequence Database, https://absd.pasteur.cloud), which contains data from major publicly available resources, creating the largest standardized, automatically updated and non-redundant source of public antibody sequences. This user-friendly and open website enables users to generate lists of antibodies based on selected criteria and download the unique sequence pairs of their variable regions.

摘要

抗体在针对健康威胁(如病毒感染)的体液免疫反应中发挥着关键作用。尽管人类免疫球蛋白的理论数量远超一万亿种,但数据库中可获取的独特抗体蛋白序列总数远低于一个个体中发现的数量。训练人工智能(AI)模型,例如协助开发血清诊断或基于抗体的疗法,需要根据严格标准构建数据集,以纳入尽可能多的标准化抗体序列。然而,现有的序列分散在部分冗余的数据库中,难以将它们编译成单个非冗余数据集。在此,我们引入了ABSD(抗体序列数据库,https://absd.pasteur.cloud),它包含来自主要公开可用资源的数据,创建了最大的标准化、自动更新且非冗余的公共抗体序列来源。这个用户友好的开放网站使用户能够根据选定标准生成抗体列表,并下载其可变区的独特序列对。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23ea/11655285/b5a22ab9e017/lqae171fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验