• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从等位基因包含中学习抗体序列约束条件。

Learning antibody sequence constraints from allelic inclusion.

作者信息

Jagota Milind, Hsu Chloe, Mazumder Thomas, Sung Kevin, DeWitt William S, Listgarten Jennifer, Matsen Frederick A, Ye Chun Jimmie, Song Yun S

机构信息

Computer Science Division, UC Berkeley, Berkeley, CA USA.

Division of Rheumatology, Department of Medicine, UCSF, San Francisco, CA, USA.

出版信息

bioRxiv. 2024 Oct 25:2024.10.22.619760. doi: 10.1101/2024.10.22.619760.

DOI:10.1101/2024.10.22.619760
PMID:39484623
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11526943/
Abstract

Antibodies and B-cell receptors (BCRs) are produced by B cells, and are built of a heavy chain and a light chain. Although each B cell could express two different heavy chains and four different light chains, usually only a unique pair of heavy chain and light chain is expressed-a phenomenon known as . However, a small fraction of naive-B cells violate allelic exclusion by expressing two productive light chains, one of which has impaired function; this has been called . We demonstrate that these B cells can be used to learn constraints on antibody sequence. Using large-scale single-cell sequencing data from humans, we find examples of light chain allelic inclusion in thousands of naive-B cells, which is an order of magnitude larger than existing datasets. We train machine learning models to identify the abnormal sequences in these cells. The resulting models correlate with antibody properties that they were not trained on, including polyreactivity, surface expression, and mutation usage in affinity maturation. These correlations are larger than what is achieved by existing antibody modeling approaches, indicating that allelic inclusion data contains useful new information. We also investigate the impact of similar selection forces on the heavy chain in mouse, and observe that pairing with the surrogate light chain significantly restricts heavy chain diversity.

摘要

抗体和B细胞受体(BCR)由B细胞产生,由一条重链和一条轻链组成。虽然每个B细胞可以表达两种不同的重链和四种不同的轻链,但通常只表达一对独特的重链和轻链——这种现象称为等位基因排斥。然而,一小部分未成熟B细胞通过表达两条有功能的轻链来违反等位基因排斥,其中一条轻链功能受损;这被称为轻链等位基因包含。我们证明这些B细胞可用于了解抗体序列的限制因素。利用来自人类的大规模单细胞测序数据,我们在数千个未成熟B细胞中发现了轻链等位基因包含的例子,这比现有数据集大一个数量级。我们训练机器学习模型来识别这些细胞中的异常序列。所得模型与它们未接受训练的抗体特性相关,包括多反应性、表面表达和亲和力成熟中的突变使用情况。这些相关性比现有抗体建模方法所达到的相关性更大,表明等位基因包含数据包含有用的新信息。我们还研究了类似选择力对小鼠重链的影响,并观察到与替代轻链配对会显著限制重链多样性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/9e1ff685fc77/nihpp-2024.10.22.619760v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/76316417d1c8/nihpp-2024.10.22.619760v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/17c8b0d77056/nihpp-2024.10.22.619760v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/866914540d42/nihpp-2024.10.22.619760v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/326470e67b80/nihpp-2024.10.22.619760v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/9e1ff685fc77/nihpp-2024.10.22.619760v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/76316417d1c8/nihpp-2024.10.22.619760v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/17c8b0d77056/nihpp-2024.10.22.619760v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/866914540d42/nihpp-2024.10.22.619760v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/326470e67b80/nihpp-2024.10.22.619760v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f6d/11526943/9e1ff685fc77/nihpp-2024.10.22.619760v1-f0005.jpg

相似文献

1
Learning antibody sequence constraints from allelic inclusion.从等位基因包含中学习抗体序列约束条件。
bioRxiv. 2024 Oct 25:2024.10.22.619760. doi: 10.1101/2024.10.22.619760.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
4
Learning antibody sequence constraints from allelic inclusion.
Cell Syst. 2025 Sep 17;16(9):101368. doi: 10.1016/j.cels.2025.101368. Epub 2025 Aug 13.
5
Short-Term Memory Impairment短期记忆障碍
6
Genetic determinants of testicular sperm extraction outcomes: insights from a large multicentre study of men with non-obstructive azoospermia.睾丸精子提取结果的遗传决定因素:来自一项针对非梗阻性无精子症男性的大型多中心研究的见解
Hum Reprod Open. 2025 Aug 29;2025(3):hoaf049. doi: 10.1093/hropen/hoaf049. eCollection 2025.
7
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
8
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
9
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
10
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

本文引用的文献

1
Addressing the antibody germline bias and its effect on language models for improved antibody design.解决抗体种系偏倚及其对改善抗体设计的语言模型的影响。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae618.
2
Deep learning predictions of TCR-epitope interactions reveal epitope-specific chains in dual alpha T cells.深度学习预测 TCR-表位相互作用揭示了双α T 细胞中表位特异性链。
Nat Commun. 2024 Apr 13;15(1):3211. doi: 10.1038/s41467-024-47461-8.
3
IgLM: Infilling language modeling for antibody sequence design.IgLM:抗体序列设计的填充语言模型。
Cell Syst. 2023 Nov 15;14(11):979-989.e4. doi: 10.1016/j.cels.2023.10.001. Epub 2023 Oct 30.
4
Efficient evolution of human antibodies from general protein language models.从通用蛋白质语言模型中高效进化出人类抗体。
Nat Biotechnol. 2024 Feb;42(2):275-283. doi: 10.1038/s41587-023-01763-2. Epub 2023 Apr 24.
5
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
6
Toward generalizable prediction of antibody thermostability using machine learning on sequence and structure features.利用序列和结构特征的机器学习方法实现抗体热稳定性的可推广预测。
MAbs. 2023 Jan-Dec;15(1):2163584. doi: 10.1080/19420862.2022.2163584.
7
Structure-based prediction of T cell receptor:peptide-MHC interactions.基于结构的 T 细胞受体-肽-MHC 相互作用预测。
Elife. 2023 Jan 20;12:e82813. doi: 10.7554/eLife.82813.
8
State-of-the-Art Estimation of Protein Model Accuracy Using AlphaFold.利用 AlphaFold 对蛋白质模型精度进行的最新评估。
Phys Rev Lett. 2022 Dec 2;129(23):238101. doi: 10.1103/PhysRevLett.129.238101.
9
An in silico method to assess antibody fragment polyreactivity.一种评估抗体片段多反应性的计算方法。
Nat Commun. 2022 Dec 7;13(1):7554. doi: 10.1038/s41467-022-35276-4.
10
Functional antibodies exhibit light chain coherence.功能性抗体表现出轻链一致性。
Nature. 2022 Nov;611(7935):352-357. doi: 10.1038/s41586-022-05371-z. Epub 2022 Oct 26.