Hemadou Audrey, Giudicelli Véronique, Smith Melissa Laird, Lefranc Marie-Paule, Duroux Patrice, Kossida Sofia, Heiner Cheryl, Hepler N Lance, Kuijpers John, Groppi Alexis, Korlach Jonas, Mondon Philippe, Ottones Florence, Jacobin-Valat Marie-Josée, Laroche-Traineau Jeanny, Clofent-Sanchez Gisèle
CRMSB, UMR 5536, CNRS, Bordeaux, France.
IMGT®, The International ImMunoGeneTics Information System®, Laboratoire d'ImmunoGénétique Moléculaire, LIGM, Institut de Génétique Humaine, IGH, UMR 9002, CNRS, Montpellier University, Montpellier, France.
Front Immunol. 2017 Dec 20;8:1796. doi: 10.3389/fimmu.2017.01796. eCollection 2017.
Phage-display selection of immunoglobulin (IG) or antibody single chain Fragment variable (scFv) from combinatorial libraries is widely used for identifying new antibodies for novel targets. Next-generation sequencing (NGS) has recently emerged as a new method for the high throughput characterization of IG and T cell receptor (TR) immune repertoires both and . However, challenges remain for the NGS sequencing of scFv from combinatorial libraries owing to the scFv length (>800 bp) and the presence of two variable domains [variable heavy (VH) and variable light (VL) for IG] associated by a peptide linker in a single chain. Here, we show that single-molecule real-time (SMRT) sequencing with the Pacific Biosciences RS II platform allows for the generation of full-length scFv reads obtained from an selection of scFv-phages in an animal model of atherosclerosis. We first amplified the DNA of the phagemid inserts from scFv-phages eluted from an aortic section at the third round of the selection. From this amplified DNA, 450,558 reads were obtained from 15 SMRT cells. Highly accurate circular consensus sequences from these reads were generated, filtered by quality and then analyzed by IMGT/HighV-QUEST with the functionality for scFv. Full-length scFv were identified and characterized in 348,659 reads. Full-length scFv sequencing is an absolute requirement for analyzing the associated VH and VL domains enriched during the panning rounds. In order to further validate the ability of SMRT sequencing to provide high quality, full-length scFv sequences, we tracked the reads of an scFv-phage clone P3 previously identified by biological assays and Sanger sequencing. Sixty P3 reads showed 100% identity with the full-length scFv of 767 bp, 53 of them covering the whole insert of 977 bp, which encompassed the primer sequences. The remaining seven reads were identical over a shortened length of 939 bp that excludes the vicinity of primers at both ends. Interestingly these reads were obtained from each of the 15 SMRT cells. Thus, the SMRT sequencing method and the IMGT/HighV-QUEST functionality for scFv provides a straightforward protocol for characterization of full-length scFv from combinatorial phage libraries.
从组合文库中通过噬菌体展示筛选免疫球蛋白(IG)或抗体单链可变片段(scFv)被广泛用于鉴定针对新靶点的新抗体。新一代测序(NGS)最近已成为一种用于高通量表征IG和T细胞受体(TR)免疫组库的新方法。然而,由于scFv长度(>800 bp)以及单链中由肽接头连接的两个可变结构域[IG的可变重链(VH)和可变轻链(VL)]的存在,对组合文库中的scFv进行NGS测序仍存在挑战。在此,我们表明,使用太平洋生物科学公司的RS II平台进行单分子实时(SMRT)测序能够从动脉粥样硬化动物模型中筛选的scFv噬菌体中生成全长scFv读数。我们首先从三轮筛选后从主动脉段洗脱的scFv噬菌体中扩增噬菌粒插入片段的DNA。从该扩增的DNA中,15个SMRT细胞获得了450,558条读数。从这些读数中生成了高度准确的环状一致序列,通过质量过滤,然后使用具有scFv功能的IMGT/HighV-QUEST进行分析。在348,659条读数中鉴定并表征了全长scFv。全长scFv测序是分析在淘选轮次中富集的相关VH和VL结构域的绝对必要条件。为了进一步验证SMRT测序提供高质量全长scFv序列的能力,我们追踪了先前通过生物学测定和桑格测序鉴定的scFv噬菌体克隆P3的读数。60条P3读数与767 bp的全长scFv显示100%的同一性,其中53条覆盖了977 bp的整个插入片段,包括引物序列。其余7条读数在缩短至939 bp的长度上相同,该长度排除了两端引物附近区域。有趣的是,这些读数来自15个SMRT细胞中的每一个。因此,SMRT测序方法和针对scFv的IMGT/HighV-QUEST功能为从组合噬菌体文库中表征全长scFv提供了一个直接的方案。