Shemiakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Science, Moscow, Russia.
Center of Molecular Medicine, CEITEC, Masaryk University, Brno, Czech Republic.
Nat Protoc. 2016 Sep;11(9):1599-616. doi: 10.1038/nprot.2016.093. Epub 2016 Aug 4.
High-throughput sequencing analysis of hypermutating immunoglobulin (IG) repertoires remains a challenging task. Here we present a robust protocol for the full-length profiling of human and mouse IG repertoires. This protocol uses unique molecular identifiers (UMIs) introduced in the course of cDNA synthesis to control bottlenecks and to eliminate PCR and sequencing errors. Using asymmetric 400+100-nt paired-end Illumina sequencing and UMI-based assembly with the new version of the MIGEC software, the protocol allows up to 750-nt lengths to be sequenced in an almost error-free manner. This sequencing approach should also be applicable to various tasks beyond immune repertoire studies. In IG profiling, the achieved length of high-quality sequence covers the variable region of even the longest chains, along with the fragment of a constant region carrying information on the antibody isotype. The whole protocol, including preparation of cells and libraries, sequencing and data analysis, takes 5 to 6 d.
高通量测序分析高突变免疫球蛋白 (IG) 库仍然是一项具有挑战性的任务。在这里,我们提出了一个用于全长人源和鼠源 IG 库分析的稳健方案。该方案在 cDNA 合成过程中引入独特分子标识符 (UMI) 来控制瓶颈,并消除 PCR 和测序错误。使用不对称的 400+100-nt 配对末端 Illumina 测序和基于 UMI 的新版本 MIGEC 软件组装,该方案可近乎无错误地对长达 750-nt 的序列进行测序。这种测序方法也应该适用于免疫库研究以外的各种任务。在 IG 分析中,所获得的高质量序列长度覆盖了即使是最长链的可变区,以及携带抗体同种型信息的恒定区片段。整个方案,包括细胞和文库的制备、测序和数据分析,需要 5 到 6 天。