Nanopore Group, Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064.
Proc Natl Acad Sci U S A. 2013 Nov 19;110(47):18910-5. doi: 10.1073/pnas.1310615110. Epub 2013 Oct 28.
Cytosine, 5-methylcytosine, and 5-hydroxymethylcytosine were identified during translocation of single DNA template strands through a modified Mycobacterium smegmatis porin A (M2MspA) nanopore under control of phi29 DNA polymerase. This identification was based on three consecutive ionic current states that correspond to passage of modified or unmodified CG dinucleotides and their immediate neighbors through the nanopore limiting aperture. To establish quality scores for these calls, we examined ~3,300 translocation events for 48 distinct DNA constructs. Each experiment analyzed a mixture of cytosine-, 5-methylcytosine-, and 5-hydroxymethylcytosine-bearing DNA strands that contained a marker that independently established the correct cytosine methylation status at the target CG of each molecule tested. To calculate error rates for these calls, we established decision boundaries using a variety of machine-learning methods. These error rates depended upon the identity of the bases immediately 5' and 3' of the targeted CG dinucleotide, and ranged from 1.7% to 12.2% for a single-pass read. We estimate that Q40 values (0.01% error rates) for methylation status calls could be achieved by reading single molecules 5-19 times depending upon sequence context.
在 phi29 DNA 聚合酶的控制下,单链 DNA 模板穿过改良的 Mycobacterium smegmatis porin A (M2MspA) 纳米孔时,鉴定出了胞嘧啶、5-甲基胞嘧啶和 5-羟甲基胞嘧啶。这一鉴定是基于三个连续的离子电流状态,分别对应于修饰或未修饰的 CG 二核苷酸及其紧邻核苷酸通过纳米孔限制孔径的通过。为了为这些检测结果建立质量评分,我们对 48 种不同的 DNA 构建体进行了约 3300 次转位事件的检测。每个实验分析了含有标记物的胞嘧啶、5-甲基胞嘧啶和 5-羟甲基胞嘧啶修饰的 DNA 链的混合物,该标记物可独立确定每个被测试分子的目标 CG 处的正确胞嘧啶甲基化状态。为了计算这些检测的错误率,我们使用各种机器学习方法建立了决策边界。这些错误率取决于目标 CG 二核苷酸 5' 和 3' 处碱基的身份,单轮读取的错误率范围为 1.7%至 12.2%。我们估计,根据序列上下文,通过读取单个分子 5-19 次,可实现 Q40 值(0.01%的错误率)的甲基化状态检测。