Institut Européen des Membranes, UMR5635, UM, ENSCM, CNRS, 34095 Montpellier, France.
Mécanismes Moléculaires dans les Démences Neurodégénératives, U1198, UM, EPHE, INSERM, 34095 Montpellier, France.
Biosensors (Basel). 2020 Oct 5;10(10):140. doi: 10.3390/bios10100140.
Single nanopore is a powerful platform to detect, discriminate and identify biomacromolecules. Among the different devices, the conical nanopores obtained by the track-etched technique on a polymer film are stable and easy to functionalize. However, these advantages are hampered by their high aspect ratio that avoids the discrimination of similar samples. Using machine learning, we demonstrate an improved resolution so that it can identify short single- and double-stranded DNA (10- and 40-mers). We have characterized each current blockade event by the relative intensity, dwell time, surface area and both the right and left slope. We show an overlap of the relative current blockade amplitudes and dwell time distributions that prevents their identification. We define the different parameters that characterize the events as features and the type of DNA sample as the target. By applying support-vector machines to discriminate each sample, we show accuracy between 50% and 72% by using two features that distinctly classify the data points. Finally, we achieved an increased accuracy (up to 82%) when five features were implemented.
单纳米孔是一种强大的平台,可以用于检测、区分和识别生物大分子。在不同的设备中,通过在聚合物膜上进行轨迹刻蚀技术获得的锥形纳米孔稳定且易于功能化。然而,这些优点受到其高纵横比的限制,这使得类似的样品难以区分。通过机器学习,我们展示了一种改进的分辨率,可以识别短的单链和双链 DNA(10 个和 40 个碱基对)。我们通过相对强度、停留时间、表面积以及左右斜率来描述每个电流阻断事件。我们发现相对电流阻断幅度和停留时间分布存在重叠,这使得它们无法被识别。我们将描述事件的不同参数定义为特征,将 DNA 样本的类型定义为目标。通过应用支持向量机来区分每个样本,我们使用两个明显分类数据点的特征,实现了 50%至 72%的准确率。最后,当使用五个特征时,我们实现了更高的准确率(高达 82%)。