Bac Jonathan, Zinovyev Andrei
Institut Curie, PSL Research University, Paris, France.
Institut National de la Santé et de la Recherche Médicale, U900, Paris, France.
Front Neurorobot. 2020 Jan 9;13:110. doi: 10.3389/fnbot.2019.00110. eCollection 2019.
Machine learning deals with datasets characterized by high dimensionality. However, in many cases, the intrinsic dimensionality of the datasets is surprisingly low. For example, the dimensionality of a robot's perception space can be large and multi-modal but its variables can have more or less complex non-linear interdependencies. Thus multidimensional data point clouds can be effectively located in the vicinity of principal varieties possessing locally small dimensionality, but having a globally complicated organization which is sometimes difficult to represent with regular mathematical objects (such as manifolds). We review modern machine learning approaches for extracting low-dimensional geometries from multi-dimensional data and their applications in various scientific fields.
机器学习处理具有高维特征的数据集。然而,在许多情况下,数据集的内在维度出奇地低。例如,机器人感知空间的维度可能很大且具有多模态,但它的变量可能具有或多或少复杂的非线性相互依赖关系。因此,多维数据点云可以有效地定位在具有局部低维度但全局组织复杂的主要变体附近,而这种全局组织有时难以用常规数学对象(如流形)来表示。我们回顾了从多维数据中提取低维几何结构的现代机器学习方法及其在各个科学领域的应用。