Bullen Robert J, Cornford Dan, Nabney Ian T
Neural Computing Research Group, School of Engineering and Applied Science, Aston University, Aston Triangle, B4 7ET, Birmingham, UK.
Neural Netw. 2003 Apr-May;16(3-4):419-26. doi: 10.1016/S0893-6080(03)00013-3.
Satellite-borne scatterometers are used to measure backscattered micro-wave radiation from the ocean surface. This data may be used to infer surface wind vectors where no direct measurements exist. Inherent in this data are outliers owing to aberrations on the water surface and measurement errors within the equipment. We present two techniques for identifying outliers using neural networks; the outliers may then be removed to improve models derived from the data. Firstly the generative topographic mapping (GTM) is used to create a probability density model; data with low probability under the model may be classed as outliers. In the second part of the paper, a sensor model with input-dependent noise is used and outliers are identified based on their probability under this model.GTM was successfully modified to incorporate prior knowledge of the shape of the observation manifold; however, GTM could not learn the double skinned nature of the observation manifold. To learn this double skinned manifold necessitated the use of a sensor model which imposes strong constraints on the mapping. The results using GTM with a fixed noise level suggested the noise level may vary as a function of wind speed. This was confirmed by experiments using a sensor model with input-dependent noise, where the variation in noise is most sensitive to the wind speed input. Both models successfully identified gross outliers with the largest differences between models occurring at low wind speeds.
星载散射计用于测量来自海洋表面的后向散射微波辐射。在没有直接测量数据的情况下,这些数据可用于推断表面风矢量。由于水面像差和设备内部测量误差,这些数据中存在异常值。我们提出了两种使用神经网络识别异常值的技术;然后可以去除异常值以改进从数据中得出的模型。首先,生成地形映射(GTM)用于创建概率密度模型;在该模型下概率较低的数据可归类为异常值。在论文的第二部分,使用了具有输入相关噪声的传感器模型,并根据数据在该模型下的概率来识别异常值。GTM已成功修改以纳入观测流形形状的先验知识;然而,GTM无法了解观测流形的双层性质。为了了解这种双层流形,需要使用对映射施加强约束的传感器模型。使用固定噪声水平的GTM得到的结果表明,噪声水平可能随风速变化。使用具有输入相关噪声的传感器模型进行的实验证实了这一点,其中噪声变化对风速输入最为敏感。两个模型都成功地识别出了明显的异常值,模型之间的最大差异出现在低风速时。