Software Engineering Department, Institute of Informatics, University of Szeged, Dugonics tér 13, Szeged, 6720, Hungary.
European XFEL GmbH, Holzkoppel 4, 22869, Schenefeld, Germany.
Sci Rep. 2023 Jun 9;13(1):9370. doi: 10.1038/s41598-023-36456-y.
Spectroscopy and X-ray diffraction techniques encode ample information on investigated samples. The ability of rapidly and accurately extracting these enhances the means to steer the experiment, as well as the understanding of the underlying processes governing the experiment. It improves the efficiency of the experiment, and maximizes the scientific outcome. To address this, we introduce and validate three frameworks based on self-supervised learning which are capable of classifying 1D spectral curves using data transformations preserving the scientific content and only a small amount of data labeled by domain experts. In particular, in this work we focus on the identification of phase transitions in samples investigated by x-ray powder diffraction. We demonstrate that the three frameworks, based either on relational reasoning, contrastive learning, or a combination of the two, are capable of accurately identifying phase transitions. Furthermore, we discuss in detail the selection of data augmentation techniques, crucial to ensure that scientifically meaningful information is retained.
光谱和 X 射线衍射技术对研究样本进行了充分的编码。快速准确地提取这些信息的能力提高了引导实验的手段,以及对控制实验的基本过程的理解。它提高了实验的效率,并最大限度地提高了科学成果。为了解决这个问题,我们引入并验证了三个基于自我监督学习的框架,这些框架能够使用数据转换对 1D 光谱曲线进行分类,这些数据转换保留了科学内容,并且只需要少量由领域专家标记的数据。特别是,在这项工作中,我们专注于识别 X 射线粉末衍射研究中样品的相变。我们证明,基于关系推理、对比学习或两者结合的三个框架能够准确识别相变。此外,我们详细讨论了数据增强技术的选择,这对于确保保留有意义的科学信息至关重要。