Institut Lumière Matière, UMR5306, Université Lyon 1-CNRS, Université de Lyon, Villeurbanne, France.
Laboratoire Matière et Systèmes Complexes, UMR7057, Université Paris Cité-CNRS, Paris, France.
PLoS One. 2023 Feb 16;18(2):e0281931. doi: 10.1371/journal.pone.0281931. eCollection 2023.
Mechanical cues such as stresses and strains are now recognized as essential regulators in many biological processes like cell division, gene expression or morphogenesis. Studying the interplay between these mechanical cues and biological responses requires experimental tools to measure these cues. In the context of large scale tissues, this can be achieved by segmenting individual cells to extract their shapes and deformations which in turn inform on their mechanical environment. Historically, this has been done by segmentation methods which are well known to be time consuming and error prone. In this context however, one doesn't necessarily require a cell-level description and a coarse-grained approach can be more efficient while using tools different from segmentation. The advent of machine learning and deep neural networks has revolutionized the field of image analysis in recent years, including in biomedical research. With the democratization of these techniques, more and more researchers are trying to apply them to their own biological systems. In this paper, we tackle a problem of cell shape measurement thanks to a large annotated dataset. We develop simple Convolutional Neural Networks (CNNs) which we thoroughly optimize in terms of architecture and complexity to question construction rules usually applied. We find that increasing the complexity of the networks rapidly no longer yields improvements in performance and that the number of kernels in each convolutional layer is the most important parameter to achieve good results. In addition, we compare our step-by-step approach with transfer learning and find that our simple, optimized CNNs give better predictions, are faster in training and analysis and don't require more technical knowledge to be implemented. Overall, we offer a roadmap to develop optimized models and argue that we should limit the complexity of such models. We conclude by illustrating this strategy on a similar problem and dataset.
机械线索,如应力和应变,现在被认为是许多生物学过程(如细胞分裂、基因表达或形态发生)的重要调节因素。研究这些机械线索与生物反应之间的相互作用需要实验工具来测量这些线索。在大规模组织的背景下,可以通过分割单个细胞来提取它们的形状和变形来实现这一点,这反过来又可以告知它们的机械环境。从历史上看,这是通过分割方法来实现的,这些方法众所周知是耗时且容易出错的。然而,在这种情况下,不一定需要细胞级别的描述,并且使用不同于分割的工具的粗粒度方法可以更有效。机器学习和深度神经网络的出现近年来彻底改变了图像分析领域,包括生物医学研究。随着这些技术的民主化,越来越多的研究人员试图将它们应用于自己的生物系统。在本文中,我们通过一个大型注释数据集解决了细胞形状测量的问题。我们开发了简单的卷积神经网络(CNN),我们根据架构和复杂性对其进行了彻底的优化,以质疑通常应用的构建规则。我们发现,增加网络的复杂性不再能迅速提高性能,并且每个卷积层中的核数是实现良好结果的最重要参数。此外,我们将我们的逐步方法与迁移学习进行了比较,发现我们简单、优化的 CNN 可以做出更好的预测,在训练和分析中更快,并且不需要更多的技术知识来实现。总体而言,我们提供了开发优化模型的路线图,并认为我们应该限制此类模型的复杂性。最后,我们通过在类似的问题和数据集上进行说明来总结这一策略。