Knecht Avi C, Campbell Malachy T, Caprez Adam, Swanson David R, Walia Harkamal
University of Nebraska-Lincoln, Holland Computing Center, Lincoln, NE 68583, USA.
University of Nebraska-Lincoln, Department of Agronomy and Horticulture, Lincoln, NE 68583, USA.
J Exp Bot. 2016 May;67(11):3587-99. doi: 10.1093/jxb/erw176. Epub 2016 May 3.
High-throughput plant phenotyping is an effective approach to bridge the genotype-to-phenotype gap in crops. Phenomics experiments typically result in large-scale image datasets, which are not amenable for processing on desktop computers, thus creating a bottleneck in the image-analysis pipeline. Here, we present an open-source, flexible image-analysis framework, called Image Harvest (IH), for processing images originating from high-throughput plant phenotyping platforms. Image Harvest is developed to perform parallel processing on computing grids and provides an integrated feature for metadata extraction from large-scale file organization. Moreover, the integration of IH with the Open Science Grid provides academic researchers with the computational resources required for processing large image datasets at no cost. Image Harvest also offers functionalities to extract digital traits from images to interpret plant architecture-related characteristics. To demonstrate the applications of these digital traits, a rice (Oryza sativa) diversity panel was phenotyped and genome-wide association mapping was performed using digital traits that are used to describe different plant ideotypes. Three major quantitative trait loci were identified on rice chromosomes 4 and 6, which co-localize with quantitative trait loci known to regulate agronomically important traits in rice. Image Harvest is an open-source software for high-throughput image processing that requires a minimal learning curve for plant biologists to analyzephenomics datasets.
高通量植物表型分析是弥合作物基因型与表型差距的有效方法。表型组学实验通常会产生大规模图像数据集,这些数据集不适合在台式计算机上进行处理,从而在图像分析流程中形成了瓶颈。在此,我们提出了一个名为图像采集(IH)的开源、灵活的图像分析框架,用于处理源自高通量植物表型分析平台的图像。图像采集旨在在计算网格上进行并行处理,并为从大规模文件组织中提取元数据提供了一个集成功能。此外,将图像采集与开放科学网格集成,为学术研究人员免费提供了处理大型图像数据集所需的计算资源。图像采集还提供了从图像中提取数字性状以解释与植物结构相关特征的功能。为了展示这些数字性状的应用,对一个水稻(Oryza sativa)多样性群体进行了表型分析,并使用用于描述不同植物理想型的数字性状进行了全基因组关联作图。在水稻第4和第6号染色体上鉴定出了三个主要数量性状位点,它们与已知调控水稻重要农艺性状的数量性状位点共定位。图像采集是一款用于高通量图像处理的开源软件,植物生物学家分析表型组学数据集时所需的学习曲线很平缓。