Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801;
Robotics Institute, Carnegie Mellon University, Pittsburgh, PA 15213.
Proc Natl Acad Sci U S A. 2020 Nov 10;117(45):28496-28505. doi: 10.1073/pnas.2007324117. Epub 2020 Oct 23.
Taxonomic resolution is a major challenge in palynology, largely limiting the ecological and evolutionary interpretations possible with deep-time fossil pollen data. We present an approach for fossil pollen analysis that uses optical superresolution microscopy and machine learning to create a quantitative and higher throughput workflow for producing palynological identifications and hypotheses of biological affinity. We developed three convolutional neural network (CNN) classification models: maximum projection (MPM), multislice (MSM), and fused (FM). We trained the models on the pollen of 16 genera of the legume tribe Amherstieae, and then used these models to constrain the biological classifications of 48 fossil specimens from the Paleocene, Eocene, and Miocene of western Africa and northern South America. All models achieved average accuracies of 83 to 90% in the classification of the extant genera, and the majority of fossil identifications (86%) showed consensus among at least two of the three models. Our fossil identifications support the paleobiogeographic hypothesis that Amherstieae originated in Paleocene Africa and dispersed to South America during the Paleocene-Eocene Thermal Maximum (56 Ma). They also raise the possibility that at least three Amherstieae genera (, , and ) may have diverged earlier in the Cenozoic than predicted by molecular phylogenies.
分类分辨率是孢粉学中的一个主要挑战,在很大程度上限制了利用古花粉数据进行生态和进化解释的可能性。我们提出了一种用于化石花粉分析的方法,该方法结合了光学超分辨率显微镜和机器学习,创建了一种定量的、高通量的工作流程,用于生成孢粉学鉴定和生物亲缘关系的假说。我们开发了三个卷积神经网络(CNN)分类模型:最大投影(MPM)、多切片(MSM)和融合(FM)。我们在豆科植物族 Amherstieae 的 16 个属的花粉上对模型进行了训练,然后使用这些模型来限制来自西非和南美洲北部古近纪、始新世和中新世的 48 个化石样本的生物分类。所有模型在分类现有属时的平均准确率达到 83%到 90%,并且大多数化石鉴定(86%)在三个模型中的至少两个中显示出共识。我们的化石鉴定支持 Amherstieae 起源于古新世非洲并在古新世-始新世极热事件期间(56 Ma)扩散到南美洲的古生物地理假说。它们还提出了这样一种可能性,即在新生代,至少有三个 Amherstieae 属(、和)的分化时间可能比分子系统发育预测的更早。