TY - JOUR
T1 - Airborne pollen grain detection from partially labelled data utilising semi-supervised learning
AU - Jin, Benjamin
AU - Milling, Manuel
AU - Plaza, Maria Pilar
AU - Brunner, Jens O
AU - Traidl-Hoffmann, Claudia
AU - Schuller, Björn W
AU - Damialis, Athanasios
N1 - Copyright © 2023 Elsevier B.V. All rights reserved.
PY - 2023/9/15
Y1 - 2023/9/15
N2 - Airborne pollen monitoring has been conducted for more than a century now, as knowledge of the quantity and periodicity of airborne pollen has diverse use cases, like reconstructing historic climates and tracking current climate change, forensic applications, and up to warning those affected by pollen-induced respiratory allergies. Hence, related work on automation of pollen classification already exists. In contrast, detection of pollen is still conducted manually, and it is the gold standard for accuracy. So, here we used a new-generation, automated, near-real-time pollen monitoring sampler, the BAA500, and we used data consisting of both raw and synthesised microscope images. Apart from the automatically generated, commercially-labelled data of all pollen taxa, we additionally used manual corrections to the pollen taxa, as well as a manually created test set of bounding boxes and pollen taxa, so as to more accurately evaluate the real-life performance. For the pollen detection, we employed two-stage deep neural network object detectors. We explored a semi-supervised training scheme to remedy the partial labelling. Using a teacher-student approach, the model can add pseudo-labels to complete the labelling during training. To evaluate the performance of our deep learning algorithms and to compare them to the commercial algorithm of the BAA500, we created a manual test set, in which an expert aerobiologist corrected automatically annotated labels. For the novel manual test set, both the supervised and semi-supervised approaches clearly outperform the commercial algorithm with an F1 score of up to 76.9 % compared to 61.3 %. On an automatically created and partially labelled test dataset, we obtain a maximum mAP of 92.7 %. Additional experiments on raw microscope images show comparable performance for the best models, which potentially justifies reducing the complexity of the image generation process. Our results bring automatic pollen monitoring a step forward, as they close the gap in pollen detection performance between manual and automated procedure.
AB - Airborne pollen monitoring has been conducted for more than a century now, as knowledge of the quantity and periodicity of airborne pollen has diverse use cases, like reconstructing historic climates and tracking current climate change, forensic applications, and up to warning those affected by pollen-induced respiratory allergies. Hence, related work on automation of pollen classification already exists. In contrast, detection of pollen is still conducted manually, and it is the gold standard for accuracy. So, here we used a new-generation, automated, near-real-time pollen monitoring sampler, the BAA500, and we used data consisting of both raw and synthesised microscope images. Apart from the automatically generated, commercially-labelled data of all pollen taxa, we additionally used manual corrections to the pollen taxa, as well as a manually created test set of bounding boxes and pollen taxa, so as to more accurately evaluate the real-life performance. For the pollen detection, we employed two-stage deep neural network object detectors. We explored a semi-supervised training scheme to remedy the partial labelling. Using a teacher-student approach, the model can add pseudo-labels to complete the labelling during training. To evaluate the performance of our deep learning algorithms and to compare them to the commercial algorithm of the BAA500, we created a manual test set, in which an expert aerobiologist corrected automatically annotated labels. For the novel manual test set, both the supervised and semi-supervised approaches clearly outperform the commercial algorithm with an F1 score of up to 76.9 % compared to 61.3 %. On an automatically created and partially labelled test dataset, we obtain a maximum mAP of 92.7 %. Additional experiments on raw microscope images show comparable performance for the best models, which potentially justifies reducing the complexity of the image generation process. Our results bring automatic pollen monitoring a step forward, as they close the gap in pollen detection performance between manual and automated procedure.
KW - Humans
KW - Pollen
KW - Supervised Machine Learning
KW - Rhinitis, Allergic, Seasonal
KW - Algorithms
KW - Climate Change
U2 - 10.1016/j.scitotenv.2023.164295
DO - 10.1016/j.scitotenv.2023.164295
M3 - Article
C2 - 37211136
SN - 0048-9697
VL - 891
JO - Science of the Total Environment
JF - Science of the Total Environment
M1 - 164295
ER -