Document of bibliographic reference 361942

BibliographicReference record

Type
Bibliographic resource
Type of document
Journal article
BibLvlCode
AS
Title
Towards operational phytoplankton recognition with automated high-throughput imaging, near-real-time data processing, and convolutional neural networks
Abstract
Plankton communities form the basis of aquatic ecosystems and elucidating their role in increasingly important environmental issues is a persistent research question. Recent technological advances in automated microscopic imaging, together with cloud platforms for high-performance computing, have created possibilities for collecting and processing detailed high-frequency data on planktonic communities, opening new horizons for testing core hypotheses in aquatic ecosystems. Analyzing continuous streams of big data calls for development and deployment of novel computer vision and machine learning systems. The implementation of these analysis systems is not always straightforward with regards to operationality, and issues regarding data flows, computing and data treatment need to be considered. We created a data pipeline for automated near-real-time classification of phytoplankton during remote deployment of imaging flow cytometer (Imaging FlowCytobot, IFCB). Convolutional neural network (CNN) is used to classify continuous imaging data with probability thresholds used to filter out images not belonging to our existing classes. The automated data flow and classification system were used to monitor dominating species of filamentous cyanobacteria on the coast of Finland during summer 2021. We demonstrate that good phytoplankton recognition can be achieved with transfer learning utilizing a relatively shallow, publicly available, pre-trained CNN model and fine-tuning it with community-specific phytoplankton images (overall F1-score of 0.95 for test set of our labeled image data complemented with a 50% unclassifiable image portion). This enables both fast training and low computing resource requirements for model deployment making it easy to modify and applicable in wide range of situations. The system performed well when used to classify a natural phytoplankton community over different seasons (overall F1-score 0.82 for our evaluation data set). Furthermore, we address the key challenges of image classification for varying planktonic communities and analyze the practical implications of confused classes. We published our labeled image data set of Baltic Sea phytoplankton community for the training of image recognition models (~63000 images in 50 classes) to accelerate implementation of imaging systems for other brackish and freshwater communities. Our evaluation data set, 59 fully annotated samples of natural communities throughout an annual cycle, is also available for model testing purposes (~150000 images).
WebOfScience code
https://www.webofscience.com/wos/woscc/full-record/WOS:000855100200001
Bibliographic citation
Kraft, K.; Velhonoja, O.; Eerola, T.; Suikkanen, S.; Tamminen, T.; Haraguchi, L.; Ylöstalo, P.; Kielosto, S.; Johansson, M.; Lensu, L.; Kälviäinen, H.; Haario, H.; Seppälä, J. (2022). Towards operational phytoplankton recognition with automated high-throughput imaging, near-real-time data processing, and convolutional neural networks. Front. Mar. Sci. 9: 867695. https://dx.doi.org/10.3389/fmars.2022.867695
Is peer reviewed
true
Access rights
open access
Is accessible for free
true

Authors

author
Name
Kaisa Kraft
author
Name
Otso Velhonoja
author
Name
Tuomas Eerola
author
Name
Sanna Suikkanen
author
Name
Timo Tamminen
author
Name
Lumi Haraguchi
author
Name
Pasi Ylöstalo
author
Name
Sami Kielosto
author
Name
Milla Johansson
author
Name
Lasse Lensu
author
Name
Heikki Kälviäinen
author
Name
Heikki Haario
author
Name
Jukka Seppälä

Links

referenced creativework
type
DOI
accessURL
https://dx.doi.org/10.3389/fmars.2022.867695

Document metadata

date created
2023-03-09
date modified
2023-05-23