Towards operational phytoplankton recognition with automated high-throughput imaging and compact convolutional neural networks

Eerola, Tuomas; Kraft, Kaisa; Grönberg, Osku; Lensu, Lasse; Suikkanen, Sanna; Seppälä, Jukka; Tamminen, Timo; Kälviäinen, Heikki; Haario, Heikki

doi:https://doi.org/10.5194/os-2020-62

Preprints

https://doi.org/10.5194/os-2020-62

Preprints

08 Jul 2020

| 08 Jul 2020

Status: this preprint was under review for the journal OS but the revision was not accepted.

Towards operational phytoplankton recognition with automated high-throughput imaging and compact convolutional neural networks

Tuomas Eerola, Kaisa Kraft, Osku Grönberg, Lasse Lensu, Sanna Suikkanen, Jukka Seppälä, Timo Tamminen, Heikki Kälviäinen, and Heikki Haario

Abstract. Plankton communities form the basis of aquatic ecosystems and elucidating their role in increasingly important environmental issues is a constantly present research question. The concealed plankton community dynamics reflect changes in environmental forcing, growth traits of competing species, and multiple food web interactions. Recent technological advances have led to the possibility of collecting real-time big data opening new horizons for testing core hypotheses in planktonic systems, derived from macroscopic realms, in community ecology, biodiversity research, and ecosystem functioning. Analyzing the big data calls for computer vision and machine learning methods capable of producing interoperable data across platforms and systems. In this paper we apply convolutional neural networks (CNN) to classify a brackish-water phytoplankton community in the Baltic Sea. For solving the classification task, we utilize compact CNN architectures requiring less computational capacity and creating an opportunity to quickly train the network. This makes it possible to (1) test various modifications to the classification method, and (2) repeat each experiment multiple times with different training and test set combinations to obtain reliable results. We further analyze the effect of large class imbalance to the CNN performance, and test relevant data augmentation techniques to improve the performance. Finally, we address the practical implications of the classification performance to aquatic research by analyzing the confused classes and their effect on the reliability of the automatic plankton recognition system, to guide further development of plankton recognition research. Our results show that it is possible to obtain good classification accuracy with relatively shallow architectures and a small amount of training data when using effective data augmentation methods even with a very unbalanced dataset.

Received: 18 Jun 2020 – Discussion started: 08 Jul 2020

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Tuomas Eerola, Kaisa Kraft, Osku Grönberg, Lasse Lensu, Sanna Suikkanen, Jukka Seppälä, Timo Tamminen, Heikki Kälviäinen, and Heikki Haario

Status: closed

AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment

- Printer-friendly version

- Supplement

RC1: 'Review for "Towards operational phytoplankton recognition with automated high-throughput imaging and compact convolutional neural networks"', Anonymous Referee #1, 04 Aug 2020
- AC1: 'Reply to the comments by Referee #1', Tuomas Eerola, 02 Sep 2020
RC2: 'review', Anonymous Referee #2, 14 Aug 2020
- AC2: 'Reply to the comments by Referee #2', Tuomas Eerola, 02 Sep 2020

Status: closed

AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment

- Printer-friendly version

- Supplement

RC1: 'Review for "Towards operational phytoplankton recognition with automated high-throughput imaging and compact convolutional neural networks"', Anonymous Referee #1, 04 Aug 2020
- AC1: 'Reply to the comments by Referee #1', Tuomas Eerola, 02 Sep 2020
RC2: 'review', Anonymous Referee #2, 14 Aug 2020
- AC2: 'Reply to the comments by Referee #2', Tuomas Eerola, 02 Sep 2020

Tuomas Eerola, Kaisa Kraft, Osku Grönberg, Lasse Lensu, Sanna Suikkanen, Jukka Seppälä, Timo Tamminen, Heikki Kälviäinen, and Heikki Haario

Viewed

Total article views: 1,880 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
1,332	461	87	1,880	113	123

HTML: 1,332
PDF: 461
XML: 87
Total: 1,880
BibTeX: 113
EndNote: 123

Views and downloads (calculated since 08 Jul 2020)

Month	HTML	PDF	XML	Total
Jul 2020	103	59	32	194
Aug 2020	40	9	1	50
Sep 2020	36	13	0	49
Oct 2020	25	6	0	31
Nov 2020	18	6	1	25
Dec 2020	21	5	1	27
Jan 2021	16	13	1	30
Feb 2021	14	3	0	17
Mar 2021	18	6	0	24
Apr 2021	6	5	0	11
May 2021	13	14	0	27
Jun 2021	11	7	0	18
Jul 2021	3	3	0	6
Aug 2021	8	6	0	14
Sep 2021	6	6	1	13
Oct 2021	12	30	0	42
Nov 2021	21	20	1	42
Dec 2021	14	7	0	21
Jan 2022	14	7	0	21
Feb 2022	11	2	0	13
Mar 2022	7	8	1	16
Apr 2022	8	2	0	10
May 2022	7	1	1	9
Jun 2022	12	12	4	28
Jul 2022	6	1	0	7
Aug 2022	4	4	0	8
Sep 2022	6	5	1	12
Oct 2022	12	2	2	16
Nov 2022	9	2	0	11
Dec 2022	14	5	0	19
Jan 2023	4	5	0	9
Feb 2023	12	7	0	19
Mar 2023	13	7	0	20
Apr 2023	5	6	0	11
May 2023	7	2	1	10
Jun 2023	6	6	1	13
Jul 2023	11	4	1	16
Aug 2023	4	2	0	6
Sep 2023	13	6	4	23
Oct 2023	23	10	1	34
Nov 2023	20	1	0	21
Dec 2023	35	5	3	43
Jan 2024	23	5	0	28
Feb 2024	30	7	1	38
Mar 2024	25	4	4	33
Apr 2024	14	4	1	19
May 2024	14	3	3	20
Jun 2024	31	5	2	38
Jul 2024	12	5	4	21
Aug 2024	15	8	1	24
Sep 2024	18	9	1	28
Oct 2024	9	9	1	19
Nov 2024	5	5	1	11
Dec 2024	5	5	0	10
Jan 2025	11	7	0	18
Feb 2025	15	3	0	18
Mar 2025	12	5	1	18
Apr 2025	8	3	0	11
May 2025	10	7	2	19
Jun 2025	22	8	2	32
Jul 2025	18	9	3	30
Aug 2025	50	11	0	61
Sep 2025	327	7	2	336
Oct 2025	10	2	0	12

Cumulative views and downloads (calculated since 08 Jul 2020)

Month	HTML	PDF	XML	Total
Jul 2020	103	59	32	194
Aug 2020	40	9	1	50
Sep 2020	36	13	0	49
Oct 2020	25	6	0	31
Nov 2020	18	6	1	25
Dec 2020	21	5	1	27
Jan 2021	16	13	1	30
Feb 2021	14	3	0	17
Mar 2021	18	6	0	24
Apr 2021	6	5	0	11
May 2021	13	14	0	27
Jun 2021	11	7	0	18
Jul 2021	3	3	0	6
Aug 2021	8	6	0	14
Sep 2021	6	6	1	13
Oct 2021	12	30	0	42
Nov 2021	21	20	1	42
Dec 2021	14	7	0	21
Jan 2022	14	7	0	21
Feb 2022	11	2	0	13
Mar 2022	7	8	1	16
Apr 2022	8	2	0	10
May 2022	7	1	1	9
Jun 2022	12	12	4	28
Jul 2022	6	1	0	7
Aug 2022	4	4	0	8
Sep 2022	6	5	1	12
Oct 2022	12	2	2	16
Nov 2022	9	2	0	11
Dec 2022	14	5	0	19
Jan 2023	4	5	0	9
Feb 2023	12	7	0	19
Mar 2023	13	7	0	20
Apr 2023	5	6	0	11
May 2023	7	2	1	10
Jun 2023	6	6	1	13
Jul 2023	11	4	1	16
Aug 2023	4	2	0	6
Sep 2023	13	6	4	23
Oct 2023	23	10	1	34
Nov 2023	20	1	0	21
Dec 2023	35	5	3	43
Jan 2024	23	5	0	28
Feb 2024	30	7	1	38
Mar 2024	25	4	4	33
Apr 2024	14	4	1	19
May 2024	14	3	3	20
Jun 2024	31	5	2	38
Jul 2024	12	5	4	21
Aug 2024	15	8	1	24
Sep 2024	18	9	1	28
Oct 2024	9	9	1	19
Nov 2024	5	5	1	11
Dec 2024	5	5	0	10
Jan 2025	11	7	0	18
Feb 2025	15	3	0	18
Mar 2025	12	5	1	18
Apr 2025	8	3	0	11
May 2025	10	7	2	19
Jun 2025	22	8	2	32
Jul 2025	18	9	3	30
Aug 2025	50	11	0	61
Sep 2025	327	7	2	336
Oct 2025	10	2	0	12

Viewed (geographical distribution)

Total article views: 1,746 (including HTML, PDF, and XML) Thereof 1,745 with geography defined and 1 with unknown origin.

Country	#	Views	%

Cited

Latest update: 17 Oct 2025

Short summary

The role of plankton communities in important environmental issues is an active research question. Large amounts of plankton images collected using modern devices call for automated analysis methods. We consider classification of phytoplanktons using compact convolutional neural networks allowing fast model training. We analyse the confused classes and their practical implications to aquatic research. We show that good accuracy can be obtained with a limited amount of unbalanced training data.


Total:	0
HTML:	0
PDF:	0
XML:	0

Towards operational phytoplankton recognition with automated high-throughput imaging and compact convolutional neural networks

Viewed

Viewed (geographical distribution)

Cited

3 citations as recorded by crossref.