    {"datasetrec":{"DasID":8904,"Acronym":null,"StandardTitle":"LifeWatch observatory data: phytoplankton annotated trainingset by FlowCam imaging in the Belgian Part of the North Sea","OrigTitle":null,"OrigTitleLangID":null,"OrigTitleLangCode":null,"OrigTitleLang":null,"OrigTitleLangNL":null,"VersionName":"v2","ContactEmail":null,"VersionDate":"Aug  4 2025 10:00PM","VersionDay":5,"VersionMonth":8,"VersionYear":2025,"SizeReference":null,"EngAbstract":"<h3>Training dataset</h3><p>The images were collected in the framework of the Belgian Lifewatch Research Infrastructure. During multidisciplinary campaigns, a number of fixed stations in the Belgian Part of the North Sea (BPNS) are visited on a monthly (onshore stations) or seasonal (offshore stations) basis. Samples are taken using a 55µm mesh size Apstein net and fixed in Lugol's iodine solution. In the lab, the samples are processed using a VS-4 FlowCAM model at 4X magnification targeting a particle size range of 55-300µm. The identification of the image data is done with the use of a CNN and followed by a manual validation step. Since May 2017, this dataset has provided micro- and phytoplankton observations, mainly covering diatoms, dinoflagellates and cilliates, for the Belgian Part of the North Sea (BPNS).</p><p>This dataset comprises a trainings datasplit of 337,514 images distributed across 95 classes, with each class containing a minimum of 100 and a maximum of 10,000 images. The goal of this dataset is to be able to facilitate model training, here we have organized the data into a standard split, with 80% allocated for training, 10% for validation, and another 10% for testing purposes. This dataset structure ensures a balanced representation and supports scientific rigor in subsequent analyses.</p>","EngDescr":"<h3>Technical details&nbsp;</h3><h4>Data preprocessing</h4><p>Raw FlowCam output data is fully processed using in-house datapipelines, the VisualSpreadsheet software is only used for data acquisition during the lab run of the sample. Raw images and binary images are never saved during the FlowCam run, we only work on the image collages saved at the end of the run. Single images are cut from these collages using each image coordinates width and height pulled from the .lst file using in-house python code. The background of the images is not removed. These images are then predicted and annotated in-house at VLIZ.</p><h4>Data splitting</h4><p>The training dataset is 80% used for training, 10% for validation and 10% for prediction.&nbsp;</p><h4>Classes, labels and annotations</h4><p>The dataset comprises 337,514 images distributed across 95 classes, with each class containing a minimum of 100 and a maximum of 10,000 images. Taxonomic coverage of the dataset comprises mainly of diatoms, dinoflagellates and cilliates, but to a lesser extent also zooplankton and other protists.</p><h4>Parameters</h4><p>The images are read using cv2.imread and the values are used as parameters.</p><p>Metadata Parameter Descriptions:</p><ul><li>&nbsp; &nbsp;image_path: The relative path to the image file showing the plankton or particle, usually structured by taxon or project folder.</li><li>&nbsp; &nbsp;sample_datetime: The exact date and time (in YYYY-MM-DD HH:MM:SS.sss format) when the sample was acquired using the imaging instrument.</li><li>&nbsp; &nbsp;flowcam_version: The software or hardware version of the FlowCAM instrument used to capture the image and process the sample.</li><li>&nbsp; &nbsp;station: The sampling location code where the image was taken. These station codes typically refer to predefined geographic or monitoring points in the field.</li><li>&nbsp; &nbsp;accepted_label: The final validated taxonomic label (e.g., genus or species) assigned to the organism or particle in the image, often based on expert review.</li><li>&nbsp; &nbsp;accepted_aphia_id: The unique identifier (AphiaID) corresponding to the accepted taxonomic label in the World Register of Marine Species (WoRMS) database, which ensures standardized taxonomic reference.</li><li>&nbsp; &nbsp;original_reference_id: A unique identifier (often a UUID) assigned to the image or sample in the original classification system (e.g., EcoTaxa), useful for traceability and linking back to the source record.</li></ul><h4>Data sources</h4><p>Images are collected during the monthly monitoring of phytoplankton communities in the Belgian Part of the North Sea during the LifeWatch multidisciplinary campaigns by FlowCam VS-4 benchmodel (Fluid Imaging Technologies, Yarmouth, Maine, U.S.A.).</p><h4>Data quality</h4><p>All images are predicted and subsequently manually validated to ensure the quality of the trainingset.</p><h4>Image resolution</h4><p>The size range imaged is 55-300µm. Images are acquired using a Sony XCD SC90 digital gray-scale camera. Images are during training of CNN resized to 100px by 100px.</p><h4>Spatial coverage&nbsp;</h4><p>The data comes from a number of fixed stations in the Belgian Part of the North Sea (BPNS).&nbsp;</p><p>Nine stations onshore are visited monthly:</p><figure class=\"table\"><table><tbody><tr><td><strong>Station</strong></td><td><strong>Longitude</strong></td><td><strong>Latitude</strong></td></tr><tr><td>130</td><td>2.90535</td><td>51.27055</td></tr><tr><td>780</td><td>3.057283</td><td>51.471367</td></tr><tr><td>330</td><td>2.809083</td><td>51.434117</td></tr><tr><td>230</td><td>2.85035</td><td>51.308683</td></tr><tr><td>710</td><td>3.138283</td><td>51.441217</td></tr><tr><td>215</td><td>2.61075</td><td>51.274867</td></tr><tr><td>ZG02</td><td>2.500717</td><td>51.33515</td></tr><tr><td>120</td><td>2.702483</td><td>51.186083</td></tr><tr><td>700</td><td>3.221017</td><td>51.377</td></tr></tbody></table></figure><p>Eight additional offshore stations are visited seasonally:</p><figure class=\"table\"><table><tbody><tr><td><strong>Station</strong></td><td><strong>Longitude</strong></td><td><strong>Latitude</strong></td></tr><tr><td>LW01</td><td>2.256</td><td>51.568667</td></tr><tr><td>LW02</td><td>2.556</td><td>51.8</td></tr><tr><td>435</td><td>2.790333</td><td>51.580667</td></tr><tr><td>W07bis</td><td>3.012517</td><td>51.588033</td></tr><tr><td>W08</td><td>2.35</td><td>51.458333</td></tr><tr><td>W09</td><td>2.7</td><td>51.75</td></tr><tr><td>W10</td><td>2.416667</td><td>51.683333</td></tr><tr><td>421</td><td>2.45</td><td>51.4805</td></tr></tbody></table></figure><p>&nbsp;</p>","OrigAbstract":null,"OrigDescr":null,"Comments":null,"ReleaseDate":null,"ReleaseDate0":null,"OrigDescrLang":null,"EmbargoDate":null,"OrigDescrLangNL":null,"OrigLangCode":null,"OrigLangCodeExtended":null,"OrigLangID":null,"DescrCompFlag":null,"DescrTransFlag":null,"Citation":"Decrop, W., Lagaisse, R., Mortelmans, J., Muyle, J., Amadei Martínez, L., &amp; Deneudt, K. (2025). LifeWatch observatory data: phytoplankton annotated trainingset by FlowCam imaging in the Belgian Part of the North Sea [Data set]. Zenodo.","AccessConstraints":null,"UDate":"2026-04-02","CDate":"2025-08-21","CurrencyDate":null,"RevisionDate":null,"DateLastModified":{"date":"2026-04-02 11:55:47.501543","timezone_type":1,"timezone":"+02:00"},"CheckedFlag":0,"PublicFlag":1,"VlizCoreFlag":1,"MarineFlag":null,"FreshFlag":0,"BrackishFlag":0,"TerrestrialFlag":0,"StatusID":1,"DasType":"Data products","DasTypeID":23,"DasOrigin":"Data collection","Progress":"Completed","AccessConstraint":"Attribution (CC BY)","AccConstrEN":"Attribution (CC BY)","AccConstrDisplay":"<a rel=\"license\" href=\"https://creativecommons.org/licenses/by/4.0/\" target=\"_blank\"><img alt=\"Creative Commons License\" style=\"border:0px;height:15px;width:80px;vertical-align:middle;\" src=\"https://www.marinespecies.org/aphia/images/cc/by.png\" /></a> This dataset is licensed under a <a rel=\"license\" href=\"https://creativecommons.org/licenses/by/4.0/\" target=\"_blank\">Creative Commons Attribution 4.0 International License</a>.","License":"https://creativecommons.org/licenses/by/4.0/","AccConstrDescription":"This license lets others distribute, remix, tweak, and build upon your work, even commercially, as long as they credit you for the original creation. This is the most accommodating of licenses offered. Recommended for maximum dissemination and use of licensed materials.","Lineage":null,"AccConID":21,"DOI":"10.5281/zenodo.16679297"},"dois":null,"spcols":null,"keywords":[{"ThesaurusTerm":"CNN weights","ThesTypID":0,"ThesType":null,"Code":null,"Description":null,"OrigThesTerm":"CNN weights","DutchTerm":"","URI":null,"DasKeywordDescr":null},{"ThesaurusTerm":"phytoplankton classifier","ThesTypID":0,"ThesType":null,"Code":null,"Description":null,"OrigThesTerm":"phytoplankton classifier","DutchTerm":"phytoplankton classifier","URI":null,"DasKeywordDescr":null},{"ThesaurusTerm":"Phytoplankton species","ThesTypID":0,"ThesType":null,"Code":null,"Description":null,"OrigThesTerm":"Phytoplankton species","DutchTerm":null,"URI":null,"DasKeywordDescr":null},{"ThesaurusTerm":"TensorFlow","ThesTypID":0,"ThesType":null,"Code":null,"Description":null,"OrigThesTerm":"TensorFlow","DutchTerm":"","URI":null,"DasKeywordDescr":null}],"parents":null,"children":null,"othrel":null,"othrelrev":null,"ownerships":[{"OrderNr":1,"Surname":"Decrop","Firstname":"Wout","Initials":"W.","PerPublicFlag":1,"AdrID":170200,"Email":"wout.decrop@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":42199,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":24,"Role":"Contact","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0009-0001-7756-8310","ROR":"https://ror.org/0496vr396"},{"OrderNr":2,"Surname":"Decrop","Firstname":"Wout","Initials":"W.","PerPublicFlag":1,"AdrID":170200,"Email":"wout.decrop@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":42199,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":61,"Role":"Data creator","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0009-0001-7756-8310","ROR":"https://ror.org/0496vr396"},{"OrderNr":3,"Surname":"Lagaisse","Firstname":"Rune","Initials":"R.","PerPublicFlag":1,"AdrID":169914,"Email":"rune.lagaisse@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":39431,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":61,"Role":"Data creator","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0000-0001-9191-9140","ROR":"https://ror.org/0496vr396"},{"OrderNr":4,"Surname":"Mortelmans","Firstname":"Jonas","Initials":"J.","PerPublicFlag":1,"AdrID":167758,"Email":"jonas.mortelmans@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":26622,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":61,"Role":"Data creator","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0000-0002-8781-7915","ROR":"https://ror.org/0496vr396"},{"OrderNr":5,"Surname":"Muyle","Firstname":"Julie","Initials":"J.","PerPublicFlag":1,"AdrID":171095,"Email":"julie.muyle@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":38210,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":61,"Role":"Data creator","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0000-0002-7481-0626","ROR":"https://ror.org/0496vr396"},{"OrderNr":6,"Surname":"Mortelmans","Firstname":"Jonas","Initials":"J.","PerPublicFlag":1,"AdrID":167758,"Email":"jonas.mortelmans@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":26622,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":24,"Role":"Contact","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0000-0002-8781-7915","ROR":"https://ror.org/0496vr396"},{"OrderNr":7,"Surname":"Amadei Martinez","Firstname":"Luz","Initials":"L.","PerPublicFlag":1,"AdrID":170719,"Email":"luz.amadei@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":34178,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":24,"Role":"Contact","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0000-0001-5960-7972","ROR":"https://ror.org/0496vr396"},{"OrderNr":8,"Surname":"Amadei Martinez","Firstname":"Luz","Initials":"L.","PerPublicFlag":1,"AdrID":170719,"Email":"luz.amadei@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":34178,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":61,"Role":"Data creator","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0000-0001-5960-7972","ROR":"https://ror.org/0496vr396"},{"OrderNr":9,"Surname":"Deneudt","Firstname":"Klaas","Initials":"K.","PerPublicFlag":1,"AdrID":171088,"Email":"klaas.deneudt@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":3362,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":61,"Role":"Data creator","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0000-0002-8559-3508","ROR":"https://ror.org/0496vr396"},{"OrderNr":10,"Surname":"Deneudt","Firstname":"Klaas","Initials":"K.","PerPublicFlag":1,"AdrID":171088,"Email":"klaas.deneudt@vliz.be","InsPublicFlag":1,"Acronym":"VLIZ","OrigNameLangCode":"en","OrigNameLangID":15,"FullOrigName":"Flanders Marine Institute","InsOwnerCNT":10,"PersID":3362,"InsID":36,"FullInstitute":"Vlaams Instituut voor de Zee","RoleID":7,"Role":"Co-ordinator","OrigName":"Flanders Marine Institute","StandardName":"Vlaams Instituut voor de Zee","FullAcronym":"VLIZ","ORC":"https://orcid.org/0000-0002-8559-3508","ROR":"https://ror.org/0496vr396"}],"taxterms":null,"frameworks":null,"otherterms":[{"OtherTerm":"CNN weights"},{"OtherTerm":"phytoplankton classifier"},{"OtherTerm":"Phytoplankton species"},{"OtherTerm":"TensorFlow"}],"temporal":[{"DasDateID":6464,"StartYear":2017,"EndYear":null,"StartDay":1,"EndDay":null,"StartDate":"2017-05-01","EndDate":null,"DasDate":null,"Resolution":"Monthly","ResolutionNL":"Maandelijks","Notes":null,"StartMonth0":5,"StartMonth":"May","StartMonthNL":"Mei","EndMonth0":null,"EndMonth":null,"EndMonthNL":null,"Progress":"Completed","ProgressNL":"Afgelopen"}],"geographical":[{"GeoTerm":"Belgian part of the North Sea","DasGeoID":15107,"DasGeoTerm":null,"DasID":8904,"GeotID":9478,"X":null,"Y":null,"MaxX":null,"MaxY":null,"StationName":null,"Precision":null,"CoordSystID":null,"GeoDatumID":null,"OrigCoordMinX":null,"OrigCoordMinY":null,"OrigCoordMaxX":null,"OrigCoordMaxY":null,"OrderNr":null,"Projection":null,"GeoDatum":null,"GeoObjectID":26567,"OrigGeoTerm":"Belgian part of the North Sea","DutchTerm":null}],"meastypes":[{"MethID":null,"DasMeasTypID":20560,"DasMeasType":"images","Matrix":"Water","MatrixNL":"Water","Parameter":null,"ParameterNL":null,"Description":null,"Unit":null,"Methodology":null,"Protocol":null,"QualityControl":null,"Precision":null,"Detectionlimit":null,"Instrument":null,"InstrumentUrl":null,"MethFull":"","ParaCategoryTitle":null,"ParaCatCode":null},{"MethID":null,"DasMeasTypID":20561,"DasMeasType":"","Matrix":"Water","MatrixNL":"Water","Parameter":"Species diversity","ParameterNL":"soortenrijkdom/biodiversiteit","Description":null,"Unit":null,"Methodology":null,"Protocol":null,"QualityControl":null,"Precision":null,"Detectionlimit":null,"Instrument":null,"InstrumentUrl":null,"MethFull":"","ParaCategoryTitle":null,"ParaCatCode":null}],"dasthemes":null,"projects":[{"ProID":5263,"Acronym":"iMagine","Progress":"Completed","StandardTitle":"Imaging data and services for aquatic science","FP7Code":null,"GrantDOI":null,"FunderID":"101058625","FunderIDType":"EU contract id","FunderCodes":["Horizon Europe"]},{"ProID":4139,"Acronym":"LifeWatch","Progress":"Completed","StandardTitle":"Flemish contribution to LifeWatch.eu","FP7Code":null,"GrantDOI":null,"FunderID":"I002021N","FunderIDType":"FWO contract id","FunderCodes":["FWO International research infrastructure"]}],"refs":null,"urls":[{"URL":"https://doi.org/10.5281/zenodo.16679297","externalID":"10.5281/zenodo.16679297","URLTypeCode":"DOI","URLType":"DOI","URLTypID":13,"downloadURL":null,"FileName":null}],"pictures":[],"urlmaps":null,"spatreps":null,"fileformats":null,"resmessage":"","complete":1}
