Articles | Volume 20, issue 5
https://doi.org/10.5194/os-20-1149-2024
© Author(s) 2024. This work is distributed under the Creative Commons Attribution 4.0 License.
Special issue:
Predicting particle catchment areas of deep-ocean sediment traps using machine learning
Download
- Final revised paper (published on 19 Sep 2024)
- Preprint (discussion started on 05 Dec 2023)
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2023-2777', Anonymous Referee #1, 05 Jan 2024
- AC3: 'Reply on RC1', Théo Picard, 31 May 2024
- AC1: 'Comment on egusphere-2023-2777', Théo Picard, 31 Jan 2024
-
RC2: 'Comment on egusphere-2023-2777', Gael Forget, 04 Apr 2024
- AC1: 'Comment on egusphere-2023-2777', Théo Picard, 31 Jan 2024
- AC2: 'Reply on RC2', Théo Picard, 31 May 2024
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Théo Picard on behalf of the Authors (31 May 2024)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (03 Jun 2024) by Matjaz Licer
RR by Anonymous Referee #1 (26 Jun 2024)
RR by Anonymous Referee #3 (08 Jul 2024)
ED: Publish subject to technical corrections (22 Jul 2024) by Matjaz Licer
AR by Théo Picard on behalf of the Authors (24 Jul 2024)
Manuscript
The study is heavily based on the Lagrangian simulations of particle sinking based on circulation model results and described in Wang et al., 2022 (I think this should be stated more explicitly and include a brief summary of results. Best in Section 2). The authors use the results of these simulations to test the possibilities of using machine learning to locate the origin of particles caught in deep sea sediment traps. The aim is to use observed sea surface conditions (mainly satellite altimetry and SST) to deduce the source of particles.
The authors demonstrate that the machine learning algorithm vastly outperforms the baseline prediction, which uses the average distribution centred above the sediment trap. However, they should give a more detailed explanation of what are the benefits of using their algorithm over running the Lagrangian simulations. To put it in a anther way, why not simulate the particle paths using existing ocean reanalyses instead of ‘guessing’ the paths from surface observations? There are many regional model reanalyses of ocean currents available and e.g. CMEMS offers global reanalysis in roughly 8 km resolution which matches the one used in this study. It is clear that there are computational benefits, but one would think that with all the efforts invested in setting the sediment traps, dedicating some serious computing time shouldn’t be a problem. The answer might be obvious to the authors, but certainly not to the potential audience. The authors state in the introduction that there are uncertainties stemming from the ocean model errors. That is certainly true, but in such case I am missing at least a rough comparison of the accuracy of their method to the uncertainties in the prediction of the catchment areas using the Lagrangian backtracking.
I am not a native speaker, so my comments on the English grammar are merely suggestions.
Comments by lines:
L. 7: “surface conditions appear to be sufficient to accurately predict the source area” – Are they? What is considered sufficient?
L. 66: Why 50 m/day?
L. 76: “After spin-up, the chaotic evolution results in uncorrelated dynamics” – How are the initial and boundary conditions perturbed? Shouldn’t the same atmospheric conditions ensure similarity of both runs? Large distance from the boundary and same atmospheric forcing would imply very similar oceanographic conditions. This is very important. If the test conditions are related to the training conditions, the performance of the ML algorithm is overrated. This could also (at least partially) explain the results which are better “under weak/or stable dynamics”. Such conditions are likely also the situations when POLGYR1 and POLGYR2 currents would be most similar. This should be checked and thoroughly explained in the text.
L. 90: Why 10 km x 10 km patch? This paragraph is a bit confusing. 36 particles are released from 10 km x 10 km patches, but there are 36 STs in the model and they are 36 km apart. Is this right? So 36x36=1296 particles are released every 12 hours? Why not release them from 36 points instead of patches? To compensate for the lack of dispersion? The paragraph should be rewritten and clearer.
L. 104: I think the delta sign is not a good choice for the number of points as it implies the distance between points.
L. 108: The authors should explain better the 8 km resolution. I think that the simulations run at 2 km resolution, but the end results are downscaled and so are the surface fields used for the machine learning. This would also match roughly with the resolution of the satellite altimetry. Am I right? If so, this should be clearer from the text.
Figure 3 caption: remove “the” from “the origins”. “from the left panel” instead of “of the left panel.”
L. 142 (something wrong with the line numbering here): “the training criterion”. Batthacharyya coefficient is used in the equation as BC and should be marked as such.
The sum (epsilon) in the equation is too small and is missing the summation index.
L. 145: “Empirically, this method improves the performance of the trained model compared to experiments where the training loss is based only on BL200m.” - This is very interesting. On one hand, one would expect only the end location to matter, on the other hand, the path is important. It would be interesting to know more.
L. 160: The values of BL seem a bit arbitrary. A visual analysis is kind of a weak argument. What does it mean in the practical sense? Is it possible to relate how much would a certain value of BL affect the content of the sediment trap? L. 184: “analysis” instead of “analyse”.
L. 189: “16 in)” – the “in” is kind of out of place here.
L. 219: Why the 20th day? Shouldn’t these be averaged over the average travel time?
L. 256: Maybe “collected” instead of “measured”.
L. 275-285: The biogeochemical model results and satellite chlorophyll concentration measurements show the spatial variability of phytoplankton biomass. Maybe this could serve as a measure of needed accuracy of the method? On the other hand, this paragraph focuses on primary production only and neglects other sources of particles such as zooplankton. The latter could be obtained from the biogeochemical model as well.
L. 298: “analysis” instead of “analyse”.