Articles | Volume 22, issue 1
https://doi.org/10.5194/os-22-49-2026
© Author(s) 2026. This work is distributed under the Creative Commons Attribution 4.0 License.
Using surface drifters to characterise near-surface ocean dynamics in the southern North Sea: a data-driven approach
Download
- Final revised paper (published on 07 Jan 2026)
- Supplement to the final revised paper
- Preprint (discussion started on 22 Jul 2025)
- Supplement to the preprint
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2025-3287', Anonymous Referee #1, 29 Sep 2025
- AC1: 'Reply on RC1', Jimena Medina Rubio, 28 Oct 2025
-
RC2: 'Comment on egusphere-2025-3287', Anonymous Referee #2, 03 Oct 2025
- AC2: 'Reply on RC2', Jimena Medina Rubio, 28 Oct 2025
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Jimena Medina Rubio on behalf of the Authors (03 Nov 2025)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (10 Nov 2025) by Bernadette Sloyan
RR by Anonymous Referee #2 (30 Nov 2025)
ED: Publish as is (10 Dec 2025) by Bernadette Sloyan
AR by Jimena Medina Rubio on behalf of the Authors (12 Dec 2025)
In this paper the authors use different machine learning models to characterize near surface ocean dynamic. Tha authors launched several undrogued surface drifters in the North Sea released from the coast of Netherlands, tracking their position with GNSS. Then, several variables (including variables derived from wind, oceanic currents and waves) from different research products are used as inputs in three machine learning models (linear regression, random forest and support vector machine) to predict drifter velocities. Permutation feature importance and ALE plots are then used to explain the importance of the input variables in predicting the drifter velocities.
The authors claim two different results in the conclusions.
The first one is the efficacy of the proposed analysis method. The use of techniques of explainable machine learning to investigate surface ocean dynamic is interesting and sufficiently novel. I have no objections for this part.
The second one is the accuracy of the proposed method in inferring drifter trajectories. This is, in my opinion, the weakest part of the paper. Albeit the numerical results support the conclusions of the authors, the trajectory dataset is very small, consisting of twelve drifters, released the same day at 250 meters of distance. As can be seen from the figures in the paper, the trajectories are higly correlated, meaning that the dataset lacks the variety needed to ensure sufficient generalization. In this condition the risk of overfitting a model during training is very high, and this problem is neither mentioned nor addressed in the paper.
The reason why the trajectory integrated using the linear model outputs is much more different from the other might be because, due to being a simpler model, it overfitted less than random forest and support vector regression.
I still think that integrating the trajectories using the model outputs is a reasonable benchmark, if the scope of the models is to explain the correlations between input variables and predicted drifter velocities.
In order to claim that the model is able to generalize beyond the twelve drifters presented in the paper, a test using some other drifter release (from some other starting position, in some other period) should be necessary.
I understand that drifter release is a demanding task, and I am obviously not asking the authors to plan further releases. However, in order to better understand the generalization limits, if other surface drifter trajectories are available to the authors, I suggest to test the trained models to reproduce them. If this is not possible, I expect that these concerns are better addressed in the conclusions.
At the very least, the model-integrated trajectories should be compared with trajectories simulated using the ocean velocities given as input to the machine learning models, using some classical integration scheme such as RK4 or RK45.
As a last note, even if the models are actually overfitting the data, this is not an issue for the first scope of the paper (predictor-velocity analysis), since the analysis is focused on this particular dataset and has no claim of generalization. Some degree of overfitting might even be considered beneficial.