Articles | Volume 22, issue 1
https://doi.org/10.5194/os-22-49-2026
https://doi.org/10.5194/os-22-49-2026
Research article
 | 
07 Jan 2026
Research article |  | 07 Jan 2026

Using surface drifters to characterise near-surface ocean dynamics in the southern North Sea: a data-driven approach

Jimena Medina-Rubio, Madlene Nussbaum, Ton S. van den Bremer, and Erik van Sebille
Abstract

The large size of traditional drifters limits their ability to mimic the transport of buoyant objects at the ocean surface, which are subject to complex interactions among direct wind drag, fast-moving surface currents, and wave-induced transport. To better capture these dynamics, we track the trajectories of 12 novel, ultra-thin surface drifters deployed in the southern North Sea over 68 d. We adopt a data-driven approach to model drifter velocity using hydrodynamic and atmospheric data, applying both a linear leeway parameterisation and two machine learning models: random forest and support vector regression. Machine learning model-agnostic interpretation techniques reveal that tidal forcing predominantly drives zonal motion, whereas wind is the main driver in the meridional direction in this region. Notably, the wind exhibits a saturation effect, and its contribution to explaining the variance of the drifter velocity decreases at higher speeds. In trajectory prediction experiments, we find that machine learning models, particularly random forest, outperform linear models, with the latter achieving comparable accuracy only at short time scales. Using a hybrid approach and deriving a non-linear function of the wind from machine learning interpretable methods to include in the leeway parameterisation significantly improves the model prediction of the drifter trajectory. Finally, we test the generalisability of the North Sea-trained models using an independent drifter dataset from the Tyrrhenian Sea. Despite the differences in ocean dynamics between the regions, the machine learning models reproduce the observed trajectories with comparable accuracy to state-of-the-art studies, demonstrating robust explanatory skill and a low degree of overfitting in this instance.

Share
1 Introduction

Accurate predictions of the pathways and the fate of buoyant objects in the ocean rely on our understanding of surface ocean dynamics (Röhrs et al.2021). For example, search-and-rescue missions require a model that predicts how far a missing person has drifted at the ocean surface to define the search area (Breivik et al.2013). These models also benefit biological studies, where the degree of connectivity between different oceanic regions affects the population dynamics of species such as phytoplankton, larvae, and turtles (Nooteboom et al.2019; Lindo-Atichati et al.2020; Grimaldi et al.2022; Manral et al.2024). Similarly, oil spill detection and modelling depend on our knowledge of geophysical fluid dynamics to mitigate environmental damage (Pisano et al.2016; Jones et al.2016; Calzada et al.2021). Another pressing concern is plastic pollution, which threatens marine fauna through ingestion and entanglement (Kühn and van Franeker2020), and disrupts ecosystems via chemical contamination (Mato et al.2001) and the facilitation of biological invasions (Haram et al.2023). Marine debris has increased by 4 % on average each year since 1980, indicating that the current plastic standing stock of 3200 kt (kilotons) in the ocean will double within 2 decades, exacerbating the issue (Kaandorp et al.2023). To further advance these areas of operational oceanography and improve our fundamental understanding of near-surface ocean dynamics, modelling the physical mechanisms driving the transport of buoyant objects in the ocean is key.

The transport of buoyant objects at the ocean surface is governed by a complex interplay of winds, waves, and ocean currents at different spatio-temporal scales. Wind stress creates a friction force on floating objects and generates near-surface currents, resulting in a strong vertical velocity shear, following the well-known steady-state Ekman solution (Ekman1905). Surface gravity waves contribute to transport in the direction of wave propagation through Stokes drift, which arises from deviations in orbital motion (Stokes1847). The interaction between Stokes drift and the Coriolis force generates the Coriolis–Stokes force, which accelerates a current perpendicular to the Stokes drift (see van den Bremer and Breivik2018). In addition, buoyant objects are transported by low-frequency currents, including geostrophic flows that drive large-scale ocean circulation, topographic steering, mesoscale eddies, and high-frequency periodic currents such as tides and inertial oscillations (Röhrs et al.2021).

Buoyant drifters can be used to monitor how these different physical mechanisms combine to drive the transport at the ocean near-surface (Lumpkin et al.2016). Currently, the most commonly deployed drifters are transported with a current that is effectively integrated over part of the water column (over the vertical extent of the drifter) and do not directly capture the complex surface ocean dynamics (Davis et al.1982; Sybrandy and Niiler1992; Novelli et al.2017; MetOcean2017). Alternatively, the surface drifters used in this study (see MetOcean2020) have a thin disc shape with a height of 4.1 cm that enables them to follow the orbital velocities of the waves and drift with the uppermost centimetres of the ocean surface currents (Morey et al.2018; Calvert et al.2024; Pawlowicz et al.2024), which are characterised by higher speeds. Yet, the relevance of the Stokes drift, wind drag, or surface currents on these specific drifters is currently unclear.

Through the analysis of the trajectories of our disc-shaped surface drifters and their comparison with model simulations, we can estimate how each forcing mechanism contributes to the near-surface ocean transport. A prevalent data-driven methodology for quantifying these contributions involves regression models, which seek to establish a relationship between drifter velocity observations and the hydrodynamic and atmospheric conditions. As described by Breiman (2001), these regression models that aim to describe natural processes fall into two broad categories: traditional statistical models, also known as linear models (Bishop2006), or algorithmic models, also known as supervised Machine Learning (ML). Linear models require manually defining the model structure to approximate non-linearities and interdependencies of the near-surface ocean dynamics. This poses a challenge, as there are high-dimensional processes in the ocean with spatiotemporal scales spanning several orders of magnitude and involving many degrees of freedom (van Sebille et al.2020). In contrast, machine learning regression methods do not require explicit knowledge of the underlying transport mechanisms as they establish relationships driven by the algorithms, making them particularly useful for complex systems with multiscale variability and non-linear interdependence (Bracco et al.2025). For instance, Callies et al. (2017) demonstrated a consistent relationship between the leeway (i.e., wind contribution to surface friction) and Stokes drift over time in the same region, illustrating these interdependencies of the system. Furthermore, previous studies have highlighted the advantages in the prediction of the fate of buoyant objects at the ocean surface using different machine learning methods, such as tree-based models (Kaandorp et al.2022; O'Malley et al.2023) and neural networks (Aksamit et al.2020; Fajardo-Urbina et al.2024; Grossi et al.2025).

In this study, we analyse the trajectories of twelve disc-shaped surface drifters deployed in the southern North Sea to evaluate how different forcing mechanisms drive changes in their velocity, using data on the surface ocean currents, wave conditions, and wind patterns. We apply a widely used linear parametrisation and two machine learning regression algorithms (random forest and support vector regression) to model the velocity of the drifters. Using this data-driven approach, we aim to (i) identify the dominant forces driving drifter velocity changes and (ii) improve the prediction of drifter trajectories using only physical variables describing near-surface ocean dynamics.

2 Data

2.1 Surface drifters

2.1.1 Design and deployment

The drifters are thin disc-shaped buoys manufactured by MetOcean (MetOcean2020). Each drifter has a diameter of 24 cm, a height of 4.1 cm, and a weight of 900 g. These drifters are designed to track the uppermost centimetres of the ocean surface. They are equipped with Global Navigation Satellite System (GNSS) positioning (8 m positional accuracy), a sea surface temperature (SST) sensor, and Iridium satellite telemetry. Details on spatial coordinate error estimation are provided in Appendix A. To facilitate the retrieval of beached drifters after battery depletion, we attached AirTags (Apple Inc.2021), increasing the total mass by 1 %. Additionally, the drifters report which of their two antennas is transmitting, distinguishing between the upward-facing and submerged positions. This allows us to monitor their orientation and thus their flipping behaviour.

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f01

Figure 1Trajectories over 68 d of 12 colour-coded surface drifters in the southern North Sea. Drifters were deployed on 25 April 2024 in the Dutch Eastern Wadden Sea in three different clusters spaced 250 m apart. Starting and ending positions are marked with stars and triangles, respectively. Background colourmap shows the bathymetry of the southern North Sea from the NWS Ocean Physics Analysis and Forecast model with a horizontal resolution of 1.5 km (Tonani et al.2019). The study site location in the southern North Sea is highlighted by a red rectangle on the orthogonal projection of the Northern Hemisphere in the bottom right corner.

We released the drifters off the coast of the town of Moddergat, the Netherlands, in the Eastern Wadden Sea on 25 April 2024 (Fig. 1). We placed the drifters in three clusters spaced 250 m apart on mudflats at low tide, awaiting the incoming tide to transport them. Within each cluster, we arranged four drifters in a square formation with 50 m spacing between them. All drifters remained operational for at least 68 d, drifting across the German Bight region in the southern North Sea and reaching the North Frisian Islands. Initially, we set the drifters' sampling frequency to 5 min, but we lengthened the interval to 30 min after 6 d to extend battery life. After 26 d, we further lengthened it to 3 h.

2.1.2 Drifter data processing

To ensure the quality of the GNSS data, data points for which the time difference between measurements Δt is less than 2.5 min are eliminated. This threshold removes redundant measurements, which typically result from minor time synchronisation errors between the drifter’s internal time and the satellite time stamp or re-transmissions occurring within the intended sampling interval. Furthermore, the atmospheric and ocean datasets to be used for comparison (Sect. 2.2) are only available on a much coarser time resolution. We compute the (total) drifter velocity vd at position xn and time tn, corresponding to the nth observation, using the central difference scheme, which averages the arriving and departing instantaneous velocities (Elipot et al.2016). This vector is calculated as

(1) v d x n , t n = 1 2 x n + 1 - x n t n + 1 - t n + x n - x n - 1 t n - t n - 1 .

This method yields average speeds for the entire data set of 0.27±0.19 m s−1 in the zonal direction and 0.13±0.10 m s−1 in the meridional direction. Measurements with speeds exceeding 3 m s−1 are flagged as spatial coordinate errors, as sustained speeds above this threshold are considered physically unrealistic given the typical fluid motion timescales in the region (Otto et al.1990). While such high velocities may occur during wave breaking, these events would last less than a fraction of Δt. In total, 0.6 % of the original data points are identified as signal recording errors and removed from the time series. We also omit the first 24 h to reduce the importance of coastal and intertidal effects, focusing the analysis on mesoscale surface dynamics in the open basin.

The German Bight is characterised by strong tidal dynamics due to its shallowness (Otto et al.1990), so to isolate the residual kinematics from these dominant tidal effects, we estimate the net drifter velocity vd over the dominant tidal cycle T. This residual (Lagrangian) velocity is defined by Zimmerman (1979) as:

(2) v ̃ d ( x , t ) = 1 T t - T 2 t + T 2 v d ( x , τ ) d τ .

We identify the dominant tidal harmonic in the drifters' velocity data using Fast Fourier Transform (FFT) and Morlet wavelet analysis (Meyers et al.1993). Both methods show that the semi-diurnal M2 and S2 tidal constituents are the most significant, with no signal detected at the inertial period. These findings are consistent with prior Lagrangian observations in the region (Meyerjürgens et al.2019; Deyle et al.2024). Hence, drifter velocities are time-averaged over the period T= 24.83 h to smooth out the influence of both tidal signals, resulting in residual speeds that reach a maximum of 0.63 m s−1. Further details on the spectral analysis methodology and a visualisation of the frequency spectra can be found in Appendix B.

2.2 Atmospheric and hydrodynamic datasets

We use an atmospheric model and coupled ocean–wave models to quantify the effects of the different forcing mechanisms on drifter transport. We use data of the wind velocity field at 10 m above the ocean surface (uw) from the EMCWF reanalysis model ERA5, which assimilates satellite and in-situ measurements (Hersbach et al.2023). The surface ocean currents velocity field (uo) is provided by the North Western Shelf Ocean Physics Analysis and Forecast model, which is forced by the EMCWF wind field and assimilates SST, sea level anomalies from satellites, as well as in situ temperature and salinity profiles (Copernicus Marine Service2024a). The model uses a hybrid z*-σ terrain-following vertical coordinate system consisting of 51 levels, with the thickness of the surface cell set to ≤1 m (Tonani et al.2019). Due to the strong tidal ocean dynamics in the region, we isolate tidal contributions from the ocean surface velocity field using a low-pass filter with a time window T=24.83 h. The resulting low-pass ocean currents velocity vector is denoted as uoLP, while the remaining high-pass filtered ocean currents velocity will be referred to as uoHP. To characterise the wave conditions at the southern North Sea, we use the NWS Ocean-Wave Forecasting System (Copernicus Marine Service2024b), which is coupled to the mentioned ocean currents model and includes the effect of wave-induced fluxes in momentum and energy, and the Coriolis–Stokes force on the Eulerian current. Several studies have emphasised the importance of using coupled models (compared to non-coupled models) in the simulation of wave-induced surface transport of buoyant objects (Röhrs et al.2012; Cunningham et al.2022; Rühs et al.2025). Three key parameters used to describe the effect of ocean waves on the transport of buoyant objects are significant wave height, mean propagation direction, and mean frequency or period. These parameters are derived from the wave spectrum model output, which partitions the spectral significant wave height (Hs) and the mean wave direction (θ) into wind waves, and first and second swell components. The mean wave direction represents the energy-weighted average propagation direction within each spectral partition. In contrast, the bulk wave direction (θbulk) is computed from the full, unpartitioned directional spectrum and represents the overall propagation direction of the total wave field. Additionally, we consider the period at the spectral peak (Tp), and the Stokes drift velocity field at the surface (us). As in Bruciaferri et al. (2021), we calculate the Stokes drift at 0.5 m below the still–water level to align with the depth mid-point of the upper ocean model layer, and apply the parametrisation of Breivik et al. (2016) based on the Phillips wind-wave spectrum. While this approximation may underestimate the Stokes drift experienced by the drifters (Lenain and Pizzo2020), defining the Stokes drift at this depth ensures consistency, allowing a coherent analysis of the effects derived from the coupled ocean–wave model. Further details on the models' spatio-temporal resolution are presented in Table 1.

Hersbach et al. (2023)Tonani et al. (2019)Bruciaferri et al. (2021)

Table 1Specifications of the hydrodynamic and atmospheric model data at the drifters' spatiotemporal coordinates. Variables included are the wind velocity vector (uw), high-pass and low-pass ocean currents velocity using a 24.83 h filter (uoHP, uoLP), Stokes drift velocity in the ocean upper-layer (us), and properties derived from the wave spectrum, including significant wave height from the wind, first and second swell partitions (Hswind, Hs1pswell, Hs2pswell), wave direction from the wind, and first and second swell partitions (θwind, θ1p swell, θ2p swell), bulk wave direction (θbulk), and wave period at the spectral peak (Tp).

Download Print Version | Download XLSX

Where needed, we interpolate the atmospheric and hydrodynamic model data to the drifter measurements of location and timestamp to assess their instantaneous effect on their velocity using the bilinear interpolation scheme from Parcels (Delandmeter and van Sebille2019). To align with the models' 1 h temporal resolution while reducing high-frequency noise, drifter coordinates during the period with Δt=5 min are resampled to a coarser resolution of 30 min via linear interpolation.

3 Methodology

We manipulate the interpolated hydrodynamic and atmospheric model data to define physically relevant independent variables to model velocity changes along the trajectories of the drifters. We include the zonal (U) and meridional (V) components of all velocity vectors, including the surface ocean currents, wind, and Stokes drift. To account for wave directionality, we project the properties derived from the wave spectrum onto the zonal and meridional axes using wave direction data. Let A be one such property; we formulate its vector form by A=(Asinθ,Acosθ), where θ is the deviation angle from true north of the wave direction. We define the peak spectral period vector Tp=(Tpx,Tpy)=(Tpsinθbulk, Tpcos θbulk) using the bulk wave direction θbulk and, in doing so, assign a scalar to the two directions in an ad hoc fashion. Similarly, we compute three different significant wave height vectors, each associated with a specific wave partition (wind sea, first swell, and second swell). These are expressed as Hsi=(Hsi,x,Hsi,y)=(Hsisinθi,Hsicosθi), where i denotes the wave partition, and each vector is calculated using the corresponding wave direction for that partition. This ensures that the models incorporate wave effects on drifter motion in both directions, resulting in a feature matrix comprised of 16 variables measured over 18 696 time points. We do not include previous history of hydrodynamic and atmospheric conditions (i.e., we do not include variables with time lags or averages over a spatial radius of influence) because these variables would physically represent an inertial effect on the drifters, which has been found to be very small (Olascoaga et al.2020).

To infer the predominant forcing mechanisms and model the trajectory of the drifters, we analyse the outcome using the baseline linear regression model of the total drifter velocity components and compare it to two non-linear machine learning algorithms: random forest and support vector regression. Given the study region's stronger zonal than meridional tidal component (Kopte et al.2022), we model the zonal (Ud) and meridional (Vd) drifter velocity components separately in both the linear and machine learning models and assume that the interdependence between these two variables is negligible.

3.1 Linear regression

Unlike more complex non-linear models, linear regression offers a direct model interpretation by quantifying each feature's (i.e., independent variables) influence through interpretable coefficients or weights in the prediction equation of the target (i.e., dependent variable). The total drifter velocity ud is typically parameterised using a physics-based linear model as

(3) u d = u o + u s + γ u w + ε ,

where γ is a vector of model coefficients that scales each component of the wind velocity uw, ε is a disturbance term to be minimised by the linear regression algorithm, uo is the surface currents total velocity (i.e., the addition of the low-pass and high-pass surface ocean currents), us is the Stokes drift velocity, and  is the Hadamard or element-wise product. Note that this vector equation represents two independent scalar equations, one for the zonal and one for the meridional components.

Physically, the wind term weight represents the drag coefficient ratio above and below water, since our drifters are radially and axially symmetric (Dominicis et al.2016). Yet, downwind and crosswind components of the drifter velocity vector have been found to have distinct dependences on the wind speed (Allen2005). Several studies have hence divided the wind contribution into two perpendicular components, known in the literature as the “leeway method” (Breivik et al.2011), and quantified the wind slip vector represented by γ in Eq. (3). This approach yields values of γ ranging from 1 %–3 %, depending on the type of surface drifters and choice of hydrodynamic model (Sutherland et al.2020; Staneva et al.2021). Hence, we treat wind slip γ=(γx,γy) as a vector with zonal and meridional components to find the best-fit values for this new drifter design using the ordinary least squares method (Faraway2025). The accuracy of the linear model is evaluated based on its goodness-of-fit. Although this method is an empirical parametrisation of the wind contribution to the drifter velocity, its functional form aligns with theoretically derived models of the drift of spherical buoyant objects at the ocean surface (Beron-Vera et al.2019).

We also consider a linear regression model in terms of the relative wind, given that the friction velocity (i.e., the actual velocity acting at the ocean surface) is a function of the wind speed with a slope equal to the drag coefficient and a non-zero constant (Foreman and Emeis2010). Upon fitting the boundary conditions at the ocean surface, the drifter velocity is hence expressed as

(4) u d = u o + u s + γ u w - u o + ε .

Making assumptions on the magnitude and spatio-temporal scales of the wind and ocean current forcing simplifies this formulation back to Eq. (3) as shown by Duhaut and Straub (2006).

Nevertheless, despite the high interpretability of linear models, they may still oversimplify near-surface ocean dynamics by omitting non-linear behaviour in the parameterisation of drifter velocity components. For the case of highly correlated features, linear models also struggle to determine their contributions, leading to instability in coefficient estimation and reduced model reliability (Molnar2022). From a physical perspective, another limitation is that the surface current velocities used here are depth averaged over the model's upper layer (Tonani et al.2019). This means that wind-driven vertical shear in the upper centimetres may be underestimated and part of the shear effect effectively absorbed by the windage coefficient, potentially influencing the interpretation of the modelled surface currents (Callies et al.2017; Laxague et al.2018).

3.2 Machine learning regression

Machine learning algorithms offer an alternative regression approach particularly suited to climate variables, capturing non-linear interactions and multicollinearity without requiring prior structural knowledge (Breiman2001). We use a random forest model, as it has been found to perform well for spatio-temporal predictions due to its capacity to handle highly correlated features (Hengl et al.2018; Nussbaum et al.2018). To contrast these results, we also employ a support vector regression model and take advantage of its ability to handle high-dimensional feature spaces and its intrinsic algorithmic differences with the decision-tree structure of the random forest model. Both models are implemented using the scikit-learn Python package (Pedregosa et al.2011, version 1.6.1).

3.2.1 Random forest

Random forest is a machine learning method that predicts the values of the target variable from the average of the predictions from a collection of decision trees (Breiman2001). Each regression tree recursively partitions the data through binary splits based on feature thresholds. In this algorithm, each tree is fitted to a randomly drawn subset of the data and uses a randomly drawn subset of features to consider at each split. Single decision trees struggle with linear relationships, which must be approximated by step functions, and are sensitive to small input changes, sometimes producing non-smooth predictions (Molnar2022). By averaging over many trees, random forests reduce these limitations, exhibiting low sensitivity to hyperparameter choices including the number of trees ntree, the minimal number of observations at terminal nodes nmin (i.e., the last branch of each tree), and the number of randomly selected features to test at each split mtry (Probst and Boulesteix2017). Hence, we build a random forest model to fit the drifter velocity zonal and meridional components with scikit-learn defaults ntree=100, nmin=2, mtry=np (Geurts et al.2006) where np is the total number of variables in the feature matrix (sometimes called predictors) and np=16.

3.2.2 Support vector regression

Support vector regression is a model that applies principles of the support vector machine (Cortes and Vapnik1995) to regression tasks (Drucker et al.1996). In classification, support vector machines find the optimal hyperplane that separates data and maximises its distance to the closest data point (Vapnik1999). Instead, support vector regression identifies an optimal hyperplane with a margin (defined by support vectors) where prediction errors are tolerated. This model applies a transformation using non-linear kernel functions to project the data into a higher-dimensional space where a linear hyperplane can better approximate the non-linear relationships within the data (Smola and Schölkopf2004). However, the support vector regression model is sensitive to the choice of kernel and hyperparameters, which can strongly affect model performance. To address this, we use a radial basis function (RBF) and optimise the hyperparameters via grid search using cross-validation (see Appendix D for the exact values).

3.2.3 Evaluation of predictive performance

We evaluate the predictive performance of the machine learning models through cross-validation, where we calculate the expected extra-sample error (Hastie et al.2009). Validating the predictive performance of these models for drifter velocities is challenging due to the strong temporal autocorrelation present in trajectory data and the limited spatial dispersion in our dataset (Wadoux and Heuvelink2023). These factors could potentially lead to overly optimistic model performance estimates if training and testing sets are not sufficiently independent. To address this, we designed a spatio-temporal block cross-validation strategy to organise data into independent time blocks, regardless of the drifter, and then applied a k-fold cross-validation (k=5) by selecting a random set of blocks to be left out in each fold. Doing so ensures that the model is validated on data that is both temporally and spatially independent of the training set. The duration of the blocks corresponds to the autocorrelation time of the target variables, ensuring that any two points separated by more than this time range can be considered statistically independent and thus suitable for validation. This approach substantially reduces the risk of overfitting, as it prevents information leakage between training and validation sets and ensures that model performance is assessed on truly unseen, time-independent samples. For a detailed explanation of the block-cross validation strategy and the exact autocorrelation times, see Appendix C.

The zonal and meridional drifter velocity models' predictive accuracy are evaluated independently using the coefficient of determination R2 (Wilks2011), which quantifies the variance explained by the model, the root-mean-square error (RMSE), which measures the average prediction error magnitude, and the mean absolute error (MAE), which assesses the bias of the model. These metrics are defined as

(5) R 2 = 1 - i = 1 N y i - y ^ i 2 i = 1 N y i - y i 2 , RMSE = 1 N i = 1 N y i - y ^ i 2 , MAE = 1 N i = 1 N y i - y ^ i ,

where yi are the observed target values, y^i are the model predictions, yi is the mean of the observed values, and N is the number of observations. All metrics are applied for each fold using the observed and predicted data points from the test set and averaged across folds to assess the models' predictive capacities. Cross-validation plots of predicted against observed data points within each testing fold are included in Sect. S1 in the Supplement.

3.2.4 Model interpretation

A key limitation of machine learning models is their reduced interpretability compared to linear models, the so-called “black-box” abstractness (Lipton2017). As these models do not use a functional form to calculate the output, it becomes hard to understand the effects of individual features on the final prediction of the target variable (Molnar2022). To mitigate this, model-agnostic methods have been developed to characterise the overall behaviour of machine learning models.

One such method is permutation feature importance, which is used for identifying the variables that have the greatest impact on the target. This approach quantifies the increase in model prediction error, chosen to be the RMSE, when the values of a feature are randomly shuffled 10 times along the time dimension (i.e., the value for observation tn is reassigned to tk, where kn is a random index within the dataset). This assumes that features causing large prediction error increases when shuffled are more important in explaining the variance in the target data (Ewald et al.2024).

Another powerful tool for interpreting machine learning models post hoc are Accumulated Local Effects (ALE) plots. These ALE plots are model-agnostic methods for explaining individual predictions (Apley and Zhu2020). As described by Molnar (2022), ALE plots show how a model's predictions change with respect to a feature, considering only the data points within a local range (window) of that feature. To improve visualisation, the accumulated changes are computed across all such intervals covering the feature's range. An advantage of this method is its ability to visually reveal the functional relationship between the target variable and individual features. Unlike predecessors such as Partial Dependence Plots (PDPs) (Friedman2001), which estimate global average effects across the entire dataset and can be biased by correlated features, ALE plots capture local effects by focusing on small, localised changes in prediction within each interval. We estimate the ALE uncertainty by calculating the results from each feature using 100 bootstrapped resamples of the feature matrix and target variable. For each feature, we calculate the 95 % CI (Confidence Interval) based on the distribution of ALE estimates across these bootstrapped samples. These ALE plots are generated using the PyAle Python package (Jomar2020, version 1.2.0), a Python implementation of the R package ALEPlot (Apley and Zhu2020). After plotting the ALE, feature transformations are derived by visual inspection to approximate the functional form of the ALE curve to be able to obtain an interpretable model, i.e. making the linear model non-linear but retaining its high interpretability.

3.3 Modelling drifter trajectories

To assess model predictive accuracy, we reconstruct the change in location of the drifters over time using the models' predictions of their total velocity and employing a leave-one-drifter-out cross-validation strategy. We train each of the three models on data from 11 drifters, constituting the training feature matrix X, and obtain the mapping functions fu(X) and fv(X) which relate the hydrodynamic and atmospheric ocean conditions to the total drifter velocity components. The predicted drifter velocity vector u^d is expressed as:

(6) u ^ d = U ^ d V ^ d = f u ( X ) f v ( X ) ,

where U^d and V^d are the zonal and meridional predicted drifter velocity components, respectively.

We simulate the trajectory of the test case, i.e., the excluded drifter, by integrating the velocity predictions over time. Let X(x,t) contain interpolated hydrodynamic and atmospheric variables at position xd and time t. The predicted drifter position x^d(t), based on a given model, is defined as:

(7) x ^ d ( t + δ t ) = x ^ d ( t ) + t t + δ t u ^ d X x ^ d , τ d τ ,

where δt is the integration time-step, set to δt= 60 s. We use a forward Euler method to solve this integral and calculate the time advection of the drifter trajectories.

3.3.1 Trajectory prediction skill metrics

The accuracy of the modelled trajectories is measured using the mean cumulative separation distance D, which quantifies the difference between observed and modelled spatial coordinates at each time-step (Haza et al.2019; van der Mheen et al.2020; Moerman et al.2024). This metric is calculated as

(8) D = 1 M i = 0 M x ^ d t i - x d t i ,

where M is the total number of timesteps along the drifter trajectory, and x^d(ti) is the position vector of the drifter at the ith timestep. As reference, observed drifters travel on average a total of 1795 km over their measuring period.

We also evaluate the predicted trajectories using the Liu–Weisberg skill score (Liu and Weisberg2011), which is widely used in drifter studies (Liu et al.2014; van Sebille et al.2021; Pärn et al.2023). This metric is the average of the separation distance (i.e., the distance between the model prediction and the observed position) weighted by the length of the observed trajectory. The skill score “ss” is given by

(9) ss = 1 - s n , s n 0 , s > n for s = i = 0 M x ^ d t i - x d t i i = 0 M j = 0 i x d t j + 1 - x d t j ,

where n is a non-dimensional number that defines the threshold of no skill. Liu and Weisberg (2011) use a threshold of n=1 to calculate the accuracy of 3 d predictions. However, due to the long prediction period in our trajectory analysis (up to 60 d), a smaller threshold value n is needed. Otherwise, minor differences in separation distance between models would have little influence on the final Liu–Weisberg skill score, which would instead be dominated by the large cumulative trajectory length. We therefore set the threshold to n=0.05, preserving a consistent ratio between prediction period and threshold to ensure meaningful model comparisons.

3.3.2 Impact of non-dynamical variables on trajectory prediction

To draw physically meaningful conclusions about the dominant forces governing the transport of buoyant objects, our machine learning models are initially trained using only dynamic physical variables that change along the drifter trajectory and contrasted with the linear model. Nonetheless, prior studies in other fields have shown that including spatiotemporal features such as latitude, longitude, distance to reference points, time since release, or seasonal indicators can significantly enhance machine learning models' performance (Behrens et al.2018; Hengl et al.2018; O'Malley et al.2023). To assess this in our data, we train an additional random forest model to predict total drifter velocity, incorporating non-dynamic features and contrast the results with the original random forest model. These random forest models with an expanded dataset include features related to position (latitude and longitude) and local water depth, along with the original input variables. We also test whether explicitly accounting for the additional transport of these surface drifters due to possible wave surfing (Pizzo et al.2019) improves the random forest model predictive skill by introducing another feature: Flipping Index. This feature parameterises drifter flipping behaviour, a process used by Haza et al. (2018) for the identification of drogue loss and associated with storm conditions with strong winds and high wave steepness that induce wave-breaking. The Flipping Index quantifies the proportion of orientation changes (i.e., flips) based on successive antenna measurements (see Appendix E for full details). In order to obtain a Flipping Index along the drifters' trajectories, we first train a random forest model to predict it everywhere. Since only about 28 % of the dataset exhibits non-zero Flipping Index values (Fig. E2), we apply a hurdle modelling approach: first, a random forest classifier predicts the likelihood of flipping, and second, a random forest regressor estimates the flipping magnitude conditional on a positive event (Wadoux and Heuvelink2023).

4 Results and discussion

4.1 Inference of predominant forcing mechanisms

The linear regression, random forest, and support vector regression models collectively provide insight into the physical forcing mechanisms governing the transport of buoyant objects at the ocean surface. By analysing both the total and residual drifter velocities, we can infer the relative importance of wind, waves, and ocean currents in shaping measured trajectories of our surface drifters at subtidal and supertidal scales.

4.1.1 Total drifter velocity

Fitting the weights of the ordinary least-square linear regression given by Eq. (3) of the drifter total zonal and meridional velocity components yields γx=1.34 % (R2=0.26, RMSE = 0.11 m s−1, MAE = 0.12 m s−1) and γy=1.63 % (R2=0.40, RMSE = 0.08 m s−1, MAE = 0.10 m s−1) in the zonal and meridional directions, respectively. Both values fall within the theoretically predicted range of 1 %–2 % for objects with a density of approximately 0.7ρ, where ρ is the seawater density (Wagner et al.2022).

Focusing on the scale of each of the resulting terms in this linear regression, we observe a contrast between the predominant dynamics in the zonal and meridional direction. The mean zonal surface ocean current speed, Uo=0.3 m s−1, is higher than the resulting mean wind contribution γxUw=0.07 m s−1, and at least one order of magnitude larger than the mean zonal Stokes drift speed (Us=0.04 m s−1). In the meridional direction, however, mean surface ocean currents speed, Vo=0.08 m s−1, and wind effects γyVw=0.06 m s−1 are comparable, while the mean Stokes drift speed (Vs=0.03 m s−1) is still smaller.

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f02

Figure 2Permutation feature importance (RMSE increase) for a random forest and support vector regression models predicting zonal (Ud, blue) and meridional (Vd, red) total drifter velocity components calculated using central difference scheme. Bars represent the mean RMSE increase over 10 random permutations of each feature, which are ordered by decreasing importance. Error bars are omitted, as they account for less than 1 % of the RMSE increase. A larger RMSE increase indicates greater feature importance, as shuffling a feature significantly worsens the model's prediction. Note the differences in the scale of the y axis across the models. Features are shaded by type: ocean currents (blue), wind (beige), and waves (light green). Cross-validated metrics (coefficient of determination R2, RMSE, and MAE) are shown in the legend.

Download

The random forest and support vector regression models of the total drifter velocity exhibit similar R2, RMSE, and MAE from cross-validation, performing a better fit of the zonal drifter velocity than the meridional velocity (see the legend in Figs. 2, S1 and S2 in the Supplement). Model-agnostic interpretation methods consistently identify ocean currents and wind as the main drivers of the drifter's motion, approximating its total velocity as a linear combination of these forces. The permutation feature importance plots of the machine learning models (Fig. 2) show there is a prominent signature of the tidal current in the zonal direction for the prediction of the drifter zonal velocity Ud, represented by the high RMSE increase of UoHP (0.37 m s−1 in random forest model, 0.93 m s−1 in support vector regression), followed by a smaller contribution from the zonal wind U10 (0.14 m s−1, 0.23 m s−1 respectively). The roles of these two variables are reversed for the prediction of the meridional drifter velocity V10 (0.15 m s−1, 0.43 m s−1) compared to the contribution from the meridional high-pass ocean currents VoHP (0.05 m s−1, 0.13 m s−1).

ALE plots of the random forest model reveal the dependence between the drifter velocity components and the most influential features from the permutation feature importance plots: high-pass ocean currents and wind. We observe a linear relationship between the high-pass ocean currents and the parallel total drifter velocity component (Fig. 3). Yet, we also observe small nonlinearities in the ALE plot of the meridional high-pass ocean currents VoHP around zero and at the extremes of the feature distribution, where uncertainty increases due to sparse data.

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f03

Figure 3Accumulated Local Effect (ALE) plots for the zonal (blue, left y axis) and meridional (red, right y axis) total drifter velocity, calculated using central difference scheme, as a function of the corresponding parallel components of (a) the high-pass ocean currents and (b) the wind modelled by a random forest model. The shaded area represents the 95 % CI across 100 bootstrapped samples. The top histograms show feature distributions of the data. Negative velocity values for the zonal component represent westward motion, while negative meridional velocities represent southward motion.

Download

The ALE plots for the zonal wind velocity U10 show that the effect on the zonal total drifter velocity Ud becomes constant at extremes of the distribution, resembling a sigmoid function (Fig. 3). This indicates that higher values of the zonal wind might not contribute to the same extent to the variance of the zonal total drifter velocity. The meridional wind V10 exhibits a similar but weaker saturation effect on Vd for strong westward winds (<-7 m s−1), though data density is low in this regime. This could be related to the fact that the friction velocity does not scale linearly with wind speed at high values due to increasing surface roughness (Foreman and Emeis2010). However, experiments indicate that this nonlinearity typically occurs at wind speeds above the maximum observed in our data, which is 13.3 m s−1. Another possible explanation for the decoupling between winds and drifter velocity could be the transfer of kinetic energy from wind to the ocean, generating wind-driven currents. Yet this would require a high positive correlation between high wind speeds and surface currents, absent in our data (Fig. S7). A further consideration is the difference in spatial resolution between the atmospheric and hydrodynamic models. While this discrepancy might seem relevant, the Rossby radius of the ocean remains much smaller than the synoptic scale, meaning the resolution difference likely has little effect on drifter velocity parameterisation.

4.1.2 Residual drifter velocity

By calculating the drifter residual velocity using Eq. (2), we effectively filter out dominant high-frequency ocean surface currents (i.e., those shorter than the M2 and S2 tidal frequencies), allowing for a more focused analysis of the underlying net transport mechanisms. As seen in Fig. 4, the random forest and support vector regression models in that case assign the highest importance to the parallel components of the wind, Stokes drift velocity, and low-pass filtered ocean surface currents.

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f04

Figure 4As in Fig. 2 but for the random forest and support vector regression models predicting the zonal (Ũd, blue) and meridional (Ṽd, red) residual drifter velocity components.

Download

In the meridional direction, the wind V10 (RMSE increase of 0.10 m s−1 in random forest, 0.45 m s−1 in support vector regression) and the low-pass current VoLP (0.07 m s−1, 0.30 m s−1 respectively) are the most important features in both models for the drifter residual meridional velocity Ṽd. Yet, the contribution from the meridional Stokes drift Vs is only significant in the random forest model compared to the support vector regression (0.05, 0.01 m s−1). This disagreement could stem from the definition of the permutation feature importance. A feature can only imply a meaningful importance if it is not strongly correlated with other features that also influence the target (Ewald et al.2024). We find that meridional Stokes drift Vs and meridional wind V10 are highly correlated (Spearman correlation coefficient ρ=0.92 (Spearman1904); see Fig. S7). The reason this is only captured by the random forest model and not the support vector regression is that in random forest models, correlated features can also replace each other if randomly left out by mtry for splitting. Instead, support vector regression performs a general transformation of feature space, where correlated features go into the same dimension and hence are less sensitive to data points further away from the general distribution.

In the zonal direction, the wind U10 (RMSE increase of 0.11 m s−1, 0.53 m s−1) and the Stokes drift Us (0.06 m s−1, 0.18 m s−1) are the most important features for the zonal drifter residual velocity Ũd models. The contribution of the low-pass ocean currents UoLP is also comparable (0.06 m s−1, 0.11 m s−1).

From both the residual zonal and meridional drifter velocity models, we also find high permutation feature importance of variables that are not aligned with the velocity component, such as V10 for Ũd (RMSE increase of 0.02 m s−1, 0.09 m s−1) and Hs1stswell,x for Ṽd (0.09 m s−1 in the support vector regression model).

Residual drifter velocity ALE plots (Fig. 5) show analogous dependence to the wind speed as the total drifter velocity: a linear regime for low speeds but plateauing at higher speeds (Fig. S6). Low-pass currents likewise show a linear relationship with the parallel residual velocity components where data density is high for speeds <0.10 m s−1 and saturation effects at the extremes of the distribution. Furthermore, we find that Stokes drift influences residual velocity noticeably above 0.05 m s−1. Below this velocity threshold, the residual velocity remains largely unaffected, and the Stokes drift contribution to surface drifter transport is minimal. This suggests that in such a regime, the otherwise small influence of swell-induced Stokes drift may become relatively more significant and should not be neglected. At higher Stokes drift values, the drifter residual velocity increases approximately linearly in the zonal direction. A similar trend is observed in the meridional component, although with greater uncertainty due to the limited number of observations at high Stokes drift speed.

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f05

Figure 5Accumulated Local Effect (ALE) plots for the zonal (blue, left y axis) and meridional (red, right y axis) residual drifter velocity as a function of the corresponding parallel components of the (a) low-pass ocean currents and (b) Stokes drift modelled by a random forest model. The shaded area represents the 95 % CI across 100 bootstrapped samples. The top histograms show feature distributions of the data. Negative velocity values for the zonal component represent westward motion, while negative meridional velocities represent southward motion.

Download

4.2 Prediction of drifter trajectories

To evaluate the predictive skill of each model, we apply a leave-one-drifter-out strategy (see Sect. 3.3). The models are evaluated over a fixed prediction period of 60 d, after which the linear regression model typically predicts beaching. Repeating this process for each drifter yields 12 simulated trajectories per model. All the resulting trajectories succeed in reproducing tidal oscillations along the trajectory and large-scale patterns (e.g. the loop near 4° E longitude in Fig. 6).

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f06

Figure 6Example of a reconstructed drifter trajectory of over 60 d using linear regression (yellow), random forest (green) and support vector regression (blue) models using an integration timestep of dt=60 s. The true measured drifter trajectory is shown in black. The wind slip coefficients used for this linear regression model are γx=1.34 % and γy=1.64 % in the zonal and meridional direction, respectively.

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f07

Figure 7Mean cumulative separation distances using linear regression (yellow), random forest (green), and support vector regression (blue). Each model is trained on 11 trajectories and tested on the remaining one, with the process iterated so that each drifter is used once as the test set (leave-one-drifter-out cross-validation). Individual results for each drifter are shown as scatter points along with a median indicator.

Download

Comparing the mean cumulative distance of reconstructed trajectories from drifter velocity predictions in Fig. 7 demonstrates the advantages of using machine learning algorithms for prediction over linear regression for long-term predictions. Random forest achieves the lowest deviation from observations with D=10.8 km and an interquartile range of all distances (IQR) of 3.2 km, indicating superior accuracy and consistency across the test data. The support vector regression model shows a lower performance, with a median cumulative separation distance of D=14.1 km (IQR = 2.3 km), potentially due to the fact that this model does not model high-order interactions by sub-partitioning the dataset into small sections like the random forest does. However, there is a risk of over-adaptation of the random forest model to the current training data; hence, the advantage of comparing the results from both models. Meanwhile, the linear regression model has the poorest performance with a median of D=27.2 km and a higher IQR of 5.6 km, indicating it is highly sensitive to the training data. The comparison of model performances using the Liu–Weisberg skill score also captures the superior performance of the machine learning models compared to the linear regression model, assigning a skill score of 0.64±0.10 to the random forest model, 0.55±0.09 to the support vector regression model, and 0.10±0.16 to the linear regression (Fig. S8). Deviations between the predicted and observed trajectories likely arise from biases in the training data caused by strong spatial correlations. As a result, (minor) velocity prediction errors can accumulate during integration, causing the simulated trajectories to diverge into regions not adequately represented in the training set.

The time evolution analysis of the differences between the observed and modelled trajectories from each of the models reveals that linear regression outperforms machine learning models for time scales smaller than 4 d (Fig. 8). After that onset time, the cumulative separation distance with respect to the observations increases over time for the linear regression predictions, while the error from the machine learning models remains constant, yielding a lower cumulative separation distance considering the entire trajectory. In the reconstructed trajectories of the drifters (e.g. Fig. 6), we observe that the linear regression model overestimates the zonal displacement, which could be caused by the fact that this model does not account for the decrease in the zonal wind contribution for higher wind speeds seen in the random forest ALE plots (Fig. 3).

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f08

Figure 8Time series of the cumulative separation distances using linear regression (yellow), random forest (green), and support vector regression (blue). Each model is trained on 11 trajectories and tested on the remaining one, with the process iterated so that each drifter is used once as the test set (leave-one-drifter-out cross-validation). Individual results for each drifter are shown as shaded lines, along with the median time series across drifters shown as solid lines. The small panel shows a close-up view of the first week since release.

Download

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f09

Figure 9Mean cumulative separation distance metric applied to modelled trajectories from random forest models of the total drifter velocity with different sets of features: original model (green), including Flipping Index (grey), including latitude and longitude (magenta), and including bathymetry (dark purple). Each model is trained on 11 trajectories and tested on the remaining one, with the process iterated so that each drifter is used once as the test set (leave-one-drifter-out cross-validation). Individual results for each drifter are shown as scatter points along with a median indicator.

Download

Additionally, we apply the drifter trajectory model and evaluation procedure described in Sect. 3.3 to assess the effect of incorporating non-dynamical variables. Among the tested features, only the inclusion of the depth of the water column yields an improvement in velocity prediction (Fig. 9). As expected, adding spatial coordinates such as latitude and longitude does not enhance model performance because the absolute location is not a relevant property, as it does not carry any physical meaning in a non-stationary flow. The parameterisation of wave-surfing transport via the Flipping Index results in a model with the same median predictive skill across drifter samples as the original random forest, but with somewhat reduced variance in prediction error. The random forest model trained to predict this index from hydrodynamic and atmospheric variables reveals a high permutation importance for wind speed and Stokes drift (Figs. E3 and E4), highlighting their dominant role in the mechanisms associated with flipping events and, by extension, wave-driven transport.

4.2.1 Linear models with alternative structure

Furthermore, we explore how to improve the predictive performance of the linear regression of the total drifter velocity using insights from the machine learning models. From the results of the ALE plots, the zonal and meridional drifter velocity in our data shows a sigmoid-like shape as a function of the zonal wind. Hence, we test a total drifter velocity linear regression model where the contribution of each wind component is modelled by a sigmoid function of its velocity, so that

(10) u d = u o + u s + g u w + ε for g ( ζ ) = a 1 + e - b ζ - ζ 0 ,

yielding a=0.27 m s−1, b=0.31 m s−1, and ζ0=1.79 m s−1 (R2=0.28, RMSE = 0.11 m s−1, MAE = 0.07 m s−1) as best-fit parameters in the zonal direction and a=1.82 m s−1, b=0.04 m s−1, and ζ0=12.2 m s−1 (R2=0.40, RMSE = 0.08 m s−1, MAE = 0.07 m s−1) in the meridional. These fits improve upon the original linear model by reducing its bias, nearly halving the MAE.

The alternative approach using the relative wind yields best-fit coefficients of γx=1.39 % (R2=0.28, RMSE = 0.11 m s−1, MAE = 0.11 m s−1), and γy=1.66 % (R2=0.41, RMSE = 0.08 m s−1, MAE = 0.10 m s−1), also showing a small improvement in the fix with respect to the original linear model.

Following the method previously described for reconstructing the trajectories of the drifters from different models, we compare the predictive accuracy of the three linear regression models. From the resulting mean cumulative separation distance across drifter samples, we find that the sigmoid function parametrisation improves the linear regression predictions with D=15.8 km (IQR = 3.7 km) (Fig. 10). The relative wind parametrisation also shows an improvement in the predictions with D=25.8 km (IQR = 3.3 km). However, the analysis of the cumulative separation distance over time reveals that these differences between sigmoid and linear functions of the wind only emerge beyond 24 h after release (Fig. S10).

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f10

Figure 10Mean cumulative separation distance metric applied to modelled trajectories using linear regression models with the original wind formulation (yellow) as per Eq. (3), the relative wind parametrisation (fuchsia) using Eq. (4), and the alternative sigmoid function of the wind (teal) from Eq. (10). Each model is trained on 11 trajectories and tested on the remaining one, with the process iterated so that each drifter is used once as the test set (leave-one-drifter-out cross-validation). Individual results for each drifter are shown as scatter points along with a median indicator.

Download

These findings suggest that the linear parameterisation of the wind contribution to drifter velocities, while effective at shorter timescales, may oversimplify the underlying physics. Indeed, this linear form resembles the functional form derived by theoretical studies using the Maxey-Riley framework for buoyant spherical (Beron-Vera et al.2019) and non-spherical (Wagner et al.2022) particles. However, as noted by Bos et al. (2025), the particle Reynolds numbers for the surface drifters used in this study in the southern North Sea are above the Stokes drag regime. Therefore, it is important to consider that air and water have different viscosities when determining the functional form of the wind contribution to drifter velocity, which may no longer be linear.

4.2.2 Generalisability of the prediction results

We confirmed above, using a leave-one-drifter-out validation strategy, that machine learning models outperform linear equations in predicting drifters in the North Sea. To test the transferability of those models into other regions, we use an additional data set from a drifter campaign in the Tyrrhenian Sea that started on 26 June 2025. Six surface drifters from MetOcean (2020) were deployed off the coast of Napoli. One stopped transmitting after 1 d, while the remaining five drifted for at least 40 d. Full trajectories of these drifters are provided in Appendix F. We apply the North-Sea-trained models to predict the Tyrrhenian Sea drifter trajectories using the method and evaluation metrics described in Sect. 3.3. As predictor data, we use data from a coupled ocean–waves model of the Mediterranean Sea (Clementi et al.2023) and the same atmospheric model described in Sect. 2.2. We also test the generalisability of the leeway method by applying the same windage coefficients derived from the North Sea data.

Trajectories were reconstructed over 48 h, with predictions iteratively repeated along the full drifter trajectory to obtain 20 predictions per drifter. The choice of a 48 h prediction window is motivated by the aim of comparing its performance with that reported in other studies with different methodologies: Dagestad and Röhrs (2019) reported CODE and iSphere drifter trajectories in the Norwegian coast using a physics-based linear model with different hydrodynamic models and Stokes drift configurations, while Grossi et al. (2025) builds an artificial neural network for CODE drifter trajectory predictions in the Gulf of Mexico based solely on previous latitude and longitude data.

Table 2 presents a comparison of the evaluation metrics reported in the literature with those obtained in the current study. Among the models tested, the random forest achieves the highest predictive performance, followed closely by the support vector regression, while the linear regression performs slightly less well. Overall, all models demonstrate a predictive skill after 48 h consistent with previous studies. Figure G1 shows the predicted trajectory of a single drifter in the Tyrrhenian Sea to illustrate the results.

Dagestad and Röhrs (2019)Grossi et al. (2025)

Table 2Evaluation metrics of the predictive performance of different models reproducing 5 surface drifter trajectories in the Tyrrhenian Sea: mean cumulative separation distance (D), RMSE error between observations and predictions, separation distance between observation and prediction after 48 h, and skill score. Machine learning models from this work are trained on the North Sea drifter dataset, and the windage coefficient for the linear regression is tuned for this data. The reported errors are the interquartile range (IQR).

Download Print Version | Download XLSX

These findings indicate that the machine learning models trained on the North Sea data can largely reproduce the dynamics in the Tyrrhenian Sea despite the differences in the ocean dynamics between the two regions. This suggests a low degree of overfitting and over-adaptation to the specific conditions of the data of the campaign from the North Sea. The North Sea is a tidally dominated region, while the Tyrrhenian Sea is part of the Mediterranean Sea, a semi-enclosed basin with a thermohaline and wind-driven circulation with eddies as regular features (Rinaldi et al.2010; Buffett et al.2017).

5 Conclusions

This study analyses the effect of near-surface ocean currents, wave-induced motions, and wind drag on the trajectories of ultra-thin surface drifters to gain insight into the transport of buoyant objects at the ocean surface. We follow a data-driven approach and regress drifter velocity against instantaneous hydrodynamic and atmospheric conditions, and compare the established linear leeway model with two fundamentally different machine learning algorithms in terms of both inference of predominant forcing mechanisms and trajectory prediction performance.

At first order, machine learning models indicate that wind and ocean currents are linearly related to drifter velocity and are the most important features explaining the variability in velocity. These findings align with previous observational studies using leeway formulations (Breivik et al.2011; Dominicis et al.2016) and theoretical predictions from the Maxey–Riley framework (Beron-Vera et al.2019). Stokes drift also contributes notably for low values of the residual (non-tidal) drifter velocity.

Non-linear behaviour emerges under strong wind, as revealed by the ALE plots of the drifter velocity (Fig. 3). Incorporating this insight, we improve trajectory predictions by using a quasi-linear model with a non-linear wind term. While linear approaches remain advantageous in operational oceanography due to their simplicity and computational efficiency, we propose a hybrid framework that utilises interpretable machine learning methods to reveal functional relationships between drifter velocity and environmental forcing. These insights can guide the formulation of more accurate linear parameterisations.

When evaluating trajectory predictions from integrated modelled zonal and meridional velocities, the linear model performs reasonably well for predictions of less than 4 d but accumulates bias over longer periods. In contrast, machine learning models, especially the random forest, consistently outperform the linear baseline (Fig. 8). This suggests that more complex models might be needed to extend the forecasting horizon.

Feature engineering analysis shows that incorporating additional physically relevant information, such as water depth or a parameterisation of wave-surfing effects, further improves random forest performance. Meanwhile, adding non-physical features, such as longitude and latitude, degrades predictions. Finally, we test the generalisability of the models to a region with markedly different ocean dynamics, the Tyrrhenian Sea. Performing 48h-reconstructions of surface drifter trajectories demonstrates a predictive skill comparable to other state-of-the-art studies, indicating low overfitting to the North Sea training data and reinforcing the physical conclusions of the near-surface ocean dynamics of this study.

Appendix A: Estimation of spatial coordinate errors

We estimate the uncertainty in the spatial coordinates reported by the GPS system in the drifters from the errors in the distribution of measurements during an experiment. We position Stokes drifters, identical to the ones used in this study, over a flat surface on land. A total of 84 coordinate data points were measured from each drifter, derived from two distinct measurement rounds. The first round comprised a 24 h cycle with a 30 min transmission frequency, and the second was a 3 h cycle utilising a 5 min frequency (Schneiter and van Sebille2023). For each drifter, we compute the deviation of the longitude and latitude measurements with respect to their mean during the combined measuring periods and approximate their density distribution to continuous using the Kernel Density Estimation method. The resulting curves can be observed in Fig. A1, which highlights the mean standard deviation of the measurement distribution across drifters: 8.4 m in the latitudinal direction, and 6.5 m in the longitudinal direction.

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f11

Figure A1Distribution of the deviation of (a) longitude and (b) latitude measurements with respect to their mean from 24 colour-coded stationary surface drifters. Data were collected during a 24 h cycle (30 min frequency) and a 3 h cycle (5 min frequency). The distributions are estimated using Kernel Density Estimation (KDE). The mean standard error σ across all drifter distributions is included in the legend.

Download

Appendix B: Power spectral analysis of the drifters' velocity

We use power spectral analysis to identify the dominant tidal harmonic using two complementary techniques: Fast Fourier Transform (FFT) and Morlet Wavelet analysis. For FFT analysis, uniform time spacing of the measurements is required, so we perform the analysis to two different periods of time independently: from day 6–26, when the sampling period is 30 min, and from day 26 onwards, when the sampling period is 3 h. We also performed a Morlet Wavelet spectral analysis to investigate temporal variations in the frequency spectrum, as the time series spans more than one spring-neap tidal cycle (Meyers et al.1993). Unlike FFT, this approach does not require time resampling, allowing the detection of higher-frequency harmonics without compromising the integrity of the original temporal resolution. The analysis of the 5 min period (from the beginning of the time series to day 6) has not been included as the time interval between samples is more irregular (not exactly 300 s, with an average standard deviation of ±22 s). This irregular sampling complicates the alignment and averaging of results across drifters.

Apart from the predominant signals of the M2 and S2 tidal constituents, we also observe a weaker contribution from the high-frequency lunar tidal constituent M4 in the Morlet Wavelet graph when the sampling period satisfies Δt≤30 min (Fig. B1). This is due to the fact that for a sampling period of 3 h, the Nyquist period is 6 h, which closely matches the period of the M4 signal (6.2 h). Hence, during the period when the sampling frequency is coarsest, the time resolution of the observations is barely enough to detect this signal.

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f12

Figure B1Average power spectrum across all drifters' speed using Fast Fourier Transform (left panel) and Morlet Wavelet (right panel). The FFT shows two spectra corresponding to the analysis over a time period with sampling frequency of 30 min (blue) and 3 h (red) with their respective x axis. The Morlet Wavelet graph is a concatenation of the results for both periods, separated by a discontinued line The frequencies of the main tidal harmonics found in the German Bight region (M2, S2, M4) are highlighted in colours (yellow, purple, and light blue respectively) as well as the inertial frequency at 54° latitude (orange).

Download

Appendix C: Spatiotemporal block cross-validation strategy

We use a spatiotemporal block cross-validation strategy to mitigate the impact of temporal autocorrelation and spatial correlation in the dataset (Wadoux and Heuvelink2023). Data is first aligned in time and then segmented into blocks. Subsequently, a standard k-fold cross-validation approach is applied by splitting the shuffled blocks into five folds. During each iteration, four folds are used for training, and the remaining fold is used for validation, ensuring all blocks are eventually tested.

The duration of each block corresponds to the average autocorrelation time of the target variable across all drifters using the e-folding scheme. The autocorrelation functions are found by calculating the Pearson correlation at time lags ranging from 1–100 h using statsmodels Python package (Seabold and Perktold2010, version 0.14.2). The resulting correlograms are shown in Fig. C1, and the resulting autocorrelation times are summarised in Table C1.

Table C1Autocorrelation times for different drifter velocity components as a result of using e-folding scheme.

Download Print Version | Download XLSX

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f13

Figure C1Correlogram of the (a) total and (b) residual drifter velocity components across all drifters (zonal is shown in blue and meridional in red). The shaded region represents the standard deviation from the autocorrelation functions of each drifter velocity component.

Download

Appendix D: Support vector regression hyperparameters

Support vector regression models use kernel functions to transform the data into a higher-dimensional space. In this higher-dimensional space, the support vector regression algorithm attempts to find a hyperplane that best fits the data, while allowing for some deviations from the actual observations, controlled by the parameter ε, called the margin of tolerance (Hastie et al.2009). We choose the Radial Basis Function (RBF) kernel to build our models, which is commonly used for its ability to capture non-linear relationships between data points. The RBF kernel is defined as:

K(x,x)=e-γx-x2

where x-x2 is the squared Euclidean distance between two feature vectors x and x, and γ is a parameter that controls how much influence a single training point has (Pedregosa et al.2011). Additionally, the parameter C alters the decision surface's smoothness that fits the target data in the hyperplane. The best fit from the SVR models of the total and residual velocity components, and Flipping Index, is found for the hyperparameters included in Table D1 using GridSearchCV functionality (Pedregosa et al.2011).

Table D1Value of the hyperparameters C, γ, ε used in the Support Vector Regression models using a Radial Basis Function kernel to fit the total drifter zonal and meridional velocity (Ud, Vd), residual drifter zonal and meridional velocity (ŨdṼd).

Download Print Version | Download XLSX

Appendix E: Flipping Index model

We define a new metric named Flipping Index (F) to quantify the proportion of changes in the drifters' orientation or flips observed in subsequent measurements. To derive this index for each trajectory, the flips of the drifters are identified over time as the changes in the orientation signal, resulting in a binary variable f(t) that equals 1 if a flip is observed at time t and 0 otherwise. Then, these flips' time series are convolved with a sliding window of size n(t) that increases with the sampling frequency of the measurements. Hence, the Flipping Index is defined as:

(E1) F ( t , n ( t ) ) = i = - n ( t ) 2 n ( t ) 2 f ( t + i ) , n ( t ) = L Δ t ( t )

where L is the fixed length of the temporal window, and Δt(t) is the sampling frequency at time t, which increases along the drifters' trajectory. The Flipping Index is computed using a window size with L=3 h, which corresponds to the highest sampling frequency of the drifter dataset. This choice was based on an analysis of window sizes ranging from 1–8 h, which revealed only minor variations in the magnitude of the Flipping Index peaks (Fig. E1).

To ensure a continuous representation of the Flipping Index and reduce sensitivity to the non-uniform timestep, we apply a Gaussian smoothing filter. The standard deviation of the filter is set to σ=δt/2, where δt represents the mean time interval between consecutive flips. The index is subsequently normalised by the maximum number of flips observed across all drifters, resulting in a dimensionless measure ranging from 0 to 1. In this framework, a Flipping Index of 1 corresponds to 10 flips occurring within a window size with L=3 h. The Flipping Index is evaluated only for time periods with sampling frequency Δt≤30 min. At lower sampling frequencies, the number of detected flips is likely strongly underestimated, and the temporal resolution becomes insufficient to capture submesoscale variability (Essink et al.2022). Figure E2 presents the resulting time series of the mean Flipping Index across all drifters. The relatively small standard deviation indicates that drifters tend to flip simultaneously, which can be attributed to their spatial proximity and shared exposure to similar hydrodynamic conditions throughout most of their trajectories.

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f14

Figure E1Time series of the Flipping Index of a single example surface drifter using different choices of temporal window size ranging from 1–8 h. Close-up view between day 5, 12 h after release and day 8 shows small variations at the peak depending on the choice of the window size.

Download

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f15

Figure E2Time series of the Flipping Index from the 12 surface drifters (orange lines), along with the average across them (black line). The index is computed based on changes in drifter orientation relative to the ocean surface (i.e. flips) over a 26 d trajectory in the southern North Sea. Peaks indicate periods of increased flipping, likely caused by intense wave activity during storm conditions.

Download

To assess whether including non-physical variables improves the random forest model's predictive skill, we create an additional model that fits the Flipping Index for a given hydrodynamic and atmospheric conditions, and then include this index as a feature in the prediction of the total drifter velocity, which parametrises stormy conditions. However, only 28 % of the data points yield a non-zero Flipping index (i.e., it is a zero-inflated variable), so the standard random forest algorithm would have difficulties predicting these zeroes (Fig. S4). In order to solve this issue, we use a hurdle or two-step model that first creates a binary variable of the Flipping Index using a threshold, which we establish at F=0.05, and trains a random forest classifier to predict instances when the Flipping Index is non-zero (Prasad et al.2006). Then, we train a random forest regressor with the predicted non-zero Flipping Index data points to learn the relationship between the climate variables and the magnitude of the Flipping Index. The resulting permutation feature importance analysis reveals that most of the variance of the binary Flipping Index variable is explained by a combination of zonal wind, low-pass currents, and various wind-related variables (Fig. E3), while the highest importance for the model predicting the magnitude of this Flipping Index is assigned to the Stokes drift and the wind (Fig. E4).

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f16

Figure E3Permutation feature importance (RMSE increase) for a random forest classifier predicting the binary Flipping Index of the drifters in a hurdle model. Grey bars represent the mean RMSE increase over 10 random permutations of each feature, which are ordered by decreasing importance. Features are colour-coded by a shadow: blue indicates ocean current-related variables, beige corresponds to wind, and teal to wave parameters. Cross-validated model performance metrics (coefficient of determination R2, RMSE, and MAE) for each velocity component are shown in the legend.

Download

https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f17

Figure E4As in Fig. E3 for a random forest regressor predicting the magnitude of the Flipping Index of the drifters in a hurdle model for non-zero predictions from the classifier.

Download

This variable is then used in the test case as a feature to train a random forest model to predict the trajectories of the drifters. For a new location of the predicted drifter, this trained Flipping Index model takes as input the interpolated hydrodynamic and atmospheric features and predicts the amount of flipping at that location. Then, this information is also included as input for the total drifter velocity model that predicts the velocity of the drifter used in the advection scheme.

Appendix F: Drifter trajectories in the Tyrrhenian Sea
https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f18

Figure F1Trajectories over 49 d of 5 colour-coded surface drifters in the Tyrrhenian Sea. Drifters were deployed on 26 June 2025 in the Gulf of Naples in three different clusters spaced 1 km apart. Starting and ending positions are marked with stars and triangles, respectively. Background colourmap shows the bathymetry of the Tyrrhenian Sea from the Mediterranean Ocean Physics Analysis and Forecast model with a horizontal resolution of 0.042° (Clementi et al.2023). The study site location in the Tyrrhenian Sea is highlighted by a red rectangle on the orthogonal projection of the Northern Hemisphere in the bottom right corner.

Appendix G: Prediction results of trajectories in the Tyrrhenian Sea
https://os.copernicus.org/articles/22/49/2026/os-22-49-2026-f19

Figure G1Example of an iterated reconstructed surface drifter trajectory with a forecasting window of 48 h using linear regression (yellow), random forest (green) and support vector regression (blue) models using an integration timestep of dt=60 s. The true measured drifter trajectory is shown in a black solid line. The wind slip coefficients used for this linear regression model are γx=1.34 % and γy=1.64 % in the zonal and meridional direction, respectively. The beginning of each new prediction period is marked by a solid black dot.

Code and data availability

Code used to conduct the experiment is available at https://doi.org/10.5281/zenodo.17975392 (Medina Rubio2025a). Larger-sized machine learning models are stored in https://doi.org/10.5281/zenodo.17901303 (Medina Rubio2025b) (random forest models) and https://doi.org/10.5281/zenodo.17901907 (Medina Rubio2025c) (support vector regression models). Drifter data in the North Sea is available at https://doi.org/10.5281/zenodo.14198921 (van Sebille2024). Drifter data in the Tyrrhenian Sea is available at https://doi.org/10.5281/zenodo.17293098 (van Sebille2025).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/os-22-49-2026-supplement.

Author contributions

JMR designed and conducted the study, with steering and discussion from EvS, TvdB, and MN. JMR wrote the manuscript. All co-authors contributed to the revision of the manuscript.

Competing interests

At least one of the (co-)authors is a member of the editorial board of Ocean Science. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We would like to thank the team at IMAU who helped with the deployment of the drifters, including Bas Altena, Meike Bos, Michael Denes, Claudio Pierard, Daan Reijnders, Marc Schneiter, Jelle Soons, Margo van Asschenbergh, and Anna van Herwijnen.

Financial support

This research has been supported by the Nederlandse Organisatie voor Wetenschappelijk Onderzoek, Exacte en Natuurwetenschappen (VI.C.222.025) as part of the project “Tracing Marine Macroplastics by Unraveling the Ocean's Multiscale Transport Processes”. The Stokes drifters were purchased with support from the Paying-it-forward campaign of the Utrecht University Fund.

Review statement

This paper was edited by Bernadette Sloyan and reviewed by two anonymous referees.

References

Aksamit, N. O., Sapsis, T., and Haller, G.: Machine-Learning Mesoscale and Submesoscale Surface Dynamics from Lagrangian Ocean Drifter Trajectories, J. Phys. Oceanogr., 50, 1179–1196, https://doi.org/10.1175/JPO-D-19-0238.1, 2020. a

Allen, A. A.: Leeway Divergence, Tech. Rep. CG-D-05-05, US Coast Guard Research and Development Center, https://apps.dtic.mil/sti/pdfs/ADA435435.pdf (last access: 18 December 2025), 2005. a

Apley, D. W. and Zhu, J.: Visualizing the Effects of Predictor Variables in Black Box Supervised Learning Models, J. Roy. Stat. Soc. Ser. B, 82, 1059–1086, https://doi.org/10.1111/rssb.12377, 2020. a, b

Apple Inc.: AirTag – Technical Specifications, Apple Inc., https://support.apple.com/en-us/111847 (last access: 18 December 2025), 2021. a

Behrens, T., Schmidt, K., MacMillan, R. A., and Viscarra Rossel, R. A.: Multiscale contextual spatial modelling with the Gaussian scale space, Geoderma, 310, 128–137, https://doi.org/10.1016/j.geoderma.2017.09.015, 2018. a

Beron-Vera, F. J., Olascoaga, M. J., and Miron, P.: Building a Maxey–Riley framework for surface ocean inertial particle dynamics, Phys. Fluids, 31, 096602, https://doi.org/10.1063/1.5110731, 2019. a, b, c

Bishop, C. M.: Pattern recognition and machine learning, Information science and statistics, Springer, New York, ISBN 978-0-387-31073-2, 2006. a

Bos, M., Rypina, I. I., Pratt, L., and Van Sebille, E.: The Maxey-Riley-Gatignol equations for macroplastics in the North West European Shelf region, in preparation, 2025. a

Bracco, A., Brajard, J., Dijkstra, H. A., Hassanzadeh, P., Lessig, C., and Monteleoni, C.: Machine learning for the physics of climate, Nat. Rev. Phys., 7, 6–20, https://doi.org/10.1038/s42254-024-00776-3, 2025. a

Breiman, L.: Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author), Stat. Sci., 16, 199–231, https://doi.org/10.1214/ss/1009213726, 2001. a, b, c

Breivik, Ø., Allen, A. A., Maisondieu, C., and Roth, J. C.: Wind-induced drift of objects at sea: The leeway field method, Appl. Ocean Res., 33, 100–109, https://doi.org/10.1016/j.apor.2011.01.005, 2011. a, b

Breivik, Ø., Allen, A., Maisondieu, C., and others: Advances in search and rescue at sea, Ocean Dynam., 63, 83–88, https://doi.org/10.1007/s10236-012-0581-1, 2013.  a

Breivik, Ø., Bidlot, J.-R., and Janssen, P.: A Stokes drift approximation based on the Phillips spectrum, Ocean Model., 100, 49–56, https://doi.org/10.1016/j.ocemod.2016.01.005, 2016. a

Bruciaferri, D., Tonani, M., Lewis, H., Siddorn, J., Saulter, A., Castillo, J., Garcia Valiente, N., Conley, D., Sykes, P., Ascione, I., and McConnell, N.: The Impact of Ocean-Wave Coupling on the Upper Ocean Circulation During Storm Events, J. Geophys. Res.-Oceans, 126, https://doi.org/10.1029/2021JC017343, 2021. a, b

Buffett, G. G., Krahmann, G., Klaeschen, D., Schroeder, K., Sallarès, V., Papenberg, C., Ranero, C. R., and Zitellini, N.: Seismic Oceanography in the Tyrrhenian Sea: Thermohaline Staircases, Eddies, and Internal Waves, J. Geophys. Res.-Oceans, 122, 8503–8523, https://doi.org/10.1002/2017JC012726, 2017. a

Callies, U., Groll, N., Horstmann, J., Kapitza, H., Klein, H., Maßmann, S., and Schwichtenberg, F.: Surface drifters in the German Bight: model validation considering windage and Stokes drift, Ocean Sci., 13, 799–827, https://doi.org/10.5194/os-13-799-2017, 2017. a, b

Calvert, R., Peytavin, A., Pham, Y., Duhamel, A., van der Zanden, J., van Essen, S. M., Sainte-Rose, B., and van den Bremer, T. S.: A Laboratory Study of the Effects of Size, Density, and Shape on the Wave-Induced Transport of Floating Marine Litter, J. Geophys. Res.-Oceans, 129, e2023JC020661, https://doi.org/10.1029/2023JC020661, 2024. a

Calzada, A., Delgado, I., Ramos, C., Pérez, F., Reyes, D., Carracedo, D., Rodríguez, A., Chang, D., Cabrales, J., and Lobaina, A.: Lagrangian Model PETROMAR-3D to Describe Complex Processes in Marine Oil Spills, Open J. Mar. Sci., 11, 17–40, https://doi.org/10.4236/ojms.2021.111002, 2021. a

Clementi, E., Drudi, M., Aydogdu, A., Moulin, A., Grandi, A., Mariani, A., Goglio, A., Pistoia, J., Miraglio, P., Lecci, R., Palermo, F., Coppini, G., Masina, S., and Pinardi, N.: Mediterranean Sea Physical Analysis and Forecast (CMEMS MED-Physics, EAS8 system), cmcc, https://doi.org/10.25423/CMCC/MEDSEA_ANALYSISFORE, 2023. a, b

Copernicus Marine Service: Atlantic – European North West Shelf – Ocean Physics Analysis and Forecast, Marine Data Store (MDS) [data set] , https://doi.org/10.48670/moi-00054, 2024a. a

Copernicus Marine Service: Atlantic – European North West Shelf – Ocean Wave Analysis and Forecast, Marine Data Store (MDS) [data set], https://doi.org/10.48670/moi-00055, 2024b. a

Cortes, C. and Vapnik, V. N.: Support-Vector Networks, Mach. Learn., 20, 273–297, 1995. a

Cunningham, H. J., Higgins, C., and van den Bremer, T. S.: The Role of the Unsteady Surface Wave-Driven Ekman–Stokes Flow in the Accumulation of Floating Marine Litter, J. Geophys. Res.-Oceans, 127, e2021JC018106, https://doi.org/10.1029/2021JC018106, 2022. a

Dagestad, K.-F. and Röhrs, J.: Prediction of ocean surface trajectories using satellite derived vs. modeled ocean currents, Remote Sens. Environ., 223, 130–142, https://doi.org/10.1016/j.rse.2019.01.001, 2019. a, b

Davis, R. E., Dufour, J. E., Parks, G. J., and Perkins, M. R.: Two Inexpensive Current-Following Drifters, Scripps Institution of Oceanography, La Jolla, CA, Ref. 82-28, 55 pp., 1982. a

Delandmeter, P. and van Sebille, E.: The Parcels v2.0 Lagrangian framework: new field interpolation schemes, Geosci. Model Dev., 12, 3571–3584, https://doi.org/10.5194/gmd-12-3571-2019, 2019. a

Deyle, L., Badewien, T. H., Wurl, O., and Meyerjürgens, J.: Lagrangian surface drifter observations in the North Sea: an overview of high-resolution tidal dynamics and surface currents, Earth Syst. Sci. Data, 16, 2099–2112, https://doi.org/10.5194/essd-16-2099-2024, 2024. a

Dominicis, M. D., Bruciaferri, D., Gerin, R., Pinardi, N., Poulain, P. M., Garreau, P., Zodiatis, G., Perivoliotis, L., Fazioli, L., Sorgente, R., and Manganiello, C.: A multi-model assessment of the impact of currents, waves and wind in modelling surface drifters and oil spill, Deep-Sea Res. Pt. II, 133, 21–38, https://doi.org/10.1016/j.dsr2.2016.04.002, 2016. a, b

Drucker, H., Burges, C. J., Kaufman, L., Smola, A., and Vapnik, V.: Support vector regression machines, Adv. Neural Inf. Process. Syst., 9, 155–161, 1996. a

Duhaut, T. H. A. and Straub, D. N.: Wind Stress Dependence on Ocean Surface Velocity: Implications for Mechanical Energy Input to Ocean Circulation, J. Phys. Oceanogr., 36, 202–211, https://doi.org/10.1175/JPO2842.1, 2006. a

Ekman, V.: On the Influence of the Earth's Rotation on Ocean Currents, Arkiv för Matematik, Astronomy Och Fysik, 2, 1–53, 1905. a

Elipot, S., Lumpkin, R., Perez, R. C., Lilly, J. M., Early, J. J., and Sykulski, A. M.: A global surface drifter data set at hourly resolution, J. Geophys. Res.-Oceans, 121, 2937–2966, https://doi.org/10.1002/2016JC011716, 2016. a

Essink, S., Hormann, V., Centurioni, L. R., and Mahadevan, A.: On Characterizing Ocean Kinematics from Surface Drifters, J. Atmos. Ocean. Tech., 39, 1183–1198, https://doi.org/10.1175/JTECH-D-21-0068.1, 2022. a

Ewald, F. K., Bothmann, L., Wright, M. N., Bischl, B., Casalicchio, G., and König, G.: A Guide to Feature Importance Methods for Scientific Inference, in: Explainable Artificial Intelligence, Springer Nature Switzerland, 440–464, ISBN 978-3-031-63797-1, https://doi.org/10.1007/978-3-031-63797-1_22, 2024. a, b

Fajardo-Urbina, J. M., Liu, Y., Georgievska, S., Gräwe, U., Clercx, H. J. H., Gerkema, T., and Duran-Matute, M.: Efficient deep learning surrogate method for predicting the transport of particle patches in coastal environments, Mar. Pollut. Bull., 209, 117251, https://doi.org/10.1016/j.marpolbul.2024.117251, 2024. a

Faraway, J.: Linear Models with R, Chapman & Hall/CRC Texts in Statistical Science, CRC Press, ISBN 978-1-040-27585-6, 2025. a

Foreman, R. J. and Emeis, S.: Revisiting the Definition of the Drag Coefficient in the Marine Atmospheric Boundary Layer, J. Phys. Oceanogr., 40, 2325–2332, https://doi.org/10.1175/2010JPO4420.1, 2010. a, b

Friedman, J. H.: Greedy function approximation: A gradient boosting machine, Ann. Statist., 29, 1189–1232, https://doi.org/10.1214/aos/1013203451, 2001. a

Geurts, P., Ernst, D., and Wehenkel, L.: Extremely randomized trees, Mach. Learn., 63, 3–42, https://doi.org/10.1007/s10994-006-6226-1, 2006. a

Grimaldi, C. M., Lowe, R. J., Benthuysen, J. A., Cuttler, M. V. W., Green, R. H., Radford, B., Ryan, N., and Gilmour, J.: Hydrodynamic drivers of fine-scale connectivity within a coral reef atoll, Limnol. Oceanogr., 67, 2204–2217, https://doi.org/10.1002/lno.12198, 2022. a

Grossi, M. D., Jegelka, S., Lermusiaux, P. F. J., and Özgökmen, T. M.: Surface drifter trajectory prediction in the Gulf of Mexico using neural networks, Ocean Model., 196, 102543, https://doi.org/10.1016/j.ocemod.2025.102543, 2025. a, b, c

Haram, L. E., Carlton, J. T., Centurioni, L., Choong, H., Cornwell, B., Crowley, M., Egger, M., Hafner, J., Hormann, V., Lebreton, L., Maximenko, N., McCuller, M., Murray, C., Par, J., Shcherbina, A., Wright, C., and Ruiz, G. M.: Extent and reproduction of coastal species on plastic debris in the North Pacific Subtropical Gyre, Nat. Ecol. Evol., 7, 687–697, https://doi.org/10.1038/s41559-023-01997-y, 2023. a

Hastie, T., Tibshirani, R., and Friedman, J.: The elements of statistical learning: data mining, inference and prediction, 2nd edn., Springer, ISBN 978-0-387-84884-6, 2009. a, b

Haza, A. C., D'Asaro, E., Chang, H., Chen, S., Curcic, M., Guigand, C., Huntley, H. S., Jacobs, G., Novelli, G., Özgökmen, T. M., Poje, A. C., Ryan, E., and Shcherbina, A.: Drogue-Loss Detection for Surface Drifters during the Lagrangian Submesoscale Experiment (LASER), J. Atmos. Ocean. Tech., 35, 705–725, https://doi.org/10.1175/JTECH-D-17-0143.1, 2018. a

Haza, A. C., Paldor, N., Özgökmen, T. M., Curcic, M., Chen, S. S., and Jacobs, G.: Wind-Based Estimations of Ocean Surface Currents From Massive Clusters of Drifters in the Gulf of Mexico, J. Geophys. Res.-Oceans, 124, 5844–5869, https://doi.org/10.1029/2018JC014813, 2019. a

Hengl, T., Nussbaum, M., Wright, M. N., Heuvelink, G. B., and Gräler, B.: Random forest as a generic framework for predictive modeling of spatial and spatio-temporal variables, Peer J., 6, https://doi.org/10.7717/peerj.5518, 2018. a, b

Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 Hourly Data on Single Levels from 1940 to Present, Climate Data Store [data set], https://doi.org/10.24381/cds.adbb2d47, 2023. a, b

Jomar, D.: PyALE: ALE Plots with python, GitHub [code], https://github.com/DanaJomar/PyALE (last access: 18 December 2025), 2020. a

Jones, C. E., Dagestad, K.-F., Breivik, Ø., Holt, B., Röhrs, J., Christensen, K. H., Espeseth, M., Brekke, C., and Skrunes, S.: Measurement and modeling of oil slick transport, J. Geophys. Res.-Oceans, 121, 7759–7775, https://doi.org/10.1002/2016JC012113, 2016. a

Kaandorp, M., Lobelle, D., Kehl, C., Dijkstra, H. A., and van Sebille, E.: Global mass of buoyant marine plastics dominated by large long-lived debris, Nat. Geosci., 16, 689–694, https://doi.org/10.1038/s41561-023-01216-0, 2023. a

Kaandorp, M. L. A., Ypma, S. L., Boonstra, M., Dijkstra, H. A., and van Sebille, E.: Using machine learning and beach cleanup data to explain litter quantities along the Dutch North Sea coast, Ocean Sci., 18, 269–293, https://doi.org/10.5194/os-18-269-2022, 2022. a

Kopte, R., Becker, M., Holtermann, P., and Winter, C.: Tides, Stratification, and Counter Rotation: The German Bight ROFI in Comparison to Other Regions of Freshwater Influence, J. Geophys. Res.-Oceans, 127, e2021JC018236, https://doi.org/10.1029/2021JC018236, 2022. a

Kühn, S. and van Franeker, J. A.: Quantitative overview of marine debris ingested by marine megafauna, Mar. Pollut. Bull., 151, 110858, https://doi.org/10.1016/j.marpolbul.2019.110858, 2020. a

Laxague, N. J. M., Özgökmen, T. M., Haus, B. K., Novelli, G., Shcherbina, A., Sutherland, P., Guigand, C. M., Lund, B., Mehta, S., Alday, M., and Molemaker, J.: Relative dispersion at hte surface of the Gulf of Mexico, Geophys. Res. Lett., 45, 245–249, https://doi.org/10.1002/2017GL075891, 2018. a

Lenain, L. and Pizzo, N.: The Contribution of High-Frequency Wind-Generated Surface Waves to the Stokes Drift, J. Phys. Oceanogr., 50, 3455–3465, https://doi.org/10.1175/JPO-D-20-0116.1, 2020. a

Lindo-Atichati, D., Jia, Y., Wren, J. L. K., Antoniades, A., and Kobayashi, D. R.: Eddies in the Hawaiian Archipelago Region: Formation, Characterization, and Potential Implications on Larval Retention of Reef Fish, J. Geophys. Res.-Oceans, 125, e2019JC015348, https://doi.org/10.1029/2019JC015348, 2020. a

Lipton, Z. C.: The Mythos of Model Interpretability, arXiv [preprint], arXiv:1606.03490 [cs], https://doi.org/10.48550/arXiv.1606.03490, 2017. a

Liu, Y. and Weisberg, R. H.: Evaluation of trajectory modeling in different dynamic regions using normalized cumulative Lagrangian separation, J. Geophys. Res.-Oceans, 116, https://doi.org/10.1029/2010JC006837, 2011. a, b

Liu, Y., Weisberg, R. H., Vignudelli, S., and Mitchum, G. T.: Evaluation of altimetry-derived surface current products using Lagrangian drifter trajectories in the eastern Gulf of Mexico, J. Geophys. Res.-Oceans, 119, 2827–2842, https://doi.org/10.1002/2013JC009710, 2014. a

Lumpkin, R., Özgökmen, T., and Centurioni, L.: Advances in the Application of Surface Drifters, Annu. Rev. Mar. Sci., 9, https://doi.org/10.1146/annurev-marine-010816-060641, 2016. a

Manral, D., Bos, I., de Boer, M., and van Sebille, E.: Modelling drift of cold-stunned Kemp's ridley turtles stranding on the Dutch coast, Open Res. Eur., 4, 41, https://doi.org/10.12688/openreseurope.16913.3, 2024. a

Mato, Y., Isobe, T., Takada, H., Kanehiro, H., Ohtake, C., and Kaminuma, T.: Plastic Resin Pellets as a Transport Medium for Toxic Chemicals in the Marine Environment, Environ. Sci. Technol., 35, 318–324, https://doi.org/10.1021/es0010498, 2001. a

Medina Rubio, J.: jimena-medinarubio/ML_surface-drifters, Zenodo [code], https://doi.org/10.5281/zenodo.17975392, 2025a. a

Medina Rubio, J.: Random Forest models trained on surface drifter trajectories in the North Sea (April, 2024), Zenodo [code], https://doi.org/10.5281/zenodo.17901303, 2025b. a

Medina Rubio, J.: Support vector regression models trained on surface drifter trajectories in the North Sea (April, 2024), Zenodo [code], https://doi.org/10.5281/zenodo.17901907, 2025c. . a

MetOcean: iSphere, MetOcean, Dartmouth, Nova Scotia, https://metocean.com/products/isphere/ (last access: 18 December 2025), 2017. a

MetOcean: Stokes Drifter, MetOcean, Dartmouth, Nova Scotia, https://metocean.com/products/stokes-drifter/ (last access: 18 December 2025), 2020. a, b, c

Meyerjürgens, J., Badewien, T. H., Garaba, S. P., Wolff, J.-O., and Zielinski, O.: A State-of-the-Art Compact Surface Drifter Reveals Pathways of Floating Marine Litter in the German Bight, Front. Mar. Sci., 6, https://doi.org/10.3389/fmars.2019.00058, 2019. a

Meyers, S. D., Kelly, B. G., and O'Brien, J. J.: An Introduction to Wavelet Analysis in Oceanography and Meteorology: With Application to the Dispersion of Yanai Waves, Mon. Weather Rev., 121, 2858–2866, https://doi.org/10.1175/1520-0493(1993)121<2858:AITWAI>2.0.CO;2, 1993. a, b

Moerman, B., Breivik, Ø., Hole, L. R., Hope, G., Johannessen, J. A., and Rabault, J.: An analysis on OpenMetBuoy-v2021 drifter in-situ data and Lagrangian trajectory simulations in the Agulhas Current System, arXiv [preprint], https://doi.org/10.48550/arXiv.2409.20096, 2024. a

Molnar, C.: Interpretable Machine Learning: A Guide for Making Black Box Models Explainable, 2nd edn., https://christophm.github.io/interpretable-ml-book (last access: 18 December 2025), 2022. a, b, c, d

Morey, S. L., Wienders, N., Dukhovskoy, D. S., and Bourassa, M. A.: Measurement Characteristics of Near-Surface Currents from Ultra-Thin Drifters, Drogued Drifters, and HF Radar, Remote Sens., 10, 1633, https://doi.org/10.3390/rs10101633, 2018. a

Nooteboom, P. D., Bijl, P. K., van Sebille, E., von der Heydt, A. S., and Dijkstra, H. A.: Transport Bias by Ocean Currents in Sedimentary Microplankton Assemblages: Implications for Paleoceanographic Reconstructions, Paleoceanogr. Paleoclimatol., 34, 1178–1194, https://doi.org/10.1029/2019PA003606, 2019. a

Novelli, G., Guigand, C. M., Cousin, C., Ryan, E. H., Laxague, N. J. M., Dai, H., Haus, B. K., and Özgökmen, T. M.: A Biodegradable Surface Drifter for Ocean Sampling on a Massive Scale, J. Atmos. Ocean. Tech., 34, 2509–2532, https://doi.org/10.1175/JTECH-D-17-0055.1, 2017. a

Nussbaum, M., Spiess, K., Baltensweiler, A., Grob, U., Keller, A., Greiner, L., Schaepman, M. E., and Papritz, A.: Evaluation of digital soil mapping approaches with large sets of environmental covariates, SOIL, 4, 1–22, https://doi.org/10.5194/soil-4-1-2018, 2018. a

Olascoaga, M. J., Beron-Vera, F. J., Miron, P., Triñanes, J., Putman, N. F., Lumpkin, R., and Goni, G. J.: Observation and quantification of inertial effects on the drift of floating objects at the ocean surface, Phys. Fluids, 32, https://doi.org/10.1063/1.5139045, 2020. a

O'Malley, M., Sykulski, A. M., Lumpkin, R., and Schuler, A.: Probabilistic Prediction of Oceanographic Velocities with Multivariate Gaussian Natural Gradient Boosting, Environ. Data Sci., 2, e10, https://doi.org/10.1017/eds.2023.4, 2023. a, b

Otto, L., Zimmerman, J. T. F., Furnes, G. K., Mork, M., Saetre, R., and Becker, G.: Review of the physical oceanography of the North Sea, Neth. J. Sea Res., 26, 161–238, https://doi.org/10.1016/0077-7579(90)90091-T, 1990. a, b

Pärn, O., Davulienė, L., Macias Moy, D., Vahter, K., Stips, A., and Torsvik, T.: Effects of Eulerian current, Stokes drift and wind while simulating surface drifter trajectories in the Baltic Sea, Oceanologia, 65, 453–465, https://doi.org/10.1016/j.oceano.2023.02.001, 2023. a

Pawlowicz, R., Chavanne, C., and Dumont, D.: The Water-Following Performance of Various Lagrangian Surface Drifters Measured in a Dye Release Experiment, J. Atmos. Ocean. Tech., 41, 45–63, https://doi.org/10.1175/JTECH-D-23-0073.1, 2024. a

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.: Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12, 2825–2830, 2011. a, b, c

Pisano, A., De Dominicis, M., Biamino, W., Bignami, F., Gherardi, S., Colao, F., Coppini, G., Marullo, S., Sprovieri, M., Trivero, P., Zambianchi, E., and Santoleri, R.: An oceanographic survey for oil spill monitoring and model forecasting validation using remote sensing and in situ data in the Mediterranean Sea, Deep-Sea Res. Pt. II, 133, 132–145, https://doi.org/10.1016/j.dsr2.2016.02.013, 2016. a

Pizzo, N., Melville, W. K., and Deike, L.: Lagrangian Transport by Nonbreaking and Breaking Deep-Water Waves at the Ocean Surface, J. Phys. Oceanogr., 49, 983–992, https://doi.org/10.1175/JPO-D-18-0227.1, 2019. a

Prasad, A. M., Iverson, L. R., and Liaw, A.: Newer Classification and Regression Tree Techniques: Bagging and Random Forests for Ecological Prediction, Ecosystems, 9, 181–199, https://doi.org/10.1007/s10021-005-0054-1, 2006. a

Probst, P. and Boulesteix, A.-L.: To tune or not to tune the number of trees in random forest, J. Mach. Learn. Res., 18, 6673–6690, 2017. a

Rinaldi, E., Buongiorno Nardelli, B., Zambianchi, E., Santoleri, R., and Poulain, P.-M.: Lagrangian and Eulerian observations of the surface circulation in the Tyrrhenian Sea, J. Geophys. Res.- Oceans, 115, https://doi.org/10.1029/2009JC005535, 2010. a

Röhrs, J., Christensen, K. H., Hole, L. R., Broström, G., Drivdal, M., and Sundby, S.: Observation-based evaluation of surface wave effects on currents and trajectory forecasts, Ocean Dynam., 62, 1519–1533, https://doi.org/10.1007/s10236-012-0576-y, 2012. a

Röhrs, J., Sutherland, G., Jeans, G., Bedington, M., Sperrevik, A. K., Dagestad, K.-F., Gusdal, Y., Mauritzen, C., Dale, A., and LaCasce, J. H.: Surface Currents in Operational Oceanography: Key Applications, Mechanisms, and Methods, J. Oper. Oceanogr., 16, 60–88, https://doi.org/10.1080/1755876X.2021.1903221, 2021. a, b

Rühs, S., van den Bremer, T., Clementi, E., Denes, M. C., Moulin, A., and van Sebille, E.: Non-negligible impact of Stokes drift and wave-driven Eulerian currents on simulated surface particle dispersal in the Mediterranean Sea, Ocean Science, 21, 217–240, https://doi.org/10.5194/os-21-217-2025, 2025. a

Schneiter, M. and van Sebille, E.: Stokesdrifters_Wadden, GitHub [code], https://github.com/Parcels-code/Stokesdrifters_Wadden (last access: 17 October 2025), 2023. a

Seabold, S. and Perktold, J.: Statsmodels: Econometric and Statistical Modeling with Python, in: Proceedings of the 9th Python in Science Conference, edited by: v. d. Walt, S. and Millman, J., 92–96, https://doi.org/10.25080/Majora-92bf1922-011, 2010. a

Smola, A. J. and Schölkopf, B.: A Tutorial on Support Vector Regression, Stat. Comput., 14, 199–222, https://doi.org/10.1023/B:STCO.0000035301.49549.88, 2004. a

Spearman, C.: The Proof and Measurement of Association between Two Things, Am. J. Psychol., 15, 72–101, 1904. a

Staneva, J., Ricker, M., Carrasco Alvarez, R., Breivik, Ø., and Schrum, C.: Effects of Wave-Induced Processes in a Coupled Wave–Ocean Model on Particle Transport Simulations, Water, 13, 415, https://doi.org/10.3390/w13040415, 2021. a

Stokes, G. G.: On the theory of oscillatory waves, T. Cambridge Philos. Soc., 8, 441–455, 1847. a

Sutherland, G., Soontiens, N., Davidson, F., Smith, G. C., Bernier, N., Blanken, H., Schillinger, D., Marcotte, G., Röhrs, J., Dagestad, K.-F., Christensen, K. H., and Breivik, Ø.: Evaluating the Leeway Coefficient of Ocean Drifters Using Operational Marine Environmental Prediction Systems, J. Atmos. Ocean. Tech., 37, 1943–1954, https://doi.org/10.1175/JTECH-D-20-0013.1, 2020. a

Sybrandy, A. L. and Niiler, P. P.: WOCE/TOGA Lagrangian Drifter Construction Manual, Scripps Institution of Oceanography, La Jolla, CA, WOCE Report 63, SIO Ref. 91/6, 1992. a

Tonani, M., Sykes, P., King, R. R., McConnell, N., Péquignet, A. C., O'Dea, E., Graham, J. A., Polton, J., and Siddorn, J.: The impact of a new high-resolution ocean model on the Met Office North-West European Shelf forecasting system, Ocean Sci., 15, 1133–1158, https://doi.org/10.5194/os-15-1133-2019, 2019. a, b, c, d

van den Bremer, T. S. and Breivik, Ø.: Stokes drift, Philos. T. Roy. Soc. A, 376, https://doi.org/10.1098/rsta.2017.0104, 2018. a

van der Mheen, M., Pattiaratchi, C., Cosoli, S., and Wandres, M.: Depth-Dependent Correction for Wind-Driven Drift Current in Particle Tracking Applications, Front. Mar. Sci., 7, https://doi.org/10.3389/fmars.2020.00305, 2020. a

van Sebille, E., Aliani, S., Law, K. L., Maximenko, N., Alsina, J. M., Bagaev, A., Bergmann, M., Chapron, B., Chubarenko, I., Cózar, A., Delandmeter, P., Egger, M., Fox-Kemper, B., Garaba, S. P., Goddijn-Murphy, L., Hardesty, B. D., Hoffman, M. J., Isobe, A., Jongedijk, C. E., Kaandorp, M. L. A., Khatmullina, L., Koelmans, A. A., Kukulka, T., Laufkötter, C., Lebreton, L., Lobelle, D., Maes, C., Martinez-Vicente, V., Morales Maqueda, M. A., Poulain-Zarcos, M., Rodríguez, E., Ryan, P. G., Shanks, A. L., Shim, W. J., Suaria, G., Thiel, M., van den Bremer, T. S., and Wichmann, D.: The physical oceanography of the transport of floating marine debris, Environ. Res. Lett., 15, 023003, https://doi.org/10.1088/1748-9326/ab6d7d, 2020. a

van Sebille, E., Zettler, E., Wienders, N., Amaral-Zettler, L., Elipot, S., and Lumpkin, R.: Dispersion of Surface Drifters in the Tropical Atlantic, Front. Mar. Sci., 7, https://doi.org/10.3389/fmars.2020.607426, 2021. a

van Sebille, E.: North Sea drifter trajectories 2024, Zenodo [data set], https://doi.org/10.5281/zenodo.14198921, 2024. a

van Sebille, E.: Tyrrhenian Sea drifter trajectories 2025, Zenodo [data set], https://doi.org/10.5281/zenodo.17293098, 2025. a

Vapnik, V.: An overview of statistical learning theory, IEEE T. Neural Netw., 10, 988–999, https://doi.org/10.1109/72.788640, 1999. a

Wadoux, A. M. J.-C. and Heuvelink, G. B. M.: Uncertainty of spatial averages and totals of natural resource maps, Meth. Ecol. Evol., 14, 1320–1332, https://doi.org/10.1111/2041-210X.14106, 2023. a, b, c

Wagner, T. J. W., Eisenman, I., Ceroli, A. M., and Constantinou, N. C.: How Winds and Ocean Currents Influence the Drift of Floating Objects, J. Phys. Oceanogr., 52, 907–916, https://doi.org/10.1175/JPO-D-20-0275.1, 2022. a, b

Wilks, D. S.: Statistical Methods in the Atmospheric Sciences, in: 3rd edn., Academic Press, Oxford, ISBN 9780128165270, 2011. a

Zimmerman, J. T. F.: On the Euler-Lagrange transformation and the stokes' drift in the presence of oscillatory and residual currents, Deep-Sea Res. Pt. A, 26, 505–520, https://doi.org/10.1016/0198-0149(79)90093-1, 1979. a

Download
Short summary
We study how tides, wind, and waves interact at the ocean surface by tracking ultra-thin drifters released in the southern North Sea for two months. Using model data together with data-driven machine learning models, we determine the relative contribution of each forcing mechanism in driving the drifters' velocity and improve the prediction of their trajectories. We also test the generalisability of this method by applying it to the same drifters in the Tyrrhenian Sea.
Share