High-resolution stochastic downscaling method for ocean forecasting models and its application to the Red Sea dynamics

Shapiro, Georgy I.; Gonzalez-Ondina, Jose M.; Belokopytov, Vladimir N.

doi:https://doi.org/10.5194/os-17-891-2021

Articles | Volume 17, issue 4

https://doi.org/10.5194/os-17-891-2021

Articles | Volume 17, issue 4

Research article

06 Jul 2021

Research article |

| 06 Jul 2021

High-resolution stochastic downscaling method for ocean forecasting models and its application to the Red Sea dynamics

Georgy I. Shapiro, Jose M. Gonzalez-Ondina, and Vladimir N. Belokopytov

Abstract

High-resolution modelling of a large ocean domain requires significant computational resources. The main purpose of this study is to develop an efficient tool for downscaling the lower-resolution data such as those available from Copernicus Marine Environment Monitoring Service (CMEMS). Common methods of downscaling CMEMS ocean models utilise their lower-resolution output as boundary conditions for local, higher-resolution hydrodynamic ocean models. Such methods reveal greater details of spatial distribution of ocean variables; however, they increase the cost of computations and often reduce the model skill due to the so called “double penalty” effect. This effect is a common problem for many high-resolution models where predicted features are displaced in space or time. This paper presents a stochastic–deterministic downscaling (SDD) method, which is an efficient tool for downscaling of ocean models based on the combination of deterministic and stochastic approaches. The ability of the SDD method is first demonstrated in an idealised case when the true solution is known a priori. Then the method is applied to create an operational Stochastic Model of the Red Sea (SMORS), with the parent model being the Mercator Global Ocean Analysis and Forecast System at $1 / 12$ ^∘ resolution. The stochastic component of the model is data-driven rather than equation-driven, and it is applied to the areas smaller than the Rossby radius, within which distributions of ocean variables are more coherent than over a larger distance. The method, based on objective analysis, is similar to what is used for data assimilation in ocean models and stems from the philosophy of 2-D turbulence. SMORS produces finer-resolution ( $1 / 24$ ^∘ latitude mesh) oceanographic data using the output from a coarser-resolution ( $1 / 12$ ^∘ mesh) parent model available from CMEMS. The values on the fine-resolution mesh are computed under conditions of minimisation of the cost function, which represents the error between the model and true solution. SMORS has been validated against sea surface temperature and ARGO float observations. Comparisons show that the model and observations are in good agreement and SMORS is not subject to the “double penalty” effect. SMORS is very fast to run on a typical desktop PC and can be relocated to another area of the ocean.

Download & links

Article (PDF, 5237 KB)

Download & links

How to cite.

Received: 15 Dec 2020 – Discussion started: 11 Jan 2021 – Revised: 02 Jun 2021 – Accepted: 04 Jun 2021 – Published: 06 Jul 2021

1 Introduction

The main aim of this paper is to present an alternative, computationally efficient method of downscaling of ocean models, i.e. create finer-resolution outputs using a stochastic method while the coarser-resolution fields are obtained by traditional deterministic numerical ocean modelling. In order to reflect the dual nature of the algorithm, the term “stochastic–deterministic” is used. The suggested method may do best in going from eddy-permitting resolution where the desired features are “already” there embryonically and guided by assimilation, e.g. as in CMEMS (2020), to somewhat finer resolution so that the embryonic features can be properly represented. As usual, the method has its limitations which are discussed later. A deterministic approach in ocean modelling based on solving differential equations is capable of producing high-quality forecasts and hindcasts, both for research and operational needs, and is currently mainstream in numerical modelling of the ocean. Ocean models have matured through multiple improvements including better numerical schemes, spatial discretisation, parameterisations, and data assimilation. Modern ocean models do not solve the full Navier–Stokes or Reynolds equations, instead they tend to make the traditional and hydrostatic Boussinesq approximations and various parameterisations of unresolved processes (Miller, 2007; Fox-Kemper et al., 2019; Lindsay, 2017; Ezer and Mellor, 2004; Bruciaferri et al., 2019).

However, the enhancement of model resolution using such an approach involves a significant increase in the computational cost. For example, doubling the horizontal resolution in both directions requires approximately 10 times more calculations, taking into account the necessity of reducing the time step and increasing the overhead due to data exchange between the nodes of a high-performance computer (HPC). There is also an increased conceptual difficulty to deterministically resolve very small-scale processes due to the turbulent and chaotic nature of motion at a small scale.

In contrast to early ocean models which were applied to highly idealised cases and did not require any observational data (e.g. Bryan, 1963) modern models use real-world data in addition to the universal laws of physics. The data are used for model initialisation, tuning the numerical parameters such as diffusion and viscosity coefficients, validation, and data assimilation. Data assimilation improves the description of ocean state used as the initial condition for the forecasting step. There are many different forms of data assimilation including optimal interpolation (OI), Kalman filtering and variational methods; see for example Lorenc (1986) and references therein. One of the most efficient methods is optimal interpolation (Gandin, 1959, 1965; Fletcher, 2017), which uses statistical properties of real-world data rather than equations of motion or prescribed spatial dependences.

The term “optimal interpolation” may be confusing as it is of a very different nature than the usual deterministic interpolation methods (linear, polynomial, spline, inverse distance etc.) where the weighting coefficients are determined by the location of points, not by the data themselves. In contrast, the OI method calculates the weights based on statistical properties of the data and could be called “objective analysis”. However, the term “objective analysis” has already been occupied in the original publication by Cressman (1959) for his deterministic interpolation method. Therefore this paper follows the terminology from the original literature and uses the term “optimal interpolation” even though it is not strictly interpolation, but a minimum variance estimator that is algorithmically similar to Kalman filtering.

The philosophy of combining deterministic and stochastic (random) behaviour of fluids has a long history. For example the Reynolds equations and their modern versions are used in ocean modelling, based on simple decomposition of an actual instantaneous quantity into time-averaged and fluctuating quantities and taking the averages of non-linear terms (see for example Tennekes and Lumley, 1992). More advanced methods of describing the chaotic movements at smaller scales have been developed in the statistical theory of turbulence (see for example Kolmogorov, 1941; Monin and Yaglom, 1971; Frisch, 1995). The OI method further extends ideas originated in the theory of statistical turbulence and was the method of choice for operational numerical weather prediction centres in the 1980s and early 1990s. As shown by Lorenc (1986), more modern variational methods are closely linked to the original OI and they can be described using a common Bayesian analysis framework.

The basis of OI is the minimisation of a cost function which represents a measure of the difference between the estimated and true values. The OI considers the data fields as realisations of random processes, and it studies the statistical links represented by either structure functions or covariances between data points in a way similar to the theory of fully developed turbulence (Gandin and Kagan, 1976). An important feature of the method is that, in order to calculate the interpolating coefficients, it only requires the knowledge of statistical moments of the second order. It does not use any a priori hypothesis about the dependence of the weights on the distance from the interpolation points as it is used in alternative methods of objective analysis (Cressman, 1959; Vasquez, 2003). In those alternative methods the weighting coefficients are calculated as a prescribed analytical function of distance and hence do not require the knowledge of the statistical properties of the actual field of interest.

In this paper we have tested a hypothesis that a similar technique, hereafter called stochastic–deterministic downscaling, or SDD, based on the statistical properties of ocean parameters such as temperature, salinity and velocity, can be used to achieve a finer resolution in ocean modelling by downscaling the results of a parent deterministic model. Basically, the data are treated as having two components: a low-resolution, slowly varying component which is computed using deterministic equations and a high-resolution quickly varying component where the data are treated as random processes. As in the theory of turbulence, the statistical properties of the smaller-scale processes are often much more stable than the data themselves (see for example Monin and Yaglom, 1971, and Tennekes and Lumley, 1992).

The assimilation of observational data is widely used in operational ocean modelling (see for example Dobricic et al., 2007; Dobricic and Pinardi, 2008; Korotaev et al., 2011; Mirouze et al., 2016). However, the application of a similar approach for fine-resolution model downscaling should be considered as experimental at this stage. The SDD method, in common with other data assimilation techniques, can be used in both the attached and detached modes. In the attached mode the downscaling is carried out on the same computer which solves the equations of ocean dynamics at the same time as the forecast advances. Programmatically, in the attached mode the SDD is contained within the same executable module as all other elements of the model and is applied regularly as the model advances in time. On the other hand, in the detached mode, the SDD is applied after the forecast has been completed by the parent model. This mode was used in SMORS. In this case the SDD (or any data assimilation) can be considered as post-processing. The treatment of data assimilation as post-processing can be found in Delle Monache et al. (2011) and Dazhi (2019 and references therein). Due to its experimental nature, the SDD method is first tested and assessed by application to an idealised case of a region filled with multiple mesoscale eddies where the true solution is known.

While the proposed SDD method has a generic nature, the focus of this paper is on its application to the Red Sea. We use the Red Sea as a “difficult” case for the SDD method as the sea has a complicated coastline, multiple islands and a complex mesoscale-circulation structure; see for example Zhan et al. (2016) and Hoteit et al. (2021 and references therein). The main section of the paper describes the development and properties of an operational eddy-resolving stochastic model for the Red Sea (SMORS) at $1 / 24$ ^∘ resolution based on a parent eddy-permitting model at $1 / 12$ ^∘ resolution, the outputs of which are accessible via Copernicus Marine Environment Monitoring Service (CMEMS, 2020).

The paper is organised as follows. Section 2 describes materials and methods, including a detailed description of the algorithm used in SDD, application of the method for an idealised case, the treatment of noisy data, and a description of the operational Red Sea model (SMORS). Section 3 presents the results of SMORS validation, analyses of eddy and mean kinetic energy as well as analyses of vorticity and enstrophy produced by the parent and SDD models. Section 4 present the discussion of the results and Sect. 5 concludes the paper.

2 Materials and methods

2.1 The algorithm

The stochastic–deterministic downscaling (SDD) uses the methodology developed for the original version of the optimal interpolation technique (Gandin, 1959, 1963, 1965; Gandin and Kagan, 1976; Barth et al., 2008). The philosophy behind this technique is similar to what is used in assimilation of observational data to improve the quality of numerical models. The main differences are that instead of observational data, the SDD assimilates the data from a medium-resolution model, and the effect is the enhancement of model resolution rather than improvement of model skill. The SDD method considers all oceanographic fields as consisting of two components: (i) a relatively slowly varying part which can be described using a dynamic method (i.e. by solving deterministic equations) and (ii) a stochastic, turbulent part which can be described via its statistical properties. Then the statistical properties are linked to the properties of a slowly varying field similar to how a turbulent viscosity coefficient is estimated in ocean modelling via the knowledge of deterministically assessed larger-scale flows; see for example Smagorinsky (1963).

We treat the data from the parent model as “observations” and assimilate these onto a fine-resolution mesh of SMORS. Generally speaking, the OI method requires, among other parameters, the knowledge of the root mean square error (RMSE) of “observations” at each location to calculate the interpolating weights. As the errors of the parent models at each grid point are often not known, we assume that the medium-resolution forecast provides the values $f_{1} = f (r_{1}), \dots, f_{n} = f (r_{n})$ for a certain oceanographic parameter f at all points $r_{1}, \dots, r_{n}$ on the parent mesh with perfect accuracy (later, in Sect. 2.3 we shall see that this requirement can be relaxed). We are interested in finding the value of the parameter f at another location f₀=f(r₀), where r₀ is any point on a fine-resolution mesh. The SDD method is applied to the deviations $f_{i}^{'} = f^{'} (r_{i}) = f (r_{i}) - 〈 f (r_{i}) 〉$ of the parameter from its statistical mean, or “norm”, designated here as 〈f〉, rather than to the parameter f itself, in line with the approach used in Gandin (1965). We further assume that the field of deviations f^′ is statistically homogenous and isotropic. This assumption has been shown to be more applicable to the deviations than to the meteorological and oceanographic parameters themselves (Gandin and Kagan, 1976; Fletcher, 2017; Barth et al., 2008). Bretherton et al. (1976) have also recommended that for oceanographic applications an estimated mean should be subtracted from each observation at the outset and added back to the estimate of interpolated values. Climatic studies have also shown that fluctuations (a.k.a. anomalies) have better statistical properties than the data itself, and hence it is the statistics of fluctuations rather than full data that are usually used on oceanographic research; see for example Boyer et al. (2005).

The calculation of statistically mean values requires averaging over a statistical ensemble, which, as usual, was not available. The estimate of the statistical mean of a parameter 〈f〉 was calculated by computing the spatial average inside the Red Sea of the values of the parameter in the daily analysis data corresponding to one year (2016). Deviations from climatology would be a satisfactory alternative as well. These daily spatial averages were averaged in time to obtain monthly averages. This means that 〈f〉 is independent of the location but has a dependency on time since each month has a different norm.

According to Gandin (1965), an approximate estimate ${\tilde{f}}_{0}^{'}$ of the true deviation $f_{0}^{'} = f^{'} (r_{0})$ at a location r₀ can be found as a linear combination of deviations at other points as follows:

\begin{matrix} (1) & {\tilde{f}}_{0}^{'} = \sum_{i = 1}^{n} p_{i} f_{i}^{'}, \end{matrix}

where p_i denotes the weighting factors that must be determined. This is done by minimising the variance of the difference between the true and estimated values of deviations, also known as a cost function:

\begin{matrix} (2) & E = 〈{(f_{0}^{'} - {\tilde{f}}_{0}^{'})}^{2}〉 = 〈f_{0}^{'} - Σ_{i = 1}^{n} p_{i} f_{i}^{'}〉 \end{matrix}

The cost function given by Eq. (2) can be rewritten in terms of the autocorrelation matrix

\begin{matrix} (3) & R_{i j} = \frac{〈f_{i}^{'}〉 〈f_{j}^{'}〉}{〈(f_{0}^{'})^{2}〉}, \end{matrix}

also known as a background error correlation matrix as follows:

\begin{matrix} (4) & E = 〈{(f_{0}^{'})}^{2}〉 (1 - 2 R_{0}^{T} p + p^{T} R p), \end{matrix}

where p is the column vector composed of the unknown weighting coefficients p_i, i=1…n and R₀ is the column vector of correlations R_0i given by Eq. (3). The optimal values of weights p_i which minimise the cost function E given by Eq. (4) can be found by taking partial derivatives of E with respect to all the p_i and equalling them to zero, resulting in the following system of linear equations:

\begin{matrix} (5) & R p = R_{0} . \end{matrix}

These equations can be solved for the weights p_i if we know the background correlation matrix R. Background correlation describes the statistical structure of deviations f^′ in space and can be found as described below.

Following Gandin (1963), only those correlations which relate to the data located at the same depth level are taken into account, and the distribution of deviations f^′ is assumed to be statistically uniform and isotropic locally (i.e. within the search radius; see Eq. 8 below) in the horizontal plane. Therefore, the autocorrelation R matrix can be represented in the form

\begin{matrix} (6) & R_{i j} = C (∥r_{i} - r_{j}∥, z), \end{matrix}

where r_i, r_j are horizontal coordinates of the parent grid points, ∥r_i−r_j∥ is the distance between points r_i and r_j independently of the direction, and z is a vertical coordinate (depth). For three-dimensional fields, Fu et al. (2004) suggested approximating the correlation function defined by Eq. (6) using a Gaussian formula which can be written in a horizontally isotropic case as follows:

\begin{matrix} (7) & R_{i j} (∥r_{i} - r_{j}∥, z) = \exp (- \frac{{(r_{i} - r_{j})}^{2}}{L {(z)}^{2}}), \end{matrix}

where L(z) is the e-folding correlation radius, representing the scale which reflects the extent of spatial correlation, and z is the depth level where correlation is calculated. The use of a Gaussian function for the autocorrelation and associated difficulties has been discussed by Daley (1991). The downscaling process described by Eqs. (1)–(7) is repeated for every ocean parameter on every grid point of the fine-resolution mesh, to provide a fine-resolution output for deviations $f_{i}^{'}$ . The fine-resolution output of the actual values is calculated by adding the deviations to the “norms”.

To reduce the computational cost while solving multiple systems of Eq. (5), only those nodes which are relatively close to the point of interpolation r₀ are taken into account, so that the corresponding matrix elements are larger than a certain threshold. We use a correlation threshold R_cut suggested by Grigoriev et al (1996) for using the optimal interpolation technique in the analysis of ocean observations in the Black Sea. Our tests confirmed that this method provides accurate results in the downscaling of model outputs while avoiding numerical and computational problems. In order to further optimise the computational algorithm, the correlation threshold R_cut was converted into a maximum distance r_max, which is computed just once for each depth:

\begin{matrix} (8) & r_{\max} = L (z) \sqrt{- \ln (R_{cut})} . \end{matrix}

For computation of the correlation matrix R_ij using the expression in Eq. (5), it is only necessary to include the nodes in the medium mesh located at a distance smaller than r_max to the fine-resolution node being computed. It is worth noting that the SDD method honours the data on the coarse grid; i.e. it reproduces the coarse field data exactly (to within truncation errors) at those fine grid nodes which coincide with the parent grid points. If the fine grid contains the coarse model grid points then the values at this points are exactly the same (to within truncation errors) as in the parent model. Therefore, the spatial structure is anchored onto the coarse grid and no additional double penalty effect compared to the parent model is generated.

2.2 Idealised case

The SDD technique can be illustrated using an idealised case. Let us consider a rectangular domain, which is significantly larger than a typical size of a mesoscale eddy. In this numerical example we use an area of 1000 km × 1000 km. The parent model is assumed to produce no errors, with its only limitation being an insufficient resolution. Let the parent grid have a spatial resolution of $Δ x_{p} = Δ y_{p} = 10$ km and the true 2-D field of variable F consist of a number of anisotropic vortices which are modelled by the following formula:

\begin{matrix} (9) & F (x, y) = \sin (\frac{x}{a}) \sin (\frac{y}{b}), \end{matrix}

as shown in Fig. 1. The statistical norm of F is zero and hence the Eqs. (1)–(7) can be applied to the parameter F itself.

https://os.copernicus.org/articles/17/891/2021/os-17-891-2021-f01

Figure 1Model domain with a zoomed-in sub-map of idealised spatial distribution of parameter F according to Eq. (9) with a=4.1 km, which corresponds to the eddy size of 13 km in the x direction, and b=33.3 km, which corresponds to the eddy size of 105 km in the y direction.

High-resolution stochastic downscaling method for ocean forecasting models and its application to the Red Sea dynamics

2.1 The algorithm

2.2 Idealised case

2.3 Effect of noise in the input data

2.4 Stochastic Model of the Red Sea

3.1 Model validation

3.2 Eddy and mean kinetic energy

3.3 Analysis of vorticity and enstrophy