Articles | Volume 21, issue 4
https://doi.org/10.5194/os-21-1761-2025
https://doi.org/10.5194/os-21-1761-2025
Research article
 | 
21 Aug 2025
Research article |  | 21 Aug 2025

Assessing subseasonal forecast skill for use in predicting US coastal inundation risk

John R. Albers, Matthew Newman, Magdalena A. Balmaseda, William Sweet, Yan Wang, and Tongtong Xu
Abstract

Developing predictions of coastal flooding risk on subseasonal timescales (2–6 weeks in advance) is an emerging priority for the National Oceanic and Atmospheric Administration (NOAA). In this study, we assess the ability of two current operational forecast systems, the European Centre for Medium-Range Weather Forecasts Integrated Forecasting System (IFS) and the Centre National de Recherches Météorologiques climate model (CNRM), to make subseasonal ensemble predictions of the non-tidal residual component of coastal water levels at United States coastal gauge stations for the period 2000–2019. These models were chosen because they assimilate satellite altimetry at forecast initialization and attempt to predict the mean sea level, including a global mean component whose absence in other forecast systems complicates assessment of tide gauge reforecast skill. Both forecast systems have skill that exceeds damped persistence for forecast leads through 2–3 weeks, with IFS skill exceeding damped persistence for leads up to 6 weeks. Post-processing forecasts to include the inverse barometer effect, derived from mean sea level pressure forecasts, improves skill for relatively short forecast leads (1–3 weeks). Accounting for vertical land motion of each gauge primarily improves skill for longer leads (3–6 weeks), especially for the Alaskan and Gulf coasts; sea-level trends contribute to reforecast skill for both model and persistence forecasts, primarily for the East and Gulf coasts. Overall, we find that current forecast systems have sufficiently high levels of deterministic and probabilistic skill to be used in support of operational coastal flood guidance on subseasonal timescales.

Share
1 Introduction

Over the past several decades, nearly all United States coastal regions have experienced a steady increase in the frequency, extent, and duration of high tide flooding, particularly the East and Gulf coast regions (Sweet et al., 2018, 2022). The increased flooding frequency is associated with risks to trillions of dollars of property and infrastructure, as well as risks to coastal ecosystems (Fleming et al., 2018 and references therein). As a result, there is an emerging need to provide high tide flooding outlooks on subseasonal-to-annual timescales (Dusek et al., 2022; NOAA Coastal Inundation Framework, 2022).

High tide flooding (HTF) can be defined in terms of water levels exceeding gauge station water level thresholds, where the thresholds are typically determined based on local conditions, including topography, land-cover types, and risk to infrastructure (Sweet et al., 2018; Kavanaugh et al., 2023). The water levels, in turn, can be considered the sum of a tidal component, a sea-level trend component, and a non-tidal residual (Dusek et al., 2022; see also Widlansky et al., 2017). For the West Coast and parts of the East Coast, the tidal component dominates water level in terms of amplitude relative to the non-tidal residual, either because the tidal range is large or because the non-tidal residual is small, while in other regions, including the Gulf Coast, the non-tidal residual is comparable to the tidal component (Merrifield et al., 2013; Sweet et al., 2014). Still, predicting non-tidal residuals is important for all coastal regions because, as sea levels continue to rise, even small-amplitude non-tidal residual anomalies can push high tide water levels past flood thresholds (e.g., Sweet and Park, 2014).

Currently, NOAA's National Ocean Service issues a monthly HTF outlook for a wide range of stakeholders (NOAA Coastal Inundation Framework, 2022; Dusek et al., 2022) by using damped persistence of current gauge station monthly non-tidal residual anomalies as part of its predicted total water levels. If current operational models could in fact skillfully predict coastal sea level anomalies on subseasonal timescales, specifically by outperforming damped persistence, then incorporating those predictions in the HTF outlook could improve upon its forecast guidance. The non-tidal residual is driven by a multitude of processes operating on timescales ranging from minutes to decades (Sweet et al., 2018; Woodworth et al., 2019; and references therein). Many of these processes, including daily timescale wind forcing and storm surge, are largely governed by individual weather systems, which are generally not predictable beyond 10–14 d (Lorenz, 1963, 1969; Weber and Mass, 2017; Bauer et al., 2015; Simmons and Hollingsworth, 2002; Zhang et al., 2019). However, sea level anomalies are also correlated with several modes of large-scale climate variability, including the Madden–Julian Oscillation (MJO), the El Niño–Southern Oscillation (ENSO), and the North Atlantic Oscillation (NAO) (Enfield and Allen, 1980; Menéndez and Woodworth, 2010; Sweet and Zervas, 2011; Ezer and Atkinson, 2014; Sweet and Park, 2014; Valle-Levinson et al., 2017; Han et al., 2019; Amaya et al., 2022; Boucharel et al., 2023; Arcodia et al., 2024; Renkl et al., 2024), which are associated with potentially predictable signals on subseasonal-to-seasonal (S2S) timescales (e.g., Vitart and Molteni, 2010; Barnston et al., 2019; Albers and Newman, 2021, and references therein). For example, Amaya et al. (2022) recently demonstrated that coastal Kelvin waves related to ENSO are associated with sea surface height anomalies along the West Coast of the United States that can be skillfully predicted. Whether these climate modes also introduce predictable sea level anomalies along the other coasts of the United States via wind stress and atmospheric pressure anomalies associated with atmospheric teleconnections remains an open question.

In this study, we assess the skill of current forecast models for predicting non-tidal anomalies at gauge stations on subseasonal timescales. Of course, there is a clear difference in spatial scale between the model grid size, on the order of tens of kilometers, and the point locations of the gauge stations. Here, we will take a relatively simple approach, using the closest oceanic model grid point to a particular gauge station to make forecasts for that station, and do not consider the additional complication of how best to downscale the model output to much smaller scales (e.g., Long et al., 2023). This approach hinges in part on the assumption that open ocean or near-shore sea level subseasonal anomalies will be representative of the sea level subseasonal anomalies at gauge stations. This is a reasonably well-justified assumption, as many coastal gauge stations are fairly well-correlated with the nearby open ocean (Vinogradov and Ponte, 2011). However, the connection between open ocean and coastal sea level anomalies varies by region due to many factors, including shelf depth and extent, and eastern versus western boundary ocean dynamics (Hughes et al., 2019; Han et al., 2019). For example, for eastern boundary regions (e.g., the US West Coast), coastal sea level will be influenced by coastally trapped waves propagating poleward from equatorial regions and local direct forcing; for western boundary regions, on the other hand, models need to successfully simulate more processes, including wave signals from all latitudes (Hughes et al., 2019). Gauge stations located on islands, on the other hand, which typically have narrow shelves, should have sea level variability more closely matching the nearby open ocean (Vinogradov and Ponte, 2011). Coastal shelf depth also effects the relative importance of wind stress versus atmospheric pressure effects; for example, along coasts with shallow shelves, wind-stress-driven sea level variability is relatively more important than the inverse barometer effect (IBE) on daily to monthly timescales (Woodworth et al., 2019; and references therein).

The two forecast systems evaluated here, the European Centre for Medium Range Weather Forecasting Integrated Forecasting System reforecasts (IFS) and the Centre National de Recherches Météorologiques climate model (CNRM), were chosen in part because they both assimilate ocean altimetry data into their initial conditions, which previous studies have suggested is relevant to sea level forecast skill (Widlansky et al., 2023; Long et al., 2025). In addition, since the forecast systems are verified against tide gauge observations, we examine the impact of post-processing reforecasts by including both the IBE, computed from the corresponding mean sea level pressure reforecasts, and an estimate of local vertical land motion determined for each gauge location. Model and verification datasets and skill evaluation methods, including deterministic and probabilistic metrics, are outlined in Sect. 2. This is followed by an evaluation of open ocean sea surface height reforecast skill and coastal gauge station skill for the United States (Sect. 3.1–3.2), where gauge stations are broken down into four regions: the East Coast, Gulf Coast, West Coast, and Alaska. The seasonality of skill is briefly discussed at the end of Sect. 3.3, which is followed by a discussion of the implications of the results in the Conclusions.

2 Data and verification metrics

2.1 Forecast models and verification data

Two dynamical forecast models are considered, the European Centre for Medium Range Weather Forecasting Integrated Forecasting System reforecasts (IFS model versions CY 46R1-47R1 for the reforecast period 2000–2019, https://www.ecmwf.int/en/publications/ifs-documentation, last access: 13 March 2024) and the Centre National de Recherches Météorologiques climate model (CNRM model version CM6.1, for the reforecast period 1993–2017, Voldoire et al., 2019). Both models utilize the Nucleus for European Modeling of the Ocean (NEMO, Madec et al., 2017) ocean model run at 1/4° resolution, and assimilate ocean altimetry data into their initial conditions (ECMWF ocean analysis for the IFS and the Mercator-Ocean ocean and sea-ice analysis for the CNRM). Reforecast data, obtained from the S2S Prediction database (Vitart et al., 2017), was only available at 1° resolution, so all calculations, including identifying the closest model grid point to each gauge station, are based on this resolution.

For the open ocean, reforecasts are verified against sea surface height (SSH) from the Copernicus Marine Service Global Ocean Physics Reanalysis 12v1 (GLORYS, Lellouche et al., 2021), which has been extensively verified in North American waters (Amaya et al., 2022, 2023; Castillo-Trujillo et al., 2023; Feng et al., 2024). For coastal regions of the continental United States and Alaska, reforecasts are further verified against National Oceanic and Atmospheric Administration (NOAA) water level gauge stations (NOAA National Ocean Service, 2024). Only gauge stations that have at least 10 years of data during the reforecast records of the IFS and CNRM are considered, yielding 47 stations for the East Coast, 32 stations for the Gulf of Mexico, 35 stations for the West Coast, and 23 stations for Alaska (see Table 1 for a list of the gauge stations and the number of days in each verification time series). When comparing the reforecast skill of the IFS and CNRM (Sect. 3.1), we verify against their common reforecast periods, 2000–2017; when evaluating the reforecast skill of the IFS alone (Sect. 3.2–3.3), we use the full IFS reforecast period, 2000–2019.

As discussed in the Introduction, water levels at gauge stations are separated into three components for the HTF outlooks (Dusek et al., 2022): a 37-constituent tidal component (e.g., Sweet and Zervas, 2011), a linear sea level trend component, and a non-tidal residual. Here, for comparison with the reforecasts, we remove the local tidal components, but we do not remove the trend from the water level gauge data, so that the verification time series of non-tidal residuals (NTR) at each tide gauge includes the local trend component, in contrast with how it is defined for the HTF framework. Including the trend component in the reforecasts and verifications of the non-tidal residual allows for assessment of contribution of the linear trend to reforecast skill, for example, by comparing the difference in anomaly correlation skill with and without the linear trend included (Figs. 2d, 4d, 6d, and 8d), and comparing reliability with and without the linear trend included (Figs. 3, 5, 7, and 9).

IFS reforecasts are available from initialization out to forecast lead day 46, while CNRM reforecasts are available out to forecast lead day 47; both reforecast sets were obtained as daily averages. To create reforecast anomalies, the lead-dependent 20- or 25-year reforecast climatologies of the IFS and CNRM periods, respectively, are subtracted from the daily average data, which implicitly applies a mean bias correction; further details regarding the creation of the IFS and CNRM climatologies (which are different for each model) are in the Supplement (Sect. S1). Next, weekly averages are calculated from the daily reforecast data, where week 1 includes the average of forecast days 1–7, week 2 includes days 8–14, etc. To calculate the daily average GLORYS verification anomalies, we use a period spanning both the CNRM and IFS reforecast records (2000–2017), where a 365 d (plus 1 d for leap years) climatology is calculated by averaging over all years, and extracting the first four harmonics (plus the mean) via Fourier transform (e.g., Epstein, 1988), yielding the final climatology, which is then removed from the daily average GLORYS data. Weekly average GLORYS verification anomalies are then created by applying a 7 d running mean to the daily data. To calculate the NOAA gauge stations anomalies, we calculate a daily average climatology for the period over which the gauge station data are available, generally 20 years but slightly shorter for some stations (see Table 1), and remove that climatology from the daily averaged gauge station data. A 7 d running mean is then applied to the gauge station data.

Table 1List of NOAA gauge stations organized by four regions: East Coast, Gulf Coast, West Coast, and Alaska. The number of days listed for each gauge station corresponds to the number of reforecast days used when calculating IFS reforecast skill; the IFS is initialized twice weekly, which means that  2000 d equates to roughly  20 years. Gauge stations with an * next to them are stations that are included in the official NOAA High Tide Flooding Monthly Outlooks (Kavanaugh et al., 2023).

Download XLSX

2.2 Inverse barometer effect and vertical land motion

The satellite altimeter-derived sea level products used to initialize the IFS are processed so as to remove static and high-frequency ( 20 d cutoff) dynamic atmospheric pressure effects (Ponte and Ray, 2002; Centre national d'études spatiales, 2020). As a result, when the ocean model (e.g., NEMO in the case of the IFS) is run for forecasting purposes, it assumes an atmosphere with no mass (both because the assimilated observations are corrected to remove pressure effects, and because the “atmospheric pressure” subroutine in NEMO is turned off), so that the SSH reforecasts include neither static nor dynamic responses to atmospheric pressure fluctuations (Tai, 1993). However, the static effect of atmospheric pressure on the ocean surface can be approximated via the so-called inverse barometer effect (Ross, 1854; Doodson, 1923; Groves and Hannan, 1968; Tai, 1993; Arbic, 2005; Ponte, 1992, 2006; Oddo et al., 2014; Long et al., 2021; Feng et al., 2024), which assumes a static ocean response to atmospheric pressure forcing (Tai, 1993; Wunsch and Stammer, 1997). The IBE (ηibe) is written as

(1) η ibe = - p m . s . l . - p m . s . l . ρ ocean g

(e.g., Piecuch and Ponte, 2015), where pm.s.l. is atmospheric mean sea level pressure, pm.s.l. is the global mean sea level pressure (MSLP, ocean-only), ρocean is the ocean density (assumed to be a constant value of 1025 kg m−3), and g is the acceleration due to gravity. Although the assumptions inherent to the inverse barometer approximation are not always strictly valid, which can lead to deviations from a purely static response (e.g., Wunsch, 1991; Le Traon and Gauzelin, 1997), previous studies have suggested that including the IBE is important on subseasonal-to-seasonal timescales (e.g., Woodworth et al., 2019; Long et al., 2021; Feng et al., 2024). In this study, for the IFS we use the predicted pm.s.l. to predict the ηibe from Eq. (1) at different forecast leads, which is added to the corresponding SSH reforecasts as a post-processing step for each tide gauge (for simplicity, we do not display IBE-corrected CNRM reforecasts, as the IFS is generally more skillful).

Over the course of several decades, the sea level at local gauge stations can change from vertical land motion (VLM) due to a wide range of phenomena, including glacial isostatic rebound and groundwater and/or fossil fuel removal (Larsen et al., 2004; Hu and Freymueller, 2019; Sweet et al., 2022; Lindesy et al., 2021; Oelsmann et al., 2024). While the VLM rate is too small to impact an individual subseasonal forecast, it could be large enough in some regions to impact long-term skill assessment and is implicitly included in a persistence forecast. There exist many methods for estimating VLM (e.g., Kopp et al., 2014; Hammond et al., 2021; Oelsmann et al., 2024); however, here we use a relative simple approach that applies VLM rates as a trend correction to each IFS forecast anomaly time series. The VLM rates we use are provided on a 1° × 1° grid (NOAA National Ocean Service, 2021; Sweet et al., 2022), and the grid point nearest to each gauge station is used as the rate constant for the entire reforecast period.

In the remainder of the paper, we will refer to two types of reforecasts: (1) IFS and CNRM SSH reforecast anomalies, which are the original ocean model output and (2) IFS reforecast anomalies that are IBE- and VLM-corrected, which will be referred to as non-tidal residual reforecasts:

(2) NTR ^ = SSH + IBE + VLM ,

where in Eq. (2), the SSH and IBE are IFS quantities and the VLM is the trend correction described above. While we could have adjusted the tide gauge NTR using the VLM prior to evaluating skill, we prefer adding both the IFS-IBE and VLM to the IFS-based SSH reforecast, since NTR^ is the hindcast that should be verified directly against the NTR that is observed at the tide gauge. GLORYS does not include the IBE, so verifying the IFS and CNRM against GLORYS is done using SSH only.

2.3 Skill metrics

Reforecast skill is evaluated both deterministically, using anomaly correlation, and probabilistically, using reliability and sharpness (Atger, 1999; Jolliffe and Stephenson, 2011; Wilks, 2011). Reliability is computed for events that exceed the upper tercile of each gauge station's water level, where the tercile threshold is calculated separately for the water level (non-tidal residual) distributions of the reforecasts (NTR^) and verifications (NTR). Using the tercile from the reforecast ensemble sample distribution for computing the reforecast probability of an event (as opposed to using the observed tercile) amounts to an in-sample bias correction of the reforecast probability distribution (Weisheimer and Palmer, 2014), which ensures that the reforecast probabilities and observed frequencies of tercile events are the same for each gauge station; this correction is needed because, in general, the reforecast water level distributions are underdispersive (i.e., more narrow than observed). For all reliability diagrams, the observed distributions and relative forecast frequencies are split into 10 bins (0 %–10 %, 10 %–20 %,, 90 %–100 %).

To condense reliability metrics onto a single map, we use the slope of a regression line fitted to each gauge station's reliability curve, computed via a weighted least-squares regression fitted to the reliability bin values (i.e., the observed relative frequency–reforecast probability pairs), where the weights are determined by the number of events in each observed frequency–reforecast probability bin. Since we have applied the reforecast probability bias correction, the regression lines always intersect the tercile–tercile point (0.33, 0.33) on the reliability diagrams, so that a regression line with a slope greater than 0.5 contributes positively to the Brier skill score (e.g., Mason, 2004), though in general, any reliability regression line that is relatively close to 0.5 is probably “useful” for making forecast guidance (see, e.g., Weisheimer and Palmer, 2014, for a discussion on this point).

We also determine forecast sharpness, which measures the ability of a forecast system to issue more definitive guidance (i.e., beyond simply forecasting the climatological probability), as

(3) SHP = 1 n i = 1 n p i 1 1 - p i ( 1 400

(Daan 1984; Potts 2011), where pi(1) is the probability of exceeding the tercile threshold for each forecast, i, where each “i” refers to a forecast initialization and verification pair. This expression yields SHP = 0 for the sharpest forecasts (when all the forecasts are either 0 % or 100 %) and SHP = 100 for the most blurred forecasts (multiplying the expression in Eq. (3) by 400 is not strictly necessary but is done to provide a more intuitive range of SHP values, spanning 0 to 100 instead of 0 to 0.25). Since a forecast system can be perfectly sharp by always forecasting 0 %, which would not be terribly useful, we also report the percentage of forecasts that are in the 90 %–100 % probability bin relative to the 0 %–10 % bin.

Dynamical model reforecast skill is also compared to the skill of “damped” persistence of the observed coastal station anomalies. Each damped persistence forecast is calculated using the mean of the previous 7 d, multiplied by the lead-dependent autocorrelation value, where the autocorrelation function is calculated independently from the NTR time series for each gauge station. For all reforecasts, including from the CNRM, the IFS, and damped persistence, reforecasts are compared with and without a linear trend removed from the anomaly time series, where the trend is independently computed relative to the length of each time series being compared.

3 Results

We begin with a simple evaluation of the geographic distribution of year-round week 3 SSH prediction skill for the IFS and CNRM, verifying against GLORYS SSH anomalies, which also do not include the IBE (Sect. 3.1). Thereafter, we conduct a more detailed regional prediction skill assessment of NTR^ based on the IFS reforecasts, including evaluating the impact of the post-processed IBE- and VLM-based reforecast corrections, for gauge stations grouped into four broad regions: the East Coast (Maine to South Carolina), the Gulf Coast (Florida to Texas, and one station in Georgia), the West Coast (California to Washington), and Alaska (Sect. 3.2). Finally, we evaluate the seasonal dependence of NTR^ prediction skill (Sect. 3.3).

3.1 Multi-model comparison of coastal ocean and tide gauge SSH reforecast skill

For the oceans surrounding North America, the geographic distribution of week 3 SSH anomaly correlation skill for the IFS and CNRM are qualitatively similar, with both models exhibiting relatively higher skill in the eastern portion of the Pacific Ocean, along the southern coast of Alaska, and the Beaufort Sea, while exhibiting relatively lower skill along the East Coast of the US (Fig. 1). In general, the IFS is more skillful than the CNRM in most regions, with the exception of a small region northeast of the Bahamas, the central portion of the Gulf of Mexico, and portions of the Labrador Sea and Baffin Bay.

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f01

Figure 1Year-round (2000–2017) week 3 anomaly correlation skill between GLORYS SSH anomalies and SSH reforecasts (color contours), and week 3 anomaly correlation skill between NOAA gauge station anomalies and SSH reforecasts (colored circle markers), for reforecasts from the (a) IFS and (b) CNRM. While the sample years used in the anomaly correlation calculation (2000–2017) are the same, the dates are slightly different because of different initialization dates. All dates are used in both datasets, regardless of whether they overlap with the other dataset, because otherwise there are too few samples.

For both models, SSH reforecast skill evaluated at the gauge stations tends to correspond reasonably well with nearby open ocean SSH reforecast skill evaluated using GLORYS gridded SSH anomalies for week 3 (see colored shading for the oceans surrounding North America and colored dots for the tide gauges used in this study in Fig. 1). This good comparison justifies the use of nearest neighbor open ocean SSH forecasts to predict tide gauge anomalies in this study. For example, for both tide gauges and the near-shore open ocean, reforecast skill for both models is relatively high in southern California and Maine and relatively low between Virginia and New York (see both Fig. 1a and b).

For the US coastlines nearest to the gauge stations that are of central interest here, the IFS has superior skill at all forecast leads (see Figs. S1–S6 in the Supplement). While the IFS and CNRM have qualitatively similar skill characteristics, only the IFS SSH has skill that is as good or better than persistence for nearly all contiguous US (CONUS) gauge stations (Figs. S1–S6) as well as many Alaskan stations (not shown), with the exception of those situated in regions far up inland rivers, including stations such as Vancouver, WA, or Berwick, Atchafalaya River, LA, which are all far removed (and physically disconnected) from the nearest IFS or CNRM open ocean grid points.

The biggest skill improvements of IFS week 3 reforecasts relative to persistence are seen in three regions: along the central East Coast, the entire West Coast, and along the southern coast and islands of Alaska (not shown). Reforecast skill is highest in southern California (e.g., Fig. S3), where skill from coastal Kelvin waves may provide a significant predictable signal (Amaya et al., 2022), particularly during strong El Niño events (see also Arcodia et al., 2024; and references therein). At week 3, and even at week 2 (both shown in the Supplement), the CNRM forecasts are uniformly (across all regions) less skillful than persistence, with one exception: the week 2 CNRM forecasts for the Pacific Northwest (Oregon and Washington) are more skillful than persistence (Fig. S2).

Scoring reforecasts without accounting for strong trends can make it difficult to differentiate between forecast skill associated with predicting subseasonal climate variations versus the spurious impact of trends on subseasonal forecast skill (Wulff et al., 2022). Similarly, prior studies on seasonal predictions of U.S. coastal sea-level anomalies have found that sea-level trends can substantially affect reforecast skill estimates, especially when using standard metrics benchmarked against a climatological forecast (Widlansky et al., 2017; Long et al., 2021; Shin and Newman, 2021; Long et al., 2025). After a linear trend is removed from the forecast and verification datasets, CONUS reforecast skill is reduced for most regions both for models and for persistence, with the largest effects occurring for leads of 3 weeks and beyond (Figs. S1–S12; see Sect. 3.2.1–3.2.4 for a detailed discussion of the effect of the linear trend on IFS reforecast skill). However, removing the linear trend from the model reforecasts and persistence also highlights the models' ability to skillfully predict SSH anomalies related to subseasonal climate variability. For example, linearly detrended CNRM reforecasts are more skillful than linearly detrended persistence for the entire West Coast and most East Coast stations at weeks 2 and 3 (Figs. S4 and S6, respectively). Thus, with the linear trend removed, it becomes clearer that the CNRM provides useful guidance for many regions for forecast leads out to at least 3 weeks. However, the IFS is still more skillful than the CNRM at nearly all gauge stations and at all lead times (Figs. S1–S12), so for the remainder of the article, we focus on results using the IFS.

3.2 Regional US coastal skill

We next assess deterministic and probabilistic NTR^ reforecast skill for each of the four US subregions, the West, Gulf, and East coasts, and Alaska. When comparing deterministic skill (anomaly correlation), four panels are shown: NTR^ skill, followed by a panel showing the difference between NTR^ skill and persistence skill, and then two panels isolating the contributions of the IBE and the linear trend to NTR^ skill; the contribution of VLM to NTR^ skill is discussed in the text when relevant with a figure included in the Supplement.

Probabilistic week 3 NTR^ reforecast skill is assessed at all gauge stations via three metrics that characterize reliability and sharpness (see Sect. 2.3 for details of the calculations): (1) the slope of the reliability regression line, where values greater than 0.5 indicate positive contributions to the Brier skill score; (2) forecast sharpness as measured by the “SHP” parameter ranging from 0–100, where smaller values represent sharper forecasts; and (3) the percentage of forecasts that are in the top forecast probability category versus the lowest category (0 %–10 % versus 90 %–100 %), where higher percentages indicate relatively more “certain” affirmative forecasts. In addition, for each sub-region, we show reliability diagrams and histograms of the forecast probability distributions for two representative gauge stations to provide a visual reference for the metric numbers reported on the maps; for these figures, skill of NTR^ reforecasts with the linear trend removed is also shown.

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f02

Figure 2(a) Year-round (2000–2019) week 3 NTR^ anomaly correlation skill between nearest grid point NTR^ reforecasts anomalies (SSH + IBE + VLM) and West Coast NOAA gauge station NTR anomalies. (b) Difference between NTR^ skill and NTR persistence skill, (c) difference between NTR^ skill and (SSH + VLM)-only skill, and (d) difference between NTR^ skill and linearly detrended NTR^ skill.

Download

3.2.1 West Coast

In general, NTR^ skill is highest along the southern to central West Coast, often exceeding 0.5 for leads through week 6 (Fig. 2a). With the exception of a few gauge stations, notably those far up the Columbia River, West Coast reforecasts are also more skillful than persistence (Fig. 2b). Note that for many of the stations between central California and Oregon, IFS week 1 skill is only better than persistence with the IBE correction (see Fig. 2b and c). Removing the linear trend minimally impacts West Coast skill, primarily only for longer leads in California (Fig. 2d).

Reliability and sharpness along the West Coast tends to be largest in the south and decreases to the north (Fig. 3a). For example, reforecasts for gauge stations south of Arena Cove, CA, have excellent reliability (regression slopes between 0.6–0.9) and sharpness (SHP between 30–50), with most gauge stations having roughly a quarter of their forecasts in the highest forecast probability category (e.g., Santa Barbara, CA, in Fig. 3c). Even with the linear trend removed, most forecasts in central to southern California have excellent reliability and sharpness (e.g., Fig. 3c). Sharpness decreases roughly monotonically from south to north, but reliability has a minimum near the Oregon–California border (e.g., Crescent City, CA; Fig. 3b) with increasing values again from northern Oregon into Washington, apart from gauge stations extending well up the Columbia River. The IBE correction increases reliability for all West Coast gauge stations, with regression slopes generally increasing by 0.2 slope units (not shown). The IBE correction also mildly decreases sharpness (not shown); however, since sharpness for most gauge stations remains relatively high, this decrease is outweighed by the increased reliability realized with the IBE correction. Linearly detrending the reforecasts has a relatively small impact on overall reliability and sharpness (not shown).

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f03

Figure 3(a) Year-round (2000–2019) West Coast NTR^ reliability (colored circles), sharpness (numbers), and percentage of forecasts that are in the top forecast probability category versus the lowest category (0 %–10 % versus 90 %–100 %). For reliability, slopes greater than 0.5 contribute to positive Brier skill scores; for sharpness, the scale spans 0–100, with smaller numbers representing sharper forecast distributions. Panels (b) and (c) show example reliability diagrams and sharpness distributions (ensemble probability counts) for two gauge stations, Crescent City, California, and Santa Barbara, California, respectively, where the red ovals in panel (a) highlight the location of each station.

3.2.2 East Coast

In contrast to the West Coast, NTR^ deterministic reforecast skill along the East Coast exceeds 0.5 only through weeks 2–3 (Fig. 4a), though skill still nearly always exceeds that of persistence for all locations and at all leads (Fig. 4b). The IBE correction improves skill for many East Coast gauge stations, particularly north of 35° N for weeks 1 and 2 (Fig. 4c), so that NTR^ skill exceeds persistence skill for the northernmost gauge stations for shorter lead times (see Fig. 4b and c). The linear trend also has a more significant impact on East Coast than West Coast skill, significantly contributing to weeks 3–6, particularly for the Carolinas and Georgia (Fig. 4d). When both NTR^ and persistence are linearly detrended, NTR^ is still more skillful than persistence at all leads (not shown), consistent with the SSH results from Sect. 3.1.

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f04

Figure 4(a) Year-round (2000–2019) week 3 NTR^ anomaly correlation skill between nearest grid point NTR^ reforecasts anomalies (SSH + IBE + VLM) and East Coast NOAA gauge station NTR anomalies. (b) Difference between NTR^ skill and NTR persistence skill, (c) difference between NTR^ skill and (SSH + VLM)-only skill, and (d) difference between NTR^ skill and linearly detrended NTR^ skill.

Download

With the exception of two gauge stations (Burlington, Delaware River, NJ, and Bergen Point West Reach, NY), all of the NTR^ reforecasts for the East Coast are at least minimally reliable (regression slopes > 0.5), with some stations, particularly in the northeast, having quite high reliability (regression slopes > 0.7). Moreover, for many of the northernmost and southernmost gauge stations, the forecasts also have reasonably sharp forecast probability distributions, with SHP values somewhere between 50–68 (Fig. 5). In contrast, most of the mid-Atlantic stations have SHP values > 65 (that is, most reforecasts are near climatological probabilities), with only a small number of forecasts in the highest probability category.

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f05

Figure 5(a) Year-round (2000–2019) East Coast NTR^ reliability (colored circles), sharpness (numbers), and percentage of forecasts that are in the top forecast probability category versus the lowest category (0 %–10 % versus 90 %–100 %). For reliability, slopes greater than 0.5 contribute to positive Brier skill scores; for sharpness, the scale spans 0–100, with smaller numbers representing sharper forecast distributions. Panels (b) and (c) show example reliability diagrams and sharpness distributions (ensemble probability counts) for two gauge stations, Eastport, Maine, and Oregon Inlet Marina, North Carolina, respectively, where the red ovals in panel (a) highlight the location of each station.

The IBE correction impacts reliability differently depending on the region, improving it for gauge stations roughly north of New York City and decreasing it for the middle Atlantic Bight stations with no discernible change in sharpness (not shown), while there is little impact further south in the Carolinas. Thus, on balance, the notable benefits of the IBE correction for the northern East Coast appear to outweigh the small reduction in reliability for the middle Atlantic Bight region, particularly because the middle Atlantic Bight gauge stations have marginal sharpness that weighs against their overall usefulness. With the linear trend removed, reforecasts at all East Coast gauge stations become more overconfident and less sharp (e.g., Fig. 5b and c).

3.2.3 Gulf Coast

NTR^ skill for many of the Gulf Coast stations remains at or above 0.5 until at least forecast week 6 (Fig. 6a), and for nearly all gauge stations, the NTR^ reforecasts are more skillful than persistence for weeks 2–6 (Fig. 6b). The IBE modestly improves weeks 1–2 reforecast skill for gauge stations between Mississippi and the southern tip of Florida (Fig. 6c), while the linear trend greatly increases reforecast skill for all stations for leads at and beyond week 2. While the steric and eustatic contributions to the linear trend are generally spatially uniform across the Gulf (e.g., Fig. 2.1 of Sweet et al., 2022), the effect of the VLM trend on reforecast skill is largely confined to gauge stations between Rockport, Texas, and Dauphin Island, Alabama (Fig. 10a), where VLM improves correlation skill by roughly 0.1–0.2, particularly for forecast leads beyond week 2 (Fig. S13c). Nevertheless, even when the linear trend is removed from the reforecasts, reforecast skill (i.e., IFS-only SSH + IBE) exceeds linearly detrended persistence skill for leads out to 2–3 weeks for Texas gauge stations and out to 6 weeks for stations between Louisiana and Florida (not shown).

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f06

Figure 6(a) Year-round (2000–2019) week 3 NTR^ anomaly correlation skill between nearest grid point NTR^ reforecasts anomalies (SSH + IBE + VLM) and Gulf Coast NOAA gauge station NTR anomalies. (b) Difference between NTR^ skill and NTR persistence skill, (c) difference between NTR^ skill and (SSH + VLM)-only skill, and (d) difference between NTR^ skill and linearly detrended NTR^ skill.

Download

Gulf Coast forecasts have good reliability (Fig. 7a), with regression slopes in the 0.6–0.9 range. There is one outlier gauge station relatively far inland (Berwick, Atchafalaya River, LA, Fig. 7b), with poorer reliability and with relatively few forecasts in the highest probability category ( 10 %). Most of the Gulf Coast stations have reasonably decent sharpness quite similar to the St. Petersburg, FL, gauge station (Fig. 7c). The IBE correction increases reliability for gauge stations from Mississippi east to all of Florida (not shown) but appears not to impact reforecast skill in Louisiana or Texas. As for the East Coast, linearly detrending the reforecasts mildly decreases their reliability and sharpness (e.g., Fig. 7c).

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f07

Figure 7(a) Year-round (2000–2019) Gulf Coast NTR^ reliability (colored circles), sharpness (numbers), and percentage of forecasts that are in the top forecast probability category versus the lowest category (0 %–10 % versus 90 %–100 %). For reliability, slopes greater than 0.5 contribute to positive Brier skill scores; for sharpness, the scale spans 0–100, with smaller numbers representing sharper forecast distributions. Panels (b) and (c) show example reliability diagrams and sharpness distributions (ensemble probability counts) for two gauge stations, Berwick, Atchafalaya River, Louisiana, and St. Petersburg, Florida, respectively, where the red ovals in panel (a) highlight the location of each station.

3.2.4 Alaska

Most Alaskan gauge stations have modest skill (anomaly correlation > 0.5) through week 3 (Fig. 8a), with skill exceeding persistence at all leads (Fig. 8b). The IBE notably contributes to reforecast skill for all stations out to lead times of 2–3 weeks, and for a handful of stations, out to even longer leads (Fig. 8c). Reliability is also quite good, with all but a few stations having regression slopes > 0.6 and moderate sharpness (Fig. 9). Except for Nome, Norton Sound, AK and Prudhoe Bay, AK, the IBE notably increases the slopes of the reliability regression lines (not shown), with many stations rising from poor reliability slopes (0.1–0.4) to quite useful slopes (0.5–0.8).

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f08

Figure 8(a) Year-round (2000–2019) week 3 NTR^ anomaly correlation skill between nearest grid point NTR^ reforecasts anomalies (SSH + IBE + VLM) and Alaska NOAA gauge station NTR anomalies. (b) Difference between NTR^ skill and NTR persistence skill, (c) difference between NTR^ skill and (SSH + VLM)-only skill, and (d) difference between NTR^ skill and linearly detrended NTR^ skill.

Download

Removing the linear trend decreases overall skill for stations extending from roughly Kodiak Island southeast to Sitka (Fig. 8d). Along this section of the Alaskan coast, glacial isostatic rebound (see references in Sect. 2.2) is causing the land surface to rise, which is reflected in relatively large VLM rates over southeastern Alaska (Fig. 10a). Indeed, failing to account for VLM can lead to large errors in the NTR anomaly time series; see, for example, the NTR^ (blue lines) versus IFS-only (SSH + IBE) time series (orange lines) for Yakutat and Skagway, Alaska, in Fig. 10b and c, respectively. In general, derived VLM rates (Kopp et al., 2014; Sweet et al., 2022) successfully account for a large portion of the negative linear NTR trend, though the VLM-corrected time series (NTR^) at some gauge stations appear to more closely match the observed trend than for others (e.g., the VLM correction appears to underestimate the size of the land motion trend for Yakutat, Fig. 10b). Indeed, adjusting the IFS reforecasts with the predicted VLM rates does not completely resolve all trend issues, as evidenced by Port Alexander (Fig. 9c), where linearly detrending the reforecasts increases reliability. For stations in the Aleutian Islands and northwards to Prudhoe Bay, AK where isostatic rebound is either not occurring or not significant, accounting for VLM only mildly increases reliability. Nevertheless, adjusting the IFS reforecasts to account for the VLM trend notably improves both deterministic and probabilistic NTR^ reforecast skill between Kodiak Island and Sitka for all forecast leads, increasing anomaly correlations by 0.1–0.75 (Fig. S13a).

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f09

Figure 9(a) Year-round (2000–2019) Alaska NTR^ reliability (colored circles), sharpness (numbers), and percentage of forecasts that are in the top forecast probability category versus the lowest category (0 %–10 % versus 90 %–100 %). For reliability, slopes greater than 0.5 contribute to positive Brier skill scores; for sharpness, the scale spans 0–100, with smaller numbers representing sharper forecast distributions. Panels (b) and (c) show example reliability diagrams and sharpness distributions (ensemble probability counts) for two gauge stations, Skagway, Taiya Inlet, Alaska, and Port Alexander, Alaska, respectively, where the red ovals in panel (a) highlight the location of each station.

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f10

Figure 10(a) Vertical land motion rates (cm yr−1) shown as colored contours with the NOAA gauge station locations used in the study shown as filled circular markers, where the color of the filled marker denotes the improvement in anomaly correlation skill when VLM is included in the reforecast, i.e., skill of NTR^ reforecast minus skill of IFS-only reforecast (SSH + IBE). (b) Week 3 NTR^ reforecast time series (SSH + IBE + VLM, blue line), IFS-only reforecast (SSH + IBE, gray line) and NOAA gauge station NTR time series (orange line) at the Yakutat, Yakutat Bay, Alaska, gauge station. (c) Same as (b) but for the Skagway, Taiya Inlet, Alaska, gauge station NTR time series (orange line) at the Yakutat, Yakutat Bay, Alaska, gauge station.

3.3 Seasonality of deterministic skill

While coastal flooding can happen in any season, all four United States sub-regions tend to have peaks in water levels exceeding the 90th percentile of the observed NTR during the winter months (not shown). To understand how the IFS performs during these peak exceedance seasons, Figs. 11 and 12 group forecasts into four 3-month periods (JFM, AMJ, JAS, and OND), where for each season and gauge station, the latest forecast lead when NTR^ anomaly correlation skill exceeds 0.5 is listed, both with the linear trend included (left-hand columns) and without (right-hand columns).

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f11

Figure 11Seasonal cycle of NTR^ reforecast skill (2000–2019) where the color bar (and number in each box) corresponds to the latest forecast lead (from week 1 to 6) when the anomaly correlation is greater than or equal to 0.5. In each panel, the seasonal cycle is split into four 3-month periods, where the panels correspond to correlations among (a) East Coast NOAA gauge station NTR anomalies and NTR^ reforecasts, (b) linearly detrended East Coast NOAA gauge station NTR anomalies and linearly detrended NTR^ reforecasts, (c) Gulf Coast NOAA gauge station NTR anomalies and NTR^ reforecasts, and (d) linearly detrended Gulf Coast NOAA gauge station NTR anomalies and linearly detrended NTR^ reforecasts.

Download

With the linear trend included, forecasts skill for the East remains above 0.5 for forecast leads out to at least 2 weeks for all seasons and gauge stations, with the exception of the East Coast during late fall to early winter when skill only exceeds 0.5 for leads of 1 week (Fig. 11a). For the Gulf Coast, reforecast skill exceeds 0.5 for most stations through at least 2–3 weeks but as far out as 6 weeks for many stations and seasons (Fig. 11c). With the linear trend removed, skill out to and beyond 2 week lead times is largely confined to late winter to early summer for the East Coast (Fig. 11b) and late fall to early spring for the Gulf Coast (Fig. 11d).

Reforecast skill is notably better for the central to southern portions of the West Coast and is relatively insensitive to a linear trend (see Fig. 12a and b), with reforecast skill exceeding 0.5 through forecast week 6 for nearly all California gauge stations throughout the year. For Oregon and Washington, skill exceeds 0.5 for forecast leads of 2–3 weeks. For Alaska, reforecast skill for most stations exceeds 0.5 through week 3 for the cold season (October–March) but only through week 2 for the warm season (April–September), with the exception of the stations between Yakutat and Juneau, where reforecast skill exceeds 0.5 at all lead times almost year-round (Fig. 12c and d).

https://os.copernicus.org/articles/21/1761/2025/os-21-1761-2025-f12

Figure 12Seasonal cycle of NTR^ reforecast skill (2000–2019) where the color bar (and number in each box) corresponds to the latest forecast lead (from week 1 to 6) when the anomaly correlation is greater than or equal to 0.5. In each panel, the seasonal cycle is split into four 3-month periods, where the panels correspond to correlations among (a) West Coast NOAA gauge station NTR anomalies and NTR^ reforecasts, (b) linearly detrended West Coast NOAA gauge station NTR anomalies and linearly detrended NTR^ reforecasts, (c) Alaska NOAA gauge station NTR anomalies and NTR^ reforecasts, and (d) linearly detrended Alaska NOAA gauge station NTR anomalies and linearly detrended NTR^ reforecasts.

Download

4 Conclusions

The primary goal of this paper is to assess the suitability of the current generation of forecast models for making coastal inundation forecasts on subseasonal timescales. Indeed, we find that the deterministic skill of the IFS and CNRM exceeds that of damped persistence for many US coastal regions for forecast leads extending out to 2–3 weeks, with the IFS continuing to have skill above damped persistence for longer leads through week 6. However, when reforecasts and persistence reforecasts are linearly detrended, the skill of both models over persistence increases, highlighting the ability of the models to successfully simulate water level anomalies related to subseasonal climate variability. When a simple bias correction is applied to the IFS's probability distribution, the IFS generally has “useful” reliability (Weisheimer and Palmer, 2014) that contributes positively to the Brier skill score (Mason, 2004), though the need for the probability bias correction highlights that the model ensemble spread is consistently underdispersive. Thus, our results suggest that the current generation of operational forecast models provide predictions of coastal inundation with sufficient skill to form the basis for improved forecast guidance of high tide flooding predictions on subseasonal timescales.

We have also demonstrated the regional and forecast lead time dependence of weekly NTR^ prediction skill. California has by far highest skill, with anomaly correlation skill above 0.6 through at least week 6. Many portions of Alaska also have anomaly correlation skill that reaches or exceeds 0.5 for leads through week 3, with some stations in southeastern Alaska having skill exceeding 0.5 through week 6 once past VLM trends are taken into account. On the East Coast, on the other hand, skill is quite low for the mid-Atlantic states but is relatively higher south of Cape Hatteras and further north along the New England coast (Figs. 1 and 4), where anomaly correlation skill generally ranges from 0.4 to 0.5 for forecast leads out to at least week 3. For many Gulf Coast stations, skill remains above 0.4 to 0.5 for leads out to at least week 6. However, for many Alaskan and East and Gulf Coast gauge stations, a large fraction of the reforecast skill can be attributed to the linear NTR^ trend, which includes steric and eustatic trends that are explicitly accounted for in the model reforecasts, as well as VLM trends that are added via post-processing.

How the predicted IBE impacts NTR^ reforecast skill also appears to strongly depend upon forecast lead time and region. Accounting for the IBE primarily improves reforecast skill during forecast weeks 1 and 2, which is consistent with predictable IBE signals being limited to weather timescale atmospheric pressure fluctuations (e.g., the timescales suggested in Fig. 2 of Woodworth et al., 2019). Still, the IBE is essential since, for many gauge stations, week 1 IFS reforecasts only have more skill than persistence when the IBE correction is applied. The regional dependence of IBE-related skill improvement is at least somewhat consistent with the idea that the IBE is more important at higher latitudes (Chelton and Davis, 1982), particularly in the Gulf of Alaska and the northern portion of the East Coast (Wunsch and Stammer, 1997; Ponte, 2006). However, it is likely that the geographical dependence of MSLP reforecast skill itself also plays a role. For example, the IFS at weeks 3–6 tends to have higher MSLP skill for the East Coast and the Aleutian Islands than for the West Coast and mainland Alaska (Albers and Newman, 2019; see their Figs. 1 and S2). Likewise, while the importance of the IBE seems to monotonically increase from south to north along the East Coast (Fig. 4), the middle of the West Coast appears to benefit more from including the IBE than do the northernmost West Coast gauge stations (Fig. 2). Overall, the skill improvement we find with a post-processed IBE correction suggests that future forecast model development would benefit from the explicit inclusion of the IBE.

The relatively low NTR^ reforecast skill for many regions suggests that usable NTR forecast guidance may benefit from identifying “forecasts of opportunity”; that is, when predictions are expected to have skill at the time of forecast issuance (e.g., Albers and Newman, 2019; Lang et al., 2020; Mariotti et al., 2020; and references therein). For example, when the MJO, ENSO, or the stratosphere generates more predictable oceanic waves or teleconnections in sea level pressure and surface winds (Barnston et al., 2019; DelSole et al., 2017; Kim et al., 2018; Tripathi et al., 2015; Vitart and Molteni, 2010; Albers and Newman, 2021) coastal NTR^ anomalies may also become more predictable, supporting more confident guidance for coastal communities and stakeholders. For the southern and central portion of California, the relatively high reforecast skill likely can be anticipated as a consequence of coastal Kelvin waves and remote wind variability from coupled modes of variability including ENSO (Menéndez and Woodworth, 2010; Arcodia et al., 2024; Amaya et al., 2022). For the East Coast, identifying predictable Gulf Stream subseasonal anomalies may lead to identifying times of higher NTR^ reforecast skill from Florida through the Gulf Stream separation point near Cape Hatteras, since Gulf Stream variability and coastal SLAs are known to be correlated (Ezer et al., 2013; Ezer, 2016; Chi et al., 2023). In contrast, the increase in coastal reforecast skill further north along the New England portion of the coast may be associated with the NAO, whose skill on subseasonal timescales can be anticipated from prior stratospheric and tropical conditions (e.g., Albers and Newman, 2021; and references therein), since the NAO has a stronger influence further north in New England versus the mid-Atlantic (Hurrell et al., 2003; Visbeck et al., 2003). Our study will form the basis for future work that uses dynamical models to better predict these subseasonal forecasts of opportunity.

Data availability

The data used in this study are available from the following locations: GLORYS (https://data.marine.copernicus.eu/product/GLOBAL_MULTIYEAR_PHY_001_030/description, last access: 30 January 2024, Lellouche et al., 2021), ECMWF IFS and CNRM reforecasts (https://apps.ecmwf.int/datasets/data/s2s/levtype=sfc/type=cf/, last access: 13 March 2024, Voldoire et al., 2019; Vitart et al., 2017 ), VLM (https://oceanservice.noaa.gov/hazards/sealevelrise/Sea_Level_Rise_Datasets_2022.zip, NOAA National Ocean Service, 2021; Sweet et al., 2022), and NOAA tide gauges (https://api.tidesandcurrents.noaa.gov/mdapi/prod/, NOAA National Ocean Service, 2024).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/os-21-1761-2025-supplement.

Author contributions

Conceptualization, project administration, and supervision by JRA and MN. Data curation by JRA and YW. Formal analysis, investigation, methodology, visualization, and writing of original draft by JRA. Project administration by JRA and MN. Writing, review, and editing by JRA, MN, MAB, WS, TX, and YW.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The authors would like to thank Matthew Widlansky, Chia-Wei Hsu, Dillon Amaya, and Paige Hovenga for discussions that improved the article.

Financial support

This research has been supported by the US Department of Commerce (Bipartisan Infrastructure Law (BIL) and NOAA cooperative agreement (grant no. NA22OAR4320151)).

Review statement

This paper was edited by John M. Huthnance and reviewed by two anonymous referees.

References

Albers, J. R. and Newman, M.: A priori identification of skillful extratropical subseasonal forecasts, Geophys. Res. Lett., 46, 12527–12536, https://doi.org/10.1029/2019GL085270, 2019. 

Albers, J. R. and Newman, M.: Subseasonal predictability of the North Atlantic Oscillation, Environ. Res. Lett., 16, 044024, https://doi.org/10.1088/1748-9326/abe781, 2021. 

Amaya, D. J., Jacox, M. G., Dias, J., Alexander, M. A., Karnauskas, K. B., Scott, J. D., and Gehne, M.: Subseasonal-to-seasonal forecast skill in the California Current System and its connection to coastal Kelvin waves, J. Geophys. Res.-Oceans, 127, e2021JC017892, https://doi.org/10.1029/2021JC017892, 2022. 

Amaya, D. J., Alexander, M. A., Scott, J. D., and Jacox, M. G.: An evaluation of high-resolution ocean reanalyses in the California current system, Prog. Oceanogr., 210, 102951, https://doi.org/10.1016/j.pocean.2022.102951, 2023. 

Arbic, B. K.: Atmospheric forcing of the oceanic semidiurnal tide, Geophys. Res. Lett., 32, L02610, https://doi.org/10.1029/2004GL021668, 2005. 

Arcodia, M. C., Becker, E., and Kirtman, B. P.: Subseasonal Variability of US Coastal Sea Level from MJO and ENSO Teleconnection Interference, Weather Forecast., 39, 441–458, https://doi.org/10.1175/WAF-D-23-0002.1, 2024. 

Atger, F.: The skill of ensemble prediction systems, Mon. Weather Rev., 127, 1941–1953, 1999. 

Barnston, A. G., Tippett, M. K., Ranganathan, M., and L'Heureux, M. L.: Deterministic skill of ENSO predictions from the North American Multimodel Ensemble, Clim. Dynam., 53, 7215–7234, https://doi.org/10.1007/s00382-017-3603-3, 2019. 

Bauer, P., Thorpe, A., and Brunet, G.: The quiet revolution of numerical weather prediction, Nature, 525, 47, https://doi.org/10.1038/nature14956, 2015. 

Boucharel, J., David, M., Almar, R., and Melet, A.: Contrasted influence of climate modes teleconnections to the interannual variability of coastal sea level components–implications for statistical forecasts, Clim. Dynam., 61, 4011–4032, https://doi.org/10.1007/s00382-023-06771-1, 2023. 

Castillo-Trujillo, A. C., Kwon, Y.-O., Fratantoni, P., Chen, K., Seo, H., Alexander, M. A., and Saba, V. S.: An evaluation of eight global ocean reanalyses for the Northeast US continental shelf, Prog. Oceanogr., 219, 103126, https://doi.org/10.1016/j.pocean.2023.103126, 2023. 

Centre national d'études spatiales: Along-track level-2+(L2P) SLA product handbook, Tech. rep., https://www.aviso.altimetry.fr/fileadmin/documents/data/tools/hdbk_L2P_all_missions_except_S3_S6.pdf (last access: 30 July 2025), 2020. 

Chelton, D. B. and Davis, R. E.: Monthly mean sea-level variability along the west coast of North America, J. Phys. Oceanogr., 12, 757–784, 1982. 

Chi, L., Wolfe, C. L., and Hameed, S.: Reconsidering the relationship between Gulf Stream transport and dynamic sea level at US East Coast, Geophys. Res. Lett., 50, e2022GL102018, https://doi.org/10.1029/2022GL102018, 2023. 

DelSole, T., Trenary, L., Tippett, M. K., and Pegion, K.: Predictability of week-3–4 average temperature and precipitation over the contiguous United States, J. Climate, 30, 3499–3512, https://doi.org/10.1175/JCLI-D-16-0567.1, 2017. 

Doodson, A. T.: Meteorological Perturbations of Sea-Level, Nature, 112, 765–766, https://doi.org/10.1038/112765a0, 1923. 

Dusek, G., Sweet, W. V., Widlansky, M. J., Thompson, P. R., and Marra, J. J.: A novel statistical approach to predict seasonal high tide flooding, Frontiers in Marine Science, 9, 1073792, https://doi.org/10.3389/fmars.2022.1073792, 2022. 

Enfield, D. B. and Allen, J.: On the structure and dynamics of monthly mean sea level anomalies along the Pacific coast of North and South America, J. Phys. Oceanogr., 10, 557–578, https://doi.org/10.1175/1520-0485(1980)010<0557:OTSADO>2.0.CO;2, 1980. 

Epstein, E. S.: A spectral climatology, J. Climate, 1, 88–107, https://doi.org/110.1175/1520-0442(1988)001<0088:ASC>2.0.CO;2, 1988. 

Ezer, T.: Can the Gulf Stream induce coherent short-term fluctuations in sea level along the US East Coast? A modeling study, Ocean Dynam., 66, 207–220, https://doi.org/10.1007/s10236-016-0928-0, 2016. 

Ezer, T. and Atkinson, L. P.: Accelerated flooding along the US East Coast: On the impact of sea-level rise, tides, storms, the Gulf Stream, and the North Atlantic Oscillations, Earth's Future, 2, 362–382, https://doi.org/10.1002/2014EF000252, 2014. 

Ezer, T., Atkinson, L. P., Corlett, W. B., and Blanco, J. L.: Gulf Stream's induced sea level rise and variability along the US mid-Atlantic coast, J. Geophys. Res.-Oceans, 118, 685–697, https://doi.org/10.1002/jgrc.20091, 2013. 

Feng, X., Widlansky, M. J., Balmaseda, M. A., Zuo, H., Spillman, C. M., Smith, G., Long, X., Thompson, P., Kumar, A., Dusek, G., and Sweet, W.: Improved capabilities of global ocean reanalyses for analysing sea level variability near the Atlantic and Gulf of Mexico Coastal US, Front. Mar. Sci., 11, 1338626, https://doi.org/10.3389/fmars.2024.1338626, 2024. 

Fleming, E., Payne, J., Sweet, W., Craghan, M., Haines, J., Hart, J. F., Stiller, H., and Sutton-Grier, A.: Coastal effects, in: Impacts, Risks, and Adaptation in the United States: Fourth National Climate Assessment, Volume II, edited by: Reidmiller, D. R., Avery, C. W., Easterling, D. R., Kunkel, K. E., Lewis, K. L. M., Maycock, T. K., and Stewart, B. C., U.S. Global Change Research Program, Washington, D.C., 322–352, https://doi.org/10.7930/NCA4.2018.CH8, 2018. 

Groves, G. W. and Hannan, E. J.: Time series regression of sea level on weather, Rev. Geophys., 6, 129–174, https://doi.org/10.1029/RG006i002p00129, 1968. 

Hammond, W. C., Blewitt, G., Kreemer, C., and Nerem, R. S.: GPS imaging of global vertical land motion for studies of sea level rise, J. Geophys. Res.-Sol. Ea., 126, e2021JB022355, https://doi.org/10.1029/2021JB022355, 2021. 

Han, W., Stammer, D., Thompson, P., Ezer, T., Palanisamy, H., Zhang, X., Domingues, C. M., Zhang, L., and Yuan, D.: Impacts of basin-scale climate modes on coastal sea level: a review, Surv. Geophys., 40, 1493–1541, https://doi.org/10.1007/s10712-019-09562-8, 2019. 

Hu, Y. and Freymueller, J. T.: Geodetic observations of time-variable glacial isostatic adjustment in Southeast Alaska and its implications for Earth rheology, J. Geophys. Res.-Sol. Ea., 124, 9870–9889, https://doi.org/10.1029/2018JB017028, 2019. 

Hughes, C. W., Fukumori, I., Griffies, S. M., Huthnance, J. M., Minobe, S., Spence, P., Thompson, K. R., and Wise, A.: Sea level and the role of coastal trapped waves in mediating the influence of the open ocean on the coast, Surv. Geophys., 40, 1467–1492, https://doi.org/10.1007/s10712-019-09535-x, 2019. 

Hurrell, J. W., Kushnir, Y., Ottersen, G., and Visbeck, M.: An overview of the North Atlantic Oscillation, Geophysical Monograph-American Geophysical Union, 134, 1–36, https://doi.org/10.1029/134GM01, 2003. 

Jolliffe, I. T. and Stephenson, D. B.: Forecast verification: a practitioner's guide in atmospheric science, John Wiley & Sons, ISBN 978-1-119-96000-3, 2011. 

Kavanaugh, K., Keeney, A., Callahan, J., and Sweet, W.: Notice of Methodology Update: NOAA High Tide Flooding Outlooks, NOAA Technical Service Publication NOS CO-OPS 001 NOAA Technical Service Publication NOS CO-OPS 001, https://repository.library.noaa.gov/view/noaa/70863 (last access: 30 July 2025), 2023. 

Kim, H., Vitart, F., and Waliser, D. E.: Prediction of the Madden–Julian oscillation: A review, J. Climate, 31, 9425–9443, https://doi.org/10.1175/JCLI-D-18-0210.1, 2018. 

Kopp, R. E., Horton, R. M., Little, C. M., Mitrovica, J. X., Oppenheimer, M., Rasmussen, D., Strauss, B. H., and Tebaldi, C.: Probabilistic 21st and 22nd century sea-level projections at a global network of tide-gauge sites, Earth's Future, 2, 383–406, https://doi.org/10.1002/2014EF000239, 2014. 

Lang, A. L., Pegion, K., and Barnes, E. A.: Introduction to Special Collection: “Bridging Weather and Climate: Subseasonal-to- Seasonal (S2S) Prediction”, J. Geophys. Res., 125, e2019JD031833, https://doi.org/10.1029/2019JD031833, 2020. 

Larsen, C. F., Motyka, R. J., Freymueller, J. T., Echelmeyer, K. A., and Ivins, E. R.: Rapid uplift of southern Alaska caused by recent ice loss, Geophys. J. Int., 158, 1118–1133, https://doi.org/10.1111/j.1365-246X.2004.02356.x, 2004. 

Le Traon, P.-Y. and Gauzelin, P.: Response of the Mediterranean mean sea level to atmospheric pressure forcing, J. Geophys. Res.-Oceans, 102, 973–984, https://doi.org/10.1029/96JC02777, 1997. 

Lellouche, J.-M., Greiner, E., Bourdallé-Badie, R., Garric, G., Melet, A., Drévillon, M., Bricaud, C., Hamon, M., Le Galloudec, O., Regnier, C., Candela, T., Testut, C.-E., Gasparin, F., Ruggiero, G., Benkiran, M., Drillet, Y., and Le Traon, P.-Y.: The Copernicus global 1/12 oceanic and sea ice GLORYS12 reanalysis, Frontiers in Earth Science, 9, 698876, https://doi.org/10.3389/feart.2021.698876, 2021. 

Lindsey, R., Engdahl, J., Murry, N., Keeney, A., Miller, A., Zervas, C., and Luscher, A.: Interactive map: How has local sea level in the United States changed over time?, Tech. rep., Climate.gov, https://www.climate.gov/news-features/features/interactive-map-how-has-local-sea-level-united-states-changed-over-time (last access: 30 July 2025), 2021. 

Long, X., Widlansky, M. J., Spillman, C. M., Kumar, A., Balmaseda, M., Thompson, P. R., Chikamoto, Y., Smith, G. A., Huang, B., Shin, C.-S., and Merrifield, W.: Seasonal forecasting skill of sea-level anomalies in a multi-model prediction framework, J. Geophys. Res.-Oceans, 126, e2020JC017060, https://doi.org/10.1029/2020JC017060, 2021. 

Long, X., Shin, S.-I., and Newman, M.: Statistical Downscaling of Seasonal Forecast of Sea Level Anomalies for US Coasts, Geophys. Res. Lett., 50, e2022GL100271, https://doi.org/10.1029/2022GL100271, 2023. 

Long, X., Newman, M., Shin, S.-I., Balmeseda, M., Callahan, J., Dusek, G., Jia, L., Kirtman, B., Krasting, J., Lee, C. C., and Lee, T..: Evaluating Current Statistical and Dynamical Forecasting Techniques for Seasonal Coastal Sea Level Prediction, J. Climate, 38, 1477–1503, https://doi.org/10.1175/JCLI-D-24-0214.1, 2025. 

Lorenz, E. N.: Deterministic nonperiodic flow, J. Atmos. Sci., 20, 130–141, 1963. 

Lorenz, E. N.: The predictability of a flow which possesses many scales of motion, Tellus, 21, 289–307, 1969. 

Madec, G., Bourdallé-Badie, R., Bouttier, P.-A., Bricaud, C., Bruciaferri, D., Calvert, D., Chanut, J., Clementi, E., Coward, A., Delrosso, D., Ethé, C., Flavoni, S., Graham, T., Harle, J., Iovino, D., Lea, D., Lévy, C., Lovato, T., Martin, N., Masson, S., Mocavero, S., Paul, J., Rousset, C., Storkey, D., Storto, A., and Vancoppenolle, M.: NEMO ocean engine, Notes du Pôle de modélisation de l'Institut Pierre-Simon Laplace (IPSL): (27), https://www.nemo-ocean.eu/doc/ (last access: 30 July 2025), 2017. 

Mariotti, A., Albers, J. R., Baggett, C., Barnes, E. A., Becker, E., Butler, A., Collins, D. C., Dirmeyer, P. A., Ferranti, L., Johnson, N. C., Jones, J., Kirtman, B. P., Lang, A. L., Molod, A. M., Newman, M., Robertson, A. W., Schubert, S. D., and Waliser, D. E.: Windows of Opportunity for Skillful Forecasts Subseasonal to Seasonal and Beyond, B. Am. Metereol. Soc., 101, E608–E625, https://doi.org/10.1175/BAMS-D-18-0326.1, 2020. 

Mason, S. J.: On using “climatology” as a reference strategy in the Brier and ranked probability skill scores, Mon. Weather Rev., 132, 1891–1895, 2004. 

Menéndez, M. and Woodworth, P. L.: Changes in extreme high water levels based on a quasi-global tide-gauge data set, J. Geophys. Res.-Oceans, 115, : C10011, https://doi.org/10.1029/2009JC005997, 2010. 

Merrifield, M. A., Genz, A. S., Kontoes, C. P., and Marra, J. J.: An- nual maximum water levels from tide gauges: Contributing factors and geographic patterns, J. Geophys. Res.-Oceans, 118, 2535–2546, https://doi.org/10.1002/jgrc.20173, 2013. 

NOAA National Ocean Service: Mean Sea Level Dataset for Global and Regional Sea Level Rise Scenarios for the United States: Updated Mean Projections and Extreme Water Level Probabilities Along U.S. Coastlines, Tech. rep., https://oceanservice.noaa.gov/hazards/sealevelrise/Sea_Level_Rise_Datasets_2022.zip (last access: 30 July 2025), 2021. 

NOAA National Ocean Service: A NOAA Capability for Coastal Flooding and Innundation Information and Services at Climate Timescales, Tech. rep., https://cpo.noaa.gov/wp-content/uploads/2023/08/NOAA-Coastal-Inundation-at-Climate-Timescales-Whitepaper.pdf (last access: 30 July 2025), 2022. 

NOAA National Ocean Service: NOAA Tide Predictions, Tech. rep., https://tidesandcurrents.noaa.gov/PageHelp.html (last access: 30 July 2025), 2024. 

Oddo, P., Bonaduce, A., Pinardi, N., and Guarnieri, A.: Sensitivity of the Mediterranean sea level to atmospheric pressure and free surface elevation numerical formulation in NEMO, Geosci. Model Dev., 7, 3001–3015, https://doi.org/10.5194/gmd-7-3001-2014, 2014. 

Oelsmann, J., Marcos, M., Passaro, M., Sanchez, L., Dettmering, D., Dangendorf, S., and Seitz, F.: Regional variations in relative sea-level changes influenced by nonlinear vertical land motion, Nat. Geosci., 17, 137–144, https://doi.org/10.1038/s41561-023-01357-2, 2024. 

Piecuch, C. G. and Ponte, R. M.: Inverted barometer contributions to recent sea level changes along the northeast coast of North America, Geophys. Res. Lett., 42, 5918–5925, https://doi.org/10.1002/2015GL064580, 2015. 

Ponte, R. M.: The sea level response of a stratified ocean to barometric pressure forcing, J. Phys. Oceanogr., 22, 109–113, https://doi.org/10.1002/2015GL064580, 1992. 

Ponte, R. M.: Low-frequency sea level variability and the inverted barometer effect, J. Atmos. Ocean. Tech., 23, 619–629, https://doi.org/10.1175/JTECH1864.1, 2006. 

Ponte, R. M. and Ray, R. D.: Atmospheric pressure corrections in geodesy and oceanography: A strategy for handling air tides, Geophys. Res. Lett., 29, 6–1, 2002. 

Renkl, C., Oliver, E. C., and Thompson, K. R.: Downscaling the ocean response to the Madden–Julian Oscillation in the Northwest Atlantic and adjacent shelf seas, Clim. Dynam., 62, 6719–6744, https://doi.org/10.1007/s00382-024-07233-y, 2024. 

Ross, J. C.: On the effect of the pressure of the atmosphere on the mean level of the ocean, Philos. T. R. Soc. Lond., 144, 285–296, https://doi.org/10.1098/rstl.1854.0013, 1854. 

Shin, S.-I. and Newman, M.: Seasonal predictability of global and North Amer- ican coastal sea surface temperature and height anomalies, Geophys. Res. Lett., 48, e2020GL091886, https://doi.org/10.1029/2020GL091886, 2021. 

Simmons, A. J. and Hollingsworth, A.: Some aspects of the improvement in skill of numerical weather prediction, Q. J. Roy. Meteor. Soc., 128, 647–677, https://doi.org/10.1256/003590002321042135, 2002. 

Sweet, W., Park, J., Marra, J., Zervas, C., and Gill, S.: Sea level rise and nuisance flood frequency changes around the United States, NOAA Technical Report NOS CO-OPS 073, https://repository.library.noaa.gov/view/noaa/30823 (last access: 30 July 2025), 2014. 

Sweet, W., Dusek, G., Obeysekera, J., and Marra, J. J.: Patterns and projections of high tide flooding along the US coastline using a common impact threshold, NOAA Technical Report NOS CO-OPS 086, https://repository.library.noaa.gov/view/noaa/17403 (last access: 30 July 2025), 2018. 

Sweet, W. V. and Park, J.: From the extreme to the mean: Acceleration and tipping points of coastal inundation from sea level rise, Earth's Future, 2, 579–600, https://doi.org/10.1002/2014EF000272, 2014. 

Sweet, W. V. and Zervas, C.: Cool-season sea level anomalies and storm surges along the US East Coast: Climatology and comparison with the 2009/10 El Niño, Mon. Weather Rev., 139, 2290–2299, 2011. 

Sweet, W. V., Hamlington, B. D., Kopp, R. E., Weaver, C. P., Barnard, P. L., Bekaert, D., Brooks, W., Craghan, M., Dusek, G., Frederikse, T., Garner, G., Genz, A. S., Krasting, J. P., Larour, E., Marcy, D., Marra, J. J., Obeysekera, J., Osler, M., Pendleton, M., Roman, D., Schmied, L., Veatch, W., White, K. D., and Zuzak, C.: Global and Regional Sea Level Rise Scenarios for the United States: Updated Mean Projections and Extreme Weather Level Probabilities along US Coastlines, NOAA Technical Report NOS 01. National Oceanic and Atmospheric Administration, National Ocean Service, Silver Spring, MD, 111 pp., https://earth.gov/sealevel/us/internal_resources/756/noaa-nos-techrpt01-global-regional-SLR-scenarios-US.pdf (last access: 30 July 2025), 2022. 

Tai, C.-K.: On the quasigeostrophic oceanic response to atmospheric pressure forcing: The inverted barometer pumping, NOAA Technical Memorandum NOS OES 005, https://repository.library.noaa.gov/view/noaa/2629 (last access: 30 July 2025), 1993. 

Tripathi, O. P., Charlton-Perez, A., Sigmond, M., and Vitart, F.: Enhanced long-range forecast skill in boreal winter following stratospheric strong vortex conditions, Environ. Res. Lett., 10, 104007, https://doi.org/10.1088/1748-9326/10/10/104007, 2015. 

Valle-Levinson, A., Dutton, A., and Martin, J. B.: Spatial and temporal vari- ability of sea level rise hot spots over the eastern United States, Geophys. Res. Lett., 44, 7876–7882, https://doi.org/10.1002/2017GL073926, 2017. 

Vinogradov, S. V. and Ponte, R. M.: Low-frequency variability in coastal sea level from tide gauges and altimetry, J. Geophys. Res.-Oceans, 116, C07006, https://doi.org/10.1029/2011JC007034, 2011. 

Visbeck, M., Chassignet, E. P., Curry, R. G., Delworth, T. L., Dickson, R. R., and Krahmann, G.: The ocean's response to North Atlantic Oscillation variability, Geophys. Monogr. Ser., 134, 113–146, https://doi.org/10.1029/134GM06, 2003. 

Vitart, F. and Molteni, F.: Simulation of the Madden–Julian oscillation and its teleconnections in the ECMWF forecast system, Q. J. Roy. Meteor. Soc., 136, 842–855, https://doi.org/10.1002/qj.623, 2010. 

Vitart, F., Ardilouze, C., Bonet, A., Brookshaw, A., Chen, M., Codorean, C., Déqué, M., Ferranti, L., Fucile, E., Fuentes, M., and Hendon, H.: The subseasonal to seasonal (S2S) prediction project database, B. Am. Meteorol. Soc., 98, 163–173, https://doi.org/10.1175/BAMS-D-16-0017.1, 2017. 

Voldoire, A., Saint-Martin, D., Sénési, S., Decharme, B., Alias, A., Chevallier, M., Colin, J., Gúerémy, J.-F., Michou, M., Moine, M.-P., and Nabat, P.: Evaluation of CMIP6 deck experiments with CNRM-CM6-1, J. Adv. Model. Earth Sy., 11, 2177–2213, https://doi.org/10.1029/2019MS001683, 2019. 

Weber, N. J. and Mass, C. F.: Evaluating CFSv2 subseasonal forecast skill with an emphasis on tropical convection, Mon. Weather Rev., 145, 3795–3815, https://doi.org/10.1175/MWR-D-17-0109.1, 2017. 

Weisheimer, A. and Palmer, T.: On the reliability of seasonal climate forecasts, J. R. Soc. Interface, 11, 20131162, https://doi.org/10.1098/rsif.2013.1162, 2014. 

Widlansky, M. J., Marra, J. J., Chowdhury, M. R., Stephens, S. A., Miles, E. R., Fauchereau, N., Spillman, C. M., Smith, G., Beard, G., and Wells, J.: Multimodel ensemble sea level forecasts for tropical Pacific islands, J. Appl. Meteorol. Clim., 56, 849–862, https://doi.org/10.1175/JAMC-D-16-0284.1, 2017. 

Widlansky, M. J., Long, X., Balmaseda, M. A., Spillman, C. M., Smith, G., Zuo, H., Yin, Y., Alves, O., and Kumar, A.: Quantifying the benefits of altimetry assimilation in seasonal forecasts of the upper ocean, J. Geophys. Res.-Oceans, 128, e2022JC019 342, https://doi.org/10.1029/2022JC019342, 2023. 

Wilks, D. S.: Statistical methods in the atmospheric sciences, vol. 100, Academic Press, ISBN 978-0-12-815823-4, 2011. 

Woodworth, P. L., Melet, A., Marcos, M., Ray, R. D., Wöppelmann, G., Sasaki, Y. N., Cirano, M., Hibbert, A., Huthnance, J. M., Monserrat, S., and Merrifield, W.: Forcing factors affecting sea level changes at the coast, Surv. Geophys., 40, 1351–1397, https://doi.org/10.1007/s10712-019-09531-1, 2019.  

Wulff, C. O., Vitart, F., and Domeisen, D. I.: Influence of trends on subseasonal temperature prediction skill, Q. J. Roy. Meteor. Soc., 148, 1280–1299, 2022. 

Wunsch, C.: Large-scale response of the ocean to atmospheric forcing at low frequencies, J. Geophys. Res.-Oceans, 96, 15083–15092, https://doi.org/10.1029/91JC01457, 1991. 

Wunsch, C. and Stammer, D.: Atmospheric loading and the oceanic “inverted barometer” effect, Rev. Geophys., 35, 79–107, https://doi.org/10.1029/96RG03037, 1997. 

Zhang, F., Sun, Y. Q., Magnusson, L., Buizza, R., Lin, S.-J., Chen, J.-H., and Emanuel, K.: What is the predictability limit of midlatitude weather?, J. Atmos. Sci., 76, 1077–1091, https://doi.org/10.1175/JAS-D-18-0269.1, 2019. 

Download
Short summary

Providing early warning of coastal flooding is an emerging priority for the National Oceanic and Atmospheric Administration. We assess whether current operational forecast models can provide the basis for predicting the risks of higher-than-normal coastal sea level values up to 6 weeks in advance. For many United States coastal locations, models have sufficient prediction skill to be used as the basis for the development of a high tide flooding prediction system on subseasonal timescales.

Share