Introduction

Ocean Science

Ocean Sci.

1812-0792

Copernicus Publications

Göttingen, Germany

10.5194/os-13-303-2017

Technical note: Evaluation of three machine learning models for surface ocean CO2 mapping

Zeng

Jiye

zeng@nies.go.jp Matsunaga

Tsuneo

https://orcid.org/0000-0002-3380-5230

Saigusa

Nobuko

Shirai

Tomoko

Nakaoka

Shin-ichiro

Tan

Zheng-Hong

1Centre for Global Environmental Research, National Institute for Environmental Studies, Tsukuba, Ibaraki, Japan 2Institute of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, China

Jiye Zeng (zeng@nies.go.jp)

19April2017

13 2 303313 6September2016 25October2016 16March2017 24March2017

This work is licensed under a Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/3.0/

This article is available from https://os.copernicus.org/articles/13/303/2017/os-13-303-2017.html

The full text article is available as a PDF file from https://os.copernicus.org/articles/13/303/2017/os-13-303-2017.pdf

Reconstructing surface ocean CO2 from scarce measurements plays an important role in estimating oceanic CO2 uptake. There are varying degrees of differences among the 14 models included in the Surface Ocean CO2 Mapping (SOCOM) inter-comparison initiative, in which five models used neural networks. This investigation evaluates two neural networks used in SOCOM, self-organizing maps and feedforward neural networks, and introduces a machine learning model called a support vector machine for ocean CO2 mapping. The technique note provides a practical guide to selecting the models.

Introduction

The global ocean is a major sink for anthropogenic carbon and therefore an important contributor to slowing down the human-induced global warming (Stocker et al., 2013). To calculate the oceanic CO2 uptake, various models have been used to interpolate scarce CO2 measurements in the surface ocean spatially and temporarily to obtain basin-wide (e.g., Zeng et al., 2002; Lefèvre et al., 2005; Chierici et al., 2006; Sarma et al., 2006; Jamet et al., 2007; Friedrich and Oschlies, 2009; Telszewski et al., 2009; Takamura et al., 2010; Landschützer et al., 2013; Nakaoka et al., 2013; Iida et al., 2015; Goddijn-Murphy et al, 2015) and global ocean CO2 maps (Takahashi et al., 2002, 2009, 2014; Park et al., 2010; Rödenbeck et al., 2013; Sasse et al., 2013; Jones et al., 2015; Zeng et al., 2015). The Surface Ocean CO2 Mapping (SOCOM) inter-comparison initiative revealed varying degrees of differences among 14 models (Rödenbeck et al., 2015), of which 5 used neural networks. They include self-organizing maps (SOMs) and feedforward neural networks (FNNs). The SOM has a long history in CO2 mapping (Lefèvre et al., 2005; Friedrich and Oschlies, 2009; Telszewski et al., 2009; Nakaoka et al., 2013). Recently, the FNN has been gaining popularity in this field (Landschützer et al., 2015; Zeng et al., 2014, 2015). In this investigation we introduce a machine learning model called a support vector machine (SVM) for ocean CO2 mapping and compare the SVM with the SOM and FNN. We intend to provide a practical guide for using these machine learning models.

Model equations

The machine learning models included in this study cannot directly model the long-term trend of CO2. Therefore, we express the dependence of CO2 fugacity (fCO2) on year (YR), month (MON), latitude (LAT), and longitude (LON) as the sum of a nonlinear static component and a linear trend component: fCO2=FstaticLAT,LON,MON+Ftrend(YR). As available observations are scarce with respect to the biogeochemical properties of the surface ocean, we used sea surface temperature (SST), sea surface salinity (SSS), chlorophyll-a concentration (CHL), and mixed layer depth (MLD) as the proxy variables of space and time. These proxy variables were commonly used by models included in the SOCOM. The model equation becomes fCO2=FstaticLAT,SST,SSS,CHL,MLD,dSST+FtrendYR where dSST denotes the difference between the monthly and annual means of SST. Here we excluded LON and MON. They have a circular property and therefore cannot be used directly. For instance, longitude -180∘ is geographically connected to longitude 180∘, but numerically they appear to be two extreme longitude values to the models. Zeng et al. (2014, 2015) circumvented this problem by using sine and cosine transformed components. Their approach could unintentionally enhance the influence of LON and MON on fCO2 as one more derived variable from each of them was added to the model. We excluded LON in the belief that the combination of SST, SSS, CHL, and MLD contains sufficient spatial information, but retained LAT for its different seasonal and geophysical meanings in the Northern and Southern hemispheres. Replacing MON with dSST also improves the expression of the effect of season geographically.

Data

We extracted monthly fCO2 from the track-gridded database of the Surface Ocean CO2 Atlas (SOCAT) version 3.0

http://www.socat.info/

(Pfeil et al., 2013; Sabine et al., 2013; Bakker et al., 2014). The database has a 1∘ × 1∘ spatial resolution and includes global measurements from 1970 to 2014. Similar to Zeng et al. (2014), we excluded some data points by these criteria: (i) fCO2 values smaller than 250 µatm or larger than 550 µatm, (ii) ocean depth smaller than 500 m, (iii) salinity smaller than 25.0, and (iv) year before 1990. A total of 158 052 data points were extracted with these conditions.

The monthly SST data of 1990 to 2015 were extracted from the Optimum Interpolation (OI) V2 product

http://www.esrl.noaa.gov/psd/data/gridded/data.noaa.oisst.v2.html

of NOAA (Reynolds et al., 2002). The monthly SSS climatology was extracted from the World Ocean Atlas 2013 (WOA13) product

https://www.nodc.noaa.gov/OC5/woa13/

(Boyer et al., 2013), which contains the monthly mean SSS from 27 June 1896 to 25 December 2012. The monthly CHL climatology was calculated using the MODIS Aqua and SeaWiFS climatology

https://oceancolor.gsfc.nasa.gov/cgi/l3

, which covers the period of 2012 to 2015. The mean of the two CHLs was used as the CHL climatology. The mixed layer data were derived from the Monthly Isopycnal and Mixed-layer Ocean Climatology

http://www.pmel.noaa.gov/mimoc/

of NOAA (Schmidtko et al., 2013), which includes the period of 1955 to 2012.

Machine learning models

The Appendix and Table 1 summarize the algorithms of the three models. Here we focus on discussing their usage in CO2 mapping.

Feature comparison of the three machine learning models.

Feature SVM FNN SOM Input space projection Projects the input variable space to a high-dimensional space that is proportional to the number of training samples. Projects the input space to a high-dimensional space that is proportional to the number of hidden neurons and input variables. Projects the input space to a feature space whose size is determined by the number of neurons. Prediction by Continuous interpolation. Continuous interpolation. Picking up labeling samples that have the closest feature to the input. Problems May over-fit and over-interpolate. May over-fit and over-interpolate. Discrete interpolation leads to spatial discontinuity. Data scaling Helps in solving the linear equation, but has no effect on the result. Helps the convergence of training, but has an insignificant effect on the result. Significant effect on the result. Results affected by The parameter values for regularization and kernel function. The number of hidden neurons. The number of neurons and data scaling.

The trend in Eq. (2) cannot be modeled directly by the models. One approach to dealing with the problem is to normalize the measurements to a reference year using a global rate and to only model the nonlinear component. Zeng et al. (2014) presented a method to model the linear component in Eq. (2). Instead of repeating the process, we used their annual rate of 1.5 µatm to remove the trend from fCO2 to normalize it to the reference year 2005, i.e., fCO2normalized=fCO2-1.5⋅(YR-2005). Although Takahashi et al. (2014) obtained a global mean rate of 1.9 µatm yr-1, we used 1.5 µatm yr-1 as this rate was obtained by using the gridded fCO2 of SOCAT version 2. The normalized fCO2 was used to model the nonlinear component in Eq. (2). In later discussions, fCO2 means the normalized fCO2 unless explicitly stated. Similarly, we applied the log transform of Zeng et al. (2014) to CHL prior to data scaling discussed below, i.e., CHL=log⁡10(1.0+CHL).

SMV

For a given dataset, the SVM requires a prior step to find the optimal value for the parameter σ in Eq. (A10) and the parameter γ in Eq. (A11). To shorten the training time, we randomly chose 10 % of the measurement data in this step and obtained 0.06 for σ and 10 for γ. Note that these values are dependent on data scaling, which is necessary in this case to avoid the overflow problem in solving Eq. (A18). We scaled all input variables LAT, SST, SSS, CHL, MLD, and dSST by their minimum and maximum to confine them in the range (0, 1), i.e., v=v-vminvmax-vmin.

FNN

Data scaling is not necessary for the FNN, but can improve its performance. Following Zeng et al. (2014), we scaled the input variables by their mean and standard deviation as v=v-v‾s. The output variable fCO2 is scaled by v=0.1+0.8v-vminvmax-vmin. This confines the scaled fCO2 to between 0.1 and 0.9 for better response to changes in input variables. The kernel function Eq. (A4) has the property that for any input in (-∞, +∞), the output varies between 0 and 1. For fCO2 close to 0 or 1, a small change in fCO2 requires very large adjustment of model parameters, which slows down the convergence of training.

We used 64 hidden neurons for the FNN as Zeng et al. (2014) did. The learning rate in Eq. (A6) was set to 0.25 by trial-and-error. A small value makes training slow, whereas a large value may make a training diverge. The constant in Eq. (A8) was determined dynamically in each iterative training loop. It was taken as 10 times the mean of absolute differences between modeled and observed fCO2. We experienced that this method improves the performance of training.

SOM

Data scaling is critical for the SOM, as the distance defined by Eq. (A1) would be affected by variable units. We used Eq. (6) to scale input variables in training the SOM. Based on our preliminary correlation analysis, we applied a factor of 2 to enhance the influence of SST and CHL on the distance. Using such a subjective factor is the only way to make the correlations between the output and the input variables more in line with observed correlations.

From the labeling procedure of SOM described in the Appendix, it is not difficult to see that the number of neuron cells in SOM affects the labeling and hence the prediction. Unfortunately, there is no guideline for choosing the size. Based on previous studies (Telszewski et al., 2009; Nakaoka et al., 2013), we used 20 000 neuron cells, roughly one neuron cell for one 1 × 1 grid cell of sampled areas.

Model validation

We examined the goodness of fit by randomly selecting 10 to 50 % of the data points to train the FNN and SVM, and to label the SOM; and then calculated the correlation coefficient between modeled and observed CO2 of the selected data points.

The SOM yields the best correlation in the case of 10 % of randomly selected data points and the correlation decreases with the number of data points (Fig. 1). The reason is that for a given number of neuron cells, the fewer the data points, the less likely a neuron cell will be labeled by multiple measurements and the more likely that the prediction will find the same CO2 value used for labeling. Therefore, the goodness of fit does not necessary mean good SOM modeling.

Correlation coefficient between modeled and observed fCO2 (uatm). The sample size is the number of data points randomly selected to train FFN and SVM and to label SOM.

The correlations obtained by the SVM and FNN do not vary much with the number of data points. While the SVM's correlation decreases monotonically, even though by only a little, with the number of data points, the FNN's correlation obtained with 75 000 data points is larger than that with 60 000 data points. The FNN is known for not being able to find the global optimum in training. This case could be an indication of an imperfect training. The FNN appears inferior to SVM in all cases. However, imperfect training does not account for all the differences. If we use the number of model parameters to be determined by the training as the indicator of the dimension of the model space, the FNN's dimension is far smaller than that of the SVM. The former is determined by the number of hidden neurons and input variables, whereas the latter is determined by the number of training data. For 6 input variables, 15 000 training data, and 64 hidden neurons, the number of parameters is 509 for the FNN and 15001 for the SVM.

A better indicator of the performance of the models would be the goodness of prediction. To emulate the situation that the sampled area was only a small portion of the global ocean, we evaluated the goodness of prediction by training FNN and SVM and labeling SOM with 10 % of randomly selected data to make a prediction for the rest of the data. Figure 2 shows that the SVM yielded the best correlation (R2= 0.72), the FNN fell behind (R2= 0.67), and the SOM performed the worst (R2= 0.54). The differences between predicted and observed fCO2 are 0.1 ± 17.4 µatm for SVM, 0.1 ± 18.9 µatm for FNN, and 0.2 ± 23.3 µatm for SOM. Compared to the variation of fCO2 measurements, these differences are small and their uncertainties are on the same order of magnitude as the variation of measurements. Let us examine the standard deviation (SD) of fCO2 in those grids with at least three data points. The track-gridded fCO2 in SOCAT version 3.0 includes an SD ranging from 0.1 to 71.2 µatm and the mean is 5.2 µatm. Calculating the SD of normalized fCO2 in the same grids and in the same months of all years yielded a mean of 12.5 µatm in the range of 0.1 to 103.1 µatm. The normalization had little effect on the SD as the calculation for non-normalized fCO2 gives a mean SD of 14.6 µatm in the range of 0.1 to 107.5 µatm.

Predicted vs. observed fCO2 (µatm). Ten percent of data points were selected randomly to train FNN and SVM and to label SOM, and the rest was used for validation.

From the algorithm of SOM in the Appendix, it is not difficult to see that the SOM does not make extrapolation – the model always approximates new inputs by the measurements used for training and approximates fCO2 by the measurements used for labeling; therefore, the predicted fCO2 values are within the observed fCO2 range (Fig. 2a). Figure 2c shows that the extrapolated fCO2 by the SVM, if any, did not exceed the observed range. To investigate the extrapolation risk, we used 200 000 data points randomly generated for SST, dSST, SSS, MLD, and CHL in the range of (0, 40 ∘C), (-20, 20 ∘C), (20, 50), (1, 1500 m), and (0 log(mg m-3), 2 log(mg m-3)), respectively. These ranges are larger than the corresponding observed ranges of (0, 34 ∘C), (-13, 16 ∘C), (24, 40), (1, 1000 m), and (0 log(mg m-3), 1.2 log(mg m-3)). The SVM and FNN produced fCO2 in the range of (267, 468 µatm) and (199, 596 µatm), respectively, for the simulated samples. Compared to the observed fCO2 range of (240, 560 µatm), the experiment indicates that the over-extrapolation risk of the SVM is low.

Differences

Figure 3 shows fCO2 maps in February and July 2005, which is the reference year for normalization. In the mapping, we randomly selected 50 % of the data to train the FNN and SVM and to label the SOM. All models captured the major features of observed fCO2 distribution. The SOM exhibits obvious discontinuity because of its discrete characteristics of picking up fCO2 values from the labeled SOM. For year 2005, the mean fCO2 difference is -0.05 ± 12.73 µatm for FNN-SVM and -0.6 ± 18.80 for SOM-SVM. The uncertainty is the standard deviation of the mean difference between predicted and observed values. The statistics indicates that FNN agrees better with SVM than SOM does.

Distributions of modeled and observed fCO2. The composite map for observations includes fCO2 in 1990–2014. Half of randomly selected data points were used to train FNN and SVM and to label SOM to make predictions. (a) shows February and (b) shows July.

Although the differences among models might be on the order of 10 to 20 µatm, the effect on the global ocean CO2 flux estimate is small (Fig. 4). The fluxes are calculated using the wind speed from ECMWF's interim product (Dee et al., 2011). Our estimate for the oceanic uptake is on the higher end among those in Wanninkhof et al. (2013) and Le Quéré et al. (2015). For example, Wanninkhof et al. (2013) reported that the median sea–air anthropogenic CO2 fluxes centered on year 2000 ranged from 1.9 to 2.5 PgC yr-1 among the seven models. In comparison, our estimates by the three models are about 2.4 PgC yr-1. The mean difference of CO2 flux is 0.02 PgC yr-1 between the FNN and the SVM (FNN-SVM) and 0.06 PgC yr-1 between the SOM and the SVM (SOM-SVM). They are small in comparison with those differences among the models in Wanninkhof et al. (2013) and Le Quéré et al. (2015). Note that the flux estimate is highly dependent on wind products as shown by Wanninkhof et al. (2013) and Zeng et al. (2014).

Modeled global CO2 fluxes. A negative value indicates oceanic uptake.

On the spatial scale of tens of degrees, the three models show good mutual agreement for modeled fCO2 distributions among them. However, each model shows distinguished fine structures, which are determined by the biogeochemical processes in the ocean, by model parameters obtained from training, and by the characteristics of the models. We believe that the modeled monthly fCO2 distributions are true to the degree given by the model validations.

Summary

The main features of the three machine models are listed in Table 1. The SVM is recommended when the computer has enough memory to store the matrix in Eq. (A18), which is proportional to the square of the number of training data. The SVM performs the best, but the training time could become very long when the number of training data is too large to be handled by a computer without using virtual memory. For any given dataset, using the SVM requires a prior step to find the optimal value for the parameter σ in Eq. (A10) and the parameter γ in Eq. (A11).

The FNN model does not perform as well as the SVM, but the number of training data does not affect its training as much as the SVM's. The training time can become long when a large number of hidden neurons are used and many iterations are needed to achieve convergence. It takes a longer time to train the FNN than the SVM for a small number of data points. However, the FNN is simpler to use as it requires no prior step. However, it may have the risk of over-extrapolation.

The SOM is recommended only when the other two models have over-fitting or over-interpolation problems. The SOM performs the worst and is not as straightforward as the others as its result depends too much on data scaling and the number of neurons. An advantage of the SOM is that once trained, re-labeling the SOM with new CO2 measurements and making a new prediction is fast. Although the SOM does not have the over-extrapolation problem of the FNN, it may produce nonsense predictions due to its strong dependence on data scaling.

In areas where there was no measurement on a large scale, predictions made by the models must be treated conservatively, as SVM and FNN may produce extrapolated results and SOM may extract CO2 from unexpected provinces. Figure 3 shows that the modeled CO2 east of the African coast near the Equator in July 2005 (Fig. 3) appeared much higher than the nearby measurements, which were made in July 1995 and adjusted to 2005 using the global rate of 1.5 µatm yr-1. However, considering the large variations of the rate from region to region (Takahashi et al., 2014) and of the repeated measurements discussed in Sect. 5, the measurements were not sufficient to support rejecting the modeled CO2. Similar CO2 hotspots occurred in the Southern Ocean west of South America in February 2005, around the latitudinal zone of 50∘ S. The modeled CO2 distributions by Takahashi et al. (2014) also showed CO2 hotspots around the latitudinal zone of 30∘ S in the same month and region. Their model used a completely different interpolation scheme based on a diffusion–advection transport model for surface waters. In principle, this hotspot CO2 was produced by our models using measurements somewhere else where the biogeochemical properties were similar to those in the hotspot areas. As the SOM does not make extrapolation, the SVM has a low possibility of over-extrapolation, and the hotspots appeared in all models, the risk of accepting them would not be high.

The software and data used by this study are available at https://figshare.com/s/38488b7003b03e2103c9.

The registered DOI of the package is 10.6084/m9.figshare.4877390.

Self-organizing map

A self-organizing map (SOM) is a type of artificial neural network that is trained using unsupervised learning (Kohonen, 1984). The SOM in our application comprises grid points on a two-dimensional plane. Each grid point, also called a neuron cell, has the same number of parameters as the input variables, which include LAT, SST, SSS, CHL, MLD, and dSST in our case. Training the SOM is to use samples of input variables to adjust the parameters to make neighborhood neuron cells with similar parameter values that reflect certain biogeochemical features of the surface ocean.

We used the batch learning algorithm (Abe et al., 2002) to train the SOM as the result does not depend on the sequential order of training samples. The parameters were initialized randomly in the range (-1, 1). In each iterative training loop, each training sample is associated with a neuron cell to which the distance defined as follows is smaller than to other neuron cells: d=f(p-x), where p denotes the vector of neuron cell parameters, x the vector of input variables, and f the scale matrix that we introduced to change the influence of certain variables on the distance. The components of f are all 0 except for those on the diagonal, which are set to 1 by default. In our application, the data for each input variable were scaled to be unitless by its mean and standard deviation to eliminate the effect of units on the distance.

The associated neuron cell is called the best matching cell (BMC). After the BMCs for all training samples are found, the parameters are updated by pi=∑khikxk∑khik, where i and k denote indexes of neuron cells and training samples, respectively. The neighborhood function that determines the weight factor h is defined as hik=exp⁡(-rikq), where rik denotes the geographic distance between the ith neuron cell and the BMC of the kth training sample and q is a factor that decreases linearly with the iteration loop. In other words, the procedure adjusts the parameters of neuron cells toward those training samples whose BMCs are close to them and the amount of adjustment decreases exponentially with the geographic distance between neuron cells and linearly with the training loop.

The trained SOM needs to be labeled by fCO2 for making predictions. The values of fCO2 measurements are assigned to their BMC. Predicting fCO2 for a set of input variables is realized by finding the BMC labeled with fCO2 and extracting its mean fCO2 value.

Feedforward neural network

A feedforward neural network (FNN) is an artificial neural network that is trained using supervised learning. Our FNN comprises three layers (Zeng et al., 2014): an input layer, a hidden layer, and an output layer. The number of neurons in the input layers is determined by the number of input variables, i.e., LAT, SST, SSS, CHL, MLD, and dSST in our case. The output layer has only one neuron for fCO2. Each neuron in the hidden layer uses the following kernel function to transform all input variables: yh=11+exp⁡-(b+wTx), where w denotes the vector of weight parameters and b the offset parameter. The yh of all hidden neurons become the inputs of the output neuron, which uses the same kernel function to transform yh to produce fCO2.

The training updates the offset and weight parameters, which are initialized randomly in the range (-1, 1), by minimizing the cost function fw′=12eTe=12ym-yo2, where w′ is the extended vector that includes b and w; ym and yo stand for the vectors of modeled and observed fCO2, respectively. In the gradient descent training algorithm, updating w′ at the training iteration t can be expressed as w′t=w′t-1-αg where α is the learning rate (a positive number smaller than 1), and g the first-order derivative of the cost function g=∇fw′=JTe, where J is the Jacobian matrix whose components are derivatives of e with respect to w′ using the back propagation method. We used the efficient Levenberg–Marquardt algorithm (Wilamowski et al., 2010), which derives the gradient as g=JTJ+μI-1JTe, where μ is a constant.

Support vector machine

A support vector machine (SVM) is a supervised learning model that was conceptualized in the 1960s for classification problems and later extended to regression analysis (Basak et al., 2007). We used the so-called least-square support vector machine for regression (Pelckmans et al., 2002) which, similar to FNN, seeks to minimize the error between model outputs and measurements. The SVM models the dependence of fCO2 on LAT, SST, SSS, CHL, MLD, and dSST as y=cTφx+b where x stands for a set of measurements of the input variables, c the vector of coefficients, b the offset parameter, and ϕ the kernel function. In this investigation, we used the radial basis kernel function, i.e., φxiTφxj=exp⁡-xi-xj22σ2, where σ is a parameter whose optimal value depends on the data used for training. The subscription of x indicates a sample of input variables.

Given a set of training samples xk,ykk=1N, the goal of training SVM is to minimize the cost function Fc=12cTc+γeTe where ek=yk-cTφxk-b and γ is a constant whose optimal value depends on the data used for training. The Lagrangian solution for the optimization problem of Eq. (A11) is given by Lc,e,b,α=12(cTc+γe)-∑kNαkcTφxk+b+ek-yk, where αk is a Lagrangian multiplier. The optimal conditions of Eq. (A13) are ∂L∂ck=0→ck=αkφxk,∂L∂b=0→∑kNαk=0,∂L∂ek=0→αk=γek,∂L∂αk=0,→ckφxk+b+ek-yk=0.

After eliminating c and e from the above conditions, the following equation is obtained: 0uTuΩ+γ-1Ibα=0y, where u is a vector with all components being 1, and the components of Ω are Ωij=φxiTφxj.

Once Eq. (A18) is solved, making a prediction is done by y(x)=∑kNαkφxkTφx+b.

The authors declare that they have no conflict of interest.

Acknowledgements

The Surface Ocean CO2 Atlas (SOCAT) is an international effort, endorsed by the International Ocean Carbon Coordination Project (IOCCP), the Surface Ocean Lower Atmosphere Study (SOLAS), and the Integrated Marine Biogeochemistry and Ecosystem Research program (IMBER), to deliver a uniformly quality-controlled surface ocean CO2 database. The many researchers and funding agencies responsible for the collection of data and quality control are thanked for their contributions to SOCAT. Edited by: J. M. Huthnance Reviewed by: three anonymous referees

References 1

Abe, T., Kanaya, S., Kinouchi, M., Ichiba, Y., Kozuki, T., and Ikemura, T.: A Novel Bioinformatic Strategy for Unveiling Hidden Genome Signatures of Eukaryotes: Self-Organizing Map of Oligonucleotide Frequency, Genom. Inform., 13, 12–20, 2002.

Bakker, D. C. E., Pfeil, B., Smith, K., Hankin, S., Olsen, A., Alin, S. R., Cosca, C., Harasawa, S., Kozyr, A., Nojiri, Y., O'Brien, K. M., Schuster, U., Telszewski, M., Tilbrook, B., Wada, C., Akl, J., Barbero, L., Bates, N. R., Boutin, J., Bozec, Y., Cai, W.-J., Castle, R. D., Chavez, F. P., Chen, L., Chierici, M., Currie, K., de Baar, H. J. W., Evans, W., Feely, R. A., Fransson, A., Gao, Z., Hales, B., Hardman-Mountford, N. J., Hoppema, M., Huang, W.-J., Hunt, C. W., Huss, B., Ichikawa, T., Johannessen, T., Jones, E. M., Jones, S. D., Jutterström, S., Kitidis, V., Körtzinger, A., Landschützer, P., Lauvset, S. K., Lefèvre, N., Manke, A. B., Mathis, J. T., Merlivat, L., Metzl, N., Murata, A., Newberger, T., Omar, A. M., Ono, T., Park, G.-H., Paterson, K., Pierrot, D., Ríos, A. F., Sabine, C. L., Saito, S., Salisbury, J., Sarma, V. V. S. S., Schlitzer, R., Sieger, R., Skjelvan, I., Steinhoff, T., Sullivan, K. F., Sun, H., Sutton, A. J., Suzuki, T., Sweeney, C., Takahashi, T., Tjiputra, J., Tsurushima, N., van Heuven, S. M. A. C., Vandemark, D., Vlahos, P., Wallace, D. W. R., Wanninkhof, R., and Watson, A. J.: An update to the Surface Ocean CO2 Atlas (SOCAT version 2), Earth Syst. Sci. Data, 6, 69–90, 10.5194/essd-6-69-2014, 2014.

Basak, D., Pal, S., and Patranabis, D. C.: Support vector regression, Neu. Inf. Pro.-Letters and Reviews, 11, 203–224, 2007.

Boyer, T. P., Antonov, J. I., Baranova, O. K., Coleman, C., Garcia, H. E., Grodsky, A., Johnson, D. R., Locarnini, R. A., Mishonov, A. V., O'Brien, T. D., Paver, C. R., Reagan, J. R., Seidov, D., Smolyar, I. V., and Zweng, M. M.: World Ocean Database 2013, NOAA Atlas NESDIS 72, edited by: Levitus, S. and Mishonov, A., Technical Ed., Silver Spring, MD, 209 pp. 2013.

Chierici, M., Fransson, A., and Nojiri, Y.: Biogeochemical processes as drivers of surface fCO2 in contrasting provinces in the subarctic North Pacific Ocean, Global Biogeochem. Cy., 20, GB1009, 10.1029/2004GB002356, 2006.

Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P., Bechtold, P., Beljaars, A. C. M., van de Berg, L., Bidlot, J., Bormann, N., Delsol, C., Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S. B., Hersbach, H., Hólm, E. V., Isaksen, L., Kållberg, P., Köhler, M., Matricardi, M., McNally, A. P., Monge-Sanz, B. M., Morcrette, J.-J., Park, B.-K., Peubey, C., de Rosnay, P., Tavolato, C., Thépaut, J.-N., and Vitart, F.: The ERA-Interim reanalysis: configuration and performance of the data assimilation system, Q. J. Roy. Meteor. Soc., 137, 553–597, 2011.

Friedrich, T. and Oschlies, A.: Neural network-based estimates of North Atlantic surface pCO2 from satellite data: A methodological study, J. Geophys. Res., 114, C03020, 10.1029/2007JC004646, 2009.

Goddijn-Murphy, L. M., Woolf, D. K., Land, P. E., Shutler, J. D., and Donlon, C.: The OceanFlux Greenhouse Gases methodology for deriving a sea surface climatology of CO2 fugacity in support of air–sea gas flux studies, Ocean Sci., 11, 519–541, 10.5194/os-11-519-2015, 2015.

Iida, Y., Kojima, A., Takatani, Y., Nakano, T., Sugimoto, H., Midorikawa, T., and Ishii, M.: Trends in pCO2 and sea-air CO2 flux over the global open oceans for the last two decades, J. Oceanogr., 71, 637–661, 2015.

Jamet, C., Moulin, C., and Lefèvre, N.: Estimation of the oceanic pCO2 in the North Atlantic from VOS lines in-situ measurements: parameters needed to generate seasonally mean maps, Ann. Geophys., 25, 2247–2257, 10.5194/angeo-25-2247-2007, 2007.

Jones, S. D., Quéré, C. L., Rödenbeck, C., Manning, A. C., and Olsen, A.: A statistical gap-filling method to interpolate global monthly surface ocean carbon dioxide data, J. Adv. Model. Earth Syst., 7, 1942–2466, 2015.

Kohonen, T.: Self-Organization and Associative Memory, Springer, Berlin, 1984.

Landschützer, P., Gruber, N., Bakker, D. C. E., Schuster, U., Nakaoka, S., Payne, M. R., Sasse, T. P., and Zeng, J.: A neural network-based estimate of the seasonal to inter-annual variability of the Atlantic Ocean carbon sink, Biogeosciences, 10, 7793–7815, 10.5194/bg-10-7793-2013, 2013.

Landschützer, P., Gruber, N., Haumann, F., Rödenbeck, C., Bakker, D., van Heuven, S., Hoppema, M., Metzl, N., Sweeney, C., Takahashi, T., Tilbrook, B., and Wanninkhof, R.: The reinvigoration of the Southern Ocean carbon sink, Science, 349, 1221–1224, 2015.

Le Quéré, C., Moriarty, R., Andrew, R. M., Canadell, J. G., Sitch, S., Korsbakken, J. I., Friedlingstein, P., Peters, G. P., Andres, R. J., Boden, T. A., Houghton, R. A., House, J. I., Keeling, R. F., Tans, P., Arneth, A., Bakker, D. C. E., Barbero, L., Bopp, L., Chang, J., Chevallier, F., Chini, L. P., Ciais, P., Fader, M., Feely, R. A., Gkritzalis, T., Harris, I., Hauck, J., Ilyina, T., Jain, A. K., Kato, E., Kitidis, V., Klein Goldewijk, K., Koven, C., Landschützer, P., Lauvset, S. K., Lefèvre, N., Lenton, A., Lima, I. D., Metzl, N., Millero, F., Munro, D. R., Murata, A., Nabel, J. E. M. S., Nakaoka, S., Nojiri, Y., O'Brien, K., Olsen, A., Ono, T., Pérez, F. F., Pfeil, B., Pierrot, D., Poulter, B., Rehder, G., Rödenbeck, C., Saito, S., Schuster, U., Schwinger, J., Séférian, R., Steinhoff, T., Stocker, B. D., Sutton, A. J., Takahashi, T., Tilbrook, B., van der Laan-Luijkx, I. T., van der Werf, G. R., van Heuven, S., Vandemark, D., Viovy, N., Wiltshire, A., Zaehle, S., and Zeng, N.: Global Carbon Budget 2015, Earth Syst. Sci. Data, 7, 349–396, 10.5194/essd-7-349-2015, 2015.

Lefèvre, N., Watson, A. J., and Watson, A. R.: A comparison of multiple regression and neural network techniques for mapping in situ pCO2 data, Tellus B, 57, 375–384, 2005.

Nakaoka, S., Telszewski, M., Nojiri, Y., Yasunaka, S., Miyazaki, C., Mukai, H., and Usui, N.: Estimating temporal and spatial variation of ocean surface pCO2 in the North Pacific using a self-organizing map neural network technique, Biogeosciences, 10, 6093–6106, 10.5194/bg-10-6093-2013, 2013.

Park, G.-H., Wanninkhof, R., Doney, S. C., Takahashi, T., Lee, K., Feely, R. A., Sabine, C. L., Triñanes, J., and Lima, I. D.: Variability of global net sea-air CO2 fluxes over the last three decades using empirical relationships, Tellus B, 62, 352–368, 10.1111/j.1600-0889.2010.00498.x, 2010.

Pelckmans, K., Suykens, J. A. K., Gestel, T. V., Brabanter, J. D., Hamers, B., Moor, D., and Vandewalle, J.: LS-SVMlab: a MATLAB/C toolbox for Least Squares Support Vector Machines, http://www.esat.kuleuven.ac.be/sista/lssvmlab, last access: April 2017, presented at Neural Information Processing Systems (NIPS 2002), 2002.

Pfeil, B., Olsen, A., Bakker, D. C. E., Hankin, S., Koyuk, H., Kozyr, A., Malczyk, J., Manke, A., Metzl, N., Sabine, C. L., Akl, J., Alin, S. R., Bates, N., Bellerby, R. G. J., Borges, A., Boutin, J., Brown, P. J., Cai, W.-J., Chavez, F. P., Chen, A., Cosca, C., Fassbender, A. J., Feely, R. A., González-Dávila, M., Goyet, C., Hales, B., Hardman-Mountford, N., Heinze, C., Hood, M., Hoppema, M., Hunt, C. W., Hydes, D., Ishii, M., Johannessen, T., Jones, S. D., Key, R. M., Körtzinger, A., Landschützer, P., Lauvset, S. K., Lefèvre, N., Lenton, A., Lourantou, A., Merlivat, L., Midorikawa, T., Mintrop, L., Miyazaki, C., Murata, A., Nakadate, A., Nakano, Y., Nakaoka, S., Nojiri, Y., Omar, A. M., Padin, X. A., Park, G.-H., Paterson, K., Perez, F. F., Pierrot, D., Poisson, A., Ríos, A. F., Santana-Casiano, J. M., Salisbury, J., Sarma, V. V. S. S., Schlitzer, R., Schneider, B., Schuster, U., Sieger, R., Skjelvan, I., Steinhoff, T., Suzuki, T., Takahashi, T., Tedesco, K., Telszewski, M., Thomas, H., Tilbrook, B., Tjiputra, J., Vandemark, D., Veness, T., Wanninkhof, R., Watson, A. J., Weiss, R., Wong, C. S., and Yoshikawa-Inoue, H.: A uniform, quality controlled Surface Ocean CO2 Atlas (SOCAT), Earth Syst. Sci. Data, 5, 125–143, 10.5194/essd-5-125-2013, 2013.

Reynolds, R. W., Rayner, N. A., Smith, T. M., Stokes, D. C., and Wang, W.: An Improved In Situ and Satellite SST Analysis for Climate, J. Climate, 15, 1609–1625, 2002.

Rödenbeck, C., Keeling, R. F., Bakker, D. C. E., Metzl, N., Olsen, A., Sabine, C., and Heimann, M.: Global surface-ocean pCO2 and sea–air CO2 flux variability from an observation-driven ocean mixed-layer scheme, Ocean Sci., 9, 193–216, 10.5194/os-9-193-2013, 2013.

Rödenbeck, C., Bakker, D. C. E., Gruber, N., Iida, Y., Jacobson, A. R., Jones, S., Landschützer, P., Metzl, N., Nakaoka, S., Olsen, A., Park, G.-H., Peylin, P., Rodgers, K. B., Sasse, T. P., Schuster, U., Shutler, J. D., Valsala, V., Wanninkhof, R., and Zeng, J.: Data-based estimates of the ocean carbon sink variability – first results of the Surface Ocean pCO2 Mapping intercomparison (SOCOM), Biogeosciences, 12, 7251–7278, 10.5194/bg-12-7251-2015, 2015.

Sabine, C. L., Hankin, S., Koyuk, H., Bakker, D. C. E., Pfeil, B., Olsen, A., Metzl, N., Kozyr, A., Fassbender, A., Manke, A., Malczyk, J., Akl, J., Alin, S. R., Bellerby, R. G. J., Borges, A., Boutin, J., Brown, P. J., Cai, W.-J., Chavez, F. P., Chen, A., Cosca, C., Feely, R. A., González-Dávila, M., Goyet, C., Hardman-Mountford, N., Heinze, C., Hoppema, M., Hunt, C. W., Hydes, D., Ishii, M., Johannessen, T., Key, R. M., Körtzinger, A., Landschützer, P., Lauvset, S. K., Lefèvre, N., Lenton, A., Lourantou, A., Merlivat, L., Midorikawa, T., Mintrop, L., Miyazaki, C., Murata, A., Nakadate, A., Nakano, Y., Nakaoka, S., Nojiri, Y., Omar, A. M., Padin, X. A., Park, G.-H., Paterson, K., Perez, F. F., Pierrot, D., Poisson, A., Ríos, A. F., Salisbury, J., Santana-Casiano, J. M., Sarma, V. V. S. S., Schlitzer, R., Schneider, B., Schuster, U., Sieger, R., Skjelvan, I., Steinhoff, T., Suzuki, T., Takahashi, T., Tedesco, K., Telszewski, M., Thomas, H., Tilbrook, B., Vandemark, D., Veness, T., Watson, A. J., Weiss, R., Wong, C. S., and Yoshikawa-Inoue, H.: Surface Ocean CO2 Atlas (SOCAT) gridded data products, Earth Syst. Sci. Data, 5, 145–153, 10.5194/essd-5-145-2013, 2013.

Sarma, V. V. S. S., Saino, T., Sasaoka, K., Nojiri, Y., Ono, T., Ishii, M., Inoue, H. Y., and Matsumoto, K.: Basin-scale pCO2 distribution using satellite sea surface temperature, Chla, and climatological salinity in the North Pacific in spring and summer, Global Biogeochem. Cy., 20, GB3005, 10.1029/2005GB002594, 2006.

Sasse, T. P., McNeil, B. I., and Abramowitz, G.: A new constraint on global air-sea CO2 fluxes using bottle carbon data, Geophys. Res. Lett., 40, 1594–1599, 2013.

Schmidtko, S., Johnson, G. C., and Lyman, J. M.: MIMOC: A global monthly isopycnal upper-ocean climatology with mixed layers, J. Geophys. Res., 118, 1658–1672, 10.1002/jgrc.20122, 2013.

Stocker, T., Qin, D., and Platner, G.-K.: Climate Change 2013 The Physical Science Basis, Cambridge University Press, Cambridge, United Kingdom, 2013.

Takahashi, T., Sutherland, S. C., Sweeney, C., Poisson, A., Metzl, N., Tilbrook, B., Bates, N., Wanninkhof, R., Feely, R. A., Sabine, C., Olafsson, J., and Nojiri, Y.: Global sea-air CO2 flux based on climatological surface ocean pCO2, and seasonal biological and temperature effects, Deep-Sea Res. Pt. II, 49, 1601–1622, 2002.

Takahashi, T., Sutherland, S. C., Wanninkhof, R., Sweeney, C., Feely, R. A., Chipman, D. W., Hales, B., Friederich, G., Chavez, F., Sabine, C., Watson, A., Bakker, D. C. E., Schuster, U., Metzl, N., Yoshikawa-Inoue, H., Ishii, M., Midorikawa, T., Nojiri, Y., Körtzinger, A., Steinhoff, T., Hoppema, M., Olafsson, J., Arnarson, T. S., Tilbrook, B., Johannessen, T., Olsen, A., Bellerby, R., and Wong, C. S.: Climatological mean and decadal change in surface ocean pCO2, and net sea-air CO2 flux over the global oceans, Deep-Sea Res. Pt. II, 56, 554–577, 2009.

Takahashi, T., Sutherland, S. C., Chipman, D. W., Goddard, J. G., Ho, C., Newberger, T., Sweeney, C., and Munro, D. R.: Climatological distributions of pH, pCO2, total CO2, alkalinity, and CaCO3 saturation in the global surface ocean, and temporal changes at selected locations, Mar. Chem., 164, 95–145, 2014.

Takamura, T. R., Inoue, H. Y., Midorikawa, T., Ishii, M., and Nojiri, Y.: Seasonal and Inter-Annual Variations in pCO2 sea and Air-Sea CO2 Fluxes in Mid-Latitudes of the Western and Eastern North Pacific during 1999–2006: Recent Results Utilizing Voluntary Observation Ships, J. Meteorol. Soc. Jpn., 88, 883–898, 2010.

Telszewski, M., Chazottes, A., Schuster, U., Watson, A. J., Moulin, C., Bakker, D. C. E., González-Dávila, M., Johannessen, T., Körtzinger, A., Lüger, H., Olsen, A., Omar, A., Padin, X. A., Ríos, A. F., Steinhoff, T., Santana-Casiano, M., Wallace, D. W. R., and Wanninkhof, R.: Estimating the monthly pCO2 distribution in the North Atlantic using a self-organizing neural network, Biogeosciences, 6, 1405–1421, 10.5194/bg-6-1405-2009, 2009.

Wanninkhof, R., Park, G.-H., Takahashi, T., Sweeney, C., Feely, R., Nojiri, Y., Gruber, N., Doney, S. C., McKinley, G. A., Lenton, A., Le Quéré, C., Heinze, C., Schwinger, J., Graven, H., and Khatiwala, S.: Global ocean carbon uptake: magnitude, variability and trends, Biogeosciences, 10, 1983–2000, 10.5194/bg-10-1983-2013, 2013.

Wilamowski, B. M. and Yu, H.: Improved Computation for Levenberg-Marquardt Training, IEEE T. Neural Networ., 21, 930–937, 2010.

Zeng, J., Nojiri, Y., Landschützer, P., Telszewski, M., and Nakaoka, S.: A global surface ocean fCO2 climatology based on a feedforward neural network, J. Atmos. Ocean Tech., 31, 1838–1849, 2014.

Zeng, J., Nojiri, Y., Nakaoka, S., Nakajima, H., and Shirai, T.: Surface ocean CO2 in 1990-2011 modelled using a feed-forward neural network, Geoscience Data Journal, 2, 47–51, 2015.

Zeng, J. Y., Nojiri, Y., Murphy, P. P., Wong, C. S., and Fujinuma, Y.: A comparison of Delta pCO2 distributions in the northern North Pacific using results from a commercial vessel in 1995–1999, Deep-Sea Res. Pt. II, 49, 5303–5315, 2002.

</app></app-group></back> </article>