<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" dtd-version="3.0">
  <front>
    <journal-meta>
<journal-id journal-id-type="publisher">OS</journal-id>
<journal-title-group>
<journal-title>Ocean Science</journal-title>
<abbrev-journal-title abbrev-type="publisher">OS</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Ocean Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1812-0792</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>

    <article-meta>
      <article-id pub-id-type="doi">10.5194/os-13-303-2017</article-id><title-group><article-title>Technical note: Evaluation of three machine learning models <?xmltex \hack{\newline}?> for surface
ocean CO<inline-formula><mml:math id="M1" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> mapping</article-title>
      </title-group><?xmltex \runningtitle{Evaluation of three machine learning models for surface
ocean CO${}_{{2}}$ mapping}?><?xmltex \runningauthor{J.~Zeng et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Zeng</surname><given-names>Jiye</given-names></name>
          <email>zeng@nies.go.jp</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Matsunaga</surname><given-names>Tsuneo</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-3380-5230</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Saigusa</surname><given-names>Nobuko</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Shirai</surname><given-names>Tomoko</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Nakaoka</surname><given-names>Shin-ichiro</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Tan</surname><given-names>Zheng-Hong</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Centre for Global Environmental Research, National Institute for Environmental Studies, Tsukuba, Ibaraki, Japan</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Institute of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, China</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Jiye Zeng (zeng@nies.go.jp)</corresp></author-notes><pub-date><day>19</day><month>April</month><year>2017</year></pub-date>
      
      <volume>13</volume>
      <issue>2</issue>
      <fpage>303</fpage><lpage>313</lpage>
      <history>
        <date date-type="received"><day>6</day><month>September</month><year>2016</year></date>
           <date date-type="rev-request"><day>25</day><month>October</month><year>2016</year></date>
           <date date-type="rev-recd"><day>16</day><month>March</month><year>2017</year></date>
           <date date-type="accepted"><day>24</day><month>March</month><year>2017</year></date>
      </history>
      <permissions>
<license license-type="open-access">
<license-p>This work is licensed under a Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/3.0/">http://creativecommons.org/licenses/by/3.0/</ext-link></license-p>
</license>
</permissions><self-uri xlink:href="https://os.copernicus.org/articles/13/303/2017/os-13-303-2017.html">This article is available from https://os.copernicus.org/articles/13/303/2017/os-13-303-2017.html</self-uri>
<self-uri xlink:href="https://os.copernicus.org/articles/13/303/2017/os-13-303-2017.pdf">The full text article is available as a PDF file from https://os.copernicus.org/articles/13/303/2017/os-13-303-2017.pdf</self-uri>


      <abstract>
    <p>Reconstructing surface ocean CO<inline-formula><mml:math id="M2" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> from scarce measurements plays an
important role in estimating oceanic CO<inline-formula><mml:math id="M3" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> uptake. There are varying
degrees of differences among the 14 models included in the Surface Ocean
CO<inline-formula><mml:math id="M4" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> Mapping (SOCOM) inter-comparison initiative, in which five models
used neural networks. This investigation evaluates two neural networks used
in SOCOM, self-organizing maps and feedforward neural networks, and
introduces a machine learning model called a support vector machine for ocean
CO<inline-formula><mml:math id="M5" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> mapping. The technique note provides a practical guide to selecting
the models.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <title>Introduction</title>
      <p>The global ocean is a major sink for anthropogenic carbon and therefore an
important contributor to slowing down the human-induced global warming
(Stocker et al., 2013). To calculate the oceanic CO<inline-formula><mml:math id="M6" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> uptake, various
models have been used to interpolate scarce CO<inline-formula><mml:math id="M7" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> measurements in the
surface ocean spatially and temporarily to obtain basin-wide
(e.g., Zeng et al., 2002; Lefèvre
et al., 2005; Chierici et al., 2006; Sarma et al., 2006; Jamet et al., 2007;
Friedrich and Oschlies, 2009; Telszewski et al., 2009; Takamura et al., 2010;
Landschützer et al., 2013; Nakaoka et al., 2013; Iida et al., 2015;
Goddijn-Murphy et al, 2015) and global ocean CO<inline-formula><mml:math id="M8" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> maps (Takahashi et al.,
2002, 2009, 2014; Park et al., 2010; Rödenbeck et al., 2013; Sasse et
al., 2013; Jones et al., 2015; Zeng et al., 2015). The Surface Ocean CO<inline-formula><mml:math id="M9" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
Mapping (SOCOM) inter-comparison initiative revealed varying degrees of
differences among 14 models (Rödenbeck et al., 2015), of which 5 used
neural networks. They include self-organizing maps (SOMs) and feedforward
neural networks (FNNs). The SOM has a long history in CO<inline-formula><mml:math id="M10" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> mapping
(Lefèvre et al., 2005; Friedrich and Oschlies, 2009; Telszewski et al.,
2009; Nakaoka et al., 2013). Recently, the FNN has been gaining popularity in
this field (Landschützer et al., 2015; Zeng et al., 2014, 2015). In this
investigation we introduce a machine learning model called a support vector
machine (SVM) for ocean CO<inline-formula><mml:math id="M11" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> mapping and compare the SVM with the SOM and
FNN. We intend to provide a practical guide for using these machine learning
models.</p>
</sec>
<sec id="Ch1.S2">
  <title>Model equations</title>
      <p>The machine learning models included in this study cannot directly model the
long-term trend of CO<inline-formula><mml:math id="M12" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>. Therefore, we express the dependence of
CO<inline-formula><mml:math id="M13" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> fugacity (<inline-formula><mml:math id="M14" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> on year (YR), month (MON), latitude (LAT), and
longitude (LON) as the sum of a nonlinear static component and a linear trend
component:
          <disp-formula id="Ch1.E1" content-type="numbered"><mml:math id="M16" display="block"><mml:mrow><mml:mi>f</mml:mi><mml:msub><mml:mrow class="chem"><mml:mi mathvariant="normal">CO</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi mathvariant="normal">static</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mi mathvariant="normal">LAT</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">LON</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">MON</mml:mi></mml:mfenced><mml:mo>+</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi mathvariant="normal">trend</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="normal">YR</mml:mi><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
        As available observations are scarce with respect to the biogeochemical
properties of the surface ocean, we used sea surface temperature (SST), sea
surface salinity (SSS), chlorophyll-<inline-formula><mml:math id="M17" display="inline"><mml:mi>a</mml:mi></mml:math></inline-formula> concentration (CHL), and mixed layer depth
(MLD) as the proxy variables of space and time. These proxy variables were
commonly used by models included in the SOCOM. The model equation becomes
          <disp-formula id="Ch1.E2" content-type="numbered"><mml:math id="M18" display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{8.5}{8.5}\selectfont$\displaystyle}?><mml:mi>f</mml:mi><mml:msub><mml:mrow class="chem"><mml:mi mathvariant="normal">CO</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi mathvariant="normal">static</mml:mi></mml:msub><mml:mfenced close=")" open="("><mml:mi mathvariant="normal">LAT</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">SST</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">SSS</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">CHL</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">MLD</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">dSST</mml:mi></mml:mfenced><mml:mo>+</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mi mathvariant="normal">trend</mml:mi></mml:msub><mml:mfenced close=")" open="("><mml:mi mathvariant="normal">YR</mml:mi></mml:mfenced><?xmltex \hack{$\egroup}?></mml:mrow></mml:math></disp-formula>
        where dSST denotes the difference between the monthly and annual means of
SST. Here we excluded LON and MON. They have a circular property and
therefore cannot be used directly. For instance, longitude <inline-formula><mml:math id="M19" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>180<inline-formula><mml:math id="M20" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> is
geographically connected to longitude 180<inline-formula><mml:math id="M21" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>, but numerically they
appear to be two extreme longitude values to the models. Zeng et al. (2014,
2015) circumvented this problem by using sine and cosine transformed
components. Their approach could unintentionally enhance the influence of
LON and MON on <inline-formula><mml:math id="M22" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M23" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> as one more derived variable from each of them
was added to the model. We excluded LON in the belief that the combination
of SST, SSS, CHL, and MLD contains sufficient spatial information,
but retained LAT for its different seasonal and geophysical meanings in the
Northern and Southern hemispheres. Replacing MON with dSST also improves
the expression of the effect of season geographically.</p>
</sec>
<sec id="Ch1.S3">
  <title>Data</title>
      <p>We extracted monthly <inline-formula><mml:math id="M24" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M25" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> from the track-gridded database of the Surface
Ocean CO<inline-formula><mml:math id="M26" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> Atlas (SOCAT) version 3.0<fn id="Ch1.Footn1"><p><uri>http://www.socat.info/</uri></p></fn>
(Pfeil et al., 2013; Sabine et al., 2013; Bakker
et al., 2014). The database has a 1<inline-formula><mml:math id="M27" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> <inline-formula><mml:math id="M28" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 1<inline-formula><mml:math id="M29" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>
spatial resolution and includes global measurements from 1970 to 2014.
Similar to Zeng et al. (2014), we excluded some data points by these
criteria: (i) <inline-formula><mml:math id="M30" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M31" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> values smaller than 250 <inline-formula><mml:math id="M32" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm or larger than
550 <inline-formula><mml:math id="M33" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm, (ii) ocean depth smaller than 500 m, (iii) salinity smaller
than 25.0, and (iv) year before 1990. A total of 158 052 data points were
extracted with these conditions.</p>
      <p>The monthly SST data of 1990 to 2015 were extracted from the Optimum
Interpolation (OI) V2 product<fn id="Ch1.Footn2"><p><uri>http://www.esrl.noaa.gov/psd/data/gridded/data.noaa.oisst.v2.html</uri></p></fn> of NOAA
(Reynolds et al., 2002). The monthly SSS climatology was extracted from the
World Ocean Atlas 2013 (WOA13) product<fn id="Ch1.Footn3"><p><uri>https://www.nodc.noaa.gov/OC5/woa13/</uri></p></fn>
(Boyer et al., 2013), which
contains the monthly mean SSS from 27 June 1896 to 25 December 2012. The
monthly CHL climatology was calculated using the MODIS Aqua and SeaWiFS
climatology<fn id="Ch1.Footn4"><p><uri>https://oceancolor.gsfc.nasa.gov/cgi/l3</uri></p></fn>,
which covers the period of 2012 to 2015. The mean of the two CHLs was used
as the CHL climatology. The mixed layer data were derived from the Monthly
Isopycnal and Mixed-layer Ocean Climatology<fn id="Ch1.Footn5"><p><uri>http://www.pmel.noaa.gov/mimoc/</uri></p></fn>
of NOAA (Schmidtko et al., 2013), which
includes the period of 1955 to 2012.</p>
</sec>
<sec id="Ch1.S4">
  <title>Machine learning models</title>
      <p>The Appendix and Table 1 summarize the algorithms of the three models. Here
we focus on discussing their usage in CO<inline-formula><mml:math id="M34" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> mapping.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><caption><p>Feature comparison of the three machine learning models.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.9}[.9]?><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="justify" colwidth="142.26378pt"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="142.26378pt"/>
     <oasis:colspec colnum="4" colname="col4" align="justify" colwidth="142.26378pt"/>
     <oasis:thead>
       <oasis:row rowsep="1">  
         <oasis:entry colname="col1">Feature</oasis:entry>  
         <oasis:entry colname="col2">SVM</oasis:entry>  
         <oasis:entry colname="col3">FNN</oasis:entry>  
         <oasis:entry colname="col4">SOM</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">  
         <oasis:entry colname="col1">Input space projection</oasis:entry>  
         <oasis:entry colname="col2">Projects the input variable space to a high-dimensional space that is proportional to the number of training samples.</oasis:entry>  
         <oasis:entry colname="col3">Projects the input space to a high-dimensional space that is proportional to the number of hidden neurons and input variables.</oasis:entry>  
         <oasis:entry colname="col4">Projects the input space to a feature space whose size is determined by the number of neurons.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">  
         <oasis:entry colname="col1">Prediction by</oasis:entry>  
         <oasis:entry colname="col2">Continuous interpolation.</oasis:entry>  
         <oasis:entry colname="col3">Continuous interpolation.</oasis:entry>  
         <oasis:entry colname="col4">Picking up labeling samples that have the closest feature to the input.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">  
         <oasis:entry colname="col1">Problems</oasis:entry>  
         <oasis:entry colname="col2">May over-fit and over-interpolate.</oasis:entry>  
         <oasis:entry colname="col3">May over-fit and over-interpolate.</oasis:entry>  
         <oasis:entry colname="col4">Discrete interpolation leads to spatial discontinuity.</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">  
         <oasis:entry colname="col1">Data scaling</oasis:entry>  
         <oasis:entry colname="col2">Helps in solving the linear equation, but has no effect on the result.</oasis:entry>  
         <oasis:entry colname="col3">Helps the convergence of training, but has an insignificant effect on the result.</oasis:entry>  
         <oasis:entry colname="col4">Significant effect on the result.</oasis:entry>
       </oasis:row>
       <oasis:row>  
         <oasis:entry colname="col1">Results affected by</oasis:entry>  
         <oasis:entry colname="col2">The parameter values for regularization and kernel function.</oasis:entry>  
         <oasis:entry colname="col3">The number of hidden neurons.</oasis:entry>  
         <oasis:entry colname="col4">The number of neurons and data scaling.</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table></table-wrap>

      <p><?xmltex \hack{\newpage}?>The trend in Eq. (2) cannot be modeled directly by the models. One approach
to dealing with the problem is to normalize the measurements to a reference
year using a global rate and to only model the nonlinear component. Zeng
et al. (2014) presented a method to model the linear component in Eq. (2).
Instead of repeating the process, we used their annual rate of
1.5 <inline-formula><mml:math id="M35" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm to remove the trend from <inline-formula><mml:math id="M36" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M37" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> to normalize it to
the reference year 2005, i.e.,
          <disp-formula id="Ch1.E3" content-type="numbered"><mml:math id="M38" display="block"><mml:mrow><mml:mi>f</mml:mi><mml:msubsup><mml:mrow class="chem"><mml:mi mathvariant="normal">CO</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="normal">normalized</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mi>f</mml:mi><mml:msub><mml:mrow class="chem"><mml:mi mathvariant="normal">CO</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1.5</mml:mn><mml:mo>⋅</mml:mo><mml:mo>(</mml:mo><mml:mtext>YR-2005</mml:mtext><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
        Although Takahashi et al. (2014) obtained a global mean rate of 1.9 <inline-formula><mml:math id="M39" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm yr<inline-formula><mml:math id="M40" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>,
we used 1.5 <inline-formula><mml:math id="M41" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm yr<inline-formula><mml:math id="M42" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> as this rate was obtained by
using the gridded <inline-formula><mml:math id="M43" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M44" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> of SOCAT version 2. The normalized <inline-formula><mml:math id="M45" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M46" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> was
used to model the nonlinear component in Eq. (2). In later discussions,
<inline-formula><mml:math id="M47" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M48" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> means the normalized <inline-formula><mml:math id="M49" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M50" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> unless explicitly stated. Similarly,
we applied the log transform of Zeng et  al. (2014) to  CHL prior to data
scaling discussed below, i.e.,
          <disp-formula id="Ch1.E4" content-type="numbered"><mml:math id="M51" display="block"><mml:mrow><mml:mrow class="chem"><mml:mi mathvariant="normal">CHL</mml:mi></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>log⁡</mml:mi><mml:mn mathvariant="normal">10</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1.0</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="normal">CHL</mml:mi><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
<sec id="Ch1.S4.SS1">
  <title>SMV</title>
      <p>For a given dataset, the SVM requires a prior step to find the optimal value
for the parameter <inline-formula><mml:math id="M52" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> in Eq. (A10) and the parameter <inline-formula><mml:math id="M53" display="inline"><mml:mi mathvariant="italic">γ</mml:mi></mml:math></inline-formula> in
Eq. (A11). To shorten the training time, we randomly chose 10 % of the
measurement data in this step and obtained 0.06 for <inline-formula><mml:math id="M54" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> and 10 for
<inline-formula><mml:math id="M55" display="inline"><mml:mi mathvariant="italic">γ</mml:mi></mml:math></inline-formula>. Note that these values are dependent on data scaling, which is
necessary in this case to avoid the overflow problem in solving Eq. (A18). We
scaled all input variables LAT, SST, SSS, CHL, MLD, and dSST by their minimum
and maximum to confine them in the range (0, 1), i.e.,
            <disp-formula id="Ch1.E5" content-type="numbered"><mml:math id="M56" display="block"><mml:mrow><mml:mi>v</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>v</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mi>v</mml:mi><mml:mi mathvariant="normal">min</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>v</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>v</mml:mi><mml:mi mathvariant="normal">min</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
</sec>
<sec id="Ch1.S4.SS2">
  <title>FNN</title>
      <p>Data scaling is not necessary for the FNN, but can improve its performance.
Following Zeng et al. (2014), we scaled the input variables by their mean
and standard deviation as
            <disp-formula id="Ch1.E6" content-type="numbered"><mml:math id="M57" display="block"><mml:mrow><mml:mi>v</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>v</mml:mi><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>v</mml:mi><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow><mml:mi>s</mml:mi></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          The output variable <inline-formula><mml:math id="M58" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M59" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> is scaled by
            <disp-formula id="Ch1.E7" content-type="numbered"><mml:math id="M60" display="block"><mml:mrow><mml:mi>v</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.8</mml:mn><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>v</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mi>v</mml:mi><mml:mi mathvariant="normal">min</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>v</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>v</mml:mi><mml:mi mathvariant="normal">min</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          This confines the scaled <inline-formula><mml:math id="M61" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M62" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> to between 0.1 and 0.9 for better
response to changes in input variables. The kernel function Eq. (A4) has the
property that for any input in (<inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mi mathvariant="normal">∞</mml:mi></mml:mrow></mml:math></inline-formula>), the output varies
between 0 and 1. For <inline-formula><mml:math id="M65" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M66" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> close to 0 or 1, a small change in
<inline-formula><mml:math id="M67" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M68" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> requires very large adjustment of model parameters, which slows
down the convergence of training.</p>
      <p><?xmltex \hack{\newpage}?>We used 64 hidden neurons for the FNN as Zeng et al. (2014) did. The learning
rate in Eq. (A6) was set to 0.25 by trial-and-error. A small value makes
training slow, whereas a large value may make a training diverge. The
constant in Eq. (A8) was determined dynamically in each iterative training
loop. It was taken as 10 times the mean of absolute differences between
modeled and observed <inline-formula><mml:math id="M69" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M70" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>. We experienced that this method improves
the performance of training.</p>
</sec>
<sec id="Ch1.S4.SS3">
  <title>SOM</title>
      <p>Data scaling is critical for the SOM, as the distance defined by Eq. (A1)
would be affected by variable units. We used Eq. (6) to scale input variables
in training the SOM. Based on our preliminary correlation analysis, we
applied a factor of 2 to enhance the influence of  SST and  CHL on the distance.
Using such a subjective factor is the only way to make the correlations
between the output and the input variables more in line with observed
correlations.</p>
      <p>From the labeling procedure of SOM described in the Appendix, it is not
difficult to see that the number of neuron cells in SOM affects the labeling
and hence the prediction. Unfortunately, there is no guideline for choosing
the size. Based on previous studies (Telszewski et al., 2009; Nakaoka et al.,
2013), we used 20 000 neuron cells, roughly one neuron cell for one
1 <inline-formula><mml:math id="M71" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 1 grid cell of sampled areas.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <title>Model validation</title>
      <p>We examined the goodness of fit by randomly selecting 10 to 50 % of the
data points to train the FNN and SVM, and to label the SOM; and then
calculated the correlation coefficient between modeled and observed CO<inline-formula><mml:math id="M72" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
of the selected data points.</p>
      <p>The SOM yields the best correlation in the case of 10 % of randomly
selected data points and the correlation decreases with the number of data
points (Fig. 1). The reason is that for a given number of neuron cells, the
fewer the data points, the less likely a neuron cell will be labeled by
multiple measurements and the more likely that the prediction will find the
same CO<inline-formula><mml:math id="M73" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> value used for labeling. Therefore, the goodness of fit does
not necessary mean good SOM modeling.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1"><caption><p>Correlation coefficient between modeled and observed <inline-formula><mml:math id="M74" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M75" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
(uatm). The sample size is the number of data points randomly selected to
train FFN and SVM and to label SOM.</p></caption>
        <?xmltex \igopts{width=199.169291pt}?><graphic xlink:href="https://os.copernicus.org/articles/13/303/2017/os-13-303-2017-f01.png"/>

      </fig>

      <p>The correlations obtained by the SVM and FNN do not vary much with the number
of data points. While the SVM's correlation decreases monotonically, even
though by only a little, with the number of data points, the FNN's
correlation obtained with 75 000 data points is larger than that with
60 000 data points. The FNN is known for not being able to find the global
optimum in training. This case could be an indication of an imperfect
training. The FNN appears inferior to SVM in all cases. However, imperfect
training does not account for all the differences. If we use the number of
model parameters to be determined by the training as the indicator of the
dimension of the model space, the FNN's dimension is far smaller than that of
the SVM. The former is determined by the number of hidden neurons and input
variables, whereas the latter is determined by the number of training data.
For 6 input variables, 15 000 training data, and 64 hidden neurons, the
number of parameters is 509 for the FNN and 15001 for the SVM.</p>
      <p>A better indicator of the performance of the models would be the goodness of
prediction. To emulate the situation that the sampled area was only a small
portion of the global ocean, we evaluated the goodness of prediction by
training FNN and SVM and labeling SOM with 10 % of randomly selected data
to make a prediction for the rest of the data. Figure 2 shows that the SVM
yielded the best correlation (<inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo></mml:mrow></mml:math></inline-formula> 0.72), the FNN fell behind
(<inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo></mml:mrow></mml:math></inline-formula> 0.67), and the SOM performed the worst (<inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo></mml:mrow></mml:math></inline-formula> 0.54). The
differences between predicted and observed <inline-formula><mml:math id="M79" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M80" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> are
0.1 <inline-formula><mml:math id="M81" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 17.4 <inline-formula><mml:math id="M82" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm for SVM, 0.1 <inline-formula><mml:math id="M83" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 18.9 <inline-formula><mml:math id="M84" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm for
FNN, and 0.2 <inline-formula><mml:math id="M85" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 23.3 <inline-formula><mml:math id="M86" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm for SOM. Compared to the variation
of <inline-formula><mml:math id="M87" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M88" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> measurements, these differences are small and their
uncertainties are on the same order of magnitude as
the variation of measurements. Let us
examine the standard deviation (SD) of <inline-formula><mml:math id="M89" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M90" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> in
those grids with at least three data points. The track-gridded <inline-formula><mml:math id="M91" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M92" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> in
SOCAT version 3.0 includes an SD ranging from 0.1 to 71.2 <inline-formula><mml:math id="M93" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm and
the mean is 5.2 <inline-formula><mml:math id="M94" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm. Calculating the SD of normalized <inline-formula><mml:math id="M95" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M96" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
in the same grids and in the same months of all years yielded a mean of
12.5 <inline-formula><mml:math id="M97" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm in the range of 0.1 to 103.1 <inline-formula><mml:math id="M98" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm. The
normalization had little effect on the SD as the calculation for
non-normalized <inline-formula><mml:math id="M99" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M100" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> gives a mean SD of 14.6 <inline-formula><mml:math id="M101" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm in the
range of 0.1 to 107.5 <inline-formula><mml:math id="M102" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><caption><p>Predicted vs. observed <inline-formula><mml:math id="M103" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M104" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> (<inline-formula><mml:math id="M105" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm). Ten percent of
data points were selected randomly to train FNN and SVM and to label SOM, and
the rest was used for validation.</p></caption>
        <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://os.copernicus.org/articles/13/303/2017/os-13-303-2017-f02.png"/>

      </fig>

      <p>From the algorithm of SOM in the Appendix, it is not difficult to see that
the SOM does not make extrapolation – the model always approximates new
inputs by the measurements used for training and approximates <inline-formula><mml:math id="M106" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M107" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> by
the measurements used for labeling; therefore, the predicted <inline-formula><mml:math id="M108" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M109" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
values are within the observed <inline-formula><mml:math id="M110" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M111" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> range (Fig. 2a). Figure 2c shows
that the extrapolated <inline-formula><mml:math id="M112" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M113" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> by the SVM, if any, did not exceed the
observed range. To investigate the extrapolation risk, we used 200 000 data
points randomly generated for SST, dSST, SSS, MLD, and CHL in the range of
(0, 40 <inline-formula><mml:math id="M114" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C), (<inline-formula><mml:math id="M115" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>20, 20 <inline-formula><mml:math id="M116" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C), (20, 50), (1, 1500 m), and (0
log(mg m<inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, 2 log(mg m<inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, respectively. These ranges are
larger than the corresponding observed ranges of (0, 34 <inline-formula><mml:math id="M119" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C), (<inline-formula><mml:math id="M120" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>13,
16 <inline-formula><mml:math id="M121" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C), (24, 40), (1, 1000 m), and (0 log(mg m<inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, 1.2
log(mg m<inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. The SVM and FNN produced <inline-formula><mml:math id="M124" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M125" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> in the range of
(267, 468 <inline-formula><mml:math id="M126" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm) and (199, 596 <inline-formula><mml:math id="M127" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm), respectively, for
the simulated samples. Compared to the observed <inline-formula><mml:math id="M128" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M129" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> range of (240,
560 <inline-formula><mml:math id="M130" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm), the experiment indicates that the over-extrapolation
risk of the SVM is low.</p>
</sec>
<sec id="Ch1.S6">
  <title>Differences</title>
      <p>Figure 3 shows <inline-formula><mml:math id="M131" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M132" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> maps in February and July 2005, which is the
reference year for normalization. In the mapping, we randomly selected
50 % of the data to train the FNN and SVM and to label the SOM. All
models captured the major features of observed <inline-formula><mml:math id="M133" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M134" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> distribution. The
SOM exhibits obvious discontinuity because of its discrete characteristics of
picking up <inline-formula><mml:math id="M135" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M136" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> values from the labeled SOM. For year 2005, the mean
<inline-formula><mml:math id="M137" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M138" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> difference is <inline-formula><mml:math id="M139" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.05 <inline-formula><mml:math id="M140" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 12.73 <inline-formula><mml:math id="M141" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm for FNN<inline-formula><mml:math id="M142" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>SVM
and <inline-formula><mml:math id="M143" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.6 <inline-formula><mml:math id="M144" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 18.80 for SOM<inline-formula><mml:math id="M145" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>SVM. The uncertainty is the standard
deviation of the mean difference between predicted and observed values. The
statistics indicates that FNN agrees better with SVM than SOM does.</p>

      <?xmltex \floatpos{p}?><fig id="Ch1.F3" specific-use="star"><caption><p>Distributions of modeled and
observed <inline-formula><mml:math id="M146" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M147" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>. The composite map for observations includes <inline-formula><mml:math id="M148" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M149" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
in 1990–2014. Half of randomly selected data points were used to train FNN
and SVM and to label SOM to make predictions. <bold>(a)</bold> shows February and
<bold>(b)</bold> shows July.</p></caption>
        <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://os.copernicus.org/articles/13/303/2017/os-13-303-2017-f03.png"/>

      </fig>

      <p>Although the differences among models might be on the order of 10 to
20 <inline-formula><mml:math id="M150" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm, the effect on the global ocean CO<inline-formula><mml:math id="M151" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> flux estimate is
small (Fig. 4). The fluxes are calculated using the wind speed from ECMWF's
interim product (Dee et al., 2011). Our estimate for the oceanic uptake is on
the higher end among those in Wanninkhof et al. (2013) and Le Quéré
et al. (2015). For example, Wanninkhof et al. (2013) reported that the median
sea–air anthropogenic CO<inline-formula><mml:math id="M152" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> fluxes centered on year 2000 ranged from 1.9
to 2.5 PgC yr<inline-formula><mml:math id="M153" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> among the seven models. In comparison, our estimates
by the three models are about 2.4 PgC yr<inline-formula><mml:math id="M154" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The mean difference of
CO<inline-formula><mml:math id="M155" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> flux is 0.02 PgC yr<inline-formula><mml:math id="M156" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> between the FNN and the SVM (FNN<inline-formula><mml:math id="M157" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>SVM)
and 0.06 PgC yr<inline-formula><mml:math id="M158" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> between the SOM and the SVM (SOM<inline-formula><mml:math id="M159" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>SVM). They are
small in comparison with those differences among the models in Wanninkhof et
al. (2013) and Le Quéré et al. (2015). Note that the flux estimate is
highly dependent on wind products as shown by Wanninkhof et al. (2013) and
Zeng et al. (2014).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4"><caption><p>Modeled global CO<inline-formula><mml:math id="M160" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> fluxes. A negative value indicates oceanic
uptake.</p></caption>
        <?xmltex \igopts{width=199.169291pt}?><graphic xlink:href="https://os.copernicus.org/articles/13/303/2017/os-13-303-2017-f04.png"/>

      </fig>

      <p><?xmltex \hack{\newpage}?>On the spatial scale of tens of degrees, the three models show good mutual
agreement for modeled <inline-formula><mml:math id="M161" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M162" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> distributions among them. However, each
model shows distinguished fine structures, which are determined by the
biogeochemical processes in the ocean, by model parameters obtained from
training, and by the characteristics of the models. We believe that the
modeled monthly <inline-formula><mml:math id="M163" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M164" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> distributions are true to the degree given by the
model validations.</p>
</sec>
<sec id="Ch1.S7" sec-type="conclusions">
  <title>Summary</title>
      <p>The main features of the three machine models are listed in Table 1. The SVM
is recommended when the computer has enough memory to store the matrix in
Eq. (A18), which is proportional to the square of the number of training
data. The SVM performs the best, but the training time could become very
long when the number of training data is too large to be handled by a
computer without using virtual memory. For any given dataset, using the SVM
requires a prior step to find the optimal value for the parameter <inline-formula><mml:math id="M165" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>
in Eq. (A10) and the parameter <inline-formula><mml:math id="M166" display="inline"><mml:mi mathvariant="italic">γ</mml:mi></mml:math></inline-formula> in Eq. (A11).</p>
      <p>The FNN model does not perform as well as the SVM, but the number of training
data does not affect its training as much as the SVM's. The training time can
become long when a large number of hidden neurons are used and many
iterations are needed to achieve convergence. It takes a longer time to train
the FNN than the SVM for a small number of data points. However, the FNN is
simpler to use as it requires no prior step. However, it may have the risk of
over-extrapolation.</p>
      <p><?xmltex \hack{\newpage}?>The SOM is recommended only when the other two models have over-fitting or
over-interpolation problems. The SOM performs the worst and is not as
straightforward as the others as its result depends too much on data scaling
and the number of neurons. An advantage of the SOM is that once trained,
re-labeling the SOM with new CO<inline-formula><mml:math id="M167" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> measurements and making a new
prediction is fast. Although the SOM does not have the over-extrapolation
problem of the FNN, it may produce nonsense predictions due to its strong
dependence on data scaling.</p>
      <p>In areas where there was no measurement on a large scale, predictions made by
the models must be treated conservatively, as SVM and FNN may produce
extrapolated results and SOM may extract CO<inline-formula><mml:math id="M168" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> from unexpected provinces.
Figure 3 shows that the modeled CO<inline-formula><mml:math id="M169" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> east of the African coast near the
Equator in July 2005 (Fig. 3) appeared much higher than the nearby
measurements, which were made in July 1995 and adjusted to 2005 using the
global rate of 1.5 <inline-formula><mml:math id="M170" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>atm yr<inline-formula><mml:math id="M171" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. However, considering the large
variations of the rate from region to region (Takahashi et al., 2014) and of
the repeated measurements discussed in Sect. 5, the measurements were not
sufficient to support rejecting the modeled CO<inline-formula><mml:math id="M172" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>. Similar CO<inline-formula><mml:math id="M173" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
hotspots occurred in the Southern Ocean west of South America in February
2005, around the latitudinal zone of 50<inline-formula><mml:math id="M174" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> S. The modeled CO<inline-formula><mml:math id="M175" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
distributions by Takahashi et al. (2014) also showed CO<inline-formula><mml:math id="M176" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> hotspots around
the latitudinal zone of 30<inline-formula><mml:math id="M177" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> S in the same month and region. Their
model used a completely different interpolation scheme based on a
diffusion–advection transport model for surface waters. In principle, this
hotspot CO<inline-formula><mml:math id="M178" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> was produced by our models using measurements somewhere else
where the biogeochemical properties were similar to those in the hotspot
areas. As the SOM does not make extrapolation, the SVM has a low possibility
of over-extrapolation, and the hotspots appeared in all models, the risk of
accepting them would not be high.</p>
</sec>

      
      </body>
    <back><notes notes-type="dataavailability">

      <p>The software and data used by this study are available at
<uri>https://figshare.com/s/38488b7003b03e2103c9</uri>.</p>

      <p>The registered DOI of the package is <ext-link xlink:href="http://dx.doi.org/10.6084/m9.figshare.4877390" ext-link-type="DOI">10.6084/m9.figshare.4877390</ext-link>.</p>
  </notes><?xmltex \hack{\clearpage}?><app-group>

<app id="App1.Ch1.S1">
  <title>Self-organizing map</title>
      <p>A self-organizing map (SOM) is a type of artificial neural
network that is trained using unsupervised learning (Kohonen, 1984). The SOM
in our application comprises grid points on a two-dimensional plane. Each
grid point, also called a neuron cell, has the same number of parameters as
the input variables, which include LAT, SST, SSS, CHL, MLD, and dSST in our
case. Training the SOM is to use samples of input variables to adjust the
parameters to make neighborhood neuron
cells with similar parameter values that reflect certain biogeochemical
features of the surface ocean.</p>
      <p>We used the batch learning algorithm (Abe et al., 2002) to train the SOM as
the result does not depend on the sequential order of training samples. The
parameters were initialized randomly in the range (<inline-formula><mml:math id="M179" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>1, 1). In each iterative training loop, each training
sample is associated with a neuron cell to which the distance defined as
follows is smaller than to other neuron cells:
          <disp-formula id="App1.Ch1.E1" content-type="numbered"><mml:math id="M180" display="block"><mml:mrow><mml:mi>d</mml:mi><mml:mo>=</mml:mo><mml:mfenced close="|" open="|"><mml:mi mathvariant="bold">f</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
        where <inline-formula><mml:math id="M181" display="inline"><mml:mi mathvariant="bold-italic">p</mml:mi></mml:math></inline-formula> denotes the vector of neuron
cell parameters, <inline-formula><mml:math id="M182" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> the vector of input variables, and <inline-formula><mml:math id="M183" display="inline"><mml:mi mathvariant="bold">f</mml:mi></mml:math></inline-formula>
the scale matrix that we introduced to change the influence of certain
variables on the distance. The components of <inline-formula><mml:math id="M184" display="inline"><mml:mi mathvariant="bold">f</mml:mi></mml:math></inline-formula> are all 0 except
for those on the diagonal, which are set to 1 by default. In our application,
the data for each input variable were scaled to be unitless by its mean and
standard deviation to eliminate the effect of units on the distance.</p>
      <p>The associated neuron cell is called the best matching cell (BMC). After the
BMCs for all training samples are found, the parameters are updated by
          <disp-formula id="App1.Ch1.E2" content-type="numbered"><mml:math id="M185" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mo>∑</mml:mo><mml:mi>k</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mo>∑</mml:mo><mml:mi>k</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
        where <inline-formula><mml:math id="M186" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M187" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> denote indexes of neuron cells and training samples,
respectively. The neighborhood function that determines the weight factor <inline-formula><mml:math id="M188" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula>
is defined as
          <disp-formula id="App1.Ch1.E3" content-type="numbered"><mml:math id="M189" display="block"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>exp⁡</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mfenced close="|" open="|"><mml:msub><mml:mi mathvariant="bold-italic">r</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mfenced></mml:mrow><mml:mi>q</mml:mi></mml:mfrac></mml:mstyle><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
        where <inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:mfenced close="|" open="|"><mml:msub><mml:mi mathvariant="bold-italic">r</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mfenced></mml:mrow></mml:math></inline-formula> denotes the geographic distance between the
<inline-formula><mml:math id="M191" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th neuron cell and the BMC of the <inline-formula><mml:math id="M192" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>th training sample and <inline-formula><mml:math id="M193" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula> is a
factor that decreases linearly with the iteration loop. In other words, the
procedure adjusts the parameters of neuron cells toward those training
samples whose BMCs are close to them and the amount of adjustment decreases
exponentially with the geographic distance between neuron cells and linearly
with the training loop.</p>
      <p>The trained SOM needs to be labeled by <inline-formula><mml:math id="M194" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M195" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> for making predictions.
The values of <inline-formula><mml:math id="M196" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M197" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> measurements are assigned to their BMC. Predicting
<inline-formula><mml:math id="M198" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M199" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> for a set of input variables is realized by finding the BMC
labeled with <inline-formula><mml:math id="M200" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M201" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> and extracting its mean <inline-formula><mml:math id="M202" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M203" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> value.</p><?xmltex \hack{\newpage}?>
<sec id="App1.Ch1.S1.SS1">
  <title>Feedforward neural network</title>
      <p>A feedforward neural network (FNN) is an artificial neural network that is
trained using supervised learning. Our FNN comprises three layers (Zeng et
al., 2014): an input layer, a hidden layer, and an output layer. The number
of neurons in the input layers is determined by the number of input
variables, i.e., LAT, SST, SSS, CHL, MLD, and dSST in our case. The output
layer has only one neuron for <inline-formula><mml:math id="M204" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M205" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>. Each neuron in the hidden layer
uses the following kernel function to transform all input variables:
            <disp-formula id="App1.Ch1.E4" content-type="numbered"><mml:math id="M206" display="block"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>h</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi>exp⁡</mml:mi><mml:mfenced close=")" open="("><mml:mo>-</mml:mo><mml:mo>(</mml:mo><mml:mi>b</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo></mml:mfenced></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M207" display="inline"><mml:mi mathvariant="bold-italic">w</mml:mi></mml:math></inline-formula> denotes the vector of weight parameters and <inline-formula><mml:math id="M208" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula> the offset
parameter. The <inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi mathvariant="normal">h</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of all hidden neurons become the inputs of the output
neuron, which uses the same kernel function to transform <inline-formula><mml:math id="M210" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi mathvariant="normal">h</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> to produce
<inline-formula><mml:math id="M211" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M212" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>.</p>
      <p>The training updates the offset and weight parameters, which are initialized
randomly in the range (<inline-formula><mml:math id="M213" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>1, 1), by minimizing the cost function
            <disp-formula id="App1.Ch1.E5" content-type="numbered"><mml:math id="M214" display="block"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced close=")" open="("><mml:msup><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mfenced><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msup><mml:mi mathvariant="bold-italic">e</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold-italic">e</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msup><mml:mfenced close="|" open="|"><mml:msub><mml:mi mathvariant="bold">y</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="bold">y</mml:mi><mml:mi>o</mml:mi></mml:msub></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M215" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is the extended vector that includes <inline-formula><mml:math id="M216" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M217" display="inline"><mml:mi mathvariant="bold-italic">w</mml:mi></mml:math></inline-formula>;
<inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi mathvariant="normal">m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M219" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> stand for the vectors of
modeled and observed <inline-formula><mml:math id="M220" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M221" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, respectively. In the gradient descent
training algorithm, updating <inline-formula><mml:math id="M222" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> at the training iteration <inline-formula><mml:math id="M223" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> can be
expressed as
            <disp-formula id="App1.Ch1.E6" content-type="numbered"><mml:math id="M224" display="block"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>′</mml:mo></mml:msup><mml:mfenced close=")" open="("><mml:mi>t</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>′</mml:mo></mml:msup><mml:mfenced close=")" open="("><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mfenced><mml:mo>-</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mi mathvariant="bold-italic">g</mml:mi></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M225" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> is the learning rate (a positive number smaller than 1), and
<inline-formula><mml:math id="M226" display="inline"><mml:mi mathvariant="bold-italic">g</mml:mi></mml:math></inline-formula> the first-order derivative of the cost function
            <disp-formula id="App1.Ch1.E7" content-type="numbered"><mml:math id="M227" display="block"><mml:mrow><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="normal">∇</mml:mi><mml:mi>f</mml:mi><mml:mfenced close=")" open="("><mml:msup><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mfenced><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="bold">J</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold-italic">e</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M228" display="inline"><mml:mi mathvariant="bold">J</mml:mi></mml:math></inline-formula> is the Jacobian matrix whose components are derivatives of
<inline-formula><mml:math id="M229" display="inline"><mml:mi mathvariant="bold-italic">e</mml:mi></mml:math></inline-formula> with respect to <inline-formula><mml:math id="M230" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> using the back propagation method. We used the
efficient Levenberg–Marquardt algorithm (Wilamowski et al., 2010), which
derives the gradient as
            <disp-formula id="App1.Ch1.E8" content-type="numbered"><mml:math id="M231" display="block"><mml:mrow><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mo mathvariant="bold">=</mml:mo><mml:msup><mml:mfenced open="(" close=")"><mml:msup><mml:mi mathvariant="bold">J</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold">J</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mi>I</mml:mi></mml:mfenced><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msup><mml:mi mathvariant="bold">J</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold-italic">e</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where  <inline-formula><mml:math id="M232" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> is a constant.</p>
</sec>
<sec id="App1.Ch1.S1.SS2">
  <title>Support vector machine</title>
      <p>A support vector machine (SVM) is a supervised learning model that was
conceptualized in the 1960s for classification problems and later extended to
regression analysis (Basak et al., 2007). We used the so-called least-square
support vector machine for regression (Pelckmans et al., 2002) which, similar
to FNN, seeks to minimize the error between model outputs and measurements.
The SVM models the dependence of <inline-formula><mml:math id="M233" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M234" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> on LAT, SST, SSS, CHL, MLD, and
dSST as
            <disp-formula id="App1.Ch1.E9" content-type="numbered"><mml:math id="M235" display="block"><mml:mrow><mml:mi>y</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="italic">φ</mml:mi><mml:mfenced close=")" open="("><mml:mi>x</mml:mi></mml:mfenced><mml:mo>+</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M236" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>  stands for a set of measurements of the input variables,
<inline-formula><mml:math id="M237" display="inline"><mml:mi mathvariant="bold-italic">c</mml:mi></mml:math></inline-formula> the vector of coefficients, <inline-formula><mml:math id="M238" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula> the offset parameter, and <inline-formula><mml:math id="M239" display="inline"><mml:mi mathvariant="italic">ϕ</mml:mi></mml:math></inline-formula> the kernel
function. In this investigation, we used the radial basis kernel
function, i.e.,
            <disp-formula id="App1.Ch1.E10" content-type="numbered"><mml:math id="M240" display="block"><mml:mrow><mml:mi mathvariant="italic">φ</mml:mi><mml:msup><mml:mfenced close=")" open="("><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mfenced><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="italic">φ</mml:mi><mml:mfenced close=")" open="("><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mfenced><mml:mo>=</mml:mo><mml:mi>exp⁡</mml:mi><mml:mfenced close=")" open="("><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msup><mml:mfenced open="|" close="|"><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M241" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> is a parameter whose optimal value depends on the data used
for training. The subscription of <inline-formula><mml:math id="M242" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>  indicates a sample of input
variables.</p>
      <p>Given a set of training samples <inline-formula><mml:math id="M243" display="inline"><mml:mrow><mml:msubsup><mml:mfenced close="}" open="{"><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mfenced><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>,
the goal of training SVM is to minimize the cost function
            <disp-formula id="App1.Ch1.E11" content-type="numbered"><mml:math id="M244" display="block"><mml:mrow><mml:mi>F</mml:mi><mml:mfenced close=")" open="("><mml:mi mathvariant="bold-italic">c</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mfenced open="(" close=")"><mml:msup><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:msup><mml:mi mathvariant="bold-italic">e</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold-italic">e</mml:mi></mml:mfenced></mml:mrow></mml:math></disp-formula>
          where
            <disp-formula id="App1.Ch1.E12" content-type="numbered"><mml:math id="M245" display="block"><mml:mrow><mml:msub><mml:mi>e</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="italic">φ</mml:mi><mml:mfenced close=")" open="("><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mfenced><mml:mo>-</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:math></disp-formula>
          and <inline-formula><mml:math id="M246" display="inline"><mml:mi mathvariant="italic">γ</mml:mi></mml:math></inline-formula> is a constant whose optimal value depends on the data used for
training. The Lagrangian solution for the optimization problem of Eq. (A11)
is given by

                <disp-formula specific-use="align" content-type="numbered"><mml:math id="M247" display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>L</mml:mi><mml:mfenced close=")" open="("><mml:mi>c</mml:mi><mml:mo>,</mml:mo><mml:mi>e</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:mfenced close="|" open="|"><mml:mi mathvariant="bold-italic">e</mml:mi></mml:mfenced><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="App1.Ch1.E13"><mml:mtd/><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mo>-</mml:mo><mml:msubsup><mml:mo>∑</mml:mo><mml:mi>k</mml:mi><mml:mi>N</mml:mi></mml:msubsup><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mfenced open="{" close="}"><mml:msup><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="italic">φ</mml:mi><mml:mfenced open="(" close=")"><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mfenced><mml:mo>+</mml:mo><mml:mi>b</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>e</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            <?xmltex \hack{\newpage}?><?xmltex \hack{\noindent}?>where <inline-formula><mml:math id="M248" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is a Lagrangian multiplier. The optimal conditions of
Eq. (A13) are

                <disp-formula specific-use="align" content-type="numbered"><mml:math id="M249" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="App1.Ch1.E14"><mml:mtd/><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>→</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mi mathvariant="italic">φ</mml:mi><mml:mfenced close=")" open="("><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.E15"><mml:mtd/><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>→</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msubsup><mml:mo>∑</mml:mo><mml:mi>k</mml:mi><mml:mi>N</mml:mi></mml:msubsup><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.E16"><mml:mtd/><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>e</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>→</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:msub><mml:mi>e</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.E17"><mml:mtd/><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>→</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi>c</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mi mathvariant="italic">φ</mml:mi><mml:mfenced close=")" open="("><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mfenced><mml:mo>+</mml:mo><mml:mi>b</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>e</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula></p>
      <p>After eliminating <inline-formula><mml:math id="M250" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M251" display="inline"><mml:mi>e</mml:mi></mml:math></inline-formula> from the above conditions, the following equation is
obtained:
            <disp-formula id="App1.Ch1.E18" content-type="numbered"><mml:math id="M252" display="block"><mml:mrow><mml:mfenced close="]" open="["><mml:mtable class="array" columnalign="center center"><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="bold-italic">u</mml:mi></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="normal">Ω</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="italic">γ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mi>I</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mfenced open="[" close="]"><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mi>b</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="italic">α</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>=</mml:mo><mml:mfenced close="]" open="["><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>y</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M253" display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula> is a vector with all components being 1, and the components
of <inline-formula><mml:math id="M254" display="inline"><mml:mi mathvariant="normal">Ω</mml:mi></mml:math></inline-formula> are
            <disp-formula id="App1.Ch1.E19" content-type="numbered"><mml:math id="M255" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Ω</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="italic">φ</mml:mi><mml:msup><mml:mfenced open="(" close=")"><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mfenced><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="italic">φ</mml:mi><mml:mfenced open="(" close=")"><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p>Once Eq. (A18) is solved, making a prediction is done by
            <disp-formula id="App1.Ch1.E20" content-type="numbered"><mml:math id="M256" display="block"><mml:mrow><mml:mi>y</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mo>∑</mml:mo><mml:mi>k</mml:mi><mml:mi>N</mml:mi></mml:msubsup><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mi mathvariant="italic">φ</mml:mi><mml:msup><mml:mfenced open="(" close=")"><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mfenced><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="italic">φ</mml:mi><mml:mfenced open="(" close=")"><mml:mi>x</mml:mi></mml:mfenced><mml:mo>+</mml:mo><mml:mi>b</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p><?xmltex \hack{\clearpage}?>
</sec>
</app>
  </app-group><notes notes-type="competinginterests">

      <p>The authors declare that they have no conflict of interest.</p>
  </notes><ack><title>Acknowledgements</title><p>The Surface Ocean CO<inline-formula><mml:math id="M257" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> Atlas (SOCAT) is an international effort, endorsed
by the International Ocean Carbon Coordination Project (IOCCP), the Surface
Ocean Lower Atmosphere Study (SOLAS), and the Integrated Marine
Biogeochemistry and Ecosystem Research program (IMBER), to deliver a
uniformly quality-controlled surface ocean CO<inline-formula><mml:math id="M258" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> database. The many
researchers and funding agencies responsible for the collection of data and
quality control are thanked for their contributions to SOCAT.<?xmltex \hack{\newline}?><?xmltex \hack{\newline}?> Edited by: J. M. Huthnance<?xmltex \hack{\newline}?> Reviewed by: three
anonymous referees</p></ack><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><mixed-citation>
Abe, T., Kanaya, S., Kinouchi, M., Ichiba, Y., Kozuki, T., and Ikemura, T.: A
Novel Bioinformatic Strategy for Unveiling Hidden Genome Signatures of Eukaryotes:
Self-Organizing Map of Oligonucleotide Frequency, Genom. Inform., 13, 12–20, 2002.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><mixed-citation>Bakker, D. C. E., Pfeil, B., Smith, K., Hankin, S., Olsen, A., Alin, S. R.,
Cosca, C., Harasawa, S., Kozyr, A., Nojiri, Y., O'Brien, K. M., Schuster, U.,
Telszewski, M., Tilbrook, B., Wada, C., Akl, J., Barbero, L., Bates, N. R.,
Boutin, J., Bozec, Y., Cai, W.-J., Castle, R. D., Chavez, F. P., Chen, L.,
Chierici, M., Currie, K., de Baar, H. J. W., Evans, W., Feely, R. A.,
Fransson, A., Gao, Z., Hales, B., Hardman-Mountford, N. J., Hoppema, M.,
Huang, W.-J., Hunt, C. W., Huss, B., Ichikawa, T., Johannessen, T., Jones, E.
M., Jones, S. D., Jutterström, S., Kitidis, V., Körtzinger, A., Landschützer,
P., Lauvset, S. K., Lefèvre, N., Manke, A. B., Mathis, J. T., Merlivat, L.,
Metzl, N., Murata, A., Newberger, T., Omar, A. M., Ono, T., Park, G.-H.,
Paterson, K., Pierrot, D., Ríos, A. F., Sabine, C. L., Saito, S., Salisbury,
J., Sarma, V. V. S. S., Schlitzer, R., Sieger, R., Skjelvan, I., Steinhoff,
T., Sullivan, K. F., Sun, H., Sutton, A. J., Suzuki, T., Sweeney, C.,
Takahashi, T., Tjiputra, J., Tsurushima, N., van Heuven, S. M. A. C.,
Vandemark, D., Vlahos, P., Wallace, D. W. R., Wanninkhof, R., and Watson, A.
J.: An update to the Surface Ocean CO<inline-formula><mml:math id="M259" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> Atlas (SOCAT version 2), Earth Syst.
Sci. Data, 6, 69–90, <ext-link xlink:href="http://dx.doi.org/10.5194/essd-6-69-2014" ext-link-type="DOI">10.5194/essd-6-69-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><mixed-citation>
Basak, D., Pal, S., and Patranabis, D. C.:  Support vector regression, Neu. Inf.
Pro.-Letters and Reviews, 11, 203–224, 2007.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><mixed-citation>
Boyer, T. P., Antonov, J. I., Baranova, O. K., Coleman, C., Garcia, H. E.,
Grodsky, A., Johnson, D. R., Locarnini, R. A., Mishonov, A. V., O'Brien, T.
D., Paver, C. R., Reagan, J. R., Seidov, D., Smolyar, I. V., and Zweng, M.
M.: World Ocean Database 2013, NOAA Atlas NESDIS 72, edited by: Levitus, S.
and Mishonov, A., Technical Ed., Silver Spring, MD, 209 pp. 2013.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><mixed-citation>Chierici, M., Fransson, A., and Nojiri, Y.: Biogeochemical processes as
drivers of surface <inline-formula><mml:math id="M260" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M261" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> in contrasting provinces in the subarctic North
Pacific Ocean, Global Biogeochem. Cy., 20, GB1009, <ext-link xlink:href="http://dx.doi.org/10.1029/2004GB002356" ext-link-type="DOI">10.1029/2004GB002356</ext-link>,
2006.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><mixed-citation>
Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P.,
Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P.,
Bechtold, P., Beljaars, A. C. M., van de Berg, L., Bidlot, J., Bormann, N.,
Delsol, C., Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S.
B., Hersbach, H., Hólm, E. V., Isaksen, L., Kållberg, P., Köhler,
M., Matricardi, M., McNally, A. P., Monge-Sanz, B. M., Morcrette, J.-J.,
Park, B.-K., Peubey, C., de Rosnay, P., Tavolato, C., Thépaut, J.-N., and
Vitart, F.: The ERA-Interim reanalysis: configuration and performance of
the data assimilation system, Q. J. Roy. Meteor. Soc., 137, 553–597, 2011.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><mixed-citation>Friedrich, T. and Oschlies, A.: Neural network-based estimates of North
Atlantic surface <inline-formula><mml:math id="M262" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M263" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> from satellite data: A methodological study, J.
Geophys. Res., 114, C03020, <ext-link xlink:href="http://dx.doi.org/10.1029/2007JC004646" ext-link-type="DOI">10.1029/2007JC004646</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><mixed-citation>Goddijn-Murphy, L. M., Woolf, D. K., Land, P. E., Shutler, J. D., and Donlon,
C.: The OceanFlux Greenhouse Gases methodology for deriving a sea surface
climatology of CO<inline-formula><mml:math id="M264" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> fugacity in support of air–sea gas flux studies,
Ocean Sci., 11, 519–541, <ext-link xlink:href="http://dx.doi.org/10.5194/os-11-519-2015" ext-link-type="DOI">10.5194/os-11-519-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><mixed-citation>Iida, Y., Kojima, A., Takatani, Y., Nakano, T., Sugimoto, H., Midorikawa, T.,
and Ishii, M.: Trends in <inline-formula><mml:math id="M265" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M266" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> and sea-air CO<inline-formula><mml:math id="M267" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> flux over the
global open oceans for the last two decades, J. Oceanogr., 71, 637–661, 2015.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><mixed-citation>Jamet, C., Moulin, C., and Lefèvre, N.: Estimation of the oceanic <inline-formula><mml:math id="M268" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M269" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
in the North Atlantic from VOS lines in-situ measurements: parameters needed
to generate seasonally mean maps, Ann. Geophys., 25, 2247–2257,
<ext-link xlink:href="http://dx.doi.org/10.5194/angeo-25-2247-2007" ext-link-type="DOI">10.5194/angeo-25-2247-2007</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><mixed-citation>
Jones, S. D., Quéré, C. L., Rödenbeck, C., Manning, A. C., and
Olsen, A.: A statistical gap-filling method to interpolate global monthly
surface ocean carbon dioxide data, J. Adv. Model. Earth Syst., 7, 1942–2466,
2015.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><mixed-citation>
Kohonen, T.: Self-Organization and Associative Memory, Springer, Berlin,
1984.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><mixed-citation>Landschützer, P., Gruber, N., Bakker, D. C. E., Schuster, U., Nakaoka, S.,
Payne, M. R., Sasse, T. P., and Zeng, J.: A neural network-based estimate of
the seasonal to inter-annual variability of the Atlantic Ocean carbon sink,
Biogeosciences, 10, 7793–7815, <ext-link xlink:href="http://dx.doi.org/10.5194/bg-10-7793-2013" ext-link-type="DOI">10.5194/bg-10-7793-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><mixed-citation>
Landschützer, P., Gruber, N., Haumann, F., Rödenbeck, C., Bakker, D.,
van Heuven, S., Hoppema, M., Metzl, N., Sweeney, C., Takahashi, T.,
Tilbrook, B.,
and Wanninkhof, R.: The reinvigoration of the Southern Ocean carbon sink,
Science, 349, 1221–1224, 2015.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><mixed-citation>Le Quéré, C., Moriarty, R., Andrew, R. M., Canadell, J. G., Sitch, S.,
Korsbakken, J. I., Friedlingstein, P., Peters, G. P., Andres, R. J., Boden,
T. A., Houghton, R. A., House, J. I., Keeling, R. F., Tans, P., Arneth, A.,
Bakker, D. C. E., Barbero, L., Bopp, L., Chang, J., Chevallier, F., Chini, L.
P., Ciais, P., Fader, M., Feely, R. A., Gkritzalis, T., Harris, I., Hauck,
J., Ilyina, T., Jain, A. K., Kato, E., Kitidis, V., Klein Goldewijk, K.,
Koven, C., Landschützer, P., Lauvset, S. K., Lefèvre, N., Lenton, A., Lima,
I. D., Metzl, N., Millero, F., Munro, D. R., Murata, A., Nabel, J. E. M. S.,
Nakaoka, S., Nojiri, Y., O'Brien, K., Olsen, A., Ono, T., Pérez, F. F.,
Pfeil, B., Pierrot, D., Poulter, B., Rehder, G., Rödenbeck, C., Saito, S.,
Schuster, U., Schwinger, J., Séférian, R., Steinhoff, T., Stocker, B. D.,
Sutton, A. J., Takahashi, T., Tilbrook, B., van der Laan-Luijkx, I. T., van
der Werf, G. R., van Heuven, S., Vandemark, D., Viovy, N., Wiltshire, A.,
Zaehle, S., and Zeng, N.: Global Carbon Budget 2015, Earth Syst. Sci. Data,
7, 349–396, <ext-link xlink:href="http://dx.doi.org/10.5194/essd-7-349-2015" ext-link-type="DOI">10.5194/essd-7-349-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><mixed-citation>Lefèvre, N., Watson, A. J., and Watson, A. R.: A comparison of multiple
regression and neural network techniques for mapping in situ <inline-formula><mml:math id="M270" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M271" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
data, Tellus B, 57, 375–384, 2005.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><mixed-citation>Nakaoka, S., Telszewski, M., Nojiri, Y., Yasunaka, S., Miyazaki, C., Mukai,
H., and Usui, N.: Estimating temporal and spatial variation of ocean surface
<inline-formula><mml:math id="M272" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M273" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> in the North Pacific using a self-organizing map neural network
technique, Biogeosciences, 10, 6093–6106, <ext-link xlink:href="http://dx.doi.org/10.5194/bg-10-6093-2013" ext-link-type="DOI">10.5194/bg-10-6093-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><mixed-citation>Park, G.-H., Wanninkhof, R., Doney, S. C., Takahashi, T., Lee, K., Feely, R.
A., Sabine, C. L., Triñanes, J., and Lima, I. D.: Variability of global
net sea-air CO<inline-formula><mml:math id="M274" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> fluxes over the last three decades using empirical
relationships, Tellus B, 62, 352–368, <ext-link xlink:href="http://dx.doi.org/10.1111/j.1600-0889.2010.00498.x" ext-link-type="DOI">10.1111/j.1600-0889.2010.00498.x</ext-link>,
2010.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><mixed-citation>Pelckmans, K., Suykens, J. A. K., Gestel, T. V., Brabanter, J. D., Hamers,
B., Moor, D., and Vandewalle, J.: LS-SVMlab: a MATLAB/C toolbox for Least
Squares Support Vector Machines,
<uri>http://www.esat.kuleuven.ac.be/sista/lssvmlab</uri>, last access: April 2017,
presented at Neural Information Processing Systems (NIPS 2002), 2002.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><mixed-citation>Pfeil, B., Olsen, A., Bakker, D. C. E., Hankin, S., Koyuk, H., Kozyr, A.,
Malczyk, J., Manke, A., Metzl, N., Sabine, C. L., Akl, J., Alin, S. R.,
Bates, N., Bellerby, R. G. J., Borges, A., Boutin, J., Brown, P. J., Cai,
W.-J., Chavez, F. P., Chen, A., Cosca, C., Fassbender, A. J., Feely, R. A.,
González-Dávila, M., Goyet, C., Hales, B., Hardman-Mountford, N., Heinze, C.,
Hood, M., Hoppema, M., Hunt, C. W., Hydes, D., Ishii, M., Johannessen, T.,
Jones, S. D., Key, R. M., Körtzinger, A., Landschützer, P., Lauvset, S. K.,
Lefèvre, N., Lenton, A., Lourantou, A., Merlivat, L., Midorikawa, T.,
Mintrop, L., Miyazaki, C., Murata, A., Nakadate, A., Nakano, Y., Nakaoka, S.,
Nojiri, Y., Omar, A. M., Padin, X. A., Park, G.-H., Paterson, K., Perez, F.
F., Pierrot, D., Poisson, A., Ríos, A. F., Santana-Casiano, J. M., Salisbury,
J., Sarma, V. V. S. S., Schlitzer, R., Schneider, B., Schuster, U., Sieger,
R., Skjelvan, I., Steinhoff, T., Suzuki, T., Takahashi, T., Tedesco, K.,
Telszewski, M., Thomas, H., Tilbrook, B., Tjiputra, J., Vandemark, D.,
Veness, T., Wanninkhof, R., Watson, A. J., Weiss, R., Wong, C. S., and
Yoshikawa-Inoue, H.: A uniform, quality controlled Surface Ocean CO<inline-formula><mml:math id="M275" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> Atlas
(SOCAT), Earth Syst. Sci. Data, 5, 125–143, <ext-link xlink:href="http://dx.doi.org/10.5194/essd-5-125-2013" ext-link-type="DOI">10.5194/essd-5-125-2013</ext-link>,
2013.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><mixed-citation>
Reynolds, R. W., Rayner, N. A., Smith, T. M., Stokes, D. C., and Wang, W.:
An Improved In Situ and Satellite SST Analysis for Climate, J. Climate,
15, 1609–1625, 2002.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><mixed-citation>Rödenbeck, C., Keeling, R. F., Bakker, D. C. E., Metzl, N., Olsen, A.,
Sabine, C., and Heimann, M.: Global surface-ocean <inline-formula><mml:math id="M276" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M277" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> and sea–air
CO<inline-formula><mml:math id="M278" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> flux variability from an observation-driven ocean mixed-layer
scheme, Ocean Sci., 9, 193–216, <ext-link xlink:href="http://dx.doi.org/10.5194/os-9-193-2013" ext-link-type="DOI">10.5194/os-9-193-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><mixed-citation>Rödenbeck, C., Bakker, D. C. E., Gruber, N., Iida, Y., Jacobson, A. R.,
Jones, S., Landschützer, P., Metzl, N., Nakaoka, S., Olsen, A., Park, G.-H.,
Peylin, P., Rodgers, K. B., Sasse, T. P., Schuster, U., Shutler, J. D.,
Valsala, V., Wanninkhof, R., and Zeng, J.: Data-based estimates of the ocean
carbon sink variability – first results of the Surface Ocean <inline-formula><mml:math id="M279" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M280" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>
Mapping intercomparison (SOCOM), Biogeosciences, 12, 7251–7278,
<ext-link xlink:href="http://dx.doi.org/10.5194/bg-12-7251-2015" ext-link-type="DOI">10.5194/bg-12-7251-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><mixed-citation>Sabine, C. L., Hankin, S., Koyuk, H., Bakker, D. C. E., Pfeil, B., Olsen, A.,
Metzl, N., Kozyr, A., Fassbender, A., Manke, A., Malczyk, J., Akl, J., Alin,
S. R., Bellerby, R. G. J., Borges, A., Boutin, J., Brown, P. J., Cai, W.-J.,
Chavez, F. P., Chen, A., Cosca, C., Feely, R. A., González-Dávila, M., Goyet,
C., Hardman-Mountford, N., Heinze, C., Hoppema, M., Hunt, C. W., Hydes, D.,
Ishii, M., Johannessen, T., Key, R. M., Körtzinger, A., Landschützer, P.,
Lauvset, S. K., Lefèvre, N., Lenton, A., Lourantou, A., Merlivat, L.,
Midorikawa, T., Mintrop, L., Miyazaki, C., Murata, A., Nakadate, A., Nakano,
Y., Nakaoka, S., Nojiri, Y., Omar, A. M., Padin, X. A., Park, G.-H.,
Paterson, K., Perez, F. F., Pierrot, D., Poisson, A., Ríos, A. F., Salisbury,
J., Santana-Casiano, J. M., Sarma, V. V. S. S., Schlitzer, R., Schneider, B.,
Schuster, U., Sieger, R., Skjelvan, I., Steinhoff, T., Suzuki, T., Takahashi,
T., Tedesco, K., Telszewski, M., Thomas, H., Tilbrook, B., Vandemark, D.,
Veness, T., Watson, A. J., Weiss, R., Wong, C. S., and Yoshikawa-Inoue, H.:
Surface Ocean CO<inline-formula><mml:math id="M281" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> Atlas (SOCAT) gridded data products, Earth Syst. Sci.
Data, 5, 145–153, <ext-link xlink:href="http://dx.doi.org/10.5194/essd-5-145-2013" ext-link-type="DOI">10.5194/essd-5-145-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><mixed-citation>Sarma, V. V. S. S., Saino, T., Sasaoka, K., Nojiri, Y., Ono, T., Ishii, M.,
Inoue, H. Y., and Matsumoto, K.: Basin-scale <inline-formula><mml:math id="M282" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M283" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> distribution using
satellite sea surface temperature, Chla, and climatological salinity in the
North Pacific in spring and summer, Global Biogeochem. Cy., 20, GB3005,
<ext-link xlink:href="http://dx.doi.org/10.1029/2005GB002594" ext-link-type="DOI">10.1029/2005GB002594</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><mixed-citation>Sasse, T. P., McNeil, B. I., and Abramowitz, G.: A new constraint on global
air-sea CO<inline-formula><mml:math id="M284" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> fluxes using bottle carbon data, Geophys. Res. Lett., 40,
1594–1599, 2013.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><mixed-citation>Schmidtko, S., Johnson, G. C., and Lyman, J. M.: MIMOC: A global monthly
isopycnal upper-ocean climatology with mixed layers, J. Geophys. Res., 118,
1658–1672, <ext-link xlink:href="http://dx.doi.org/10.1002/jgrc.20122" ext-link-type="DOI">10.1002/jgrc.20122</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><mixed-citation>
Stocker, T., Qin, D., and Platner, G.-K.: Climate Change 2013 The Physical
Science Basis, Cambridge University Press, Cambridge, United Kingdom, 2013.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><mixed-citation>Takahashi, T., Sutherland, S. C., Sweeney, C., Poisson, A., Metzl, N.,
Tilbrook, B., Bates, N., Wanninkhof, R., Feely, R. A., Sabine, C., Olafsson,
J., and Nojiri, Y.: Global sea-air CO<inline-formula><mml:math id="M285" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> flux based on climatological
surface ocean <inline-formula><mml:math id="M286" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M287" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, and seasonal biological and temperature effects,
Deep-Sea Res. Pt. II, 49, 1601–1622, 2002.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><mixed-citation>Takahashi, T., Sutherland, S. C., Wanninkhof, R., Sweeney, C., Feely, R. A.,
Chipman, D. W., Hales, B., Friederich, G., Chavez, F., Sabine, C., Watson,
A., Bakker, D. C. E., Schuster, U., Metzl, N., Yoshikawa-Inoue, H., Ishii,
M., Midorikawa, T., Nojiri, Y., Körtzinger, A., Steinhoff, T., Hoppema,
M., Olafsson, J., Arnarson, T. S., Tilbrook, B., Johannessen, T., Olsen, A.,
Bellerby, R., and Wong, C. S.: Climatological mean and decadal change in
surface ocean <inline-formula><mml:math id="M288" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M289" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, and net sea-air CO<inline-formula><mml:math id="M290" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> flux over the global
oceans, Deep-Sea Res. Pt. II, 56, 554–577, 2009.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><mixed-citation>Takahashi, T., Sutherland, S. C., Chipman, D. W., Goddard, J. G., Ho, C.,
Newberger, T., Sweeney, C., and Munro, D. R.: Climatological distributions of pH, <inline-formula><mml:math id="M291" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M292" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, total
CO<inline-formula><mml:math id="M293" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula>, alkalinity, and CaCO<inline-formula><mml:math id="M294" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:math></inline-formula> saturation in the global surface ocean, and
temporal changes at selected locations, Mar. Chem., 164, 95–145, 2014.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><mixed-citation>Takamura, T. R., Inoue, H. Y., Midorikawa, T., Ishii, M., and Nojiri, Y.:
Seasonal and Inter-Annual Variations in <inline-formula><mml:math id="M295" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M296" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> sea and Air-Sea CO<inline-formula><mml:math id="M297" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> Fluxes
in Mid-Latitudes of the Western and Eastern North Pacific during 1999–2006:
Recent Results Utilizing Voluntary Observation Ships, J. Meteorol. Soc. Jpn.,
88, 883–898, 2010.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><mixed-citation>Telszewski, M., Chazottes, A., Schuster, U., Watson, A. J., Moulin, C.,
Bakker, D. C. E., González-Dávila, M., Johannessen, T., Körtzinger, A.,
Lüger, H., Olsen, A., Omar, A., Padin, X. A., Ríos, A. F., Steinhoff, T.,
Santana-Casiano, M., Wallace, D. W. R., and Wanninkhof, R.: Estimating the
monthly <inline-formula><mml:math id="M298" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M299" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> distribution in the North Atlantic using a
self-organizing neural network, Biogeosciences, 6, 1405–1421,
<ext-link xlink:href="http://dx.doi.org/10.5194/bg-6-1405-2009" ext-link-type="DOI">10.5194/bg-6-1405-2009</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><mixed-citation>Wanninkhof, R., Park, G.-H., Takahashi, T., Sweeney, C., Feely, R., Nojiri,
Y., Gruber, N., Doney, S. C., McKinley, G. A., Lenton, A., Le Quéré, C.,
Heinze, C., Schwinger, J., Graven, H., and Khatiwala, S.: Global ocean carbon
uptake: magnitude, variability and trends, Biogeosciences, 10, 1983–2000,
<ext-link xlink:href="http://dx.doi.org/10.5194/bg-10-1983-2013" ext-link-type="DOI">10.5194/bg-10-1983-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><mixed-citation>Wilamowski, B. M. and Yu, H.: Improved Computation for Levenberg-Marquardt
Training, IEEE T. Neural Networ., 21, 930–937, 2010.
 </mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bib36"><label>36</label><mixed-citation>Zeng, J., Nojiri, Y., Landschützer, P., Telszewski, M., and Nakaoka, S.:
A global surface ocean <inline-formula><mml:math id="M300" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M301" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> climatology based on a feedforward neural
network, J. Atmos. Ocean Tech., 31, 1838–1849, 2014.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><mixed-citation>Zeng, J., Nojiri, Y., Nakaoka, S., Nakajima, H., and Shirai, T.: Surface
ocean CO<inline-formula><mml:math id="M302" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> in 1990-2011 modelled using a feed-forward neural network,
Geoscience Data Journal, 2, 47–51, 2015.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><mixed-citation>Zeng, J. Y., Nojiri, Y., Murphy, P. P., Wong, C. S., and Fujinuma, Y.: A
comparison of Delta <inline-formula><mml:math id="M303" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>CO<inline-formula><mml:math id="M304" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> distributions in the northern North Pacific
using results from a commercial vessel in 1995–1999, Deep-Sea Res. Pt. II,
49, 5303–5315, 2002.</mixed-citation></ref>

  </ref-list><app-group content-type="float"><app><title/>

    </app></app-group></back>
    <!--<article-title-html>Technical note: Evaluation of three machine learning models  for surface ocean CO<sub>2</sub> mapping</article-title-html>
<abstract-html><p class="p">Reconstructing surface ocean CO<sub>2</sub> from scarce measurements plays an
important role in estimating oceanic CO<sub>2</sub> uptake. There are varying
degrees of differences among the 14 models included in the Surface Ocean
CO<sub>2</sub> Mapping (SOCOM) inter-comparison initiative, in which five models
used neural networks. This investigation evaluates two neural networks used
in SOCOM, self-organizing maps and feedforward neural networks, and
introduces a machine learning model called a support vector machine for ocean
CO<sub>2</sub> mapping. The technique note provides a practical guide to selecting
the models.</p></abstract-html>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
Abe, T., Kanaya, S., Kinouchi, M., Ichiba, Y., Kozuki, T., and Ikemura, T.: A
Novel Bioinformatic Strategy for Unveiling Hidden Genome Signatures of Eukaryotes:
Self-Organizing Map of Oligonucleotide Frequency, Genom. Inform., 13, 12–20, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
Bakker, D. C. E., Pfeil, B., Smith, K., Hankin, S., Olsen, A., Alin, S. R.,
Cosca, C., Harasawa, S., Kozyr, A., Nojiri, Y., O'Brien, K. M., Schuster, U.,
Telszewski, M., Tilbrook, B., Wada, C., Akl, J., Barbero, L., Bates, N. R.,
Boutin, J., Bozec, Y., Cai, W.-J., Castle, R. D., Chavez, F. P., Chen, L.,
Chierici, M., Currie, K., de Baar, H. J. W., Evans, W., Feely, R. A.,
Fransson, A., Gao, Z., Hales, B., Hardman-Mountford, N. J., Hoppema, M.,
Huang, W.-J., Hunt, C. W., Huss, B., Ichikawa, T., Johannessen, T., Jones, E.
M., Jones, S. D., Jutterström, S., Kitidis, V., Körtzinger, A., Landschützer,
P., Lauvset, S. K., Lefèvre, N., Manke, A. B., Mathis, J. T., Merlivat, L.,
Metzl, N., Murata, A., Newberger, T., Omar, A. M., Ono, T., Park, G.-H.,
Paterson, K., Pierrot, D., Ríos, A. F., Sabine, C. L., Saito, S., Salisbury,
J., Sarma, V. V. S. S., Schlitzer, R., Sieger, R., Skjelvan, I., Steinhoff,
T., Sullivan, K. F., Sun, H., Sutton, A. J., Suzuki, T., Sweeney, C.,
Takahashi, T., Tjiputra, J., Tsurushima, N., van Heuven, S. M. A. C.,
Vandemark, D., Vlahos, P., Wallace, D. W. R., Wanninkhof, R., and Watson, A.
J.: An update to the Surface Ocean CO<sub>2</sub> Atlas (SOCAT version 2), Earth Syst.
Sci. Data, 6, 69–90, <a href="http://dx.doi.org/10.5194/essd-6-69-2014" target="_blank">doi:10.5194/essd-6-69-2014</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
Basak, D., Pal, S., and Patranabis, D. C.:  Support vector regression, Neu. Inf.
Pro.-Letters and Reviews, 11, 203–224, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
Boyer, T. P., Antonov, J. I., Baranova, O. K., Coleman, C., Garcia, H. E.,
Grodsky, A., Johnson, D. R., Locarnini, R. A., Mishonov, A. V., O'Brien, T.
D., Paver, C. R., Reagan, J. R., Seidov, D., Smolyar, I. V., and Zweng, M.
M.: World Ocean Database 2013, NOAA Atlas NESDIS 72, edited by: Levitus, S.
and Mishonov, A., Technical Ed., Silver Spring, MD, 209 pp. 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
Chierici, M., Fransson, A., and Nojiri, Y.: Biogeochemical processes as
drivers of surface <i>f</i>CO<sub>2</sub> in contrasting provinces in the subarctic North
Pacific Ocean, Global Biogeochem. Cy., 20, GB1009, <a href="http://dx.doi.org/10.1029/2004GB002356" target="_blank">doi:10.1029/2004GB002356</a>,
2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P.,
Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P.,
Bechtold, P., Beljaars, A. C. M., van de Berg, L., Bidlot, J., Bormann, N.,
Delsol, C., Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S.
B., Hersbach, H., Hólm, E. V., Isaksen, L., Kållberg, P., Köhler,
M., Matricardi, M., McNally, A. P., Monge-Sanz, B. M., Morcrette, J.-J.,
Park, B.-K., Peubey, C., de Rosnay, P., Tavolato, C., Thépaut, J.-N., and
Vitart, F.: The ERA-Interim reanalysis: configuration and performance of
the data assimilation system, Q. J. Roy. Meteor. Soc., 137, 553–597, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
Friedrich, T. and Oschlies, A.: Neural network-based estimates of North
Atlantic surface <i>p</i>CO<sub>2</sub> from satellite data: A methodological study, J.
Geophys. Res., 114, C03020, <a href="http://dx.doi.org/10.1029/2007JC004646" target="_blank">doi:10.1029/2007JC004646</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
Goddijn-Murphy, L. M., Woolf, D. K., Land, P. E., Shutler, J. D., and Donlon,
C.: The OceanFlux Greenhouse Gases methodology for deriving a sea surface
climatology of CO<sub>2</sub> fugacity in support of air–sea gas flux studies,
Ocean Sci., 11, 519–541, <a href="http://dx.doi.org/10.5194/os-11-519-2015" target="_blank">doi:10.5194/os-11-519-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
Iida, Y., Kojima, A., Takatani, Y., Nakano, T., Sugimoto, H., Midorikawa, T.,
and Ishii, M.: Trends in <i>p</i>CO<sub>2</sub> and sea-air CO<sub>2</sub> flux over the
global open oceans for the last two decades, J. Oceanogr., 71, 637–661, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
Jamet, C., Moulin, C., and Lefèvre, N.: Estimation of the oceanic <i>p</i>CO<sub>2</sub>
in the North Atlantic from VOS lines in-situ measurements: parameters needed
to generate seasonally mean maps, Ann. Geophys., 25, 2247–2257,
<a href="http://dx.doi.org/10.5194/angeo-25-2247-2007" target="_blank">doi:10.5194/angeo-25-2247-2007</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
Jones, S. D., Quéré, C. L., Rödenbeck, C., Manning, A. C., and
Olsen, A.: A statistical gap-filling method to interpolate global monthly
surface ocean carbon dioxide data, J. Adv. Model. Earth Syst., 7, 1942–2466,
2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
Kohonen, T.: Self-Organization and Associative Memory, Springer, Berlin,
1984.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
Landschützer, P., Gruber, N., Bakker, D. C. E., Schuster, U., Nakaoka, S.,
Payne, M. R., Sasse, T. P., and Zeng, J.: A neural network-based estimate of
the seasonal to inter-annual variability of the Atlantic Ocean carbon sink,
Biogeosciences, 10, 7793–7815, <a href="http://dx.doi.org/10.5194/bg-10-7793-2013" target="_blank">doi:10.5194/bg-10-7793-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
Landschützer, P., Gruber, N., Haumann, F., Rödenbeck, C., Bakker, D.,
van Heuven, S., Hoppema, M., Metzl, N., Sweeney, C., Takahashi, T.,
Tilbrook, B.,
and Wanninkhof, R.: The reinvigoration of the Southern Ocean carbon sink,
Science, 349, 1221–1224, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
Le Quéré, C., Moriarty, R., Andrew, R. M., Canadell, J. G., Sitch, S.,
Korsbakken, J. I., Friedlingstein, P., Peters, G. P., Andres, R. J., Boden,
T. A., Houghton, R. A., House, J. I., Keeling, R. F., Tans, P., Arneth, A.,
Bakker, D. C. E., Barbero, L., Bopp, L., Chang, J., Chevallier, F., Chini, L.
P., Ciais, P., Fader, M., Feely, R. A., Gkritzalis, T., Harris, I., Hauck,
J., Ilyina, T., Jain, A. K., Kato, E., Kitidis, V., Klein Goldewijk, K.,
Koven, C., Landschützer, P., Lauvset, S. K., Lefèvre, N., Lenton, A., Lima,
I. D., Metzl, N., Millero, F., Munro, D. R., Murata, A., Nabel, J. E. M. S.,
Nakaoka, S., Nojiri, Y., O'Brien, K., Olsen, A., Ono, T., Pérez, F. F.,
Pfeil, B., Pierrot, D., Poulter, B., Rehder, G., Rödenbeck, C., Saito, S.,
Schuster, U., Schwinger, J., Séférian, R., Steinhoff, T., Stocker, B. D.,
Sutton, A. J., Takahashi, T., Tilbrook, B., van der Laan-Luijkx, I. T., van
der Werf, G. R., van Heuven, S., Vandemark, D., Viovy, N., Wiltshire, A.,
Zaehle, S., and Zeng, N.: Global Carbon Budget 2015, Earth Syst. Sci. Data,
7, 349–396, <a href="http://dx.doi.org/10.5194/essd-7-349-2015" target="_blank">doi:10.5194/essd-7-349-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
Lefèvre, N., Watson, A. J., and Watson, A. R.: A comparison of multiple
regression and neural network techniques for mapping in situ <i>p</i>CO<sub>2</sub>
data, Tellus B, 57, 375–384, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
Nakaoka, S., Telszewski, M., Nojiri, Y., Yasunaka, S., Miyazaki, C., Mukai,
H., and Usui, N.: Estimating temporal and spatial variation of ocean surface
<i>p</i>CO<sub>2</sub> in the North Pacific using a self-organizing map neural network
technique, Biogeosciences, 10, 6093–6106, <a href="http://dx.doi.org/10.5194/bg-10-6093-2013" target="_blank">doi:10.5194/bg-10-6093-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
Park, G.-H., Wanninkhof, R., Doney, S. C., Takahashi, T., Lee, K., Feely, R.
A., Sabine, C. L., Triñanes, J., and Lima, I. D.: Variability of global
net sea-air CO<sub>2</sub> fluxes over the last three decades using empirical
relationships, Tellus B, 62, 352–368, <a href="http://dx.doi.org/10.1111/j.1600-0889.2010.00498.x" target="_blank">doi:10.1111/j.1600-0889.2010.00498.x</a>,
2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
Pelckmans, K., Suykens, J. A. K., Gestel, T. V., Brabanter, J. D., Hamers,
B., Moor, D., and Vandewalle, J.: LS-SVMlab: a MATLAB/C toolbox for Least
Squares Support Vector Machines,
<a href="http://www.esat.kuleuven.ac.be/sista/lssvmlab" target="_blank">http://www.esat.kuleuven.ac.be/sista/lssvmlab</a>, last access: April 2017,
presented at Neural Information Processing Systems (NIPS 2002), 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
Pfeil, B., Olsen, A., Bakker, D. C. E., Hankin, S., Koyuk, H., Kozyr, A.,
Malczyk, J., Manke, A., Metzl, N., Sabine, C. L., Akl, J., Alin, S. R.,
Bates, N., Bellerby, R. G. J., Borges, A., Boutin, J., Brown, P. J., Cai,
W.-J., Chavez, F. P., Chen, A., Cosca, C., Fassbender, A. J., Feely, R. A.,
González-Dávila, M., Goyet, C., Hales, B., Hardman-Mountford, N., Heinze, C.,
Hood, M., Hoppema, M., Hunt, C. W., Hydes, D., Ishii, M., Johannessen, T.,
Jones, S. D., Key, R. M., Körtzinger, A., Landschützer, P., Lauvset, S. K.,
Lefèvre, N., Lenton, A., Lourantou, A., Merlivat, L., Midorikawa, T.,
Mintrop, L., Miyazaki, C., Murata, A., Nakadate, A., Nakano, Y., Nakaoka, S.,
Nojiri, Y., Omar, A. M., Padin, X. A., Park, G.-H., Paterson, K., Perez, F.
F., Pierrot, D., Poisson, A., Ríos, A. F., Santana-Casiano, J. M., Salisbury,
J., Sarma, V. V. S. S., Schlitzer, R., Schneider, B., Schuster, U., Sieger,
R., Skjelvan, I., Steinhoff, T., Suzuki, T., Takahashi, T., Tedesco, K.,
Telszewski, M., Thomas, H., Tilbrook, B., Tjiputra, J., Vandemark, D.,
Veness, T., Wanninkhof, R., Watson, A. J., Weiss, R., Wong, C. S., and
Yoshikawa-Inoue, H.: A uniform, quality controlled Surface Ocean CO<sub>2</sub> Atlas
(SOCAT), Earth Syst. Sci. Data, 5, 125–143, <a href="http://dx.doi.org/10.5194/essd-5-125-2013" target="_blank">doi:10.5194/essd-5-125-2013</a>,
2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
Reynolds, R. W., Rayner, N. A., Smith, T. M., Stokes, D. C., and Wang, W.:
An Improved In Situ and Satellite SST Analysis for Climate, J. Climate,
15, 1609–1625, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
Rödenbeck, C., Keeling, R. F., Bakker, D. C. E., Metzl, N., Olsen, A.,
Sabine, C., and Heimann, M.: Global surface-ocean <i>p</i>CO<sub>2</sub> and sea–air
CO<sub>2</sub> flux variability from an observation-driven ocean mixed-layer
scheme, Ocean Sci., 9, 193–216, <a href="http://dx.doi.org/10.5194/os-9-193-2013" target="_blank">doi:10.5194/os-9-193-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
Rödenbeck, C., Bakker, D. C. E., Gruber, N., Iida, Y., Jacobson, A. R.,
Jones, S., Landschützer, P., Metzl, N., Nakaoka, S., Olsen, A., Park, G.-H.,
Peylin, P., Rodgers, K. B., Sasse, T. P., Schuster, U., Shutler, J. D.,
Valsala, V., Wanninkhof, R., and Zeng, J.: Data-based estimates of the ocean
carbon sink variability – first results of the Surface Ocean <i>p</i>CO<sub>2</sub>
Mapping intercomparison (SOCOM), Biogeosciences, 12, 7251–7278,
<a href="http://dx.doi.org/10.5194/bg-12-7251-2015" target="_blank">doi:10.5194/bg-12-7251-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
Sabine, C. L., Hankin, S., Koyuk, H., Bakker, D. C. E., Pfeil, B., Olsen, A.,
Metzl, N., Kozyr, A., Fassbender, A., Manke, A., Malczyk, J., Akl, J., Alin,
S. R., Bellerby, R. G. J., Borges, A., Boutin, J., Brown, P. J., Cai, W.-J.,
Chavez, F. P., Chen, A., Cosca, C., Feely, R. A., González-Dávila, M., Goyet,
C., Hardman-Mountford, N., Heinze, C., Hoppema, M., Hunt, C. W., Hydes, D.,
Ishii, M., Johannessen, T., Key, R. M., Körtzinger, A., Landschützer, P.,
Lauvset, S. K., Lefèvre, N., Lenton, A., Lourantou, A., Merlivat, L.,
Midorikawa, T., Mintrop, L., Miyazaki, C., Murata, A., Nakadate, A., Nakano,
Y., Nakaoka, S., Nojiri, Y., Omar, A. M., Padin, X. A., Park, G.-H.,
Paterson, K., Perez, F. F., Pierrot, D., Poisson, A., Ríos, A. F., Salisbury,
J., Santana-Casiano, J. M., Sarma, V. V. S. S., Schlitzer, R., Schneider, B.,
Schuster, U., Sieger, R., Skjelvan, I., Steinhoff, T., Suzuki, T., Takahashi,
T., Tedesco, K., Telszewski, M., Thomas, H., Tilbrook, B., Vandemark, D.,
Veness, T., Watson, A. J., Weiss, R., Wong, C. S., and Yoshikawa-Inoue, H.:
Surface Ocean CO<sub>2</sub> Atlas (SOCAT) gridded data products, Earth Syst. Sci.
Data, 5, 145–153, <a href="http://dx.doi.org/10.5194/essd-5-145-2013" target="_blank">doi:10.5194/essd-5-145-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
Sarma, V. V. S. S., Saino, T., Sasaoka, K., Nojiri, Y., Ono, T., Ishii, M.,
Inoue, H. Y., and Matsumoto, K.: Basin-scale <i>p</i>CO<sub>2</sub> distribution using
satellite sea surface temperature, Chla, and climatological salinity in the
North Pacific in spring and summer, Global Biogeochem. Cy., 20, GB3005,
<a href="http://dx.doi.org/10.1029/2005GB002594" target="_blank">doi:10.1029/2005GB002594</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
Sasse, T. P., McNeil, B. I., and Abramowitz, G.: A new constraint on global
air-sea CO<sub>2</sub> fluxes using bottle carbon data, Geophys. Res. Lett., 40,
1594–1599, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
Schmidtko, S., Johnson, G. C., and Lyman, J. M.: MIMOC: A global monthly
isopycnal upper-ocean climatology with mixed layers, J. Geophys. Res., 118,
1658–1672, <a href="http://dx.doi.org/10.1002/jgrc.20122" target="_blank">doi:10.1002/jgrc.20122</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
Stocker, T., Qin, D., and Platner, G.-K.: Climate Change 2013 The Physical
Science Basis, Cambridge University Press, Cambridge, United Kingdom, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
Takahashi, T., Sutherland, S. C., Sweeney, C., Poisson, A., Metzl, N.,
Tilbrook, B., Bates, N., Wanninkhof, R., Feely, R. A., Sabine, C., Olafsson,
J., and Nojiri, Y.: Global sea-air CO<sub>2</sub> flux based on climatological
surface ocean <i>p</i>CO<sub>2</sub>, and seasonal biological and temperature effects,
Deep-Sea Res. Pt. II, 49, 1601–1622, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
Takahashi, T., Sutherland, S. C., Wanninkhof, R., Sweeney, C., Feely, R. A.,
Chipman, D. W., Hales, B., Friederich, G., Chavez, F., Sabine, C., Watson,
A., Bakker, D. C. E., Schuster, U., Metzl, N., Yoshikawa-Inoue, H., Ishii,
M., Midorikawa, T., Nojiri, Y., Körtzinger, A., Steinhoff, T., Hoppema,
M., Olafsson, J., Arnarson, T. S., Tilbrook, B., Johannessen, T., Olsen, A.,
Bellerby, R., and Wong, C. S.: Climatological mean and decadal change in
surface ocean <i>p</i>CO<sub>2</sub>, and net sea-air CO<sub>2</sub> flux over the global
oceans, Deep-Sea Res. Pt. II, 56, 554–577, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
Takahashi, T., Sutherland, S. C., Chipman, D. W., Goddard, J. G., Ho, C.,
Newberger, T., Sweeney, C., and Munro, D. R.: Climatological distributions of pH, <i>p</i>CO<sub>2</sub>, total
CO<sub>2</sub>, alkalinity, and CaCO<sub>3</sub> saturation in the global surface ocean, and
temporal changes at selected locations, Mar. Chem., 164, 95–145, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
Takamura, T. R., Inoue, H. Y., Midorikawa, T., Ishii, M., and Nojiri, Y.:
Seasonal and Inter-Annual Variations in <i>p</i>CO<sub>2</sub> sea and Air-Sea CO<sub>2</sub> Fluxes
in Mid-Latitudes of the Western and Eastern North Pacific during 1999–2006:
Recent Results Utilizing Voluntary Observation Ships, J. Meteorol. Soc. Jpn.,
88, 883–898, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
Telszewski, M., Chazottes, A., Schuster, U., Watson, A. J., Moulin, C.,
Bakker, D. C. E., González-Dávila, M., Johannessen, T., Körtzinger, A.,
Lüger, H., Olsen, A., Omar, A., Padin, X. A., Ríos, A. F., Steinhoff, T.,
Santana-Casiano, M., Wallace, D. W. R., and Wanninkhof, R.: Estimating the
monthly <i>p</i>CO<sub>2</sub> distribution in the North Atlantic using a
self-organizing neural network, Biogeosciences, 6, 1405–1421,
<a href="http://dx.doi.org/10.5194/bg-6-1405-2009" target="_blank">doi:10.5194/bg-6-1405-2009</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
Wanninkhof, R., Park, G.-H., Takahashi, T., Sweeney, C., Feely, R., Nojiri,
Y., Gruber, N., Doney, S. C., McKinley, G. A., Lenton, A., Le Quéré, C.,
Heinze, C., Schwinger, J., Graven, H., and Khatiwala, S.: Global ocean carbon
uptake: magnitude, variability and trends, Biogeosciences, 10, 1983–2000,
<a href="http://dx.doi.org/10.5194/bg-10-1983-2013" target="_blank">doi:10.5194/bg-10-1983-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
Wilamowski, B. M. and Yu, H.: Improved Computation for Levenberg-Marquardt
Training, IEEE T. Neural Networ., 21, 930–937, 2010.

</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
Zeng, J., Nojiri, Y., Landschützer, P., Telszewski, M., and Nakaoka, S.:
A global surface ocean <i>f</i>CO<sub>2</sub> climatology based on a feedforward neural
network, J. Atmos. Ocean Tech., 31, 1838–1849, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
Zeng, J., Nojiri, Y., Nakaoka, S., Nakajima, H., and Shirai, T.: Surface
ocean CO<sub>2</sub> in 1990-2011 modelled using a feed-forward neural network,
Geoscience Data Journal, 2, 47–51, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
Zeng, J. Y., Nojiri, Y., Murphy, P. P., Wong, C. S., and Fujinuma, Y.: A
comparison of Delta <i>p</i>CO<sub>2</sub> distributions in the northern North Pacific
using results from a commercial vessel in 1995–1999, Deep-Sea Res. Pt. II,
49, 5303–5315, 2002.
</mixed-citation></ref-html>--></article>
