<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">OS</journal-id><journal-title-group>
    <journal-title>Ocean Science</journal-title>
    <abbrev-journal-title abbrev-type="publisher">OS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Ocean Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1812-0792</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/os-17-1545-2021</article-id><title-group><article-title>Defining Southern Ocean fronts using unsupervised classification</article-title><alt-title>Southern Ocean front detection</alt-title>
      </title-group><?xmltex \runningtitle{Southern Ocean front detection}?><?xmltex \runningauthor{S. D. A. Thomas et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2">
          <name><surname>Thomas</surname><given-names>Simon D. A.</given-names></name>
          <email>sithom@bas.ac.uk</email>
        <ext-link>https://orcid.org/0000-0001-7911-1659</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Jones</surname><given-names>Daniel C.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-8701-4506</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Faul</surname><given-names>Anita</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3 aff1">
          <name><surname>Mackie</surname><given-names>Erik</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4">
          <name><surname>Pauthenet</surname><given-names>Etienne</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>British Antarctic Survey, NERC, UKRI, Cambridge, UK</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Department of Physics, University of Cambridge, Cambridge, UK</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Cambridge Zero, University of Cambridge, Cambridge, UK</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>LOCEAN-IPSL, UPMC Université, Sorbonne Universités, Paris, France</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Simon D. A. Thomas (sithom@bas.ac.uk)</corresp></author-notes><pub-date><day>2</day><month>November</month><year>2021</year></pub-date>
      
      <volume>17</volume>
      <issue>6</issue>
      <fpage>1545</fpage><lpage>1562</lpage>
      <history>
        <date date-type="received"><day>7</day><month>May</month><year>2021</year></date>
           <date date-type="rev-request"><day>20</day><month>May</month><year>2021</year></date>
           <date date-type="rev-recd"><day>11</day><month>September</month><year>2021</year></date>
           <date date-type="accepted"><day>16</day><month>September</month><year>2021</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2021 </copyright-statement>
        <copyright-year>2021</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://os.copernicus.org/articles/.html">This article is available from https://os.copernicus.org/articles/.html</self-uri><self-uri xlink:href="https://os.copernicus.org/articles/.pdf">The full text article is available as a PDF file from https://os.copernicus.org/articles/.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d1e138">Oceanographic fronts are transitions between thermohaline structures with different characteristics. Such transitions are ubiquitous, and their locations and properties affect how the ocean operates as part of the global climate system. In the Southern Ocean, fronts have classically been defined using a small number of continuous, circumpolar features in sea surface height or dynamic height. Modern observational and theoretical developments are challenging and expanding this traditional framework to accommodate a more complex view of fronts. Here, we present a complementary new approach for calculating fronts using an unsupervised classification method called Gaussian mixture modelling (GMM) and a novel inter-class parameter called the <inline-formula><mml:math id="M1" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric. The <inline-formula><mml:math id="M2" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric approach produces a probabilistic view of front location, emphasising the fact that the boundaries between water masses are not uniformly sharp across the entire Southern Ocean. The <inline-formula><mml:math id="M3" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric approach uses thermohaline information from a range of depth levels, making it more general than approaches that only use near-surface properties. We train the GMM using an observationally constrained state estimate in order to have more uniform spatial and temporal data coverage. The probabilistic boundaries defined by the <inline-formula><mml:math id="M4" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric roughly coincide with several classically defined fronts, offering a novel view of this structure. The <inline-formula><mml:math id="M5" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric fronts appear to be relatively sharp in the open ocean and somewhat diffuse near large topographic features, possibly highlighting the importance of topographically induced mixing. For comparison with a more localised method, we also use an edge detection approach for identifying fronts. We find a strong correlation between the edge field of the leading principal component and the zonal velocity; the edge detection method highlights the presence of jets, which are supported by thermal wind balance. This more localised method highlights the complex, multiscale structure of Southern Ocean fronts, complementing and contrasting with the more domain-wide view offered by the <inline-formula><mml:math id="M6" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric. The Sobel edge detection method may be useful for defining and tracking smaller-scale fronts and jets in model or reanalysis data. The <inline-formula><mml:math id="M7" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric approach may prove to be a useful method for inter-model comparison, as it uses the thermohaline structure of those models instead of tracking somewhat ad hoc values of sea surface height and/or dynamic height, which can vary considerably between models. In addition, the general <inline-formula><mml:math id="M8" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric approach allows front definitions to shift with changing temperature and salinity structures, which may be useful for characterising fronts in a changing climate.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <?pagebreak page1546?><p id="d1e207">The Southern Ocean (SO) is at the centre of the global thermohaline circulation,
joining the Indian, Pacific, and Atlantic oceans into a single planetary-scale heat
and carbon transport system <xref ref-type="bibr" rid="bib1.bibx32 bib1.bibx60" id="paren.1"/>.
In the SO, upwelling and downwelling branches of the overturning circulation
transport water and tracers (e.g. heat, carbon) between
the surface and subsurface oceans <xref ref-type="bibr" rid="bib1.bibx52 bib1.bibx53" id="paren.2"/>.
The steeply tilted isopycnals associated with the overturning circulation also support
the powerful Antarctic Circumpolar Current (ACC),
with a mean combined barotropic and
baroclinic volume transport of roughly <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:mn mathvariant="normal">173.3</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">10.7</mml:mn></mml:mrow></mml:math></inline-formula> Sv, driven by a combination of
the westerly winds and air–sea buoyancy forcing
<xref ref-type="bibr" rid="bib1.bibx50 bib1.bibx41 bib1.bibx9" id="paren.3"/>.
In part because of its unique structure, the SO is a critical regulator of global
climate, having thus far absorbed more than 75 % of the excess energy and 50 % of the excess
carbon added to the climate system from anthropogenic emissions
<xref ref-type="bibr" rid="bib1.bibx40 bib1.bibx14" id="paren.4"/>.
As such, the thermohaline structure of the Southern Ocean may be
considered an important climate system parameter, as it affects how heat and carbon
are partitioned between the atmosphere and ocean.</p>
      <p id="d1e234">Through decades of observational and theoretical effort, the global oceanographic community has
curated a detailed theoretical understanding of the structure of the Southern Ocean.
One of the hallmarks of this view is the presence of fronts,
i.e. transitions in temperature, salinity, and/or biogeochemical properties <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx43" id="paren.5"/>. Although fronts are not identical to the sharp jets found in the SO, fronts and jets at the mesoscale share a close relationship partly due to thermal wind balance <xref ref-type="bibr" rid="bib1.bibx55 bib1.bibx56" id="paren.6"/>. Traditionally, oceanographers have defined SO fronts using a small number of continuous, circumpolar features that follow contours of sea surface height or dynamic height <xref ref-type="bibr" rid="bib1.bibx27" id="paren.7"/>. However, satellite altimetry shows that the ACC features a braided and meandering structure that is not necessarily reflected in the traditional, time-averaged view of fronts as continuous property contours <xref ref-type="bibr" rid="bib1.bibx3 bib1.bibx31" id="paren.8"/>. Using individual property contours to define fronts, for example, contours of temperature or sea surface height, is somewhat limited by the fact that such contours do not always line up with the locations of strong gradients  <xref ref-type="bibr" rid="bib1.bibx63 bib1.bibx62 bib1.bibx16 bib1.bibx3" id="paren.9"/>. In response to more detailed SO observations, the global oceanographic community has been developing a variety of new approaches for defining and tracking fronts in more application-specific ways <xref ref-type="bibr" rid="bib1.bibx4" id="paren.10"/>. For example, coastal applications and open-ocean applications may benefit from conceptually different treatments of ocean fronts, which are characterised by different spatial and temporal scales. For a historical view and summary of advances in the area of front definition and detection, see the recent review article by <xref ref-type="bibr" rid="bib1.bibx4" id="text.11"/>.</p>
      <p id="d1e259">In order to help us broaden our view of Southern Ocean fronts, we look to a branch of machine learning called unsupervised classification (also known as clustering). Broadly speaking, unsupervised classification attempts to identify subpopulations in data distributions that have not already been labelled or sorted. Although such methods have existed for decades, the amount of SO data has only in recent years become large enough for clustering approaches to be suitable; the application of unsupervised classification to oceanographic data is in its infancy. Several recent studies have used unsupervised classification to identify coherent regimes of thermohaline structure and the transitions between them, specifically in the North Atlantic <xref ref-type="bibr" rid="bib1.bibx36" id="paren.12"/>, Southern Ocean <xref ref-type="bibr" rid="bib1.bibx25" id="paren.13"/>, and Indian sector of the Southern Ocean <xref ref-type="bibr" rid="bib1.bibx51" id="paren.14"/>. These methods have also been used to define coherent dynamical and biogeochemical regimes from depth-averaged ocean structure <xref ref-type="bibr" rid="bib1.bibx57 bib1.bibx29 bib1.bibx23" id="paren.15"/>. Recently, unsupervised classification has been used to define coherent ecological regimes from physical and biogeochemical data <xref ref-type="bibr" rid="bib1.bibx58" id="paren.16"/>. Researchers are also exploring potential connections between changes in class properties and large-scale climate phenomena. For example, a recent study tied evolution in the longitudinal extent of an algorithmically defined class to the onset of El Niño, suggesting that unsupervised classification methods could complement existing index-based assessments of large-scale climate modes <xref ref-type="bibr" rid="bib1.bibx21" id="paren.17"/>.</p>
      <p id="d1e281">Unsupervised classification does not use specific property contours to define boundaries
between thermohaline structures, so it avoids one of the fundamental limitations of many
traditional front definition approaches. Given the required information,
unsupervised classification methods can use more detailed thermohaline data from
throughout the water column to define classes and their boundaries. Across a given front,
one might expect to find not only a transition in surface values but also a change in the thermohaline structure,
as indicated by a change of profile class with latitude and/or longitude. In this work,
we use an unsupervised classification technique called Gaussian mixture modelling (GMM),
which attempts to represent subpopulations in the data distribution using multidimensional Gaussian functions.
Because GMM is a probabilistic method, in addition to automatically clustering the thermohaline profiles into classes,
it returns a set of weights across the different classes for each data point. That is,
it returns a probability distribution that can be exploited to define boundaries between coherent regimes in a novel way.
In this paper, we propose that GMM can be used to represent the boundaries as “fuzzy” regions,
which reflects the fact that not all transitions in the SO are uniformly sharp.</p>
      <p id="d1e285">In Sect. <xref ref-type="sec" rid="Ch1.S2"/>, we introduce the observationally constrained state estimate
from which we draw our temperature and salinity data (Sect. <xref ref-type="sec" rid="Ch1.S2.SS1"/>),
discuss principal component analysis (PCA) for dimensionality reduction (Sect. <xref ref-type="sec" rid="Ch1.S2.SS2"/>),
and cover our application of GMM (Sect. <xref ref-type="sec" rid="Ch1.S2.SS3"/>). We then define the inter-class comparison metric
(i.e. the <inline-formula><mml:math id="M10" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric) that we use to quantify water mass boundaries (Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>).
Next, we apply the <inline-formula><mml:math id="M11" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric to the reduced-dimension state estimate data (Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>).
For comparison, we contrast this method with a more local front detection approach (Sect. <xref ref-type="sec" rid="Ch1.S3.SS4"/>).
Finally, we discuss some caveats (Sect. <xref ref-type="sec" rid="Ch1.S4"/>) and offer our summary and conclusions
(Sect. <xref ref-type="sec" rid="Ch1.S5"/>).</p>
</sec>
<?pagebreak page1547?><sec id="Ch1.S2">
  <label>2</label><title>State estimate data, PCA, and unsupervised classification</title>
      <p id="d1e329">Our front identification method uses a combination of principal component analysis, unsupervised classification,
and a new probabilistic metric to quantify the boundaries between coherent thermohaline structures.
First, we describe the dataset that we used for developing and training our method.</p>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>The Southern Ocean State Estimate</title>
      <p id="d1e339">We developed our method using the Biogeochemical Southern Ocean State Estimate (B-SOSE) <xref ref-type="bibr" rid="bib1.bibx66" id="paren.18"/>.
B-SOSE is an observationally constrained numerical simulation created using MITgcm (<uri>https://mitgcm.org/</uri>, last access: 10 August 2021)
<xref ref-type="bibr" rid="bib1.bibx33 bib1.bibx34" id="paren.19"/> and a suite of Southern Ocean observations, including Argo float data, ship track data, and satellite data. B-SOSE is part of the Estimating the Circulation and Climate of the Ocean (ECCO) suite of state estimates (<uri>https://www.ecco-group.org/</uri>, last access: 10 August 2021), which includes a variety of global and regional products covering a range of multi-year to multi-decadal time periods. Examples of other state estimates include the physics-only Southern Ocean State Estimate (SOSE) and the global ECCOv4 state estimate <xref ref-type="bibr" rid="bib1.bibx37 bib1.bibx12" id="paren.20"/>. We chose to develop our method using a state estimate because such products offer (1) uniform coverage in latitude, longitude, and time as well as (2) relatively high fidelity with respect to observations. We chose B-SOSE, in particular, because it represents the Southern Ocean using a spatial resolution of <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">6</mml:mn><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, which is eddy-permitting in the latitude range of the ACC, enabling a realistic representation of mesoscale eddy structure <xref ref-type="bibr" rid="bib1.bibx17" id="paren.21"/>. We expect that training our model on the physics-only SOSE would produce similar results, although we did not attempt that here. In principle, our methods can be readily applied to any gridded temperature and salinity profile dataset. It may be possible to apply these methods to in situ data as well, if the user addresses the problem of non-uniform spatial and temporal sampling. In this paper, we focus only on applications to gridded datasets.</p>
      <p id="d1e377">To construct a state estimate, researchers bring a numerical simulation into better consistency with an observational dataset using the 4D-Var method. This method uses adjoint sensitivities to calculate the required changes in the “controls” (e.g. initial conditions, mixing parameters, boundary conditions) needed to improve the agreement between the simulation and the observational dataset <xref ref-type="bibr" rid="bib1.bibx59 bib1.bibx70" id="paren.22"/>.</p>
      <p id="d1e383">The B-SOSE domain extends from the Equator to 78<inline-formula><mml:math id="M13" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> S,
but we only use data south of
30<inline-formula><mml:math id="M14" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> S to focus on the Southern Ocean and to avoid the model boundary.
It uses bathymetry and coastline based on <xref ref-type="bibr" rid="bib1.bibx1" id="text.23"/>.
B-SOSE solves the heat, salt, and momentum equations using a third-order direct space
and time advection scheme with a 1 h time step. The time-evolving atmospheric
boundary conditions use bulk formulae to solve for fluxes of heat, freshwater,
and momentum, with 6-hourly atmospheric state variables as inputs
<xref ref-type="bibr" rid="bib1.bibx28 bib1.bibx8" id="paren.24"/>. The state estimation process
iteratively adjusts the atmospheric state variables and oceanic initial conditions to improve model–data agreement.
B-SOSE uses dynamic sea ice <xref ref-type="bibr" rid="bib1.bibx30 bib1.bibx11" id="paren.25"/>. For vertical mixing,
it uses the GLL90 mixed layer parameterisation <xref ref-type="bibr" rid="bib1.bibx15" id="paren.26"/>.
It also uses horizontal and vertical viscosity and diffusivity.
River runoff comes from the product of <xref ref-type="bibr" rid="bib1.bibx6" id="text.27"/> augmented with an estimate of
Antarctic freshwater input from iceberg and ice sheet melting <xref ref-type="bibr" rid="bib1.bibx18" id="paren.28"/>.
It does not include mesoscale eddy parameterisation, as this particular configuration falls
into the horizontal resolution range wherein mesoscale parameterisation may actually worsen
the representation of the mesoscale <xref ref-type="bibr" rid="bib1.bibx17" id="paren.29"/>.
Because we are interested in quantifying physical, large-scale fronts,
we only used monthly mean temperature and salinity data. Also, because we are not interested
in the surface seasonal cycle at present, we only used temperature and salinity data between 300 and 2000 m,
following <xref ref-type="bibr" rid="bib1.bibx51" id="text.30"/>. We used the whole period of iteration 106 of this state estimate,
which covers January 2008 to December 2012.
Some key properties of B-SOSE iteration 106 are listed in Table <xref ref-type="table" rid="Ch1.T1"/>.
For further details, see <xref ref-type="bibr" rid="bib1.bibx66" id="text.31"/>.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e439">Selected properties of B-SOSE iteration 106.
Output frequency refers to the output selected for this study.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="2">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Property</oasis:entry>
         <oasis:entry colname="col2">Value</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">State estimate iteration number</oasis:entry>
         <oasis:entry colname="col2">106</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Horizontal resolution</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">6</mml:mn><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Vertical resolution (variable)</oasis:entry>
         <oasis:entry colname="col2">4.2  to 400 m</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Number of vertical levels</oasis:entry>
         <oasis:entry colname="col2">52</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Output frequency</oasis:entry>
         <oasis:entry colname="col2">Monthly averaged</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Horizontal viscosity</oasis:entry>
         <oasis:entry colname="col2">10 m<inline-formula><mml:math id="M16" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> s<inline-formula><mml:math id="M17" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Vertical viscosity</oasis:entry>
         <oasis:entry colname="col2">10<inline-formula><mml:math id="M18" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M19" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> s<inline-formula><mml:math id="M20" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Horizontal diffusivity</oasis:entry>
         <oasis:entry colname="col2">10 m<inline-formula><mml:math id="M21" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> s<inline-formula><mml:math id="M22" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Vertical diffusivity</oasis:entry>
         <oasis:entry colname="col2">10<inline-formula><mml:math id="M23" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> m<inline-formula><mml:math id="M24" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> s<inline-formula><mml:math id="M25" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Principal component analysis</title>
      <p id="d1e674">Each vertical profile in the full B-SOSE dataset is comprised of temperature and salinity values at multiple depth levels, at every grid cell and every output month. Values close to each other in the water column are correlated to some degree. Therefore, we do not necessarily need values of potential temperature (<inline-formula><mml:math id="M26" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula>) and salinity (<inline-formula><mml:math id="M27" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula>) at every depth level to capture most of the variability, and reducing the dimensionality of the data can improve the convergence of the training process. One specific dimension reduction technique is<?pagebreak page1548?> principal component analysis (PCA), which identifies the functions that capture most of the variability with depth in the dataset. The result is a representation of the dataset as a linear combination of eigenfunctions (i.e. principal components), sometimes called a principal component expansion or principal component decomposition. Using this procedure, we can describe each profile using a small set of eigenvalues (i.e. coefficients of the principal component expansion) instead of a full set of temperature and salinity values. In addition to improving the speed and efficiency of the GMM algorithm, PCA reveals potential physical structures that may be useful for understanding the stratification of the SO <xref ref-type="bibr" rid="bib1.bibx44" id="paren.32"/>. We choose the number of principal components such that the percentage of variability explained (in a statistical sense) by the PCA expansion is sufficiently high for our purposes.</p>
      <p id="d1e694">Following <xref ref-type="bibr" rid="bib1.bibx51" id="text.33"/>, we only keep values between 300 and 2000 m
to exclude most of the surface seasonal variability from the dataset.
Because the data are spaced on an irregular grid
in the vertical direction, we first interpolate
the temperature and salinity profiles onto
a regular grid with 10 m cells in the vertical.
Following <xref ref-type="bibr" rid="bib1.bibx44" id="text.34"/>, at each grid cell and time
we concatenate the temperature and salinity profiles into a single vector.
We normalise each depth level for both temperature and salinity separately:
subtracting the mean and dividing by the standard deviation calculated for
all time periods on that particular depth level and variable.
That is, we standardise the temperature values at each level using the
distribution of temperatures at that same depth level,
and we standardise the salinity values using the
distribution of salinities at that same depth level.
This is a slightly different approach from <xref ref-type="bibr" rid="bib1.bibx44" id="text.35"/>,
in which the authors standardise across the entire dataset.
We found that, for the work shown in this paper,
the choice of normalisation approach does not make a large difference
in the results (not shown). After normalisation, we carry out PCA
expansion. We keep the first three principal components (PCs),
which together statistically explain 98 % of the
variability across the thermohaline dataset. For completeness, we show the structure of the principal components in Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/>.</p>
      <p id="d1e708">The coefficients associated with PC1 indicate a broad division between polar,
high-latitude Southern Ocean waters and the subtropics (Fig. <xref ref-type="fig" rid="Ch1.F1"/>a).
The most negative PC1 coefficients are found in the Weddell Gyre,
and we also see the imprints of the South Pacific Gyre and the ACC
<xref ref-type="bibr" rid="bib1.bibx68" id="paren.36"/>.
The coefficients of PC2 bear the imprint of the ACC
and of its northward flow along the eastern Pacific Basin (Fig. <xref ref-type="fig" rid="Ch1.F1"/>b).
This northward flow is associated with the formation and export of Subantarctic Mode Water
and Antarctic Intermediate Water <xref ref-type="bibr" rid="bib1.bibx22 bib1.bibx52 bib1.bibx24" id="paren.37"/>.
PC2 also has the imprint of the Agulhas Current around South Africa.
Finally, PC3 has strong negative values in the Weddell Gyre and over most of the Pacific,
with a band of circumpolar positive values that somewhat mirrors the southward drift of
the ACC when considered from west to east.
The spatial structure of PC1 and PC2 are largely consistent with
those of <xref ref-type="bibr" rid="bib1.bibx44" id="text.38"/>, but the structure of PC3 is
somewhat different from theirs, particularly in the subtropics.
These differences are possibly a result of our
choice of a different depth range.
Given that PC3 explains a small fraction of the
variability (7 % of the variance explained),
we do not expect these differences to impact our results.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e727">Each combined temperature and salinity profile can
be approximated using a three-term PC expansion.
Above are monthly mean coefficients of the PC expansion from June 2011.
In order to limit the influence of seasonal variability,
we use temperature and salinity profiles between 300 and 2000 m.
The first three PCs explain <bold>(a)</bold> 75 %, <bold>(b)</bold> 16 %, and
<bold>(c)</bold> 7 % of the variance respectively, together explaining
a total of 98 % of the variability. The white space represents
bathymetry shallower than 2000 m, and its boundary is marked by a grey line.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f01.png"/>

        </fig>

      <p id="d1e745">After we perform dimensionality reduction, each monthly
output at each model grid cell in latitude and longitude
is represented using the first three
coefficients of the PC expansion.
The three PC values contain combined information about both
temperature and salinity, simplifying our analysis.
This approach defines an abstract three-dimensional space
in which we can perform unsupervised classification.
In typical machine learning terminology, this abstract
three-dimensional space can be called the “feature space”,
in which each PC axis is a “feature”.
To be explicit, we can say that each combined
temperature–salinity profile in latitude, longitude,
and time is represented by a three-dimensional vector of
PC values. Each three-dimensional
PC vector derived from B-SOSE is an “observation”.
In the next section, we use unsupervised classification
to identify subpopulations
in the three-dimensional distribution of PC values.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Gaussian mixture modelling</title>
      <p id="d1e756">Unsupervised classification attempts to identify subpopulations within a data distribution,
without the assistance of any predefined labels. In our application,
we attempt to identify data subpopulations in the abstract three-dimensional
space defined by the PC coefficients (i.e. the “feature space”). Here, we use GMM,
an algorithm that attempts to fit a set of multidimensional Gaussian functions
to the data by iteratively adjusting the means and
covariances of the Gaussians (<xref ref-type="bibr" rid="bib1.bibx38" id="altparen.39"/>;
see Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/> for more detail).
This method has recently been used to classify Argo
temperature profiles in the top 2 km of the North Atlantic
Ocean and the Southern Ocean <xref ref-type="bibr" rid="bib1.bibx36 bib1.bibx25" id="paren.40"/>.
GMM is well-suited to ocean applications because it offers
a probabilistic measure of classification
in the form of posterior probabilities, which is useful when working with a highly correlated dataset.
Because GMM-derived clusters will likely feature some overlap
due to the highly correlated nature of ocean data,
such posterior probabilities offer an important complement
to the GMM-derived class labels. In this application,
we use the posterior probabilities to define coherent thermohaline regimes and their boundaries.</p>
      <?pagebreak page1549?><p id="d1e767">The GMM method attempts to represent the underlying data
distribution using a set of
<inline-formula><mml:math id="M28" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> Gaussian functions in <inline-formula><mml:math id="M29" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> dimensions (in our case <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mi>D</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula>):
            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M31" display="block"><mml:mrow><mml:mi mathvariant="script">N</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:mo>-</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mi>T</mml:mi></mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msup><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfenced><mml:mfenced open="(" close=")"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow><mml:msqrt><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mi>D</mml:mi></mml:msup><mml:mfenced open="∥" close="∥"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mrow><mml:mi>D</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> is a vector in the PC space,
<inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mrow><mml:mi>D</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> is the centre of the
Gaussian distribution expressed in vector form,
<inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="double-struck">R</mml:mi><mml:mrow><mml:mi>D</mml:mi><mml:mo>×</mml:mo><mml:mi>D</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> is the covariance matrix,
and <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> is its determinant.
The covariance matrix determines the orientation of the
Gaussian ellipsoids in PC space.
We model the dataset, in the statistical sense of
representing the dataset using a probability distribution,
as a weighted sum of Gaussians:
            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M36" display="block"><mml:mrow><mml:mi mathvariant="double-struck">P</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo><mml:mo>≈</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="script">N</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>;</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the weight associated with the <inline-formula><mml:math id="M38" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>th Gaussian.
The process of fitting the GMM uses expectation maximisation (EM), which consists of
iteratively adjusting <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>,
and <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> to decrease the model–data misfit.
For additional details, see Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>.</p>
      <p id="d1e1092">Once the weights, means, and covariances are fitted, each data vector <inline-formula><mml:math id="M42" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula>
is associated with a posterior probability distribution across all of the <inline-formula><mml:math id="M43" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> classes.
Although we kept the random seed used in the initial guess fixed for this paper, our results are robust to the choice of random seed (not shown).
This distribution is the set of likelihoods that the data vector belongs
to any particular class, and the probabilities sum to one.
GMM assigns each data vector to the class with the maximum posterior probability.
We will now use this distribution to define an inter-class metric,
which gives us a novel perspective on fronts as transitions in thermohaline structures.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><?xmltex \opttitle{The inter-class comparison metric (the $I$-metric)}?><title>The inter-class comparison metric (the <inline-formula><mml:math id="M44" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric)</title>
      <p id="d1e1126">First, we examine the structure of our profile data in PC space and introduce the <inline-formula><mml:math id="M45" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric
for identifying boundaries between coherent hydrographic regimes (Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>).
Next, we examine the <inline-formula><mml:math id="M46" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric in both a monthly averaged and multi-year averaged view in
latitude–longitude space, and we explore the class structure in more detail by examining the associated
coherent regions and vertical profile types (Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>).
Following that, we compare our results with a local edge detection method (Sect. <xref ref-type="sec" rid="Ch1.S3.SS4"/>).</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><?xmltex \opttitle{Defining the $I$-metric}?><title>Defining the <inline-formula><mml:math id="M47" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric</title>
      <?pagebreak page1550?><p id="d1e1164">For each combined temperature–salinity profile, GMM returns a probability distribution across all of the <inline-formula><mml:math id="M48" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> classes. This distribution is called the <italic>posterior probability</italic> distribution, and it quantifies the probability that a particular profile is in a particular class. If the posterior probability is close to 1.0 for class <inline-formula><mml:math id="M49" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> and very small for the other classes, then within the context of the Gaussian statistical model (i.e. GMM), the classification of the profile into class <inline-formula><mml:math id="M50" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> is unambiguous and clear. However, if the posterior probability is close in value for the two classes with highest probabilities, then the classification is ambiguous and less clear. With this in mind, we can use the difference between the highest probability and the second-highest probability to quantify how clearly the profile has been classified. If the classification is unambiguous, then the profile is less likely to be associated with a boundary between coherent thermohaline regimes. If the classification is ambiguous, then the profile is more likely to be associated with a boundary. With this in mind, we propose a probabilistic inter-class comparison metric of the following form:
            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M51" display="block"><mml:mrow><mml:mi>I</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="bold">x</mml:mi><mml:mi mathvariant="bold">n</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mfenced close="]" open="["><mml:mrow><mml:mi mathvariant="double-struck">P</mml:mi><mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>c</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mi mathvariant="normal">highest</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi mathvariant="double-struck">P</mml:mi><mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>c</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mtext>runner-up</mml:mtext></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the
<inline-formula><mml:math id="M53" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>th profile's PC values, and
<inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:mi mathvariant="double-struck">P</mml:mi><mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>c</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mi mathvariant="normal">highest</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the
highest posterior probability that GMM has assigned the <inline-formula><mml:math id="M55" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>th profile as belonging to class <inline-formula><mml:math id="M56" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>.
The term <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mi mathvariant="double-struck">P</mml:mi><mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>c</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mtext>runner-up</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>
is the second-highest posterior probability belonging to class <inline-formula><mml:math id="M58" display="inline"><mml:mi>l</mml:mi></mml:math></inline-formula>.
If the difference between the highest and runner-up posterior probabilities is close to one,
then <inline-formula><mml:math id="M59" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula> is small. This would indicate that the profile is not
likely to be associated with a boundary between thermohaline regimes.
If the difference between the highest and runner-up posterior probabilities is small,
then <inline-formula><mml:math id="M60" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula> is close to one, indicating that the profile is
likely to be associated with a boundary between different thermohaline regimes.
The <inline-formula><mml:math id="M61" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric offers an alternative method for defining boundaries as
fuzzy transitions between coherent regimes. In general, some regions will feature
sharp transitions across boundaries, whereas other regions will feature more gradual transitions.
The relative sharpness of a transition is influenced by the processes that form,
mix, and destroy water masses. In contrast with approaches that define fronts as sharp transitions
located along property contours or local gradients, the <inline-formula><mml:math id="M62" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric approach
allows for a wider variety of transition types between regimes.</p>
      <p id="d1e1363">In our <inline-formula><mml:math id="M63" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric application, GMM clusters the profiles in feature space (Fig. <xref ref-type="fig" rid="Ch1.F2"/>a).
The structure of the data shown in PC space is broadly consistent with that found in other studies
(e.g. <xref ref-type="bibr" rid="bib1.bibx44 bib1.bibx45 bib1.bibx46" id="altparen.41"/>).
The data distribution is reasonably well represented by a linear combination of multidimensional Gaussian functions
(Fig. <xref ref-type="fig" rid="Ch1.F2"/>). The <inline-formula><mml:math id="M64" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric values indicate transition regions between classes,
where the class labelling is relatively ambiguous (Fig. <xref ref-type="fig" rid="Ch1.F2"/>b).
We choose <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> to represent the general, large-scale pattern of the data;
we explore the sensitivity of our results to <inline-formula><mml:math id="M66" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> in Sect. <xref ref-type="sec" rid="Ch1.S4.SS4"/>.
In the next section, we examine the <inline-formula><mml:math id="M67" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric and class structure in physical space.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e1420"><bold>(a)</bold> The classification analysis takes place in the abstract PC space.
Each point represents a three-dimensional vector of principal component values that describe
a single combined temperature and salinity profile.
The three axes are the three principal components.
Class assignments are indicated using colours.
<bold>(b)</bold> The <inline-formula><mml:math id="M68" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric highlights transitions
between classes in the abstract PC space.
The Gaussian ellipsoids of the GMM are shown in red,
and the <inline-formula><mml:math id="M69" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric values associated with each point
are shown using six different colour scales.
Each colour scale corresponds to a particular transition between classes.
Points with low <inline-formula><mml:math id="M70" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric values are not shown.
The above is a subset of data taken from 12 months of monthly averaged
B-SOSE data, between August 2011 and July 2012 inclusively.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f02.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><?xmltex \opttitle{Geographic view of the $I$-metric}?><title>Geographic view of the <inline-formula><mml:math id="M71" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric</title>
      <p id="d1e1471">The <inline-formula><mml:math id="M72" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric viewed in latitude–longitude space illustrates the rich variety of transition types found in the Southern Ocean (Fig. <xref ref-type="fig" rid="Ch1.F3"/>). In all sectors of the SO, we see sharp transitions where the regions of high <inline-formula><mml:math id="M73" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula> values are narrow and more gradual transitions where the regions of high <inline-formula><mml:math id="M74" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula> values are more spread out. Some features are circumpolar, which is consistent with the view of SO fronts as continuous lines that encircle Antarctica. However, we also see regions where the continuity and circumpolar nature of the fronts is not as clear, suggesting that a broader view may be appropriate <xref ref-type="bibr" rid="bib1.bibx4" id="paren.42"/>. The fronts are not uniformly sharp across all longitudes; for example, the northernmost transition is broad and gradual in the Atlantic sector, sharp in the Indian sector, and relatively broad in the Pacific sector. The southernmost band of high <inline-formula><mml:math id="M75" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric values is relatively sharp in the Atlantic sector, becoming increasingly broad as we follow it into the Indian and Pacific sectors. In the Pacific sector, it extends into an especially broad region in the Amundsen Sea, which is consistent with the intersection of the classically defined southern boundary (SBdy) with the Antarctic continent <xref ref-type="bibr" rid="bib1.bibx27" id="paren.43"/>. Upstream of Kerguelen Plateau, there is a region where the <inline-formula><mml:math id="M76" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric is spread-out and diffuse between classes 2 and 3; this region also features a standing meander associated with enhanced eddy kinetic energy <xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx54" id="paren.44"/>. The enhanced mesoscale eddy kinetic energy associated with the meander is consistent with increased lateral mixing and the spread-out pattern in the <inline-formula><mml:math id="M77" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric found in the same region. Closer to the Antarctic continent, we also see the imprints of both the Weddell Gyre and the Ross Gyre, in regions of coherent structures with low <inline-formula><mml:math id="M78" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric values, in part enforced by the gyre circulation.</p>
      <p id="d1e1535">The monthly mean <inline-formula><mml:math id="M79" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric (Fig. <xref ref-type="fig" rid="Ch1.F3"/>a) also highlights individual ring-like eddies; although these features are not typically considered fronts, they are small-scale transition regions between different hydrographic structures. We do expect the <inline-formula><mml:math id="M80" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric to be non-zero across these features. The monthly view also features mesoscale meanders, highlighting the detailed structure of the SO, which is partly a result of the energetic mesoscale eddy field. The <inline-formula><mml:math id="M81" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric does feature some month-to-month variability: in some locations, the fronts meander in their north–south extent, whereas in others, they are relatively stationary, likely due to bathymetric constraints (see animations in <xref ref-type="bibr" rid="bib1.bibx61" id="altparen.45"/>, in the “gifs” directory).</p>
      <p id="d1e1564">By averaging the 4 years worth of monthly means, we obtain a map of the climatological <inline-formula><mml:math id="M82" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric, which is averaged over many eddy lifetimes (Fig. <xref ref-type="fig" rid="Ch1.F3"/>b). Comparing an example monthly field with the climatological field, we can examine the imprint of eddy spatial variability and the meandering of the fronts on the <inline-formula><mml:math id="M83" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric pattern. Most of our observations<?pagebreak page1551?> about the metric are unchanged by this averaging; we identify three roughly circumpolar bands of high <inline-formula><mml:math id="M84" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric values, with significant spatial variability and some overlap. The three bands are fairly distinct in the Atlantic sector, with the northernmost transition being the broadest. Upstream of Kerguelen Plateau, the two northernmost bands become somewhat hard to distinguish. This is possibly a consequence of the eddy mixing and upwelling hotspot in that region, which tends to spread out hydrographic features in latitude–longitude space, increasing the degree of spatial correlation found there. Upstream of Kerguelen Plateau, the Polar Front features strong seasonal variability <xref ref-type="bibr" rid="bib1.bibx45" id="paren.46"/>. Note that the <inline-formula><mml:math id="M85" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric band aligned roughly with the Polar Front only passes south of the plateau (e.g. south of Heard Island), which is consistent with other studies of the subsurface component of the Polar Front (e.g. <xref ref-type="bibr" rid="bib1.bibx45" id="altparen.47"/>).</p>
      <p id="d1e1605">The three bands of higher <inline-formula><mml:math id="M86" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric values are distinct downstream of the Kerguelen Plateau in the Indian sector; notably, the southernmost band features especially high <inline-formula><mml:math id="M87" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric values in this sector. This pattern is associated with the transition between the Antarctic Circumpolar Current and the Antarctic Slope Current (ASC), which tend to flow in opposite directions <xref ref-type="bibr" rid="bib1.bibx64 bib1.bibx47" id="paren.48"/>. In the Pacific sector, we see the southernmost band turn into the Amundsen Sea and intersect with the Antarctic continental slope, spreading out in a diffuse region that is consistent with the behaviour of the southernmost extent of the ACC, the eastern boundary of the Ross Gyre, and the eastward shelf circulation along the West Antarctic Peninsula <xref ref-type="bibr" rid="bib1.bibx42" id="paren.49"/>.
In this same sector, two large regions of low <inline-formula><mml:math id="M88" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric values spatially correspond to export pathways of Subantarctic Mode Water and Antarctic Intermediate Water <xref ref-type="bibr" rid="bib1.bibx22 bib1.bibx52 bib1.bibx53 bib1.bibx24" id="paren.50"/>. The higher I-metric values delimit the edges of these more coherent thermohaline regimes (Fig. <xref ref-type="fig" rid="Ch1.F3"/>b), which are influenced by basin-scale stratification and the structure of the South Pacific Gyre.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e1643">The magnitude of the <inline-formula><mml:math id="M89" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric highlights
transitions between coherent thermohaline regimes.
Panel <bold>(a)</bold> is the <inline-formula><mml:math id="M90" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric for a single month (April 2012), and panel
<bold>(b)</bold> is for the time average of the B-SOSE iteration 106 dataset (60 months).
Latitude lines are shown between 80 and 40<inline-formula><mml:math id="M91" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> S
every 10<inline-formula><mml:math id="M92" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>, and longitude lines are shown every 60<inline-formula><mml:math id="M93" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>.
Animations showing month-to-month and interannual variability
are available in the software release <xref ref-type="bibr" rid="bib1.bibx61" id="paren.51"/>.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f03.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Properties of the thermohaline regimes</title>
      <p id="d1e1712">In order to better understand the coherent thermohaline regimes underlying our <inline-formula><mml:math id="M94" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric results, we examine their lateral extents and their vertical properties. Despite not being given any latitude or longitude information, the underlying GMM captures several coherent, large-scale features of Southern Ocean thermohaline structure (Fig. <xref ref-type="fig" rid="Ch1.F4"/>a). Class 1 contains the coldest waters in the SO, covering both the Weddell and Ross gyres near Antarctica. The mean profile in this class features cold temperatures that are nearly uniform with depth; in general, they are salt stratified in that the near-surface waters are fresher than the subsurface waters, ensuring that the density profile is stable overall (Fig. <xref ref-type="fig" rid="Ch1.F5"/>). The boundary between class 1 and class 2 broadly lies between the classically defined southern ACC Front (SACCF) and the southern boundary (SBDY) <xref ref-type="bibr" rid="bib1.bibx27" id="paren.52"/>, including the turn of the SBDY towards almost being perpendicular with the Antarctic continent in the Pacific sector of the SO. Class 2 is circumpolar, with excursions into the Amundsen Sea and the area just south of Kerguelen Plateau. It features salt stabilisation, with a fresh layer near 300 m (Fig. <xref ref-type="fig" rid="Ch1.F5"/>). Class 3 is also circumpolar, with a northward excursion in the Atlantic sector. The boundary between classes 2 and 3 roughly follows the Polar Front (PF), separating the colder, fresher Antarctic waters from the warmer, saltier subtropical waters further north. Finally, there are two subtropical classes: class 4 represents the Atlantic and Indian sectors of the subtropics, and class 5 represents the large-scale South Pacific Gyre. The boundary between classes 3 and 4 roughly aligns with the Subantarctic Front (SAF), particularly over large portions of the Indian and Pacific sectors (Fig. <xref ref-type="fig" rid="Ch1.F4"/>). The mean of class 5 has a salinity minimum around 700 m, corresponding to the presence of the Antarctic Intermediate Water layer <xref ref-type="bibr" rid="bib1.bibx22" id="paren.53"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e1739"><bold>(a)</bold> The cluster assignments with <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> and <bold>(b)</bold> the <inline-formula><mml:math id="M96" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric for all present class transitions.
This view highlights the transitions between specific classes. The transitions in panel <bold>(b)</bold>
have some similarities to the altimetric fronts from <xref ref-type="bibr" rid="bib1.bibx27" id="text.54"/>.
These fronts are shown overlain on panel <bold>(b)</bold>: SBDY – southern boundary; SACCF – southern ACC Front;
PF – Polar Front; SAF – Subantarctic Front. Data are from June 2011 as a representative month.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f04.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e1784">Profiles of the five GMM clusters between 300 and 1800 m
in <bold>(a)</bold> temperature and <bold>(b)</bold> salinity. This is calculated from
the profiles classified using the statistical model fitted on the training data itself.
The central line is the mean, and the envelope on either side indicates 1 standard deviation.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f05.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>An edge detection approach towards identifying fronts</title>
      <p id="d1e1807">For comparison with the GMM method, which uses properties of an entire training dataset to detect
changes in thermohaline structure, we use a more local front detection method implemented
by <xref ref-type="bibr" rid="bib1.bibx20" id="text.55"/> in the North Atlantic. This method, called the Sobel method,
directly examines spatial gradients in the principal component fields using a Sobel operator <xref ref-type="bibr" rid="bib1.bibx10" id="paren.56"/>.
To do this, the PCs of each grid point are placed onto a rectangular grid with the same spacing as the data sampling,
where points without data are masked.
The strength of an edge at a point is found by the two dimensional
convolution (represented by <inline-formula><mml:math id="M97" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula>) of the gridded PCs and the following two matrices.
In the <inline-formula><mml:math id="M98" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> direction the Sobel operator is
            <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M99" display="block"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mfenced close="]" open="["><mml:mtable class="array" columnalign="left left left"><mml:mtr><mml:mtd><mml:mn mathvariant="normal">1</mml:mn></mml:mtd><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn mathvariant="normal">2</mml:mn></mml:mtd><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn mathvariant="normal">1</mml:mn></mml:mtd><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          and in the <inline-formula><mml:math id="M100" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> direction the Sobel operator is
            <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M101" display="block"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mfenced close="]" open="["><mml:mtable class="array" columnalign="left left left"><mml:mtr><mml:mtd><mml:mn mathvariant="normal">1</mml:mn></mml:mtd><mml:mtd><mml:mn mathvariant="normal">2</mml:mn></mml:mtd><mml:mtd><mml:mn mathvariant="normal">1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
         <?pagebreak page1552?> The effect of this operator is similar to a gradient operator with some smoothing.
There is a correlation coefficient of 0.99 between <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> and the <inline-formula><mml:math id="M103" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> gradient,
and there is a correlation coefficient of 0.999 between <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> and the <inline-formula><mml:math id="M105" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> gradient of PC1.
The motivation for using the Sobel operator rather than the gradient operator
is principally that it can reduce the noise in data,
as shown by application to photographs <xref ref-type="bibr" rid="bib1.bibx69" id="paren.57"/>.
<xref ref-type="bibr" rid="bib1.bibx20" id="text.58"/> used the magnitude of the <inline-formula><mml:math id="M106" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M107" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> Sobel operators,
which approximates the magnitude of the gradient,
to examine fronts in the Arctic and North Atlantic.
They show that the magnitude of the Sobel gradient
can be thresholded to highlight features such as the Gulf Stream.</p>
      <p id="d1e2021">Rather than working with the gradient magnitude,
Fig. <xref ref-type="fig" rid="Ch1.F6"/> shows <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula>
alone. This is more interpretable, as the <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> component is strongly correlated with the zonal velocity <inline-formula><mml:math id="M112" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula> (<inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.85</mml:mn></mml:mrow></mml:math></inline-formula>).
<xref ref-type="bibr" rid="bib1.bibx20" id="text.59"/> use a threshold value to define fronts,
but we plot the gradient directly as a colourmap for each PC instead,
which is useful as it does not obscure any information about the fronts themselves.
Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/> shows that the correlation between the Sobel <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
gradient of PC1 and the meridional velocity, <inline-formula><mml:math id="M115" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula>,
and the correlation between <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> and zonal velocity <inline-formula><mml:math id="M117" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula>
increase for roughly the first 2 years of B-SOSE iteration 106,
suggesting that the model is still spinning up to geostrophic balance.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><?xmltex \currentcnt{6}?><?xmltex \def\figurename{Figure}?><label>Figure 6</label><caption><p id="d1e2164"><inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> Sobel edge operator convolved in two dimensions (*) with the principal component
coefficient fields for the month of June 2011.
The correlation coefficient between panel <bold>(a)</bold> for PC1 and the zonal velocity <inline-formula><mml:math id="M119" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula>
for the same month in B-SOSE iteration 106 is 0.85, showing that the structure it highlights
is substantially similar to the ACC. Panels <bold>(b)</bold> and <bold>(c)</bold> for PC2 and PC3
are also related to the ACC, (correlation coefficients of 0.18 and <inline-formula><mml:math id="M120" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.18 respectively).
The grey line is the 2000 m isobath.</p></caption>
          <?xmltex \igopts{width=369.885827pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f06.png"/>

        </fig>

      <p id="d1e2207">The GMM and Sobel methods are complementary. GMM reveals the large-scale temperature and salinity structure associated with changes in stratification, which has traditionally been used to define the fronts, whereas edge detection methods like the Sobel method used here reveals the smaller-scale structure of multiple jets, which can merge and separate. As such, both approaches may be useful ways of characterising ocean structure without making ad hoc assumptions related to particular property values or strict requirements that the structures be circumpolar and continuous. The present proliferation of front definition and analysis methods is driven by the need to expand how the oceanographic community deals with ocean structure across a wide variety of spatial and temporal scales <xref ref-type="bibr" rid="bib1.bibx4" id="paren.60"/>.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Discussion</title>
      <p id="d1e2222">In this section, we discuss the sensitivity of our results to our choice of dataset (Sect. <xref ref-type="sec" rid="Ch1.S4.SS1"/>),
touch on the temporal variability in our results (Sect. <xref ref-type="sec" rid="Ch1.S4.SS2"/>),
discuss a possible connection with the Antarctic Slope Current (Sect. <xref ref-type="sec" rid="Ch1.S4.SS3"/>),
examine the sensitivity of the results to the choice of the number of classes <inline-formula><mml:math id="M121" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> (Sect. <xref ref-type="sec" rid="Ch1.S4.SS4"/>),
and discuss the interpretation of posterior probabilities (Sect. <xref ref-type="sec" rid="Ch1.S4.SS5"/>).</p>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Sensitivity to choice of dataset</title>
      <p id="d1e2250">We chose to use B-SOSE data for this study in order to (1) work with a dataset that features relatively uniform latitude–longitude coverage and (2) to allow us to examine temporal variability as well as spatial variability. B-SOSE is<?pagebreak page1553?> an observationally constrained estimate of the hydrographic structure of the Southern Ocean, so it accurately captures many features of large-scale and mesoscale structure <xref ref-type="bibr" rid="bib1.bibx66" id="paren.61"/>. However, because B-SOSE is a numerical model run, it will no doubt have some biases with respect to observations, particularly on smaller scales. We expect that our results would not change dramatically on basin-wide scales across different state estimate and reanalysis products.</p>
      <p id="d1e2256">To examine the differences of this bias on the class structure and the structure of the inter-class comparison metric, this study could be repeated with a purely observational dataset such as Argo. One trade-off for such a study would be the fact that observational datasets are relatively sparse in terms of both spatial and temporal coverage relative to a state estimate or other numerical model run. One could attempt to use the same GMM trained on B-SOSE with Argo data, but possible biases between B-SOSE and the Argo dataset could make this challenging. It might be possible to adjust for those biases in the data cleaning and preparation step of the analysis; the standardisation process, which is already a part of the analysis presented here, is a step towards this bias removal and correction that may facilitate comparisons between models and observations. Alternatively, one could attempt to re-train the GMM using Argo data alone. This has been done in other studies, so it should be possible in principle (e.g. <xref ref-type="bibr" rid="bib1.bibx36 bib1.bibx25 bib1.bibx51" id="altparen.62"/>).</p>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Temporal variability of the fronts</title>
      <p id="d1e2270">We found that the class structure and boundary positions did not
feature large temporal variability with respect to the mean state,
but much more work could be done to examine this variability and its
connection to the processes that determine thermohaline structure
(e.g. surface forcing, subsurface mixing, and advection).
This is outside the scope of the present study, which is focused
on proposing a new metric for identifying and tracking boundaries
in Southern Ocean structure.</p>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>The Antarctic Slope Current</title>
      <p id="d1e2281">The Antarctic Slope Current (ASC) that separates warmer open-ocean waters from the colder waters on the Antarctic continental shelf is an important component of heat transport in the Southern Ocean. It acts to control the flow of warm water onto the continental shelf and eventually under the floating ice shelves. In a recent paper, <xref ref-type="bibr" rid="bib1.bibx65" id="text.63"/> suggest that if the source of the Antarctic Slope Current (ASC) intersects with the ACC in the Bellingshausen Sea, then the ASC source would be considered a major component of the overturning circulation. In our study, we found a diffuse boundary between classes in the Bellingshausen Sea region, which may be relevant for the physical context of the ASC, which is still under investigation (Fig. <xref ref-type="fig" rid="Ch1.F4"/>b).</p>
</sec>
<sec id="Ch1.S4.SS4">
  <label>4.4</label><title>Sensitivity to the maximum number of classes</title>
      <p id="d1e2298">In this study, we chose <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> as the number of classes based on sensitivity
tests and also based on a priori knowledge. Specifically, previous studies
used a front structure with five broad regions, delineated by four fronts,
so we might expect a value of around <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> based on this (e.g.
<xref ref-type="bibr" rid="bib1.bibx43 bib1.bibx49 bib1.bibx27" id="altparen.64"/>).</p>
      <p id="d1e2328">Generally, the choice of the maximum number of classes <inline-formula><mml:math id="M124" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> can be thought of as a way to select models of varying degrees of complexity. Statistical models with lower <inline-formula><mml:math id="M125" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> values are potentially easier to interpret, only capturing the most dominant structures in the dataset. For example, the probabilistic boundary between the two classes in a <inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula> statistical model roughly separates colder, fresher Antarctic waters from the warmer, saltier subtropical waters (Fig. <xref ref-type="fig" rid="Ch1.F7"/>a). Notably, in this case, the magnitude of the <inline-formula><mml:math id="M127" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric appears to largely decrease as we follow it from the Atlantic and Indian basins and into the Pacific Basin, indicating that the boundary becomes less sharp with longitude. This possibly reflects the fact that the Pacific Basin hosts some of the dominant<?pagebreak page1554?> northward export pathways of Subantarctic Mode Water and Antarctic Intermediate Water, consistent with a less sharp transition between polar and subtropical waters <xref ref-type="bibr" rid="bib1.bibx22 bib1.bibx19 bib1.bibx24" id="paren.65"/>. A statistical model with <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:math></inline-formula> retains most of the features of our analysis with <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula>, but the transition region closest to Antarctica in <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> is no longer present.</p>
      <p id="d1e2406">The <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula> statistical model that we used in this work captures near-Antarctic and circumpolar structure, as well as some subtropical structure. A more complex statistical model with higher <inline-formula><mml:math id="M132" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> would capture more of the subtropical structure (not shown). This is consistent with sensitivity studies using temperature-only Argo data, where increasing <inline-formula><mml:math id="M133" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> added details to the subtropical class structure while leaving the circumpolar class structure largely unchanged <xref ref-type="bibr" rid="bib1.bibx25" id="paren.66"/>. Statistical models with much higher <inline-formula><mml:math id="M134" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> values may capture more structure in the data, but increasing <inline-formula><mml:math id="M135" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> also risks overfitting. That is, if we tune the GMM statistical model to match an increasing number of structures in PC space, we risk losing generality; the goal is to represent the dominant structures of the dataset without overfitting every small variation, some of which could represent noise in the data. This has a direct analogue with overfitting in terms of simple statistical models; it is unwise to use a 10th-order polynomial when a quadratic captures the dominant features of the dataset, because the higher-order polynomial is less likely to generalise to other similar datasets. In addition, statistical models with very high <inline-formula><mml:math id="M136" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> values are increasingly difficult to interpret in terms of our current physical and biogeochemical understanding.
Note that regional studies, such as those carried out in specific sectors of the SO, may find it useful to increase <inline-formula><mml:math id="M137" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> based on local structure (e.g. <xref ref-type="bibr" rid="bib1.bibx51" id="altparen.67"/>). This is consistent with the suggestion by <xref ref-type="bibr" rid="bib1.bibx4" id="text.68"/> that front definitions may need to be more flexible and region-specific, as opposed to expecting a particular definition to apply globally (or even across a single ocean basin).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7"><?xmltex \currentcnt{7}?><?xmltex \def\figurename{Figure}?><label>Figure 7</label><caption><p id="d1e2476">Decreasing <inline-formula><mml:math id="M138" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> removes details from the
statistical description of Southern Ocean thermohaline structure.
Shown is the GMM-derived <inline-formula><mml:math id="M139" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric, using <bold>(a)</bold> <inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula> and <bold>(b)</bold> <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:math></inline-formula>,
for a monthly average over June 2011. The grey line is 2000 m isobath.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f07.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS5">
  <label>4.5</label><title>Interpreting posterior probabilities</title>
      <p id="d1e2538">The posterior probabilities returned by a Gaussian mixture model are affected by our choice of <inline-formula><mml:math id="M142" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula>.
We should be careful not to over-interpret the posterior probabilities as confidences in the correctness
of the assigned labels. Notably, GMM does not indicate the probability that a given profile belongs
to <italic>none</italic> of the classes in a given statistical model. With that in mind, we can interpret
the posterior probability as a measure of unambiguity in the context of a given statistical model.
When one probability is larger than all others with some margin, the profile is unambiguously classified,
while probabilities of similar magnitude indicate that the profile cannot be unambiguously classified in
the current statistical model with the specified number of classes. In this study, we used the posterior
probability distribution to identify boundaries between coherent thermohaline regimes,
taking advantage of this property of GMM.</p>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d1e2561">In this study, we proposed a new metric for defining and identifying boundaries between coherent regimes of temperature and salinity structure. Our method uses Gaussian mixture modelling, a type of unsupervised machine learning, to establish a statistical model of thermohaline structure that is intended to capture the large-scale features of the dataset in both PC space and in geographic space. We developed our method in the Southern Ocean due to the presence of circumpolar structures and relatively clear fronts, but our approach could be applied to other regions or even to the global ocean as a whole. The <inline-formula><mml:math id="M143" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric provides a flexible, probabilistic method to define and identify boundaries in an oceanographic dataset without using ad hoc property contours; the boundaries are derived in a generalised method that reflects the structure of the dataset. The <inline-formula><mml:math id="M144" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric has potential as a method for comparing different observational and numerical modelling datasets in a robust, algorithmic way that is not heavily affected by biases in the mean state between datasets. It features a parameter <inline-formula><mml:math id="M145" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> that allows users to increase and decrease the level of complexity of the statistical model; the optimal value of <inline-formula><mml:math id="M146" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> will vary between applications. The Sobel edge detection method may be useful for defining and tracking smaller-scale fronts and jets in model or reanalysis data. As discussed in <xref ref-type="bibr" rid="bib1.bibx4" id="text.69"/>, the field of oceanography needs to consider fronts and boundaries in a more general, application-specific way, due in part to the richness of ocean structure on different spatial scales. The <inline-formula><mml:math id="M147" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula>-metric was designed with this problem in mind; it is intended to be a complementary addition to the oceanographic toolbox as opposed to a replacement for any particular method.</p>
</sec>

      
      </body>
    <back><app-group>

<?pagebreak page1555?><app id="App1.Ch1.S1">
  <?xmltex \currentcnt{A}?><label>Appendix A</label><title>The relation between edge detection and the velocity field</title>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S1.F8"><?xmltex \currentcnt{A1}?><?xmltex \def\figurename{Figure}?><label>Figure A1</label><caption><p id="d1e2616">A comparison between the <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> Sobel edge detection field
and the zonal velocity, <inline-formula><mml:math id="M149" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula>, at 2 m.
Panels <bold>(a)</bold> and <bold>(b)</bold> are from the monthly mean over June 2011,
whereas panels <bold>(c)</bold> and <bold>(d)</bold> are the mean over all of the monthly means in the dataset.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f08.png"/>

      </fig>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S1.F9"><?xmltex \currentcnt{A2}?><?xmltex \def\figurename{Figure}?><label>Figure A2</label><caption><p id="d1e2666">The correlation between <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> and the zonal velocity, <inline-formula><mml:math id="M151" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula>, at 2 m
for each monthly mean in the B-SOSE iteration 106 dataset.
The increase in the correlation over the first 2 years
could be interpreted as the spin-up.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f09.png"/>

      </fig>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S1.F10"><?xmltex \currentcnt{A3}?><?xmltex \def\figurename{Figure}?><label>Figure A3</label><caption><p id="d1e2704">A comparison between the <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> Sobel edge detection field
and the meridional velocity, <inline-formula><mml:math id="M153" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula>, at 2 m. Panels <bold>(a)</bold> and <bold>(b)</bold>
are from the monthly mean over June 2011,
whereas panels <bold>(c)</bold> and <bold>(d)</bold> are the mean over all of the
monthly means in the dataset.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f10.png"/>

      </fig>

<?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S1.F11"><?xmltex \currentcnt{A4}?><?xmltex \def\figurename{Figure}?><label>Figure A4</label><caption><p id="d1e2755">The correlation between the <inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> and the meridional
velocity, <inline-formula><mml:math id="M155" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula>, at 2 m for each monthly mean in the B-SOSE iteration 106 dataset.
The increase in the correlation coefficient over the first
2 years could be interpreted as the spin-up, as in Fig. <xref ref-type="fig" rid="App1.Ch1.S1.F9"/></p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f11.png"/>

      </fig>

      <p id="d1e2791">Figures <xref ref-type="fig" rid="App1.Ch1.S1.F8"/> and  <xref ref-type="fig" rid="App1.Ch1.S1.F10"/>
illustrate the spatial resemblance between
<inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M157" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula> and between <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msub><mml:mi>G</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mi mathvariant="normal">PC</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M159" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula> respectively, compared over June 2011 or
as an average over the full B-SOSE period.
The domain-averaged correlation is shown quantitatively
in Figs. <xref ref-type="fig" rid="App1.Ch1.S1.F9"/> and  <xref ref-type="fig" rid="App1.Ch1.S1.F11"/>, where there
is an especially high correlation between the two in the last
couple of years of the reanalysis product.
That Figs. <xref ref-type="fig" rid="App1.Ch1.S1.F9"/> and  <xref ref-type="fig" rid="App1.Ch1.S1.F11"/> show opposite signs in
the correlation is equivalent to the reversal in sign
that we would expect <xref ref-type="bibr" rid="bib1.bibx5" id="paren.70"><named-content content-type="post">chap. 15 and 18</named-content></xref>.
Those figures also show that the magnitude of the correlation
between the respective variables increases during the first 2 years of the dataset before flattening off.
This is suggestive of the model spinning up towards geostrophic balance.
As the first principal component statistically explains the first-order structure
in the ocean, it primarily represents the density contrast
produced by the thermohaline structure from the tropics to the
poles.</p><?xmltex \hack{\clearpage}?>
</app>

<?pagebreak page1557?><app id="App1.Ch1.S2">
  <?xmltex \currentcnt{B}?><label>Appendix B</label><title>Principal component structure</title>
      <p id="d1e2869">In this work, we use principal component expansion for dimension reduction and to examine the structure of the Southern Ocean. In this appendix, we display the principal components (i.e. eigenvectors) used in this expansion. First, we examine the means of the temperature and salinity structure across the entire dataset (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F12"/>). The temperature decreases with depth, whilst the salinity has a minimum around 750 m, in part associated with the presence of Antarctic Bottom Water (AAIW).</p>
      <p id="d1e2874">The structure of the first three principal components (i.e. eigenvectors) reflects small variations on the mean structure (Fig. <xref ref-type="fig" rid="App1.Ch1.S2.F13"/>). The mean profiles are similar, but the variation associated with an increase or decrease in the principal component value changes with depth. The first principal component (PC1) explains 76 % of the variability in the dataset, notably consisting of variations throughout the mid-range of the profiles in temperature and throughout nearly the entire depth range in salinity. The second principal component explains 16 % of the variability and consists of larger changes in the upper part of the profile, above roughly 1000 m. The third principal component (PC3) explains 7 % of the variance and exhibits variations above and below a somewhat fixed mid-point. After principal component expansion is applied, each profile is represented by just three numbers, i.e. the eigenvalues of the principal component expansion.</p>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S2.F12"><?xmltex \currentcnt{B1}?><?xmltex \def\figurename{Figure}?><label>Figure B1</label><caption><p id="d1e2881">The means and standard deviation of samples
taken from B-SOSE iteration 106.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f12.png"/>

      </fig>

<?xmltex \hack{\newpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S2.F13"><?xmltex \currentcnt{B2}?><?xmltex \def\figurename{Figure}?><label>Figure B2</label><caption><p id="d1e2894">The temperature and salinity components of the three retained principal
components that were used in this work. Specifically, they are PC1 (left column),
PC2 (middle column), and PC3 (right column). Shown are the mean structures (black lines)
with the effect of adding (red line) or removing (blue line) one unit of
a principal component as a deviation from the
mean profile, after Fig. 4 in <xref ref-type="bibr" rid="bib1.bibx44" id="text.71"/>.
When compared to  Fig. <xref ref-type="fig" rid="Ch1.F1"/> we can see that PC1 corresponds to the hot–cold north–south contrast.</p></caption>
        <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://os.copernicus.org/articles/17/1545/2021/os-17-1545-2021-f13.png"/>

      </fig>

<?xmltex \hack{\clearpage}?>
</app>

<?pagebreak page1558?><app id="App1.Ch1.S3">
  <?xmltex \currentcnt{C}?><label>Appendix C</label><title>Gaussian mixture modelling</title>
      <p id="d1e2918">A Gaussian mixture model (GMM) attempts to represent a dataset
using a linear combination of multidimensional Gaussian distributions.
A multidimensional Gaussian (Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E6"/>) is a simple generalisation
of a Gaussian to <inline-formula><mml:math id="M160" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> dimensions.
          <disp-formula id="App1.Ch1.S3.E6" content-type="numbered"><label>C1</label><mml:math id="M161" display="block"><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="script">N</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:mo>-</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mi>T</mml:mi></mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow><mml:msqrt><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mi>D</mml:mi></mml:msup><mml:mfenced close="∥" open="∥"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
        where <inline-formula><mml:math id="M162" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> is the index for the <inline-formula><mml:math id="M163" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>th cluster of <inline-formula><mml:math id="M164" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> clusters,
<inline-formula><mml:math id="M165" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> is the index for the <inline-formula><mml:math id="M166" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>th data point of <inline-formula><mml:math id="M167" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> data points,
and <inline-formula><mml:math id="M168" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> corresponds to the three principal components of the data.</p>
      <p id="d1e3100">We make the assumption that the probability distribution
that generated the dataset can be approximated by a
set of multivariate Gaussians
(Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E7"/>):
          <disp-formula id="App1.Ch1.S3.E7" content-type="numbered"><label>C2</label><mml:math id="M169" display="block"><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="double-struck">P</mml:mi><mml:mo mathvariant="normal" stretchy="true">̃</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>≃</mml:mo><mml:mi mathvariant="double-struck">P</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="script">N</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>;</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e3187">Any probability distribution function (PDF) could be described by
an arbitrarily large number of Gaussians (Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E8"/>),
but to be a good method of describing the data, this should be a manageable number.
          <disp-formula id="App1.Ch1.S3.E8" content-type="numbered"><label>C3</label><mml:math id="M170" display="block"><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="double-struck">P</mml:mi><mml:mo stretchy="true" mathvariant="normal">̃</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">lim⁡</mml:mo><mml:mrow><mml:mi>K</mml:mi><mml:mo>→</mml:mo><mml:mi mathvariant="normal">∞</mml:mi></mml:mrow></mml:munder><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="script">N</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>;</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e3268">In this paper, we showed that
our Southern Ocean thermohaline dataset can
be fairly represented as a series
of plateau-like regions in PC variable
space; thus, it can be approximated
by a PDF made from a set of multivariate
Gaussians, where the boundaries between these Gaussians
correspond to the fronts (Fig. <xref ref-type="fig" rid="Ch1.F2"/>).</p>
<sec id="App1.Ch1.S3.SS1">
  <label>C1</label><title>Expectation maximisation</title>
      <p id="d1e3281">To initialise the method, the first <inline-formula><mml:math id="M171" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> clusters are created randomly.
Next, the set of Gaussians is iteratively adjusted
(Eqs. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E9"/>, <xref ref-type="disp-formula" rid="App1.Ch1.S3.E10"/>, and <xref ref-type="disp-formula" rid="App1.Ch1.S3.E11"/>)
until it reaches a local minimum in the cost function <xref ref-type="bibr" rid="bib1.bibx36" id="paren.72"/>.
It is generally expected that reducing the number of dimensions in the preprocessing
step helps improve the convergence.
The following section draws heavily from <xref ref-type="bibr" rid="bib1.bibx36" id="text.73"/>.</p>
      <p id="d1e3304">The expectation of the model given the data is increased by updating
the weights <inline-formula><mml:math id="M172" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, means <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, and covariance matrices
<inline-formula><mml:math id="M174" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the following way:

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M175" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="App1.Ch1.S3.E9"><mml:mtd><mml:mtext>C4</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msup><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>|</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>;</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msup><mml:mfenced open="{" close="}"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.S3.E10"><mml:mtd><mml:mtext>C5</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><?xmltex \hack{\hbox\bgroup\fontsize{9.5}{9.5}\selectfont$\displaystyle}?><mml:msup><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>|</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>;</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msup><mml:mfenced close="}" open="{"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mfenced><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>|</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>;</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msup><mml:mfenced open="{" close="}"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo><?xmltex \hack{$\egroup}?></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.S3.E11"><mml:mtd><mml:mtext>C6</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:msup><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{6.6}{6.6}\selectfont$\displaystyle}?><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msup><mml:mi>c</mml:mi><mml:mi>n</mml:mi></mml:msup><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>|</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msup><mml:mo>;</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msup><mml:mfenced open="{" close="}"><mml:mrow><mml:msup><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msup><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msup><mml:munder><mml:mi mathvariant="italic">μ</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:munder><mml:mi>k</mml:mi></mml:msup><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msup></mml:mrow></mml:mfenced><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mfenced><mml:mo>⋅</mml:mo><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msup><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mfenced><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mi>T</mml:mi></mml:msup></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>|</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msup><mml:mfenced close="}" open="{"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo><?xmltex \hack{$\egroup}?></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the classification of the <inline-formula><mml:math id="M177" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>th cluster which could be any
one of the <inline-formula><mml:math id="M178" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> clusters. This is repeated until the parameters of the
model have converged.</p>
</sec>
<sec id="App1.Ch1.S3.SS2">
  <label>C2</label><title>Information criterion</title>
      <p id="d1e3904">GMM needs an input hyperparameter <inline-formula><mml:math id="M179" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> that sets the
number of clusters that will be fitted to the data.
GMM is relatively cheap to run, and so it
is reasonable to run it with a large range of
<inline-formula><mml:math id="M180" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> and choose the <inline-formula><mml:math id="M181" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> which best describes them.
The commonly used criterions are the Bayesian information
criterion (BIC) (Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E12"/>) and the
Akaike information criterion (AIC) (Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E13"/>).
They both essentially contain a term that measures the
agreement of the model to the data, and they have a penalty term
for the number of parameters that have been used to achieve
this (related to <inline-formula><mml:math id="M182" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula>).
Thus, we are looking for minima in AIC/BIC to guide our choice of <inline-formula><mml:math id="M183" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula>. There
is no clear minimum for this dataset in <inline-formula><mml:math id="M184" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> for <inline-formula><mml:math id="M185" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>≤</mml:mo><mml:mi>K</mml:mi><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula>, which is typical of
oceanographic applications due in part to the highly correlated nature of the data
(e.g. <xref ref-type="bibr" rid="bib1.bibx57 bib1.bibx25" id="altparen.74"/>).
Because <inline-formula><mml:math id="M186" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> is weakly constrained, we are able to select a lower value
of <inline-formula><mml:math id="M187" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> for ease of interpretation, having verified that it captures the
large-scale structure of the data in PC space, which is suitable for our
purposes. BIC and AIC take the following forms:

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M188" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="App1.Ch1.S3.E12"><mml:mtd><mml:mtext>C7</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="normal">BIC</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="script">L</mml:mi><mml:mo>(</mml:mo><mml:mi>K</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi>f</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mi>N</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.S3.E13"><mml:mtd><mml:mtext>C8</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mtext>with</mml:mtext></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi>f</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>K</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi>K</mml:mi><mml:mi>D</mml:mi><mml:mo>+</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>K</mml:mi><mml:mi>D</mml:mi><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mtext>and</mml:mtext></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="normal">AIC</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>K</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="script">L</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where the log-likelihood is expressed as
            <disp-formula id="App1.Ch1.S3.E14" content-type="numbered"><label>C9</label><mml:math id="M189" display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{8.8}{8.8}\selectfont$\displaystyle}?><mml:mi mathvariant="script">L</mml:mi><mml:mo>=</mml:mo><mml:mi>log⁡</mml:mi><mml:mo>[</mml:mo><mml:mi mathvariant="double-struck">P</mml:mi><mml:mo>(</mml:mo><mml:mi>X</mml:mi><mml:mo>)</mml:mo><mml:mo>]</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:munderover><mml:mi>log⁡</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="script">N</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>;</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>.</mml:mo><?xmltex \hack{$\egroup}?></mml:mrow></mml:math></disp-formula></p>
</sec>
<sec id="App1.Ch1.S3.SS3">
  <label>C3</label><title>Labelling the dataset</title>
      <?pagebreak page1559?><p id="d1e4237">Each data point is assigned a posterior probability
distribution across the <inline-formula><mml:math id="M190" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> clusters (Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E15"/>).
This uncertainty information is one of the useful features of GMM.
The probability takes the following form:
            <disp-formula id="App1.Ch1.S3.E15" content-type="numbered"><label>C10</label><mml:math id="M191" display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{8.5}{8.5}\selectfont$\displaystyle}?><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>|</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>;</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="script">N</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>;</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:msub><mml:mo>∑</mml:mo><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="script">N</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>;</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo><?xmltex \hack{$\egroup}?></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e4390">To label a dataset, each data point is assigned
a label from the cluster that it would be the most
likely to be generated by,
in a statistical sense (Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E16"/>).
            <disp-formula id="App1.Ch1.S3.E16" content-type="numbered"><label>C11</label><mml:math id="M192" display="block"><mml:mrow><mml:mi mathvariant="script">C</mml:mi><mml:mo>=</mml:mo><mml:mi>arg⁡</mml:mi><mml:munder><mml:mo movablelimits="false">max⁡</mml:mo><mml:mi>k</mml:mi></mml:munder><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="double-struck">P</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>|</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>;</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="normal">Σ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:math></disp-formula></p>
</sec>
</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e4477">B-SOSE iteration 106 state estimate data are available from the Scripps Institution of Oceanography
(<uri>http://sose.ucsd.edu/BSOSE6_iter106_solution.html</uri>, <xref ref-type="bibr" rid="bib1.bibx67" id="altparen.75"/>). The MITgcm source code that was used to create B-SOSE is available on GitHub (<uri>https://github.com/MITgcm/MITgcm/tree/checkpoint67z</uri>, last access: 16 June 2021, <ext-link xlink:href="https://doi.org/10.5281/zenodo.4968496" ext-link-type="DOI">10.5281/zenodo.4968496</ext-link>, <xref ref-type="bibr" rid="bib1.bibx2" id="altparen.76"/>). Original climatological front positions from <xref ref-type="bibr" rid="bib1.bibx27" id="text.77"/> are available on ResearchGate (<uri>https://www.researchgate.net/publication/338420242_ACC_fronts</uri>,  <xref ref-type="bibr" rid="bib1.bibx26" id="altparen.78"/>). The code used to carry out the analysis and figure creation for this paper is available via Zenodo <xref ref-type="bibr" rid="bib1.bibx61" id="paren.79"/> (up-to-date repository: <uri>https://github.com/so-wise/so-fronts</uri>, last access: 4 May 2021, <ext-link xlink:href="https://doi.org/10.5281/zenodo.5500666" ext-link-type="DOI">10.5281/zenodo.5500666</ext-link>, <xref ref-type="bibr" rid="bib1.bibx61" id="altparen.80"/>). This software uses scikit-learn <xref ref-type="bibr" rid="bib1.bibx48" id="paren.81"/> and pyxpcm <xref ref-type="bibr" rid="bib1.bibx35" id="paren.82"/> as foundations. We used Cartopy for mapping <xref ref-type="bibr" rid="bib1.bibx39" id="paren.83"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e4530">DCJ designed the initial project, and SDAT and DCJ developed it further.
SDAT wrote the software <xref ref-type="bibr" rid="bib1.bibx61" id="paren.84"/>, performed the analysis, and created the figures.
AF proposed a significant improvement to the inter-class metric.
EM and EP provided expert guidance on fronts, Southern Ocean structure,
and dynamics. SDAT and DCJ wrote the initial manuscript, and all authors assisted with edits.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e4539">The contact author has declared that neither they nor their co-authors have any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d1e4545">Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e4551">This work originated as a Natural Environment Research Council
(NERC) Research Experience Placement (REP) project funded by
the SPITFIRE Doctoral Training Partnership (grant no. NE/S007210/1).
Simon Thomas is supported by studentship 2413578 from the UKRI Centre for Doctoral Training in
Application of Artificial Intelligence to the study of Environmental Risks (grant no. EP/S022961/1).
Daniel Jones is supported by a UKRI Future Leaders Fellowship (grant no. MR/T020822/1).
Daniel Jones and Simon Thomas also received funding from the NERC ACSIS project (grant no. NE/N018028/1).
Etienne Pauthenet received funding from the European Research Council (ERC) under the European
Union’s Horizon 2020 Research and Innovation programme (grant no. 637770).
The authors would like to thank Emma Boland, Peter Haynes, Guillaume Maze,
and John Taylor for comments that improved the quality of this work.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e4556">This research has been supported by the Natural Environment Research Council (grant nos. NE/S007210/1 and NE/N018028/1), UK Research and Innovation (grant nos. MR/T020822/1 and EP/S022961/1), and the European Research Council H2020 programme (grant no. 637770; WAPITI).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e4562">This paper was edited by Katsuro Katsumata and reviewed by three anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><?xmltex \def\ref@label{{Amante and Eakins(2009)}}?><label>Amante and Eakins(2009)</label><?label amante_etopo1_2009?><mixed-citation>Amante, C. and Eakins, B. W.: ETOPO1 Global Relief Model converted to
PanMap layer format, PANGAEA [data set], <ext-link xlink:href="https://doi.org/10.1594/PANGAEA.769615" ext-link-type="DOI">10.1594/PANGAEA.769615</ext-link>,  2009.</mixed-citation></ref>
      <ref id="bib1.bibx2"><?xmltex \def\ref@label{Campin et al.(2021)}?><label>Campin et al.(2021)</label><?label data?><mixed-citation>Campin, J.-M., Heimbach, P., Losch, M., Forget, G.,
edhill3; Adcroft, A., amolod, Menemenlis, D., dfer22, Hill, C., Jahn, O.,
Scott, J., stephdut, Mazloff, M., Fox-Kemper, B., antnguyen13, Doddridge, E.,
Fenty, I., Bates, M., AndrewEichmann-NOAA, Smith, T., Martin, T.,
Lauderdale, J., Abernathey, R., samarkhatiwala, hongandyan,
Deremble, B., dngoldberg, Bourgault, P., and  Dussin, R.: MITgcm/MITgcm: checkpoint67z, Zenodo [code],
<ext-link xlink:href="https://doi.org/10.5281/zenodo.4968496" ext-link-type="DOI">10.5281/zenodo.4968496</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx3"><?xmltex \def\ref@label{{Chapman(2017)}}?><label>Chapman(2017)</label><?label chapman2017new?><mixed-citation>Chapman, C. C.: New perspectives on frontal variability in the Southern Ocean,
J. Phys. Oceanogr., 47, 1151–1168,
<ext-link xlink:href="https://doi.org/10.1175/JPO-D-16-0222.1" ext-link-type="DOI">10.1175/JPO-D-16-0222.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx4"><?xmltex \def\ref@label{{Chapman et~al.(2020)Chapman, Lea, Meyer, Sall{\'{e}}e, and
Hindell}}?><label>Chapman et al.(2020)Chapman, Lea, Meyer, Sallée, and
Hindell</label><?label chapman2020defining?><mixed-citation>Chapman, C. C., Lea, M.-A., Meyer, A., Sallée, J.-B., and Hindell, M.:
Defining Southern Ocean fronts and their influence on biological and physical
processes in a changing climate, Nat. Clim. Change, 10, 209–219,
<ext-link xlink:href="https://doi.org/10.1038/s41558-020-0705-4" ext-link-type="DOI">10.1038/s41558-020-0705-4</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx5"><?xmltex \def\ref@label{{Cushman-Roisin and Beckers(2011)}}?><label>Cushman-Roisin and Beckers(2011)</label><?label cushman2011introduction?><mixed-citation>
Cushman-Roisin, B. and Beckers, J.-M.: Introduction to geophysical fluid
dynamics: physical and numerical aspects, Academic Press, ISBN 9780120887590, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx6"><?xmltex \def\ref@label{{Dai and Trenberth(2002)}}?><label>Dai and Trenberth(2002)</label><?label dai_estimates_2002?><mixed-citation>Dai, A. and Trenberth, K. E.: Estimates of Freshwater Discharge from
Continents: Latitudinal and Seasonal Variations, J.
Hydrometeorol., 3, 660–687,
<ext-link xlink:href="https://doi.org/10.1175/1525-7541(2002)003&lt;0660:EOFDFC&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1525-7541(2002)003&lt;0660:EOFDFC&gt;2.0.CO;2</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx7"><?xmltex \def\ref@label{{Deacon(1937)}}?><label>Deacon(1937)</label><?label deacon_hydrology_1937?><mixed-citation>
Deacon, G.: The hydrology of the Southern Ocean, Cambridge University Press, Macmillan, London, New York, 1937.</mixed-citation></ref>
      <ref id="bib1.bibx8"><?xmltex \def\ref@label{{Dee et~al.(2011)}}?><label>Dee et al.(2011)</label><?label dee_era-interim_2011?><mixed-citation>Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P., Kobayashi,
S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P., Bechtold, P.,
Beljaars, A. C. M., van de Berg, L., Bidlot, J., Bormann, N., Delsol, C.,
Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S. B.,
Hersbach, H., Halm, E. V., Isaksen, L.,  Kållberg, P.,
Köhler, M., Matricardi, M., McNally, A. P., Monge Sanz, B. M., Morcrette,
J. J., Park, B. K., Peubey, C., de Rosnay, P., Tavolato, C., Thépaut,
J. N., and Vitart, F.: The ERA-In<?pagebreak page1560?>terim reanalysis: configuration and
performance of the data assimilation system, Q. J. Roy.
Meteor. Soc., 137, 553–597, <ext-link xlink:href="https://doi.org/10.1002/qj.828" ext-link-type="DOI">10.1002/qj.828</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx9"><?xmltex \def\ref@label{{Donohue et~al.(2016)Donohue, Tracey, Watts, Chidichimo, and
Chereskin}}?><label>Donohue et al.(2016)Donohue, Tracey, Watts, Chidichimo, and
Chereskin</label><?label donohue_mean_2016?><mixed-citation>Donohue, K. A., Tracey, K. L., Watts, D. R., Chidichimo, M. P., and Chereskin,
T. K.: Mean Antarctic Circumpolar Current transport measured in Drake
Passage: MEAN ACC TRANSPORT, Geophys. Res. Lett., 43,
11760–11767, <ext-link xlink:href="https://doi.org/10.1002/2016GL070319" ext-link-type="DOI">10.1002/2016GL070319</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx10"><?xmltex \def\ref@label{{Duda and Hart(1973)}}?><label>Duda and Hart(1973)</label><?label sobel19683x3?><mixed-citation>
Duda, R. and Hart P.: Pattern Classification and Scene Analysis (1st
Edition), John Wiley and Sons, New York, ISBN: 9780471223610,
271–272, 1973.</mixed-citation></ref>
      <ref id="bib1.bibx11"><?xmltex \def\ref@label{{Fenty and Heimbach(2013)}}?><label>Fenty and Heimbach(2013)</label><?label fenty_coupled_2013?><mixed-citation>Fenty, I. and Heimbach, P.: Coupled sea ice–ocean-state estimation in the
Labrador Sea and Baffin Bay, J. Phys. Oceanogr., 43, 884–904,
<ext-link xlink:href="https://doi.org/10.1175/JPO-D-12-065.1" ext-link-type="DOI">10.1175/JPO-D-12-065.1</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx12"><?xmltex \def\ref@label{{Forget et~al.(2015)Forget, Campin, Heimbach, Hill, Ponte, and
Wunsch}}?><label>Forget et al.(2015)Forget, Campin, Heimbach, Hill, Ponte, and
Wunsch</label><?label forget_ecco_2015?><mixed-citation>Forget, G., Campin, J.-M., Heimbach, P., Hill, C. N., Ponte, R. M., and Wunsch, C.: ECCO version 4: an integrated framework for non-linear inverse modeling and global ocean state estimation, Geosci. Model Dev., 8, 3071–3104, <ext-link xlink:href="https://doi.org/10.5194/gmd-8-3071-2015" ext-link-type="DOI">10.5194/gmd-8-3071-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx13"><?xmltex \def\ref@label{{Frenger et~al.(2015)Frenger, Munnich, Gruber, and
Knutti}}?><label>Frenger et al.(2015)Frenger, Munnich, Gruber, and
Knutti</label><?label frenger_southern_2015?><mixed-citation>Frenger, I., Munnich, M., Gruber, N., and Knutti, R.: Southern Ocean eddy
phenomenology, J. Geophys. Res.-Oceans, 120, 7413–7449,
<ext-link xlink:href="https://doi.org/10.1002/2015JC011047" ext-link-type="DOI">10.1002/2015JC011047</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx14"><?xmltex \def\ref@label{{Frolicher et~al.(2015)Frolicher, Sarmiento, Paynter, Dunne, Krasting,
and Winton}}?><label>Frolicher et al.(2015)Frolicher, Sarmiento, Paynter, Dunne, Krasting,
and Winton</label><?label frolicher_dominance_2015?><mixed-citation>Frolicher, T. L., Sarmiento, J. L., Paynter, D. J., Dunne, J. P., Krasting,
J. P., and Winton, M.: Dominance of the Southern Ocean in Anthropogenic
Carbon and Heat Uptake in CMIP5 Models, J. Climate, 28,
862–886, <ext-link xlink:href="https://doi.org/10.1175/JCLI-D-14-00117.1" ext-link-type="DOI">10.1175/JCLI-D-14-00117.1</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx15"><?xmltex \def\ref@label{{Gaspar et~al.(1990)Gaspar, Gr{\'{e}}goris, and
Lefevre}}?><label>Gaspar et al.(1990)Gaspar, Grégoris, and
Lefevre</label><?label gaspar_simple_1990?><mixed-citation>Gaspar, P., Grégoris, Y., and Lefevre, J. M.: A simple eddy kinetic energy
model for simulations of the oceanic vertical mixing: Tests at station
Papa and long-term upper ocean study site, J. Geophys. Res.-Atmos., 95, 16179–16193, <ext-link xlink:href="https://doi.org/10.1029/JC095iC09p16179" ext-link-type="DOI">10.1029/JC095iC09p16179</ext-link>, 1990.</mixed-citation></ref>
      <ref id="bib1.bibx16"><?xmltex \def\ref@label{{Graham et~al.(2012)Graham, Boer, Heywood, Chapman, and
Stevens}}?><label>Graham et al.(2012)Graham, Boer, Heywood, Chapman, and
Stevens</label><?label graham_southern_2012?><mixed-citation>Graham, R. M., Boer, A. M. d., Heywood, K. J., Chapman, M. R., and Stevens,
D. P.: Southern Ocean fronts: Controlled by wind or topography?, J.
Geophys. Res.-Oceans, 117, 2156–2202, <ext-link xlink:href="https://doi.org/10.1029/2012JC007887" ext-link-type="DOI">10.1029/2012JC007887</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx17"><?xmltex \def\ref@label{{Hallberg(2013)}}?><label>Hallberg(2013)</label><?label hallberg?><mixed-citation>Hallberg, R.: Using a resolution function to regulate parameterizations of
oceanic mesoscale eddy effects, Ocean Model., 72, 92–103,
<ext-link xlink:href="https://doi.org/10.1016/j.ocemod.2013.08.007" ext-link-type="DOI">10.1016/j.ocemod.2013.08.007</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx18"><?xmltex \def\ref@label{{Hammond and Jones(2016)}}?><label>Hammond and Jones(2016)</label><?label hammond_freshwater_2016?><mixed-citation>Hammond, M. D. and Jones, D. C.: Freshwater flux from ice sheet melting and
iceberg calving in the Southern Ocean, Geosci. Data J., 3,
60–62, <ext-link xlink:href="https://doi.org/10.1002/gdj3.43" ext-link-type="DOI">10.1002/gdj3.43</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx19"><?xmltex \def\ref@label{{Herraiz-Borreguero and
Rintoul(2011)}}?><label>Herraiz-Borreguero and
Rintoul(2011)</label><?label herraiz-borreguero_subantarctic_2011?><mixed-citation>Herraiz-Borreguero, L. and Rintoul, S. R.: Subantarctic mode water:
distribution and circulation, Ocean Dynam., 61, 103–126,
<ext-link xlink:href="https://doi.org/10.1007/s10236-010-0352-9" ext-link-type="DOI">10.1007/s10236-010-0352-9</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx20"><?xmltex \def\ref@label{{Hjelmervik and Hjelmervik(2019)}}?><label>Hjelmervik and Hjelmervik(2019)</label><?label hjelmervik2019detection?><mixed-citation>Hjelmervik, K. B. and Hjelmervik, K. T.: Detection of oceanographic fronts on
variable water depths using empirical orthogonal functions, IEEE J.
Oceanic Eng., 45, 915–926, <ext-link xlink:href="https://doi.org/10.1109/JOE.2019.2917456" ext-link-type="DOI">10.1109/JOE.2019.2917456</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx21"><?xmltex \def\ref@label{{Houghton and Wilson(2020)}}?><label>Houghton and Wilson(2020)</label><?label houghton_nino_2020?><mixed-citation>Houghton, I. A. and Wilson, J. D.: El Niño Detection Via Unsupervised
Clustering of Argo Temperature Profiles, J. Geophys.
Res.-Oceans, 125, e2019JC015947, <ext-link xlink:href="https://doi.org/10.1029/2019JC015947" ext-link-type="DOI">10.1029/2019JC015947</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx22"><?xmltex \def\ref@label{{Iudicone et~al.(2007)Iudicone, Rodgers, Schopp, and
Madec}}?><label>Iudicone et al.(2007)Iudicone, Rodgers, Schopp, and
Madec</label><?label iudicone_exchange_2007?><mixed-citation>Iudicone, D., Rodgers, K., Schopp, R., and Madec, G.: An exchange window for
the injection of Antarctic Intermediate Water into the South
Pacific, J. Phys. Oceanogr., 37, 31–49,
<ext-link xlink:href="https://doi.org/10.1175/JPO2985.1" ext-link-type="DOI">10.1175/JPO2985.1</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx23"><?xmltex \def\ref@label{{Jones and Ito(2019)}}?><label>Jones and Ito(2019)</label><?label jones:19:ci?><mixed-citation>Jones, D. C. and Ito, T.: Gaussian mixture modeling describes the geography of
the surface ocean carbon budget, NCAR, 108–113, <ext-link xlink:href="https://doi.org/10.5065/y82j-f154" ext-link-type="DOI">10.5065/y82j-f154</ext-link>,
2019.</mixed-citation></ref>
      <ref id="bib1.bibx24"><?xmltex \def\ref@label{{Jones et~al.(2016)Jones, Meijers, Shuckburgh, Sall{\'{e}}e, Haynes,
McAufield, and Mazloff}}?><label>Jones et al.(2016)Jones, Meijers, Shuckburgh, Sallée, Haynes,
McAufield, and Mazloff</label><?label jones_how_2016?><mixed-citation>Jones, D. C., Meijers, A. J. S., Shuckburgh, E., Sallée, J.-B., Haynes, P.,
McAufield, E. K., and Mazloff, M. R.: How does Subantarctic Mode Water
ventilate the Southern Hemisphere subtropics?, J. Geophys.  Res.-Oceans, 121, 6558–6582, <ext-link xlink:href="https://doi.org/10.1002/2016jc011680" ext-link-type="DOI">10.1002/2016jc011680</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx25"><?xmltex \def\ref@label{{Jones et~al.(2019)Jones, Holt, Meijers, and
Shuckburgh}}?><label>Jones et al.(2019)Jones, Holt, Meijers, and
Shuckburgh</label><?label jones2019unsupervised?><mixed-citation>Jones, D. C., Holt, H. J., Meijers, A. J., and Shuckburgh, E.: Unsupervised
clustering of Southern Ocean Argo float temperature profiles, J.
Geophys. Res.-Oceans, 124, 390–402, <ext-link xlink:href="https://doi.org/10.1029/2018JC014629" ext-link-type="DOI">10.1029/2018JC014629</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx26"><?xmltex \def\ref@label{Kim(2021)}?><label>Kim(2021)</label><?label Kim?><mixed-citation>Kim, Y. S.: ACC fronts, ResearchGate [data set],
available at: <uri>https://www.researchgate.net/publication/338420242_ACC_fronts</uri>, last
access:   4 May 2021.</mixed-citation></ref>
      <ref id="bib1.bibx27"><?xmltex \def\ref@label{{Kim and Orsi(2014)}}?><label>Kim and Orsi(2014)</label><?label kim2014variability?><mixed-citation>Kim, Y. S. and Orsi, A. H.: On the variability of Antarctic Circumpolar Current
fronts inferred from 1992–2011 altimetry, J. Phys. Oceanogr., 44, 3054–3071,
<ext-link xlink:href="https://doi.org/10.1175/JPO-D-13-0217.1" ext-link-type="DOI">10.1175/JPO-D-13-0217.1</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx28"><?xmltex \def\ref@label{{Large and Yeager(2009)}}?><label>Large and Yeager(2009)</label><?label large_global_2009?><mixed-citation>Large, W. and Yeager, S.: The global climatology of an interannually varying
air‚Äìsea flux data set, Clim. Dynam., 33, 341–364,
<ext-link xlink:href="https://doi.org/10.1007/s00382-008-0441-3" ext-link-type="DOI">10.1007/s00382-008-0441-3</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx29"><?xmltex \def\ref@label{{Le~Bras et~al.(2019)Le~Bras, Sonnewald, and
Toole}}?><label>Le Bras et al.(2019)Le Bras, Sonnewald, and
Toole</label><?label le_bras_barotropic_2019?><mixed-citation>Le Bras, I. A.-A., Sonnewald, M., and Toole, J. M.: A Barotropic Vorticity
Budget for the Subtropical North Atlantic Based on Observations,
J. Phys. Oceanogr., 49, 2781–2797,
<ext-link xlink:href="https://doi.org/10.1175/JPO-D-19-0111.1" ext-link-type="DOI">10.1175/JPO-D-19-0111.1</ext-link>,
2019.</mixed-citation></ref>
      <ref id="bib1.bibx30"><?xmltex \def\ref@label{{Losch et~al.(2010)Losch, Menemenlis, Campin, Heimbach, and
Hill}}?><label>Losch et al.(2010)Losch, Menemenlis, Campin, Heimbach, and
Hill</label><?label losch_formulation_2010?><mixed-citation>Losch, M., Menemenlis, D., Campin, J.-M., Heimbach, P., and Hill, C.: On the
formulation of sea-ice models. Part 1: Effects of different solver
implementations and parameterizations, Ocean Model., 33, 129–144,
<ext-link xlink:href="https://doi.org/10.1016/j.ocemod.2009.12.008" ext-link-type="DOI">10.1016/j.ocemod.2009.12.008</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx31"><?xmltex \def\ref@label{{Mackie(2018)}}?><label>Mackie(2018)</label><?label Mackie2018SOCirc?><mixed-citation>Mackie, E. J. B.: Southern Ocean Circulation and Frontal Dynamics from
Cryosat-2 Along-track Radar Altimetry, PhD thesis, University of Bristol,
School of Geographical Sciences, available at: <ext-link xlink:href="https://research-information.bris.ac.uk/en/studentTheses/southern-ocean-circulation-and-frontal-dynamics-from-cryosat-2-al">https://research-information.bris.ac.uk/en/studentTheses/southern-ocean-circulati
on-and-frontal-dynamics-from-cryosat-2-al</ext-link> (last access: 4 May 2021), 2018.</mixed-citation></ref>
      <ref id="bib1.bibx32"><?xmltex \def\ref@label{{Marshall and Speer(2012)}}?><label>Marshall and Speer(2012)</label><?label marshall_closure_2012?><mixed-citation>Marshall, J. and Speer, K.: Closure of the meridional overturning circulation
through Southern Ocean upwelling, Nat. Geosci., 5, 171–180,
<ext-link xlink:href="https://doi.org/10.1038/ngeo1391" ext-link-type="DOI">10.1038/ngeo1391</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx33"><?xmltex \def\ref@label{{Marshall et~al.(1997{\natexlab{a}})Marshall, Adcroft, Hill, and
Perelman}}?><label>Marshall et al.(1997a)Marshall, Adcroft, Hill, and
Perelman</label><?label marshall_finite-volume_1997?><mixed-citation>Marshall, J., Adcroft, A., Hill, C., and Perelman, L.: A finite-volume,
incompressible Navier Stokes model for studies of the ocean on parallel
computers, J. Geophys. Res., 102, 5753–5766,
<ext-link xlink:href="https://doi.org/10.1029/96JC02775" ext-link-type="DOI">10.1029/96JC02775</ext-link>, 1997a.</mixed-citation></ref>
      <ref id="bib1.bibx34"><?xmltex \def\ref@label{{Marshall et~al.(1997{\natexlab{b}})Marshall, Hill, Perelman, and
Adcroft}}?><label>Marshall et al.(1997b)Marshall, Hill, Perelman, and
Adcroft</label><?label marshall_hydrostatic_1997?><mixed-citation>Marshall, J., Hill, C., Perelman, L., and Adcroft, A.: Hydrostatic,
quasi-hydrostatic, and nonhydrostatic ocean modeling, J. Geophys.
Res., 102, 5733–5752, <ext-link xlink:href="https://doi.org/10.1029/96JC02776" ext-link-type="DOI">10.1029/96JC02776</ext-link>, 1997b.</mixed-citation></ref>
      <ref id="bib1.bibx35"><?xmltex \def\ref@label{{Maze(2020)}}?><label>Maze(2020)</label><?label maze_guillaume_2020_3906236?><mixed-citation>Maze, G.: Ocean Profile Classification Model in python, Zenodo [code],
<ext-link xlink:href="https://doi.org/10.5281/zenodo.3906236" ext-link-type="DOI">10.5281/zenodo.3906236</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx36"><?xmltex \def\ref@label{{Maze et~al.(2017)Maze, Mercier, Fablet, Tandeo, Radcenco, Lenca,
Feucher, and Le~Goff}}?><label>Maze et al.(2017)Maze, Mercier, Fablet, Tandeo, Radcenco, Lenca,
Feucher, and Le Goff</label><?label maze2017coherent?><mixed-citation>Maze, G., Mercier, H., Fablet, R., Tandeo, P., Radcenco, M. L., Lenca, P.,
Feucher, C., and Le Goff, C.: Coherent heat patterns revealed by unsupervised
classification of Argo temperature profiles in the North Atlantic Ocean,
Prog. Oceanogr., 151, 275–292, <ext-link xlink:href="https://doi.org/10.1016/j.pocean.2016.12.008" ext-link-type="DOI">10.1016/j.pocean.2016.12.008</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx37"><?xmltex \def\ref@label{{Mazloff et~al.(2010)Mazloff, Heimbach, and
Wunsch}}?><label>Mazloff et al.(2010)Mazloff, Heimbach, and
Wunsch</label><?label mazloff_eddy-permitting_2010?><mixed-citation>Mazloff, M. R., Heimbach, P., and Wunsch, C.: An Eddy-Permitting Southern
Ocean State Estimate, J. Phys. Oceanogr., 40, 880–899,
<ext-link xlink:href="https://doi.org/10.1175/2009jpo4236.1" ext-link-type="DOI">10.1175/2009jpo4236.1</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx38"><?xmltex \def\ref@label{{McLachlan and Basford(1988)}}?><label>McLachlan and Basford(1988)</label><?label mclachlan1988mixture?><mixed-citation>
McLachlan, G. J. and Basford, K. E.: Mixture models: Inference and applications
to clustering, vol. 38, M. Dekker,  New York, 1988.</mixed-citation></ref>
      <?pagebreak page1561?><ref id="bib1.bibx39"><?xmltex \def\ref@label{{{Met Office}(2010--2021)}}?><label>Met Office(2010–2021)</label><?label Cartopy?><mixed-citation>Met Office: Cartopy: a cartographic python library with a matplotlib
interface, Exeter, Devon,  Cartopy [code], available at: <uri>https://scitools.org.uk/cartopy/docs/latest/</uri> (last access:
5 May 2021), 2010–2021.</mixed-citation></ref>
      <ref id="bib1.bibx40"><?xmltex \def\ref@label{{Mikaloff-Fletcher et~al.(2006)Mikaloff-Fletcher, Gruber, Jacobson,
Doney, Dutkiewicz, Gerber, Follows, Joos, Lindsay, Menemenlis, Mouchet,
M\"{u}ller, and Sarmiento}}?><label>Mikaloff-Fletcher et al.(2006)Mikaloff-Fletcher, Gruber, Jacobson,
Doney, Dutkiewicz, Gerber, Follows, Joos, Lindsay, Menemenlis, Mouchet,
Müller, and Sarmiento</label><?label fletcher_inverse_2006?><mixed-citation>Mikaloff-Fletcher, S., Gruber, N., Jacobson, A. R., Doney, S. C., Dutkiewicz,
S., Gerber, M., Follows, M., Joos, F., Lindsay, K., Menemenlis, D., Mouchet,
A., Müller, S. A., and Sarmiento, J. L.: Inverse estimates of anthropogenic
CO<inline-formula><mml:math id="M193" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:math></inline-formula> uptake, transport, and storage by the ocean, Global Biogeochem.
Cy., 20, 1–16, <ext-link xlink:href="https://doi.org/10.1029/2005gb002530" ext-link-type="DOI">10.1029/2005gb002530</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx41"><?xmltex \def\ref@label{{Morrison et~al.(2015)Morrison, Fr{\"{o}}licher, and
Sarmiento}}?><label>Morrison et al.(2015)Morrison, Frölicher, and
Sarmiento</label><?label morrison_upwelling_2015?><mixed-citation>Morrison, A. K., Frölicher, T. L., and Sarmiento, J. L.: Upwelling in the
Southern Ocean, Phys. Today, 68, 27–32, <ext-link xlink:href="https://doi.org/10.1063/PT.3.2654" ext-link-type="DOI">10.1063/PT.3.2654</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx42"><?xmltex \def\ref@label{{Nakayama et~al.(2018)Nakayama, Menemenlis, Zhang, Schodlok, and
Rignot}}?><label>Nakayama et al.(2018)Nakayama, Menemenlis, Zhang, Schodlok, and
Rignot</label><?label nakayama_origin_2018?><mixed-citation>Nakayama, Y., Menemenlis, D., Zhang, H., Schodlok, M., and Rignot, E.: Origin
of Circumpolar Deep Water intruding onto the Amundsen and
Bellingshausen Sea continental shelves, Nat. Commun., 9, 3403,
<ext-link xlink:href="https://doi.org/10.1038/s41467-018-05813-1" ext-link-type="DOI">10.1038/s41467-018-05813-1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx43"><?xmltex \def\ref@label{{Orsi et~al.(1995)Orsi, Whitworth, and Nowlin}}?><label>Orsi et al.(1995)Orsi, Whitworth, and Nowlin</label><?label orsi_meridional_1995?><mixed-citation>Orsi, A., Whitworth, T., and Nowlin, W.: On the meridional extent and fronts of
the Antarctic Circumpolar Current, Deep-Sea Res. Pt. I, 42,
641–673, <ext-link xlink:href="https://doi.org/10.1016/0967-0637(95)00021-W" ext-link-type="DOI">10.1016/0967-0637(95)00021-W</ext-link>, 1995.</mixed-citation></ref>
      <ref id="bib1.bibx44"><?xmltex \def\ref@label{{Pauthenet et~al.(2017)Pauthenet, Roquet, Madec, and
Nerini}}?><label>Pauthenet et al.(2017)Pauthenet, Roquet, Madec, and
Nerini</label><?label pauthenet2017linear?><mixed-citation>Pauthenet, E., Roquet, F., Madec, G., and Nerini, D.: A linear decomposition of
the Southern Ocean thermohaline structure, J. Phys.
Oceanogr., 47, 29–47, <ext-link xlink:href="https://doi.org/10.1175/JPO-D-16-0083.s1" ext-link-type="DOI">10.1175/JPO-D-16-0083.s1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx45"><?xmltex \def\ref@label{{Pauthenet et~al.(2018)Pauthenet, Roquet, Madec, Guinet, Hindell,
McMahon, Harcourt, and Nerini}}?><label>Pauthenet et al.(2018)Pauthenet, Roquet, Madec, Guinet, Hindell,
McMahon, Harcourt, and Nerini</label><?label pauthenet_seasonal_2018?><mixed-citation>Pauthenet, E., Roquet, F., Madec, G., Guinet, C., Hindell, M., McMahon, C. R.,
Harcourt, R., and Nerini, D.: Seasonal Meandering of the Polar Front
Upstream of the Kerguelen Plateau, Geophys. Res. Lett., 45,
9774–9781, <ext-link xlink:href="https://doi.org/10.1029/2018GL079614" ext-link-type="DOI">10.1029/2018GL079614</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx46"><?xmltex \def\ref@label{{Pauthenet et~al.(2019)Pauthenet, Roquet, Madec, Sallee, and
Nerini}}?><label>Pauthenet et al.(2019)Pauthenet, Roquet, Madec, Sallee, and
Nerini</label><?label pauthenet_thermohaline_2019?><mixed-citation>Pauthenet, E., Roquet, F., Madec, G., Sallee, J.-B., and Nerini, D.: The
Thermohaline Modes of the Global Ocean, J. Phys.
Oceanogr., 49, 2535–2552, <ext-link xlink:href="https://doi.org/10.1175/JPO-D-19-0120.1" ext-link-type="DOI">10.1175/JPO-D-19-0120.1</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx47"><?xmltex \def\ref@label{{Pauthenet et~al.(2021)Pauthenet, Sallee, Schmidtko, and
Nerini}}?><label>Pauthenet et al.(2021)Pauthenet, Sallee, Schmidtko, and
Nerini</label><?label pauthenet_seasonal_2021?><mixed-citation>Pauthenet, E., Sallee, J.-B., Schmidtko, S., and Nerini, D.: Seasonal
Variation of the Antarctic Slope Front Occurrence and Position
Estimated from an Interpolated Hydrographic Climatology, J.
Phys. Oceanogr., 51, 1539–1557, <ext-link xlink:href="https://doi.org/10.1175/JPO-D-20-0186.1" ext-link-type="DOI">10.1175/JPO-D-20-0186.1</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx48"><?xmltex \def\ref@label{{Pedregosa et~al.(2011)Pedregosa, Varoquaux, Gramfort, Michel,
Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos,
Cournapeau, Brucher, Perrot, and Duchesnay}}?><label>Pedregosa et al.(2011)Pedregosa, Varoquaux, Gramfort, Michel,
Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos,
Cournapeau, Brucher, Perrot, and Duchesnay</label><?label scikit-learn?><mixed-citation>
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.:
Scikit-learn: Machine Learning in Python, J. Mach. Learn.
Res., 12, 2825–2830, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx49"><?xmltex \def\ref@label{{Pollard et~al.(2002)Pollard, Lucas, and J}}?><label>Pollard et al.(2002)Pollard, Lucas, and J</label><?label pollard2002physical?><mixed-citation>Pollard, R. T., Lucas, M. I., and Read, J. F.: Physical controls on biogeochemical zonation
in the Southern Ocean, Deep-Sea Res. Pt. II, 49, 3289–3305,
<ext-link xlink:href="https://doi.org/10.1016/S0967-0645(02)00084-X" ext-link-type="DOI">10.1016/S0967-0645(02)00084-X</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx50"><?xmltex \def\ref@label{{Rintoul et~al.(2001)Rintoul, Hughes, and
Olbers}}?><label>Rintoul et al.(2001)Rintoul, Hughes, and
Olbers</label><?label rintoul_antarctic_2001?><mixed-citation>
Rintoul, S., Hughes, C., and Olbers, D.: The Antarctic Circumpolar
Current system, International Geophysics Series,  77, 271–302, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx51"><?xmltex \def\ref@label{{Rosso et~al.(2020)Rosso, Mazloff, Talley, Purkey, Freeman, and
Maze}}?><label>Rosso et al.(2020)Rosso, Mazloff, Talley, Purkey, Freeman, and
Maze</label><?label rosso2020water?><mixed-citation>Rosso, I., Mazloff, M. R., Talley, L. D., Purkey, S. G., Freeman, N. M., and
Maze, G.: Water Mass and Biogeochemical Variability in the Kerguelen Sector
of the Southern Ocean: A Machine Learning Approach for a Mixing Hot Spot,
J. Geophys. Res.-Oceans, 125, e2019JC015877, <ext-link xlink:href="https://doi.org/10.1029/2019JC015877" ext-link-type="DOI">10.1029/2019JC015877</ext-link>,
2020.</mixed-citation></ref>
      <ref id="bib1.bibx52"><?xmltex \def\ref@label{{Sallee et~al.(2010)Sallee, Speer, Rintoul, and
Wijffels}}?><label>Sallee et al.(2010)Sallee, Speer, Rintoul, and
Wijffels</label><?label sallee_southern_2010?><mixed-citation>Sallee, J., Speer, K., Rintoul, S., and Wijffels, S.: Southern Ocean
thermocline ventilation, J. Phys. Oceanogr., 40, 509–529,
<ext-link xlink:href="https://doi.org/10.1175/2009JPO4291.1" ext-link-type="DOI">10.1175/2009JPO4291.1</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx53"><?xmltex \def\ref@label{{Sallee et~al.(2012)Sallee, Matear, Rintoul, and
Lenton}}?><label>Sallee et al.(2012)Sallee, Matear, Rintoul, and
Lenton</label><?label sallee_localized_2012?><mixed-citation>Sallee, J.-B., Matear, R. J., Rintoul, S. R., and Lenton, A.: Localized
subduction of anthropogenic carbon dioxide in the Southern Hemisphere
oceans, Nat. Geosci., 5, 579–584, <ext-link xlink:href="https://doi.org/10.1038/ngeo1523" ext-link-type="DOI">10.1038/ngeo1523</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx54"><?xmltex \def\ref@label{{Siegelman et~al.(2019)Siegelman, OToole, Flexas, Riviere, and
Klein}}?><label>Siegelman et al.(2019)Siegelman, OToole, Flexas, Riviere, and
Klein</label><?label siegelman_submesoscale_2019?><mixed-citation>Siegelman, L., OToole, M., Flexas, M., Riviere, P., and Klein, P.: Submesoscale
ocean fronts act as biological hotspot for southern elephant seal, Sci.
Rep.-UK, 9, 5588, <ext-link xlink:href="https://doi.org/10.1038/s41598-019-42117-w" ext-link-type="DOI">10.1038/s41598-019-42117-w</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx55"><?xmltex \def\ref@label{{Sokolov and Rintoul(2002)}}?><label>Sokolov and Rintoul(2002)</label><?label sokolov_structure_2002?><mixed-citation>Sokolov, S. and Rintoul, S. R.: Structure of Southern Ocean fronts at
140<inline-formula><mml:math id="M194" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> E, J. Marine Syst., 37, 151–184,
<ext-link xlink:href="https://doi.org/10.1016/S0924-7963(02)00200-2" ext-link-type="DOI">10.1016/S0924-7963(02)00200-2</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx56"><?xmltex \def\ref@label{{Sokolov and Rintoul(2009)}}?><label>Sokolov and Rintoul(2009)</label><?label sokolov_circumpolar_2009?><mixed-citation>Sokolov, S. and Rintoul, S. R.: Circumpolar structure and distribution of the
Antarctic Circumpolar Current fronts: 1. Mean circumpolar paths,
J. Geophys. Res.-Atmos., 114, 3675,
<ext-link xlink:href="https://doi.org/10.1029/2008JC005108" ext-link-type="DOI">10.1029/2008JC005108</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx57"><?xmltex \def\ref@label{{Sonnewald et~al.(2019)Sonnewald, Wunsch, and
Heimbach}}?><label>Sonnewald et al.(2019)Sonnewald, Wunsch, and
Heimbach</label><?label sonnewald_unsupervised_2019?><mixed-citation>Sonnewald, M., Wunsch, C., and Heimbach, P.: Unsupervised Learning Reveals
Geography of Global Ocean Dynamical Regions, Earth   Space
Sci., 6, 784–794, <ext-link xlink:href="https://doi.org/10.1029/2018EA000519" ext-link-type="DOI">10.1029/2018EA000519</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx58"><?xmltex \def\ref@label{{Sonnewald et~al.(2020)Sonnewald, Dutkiewicz, Hill, and
Forget}}?><label>Sonnewald et al.(2020)Sonnewald, Dutkiewicz, Hill, and
Forget</label><?label sonnewald_elucidating_2020?><mixed-citation>Sonnewald, M., Dutkiewicz, S., Hill, C., and Forget, G.: Elucidating ecological
complexity: Unsupervised learning determines global marine eco-provinces,
Science Advances, 6, eaay4740, <ext-link xlink:href="https://doi.org/10.1126/sciadv.aay4740" ext-link-type="DOI">10.1126/sciadv.aay4740</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx59"><?xmltex \def\ref@label{{Stammer et~al.(2002)Stammer, Wunsch, Giering, Eckert, Heimbach,
Marotzke, Adcroft, Hill, and Marshall}}?><label>Stammer et al.(2002)Stammer, Wunsch, Giering, Eckert, Heimbach,
Marotzke, Adcroft, Hill, and Marshall</label><?label stammer_global_2002?><mixed-citation>Stammer, D., Wunsch, C., Giering, R., Eckert, C., Heimbach, P., Marotzke, J.,
Adcroft, A., Hill, C. N., and Marshall, J.: Global ocean circulation during
1992–1997, estimated from ocean observations and a general circulation model,
J. Geophys. Res.-Oceans, 107, 1-1–1-27, <ext-link xlink:href="https://doi.org/10.1029/2001JC000888" ext-link-type="DOI">10.1029/2001JC000888</ext-link>,
2002.</mixed-citation></ref>
      <ref id="bib1.bibx60"><?xmltex \def\ref@label{{Talley(2013)}}?><label>Talley(2013)</label><?label talley_closure_2013?><mixed-citation>Talley, L.: Closure of the Global Overturning Circulation Through the
Indian, Pacific, and Southern Oceans: Schematics and Transports,
Oceanography, 26, 80–97, <ext-link xlink:href="https://doi.org/10.5670/oceanog.2013.07" ext-link-type="DOI">10.5670/oceanog.2013.07</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx61"><?xmltex \def\ref@label{{Thomas(2021)}}?><label>Thomas(2021)</label><?label so-fronts?><mixed-citation>Thomas, S. D. A.: so-wise/so-fronts: Defining Southern Ocean fronts using
unsupervised classification, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.5500666" ext-link-type="DOI">10.5281/zenodo.5500666</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx62"><?xmltex \def\ref@label{{Thompson and Sall{\'{e}}e(2012)}}?><label>Thompson and Sallée(2012)</label><?label thompson2012jets?><mixed-citation>Thompson, A. F. and Sallée, J.-B.: Jets and topography: Jet transitions and
the impact on transport in the Antarctic Circumpolar Current, J.
Phys. Oceanogr., 42, 956–972, <ext-link xlink:href="https://doi.org/10.1175/JPO-D-11-0135.1" ext-link-type="DOI">10.1175/JPO-D-11-0135.1</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx63"><?xmltex \def\ref@label{{Thompson et~al.(2010)Thompson, Haynes, Wilson, and
Richards}}?><label>Thompson et al.(2010)Thompson, Haynes, Wilson, and
Richards</label><?label thompson_rapid_2010?><mixed-citation>Thompson, A. F., Haynes, P. H., Wilson, C., and Richards, K. J.: Rapid
Southern Ocean front transitions in an eddy-resolving ocean GCM,
Geophys. Res. Lett., 37, L23602, <ext-link xlink:href="https://doi.org/10.1029/2010GL045386" ext-link-type="DOI">10.1029/2010GL045386</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx64"><?xmltex \def\ref@label{{Thompson et~al.(2018)Thompson, Stewart, Spence, and
Heywood}}?><label>Thompson et al.(2018)Thompson, Stewart, Spence, and
Heywood</label><?label thompson_antarctic_2018?><mixed-citation>Thompson, A. F., Stewart, A. L., Spence, P., and Heywood, K. J.: The
Antarctic Slope Current in a Changing Climate, Rev.
Geophys., 56, 741–770, <ext-link xlink:href="https://doi.org/10.1029/2018RG000624" ext-link-type="DOI">10.1029/2018RG000624</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx65"><?xmltex \def\ref@label{{Thompson et~al.(2020)Thompson, Speer, and
Schulze~Chretien}}?><label>Thompson et al.(2020)Thompson, Speer, and
Schulze Chretien</label><?label thompson2020?><mixed-citation>Thompson, A. F., Speer, K. G., and Schulze Chretien, L. M.: Genesis of the
Antarctic Slope Current in West Antarctica, Geophys. Res. Lett., 47, e2020GL087802,
<ext-link xlink:href="https://doi.org/10.1029/2020GL087802" ext-link-type="DOI">10.1029/2020GL087802</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx66"><?xmltex \def\ref@label{{Verdy and Mazloff(2017)}}?><label>Verdy and Mazloff(2017)</label><?label verdy2017data?><mixed-citation>Verdy, A. and Mazloff, M. R.: A data assimilating model for estimating Southern
Ocean biogeochemistry, J. Geophys. Res.-Oceans, 122,
6968–6988, <ext-link xlink:href="https://doi.org/10.1002/2016jc012650" ext-link-type="DOI">10.1002/2016jc012650</ext-link>, 2017.</mixed-citation></ref>
      <?pagebreak page1562?><ref id="bib1.bibx67"><?xmltex \def\ref@label{Verdy and Mazloff(2021)}?><label>Verdy and Mazloff(2021)</label><?label Verdy?><mixed-citation>Verdy, A. and Mazloff, M.:
B-SOSE iteration 106, [data set], available at:
<uri>http://sose.ucsd.edu/BSOSE6_iter106_solution.html</uri>,
last access: 4 May 2021.</mixed-citation></ref>
      <ref id="bib1.bibx68"><?xmltex \def\ref@label{{Vernet et~al.(2019)Vernet, Geibert, Hoppema, Brown, Haas, Hellmer,
Jokat, Jullion, Mazloff, Bakker, Brearley, Croot, Hattermann, Hauck,
Hillenbrand, Hoppe, Huhn, Koch, Lechtenfeld, Meredith, Naveira~Garabato,
Nothig, Peeken, Rutgers~van~der Loeff, Schmidtko, Schroder, Strass,
Torres-Valdes, and Verdy}}?><label>Vernet et al.(2019)Vernet, Geibert, Hoppema, Brown, Haas, Hellmer,
Jokat, Jullion, Mazloff, Bakker, Brearley, Croot, Hattermann, Hauck,
Hillenbrand, Hoppe, Huhn, Koch, Lechtenfeld, Meredith, Naveira Garabato,
Nothig, Peeken, Rutgers van der Loeff, Schmidtko, Schroder, Strass,
Torres-Valdes, and Verdy</label><?label vernet_weddell_2019?><mixed-citation>Vernet, M., Geibert, W., Hoppema, M., Brown, P. J., Haas, C., Hellmer, H. H.,
Jokat, W., Jullion, L., Mazloff, M., Bakker, D. C. E., Brearley, J. A.,
Croot, P., Hattermann, T., Hauck, J., Hillenbrand, C.-D., Hoppe, C. J. M.,
Huhn, O., Koch, B. P., Lechtenfeld, O. J., Meredith, M. P., Naveira Garabato,
A. C., Nothig, E.-M., Peeken, I., Rutgers van der Loeff, M. M., Schmidtko,
S., Schroder, M., Strass, V. H., Torres-Valdes, S., and Verdy, A.: The
Weddell Gyre, Southern Ocean: present knowledge and future challenges,
Rev. Geophys., 57, 623–708, <ext-link xlink:href="https://doi.org/10.1029/2018RG000604" ext-link-type="DOI">10.1029/2018RG000604</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx69"><?xmltex \def\ref@label{{Vincent and Folorunso(2009)Vincent, Folorunso
et~al.}}?><label>Vincent and Folorunso(2009)Vincent, Folorunso
et al.</label><?label vincent2009descriptive?><mixed-citation>Vincent, O. R. and Folorunso, O.: A descriptive algorithm for sobel image
edge detection, in: Proceedings of informing science &amp; IT education
conference (InSITE), Informing Science Institute
California, 40,   97–107, <ext-link xlink:href="https://doi.org/10.28945/3351" ext-link-type="DOI">10.28945/3351</ext-link>, 2009.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx70"><?xmltex \def\ref@label{{Wunsch and Heimbach(2007)}}?><label>Wunsch and Heimbach(2007)</label><?label wunsch_practical_2007?><mixed-citation>Wunsch, C. and Heimbach, P.: Practical global oceanic state estimation, Physica D, 230, 197–208, <ext-link xlink:href="https://doi.org/10.1016/j.physd.2006.09.040" ext-link-type="DOI">10.1016/j.physd.2006.09.040</ext-link>,
2007.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Defining Southern Ocean fronts using unsupervised classification</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Amante and Eakins(2009)</label><mixed-citation>
Amante, C. and Eakins, B. W.: ETOPO1 Global Relief Model converted to
PanMap layer format, PANGAEA [data set], <a href="https://doi.org/10.1594/PANGAEA.769615" target="_blank">https://doi.org/10.1594/PANGAEA.769615</a>,  2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Campin et al.(2021)</label><mixed-citation>
Campin, J.-M., Heimbach, P., Losch, M., Forget, G.,
edhill3; Adcroft, A., amolod, Menemenlis, D., dfer22, Hill, C., Jahn, O.,
Scott, J., stephdut, Mazloff, M., Fox-Kemper, B., antnguyen13, Doddridge, E.,
Fenty, I., Bates, M., AndrewEichmann-NOAA, Smith, T., Martin, T.,
Lauderdale, J., Abernathey, R., samarkhatiwala, hongandyan,
Deremble, B., dngoldberg, Bourgault, P., and  Dussin, R.: MITgcm/MITgcm: checkpoint67z, Zenodo [code],
<a href="https://doi.org/10.5281/zenodo.4968496" target="_blank">https://doi.org/10.5281/zenodo.4968496</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Chapman(2017)</label><mixed-citation>
Chapman, C. C.: New perspectives on frontal variability in the Southern Ocean,
J. Phys. Oceanogr., 47, 1151–1168,
<a href="https://doi.org/10.1175/JPO-D-16-0222.1" target="_blank">https://doi.org/10.1175/JPO-D-16-0222.1</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Chapman et al.(2020)Chapman, Lea, Meyer, Sallée, and
Hindell</label><mixed-citation>
Chapman, C. C., Lea, M.-A., Meyer, A., Sallée, J.-B., and Hindell, M.:
Defining Southern Ocean fronts and their influence on biological and physical
processes in a changing climate, Nat. Clim. Change, 10, 209–219,
<a href="https://doi.org/10.1038/s41558-020-0705-4" target="_blank">https://doi.org/10.1038/s41558-020-0705-4</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Cushman-Roisin and Beckers(2011)</label><mixed-citation>
Cushman-Roisin, B. and Beckers, J.-M.: Introduction to geophysical fluid
dynamics: physical and numerical aspects, Academic Press, ISBN 9780120887590, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Dai and Trenberth(2002)</label><mixed-citation>
Dai, A. and Trenberth, K. E.: Estimates of Freshwater Discharge from
Continents: Latitudinal and Seasonal Variations, J.
Hydrometeorol., 3, 660–687,
<a href="https://doi.org/10.1175/1525-7541(2002)003&lt;0660:EOFDFC&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1525-7541(2002)003&lt;0660:EOFDFC&gt;2.0.CO;2</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Deacon(1937)</label><mixed-citation>
Deacon, G.: The hydrology of the Southern Ocean, Cambridge University Press, Macmillan, London, New York, 1937.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Dee et al.(2011)</label><mixed-citation>
Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P., Kobayashi,
S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P., Bechtold, P.,
Beljaars, A. C. M., van de Berg, L., Bidlot, J., Bormann, N., Delsol, C.,
Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S. B.,
Hersbach, H., Halm, E. V., Isaksen, L.,  Kållberg, P.,
Köhler, M., Matricardi, M., McNally, A. P., Monge Sanz, B. M., Morcrette,
J. J., Park, B. K., Peubey, C., de Rosnay, P., Tavolato, C., Thépaut,
J. N., and Vitart, F.: The ERA-Interim reanalysis: configuration and
performance of the data assimilation system, Q. J. Roy.
Meteor. Soc., 137, 553–597, <a href="https://doi.org/10.1002/qj.828" target="_blank">https://doi.org/10.1002/qj.828</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Donohue et al.(2016)Donohue, Tracey, Watts, Chidichimo, and
Chereskin</label><mixed-citation>
Donohue, K. A., Tracey, K. L., Watts, D. R., Chidichimo, M. P., and Chereskin,
T. K.: Mean Antarctic Circumpolar Current transport measured in Drake
Passage: MEAN ACC TRANSPORT, Geophys. Res. Lett., 43,
11760–11767, <a href="https://doi.org/10.1002/2016GL070319" target="_blank">https://doi.org/10.1002/2016GL070319</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Duda and Hart(1973)</label><mixed-citation>
Duda, R. and Hart P.: Pattern Classification and Scene Analysis (1st
Edition), John Wiley and Sons, New York, ISBN: 9780471223610,
271–272, 1973.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Fenty and Heimbach(2013)</label><mixed-citation>
Fenty, I. and Heimbach, P.: Coupled sea ice–ocean-state estimation in the
Labrador Sea and Baffin Bay, J. Phys. Oceanogr., 43, 884–904,
<a href="https://doi.org/10.1175/JPO-D-12-065.1" target="_blank">https://doi.org/10.1175/JPO-D-12-065.1</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Forget et al.(2015)Forget, Campin, Heimbach, Hill, Ponte, and
Wunsch</label><mixed-citation>
Forget, G., Campin, J.-M., Heimbach, P., Hill, C. N., Ponte, R. M., and Wunsch, C.: ECCO version 4: an integrated framework for non-linear inverse modeling and global ocean state estimation, Geosci. Model Dev., 8, 3071–3104, <a href="https://doi.org/10.5194/gmd-8-3071-2015" target="_blank">https://doi.org/10.5194/gmd-8-3071-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Frenger et al.(2015)Frenger, Munnich, Gruber, and
Knutti</label><mixed-citation>
Frenger, I., Munnich, M., Gruber, N., and Knutti, R.: Southern Ocean eddy
phenomenology, J. Geophys. Res.-Oceans, 120, 7413–7449,
<a href="https://doi.org/10.1002/2015JC011047" target="_blank">https://doi.org/10.1002/2015JC011047</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Frolicher et al.(2015)Frolicher, Sarmiento, Paynter, Dunne, Krasting,
and Winton</label><mixed-citation>
Frolicher, T. L., Sarmiento, J. L., Paynter, D. J., Dunne, J. P., Krasting,
J. P., and Winton, M.: Dominance of the Southern Ocean in Anthropogenic
Carbon and Heat Uptake in CMIP5 Models, J. Climate, 28,
862–886, <a href="https://doi.org/10.1175/JCLI-D-14-00117.1" target="_blank">https://doi.org/10.1175/JCLI-D-14-00117.1</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Gaspar et al.(1990)Gaspar, Grégoris, and
Lefevre</label><mixed-citation>
Gaspar, P., Grégoris, Y., and Lefevre, J. M.: A simple eddy kinetic energy
model for simulations of the oceanic vertical mixing: Tests at station
Papa and long-term upper ocean study site, J. Geophys. Res.-Atmos., 95, 16179–16193, <a href="https://doi.org/10.1029/JC095iC09p16179" target="_blank">https://doi.org/10.1029/JC095iC09p16179</a>, 1990.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Graham et al.(2012)Graham, Boer, Heywood, Chapman, and
Stevens</label><mixed-citation>
Graham, R. M., Boer, A. M. d., Heywood, K. J., Chapman, M. R., and Stevens,
D. P.: Southern Ocean fronts: Controlled by wind or topography?, J.
Geophys. Res.-Oceans, 117, 2156–2202, <a href="https://doi.org/10.1029/2012JC007887" target="_blank">https://doi.org/10.1029/2012JC007887</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Hallberg(2013)</label><mixed-citation>
Hallberg, R.: Using a resolution function to regulate parameterizations of
oceanic mesoscale eddy effects, Ocean Model., 72, 92–103,
<a href="https://doi.org/10.1016/j.ocemod.2013.08.007" target="_blank">https://doi.org/10.1016/j.ocemod.2013.08.007</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Hammond and Jones(2016)</label><mixed-citation>
Hammond, M. D. and Jones, D. C.: Freshwater flux from ice sheet melting and
iceberg calving in the Southern Ocean, Geosci. Data J., 3,
60–62, <a href="https://doi.org/10.1002/gdj3.43" target="_blank">https://doi.org/10.1002/gdj3.43</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Herraiz-Borreguero and
Rintoul(2011)</label><mixed-citation>
Herraiz-Borreguero, L. and Rintoul, S. R.: Subantarctic mode water:
distribution and circulation, Ocean Dynam., 61, 103–126,
<a href="https://doi.org/10.1007/s10236-010-0352-9" target="_blank">https://doi.org/10.1007/s10236-010-0352-9</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Hjelmervik and Hjelmervik(2019)</label><mixed-citation>
Hjelmervik, K. B. and Hjelmervik, K. T.: Detection of oceanographic fronts on
variable water depths using empirical orthogonal functions, IEEE J.
Oceanic Eng., 45, 915–926, <a href="https://doi.org/10.1109/JOE.2019.2917456" target="_blank">https://doi.org/10.1109/JOE.2019.2917456</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Houghton and Wilson(2020)</label><mixed-citation>
Houghton, I. A. and Wilson, J. D.: El Niño Detection Via Unsupervised
Clustering of Argo Temperature Profiles, J. Geophys.
Res.-Oceans, 125, e2019JC015947, <a href="https://doi.org/10.1029/2019JC015947" target="_blank">https://doi.org/10.1029/2019JC015947</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Iudicone et al.(2007)Iudicone, Rodgers, Schopp, and
Madec</label><mixed-citation>
Iudicone, D., Rodgers, K., Schopp, R., and Madec, G.: An exchange window for
the injection of Antarctic Intermediate Water into the South
Pacific, J. Phys. Oceanogr., 37, 31–49,
<a href="https://doi.org/10.1175/JPO2985.1" target="_blank">https://doi.org/10.1175/JPO2985.1</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Jones and Ito(2019)</label><mixed-citation>
Jones, D. C. and Ito, T.: Gaussian mixture modeling describes the geography of
the surface ocean carbon budget, NCAR, 108–113, <a href="https://doi.org/10.5065/y82j-f154" target="_blank">https://doi.org/10.5065/y82j-f154</a>,
2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Jones et al.(2016)Jones, Meijers, Shuckburgh, Sallée, Haynes,
McAufield, and Mazloff</label><mixed-citation>
Jones, D. C., Meijers, A. J. S., Shuckburgh, E., Sallée, J.-B., Haynes, P.,
McAufield, E. K., and Mazloff, M. R.: How does Subantarctic Mode Water
ventilate the Southern Hemisphere subtropics?, J. Geophys.  Res.-Oceans, 121, 6558–6582, <a href="https://doi.org/10.1002/2016jc011680" target="_blank">https://doi.org/10.1002/2016jc011680</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Jones et al.(2019)Jones, Holt, Meijers, and
Shuckburgh</label><mixed-citation>
Jones, D. C., Holt, H. J., Meijers, A. J., and Shuckburgh, E.: Unsupervised
clustering of Southern Ocean Argo float temperature profiles, J.
Geophys. Res.-Oceans, 124, 390–402, <a href="https://doi.org/10.1029/2018JC014629" target="_blank">https://doi.org/10.1029/2018JC014629</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Kim(2021)</label><mixed-citation>
Kim, Y. S.: ACC fronts, ResearchGate [data set],
available at: <a href="https://www.researchgate.net/publication/338420242_ACC_fronts" target="_blank"/>, last
access:   4 May 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Kim and Orsi(2014)</label><mixed-citation>
Kim, Y. S. and Orsi, A. H.: On the variability of Antarctic Circumpolar Current
fronts inferred from 1992–2011 altimetry, J. Phys. Oceanogr., 44, 3054–3071,
<a href="https://doi.org/10.1175/JPO-D-13-0217.1" target="_blank">https://doi.org/10.1175/JPO-D-13-0217.1</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Large and Yeager(2009)</label><mixed-citation>
Large, W. and Yeager, S.: The global climatology of an interannually varying
air‚Äìsea flux data set, Clim. Dynam., 33, 341–364,
<a href="https://doi.org/10.1007/s00382-008-0441-3" target="_blank">https://doi.org/10.1007/s00382-008-0441-3</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Le Bras et al.(2019)Le Bras, Sonnewald, and
Toole</label><mixed-citation>
Le Bras, I. A.-A., Sonnewald, M., and Toole, J. M.: A Barotropic Vorticity
Budget for the Subtropical North Atlantic Based on Observations,
J. Phys. Oceanogr., 49, 2781–2797,
<a href="https://doi.org/10.1175/JPO-D-19-0111.1" target="_blank">https://doi.org/10.1175/JPO-D-19-0111.1</a>,
2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Losch et al.(2010)Losch, Menemenlis, Campin, Heimbach, and
Hill</label><mixed-citation>
Losch, M., Menemenlis, D., Campin, J.-M., Heimbach, P., and Hill, C.: On the
formulation of sea-ice models. Part 1: Effects of different solver
implementations and parameterizations, Ocean Model., 33, 129–144,
<a href="https://doi.org/10.1016/j.ocemod.2009.12.008" target="_blank">https://doi.org/10.1016/j.ocemod.2009.12.008</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Mackie(2018)</label><mixed-citation>
Mackie, E. J. B.: Southern Ocean Circulation and Frontal Dynamics from
Cryosat-2 Along-track Radar Altimetry, PhD thesis, University of Bristol,
School of Geographical Sciences, available at: <a href="https://research-information.bris.ac.uk/en/studentTheses/southern-ocean-circulati&#xA;on-and-frontal-dynamics-from-cryosat-2-al" target="_blank">https://research-information.bris.ac.uk/en/studentTheses/southern-ocean-circulati
on-and-frontal-dynamics-from-cryosat-2-al</a> (last access: 4 May 2021), 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Marshall and Speer(2012)</label><mixed-citation>
Marshall, J. and Speer, K.: Closure of the meridional overturning circulation
through Southern Ocean upwelling, Nat. Geosci., 5, 171–180,
<a href="https://doi.org/10.1038/ngeo1391" target="_blank">https://doi.org/10.1038/ngeo1391</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Marshall et al.(1997a)Marshall, Adcroft, Hill, and
Perelman</label><mixed-citation>
Marshall, J., Adcroft, A., Hill, C., and Perelman, L.: A finite-volume,
incompressible Navier Stokes model for studies of the ocean on parallel
computers, J. Geophys. Res., 102, 5753–5766,
<a href="https://doi.org/10.1029/96JC02775" target="_blank">https://doi.org/10.1029/96JC02775</a>, 1997a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Marshall et al.(1997b)Marshall, Hill, Perelman, and
Adcroft</label><mixed-citation>
Marshall, J., Hill, C., Perelman, L., and Adcroft, A.: Hydrostatic,
quasi-hydrostatic, and nonhydrostatic ocean modeling, J. Geophys.
Res., 102, 5733–5752, <a href="https://doi.org/10.1029/96JC02776" target="_blank">https://doi.org/10.1029/96JC02776</a>, 1997b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Maze(2020)</label><mixed-citation>
Maze, G.: Ocean Profile Classification Model in python, Zenodo [code],
<a href="https://doi.org/10.5281/zenodo.3906236" target="_blank">https://doi.org/10.5281/zenodo.3906236</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Maze et al.(2017)Maze, Mercier, Fablet, Tandeo, Radcenco, Lenca,
Feucher, and Le Goff</label><mixed-citation>
Maze, G., Mercier, H., Fablet, R., Tandeo, P., Radcenco, M. L., Lenca, P.,
Feucher, C., and Le Goff, C.: Coherent heat patterns revealed by unsupervised
classification of Argo temperature profiles in the North Atlantic Ocean,
Prog. Oceanogr., 151, 275–292, <a href="https://doi.org/10.1016/j.pocean.2016.12.008" target="_blank">https://doi.org/10.1016/j.pocean.2016.12.008</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Mazloff et al.(2010)Mazloff, Heimbach, and
Wunsch</label><mixed-citation>
Mazloff, M. R., Heimbach, P., and Wunsch, C.: An Eddy-Permitting Southern
Ocean State Estimate, J. Phys. Oceanogr., 40, 880–899,
<a href="https://doi.org/10.1175/2009jpo4236.1" target="_blank">https://doi.org/10.1175/2009jpo4236.1</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>McLachlan and Basford(1988)</label><mixed-citation>
McLachlan, G. J. and Basford, K. E.: Mixture models: Inference and applications
to clustering, vol. 38, M. Dekker,  New York, 1988.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Met Office(2010–2021)</label><mixed-citation>
Met Office: Cartopy: a cartographic python library with a matplotlib
interface, Exeter, Devon,  Cartopy [code], available at: <a href="https://scitools.org.uk/cartopy/docs/latest/" target="_blank"/> (last access:
5 May 2021), 2010–2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Mikaloff-Fletcher et al.(2006)Mikaloff-Fletcher, Gruber, Jacobson,
Doney, Dutkiewicz, Gerber, Follows, Joos, Lindsay, Menemenlis, Mouchet,
Müller, and Sarmiento</label><mixed-citation>
Mikaloff-Fletcher, S., Gruber, N., Jacobson, A. R., Doney, S. C., Dutkiewicz,
S., Gerber, M., Follows, M., Joos, F., Lindsay, K., Menemenlis, D., Mouchet,
A., Müller, S. A., and Sarmiento, J. L.: Inverse estimates of anthropogenic
CO<sub>2</sub> uptake, transport, and storage by the ocean, Global Biogeochem.
Cy., 20, 1–16, <a href="https://doi.org/10.1029/2005gb002530" target="_blank">https://doi.org/10.1029/2005gb002530</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Morrison et al.(2015)Morrison, Frölicher, and
Sarmiento</label><mixed-citation>
Morrison, A. K., Frölicher, T. L., and Sarmiento, J. L.: Upwelling in the
Southern Ocean, Phys. Today, 68, 27–32, <a href="https://doi.org/10.1063/PT.3.2654" target="_blank">https://doi.org/10.1063/PT.3.2654</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Nakayama et al.(2018)Nakayama, Menemenlis, Zhang, Schodlok, and
Rignot</label><mixed-citation>
Nakayama, Y., Menemenlis, D., Zhang, H., Schodlok, M., and Rignot, E.: Origin
of Circumpolar Deep Water intruding onto the Amundsen and
Bellingshausen Sea continental shelves, Nat. Commun., 9, 3403,
<a href="https://doi.org/10.1038/s41467-018-05813-1" target="_blank">https://doi.org/10.1038/s41467-018-05813-1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Orsi et al.(1995)Orsi, Whitworth, and Nowlin</label><mixed-citation>
Orsi, A., Whitworth, T., and Nowlin, W.: On the meridional extent and fronts of
the Antarctic Circumpolar Current, Deep-Sea Res. Pt. I, 42,
641–673, <a href="https://doi.org/10.1016/0967-0637(95)00021-W" target="_blank">https://doi.org/10.1016/0967-0637(95)00021-W</a>, 1995.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Pauthenet et al.(2017)Pauthenet, Roquet, Madec, and
Nerini</label><mixed-citation>
Pauthenet, E., Roquet, F., Madec, G., and Nerini, D.: A linear decomposition of
the Southern Ocean thermohaline structure, J. Phys.
Oceanogr., 47, 29–47, <a href="https://doi.org/10.1175/JPO-D-16-0083.s1" target="_blank">https://doi.org/10.1175/JPO-D-16-0083.s1</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Pauthenet et al.(2018)Pauthenet, Roquet, Madec, Guinet, Hindell,
McMahon, Harcourt, and Nerini</label><mixed-citation>
Pauthenet, E., Roquet, F., Madec, G., Guinet, C., Hindell, M., McMahon, C. R.,
Harcourt, R., and Nerini, D.: Seasonal Meandering of the Polar Front
Upstream of the Kerguelen Plateau, Geophys. Res. Lett., 45,
9774–9781, <a href="https://doi.org/10.1029/2018GL079614" target="_blank">https://doi.org/10.1029/2018GL079614</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Pauthenet et al.(2019)Pauthenet, Roquet, Madec, Sallee, and
Nerini</label><mixed-citation>
Pauthenet, E., Roquet, F., Madec, G., Sallee, J.-B., and Nerini, D.: The
Thermohaline Modes of the Global Ocean, J. Phys.
Oceanogr., 49, 2535–2552, <a href="https://doi.org/10.1175/JPO-D-19-0120.1" target="_blank">https://doi.org/10.1175/JPO-D-19-0120.1</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Pauthenet et al.(2021)Pauthenet, Sallee, Schmidtko, and
Nerini</label><mixed-citation>
Pauthenet, E., Sallee, J.-B., Schmidtko, S., and Nerini, D.: Seasonal
Variation of the Antarctic Slope Front Occurrence and Position
Estimated from an Interpolated Hydrographic Climatology, J.
Phys. Oceanogr., 51, 1539–1557, <a href="https://doi.org/10.1175/JPO-D-20-0186.1" target="_blank">https://doi.org/10.1175/JPO-D-20-0186.1</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Pedregosa et al.(2011)Pedregosa, Varoquaux, Gramfort, Michel,
Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos,
Cournapeau, Brucher, Perrot, and Duchesnay</label><mixed-citation>
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.:
Scikit-learn: Machine Learning in Python, J. Mach. Learn.
Res., 12, 2825–2830, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Pollard et al.(2002)Pollard, Lucas, and J</label><mixed-citation>
Pollard, R. T., Lucas, M. I., and Read, J. F.: Physical controls on biogeochemical zonation
in the Southern Ocean, Deep-Sea Res. Pt. II, 49, 3289–3305,
<a href="https://doi.org/10.1016/S0967-0645(02)00084-X" target="_blank">https://doi.org/10.1016/S0967-0645(02)00084-X</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Rintoul et al.(2001)Rintoul, Hughes, and
Olbers</label><mixed-citation>
Rintoul, S., Hughes, C., and Olbers, D.: The Antarctic Circumpolar
Current system, International Geophysics Series,  77, 271–302, 2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Rosso et al.(2020)Rosso, Mazloff, Talley, Purkey, Freeman, and
Maze</label><mixed-citation>
Rosso, I., Mazloff, M. R., Talley, L. D., Purkey, S. G., Freeman, N. M., and
Maze, G.: Water Mass and Biogeochemical Variability in the Kerguelen Sector
of the Southern Ocean: A Machine Learning Approach for a Mixing Hot Spot,
J. Geophys. Res.-Oceans, 125, e2019JC015877, <a href="https://doi.org/10.1029/2019JC015877" target="_blank">https://doi.org/10.1029/2019JC015877</a>,
2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Sallee et al.(2010)Sallee, Speer, Rintoul, and
Wijffels</label><mixed-citation>
Sallee, J., Speer, K., Rintoul, S., and Wijffels, S.: Southern Ocean
thermocline ventilation, J. Phys. Oceanogr., 40, 509–529,
<a href="https://doi.org/10.1175/2009JPO4291.1" target="_blank">https://doi.org/10.1175/2009JPO4291.1</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Sallee et al.(2012)Sallee, Matear, Rintoul, and
Lenton</label><mixed-citation>
Sallee, J.-B., Matear, R. J., Rintoul, S. R., and Lenton, A.: Localized
subduction of anthropogenic carbon dioxide in the Southern Hemisphere
oceans, Nat. Geosci., 5, 579–584, <a href="https://doi.org/10.1038/ngeo1523" target="_blank">https://doi.org/10.1038/ngeo1523</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Siegelman et al.(2019)Siegelman, OToole, Flexas, Riviere, and
Klein</label><mixed-citation>
Siegelman, L., OToole, M., Flexas, M., Riviere, P., and Klein, P.: Submesoscale
ocean fronts act as biological hotspot for southern elephant seal, Sci.
Rep.-UK, 9, 5588, <a href="https://doi.org/10.1038/s41598-019-42117-w" target="_blank">https://doi.org/10.1038/s41598-019-42117-w</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Sokolov and Rintoul(2002)</label><mixed-citation>
Sokolov, S. and Rintoul, S. R.: Structure of Southern Ocean fronts at
140°&thinsp;E, J. Marine Syst., 37, 151–184,
<a href="https://doi.org/10.1016/S0924-7963(02)00200-2" target="_blank">https://doi.org/10.1016/S0924-7963(02)00200-2</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Sokolov and Rintoul(2009)</label><mixed-citation>
Sokolov, S. and Rintoul, S. R.: Circumpolar structure and distribution of the
Antarctic Circumpolar Current fronts: 1. Mean circumpolar paths,
J. Geophys. Res.-Atmos., 114, 3675,
<a href="https://doi.org/10.1029/2008JC005108" target="_blank">https://doi.org/10.1029/2008JC005108</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Sonnewald et al.(2019)Sonnewald, Wunsch, and
Heimbach</label><mixed-citation>
Sonnewald, M., Wunsch, C., and Heimbach, P.: Unsupervised Learning Reveals
Geography of Global Ocean Dynamical Regions, Earth   Space
Sci., 6, 784–794, <a href="https://doi.org/10.1029/2018EA000519" target="_blank">https://doi.org/10.1029/2018EA000519</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Sonnewald et al.(2020)Sonnewald, Dutkiewicz, Hill, and
Forget</label><mixed-citation>
Sonnewald, M., Dutkiewicz, S., Hill, C., and Forget, G.: Elucidating ecological
complexity: Unsupervised learning determines global marine eco-provinces,
Science Advances, 6, eaay4740, <a href="https://doi.org/10.1126/sciadv.aay4740" target="_blank">https://doi.org/10.1126/sciadv.aay4740</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Stammer et al.(2002)Stammer, Wunsch, Giering, Eckert, Heimbach,
Marotzke, Adcroft, Hill, and Marshall</label><mixed-citation>
Stammer, D., Wunsch, C., Giering, R., Eckert, C., Heimbach, P., Marotzke, J.,
Adcroft, A., Hill, C. N., and Marshall, J.: Global ocean circulation during
1992–1997, estimated from ocean observations and a general circulation model,
J. Geophys. Res.-Oceans, 107, 1-1–1-27, <a href="https://doi.org/10.1029/2001JC000888" target="_blank">https://doi.org/10.1029/2001JC000888</a>,
2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Talley(2013)</label><mixed-citation>
Talley, L.: Closure of the Global Overturning Circulation Through the
Indian, Pacific, and Southern Oceans: Schematics and Transports,
Oceanography, 26, 80–97, <a href="https://doi.org/10.5670/oceanog.2013.07" target="_blank">https://doi.org/10.5670/oceanog.2013.07</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Thomas(2021)</label><mixed-citation>
Thomas, S. D. A.: so-wise/so-fronts: Defining Southern Ocean fronts using
unsupervised classification, Zenodo [code], <a href="https://doi.org/10.5281/zenodo.5500666" target="_blank">https://doi.org/10.5281/zenodo.5500666</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Thompson and Sallée(2012)</label><mixed-citation>
Thompson, A. F. and Sallée, J.-B.: Jets and topography: Jet transitions and
the impact on transport in the Antarctic Circumpolar Current, J.
Phys. Oceanogr., 42, 956–972, <a href="https://doi.org/10.1175/JPO-D-11-0135.1" target="_blank">https://doi.org/10.1175/JPO-D-11-0135.1</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Thompson et al.(2010)Thompson, Haynes, Wilson, and
Richards</label><mixed-citation>
Thompson, A. F., Haynes, P. H., Wilson, C., and Richards, K. J.: Rapid
Southern Ocean front transitions in an eddy-resolving ocean GCM,
Geophys. Res. Lett., 37, L23602, <a href="https://doi.org/10.1029/2010GL045386" target="_blank">https://doi.org/10.1029/2010GL045386</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Thompson et al.(2018)Thompson, Stewart, Spence, and
Heywood</label><mixed-citation>
Thompson, A. F., Stewart, A. L., Spence, P., and Heywood, K. J.: The
Antarctic Slope Current in a Changing Climate, Rev.
Geophys., 56, 741–770, <a href="https://doi.org/10.1029/2018RG000624" target="_blank">https://doi.org/10.1029/2018RG000624</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Thompson et al.(2020)Thompson, Speer, and
Schulze Chretien</label><mixed-citation>
Thompson, A. F., Speer, K. G., and Schulze Chretien, L. M.: Genesis of the
Antarctic Slope Current in West Antarctica, Geophys. Res. Lett., 47, e2020GL087802,
<a href="https://doi.org/10.1029/2020GL087802" target="_blank">https://doi.org/10.1029/2020GL087802</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Verdy and Mazloff(2017)</label><mixed-citation>
Verdy, A. and Mazloff, M. R.: A data assimilating model for estimating Southern
Ocean biogeochemistry, J. Geophys. Res.-Oceans, 122,
6968–6988, <a href="https://doi.org/10.1002/2016jc012650" target="_blank">https://doi.org/10.1002/2016jc012650</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Verdy and Mazloff(2021)</label><mixed-citation>
Verdy, A. and Mazloff, M.:
B-SOSE iteration 106, [data set], available at:
<a href="http://sose.ucsd.edu/BSOSE6_iter106_solution.html" target="_blank"/>,
last access: 4 May 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Vernet et al.(2019)Vernet, Geibert, Hoppema, Brown, Haas, Hellmer,
Jokat, Jullion, Mazloff, Bakker, Brearley, Croot, Hattermann, Hauck,
Hillenbrand, Hoppe, Huhn, Koch, Lechtenfeld, Meredith, Naveira Garabato,
Nothig, Peeken, Rutgers van der Loeff, Schmidtko, Schroder, Strass,
Torres-Valdes, and Verdy</label><mixed-citation>
Vernet, M., Geibert, W., Hoppema, M., Brown, P. J., Haas, C., Hellmer, H. H.,
Jokat, W., Jullion, L., Mazloff, M., Bakker, D. C. E., Brearley, J. A.,
Croot, P., Hattermann, T., Hauck, J., Hillenbrand, C.-D., Hoppe, C. J. M.,
Huhn, O., Koch, B. P., Lechtenfeld, O. J., Meredith, M. P., Naveira Garabato,
A. C., Nothig, E.-M., Peeken, I., Rutgers van der Loeff, M. M., Schmidtko,
S., Schroder, M., Strass, V. H., Torres-Valdes, S., and Verdy, A.: The
Weddell Gyre, Southern Ocean: present knowledge and future challenges,
Rev. Geophys., 57, 623–708, <a href="https://doi.org/10.1029/2018RG000604" target="_blank">https://doi.org/10.1029/2018RG000604</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Vincent and Folorunso(2009)Vincent, Folorunso
et al.</label><mixed-citation>
Vincent, O. R. and Folorunso, O.: A descriptive algorithm for sobel image
edge detection, in: Proceedings of informing science &amp; IT education
conference (InSITE), Informing Science Institute
California, 40,   97–107, <a href="https://doi.org/10.28945/3351" target="_blank">https://doi.org/10.28945/3351</a>, 2009.

</mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Wunsch and Heimbach(2007)</label><mixed-citation>
Wunsch, C. and Heimbach, P.: Practical global oceanic state estimation, Physica D, 230, 197–208, <a href="https://doi.org/10.1016/j.physd.2006.09.040" target="_blank">https://doi.org/10.1016/j.physd.2006.09.040</a>,
2007.
</mixed-citation></ref-html>--></article>
