<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">NEJSDS</journal-id>
<journal-title-group><journal-title>The New England Journal of Statistics in Data Science</journal-title></journal-title-group>
<issn pub-type="ppub">2693-7166</issn><issn-l>2693-7166</issn-l>
<publisher>
<publisher-name>New England Statistical Society</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">NEJSDS72</article-id>
<article-id pub-id-type="doi">10.51387/24-NEJSDS72</article-id>
<article-categories><subj-group subj-group-type="area">
<subject>Statistical Methodology</subject></subj-group><subj-group subj-group-type="heading">
<subject>Methodology Article</subject></subj-group></article-categories>
<title-group>
<article-title>Contrastive Inverse Regression for Dimension Reduction</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Hawke</surname><given-names>Sam</given-names></name><email xlink:href="mailto:shawke@unc.edu">shawke@unc.edu</email><xref ref-type="aff" rid="j_nejsds72_aff_001"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Ma</surname><given-names>Yueen</given-names></name><email xlink:href="mailto:myueen@unc.edu">myueen@unc.edu</email><xref ref-type="aff" rid="j_nejsds72_aff_002"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Luo</surname><given-names>Hengrui</given-names></name><email xlink:href="mailto:hrluo@lbl.gov">hrluo@lbl.gov</email><xref ref-type="aff" rid="j_nejsds72_aff_003"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Li</surname><given-names>Didong</given-names></name><email xlink:href="mailto:didongli@unc.edu">didongli@unc.edu</email><xref ref-type="aff" rid="j_nejsds72_aff_004"/><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<aff id="j_nejsds72_aff_001">Department of Biostatistics, <institution>University of North Carolina at Chapel Hill</institution>, <country>USA</country>. E-mail address: <email xlink:href="mailto:shawke@unc.edu">shawke@unc.edu</email></aff>
<aff id="j_nejsds72_aff_002">Department of Statistics and Operations Research, <institution>University of North Carolina at Chapel Hill</institution>, <country>USA</country>. E-mail address: <email xlink:href="mailto:myueen@unc.edu">myueen@unc.edu</email></aff>
<aff id="j_nejsds72_aff_003">Computational Research Division, <institution>Lawrence Berkeley Laboratory</institution>, <country>USA</country>. Department of Statistics, <institution>Rice University</institution>, <country>USA</country>. E-mail address: <email xlink:href="mailto:hrluo@lbl.gov">hrluo@lbl.gov</email>E-mail address: <email xlink:href="mailto:hrluo@rice.edu">hrluo@rice.edu</email></aff>
<aff id="j_nejsds72_aff_004">Department of Biostatistics, <institution>University of North Carolina at Chapel Hill</institution>, <country>USA</country>. E-mail address: <email xlink:href="mailto:didongli@unc.edu">didongli@unc.edu</email></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Data and code available in <uri>https://github.com/myueen/contrastive-inverse-regression</uri>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2025</year></pub-date><pub-date pub-type="epub"><day>19</day><month>11</month><year>2024</year></pub-date><volume>3</volume><issue>1</issue><fpage>106</fpage><lpage>118</lpage><supplementary-material id="S1" content-type="document" xlink:href="nejsds72_s001.pdf" mimetype="application" mime-subtype="pdf">
<caption>
<title>Supplementary Material</title>
<p>Additional experimental details are included in the supplementary material.</p>
</caption>
</supplementary-material><history><date date-type="accepted"><day>3</day><month>10</month><year>2024</year></date></history>
<permissions><copyright-statement>© 2025 New England Statistical Society</copyright-statement><copyright-year>2025</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Supervised dimension reduction (SDR) has been a topic of growing interest in data science, as it enables the reduction of high-dimensional covariates while preserving the functional relation with certain response variables of interest. However, existing SDR methods are not suitable for analyzing datasets collected from case-control studies. In this setting, the goal is to learn and exploit the low-dimensional structure unique to or enriched by the case group, also known as the foreground group. While some unsupervised techniques such as the contrastive latent variable model and its variants have been developed for this purpose, they fail to preserve the functional relationship between the dimension-reduced covariates and the response variable. In this paper, we propose a supervised dimension reduction method called contrastive inverse regression (CIR) specifically designed for the contrastive setting. CIR introduces an optimization problem defined on the Stiefel manifold with a non-standard loss function. We prove the convergence of CIR to a local optimum using a gradient descent-based algorithm, and our numerical study empirically demonstrates the improved performance over competing methods for high-dimensional data.</p>
</abstract>
<kwd-group>
<label>Keywords and phrases</label>
<kwd>Case-control studies</kwd>
<kwd>Supervised dimension reduction</kwd>
<kwd>Optimization on Stiefel manifold</kwd>
</kwd-group>
<funding-group><funding-statement>SH was supported by NIH grants T32ES007018 and UM1 TR004406; DL was supported by NIH grants R01 AG079291, R56 LM013784, R01 HL149683, and UM1 TR004406, R01 LM014407, P30 ES010126.</funding-statement></funding-group>
</article-meta>
</front>
<body>
<sec id="j_nejsds72_s_001">
<label>1</label>
<title>Introduction</title>
<p>The field of data science has seen the growing importance of dimension reduction (DR) techniques as a preliminary step in processing large-scale biological datasets, such as single-cell RNA sequencing data. These techniques aid in tasks like data visualization, structural pattern discovery, and subsequent biological analyses. Within this broader context, Supervised Dimension Reduction (SDR, also known as sufficient dimension reduction) methodologies have gained significant attention [<xref ref-type="bibr" rid="j_nejsds72_ref_016">16</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_037">37</xref>]. In the study by [<xref ref-type="bibr" rid="j_nejsds72_ref_061">61</xref>], a comparison is drawn between SDR techniques and unsupervised counterparts like Principal Component Analysis (PCA), Spherelets [<xref ref-type="bibr" rid="j_nejsds72_ref_040">40</xref>], and Spherical Rotation Component Analysis [<xref ref-type="bibr" rid="j_nejsds72_ref_046">46</xref>], emphasizing the increasing prominence and utility of SDR in contemporary data science.</p>
<p>Given paired observations <inline-formula id="j_nejsds72_ineq_001"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="double-struck">R</mml:mi></mml:math><tex-math><![CDATA[$(x,y)\in {\mathbb{R}^{p}}\times \mathbb{R}$]]></tex-math></alternatives></inline-formula>, where <italic>x</italic> consists of <italic>p</italic> covariates, and <italic>y</italic> is the corresponding response or output variable, the common assumption in SDR is that 
<disp-formula id="j_nejsds72_eq_001">
<label>(1.1)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">φ</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ϵ</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mtext>for some function</mml:mtext>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">φ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ y=\varphi \big({V^{\top }}x,\epsilon \big),\hspace{2.5pt}\text{for some function}\hspace{2.5pt}\varphi ,\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds72_ineq_002"><alternatives><mml:math>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$V\in {\mathbb{R}^{p\times d}}$]]></tex-math></alternatives></inline-formula> with <inline-formula id="j_nejsds72_ineq_003"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo stretchy="false">≪</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi></mml:math><tex-math><![CDATA[$d\ll p$]]></tex-math></alternatives></inline-formula> is the projection matrix from a high-dimensional to a low-dimensional space, <italic>ϵ</italic> is the measurement error independent of <italic>x</italic>, and <italic>φ</italic> is an arbitrary unknown function. For example, in a single-cell RNA sequencing dataset, <italic>x</italic> could be the expression of genes for a cell and <italic>y</italic> could be the cell type.</p>
<p>Under assumption (<xref rid="j_nejsds72_eq_001">1.1</xref>), although the low-dimensional representation <inline-formula id="j_nejsds72_ineq_004"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">x</mml:mi></mml:math><tex-math><![CDATA[${V^{\top }}x$]]></tex-math></alternatives></inline-formula> is determined by a linear transformation, the function <italic>φ</italic> is an arbitrary unknown function. In this paper, we stick to the assumption in (<xref rid="j_nejsds72_eq_001">1.1</xref>) to focus on linear DR methods for two reasons. First, linear methods are computationally more efficient, particularly for large <italic>p</italic> and large <italic>n</italic>. Second, linear methods are more interpretable, which is an essential characteristic in scientific applications. For instance, in the example above, each column of <inline-formula id="j_nejsds72_ineq_005"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">x</mml:mi></mml:math><tex-math><![CDATA[${V^{\top }}x$]]></tex-math></alternatives></inline-formula> is often considered as a genetic pathway [<xref ref-type="bibr" rid="j_nejsds72_ref_004">4</xref>]. Although our proposed method can be extended to nonlinear cases by the kernel method, we will leave this for future work.</p>
<p>Sliced Inverse Regression (SIR, [<xref ref-type="bibr" rid="j_nejsds72_ref_041">41</xref>]) is a well-established technique for supervised dimension reduction that is widely applicable in multiple scenarios due to its roots in regression analysis. It has been shown to have strong consistency results in both fixed dimensional [<xref ref-type="bibr" rid="j_nejsds72_ref_032">32</xref>] and high-dimensional [<xref ref-type="bibr" rid="j_nejsds72_ref_045">45</xref>] settings. The goal of SIR is to capture the most relevant low-dimensional linear subspace without any parametric or nonparametric model-fitting process for <italic>φ</italic>.</p>
<p>Moreover, SIR offers a geometric interpretation by conditioning on the sufficient statistics of the input distribution [<xref ref-type="bibr" rid="j_nejsds72_ref_041">41</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_018">18</xref>]. SIR incorporates the idea of linear dimension reduction with statistical sufficiency. In SIR, given a pair of features <inline-formula id="j_nejsds72_ineq_006"><alternatives><mml:math>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$x\in {\mathbb{R}^{p}}$]]></tex-math></alternatives></inline-formula> and univariate response <inline-formula id="j_nejsds72_ineq_007"><alternatives><mml:math>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mi mathvariant="double-struck">R</mml:mi></mml:math><tex-math><![CDATA[$y\in \mathbb{R}$]]></tex-math></alternatives></inline-formula>, the goal is to find a matrix <inline-formula id="j_nejsds72_ineq_008"><alternatives><mml:math>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$V\in {\mathbb{R}^{p\times d}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_009"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi></mml:math><tex-math><![CDATA[$d\lt p$]]></tex-math></alternatives></inline-formula> such that <italic>y</italic> is conditionally independent of <italic>x</italic> given <inline-formula id="j_nejsds72_ineq_010"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">x</mml:mi></mml:math><tex-math><![CDATA[${V^{\top }}x$]]></tex-math></alternatives></inline-formula>. Although the matrix <italic>V</italic> is not identifiable, the column space of <italic>V</italic>, denoted <inline-formula id="j_nejsds72_ineq_011"><alternatives><mml:math>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{C}(V)$]]></tex-math></alternatives></inline-formula>, is identifiable.</p>
<p>Motivated by emerging modern high-dimensional [<xref ref-type="bibr" rid="j_nejsds72_ref_025">25</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_044">44</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_064">64</xref>] and biological datasets [<xref ref-type="bibr" rid="j_nejsds72_ref_028">28</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_042">42</xref>], SIR evolved and admitted several generalizations, including localized SIR [<xref ref-type="bibr" rid="j_nejsds72_ref_068">68</xref>], kernel SIR [<xref ref-type="bibr" rid="j_nejsds72_ref_067">67</xref>], SIR with regularization [<xref ref-type="bibr" rid="j_nejsds72_ref_042">42</xref>], SIR for longitudinal data [<xref ref-type="bibr" rid="j_nejsds72_ref_034">34</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_043">43</xref>], metric response values [<xref ref-type="bibr" rid="j_nejsds72_ref_060">60</xref>], and online SIR [<xref ref-type="bibr" rid="j_nejsds72_ref_010">10</xref>].</p>
<p>In this article, we focus on a specific type of high-dimensional biological data, where the dataset consists of two groups — a foreground group, also known as treatment group or case group, and a background group, also known as control group. The goal is to identify the low-dimensional structure, variation, and information unique to the foreground group for downstream analysis. This situation arises naturally in many scientific experiments with two subpopulations. For example, in Electronic Health Record (EHR) data, the foreground data could be health-related covariates from patients who received certain medical treatment, while the background data could be measurements from healthy patients who did not receive any treatment. In this case, the goal is to identify a unique structure in patients who received the treatment that can predict future outcomes. In a genomics context, the foreground data could be gene expression measurements from patients with a disease, and the background data could be measurements from healthy people. In this case, the goal is to predict a certain phenotype for the diseased patient for disease analysis and future therapy.</p>
<p>Previous contrastive models, such as the contrastive latent variable model (CLVM, [<xref ref-type="bibr" rid="j_nejsds72_ref_073">73</xref>]), contrastive principal component analysis (CPCA, [<xref ref-type="bibr" rid="j_nejsds72_ref_001">1</xref>]), probabilistic contrastive principal component analysis (PCPCA, [<xref ref-type="bibr" rid="j_nejsds72_ref_038">38</xref>]), and the contrastive Poisson latent variable model (CPLVM, [<xref ref-type="bibr" rid="j_nejsds72_ref_035">35</xref>]), have shown that using the case-control structure between foreground and background groups can greatly improve the effectiveness of dimension reduction over standard DR methods such as PCA and its variants. However, to the best of our knowledge, none of these unsupervised contrastive dimension reduction methods is directly applicable to the SDR setting.</p>
<p>In this work, we move from these unsupervised contrastive dimension reduction methods to a supervised contrastive dimension reduction setting. By combining the idea of contrastive loss function and the sufficient dimension reduction considered in the SIR model, we propose the Contrastive Inverse Regression (CIR) model, which exactly recovers SIR when a certain parameter is zero. The CIR model sheds light on how to explore and exploit the contrastive structures in supervised dimension reduction.</p>
<table-wrap id="j_nejsds72_tab_001">
<label>Table 1</label>
<caption>
<p>Categorization of DR methods by whether they are supervised or contrastive.</p>
</caption>
<table>
<thead>
<tr>
<td style="vertical-align: top; text-align: center; border-top: double; border-bottom: solid thin"><inline-formula id="j_nejsds72_ineq_012"><inline-graphic xlink:href="nejsds72_g001.jpg" id="j_nejsds72_ingr_001"/></inline-formula></td>
<td style="vertical-align: top; text-align: center; border-top: double; border-bottom: solid thin">No</td>
<td style="vertical-align: top; text-align: center; border-top: double; border-bottom: solid thin">Yes</td>
</tr>
</thead>
<tbody>
<tr>
<td style="vertical-align: top; text-align: center">No</td>
<td style="vertical-align: top; text-align: center">PCA, CCA</td>
<td style="vertical-align: top; text-align: center">SIR, LDA, LASSO</td>
</tr>
<tr>
<td style="vertical-align: top; text-align: center; border-bottom: solid thin">Yes</td>
<td style="vertical-align: top; text-align: center; border-bottom: solid thin">CPCA, PCPCA</td>
<td style="vertical-align: top; text-align: center; border-bottom: solid thin"><bold>CIR</bold></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Table <xref rid="j_nejsds72_tab_001">1</xref> lists several popular DR methods and their properties. The table categorizes these methods as “contrastive” and “supervised”, based on whether they are designed for case-control data and able to identify low-dimensional structure unique to the case group, and if they take the response variable <italic>y</italic> into consideration and use <inline-formula id="j_nejsds72_ineq_013"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">x</mml:mi></mml:math><tex-math><![CDATA[${V^{\top }}x$]]></tex-math></alternatives></inline-formula> to predict <italic>y</italic>. For example, PCA, the most well known DR method, is neither contrastive nor supervised. Similarly, canonical correlation analysis (CCA, [<xref ref-type="bibr" rid="j_nejsds72_ref_031">31</xref>]) does not utilize <italic>y</italic> or the unique information of one group, which makes it neither contrastive nor supervised. Methods such as CLVM, CPCA, PCPCA, and CPLVM are contrastive but not supervised. On the other hand, classical supervised DR methods including SIR [<xref ref-type="bibr" rid="j_nejsds72_ref_041">41</xref>], linear discriminant analysis (LDA, [<xref ref-type="bibr" rid="j_nejsds72_ref_026">26</xref>]), and the least absolute shrinkage and selection operator (LASSO, [<xref ref-type="bibr" rid="j_nejsds72_ref_058">58</xref>]) are supervised but not contrastive. Our proposed method, CIR, combines both contrastive and supervised features by utilizing both the response <italic>y</italic> and the case-control structure.</p>
<p>It is important to note that the assumption (<xref rid="j_nejsds72_eq_001">1.1</xref>) does not limit the response variable <italic>y</italic> to be continuous or categorical, and thus we do not distinguish between regression and classification. However, some methods listed in Table <xref rid="j_nejsds72_tab_001">1</xref> are specifically designed for either continuous <italic>y</italic> (regression, LASSO) or categorical <italic>y</italic> (classification, LDA). CIR handles both scenarios with the only difference being in the choice of slices, as explained in Section <xref rid="j_nejsds72_s_002">2</xref>. Furthermore, not all existing DR methods are included in this table. For example, the recently proposed linear optimal low-rank projection (LOL, [<xref ref-type="bibr" rid="j_nejsds72_ref_061">61</xref>]) is designed for the classification setting and requires the number of classes to be smaller than the reduced dimension <italic>d</italic>. This can be restrictive, for example, when applied to a single-cell RNA sequencing dataset, where <italic>d</italic> is required to be greater than the number of cell types. In contrast, CIR does not require such data-dependent constraints on the reduced dimension <italic>d</italic>. Similarly, data visualization algorithms that require <inline-formula id="j_nejsds72_ineq_014"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$d=2$]]></tex-math></alternatives></inline-formula> such as the t-distributed stochastic neighbor embedding (tSNE, [<xref ref-type="bibr" rid="j_nejsds72_ref_059">59</xref>]) and uniform manifold approximation and projection (UMAP, [<xref ref-type="bibr" rid="j_nejsds72_ref_005">5</xref>]) are not listed in the table.</p>
<p>We now present our proposed methodology, including an algorithm for solving the associated nonconvex optimization problem on the Stiefel manifold. We also provide analysis of the convergence of the algorithm, and conduct extensive experiments to demonstrate its superior performance on high-dimensional biomedical datasets when compared to existing DR methods. All proofs are provided in the appendix, and additional experimental details are in the supplement material.</p>
</sec>
<sec id="j_nejsds72_s_002" sec-type="methods">
<label>2</label>
<title>Method</title>
<p>To maintain consistency, we will use the terms “foreground group” and “background group” instead of “case-control” or “treatment-control” in the remaining sections. First, we briefly review SIR as our motivation.</p><statement id="j_nejsds72_stat_001"><label>Definition 1</label>
<title>(Stiefel manifold).</title>
<p><inline-formula id="j_nejsds72_ineq_015"><alternatives><mml:math>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>:</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false">I</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\operatorname{St}(p,d):=\{V\in {\mathbb{R}^{p\times d}}:{V^{\top }}V={\operatorname{I}_{d}}\}$]]></tex-math></alternatives></inline-formula> admits a smooth manifold structure equipped with a Riemannian metric, called the Stiefel manifold.</p></statement>
<p>Recall that under the assumption in Equation (<xref rid="j_nejsds72_eq_001">1.1</xref>), the centered inverse regression curve, <inline-formula id="j_nejsds72_ineq_016"><alternatives><mml:math>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[$\mathbb{E}[x|y]-\mathbb{E}[x]$]]></tex-math></alternatives></inline-formula>, lies exactly in the linear space spanned by columns of <inline-formula id="j_nejsds72_ineq_017"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi></mml:math><tex-math><![CDATA[${\Sigma _{xx}}V$]]></tex-math></alternatives></inline-formula>, denoted by <inline-formula id="j_nejsds72_ineq_018"><alternatives><mml:math>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{C}({\Sigma _{xx}}V)$]]></tex-math></alternatives></inline-formula>, where <inline-formula id="j_nejsds72_ineq_019"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{xx}}$]]></tex-math></alternatives></inline-formula> is the covariance matrix of <italic>x</italic>. This linear subspace is called the <italic>effective dimension reduced</italic> (e.d.r.) space [<xref ref-type="bibr" rid="j_nejsds72_ref_041">41</xref>]. As a result, the objective of SIR is to minimize the expected squared distance between <inline-formula id="j_nejsds72_ineq_020"><alternatives><mml:math>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[$\mathbb{E}[x|y]$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_021"><alternatives><mml:math>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{C}({\Sigma _{xx}}V)$]]></tex-math></alternatives></inline-formula>: 
<disp-formula id="j_nejsds72_eq_002">
<label>(2.1)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:munder>
<mml:mrow>
<mml:mo movablelimits="false">min</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:munder>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ \underset{V\in \operatorname{St}(p,d)}{\min }{\mathbb{E}_{y}}\big[{d^{2}}(\mathbb{E}[x|y]-\mathbb{E}[x],\mathcal{C}({\Sigma _{xx}}V)\big]\]]]></tex-math></alternatives>
</disp-formula> 
where <italic>d</italic> is the Euclidean distance.</p>
<p>In the contrastive setting, denote foreground data by <inline-formula id="j_nejsds72_ineq_022"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="double-struck">R</mml:mi></mml:math><tex-math><![CDATA[$(x,y)\in {\mathbb{R}^{p}}\times \mathbb{R}$]]></tex-math></alternatives></inline-formula> and background data by <inline-formula id="j_nejsds72_ineq_023"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="double-struck">R</mml:mi></mml:math><tex-math><![CDATA[$(\widetilde{x},\widetilde{y})\in {\mathbb{R}^{p}}\times \mathbb{R}$]]></tex-math></alternatives></inline-formula>. For convenience, we assume that <italic>x</italic> and <inline-formula id="j_nejsds72_ineq_024"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{x}$]]></tex-math></alternatives></inline-formula> are centered at the origin so that <inline-formula id="j_nejsds72_ineq_025"><alternatives><mml:math>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\mathbb{E}[x]=\mathbb{E}[\widetilde{x}]=0$]]></tex-math></alternatives></inline-formula>.</p>
<p>Our goal is to find a low-dimensional representation of <italic>x</italic>, denoted by <inline-formula id="j_nejsds72_ineq_026"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">x</mml:mi></mml:math><tex-math><![CDATA[${V^{\top }}x$]]></tex-math></alternatives></inline-formula>, such that <italic>y</italic> is determined by <inline-formula id="j_nejsds72_ineq_027"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">x</mml:mi></mml:math><tex-math><![CDATA[${V^{\top }}x$]]></tex-math></alternatives></inline-formula> while <inline-formula id="j_nejsds72_ineq_028"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{y}$]]></tex-math></alternatives></inline-formula> is not determined by <inline-formula id="j_nejsds72_ineq_029"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[${V^{\top }}\widetilde{x}$]]></tex-math></alternatives></inline-formula>. The goal of CIR is to find a low-dimensional subspace represented by <italic>V</italic> such that 
<disp-formula id="j_nejsds72_eq_003">
<label>(2.2)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mrow class="cases">
<mml:mfenced separators="" open="{" close="">
<mml:mrow>
<mml:mtable columnspacing="10.0pt" equalrows="false" columnlines="none" equalcolumns="false" columnalign="left left">
<mml:mtr>
<mml:mtd class="array">
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">φ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ϵ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd class="array">
<mml:mtext>for some function</mml:mtext>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">φ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="array">
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo stretchy="false">≠</mml:mo>
<mml:mi mathvariant="italic">ψ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ϵ</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd class="array">
<mml:mtext>for any function</mml:mtext>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">ψ</mml:mi>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ \left\{\begin{array}{l@{\hskip10.0pt}l}y=\varphi ({V^{\top }}x,\epsilon ),\hspace{1em}& \text{for some function}\hspace{2.5pt}\varphi ,\\ {} \widetilde{y}\ne \psi ({V^{\top }}\widetilde{x},\widetilde{\epsilon }),\hspace{1em}& \text{for any function}\hspace{2.5pt}\psi .\end{array}\right.\]]]></tex-math></alternatives>
</disp-formula> 
That is, the column space of <italic>V</italic> captures the low-dimensional information unique to the foreground group so that we can use <inline-formula id="j_nejsds72_ineq_030"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">x</mml:mi></mml:math><tex-math><![CDATA[${V^{\top }}x$]]></tex-math></alternatives></inline-formula> to predict <italic>y</italic> through <italic>φ</italic>, but cannot use <inline-formula id="j_nejsds72_ineq_031"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[${V^{\top }}\widetilde{x}$]]></tex-math></alternatives></inline-formula> to predict <inline-formula id="j_nejsds72_ineq_032"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{y}$]]></tex-math></alternatives></inline-formula> through any function <italic>ψ</italic>. Instead of optimizing a single loss similar to SIR, CIR aims at optimizing the subspace <inline-formula id="j_nejsds72_ineq_033"><alternatives><mml:math>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{C}({\Sigma _{xx}}V)$]]></tex-math></alternatives></inline-formula> to minimize the “contrastive loss function”: 
<disp-formula id="j_nejsds72_eq_004">
<label>(2.3)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mspace width="2.5pt"/>
<mml:mo stretchy="false">|</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="2.5pt"/>
<mml:mspace width="2.5pt"/>
<mml:mspace width="2.5pt"/>
<mml:mspace width="2.5pt"/>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mspace width="2.5pt"/>
<mml:mo stretchy="false">|</mml:mo>
<mml:mspace width="2.5pt"/><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}f(V)& :={\mathbb{E}_{y}}\big[{d^{2}}(\mathbb{E}[x\hspace{2.5pt}|\hspace{2.5pt}y],\mathcal{C}({\Sigma _{xx}}V)\big]\\ {} & \hspace{2.5pt}\hspace{2.5pt}\hspace{2.5pt}\hspace{2.5pt}-\alpha {\mathbb{E}_{\widetilde{y}}}\big[{d^{2}}(\mathbb{E}[\widetilde{x}\hspace{2.5pt}|\hspace{2.5pt}\widetilde{y}],\mathcal{C}({\Sigma _{\widetilde{x}\widetilde{x}}}V)\big],\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds72_ineq_034"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo stretchy="false">≥</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\alpha \ge 0$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_035"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo movablelimits="false">Cov</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mtext>and</mml:mtext>
<mml:mspace width="2.5pt"/>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo movablelimits="false">Cov</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\Sigma _{xx}}=\operatorname{Cov}(X)\hspace{2.5pt}\text{and}\hspace{2.5pt}{\Sigma _{\widetilde{x}\widetilde{x}}}=\operatorname{Cov}(\widetilde{X})\in {\mathbb{R}^{p\times p}}$]]></tex-math></alternatives></inline-formula>, and <italic>d</italic> is the Euclidean distance. Define the following notation: 
<disp-formula id="j_nejsds72_eq_005">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mspace width="2.5pt"/>
<mml:mo stretchy="false">|</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mspace width="2.5pt"/>
<mml:mo stretchy="false">|</mml:mo>
<mml:mspace width="2.5pt"/><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo movablelimits="false">Cov</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo movablelimits="false">Cov</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{array}{l}\displaystyle {v_{y}}=\mathbb{E}[x\hspace{2.5pt}|\hspace{2.5pt}y],\hspace{2.5pt}{v_{\widetilde{y}}}=\mathbb{E}[\widetilde{x}\hspace{2.5pt}|\hspace{2.5pt}\widetilde{y}]\in {\mathbb{R}^{p}}\\ {} \displaystyle {\Sigma _{x}}=\operatorname{Cov}({v_{y}}),\hspace{2.5pt}{\Sigma _{\widetilde{x}}}=\operatorname{Cov}({v_{\widetilde{y}}})\in {\mathbb{R}^{p\times p}}\end{array}\]]]></tex-math></alternatives>
</disp-formula> 
<inline-formula id="j_nejsds72_ineq_036"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${v_{y}}$]]></tex-math></alternatives></inline-formula> (and <inline-formula id="j_nejsds72_ineq_037"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${v_{\widetilde{y}}}$]]></tex-math></alternatives></inline-formula>) are called the centered inverse regression curves [<xref ref-type="bibr" rid="j_nejsds72_ref_041">41</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_060">60</xref>]. The resulting loss function <italic>f</italic> balances the effectiveness of dimension reduction between the foreground and background groups. We can adjust the hyperparameter <italic>α</italic> to express our belief in the importance of the background group. Note that the parameter <italic>α</italic> appears naturally in other contrastive DR methods, including CPCA and PCPCA.</p><statement id="j_nejsds72_stat_002"><label>Proposition 1.</label>
<p>The objective function <italic>f</italic> given by Equation (<xref rid="j_nejsds72_eq_004">2.3</xref>) is simplified as 
<disp-formula id="j_nejsds72_eq_006">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ f(V)=-\operatorname{tr}\big({V^{\top }}AV{\big({V^{\top }}BV\big)^{-1}}\big)+\alpha \operatorname{tr}\big({V^{\top }}\widetilde{A}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}\big)\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds72_ineq_038"><alternatives><mml:math>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$A={\Sigma _{xx}}{\Sigma _{x}}{\Sigma _{xx}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_039"><alternatives><mml:math>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[$B={\Sigma _{xx}^{2}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_040"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$\widetilde{A}={\Sigma _{\widetilde{x}\widetilde{x}}}{\Sigma _{\widetilde{x}}}{\Sigma _{\widetilde{x}\widetilde{x}}}$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds72_ineq_041"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[$\widetilde{B}={\Sigma _{\widetilde{x}\widetilde{x}}^{2}}$]]></tex-math></alternatives></inline-formula>.</p></statement>
<p>Note that <italic>V</italic> is not identifiable, and is identifiable only up to a <italic>d</italic>-dimensional rotation. However, the contrastive loss <italic>f</italic>, determined by <inline-formula id="j_nejsds72_ineq_042"><alternatives><mml:math>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$V{V^{\top }}$]]></tex-math></alternatives></inline-formula>, the projection matrix to the column space of <italic>V</italic>, is invariant under such rotations. This nonidentifiability issue is common in other DR methods, including PCA, CPCA, SIR, etc, where the convention is to refer to the column space of <italic>V</italic>. Therefore, this nonidentifiability is consistent with standard practices and does not impact the validity of our results.</p>
<p>Observe that in the case where <inline-formula id="j_nejsds72_ineq_043"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\alpha =0$]]></tex-math></alternatives></inline-formula>, CIR reduces to SIR. In this case, the problem can be reparameterized by <inline-formula id="j_nejsds72_ineq_044"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">V</mml:mi></mml:math><tex-math><![CDATA[${V^{\ast }}={B^{1/2}}V$]]></tex-math></alternatives></inline-formula> so that the columns are orthogonal, which reduces the loss function to a quadratic form, yielding a closed-form solution (as a generalized eigenproblem). In the case where <inline-formula id="j_nejsds72_ineq_045"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\alpha \gt 0$]]></tex-math></alternatives></inline-formula>, however, we cannot perform the same trick for both <italic>B</italic> and <inline-formula id="j_nejsds72_ineq_046"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{B}$]]></tex-math></alternatives></inline-formula>, so we must resort to numerical approximations. We adopt gradient-based optimization algorithms on <inline-formula id="j_nejsds72_ineq_047"><alternatives><mml:math>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula>, which are based on the gradient of <italic>f</italic> given by the following lemma.</p><statement id="j_nejsds72_stat_003"><label>Lemma 1.</label>
<p>The gradient of <italic>f</italic> is given by 
<disp-formula id="j_nejsds72_eq_007">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mspace width="-0.1667em"/>
<mml:mo>−</mml:mo>
<mml:mspace width="-0.1667em"/>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>+</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>−</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& \operatorname{grad}f(V)\\ {} & =-2\big(AV{\big({V^{\top }}BV\big)^{-1}}\hspace{-0.1667em}-\hspace{-0.1667em}BV{\big({V^{\top }}BV\big)^{-1}}{V^{\top }}AV{\big({V^{\top }}BV\big)^{-1}}\big)\\ {} & \hspace{1em}+2\alpha \big(\widetilde{A}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}\\ {} & \hspace{1em}-\widetilde{B}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}{V^{\top }}\widetilde{A}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}\big).\end{aligned}\]]]></tex-math></alternatives>
</disp-formula>
</p></statement>
<p>Note that the gradient <inline-formula id="j_nejsds72_ineq_048"><alternatives><mml:math>
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi></mml:math><tex-math><![CDATA[$\operatorname{grad}f$]]></tex-math></alternatives></inline-formula> is different from the standard gradient in Euclidean space, denoted by <inline-formula id="j_nejsds72_ineq_049"><alternatives><mml:math>
<mml:mi mathvariant="italic">D</mml:mi>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle></mml:math><tex-math><![CDATA[$Df=\frac{\partial f}{\partial V}$]]></tex-math></alternatives></inline-formula>. The difference is that <inline-formula id="j_nejsds72_ineq_050"><alternatives><mml:math>
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi></mml:math><tex-math><![CDATA[$\operatorname{grad}f$]]></tex-math></alternatives></inline-formula> lies in the tangent space of <inline-formula id="j_nejsds72_ineq_051"><alternatives><mml:math>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula> at <italic>V</italic>, while the Euclidean version may escape from the tangent space.</p><statement id="j_nejsds72_stat_004"><label>Theorem 1.</label>
<p>If <italic>V</italic> is a local minimizer of the optimization problem (<xref rid="j_nejsds72_eq_004">2.3</xref>), then 
<disp-formula id="j_nejsds72_eq_008">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ AVE(V)-\alpha \widetilde{A}V\widetilde{E}(V)=BVF(V)-\alpha \widetilde{B}V\widetilde{F}(V),\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds72_ineq_052"><alternatives><mml:math>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$E(V)={({V^{\top }}BV)^{-1}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_053"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$\widetilde{E}(V)={({V^{\top }}\widetilde{B}V)^{-1}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_054"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$F(V)={({V^{\top }}BV)^{-1}}{V^{\top }}AV{({V^{\top }}BV)^{-1}}$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds72_ineq_055"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$\widetilde{F}(V)={({V^{\top }}\widetilde{B}V)^{-1}}{V^{\top }}\widetilde{A}V{({V^{\top }}\widetilde{B}V)^{-1}}$]]></tex-math></alternatives></inline-formula>.</p></statement>
<p>Let <inline-formula id="j_nejsds72_ineq_056"><alternatives><mml:math>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi></mml:math><tex-math><![CDATA[$G(V)={V^{\top }}AV$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_057"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi></mml:math><tex-math><![CDATA[$\widetilde{G}(V)={V^{\top }}\widetilde{A}V$]]></tex-math></alternatives></inline-formula>. Note, then, that the local optimality condition is equivalent to 
<disp-formula id="j_nejsds72_eq_009">
<label>(2.4)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& AVE(V)-\alpha \widetilde{A}V\widetilde{E}(V)\\ {} & \hspace{1em}=BVE(V)G(V)E(V)-\alpha \widetilde{B}V\widetilde{E}(V)\widetilde{G}(V)\widetilde{E}(V).\end{aligned}\]]]></tex-math></alternatives>
</disp-formula>
</p>
<p>In Appendix <xref rid="j_nejsds72_app_007">G</xref>, we discuss how Equation (<xref rid="j_nejsds72_eq_009">2.4</xref>) may lead to a gradient-free algorithm that involves solving an asymmetric algebraic Ricatti equation.</p>
<p>So far, we have discussed the population version, which relies on the distributions of <italic>x</italic>, <inline-formula id="j_nejsds72_ineq_058"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{x}$]]></tex-math></alternatives></inline-formula>, <italic>y</italic>, and <inline-formula id="j_nejsds72_ineq_059"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{y}$]]></tex-math></alternatives></inline-formula> that are unknown in practice. In real applications, we observe finite samples <inline-formula id="j_nejsds72_ineq_060"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${({x_{i}},{y_{i}})_{i=1}^{n}}$]]></tex-math></alternatives></inline-formula> as foreground data and <inline-formula id="j_nejsds72_ineq_061"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${({\widetilde{x}_{j}},{\widetilde{y}_{j}})_{j=1}^{m}}$]]></tex-math></alternatives></inline-formula> as background data. We denote <inline-formula id="j_nejsds72_ineq_062"><alternatives><mml:math>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$X\in {\mathbb{R}^{n\times p}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_063"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$\widetilde{X}\in {\mathbb{R}^{m\times p}}$]]></tex-math></alternatives></inline-formula> where each row represents a sample; similarly, each entry of <inline-formula id="j_nejsds72_ineq_064"><alternatives><mml:math>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$Y\in {\mathbb{R}^{n}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_065"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$\widetilde{Y}\in {\mathbb{R}^{m}}$]]></tex-math></alternatives></inline-formula> represents a response value. In this case, we can replace the expectation by the sample mean to get estimates of <inline-formula id="j_nejsds72_ineq_066"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{x}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_067"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{\widetilde{x}}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_068"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{xx}}$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds72_ineq_069"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{\widetilde{x}\widetilde{x}}}$]]></tex-math></alternatives></inline-formula> and have the corresponding plug-in estimates for <italic>A</italic>, <italic>B</italic>, <inline-formula id="j_nejsds72_ineq_070"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{A}$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds72_ineq_071"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{B}$]]></tex-math></alternatives></inline-formula>. Throughout this paper, we assume <inline-formula id="j_nejsds72_ineq_072"><alternatives><mml:math>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi></mml:math><tex-math><![CDATA[$p\lt n$]]></tex-math></alternatives></inline-formula> so that all covariance matrices are nonsingular. The extension to <inline-formula id="j_nejsds72_ineq_073"><alternatives><mml:math>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi></mml:math><tex-math><![CDATA[$p\gt n$]]></tex-math></alternatives></inline-formula> is discussed in Section <xref rid="j_nejsds72_s_009">5</xref>. After computing these matrices, the problem is reduced to a manifold optimization problem [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>]. The estimates of <inline-formula id="j_nejsds72_ineq_074"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{x}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_075"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{\widetilde{x}}}$]]></tex-math></alternatives></inline-formula> deserve further discussion. As shown by [<xref ref-type="bibr" rid="j_nejsds72_ref_041">41</xref>] and [<xref ref-type="bibr" rid="j_nejsds72_ref_010">10</xref>] among others, for continuous response <italic>y</italic>, the observed support of response <italic>y</italic> can be discretized into <italic>slices</italic> <inline-formula id="j_nejsds72_ineq_076"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">I</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[${I_{h}}=({q_{h-1}},{q_{h}}]$]]></tex-math></alternatives></inline-formula>, for <inline-formula id="j_nejsds72_ineq_077"><alternatives><mml:math>
<mml:mi mathvariant="italic">h</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">H</mml:mi></mml:math><tex-math><![CDATA[$h=1,\dots ,H$]]></tex-math></alternatives></inline-formula>. An estimate of <inline-formula id="j_nejsds72_ineq_078"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{x}}$]]></tex-math></alternatives></inline-formula> is given by <inline-formula id="j_nejsds72_ineq_079"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mo largeop="false" movablelimits="false">∑</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${\textstyle\sum _{h=1}^{H}}{m_{h}}{m_{h}^{\top }}$]]></tex-math></alternatives></inline-formula> where <inline-formula id="j_nejsds72_ineq_080"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:msub>
<mml:mrow>
<mml:mo largeop="false" movablelimits="false">∑</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">I</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${m_{h}}=\frac{1}{n{p_{h}}}{\textstyle\sum _{{y_{i}}\in {I_{h}}}}{x_{i}}$]]></tex-math></alternatives></inline-formula> with <inline-formula id="j_nejsds72_ineq_081"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:msubsup>
<mml:mrow>
<mml:mo largeop="false" movablelimits="false">∑</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mi mathvariant="italic">I</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">I</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${p_{h}}=\frac{1}{n}{\textstyle\sum _{i=1}^{n}}I({y_{i}}\in {I_{h}})$]]></tex-math></alternatives></inline-formula>. On the other hand, if <italic>y</italic> and <inline-formula id="j_nejsds72_ineq_082"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{y}$]]></tex-math></alternatives></inline-formula> are categorical, the slices are naturally chosen as all possible values of <italic>y</italic> and <inline-formula id="j_nejsds72_ineq_083"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{y}$]]></tex-math></alternatives></inline-formula>. Combining these pieces, we present our empirical version of the CIR algorithm in Algorithm <xref rid="j_nejsds72_fig_001">1</xref>.</p>
<p>It is worth noting that our optimization of <italic>f</italic> as a function of <inline-formula id="j_nejsds72_ineq_084"><alternatives><mml:math>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$V\in \operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula> cannot be considered as an optimization problem in <inline-formula id="j_nejsds72_ineq_085"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\mathbb{R}^{p\times d}}$]]></tex-math></alternatives></inline-formula> with orthogonality constraints <inline-formula id="j_nejsds72_ineq_086"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false">I</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V^{\top }}V={\operatorname{I}_{d}}$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds72_ref_023">23</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_007">7</xref>]. Because the term <inline-formula id="j_nejsds72_ineq_087"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${({V^{\top }}BV)^{-1}}$]]></tex-math></alternatives></inline-formula> in <italic>f</italic> is not well defined unless <italic>V</italic> is full rank, our loss function <italic>f</italic> cannot be extended to the full Euclidean space <inline-formula id="j_nejsds72_ineq_088"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\mathbb{R}^{p\times d}}$]]></tex-math></alternatives></inline-formula>. We consider it as an optimization problem <italic>intrinsically defined</italic> on <inline-formula id="j_nejsds72_ineq_089"><alternatives><mml:math>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula> as laid out by [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>]. This key property excludes some commonly used optimizers on manifolds, and we will discuss more details in the next section.</p>
<fig id="j_nejsds72_fig_001">
<label>Algorithm 1:</label>
<caption>
<p>CIR.</p>
</caption>
<graphic xlink:href="nejsds72_g002.jpg"/>
</fig>
</sec>
<sec id="j_nejsds72_s_003">
<label>3</label>
<title>Theory</title>
<p>In this section, we discuss two concrete optimization algorithms for the last step in Algorithm <xref rid="j_nejsds72_fig_001">1</xref> to find <inline-formula id="j_nejsds72_ineq_090"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${V^{\ast }}$]]></tex-math></alternatives></inline-formula> and show their convergence. The optimization problem outlined in Equation (<xref rid="j_nejsds72_eq_004">2.3</xref>) does not follow the classic Li-Duan theorem for regression-based dimension reduction due to its nonconvex nature (see, e.g., [<xref ref-type="bibr" rid="j_nejsds72_ref_017">17</xref>, Prop. 8.1]). The convergence of the algorithm is discussed in detail below.</p>
<p>The first algorithm we consider is the scaled gradient projection method (SGPM) specifically designed for optimization on the Stiefel manifold [<xref ref-type="bibr" rid="j_nejsds72_ref_050">50</xref>]. We first define an analog to Lagrangian multiplier <inline-formula id="j_nejsds72_ineq_091"><alternatives><mml:math>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="normal">Λ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo fence="true" stretchy="false">⟨</mml:mo>
<mml:mi mathvariant="normal">Λ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false">I</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">⟩</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{L}(V,\Lambda ):=f(V)-\frac{1}{2}\langle \Lambda ,{V^{\top }}V-{\operatorname{I}_{d}}\rangle $]]></tex-math></alternatives></inline-formula>, then the SGPM algorithm is summarized in Algorithm <xref rid="j_nejsds72_fig_002">2</xref>, where <inline-formula id="j_nejsds72_ineq_092"><alternatives><mml:math>
<mml:mi mathvariant="italic">π</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo movablelimits="false">arg</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false">min</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">Q</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">|</mml:mo>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">Q</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$\pi (X)=\arg {\min _{Q\in \operatorname{St}(p,d)}}||X-Q|{|_{F}}$]]></tex-math></alternatives></inline-formula> is the orthogonal projection to the Stiefel manifold. Note for Algorithm <xref rid="j_nejsds72_fig_002">2</xref> that the update for <italic>μ</italic> and <inline-formula id="j_nejsds72_ineq_093"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">C</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${C_{k+1}}$]]></tex-math></alternatives></inline-formula> is intricate; see [<xref ref-type="bibr" rid="j_nejsds72_ref_050">50</xref>] for more details.</p>
<fig id="j_nejsds72_fig_002">
<label>Algorithm 2:</label>
<caption>
<p>SGPM [<xref ref-type="bibr" rid="j_nejsds72_ref_050">50</xref>].</p>
</caption>
<graphic xlink:href="nejsds72_g003.jpg"/>
</fig>
<p>To study the convergence of Algorithm <xref rid="j_nejsds72_fig_002">2</xref>, we need to study the Karush-Kuhn-Tucker (KKT) conditions for CIR:</p><statement id="j_nejsds72_stat_005"><label>Lemma 2</label>
<title>([<xref ref-type="bibr" rid="j_nejsds72_ref_050">50</xref>]).</title>
<p>The KKT conditions are given by 
<disp-formula id="j_nejsds72_eq_010">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">D</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="normal">Λ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:mo>∇</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="normal">Λ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd">
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">D</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="normal">Λ</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="normal">Λ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false">I</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}{D_{V}}\mathcal{L}(V,\Lambda )& =\nabla f-V\Lambda =0\\ {} {D_{\Lambda }}\mathcal{L}(V,\Lambda )& ={V^{\top }}V-{\operatorname{I}_{d}}.\end{aligned}\]]]></tex-math></alternatives>
</disp-formula>
</p></statement>
<p>Now we can state the convergence theorem of Algorithm <xref rid="j_nejsds72_fig_002">2</xref>:</p><statement id="j_nejsds72_stat_006"><label>Theorem 2.</label>
<p>Let <inline-formula id="j_nejsds72_ineq_094"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>∞</mml:mi>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${\{{V_{k}}\}_{k=1}^{\infty }}$]]></tex-math></alternatives></inline-formula> be an infinite sequence generated by Algorithm <xref rid="j_nejsds72_fig_002">2</xref>, then any accumulation point <inline-formula id="j_nejsds72_ineq_095"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{\ast }}$]]></tex-math></alternatives></inline-formula> of <inline-formula id="j_nejsds72_ineq_096"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>∞</mml:mi>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${\{{V_{k}}\}_{k=1}^{\infty }}$]]></tex-math></alternatives></inline-formula> satisfies the KKT conditions in Lemma <xref rid="j_nejsds72_stat_005">2</xref>, and <inline-formula id="j_nejsds72_ineq_097"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false">lim</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo stretchy="false">→</mml:mo>
<mml:mi>∞</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">‖</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">D</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">‖</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[${\lim \nolimits_{k\to \infty }}\| {D_{V}}\mathcal{L}({V_{k}})\| =0$]]></tex-math></alternatives></inline-formula>.</p></statement>
<fig id="j_nejsds72_fig_003">
<label>Algorithm 3:</label>
<caption>
<p>ALS [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>].</p>
</caption>
<graphic xlink:href="nejsds72_g004.jpg"/>
</fig>
<p>Although Algorithm <xref rid="j_nejsds72_fig_002">2</xref> is guaranteed to converge, there are two drawbacks. First, the accumulation point <inline-formula id="j_nejsds72_ineq_098"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{\ast }}$]]></tex-math></alternatives></inline-formula> is only guaranteed to satisfy the KKT conditions, but not necessarily a critical point. Second, we do not know how fast <inline-formula id="j_nejsds72_ineq_099"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{k}}$]]></tex-math></alternatives></inline-formula> will converge to <inline-formula id="j_nejsds72_ineq_100"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{\ast }}$]]></tex-math></alternatives></inline-formula>. Next, we introduce an accelerated line search (ALS) algorithm as an alternative to SGPM that converges to a critical point with a known convergence rate. ALS is summarized by Algorithm <xref rid="j_nejsds72_fig_003">3</xref>, where <inline-formula id="j_nejsds72_ineq_101"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">t</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${t_{k}^{A}}$]]></tex-math></alternatives></inline-formula> is the step size, called the Armijo step size for given <inline-formula id="j_nejsds72_ineq_102"><alternatives><mml:math><mml:mover accent="false">
<mml:mrow>
<mml:mi mathvariant="italic">α</mml:mi>
</mml:mrow>
<mml:mo accent="true">‾</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\overline{\alpha }$]]></tex-math></alternatives></inline-formula>, <italic>β</italic>, <italic>σ</italic>, <inline-formula id="j_nejsds72_ineq_103"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\eta _{k}}$]]></tex-math></alternatives></inline-formula> and <italic>R</italic> is a retraction to <inline-formula id="j_nejsds72_ineq_104"><alternatives><mml:math>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula>, see [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>] for more details.</p>
<p>Algorithm <xref rid="j_nejsds72_fig_003">3</xref> can be shown to have linear convergence to critical points if the hyperparameters are chosen properly. For other choices of <inline-formula id="j_nejsds72_ineq_105"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\eta _{k}}$]]></tex-math></alternatives></inline-formula>, see Appendix <xref rid="j_nejsds72_app_005">E</xref> for more details. The following adaptation of Theorem 4.5.6 in [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>] indicates linear convergence to stationary points.</p><statement id="j_nejsds72_stat_007"><label>Theorem 3.</label>
<p>Let <inline-formula id="j_nejsds72_ineq_106"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>∞</mml:mi>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${\{{V_{k}}\}_{k=1}^{\infty }}$]]></tex-math></alternatives></inline-formula> be an infinite sequence generated by Algorithm <xref rid="j_nejsds72_fig_003">3</xref> with <inline-formula id="j_nejsds72_ineq_107"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${\eta _{k}}=-\operatorname{grad}\hspace{2.5pt}f({V_{k}})$]]></tex-math></alternatives></inline-formula> converging to an accumulation point <inline-formula id="j_nejsds72_ineq_108"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{\ast }}$]]></tex-math></alternatives></inline-formula> of <inline-formula id="j_nejsds72_ineq_109"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>∞</mml:mi>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${\{{V_{k}}\}_{k=1}^{\infty }}$]]></tex-math></alternatives></inline-formula>, then <inline-formula id="j_nejsds72_ineq_110"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{\ast }}$]]></tex-math></alternatives></inline-formula> is a critical point of <italic>f</italic>, and <inline-formula id="j_nejsds72_ineq_111"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false">lim</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo stretchy="false">→</mml:mo>
<mml:mi>∞</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">‖</mml:mo>
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">‖</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[${\lim \nolimits_{k\to \infty }}\| \operatorname{grad}f({V_{k}})\| =0$]]></tex-math></alternatives></inline-formula>.</p>
<p>Furthermore, assuming <inline-formula id="j_nejsds72_ineq_112"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{\ast }}$]]></tex-math></alternatives></inline-formula> is a local minimizer of <italic>f</italic> with <inline-formula id="j_nejsds72_ineq_113"><alternatives><mml:math>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">λ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">l</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo movablelimits="false">min</mml:mo>
<mml:mo movablelimits="false">eig</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo movablelimits="false">Hess</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$0\lt {\lambda _{l}}:=\min \operatorname{eig}(\operatorname{Hess}(f)({V_{\ast }}))$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_114"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">λ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">u</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo movablelimits="false">max</mml:mo>
<mml:mo movablelimits="false">eig</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo movablelimits="false">Hess</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${\lambda _{u}}:=\max \operatorname{eig}(\operatorname{Hess}(f)({V_{\ast }}))$]]></tex-math></alternatives></inline-formula>, then, for any <inline-formula id="j_nejsds72_ineq_115"><alternatives><mml:math>
<mml:mi mathvariant="italic">r</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">r</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$r\in ({r_{\ast }},1)$]]></tex-math></alternatives></inline-formula> where 
<disp-formula id="j_nejsds72_eq_011">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">r</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">min</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="2.03em" minsize="2.03em">(</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="italic">σ</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">α</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">¯</mml:mo></mml:mover>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">λ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">l</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>4</mml:mn>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">λ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">λ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">u</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal" fence="true" maxsize="2.03em" minsize="2.03em">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {r_{\ast }}:=1-\min \bigg(2\sigma \bar{\alpha }{\lambda _{l}},4\sigma (1-\sigma )\beta \frac{{\lambda _{l}}}{{\lambda _{u}}}\bigg),\]]]></tex-math></alternatives>
</disp-formula> 
there exists an integer <inline-formula id="j_nejsds72_ineq_116"><alternatives><mml:math>
<mml:mi mathvariant="italic">K</mml:mi>
<mml:mo stretchy="false">≠</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$K\ne 0$]]></tex-math></alternatives></inline-formula> such that 
<disp-formula id="j_nejsds72_eq_012">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mi mathvariant="italic">r</mml:mi>
<mml:mo>+</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">r</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">c</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ f({V_{k+1}})-f({V_{\ast }})\le \big(r+(1-r)(1-c)\big)\big(f({V_{k}})-f({V_{\ast }})\big),\]]]></tex-math></alternatives>
</disp-formula> 
for all <inline-formula id="j_nejsds72_ineq_117"><alternatives><mml:math>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo stretchy="false">≥</mml:mo>
<mml:mi mathvariant="italic">K</mml:mi></mml:math><tex-math><![CDATA[$k\ge K$]]></tex-math></alternatives></inline-formula>, where <inline-formula id="j_nejsds72_ineq_118"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">α</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">¯</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\bar{\alpha }$]]></tex-math></alternatives></inline-formula>, <italic>β</italic>, <italic>σ</italic>, <italic>c</italic> are the hyperparameters in Algorithm <xref rid="j_nejsds72_fig_003">3</xref>.</p></statement>
<p>The difference between Algorithm <xref rid="j_nejsds72_fig_002">2</xref> and <xref rid="j_nejsds72_fig_003">3</xref> deserves further comment. While we empirically observe that Algorithm <xref rid="j_nejsds72_fig_002">2</xref> often converges faster, Algorithm <xref rid="j_nejsds72_fig_003">3</xref> has theoretical properties which allow for a proof of linear convergence in terms of an upper bound on the number of iterations. In practice, for smaller datasets we suggest running Algorithm <xref rid="j_nejsds72_fig_003">3</xref>, while for larger datasets we recommend using Algorithm <xref rid="j_nejsds72_fig_002">2</xref> for efficiency.</p>
<p>The computational complexity of CIR for both the SGPM-based optimization and the ALS-based optimization is compared to various competitors in the table below. Here, we assume that <inline-formula id="j_nejsds72_ineq_119"><alternatives><mml:math>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mi mathvariant="italic">m</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi></mml:math><tex-math><![CDATA[$1\le d\lt p\lt m,n$]]></tex-math></alternatives></inline-formula>, where the background and foreground data have <italic>m</italic> and <italic>n</italic> samples, respectively. Specifically, we assume that <italic>ϵ</italic> denotes the stopping error such that <inline-formula id="j_nejsds72_ineq_120"><alternatives><mml:math>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mi mathvariant="italic">ϵ</mml:mi></mml:math><tex-math><![CDATA[$f({V_{k}})-f({V_{\ast }})\le \epsilon $]]></tex-math></alternatives></inline-formula>. It is noteworthy that a <italic>p</italic>-dimensional singular value decomposition can be achieved within <inline-formula id="j_nejsds72_ineq_121"><alternatives><mml:math>
<mml:mi mathvariant="script">O</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{O}({p^{3}})$]]></tex-math></alternatives></inline-formula>. We present the comparison in the table below.</p>
<table-wrap id="j_nejsds72_tab_002">
<label>Table 2</label>
<caption>
<p>Computational time-complexity of CIR and competitors.</p>
</caption>
<table>
<thead>
<tr>
<td style="vertical-align: top; text-align: center; border-top: double; border-bottom: solid thin">Algorithm</td>
<td style="vertical-align: top; text-align: center; border-top: double; border-bottom: solid thin">Theoretical Algorithmic Complexity</td>
</tr>
</thead>
<tbody>
<tr>
<td style="vertical-align: top; text-align: center">CIR, SGPM-based</td>
<td style="vertical-align: top; text-align: center"><inline-formula id="j_nejsds72_ineq_122"><alternatives><mml:math>
<mml:mi mathvariant="script">O</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">m</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{O}((m+n){p^{2}})$]]></tex-math></alternatives></inline-formula></td>
</tr>
<tr>
<td style="vertical-align: top; text-align: center">CIR, ALS-based</td>
<td style="vertical-align: top; text-align: center"><inline-formula id="j_nejsds72_ineq_123"><alternatives><mml:math>
<mml:mi mathvariant="script">O</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">log</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">ϵ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>+</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">m</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{O}(-\log (\epsilon ){p^{3}}+(m+n){p^{2}})$]]></tex-math></alternatives></inline-formula></td>
</tr>
<tr>
<td style="vertical-align: top; text-align: center">LDA</td>
<td style="vertical-align: top; text-align: center"><inline-formula id="j_nejsds72_ineq_124"><alternatives><mml:math>
<mml:mi mathvariant="script">O</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{O}(n{p^{2}})$]]></tex-math></alternatives></inline-formula></td>
</tr>
<tr>
<td style="vertical-align: top; text-align: center">PCA</td>
<td style="vertical-align: top; text-align: center"><inline-formula id="j_nejsds72_ineq_125"><alternatives><mml:math>
<mml:mi mathvariant="script">O</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{O}(n{p^{2}})$]]></tex-math></alternatives></inline-formula></td>
</tr>
<tr>
<td style="vertical-align: top; text-align: center">CPCA</td>
<td style="vertical-align: top; text-align: center"><inline-formula id="j_nejsds72_ineq_126"><alternatives><mml:math>
<mml:mi mathvariant="script">O</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">m</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{O}((m+n){p^{2}})$]]></tex-math></alternatives></inline-formula></td>
</tr>
<tr>
<td style="vertical-align: top; text-align: center; border-bottom: solid thin">SIR</td>
<td style="vertical-align: top; text-align: center; border-bottom: solid thin"><inline-formula id="j_nejsds72_ineq_127"><alternatives><mml:math>
<mml:mi mathvariant="script">O</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{O}(n{p^{2}})$]]></tex-math></alternatives></inline-formula></td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="j_nejsds72_s_004">
<label>4</label>
<title>Application</title>
<p>When applying CIR, several hyperparameters must be tuned, such as the weight <italic>α</italic>, the reduced dimension <italic>d</italic>, and the slices <inline-formula id="j_nejsds72_ineq_128"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">I</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${I_{h}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_129"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">I</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">h</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${I_{\widetilde{h}}}$]]></tex-math></alternatives></inline-formula> for estimation of <inline-formula id="j_nejsds72_ineq_130"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{x}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_131"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{\widetilde{x}}}$]]></tex-math></alternatives></inline-formula>. In some cases, it may also be necessary to determine the definition of the foreground and background groups and to assign background labels <inline-formula id="j_nejsds72_ineq_132"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{Y}$]]></tex-math></alternatives></inline-formula>.</p>
<p>The value of <inline-formula id="j_nejsds72_ineq_133"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo stretchy="false">≥</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\alpha \ge 0$]]></tex-math></alternatives></inline-formula> can be determined by cross-validation. In particular, we suggest the following 2-step method. First, identify the rough range of <italic>α</italic> at the logarithmic scale. Because CIR coincides with SIR when <inline-formula id="j_nejsds72_ineq_134"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\alpha =0$]]></tex-math></alternatives></inline-formula>, running SIR initially can provide insights: better performance of SIR suggests a smaller <italic>α</italic>, and vice versa. Once a rough range is identified, standard cross-validation can be used within this range. Our numerical experiments show that CIR is robust to the choice of <italic>α</italic>; that is, the performance of the method changes continuously with <italic>α</italic>. Tables supporting this observation are provided in the supplement material.</p>
<p>Additionally, the choice of reduced dimension <italic>d</italic> may depend on the goal of the investigator. If visualization is considered important, <inline-formula id="j_nejsds72_ineq_135"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$d=2$]]></tex-math></alternatives></inline-formula> is appropriate. If the goal is prediction, the elbow point of the <italic>d</italic> versus prediction error plot may suggest an optimal <italic>d</italic>. However, as with other DR methods, determining the optimal value of <italic>d</italic> is still a topic of ongoing research [<xref ref-type="bibr" rid="j_nejsds72_ref_012">12</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_013">13</xref>].</p>
<p>The definition of foreground data <italic>X</italic> and <italic>Y</italic> should be the data and the target variable of interest, while the choices of background data <inline-formula id="j_nejsds72_ineq_136"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{X}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_137"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{Y}$]]></tex-math></alternatives></inline-formula> may not be as straightforward. These data are intended to represent “noise” that is “subtracted” from the foreground data. For example, in the biomedical context, if the population of interest is a group of sick patients, the background dataset may include observations of healthy individuals. In other contexts, however, it may be appropriate to use <inline-formula id="j_nejsds72_ineq_138"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi></mml:math><tex-math><![CDATA[$\widetilde{X}=X$]]></tex-math></alternatives></inline-formula>. In this case, the choice of background label <inline-formula id="j_nejsds72_ineq_139"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{Y}$]]></tex-math></alternatives></inline-formula> may be unclear. If another outcome variable was collected, it could be used as <inline-formula id="j_nejsds72_ineq_140"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{Y}$]]></tex-math></alternatives></inline-formula>; otherwise, randomly selected values in the support of <italic>Y</italic> could be used to represent “noise”.</p>
<p>The estimates for <inline-formula id="j_nejsds72_ineq_141"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{x}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_142"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{\widetilde{x}}}$]]></tex-math></alternatives></inline-formula> are partly determined by whether <italic>y</italic> and <inline-formula id="j_nejsds72_ineq_143"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{y}$]]></tex-math></alternatives></inline-formula> are categorical or continuous. If these variables are categorical, then each value of <italic>y</italic> (or <inline-formula id="j_nejsds72_ineq_144"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{y}$]]></tex-math></alternatives></inline-formula>) can be considered as a separate slice, resulting in <inline-formula id="j_nejsds72_ineq_145"><alternatives><mml:math>
<mml:mo stretchy="false">|</mml:mo>
<mml:mtext>supp</mml:mtext>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">|</mml:mo></mml:math><tex-math><![CDATA[$|\text{supp}(Y)|$]]></tex-math></alternatives></inline-formula> (or <inline-formula id="j_nejsds72_ineq_146"><alternatives><mml:math>
<mml:mo stretchy="false">|</mml:mo>
<mml:mtext>supp</mml:mtext>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">|</mml:mo></mml:math><tex-math><![CDATA[$|\text{supp}(\widetilde{Y})|$]]></tex-math></alternatives></inline-formula>) total slices. On the other hand, if these variables are continuous, slices can be taken to represent an equally spaced partition of the range of <italic>Y</italic> (or <inline-formula id="j_nejsds72_ineq_147"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{Y}$]]></tex-math></alternatives></inline-formula>), with the number of slices being tunable hyperparameters.</p>
<sec id="j_nejsds72_s_005">
<label>4.1</label>
<title>Mouse Protein Expression</title>
<p>The first dataset we consider was collected for the purpose of identifying proteins critical to learning in a mouse model of Down syndrome [<xref ref-type="bibr" rid="j_nejsds72_ref_027">27</xref>]. The data contain 1095 observations of expression levels of 77 different proteins, along with genotype (t=Ts65Dn, c=control), behavior (CS=context-shock, SC=shock-context), and treatment (m=memantine, s=saline). The behavior of CS corresponds to the scenario in which the mouse was first placed in a new cage and permitted to explore for a few minutes before being exposed to a brief electric shock; conversely, SC corresponds to mice immediately given an electric shock upon being placed in a new cage, and then being permitted to explore. Of the data, 543 samples contain at least one missing value. Taking into account the relatively large sample size, we consider only the 552 observations with complete data. We do not perform any normalization or any other type of preprocessing to the raw data prior to analysis.</p>
<p>In this example, <inline-formula id="j_nejsds72_ineq_148"><alternatives><mml:math>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>552</mml:mn>
<mml:mo>×</mml:mo>
<mml:mn>77</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$X\in {\mathbb{R}^{552\times 77}}$]]></tex-math></alternatives></inline-formula> represents the expression of 77 proteins of all mice without a missing value, while <inline-formula id="j_nejsds72_ineq_149"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>7</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[${y_{i}}\in \{0,1,\dots ,7\}$]]></tex-math></alternatives></inline-formula> represents the combination of 3 binary variables: genotype, treatment, and behavior. For example, <inline-formula id="j_nejsds72_ineq_150"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:math><tex-math><![CDATA[${y_{i}}=1$]]></tex-math></alternatives></inline-formula> means that the <italic>i</italic>-th mouse received memantine, was exposed to context-shock, and has genotype Ts65Dn. To visualize the data, we apply unsupervised DR algorithms PCA, tSNE and UMAP to <italic>X</italic> and supervised DR methods LDA, LASSO and SIR to <inline-formula id="j_nejsds72_ineq_151"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,Y)$]]></tex-math></alternatives></inline-formula>, with <inline-formula id="j_nejsds72_ineq_152"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$d=2$]]></tex-math></alternatives></inline-formula> for all algorithms. The 2-dimensional representation is given in Figure <xref rid="j_nejsds72_fig_004">1</xref>, where each color represents a class of mice among 8 total classes.</p>
<fig id="j_nejsds72_fig_004">
<label>Figure 1</label>
<caption>
<p>2-d representation of mouse protein data. Silhouette scores: (PCA, <inline-formula id="j_nejsds72_ineq_153"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.20</mml:mn></mml:math><tex-math><![CDATA[$-.20$]]></tex-math></alternatives></inline-formula>), (CPCA, <inline-formula id="j_nejsds72_ineq_154"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.13</mml:mn></mml:math><tex-math><![CDATA[$-.13$]]></tex-math></alternatives></inline-formula>), (LDA, .42), (LASSO, <inline-formula id="j_nejsds72_ineq_155"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.17</mml:mn></mml:math><tex-math><![CDATA[$-.17$]]></tex-math></alternatives></inline-formula>), (SIR, .03), (CIR, .29), (tSNE, <inline-formula id="j_nejsds72_ineq_156"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.14</mml:mn></mml:math><tex-math><![CDATA[$-.14$]]></tex-math></alternatives></inline-formula>), (UMAP, <inline-formula id="j_nejsds72_ineq_157"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.00</mml:mn></mml:math><tex-math><![CDATA[$-.00$]]></tex-math></alternatives></inline-formula>).</p>
</caption>
<graphic xlink:href="nejsds72_g005.jpg"/>
</fig>
<p>PCA, LASSO, SIR, tSNE, and UMAP fail to distinguish between classes, whereas LDA successfully separates 5 classes but with 3 classes (c-CS-m, c-CS-s, t-CS-m) mixed together. Now we take advantage of the background data. We let <inline-formula id="j_nejsds72_ineq_158"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{X}$]]></tex-math></alternatives></inline-formula> be the protein expression from mice with genotype = control, which coincides with the background group used in previous studies of this application [<xref ref-type="bibr" rid="j_nejsds72_ref_001">1</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_038">38</xref>]. We set <inline-formula id="j_nejsds72_ineq_159"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{Y}$]]></tex-math></alternatives></inline-formula> as the binary variable representing behavior and apply CPCA to <inline-formula id="j_nejsds72_ineq_160"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,\widetilde{X})$]]></tex-math></alternatives></inline-formula> and CIR to <inline-formula id="j_nejsds72_ineq_161"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,Y,\widetilde{X},\widetilde{Y})$]]></tex-math></alternatives></inline-formula> with <inline-formula id="j_nejsds72_ineq_162"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$d=2$]]></tex-math></alternatives></inline-formula> as well. The 2-dimensional representations with their Silhouette scores [<xref ref-type="bibr" rid="j_nejsds72_ref_053">53</xref>] are shown in Figure <xref rid="j_nejsds72_fig_004">1</xref>, which indicates that CIR outperforms all other competitors except LDA in terms of the Silhouette score. In particular, the three classes that were not separated in LDA are less mixed in CIR, supported by the Silhouette scores for these three classes: <inline-formula id="j_nejsds72_ineq_163"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>0.10</mml:mn></mml:math><tex-math><![CDATA[$-0.10$]]></tex-math></alternatives></inline-formula> for LDA and 0.23 for CIR. We provide other objective scores [<xref ref-type="bibr" rid="j_nejsds72_ref_011">11</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_022">22</xref>] in the supplementary material.</p>
<p>Next, we show the classification accuracy based on <inline-formula id="j_nejsds72_ineq_164"><alternatives><mml:math>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi></mml:math><tex-math><![CDATA[$XV$]]></tex-math></alternatives></inline-formula>, the dimension-reduced data. Here, we vary <italic>d</italic> from 2 to 7 because for higher <italic>d</italic>, the accuracy is close to 1. The mean prediction accuracy of KNN, the best classifier for this example, over 10 replicates versus the reduced dimension <italic>d</italic> is shown in Figure <xref rid="j_nejsds72_fig_005">2</xref>, clearly indicating that CIR outperforms all competitors especially when <italic>d</italic> is small. We present the accuracy of other classifiers and their standard deviations in the supplement material.</p>
<fig id="j_nejsds72_fig_005">
<label>Figure 2</label>
<caption>
<p>Classification accuracy by KNN for mouse protein data.</p>
</caption>
<graphic xlink:href="nejsds72_g006.jpg"/>
</fig>
</sec>
<sec id="j_nejsds72_s_006">
<label>4.2</label>
<title>Single Cell RNA Sequencing</title>
<p>The second dataset we consider is from a study of single-cell RNA sequencing used to classify cells into cell types based on their transcriptional profile [<xref ref-type="bibr" rid="j_nejsds72_ref_003">3</xref>]. The data include 3500 observations of expression levels of 32838 different genes, along with cell labels as one of the 9 different cell types, namely CD8 T cell, CD4 T cell, classical monocyte, B cell, NK cell, plasmacytoid dendritic cell, non-classical monocyte, classic dendritic cell, and plasma cell. We select the top 100 most variable genes for our analysis to be consistent with previous analyses of these data [<xref ref-type="bibr" rid="j_nejsds72_ref_071">71</xref>, <xref ref-type="bibr" rid="j_nejsds72_ref_001">1</xref>]. In this example, <inline-formula id="j_nejsds72_ineq_165"><alternatives><mml:math>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>3500</mml:mn>
<mml:mo>×</mml:mo>
<mml:mn>100</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$X\in {\mathbb{R}^{3500\times 100}}$]]></tex-math></alternatives></inline-formula> represents the expression of 100 genes, while <inline-formula id="j_nejsds72_ineq_166"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>8</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[${y_{i}}\in \{0,1,\dots ,8\}$]]></tex-math></alternatives></inline-formula> represents the cell type. For example, <inline-formula id="j_nejsds72_ineq_167"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:math><tex-math><![CDATA[${y_{i}}=1$]]></tex-math></alternatives></inline-formula> means that the <italic>i</italic>-th cell is a CD4 T cell.</p>
<p>To visualize the data, we apply unsupervised DR algorithms PCA, tSNE, and UMAP to <italic>X</italic> and supervised DR methods LDA, LASSO, and SIR to <inline-formula id="j_nejsds72_ineq_168"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,Y)$]]></tex-math></alternatives></inline-formula>, for <inline-formula id="j_nejsds72_ineq_169"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$d=2$]]></tex-math></alternatives></inline-formula> for all algorithms. In this case, there is no obvious choice of background data. So, we use <inline-formula id="j_nejsds72_ineq_170"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi></mml:math><tex-math><![CDATA[$\widetilde{X}=X$]]></tex-math></alternatives></inline-formula> and randomly draw independent and identically distributed samples <inline-formula id="j_nejsds72_ineq_171"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo stretchy="false">∼</mml:mo>
<mml:mtext>uniform</mml:mtext>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>8</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\widetilde{Y}\sim \text{uniform}\{0,\dots ,8\}$]]></tex-math></alternatives></inline-formula> in order to apply CPCA and CIR. The 2-dimensional representations with their Silhouette scores are given in Figure <xref rid="j_nejsds72_fig_006">3</xref>, which indicates that CIR has the best performance.</p>
<p>For each <inline-formula id="j_nejsds72_ineq_172"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>10</mml:mn></mml:math><tex-math><![CDATA[$d=2,\dots ,10$]]></tex-math></alternatives></inline-formula>, we compare the accuracy of a KNN classifier based on dimension-reduced data among various methods, with the raw data as the baseline. We repeat this process 10 times to reduce the impact of random split in cross-validation. The prediction accuracy versus reduced dimension <italic>d</italic> is shown in Figure <xref rid="j_nejsds72_fig_007">4</xref>, where CIR has the best overall performance especially when <inline-formula id="j_nejsds72_ineq_173"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>3</mml:mn></mml:math><tex-math><![CDATA[$d=2,3$]]></tex-math></alternatives></inline-formula>. We show the accuracy of other classifiers and their standard deviations in the supplement material.</p>
<fig id="j_nejsds72_fig_006">
<label>Figure 3</label>
<caption>
<p>2-d representation of single-cell RNA sequencing data. Silhouette scores: (PCA, <inline-formula id="j_nejsds72_ineq_174"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.25</mml:mn></mml:math><tex-math><![CDATA[$-.25$]]></tex-math></alternatives></inline-formula>), (CPCA, <inline-formula id="j_nejsds72_ineq_175"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.25</mml:mn></mml:math><tex-math><![CDATA[$-.25$]]></tex-math></alternatives></inline-formula>), (LDA, .11), (LASSO, <inline-formula id="j_nejsds72_ineq_176"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.30</mml:mn></mml:math><tex-math><![CDATA[$-.30$]]></tex-math></alternatives></inline-formula>), (SIR, <inline-formula id="j_nejsds72_ineq_177"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.10</mml:mn></mml:math><tex-math><![CDATA[$-.10$]]></tex-math></alternatives></inline-formula>), (CIR, .15), (tSNE, <inline-formula id="j_nejsds72_ineq_178"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.17</mml:mn></mml:math><tex-math><![CDATA[$-.17$]]></tex-math></alternatives></inline-formula>), (UMAP, <inline-formula id="j_nejsds72_ineq_179"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.17</mml:mn></mml:math><tex-math><![CDATA[$-.17$]]></tex-math></alternatives></inline-formula>).</p>
</caption>
<graphic xlink:href="nejsds72_g007.jpg"/>
</fig>
<p>The improved performance of CIR over SIR deserves further comment. While the background data and labels <inline-formula id="j_nejsds72_ineq_180"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\widetilde{X},\widetilde{Y})$]]></tex-math></alternatives></inline-formula> used in CIR do not add new information beyond what SIR used, because the background label is chosen randomly, we attribute the improved performance to CIR “denoising” the foreground data.</p>
<fig id="j_nejsds72_fig_007">
<label>Figure 4</label>
<caption>
<p>Classification accuracy by KNN for single-cell RNA sequencing data.</p>
</caption>
<graphic xlink:href="nejsds72_g008.jpg"/>
</fig>
</sec>
<sec id="j_nejsds72_s_007">
<label>4.3</label>
<title>COVID-19 Cell States</title>
<p>The third dataset we consider is also a single-cell RNA sequencing dataset, with samples from 90 patients with COVID-19 and 23 healthy volunteers [<xref ref-type="bibr" rid="j_nejsds72_ref_056">56</xref>]. In total, the dataset contains 48083 cells from diseased patients and 14426 cells from healthy volunteers. We treat the cells from the patients with disease as foreground and the cells from the healthy volunteers as background. On each cell, RNA expression levels on 24727 different genes were measured. For the features, we selected the 500 genes with the largest variances in RNA expression, in accordance with prior analysis with this dataset [<xref ref-type="bibr" rid="j_nejsds72_ref_021">21</xref>].</p>
<p>For each cell in the dataset, its cell type is identified, which we use as the labels. As recommended in a previous analysis [<xref ref-type="bibr" rid="j_nejsds72_ref_021">21</xref>], we consider only the cells for which at least 250 observations were available. This filtering resulted in 14 distinct cell types, with 40411 observations in the foreground and 13041 in the background. As in the previous example, we have <inline-formula id="j_nejsds72_ineq_181"><alternatives><mml:math>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>40411</mml:mn>
<mml:mo>×</mml:mo>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$X\in {\mathbb{R}^{40411\times 500}}$]]></tex-math></alternatives></inline-formula> to represent the expressions of 500 genes in the cells of patients with COVID-19 and <inline-formula id="j_nejsds72_ineq_182"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>13041</mml:mn>
<mml:mo>×</mml:mo>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$\widetilde{X}\in {\mathbb{R}^{13041\times 500}}$]]></tex-math></alternatives></inline-formula> to represent the gene expressions in the healthy volunteers, while <inline-formula id="j_nejsds72_ineq_183"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>13</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[${y_{i}},{\widetilde{y}_{j}}\in \{0,1,\dots ,13\}$]]></tex-math></alternatives></inline-formula> represents the cell type.</p>
<p>As with the previous two examples, we first apply DR methods to visualize the data. With <inline-formula id="j_nejsds72_ineq_184"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$d=2$]]></tex-math></alternatives></inline-formula>, we apply PCA, tSNE, and UMAP to <italic>X</italic>, and we apply LDA, LASSO, and SIR to <inline-formula id="j_nejsds72_ineq_185"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,Y)$]]></tex-math></alternatives></inline-formula>. We also apply CPCA to <inline-formula id="j_nejsds72_ineq_186"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,\widetilde{X})$]]></tex-math></alternatives></inline-formula> and CIR to <inline-formula id="j_nejsds72_ineq_187"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,Y,\widetilde{X},\widetilde{Y})$]]></tex-math></alternatives></inline-formula>. The 2-dimensional representations with their Silhouette scores are provided in Figure <xref rid="j_nejsds72_fig_008">5</xref>. Although CIR is not the best overall in terms of the Silhouette score, it outperforms its direct competitors, CPCA and SIR.</p>
<fig id="j_nejsds72_fig_008">
<label>Figure 5</label>
<caption>
<p>2-d representation of COVID-19 data. Silhouette scores: (PCA, <inline-formula id="j_nejsds72_ineq_188"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.29</mml:mn></mml:math><tex-math><![CDATA[$-.29$]]></tex-math></alternatives></inline-formula>), (CPCA, <inline-formula id="j_nejsds72_ineq_189"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.29</mml:mn></mml:math><tex-math><![CDATA[$-.29$]]></tex-math></alternatives></inline-formula>), (LDA, .02), (LASSO, <inline-formula id="j_nejsds72_ineq_190"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.48</mml:mn></mml:math><tex-math><![CDATA[$-.48$]]></tex-math></alternatives></inline-formula>), (SIR, <inline-formula id="j_nejsds72_ineq_191"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.27</mml:mn></mml:math><tex-math><![CDATA[$-.27$]]></tex-math></alternatives></inline-formula>), (CIR, <inline-formula id="j_nejsds72_ineq_192"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.05</mml:mn></mml:math><tex-math><![CDATA[$-.05$]]></tex-math></alternatives></inline-formula>), (tSNE, <inline-formula id="j_nejsds72_ineq_193"><alternatives><mml:math>
<mml:mo>−</mml:mo>
<mml:mn>.03</mml:mn></mml:math><tex-math><![CDATA[$-.03$]]></tex-math></alternatives></inline-formula>), (UMAP, .11).</p>
</caption>
<graphic xlink:href="nejsds72_g009.jpg"/>
</fig>
<p>For <inline-formula id="j_nejsds72_ineq_194"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>7</mml:mn></mml:math><tex-math><![CDATA[$d=2,\dots ,7$]]></tex-math></alternatives></inline-formula>, we compare the accuracy of a KNN classifier based on the dimension-reduced data among various DR methods, with the raw data as baseline. As with the previous examples, we repeat this process 10 times for each method to reduce the effect of the cross-validation random split on the results. The prediction accuracy for each reduced dimension <italic>d</italic> is shown in Figure <xref rid="j_nejsds72_fig_009">6</xref>, where we see that CIR is an improvement over all other methods for <inline-formula id="j_nejsds72_ineq_195"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>3</mml:mn></mml:math><tex-math><![CDATA[$d=2,3$]]></tex-math></alternatives></inline-formula>. We show the accuracy of both the KNN-based classifier and the tree-based classifier in the supplementary material.</p>
<fig id="j_nejsds72_fig_009">
<label>Figure 6</label>
<caption>
<p>Classification accuracy by KNN for COVID-19 data.</p>
</caption>
<graphic xlink:href="nejsds72_g010.jpg"/>
</fig>
</sec>
<sec id="j_nejsds72_s_008">
<label>4.4</label>
<title>Plasma Retinol</title>
<p>The final dataset we consider is the plasma retinol dataset [<xref ref-type="bibr" rid="j_nejsds72_ref_049">49</xref>]. The dataset contains 315 observations of 14 variables, including age, sex, smoking status, BMI, vitamin use, calories, fat, fiber, cholesterol, dietary beta-carotene, dietary retinol consumed per day, number of alcoholic drinks consumed per week, and levels of plasma beta-carotene and plasma retinol.</p>
<p>In this example, <inline-formula id="j_nejsds72_ineq_196"><alternatives><mml:math>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>315</mml:mn>
<mml:mo>×</mml:mo>
<mml:mn>12</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$X\in {\mathbb{R}^{315\times 12}}$]]></tex-math></alternatives></inline-formula> represents measurements of the first 12 variables listed for all subjects, while <inline-formula id="j_nejsds72_ineq_197"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${y_{i}}$]]></tex-math></alternatives></inline-formula> represents the measurement of plasma beta-carotene, a variable of particular interest to scientists [<xref ref-type="bibr" rid="j_nejsds72_ref_049">49</xref>]. In contrast to the previous two examples, note that here <inline-formula id="j_nejsds72_ineq_198"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${y_{i}}$]]></tex-math></alternatives></inline-formula> is continuous, not categorical.</p>
<p>We apply unsupervised DR algorithms PCA, tSNE, and UMAP to <italic>X</italic> and supervised DR algorithms LDA, LASSO, and SIR to <inline-formula id="j_nejsds72_ineq_199"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,Y)$]]></tex-math></alternatives></inline-formula> for <inline-formula id="j_nejsds72_ineq_200"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>8</mml:mn></mml:math><tex-math><![CDATA[$d=1,\dots ,8$]]></tex-math></alternatives></inline-formula>. Similarly to the single-cell RNA sequencing application, we let <inline-formula id="j_nejsds72_ineq_201"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi></mml:math><tex-math><![CDATA[$\widetilde{X}=X$]]></tex-math></alternatives></inline-formula> because there is no natural choice of background data. For the background label, we set <inline-formula id="j_nejsds72_ineq_202"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{Y}$]]></tex-math></alternatives></inline-formula> as the continuous variable representing the level of plasma retinol, which shares certain information with <inline-formula id="j_nejsds72_ineq_203"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${y_{i}}$]]></tex-math></alternatives></inline-formula>, and apply CPCA to <inline-formula id="j_nejsds72_ineq_204"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,\widetilde{X})$]]></tex-math></alternatives></inline-formula> and CIR to <inline-formula id="j_nejsds72_ineq_205"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,Y,\widetilde{X},\widetilde{Y})$]]></tex-math></alternatives></inline-formula> for <inline-formula id="j_nejsds72_ineq_206"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>8</mml:mn></mml:math><tex-math><![CDATA[$d=1,\dots ,8$]]></tex-math></alternatives></inline-formula>. We skip the visualization step in this case due to the poor visibility in terms of <inline-formula id="j_nejsds72_ineq_207"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${y_{i}}$]]></tex-math></alternatives></inline-formula>.</p>
<p>After trying a few regression methods, namely linear regression [<xref ref-type="bibr" rid="j_nejsds72_ref_024">24</xref>], regression trees [<xref ref-type="bibr" rid="j_nejsds72_ref_009">9</xref>], Gaussian process regression [<xref ref-type="bibr" rid="j_nejsds72_ref_014">14</xref>], and neural networks [<xref ref-type="bibr" rid="j_nejsds72_ref_030">30</xref>], we present the prediction mean squared error (MSE) for the best method for this dataset, linear regression. That is, for each <italic>d</italic> and the output <italic>V</italic> from each DR algorithm, we fit a linear regression model to <inline-formula id="j_nejsds72_ineq_208"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(XV,Y)$]]></tex-math></alternatives></inline-formula>. We also compare to a linear regression model based on raw data <inline-formula id="j_nejsds72_ineq_209"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(X,Y)$]]></tex-math></alternatives></inline-formula> as the baseline. Figure <xref rid="j_nejsds72_fig_010">7</xref> demonstrates that CIR outperforms all other competitors, but matches SIR when <inline-formula id="j_nejsds72_ineq_210"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo stretchy="false">≥</mml:mo>
<mml:mn>3</mml:mn></mml:math><tex-math><![CDATA[$d\ge 3$]]></tex-math></alternatives></inline-formula>.</p>
<p>Note that because <italic>Y</italic> and <inline-formula id="j_nejsds72_ineq_211"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{Y}$]]></tex-math></alternatives></inline-formula> are continuous, the number of slices to estimate <inline-formula id="j_nejsds72_ineq_212"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{x}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_213"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\Sigma _{\widetilde{x}}}$]]></tex-math></alternatives></inline-formula> needs to be carefully chosen and adjusted to ensure optimal performance. We use cross-validation to select 4 equally spaced partitions for the support of <italic>Y</italic> and <inline-formula id="j_nejsds72_ineq_214"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{Y}$]]></tex-math></alternatives></inline-formula>. In the three applications presented above, CIR demonstrates superior overall performance over its supervised, unsupervised, contrastive, and non-contrastive competitors, especially in low dimension, i.e., <inline-formula id="j_nejsds72_ineq_215"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>3</mml:mn></mml:math><tex-math><![CDATA[$d=2,3$]]></tex-math></alternatives></inline-formula>, which are the most crucial dimensions for visualization purposes.</p>
<fig id="j_nejsds72_fig_010">
<label>Figure 7</label>
<caption>
<p>MSE of linear regression for plasma retinol data.</p>
</caption>
<graphic xlink:href="nejsds72_g011.jpg"/>
</fig>
<p>In all four examples we considered, we consistently observed that CIR is the best among all methods when <inline-formula id="j_nejsds72_ineq_216"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>3</mml:mn></mml:math><tex-math><![CDATA[$d=2,3$]]></tex-math></alternatives></inline-formula>. However, the gain is not obvious for higher dimensions. A possible reason for this is that when <italic>d</italic> is relatively large, methods that use only the foreground data, such as SIR or LDA, capture both shared information and unique information in the foreground. Consequently, the improvement from incorporating the background group, or any contrastive model, becomes incremental. Fortunately, <inline-formula id="j_nejsds72_ineq_217"><alternatives><mml:math>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mn>3</mml:mn></mml:math><tex-math><![CDATA[$d\le 3$]]></tex-math></alternatives></inline-formula> are often the most important dimensions because they allow for visualization and interpretation.</p>
</sec>
</sec>
<sec id="j_nejsds72_s_009">
<label>5</label>
<title>Discussion and Future Work</title>
<p>In this work, we propose the CIR model and the associated optimization algorithm for supervised dimension reduction for datasets that are split into foreground and background groups. We provide a theoretical guarantee of the convergence of the CIR algorithm under mild conditions. We have shown that our CIR model outperforms competitors in multiple biomedical datasets, including mouse protein expression data, single-cell RNA sequencing data, and plasma retinol data. However, there are several important future directions that remain unaddressed in this paper, as outlined below.</p>
<p><italic>Multi-Treatment</italic>  It is natural to consider how our model can be extended to studies with multiple treatments. For example, in medical treatment, there might be more than one treatment for patients with certain disease. In [<xref ref-type="bibr" rid="j_nejsds72_ref_061">61</xref>], it has been shown that the number of treatment groups puts a hard constraint on the target dimension. It is interesting to generalize from a single-treatment structure to a multi-treatment structure (e.g., [<xref ref-type="bibr" rid="j_nejsds72_ref_047">47</xref>]), where the loss function needs more sophisticated design.</p>
<p>Another direction in multi-group scenario is to combine multiple CIR models trained on different pairs of bi-groups. As pointed out by [<xref ref-type="bibr" rid="j_nejsds72_ref_065">65</xref>], the generalization error in the contrastive regression model stacking needs to be controlled, and one possible way is to follow divergence mixing as proposed by [<xref ref-type="bibr" rid="j_nejsds72_ref_029">29</xref>], with a careful normalization. The major difficulty in training such stacking model is how to devise a sequential optimization for model training.</p>
<p><italic>Consistency and Sufficient Dimension Reduction</italic>  The consistency of the proposed CIR model remains open. Theorems <xref rid="j_nejsds72_stat_006">2</xref> and <xref rid="j_nejsds72_stat_007">3</xref> ensure that the resulting solution must be a stationary point, but we did not discuss whether these stationary points are consistent estimates. The consistency of the estimates is also affected by the choice of <italic>α</italic>, because <inline-formula id="j_nejsds72_ineq_218"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\alpha =0$]]></tex-math></alternatives></inline-formula> will render this CIR into a classical SIR for the foreground group. This consistency problem also has practical importance, as it explicitly expresses the trade-off between the expressive contrastiveness and the emphasis on the effective lower-dimensional structures. The group information and statistical sufficiency compete against each other, as we observed in the experiments, thus a range of <italic>α</italic> that balances between these two factors are of interest and might be answered by the consistency result.</p>
<p>Furthermore, SIR has the drawback of missing the totality central subspace when the symmetry assumption in <italic>x</italic> is lost [<xref ref-type="bibr" rid="j_nejsds72_ref_037">37</xref>]. [<xref ref-type="bibr" rid="j_nejsds72_ref_018">18</xref>] proposed the sliced average variance estimator (SAVE) estimator for addressing this problem, which raises the natural question of how to generalize this high-moment SDR method to the contrastive setting.</p>
<p><italic>Loss Function</italic>  Our loss function (<xref rid="j_nejsds72_eq_004">2.3</xref>) is nonstandard, which raises many questions. For example, the relation between number of local minima and <italic>A</italic>, <italic>B</italic>, <inline-formula id="j_nejsds72_ineq_219"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{A}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_220"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{B}$]]></tex-math></alternatives></inline-formula>, <italic>α</italic> remains open. Moreover, although <italic>f</italic> cannot be continuously extended to the full Euclidean space <inline-formula id="j_nejsds72_ineq_221"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="double-struck">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>×</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\mathbb{R}^{p\times d}}$]]></tex-math></alternatives></inline-formula>, if we restrict the domain to be a submanifold of <inline-formula id="j_nejsds72_ineq_222"><alternatives><mml:math>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula>, <italic>f</italic> might be extended to the convex hull of the submanifold. This extension will enable us to apply some other efficient optimization algorithms with strong theoretical guarantee [<xref ref-type="bibr" rid="j_nejsds72_ref_007">7</xref>]. Furthermore, Appendix <xref rid="j_nejsds72_app_007">G</xref> raises the question of the validity of a fixed-point algorithm based on Ricatti equations that may lead to a more efficient algorithm to minimize <italic>f</italic> without involving the gradient.</p>
<p><italic>Scalability</italic>  The method we presented in this paper does not handle high-dimensional data in the sense of <inline-formula id="j_nejsds72_ineq_223"><alternatives><mml:math>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">m</mml:mi></mml:math><tex-math><![CDATA[$p\gt n,m$]]></tex-math></alternatives></inline-formula>, because matrices <italic>B</italic> and <inline-formula id="j_nejsds72_ineq_224"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{B}$]]></tex-math></alternatives></inline-formula> are singular in this situation. A possible extension of CIR to <inline-formula id="j_nejsds72_ineq_225"><alternatives><mml:math>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi></mml:math><tex-math><![CDATA[$p\gt n$]]></tex-math></alternatives></inline-formula> is to use the same technique as sparse PCA [<xref ref-type="bibr" rid="j_nejsds72_ref_072">72</xref>], which introduces a penalty term to enforce sparsity.</p>
</sec>
</body>
<back>
<app-group>
<app id="j_nejsds72_app_001"><label>Appendix A</label>
<title>Proof to Proposition <xref rid="j_nejsds72_stat_002">1</xref></title>
<p>We need to simplify the loss function <inline-formula id="j_nejsds72_ineq_226"><alternatives><mml:math>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$f(V)$]]></tex-math></alternatives></inline-formula> for subsequent analyses. Recall that the projection to the subspace <inline-formula id="j_nejsds72_ineq_227"><alternatives><mml:math>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{C}({\Sigma _{xx}}V)$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_228"><alternatives><mml:math>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{C}({\Sigma _{\widetilde{x}\widetilde{x}}}V)$]]></tex-math></alternatives></inline-formula> is given by the following projection matrices: 
<disp-formula id="j_nejsds72_eq_013">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd">
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}{P_{{\Sigma _{xx}}V}}& :={\Sigma _{xx}}V{\big[{V^{\top }}{\Sigma _{xx}^{2}}V\big]^{-1}}{V^{\top }}{\Sigma _{xx}}\\ {} {P_{{\Sigma _{\widetilde{x}\widetilde{x}}}V}}& :={\Sigma _{\widetilde{x}\widetilde{x}}}V{\big[{V^{\top }}{\Sigma _{\widetilde{x}\widetilde{x}}^{2}}V\big]^{-1}}{V^{\top }}{\Sigma _{\widetilde{x}\widetilde{x}}}.\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
Because projection matrices are idempotent, that is, <inline-formula id="j_nejsds72_ineq_229"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${P_{{\Sigma _{xx}}V}^{2}}={P_{{\Sigma _{xx}}V}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_230"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${P_{{\Sigma _{\widetilde{x}\widetilde{x}}}V}^{2}}={P_{{\Sigma _{\widetilde{x}\widetilde{x}}}V}}$]]></tex-math></alternatives></inline-formula>, we can rewrite the loss function as follows: 
<disp-formula id="j_nejsds72_eq_014">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mspace width="2.5pt"/>
<mml:mo stretchy="false">|</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="2.5pt"/>
<mml:mspace width="2.5pt"/>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mi mathvariant="double-struck">E</mml:mi>
<mml:mo fence="true" stretchy="false">[</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mspace width="2.5pt"/>
<mml:mo stretchy="false">|</mml:mo>
<mml:mspace width="2.5pt"/><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="script">C</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:mo stretchy="false">‖</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">‖</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:mo stretchy="false">‖</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">‖</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>−</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo>−</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>−</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo>−</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}f(V)& ={\mathbb{E}_{y}}\big[{d^{2}}\big(\mathbb{E}[x\hspace{2.5pt}|\hspace{2.5pt}y],\mathcal{C}({\Sigma _{xx}}V)\big)\big]\\ {} & \hspace{2.5pt}\hspace{2.5pt}-\alpha {\mathbb{E}_{\widetilde{y}}}\big[{d^{2}}\big(\mathbb{E}[\widetilde{x}\hspace{2.5pt}|\hspace{2.5pt}\widetilde{y}],\mathcal{C}({\Sigma _{\widetilde{x}\widetilde{x}}}V)\big)\big]\\ {} & ={\mathbb{E}_{y}}\big[\| {v_{y}}-{P_{{\Sigma _{xx}}V}}{v_{y}}{\| ^{2}}\big]-\alpha {\mathbb{E}_{\widetilde{y}}}\big[\| {v_{\widetilde{y}}}-{P_{{\Sigma _{\widetilde{x}\widetilde{x}}}V}}{v_{\widetilde{y}}}{\| ^{2}}\big]\\ {} & ={\mathbb{E}_{y}}\big[{v_{y}^{\top }}{v_{y}}-{v_{y}^{\top }}{P_{{\Sigma _{xx}}V}^{2}}{v_{y}}\big]-\alpha {\mathbb{E}_{\widetilde{y}}}\big[{v_{\widetilde{y}}^{\top }}{v_{\widetilde{y}}}-{v_{\widetilde{y}}^{\top }}{P_{{\Sigma _{\widetilde{x}\widetilde{x}}}V}}{v_{\widetilde{y}}}\big]\\ {} & ={\mathbb{E}_{y}}\big[{v_{y}^{\top }}{v_{y}}-{v_{y}^{\top }}{P_{{\Sigma _{xx}}V}}{v_{y}}\big]-\alpha {\mathbb{E}_{\widetilde{y}}}\big[{v_{\widetilde{y}}^{\top }}{v_{\widetilde{y}}}-{v_{\widetilde{y}}^{\top }}{P_{{\Sigma _{\widetilde{x}\widetilde{x}}}V}}{v_{\widetilde{y}}}\big].\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
The solution to the optimization problem defined by this loss function, if it exists, leads to our CIR model.</p>
<p>We remove the constant terms <inline-formula id="j_nejsds72_ineq_231"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[${\mathbb{E}_{y}}[{v_{y}^{\top }}{v_{y}}]$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_232"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">]</mml:mo></mml:math><tex-math><![CDATA[${\mathbb{E}_{\widetilde{y}}}[{v_{\widetilde{y}}^{\top }}{v_{\widetilde{y}}}]$]]></tex-math></alternatives></inline-formula> that are independent of <italic>V</italic> and continue to simplify <inline-formula id="j_nejsds72_ineq_233"><alternatives><mml:math>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$f(V)$]]></tex-math></alternatives></inline-formula>: 
<disp-formula id="j_nejsds72_eq_015">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="double-struck">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mphantom>
<mml:mo>=</mml:mo></mml:mphantom>
<mml:mspace width="2.5pt"/>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mspace width="-0.1667em"/>
<mml:mo>+</mml:mo>
<mml:mspace width="-0.1667em"/>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo movablelimits="false">tr</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}f(V)& =-{\mathbb{E}_{y}}\big[{v_{y}^{\top }}{P_{{\Sigma _{xx}}V}}{v_{y}}\big]+\alpha {\mathbb{E}_{\widetilde{y}}}\big[{v_{\widetilde{y}}^{\top }}{P_{{\Sigma _{\widetilde{x}\widetilde{x}}}V}}{v_{\widetilde{y}}}\big]\\ {} & =-{\mathbb{E}_{y}}\big[\operatorname{tr}\big({v_{y}^{\top }}{P_{{\Sigma _{xx}}V}}{v_{y}}\big)\big]+\alpha {\mathbb{E}_{\widetilde{y}}}\big[\operatorname{tr}\big({v_{\widetilde{y}}^{\top }}{P_{{\Sigma _{\widetilde{x}\widetilde{x}}}V}}{v_{\widetilde{y}}}\big)\big]\\ {} & =-\operatorname{tr}({\Sigma _{x}}{P_{{\Sigma _{xx}}V}})+\alpha \operatorname{tr}({\Sigma _{\widetilde{x}}}{P_{{\Sigma _{\widetilde{x}\widetilde{x}}}V}})\\ {} & =-\operatorname{tr}\big({V^{\top }}{\Sigma _{xx}}{\Sigma _{x}}{\Sigma _{xx}}V{\big[{V^{\top }}{\Sigma _{xx}^{2}}V\big]^{-1}}\big)\\ {} & \phantom{=}\hspace{2.5pt}+\alpha \operatorname{tr}\big({V^{\top }}{\Sigma _{\widetilde{x}\widetilde{x}}}{\Sigma _{\widetilde{x}}}{\Sigma _{\widetilde{x}\widetilde{x}}}V{\big[{V^{\top }}{\Sigma _{\widetilde{x}\widetilde{x}}^{2}}V\big]^{-1}}\big)\\ {} & =-\operatorname{tr}\big({V^{\top }}AV{\big({V^{\top }}BV\big)^{-1}}\big)\hspace{-0.1667em}+\hspace{-0.1667em}\alpha \operatorname{tr}\big({V^{\top }}\widetilde{A}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}\big),\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds72_ineq_234"><alternatives><mml:math>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$A={\Sigma _{xx}}{\Sigma _{x}}{\Sigma _{xx}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_235"><alternatives><mml:math>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[$B={\Sigma _{xx}^{2}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_236"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$\widetilde{A}={\Sigma _{\widetilde{x}\widetilde{x}}}{\Sigma _{\widetilde{x}}}{\Sigma _{\widetilde{x}\widetilde{x}}}$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds72_ineq_237"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">Σ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[$\widetilde{B}={\Sigma _{\widetilde{x}\widetilde{x}}^{2}}$]]></tex-math></alternatives></inline-formula>.</p></app>
<app id="j_nejsds72_app_002"><label>Appendix B</label>
<title>Proof to Lemma <xref rid="j_nejsds72_stat_003">1</xref></title>
<disp-formula id="j_nejsds72_eq_016">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>−</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">{</mml:mo>
<mml:mn>2</mml:mn><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>−</mml:mo>
<mml:mn>2</mml:mn><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">}</mml:mo>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}\frac{\partial f}{\partial V}& =-2AV{\big({V^{\top }}BV\big)^{-1}}\\ {} & \hspace{1em}-2BV{\big({V^{\top }}BV\big)^{-1}}{V^{\top }}AV{\big({V^{\top }}BV\big)^{-1}}\\ {} & \hspace{1em}+\alpha \big\{2\widetilde{A}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}\\ {} & \hspace{1em}-2\widetilde{B}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}{V^{\top }}\widetilde{A}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}\big\}.\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
<p>Recall that the projection to the tangent space of the Stiefel manifold <inline-formula id="j_nejsds72_ineq_238"><alternatives><mml:math>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula> at <italic>V</italic> is given by 
<disp-formula id="j_nejsds72_eq_017">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false">Proj</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">Z</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">Z</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo movablelimits="false">Sym</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">Z</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mo>∀</mml:mo>
<mml:mi mathvariant="italic">Z</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {\operatorname{Proj}_{V}}(Z)=Z-V\operatorname{Sym}\big({V^{\top }}Z\big),\hspace{2.5pt}\forall Z\in {T_{V}}\operatorname{St}(p,d),\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds72_ineq_239"><alternatives><mml:math>
<mml:mo movablelimits="false">Sym</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
<mml:mo>+</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle></mml:math><tex-math><![CDATA[$\operatorname{Sym}(X):=\frac{X+{X^{\top }}}{2}$]]></tex-math></alternatives></inline-formula> is the symmetrizer. Then observe that the following equations involving the pair <italic>A</italic>, <italic>B</italic> and the pair <inline-formula id="j_nejsds72_ineq_240"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{A}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_241"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widetilde{B}$]]></tex-math></alternatives></inline-formula> have to satisfy the following equations: 
<disp-formula id="j_nejsds72_eq_018">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mspace width="-0.1667em"/>
<mml:mo>−</mml:mo>
<mml:mspace width="-0.1667em"/>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mn>2</mml:mn><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mspace width="-0.1667em"/>
<mml:mo>−</mml:mo>
<mml:mspace width="-0.1667em"/>
<mml:mn>2</mml:mn><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& {V^{\top }}\big(2AV{\big({V^{\top }}BV\big)^{-1}}\hspace{-0.1667em}-\hspace{-0.1667em}2BV{\big({V^{\top }}BV\big)^{-1}}{V^{\top }}AV{\big({V^{\top }}BV\big)^{-1}}\big)\\ {} & \hspace{1em}=0\\ {} & {V^{\top }}\big(2\widetilde{A}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}\hspace{-0.1667em}-\hspace{-0.1667em}2\widetilde{B}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}{V^{\top }}\widetilde{A}V{\big({V^{\top }}\widetilde{B}V\big)^{-1}}\big)\\ {} & \hspace{1em}=0.\end{aligned}\]]]></tex-math></alternatives>
</disp-formula>
</p>
<p>That is, <inline-formula id="j_nejsds72_ineq_242"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[${V^{\top }}\frac{\partial f}{\partial V}=0$]]></tex-math></alternatives></inline-formula>. As a result, the gradient of <italic>f</italic> is given by 
<disp-formula id="j_nejsds72_eq_019">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mspace width="-0.1667em"/>
<mml:mo>=</mml:mo>
<mml:mspace width="-0.1667em"/>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false">Proj</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" maxsize="2.03em" minsize="2.03em">(</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal" fence="true" maxsize="2.03em" minsize="2.03em">)</mml:mo>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo movablelimits="false">Sym</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="2.03em" minsize="2.03em">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal" fence="true" maxsize="2.03em" minsize="2.03em">)</mml:mo>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ \operatorname{grad}f(V)\hspace{-0.1667em}=\hspace{-0.1667em}{\operatorname{Proj}_{V}}\bigg(\frac{\partial f}{\partial V}\bigg)=\frac{\partial f}{\partial V}-V\operatorname{Sym}\bigg({V^{\top }}\frac{\partial f}{\partial V}\bigg)=\frac{\partial f}{\partial V}.\]]]></tex-math></alternatives>
</disp-formula>
</p></app>
<app id="j_nejsds72_app_003"><label>Appendix C</label>
<title>Proof of Theorem <xref rid="j_nejsds72_stat_004">1</xref></title>
<p>If <italic>V</italic> is a local minimizer (i.e., a stationary point for the optimization problem (<xref rid="j_nejsds72_eq_004">2.3</xref>)), then <inline-formula id="j_nejsds72_ineq_243"><alternatives><mml:math>
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\operatorname{grad}f(V)=0$]]></tex-math></alternatives></inline-formula>, from Lemma <xref rid="j_nejsds72_stat_003">1</xref> we have 
<disp-formula id="j_nejsds72_eq_020">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ AVE(V)-\alpha \widetilde{A}V\widetilde{E}(V)=BVF(V)-\alpha \widetilde{B}V\widetilde{F}(V),\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds72_ineq_244"><alternatives><mml:math>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$E(V)={({V^{\top }}BV)^{-1}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_245"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$\widetilde{E}(V)={({V^{\top }}\widetilde{B}V)^{-1}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_246"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$F(V)={({V^{\top }}BV)^{-1}}{V^{\top }}AV{({V^{\top }}BV)^{-1}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_247"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$\widetilde{F}(V)={({V^{\top }}\widetilde{B}V)^{-1}}{V^{\top }}\widetilde{A}V{({V^{\top }}\widetilde{B}V)^{-1}}$]]></tex-math></alternatives></inline-formula>.</p></app>
<app id="j_nejsds72_app_004"><label>Appendix D</label>
<title>Proof of Theorem <xref rid="j_nejsds72_stat_006">2</xref></title>
<p>By Theorem 1 and Corollary 1 in [<xref ref-type="bibr" rid="j_nejsds72_ref_050">50</xref>], it suffices to show <italic>f</italic> is continuously differentiable, which is a direct corollary of Lemma <xref rid="j_nejsds72_stat_003">1</xref>.</p></app>
<app id="j_nejsds72_app_005"><label>Appendix E</label>
<title>Options for <inline-formula id="j_nejsds72_ineq_248"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\eta _{k}}$]]></tex-math></alternatives></inline-formula></title>
<p>To introduce other options for <inline-formula id="j_nejsds72_ineq_249"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\eta _{k}}$]]></tex-math></alternatives></inline-formula>, we need the following definition.</p><statement id="j_nejsds72_stat_008"><label>Definition 2</label>
<title>(Gradient-related sequence, see [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>, p. 62, Definition 4.2.1]).</title>
<p>Given a function <italic>f</italic> on a Riemannian manifold <italic>M</italic>, a sequence in tangent space <inline-formula id="j_nejsds72_ineq_250"><alternatives><mml:math>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">M</mml:mi></mml:math><tex-math><![CDATA[$\{{\eta _{k}}\},{\eta _{k}}\in {T_{{V_{k}}}}M$]]></tex-math></alternatives></inline-formula>, where <inline-formula id="j_nejsds72_ineq_251"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{k}}$]]></tex-math></alternatives></inline-formula> are defined through the iterative formula <inline-formula id="j_nejsds72_ineq_252"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">t</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${V_{k+1}}={R_{{V_{k}}}}({t_{k}}{\eta _{k}})$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds72_ineq_253"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${R_{{x_{k}}}}$]]></tex-math></alternatives></inline-formula> can be any retraction (e.g., global retraction mapping <inline-formula id="j_nejsds72_ineq_254"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mtext>Retr</mml:mtext>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>:</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">M</mml:mi>
<mml:mo stretchy="false">→</mml:mo>
<mml:mi mathvariant="italic">M</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo stretchy="false">↦</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">I</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\text{Retr}_{V}}:{T_{V}}M\to M,\xi \mapsto (V+\xi ){({I_{d}}+{\xi ^{\top }}\xi )^{-1/2}}$]]></tex-math></alternatives></inline-formula> on <inline-formula id="j_nejsds72_ineq_255"><alternatives><mml:math>
<mml:mi mathvariant="italic">S</mml:mi>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$St(n,p)$]]></tex-math></alternatives></inline-formula>), is called <bold>gradient-related</bold> if, for any subsequence of <inline-formula id="j_nejsds72_ineq_256"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mi mathvariant="italic">K</mml:mi>
<mml:mo stretchy="false">⊂</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo fence="true" stretchy="false">}</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\{{V_{k}}\}_{k\in K\subset \{1,2,\dots ,n\}}}$]]></tex-math></alternatives></inline-formula> that converges to a non-critical point of <italic>f</italic>, the corresponding subsequence <inline-formula id="j_nejsds72_ineq_257"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" stretchy="false">}</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mi mathvariant="italic">K</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\{{\eta _{k}}\}_{k\in K}}$]]></tex-math></alternatives></inline-formula> is bounded and satisfies 
<disp-formula id="j_nejsds72_eq_021">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mo movablelimits="false">lim</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mo movablelimits="false">sup</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo stretchy="false">→</mml:mo>
<mml:mi>∞</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mi mathvariant="italic">K</mml:mi>
</mml:mrow>
</mml:munder>
<mml:msub>
<mml:mrow>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">⟨</mml:mo>
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">⟩</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">M</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}\lim \underset{k\to \infty ,k\in K}{\sup }{\big\langle \operatorname{grad}f({V_{k}}),{\eta _{k}}\big\rangle _{M}}& \lt 0.\end{aligned}\]]]></tex-math></alternatives>
</disp-formula>
</p></statement>
<p>This means that the cosine of gradient and update <inline-formula id="j_nejsds72_ineq_258"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\eta _{k}}$]]></tex-math></alternatives></inline-formula> needs to form an acute angle for only critical points. Note that a naive Newton step is not necessarily gradient-related (see p. 122 in [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>]). In particular, <inline-formula id="j_nejsds72_ineq_259"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">η</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${\eta _{k}}=-\operatorname{grad}f({V_{k}})$]]></tex-math></alternatives></inline-formula> results in a gradient-related sequence, and is suggested by [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>] as a natural choice.</p></app>
<app id="j_nejsds72_app_006"><label>Appendix F</label>
<title>Proof of Theorem <xref rid="j_nejsds72_stat_007">3</xref></title>
<p>The first assertion regarding consistency is from Theorem 4.3.1 in [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>], which requires our loss function <italic>f</italic> to be continuously differentiable, a direct corollary of Lemma <xref rid="j_nejsds72_stat_003">1</xref>.</p>
<p>By the compactness of <inline-formula id="j_nejsds72_ineq_260"><alternatives><mml:math>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula>, the level set <inline-formula id="j_nejsds72_ineq_261"><alternatives><mml:math>
<mml:mi mathvariant="script">L</mml:mi>
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>:</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\mathcal{L}:=\{V\in \operatorname{St}(p,d):\hspace{2.5pt}f(V)\le f({V_{0}})\}$]]></tex-math></alternatives></inline-formula> is compact for any <inline-formula id="j_nejsds72_ineq_262"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${V_{0}}\in \operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula>, the second assertion follows Corollary 4.3.2 in [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>].</p>
<p>The third assertion regarding the convergence rate involves second-order conditions, i.e., the Hessian of <italic>f</italic>. Let <inline-formula id="j_nejsds72_ineq_263"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">D</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">f</mml:mi></mml:math><tex-math><![CDATA[${D^{2}}f$]]></tex-math></alternatives></inline-formula> be the Hessian computed in Euclidean coordinates, that is, <inline-formula id="j_nejsds72_ineq_264"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">D</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mo stretchy="false">|</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mi mathvariant="italic">j</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mi mathvariant="italic">l</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>:</mml:mo>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:mi mathvariant="italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>∂</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mi mathvariant="italic">j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>∂</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mi mathvariant="italic">l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mstyle></mml:math><tex-math><![CDATA[${({D^{2}}f{|_{V}})_{ij,kl}}:=\frac{\partial f}{\partial {V_{ij}}\partial {V_{kl}}}$]]></tex-math></alternatives></inline-formula>, then for tangent vectors <inline-formula id="j_nejsds72_ineq_265"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Ω</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">Ω</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${\Omega _{1}},{\Omega _{2}}\in {T_{V}}\operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula>, the Hessian is given by [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>]</p><graphic xlink:href="nejsds72_g012.jpg"/> 
<p>By the definition of <italic>f</italic>, <inline-graphic xlink:href="nejsds72_g013.jpg" id="j_nejsds72_ingr_002"/> is <inline-formula id="j_nejsds72_ineq_266"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">C</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>∞</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${C^{\infty }}$]]></tex-math></alternatives></inline-formula> in the Euclidean sense, so is continuous. By the continuity of <inline-formula id="j_nejsds72_ineq_267"><alternatives><mml:math>
<mml:mo movablelimits="false">grad</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi></mml:math><tex-math><![CDATA[$\operatorname{grad}f$]]></tex-math></alternatives></inline-formula>, <inline-graphic xlink:href="nejsds72_g014.jpg" id="j_nejsds72_ingr_003"/> and <inline-graphic xlink:href="nejsds72_g015.jpg" id="j_nejsds72_ingr_004"/> are also continuous since they are product or summation of continuous functions. Then the convergence rate follows Theorem 4.5.6 in [<xref ref-type="bibr" rid="j_nejsds72_ref_002">2</xref>].</p></app>
<app id="j_nejsds72_app_007"><label>Appendix G</label>
<title>Fixed-Point Approach to Optimization</title>
<p>Motivated by the first order optimality condition for the loss function (<xref rid="j_nejsds72_eq_004">2.3</xref>), we seek a fixed-point method as an alternative to a gradient descent-based algorithm. Instead of solving equation (<xref rid="j_nejsds72_eq_009">2.4</xref>) in one algebraic step, we separate the problem into the following 8 equations, which can be solved cyclically. Recall that <inline-formula id="j_nejsds72_ineq_268"><alternatives><mml:math>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$E(V)={({V^{\top }}BV)^{-1}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_269"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$\widetilde{E}(V)={({V^{\top }}\widetilde{B}V)^{-1}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds72_ineq_270"><alternatives><mml:math>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi></mml:math><tex-math><![CDATA[$G(V)={V^{\top }}AV$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds72_ineq_271"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi></mml:math><tex-math><![CDATA[$\widetilde{G}(V)={V^{\top }}\widetilde{A}V$]]></tex-math></alternatives></inline-formula> and suppress the index of <inline-formula id="j_nejsds72_ineq_272"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{k}}$]]></tex-math></alternatives></inline-formula>, i.e., <inline-formula id="j_nejsds72_ineq_273"><alternatives><mml:math>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$V={V_{k}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds72_ineq_274"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{1}}={V_{k+1}}$]]></tex-math></alternatives></inline-formula> for now for legibility: 
<disp-formula id="j_nejsds72_eq_022">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& A{V_{1}}E(V)-\alpha \widetilde{A}V\widetilde{E}(V)\\ {} & \hspace{1em}=BVE(V)G(V)E(V)-\alpha \widetilde{B}V\widetilde{E}(V)\widetilde{G}(V)\widetilde{E}(V)\\ {} & AVE(V)-\alpha \widetilde{A}{V_{1}}\widetilde{E}(V)\\ {} & \hspace{1em}=BVE(V)G(V)E(V)-\alpha \widetilde{B}V\widetilde{E}(V)\widetilde{G}(V)\widetilde{E}(V)\\ {} & AVE(V)-\alpha \widetilde{A}V\widetilde{E}(V)\\ {} & \hspace{1em}=B{V_{1}}E(V)G(V)E(V)-\alpha \widetilde{B}V\widetilde{E}(V)\widetilde{G}(V)\widetilde{E}(V)\\ {} & AVE(V)-\alpha \widetilde{A}V\widetilde{E}(V)\\ {} & \hspace{1em}=BVE(V)G(V)E(V)-\alpha \widetilde{B}{V_{1}}\widetilde{E}(V)\widetilde{G}(V)\widetilde{E}(V)\\ {} & AVE({V_{1}})-\alpha \widetilde{A}V\widetilde{E}(V)\\ {} & \hspace{1em}=BVE({V_{1}})G(V)E({V_{1}})-\alpha \widetilde{B}V\widetilde{E}(V)\widetilde{G}(V)\widetilde{E}(V)\\ {} & AVE(V)-\alpha \widetilde{A}V\widetilde{E}({V_{1}})\\ {} & \hspace{1em}=BVE(V)G(V)E(V)-\alpha \widetilde{B}V\widetilde{E}({V_{1}})\widetilde{G}(V)\widetilde{E}({V_{1}})\\ {} & AVE(V)-\alpha \widetilde{A}V\widetilde{E}(V)\\ {} & \hspace{1em}=BVE(V)G({V_{1}})E(V)-\alpha \widetilde{B}V\widetilde{E}(V)\widetilde{G}(V)\widetilde{E}(V)\\ {} & AVE(V)-\alpha \widetilde{A}V\widetilde{E}(V)\\ {} & \hspace{1em}=BVE(V)G(V)E(V)-\alpha \widetilde{B}V\widetilde{E}(V)\widetilde{G}({V_{1}})\widetilde{E}(V)\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
In each of the first four of these equations, <inline-formula id="j_nejsds72_ineq_275"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{k+1}}$]]></tex-math></alternatives></inline-formula> can change independently, suggesting a convenient corresponding update rule. For the next two equations, we can premultiply by <inline-formula id="j_nejsds72_ineq_276"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${[{(BV)^{\top }}(BV)]^{-1}}{(BV)^{\top }}$]]></tex-math></alternatives></inline-formula> (and <inline-formula id="j_nejsds72_ineq_277"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" stretchy="false">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${[{(\widetilde{B}V)^{\top }}(\widetilde{B}V)]^{-1}}{(\widetilde{B}V)^{\top }}$]]></tex-math></alternatives></inline-formula>, respectively) to obtain the following equation: 
<disp-formula id="j_nejsds72_eq_023">
<label>(G.1)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:msup>
<mml:mrow>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mspace width="-0.1667em"/>
<mml:mo>−</mml:mo>
<mml:mspace width="-0.1667em"/>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& {\big[{(BV)^{\top }}(BV)\big]^{-1}}{(BV)^{\top }}AVE({V_{k+1}})\hspace{-0.1667em}-\hspace{-0.1667em}E({V_{k+1}})G(V)E({V_{k+1}})\\ {} & \hspace{1em}=\alpha {\big[{(BV)^{\top }}(BV)\big]^{-1}}{(BV)^{\top }}{H_{1}},\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds72_ineq_278"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${H_{1}}=(\widetilde{A}V\widetilde{E}(V)-\widetilde{B}V\widetilde{E}(V)\widetilde{G}(V)\widetilde{E}(V))$]]></tex-math></alternatives></inline-formula>.</p>
<p>In practice, the cyclic update may not converge to stationary points of the optimization problem (<xref rid="j_nejsds72_eq_004">2.3</xref>). However, when the designated cyclic update converges, it can be shown that equation (<xref rid="j_nejsds72_eq_023">G.1</xref>) is in the form of an asymmetric algebraic Riccati equation in <inline-formula id="j_nejsds72_ineq_279"><alternatives><mml:math>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$E({V_{k+1}})$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds72_ref_006">6</xref>]. When we obtain a solution <inline-formula id="j_nejsds72_ineq_280"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${E^{\ast }}=E({V_{k+1}})$]]></tex-math></alternatives></inline-formula> where <inline-formula id="j_nejsds72_ineq_281"><alternatives><mml:math>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$V={V_{k}}$]]></tex-math></alternatives></inline-formula> is not a local optimum, the <inline-formula id="j_nejsds72_ineq_282"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${E^{\ast }}$]]></tex-math></alternatives></inline-formula> is not in <inline-formula id="j_nejsds72_ineq_283"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>+</mml:mo>
<mml:mo>+</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${S_{++}^{d}}$]]></tex-math></alternatives></inline-formula>, which means we cannot use the Cholesky decomposition to solve for <inline-formula id="j_nejsds72_ineq_284"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{k+1}}$]]></tex-math></alternatives></inline-formula> in the next update.</p>
<p>For the final two equations, we can write 
<disp-formula id="j_nejsds72_eq_024">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mspace width="1em"/>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo fence="true" maxsize="1.19em" minsize="1.19em">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& {V_{k+1}^{\top }}A{V_{k+1}}\\ {} & \hspace{1em}={\big[{\big(VE(V)\big)^{\top }}\big(VE(V)\big)\big]^{-1}}{\big(VE(V)\big)^{\top }}{B^{-1}}{H_{2}}E{(V)^{-1}},\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
where 
<disp-formula id="j_nejsds72_eq_025">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mi mathvariant="italic">E</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">G</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>−</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">A</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mi mathvariant="italic">V</mml:mi><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" maxsize="1.19em" minsize="1.19em">)</mml:mo>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {H_{2}}=AVE(V)+\alpha \big(\widetilde{B}V\widetilde{E}(V)\widetilde{G}(V)\widetilde{E}(V)-\widetilde{A}V\widetilde{E}(V)\big).\]]]></tex-math></alternatives>
</disp-formula> 
However, when <inline-formula id="j_nejsds72_ineq_285"><alternatives><mml:math>
<mml:mi mathvariant="italic">V</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$V={V_{k}}$]]></tex-math></alternatives></inline-formula> is not a local optimum, again the right-hand side is not symmetric positive-definite, and so we cannot use the Cholesky decomposition to solve for <inline-formula id="j_nejsds72_ineq_286"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{k+1}}$]]></tex-math></alternatives></inline-formula> in the next update.</p>
<p>Note that in order to require <inline-formula id="j_nejsds72_ineq_287"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${V_{k+1}}\in \operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula>, the final step of each update rule should project the solution for <inline-formula id="j_nejsds72_ineq_288"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${V_{k+1}}$]]></tex-math></alternatives></inline-formula> onto <inline-formula id="j_nejsds72_ineq_289"><alternatives><mml:math>
<mml:mo movablelimits="false">St</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\operatorname{St}(p,d)$]]></tex-math></alternatives></inline-formula>, which can be done by SVD; if <inline-formula id="j_nejsds72_ineq_290"><alternatives><mml:math>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">U</mml:mi>
<mml:mi mathvariant="normal">Σ</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$A=U\Sigma {V^{\top }}$]]></tex-math></alternatives></inline-formula>, then <inline-formula id="j_nejsds72_ineq_291"><alternatives><mml:math>
<mml:mi mathvariant="italic">π</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">A</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">U</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>⊤</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[$\pi (A)=U{V^{\top }}$]]></tex-math></alternatives></inline-formula>.</p>
<p>Although this cyclic update regime does not immediately lead to a practical fixed-point optimization algorithm, it shows that our loss function has the classical link to a Ricatti equation (<xref rid="j_nejsds72_eq_023">G.1</xref>), indicating that more efficient algorithms are possible.</p></app></app-group>
<ref-list id="j_nejsds72_reflist_001">
<title>References</title>
<ref id="j_nejsds72_ref_001">
<label>[1]</label><mixed-citation publication-type="journal"><string-name><surname>Abid</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Zhang</surname>, <given-names>M. J.</given-names></string-name>, <string-name><surname>Bagaria</surname>, <given-names>V. K.</given-names></string-name> and <string-name><surname>Zou</surname>, <given-names>J.</given-names></string-name> (<year>2018</year>). <article-title>Exploring patterns enriched in a dataset with contrastive principal component analysis</article-title>. <source>Nature Communications</source> <volume>9</volume>(<issue>1</issue>) <fpage>1</fpage>–<lpage>7</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_002">
<label>[2]</label><mixed-citation publication-type="chapter"><string-name><surname>Absil</surname>, <given-names>P.-A.</given-names></string-name>, <string-name><surname>Mahony</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>Sepulchre</surname>, <given-names>R.</given-names></string-name> (<year>2009</year>). <chapter-title>Optimization algorithms on matrix manifolds</chapter-title>. In <source>Optimization Algorithms on Matrix Manifolds</source> <publisher-name>Princeton University Press</publisher-name>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1515/9781400830244" xlink:type="simple">https://doi.org/10.1515/9781400830244</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2364186">MR2364186</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_003">
<label>[3]</label><mixed-citation publication-type="journal"><string-name><surname>Alquicira-Hernandez</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Sathe</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Ji</surname>, <given-names>H. P.</given-names></string-name>, <string-name><surname>Nguyen</surname>, <given-names>Q.</given-names></string-name> and <string-name><surname>Powell</surname>, <given-names>J. E.</given-names></string-name> (<year>2019</year>). <article-title>scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data</article-title>. <source>Genome Biology</source> <volume>20</volume>(<issue>1</issue>) <fpage>1</fpage>–<lpage>17</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_004">
<label>[4]</label><mixed-citation publication-type="journal"><string-name><surname>Bader</surname>, <given-names>G. D.</given-names></string-name>, <string-name><surname>Cary</surname>, <given-names>M. P.</given-names></string-name> and <string-name><surname>Sander</surname>, <given-names>C.</given-names></string-name> (<year>2006</year>). <article-title>Pathguide: a pathway resource list</article-title>. <source>Nucleic Acids Research</source> <volume>34</volume> <fpage>504</fpage>–<lpage>506</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_005">
<label>[5]</label><mixed-citation publication-type="journal"><string-name><surname>Becht</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>McInnes</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Healy</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Dutertre</surname>, <given-names>C.-A.</given-names></string-name>, <string-name><surname>Kwok</surname>, <given-names>I. W.</given-names></string-name>, <string-name><surname>Ng</surname>, <given-names>L. G.</given-names></string-name>, <string-name><surname>Ginhoux</surname>, <given-names>F.</given-names></string-name> and <string-name><surname>Newell</surname>, <given-names>E. W.</given-names></string-name> (<year>2019</year>). <article-title>Dimensionality reduction for visualizing single-cell data using UMAP</article-title>. <source>Nature Biotechnology</source> <volume>37</volume>(<issue>1</issue>) <fpage>38</fpage>–<lpage>44</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_006">
<label>[6]</label><mixed-citation publication-type="chapter"><string-name><surname>Bini</surname>, <given-names>D. A.</given-names></string-name>, <string-name><surname>Iannazzo</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Meini</surname>, <given-names>B.</given-names></string-name> and <string-name><surname>Poloni</surname>, <given-names>F.</given-names></string-name> (<year>2008</year>). <chapter-title>Nonsymmetric algebraic Riccati equations associated with an M-matrix: recent advances and algorithms</chapter-title>. In <source>Dagstuhl Seminar Proceedings</source>. <publisher-name>Schloss Dagstuhl-Leibniz-Zentrum für Informatik</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_007">
<label>[7]</label><mixed-citation publication-type="journal"><string-name><surname>Boumal</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Absil</surname>, <given-names>P.-A.</given-names></string-name> and <string-name><surname>Cartis</surname>, <given-names>C.</given-names></string-name> (<year>2019</year>). <article-title>Global rates of convergence for nonconvex optimization on manifolds</article-title>. <source>IMA Journal of Numerical Analysis</source> <volume>39</volume>(<issue>1</issue>) <fpage>1</fpage>–<lpage>33</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/imanum/drx080" xlink:type="simple">https://doi.org/10.1093/imanum/drx080</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4023745">MR4023745</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_008">
<label>[8]</label><mixed-citation publication-type="other"><string-name><surname>Breiman</surname>, <given-names>L.</given-names></string-name> (1996). Bias, variance, and arcing classifiers. Technical Report, Tech. Rep. 460, Statistics Department, University of California, Berkeley …. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/aos/1024691079" xlink:type="simple">https://doi.org/10.1214/aos/1024691079</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1635406">MR1635406</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_009">
<label>[9]</label><mixed-citation publication-type="book"><string-name><surname>Breiman</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Friedman</surname>, <given-names>J. H.</given-names></string-name>, <string-name><surname>Olshen</surname>, <given-names>R. A.</given-names></string-name> and <string-name><surname>Stone</surname>, <given-names>C. J.</given-names></string-name> (<year>2017</year>) <source>Classification and regression trees</source>. <publisher-name>Routledge</publisher-name>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0726392">MR0726392</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_010">
<label>[10]</label><mixed-citation publication-type="journal"><string-name><surname>Cai</surname>, <given-names>Z.</given-names></string-name>, <string-name><surname>Li</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>Zhu</surname>, <given-names>L.</given-names></string-name> (<year>2020</year>). <article-title>Online sufficient dimension reduction through sliced inverse regression.</article-title> <source>J. Mach. Learn. Res.</source> <volume>21</volume>(<issue>10</issue>) <fpage>1</fpage>–<lpage>25</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4071193">MR4071193</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_011">
<label>[11]</label><mixed-citation publication-type="journal"><string-name><surname>Caliski</surname>, <given-names>T.</given-names></string-name> and <string-name><surname>Harabasz</surname>, <given-names>J.</given-names></string-name> (<year>1974</year>). <article-title>A dendrite method for cluster analysis</article-title>. <source>Communications in Statistics – Theory and Methods</source> <volume>3</volume>(<issue>1</issue>) <fpage>1</fpage>–<lpage>27</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/03610927408827101" xlink:type="simple">https://doi.org/10.1080/03610927408827101</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0375641">MR0375641</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_012">
<label>[12]</label><mixed-citation publication-type="journal"><string-name><surname>Camastra</surname>, <given-names>F.</given-names></string-name> and <string-name><surname>Staiano</surname>, <given-names>A.</given-names></string-name> (<year>2016</year>). <article-title>Intrinsic dimension estimation: Advances and open problems</article-title>. <source>Information Sciences</source> <volume>328</volume> <fpage>26</fpage>–<lpage>41</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_013">
<label>[13]</label><mixed-citation publication-type="journal"><string-name><surname>Campadelli</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Casiraghi</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Ceruti</surname>, <given-names>C.</given-names></string-name> and <string-name><surname>Rozza</surname>, <given-names>A.</given-names></string-name> (<year>2015</year>). <article-title>Intrinsic dimension estimation: Relevant techniques and a benchmark framework</article-title>. <source>Mathematical Problems in Engineering</source> <volume>2015</volume> <elocation-id>759567</elocation-id>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1155/2015/759567" xlink:type="simple">https://doi.org/10.1155/2015/759567</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3417646">MR3417646</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_014">
<label>[14]</label><mixed-citation publication-type="journal"><string-name><surname>Chen</surname>, <given-names>Z.</given-names></string-name>, <string-name><surname>Wang</surname>, <given-names>B.</given-names></string-name> and <string-name><surname>Gorban</surname>, <given-names>A. N.</given-names></string-name> (<year>2020</year>). <article-title>Multivariate Gaussian and Student-t process regression for multi-output prediction</article-title>. <source>Neural Computing and Applications</source> <volume>32</volume>(<issue>8</issue>) <fpage>3005</fpage>–<lpage>3028</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_015">
<label>[15]</label><mixed-citation publication-type="chapter"><string-name><surname>Chopra</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Hadsell</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>LeCun</surname>, <given-names>Y.</given-names></string-name> (<year>2005</year>). <chapter-title>Learning a similarity metric discriminatively, with application to face verification</chapter-title>. In <source>2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05)</source> <volume>1</volume> <fpage>539</fpage>–<lpage>546</lpage>. <publisher-name>IEEE</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_016">
<label>[16]</label><mixed-citation publication-type="journal"><string-name><surname>Cook</surname>, <given-names>R. D.</given-names></string-name> (<year>1996</year>). <article-title>Graphics for regressions with a binary response</article-title>. <source>Journal of the American Statistical Association</source> <volume>91</volume>(<issue>435</issue>) <fpage>983</fpage>–<lpage>992</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2307/2291717" xlink:type="simple">https://doi.org/10.2307/2291717</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1424601">MR1424601</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_017">
<label>[17]</label><mixed-citation publication-type="book"><string-name><surname>Cook</surname>, <given-names>R. D.</given-names></string-name> (<year>2009</year>) <source>Regression graphics: Ideas for studying regressions through graphics</source>. <publisher-name>John Wiley &amp; Sons</publisher-name>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1002/9780470316931" xlink:type="simple">https://doi.org/10.1002/9780470316931</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1645673">MR1645673</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_018">
<label>[18]</label><mixed-citation publication-type="journal"><string-name><surname>Cook</surname>, <given-names>R. D.</given-names></string-name> and <string-name><surname>Weisberg</surname>, <given-names>S.</given-names></string-name> (<year>1991</year>). <article-title>Sliced inverse regression for dimension reduction: Comment</article-title>. <source>Journal of the American Statistical Association</source> <volume>86</volume>(<issue>414</issue>) <fpage>328</fpage>–<lpage>332</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1137117">MR1137117</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_019">
<label>[19]</label><mixed-citation publication-type="journal"><string-name><surname>Cortes</surname>, <given-names>C.</given-names></string-name> and <string-name><surname>Vapnik</surname>, <given-names>V.</given-names></string-name> (<year>1995</year>). <article-title>Support-vector networks</article-title>. <source>Machine Learning</source> <volume>20</volume>(<issue>3</issue>) <fpage>273</fpage>–<lpage>297</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_020">
<label>[20]</label><mixed-citation publication-type="journal"><string-name><surname>Cover</surname>, <given-names>T.</given-names></string-name> and <string-name><surname>Hart</surname>, <given-names>P.</given-names></string-name> (<year>1967</year>). <article-title>Nearest neighbor pattern classification</article-title>. <source>IEEE Transactions on Information Theory</source> <volume>13</volume>(<issue>1</issue>) <fpage>21</fpage>–<lpage>27</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_021">
<label>[21]</label><mixed-citation publication-type="journal"><string-name><surname>Dann</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Cujba</surname>, <given-names>A.-M.</given-names></string-name>, <string-name><surname>Oliver</surname>, <given-names>A. J.</given-names></string-name>, <string-name><surname>Meyer</surname>, <given-names>K. B.</given-names></string-name>, <string-name><surname>Teichmann</surname>, <given-names>S. A.</given-names></string-name> and <string-name><surname>Marioni</surname>, <given-names>J. C.</given-names></string-name> (<year>2023</year>). <article-title>Precise identification of cell states altered in disease using healthy single-cell references</article-title>. <source>Nature Genetics</source> <volume>55</volume>(<issue>11</issue>) <fpage>1998</fpage>–<lpage>2008</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_022">
<label>[22]</label><mixed-citation publication-type="other"><string-name><surname>Davies</surname>, <given-names>D. L.</given-names></string-name> and <string-name><surname>Bouldin</surname>, <given-names>D. W.</given-names></string-name> (1979). A cluster separation measure. <italic>IEEE Transactions on Pattern Analysis and Machine Intelligence</italic> (2) 224–227.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_023">
<label>[23]</label><mixed-citation publication-type="journal"><string-name><surname>Edelman</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Arias</surname>, <given-names>T. A.</given-names></string-name> and <string-name><surname>Smith</surname>, <given-names>S. T.</given-names></string-name> (<year>1998</year>). <article-title>The geometry of algorithms with orthogonality constraints</article-title>. <source>SIAM Journal on Matrix Analysis and Applications</source> <volume>20</volume>(<issue>2</issue>) <fpage>303</fpage>–<lpage>353</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1137/S0895479895290954" xlink:type="simple">https://doi.org/10.1137/S0895479895290954</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1646856">MR1646856</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_024">
<label>[24]</label><mixed-citation publication-type="book"><string-name><surname>Freedman</surname>, <given-names>D. A.</given-names></string-name> (<year>2009</year>) <source>Statistical models: theory and practice</source>. <publisher-name>cambridge university press</publisher-name>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1017/CBO9780511815867" xlink:type="simple">https://doi.org/10.1017/CBO9780511815867</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2489600">MR2489600</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_025">
<label>[25]</label><mixed-citation publication-type="journal"><string-name><surname>Girard</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Lorenzo</surname>, <given-names>H.</given-names></string-name> and <string-name><surname>Saracco</surname>, <given-names>J.</given-names></string-name> (<year>2022</year>). <article-title>Advanced topics in sliced inverse regression</article-title>. <source>Journal of Multivariate Analysis</source> <volume>188</volume> <fpage>104852</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.jmva.2021.104852" xlink:type="simple">https://doi.org/10.1016/j.jmva.2021.104852</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4353861">MR4353861</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_026">
<label>[26]</label><mixed-citation publication-type="journal"><string-name><surname>Hastie</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Tibshirani</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>Buja</surname>, <given-names>A.</given-names></string-name> (<year>1994</year>). <article-title>Flexible discriminant analysis by optimal scoring</article-title>. <source>Journal of the American Statistical Association</source> <volume>89</volume>(<issue>428</issue>) <fpage>1255</fpage>–<lpage>1270</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1310220">MR1310220</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_027">
<label>[27]</label><mixed-citation publication-type="journal"><string-name><surname>Higuera</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Gardiner</surname>, <given-names>K. J.</given-names></string-name> and <string-name><surname>Cios</surname>, <given-names>K. J.</given-names></string-name> (<year>2015</year>). <article-title>Self-organizing feature maps identify proteins critical to learning in a mouse model of Down syndrome</article-title>. <source>PloS One</source> <volume>10</volume>(<issue>6</issue>) <fpage>0129126</fpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_028">
<label>[28]</label><mixed-citation publication-type="journal"><string-name><surname>Hilafu</surname>, <given-names>H.</given-names></string-name> and <string-name><surname>Safo</surname>, <given-names>S. E.</given-names></string-name> (<year>2022</year>). <article-title>Sparse sliced inverse regression for high dimensional data analysis</article-title>. <source>BMC Bioinformatics</source> <volume>23</volume>(<issue>1</issue>) <fpage>1</fpage>–<lpage>19</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_029">
<label>[29]</label><mixed-citation publication-type="journal"><string-name><surname>Hinton</surname>, <given-names>G. E.</given-names></string-name> (<year>2002</year>). <article-title>Training products of experts by minimizing contrastive divergence</article-title>. <source>Neural Computation</source> <volume>14</volume>(<issue>8</issue>) <fpage>1771</fpage>–<lpage>1800</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_030">
<label>[30]</label><mixed-citation publication-type="journal"><string-name><surname>Hopfield</surname>, <given-names>J. J.</given-names></string-name> (<year>1982</year>). <article-title>Neural networks and physical systems with emergent collective computational abilities.</article-title> <source>Proceedings of the National Academy of Sciences</source> <volume>79</volume>(<issue>8</issue>) <fpage>2554</fpage>–<lpage>2558</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1073/pnas.79.8.2554" xlink:type="simple">https://doi.org/10.1073/pnas.79.8.2554</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0652033">MR0652033</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_031">
<label>[31]</label><mixed-citation publication-type="chapter"><string-name><surname>Hotelling</surname>, <given-names>H.</given-names></string-name> (<year>1992</year>). <chapter-title>Relations between two sets of variates</chapter-title>. In <source>Breakthroughs in Statistics</source> <fpage>162</fpage>–<lpage>190</lpage> <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_032">
<label>[32]</label><mixed-citation publication-type="journal"><string-name><surname>Hsing</surname>, <given-names>T.</given-names></string-name> and <string-name><surname>Carroll</surname>, <given-names>R. J.</given-names></string-name> (1992). <article-title>An asymptotic theory for sliced inverse regression</article-title>. <source>The Annals of Statistics</source> <volume>20</volume>(<issue>2</issue>) <fpage>1040</fpage>–<lpage>1061</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/aos/1176348669" xlink:type="simple">https://doi.org/10.1214/aos/1176348669</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1165605">MR1165605</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_033">
<label>[33]</label><mixed-citation publication-type="journal"><string-name><surname>Jiang</surname>, <given-names>B.</given-names></string-name> and <string-name><surname>Liu</surname>, <given-names>J. S.</given-names></string-name> (<year>2014</year>). <article-title>Variable selection for general index models via sliced inverse regression</article-title>. <source>The Annals of Statistics</source> <volume>42</volume>(<issue>5</issue>) <fpage>1751</fpage>–<lpage>1786</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/14-AOS1233" xlink:type="simple">https://doi.org/10.1214/14-AOS1233</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3262467">MR3262467</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_034">
<label>[34]</label><mixed-citation publication-type="journal"><string-name><surname>Jiang</surname>, <given-names>C.-R.</given-names></string-name>, <string-name><surname>Yu</surname>, <given-names>W.</given-names></string-name> and <string-name><surname>Wang</surname>, <given-names>J.-L.</given-names></string-name> (<year>2014</year>). <article-title>Inverse regression for longitudinal data</article-title>. <source>The Annals of Statistics</source> <volume>42</volume>(<issue>2</issue>) <fpage>563</fpage>–<lpage>591</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/13-AOS1193" xlink:type="simple">https://doi.org/10.1214/13-AOS1193</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3210979">MR3210979</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_035">
<label>[35]</label><mixed-citation publication-type="journal"><string-name><surname>Jones</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Townes</surname>, <given-names>F. W.</given-names></string-name>, <string-name><surname>Li</surname>, <given-names>D.</given-names></string-name> and <string-name><surname>Engelhardt</surname>, <given-names>B. E.</given-names></string-name> (<year>2022</year>). <article-title>Contrastive latent variable modeling with application to case-control sequencing experiments</article-title>. <source>The Annals of Applied Statistics</source> <volume>16</volume>(<issue>3</issue>) <fpage>1268</fpage>–<lpage>1291</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/21-aoas1534" xlink:type="simple">https://doi.org/10.1214/21-aoas1534</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4455880">MR4455880</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_036">
<label>[36]</label><mixed-citation publication-type="journal"><string-name><surname>Journée</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Nesterov</surname>, <given-names>Y.</given-names></string-name>, <string-name><surname>Richtárik</surname>, <given-names>P.</given-names></string-name> and <string-name><surname>Sepulchre</surname>, <given-names>R.</given-names></string-name> (2010). <article-title>Generalized power method for sparse principal component analysis</article-title>. <source>Journal of Machine Learning Research</source> <volume>11</volume>(<issue>2</issue>) <fpage>517</fpage>–<lpage>553</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2600619">MR2600619</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_037">
<label>[37]</label><mixed-citation publication-type="book"><string-name><surname>Li</surname>, <given-names>B.</given-names></string-name> (<year>2018</year>) <source>Sufficient dimension reduction: Methods and applications with R</source>. <publisher-name>Chapman and Hall/CRC</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_038">
<label>[38]</label><mixed-citation publication-type="other"><string-name><surname>Li</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Jones</surname>, <given-names>A.</given-names></string-name> and <string-name><surname>Engelhardt</surname>, <given-names>B.</given-names></string-name> (2020). Probabilistic contrastive principal component analysis. <italic>arXiv preprint arXiv:</italic><ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2012.07977"><italic>2012.07977</italic></ext-link>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_039">
<label>[39]</label><mixed-citation publication-type="other"><string-name><surname>Li</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Mukhopadhyay</surname>, <given-names>M.</given-names></string-name> and <string-name><surname>Dunson</surname>, <given-names>D. B.</given-names></string-name> (2017). Efficient manifold and subspace approximations with spherelets. <italic>arXiv preprint arXiv:</italic><ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/1706.08263"><italic>1706.08263</italic></ext-link>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/rssb.12508" xlink:type="simple">https://doi.org/10.1111/rssb.12508</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4494155">MR4494155</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_040">
<label>[40]</label><mixed-citation publication-type="journal"><string-name><surname>Li</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Mukhopadhyay</surname>, <given-names>M.</given-names></string-name> and <string-name><surname>Dunson</surname>, <given-names>D. B.</given-names></string-name> (<year>2022</year>). <article-title>Efficient manifold approximation with spherelets</article-title>. <source>Journal of the Royal Statistical Society Series B</source> <volume>84</volume>(<issue>4</issue>) <fpage>1129</fpage>–<lpage>1149</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/rssb.12508" xlink:type="simple">https://doi.org/10.1111/rssb.12508</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4494155">MR4494155</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_041">
<label>[41]</label><mixed-citation publication-type="journal"><string-name><surname>Li</surname>, <given-names>K.-C.</given-names></string-name> (<year>1991</year>). <article-title>Sliced inverse regression for dimension reduction</article-title>. <source>Journal of the American Statistical Association</source> <volume>86</volume>(<issue>414</issue>) <fpage>316</fpage>–<lpage>327</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1137117">MR1137117</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_042">
<label>[42]</label><mixed-citation publication-type="journal"><string-name><surname>Li</surname>, <given-names>L.</given-names></string-name> and <string-name><surname>Yin</surname>, <given-names>X.</given-names></string-name> (<year>2008</year>). <article-title>Sliced inverse regression with regularizations</article-title>. <source>Biometrics</source> <volume>64</volume>(<issue>1</issue>) <fpage>124</fpage>–<lpage>131</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/j.1541-0420.2007.00836.x" xlink:type="simple">https://doi.org/10.1111/j.1541-0420.2007.00836.x</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2422826">MR2422826</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_043">
<label>[43]</label><mixed-citation publication-type="journal"><string-name><surname>Li</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Simonoff</surname>, <given-names>J. S.</given-names></string-name> and <string-name><surname>Tsai</surname>, <given-names>C.-L.</given-names></string-name> (<year>2007</year>). <article-title>Tobit model estimation and sliced inverse regression</article-title>. <source>Statistical Modelling</source> <volume>7</volume>(<issue>2</issue>) <fpage>107</fpage>–<lpage>123</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1177/1471082X0700700201" xlink:type="simple">https://doi.org/10.1177/1471082X0700700201</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2749982">MR2749982</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_044">
<label>[44]</label><mixed-citation publication-type="other"><string-name><surname>Liao</surname>, <given-names>Y.-t.</given-names></string-name>, <string-name><surname>Luo</surname>, <given-names>H.</given-names></string-name> and <string-name><surname>Ma</surname>, <given-names>A.</given-names></string-name> (2023). Efficient Bayesian selection of hyper-parameters for dimension reduction: Case studies for t-SNE and UMAP. <italic>In preparation</italic>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_045">
<label>[45]</label><mixed-citation publication-type="journal"><string-name><surname>Lin</surname>, <given-names>Q.</given-names></string-name>, <string-name><surname>Zhao</surname>, <given-names>Z.</given-names></string-name> and <string-name><surname>Liu</surname>, <given-names>J. S.</given-names></string-name> (<year>2018</year>). <article-title>On consistency and sparsity for sliced inverse regression in high dimensions</article-title>. <source>The Annals of Statistics</source> <volume>46</volume>(<issue>2</issue>) <fpage>580</fpage>–<lpage>610</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/17-AOS1561" xlink:type="simple">https://doi.org/10.1214/17-AOS1561</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3782378">MR3782378</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_046">
<label>[46]</label><mixed-citation publication-type="other"><string-name><surname>Luo</surname>, <given-names>H.</given-names></string-name> and <string-name><surname>Li</surname>, <given-names>D.</given-names></string-name> (2022). Spherical rotation dimension reduction with geometric loss functions, 1–56. arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2204.10975">2204.10975</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4777417">MR4777417</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_047">
<label>[47]</label><mixed-citation publication-type="other"><string-name><surname>Luo</surname>, <given-names>H.</given-names></string-name> and <string-name><surname>Strait</surname>, <given-names>J. D.</given-names></string-name> (2022). Nonparametric multi-shape modeling with uncertainty quantification. <italic>arXiv preprint arXiv:</italic><ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2206.09127"><italic>2206.09127</italic></ext-link>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_048">
<label>[48]</label><mixed-citation publication-type="other"><string-name><surname>Murray</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Demmel</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Mahoney</surname>, <given-names>M. W.</given-names></string-name>, <string-name><surname>Erichson</surname>, <given-names>N. B.</given-names></string-name>, <string-name><surname>Melnichenko</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Malik</surname>, <given-names>O. A.</given-names></string-name>, <string-name><surname>Grigori</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Dereziski</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Lopes</surname>, <given-names>M. E.</given-names></string-name>, <string-name><surname>Liang</surname>, <given-names>T.</given-names></string-name> and <string-name><surname>Luo</surname>, <given-names>H.</given-names></string-name> (2022). <italic>Randomized Numerical Linear Algebra: a perspective on the field with an eye to software</italic>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_049">
<label>[49]</label><mixed-citation publication-type="journal"><string-name><surname>Nierenberg</surname>, <given-names>D. W.</given-names></string-name>, <string-name><surname>Stukel</surname>, <given-names>T. A.</given-names></string-name>, <string-name><surname>Baron</surname>, <given-names>J. A.</given-names></string-name>, <string-name><surname>Dain</surname>, <given-names>B. J.</given-names></string-name>, <string-name><surname>Greenberg</surname>, <given-names>E. R.</given-names></string-name> and <string-name><surname>Group</surname>, <given-names>S. C. P. S.</given-names></string-name> (<year>1989</year>). <article-title>Determinants of plasma levels of beta-carotene and retinol</article-title>. <source>American Journal of Epidemiology</source> <volume>130</volume>(<issue>3</issue>) <fpage>511</fpage>–<lpage>521</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0210418">MR0210418</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_050">
<label>[50]</label><mixed-citation publication-type="chapter"><string-name><surname>Oviedo</surname>, <given-names>H.</given-names></string-name> and <string-name><surname>Dalmau</surname>, <given-names>O.</given-names></string-name> (<year>2019</year>). <chapter-title>A scaled gradient projection method for minimization over the Stiefel manifold</chapter-title>. In <source>Mexican International Conference on Artificial Intelligence</source> <fpage>239</fpage>–<lpage>250</lpage>. <publisher-name>Springer</publisher-name>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s11075-020-01001-9" xlink:type="simple">https://doi.org/10.1007/s11075-020-01001-9</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4269662">MR4269662</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_051">
<label>[51]</label><mixed-citation publication-type="journal"><string-name><surname>Oviedo</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Dalmau</surname>, <given-names>O.</given-names></string-name> and <string-name><surname>Lara</surname>, <given-names>H.</given-names></string-name> (<year>2021</year>). <article-title>Two adaptive scaled gradient projection methods for Stiefel manifold constrained optimization</article-title>. <source>Numerical Algorithms</source> <volume>87</volume>(<issue>3</issue>) <fpage>1107</fpage>–<lpage>1127</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s11075-020-01001-9" xlink:type="simple">https://doi.org/10.1007/s11075-020-01001-9</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4269662">MR4269662</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_052">
<label>[52]</label><mixed-citation publication-type="journal"><string-name><surname>Quinlan</surname>, <given-names>J. R.</given-names></string-name> (<year>1987</year>). <article-title>Simplifying decision trees</article-title>. <source>International Journal of Man-Machine Studies</source> <volume>27</volume>(<issue>3</issue>) <fpage>221</fpage>–<lpage>234</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_053">
<label>[53]</label><mixed-citation publication-type="journal"><string-name><surname>Rousseeuw</surname>, <given-names>P. J.</given-names></string-name> (<year>1987</year>). <article-title>Silhouettes: a graphical aid to the interpretation and validation of cluster analysis</article-title>. <source>Journal of Computational and Applied Mathematics</source> <volume>20</volume> <fpage>53</fpage>–<lpage>65</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_054">
<label>[54]</label><mixed-citation publication-type="journal"><string-name><surname>Ruhe</surname>, <given-names>A.</given-names></string-name> (<year>1970</year>). <article-title>Perturbation bounds for means of eigenvalues and invariant subspaces</article-title>. <source>BIT Numerical Mathematics</source> <volume>10</volume>(<issue>3</issue>) <fpage>343</fpage>–<lpage>354</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/bf01934203" xlink:type="simple">https://doi.org/10.1007/bf01934203</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0273802">MR0273802</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_055">
<label>[55]</label><mixed-citation publication-type="chapter"><string-name><surname>Severson</surname>, <given-names>K. A.</given-names></string-name>, <string-name><surname>Ghosh</surname>, <given-names>S.</given-names></string-name> and <string-name><surname>Ng</surname>, <given-names>K.</given-names></string-name> (<year>2019</year>). <chapter-title>Unsupervised learning with contrastive latent variable models</chapter-title>. In <source>Proceedings of the AAAI Conference on Artificial Intelligence</source> <volume>33</volume> <fpage>4862</fpage>–<lpage>4869</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_056">
<label>[56]</label><mixed-citation publication-type="journal"><string-name><surname>Stephenson</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Reynolds</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Botting</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Calero-Nieto</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Morgan</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Tuong</surname>, <given-names>Z.</given-names></string-name>, <string-name><surname>Bach</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Sungnak</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Worlock</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Yoshida</surname>, <given-names>M.</given-names></string-name> <etal>et al.</etal> (<year>2021</year>). <article-title>Cambridge Institute of therapeutic immunology and infectious disease-national Institute of health research (CITIID-NIHR) COVID-19 BioResource collaboration, single-cell multi-omics analysis of the immune response in COVID-19</article-title>. <source>Nat. Med</source> <volume>27</volume>(<issue>5</issue>) <fpage>904</fpage>–<lpage>916</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_057">
<label>[57]</label><mixed-citation publication-type="other"><string-name><surname>Tagare</surname>, <given-names>H. D.</given-names></string-name> (2011). Notes on optimization on Stiefel manifolds. <italic>Yale University, New Haven</italic>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_058">
<label>[58]</label><mixed-citation publication-type="journal"><string-name><surname>Tibshirani</surname>, <given-names>R.</given-names></string-name> (<year>1996</year>). <article-title>Regression shrinkage and selection via the lasso</article-title>. <source>Journal of the Royal Statistical Society: Series B (Methodological)</source> <volume>58</volume>(<issue>1</issue>) <fpage>267</fpage>–<lpage>288</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1379242">MR1379242</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_059">
<label>[59]</label><mixed-citation publication-type="journal"><string-name><surname>Van der Maaten</surname>, <given-names>L.</given-names></string-name> and <string-name><surname>Hinton</surname>, <given-names>G.</given-names></string-name> (2008). <article-title>Visualizing data using t-SNE</article-title>. <source>Journal of Machine Learning Research</source> <volume>9</volume>(<issue>11</issue>).</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_060">
<label>[60]</label><mixed-citation publication-type="other"><string-name><surname>Virta</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Lee</surname>, <given-names>K.-Y.</given-names></string-name> and <string-name><surname>Li</surname>, <given-names>L.</given-names></string-name> (2022). Sliced inverse regression in metric spaces. <italic>arXiv preprint arXiv:</italic><ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2206.11511"><italic>2206.11511</italic></ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4485085">MR4485085</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_061">
<label>[61]</label><mixed-citation publication-type="journal"><string-name><surname>Vogelstein</surname>, <given-names>J. T.</given-names></string-name>, <string-name><surname>Bridgeford</surname>, <given-names>E. W.</given-names></string-name>, <string-name><surname>Tang</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Zheng</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Douville</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Burns</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>Maggioni</surname>, <given-names>M.</given-names></string-name> (<year>2021</year>). <article-title>Supervised dimensionality reduction for big data</article-title>. <source>Nature Communications</source> <volume>12</volume>(<issue>1</issue>) <fpage>1</fpage>–<lpage>9</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_062">
<label>[62]</label><mixed-citation publication-type="journal"><string-name><surname>Weinberger</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Lin</surname>, <given-names>C.</given-names></string-name> and <string-name><surname>Lee</surname>, <given-names>S.-I.</given-names></string-name> (<year>2023</year>). <article-title>Isolating salient variations of interest in single-cell data with contrastiveVI</article-title>. <source>Nature Methods</source> <volume>20</volume>(<issue>9</issue>) <fpage>1336</fpage>–<lpage>1345</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_063">
<label>[63]</label><mixed-citation publication-type="chapter"><string-name><surname>Weiss</surname>, <given-names>M.</given-names></string-name> and <string-name><surname>Tonella</surname>, <given-names>P.</given-names></string-name> (<year>2022</year>). <chapter-title>Simple techniques work surprisingly well for neural network test prioritization and active learning (replicability study)</chapter-title>. In <source>Proceedings of the 31st ACM SIGSOFT International Symposium on Software Testing and Analysis</source> <fpage>139</fpage>–<lpage>150</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_064">
<label>[64]</label><mixed-citation publication-type="journal"><string-name><surname>Wilkinson</surname>, <given-names>L.</given-names></string-name> and <string-name><surname>Luo</surname>, <given-names>H.</given-names></string-name> (2022). <article-title>A distance-preserving matrix sketch</article-title>. <source>Journal of Computational and Graphical Statistics</source> <volume>31</volume> <fpage>945</fpage>–<lpage>959</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/10618600.2022.2050246" xlink:type="simple">https://doi.org/10.1080/10618600.2022.2050246</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4513361">MR4513361</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_065">
<label>[65]</label><mixed-citation publication-type="journal"><string-name><surname>Wolpert</surname>, <given-names>D. H.</given-names></string-name> (<year>1992</year>). <article-title>Stacked generalization</article-title>. <source>Neural Networks</source> <volume>5</volume>(<issue>2</issue>) <fpage>241</fpage>–<lpage>259</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_066">
<label>[66]</label><mixed-citation publication-type="journal"><string-name><surname>Wu</surname>, <given-names>H.-M.</given-names></string-name>, <string-name><surname>Kao</surname>, <given-names>C.-H.</given-names></string-name> and <string-name><surname>Chen</surname>, <given-names>C.-h.</given-names></string-name> (<year>2020</year>). <article-title>Dimension reduction and visualization of symbolic interval-valued data using sliced inverse regression</article-title>. <source>Advances in Data Science: Symbolic, Complex and Network Data</source> <volume>4</volume> <fpage>49</fpage>–<lpage>77</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_067">
<label>[67]</label><mixed-citation publication-type="chapter"><string-name><surname>Wu</surname>, <given-names>Q.</given-names></string-name>, <string-name><surname>Liang</surname>, <given-names>F.</given-names></string-name> and <string-name><surname>Mukherjee</surname>, <given-names>S.</given-names></string-name> (<year>2013</year>). <chapter-title>Kernel sliced inverse regression: Regularization and consistency</chapter-title>. In <source>Abstract and Applied Analysis</source> <volume>2013</volume>. <publisher-name>Hindawi</publisher-name>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1155/2013/540725" xlink:type="simple">https://doi.org/10.1155/2013/540725</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3081598">MR3081598</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_068">
<label>[68]</label><mixed-citation publication-type="other"><string-name><surname>Wu</surname>, <given-names>Q.</given-names></string-name>, <string-name><surname>Mukherjee</surname>, <given-names>S.</given-names></string-name> and <string-name><surname>Liang</surname>, <given-names>F.</given-names></string-name> (2008). Localized sliced inverse regression. <italic>Advances in Neural Information Processing Systems</italic> <bold>21</bold>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1198/jcgs.2010.08080" xlink:type="simple">https://doi.org/10.1198/jcgs.2010.08080</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2791260">MR2791260</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_069">
<label>[69]</label><mixed-citation publication-type="other"><string-name><surname>Xiao</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Rasul</surname>, <given-names>K.</given-names></string-name> and <string-name><surname>Vollgraf</surname>, <given-names>R.</given-names></string-name> (2017). Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. <italic>arXiv preprint arXiv:</italic><ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/1708.07747"><italic>1708.07747</italic></ext-link>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_070">
<label>[70]</label><mixed-citation publication-type="journal"><string-name><surname>Young</surname>, <given-names>M. D.</given-names></string-name>, <string-name><surname>Mitchell</surname>, <given-names>T. J.</given-names></string-name>, <string-name><surname>Vieira Braga</surname>, <given-names>F. A.</given-names></string-name>, <string-name><surname>Tran</surname>, <given-names>M. G.</given-names></string-name>, <string-name><surname>Stewart</surname>, <given-names>B. J.</given-names></string-name>, <string-name><surname>Ferdinand</surname>, <given-names>J. R.</given-names></string-name>, <string-name><surname>Collord</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Botting</surname>, <given-names>R. A.</given-names></string-name>, <string-name><surname>Popescu</surname>, <given-names>D.-M.</given-names></string-name>, <string-name><surname>Loudon</surname>, <given-names>K. W.</given-names></string-name> <etal>et al.</etal> (<year>2018</year>). <article-title>Single-cell transcriptomes from human kidneys reveal the cellular identity of renal tumors</article-title>. <source>Science</source> <volume>361</volume>(<issue>6402</issue>) <fpage>594</fpage>–<lpage>599</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_071">
<label>[71]</label><mixed-citation publication-type="journal"><string-name><surname>Zheng</surname>, <given-names>G. X.</given-names></string-name>, <string-name><surname>Terry</surname>, <given-names>J. M.</given-names></string-name>, <string-name><surname>Belgrader</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Ryvkin</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Bent</surname>, <given-names>Z. W.</given-names></string-name>, <string-name><surname>Wilson</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Ziraldo</surname>, <given-names>S. B.</given-names></string-name>, <string-name><surname>Wheeler</surname>, <given-names>T. D.</given-names></string-name>, <string-name><surname>McDermott</surname>, <given-names>G. P.</given-names></string-name>, <string-name><surname>Zhu</surname>, <given-names>J.</given-names></string-name> <etal>et al.</etal> (<year>2017</year>). <article-title>Massively parallel digital transcriptional profiling of single cells</article-title>. <source>Nature Communications</source> <volume>8</volume>(<issue>1</issue>) <fpage>1</fpage>–<lpage>12</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds72_ref_072">
<label>[72]</label><mixed-citation publication-type="journal"><string-name><surname>Zou</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Hastie</surname>, <given-names>T.</given-names></string-name> and <string-name><surname>Tibshirani</surname>, <given-names>R.</given-names></string-name> (<year>2006</year>). <article-title>Sparse principal component analysis</article-title>. <source>Journal of Computational and Graphical Statistics</source> <volume>15</volume>(<issue>2</issue>) <fpage>265</fpage>–<lpage>286</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1198/106186006X113430" xlink:type="simple">https://doi.org/10.1198/106186006X113430</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2252527">MR2252527</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds72_ref_073">
<label>[73]</label><mixed-citation publication-type="other"><string-name><surname>Zou</surname>, <given-names>J. Y.</given-names></string-name>, <string-name><surname>Hsu</surname>, <given-names>D. J.</given-names></string-name>, <string-name><surname>Parkes</surname>, <given-names>D. C.</given-names></string-name> and <string-name><surname>Adams</surname>, <given-names>R. P.</given-names></string-name> (2013). Contrastive learning using spectral methods. <italic>Advances in Neural Information Processing Systems</italic> <bold>26</bold>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
