<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">NEJSDS</journal-id>
<journal-title-group><journal-title>The New England Journal of Statistics in Data Science</journal-title></journal-title-group>
<issn pub-type="ppub">2693-7166</issn>
<issn-l>2693-7166</issn-l>
<publisher>
<publisher-name>New England Statistical Society</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">NEJSDS11</article-id>
<article-id pub-id-type="doi">10.51387/22-NEJSDS11</article-id>
<article-categories>
<subj-group subj-group-type="heading"><subject>Commentary and/or Historical Perspective</subject></subj-group>
<subj-group subj-group-type="area"><subject>Spatial and Environmental Statistics</subject></subj-group>
</article-categories>
<title-group>
<article-title>Some Noteworthy Issues in Joint Species Distribution Modeling for Plant Data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Gelfand</surname><given-names>Alan E.</given-names></name><email xlink:href="mailto:alan@duke.edu">alan@duke.edu</email><xref ref-type="aff" rid="j_nejsds11_aff_001"/>
</contrib>
<aff id="j_nejsds11_aff_001">Department of Statistical Science, <institution>Duke University</institution>, <country>USA</country>. E-mail address: <email xlink:href="mailto:alan@duke.edu">alan@duke.edu</email></aff>
</contrib-group>
<pub-date pub-type="ppub"><year>2023</year></pub-date><pub-date pub-type="epub"><day>19</day><month>10</month><year>2022</year></pub-date><volume>1</volume><issue>1</issue><fpage>102</fpage><lpage>109</lpage>
<history>
<date date-type="accepted"><day>21</day><month>7</month><year>2022</year></date>
</history>
<permissions><copyright-statement>© 2023 New England Statistical Society</copyright-statement><copyright-year>2023</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Joint species distribution modeling is attracting increasing attention in the literature these days, recognizing the fact that single species modeling fails to take into account expected dependence/interaction between species. This short paper offers discussion that attempts to illuminate five noteworthy technical issues associated with such modeling in the context of plant data. In this setting, the joint species distribution work in the literature considers several types of species data collection. For convenience of discussion, we focus on joint modeling of presence/absence data. For such data, the primary modeling strategy has been through introduction of latent multivariate normal random variables.</p>
<p>These issues address the following: (i) how the observed presence/absence data is linked to the latent normal variables as well as the resulting implications with regard to modeling the data sites as independent or spatially dependent, (ii) the incompatibility of point referenced and areal referenced presence/absence data in spatial modeling of species distribution, (iii) the effect of modeling species independently/marginally rather than jointly within site, with regard to assessing species distribution, (iv) the interpretation of species dependence under the use of latent multivariate normal specification, and (v) the interpretation of clustering of species associated with specific joint species distribution modeling specifications.</p>
<p>It is hoped that, by attempting to clarify these issues, ecological modelers and quantitative ecologists will be able to better appreciate some subtleties that are implicit in this growing collection of modeling ideas. In this regard, this paper can serve as a useful companion piece to the recent survey/comparison article by [<xref ref-type="bibr" rid="j_nejsds11_ref_033">33</xref>] in <italic>Methods in Ecology and Evolution</italic>.</p>
</abstract>
<kwd-group>
<label>Keywords and phrases</label>
<kwd>Dirichlet process</kwd>
<kwd>Gaussian process</kwd>
<kwd>Latent factor analysis</kwd>
<kwd>Latent variables</kwd>
<kwd>Model-based clustering</kwd>
<kwd>Odds ratios</kwd>
<kwd>Spatial dependence</kwd>
<kwd>Species richness</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="j_nejsds11_s_001">
<label>1</label>
<title>Introduction</title>
<p>Recently, in the context of plants, there has been a flood of publication on joint species distribution modeling (JSDM) in the literature [<xref ref-type="bibr" rid="j_nejsds11_ref_022">22</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_029">29</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_019">19</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_009">9</xref>]. A useful comparison of such modeling has been presented in [<xref ref-type="bibr" rid="j_nejsds11_ref_033">33</xref>]. Such effort reflects the realization that observation of a community at a site anticipates dependence between the species present at that site. That is, so-called stacked species distribution modeling [<xref ref-type="bibr" rid="j_nejsds11_ref_013">13</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_006">6</xref>], modeling the species marginally but looking at the results jointly, need not perform well. For example, with presence/absence data, such modeling tends to overestimate probability of presence for each species at a site, hence the number of presences at a site [<xref ref-type="bibr" rid="j_nejsds11_ref_009">9</xref>]. Below, we will elaborate this issue further.</p>
<p>Joint species distribution modeling has been developed for presence/absence data, for count (abundance) data, and for composition data [<xref ref-type="bibr" rid="j_nejsds11_ref_009">9</xref>]. Here, for simplicity in discussing the challenges of interest, we focus solely on presence/absence data. We have primary concern with the setting consisting of a large number <italic>S</italic> of species and a large number <italic>n</italic> of sites. Site <inline-formula id="j_nejsds11_ineq_001"><alternatives>
<mml:math><mml:mi mathvariant="italic">i</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="italic">i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mn>2</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mo>…</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="italic">n</mml:mi></mml:math>
<tex-math><![CDATA[$i,i=1,2,\dots ,n$]]></tex-math></alternatives></inline-formula> provides an <inline-formula id="j_nejsds11_ineq_002"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo>×</mml:mo><mml:mn>1</mml:mn></mml:math>
<tex-math><![CDATA[$S\times 1$]]></tex-math></alternatives></inline-formula>, vector, <inline-formula id="j_nejsds11_ineq_003"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Y}_{i}}$]]></tex-math></alternatives></inline-formula> with entries 1 (presence) or 0 (absence). The sites may be viewed, hence modeled, as independent or spatially dependent, as appropriate. The joint species distribution modeling challenge is the need to model the set of <inline-formula id="j_nejsds11_ineq_004"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">S</mml:mi></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${2^{S}}$]]></tex-math></alternatives></inline-formula> probabilities associated with the set of possible realizations of <inline-formula id="j_nejsds11_ineq_005"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Y}_{i}}$]]></tex-math></alternatives></inline-formula>. Direct modeling of these probabilities is clearly infeasible even for relatively small <italic>S</italic> while we imagine <italic>S</italic> of order <inline-formula id="j_nejsds11_ineq_006"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mn>10</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${10^{2}}$]]></tex-math></alternatives></inline-formula> or even <inline-formula id="j_nejsds11_ineq_007"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mn>10</mml:mn></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${10^{3}}$]]></tex-math></alternatives></inline-formula>. The common solution that has been adopted in the literature is to introduce latent variables, <inline-formula id="j_nejsds11_ineq_008"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Z}_{i}}$]]></tex-math></alternatives></inline-formula> which drive the responses <inline-formula id="j_nejsds11_ineq_009"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Y}_{i}}$]]></tex-math></alternatives></inline-formula>. The <inline-formula id="j_nejsds11_ineq_010"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Z}_{i}}$]]></tex-math></alternatives></inline-formula> are modeled as multivariate normal vectors which enables tractable model specification though still computationally demanding model fitting. There is increasing literature on this demanding model fitting when <italic>n</italic> and <italic>S</italic> are large [<xref ref-type="bibr" rid="j_nejsds11_ref_027">27</xref>] and, further, when we introduce spatial dependence [<xref ref-type="bibr" rid="j_nejsds11_ref_026">26</xref>]. However, we do not consider the computational challenge here. Rather, our focus is on issues associated with model specification.</p>
<p>This takes us to the specific contribution of this note. We attempt to illuminate five consequential technical issues associated with joint species distribution modeling, presented in the context of presence-absence data. We address the following: (i) how the observed presence/absence data is linked to the latent normal variables as well as the resulting implications with regard to modeling the data sites as independent or spatially dependent, (ii) the incompatibility of point referenced and areal referenced presence/absence data in spatial modeling of species distribution, (iii) the effect of modeling species independently rather than jointly within site, with regard to assessing species distribution, (iv) the interpretation of species dependence under the use of latent multivariate normal specification, and (v) the interpretation of clustering of species associated with specific joint species distribution modeling specifications.</p>
<p>Species distribution modeling for animals offers a much more difficult challenge due to animal movement and the scale of animal range. There is a very active literature on animal movement, almost all of it at the single species level; there is certainly nothing at the order <inline-formula id="j_nejsds11_ineq_011"><alternatives>
<mml:math><mml:mi mathvariant="italic">O</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mn>10</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$O({10^{2}})$]]></tex-math></alternatives></inline-formula> species which we find in the plant literature. The proposed modeling is in a very different spirit from that for plant data, both dynamic and at larger spatial scale, (see, e.g., [<xref ref-type="bibr" rid="j_nejsds11_ref_015">15</xref>]) and we do not pursue it further here.</p>
<p>So, the format for the paper is simple. We devote a section to each of the five foregoing issues and then present a brief concluding section.</p>
</sec>
<sec id="j_nejsds11_s_002">
<label>2</label>
<title>Issue (i): Linking Binary Responses to Latent Normal Variables</title>
<p>This issue concerns how the observed presence/absence data is linked to the latent normal variables as well as the resulting implications with regard to modeling the data sites as independent or spatially dependent. Starting with the nonspatial case, let <inline-formula id="j_nejsds11_ineq_012"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{ij}}$]]></tex-math></alternatives></inline-formula> denote the response of species <italic>j</italic> at site <italic>i</italic> and let <inline-formula id="j_nejsds11_ineq_013"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}$]]></tex-math></alternatives></inline-formula> denote the associated Gaussian variable. We adopt notation in the spirit of [<xref ref-type="bibr" rid="j_nejsds11_ref_020">20</xref>], letting <inline-formula id="j_nejsds11_ineq_014"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{F}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_015"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{R}}$]]></tex-math></alternatives></inline-formula> denote the fixed and random effects contributions which are included additively in the modeling of <inline-formula id="j_nejsds11_ineq_016"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}$]]></tex-math></alternatives></inline-formula>. More will be said about the forms of these <italic>L</italic>’s below. However, as the definitions suggest, we will view <inline-formula id="j_nejsds11_ineq_017"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{F}}$]]></tex-math></alternatives></inline-formula> as a nonrandom component in the specification for <inline-formula id="j_nejsds11_ineq_018"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}$]]></tex-math></alternatives></inline-formula> (though it will have parameters – regression coefficients – in it and, in a Bayesian hierarchical modeling framework, these coefficients would be viewed and modeled as random). We will view <inline-formula id="j_nejsds11_ineq_019"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\mathbf{L}_{i}^{R}}$]]></tex-math></alternatives></inline-formula> as an <inline-formula id="j_nejsds11_ineq_020"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo>×</mml:mo><mml:mn>1</mml:mn></mml:math>
<tex-math><![CDATA[$S\times 1$]]></tex-math></alternatives></inline-formula> multivariate normal random variable with mean <inline-formula id="j_nejsds11_ineq_021"><alternatives>
<mml:math><mml:mn mathvariant="bold">0</mml:mn></mml:math>
<tex-math><![CDATA[$\mathbf{0}$]]></tex-math></alternatives></inline-formula> and dependence structure given by an <inline-formula id="j_nejsds11_ineq_022"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo>×</mml:mo><mml:mi mathvariant="italic">S</mml:mi></mml:math>
<tex-math><![CDATA[$S\times S$]]></tex-math></alternatives></inline-formula> correlation matrix, <italic>H</italic>. Then, marginally, <inline-formula id="j_nejsds11_ineq_023"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo stretchy="false">∼</mml:mo><mml:mi mathvariant="italic">N</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${L_{ij}^{R}}\sim N(0,1)$]]></tex-math></alternatives></inline-formula>.</p>
<p>Should we model <inline-formula id="j_nejsds11_ineq_024"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{ij}}$]]></tex-math></alternatives></inline-formula> as a <italic>function</italic> of <inline-formula id="j_nejsds11_ineq_025"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}$]]></tex-math></alternatives></inline-formula>, i.e., <inline-formula id="j_nejsds11_ineq_026"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="italic">I</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">&gt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{ij}}=I({Z_{ij}}>0)$]]></tex-math></alternatives></inline-formula> where <italic>I</italic> is the indicator function [as in, e.g., <xref ref-type="bibr" rid="j_nejsds11_ref_022">22</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_009">9</xref>] or should we model <inline-formula id="j_nejsds11_ineq_027"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{ij}}$]]></tex-math></alternatives></inline-formula> using a <italic>conditional</italic> distribution, <inline-formula id="j_nejsds11_ineq_028"><alternatives>
<mml:math><mml:mo fence="true" stretchy="false">[</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo fence="true" stretchy="false">]</mml:mo></mml:math>
<tex-math><![CDATA[$[{Y_{ij}}|{Z_{ij}}]$]]></tex-math></alternatives></inline-formula> [as in <xref ref-type="bibr" rid="j_nejsds11_ref_019">19</xref>]? Does it matter? We now clarify that the answer is NO if we view the sites as independent but YES if we view the sites as spatially dependent.</p>
<p>Under the <italic>functional</italic> relationship between <inline-formula id="j_nejsds11_ineq_029"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{ij}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_030"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}$]]></tex-math></alternatives></inline-formula>, since <inline-formula id="j_nejsds11_ineq_031"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="italic">I</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">&gt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{ij}}=I({Z_{ij}}>0)$]]></tex-math></alternatives></inline-formula>, we have <inline-formula id="j_nejsds11_ineq_032"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">&gt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$P({Y_{ij}}=1)=P({Z_{ij}}>0)$]]></tex-math></alternatives></inline-formula>. Consider the following two specifications for <inline-formula id="j_nejsds11_ineq_033"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}$]]></tex-math></alternatives></inline-formula>: 
<list>
<list-item id="j_nejsds11_li_001">
<label>(i)</label>
<p><inline-formula id="j_nejsds11_ineq_034"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${Z_{ij}}={L_{ij}^{F}}+{L_{ij}^{R}}$]]></tex-math></alternatives></inline-formula></p>
</list-item>
<list-item id="j_nejsds11_li_002">
<label>(ii)</label>
<p><inline-formula id="j_nejsds11_ineq_035"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}={L_{ij}^{F}}+{L_{ij}^{R}}+{\epsilon _{ij}}$]]></tex-math></alternatives></inline-formula></p>
</list-item>
</list> 
where, in (ii), the <inline-formula id="j_nejsds11_ineq_036"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\epsilon _{ij}}$]]></tex-math></alternatives></inline-formula> are pure error terms, i.e., independent and identically distributed normal random variables with mean 0 and variance 1. Then, with Φ denoting the standard normal cumulative distribution function, under (i), given <inline-formula id="j_nejsds11_ineq_037"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{F}}$]]></tex-math></alternatives></inline-formula>, 
<disp-formula id="j_nejsds11_eq_001">
<label>(2.1)</label><alternatives>
<mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">&gt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math>
<tex-math><![CDATA[\[ P({Y_{ij}}=1)=P({Z_{ij}}>0)=\Phi ({L_{ij}^{F}}).\]]]></tex-math></alternatives>
</disp-formula> 
Under (ii), given <inline-formula id="j_nejsds11_ineq_038"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{F}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_039"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{R}}$]]></tex-math></alternatives></inline-formula>, 
<disp-formula id="j_nejsds11_eq_002">
<label>(2.2)</label><alternatives>
<mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">&gt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math>
<tex-math><![CDATA[\[ P({Y_{ij}}=1)=P({Z_{ij}}>0)=\Phi ({L_{ij}^{F}}+{L_{ij}^{R}}).\]]]></tex-math></alternatives>
</disp-formula>
</p>
<p>Working with model (i) for <inline-formula id="j_nejsds11_ineq_040"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}$]]></tex-math></alternatives></inline-formula>, we have dependence at the first stage specification. The <inline-formula id="j_nejsds11_ineq_041"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}$]]></tex-math></alternatives></inline-formula> are dependent, <inline-formula id="j_nejsds11_ineq_042"><alternatives>
<mml:math><mml:mtext>corr</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mtext>corr</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal">,</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\text{corr}({Z_{ij}},{Z_{i{j^{\prime }}}})=\text{corr}({L_{ij}^{R}},{L_{i{j^{\prime }}}^{R}})$]]></tex-math></alternatives></inline-formula>, and therefore, so are the <inline-formula id="j_nejsds11_ineq_043"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{ij}}$]]></tex-math></alternatives></inline-formula>. Indeed, this direct dependence approach is advocated in [<xref ref-type="bibr" rid="j_nejsds11_ref_009">9</xref>]. The concern here is that now the probability of presence has no random effects in it; simply, <inline-formula id="j_nejsds11_ineq_044"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$P({Y_{ij}}=1)=\Phi ({L_{ij}^{F}})$]]></tex-math></alternatives></inline-formula> is entirely driven by covariates. We have a basic probit regression for every species. Moreover, how do we usefully interpret a correlation between normal random variables with regard to the association between the binary variables, <inline-formula id="j_nejsds11_ineq_045"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{ij}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_046"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{i{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula>? We take up this question under challenge (iv) in Section 5 below.</p>
<p>If we adopt (ii) above, we model <inline-formula id="j_nejsds11_ineq_047"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$P({Y_{ij}}=1)$]]></tex-math></alternatives></inline-formula> to include both fixed and random effects. It is clear that dependence is introduced through the specification for <inline-formula id="j_nejsds11_ineq_048"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{R}}$]]></tex-math></alternatives></inline-formula>. Under this specification, dependence between species is captured in the probability of presence, the so-called second stage of a hierarchical model, as we see from (<xref rid="j_nejsds11_eq_002">2.2</xref>). In fact, it is the correlation between <inline-formula id="j_nejsds11_ineq_049"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="normal">Φ</mml:mi></mml:mrow><mml:mrow><mml:mo>−</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\Phi ^{-1}}(P({Y_{ij}}=1))$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_050"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="normal">Φ</mml:mi></mml:mrow><mml:mrow><mml:mo>−</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\Phi ^{-1}}(P({Y_{i{j^{\prime }}}}=1))$]]></tex-math></alternatives></inline-formula>. In different words, <inline-formula id="j_nejsds11_ineq_051"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$P({Y_{ij}}=1)$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_052"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$P({Y_{i{j^{\prime }}}}=1)$]]></tex-math></alternatives></inline-formula> are dependent but the events <inline-formula id="j_nejsds11_ineq_053"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math>
<tex-math><![CDATA[${Y_{ij}}=1$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_054"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{i{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula> are conditionally independent given these probabilities.</p>
<p>Furthermore, stochastic dependence between probability of presence replaces explicit modeling of interaction [<xref ref-type="bibr" rid="j_nejsds11_ref_009">9</xref>]. This raises the question of what the resulting correlation means. In this regard, it is associated with <italic>residuals</italic> as (ii) reveals, i.e., adjusted for the <italic>mean</italic>, <inline-formula id="j_nejsds11_ineq_055"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{F}}$]]></tex-math></alternatives></inline-formula>. Moreover, at any site, we will find only a small subset of the <italic>S</italic> species present. That is, <inline-formula id="j_nejsds11_ineq_056"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Y}_{i}}$]]></tex-math></alternatives></inline-formula> will be predominantly comprised of 0’s. Nonetheless, we create pairwise associations for all pairs of species. So, it is evident that these associations have little to do with the actual realization of <inline-formula id="j_nejsds11_ineq_057"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Y}_{i}}$]]></tex-math></alternatives></inline-formula> at site <italic>i</italic>.</p>
<p>Furthermore, is a positive association suggestive of encouraging co-occurrence or of a potential substitution effect, i.e., a particular species is present but another, say similar one, could equally well have been successful there? This leads to discussion presented in, e.g., [<xref ref-type="bibr" rid="j_nejsds11_ref_035">35</xref>] and [<xref ref-type="bibr" rid="j_nejsds11_ref_020">20</xref>] regarding the global species pool (all existing species), the regional species pool (those able to colonize an area), and the local species pool (those found at the finest scale considered). Recent discussion clarifying that species co-occurrences from JSDMs are not able to be interpreted directly as species interactions appears in [<xref ref-type="bibr" rid="j_nejsds11_ref_003">3</xref>] and in [<xref ref-type="bibr" rid="j_nejsds11_ref_005">5</xref>]. However, further ecological elaboration of species interaction/dependence is beyond our interest here. In the sequel, under the foregoing modeling, we view pairwise correlation/dependence between species as a surrogate for species interaction.</p>
<p>Now, turn to the <italic>conditional</italic> specification, again, <inline-formula id="j_nejsds11_ineq_058"><alternatives>
<mml:math><mml:mo fence="true" stretchy="false">[</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo fence="true" stretchy="false">]</mml:mo></mml:math>
<tex-math><![CDATA[$[{Y_{ij}}|{Z_{ij}}]$]]></tex-math></alternatives></inline-formula>. Under a probit link function, <inline-formula id="j_nejsds11_ineq_059"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$P({Y_{ij}}=1)=\Phi ({Z_{ij}})$]]></tex-math></alternatives></inline-formula>. So, under (i), we obtain <inline-formula id="j_nejsds11_ineq_060"><alternatives>
<mml:math><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\Phi ({L_{ij}^{F}}+{L_{ij}^{R}})$]]></tex-math></alternatives></inline-formula>, the form in [<xref ref-type="bibr" rid="j_nejsds11_ref_020">20</xref>]. We can conclude that using (ii) under the functional specification or (i) under the conditional specification produces the same probability of presence. Therefore, if our goal is merely to obtain <inline-formula id="j_nejsds11_ineq_061"><alternatives>
<mml:math><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\Phi ({L_{ij}^{F}}+{L_{ij}^{R}})$]]></tex-math></alternatives></inline-formula> as the probability of presence, we can achieve this under either specification. However, if the functional specification is used, we must adopt (ii) above for the <italic>Z</italic>’s. This distinction seems muddled in, e.g., [<xref ref-type="bibr" rid="j_nejsds11_ref_033">33</xref>].</p>
<p>Next, suppose we bring in space and spatial dependence. For plant data, we assume that the spatial scale of the study region of interest is large enough so that we can view plots as geo-coded locations. Hence, we modify notation by attaching location <inline-formula id="j_nejsds11_ineq_062"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{s}_{i}}$]]></tex-math></alternatives></inline-formula> to site <italic>i</italic> and writing <inline-formula id="j_nejsds11_ineq_063"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">≡</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{ij}}\equiv {Y_{j}}({\mathbf{s}_{i}})$]]></tex-math></alternatives></inline-formula>. Now we conceptualize a presence/absence variable, <inline-formula id="j_nejsds11_ineq_064"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> for species <italic>j</italic> at every location, <bold>s</bold>, in the study region, say <italic>D</italic>, and, in fact, a realization of a presence/absence (binary) surface for species <italic>j</italic>, <inline-formula id="j_nejsds11_ineq_065"><alternatives>
<mml:math><mml:mo fence="true" stretchy="false">{</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>:</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo stretchy="false">∈</mml:mo><mml:mi mathvariant="italic">D</mml:mi><mml:mo fence="true" stretchy="false">}</mml:mo></mml:math>
<tex-math><![CDATA[$\{{Y_{j}}(\mathbf{s}):\mathbf{s}\in D\}$]]></tex-math></alternatives></inline-formula>. This surface is observed at <inline-formula id="j_nejsds11_ineq_066"><alternatives>
<mml:math><mml:mo fence="true" stretchy="false">{</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="italic">i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mn>2</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mo>…</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="italic">n</mml:mi><mml:mo fence="true" stretchy="false">}</mml:mo></mml:math>
<tex-math><![CDATA[$\{{\mathbf{s}_{i}},i=1,2,\dots ,n\}$]]></tex-math></alternatives></inline-formula>. With regard to the <italic>Z</italic>’s, now we have:</p>
<list>
<list-item id="j_nejsds11_li_003">
<label>(i)</label>
<p><inline-formula id="j_nejsds11_ineq_067"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\mathbf{s})={L_{j}^{F}}(\mathbf{s})+{L_{j}^{R}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> or</p>
</list-item>
<list-item id="j_nejsds11_li_004">
<label>(ii)</label>
<p><inline-formula id="j_nejsds11_ineq_068"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\mathbf{s})={L_{j}^{F}}(\mathbf{s})+{L_{j}^{R}}(\mathbf{s})+{\epsilon _{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula>.</p>
</list-item>
</list>
<p>Here, <inline-formula id="j_nejsds11_ineq_069"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\epsilon _{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> is pure error, so called white noise. That is, at each <bold>s</bold>, we have an associated independent normal error random variable.</p>
<p>Suppose <inline-formula id="j_nejsds11_ineq_070"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${L_{j}^{F}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> is a surface which is continuous except for a set of measure 0 over <italic>D</italic>. What this means here is that typically, environmental regressors are available at areal scales making <inline-formula id="j_nejsds11_ineq_071"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${L_{j}^{F}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> continuous over <italic>D</italic> except for the boundaries between areas. The total area of these boundaries is 0 relative to the area of <italic>D</italic>. Further, suppose <inline-formula id="j_nejsds11_ineq_072"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${L_{j}^{R}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> is a realization of a Gaussian process [<xref ref-type="bibr" rid="j_nejsds11_ref_002">2</xref>] which produces mean square continuous realizations.<xref ref-type="fn" rid="j_nejsds11_fn_001">1</xref><fn id="j_nejsds11_fn_001"><label><sup>1</sup></label>
<p>A sufficient condition is that the correlation function of the Gaussian process be continuous at <inline-formula id="j_nejsds11_ineq_073"><alternatives>
<mml:math><mml:mn mathvariant="bold">0</mml:mn></mml:math>
<tex-math><![CDATA[$\mathbf{0}$]]></tex-math></alternatives></inline-formula>.</p></fn> Then, under (i), <inline-formula id="j_nejsds11_ineq_074"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> is a continuous surface except for a set of measure 0 while under (ii) <inline-formula id="j_nejsds11_ineq_075"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> is everywhere discontinuous because the pure error <inline-formula id="j_nejsds11_ineq_076"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\epsilon _{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> surface is.</p>
<p>Again, consider the functional specification, now <inline-formula id="j_nejsds11_ineq_077"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">I</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal">&gt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{j}}(\mathbf{s})=I({Z_{j}}(\mathbf{s})>0)$]]></tex-math></alternatives></inline-formula> (which is referred to as a clipped Gaussian field in the literature [e.g., <xref ref-type="bibr" rid="j_nejsds11_ref_010">10</xref>]), and the conditional specification, now <inline-formula id="j_nejsds11_ineq_078"><alternatives>
<mml:math><mml:mo fence="true" stretchy="false">[</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo fence="true" stretchy="false">]</mml:mo></mml:math>
<tex-math><![CDATA[$[{Y_{j}}(\mathbf{s})|{Z_{j}}(\mathbf{s})]$]]></tex-math></alternatives></inline-formula>. Suppose, we work with (ii) yielding a probability of presence surface, <inline-formula id="j_nejsds11_ineq_079"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$P({Y_{j}}(\mathbf{s})=1)=\Phi ({L_{j}^{F}}(\mathbf{s})+{L_{j}^{R}}(\mathbf{s}))$]]></tex-math></alternatives></inline-formula>. Then, following the previous paragraph, for species <italic>j</italic>, the probability of presence surface is a.e. continuous over <italic>D</italic>. However, under the conditional specification, each <inline-formula id="j_nejsds11_ineq_080"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> is drawn as a conditionally independent Bernoulli variable given its probability of presence. Hence, the realized presence/absence surface, <inline-formula id="j_nejsds11_ineq_081"><alternatives>
<mml:math><mml:mo fence="true" stretchy="false">{</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>:</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo stretchy="false">∈</mml:mo><mml:mi mathvariant="italic">D</mml:mi><mml:mo fence="true" stretchy="false">}</mml:mo></mml:math>
<tex-math><![CDATA[$\{{Y_{j}}(\mathbf{s}):\mathbf{s}\in D\}$]]></tex-math></alternatives></inline-formula> here is everywhere discontinuous. This seems unsatisfying; the realized presence/absence surface should manifest <italic>local</italic> smoothness, local subregions where it is 0, local subregions where it is 1.</p>
<p>Back to the functional specification, under (ii), since <inline-formula id="j_nejsds11_ineq_082"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> is everywhere discontinuous, we can not obtain local continuity for the <inline-formula id="j_nejsds11_ineq_083"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> surface. However, under (i), if the <inline-formula id="j_nejsds11_ineq_084"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> surface is continuous, with the functional specification, we can obtain local continuity for the <inline-formula id="j_nejsds11_ineq_085"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> surface. The point here is that, with spatial modeling, if we value local smoothness in the realized presence/absence surface, if we think that such smoothness more appropriately captures real world behavior of process realizations, then we should work with the functional specification since this smoothness can never be achieved with the conditional specification.</p>
<p>However, to work with the functional specification under (i), we encounter a technical problem. Suppose we define <inline-formula id="j_nejsds11_ineq_086"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold-italic">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">s</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${L_{ij}^{F}}={\boldsymbol{X}^{T}}({\boldsymbol{s}_{i}}){\boldsymbol{\beta }_{j}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_087"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">w</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${L_{ij}^{R}}={w_{j}}({\mathbf{s}_{i}})$]]></tex-math></alternatives></inline-formula>. The problem concerns the difference between the probability of presence surface under (i) vs. under (ii). Because of the spatial dependence imposed on the presence/absence surface under (i), the realized presence surface, <inline-formula id="j_nejsds11_ineq_088"><alternatives>
<mml:math><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold-italic">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\Phi ({\boldsymbol{X}^{T}}(\boldsymbol{s}){\boldsymbol{\beta }_{j}})$]]></tex-math></alternatives></inline-formula> has to “agree” with the observed presences and absences. Under (ii), smoothness is imposed on the probability of presence surface, i.e., <inline-formula id="j_nejsds11_ineq_089"><alternatives>
<mml:math><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold-italic">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">w</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\Phi ({\boldsymbol{X}^{T}}(\boldsymbol{s}){\boldsymbol{\beta }_{j}}+{w_{j}}(\mathbf{s}))$]]></tex-math></alternatives></inline-formula> but not on the realized presence/absence surface. With the latter, we can observe a presence that has small probability of occurring or an absence that has a small probability of occurring. As a result, the probability of presence surface does not have to work as hard to fit the data. Under the functional model, the GP has to react strongly to observed presences and absences. Under the conditional modeling, it has to react less so. Therefore, when fitting the functional model, the <inline-formula id="j_nejsds11_ineq_090"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">w</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${w_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula> surface becomes spiky in the neighborhood of a presence in order to explain well the observed presence. The flexibility of the GP produces a posterior which is too sensitive to the data.</p>
<p>A potential solution is to replace <inline-formula id="j_nejsds11_ineq_091"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\epsilon _{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> with <inline-formula id="j_nejsds11_ineq_092"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">v</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${v_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula>, a second spatial Gaussian process, exchanging the discontinuity everywhere of the former with the spatial continuity of the latter. That is, still using the functional form, <inline-formula id="j_nejsds11_ineq_093"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">s</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mn>0</mml:mn></mml:math>
<tex-math><![CDATA[${Y_{j}}({\boldsymbol{s}_{i}})=1,0$]]></tex-math></alternatives></inline-formula> according to <inline-formula id="j_nejsds11_ineq_094"><alternatives>
<mml:math><mml:mi mathvariant="italic">Z</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">s</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo stretchy="false">≥</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mn>0</mml:mn></mml:math>
<tex-math><![CDATA[$Z({\boldsymbol{s}_{i}})\ge 0,<0$]]></tex-math></alternatives></inline-formula>, we have two GP’s in specifying <inline-formula id="j_nejsds11_ineq_095"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula>, i.e., <inline-formula id="j_nejsds11_ineq_096"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold-italic">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">w</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">v</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\boldsymbol{s})={\boldsymbol{X}^{T}}(\boldsymbol{s}){\boldsymbol{\beta }_{j}}+{w_{j}}(\boldsymbol{s})+{v_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula>. Here, <inline-formula id="j_nejsds11_ineq_097"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">w</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${w_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula> has a larger range, a smaller decay parameter while <inline-formula id="j_nejsds11_ineq_098"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">v</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${v_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula> has a smaller range with a larger decay parameter. That is, the <italic>w</italic> process seeks to capture the spatial dependence in the process while the <italic>v</italic> process only serves as a device to introduce smoothness.</p>
</sec>
<sec id="j_nejsds11_s_003">
<label>3</label>
<title>Issue (ii): Incompatibility of Point-Referenced and Areal Unit Presence/Absence Data</title>
<p>This issue concerns the incompatibility of point referenced and areal referenced presence/absence data in spatial modeling of species distribution. To do so requires explicit discussion regarding what an observed presence means along with the associated implications. The problem is whether presence/absence is viewed as an event at point level or at areal level. Is it a Bernoulli trial at say location <bold>s</bold> or is it the event that the number of individuals of a species in a set, say <italic>A</italic>, is <inline-formula id="j_nejsds11_ineq_099"><alternatives>
<mml:math><mml:mo stretchy="false">≥</mml:mo><mml:mn>1</mml:mn></mml:math>
<tex-math><![CDATA[$\ge 1$]]></tex-math></alternatives></inline-formula>?</p>
<p>If we model presence/absence at point level, then <inline-formula id="j_nejsds11_ineq_100"><alternatives>
<mml:math><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math>
<tex-math><![CDATA[$Y(\mathbf{s})=1$]]></tex-math></alternatives></inline-formula> is the result of a Bernoulli trial at location <bold>s</bold>. However, under point level modeling, what does <inline-formula id="j_nejsds11_ineq_101"><alternatives>
<mml:math><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$Y(A)$]]></tex-math></alternatives></inline-formula> mean? A coherent probabilistic definition specifies it as a block average, i.e., a realization of <inline-formula id="j_nejsds11_ineq_102"><alternatives>
<mml:math><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$Y(A)$]]></tex-math></alternatives></inline-formula> is <inline-formula id="j_nejsds11_ineq_103"><alternatives>
<mml:math><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mo largeop="false" movablelimits="false">∫</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">A</mml:mi></mml:mrow></mml:msub><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mi mathvariant="italic">d</mml:mi><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" stretchy="false">/</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo stretchy="false">|</mml:mo></mml:math>
<tex-math><![CDATA[$Y(A)={\textstyle\int _{A}}1(Y(\mathbf{s})=1)d\mathbf{s}/|A|$]]></tex-math></alternatives></inline-formula> (where <inline-formula id="j_nejsds11_ineq_104"><alternatives>
<mml:math><mml:mo stretchy="false">|</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo stretchy="false">|</mml:mo></mml:math>
<tex-math><![CDATA[$|A|$]]></tex-math></alternatives></inline-formula> is the area of <italic>A</italic>). It is the proportion of the <inline-formula id="j_nejsds11_ineq_105"><alternatives>
<mml:math><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$Y(\mathbf{s})$]]></tex-math></alternatives></inline-formula> in <italic>A</italic> that equal 1; it is not a Bernoulli trial and <inline-formula id="j_nejsds11_ineq_106"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math>
<tex-math><![CDATA[$P(Y(A)=1)=0$]]></tex-math></alternatives></inline-formula> since the probability that almost every Bernoulli trial in <italic>A</italic> results in a 1 equals 0. We can calculate <inline-formula id="j_nejsds11_ineq_107"><alternatives>
<mml:math><mml:mi mathvariant="italic">E</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mo largeop="false" movablelimits="false">∫</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">A</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="italic">p</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mi mathvariant="italic">d</mml:mi><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" stretchy="false">/</mml:mo><mml:mo stretchy="false">|</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo stretchy="false">|</mml:mo></mml:math>
<tex-math><![CDATA[$E(Y(A))={\textstyle\int _{A}}p(\mathbf{s})d\mathbf{s}/|A|$]]></tex-math></alternatives></inline-formula> with <inline-formula id="j_nejsds11_ineq_108"><alternatives>
<mml:math><mml:mi mathvariant="italic">p</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$p(\mathbf{s})$]]></tex-math></alternatives></inline-formula> specified as in the previous section. That is, <inline-formula id="j_nejsds11_ineq_109"><alternatives>
<mml:math><mml:mi mathvariant="italic">E</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$E(Y(A))$]]></tex-math></alternatives></inline-formula> becomes the average probability of presence over <italic>A</italic>. It is the probability that, at a randomly selected location in <italic>A</italic>, the species is present. If <inline-formula id="j_nejsds11_ineq_110"><alternatives>
<mml:math><mml:mi mathvariant="italic">p</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$p(\mathbf{s})$]]></tex-math></alternatives></inline-formula> is constant over <italic>A</italic> then <inline-formula id="j_nejsds11_ineq_111"><alternatives>
<mml:math><mml:mi mathvariant="italic">E</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$E(Y(A))$]]></tex-math></alternatives></inline-formula> is this constant probability. It is interpreted at point level; it is the probability of presence at any site in <italic>A</italic>.</p>
<p>Now, suppose we consider the locations of all individuals in a study region as a random point pattern. Then, if <inline-formula id="j_nejsds11_ineq_112"><alternatives>
<mml:math><mml:mi mathvariant="italic">N</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$N(A)$]]></tex-math></alternatives></inline-formula> is the number of individuals in set <italic>A</italic>, <inline-formula id="j_nejsds11_ineq_113"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mtext>presence</mml:mtext><mml:mspace width="5.69046pt"/><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">n</mml:mi><mml:mspace width="5.69046pt"/><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">N</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo stretchy="false">≥</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$P(\text{presence}\hspace{5.69046pt}in\hspace{5.69046pt}A)=P(N(A)\ge 1)$]]></tex-math></alternatives></inline-formula>. Here, assuming say, a nonhomogeneous Poisson process (NHPP) or, more generally a log Gaussian Cox process (LGCP) with intensity <inline-formula id="j_nejsds11_ineq_114"><alternatives>
<mml:math><mml:mi mathvariant="italic">λ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\lambda (\mathbf{s})$]]></tex-math></alternatives></inline-formula> (see Illian et al. (2008) for a full discussion of NHPPs and LGCPs), <inline-formula id="j_nejsds11_ineq_115"><alternatives>
<mml:math><mml:mi mathvariant="italic">N</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo stretchy="false">∼</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mi mathvariant="italic">o</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$N(A)\sim Po(\lambda (A))$]]></tex-math></alternatives></inline-formula> where <inline-formula id="j_nejsds11_ineq_116"><alternatives>
<mml:math><mml:mi mathvariant="italic">λ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mo largeop="false" movablelimits="false">∫</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">A</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mi mathvariant="italic">d</mml:mi><mml:mi mathvariant="italic">s</mml:mi></mml:math>
<tex-math><![CDATA[$\lambda (A)={\textstyle\int _{A}}\lambda (s)ds$]]></tex-math></alternatives></inline-formula>. Then, taking the areal unit definition of a presence in <italic>A</italic>, we seek <inline-formula id="j_nejsds11_ineq_117"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">Y</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">N</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo stretchy="false">≥</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>−</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">e</mml:mi></mml:mrow><mml:mrow><mml:mo>−</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[$P(Y(A)=1)=P(N(A)\ge 1)=1-{e^{-\lambda (A)}}$]]></tex-math></alternatives></inline-formula>. Viewing the data as a collection of observed presences imagines the data as presence-only; there are no absences [<xref ref-type="bibr" rid="j_nejsds11_ref_032">32</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_007">7</xref>]. This conceptualization enables the foregoing definition of presence/absence. However, the probability of a presence is only defined given <italic>A</italic> and, evidently, will depend on the size/scale of <italic>A</italic>. As a result, it is unclear how to specify a meaningful probability of presence surface. Perhaps the best option would be a <italic>gridded</italic> surface for some choice of <italic>A</italic>? Furthermore, the definition of probability of presence as “one or more” observations of the species in <italic>A</italic> yields local distortion to any such surface; <inline-formula id="j_nejsds11_ineq_118"><alternatives>
<mml:math><mml:mi mathvariant="italic">N</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math>
<tex-math><![CDATA[$N(A)=1$]]></tex-math></alternatives></inline-formula> or <inline-formula id="j_nejsds11_ineq_119"><alternatives>
<mml:math><mml:mi mathvariant="italic">N</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">A</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>11</mml:mn></mml:math>
<tex-math><![CDATA[$N(A)=11$]]></tex-math></alternatives></inline-formula> are treated the same with regard to probability of presence in <italic>A</italic> [<xref ref-type="bibr" rid="j_nejsds11_ref_001">1</xref>].</p>
<p>The two foregoing definitions associated with presence/absence are incompatible and the fundamental difference between them seems to have been missed in the literature (though see [<xref ref-type="bibr" rid="j_nejsds11_ref_011">11</xref>]). The conceptualization for the first choice is that we go to fixed “point” locations and see what is there; we are not sampling a point pattern. We model a surface over a domain <italic>D</italic> which captures the probability of presence at every location in <italic>D</italic>. The conceptualization for the second is that we identify an area of interest <italic>D</italic> and, theoretically, we census it completely for all of the occurrences of the point pattern (though in practice we never have the sampling effort to a study region completely). We model an intensity which, using the definition above, provides a probability of presence for a given <italic>A</italic>. The intensity surface can be normalized to a density surface under which the probability of an event at a “dimensionless” point is 0. That is, this density has nothing to do with modeling a Bernoulli trial at a point by specifying a probability of presence at the point, hence a probability of presence surface.</p>
<p>Furthermore, if presented with a collection of plots and observed presence/absence for those plots, one would not model the data as a point pattern. No point pattern was observed; there is no way to model an intensity. We would treat the plots as points in space and use a version of the foregoing presence/absence regression models. To reconcile the differences above it may be useful to think more carefully about what the distribution of a species looks like within a specified region, <italic>D</italic> and the associated implications. See [<xref ref-type="bibr" rid="j_nejsds11_ref_011">11</xref>] for further discussion in this regard.</p>
</sec>
<sec id="j_nejsds11_s_004">
<label>4</label>
<title>Issue (iii): the Effect of Dependence Vs. Independence at Site Level</title>
<p>With regard to assessing species distribution, this issue concerns the effect of modeling species independently rather than jointly within site. For example, with presence/absence data, stacking may tend to overestimate probability of presence for each species at a site. Hence the number of presences, the richness, at a site [<xref ref-type="bibr" rid="j_nejsds11_ref_013">13</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_009">9</xref>] may be overestimated. This can be potentially more problematic when a large number of species are examined.</p>
<p>Specifically, [<xref ref-type="bibr" rid="j_nejsds11_ref_013">13</xref>] offer the following criticisms of stacked species distribution modeling: (i) without adding a dispersal filter (e.g., seed dispersal pathways) it may incorrectly predict species in areas that appear environmentally suitable but that are outside their colonizable or historical range; (ii) it does not consider any constraints based on the carrying capacity of the local environment which determine the maximum number of species that may co-occur; and (iii) it does not explicitly consider any rules based on biotic interactions that control species co-occurrences and can exclude species from a community. As a result, it is anticipated that too many species can be predicted to occur in a geographical unit by stacked species distribution models.</p>
<p>We offer a stochastic perspective through formalization of species richness. Species richness records the number of distinct species present at a site and is commonly used to characterize species distributions at sites. With the foregoing notation, the observed richness at site <inline-formula id="j_nejsds11_ineq_120"><alternatives>
<mml:math><mml:mi mathvariant="bold-italic">s</mml:mi></mml:math>
<tex-math><![CDATA[$\boldsymbol{s}$]]></tex-math></alternatives></inline-formula> is <inline-formula id="j_nejsds11_ineq_121"><alternatives>
<mml:math><mml:mtext mathvariant="monospace">Rich</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mo largeop="false" movablelimits="false">∑</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">S</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\texttt{Rich}(\boldsymbol{s})={\textstyle\sum _{j=1}^{S}}{1_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula>. To be clear, <inline-formula id="j_nejsds11_ineq_122"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${1_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula> is the indicator of whether species <italic>j</italic> is present at location <inline-formula id="j_nejsds11_ineq_123"><alternatives>
<mml:math><mml:mi mathvariant="bold-italic">s</mml:mi></mml:math>
<tex-math><![CDATA[$\boldsymbol{s}$]]></tex-math></alternatives></inline-formula>. Further, <inline-formula id="j_nejsds11_ineq_124"><alternatives>
<mml:math><mml:mo fence="true" stretchy="false">{</mml:mo><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mn>2</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mo>…</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="italic">S</mml:mi><mml:mo fence="true" stretchy="false">}</mml:mo></mml:math>
<tex-math><![CDATA[$\{{1_{j}}(\boldsymbol{s}),j=1,2,\dots ,S\}$]]></tex-math></alternatives></inline-formula> does not constitute a multinomial trial but, rather, a set of dependent Bernoulli trials. Whether we model species independently using stacked species distribution models or dependently using JSDM’s, <inline-formula id="j_nejsds11_ineq_125"><alternatives>
<mml:math><mml:mi mathvariant="italic">E</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mtext mathvariant="monospace">Rich</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mo largeop="false" movablelimits="false">∑</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">S</mml:mi></mml:mrow></mml:msubsup><mml:mi mathvariant="italic">E</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mo largeop="false" movablelimits="false">∑</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">S</mml:mi></mml:mrow></mml:msubsup><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mo largeop="false" movablelimits="false">∑</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">S</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$E(\texttt{Rich}(\boldsymbol{s}))={\textstyle\sum _{j=1}^{S}}E({1_{j}}(\boldsymbol{s}))={\textstyle\sum _{j=1}^{S}}P({Y_{j}}(\boldsymbol{s})=1)={\textstyle\sum _{j=1}^{S}}{p_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula> with forms for <inline-formula id="j_nejsds11_ineq_126"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${p_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula> supplied above. Though the forms are the same, these expectations need not agree since, following the argument of the previous paragraph, the <inline-formula id="j_nejsds11_ineq_127"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${p_{j}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula> are expected to be different under an independence model vs. a JSDM. Probabilistically, because the joint model considers the data for all of the species at a site while the individual models consider the data only for the individual species at the site, unconstrained by the overall presence/absence at the site, intuitively, we might anticipate the latter expectations to be larger, suggesting prediction of higher richness using a stacked species distribution model.</p>
<p>Turning to the second moments, we expect to incorrectly estimate uncertainty in richness when the indicator variables in the sum are not independent. That is, Var<inline-formula id="j_nejsds11_ineq_128"><alternatives>
<mml:math><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mtext mathvariant="monospace">Rich</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mtext>Var</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mo largeop="false" movablelimits="false">∑</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">S</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$(\texttt{Rich}(\boldsymbol{s}))=\text{Var}({\textstyle\sum _{j=1}^{S}}{1_{j}}(\boldsymbol{s}))$]]></tex-math></alternatives></inline-formula> should reflect the chance of joint presence or absence. Formally, Var<inline-formula id="j_nejsds11_ineq_129"><alternatives>
<mml:math><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mtext mathvariant="monospace">Rich</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mtext>Var</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mo largeop="false" movablelimits="false">∑</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">S</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mo largeop="false" movablelimits="false">∑</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">S</mml:mi></mml:mrow></mml:msubsup><mml:mtext>Var</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:msub><mml:mrow><mml:mo largeop="false" movablelimits="false">∑</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">&lt;</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mtext>Cov</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$(\texttt{Rich}(\boldsymbol{s}))=\text{Var}({\textstyle\sum _{j=1}^{S}}{1_{j}}(\boldsymbol{s}))={\textstyle\sum _{j=1}^{S}}\text{Var}({1_{j}}(\boldsymbol{s}))+2{\textstyle\sum _{j<{j^{\prime }}}}\text{Cov}({1_{j}}(\boldsymbol{s}),{1_{{j^{\prime }}}}(\boldsymbol{s}))$]]></tex-math></alternatives></inline-formula>. However, in obvious notation, <inline-formula id="j_nejsds11_ineq_130"><alternatives>
<mml:math><mml:mtext>Cov</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>−</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\text{Cov}({1_{j}}(\boldsymbol{s}),{1_{{j^{\prime }}}}(\boldsymbol{s}))={p_{(j,{j^{\prime }})}}(\boldsymbol{s})-{p_{j}}(\boldsymbol{s}){p_{{j^{\prime }}}}(\boldsymbol{s})$]]></tex-math></alternatives></inline-formula>. Departure from independence will affect this term and there how departure from independence can affect the variance in observed richness. Finally, the above is all in the context of a single site so the same conclusions apply whether we are building a spatial or a nonspatial specification.</p>
<p>As a last thought here, perhaps the most direct way to demonstrate the benefit of the joint modeling is with conditional prediction. This strategy does not depend upon whether or not the model fitting was done spatially. At a site, suppose we attempt to predict presence/absence for a species, given we know the presence/absence state of some other species at that site. The conditional prediction probabilities will be suitably adjusted given this information. The model for the species under stacking will ignore this information. See [<xref ref-type="bibr" rid="j_nejsds11_ref_034">34</xref>] in this regard.</p>
</sec>
<sec id="j_nejsds11_s_005">
<label>5</label>
<title>Issue (iv): Interpretation of Dependence Under Latent Multivariate Normal Distributions</title>
<p>This issue concerns the interpretation of species dependence under the use of latent multivariate normal specification. As pointed out in Section 2, the pairwise associations arising under the latent multivariate normal have little to do with the actual realization of <inline-formula id="j_nejsds11_ineq_131"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{i}}$]]></tex-math></alternatives></inline-formula> at site <italic>i</italic>. Perhaps more importantly, the pairwise correlations between species arising under the normal model provide little understanding of the nature of/strength of dependence between species. For a pair of species, envisioning a <inline-formula id="j_nejsds11_ineq_132"><alternatives>
<mml:math><mml:mn>2</mml:mn><mml:mo>×</mml:mo><mml:mn>2</mml:mn></mml:math>
<tex-math><![CDATA[$2\times 2$]]></tex-math></alternatives></inline-formula> table for presence/absence, the odds ratio provides a useful tool for learning about species dependence with regard to presence/absence. Specifically, a positive log odds ratio captures sympatry, i.e., encouraging joint occurrence or joint absence. A negative log odds ratio captures allopatry, i.e., discouragement of co-occurrence. As an aside, since independence modeling underlies stacked species distribution models, such models will not be able to capture sympatric or allopatric behavior for pairs of species. [<xref ref-type="bibr" rid="j_nejsds11_ref_012">12</xref>] provide a full discussion of the role of odds ratios in interpretation of species dependence in JSDMs. Here, we extract a few thoughts.</p>
<p>For the JSDMs above, again, dependence across species is captured through the pairwise correlation between species in the latent bivariate normal distribution. We do not model the <inline-formula id="j_nejsds11_ineq_133"><alternatives>
<mml:math><mml:mn>2</mml:mn><mml:mo>×</mml:mo><mml:mn>2</mml:mn></mml:math>
<tex-math><![CDATA[$2\times 2$]]></tex-math></alternatives></inline-formula> table of probabilities, <inline-formula id="j_nejsds11_ineq_134"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">a</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="italic">b</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="italic">a</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="italic">b</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mn>1</mml:mn></mml:math>
<tex-math><![CDATA[${p_{a,b}^{(j,{j^{\prime }})}},a,b=0,1$]]></tex-math></alternatives></inline-formula> associated with species <italic>j</italic> and <inline-formula id="j_nejsds11_ineq_135"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${j^{\prime }}$]]></tex-math></alternatives></inline-formula> directly but, rather, we model the parameters in the latent multivariate normal distribution and, as a result, each of these probabilities is a function of these parameters.</p>
<p>However, there is no direct connection between say <inline-formula id="j_nejsds11_ineq_136"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${\rho ^{(j,{j^{\prime }})}}$]]></tex-math></alternatives></inline-formula>, the correlation in the latent multivariate normal between species <italic>j</italic> and species <inline-formula id="j_nejsds11_ineq_137"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${j^{\prime }}$]]></tex-math></alternatives></inline-formula>, and the odds ratio associated with the induced <inline-formula id="j_nejsds11_ineq_138"><alternatives>
<mml:math><mml:mn>2</mml:mn><mml:mo>×</mml:mo><mml:mn>2</mml:mn></mml:math>
<tex-math><![CDATA[$2\times 2$]]></tex-math></alternatives></inline-formula> table of joint probabilities for the species pair, <inline-formula id="j_nejsds11_ineq_139"><alternatives>
<mml:math><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$j,{j^{\prime }})$]]></tex-math></alternatives></inline-formula> at site <italic>i</italic>. Specifically, suppose the latent bivariate normal distribution for <inline-formula id="j_nejsds11_ineq_140"><alternatives>
<mml:math><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mtable equalrows="false" equalcolumns="false" columnalign="center"><mml:mtr><mml:mtd class="array"><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="array"><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mfenced></mml:math>
<tex-math><![CDATA[$\left(\begin{array}{c}{Z_{ij}}\\ {} {Z_{i{j^{\prime }}}}\end{array}\right)$]]></tex-math></alternatives></inline-formula> has mean <inline-formula id="j_nejsds11_ineq_141"><alternatives>
<mml:math><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mtable equalrows="false" equalcolumns="false" columnalign="center"><mml:mtr><mml:mtd class="array"><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="array"><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mfenced></mml:math>
<tex-math><![CDATA[$\left(\begin{array}{c}{\mu _{i}^{(j)}}\\ {} {\mu _{i}^{({j^{\prime }})}}\end{array}\right)$]]></tex-math></alternatives></inline-formula> and correlation matrix <inline-formula id="j_nejsds11_ineq_142"><alternatives>
<mml:math><mml:mfenced separators="" open="(" close=")"><mml:mrow><mml:mtable columnspacing="10.0pt" equalrows="false" columnlines="none" equalcolumns="false" columnalign="center center"><mml:mtr><mml:mtd class="array"><mml:mn>1</mml:mn></mml:mtd><mml:mtd class="array"><mml:msup><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="array"><mml:msup><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:mtd><mml:mtd class="array"><mml:mn>1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mfenced></mml:math>
<tex-math><![CDATA[$\left(\begin{array}{c@{\hskip10.0pt}c}1& {\rho ^{(j,{j^{\prime }})}}\\ {} {\rho ^{(j,{j^{\prime }})}}& 1\end{array}\right)$]]></tex-math></alternatives></inline-formula>. Then, the odds ratio for species <italic>j</italic> and <inline-formula id="j_nejsds11_ineq_143"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${j^{\prime }}$]]></tex-math></alternatives></inline-formula> at site <italic>i</italic>, 
<disp-formula id="j_nejsds11_eq_003">
<label>(5.1)</label><alternatives>
<mml:math display="block"><mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt"><mml:mtr><mml:mtd class="align-odd"><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">θ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:mtd><mml:mtd class="align-even"><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac><mml:mrow><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:mn>00</mml:mn></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:mn>11</mml:mn></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:mn>10</mml:mn></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:mn>01</mml:mn></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd class="align-odd"/><mml:mtd class="align-even"><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac><mml:mrow><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">≥</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo stretchy="false">≥</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">≥</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo stretchy="false">≥</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math>
<tex-math><![CDATA[\[\begin{aligned}{}{\theta _{i}^{(j,{j^{\prime }})}}& =\frac{{p_{i,00}^{(j,{j^{\prime }})}}{p_{i,11}^{(j,{j^{\prime }})}}}{{p_{i,10}^{(j,{j^{\prime }})}}{p_{i,01}^{(j,{j^{\prime }})}}}\\ {} & =\frac{P({Z_{ij}}<0,{Z_{i{j^{\prime }}}}<0)P({Z_{ij}}\ge 0,{Z_{i{j^{\prime }}}}\ge 0)}{P({Z_{ij}}\ge 0,{Z_{i{j^{\prime }}}}<0)P({Z_{ij}}<0,{Z_{i{j^{\prime }}}}\ge 0)}.\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
The expressions for the double integrals in (<xref rid="j_nejsds11_eq_003">5.1</xref>) show that each probability is a function of <inline-formula id="j_nejsds11_ineq_144"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\mu _{i}^{(j)}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds11_ineq_145"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\mu _{i}^{({j^{\prime }})}}$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds11_ineq_146"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${\rho ^{(j,{j^{\prime }})}}$]]></tex-math></alternatives></inline-formula>. [<xref ref-type="bibr" rid="j_nejsds11_ref_012">12</xref>] prove that <inline-formula id="j_nejsds11_ineq_147"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">θ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\theta _{i}^{(j,{j^{\prime }})}}$]]></tex-math></alternatives></inline-formula> is non-decreasing in <inline-formula id="j_nejsds11_ineq_148"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${\rho ^{(j,{j^{\prime }})}}$]]></tex-math></alternatives></inline-formula> for fixed <inline-formula id="j_nejsds11_ineq_149"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\mu _{i}^{(j)}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_150"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\mu _{i}^{({j^{\prime }})}}$]]></tex-math></alternatives></inline-formula>. However, in the presence of <inline-formula id="j_nejsds11_ineq_151"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\mu _{i}^{(j)}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds11_ineq_152"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\mu _{i}^{({j^{\prime }})}}$]]></tex-math></alternatives></inline-formula>, the latent correlations do not determine the strength/magnitude of the odds ratios.</p>
<p>Specifically, this result should be applied to <inline-formula id="j_nejsds11_ineq_153"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">W</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>−</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${W_{ij}}={Z_{ij}}-{\mu _{i}^{(j)}}$]]></tex-math></alternatives></inline-formula> where say, <inline-formula id="j_nejsds11_ineq_154"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mu _{i}^{(j)}}={\mathbf{X}_{i}^{T}}{\boldsymbol{\beta }_{j}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_155"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">W</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo>−</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${W_{i{j^{\prime }}}}={Z_{i{j^{\prime }}}}-{\mu _{i}^{({j^{\prime }})}}$]]></tex-math></alternatives></inline-formula> where again, <inline-formula id="j_nejsds11_ineq_156"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mu _{i}^{({j^{\prime }})}}={\mathbf{X}_{i}^{T}}{\boldsymbol{\beta }_{{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula>. As a result, <inline-formula id="j_nejsds11_ineq_157"><alternatives>
<mml:math><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">P</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">W</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">&lt;</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">c</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">W</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal">&lt;</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">c</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$P({Z_{ij}}<0,{Z_{i{j^{\prime }}}}<0)=P({W_{ij}}<{c_{ij}},{W_{i{j^{\prime }}}}<{c_{i{j^{\prime }}}})$]]></tex-math></alternatives></inline-formula>, where <inline-formula id="j_nejsds11_ineq_158"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">c</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>−</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${c_{ij}}=-{\mathbf{X}_{i}^{T}}{\boldsymbol{\beta }_{j}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_159"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">c</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>−</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${c_{i{j^{\prime }}}}=-{\mathbf{X}_{i}^{T}}{\boldsymbol{\beta }_{{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula>, is non-decreasing in <inline-formula id="j_nejsds11_ineq_160"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${\rho ^{(j,{j^{\prime }})}}$]]></tex-math></alternatives></inline-formula> for any <inline-formula id="j_nejsds11_ineq_161"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{X}_{i}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds11_ineq_162"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\beta }_{j}}$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds11_ineq_163"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\beta }_{{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula> and therefore so is the associated odds ratio, <inline-formula id="j_nejsds11_ineq_164"><alternatives>
<mml:math><mml:mi mathvariant="italic">θ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\theta ({\mathbf{X}_{i}},{\boldsymbol{\beta }_{j}},{\boldsymbol{\beta }_{{j^{\prime }}}})$]]></tex-math></alternatives></inline-formula>. As a result, for a given <inline-formula id="j_nejsds11_ineq_165"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi></mml:mrow><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${\rho ^{(j,{j^{\prime }})}}$]]></tex-math></alternatives></inline-formula>, we can see the response of <inline-formula id="j_nejsds11_ineq_166"><alternatives>
<mml:math><mml:mi mathvariant="italic">θ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\theta ({\mathbf{X}_{i}},{\boldsymbol{\beta }_{j}},{\boldsymbol{\beta }_{{j^{\prime }}}})$]]></tex-math></alternatives></inline-formula> to changes in <inline-formula id="j_nejsds11_ineq_167"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{X}_{i}}$]]></tex-math></alternatives></inline-formula> for given coefficient vectors; we can understand how the odds ratio varies across environmental niches. In different words, JSDMs <italic>disentangle</italic> the role of the environment from the role of biotic interactions in the model specification. With these models, odds ratios provide a measure of association that <italic>unifies</italic> the effects of the biotic and abiotic conditions while enabling assessment of the effect of each on the association.</p>
</sec>
<sec id="j_nejsds11_s_006">
<label>6</label>
<title>Issue (v): Interpretation of Clustering of Species</title>
<p>This issue concerns interpretation for joint species distribution modeling specifications which impose clustering of species. When <italic>S</italic> is large, it is natural to attempt to cluster the species, here seeking data-driven clustering rather than say taxonomic or morphological clustering. Further, we seek model-based clustering rather than ad hoc clustering. With independent sites, such clustering has been proposed by [<xref ref-type="bibr" rid="j_nejsds11_ref_027">27</xref>]. Can we attach useful interpretation to the resulting clustering? Suppose we include spatial dependence between sites and again seek model-based clustering. An approach for such clustering has been proposed by [<xref ref-type="bibr" rid="j_nejsds11_ref_026">26</xref>]. Again, can we attach useful interpretation to the resulting clustering?</p>
<p>Continuing our notation for <inline-formula id="j_nejsds11_ineq_168"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{F}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_169"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{R}}$]]></tex-math></alternatives></inline-formula> above, we have <inline-formula id="j_nejsds11_ineq_170"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${L_{ij}^{F}}={\mathbf{X}_{i}^{T}}{\boldsymbol{\beta }_{j}}$]]></tex-math></alternatives></inline-formula> where <inline-formula id="j_nejsds11_ineq_171"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{X}_{i}}$]]></tex-math></alternatives></inline-formula> denotes the vector of environmental covariates associated with site <italic>i</italic> and <inline-formula id="j_nejsds11_ineq_172"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\beta }_{j}}$]]></tex-math></alternatives></inline-formula> is a species-specific coefficient vector. Collecting to a vector for site <italic>i</italic>, we can write <inline-formula id="j_nejsds11_ineq_173"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mi mathvariant="bold">B</mml:mi><mml:msub><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{L}_{i}^{F}}=\mathbf{B}{\mathbf{X}_{i}}$]]></tex-math></alternatives></inline-formula> where <bold>B</bold> is an <inline-formula id="j_nejsds11_ineq_174"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo>×</mml:mo><mml:mi mathvariant="italic">p</mml:mi></mml:math>
<tex-math><![CDATA[$S\times p$]]></tex-math></alternatives></inline-formula> matrix whose <italic>j</italic>th row provides the regression coefficients for species <italic>j</italic>. Similarly, collect the <inline-formula id="j_nejsds11_ineq_175"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{R}}$]]></tex-math></alternatives></inline-formula> to a vector <inline-formula id="j_nejsds11_ineq_176"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\mathbf{L}_{i}^{R}}$]]></tex-math></alternatives></inline-formula> which is to be modeled as an <inline-formula id="j_nejsds11_ineq_177"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo>×</mml:mo><mml:mn>1</mml:mn></mml:math>
<tex-math><![CDATA[$S\times 1$]]></tex-math></alternatives></inline-formula> vector of random effects. Under independence of sites, these vectors are independent and identically distributed as multivariate normals, say <inline-formula id="j_nejsds11_ineq_178"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo stretchy="false">∼</mml:mo><mml:mi mathvariant="italic">M</mml:mi><mml:mi mathvariant="italic">V</mml:mi><mml:mi mathvariant="italic">N</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mn mathvariant="bold">0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mi mathvariant="bold">Σ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\mathbf{L}_{i}^{R}}\sim MVN(\mathbf{0},\boldsymbol{\Sigma })$]]></tex-math></alternatives></inline-formula> where <bold>Σ</bold> is an <inline-formula id="j_nejsds11_ineq_179"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo>×</mml:mo><mml:mi mathvariant="italic">S</mml:mi></mml:math>
<tex-math><![CDATA[$S\times S$]]></tex-math></alternatives></inline-formula> covariance matrix.</p>
<p>In working with binary responses, to be able to identify the coefficients in <bold>B</bold>, we need to work with a correlation matrix rather than a covariance matrix. So, in model fitting we would set <inline-formula id="j_nejsds11_ineq_180"><alternatives>
<mml:math><mml:mi mathvariant="bold">R</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">D</mml:mi></mml:mrow><mml:mrow><mml:mo>−</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" stretchy="false">/</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mi mathvariant="bold">Σ</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="bold">D</mml:mi></mml:mrow><mml:mrow><mml:mo>−</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" stretchy="false">/</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[$\mathbf{R}={\mathbf{D}^{-1/2}}\boldsymbol{\Sigma }{\mathbf{D}^{-1/2}}$]]></tex-math></alternatives></inline-formula>, where <bold>D</bold> is the diagonal matrix consisting of the diagonal elements of <bold>Σ</bold>. As an aside, with regard to model fitting, this enables adaptation of the data-augmentation algorithm proposed by [<xref ref-type="bibr" rid="j_nejsds11_ref_008">8</xref>] for multivariate probit regression and is known in the literature as the parameter-expansion data-augmentation (PX-DA) algorithm [<xref ref-type="bibr" rid="j_nejsds11_ref_018">18</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_017">17</xref>].</p>
<p><bold>Σ</bold> has <inline-formula id="j_nejsds11_ineq_181"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">S</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" stretchy="false">/</mml:mo><mml:mn>2</mml:mn></mml:math>
<tex-math><![CDATA[$S(S+1)/2$]]></tex-math></alternatives></inline-formula> distinct entries and with <italic>S</italic> on the order of <inline-formula id="j_nejsds11_ineq_182"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mn>10</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${10^{2}}$]]></tex-math></alternatives></inline-formula> or <inline-formula id="j_nejsds11_ineq_183"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mn>10</mml:mn></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${10^{3}}$]]></tex-math></alternatives></inline-formula> as above, it becomes infeasible to infer about <bold>Σ</bold>. The solution that has been adopted in the fully model-based JSDM literature is to employ a dimension reduction in the form of a so-called latent factor analysis [<xref ref-type="bibr" rid="j_nejsds11_ref_004">4</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_023">23</xref>]. We write <inline-formula id="j_nejsds11_ineq_184"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${L_{ij}^{R}}$]]></tex-math></alternatives></inline-formula> in the form <inline-formula id="j_nejsds11_ineq_185"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="bold-italic">λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">η</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\lambda }_{j}^{T}}{\boldsymbol{\eta }_{i}}$]]></tex-math></alternatives></inline-formula> where each of these vectors is <inline-formula id="j_nejsds11_ineq_186"><alternatives>
<mml:math><mml:mi mathvariant="italic">r</mml:mi><mml:mo>×</mml:mo><mml:mn>1</mml:mn></mml:math>
<tex-math><![CDATA[$r\times 1$]]></tex-math></alternatives></inline-formula> with <inline-formula id="j_nejsds11_ineq_187"><alternatives>
<mml:math><mml:mi mathvariant="italic">r</mml:mi><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mi mathvariant="italic">S</mml:mi></mml:math>
<tex-math><![CDATA[$r<<S$]]></tex-math></alternatives></inline-formula>. (In applications typically <italic>r</italic> is at least 3 but at most 10.) As a result, we can write <inline-formula id="j_nejsds11_ineq_188"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mi mathvariant="bold">Λ</mml:mi><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">η</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{L}_{i}^{R}}=\boldsymbol{\Lambda }{\boldsymbol{\eta }_{i}}$]]></tex-math></alternatives></inline-formula> where <bold>Λ</bold> is <inline-formula id="j_nejsds11_ineq_189"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo>×</mml:mo><mml:mi mathvariant="italic">r</mml:mi></mml:math>
<tex-math><![CDATA[$S\times r$]]></tex-math></alternatives></inline-formula>. We envision <italic>r</italic> latent factors where the entries in <inline-formula id="j_nejsds11_ineq_190"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">η</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\eta }_{i}}$]]></tex-math></alternatives></inline-formula> are <italic>r</italic> independent <inline-formula id="j_nejsds11_ineq_191"><alternatives>
<mml:math><mml:mi mathvariant="italic">N</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$N(0,1)$]]></tex-math></alternatives></inline-formula> variables and the rows of <bold>Λ</bold> provide the so-called factor loadings for the collection of species. The induced covariance matrix for <inline-formula id="j_nejsds11_ineq_192"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${\mathbf{L}_{i}^{R}}$]]></tex-math></alternatives></inline-formula> is <inline-formula id="j_nejsds11_ineq_193"><alternatives>
<mml:math><mml:mi mathvariant="bold">Λ</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="bold">Λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[$\boldsymbol{\Lambda }{\boldsymbol{\Lambda }^{T}}$]]></tex-math></alternatives></inline-formula>, creating the dependence structure between the species, that is <inline-formula id="j_nejsds11_ineq_194"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">Λ</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="bold">Λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${(\boldsymbol{\Lambda }{\boldsymbol{\Lambda }^{T}})_{j{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula> is the covariance between species <italic>j</italic> and <inline-formula id="j_nejsds11_ineq_195"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${j^{\prime }}$]]></tex-math></alternatives></inline-formula>. Since <inline-formula id="j_nejsds11_ineq_196"><alternatives>
<mml:math><mml:mi mathvariant="bold">Λ</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="bold">Λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[$\boldsymbol{\Lambda }{\boldsymbol{\Lambda }^{T}}$]]></tex-math></alternatives></inline-formula> is of rank <italic>r</italic>, not full rank, a diagonal matrix <bold>V</bold> with positive entries <inline-formula id="j_nejsds11_ineq_197"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">V</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math>
<tex-math><![CDATA[${V_{jj}}={\sigma _{j}^{2}}$]]></tex-math></alternatives></inline-formula> is added, yielding the diagonally dominant matrix, <inline-formula id="j_nejsds11_ineq_198"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="bold">Σ</mml:mi></mml:mrow><mml:mrow><mml:mo>∗</mml:mo></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mi mathvariant="bold">Λ</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="bold">Λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="bold">V</mml:mi></mml:math>
<tex-math><![CDATA[${\boldsymbol{\Sigma }^{\ast }}=\boldsymbol{\Lambda }{\boldsymbol{\Lambda }^{T}}+\mathbf{V}$]]></tex-math></alternatives></inline-formula> as the full rank approximation to <bold>Σ</bold>. This corresponds to adding pure error to the model for the <italic>Z</italic>’s, that is, to adopting model (ii) in Section 2 for the <inline-formula id="j_nejsds11_ineq_199"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ij}}$]]></tex-math></alternatives></inline-formula>’s.</p>
<p>We now have <inline-formula id="j_nejsds11_ineq_200"><alternatives>
<mml:math><mml:mi mathvariant="italic">r</mml:mi><mml:mi mathvariant="italic">S</mml:mi></mml:math>
<tex-math><![CDATA[$rS$]]></tex-math></alternatives></inline-formula> unknowns in <bold>Λ</bold> with <italic>S</italic> more in the <bold>V</bold> matrix.<xref ref-type="fn" rid="j_nejsds11_fn_002">2</xref><fn id="j_nejsds11_fn_002"><label><sup>2</sup></label>
<p>In practice, setting <inline-formula id="j_nejsds11_ineq_201"><alternatives>
<mml:math><mml:mi mathvariant="bold">V</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mi mathvariant="bold">I</mml:mi></mml:math>
<tex-math><![CDATA[$\mathbf{V}={\sigma ^{2}}\mathbf{I}$]]></tex-math></alternatives></inline-formula> typically provides an adequate approximation.</p></fn> So, the number of unknowns is now order <italic>S</italic> rather than order <inline-formula id="j_nejsds11_ineq_202"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">S</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${S^{2}}$]]></tex-math></alternatives></inline-formula>, achieving the desired dimension reduction. It is well known that the <bold>Λ</bold> matrix is not identified. Various strategies have been proposed in the literature to deal with this issue; [see e.g., <xref ref-type="bibr" rid="j_nejsds11_ref_023">23</xref>]. However, [<xref ref-type="bibr" rid="j_nejsds11_ref_027">27</xref>] introduce model based clustering in the specification of <bold>Λ</bold> to address the identifiability problem. It is implemented through the stick-breaking representation of the Dirichlet process [<xref ref-type="bibr" rid="j_nejsds11_ref_025">25</xref>] which provides specification of random discrete distributions and, therefore, results in a tie when two observations take on the same discrete value. To be clear, this approach enables ties in the <inline-formula id="j_nejsds11_ineq_203"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\lambda }_{j}}$]]></tex-math></alternatives></inline-formula> vectors. It means that the <italic>S</italic> rows in <bold>Λ</bold> will not all be unique. Specifically, in a Markov chain Monte Carlo model fitting implementation, at each iteration the Dirichlet process yields a random number (<inline-formula id="j_nejsds11_ineq_204"><alternatives>
<mml:math><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mi mathvariant="italic">S</mml:mi></mml:math>
<tex-math><![CDATA[$<S$]]></tex-math></alternatives></inline-formula>) of unique choices for the rows of <bold>Λ</bold>. As a result, ties are created in the random effects structure and, for each iteration, the number of distinct rows in <bold>Λ</bold> is the number of clusters for the species associated with that iteration.</p>
<p>So, the clustering resulting from modeling the rows of <bold>Λ</bold> through a Dirichlet process is not clustering the species by their means since each species gets its own vector of regression coefficients from <bold>B</bold>. Rather, it is clustering on the residual covariance structure. If <inline-formula id="j_nejsds11_ineq_205"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">λ</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\lambda }_{j}}={\boldsymbol{\lambda }_{{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula>, then the row entries for <inline-formula id="j_nejsds11_ineq_206"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Z}_{j}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_207"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Z</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Z}_{{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula> in <inline-formula id="j_nejsds11_ineq_208"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="bold">Σ</mml:mi></mml:mrow><mml:mrow><mml:mo>∗</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${\boldsymbol{\Sigma }^{\ast }}$]]></tex-math></alternatives></inline-formula> are identical. In other words, species <italic>j</italic> and <inline-formula id="j_nejsds11_ineq_209"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${j^{\prime }}$]]></tex-math></alternatives></inline-formula> have the same dependence structure with all other species, adjusted for the regressors. If pairwise residual dependence is viewed as a surrogate for pairwise species interaction, then, we might view common dependence with other species as a surrogate for common interaction with other species.</p>
<p>There has been some recent work to cluster the <inline-formula id="j_nejsds11_ineq_210"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\beta }_{j}}$]]></tex-math></alternatives></inline-formula>’s, i.e., to cluster the coefficient vectors across species [<xref ref-type="bibr" rid="j_nejsds11_ref_016">16</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_014">14</xref>]. The Dirichlet process modeling that has been employed for the second order dependence structure can be adopted for the first order mean structure if such clustering is of interest. However, since no identifiability challenges are raised with regard to the <inline-formula id="j_nejsds11_ineq_211"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\beta }_{j}}$]]></tex-math></alternatives></inline-formula>’s, such clustering is not <italic>needed</italic> for the fitting the means.<xref ref-type="fn" rid="j_nejsds11_fn_003">3</xref><fn id="j_nejsds11_fn_003"><label><sup>3</sup></label>
<p>There is no identifiability problem for the <italic>β</italic>s because the <italic>X</italic>s are observed unlike with the <italic>λ</italic>s where the <italic>η</italic>s are not observed.</p></fn> In the hierarchical setting, an <inline-formula id="j_nejsds11_ineq_212"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo>×</mml:mo><mml:mi mathvariant="italic">p</mml:mi></mml:math>
<tex-math><![CDATA[$S\times p$]]></tex-math></alternatives></inline-formula> matrix variate normal distribution is adopted for <bold>B</bold>. In fact, it is greatly simplified to provide a vague independent normal distribution for each of the entries in <bold>B</bold> [see <xref ref-type="bibr" rid="j_nejsds11_ref_027">27</xref>, for details].</p>
<p>Next, we bring in space and spatial dependence to the clustering problem, following the notation of Section 2. Again, we envision a presence/absence variable, <inline-formula id="j_nejsds11_ineq_213"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> for species <italic>j</italic> at every location, <bold>s</bold>, in study region <italic>D</italic>. With regard to the <italic>Z</italic>’s, again we have: (i) <inline-formula id="j_nejsds11_ineq_214"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\mathbf{s})={L_{j}^{F}}(\mathbf{s})+{L_{j}^{R}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> and (ii) <inline-formula id="j_nejsds11_ineq_215"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Z_{j}}(\mathbf{s})={L_{j}^{F}}(\mathbf{s})+{L_{j}^{R}}(\mathbf{s})+{\epsilon _{j}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula>. Under either (i) or (ii), we envision a multivariate spatial process for <inline-formula id="j_nejsds11_ineq_216"><alternatives>
<mml:math><mml:mi mathvariant="bold">Z</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\mathbf{Z}(\mathbf{s})$]]></tex-math></alternatives></inline-formula>. That is, we envision dependence within the components/species at a given <bold>s</bold> but also, spatial dependence between <inline-formula id="j_nejsds11_ineq_217"><alternatives>
<mml:math><mml:mi mathvariant="bold">Z</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\mathbf{Z}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_218"><alternatives>
<mml:math><mml:mi mathvariant="bold">Z</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\mathbf{Z}({\mathbf{s}^{\prime }})$]]></tex-math></alternatives></inline-formula>. We use the functional specification, <inline-formula id="j_nejsds11_ineq_219"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">I</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal">&gt;</mml:mo><mml:mn>0</mml:mn><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${Y_{j}}(\mathbf{s})=I({Z_{j}}(\mathbf{s})>0)$]]></tex-math></alternatives></inline-formula> to impart spatial dependence for the <italic>Z</italic>’s to the <italic>Y</italic>’s. Such specification requires an <inline-formula id="j_nejsds11_ineq_220"><alternatives>
<mml:math><mml:mi mathvariant="italic">S</mml:mi><mml:mo>×</mml:mo><mml:mi mathvariant="italic">S</mml:mi></mml:math>
<tex-math><![CDATA[$S\times S$]]></tex-math></alternatives></inline-formula> cross-covariance function, say <inline-formula id="j_nejsds11_ineq_221"><alternatives>
<mml:math><mml:mi mathvariant="italic">C</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$C(\mathbf{s},{\mathbf{s}^{\prime }})$]]></tex-math></alternatives></inline-formula> which is such that <inline-formula id="j_nejsds11_ineq_222"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="italic">C</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mtext>cov</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${(C(\mathbf{s},{\mathbf{s}^{\prime }}))_{j{j^{\prime }}}}=\text{cov}({Z_{j}}(\mathbf{s}),{Z_{{j^{\prime }}}}({\mathbf{s}^{\prime }}))$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds11_ref_002">2</xref>]. Under (i) or (ii) it becomes <inline-formula id="j_nejsds11_ineq_223"><alternatives>
<mml:math><mml:mtext>cov</mml:mtext><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\text{cov}({L_{j}^{R}}(\mathbf{s}),{L_{{j^{\prime }}}^{R}}({\mathbf{s}^{\prime }}))$]]></tex-math></alternatives></inline-formula>.</p>
<p>So, <inline-formula id="j_nejsds11_ineq_224"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\mathbf{L}^{R}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> becomes an <italic>S</italic>-dimensional Gaussian process over <italic>D</italic> where, again, we consider <italic>S</italic> to be large. Coregionalization [<xref ref-type="bibr" rid="j_nejsds11_ref_031">31</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_002">2</xref>] is a convenient way to introduce dimension reduction here; we write <inline-formula id="j_nejsds11_ineq_225"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\mathbf{L}^{R}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> as a linear transformation of a lower dimensional (say <italic>r</italic>) process. Analogous to the nonspatial case, we write <inline-formula id="j_nejsds11_ineq_226"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="bold">Λ</mml:mi><mml:mi mathvariant="bold-italic">η</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${L^{R}}(\mathbf{s})=\boldsymbol{\Lambda }\boldsymbol{\eta }(\mathbf{s})$]]></tex-math></alternatives></inline-formula> where <bold>Λ</bold> is again, <inline-formula id="j_nejsds11_ineq_227"><alternatives>
<mml:math><mml:mi mathvariant="italic">s</mml:mi><mml:mo>×</mml:mo><mml:mi mathvariant="italic">r</mml:mi></mml:math>
<tex-math><![CDATA[$s\times r$]]></tex-math></alternatives></inline-formula> (with <inline-formula id="j_nejsds11_ineq_228"><alternatives>
<mml:math><mml:mi mathvariant="italic">r</mml:mi><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mo mathvariant="normal">&lt;</mml:mo><mml:mi mathvariant="italic">S</mml:mi></mml:math>
<tex-math><![CDATA[$r<<S$]]></tex-math></alternatives></inline-formula> and now <inline-formula id="j_nejsds11_ineq_229"><alternatives>
<mml:math><mml:mi mathvariant="bold-italic">η</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\boldsymbol{\eta }(\mathbf{s})$]]></tex-math></alternatives></inline-formula> is an <italic>r</italic>-dimensional Gaussian process with its own <inline-formula id="j_nejsds11_ineq_230"><alternatives>
<mml:math><mml:mi mathvariant="italic">r</mml:mi><mml:mo>×</mml:mo><mml:mi mathvariant="italic">r</mml:mi></mml:math>
<tex-math><![CDATA[$r\times r$]]></tex-math></alternatives></inline-formula> cross-covariance function, say <inline-formula id="j_nejsds11_ineq_231"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">C</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">η</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${C_{\boldsymbol{\eta }}}(\mathbf{s},{\mathbf{s}^{\prime }})$]]></tex-math></alternatives></inline-formula>. The induced cross-covariance function for <inline-formula id="j_nejsds11_ineq_232"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\mathbf{L}^{R}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula>, hence for <inline-formula id="j_nejsds11_ineq_233"><alternatives>
<mml:math><mml:mi mathvariant="bold">Z</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\mathbf{Z}(\mathbf{s})$]]></tex-math></alternatives></inline-formula>, is <inline-formula id="j_nejsds11_ineq_234"><alternatives>
<mml:math><mml:mi mathvariant="bold">Λ</mml:mi><mml:msub><mml:mrow><mml:mi mathvariant="italic">C</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">η</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">Λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[$\boldsymbol{\Lambda }{C_{\boldsymbol{\eta }}}(\mathbf{s},{\mathbf{s}^{\prime }}){\boldsymbol{\Lambda }^{T}}$]]></tex-math></alternatives></inline-formula>.</p>
<p>Choices for <inline-formula id="j_nejsds11_ineq_235"><alternatives>
<mml:math><mml:mi mathvariant="bold-italic">η</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\boldsymbol{\eta }(\mathbf{s})$]]></tex-math></alternatives></inline-formula> include supplying an <italic>r</italic>-dimensional cross-covariance function but, with dependence induced between species through <bold>Λ</bold>, independent components in <inline-formula id="j_nejsds11_ineq_236"><alternatives>
<mml:math><mml:mi mathvariant="bold-italic">η</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\boldsymbol{\eta }(\mathbf{s})$]]></tex-math></alternatives></inline-formula> are sufficient. In fact, as noted in [<xref ref-type="bibr" rid="j_nejsds11_ref_026">26</xref>], the components can be <italic>r</italic> independent replicates of a Gaussian process with common correlation function <inline-formula id="j_nejsds11_ineq_237"><alternatives>
<mml:math><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo>;</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">θ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">η</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\rho (\mathbf{s},{\mathbf{s}^{\prime }};{\theta _{\eta }})$]]></tex-math></alternatives></inline-formula>. As a result, the induced cross-covariance function for <inline-formula id="j_nejsds11_ineq_238"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="bold">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[${\mathbf{L}^{R}}(\mathbf{s})$]]></tex-math></alternatives></inline-formula>, hence for <inline-formula id="j_nejsds11_ineq_239"><alternatives>
<mml:math><mml:mi mathvariant="bold">Z</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\mathbf{Z}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> simplifies to <inline-formula id="j_nejsds11_ineq_240"><alternatives>
<mml:math><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo>;</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">θ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">η</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mi mathvariant="bold">Λ</mml:mi><mml:msup><mml:mrow><mml:mi mathvariant="bold">Λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[$\rho (\mathbf{s},{\mathbf{s}^{\prime }};{\theta _{\eta }})\boldsymbol{\Lambda }{\boldsymbol{\Lambda }^{T}}$]]></tex-math></alternatives></inline-formula>. As far as specification for <bold>Λ</bold>, with interest in clustering, the same specification for <bold>Λ</bold>, as in the nonspatial case, using a Dirichlet process for the rows, can be employed.</p>
<p>Returning to interpretation, again we are clustering on the rows of <bold>Λ</bold>; we are clustering on the residual covariance structure. If <inline-formula id="j_nejsds11_ineq_241"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">λ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">λ</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\boldsymbol{\lambda }_{j}}={\boldsymbol{\lambda }_{{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula>, then the row entries for <inline-formula id="j_nejsds11_ineq_242"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Z}_{j}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_243"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="bold">Z</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${\mathbf{Z}_{{j^{\prime }}}}$]]></tex-math></alternatives></inline-formula> in <inline-formula id="j_nejsds11_ineq_244"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="bold">Σ</mml:mi></mml:mrow><mml:mrow><mml:mo>∗</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${\boldsymbol{\Sigma }^{\ast }}$]]></tex-math></alternatives></inline-formula> are identical; species <italic>j</italic> and <inline-formula id="j_nejsds11_ineq_245"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${j^{\prime }}$]]></tex-math></alternatives></inline-formula> have the same <italic>local</italic> dependence structure with all other species. In addition, under the dimension-reduced cross-covariance function for <inline-formula id="j_nejsds11_ineq_246"><alternatives>
<mml:math><mml:mi mathvariant="bold">Z</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\mathbf{Z}(\mathbf{s})$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds11_ineq_247"><alternatives>
<mml:math><mml:mi mathvariant="bold">Z</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$\mathbf{Z}({\mathbf{s}^{\prime }})$]]></tex-math></alternatives></inline-formula>, cov<inline-formula id="j_nejsds11_ineq_248"><alternatives>
<mml:math><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">h</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$({Z_{j}}(\mathbf{s}),{Z_{h}}({\mathbf{s}^{\prime }}))$]]></tex-math></alternatives></inline-formula> = cov<inline-formula id="j_nejsds11_ineq_249"><alternatives>
<mml:math><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mi mathvariant="bold">s</mml:mi><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">h</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="bold">s</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo><mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math>
<tex-math><![CDATA[$({Z_{{j^{\prime }}}}(\mathbf{s}),{Z_{h}}({\mathbf{s}^{\prime }}))$]]></tex-math></alternatives></inline-formula> for all <inline-formula id="j_nejsds11_ineq_250"><alternatives>
<mml:math><mml:mi mathvariant="italic">h</mml:mi><mml:mo stretchy="false">≠</mml:mo><mml:mi mathvariant="italic">j</mml:mi><mml:mo mathvariant="normal">,</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[$h\ne j,{j^{\prime }}$]]></tex-math></alternatives></inline-formula>. The spatial modeling adds the further interpretation that decay in spatial dependence, in terms of distance, for species <italic>j</italic> with all other species is identical to that for species <inline-formula id="j_nejsds11_ineq_251"><alternatives>
<mml:math><mml:msup><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:math>
<tex-math><![CDATA[${j^{\prime }}$]]></tex-math></alternatives></inline-formula> with all other species.</p>
</sec>
<sec id="j_nejsds11_s_007">
<label>7</label>
<title>Closing Comments</title>
<p>It is useful to note that the study design may add an extra level to the data, e.g., species occur on trees with trees located within sites as in [<xref ref-type="bibr" rid="j_nejsds11_ref_020">20</xref>]. Suppose then we add a subscript to the <italic>Y</italic>’s, i.e., <inline-formula id="j_nejsds11_ineq_252"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Y</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">k</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Y_{ikj}}$]]></tex-math></alternatives></inline-formula> with design level <italic>k nested</italic> within design level <italic>i</italic>. The design need not be balanced, we can have <inline-formula id="j_nejsds11_ineq_253"><alternatives>
<mml:math><mml:mi mathvariant="italic">k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mn>2</mml:mn><mml:mo mathvariant="normal">,</mml:mo><mml:mo>…</mml:mo><mml:mo mathvariant="normal">,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="italic">n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[$k=1,2,\dots ,{n_{i}}$]]></tex-math></alternatives></inline-formula>. Now, the latent <italic>Z</italic> process becomes <inline-formula id="j_nejsds11_ineq_254"><alternatives>
<mml:math><mml:msub><mml:mrow><mml:mi mathvariant="italic">Z</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">k</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${Z_{ikj}}$]]></tex-math></alternatives></inline-formula>, with analogy to (i) and (ii) above. Now, we can have <inline-formula id="j_nejsds11_ineq_255"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">k</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">F</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">X</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">β</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold">W</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">γ</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">j</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${L_{ikj}^{F}}={\mathbf{X}_{i}^{T}}{\boldsymbol{\beta }_{j}}+{\mathbf{W}_{ik}^{T}}{\boldsymbol{\gamma }_{j}}$]]></tex-math></alternatives></inline-formula>, incorporating both design level <italic>i</italic> and design level <italic>k</italic> fixed effects. Similarly, we can have <inline-formula id="j_nejsds11_ineq_256"><alternatives>
<mml:math><mml:msubsup><mml:mrow><mml:mi mathvariant="italic">L</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">k</mml:mi><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">R</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold-italic">λ</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">η</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold-italic">λ</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi mathvariant="italic">j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">ω</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">k</mml:mi></mml:mrow></mml:msub></mml:math>
<tex-math><![CDATA[${L_{ikj}^{R}}={\boldsymbol{\lambda }_{1j}^{T}}{\boldsymbol{\eta }_{i}}+{\boldsymbol{\lambda }_{2j}^{T}}{\boldsymbol{\omega }_{ik}}$]]></tex-math></alternatives></inline-formula>, incorporating design level <italic>i</italic> and design level <italic>k</italic> random effects. All of the foregoing discussion involving issues (i)–(v) above, both spatial and nonspatial, can be elaborated to this setting.</p>
<p>We anticipate that areas for future work will include further development for: (i) issues of missed detection or mis-classification [<xref ref-type="bibr" rid="j_nejsds11_ref_030">30</xref>], (ii) bringing in trait information including intra-specific trait variation [<xref ref-type="bibr" rid="j_nejsds11_ref_024">24</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_019">19</xref>], (iii) introduction of dynamics, i.e., data over time as well as over space [<xref ref-type="bibr" rid="j_nejsds11_ref_028">28</xref>], (iv) data types beyond presence/absence, e.g., abundance or composition data [<xref ref-type="bibr" rid="j_nejsds11_ref_009">9</xref>], and, perhaps most importantly, (v) faster and more efficient computation [<xref ref-type="bibr" rid="j_nejsds11_ref_033">33</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_028">28</xref>, <xref ref-type="bibr" rid="j_nejsds11_ref_021">21</xref>]. Evidently, there is still much life in attempting to explain joint distribution of plant life.</p>
</sec>
</body>
<back>
<ack id="j_nejsds11_ack_001">
<title>Acknowledgements</title>
<p>Thanks go to Jim Clark, Otso Ovaskainen, Erin Schliep, Shinichiro Shirota, John Silander, Daniel Taylor-Rodriguez, and Gleb Tikhonov for numerous conversations which opened up the issues taken up here and led to the attempt to bring some further clarity to them.</p></ack>
<ref-list id="j_nejsds11_reflist_001">
<title>References</title>
<ref id="j_nejsds11_ref_001">
<label>[1]</label><mixed-citation publication-type="journal"> <string-name><surname>Aarts</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Fieberg</surname>, <given-names>J.</given-names></string-name> and <string-name><surname>Matthiopoulos</surname>, <given-names>J.</given-names></string-name> (<year>2012</year>). <article-title>Comparative interpretation of count, presence-absence and point methods for species distribution models</article-title>. <source>Methods in Ecology and Evolution</source> <volume>3</volume>, <fpage>177</fpage>–<lpage>187</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_002">
<label>[2]</label><mixed-citation publication-type="book"> <string-name><surname>Banerjee</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Carlin</surname>, <given-names>B. P.</given-names></string-name> and <string-name><surname>Gelfand</surname>, <given-names>A. E.</given-names></string-name> (<year>2014</year>). <source>Hierarchical modeling and analysis for spatial data</source>. <edition>2</edition>nd edn. <publisher-name>Chapman &amp; Hall/CRC</publisher-name>, <publisher-loc>Boca Raton, FL, USA</publisher-loc>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3362184">MR3362184</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_003">
<label>[3]</label><mixed-citation publication-type="journal"> <string-name><surname>Barner</surname>, <given-names>A. K.</given-names></string-name>, <string-name><surname>Coblentz</surname>, <given-names>K. E.</given-names></string-name>, <string-name><surname>Hacker</surname>, <given-names>S. D.</given-names></string-name> and <string-name><surname>Menge</surname>, <given-names>B. A.</given-names></string-name> (<year>2018</year>). <article-title>Fundamental contradictions among observational and experimental estimates of non-trophic species interactions</article-title>. <source>Ecology</source> <volume>99</volume>, <fpage>557</fpage>–<lpage>566</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_004">
<label>[4]</label><mixed-citation publication-type="journal"> <string-name><surname>Bhattacharya</surname>, <given-names>A.</given-names></string-name> and <string-name><surname>Dunson</surname>, <given-names>D. B.</given-names></string-name> (<year>2011</year>). <article-title>Sparse Bayesian infinite factor models</article-title>. <source>Biometrika</source> <fpage>291</fpage>–<lpage>306</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biomet/asr013" xlink:type="simple">https://doi.org/10.1093/biomet/asr013</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2806429">MR2806429</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_005">
<label>[5]</label><mixed-citation publication-type="journal"> <string-name><surname>Blanchet</surname>, <given-names>F. G.</given-names></string-name>, <string-name><surname>Cazelles</surname>, <given-names>K.</given-names></string-name> and <string-name><surname>Gravel</surname>, <given-names>D.</given-names></string-name> (<year>2020</year>). <article-title>Co-occurrence is not evidence of ecological interactions</article-title>. <source>Ecology Letters</source> <volume>23</volume>, <fpage>1050</fpage>–<lpage>1063</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_006">
<label>[6]</label><mixed-citation publication-type="journal"> <string-name><surname>Calabrese</surname>, <given-names>J. M.</given-names></string-name>, <string-name><surname>Certain</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Kraan</surname>, <given-names>C.</given-names></string-name> and <string-name><surname>Dormann</surname>, <given-names>C. F.</given-names></string-name> (<year>2014</year>). <article-title>Stacking species distribution models and adjusting bias by linking them to macroecological models</article-title>. <source>Global Ecology and Biogeography</source> <volume>23</volume>(<issue>1</issue>) <fpage>99</fpage>–<lpage>112</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_007">
<label>[7]</label><mixed-citation publication-type="journal"> <string-name><surname>Chakraborty</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Gelfand</surname>, <given-names>A. E.</given-names></string-name>, <string-name><surname>Wilson</surname>, <given-names>A. M.</given-names></string-name>, <string-name><surname>Latimer</surname>, <given-names>A. M.</given-names></string-name> and <string-name><surname>Silander</surname>, <given-names>J. A.</given-names></string-name> (<year>2011</year>). <article-title>Point pattern modelling for degraded presence-only data over large regions</article-title>. <source>JRSS-C</source> <volume>60</volume>, <fpage>757</fpage>–<lpage>776</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/j.1467-9876.2011.00769.x" xlink:type="simple">https://doi.org/10.1111/j.1467-9876.2011.00769.x</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2844854">MR2844854</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_008">
<label>[8]</label><mixed-citation publication-type="journal"> <string-name><surname>Chib</surname>, <given-names>S.</given-names></string-name> and <string-name><surname>Greenberg</surname>, <given-names>E.</given-names></string-name> (<year>1998</year>). <article-title>Analysis of multivariate probit models</article-title>. <source>Biometrika</source> <volume>85</volume>(<issue>2</issue>) <fpage>347</fpage>–<lpage>361</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_009">
<label>[9]</label><mixed-citation publication-type="journal"> <string-name><surname>Clark</surname>, <given-names>J. S.</given-names></string-name>, <string-name><surname>Nemergut</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Seyednasrollah</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Turner</surname>, <given-names>P. J.</given-names></string-name> and <string-name><surname>Zhang</surname>, <given-names>S.</given-names></string-name> (<year>2017</year>). <article-title>Generalized joint attribute modeling for biodiversity analysis: Median-zero, multivariate, multifarious data</article-title>. <source>Ecological Monographs</source> <volume>87</volume>(<issue>1</issue>) <fpage>34</fpage>–<lpage>56</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_010">
<label>[10]</label><mixed-citation publication-type="journal"> <string-name><surname>De Oliveira</surname>, <given-names>V.</given-names></string-name> (<year>2000</year>). <article-title>Bayesian prediction of clipped Gaussian random fields</article-title>. <source>CompStatDataAnal</source> <volume>34</volume>, <fpage>299</fpage>–<lpage>314</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_011">
<label>[11]</label><mixed-citation publication-type="journal"> <string-name><surname>Gelfand</surname>, <given-names>A. E.</given-names></string-name> and <string-name><surname>Shirota</surname>, <given-names>S.</given-names></string-name> (<year>2019</year>). <article-title>Preferential sampling for presence/absence data and for fusion of presence/absence data with presence–only data</article-title>. <source>Ecological Monographs</source> <volume>89</volume>(<issue>3</issue>).</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_012">
<label>[12]</label><mixed-citation publication-type="journal"> <string-name><surname>Gelfand</surname>, <given-names>A. E.</given-names></string-name> and <string-name><surname>Shirota</surname>, <given-names>S.</given-names></string-name> (<year>2021</year>). <article-title>On the role of odds ratios in joint species distribution modeling</article-title>. <source>Environmental and Ecological Statistics</source> (forthcoming).</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_013">
<label>[13]</label><mixed-citation publication-type="journal"> <string-name><surname>Guisan</surname>, <given-names>A.</given-names></string-name> and <string-name><surname>Rahbek</surname>, <given-names>C.</given-names></string-name> (<year>2011</year>). <article-title>SESAM–a new framework integrating macroecological and species distribution models for predicting spatio-temporal patterns of species assemblages</article-title>. <source>Journal of Biogeography</source> <volume>38</volume>(<issue>8</issue>) <fpage>1433</fpage>–<lpage>1444</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_014">
<label>[14]</label><mixed-citation publication-type="other"> <string-name><surname>Hefley</surname>, <given-names>T.</given-names></string-name> (2020). Model selection for ecological community data using tree shrinkage priors. ArXiv preprint, <ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2005.14303">2005.14303</ext-link>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_015">
<label>[15]</label><mixed-citation publication-type="book"> <string-name><surname>Hooten</surname>, <given-names>M. B.</given-names></string-name>, <string-name><surname>Johnson</surname>, <given-names>D. S.</given-names></string-name>, <string-name><surname>McClintock</surname>, <given-names>B. T.</given-names></string-name> and <string-name><surname>Morales</surname>, <given-names>J. M.</given-names></string-name> (<year>2017</year>). <source>Animal Movement: Statistical Models for Telemetry Data</source>. <publisher-name>CRC Press</publisher-name>, <publisher-loc>Boca Raton</publisher-loc>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3889901">MR3889901</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_016">
<label>[16]</label><mixed-citation publication-type="journal"> <string-name><surname>Johnson</surname>, <given-names>D. S.</given-names></string-name> and <string-name><surname>Sinclair</surname>, <given-names>E. H.</given-names></string-name> (<year>2017</year>). <article-title>Modeling joint abundance of multiple species using Dirichlet process mixtures</article-title>. <source>Environmetrics</source> <volume>28</volume>, <fpage>e2440</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1002/env.2440" xlink:type="simple">https://doi.org/10.1002/env.2440</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3634110">MR3634110</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_017">
<label>[17]</label><mixed-citation publication-type="journal"> <string-name><surname>Lawrence</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Bingham</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Liu</surname>, <given-names>C.</given-names></string-name> and <string-name><surname>Nair</surname>, <given-names>V. N.</given-names></string-name> (<year>2008</year>). <article-title>Bayesian inference for multivariate ordinal data using parameter expansion</article-title>. <source>Technometrics</source> <volume>50</volume>(<issue>2</issue>) <fpage>182</fpage>–<lpage>191</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1198/004017008000000064" xlink:type="simple">https://doi.org/10.1198/004017008000000064</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2439877">MR2439877</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_018">
<label>[18]</label><mixed-citation publication-type="journal"> <string-name><surname>Liu</surname>, <given-names>J. S.</given-names></string-name> and <string-name><surname>Wu</surname>, <given-names>Y. N.</given-names></string-name> (<year>1999</year>). <article-title>Parameter expansion for data augmentation</article-title>. <source>Journal of the American Statistical Association</source> <volume>94</volume>(<issue>448</issue>) <fpage>1264</fpage>–<lpage>1274</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2307/2669940" xlink:type="simple">https://doi.org/10.2307/2669940</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1731488">MR1731488</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_019">
<label>[19]</label><mixed-citation publication-type="journal"> <string-name><surname>Ovaskainen</surname>, <given-names>O.</given-names></string-name>, <string-name><surname>Roy</surname>, <given-names>D. B.</given-names></string-name>, <string-name><surname>Fox</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>Anderson</surname>, <given-names>B. J.</given-names></string-name> (<year>2016</year>). <article-title>Uncovering hidden spatial structure in species communities with spatially explicit joint species distribution models</article-title>. <source>Methods in Ecology and Evolution</source> <volume>7</volume>(<issue>4</issue>) <fpage>428</fpage>–<lpage>436</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_020">
<label>[20]</label><mixed-citation publication-type="journal"> <string-name><surname>Ovaskainen</surname>, <given-names>O.</given-names></string-name>, <string-name><surname>Tikhonov</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Norberg</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Guillaume Blanchet</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Duan</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Dunson</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Roslin</surname>, <given-names>T.</given-names></string-name> and <string-name><surname>Abrego</surname>, <given-names>N.</given-names></string-name> (<year>2017</year>). <article-title>How to make more out of community data? A conceptual framework and its implementation as models and software</article-title>. <source>Ecology Letters</source> <volume>20</volume>(<issue>5</issue>) <fpage>561</fpage>–<lpage>576</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_021">
<label>[21]</label><mixed-citation publication-type="journal"> <string-name><surname>Pichler</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Boreux</surname>, <given-names>V.</given-names></string-name>, <string-name><surname>Klein</surname>, <given-names>A. M.</given-names></string-name>, <string-name><surname>Schleuning</surname>, <given-names>M.</given-names></string-name> and <string-name><surname>Hartig</surname>, <given-names>F.</given-names></string-name> (<year>2019</year>). <article-title>Machine learning algorithms to infer trait-matching and predict species interactions in ecological networks</article-title>. <source>Methods in Ecology and Evolution</source> <volume>11</volume>, <fpage>281</fpage>–<lpage>293</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_022">
<label>[22]</label><mixed-citation publication-type="journal"> <string-name><surname>Pollock</surname>, <given-names>L. J.</given-names></string-name>, <string-name><surname>Tingley</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Morris</surname>, <given-names>W. K.</given-names></string-name>, <string-name><surname>Golding</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>O’Hara</surname>, <given-names>R. B.</given-names></string-name>, <string-name><surname>Parris</surname>, <given-names>K. M.</given-names></string-name>, <string-name><surname>Vesk</surname>, <given-names>P. A.</given-names></string-name> and <string-name><surname>McCarthy</surname>, <given-names>M. A.</given-names></string-name> (<year>2014</year>). <article-title>Understanding co-occurrence by modelling species simultaneously with a Joint Species Distribution Model (JSDM)</article-title>. <source>Methods in Ecology and Evolution</source> <volume>5</volume>(<issue>5</issue>) <fpage>397</fpage>–<lpage>406</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_023">
<label>[23]</label><mixed-citation publication-type="journal"> <string-name><surname>Ren</surname>, <given-names>Q.</given-names></string-name> and <string-name><surname>Banerjee</surname>, <given-names>S.</given-names></string-name> (<year>2013</year>). <article-title>Hierarchical factor models for large spatially misaligned data: a low-rank predictive process approach</article-title>. <source>Biometrics</source> <volume>69</volume>(<issue>1</issue>) <fpage>19</fpage>–<lpage>30</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/j.1541-0420.2012.01832.x" xlink:type="simple">https://doi.org/10.1111/j.1541-0420.2012.01832.x</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3058048">MR3058048</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_024">
<label>[24]</label><mixed-citation publication-type="journal"> <string-name><surname>Schliep</surname>, <given-names>E. M.</given-names></string-name>, <string-name><surname>Gelfand</surname>, <given-names>A. E.</given-names></string-name>, <string-name><surname>Mitchell</surname>, <given-names>R. M.</given-names></string-name>, <string-name><surname>Aiello-Lammens</surname>, <given-names>M. E.</given-names></string-name> and <string-name><surname>Silander Jr</surname>, <given-names>J. A.</given-names></string-name> (<year>2018</year>). <article-title>Assessing the joint behaviour of species traits as filtered by environment</article-title>. <source>Methods in Ecology and Evolution</source> <volume>9</volume>(<issue>3</issue>) <fpage>716</fpage>–<lpage>727</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_025">
<label>[25]</label><mixed-citation publication-type="journal"> <string-name><surname>Sethuraman</surname>, <given-names>J.</given-names></string-name> (<year>1994</year>). <article-title>A constructive definition of Dirichlet priors</article-title>. <source>Statistica sinica</source> <fpage>639</fpage>–<lpage>650</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1309433">MR1309433</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_026">
<label>[26]</label><mixed-citation publication-type="journal"> <string-name><surname>Shirota</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Gelfand</surname>, <given-names>A. E.</given-names></string-name> and <string-name><surname>Banerjee</surname>, <given-names>S.</given-names></string-name> (<year>2018</year>). <article-title>Spatial Joint Species Distribution Modeling using Dirichlet Processes</article-title>. <source>Statistica Sinica</source>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3932512">MR3932512</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_027">
<label>[27]</label><mixed-citation publication-type="journal"> <string-name><surname>Taylor-Rodriguez</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Kaufeld</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Schliep</surname>, <given-names>E. M.</given-names></string-name>, <string-name><surname>Clark</surname>, <given-names>J. S.</given-names></string-name> and <string-name><surname>Gelfand</surname>, <given-names>A. E.</given-names></string-name> (<year>2017</year>). <article-title>Joint species distribution modeling: dimension reduction using Dirichlet processes</article-title>. <source>Bayesian Analysis</source> <volume>12</volume>(<issue>4</issue>) <fpage>939</fpage>–<lpage>967</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/16-BA1031" xlink:type="simple">https://doi.org/10.1214/16-BA1031</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3724974">MR3724974</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_028">
<label>[28]</label><mixed-citation publication-type="other"> <string-name><surname>Tikhonov</surname>, <given-names>G.</given-names></string-name> (2018). Bayesian latent factor approaches for modeling ecological species communities. Technical Report, Ph.D. Thesis, Faculty of Biological and Environmental Sciences, University of Helsinki.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_029">
<label>[29]</label><mixed-citation publication-type="journal"> <string-name><surname>Thorson</surname>, <given-names>J. T.</given-names></string-name>, <string-name><surname>Scheuerell</surname>, <given-names>M. D.</given-names></string-name>, <string-name><surname>Shelton</surname>, <given-names>A. O.</given-names></string-name>, <string-name><surname>See</surname>, <given-names>K. E.</given-names></string-name>, <string-name><surname>Skaug</surname>, <given-names>H. J.</given-names></string-name> and <string-name><surname>Kristensen</surname>, <given-names>K.</given-names></string-name> (<year>2015</year>). <article-title>Spatial factor analysis: a new tool for estimating joint species distributions and correlations in species range</article-title>. <source>Methods in Ecology and Evolution</source> <volume>6</volume>(<issue>6</issue>) <fpage>627</fpage>–<lpage>637</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_030">
<label>[30]</label><mixed-citation publication-type="journal"> <string-name><surname>Tobler</surname>, <given-names>M. W.</given-names></string-name>, <string-name><surname>Kery</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Hui</surname>, <given-names>F. K. C.</given-names></string-name>, <string-name><surname>Guillera-Arroita</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Knaus</surname>, <given-names>P.</given-names></string-name> and <string-name><surname>Sattler</surname>, <given-names>T.</given-names></string-name> (<year>2019</year>). <article-title>Joint species distribution models with species correlations and imperfect detection</article-title>. <source>Ecology</source> <volume>100</volume>, <fpage>e02754</fpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_031">
<label>[31]</label><mixed-citation publication-type="book"> <string-name><surname>Wackernagel</surname>, <given-names>H.</given-names></string-name> (<year>2003</year>). <source>Multivariate Geostatistics: An Introduction with Applications</source>. <edition>3</edition>rd edn. <publisher-name>Springer-Verlag</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_032">
<label>[32]</label><mixed-citation publication-type="journal"> <string-name><surname>Warton</surname>, <given-names>D. I.</given-names></string-name> and <string-name><surname>Shepherd</surname>, <given-names>L. C.</given-names></string-name> (<year>2010</year>). <article-title>Poisson point process models solve the pseudo-absence problem for presence-only data in ecology</article-title>. <source>AnnalsAppldStats</source> <volume>4</volume>, <fpage>1383</fpage>–<lpage>1402</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/10-AOAS331" xlink:type="simple">https://doi.org/10.1214/10-AOAS331</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2758333">MR2758333</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds11_ref_033">
<label>[33]</label><mixed-citation publication-type="journal"> <string-name><surname>Wilkinson</surname>, <given-names>D. P.</given-names></string-name>, <string-name><surname>Golding</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Guillera-Arroita</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Tingley</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>McCarthy</surname>, <given-names>M. A.</given-names></string-name> (<year>2018</year>). <article-title>A comparison of joint species distribution models for presence–absence data</article-title>. <source>Methods in Ecology and Evolution</source>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_034">
<label>[34]</label><mixed-citation publication-type="journal"> <string-name><surname>Wilkinson</surname>, <given-names>D. P.</given-names></string-name>, <string-name><surname>Golding</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Guillera-Arroita</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Tingley</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>McCarthy</surname>, <given-names>M. A.</given-names></string-name> (<year>2021</year>). <article-title>Defining and evaluating predictions of joint species distribution models</article-title>. <source>Methods in Ecology and Evolution</source> <volume>12</volume>, <fpage>394</fpage>–<lpage>404</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds11_ref_035">
<label>[35]</label><mixed-citation publication-type="journal"> <string-name><surname>Zobel</surname>, <given-names>M.</given-names></string-name> (<year>1997</year>). <article-title>The relative of species pools in determining plant species richness: an alternative explanation of species coexistence?</article-title> <source>Trends in Ecology &amp; Evolution</source> <volume>12</volume>(<issue>7</issue>) <fpage>266</fpage>–<lpage>269</lpage>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
