Some Noteworthy Issues in Joint Species Distribution Modeling for Plant Data
Volume 1, Issue 1 (2023), pp. 102–109
Pub. online: 19 October 2022
Type: Spatial And Environmental Statistics
Open Access
Accepted
21 July 2022
21 July 2022
Published
19 October 2022
19 October 2022
Abstract
Joint species distribution modeling is attracting increasing attention in the literature these days, recognizing the fact that single species modeling fails to take into account expected dependence/interaction between species. This short paper offers discussion that attempts to illuminate five noteworthy technical issues associated with such modeling in the context of plant data. In this setting, the joint species distribution work in the literature considers several types of species data collection. For convenience of discussion, we focus on joint modeling of presence/absence data. For such data, the primary modeling strategy has been through introduction of latent multivariate normal random variables.
These issues address the following: (i) how the observed presence/absence data is linked to the latent normal variables as well as the resulting implications with regard to modeling the data sites as independent or spatially dependent, (ii) the incompatibility of point referenced and areal referenced presence/absence data in spatial modeling of species distribution, (iii) the effect of modeling species independently/marginally rather than jointly within site, with regard to assessing species distribution, (iv) the interpretation of species dependence under the use of latent multivariate normal specification, and (v) the interpretation of clustering of species associated with specific joint species distribution modeling specifications.
It is hoped that, by attempting to clarify these issues, ecological modelers and quantitative ecologists will be able to better appreciate some subtleties that are implicit in this growing collection of modeling ideas. In this regard, this paper can serve as a useful companion piece to the recent survey/comparison article by [33] in Methods in Ecology and Evolution.
References
Banerjee, S., Carlin, B. P. and Gelfand, A. E. (2014). Hierarchical modeling and analysis for spatial data. 2nd edn. Chapman & Hall/CRC, Boca Raton, FL, USA. MR3362184
Bhattacharya, A. and Dunson, D. B. (2011). Sparse Bayesian infinite factor models. Biometrika 291–306. https://doi.org/10.1093/biomet/asr013. MR2806429
Chakraborty, A., Gelfand, A. E., Wilson, A. M., Latimer, A. M. and Silander, J. A. (2011). Point pattern modelling for degraded presence-only data over large regions. JRSS-C 60, 757–776. https://doi.org/10.1111/j.1467-9876.2011.00769.x. MR2844854
Hefley, T. (2020). Model selection for ecological community data using tree shrinkage priors. ArXiv preprint, 2005.14303.
Hooten, M. B., Johnson, D. S., McClintock, B. T. and Morales, J. M. (2017). Animal Movement: Statistical Models for Telemetry Data. CRC Press, Boca Raton. MR3889901
Johnson, D. S. and Sinclair, E. H. (2017). Modeling joint abundance of multiple species using Dirichlet process mixtures. Environmetrics 28, e2440. https://doi.org/10.1002/env.2440. MR3634110
Lawrence, E., Bingham, D., Liu, C. and Nair, V. N. (2008). Bayesian inference for multivariate ordinal data using parameter expansion. Technometrics 50(2) 182–191. https://doi.org/10.1198/004017008000000064. MR2439877
Liu, J. S. and Wu, Y. N. (1999). Parameter expansion for data augmentation. Journal of the American Statistical Association 94(448) 1264–1274. https://doi.org/10.2307/2669940. MR1731488
Ren, Q. and Banerjee, S. (2013). Hierarchical factor models for large spatially misaligned data: a low-rank predictive process approach. Biometrics 69(1) 19–30. https://doi.org/10.1111/j.1541-0420.2012.01832.x. MR3058048
Sethuraman, J. (1994). A constructive definition of Dirichlet priors. Statistica sinica 639–650. MR1309433
Shirota, S., Gelfand, A. E. and Banerjee, S. (2018). Spatial Joint Species Distribution Modeling using Dirichlet Processes. Statistica Sinica. MR3932512
Taylor-Rodriguez, D., Kaufeld, K., Schliep, E. M., Clark, J. S. and Gelfand, A. E. (2017). Joint species distribution modeling: dimension reduction using Dirichlet processes. Bayesian Analysis 12(4) 939–967. https://doi.org/10.1214/16-BA1031. MR3724974
Warton, D. I. and Shepherd, L. C. (2010). Poisson point process models solve the pseudo-absence problem for presence-only data in ecology. AnnalsAppldStats 4, 1383–1402. https://doi.org/10.1214/10-AOAS331. MR2758333