The occARU model • occARU

occARU implements a Bayesian multispecies, multiseason occupancy model with a count observation model for autonomous recording unit (ARU) data such as camera traps and passive acoustic monitoring (PAM). Sites are defined as individual ARUs, each producing time-stamped images or recordings with species labels and individual counts. These counts are then binned into survey occasions spanning an arbitrary number of days, with detections thinned to appropriate intervals. The model is implemented in Stan (Carpenter et al. 2017) via cmdstanr (Gabry et al. 2025), using Hamiltonian Monte Carlo (HMC) to sample from the posterior. The model automatically detects the number of species and seasons (per site), and selects the appropriate model structure accordingly. The following sections describe each model component in turn; Stan programs and associated functions are available on GitHub.

Occupancy

To accommodate structural zeros arising from species absences at the site level, we model the occupancy of species s \in 1, \dots, S at site i \in 1, \dots, I as a latent Bernoulli variable, where z_{is} = 1 indicates occupancy:

\begin{aligned} z_{is} &\sim \mathrm{Bernoulli} \left( \psi_{is} \right) \\ \psi_{is} &= \operatorname{logit}^{-1} \left( {\alpha_\psi}_{[s]} + {\boldsymbol{X_1}}_{[i]} {\boldsymbol{\beta_\psi}}_{[s]} \right), \end{aligned} \tag{1}

where \boldsymbol{\alpha_\psi} are species-level intercepts, \boldsymbol{\beta_\psi} are species-level coefficients and \boldsymbol{X_1} represents a design matrix. occARU accommodates three types of predictors: continuous, categorical, and ordinal (ordered categorical). Categorical coefficients are modeled as zero-sum Gaussian variables, and ordinal coefficients are parameterised via a simplex decomposition of the category differences (Bürkner and Charpentier 2020), where the coefficient corresponds to the effect of the highest category.

Species-specific coefficients are modeled as multivariate normal, facilitating inference on correlations in predictor responses across species. Spatial effects on occupancy in single season models are not included by design because they are weakly identified in the absence of repeated measures of occupancy at the site level (Doser et al. 2022). Future versions of occARU will include multi-season occupancy models, where spatial random effects are more appropriate.

Stan does not accommodate discrete parameters (z_{is}) and they are marginalised out of the likelihood; however, posterior inference on underlying occupancy states is performed through the forward-backward sampling algorithm by setting the argument latent = TRUE in occARU().

Detection

The species counts during surveys j \in 1, \dots, J produce a multi-species time series \boldsymbol{y}_{ij} = (y_{ij1}, \dots, y_{ijS}). Rather than reducing counts to binary detection/non-detection as in traditional occupancy models (MacKenzie et al. 2002), occARU models the counts directly, capitalising on the magnitude of detections. Conditional on z_{is} = 1, detections are modeled as Poisson by default:

\begin{aligned} y_{ijs} \mid z_{is} = 1 &\sim \mathrm{Poisson} \left( \mu_{ijs} \right) \\ \mu_{ijs} &= \exp \Bigl( {\alpha_\mu}_{[s]} + {\boldsymbol{X_2}}_{[i]} {\boldsymbol{\beta_\mu}}_{[s]} + {\boldsymbol{X_3}}_{[ij]} \boldsymbol{\gamma}_{[s]} + \iota_{is} +\kappa_{js} + \log \Delta_{ij}\Bigr). \end{aligned} \tag{2}

Here, \Delta_{ij} are the proportion of each survey that the site was actively sampling, taking on values between 0 and 1, used as offsets for the detection rates \boldsymbol{\mu}_{ij}. The species-specific intercepts \boldsymbol{\alpha_\mu} are modeled jointly with occupancy intercepts as bivariate normal to facilitate estimating the occupancy/detection correlation across species:

\begin{pmatrix} {\alpha_\psi}_{[s]} \\ {\alpha_\mu}_{[s]} \end{pmatrix} \sim \mathrm{MVNormal} \left[ \begin{pmatrix} \operatorname{logit} \bar{\psi} \\ \log \bar{\mu} \end{pmatrix}, \begin{pmatrix} \tau_\psi^2 & \rho \tau_\psi \tau_\mu \\ \rho \tau_\psi \tau_\mu & \tau_\mu^2 \end{pmatrix} \right]. \tag{3}

As with occupancy (Equation 1), \boldsymbol{\beta_\mu} are species-level coefficients and \boldsymbol{X_2} represents a site-level design matrix. Additionally, \boldsymbol{\gamma} are species-level coefficients for the site-by-survey design matrix \boldsymbol{X_3}, used for predictors that vary spatio-temporally. Again, \boldsymbol{\beta_\mu} and \boldsymbol{\gamma} are modeled as multivariate normal to explore interspecific correlations in detection rate responses to site-level and site-by-survey-level predictors. Both \boldsymbol{X_2} and \boldsymbol{X_3} accommodate continuous, categorical, and ordinal predictors.

Site and survey random effects

To handle repeated measures at the level of sites and surveys, site (\boldsymbol{\iota}_i) and survey (\boldsymbol{\kappa}_j) random effects are included by default. These are modeled as matrix normal distributions, where the row covariances are Gaussian process (GP) kernels capturing spatial and temporal structure, and the column covariances are interspecific covariance matrices capturing correlations across species. This formulation accommodates effects that may differ in magnitude across species but are nevertheless correlated:

\begin{aligned} \boldsymbol{\iota}_i &\sim \mathrm{MatrixNormal} \left( \bar{\iota}_i, \boldsymbol{K_{\iota}}, \Sigma_{\iota} \right) \\ \boldsymbol{\kappa}_j &\sim \mathrm{MatrixNormal} \left( \bar{\kappa}_j, \boldsymbol{K_{\kappa}}, \Sigma_{\kappa} \right). \end{aligned}

Here, \bar{\boldsymbol{\iota}} and \bar{\boldsymbol{\kappa}} are mean site and survey effects, themselves modeled as GPs; \boldsymbol{K_{\iota}} and \boldsymbol{K_{\kappa}} are I \times I and J \times J GP kernels; and \Sigma_\iota and \Sigma_\kappa are S \times S interspecific covariance matrices, respectively. Users can specify whether to use exponentiated quadratic, Matern 3/2, or Matern 5/2 kernels, where the default exponentiated quadratic kernel function is:

K_{n, n'} = \tau^2 \exp \left(- \frac{\lVert \boldsymbol{x}_n - \boldsymbol{x}_{n'} \rVert^2}{2\ell^2} \right), \tag{4}

where \lVert \boldsymbol{x}_n - \boldsymbol{x}_{n'} \rVert is the Euclidean distance between inputs (site UTMs in kms or survey index), \tau is the marginal scale of the GP, and \ell is the length scale controlling the rate of decay. occARU facilitates fitting species-specific GP kernels; however, this requires S additional Cholesky decompositions for that GP. By default, the same kernel is used for both the mean and species-specific GP realisations, with realisations differing only in magnitude through species-specific scales, requiring only one Cholesky decomposition per GP.

occARU has the option to include a periodic kernel in the temporal GP kernel \boldsymbol{K_\kappa} (by setting random = list(survey = gp(periodic = TRUE)) in occARU()):

K_{n, n'} = \tau^2 \exp \left(- \frac{2 \sin^2 \left( \pi \lvert x_n - x_{n'} \rvert / p \right)}{\ell^2} \right), \tag{5}

where \tau and \ell have the same interpretation as above and p is the period in survey units, which is by default automatically determined assuming an annual cycle (e.g., p = 52 with weekly surveys). Because GP kernels are closed under addition, a single Cholesky decomposition is required. The variance partition of the two GP kernels is handled via a simplex \boldsymbol{K_\phi} = \left({K_\phi}_{[1]}, {K_\phi}_{[2]}\right), \sum_{g=1}^2 {K_\phi}_{[g]} = 1.

Site and survey random effects can be fully disabled or fitted without GPs, collapsing to multivariate normals, with the argument random = list(site = c("mvn", "none"), survey = c("mvn", "none")) in occARU(). Identifiability of the matrix normal scales is ensured by fixing the marginal scale of the GP kernels to 1.

Confounding of fixed and random effects

When predictors are included alongside random site or survey effects, the fixed and random effects can be confounded because the random effects absorb part of the variance that the fixed effects are trying to explain (Hodges and Reich 2010). In practice, fixed effect coefficients are often absorbed into the random effects, which capture most of the variance. In addition to the “conditional” fixed effect coefficients (conditioned on the presence of the random effect), an alternative quantity reported by occARU are the “unconditional” coefficients (Hanks et al. 2015), e.g., \boldsymbol{\beta_\mu} + \left( \boldsymbol{X_2}^\prime \boldsymbol{X_2} \right)^{-1} \boldsymbol{X_2}^\prime \boldsymbol{\iota}. Unconditional coefficients are available for both site and survey predictors when random effects are included, and can be plotted with plot_coefficients(fit, unconditional = TRUE).

Overdispersion and residual species interactions

Two extensions beyond the basic Poisson observation model are available via the overdispersion = c("none", "nb", "olre") argument in occARU():

Negative binomial (overdispersion = "nb"): a species-specific overdispersion parameter \phi_s is estimated, so that y_{ijs} \mid z_{is} = 1 \sim \mathrm{NegBin}(\mu_{ijs}, \phi_s).
Observation-level random effects (OLREs) (overdispersion = "olre"): multivariate normal residuals \boldsymbol{\epsilon}_{ij} = (\epsilon_{ij1}, \dots, \epsilon_{ijS}) are added to the linear predictor, yielding a Poisson-lognormal model. The OLREs are modeled hierarchically \boldsymbol{\epsilon}_{ij} \sim \mathrm{MVN} \left( \bar\epsilon_{ij}, \Sigma_\epsilon \right), where \boldsymbol{\bar\epsilon} are mean residuals shared across species at each site-survey combination, capturing detection anomalies common to all species, and the covariance matrix \Sigma_\epsilon captures residual interspecific correlations at the observation level—the site-by-survey-specific detection counts—which may reflect direct species interactions such as predator–prey dynamics.

Variance decomposition

The model uses global-local shrinkage priors for variance components of both occupancy and detection submodels (Aguilar and Bürkner 2023). This accommodates a wide range of model complexity (i.e., various number of covariates with different random effects) in a principled unified prior framework. occARU places half-Student-t priors on the variance of the linear predictor of occupancy log odds (W_\psi) and log detection rates (W_\mu). These variances are then simplex-decomposed between the different model components (e.g., species-level intercepts, hierarchical coefficients, random site and survey effects, and potentially Poisson OLREs). The occARU model automatically determines the number of variance partitions for each submodel (V_\psi and V_\mu) and produces the prior scales using the V-length simplexes \boldsymbol{\phi_\psi} and \boldsymbol{\phi_\mu} as, e.g., \boldsymbol{\tau_\mu} := \left(\tau_1, \dots, \tau_{V_\mu} \right) = \sqrt{W_\mu \cdot \boldsymbol{\phi_\mu}}.

Variance decomposition can be done with either Dirichlet or zero-sum logistic-normal distributions for \boldsymbol{\phi}, set via the variance_decomposition = c("dirichlet", "logistic-normal") argument in occARU(). In either case, the hyperparameters \theta_\psi and \theta_\mu control the concentration or sparsity of the simplex: in Dirichlet models, \theta is the inverse concentration, and in logistic-normal models, it is the scale. For both underlying distributions, larger values of \theta correspond to a more sparse simplex (i.e., where a small number of components account for most of the variance), with smaller values corresponding to more concentrated simplexes (i.e., where different model components more evenly account for the variance).

Variance decomposition is also used for species-specific scales in site and survey effects (and potentially OLREs). The scales vector \boldsymbol{\phi_\mu} contains one partition for the species-level deviations from the mean function for each, which are then simplex decomposed to produce the species-specific scales. Parameter names in the fit objects are iota_t, kappa_t, and epsilon_t, respectively.

Multiseason models

Deployments may be spread across multiple distinct seasons k \in 1, \dots, K, typically with the sites sampled repeatedly. occARU accommodates two types of multiseason data:

When sites are sampled once, but deployments are spread across seasons. This is modeled with a single season occupancy model, except that survey predictors and random effects are now spread across multiple seasons.
When sites are sampled more than once, for an arbitrary subset of the total number of seasons.

Continuous time dynamic occupancy

occARU automatically detects the deployment history of each site and sites sampled in more than one season are modelled with dynamic occupancy (MacKenzie et al. 2003) with a continuous time transition process (with subscripts for i, k, and s dropped on \boldsymbol{P} and \boldsymbol{Q}):

\begin{aligned} z_{1is} &\sim \mathrm{Categorical} \left( 1 - \psi_{is}, \psi_{is} \right) \\ z_{kis} &\sim \mathrm{Categorical} \left( {\boldsymbol{P}}_{z_{[k-1]is}} \right), \ k \in 2, \dots, K \\ \boldsymbol{P} &= \exp \left(\boldsymbol{Q} \tau_{i[k-1]} \right), \end{aligned} \tag{6}

where \boldsymbol{Q} is a 2 \times 2 transition rate matrix (subscripts for i and s dropped):

\boldsymbol{Q}_{[k-1]} = \begin{bmatrix} -{q_1}_{[k-1]} & {q_1}_{[k-1]} \\ {q_2}_{[k-1]} & -{q_2}_{[k-1]} \end{bmatrix}, \ k \in 2, \dots, K, \tag{7}

\tau_{ik} gives the elapsed time (expressed in years) from the end of deployment in season k-1 to the start of deployment in season k at site i, and the transition rates \boldsymbol{q} are the (annual) colonisation and emigration rates, respectively. The transition probability matrix \boldsymbol{P} is obtained by taking the matrix exponential of \boldsymbol{Q}\tau, which accounts for (potentially) unequal time periods between deployments.

Dimension reduction through structural equation modeling

Initial occupancy probabilities (\boldsymbol{\psi}) and colonisation (\boldsymbol{q_1}) and emigration (\boldsymbol{q_2}) rates may be expected to be highly correlated: sites that are suitable for a species should have high initial occupancy and colonisation rates and low emigration rates. Therefore, expressing linear predictors for all three parameters is likely redundant and makes interpretation more complex, especially when the primary interest is not on the dynamics of the occupancy process. As a remedy, occARU uses a structural equation modeling (SEM) approach and models a latent “site suitability” \eta_{kis}, which are then mapped onto the ecological parameters using species-specific factor loadings \boldsymbol{\lambda}:

\begin{aligned} \eta_{kis} &= {\alpha_\eta}_{[s]} + {\boldsymbol{X_1}}_{[ki]} {\boldsymbol{\beta_\psi}}_{[s]} + {\iota_\psi}_{[is]} + {\nu_\psi}_{[ks]}, \ k \in f_i, \dots, l_i \\ \psi_{is} &= \operatorname{logit}^{-1} \left( \bar{\alpha}_\psi + {\lambda_1}_{[s]} \eta_{f_{[i]}is} \right) \\ {q_1}_{[k-1is]} &= \exp \left( \bar{\alpha}_{q_1} + {\lambda_2}_{[s]} \eta_{kis} \right) \\ {q_2}_{[k-1is]} &= \exp \left( \bar{\alpha}_{q_2} - {\lambda_3}_{[s]} \eta_{kis} \right), \end{aligned} \tag{8}

where f_i and l_i are the first and last season of deployment for site i; \boldsymbol{\alpha_\eta} are zero-centered species-level intercepts; \boldsymbol{\iota_\psi} and \boldsymbol{\nu_\psi} are random site and season effects on suitability, respectively (which can now be estimated due to repeated measures); and \boldsymbol{\bar{\alpha}} are overall intercepts for initial occupancy and transition rates. This construction means that only one set of fixed effect coefficients and random effects are estimated for occupancy, irrespective of the number of seasons. In multiseason models, however, site-level predictors must be provided for each season that a site was sampled, which accommodates potentially time-varying site-level predictors.

For identifiability, the signs of factor loadings are fixed to be positive for initial occupancy and colonisation and negative for emigration, so that the interpretation of \boldsymbol{\eta} remains “suitability”. Additionally, the variance of the factor loadings is fixed to 1. Note that in single season models where \lambda_1 = 1, we simply recover initial occupancy as in Equation 1, without random site and season effects.

Observation model

The observation model for dynamic models remains as in Equation 2, except that random season effects \boldsymbol{\nu} are included by default to capture unexplained seasonal differences by default and random survey effects are estimated separately in each season. As with occupancy, site-level and survey-level predictors must be provided for each season that a site was sampled. The optional season random effects (on both detection and occupancy) are modeled analogously to site and season random effects with a choice of GP kernel (using the dates of each season’s first deployment as input) or unstructured multivariate normal.

Sites nested in regions

occARU accommodates groups of ARUs nested in regions, for example when multiple independent grids are deployed. When multiple regions are included, occARU includes region-level intercepts for both occupancy (or “site suitability” in dynamic models) and detection. Random (spatial) site effects are also computed separately per-region, which can significantly increase speed of model fits.

Identifiability and sum-to-zero constraints

All random effects (species-level intercepts, fixed effect coefficients, site effects, survey effects, and OLRE residuals) are fit with sum-to-zero constraints for parameter identifiability (Ogle and Barber 2020). Without these constraints, the overall intercepts (\bar{\psi} and \bar{\mu}), mean effects (i.e., \bar{\boldsymbol{\beta}}, \bar{\boldsymbol{\gamma}}, \boldsymbol{\bar{\iota}}, \boldsymbol{\bar{\kappa}}, and \boldsymbol{\bar{\epsilon}}) and the species-level deviations are weakly identified. The constraints are implemented directly in the Stan parameterisation using sum_to_zero_vector and sum_to_zero_matrix types, which induce the constraint with a favourable posterior geometry leading to increased computational efficiency. Note that technically, the resulting distributions are no longer (multivariate/matrix) normal, but rather the zero-sum variants.

Assessing model fit

occARU produces site-by-species level log likelihoods (log_lik) as generated quantities; these values can be used with Pareto-smoothed importance sampling leave-one-out cross-validation (PSIS-LOO-CV, Vehtari et al. 2017) as implemented in the loo package (Vehtari et al. 2025). PSIS-LOO-CV is known to be problematic when there are observation-level parameters, i.e. \iota_{is} and, when OLREs are included, \epsilon_{ijs}. As a remedy, Monte Carlo integration of these random effects is used to produce a second log likelihood object (log_lik2), where the user can set the number of Monte Carlo draws to perform per HMC iteration (argument loo_draws in occARU()).

Posterior predictions can be produced for both y_{ijs} and Q_{is} := \sum_{j=1}^J y_{ijs}, by setting the argument ppc = c("Q", "y", "both", "none") in occARU(). This facilitates posterior predictive checking, i.e. via the bayesplot package (Gabry et al. 2025), and an occARU-specific pp_check() function plots “rootograms” by site or region (Kleiber and Zeileis 2016). Additionally, the log prior density of the prior hyperparameters is stored (lprior), facilitating prior predictive checking via the priorsense package (Kallioinen et al. 2023).

Priors

Default priors for hyperparameters are chosen to be weakly informative on the natural scale of the outcome (species counts). Prior families of parameters are fixed, but hyperparameters can be configured. They can be inspected and modified with set_priors(), which produces an occARU_priors object to be passed to occARU().

Parameter	Interpretation	Default Prior
\bar{\psi}	Mean (initial) occupancy probability	\mathrm{Beta} \left( 1, 1 \right)
\bar{\mu}	Mean detection rate	\mathrm{Gamma} \left( 1, 1 \right)
\boldsymbol{\bar{q}}	Mean colonisation and emigration rates	\mathrm{Gamma} \left( 1, 3 \right)
W_{\psi}	Variance of occupancy log odds linear predictor	\mathit{t}^+ \left( 3, 0, 1 \right)
W_{\mu}	Variance of log detection rate linear predictor	\mathit{t}^+ \left( 3, 0, 2.5 \right)
\theta_{\psi}, \theta_{\mu}	Variance partition simplex sparsity	\mathrm{Gamma} \left( 1, 1 \right)
\ell_\iota, \boldsymbol{\ell_\kappa}, \ell_\nu, \ell_{\iota_\psi}, \ell_{\nu_\psi}	GP length scales	\mathrm{InvGamma} \left( 1, 1 \right)
\phi_s	Negative binomial overdispersion	\mathrm{InvGamma} \left( 0.4, 0.3 \right)
Cholesky factor of correlation matrices	Intercepts and coefficient/random effect correlations	\mathrm{LKJ} \left( 1 \right)

References

Aguilar, Javier Enrique, and Paul-Christian Bürkner. 2023. “Intuitive Joint Priors for Bayesian Linear Multilevel Models: The R2D2M2 Prior.” Electronic Journal of Statistics 17 (1): 1711–67. https://doi.org/10.1214/23-EJS2136.

Bürkner, Paul-Christian, and Emmanuel Charpentier. 2020. “Modelling Monotonic Effects of Ordinal Predictors in Bayesian Regression Models.” British Journal of Mathematical & Statistical Psychology 73 (3): 420–51. https://doi.org/10.1111/bmsp.12195.

Carpenter, Bob, Andrew Gelman, Matthew D. Hoffman, et al. 2017. “Stan: A Probabilistic Programming Language.” Journal of Statistical Software 76: 1. https://doi.org/10.18637/jss.v076.i01.

Doser, Jeffrey W., Andrew O. Finley, Marc Kéry, and Elise F. Zipkin. 2022. “spOccupancy: An R Package for Single-Species, Multi-Species, and Integrated Spatial Occupancy Models.” Methods in Ecology and Evolution 13 (8): 1670–78. https://doi.org/10.1111/2041-210X.13897.

Gabry, Jonah, Rok Češnovar, Andrew Johnson, and Steve Bronder. 2025. cmdstanr: R Interface to ’CmdStan’. Manual.

Hanks, Ephraim M., Erin M. Schliep, Mevin B. Hooten, and Jennifer A. Hoeting. 2015. “Restricted Spatial Regression in Practice: Geostatistical Models, Confounding, and Robustness Under Model Misspecification.” Environmetrics 26 (4): 243–54. https://doi.org/10.1002/env.2331.

Hodges, James S., and Brian J. Reich. 2010. “Adding Spatially-Correlated Errors Can Mess up the Fixed Effect You Love.” The American Statistician 64 (4): 325–34. https://doi.org/10.1198/tast.2010.10052.

Kallioinen, Noa, Topi Paananen, Paul-Christian Bürkner, and Aki Vehtari. 2023. “Detecting and Diagnosing Prior and Likelihood Sensitivity with Power-Scaling.” Statistics and Computing 34 (57). https://doi.org/10.1007/s11222-023-10366-5.

Kleiber, Christian, and Achim Zeileis. 2016. “Visualizing Count Data Regressions Using Rootograms.” The American Statistician 70 (3): 296–303. https://doi.org/10.1080/00031305.2016.1173590.

MacKenzie, Darryl I., James D. Nichols, James E. Hines, Melinda G. Knutson, and Alan B. Franklin. 2003. “Estimating Site Occupancy, Colonization, and Local Extinction When a Species Is Detected Imperfectly.” Ecology 84 (8): 2200–2207. https://doi.org/10.1890/02-3090.

MacKenzie, Darryl I., James D. Nichols, Gideon B. Lachman, Sam Droege, J. Andrew Royle, and Catherine A. Langtimm. 2002. “Estimating Site Occupancy Rates When Detection Probabilities Are Less Than One.” Ecology 83 (8): 2248–55. https://doi.org/10.1890/0012-9658(2002)083[2248:ESORWD]2.0.CO;2.

Ogle, Kiona, and Jarrett J. Barber. 2020. “Ensuring Identifiability in Hierarchical Mixed Effects Bayesian Models.” Ecological Applications 30 (7): e02159. https://doi.org/10.1002/eap.2159.

Vehtari, Aki, Jonah Gabry, Måns Magnusson, et al. 2025. loo: Efficient Leave-One-Out Cross-Validation and WAIC for Bayesian Models.

Vehtari, Aki, Andrew Gelman, and Jonah Gabry. 2017. “Practical Bayesian Model Evaluation Using Leave-One-Out Cross-Validation and WAIC.” Statistics and Computing 27 (5): 1413–32. https://doi.org/10.1007/s11222-016-9696-4.