Nc-mixture occupancy model
Abstract
A class of occupancy models for detection/non-detection data is proposed to relax the closure assumption of Nmixture models.
We introduce a community parameter , ranging from to , which characterizes a certain portion of individuals being fixed across multiple visits.
As a result, when equals , the model reduces to the Nmixture model; this reduced model is shown to overestimate abundance when the closure assumption is not fully satisfied.
Additionally, by including a zero-inflated component, the proposed model can bridge the standard occupancy model () and the zero-inflated Nmixture model (). We then study the behavior of the estimators for the two extreme models as varies from to .
An interesting finding is that the zero-inflated Nmixture model can consistently estimate the zero-inflated probability (occupancy) as approaches , but the bias can be positive, negative, or unbiased when depending on other parameters. We also demonstrate these results through simulation studies and data analysis.
Keywords: Community parameter; Nmixture models; Zero-inflated model
1 Introduction
Estimating the occupancy and abundance of species in a specific region, despite imperfect detection, is a crucial problem in ecology conservation and management. In reality, a species may exist at a survey site but go undetected due to limitations in the survey method or timing. Zero counts in a site sampling survey can be caused by the species not being present or by detection errors. If this issue is not addressed, the occupancy rate will be underestimated. Site occupancy models (MacKenzie et al., 2002), based on multiple-visit detection/non-detection (occurrence, presence/absence) data, can estimate the occurrence rate by accounting for detection errors. These models, which can be temporal or spatial replication surveys, are cost-effective as they do not require individual identification or marking. They are widely used in species distribution modeling, with various extensions such as multi-season open models, multi-species models, dynamic models, and spatial-temporal models (MacKenzie et al., 2017; Hogg et al., 2021; MacKenzie et al., 2009; Johnson et al., 2013) having been developed. This study focuses on single-species, single-season occupancy models.
In the standard occupancy model (MacKenzie et al., 2002), detection probability and species abundance are confounded and cannot be distinguished from one another. To overcome this limitation, Royle and Nichols (2003); Royle (2004) introduced Nmixture occupancy models that allow for the separation of detection probability and species abundance. These models enable the estimation of species abundance through multiple-visit occurrence or count data. Nmixture models have received a lot of attention in the literature (Haines, 2016a, b; Joseph et al., 2009; Gomez et al., 2018) as they can estimate population size like capture-recapture models, but without the need for capturing and marking individuals. However, the performance of these models depends on the assumptions made (Dennis et al., 2015; Link et al., 2018; Barker et al., 2018). Therefore, our goal is to improve these models by relaxing the closure model assumption, which is often violated even in single-season surveys (Kendall et al., 2013; Otto et al., 2013).
The closure assumption states that the number of individuals at a site remains constant during multiple visits. Surveys are often conducted through temporal or spatial replication, and sometimes with the use of multiple detectors or a combination of these methods (MacKenzie et al., 2017; Kendall and White, 2009). Temporal replication involves surveying the same sites at different times, while spatial replication involves selecting random sampling units within a larger area at a single site. While this closure assumption is reasonable for surveys conducted in the same locations over a short period, it may not be appropriate for highly mobile species (Hayes and Monfils, 2015). We also note that the study in Royle (2004) is an example of temporal replication, despite the paper’s title referencing spatial replication to indicate the distribution of sites.
The inference of the Nmixture occupancy model has been shown to produce biased point estimates and incorrect interval estimates when the closure assumption is violated. While a few studies have highlighted these effects (Dénes et al., 2015; Duarte et al., 2018; Dail and Madsen, 2011; Ke et al., 2022), most evidence is derived from simulation studies. To the best of our knowledge, there is a lack of theoretical results that explain the behavior of estimation bias under the Nmixture model. However, we have recently provided theoretical evidence under the proposed Nmixture model, which is an extension of the Nmixture model. It is important to note that Dail and Madsen (2011) also proposes a class of generalized Nmixture models, which allows for the immigration and emigration of species populations and estimates year-to-year immigration and emigration rates, providing valuable insights for conservation managers. However, this model is a multi-season open population model that goes in a different direction than our extension.
The Nmixture model is designed to create a framework that can unify both temporal and spatial replicating surveys. For example, consider a triple-visit survey conducted at a site where , , and represent the number of observable individuals during each visit. These variables are assumed to be identically distributed random variables. In the case of temporal replication, where the surveys are conducted in a short period, the three variables can be considered equal and meet the closure assumption of the Nmixture model. In contrast, for spatial replication, the variables are treated as independent. To account for both scenarios, we decompose the variables into two components: for , where represents the number of common individuals during the triple-visit survey, and represents the number of non-common individuals. We assume that and are independent, with for some . This parameter , referred to as the community parameter, indicates the proportion of individuals who are residents and remain fixed during the triple-visit. It also allows us to easily let degenerate to if , or let degenerate to if . Figure 1 provides an illustration of this decomposition.

Under the Nmixture model, we are able to demonstrate that if the community parameter is incorrectly specified as (i.e., the Nmixture model is used instead), the mean abundance will be overestimated. Additionally, our results indicate that the bias increases as the community parameter decreases, reaching infinity as approaches .
We propose an extension to the Nmixture model by incorporating a zero-inflated component. This allows us to bridge the standard occupancy model when (MacKenzie et al., 2002) and the zero-inflated Nmixture model when (Haines, 2016a). We then investigate the behavior of estimators for these two extreme models as the community parameter ranges from to . Our findings reveal that the standard occupancy model underestimates the zero-inflated probability (occupancy), and the bias increases as the community parameter increases. However, an interesting finding is that the zero-inflated Nmixture model can estimate the occupancy consistently as approaches , but the bias can be positive or negative depending on the other parameters.
The paper is organized as follows: In Section 2, we develop the Nmixture model and present estimation methods. In Section 3, we consider the zero-inflated Nmixture model. Section 4 includes simulation studies to evaluate estimator performance. Two real data examples are presented in Section 5, and a discussion is in Section 6. Web Appendix A includes proofs for all propositions.
2 Nmixture model
2.1 Notation and estimation
Consider a multiple-visit sampling survey consisting of sites and visits. Let be the number of individuals at site during the -th visit. Following the motivation in the Introduction (see Figure 1), we decompose into two independent Poisson distributed random variables: and . The expected value of is and the expected value of is , where and are constant parameters. Therefore, the number of individuals at each site, , follows an identical Poisson distribution with mean , representing the abundance parameter over sites (per each visit). However, the numbers of individuals from multiple visits at a site, , are not independent as they share a common variable . The community parameter characterizes the degree of dependence between visits, as it also represents the correlation between each visit of . Note that and are degenerate to 0 when and , respectively.
To model the data observation process, we assume that each individual is independently detectable with a probability of detection . If the species is detected at site during visit , let , otherwise . We denote the probability of detection at site during visit as , then when is known. This forms an Nmixture model.
It is clear that , are exchangeable variables, though not independent unless . Let be the frequency of occurrence at site . In Web Appendix A, we show that the probability function of under the Nmixture model is given by
(1) |
where is the vector of model parameters and . Equation (1) is also referred to as the Nmixture model.
When the community parameter , the model in equation (1) simplifies to
(2) |
where with . This is the probability function of the Nmixture model (Royle and Nichols, 2003), but the explicit form (2) is firstly given in Haines (2016a).
Similarly, when the community parameter , the model (1) reduces to
(3) |
where with . The reduced model (3) is a binomial distribution denoted as , where . We also note that the binary variables are independently and identically distributed with a mean of when . A calculation using the Poisson moment generating function shows that , which leads to the same result as (3). However, the model parameters and in (3) cannot be separated, making the model unidentifiable, and only their product is identifiable.
When the community parameter , the Nmixture model (1) is identifiable for . The log-likelihood function of the parameter vector is given by , and the maximum likelihood estimation is straightforward. For later use, the likelihood function can be represented in the framework of the multinomial model. Let for , where is the indicator function. These statistics can be viewed as a result of the multinomial model with cell probabilities . Therefore, we can write , and the score function for is
(4) |
2.2 Occupancy rate
Royle and Nichols (2003) define the occupancy rate as a derived parameter under the Nmixture model. Specifically, in terms of our notations. Under the Nmixture framework, the occupancy rate per each visit can be defined as . Alternatively, the rate can be defined as , where if some , the site is considered occupied by the species. In this way, the occupancy rate is , which depends on the number of visits and converges to when increases infinitely. Both definitions, therefore, differ from the current concept of site occupancy. The problem is that the number of individuals at site may vary from visit to visit because are random in the Nmixture model. Therefore, rather than directly defining occupancy, we would like to include a zero-inflated parameter to determine site occupancy under the Nmixture model. This extension will be addressed in Section 3.
2.3 Behavior of the Nmixture model estimators
In this subsection, we examine the behavior of Nmixture model estimators for and when the number of individuals at a site during multiple visits is not a fixed constant (i.e., ).
First, we examine the scenario where the community parameter is known and (double-visit). In this scenario, we use the notation and to represent the (restricted) maximum likelihood estimators (MLEs) of and , respectively. As shown in Web Lemma 1 of Web Appendix A, the MLEs have closed forms when , which are given by and , where and . These closed forms allow us to easily determine the limits of the MLEs of the Nmixture model, and , as given in the following proposition.
Proposition 1.
Under the Nmixture model with a double-visit survey, as the number of sites increases to infinity, the estimators and converge to and respectively with probability one, for all .
Proposition 1 delivers insights into the behavior of Nmixture model estimators when the community parameter is less than . The results show that the MLEs are consistent when , which is a common property of the maximum likelihood approach. Additionally, the results indicate that the Nmixture model overestimates the abundance parameter , and the bias increases as the community parameter decreases. In contrast, the Nmixture model exhibits the opposite bias behavior for the detection probability . Despite these biases, the estimator can consistently estimate under the framework of Nmixture model, which is noteworthy.
We initially believed that the results of Proposition 1 would hold for surveys with more than two visits (), but further investigation revealed that this is only partially true. The correct part is that there are moment estimators of the Nmixture model that exhibit similar behaviors to those described in Proposition 1. To demonstrate this, we derived the moments of under the Nmixture model and defined the resulting estimators as and . The following proposition summarizes the results of this analysis.
Proposition 2.
The method of moment estimators of the Nmixture model, and , are obtained by solving the equations
where is the sample average and . Furthermore, if the Nmixture model is true, as the number of sites increases to infinity, the estimators and converge to and respectively with probability one for all .
In the case of multiple-visit surveys, our simulation study (Section 4) found that the limit of the MLE of the Nmixture model, , exhibits a similar pattern to when . However, there is a discrepancy between them, and we can only determine the behavior when the community parameter is close to .
Proposition 3.
Under the Nmixture model with approaching zero, as the number of sites increases to infinity, the estimators and converge such that , , and with probability one.
Based on the results presented, we suggest that when , the MLE of the Nmixture model tends to overestimate the parameter and underestimate the parameter , and the bias increases as moves further away from . However, the estimator may still be able to consistently estimate at certain ranges of , such as when is close to 0, within the framework of the Nmixture model.
3 Zero-inflated Nmixture model
To account for the species occupancy, we extend the model (1) by incorporating a zero-inflation component. Following MacKenzie et al. (2002), let be the site occupancy probability, then the probability of a zero count () is . The likelihood function thus has an additional parameter so that we may write
(5) |
We refer the model (5) as a zero-inflated Nmixture (ZINc) model.
When , the model (5) becomes a zero-inflated Nmixture (ZIN) model, as previously described in Haines (2016a). In the case of , it simplifies to a zero-inflated binomial (ZIB) model with the detection probability , which is also known as the first occupancy model (MacKenzie et al., 2002). Thus, the ZINc model unifies both the ZIB and ZIN models into a single framework, with the special cases of and corresponding to ZIB and ZIN, respectively.
3.1 Estimation
When the community parameter , the ZINc model is identifiable for . By defining and , the likelihood function can be written as
where and Note that reflects the probability function of while is the conditional likelihood based on . As the conditional likelihood function is independent of the occupancy probability , it allows us to estimate without the confounding of (Karavarsamis and Huggins, 2020). Specifically, we can find the conditional MLE of , denoted as , by solving the score function of . Additionally, by taking the MLE of the profile likelihood we can find . As shown in Web Appendix A, Web Lemma 2 confirms that the estimators and resulting from this method are also the usual MLEs based on (5). The asymptotic variances of and can be derived in the usual way.
In the zero-inflated type models, the main focus is on the estimation of the occupancy probability. Let be the MLE of for the ZINc model, given that the community parameter is known. We next examine the behavior of and , which correspond to the occupancy estimators for the ZIB and ZIN models, respectively.
3.2 Behaviors of and
Like the Nmixture model, the ZINc model also has an identifiability issue for and when . As a result, under the ZIB model, only the parameter (i.e., ) and the occupancy probability can be estimated. In practice, to fit ZIB models, it is common to set to estimate with the resulting abundance estimator ().
Proposition 4.
Under the ZINc model with , the ZIB occupancy estimator shows an underestimation of with probability one as increases to infinity. A linear approximation of this underestimation can be represented as where
(6) |
and increases as increases, when then .
We notice that as either or increases, the value of decreases to zero. This result is expected, as when there are more visits or when the species abundance is high, the observed occupancy approaches . The linear approximation bias provides a reasonable representation of the trend of in various aspects, depicting that as moves away from (or as the correlation between visits increases), the underestimation becomes more significant.
Remark 1. As a direct consequence of Proposition 4, we can also see that the corresponding abundance estimator increases as increases, and that at is a consistent estimator of the product of species abundance and detection probability ().
Estimators of the ZIN model tend to have bias when the community parameter is less than . However, the behavior of these biases is complex. For example, the bias of the occupancy estimator does not vary monotonically with decreasing . Despite this, when , the estimators of the ZIN model (except ) behave similarly to those of the Nmixture model, as shown in Proposition 3. Interestingly, can consistently estimate at , as shown in the next proposition. For clarity, if there is no confusion, and in this proposition also refer to the MLE of the ZIN model (or the restricted MLE of the ZINc model).
Proposition 5.
Under the Nmixture model with approaching zero, as the number of sites increases, the estimators and converge such that , , and with probability one. Additionally, is a consistent estimator for when .
The estimator is also a consistent estimator under the ZIN (or ZINc with ) model. Therefore, Proposition 5 shows that behaves like a bridge with both ends (at and 1) at the same level; however, it is not clear whether the bridge deck is always above or below this level, or if it varies in different sections. We will further investigate this behavior through a simulation study.
3.3 Tests for ZIN and ZIB
The null hypothesis of the ZIN or ZIB model can be tested within the framework of the ZINc model, as it is equivalent to testing the values or . A simple method to justify the hypothesis is to use a Wald-type confidence interval based on the estimate of . A more formal approach is to use the likelihood ratio test, as the null hypothesis is a submodel of the full ZINc model. However, it is important to note that the asymptotic distribution of the likelihood ratio test under the null hypothesis is a mixture of and chi-square distributions, rather than the usual chi-square distribution (Self and Liang, 1987). In practice, we also suggest generating bootstrap samples under the null hypothesis to find the value. This is similar to the conclusion of Dail and Madsen (2011).
4 Simulation study
We conducted simulations to evaluate the performance of proposed models and estimators. We considered two scenarios: the Nmixture model and the ZINc model. In the first scenario, we computed the maximum likelihood estimators for both the Nmixture model and Nmixture model. In the second scenario, we calculated the maximum likelihood estimators for the ZIB, ZIN, and ZINc models. All estimates were calculated by using the optim function in the R software (R Core Team, 2022).
We specified the true parameter values as , , and for the simulations. For both scenarios, the number of sites was set at , and the number of visits was set at . In the second scenario, we also set the occupancy probability to . We generated 1,000 data sets for each parameter setting and calculated the estimates and associated standard error estimates for each data set.
To account for outliers in some of the simulated data sets, we present the median of the parameter estimates (Med), the median of the estimated standard error (Med.se), and the median absolute deviation (MAD) scaled to align with the normal distribution. Additionally, we report the coverage percentage (CP) of the nominal Wald-type confidence intervals. We note that in some instances, the numerical methods utilized for estimating the parameters for each model did not converge, with a higher frequency of non-convergence observed for the ZIN model and a lower frequency for the ZINc model. However, in most cases, the percentage of failures was minor and not reported.
4.1 Simulation study A: Nmixture model
The median estimates of for and are displayed in Figure 2. A comprehensive examination of the simulation results for can be found in Web Tables 1-6 in Web Appendix B.

Figure 2 illustrates the behavior of the Nmixture abundance estimator as a function of the community parameter . The figure shows that the estimator exhibits a positive bias for all values of less than . This bias follows a monotonically decreasing pattern and only approaches the true value when equals . When is close to , the bias can be substantial but has been removed from the figure for ease of visualization. Additionally, the figure shows that is greater than for all values of less than 1, although the difference has not been explicitly explored. In a separate simulation study (not reported), we calculated the moment estimator for the Nmixture model and found that its pattern closely resembled the reference curve . These simulation results agree with the findings outlined in Propositions 1-3. Corresponding conclusions can also be drawn from the results of the estimator shown in Web Figure 1 of Web Appendix B. The MLE of the Nmixture model can consistently estimate , but it also frequently exhibits bias around . This is mainly because the parameters of and are almost unidentifiable when . The bias is more pronounced when is large and is small, but it becomes smaller when or increases; see Web Table 6 (for and cases).
In Web Tables 1-6, it can be seen that the Nmixture abundance estimator has a relative bias ranging from to when varies from to . The bias is more pronounced when or increases but less severe when increases. On the other hand, the Nmixture model shows nearly unbiased estimates in all cases, except when , and , where the relative bias can reach up to - when varies from to ; see Web Tables 1 and 4.
In terms of mean absolute deviation (MAD), both models show a decrease in variation as increases, with the Nmixture model showing a much steeper decrease compared to the Nmixture model. Specifically, when , the MAD of can be to times that of , but the former is usually smaller than the latter when . The asymptotic standard error estimates, as measured by Med.se, generally match the results of the corresponding MAD, making them reasonably reliable for the scenarios considered. The Wald-type confidence interval estimator of the Nmixture model performs well when data information is sufficient, with a close match to the nominal confidence level at and . However, in some cases with small and , the coverage probability (CP) can be lower than (as seen in Web Tables 1, 4, and 5), indicating an unsatisfactory performance. The Nmixture model estimator often reaches for the coverage probability due to the severe bias problem of when .
The results from Web Tables 1-6 on the detection probability parameter are similar to , with the only difference being that the Nmixture model estimator’s bias is in the opposite direction. Finally, it is found that the median of the product presents nearly unbiased results for estimating in all cases. Note that this property is only proven for the range (Proposition 5), but the simulation results suggest that the range of can be extended somewhere.
4.2 Simulation study B: ZINc model
In Figures 3 and 4, we present the median estimate results for the parameters and when and . The detailed simulation results for can be found in Web Tables 7-12 of the Web Appendix B. In the ZIB model, we have fixed the value of . This is because the parameter is non-separable in this case.

In Figure 3, we can see that the ZIB estimator consistently underestimates . In fact, is approximately equal to when is close to 0 and increases as increases (as stated in Remark 1). The ZIN estimator has a similar trend as the Nmixture model when is close to , but it falls off quickly, rises slowly, and becomes consistent with at . In general, mainly exhibits positive bias but occasionally also shows negative bias at some . The ZINc estimator typically exhibits some bias at both ends of the range of , which can be very large at when is small. As expected, increasing and can mitigate bias issues of the maximum likelihood estimator.

In Figure 4, the ZIB estimator was found to underestimate when , with a greater bias observed when compared to . The relative bias reached a maximum of when and . The approximate formula of Proposition 4 was found to be consistent with the behavior trend of , although it is not shown in the figure. The ZIN occupancy estimator was found to fit well with at both ends of the range, as stated in Proposition 5. A positive bias was observed around and a negative bias around . The partial derivative of was proven to be positive when , indicating overestimation of in that region. However, at , the partial derivative was found to be positive in most situations, with some negative signs observed in some instances, such as when and . In this case, the plot of against exhibited behavior similar to an arch bridge, with only positive biases observed for all . The ZINc estimator was found to be unbiased in general, except when . The bias was found to be smaller when or is increased.
Next, we present a more detailed summary of the performance of the three occupancy estimators based on the results in Web Tables 7-12. The upper half of Web Table 7 ( and ) shows that the ZIB occupancy estimator has a relative bias of at , which becomes more pronounced as increases, reaching a maximum of at . However, increasing or can reduce the bias of as seen in Web Tables 7-12, which is consistent with Proposition 4. In Web Tables 7-9 (), the ZIN occupancy estimator has a relative bias of around - at . Specifically, the ZIN model often estimated as one at when and (Web Tables 8-9). The bias of decreases as increases, showing a small negative bias at . When is increased to , Web Tables 10-12 show that overestimates in all cases, with the most significant bias around - at , and small or even negligible bias at . Web Tables 7-12 also reveal that the bias of the ZINc occupancy estimator is generally close to , except for a few cases of , , and (Web Table 7).
The ZIB occupancy estimator has the lowest MAD, indicating that it is the most stable of the three methods. When , the MAD of the ZIN estimator decreases with increasing ; however, when , the MAD is more prominent at due to the significant bias of the ZIN estimator at this point. In contrast, the MAD of the ZINc estimator increases with increasing values; when , the MAD at can be to times the MAD at , but the increase is much smaller when . Generally, the stabilities of all three estimators improve with increasing values of , , and .
The Med.se of the ZIB occupancy estimator fits the corresponding MAD reasonably well, except for some negative biases observed at . However, the resulting interval estimator is only reliable when and ; in other cases, the corresponding CP may even reach zero showing very poor performance. For the ZIN occupancy estimator, Med.se often overestimates MAD, particularly with increasing , but this is less of an issue when or are increased. The resulting CP also shows many abnormal values, even when it appears to be close to the nominal level in some cases. For example, Web Tables 8-9 show that the CP can reach about in the upper panels but may drop to zero in the bottom panels. As a result, the interval estimates for the ZIN model are not considered reliable. The ZINc estimator, being a maximum likelihood estimate, can perform well on Med.se and CP measures when the data information is sufficient, but its performance is not guaranteed otherwise.
Lastly, we note that abundance estimators can be similarly affected by issues such as violation of model assumptions or a lack of sufficient data, which can result in substantial bias in some instances, such as when and . In general, compared to abundance estimators, the corresponding occupancy estimators are relatively less affected by these issues and their performance is relatively robust.
5 Examples
5.1 Example 1. Fisher data
The fisher is a carnivorous mammal native to the boreal forests of North America. In 2000, a survey program using noninvasive methods was conducted to collect data on fisher species distribution in northern and central California (Zielinski et al., 2005; Royle and Dorazio, 2008). The data consists of multiple-visit occurrences at sites, each visited times. See Web Table 13 for the data. Out of the eight visits, sites had zero counts, resulting in a sample occupancy rate of . We analyzed the fisher data using the models presented in Sections 2 and 3. The results are summarized in the top panel of Table 1, where we also calculated the Akaike information criterion (AIC) value for each model to evaluate model fit.
The AIC values for the ZIN and ZINc models were significantly lower than those of the Nmixture and Nmixture models, respectively, indicating that including zero-inflated probabilities improves the fit of the model to the fisher data. Among the three zero-inflated models, the ZINc had the smallest AIC value. Furthermore, the ZINc model estimated the community parameter to be in the middle of and the associated Wald-type confidence interval (Web Table 14) provided strong evidence against the null hypotheses and . The likelihood ratio statistics for the tests and were and respectively, and the values based on 10,000 bootstrap samples were and , respectively, leading to the same conclusion for rejecting ZIB and ZIN models. To sum up, the ZINc model fits the fisher data better than the other models, however, the occupancy estimates for the three zero-inflated models are similar.
It is worth noting that the ZIN model estimated to be , which appears larger than the other models, but the ZIN model has an unoccupied probability of . Therefore, to compare occupancy rates with other models, we should use with an estimate of (called the occupied probability of the ZIN model), which is similar to other estimates of . In contrast, the resulting abundance estimates are quite different, with ZIN reporting a higher value and ZIB having the lowest value. The standard error estimates for the three models also follow the same order, with the parameter estimates for ZIN showing the greatest estimated variation. This phenomenon is similar to what was observed in Web Table 8 of Simulation Study B.
Model | AIC | ||||
Fisher | |||||
ZINc | 0.93 (0.21) | 0.49 (0.10) | 0.51 (0.13) | 0.154 (0.020) | 626.9 |
ZIN | 1.79 (0.68) | 0.22 (0.05) | 0.175 (0.030) | 633.2 | |
ZIB | 0.51 (0.04) | 0.140 (0.016) | 663.1 | ||
Nmix. | 0.13 (0.02) | 0.46 (0.03) | 0.93 (0.02) | 636.8 | |
Nmix. | 0.16 (0.02) | 0.37 (0.02) | 0.150 (0.017) | 650.6 | |
Blue Jay | |||||
ZIN | 48.96 (427.97) | 0 (0.04) | 0.729 (0.091) | 170.1 | |
ZIB | 0.22 (0.03) | 0.723 (0.077) | 168.1 | ||
Nmix. | 1.64 (0.56) | 0.09 (0.03) | 0.806 (0.109) | 169.5 | |
Catbird | |||||
ZINc | 0.60 (0.32) | 0.42 (0.21) | 0.19 (0.14) | 0.421 (0.079) | 139.1 |
ZIN | 0.85 (1.31) | 0.16 (0.09) | 0.740 (0.691) | 137.8 | |
ZIB | 0.26 (0.04) | 0.403 (0.074) | 138.5 | ||
Nmix. | 0.52 (0.17) | 0.19 (0.06) | 0.97 (0.12) | 137.9 | |
Nmix. | 0.55 (0.14) | 0.18 (0.04) | 0.421 (0.079) | 135.9 | |
Common Yellow-Throat | |||||
ZINc | 0.93 (0.23) | 0.47 (0.09) | 0.56 (0.10) | 0.775 (0.075) | 222.9 |
ZIN | 2.06 (0.81) | 0.20 (0.05) | 0.851 (0.123) | 227.8 | |
ZIB | 0.49 (0.04) | 0.723 (0.064) | 254.5 | ||
Nmix. | 1.09 (0.38) | 0.31 (0.09) | 0.88 (0.15) | 226.7 | |
Nmix. | 1.45 (0.25) | 0.23 (0.03) | 0.765 (0.059) | 226.8 | |
Tree Swallow | |||||
ZINc | 0.75 (0.22) | 0.45 (0.08) | 0.61 (0.12) | 0.682 (0.106) | 196.4 |
ZIN | 1.81 (0.75) | 0.18 (0.05) | 0.726 (0.135) | 204.0 | |
ZIB | 0.40 (0.04) | 0.587 (0.071) | 228.0 | ||
Nmix. | 0.48 (0.11) | 0.46 (0.07) | 0.72 (0.08) | 198.4 | |
Nmix. | 1.02 (0.19) | 0.22 (0.03) | 0.641 (0.069) | 203.9 | |
Song Sparrow | |||||
ZINc | 1.01 (0.59) | 0.37 (0.08) | 0.90 (0.09) | 0.678 (0.240) | 184.7 |
ZIN | 2.23 (0.98) | 0.21 (0.06) | 0.573 (0.105) | 187.9 | |
ZIB | 0.54 (0.05) | 0.501 (0.072) | 210.5 | ||
Nmix. | 0.58 (0.13) | 0.42 (0.05) | 0.93 (0.04) | 183.4 | |
Nmix. | 0.81 (0.16) | 0.31 (0.04) | 0.555 (0.071) | 190.5 |
5.2 Example 2: Breeding Bird Survey (BBS) data
This example aims to understand how the community parameter may vary among different species in data from the same survey. We consider the data used in Royle (2006), originally from the North American Bird Breeding Survey (BBS) program, which includes records of occurrence data for five species of birds -- blue jay (), catbird (), common yellow-throat (), tree swallow (), and song sparrow ()-- from locations with repeated visits. The sample occupancy rates for the five species (in the above order) are , , , , and ; see Web Table 13 for the data.
Results are reported in Table 1, where the order of the species was actually sorted according to the estimated value of in the ZINc model. The first two species show small estimates of , with the ZINc model even degenerating to the ZIB model for the blue jay data due to an estimate of . All the considered models showed comparable AIC values for both species, indicating a similar level of model fit. However, the parameter estimates of , and for each model produced conflicting results. We believe this is a problem of model identifiability, similar to the one described Royle (2006) and Link (2003). Royle (2006) concluded that this problem is more pronounced in occupancy models when the probability of detection is low, which is the case for these two species.
The next two species, common yellow-throat and tree sparrow, show middle estimates of , where both Wald type confidence interval does not include or (Web Table 14 ). The results for model comparisons here can be summarized as similar to Example 1, particularly for zero-inflated models. In terms of AIC, the ZINc model performs better than other models.
The last species, song sparrow, has a high estimate of of , which is close to . In fact, the upper limit of its confidence interval exceeds the upper bound of . In this case, the standard errors of and for the ZINc model are relatively high, indicating that estimation uncertainty increases as approaches . This phenomenon was also observed in Simulation Study B.
In Web Table 14, we also report bootstrap values for the likelihood ratio tests to the hypotheses of ZIB and ZIN models, which suggest significant evidence against the null hypotheses for the last three species of BBS data.
6 Discussion
The Nmixture model has been extended to allow for the number of individuals at a sample site to vary with each visit. In this extension, the number of individuals at a site per visit is treated as a latent variable that is decomposed into two components, with only one of these components being considered in the original Nmixture model. It should be noted that the extended Nmixture model is only applicable to closed populations (single season) due to the assumption of constant . While it is possible to relax this assumption using regression models with the site and/or visit covariates, it is unclear whether this approach is effective for some multiseasonal data.
Unlike the Nmixture model, the occupancy of the Nmixture model may not be well defined by the abundance parameter. To address this, we propose a zero-inflated Nmixture model that uses a zero-inflated probability to define occupancy explicitly. This new model unifies the commonly used standard occupancy and zero-inflated Nmixture models into one framework. As a result, we discover interesting properties of both models, such as the use of standard occupancy estimators under the zero-inflated Nmixture model or vice versa.
Our extension models offer greater flexibility in fitting data, but they also increase the complexity of the model. These models may also experience issues such as numerical instability, parameter identifiability, and model identification problems, particularly when data is sparse, such as in cases of low detection probability, low abundance, few replicates of visits, or a small sample of sites. These issues have been observed in both simulation studies and data analysis. Indeed, these problems were initially raised in Nmixture models (Dennis et al., 2015; Barker et al., 2018; Link et al., 2018) as a caution against their use. Our extension models require an additional community parameter, which may exacerbate these problems. However, these issues may be mitigated by incorporating additional capture-recapture data (Barker et al., 2018) or including covariates in the models (Link et al., 2018). Further research is necessary to improve the estimation in these contexts.
There are several potential avenues for extending this work, each of which would require further implementation and investigation.
-
•
Using the negative binomial distribution to model abundance instead of the Poisson distribution. This distribution is often more appropriate for biological and ecological data analysis, but it also includes an additional aggregation parameter, increasing model complexity and estimation difficulty.
-
•
Examining count data instead of occurrence data. This would require a likelihood function without an explicit form, making estimation computationally intensive, particularly for large datasets.
-
•
Incorporating covariates into the models. Including covariates is important for real-world applications. This can be done by using a log link for abundance and a logit link for detection and occupancy probability. Incorporating species covariates for the community parameter in a joint species model may also be meaningful.
-
•
Extending to time-to-detection data. Time-to-detection occupancy models have been developed recently, which also show some advantages over occurrence data models (Priyadarshani et al., 2022). Strebel et al. (2021) propose an Nmixture time-to-detection occupancy model that enables estimating the abundance without marking individuals. An analogous Nmixture occupancy model for time-to-detection data is currently under development.
Acknowledgements
This work was supported by the National Science and Technology Council of Taiwan.
References
- Barker et al. (2018) Barker, R.J., Schofield, M.R., Link, W.A., and Sauer, J.R. (2018). On the reliability of mixture models for count data. Biometrics, 74, 369--377.
- Dail and Madsen (2011) Dail, D. and Madsen, L. (2011). Models for estimating abundance from repeated counts of an open metapopulation. Biometrics, 67, 577--587.
- Dénes et al. (2015) Dénes, F.V., Silveira, L.F., and Beissinger, S.R. (2015). Estimating abundance of unmarked animal populations: accounting for imperfect detection and other sources of zero inflation. Methods in Ecology and Evolution, 6, 543--556.
- Dennis et al. (2015) Dennis, E.B., Morgan, B.J., and Ridout, M.S. (2015). Computational aspects of Nmixture models. Biometrics, 71, 237--246.
- Duarte et al. (2018) Duarte, A., Adams, M.J., and Peterson, J.T. (2018). Fitting Nmixture models to count data with unmodeled heterogeneity: Bias, diagnostics, and alternative approaches. Ecological Modelling, 374, 51--59.
- Gomez et al. (2018) Gomez, J.P., Robinson, S.K., Blackburn, J.K., and Ponciano, J.M. (2018). An efficient extension of Nmixture models for multi-species abundance estimation. Methods in Ecology and Evolution, 9, 340--353.
- Haines (2016a) Haines, L.M. (2016a). A note on the Royle--Nichols model for repeated detection--nondetection data. Journal of Agricultural, Biological, and Environmental Statistics, 21, 588--598.
- Haines (2016b) Haines, L.M. (2016b). Maximum likelihood estimation for Nmixture models. Biometrics, 72, 1235--1245.
- Hayes and Monfils (2015) Hayes, D.B. and Monfils, M.J. (2015). Occupancy modeling of bird point counts: Implications of mobile animals. The Journal of Wildlife Management, 79, 1361--1368.
- Hogg et al. (2021) Hogg, S.E., Wang, Y., and Stone, L. (2021). Effectiveness of joint species distribution models in the presence of imperfect detection. Methods in Ecology and Evolution, 12, 1458--1474.
- Johnson et al. (2013) Johnson, D.S., Conn, P.B., Hooten, M.B., Ray, J.C., and Pond, B.A. (2013). Spatial occupancy models for large data sets. Ecology, 94, 801--808.
- Joseph et al. (2009) Joseph, L.N., Elkin, C., Martin, T.G, and Possingham, H.P (2009). Modeling abundance using Nmixture models: the importance of considering ecological mechanisms. Ecological Applications, 19, 631--642.
- Karavarsamis and Huggins (2020) Karavarsamis, N. and Huggins, R. (2020). Two-stage approaches to the analysis of occupancy data I: The homogeneous case. Communications in Statistics - Theory and Methods, 49, 4751--4761.
- Ke et al. (2022) Ke, A., Sollmann, R., Frishkoff, L.O., and Karp, D.S. (2022). A hierarchical Nmixture model to estimate behavioral variation and a case study of neotropical birds. Ecological Applications, 32, e2632.
- Kendall and White (2009) Kendall, W.L. and While, G.C. (2009). A cautionary note on substituting spatial subunits for repeated temporal sampling in studies of site occupancy. Journal of Applied Ecology, 46, 1182--1188.
- Kendall et al. (2013) Kendall, W.L., Hines, J.E., Nichols, J.D., and Grant, E.H.C. (2013). Relaxing the closure assumption in occupancy models: staggered arrival and departure times. Ecology, 94, 610--617.
- Link (2003) Link, W.A. (2003). Nonidentifiability of population size from capture-recapture data with heterogeneous detection probabilities. Biometrics, 59, 1123--1130
- Link et al. (2018) Link, W.A., Schofield, M.R., Barker, R.J., and Sauer, J.R. (2018). On the robustness of Nmixture models. Ecology, 99, 1547--1551.
- MacKenzie et al. (2002) MacKenzie, D.I., Nichols, J.D., Lachman, G.B., Droege, S., Royle, J.A., and Langtimm, C.A. (2002). Estimating site occupancy rates when detection probabilities are less than one. Ecology, 83, 2248--2255.
- MacKenzie et al. (2009) MacKenzie, D.I., Nichols, J.D., Seamans, M.E., and Gutiérrez, R.J. (2009). Modeling species occurrence dynamics with multiple states and imperfect detection. Ecology, 90, 823--835.
- MacKenzie et al. (2017) MacKenzie, D.I., Nichols, J.D., Royle, J.A., Pollock, K.H., Bailey, L.L., and Hines, J. E. (2017). Occupancy estimation and modeling: inferring patterns and dynamics of species occurrence, 2nd Edition. London: Elsevier/Academic Press.
- Otto et al. (2013) Otto, C.R.V., Bailey, L.L., and Roloff, G.J. (2013). Improving species occupancy estimation when sampling violates the closure assumption. Ecography, 36, 1299--1309.
- Priyadarshani et al. (2022) Priyadarshani, D., Altwegg, R., Lee, A.T.K., and Hwang, W.H. (2022). What can occupancy models gain from time-to-detection data? Ecology, 103, e3832.
- R Core Team (2022) R Core Team (2022). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.
- Royle and Nichols (2003) Royle, J.A. and Nichols, J.D. (2003). Estimating abundance from repeated presence-absence data or point counts. Ecology, 84, 777--790.
- Royle (2004) Royle, J.A. (2004). Nmixture models for estimating population size from spatially replicated counts. Biometrics, 60, 108--115.
- Royle (2006) Royle, J.A. (2006). Site occupancy models with heterogeneous detection probabilities. Biometrics, 62, 97--102.
- Royle and Dorazio (2008) Royle, J.A. and Dorazio, R.M. (2008). Hierarchical modeling and inference in ecology: The analysis of data from populations, metapopulations and communities. Elsevier/Academic Press.
- Self and Liang (1987) Self, S.G., and Liang, K.L. (1987). Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. Journal of the American Statistical Association, 82, 605--610.
- Strebel et al. (2021) Strebel, N., Fiss, C.J., Kellner, K.F., Larkin, J.L., Kéry, M., and Cohen, J. (2021). Estimating abundance based on time-to-detection data. Methods in Ecology and Evolution, 12, 909--920.
- Zielinski et al. (2005) Zielinski, W.J., Truex, R.L., Schlexer, F.V., Campbell, L.A., and Carroll, C. (2005). Historical and contemporary distributions of carnivores in forests of the Sierra Nevada, California, USA. Journal of Biogeography, 32, 1385--1407.