Modelling binary qPCR data

This project was part-funded by NERC (project NE/T010045/1) as part of the Landscape Decisions theme.

Summary

This work developeds hierarchical occupancy models for qPCR detection/non-detection data that explicitly account for imperfect observation in DNA-based surveys. The models separate the ecological process of species presence from the observation process, allowing for both false negative errors (failure to detect DNA when present) and false positive errors (detection due to contamination or other sources).

A key contribution is the formulation of a two-stage observation model that distinguishes between errors arising during sample collection and those arising during laboratory analysis. This framework allows the use of replicate qPCR measurements to quantify detection probabilities and error rates, rather than relying on arbitrary thresholds for declaring presence.

The approach also incorporates environmental covariates and Bayesian variable selection to identify drivers of species occurrence, and provides a principled basis for estimating occupancy at large spatial scales. Subsequent work extends the framework to improve survey design, assess reliability of national-scale monitoring programmes, and provide accessible implementation through an RShiny application.

Papers

Code