Abstract:
|
Capture-Recapture (CRC), also know as Multiple System Estimation (MSE) is a statistical methodology using multiple independent samples to estimate the size of an entire population. It is typically employed to estimate the size of hard-to-count populations. Originally developed by ecologists to estimate animal populations, capture-recapture / MSE has become an important analytic tool for social justice. In recent years, applications have grown to estimate the number of people affected by a wide variety of problems, including human trafficking, victims of discrimination, bias in machine learning algorithms, the growth of hate speech in social media, and natural disasters. In this method, careful design, development, and management of the underlying database are critical tasks. This paper demonstrates development and management of databases for capture-recapture / MSE analysis, including database organization, integrating additional data sources, addressing privacy issues, and database management and governance. The presentation then describes a fully Bayesian approach to Capture-recapture / MSE to improve estimation accuracy by leveraging multiple data sources.
|