Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 3 - Data Privacy: Statisticians’ Perspective
Type: Invited
Date/Time: Sunday, August 8, 2021 : 1:30 PM to 3:20 PM
Sponsor: SSC (Statistical Society of Canada)
Abstract #315519
Title: Methods for Synthesizing Clinical Data
Author(s): Khaled El Emam* and Lucy Mosquera
Companies: University of Ottawa and Replica Analytics Ltd.
Keywords: data synthesis; synthetic data; data privacy; data protection; hipaa; gdpr

Synthetic data is increasingly being generated to meet the rising demand for clinical data, especially in the context of the COVID-19 pandemic. In this presentation we will discuss various generative methods that can scale down to small datasets, as well as scale up to large databases, and present a framework for evaluating the utility of synthetic data. The methods will be demonstrated using real world datasets covering different clinical and public health domains. Generative methods that will be discussed include statistical machine learning and deep learning models suitable for cross-sectional and longitudinal datasets, with comparisons of their various strengths and weaknesses. In practice, multiple models need to be used to accommodate the heterogeneity of real world data. In terms of applications, generative models enable the sharing of data in a privacy protective manner, and also can be used as data simulators to augment small datasets and for complex predictions.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program