Authors
Joerg Drechsler, Anna-Carolina Haensch
Publication date
2024
Journal
Statistical Science
Publisher
https://www.e-publications.org/ims/submission/STS/user/submissionFile/57906?confirm=1907e523
Description
The idea to generate synthetic data as a tool for broadening access to sensitive microdata has been proposed for the first time three decades ago. While first applications of the idea emerged around the turn of the century, the approach really gained momentum over the last ten years, stimulated at least in parts by some recent developments in computer science. We consider the 30th jubilee of Rubin’s seminal paper on synthetic data (J. Off. Stat. 9 (1993) 462–468) as an opportunity to look back at the historical developments but also to offer a review of the diverse approaches and methodological underpinnings proposed over the years. We will also discuss the various strategies that have been suggested to measure the utility and remaining risk of disclosure of the generated data.
Total citations
2023202415
Scholar articles
J Drechsler, AC Haensch - Statistical Science, 2024