UnlearnCanvas: A Stylized Image Dataset to Benchmark
Machine Unlearning for Diffusion Models

Code: https://github.com/OPTML-Group/UnlearnCanvas

The rapid advancement of diffusion models (DMs) has not only transformed various real- world industries but has also introduced negative societal concerns, including the generation of harmful content, copyright disputes, and the rise of stereotypes and biases. To mitigate these issues, machine unlearning (MU) has emerged as a potential solution, demonstrating its ability to remove undesired generative capabilities of DMs in various applications. However, by examining existing MU evaluation methods, we uncover several key challenges that can result in incomplete, inaccurate, or biased evaluations for MU in DMs.

To address them, we enhance the evaluation metrics for MU, including the introduction of an often-overlooked retainability measurement for DMs post-unlearning. Additionally, we introduce UnlearnCanvas, a comprehensive high-resolution stylized image dataset that facilitates us to evaluate the unlearning of artistic painting styles in conjunction with associated image objects.

We show that this dataset plays a pivotal role in establishing a standardized and automated evaluation framework for MU techniques on DMs, featuring 7 quantitative metrics to address various aspects of unlearning effectiveness. Through extensive experiments, we benchmark 5 state-of- the-art MU methods, revealing novel insights into their pros and cons, and the underlying unlearning mechanisms. Furthermore, we demonstrate the potential of UnlearnCanvas to benchmark other generative modeling tasks, such as style transfer.

[Other Related Benchmarks]

UnlearnDiff Benchmark: an evaluation benchmark built upon adversarial attacks (also referred to as adversarial prompts), in order to discern the trustworthiness of these safety-driven unlearned DMs.

Method	Style-UA	Style-IRA	Style-CRA	Object-UA	Object-IRA	Object-CRA	FID	Time (s)	Memory (GB)	Storage (GB)
SalUn	98.58%	80.97%	93.96%	92.15%	55.78%	44.23%	131.37	6163	17.8	4.3

Method	Style-UA	Style-IRA	Style-CRA	Object-UA	Object-IRA	Object-CRA	FID	Time (s)	Memory (GB)	Storage (GB)
ESD	98.58%	80.97%	93.96%	92.15%	55.78%	44.23%	65.55	6163	17.8	4.3
FMN	88.48%	56.77%	46.60%	45.64%	90.63%	73.46%	131.37	350	17.9	4.2
UCE	98.40%	60.22%	47.71%	94.31%	39.35%	34.67%	182.01	434	5.1	1.7
CA	60.82%	96.01%	92.70%	46.67%	90.11%	81.97%	54.21	734	10.1	4.2
SalUn	86.26%	90.39%	95.08%	86.91%	96.35%	99.59%	61.05	667	30.8	4

Method	Style-UA	Style-IRA	Style-CRA	Object-UA	Object-IRA	Object-CRA	FID	Time (s)	Memory (GB)	Storage (GB)
ESD	98.58%	80.97%	93.96%	92.15%	55.78%	44.23%	65.55	6163	17.8	4.3
FMN	88.48%	56.77%	46.60%	45.64%	90.63%	73.46%	131.37	350	17.9	4.2
UCE	98.40%	60.22%	47.71%	94.31%	39.35%	34.67%	182.01	434	5.1	1.7
CA	60.82%	96.01%	92.70%	46.67%	90.11%	81.97%	54.21	734	10.1	4.2
SalUn	86.26%	90.39%	95.08%	86.91%	96.35%	99.59%	61.05	667	30.8	4

Copy the following snippet to cite these results

UnlearnCanvas: A Stylized Image Dataset to Benchmark
Machine Unlearning for Diffusion Models

Evaluation Queue for the UnlearnCanvas Benchmark.

Context

Evaluated MU Methods

Metrics

Impact Statement

Other Related Benchmarks

Contact

UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models

Evaluation Queue for the UnlearnCanvas Benchmark.

Context

Evaluated MU Methods

Metrics

Impact Statement

Other Related Benchmarks

Contact

UnlearnCanvas: A Stylized Image Dataset to Benchmark
Machine Unlearning for Diffusion Models