Optimizing DDPM Sampling with Shortcut Fine-Tuning

Fan, Ying; Lee, Kangwook

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2301

Change to browse by:

Computer Science > Machine Learning

Title: Optimizing DDPM Sampling with Shortcut Fine-Tuning

Authors: Ying Fan, Kangwook Lee

(Submitted on 31 Jan 2023 (v1), last revised 24 May 2023 (this version, v3))

Abstract: In this study, we propose Shortcut Fine-Tuning (SFT), a new approach for addressing the challenge of fast sampling of pretrained Denoising Diffusion Probabilistic Models (DDPMs). SFT advocates for the fine-tuning of DDPM samplers through the direct minimization of Integral Probability Metrics (IPM), instead of learning the backward diffusion process. This enables samplers to discover an alternative and more efficient sampling shortcut, deviating from the backward diffusion process. Inspired by a control perspective, we propose a new algorithm SFT-PG: Shortcut Fine-Tuning with Policy Gradient, and prove that under certain assumptions, gradient descent of diffusion models with respect to IPM is equivalent to performing policy gradient. To our best knowledge, this is the first attempt to utilize reinforcement learning (RL) methods to train diffusion models. Through empirical evaluation, we demonstrate that our fine-tuning method can further enhance existing fast DDPM samplers, resulting in sample quality comparable to or even surpassing that of the full-step model across various datasets.

Comments:	ICML 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2301.13362 [cs.LG]
	(or arXiv:2301.13362v3 [cs.LG] for this version)

Submission history

From: Ying Fan [view email]
[v1] Tue, 31 Jan 2023 01:37:48 GMT (1104kb,D)
[v2] Wed, 1 Feb 2023 22:16:04 GMT (1105kb,D)
[v3] Wed, 24 May 2023 08:28:13 GMT (6740kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2301.13362

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Optimizing DDPM Sampling with Shortcut Fine-Tuning

Submission history