Denoising Diffusion Gamma Models

Nachmani, Eliya; Roman, Robin San; Wolf, Lior

Full-text links:

Download:

Current browse context:

eess.SP

< prev | next >

new | recent | 2110

Electrical Engineering and Systems Science > Signal Processing

Title: Denoising Diffusion Gamma Models

Authors: Eliya Nachmani, Robin San Roman, Lior Wolf

(Submitted on 10 Oct 2021)

Abstract: Generative diffusion processes are an emerging and effective tool for image and speech generation. In the existing methods, the underlying noise distribution of the diffusion process is Gaussian noise. However, fitting distributions with more degrees of freedom could improve the performance of such generative models. In this work, we investigate other types of noise distribution for the diffusion process. Specifically, we introduce the Denoising Diffusion Gamma Model (DDGM) and show that noise from Gamma distribution provides improved results for image and speech generation. Our approach preserves the ability to efficiently sample state in the training diffusion process while using Gamma noise.

Comments:	arXiv admin note: substantial text overlap with arXiv:2106.07582
Subjects:	Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
Cite as:	arXiv:2110.05948 [eess.SP]
	(or arXiv:2110.05948v1 [eess.SP] for this version)

Submission history

From: Eliya Nachmani [view email]
[v1] Sun, 10 Oct 2021 10:46:31 GMT (1556kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2110.05948

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Signal Processing

Title: Denoising Diffusion Gamma Models

Submission history