We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation

Abstract: Recent advances in diffusion models bring state-of-the-art performance on image generation tasks. However, empirical results from previous research in diffusion models imply an inverse correlation between density estimation and sample generation performances. This paper investigates with sufficient empirical evidence that such inverse correlation happens because density estimation is significantly contributed by small diffusion time, whereas sample generation mainly depends on large diffusion time. However, training a score network well across the entire diffusion time is demanding because the loss scale is significantly imbalanced at each diffusion time. For successful training, therefore, we introduce Soft Truncation, a universally applicable training technique for diffusion models, that softens the fixed and static truncation hyperparameter into a random variable. In experiments, Soft Truncation achieves state-of-the-art performance on CIFAR-10, CelebA, CelebA-HQ 256x256, and STL-10 datasets.
Comments: 28 pages, 16 figures, 15 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Journal reference: International Conference on Machine Learning (ICML) 2022
Cite as: arXiv:2106.05527 [cs.LG]
  (or arXiv:2106.05527v5 [cs.LG] for this version)

Submission history

From: Dongjun Kim [view email]
[v1] Thu, 10 Jun 2021 06:30:16 GMT (21348kb,D)
[v2] Sun, 8 Aug 2021 01:25:58 GMT (22050kb,D)
[v3] Thu, 16 Sep 2021 01:55:45 GMT (22049kb,D)
[v4] Fri, 15 Apr 2022 10:55:05 GMT (19299kb,D)
[v5] Sat, 11 Jun 2022 02:36:37 GMT (19166kb,D)

Link back to: arXiv, form interface, contact.