We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.IV

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Image and Video Processing

Title: Interpreting CNN for Low Complexity Learned Sub-pixel Motion Compensation in Video Coding

Abstract: Deep learning has shown great potential in image and video compression tasks. However, it brings bit savings at the cost of significant increases in coding complexity, which limits its potential for implementation within practical applications. In this paper, a novel neural network-based tool is presented which improves the interpolation of reference samples needed for fractional precision motion compensation. Contrary to previous efforts, the proposed approach focuses on complexity reduction achieved by interpreting the interpolation filters learned by the networks. When the approach is implemented in the Versatile Video Coding (VVC) test model, up to 4.5% BD-rate saving for individual sequences is achieved compared with the baseline VVC, while the complexity of learned interpolation is significantly reduced compared to the application of full neural network.
Comments: 27th IEEE International Conference on Image Processing, 25-28 Oct 2020, Abu Dhabi, United Arab Emirates
Subjects: Image and Video Processing (eess.IV); Computational Complexity (cs.CC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
Journal reference: 2020 IEEE International Conference on Image Processing (ICIP), 2020, pp. 798-802
DOI: 10.1109/ICIP40778.2020.9191193
Cite as: arXiv:2006.06392 [eess.IV]
  (or arXiv:2006.06392v1 [eess.IV] for this version)

Submission history

From: Luka Murn [view email]
[v1] Thu, 11 Jun 2020 13:10:20 GMT (1487kb,D)

Link back to: arXiv, form interface, contact.