We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.IV

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Image and Video Processing

Title: Robust Deep Neural Object Detection and Segmentation for Automotive Driving Scenario with Compressed Image Data

Abstract: Deep neural object detection or segmentation networks are commonly trained with pristine, uncompressed data. However, in practical applications the input images are usually deteriorated by compression that is applied to efficiently transmit the data. Thus, we propose to add deteriorated images to the training process in order to increase the robustness of the two state-of-the-art networks Faster and Mask R-CNN. Throughout our paper, we investigate an autonomous driving scenario by evaluating the newly trained models on the Cityscapes dataset that has been compressed with the upcoming video coding standard Versatile Video Coding (VVC). When employing the models that have been trained with the proposed method, the weighted average precision of the R-CNNs can be increased by up to 3.68 percentage points for compressed input images, which corresponds to bitrate savings of nearly 48 %.
Comments: Originally submitted at IEEE ISCAS 2021
Subjects: Image and Video Processing (eess.IV)
ACM classes: I.4.2
Journal reference: IEEE International Symposium on Circuits and Systems (ISCAS) 2021
DOI: 10.1109/ISCAS51556.2021.9401621
Cite as: arXiv:2205.06501 [eess.IV]
  (or arXiv:2205.06501v1 [eess.IV] for this version)

Submission history

From: Kristian Fischer [view email]
[v1] Fri, 13 May 2022 08:17:52 GMT (7524kb,D)

Link back to: arXiv, form interface, contact.