We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for recent submissions

[ total of 19 entries: 1-19 ]
[ showing up to 25 entries per page: fewer | more ]

Tue, 12 Nov 2019

[1]  arXiv:1911.04139 [pdf, other]
Title: Pano: Optimizing 360° Video Streaming with a Better Understanding of Quality Perception
Comments: 16 pages, 18 figures, Sigcomm conference
Subjects: Multimedia (cs.MM)
[2]  arXiv:1911.03974 [pdf, other]
Title: A Multimodal CNN-based Tool to Censure Inappropriate Video Scenes
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[3]  arXiv:1911.03793 [pdf, other]
Title: A Robust Blind 3-D Mesh Watermarking based on Wavelet Transform for Copyright Protection
Comments: 6 pages, 3 figures, International Conference on Advanced Technologies for Signal and Image Processing (ATSIP'2017)
Subjects: Multimedia (cs.MM)

Mon, 11 Nov 2019

[4]  arXiv:1911.03100 (cross-list from cs.CV) [pdf, other]
Title: Extracting temporal features into a spatial domain using autoencoders for sperm video analysis
Comments: 3 pages, 1 figure, MediaEval 19, 27-29 October 2019, Sophia Antipolis, France
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)

Thu, 7 Nov 2019

[5]  arXiv:1911.02360 [pdf, other]
Title: Reversible Adversarial Examples based on Reversible Image Transformation
Authors: Hua Wang, Zhaoxia Yin
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[6]  arXiv:1911.02475 (cross-list from eess.IV) [pdf, other]
Title: Unimodal-uniform Constrained Wasserstein Training for Medical Diagnosis
Comments: ICCV VRMI workshop Oral. arXiv admin note: text overlap with arXiv:1911.00962
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[7]  arXiv:1911.02172 (cross-list from cs.CV) [pdf]
Title: Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding
Comments: Submitted to IEEE ICASSP 2020; Pytorch code will be released soon
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[8]  arXiv:1911.02103 (cross-list from cs.CV) [pdf, other]
Title: Recurrent Instance Segmentation using Sequences of Referring Expressions
Comments: 3rd NeurIPS Workshop on Visually Grounded Interaction and Language (ViGIL, 2019)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)

Wed, 6 Nov 2019

[9]  arXiv:1911.01699 [pdf, other]
Title: Reversible Data Hiding in Encrypted Images based on Pixel Prediction and Bit-plane Compression
Subjects: Multimedia (cs.MM)
[10]  arXiv:1911.01840 (cross-list from eess.AS) [pdf, other]
Title: Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems
Subjects: Audio and Speech Processing (eess.AS); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD)
[11]  arXiv:1911.01806 (cross-list from eess.AS) [pdf, other]
Title: Mixture factorized auto-encoder for unsupervised hierarchical deep factorization of speech signal
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD)

Tue, 5 Nov 2019

[12]  arXiv:1911.01355 [pdf, ps, other]
Title: Video-based compression for plenoptic point clouds
Comments: 10 pages, 4 figures
Subjects: Image and Video Processing (eess.IV); Multimedia (cs.MM)
[13]  arXiv:1911.00812 [pdf, other]
Title: Adaptive Rate Allocation for View-Aware Point-Cloud Streaming
Comments: Technical Report, University of Illinois at Urbana-Champaign (UIUC), September 2017, 5 pages
Subjects: Multimedia (cs.MM)
[14]  arXiv:1911.00772 [pdf]
Title: Robustness and Imperceptibility Enhancement in Watermarked Images by Color Transformation
Comments: 5 pages 3 figures
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[15]  arXiv:1911.00753 [pdf, other]
Title: Hybrid blind robust image watermarking technique based on DFT-DCT and Arnold transform
Comments: 34 page, 17 figures, published in Multimedia Tools and Applications Springer, 2018
Journal-ref: Multimedia Tools and Applications, 77(20), 27181-27214 (2018)
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR)
[16]  arXiv:1911.00682 [pdf, other]
Title: FCEM: A Novel Fast Correlation Extract Model For Real Time Steganalysis of VoIP Stream via Multi-head Attention
Comments: 5 pages, 2 figures
Subjects: Multimedia (cs.MM)
[17]  arXiv:1911.00639 [pdf, ps, other]
Title: A Generalized Rate-Distortion-$λ$ Model Based HEVC Rate Control Algorithm
Subjects: Multimedia (cs.MM)
[18]  arXiv:1911.00962 (cross-list from cs.CV) [pdf, other]
Title: Conservative Wasserstein Training for Pose Estimation
Comments: ICCV 2019
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[19]  arXiv:1911.00713 (cross-list from cs.CV) [pdf, other]
Title: Visual Relationship Detection with Relative Location Mining
Comments: Accepted to ACM MM 2019
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[ total of 19 entries: 1-19 ]
[ showing up to 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 1911, contact, help  (Access key information)