We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for cs.MM in May 2022

[ total of 61 entries: 1-25 | 26-50 | 51-61 ]
[ showing 25 entries per page: fewer | more | all ]
[1]  arXiv:2205.00132 [pdf, other]
Title: Learn to Understand Negation in Video Retrieval
Comments: Accepted by ACMMM2022
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[2]  arXiv:2205.01583 [pdf, other]
Title: An Explore of Virtual Reality for Awareness of the Climate Change Crisis: A Simulation of Sea Level Rise
Comments: Published in 8th International Conference of the Immersive Learning Research Network (iLRN 2022)
Subjects: Multimedia (cs.MM); Human-Computer Interaction (cs.HC)
[3]  arXiv:2205.03595 [pdf, ps, other]
Title: $λ$-domain VVC Rate Control Based on Game Theory
Subjects: Multimedia (cs.MM); Multiagent Systems (cs.MA)
[4]  arXiv:2205.03684 [pdf, other]
Title: Timestamp-independent Haptic-Visual Synchronization
Subjects: Multimedia (cs.MM)
[5]  arXiv:2205.03782 [pdf, ps, other]
Title: SSIM-Variation-Based Complexity Optimization for Versatile Video Coding
Subjects: Multimedia (cs.MM); Multiagent Systems (cs.MA)
[6]  arXiv:2205.04906 [pdf, other]
Title: Evaluating the Impact of Tiled User-Adaptive Real-Time Point Cloud Streaming on VR Remote Communication
Subjects: Multimedia (cs.MM)
[7]  arXiv:2205.05177 [pdf, other]
Title: ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions in the Wild
Comments: v2 is the version submitted to Neurips 2022 Datasets and Benchmarks Track
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)
[8]  arXiv:2205.05880 [pdf, other]
Title: Deep Decomposition and Bilinear Pooling Network for Blind Night-Time Image Quality Evaluation
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2205.08007 [pdf, other]
Title: Perceptual Evaluation on Audio-visual Dataset of 360 Content
Comments: 6 pages, 5 figures, International Conference on Multimedia and Expo 2022
Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[10]  arXiv:2205.08738 [pdf, other]
Title: Passive Defense Against 3D Adversarial Point Clouds Through the Lens of 3D Steganalysis
Authors: Jiahao Zhu
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[11]  arXiv:2205.08866 [pdf, other]
Title: Seeing Sounds, Hearing Shapes: a gamified study to evaluate sound-sketches
Comments: Accepted at International Computer Music Conference (ICMC) 2022
Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[12]  arXiv:2205.10649 [pdf, other]
Title: Towards the Effects of Alignment Edits on the Quality of Experience of 360 Videos
Comments: 14 pages, 13 figures, 4 tables
Subjects: Multimedia (cs.MM)
[13]  arXiv:2205.10815 [pdf, other]
Title: Recent Advances in Rate Control: From Optimization to Implementation and Beyond
Subjects: Multimedia (cs.MM)
[14]  arXiv:2205.11825 [pdf, ps, other]
Title: A Rate Control Algorithm for Video-based Point Cloud Compression
Authors: Fangyu Shen, Wei Gao
Comments: 5 pages, 3 figures, 4 tables
Journal-ref: 2021 International Conference on Visual Communications and Image Processing (VCIP)
Subjects: Multimedia (cs.MM)
[15]  arXiv:2205.00694 (cross-list from cs.CV) [pdf, other]
Title: A Multi-stage deep architecture for summary generation of soccer videos
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[16]  arXiv:2205.00941 (cross-list from cs.SD) [pdf]
Title: Music Interpretation Analysis. A Multimodal Approach To Score-Informed Resynthesis of Piano Recordings
Comments: PhD Thesis. Author: F. Simonetta; tutor: S. Ntalampiras; co-tutor: F. Avanzini; Universit\`a degli studi di Milano - Dipartimento di Informatica "Giovanni Degli Antoni", 2022 Apr 22
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[17]  arXiv:2205.01155 (cross-list from cs.CV) [pdf, other]
Title: Emotion-Controllable Generalized Talking Face Generation
Comments: Accepted at IJCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[18]  arXiv:2205.01917 (cross-list from cs.CV) [pdf, other]
Title: CoCa: Contrastive Captioners are Image-Text Foundation Models
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[19]  arXiv:2205.01989 (cross-list from cs.CL) [pdf, other]
Title: MM-Claims: A Dataset for Multimodal Claim Detection in Social Media
Comments: Accepted to Findings of NAACL 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Social and Information Networks (cs.SI)
[20]  arXiv:2205.02456 (cross-list from cs.CV) [pdf, other]
Title: Declaration-based Prompt Tuning for Visual Question Answering
Comments: Accepted to IJCAI2022, data and codes are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[21]  arXiv:2205.02538 (cross-list from cs.CV) [pdf, other]
Title: Parametric Reshaping of Portraits in Videos
Journal-ref: MM'21: Proceedings of the 29th ACM International Conference on MultimediaOctober 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[22]  arXiv:2205.03297 (cross-list from cs.IR) [pdf, other]
Title: Implicit semantic-based personalized micro-videos recommendation
Authors: Bo Liu
Subjects: Information Retrieval (cs.IR); Multimedia (cs.MM)
[23]  arXiv:2205.03534 (cross-list from cs.CL) [pdf, other]
Title: Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[24]  arXiv:2205.03923 (cross-list from cs.CV) [pdf, other]
Title: Unsupervised Discovery and Composition of Object Light Fields
Comments: Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[25]  arXiv:2205.04029 (cross-list from cs.SD) [pdf, other]
Title: Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Comments: Accepted by Interspeech
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[ total of 61 entries: 1-25 | 26-50 | 51-61 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2210, contact, help  (Access key information)