We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for recent submissions

[ total of 23 entries: 1-10 | 11-20 | 21-23 ]
[ showing 10 entries per page: fewer | more | all ]

Thu, 9 May 2024

[1]  arXiv:2405.05170 [pdf, other]
Title: Picking watermarks from noise (PWFN): an improved robust watermarking model against intensive distortions
Comments: Accepted by ICME2024
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[2]  arXiv:2405.04963 [pdf, other]
Title: Audio Matters Too! Enhancing Markerless Motion Capture with Audio Signals for String Performance Capture
Comments: SIGGRAPH2024
Subjects: Multimedia (cs.MM)
[3]  arXiv:2405.05244 (cross-list from eess.AS) [pdf, other]
Title: SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan
Comments: Evaluation plan of the SVDD Challenge @ SLT 2024
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Sound (cs.SD)
[4]  arXiv:2405.05130 (cross-list from cs.CV) [pdf, other]
Title: Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection
Comments: Accepted by ICME 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[5]  arXiv:2405.05039 (cross-list from cs.CV) [pdf, other]
Title: Reviewing Intelligent Cinematography: AI research for camera-based video production
Comments: For researchers and cinematographers. 43 pages including Table of Contents, List of Figures and Tables. We obtained permission to use Figures 5 and 11. All other Figures have been drawn by us
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)

Wed, 8 May 2024

[6]  arXiv:2405.04279 [pdf, other]
Title: Task Presentation and Human Perception in Interactive Video Retrieval
Subjects: Multimedia (cs.MM)
[7]  arXiv:2405.04097 (cross-list from cs.CV) [pdf, other]
Title: Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); Multimedia (cs.MM)
[8]  arXiv:2405.03920 (cross-list from cs.CL) [pdf, other]
Title: A Roadmap for Multilingual, Multimodal Domain Independent Deception Detection
Comments: 6 pages, 1 figure, shorter version in SIAM International Conference on Data Mining (SDM) 2024
Journal-ref: Proc. SDM 2024, 396-399
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM)

Tue, 7 May 2024 (showing first 2 of 3 entries)

[9]  arXiv:2405.03500 [pdf, other]
Title: A Rate-Distortion-Classification Approach for Lossy Image Compression
Authors: Yuefeng Zhang
Comments: 15 pages
Journal-ref: Digital Signal Processing Volume 141, September 2023, 104163
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[10]  arXiv:2405.03436 (cross-list from cs.CV) [pdf, other]
Title: DBDH: A Dual-Branch Dual-Head Neural Network for Invisible Embedded Regions Localization
Comments: 7 pages, 6 figures (Have been accepted by IJCNN 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[ total of 23 entries: 1-10 | 11-20 | 21-23 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)