We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for recent submissions

[ total of 12 entries: 1-12 ]
[ showing up to 25 entries per page: fewer | more ]

Wed, 1 Dec 2021

[1]  arXiv:2111.15078 (cross-list from cs.CV) [pdf, other]
Title: SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)

Tue, 30 Nov 2021

[2]  arXiv:2111.14448 (cross-list from cs.CV) [pdf, other]
Title: AVA-AVD: Audio-visual Speaker Diarization in the Wild
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[3]  arXiv:2111.14267 (cross-list from cs.CV) [pdf, other]
Title: Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble Method
Comments: 7 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[4]  arXiv:2111.14237 (cross-list from eess.IV) [pdf, other]
Title: Data-independent Low-complexity KLT Approximations for Image and Video Coding
Comments: 18 pages, 15 figures, 4 tables
Subjects: Image and Video Processing (eess.IV); Multimedia (cs.MM); Signal Processing (eess.SP); Numerical Analysis (math.NA); Computation (stat.CO)
[5]  arXiv:2111.14080 (cross-list from cs.NI) [pdf, other]
Title: Empirical Conditional Mean: A New Method of Predicting Throughput in Uplink Data Network
Authors: Weijia Zheng
Comments: 5 pages, 7 figures
Subjects: Networking and Internet Architecture (cs.NI); Multimedia (cs.MM)
[6]  arXiv:2111.13945 (cross-list from cs.CV) [pdf, other]
Title: Calibrated Feature Decomposition for Generalizable Person Re-Identification
Comments: Technical report, Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)

Mon, 29 Nov 2021

[7]  arXiv:2111.13486 (cross-list from cs.CY) [pdf, other]
Title: When Creators Meet the Metaverse: A Survey on Computational Arts
Comments: Submitted to ACM Computing Surveys, 36 pages
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[8]  arXiv:2111.13042 (cross-list from eess.IV) [pdf, other]
Title: DeepJSCC-Q: Channel Input Constrained Deep Joint Source-Channel Coding
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Multimedia (cs.MM); Signal Processing (eess.SP)
[9]  arXiv:2111.13034 (cross-list from eess.IV) [pdf, other]
Title: DeepWiVe: Deep-Learning-Aided Wireless Video Transmission
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Multimedia (cs.MM); Signal Processing (eess.SP)
[10]  arXiv:2111.12727 (cross-list from cs.CV) [pdf, other]
Title: Universal Captioner: Long-Tail Vision-and-Language Model Training through Content-Style Separation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)

Thu, 25 Nov 2021

[11]  arXiv:2111.12663 [pdf, ps, other]
Title: PointPCA: Point Cloud Objective Quality Assessment Using PCA-Based Descriptors
Comments: 14 pages, 9 figures, 6 tables
Subjects: Multimedia (cs.MM)

Wed, 24 Nov 2021

[12]  arXiv:2111.11952 (cross-list from cs.CV) [pdf, other]
Title: Leveraging Selective Prediction for Reliable Image Geolocation
Comments: Accepted to the 28th International Conference on MultiMedia Modeling (MMM' 22)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[ total of 12 entries: 1-12 ]
[ showing up to 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2111, contact, help  (Access key information)