We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for recent submissions

[ total of 14 entries: 1-14 ]
[ showing up to 25 entries per page: fewer | more ]

Fri, 9 Jun 2023

[1]  arXiv:2306.05241 [pdf, other]
Title: Two Heads Are Better Than One: Improving Fake News Video Detection by Correlating with Neighbors
Comments: To appear in ACL 2023 Findings
Subjects: Multimedia (cs.MM)
[2]  arXiv:2306.05268 (cross-list from cs.LG) [pdf, other]
Title: Factorized Contrastive Learning: Going Beyond Multi-view Redundancy
Comments: Code available at: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)

Thu, 8 Jun 2023

[3]  arXiv:2306.04202 [pdf, other]
Title: Video Compression with Arbitrary Rescaling Network
Comments: Accepted as a one-page poster by 2023 Data Compression Conference (DCC). This is the full paper
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[4]  arXiv:2306.04628 (cross-list from cs.SD) [pdf, other]
Title: Systematic Analysis of Music Representations from BERT
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[5]  arXiv:2306.04345 (cross-list from cs.CV) [pdf, other]
Title: An Overview of Challenges in Egocentric Text-Video Retrieval
Comments: 4 pages, CVPR 2023 Joint Ego4D&EPIC Workshop, Extended Abstract
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Multimedia (cs.MM)

Wed, 7 Jun 2023

[6]  arXiv:2306.03873 [pdf]
Title: Pivotuner: automatic real-time pure intonation and microtonal modulation
Authors: Dmitri Volkov
Comments: 5 pages, associated files and additional information available at this https URL
Subjects: Multimedia (cs.MM)
[7]  arXiv:2306.03395 [pdf, other]
Title: Computational Technologies for Fashion Recommendation: A Survey
Subjects: Multimedia (cs.MM); Information Retrieval (cs.IR)
[8]  arXiv:2306.03718 (cross-list from cs.SD) [pdf, other]
Title: Emotion-Conditioned Melody Harmonization with Hierarchical Variational Autoencoder
Authors: Shulei Ji, Xinyu Yang
Comments: Accepted by IEEE SMC 2023
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[9]  arXiv:2306.03403 (cross-list from cs.CV) [pdf, other]
Title: SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation
Comments: Accepted by IJCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)

Tue, 6 Jun 2023

[10]  arXiv:2306.02898 (cross-list from cs.CV) [pdf, other]
Title: Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[11]  arXiv:2306.02623 (cross-list from cs.CV) [pdf, other]
Title: Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models
Comments: SIGIR 2023. The code and datasets for our Do-GOOD benchmark can be found at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)

Mon, 5 Jun 2023

[12]  arXiv:2306.01304 (cross-list from cs.SD) [pdf, other]
Title: JEPOO: Highly Accurate Joint Estimation of Pitch, Onset and Offset for Music Information Retrieval
Comments: This paper has been accepted by IJCAI 2023; 11 pages, 6 figures
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[13]  arXiv:2306.01081 (cross-list from cs.CV) [pdf, other]
Title: 4DSR-GCN: 4D Video Point Cloud Upsampling using Graph Convolutional Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[14]  arXiv:2306.01016 (cross-list from cs.CL) [pdf, other]
Title: PV2TEA: Patching Visual Modality to Textual-Established Information Extraction
Comments: ACL 2023 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[ total of 14 entries: 1-14 ]
[ showing up to 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2306, contact, help  (Access key information)