We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for cs.MM in Aug 2021

[ total of 73 entries: 1-25 | 26-50 | 51-73 ]
[ showing 25 entries per page: fewer | more | all ]
[1]  arXiv:2108.00054 [pdf, ps, other]
Title: A Point-to-Distribution Joint Geometry and Color Metric for Point Cloud Quality Assessment
Comments: This paper has been accepted for publication in IEEE Workshop on Multimedia Signal Processing
Subjects: Multimedia (cs.MM); Image and Video Processing (eess.IV)
[2]  arXiv:2108.00262 [pdf, other]
Title: Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning
Comments: 11 pages, 4 figures, 2 tables. Proceedings of the 29th ACM International Conference on Multimedia, October 20-24, 2021, Virtual Event, China
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)
[3]  arXiv:2108.00970 [pdf, other]
Title: Is there a "language of music-video clips" ? A qualitative and quantitative study
Subjects: Multimedia (cs.MM)
[4]  arXiv:2108.01612 [pdf, ps, other]
Title: An Efficient Digital Watermarking Algorithm Based on DCT and BCH Error Correcting Code
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR); Information Theory (cs.IT)
[5]  arXiv:2108.01809 [pdf, other]
Title: What's Wrong with the Bottom-up Methods in Arbitrary-shape Scene Text Detection
Comments: Accepted by Trans. on Multimedia
Subjects: Multimedia (cs.MM)
[6]  arXiv:2108.02515 [pdf, other]
Title: Multi-clue reconstruction of sharing chains for social media images
Subjects: Multimedia (cs.MM)
[7]  arXiv:2108.03914 [pdf, other]
Title: Two-pronged Strategy: Lightweight Augmented Graph Network Hashing for Scalable Image Retrieval
Subjects: Multimedia (cs.MM)
[8]  arXiv:2108.04187 [pdf, other]
Title: Scaling New Peaks: A Viewership-centric Approach to Automated Content Curation
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[9]  arXiv:2108.08083 [pdf, other]
Title: Promoting Mental Well-Being for Audiences in a Live-Streaming Game by Highlight-Based Bullet Comments
Journal-ref: 2021 IEEE 10th Global Conference on Consumer Electronics (GCCE 2021)
Subjects: Multimedia (cs.MM)
[10]  arXiv:2108.08112 [pdf, other]
Title: Fighting Game Commentator with Pitch and Loudness Adjustment Utilizing Highlight Cues
Journal-ref: 2021 IEEE 10th Global Conference on Consumer Electronics (GCCE 2021)
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[11]  arXiv:2108.08985 [pdf, other]
Title: Metaverse for Social Good: A University Campus Prototype
Journal-ref: Proceedings of the 29th ACM International Conference on Multimedia (MM '21), October 20--24, 2021, Virtual Event, China
Subjects: Multimedia (cs.MM); Human-Computer Interaction (cs.HC)
[12]  arXiv:2108.09479 [pdf, other]
Title: Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training
Subjects: Multimedia (cs.MM); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[13]  arXiv:2108.10509 [pdf, other]
Title: Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues
Comments: To appear in MM 2021 industrial track (long paper)
Subjects: Multimedia (cs.MM)
[14]  arXiv:2108.11627 [pdf, ps, other]
Title: Towards Robust Mispronunciation Detection and Diagnosis for L2 English Learners with Accent-Modulating Methods
Comments: Accepted by ASRU 2021
Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[15]  arXiv:2108.00139 (cross-list from cs.CV) [pdf, ps, other]
Title: Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification
Comments: ACM MM 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[16]  arXiv:2108.00378 (cross-list from cs.SD) [pdf, other]
Title: SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours
Comments: Proceedings of the 22nd International Society for Music Information Retrieval Conference, ISMIR 2021
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[17]  arXiv:2108.00500 (cross-list from cs.SD) [pdf, other]
Title: End to End Bangla Speech Synthesis
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[18]  arXiv:2108.00679 (cross-list from cs.CV) [pdf, other]
Title: Multimodal Feature Fusion for Video Advertisements Tagging Via Stacking Ensemble
Comments: 1st place in ACM Multimedia Multimodal Video Ads Tagging Competition (2021 Tencent Advertising Algorithm Competition)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[19]  arXiv:2108.00705 (cross-list from cs.CV) [pdf, other]
Title: Efficient Deep Feature Calibration for Cross-Modal Joint Embedding Learning
Comments: accepted by ACM ICMI 2021 conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[20]  arXiv:2108.00871 (cross-list from cs.CV) [pdf, other]
Title: Constrained Graphic Layout Generation via Latent Optimization
Comments: Accepted by ACM Multimedia 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[21]  arXiv:2108.01056 (cross-list from cs.CV) [pdf, other]
Title: Distributed Attention for Grounded Image Captioning
Comments: mm21
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[22]  arXiv:2108.01374 (cross-list from cs.SD) [pdf, other]
Title: EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation
Comments: The paper has been accepted for publication at ISMIR 2021
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[23]  arXiv:2108.02050 (cross-list from cs.CV) [pdf, other]
Title: ICECAP: Information Concentrated Entity-aware Image Captioning
Comments: 9 pages, 7 figures, ACM MM 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[24]  arXiv:2108.02059 (cross-list from cs.CV) [pdf, other]
Title: Question-controlled Text-aware Image Captioning
Comments: 10 pages, 8 figures, to appear in ACM MM 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[25]  arXiv:2108.02432 (cross-list from cs.CV) [pdf, ps, other]
Title: Token Shift Transformer for Video Classification
Comments: ACM Multimedia 2021, 9 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[ total of 73 entries: 1-25 | 26-50 | 51-73 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)