Multimedia

Authors and titles for cs.MM in Jan 2023

[ total of 55 entries: 1-25 | 26-50 | 51-55 ]
[ showing 25 entries per page: fewer | more | all ]

[1] arXiv:2301.00254 [pdf, other]: Title: Depression Diagnosis and Analysis via Multimodal Multi-order Factor Fusion

Authors: Chengbo Yuan, Qianhui Xu, Yong Luo

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2] arXiv:2301.00726 [pdf, other]: Title: 3-D Markerless Tracking of Human Gait by Geometric Trilateration of Multiple Kinects

Authors: Lin Yang, Bowen Yang, Haiwei Dong, Abdulmotaleb El Saddik

Journal-ref: IEEE Systems Journal, vol. 12, no. 2, pp. 1393-1403, 2018

Subjects: Multimedia (cs.MM)
[3] arXiv:2301.01134 [pdf, other]: Title: Ring That Bell: A Corpus and Method for Multimodal Metaphor Detection in Videos

Authors: Khalid Alnajjar, Mika Hämäläinen, Shuo Zhang

Comments: Figlang 2022

Subjects: Multimedia (cs.MM); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[4] arXiv:2301.01420 [pdf, ps, other]: Title: Improved CNN Prediction Based Reversible Data Hiding

Authors: Yingqiang Qiu, Wanli Peng, Xiaodan Lin, Huanqiang Zeng, Zhenxing Qian

Subjects: Multimedia (cs.MM); Image and Video Processing (eess.IV)
[5] arXiv:2301.02363 [pdf, other]: Title: Text2Poster: Laying out Stylized Texts on Retrieved Images

Authors: Chuhao Jin, Hongteng Xu, Ruihua Song, Zhiwu Lu

Comments: 5 pages, Accepted to ICASSP 2022

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[6] arXiv:2301.05541 [pdf, other]: Title: From Ember to Blaze: Swift Interactive Video Adaptation via Meta-Reinforcement Learning

Authors: Xuedou Xiao, Mingxuan Yan, Yingying Zuo, Boxi Liu, Paul Ruan, Yang Cao, Wei Wang

Comments: 9 pages, 13 figures

Subjects: Multimedia (cs.MM)
[7] arXiv:2301.06375 [pdf, other]: Title: OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset

Authors: Jeongkyun Park, Jung-Wook Hwang, Kwanghee Choi, Seung-Hyun Lee, Jun Hwan Ahn, Rae-Hong Park, Hyung-Min Park

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)
[8] arXiv:2301.06876 [pdf, other]: Title: CS-lol: a Dataset of Viewer Comment with Scene in E-sports Live-streaming

Authors: Junjie H. Xu, Yu Nakano, Lingrong Kong, Kojiro Iizuka

Comments: 5 pages, 3 figures, In ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR 23)

Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)
[9] arXiv:2301.07681 [pdf, other]: Title: Reduced-Reference Quality Assessment of Point Clouds via Content-Oriented Saliency Projection

Authors: Wei Zhou, Guanghui Yue, Ruizeng Zhang, Yipeng Qin, Hantao Liu

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[10] arXiv:2301.07740 [pdf, other]: Title: The Metaverse from a Multimedia Communications Perspective

Authors: Haiwei Dong, Jeannie S. A. Lee

Journal-ref: IEEE Multimedia Magazine, vol. 29, no. 4, pp. 123-127, 2022

Subjects: Multimedia (cs.MM); Networking and Internet Architecture (cs.NI)
[11] arXiv:2301.09080 [pdf, other]: Title: Dance2MIDI: Dance-driven multi-instruments music generation

Authors: Bo Han, Yuheng Li, Yixuan Shen, Yi Ren, Feilin Han

Comments: has been accepted by Computational Visual Media Journal

Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[12] arXiv:2301.11648 [pdf, ps, other]: Title: Top-down and bottom-up approaches to video Quality of Experience studies; overview and proposal of a new model

Authors: Kamil Koniuch, Sabina Baraković, Jasmina Baraković Husić, Katrien De Moor, Lucjan Janowski, Michał Wierzchoń

Comments: 35 pages, 2 figures, preprint submitted to review

Subjects: Multimedia (cs.MM)
[13] arXiv:2301.12191 [pdf, other]: Title: Multi-resolution encoding and optimization for next generation video compression

Authors: Vignesh V Menon

Comments: Degree project in Electrical Engineering, Second Cycle, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology (16 October 2020)

Subjects: Multimedia (cs.MM)
[14] arXiv:2301.12831 [pdf, other]: Title: M3FAS: An Accurate and Robust MultiModal Mobile Face Anti-Spoofing System

Authors: Chenqi Kong, Kexin Zheng, Yibing Liu, Shiqi Wang, Anderson Rocha, Haoliang Li

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[15] arXiv:2301.13523 [pdf, other]: Title: Towards Better Quality of Experience in HTTP Adaptive Streaming

Authors: Babak Taraghi, Selina Zoë Haack, Christian Timmerer

Subjects: Multimedia (cs.MM)
[16] arXiv:2301.13617 [pdf, other]: Title: A Closer Look into Recent Video-based Learning Research: A Comprehensive Review of Video Characteristics, Tools, Technologies, and Learning Effectiveness

Authors: Evelyn Navarrete, Andreas Nehring, Sascha Schanze, Ralph Ewerth, Anett Hoppe

Subjects: Multimedia (cs.MM)
[17] arXiv:2301.00965 (cross-list from cs.CV) [pdf, other]: Title: OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup

Authors: Zhijing Yang, Junyang Chen, Yukai Shi, Hao Li, Tianshui Chen, Liang Lin

Comments: To be published in IEEE T-MM; Code is available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[18] arXiv:2301.01904 (cross-list from cs.CY) [pdf, ps, other]: Title: Piloting Virtual Reality Photo-Based Tours among Students of a Filipino Language Class: A Case of Emergency Remote Teaching in Japan

Authors: Roberto Bacani Figueroa Jr., Florinda Amparo Adarayan Palma Gil, Hiroshi Taniguchi

Comments: 25 pages including appendices

Journal-ref: Avant: trends in interdisciplinary studies 13(1) (2022)

Subjects: Computers and Society (cs.CY); Multimedia (cs.MM)
[19] arXiv:2301.01949 (cross-list from cs.CL) [pdf, other]: Title: SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph

Authors: Yuxing Long, Binyuan Hui, Fulong Ye, Yanyang Li, Zhuoxin Han, Caixia Yuan, Yongbin Li, Xiaojie Wang

Comments: AAAI 2023

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[20] arXiv:2301.03127 (cross-list from cs.CL) [pdf, other]: Title: Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture

Authors: Pim Jordi Verschuuren, Jie Gao, Adelize van Eeden, Stylianos Oikonomou, Anil Bandhakavi

Comments: Accepted in AAAI'23: Second Workshop on Multimodal Fact-Checking and Hate Speech Detection, February 2023, Washington, DC, USA

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[21] arXiv:2301.03829 (cross-list from cs.LG) [pdf, other]: Title: From Plate to Prevention: A Dietary Nutrient-aided Platform for Health Promotion in Singapore

Authors: Kaiping Zheng, Thao Nguyen, Jesslyn Hwei Sing Chong, Charlene Enhui Goh, Melanie Herschel, Hee Hoon Lee, Changshuo Liu, Beng Chin Ooi, Wei Wang, James Yip

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Multimedia (cs.MM)
[22] arXiv:2301.03992 (cross-list from cs.CV) [pdf, other]: Title: Vision Transformers Are Good Mask Auto-Labelers

Authors: Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, Jose M. Alvarez, Anima Anandkumar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[23] arXiv:2301.04366 (cross-list from cs.CL) [pdf, other]: Title: Multimodal Inverse Cloze Task for Knowledge-based Visual Question Answering

Authors: Paul Lerner, Olivier Ferret, Camille Guinaudeau

Comments: Accepted at ECIR 2023

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[24] arXiv:2301.05174 (cross-list from cs.IR) [pdf, other]: Title: Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility Study

Authors: Mariya Hendriksen, Svitlana Vakulenko, Ernst Kuiper, Maarten de Rijke

Comments: 18 pages, accepted as a reproducibility paper at ECIR 2023

Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[25] arXiv:2301.05908 (cross-list from cs.SD) [pdf, other]: Title: An Order-Complexity Model for Aesthetic Quality Assessment of Symbolic Homophony Music Scores

Authors: Xin Jin, Wu Zhou, Jinyu Wang, Duo Xu, Yiqing Rong, Shuai Cui

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)

[ total of 55 entries: 1-25 | 26-50 | 51-55 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help (Access key information)

> cs > cs.MM

Multimedia

Authors and titles for cs.MM in Jan 2023