Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Dec 2021

[ total of 1570 entries: 1-1570 ]
[ showing 1570 entries per page: fewer | more ]

[1] arXiv:2112.00011 [pdf, other]: Title: Predicting Poverty Level from Satellite Imagery using Deep Neural Networks

Authors: Varun Chitturi, Zaid Nabulsi

Comments: 14 pages, 5 Figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[2] arXiv:2112.00050 [pdf, other]: Title: Pattern-Aware Data Augmentation for LiDAR 3D Object Detection

Authors: Jordan S.K. Hu, Steven L. Waslander

Comments: Published paper in the IEEE Intelligent Transportation Systems Conference - ITSC 2021

Journal-ref: 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), 2021, pp. 2703-2710

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[3] arXiv:2112.00054 [pdf, other]: Title: Task2Sim : Towards Effective Pre-training and Transfer from Synthetic Data

Authors: Samarth Mishra, Rameswar Panda, Cheng Perng Phoo, Chun-Fu Chen, Leonid Karlinsky, Kate Saenko, Venkatesh Saligrama, Rogerio S. Feris

Comments: Accepted to CVPR'22

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[4] arXiv:2112.00061 [pdf, other]: Title: Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources

Authors: Sahar Abdelnabi, Rakibul Hasan, Mario Fritz

Comments: CVPR'22

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[5] arXiv:2112.00065 [pdf, other]: Title: Boosting EfficientNets Ensemble Performance via Pseudo-Labels and Synthetic Images by pix2pixHD for Infection and Ischaemia Classification in Diabetic Foot Ulcers

Authors: Louise Bloch, Raphael Brüngel, Christoph M. Friedrich

Comments: Accepted for Workshop Proceedings of the Diabetic Foot Ulcers Challenge (DFUC) as part of the 2021 24th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[6] arXiv:2112.00113 [pdf, other]: Title: Beyond Flatland: Pre-training with a Strong 3D Inductive Bias

Authors: Shubhaankar Gupta, Thomas P. O'Connell, Bernhard Egger

Comments: NeurIPS 2021 pre-registration workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[7] arXiv:2112.00166 [pdf, ps, other]: Title: TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information

Authors: Suraj Kothawade, Saikat Ghosh, Sumit Shekhar, Yu Xiang, Rishabh Iyer

Comments: To Appear In European Conference on Computer Vision (ECCV) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8] arXiv:2112.00167 [pdf, other]: Title: Event-Based Fusion for Motion Deblurring with Cross-modal Attention

Authors: Lei Sun, Christos Sakaridis, Jingyun Liang, Qi Jiang, Kailun Yang, Peng Sun, Yaozu Ye, Kaiwei Wang, Luc Van Gool

Comments: Accepted by ECCV 2022 as oral presentation

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9] arXiv:2112.00169 [pdf, other]: Title: 3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image

Authors: Fangzhou Mu, Jian Wang, Yicheng Wu, Yin Li

Comments: Project page: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[10] arXiv:2112.00180 [pdf, other]: Title: SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing

Authors: Jing Shi, Ning Xu, Haitian Zheng, Alex Smith, Jiebo Luo, Chenliang Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[11] arXiv:2112.00185 [pdf, other]: Title: Light Field Implicit Representation for Flexible Resolution Reconstruction

Authors: Paramanand Chandramouli, Hendrik Sommerhoff, Andreas Kolb

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[12] arXiv:2112.00202 [pdf, other]: Title: 3DVNet: Multi-View Depth Prediction and Volumetric Refinement

Authors: Alexander Rich, Noah Stier, Pradeep Sen, Tobias Höllerer

Comments: 10 pages, 6 figures, 3 tables. Accepted to 3DV 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13] arXiv:2112.00206 [pdf, other]: Title: Querying Labelled Data with Scenario Programs for Sim-to-Real Validation

Authors: Edward Kim, Jay Shenoy, Sebastian Junges, Daniel Fremont, Alberto Sangiovanni-Vincentelli, Sanjit Seshia

Comments: pre-print

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Programming Languages (cs.PL); Robotics (cs.RO)
[14] arXiv:2112.00207 [pdf, ps, other]: Title: Improved sparse PCA method for face and image recognition

Authors: Loc Hoang Tran, Tuan Tran, An Mai

Comments: 11 pages. arXiv admin note: substantial text overlap with arXiv:1904.08496

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[15] arXiv:2112.00216 [pdf, other]: Title: PoseKernelLifter: Metric Lifting of 3D Human Pose using Sound

Authors: Zhijian Yang, Xiaoran Fan, Volkan Isler, Hyun Soo Park

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[16] arXiv:2112.00219 [pdf, other]: Title: Scalable Primitives for Generalized Sensor Fusion in Autonomous Vehicles

Authors: Sammy Sidhu, Linda Wang, Tayyab Naseer, Ashish Malhotra, Jay Chia, Aayush Ahuja, Ella Rasmussen, Qiangui Huang, Ray Gao

Comments: Presented in Machine Learning for Autonomous Driving Workshop at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia. 11 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[17] arXiv:2112.00234 [pdf, other]: Title: MC-Blur: A Comprehensive Benchmark for Image Deblurring

Authors: Kaihao Zhang, Tao Wang, Wenhan Luo, Boheng Chen, Wenqi Ren, Bjorn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang

Comments: To appear in IEEE TCSVT

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18] arXiv:2112.00236 [pdf, other]: Title: VoRTX: Volumetric 3D Reconstruction With Transformers for Voxelwise View Selection and Fusion

Authors: Noah Stier, Alexander Rich, Pradeep Sen, Tobias Höllerer

Comments: 3DV 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[19] arXiv:2112.00246 [pdf, other]: Title: AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-shot Interactions

Authors: Yian Wang, Ruihai Wu, Kaichun Mo, Jiaqi Ke, Qingnan Fan, Leonidas Guibas, Hao Dong

Comments: ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[20] arXiv:2112.00250 [pdf, ps, other]: Title: Shallow Network Based on Depthwise Over-Parameterized Convolution for Hyperspectral Image Classification

Authors: Hongmin Gao, Zhonghao Chen, Chenming Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[21] arXiv:2112.00260 [pdf, other]: Title: Ranking Distance Calibration for Cross-Domain Few-Shot Learning

Authors: Pan Li, Shaogang Gong, Chengjie Wang, Yanwei Fu

Comments: Accepted at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[22] arXiv:2112.00263 [pdf, other]: Title: GLocal: Global Graph Reasoning and Local Structure Transfer for Person Image Generation

Authors: Liyuan Ma, Kejie Huang, Dongxu Wei, Haibin Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23] arXiv:2112.00281 [pdf, other]: Title: FDA-GAN: Flow-based Dual Attention GAN for Human Pose Transfer

Authors: Liyuan Ma, Kejie Huang, Dongxu Wei, Zhaoyan Ming, Haibin Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2112.00289 [pdf, other]: Title: Point Cloud Segmentation Using Sparse Temporal Local Attention

Authors: Joshua Knights, Peyman Moghadam, Clinton Fookes, Sridha Sridharan

Comments: 8 pages, 3 figures Published at the Australasian Conference on Robotics and Automation (ACRA) 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[25] arXiv:2112.00290 [pdf, other]: Title: Unsupervised Statistical Learning for Die Analysis in Ancient Numismatics

Authors: Andreas Heinecke, Emanuel Mayer, Abhinav Natarajan, Yoonju Jung

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[26] arXiv:2112.00295 [pdf, other]: Title: Multiple Fusion Adaptation: A Strong Framework for Unsupervised Semantic Segmentation Adaptation

Authors: Kai Zhang, Yifan Sun, Rui Wang, Haichang Li, Xiaohui Hu

Comments: 13 pages, 2 figures, submitted to BMVC2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[27] arXiv:2112.00302 [pdf, other]: Title: Graph Convolutional Module for Temporal Action Localization in Videos

Authors: Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan

Comments: Accepted by T-PAMI

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[28] arXiv:2112.00317 [pdf, other]: Title: Unleashing the Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification

Authors: Zizheng Yang, Xin Jin, Kecheng Zheng, Feng Zhao

Comments: Technical report, code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[29] arXiv:2112.00319 [pdf, other]: Title: Object-Aware Cropping for Self-Supervised Learning

Authors: Shlok Mishra, Anshul Shah, Ankan Bansal, Abhyuday Jagannatha, Janit Anjaria, Abhishek Sharma, David Jacobs, Dilip Krishnan

Journal-ref: Transactions on Machine Learning Research 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[30] arXiv:2112.00322 [pdf, other]: Title: FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection

Authors: Danila Rukhovich, Anna Vorontsova, Anton Konushin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31] arXiv:2112.00323 [pdf, other]: Title: Push Stricter to Decide Better: A Class-Conditional Feature Adaptive Framework for Improving Adversarial Robustness

Authors: Jia-Li Yin, Lehui Xie, Wanqing Zhu, Ximeng Liu, Bo-Hao Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32] arXiv:2112.00336 [pdf, other]: Title: Multi-View Stereo with Transformer

Authors: Jie Zhu, Bo Peng, Wanqing Li, Haifeng Shen, Zhe Zhang, Jianjun Lei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33] arXiv:2112.00337 [pdf, other]: Title: A Unified Benchmark for the Unknown Detection Capability of Deep Neural Networks

Authors: Jihyo Kim, Jiin Koo, Sangheum Hwang

Comments: Published in ESWA (this https URL)

Journal-ref: Expert Systems with Applications (2023), Vol. 229, Part A, 120461

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[34] arXiv:2112.00342 [pdf, other]: Title: Confidence Propagation Cluster: Unleash Full Potential of Object Detectors

Authors: Yichun Shen, Wanli Jiang, Zhen Xu, Rundong Li, Junghyun Kwon, Siyi Li

Comments: Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35] arXiv:2112.00343 [pdf, other]: Title: Camera Motion Agnostic 3D Human Pose Estimation

Authors: Seong Hyun Kim, Sunwon Jeong, Sungbum Park, Ju Yong Chang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36] arXiv:2112.00348 [pdf, other]: Title: Automatic travel pattern extraction from visa page stamps using CNN models

Authors: Eimantas Ledinauskas, Julius Ruseckas, Julius Marozas, Kasparas Karlauskas, Justas Terentjevas, Augustas Mačijauskas, Alfonsas Juršėnas

Comments: 15 pages, 13 figures, 4 tables, submitted for peer review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37] arXiv:2112.00374 [pdf, other]: Title: CLIPstyler: Image Style Transfer with a Single Text Condition

Authors: Gihyun Kwon, Jong Chul Ye

Comments: CVPR 2022 camera ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[38] arXiv:2112.00380 [pdf, other]: Title: Deep Measurement Updates for Bayes Filters

Authors: Johannes Pankert, Maria Vittoria Minniti, Lorenz Wellhausen, Marco Hutter

Journal-ref: IEEE Robotics and Automation Letters, vol. 7, no. 1, pp. 414-421, Jan. 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[39] arXiv:2112.00384 [pdf, other]: Title: Exploration into Translation-Equivariant Image Quantization

Authors: Woncheol Shin, Gyubok Lee, Jiyoung Lee, Eunyi Lyou, Joonseok Lee, Edward Choi

Comments: ICASSP 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[40] arXiv:2112.00390 [pdf, other]: Title: SegDiff: Image Segmentation with Diffusion Probabilistic Models

Authors: Tomer Amit, Tal Shaharbany, Eliya Nachmani, Lior Wolf

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[41] arXiv:2112.00396 [pdf, other]: Title: Dyadic Human Motion Prediction

Authors: Isinsu Katircioglu, Costa Georgantas, Mathieu Salzmann, Pascal Fua

Comments: added reference for section 2

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42] arXiv:2112.00410 [pdf, other]: Title: Rethink, Revisit, Revise: A Spiral Reinforced Self-Revised Network for Zero-Shot Learning

Authors: Zhe Liu, Yun Li, Lina Yao, Julian McAuley, Sam Dixon

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[43] arXiv:2112.00412 [pdf, other]: Title: The Majority Can Help The Minority: Context-rich Minority Oversampling for Long-tailed Classification

Authors: Seulki Park, Youngkyu Hong, Byeongho Heo, Sangdoo Yun, Jin Young Choi

Comments: Accepted by CVPR 2022, 14 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[44] arXiv:2112.00428 [pdf, other]: Title: Adv-4-Adv: Thwarting Changing Adversarial Perturbations via Adversarial Domain Adaptation

Authors: Tianyue Zheng, Zhe Chen, Shuya Ding, Chao Cai, Jun Luo

Comments: 22 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[45] arXiv:2112.00431 [pdf, other]: Title: MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

Authors: Mattia Soldan, Alejandro Pardo, Juan León Alcázar, Fabian Caba Heilbron, Chen Zhao, Silvio Giancola, Bernard Ghanem

Comments: 12 Pages, 6 Figures, 7 Tables

Journal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[46] arXiv:2112.00432 [pdf, other]: Title: A benchmark with decomposed distribution shifts for 360 monocular depth estimation

Authors: Georgios Albanis, Nikolaos Zioulis, Petros Drakoulis, Federico Alvarez, Dimitrios Zarpalas, Petros Daras

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[47] arXiv:2112.00448 [pdf, other]: Title: On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification

Authors: Rutika Moharir, Arun D Prabhu, Sukumar Moharana, Gopi Ramena, Rachit S Munjal

Comments: Accepted for publication in CVIP 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48] arXiv:2112.00459 [pdf, other]: Title: Information Theoretic Representation Distillation

Authors: Roy Miles, Adrian Lopez Rodriguez, Krystian Mikolajczyk

Comments: BMVC 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49] arXiv:2112.00463 [pdf, other]: Title: The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization

Authors: M. Jehanzeb Mirza, Jakub Micorek, Horst Possegger, Horst Bischof

Comments: Accepted to CVPR 2022 - Camera Ready Version - Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50] arXiv:2112.00475 [pdf, other]: Title: Weakly-Supervised Video Object Grounding via Causal Intervention

Authors: Wei Wang, Junyu Gao, Changsheng Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[51] arXiv:2112.00484 [pdf, other]: Title: Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding

Authors: Xianzheng Ma, Zhixiang Wang, Yacheng Zhan, Yinqiang Zheng, Zheng Wang, Dengxin Dai, Chia-Wen Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52] arXiv:2112.00485 [pdf, other]: Title: Learning Transformer Features for Image Quality Assessment

Authors: Chao Zeng, Sam Kwong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[53] arXiv:2112.00492 [pdf, other]: Title: Human-Object Interaction Detection via Weak Supervision

Authors: Mert Kilickaya, Arnold Smeulders

Comments: Accepted at BMVC'21

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54] arXiv:2112.00496 [pdf, other]: Title: Revisiting the Transferability of Supervised Pretraining: an MLP Perspective

Authors: Yizhou Wang, Shixiang Tang, Feng Zhu, Lei Bai, Rui Zhao, Donglian Qi, Wanli Ouyang

Comments: Accepted by CVPR 2022. [camera ready with supplement]

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[55] arXiv:2112.00504 [pdf, other]: Title: Learning Oriented Remote Sensing Object Detection via Naive Geometric Computing

Authors: Yanjie Wang, Xu Zou, Zhijun Zhang, Wenhui Xu, Liqun Chen, Sheng Zhong, Luxin Yan, Guodong Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56] arXiv:2112.00510 [pdf, other]: Title: Trimap-guided Feature Mining and Fusion Network for Natural Image Matting

Authors: Weihao Jiang, Dongdong Yu, Zhaozhi Xie, Yaoyi Li, Zehuan Yuan, Hongtao Lu

Comments: Accepted to Computer Vision and Image Understanding

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[57] arXiv:2112.00527 [pdf, other]: Title: Subtask-dominated Transfer Learning for Long-tail Person Search

Authors: Chuang Liu, Hua Yang, Qin Zhou, Shibao Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58] arXiv:2112.00532 [pdf, other]: Title: FaceTuneGAN: Face Autoencoder for Convolutional Expression Transfer Using Neural Generative Adversarial Networks

Authors: Nicolas Olivier, Kelian Baert, Fabien Danieau, Franck Multon, Quentin Avril

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[59] arXiv:2112.00556 [pdf, other]: Title: Semi-Supervised Surface Anomaly Detection of Composite Wind Turbine Blades From Drone Imagery

Authors: Jack. W. Barker, Neelanjan Bhowmik, Toby. P. Breckon

Comments: In-proceedings at 2022 17th International Conference on Computer Vision Theory and Applications (VISAPP)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[60] arXiv:2112.00557 [pdf, ps, other]: Title: 3D Reconstruction Using a Linear Laser Scanner and a Camera

Authors: Rui Wang

Comments: 8 pages, 16 figures, published in The 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[61] arXiv:2112.00560 [pdf, other]: Title: Attribute Artifacts Removal for Geometry-based Point Cloud Compression

Authors: Xihua Sheng, Li Li, Dong Liu, Zhiwei Xiong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[62] arXiv:2112.00568 [pdf, other]: Title: Dual Spoof Disentanglement Generation for Face Anti-spoofing with Depth Uncertainty Learning

Authors: Hangtong Wu, Dan Zen, Yibo Hu, Hailin Shi, Tao Mei

Comments: Accepted to TCSVT, arXiv version. The codes are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63] arXiv:2112.00580 [pdf, other]: Title: Background Activation Suppression for Weakly Supervised Object Localization

Authors: Pingyu Wu, Wei Zhai, Yang Cao

Comments: Accepted by CVPR 2022. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64] arXiv:2112.00582 [pdf, other]: Title: Transformer-based Network for RGB-D Saliency Detection

Authors: Yue Wang, Xu Jia, Lu Zhang, Yuke Li, James Elder, Huchuan Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[65] arXiv:2112.00585 [pdf, other]: Title: Neural Emotion Director: Speech-preserving semantic control of facial expressions in "in-the-wild" videos

Authors: Foivos Paraperas Papantoniou, Panagiotis P. Filntisis, Petros Maragos, Anastasios Roussos

Comments: CVPR 2022 (oral). Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66] arXiv:2112.00599 [pdf, other]: Title: An implementation of the "Guess who?" game using CLIP

Authors: Arnau Martí Sarri, Victor Rodriguez-Fernandez

Comments: Code available at this https URL

Journal-ref: Intelligent Data Engineering and Automated Learning (IDEAL 2021). Lecture Notes in Computer Science, vol 13113

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[67] arXiv:2112.00627 [pdf, other]: Title: DeepSportLab: a Unified Framework for Ball Detection, Player Instance Segmentation and Pose Estimation in Team Sports Scenes

Authors: Seyed Abolfazl Ghasemzadeh, Gabriel Van Zandycke, Maxime Istasse, Niels Sayez, Amirafshar Moshtaghpour, Christophe De Vleeschouwer

Comments: 13 pages, 5 figures, BMVC, BMVC2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68] arXiv:2112.00639 [pdf, other]: Title: A Systematic Review of Robustness in Deep Learning for Computer Vision: Mind the gap?

Authors: Nathan Drenkow, Numair Sani, Ilya Shpitser, Mathias Unberath

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[69] arXiv:2112.00656 [pdf, other]: Title: Object-aware Video-language Pre-training for Retrieval

Authors: Alex Jinpeng Wang, Yixiao Ge, Guanyu Cai, Rui Yan, Xudong Lin, Ying Shan, Xiaohu Qie, Mike Zheng Shou

Comments: CVPR2022; Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[70] arXiv:2112.00665 [pdf, other]: Title: Iterative Saliency Enhancement using Superpixel Similarity

Authors: Leonardo de Melo Joao, Alexandre Xavier Falcao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71] arXiv:2112.00686 [pdf, other]: Title: CYBORG: Blending Human Saliency Into the Loss Improves Deep Learning

Authors: Aidan Boyd, Patrick Tinsley, Kevin Bowyer, Adam Czajka

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2112.00690 [pdf, other]: Title: MDFM: Multi-Decision Fusing Model for Few-Shot Learning

Authors: Shuai Shao, Lei Xing, Rui Xu, Weifeng Liu, Yan-Jiang Wang, Bao-Di Liu

Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT). arXiv admin note: text overlap with arXiv:2109.07785

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[73] arXiv:2112.00694 [pdf, other]: Title: Label-Free Model Evaluation with Semi-Structured Dataset Representations

Authors: Xiaoxiao Sun, Yunzhong Hou, Hongdong Li, Liang Zheng

Comments: 10 pages, 8 figures, 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74] arXiv:2112.00698 [pdf, ps, other]: Title: CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded Systems

Authors: Priyank Kalgaonkar, Mohamed El-Sharkawy

Comments: 5 pages, 3 figures, published in an IEEE Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[75] arXiv:2112.00718 [pdf, other]: Title: Improving GAN Equilibrium by Raising Spatial Awareness

Authors: Jianyuan Wang, Ceyuan Yang, Yinghao Xu, Yujun Shen, Hongdong Li, Bolei Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76] arXiv:2112.00719 [pdf, other]: Title: HyperInverter: Improving StyleGAN Inversion via Hypernetwork

Authors: Tan M. Dinh, Anh Tuan Tran, Rang Nguyen, Binh-Son Hua

Comments: Accepted to CVPR 2022; Project page is located at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2112.00724 [pdf, other]: Title: RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs

Authors: Michael Niemeyer, Jonathan T. Barron, Ben Mildenhall, Mehdi S. M. Sajjadi, Andreas Geiger, Noha Radwan

Comments: Project page available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[78] arXiv:2112.00725 [pdf, other]: Title: The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image

Authors: Yuki M. Asano, Aaqib Saeed

Comments: Accepted at ICLR'23. Webpage: this https URL, code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[79] arXiv:2112.00726 [pdf, other]: Title: MonoScene: Monocular 3D Semantic Scene Completion

Authors: Anh-Quan Cao, Raoul de Charette

Comments: Accepted at CVPR 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[80] arXiv:2112.00775 [pdf, other]: Title: Routing with Self-Attention for Multimodal Capsule Networks

Authors: Kevin Duarte, Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Samuel Thomas, Alexander Liu, David Harwath, James Glass, Hilde Kuehne, Mubarak Shah

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81] arXiv:2112.00793 [pdf, other]: Title: Using Deep Image Prior to Assist Variational Selective Segmentation Deep Learning Algorithms

Authors: Liam Burrows, Ke Chen, Francesco Torella

Comments: Presented at SIPAIM 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[82] arXiv:2112.00804 [pdf, other]: Title: PreViTS: Contrastive Pretraining with Video Tracking Supervision

Authors: Brian Chen, Ramprasaath R. Selvaraju, Shih-Fu Chang, Juan Carlos Niebles, Nikhil Naik

Comments: To be presented at WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83] arXiv:2112.00821 [pdf, other]: Title: FaSS-MVS -- Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery

Authors: Boitumelo Ruf, Martin Weinmann, Stefan Hinz

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[84] arXiv:2112.00847 [pdf, other]: Title: CLAWS: Contrastive Learning with hard Attention and Weak Supervision

Authors: Jansel Herrera-Gerena, Ramakrishnan Sundareswaran, John Just, Matthew Darr, Ali Jannesari

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[85] arXiv:2112.00849 [pdf, ps, other]: Title: Interpretable Deep Learning-Based Forensic Iris Segmentation and Recognition

Authors: Andrey Kuehlkamp, Aidan Boyd, Adam Czajka, Kevin Bowyer, Patrick Flynn, Dennis Chute, Eric Benjamin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[86] arXiv:2112.00854 [pdf, other]: Title: GANORCON: Are Generative Models Useful for Few-shot Segmentation?

Authors: Oindrila Saha, Zezhou Cheng, Subhransu Maji

Comments: CVPR 2022 Camera Ready Version

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[87] arXiv:2112.00879 [pdf, other]: Title: Generating Diverse 3D Reconstructions from a Single Occluded Face Image

Authors: Rahul Dey, Vishnu Naresh Boddeti

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2112.00891 [pdf, other]: Title: Event Neural Networks

Authors: Matthew Dutson, Yin Li, Mohit Gupta

Comments: Accepted to ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2112.00933 [pdf, other]: Title: PartImageNet: A Large, High-Quality Dataset of Parts

Authors: Ju He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jie-Neng Chen, Shuai Liu, Cheng Yang, Qihang Yu, Alan Yuille

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90] arXiv:2112.00941 [pdf, other]: Title: Generalized Closed-form Formulae for Feature-based Subpixel Alignment in Patch-based Matching

Authors: Laurent Valentin Jospin, Farid Boussaid, Hamid Laga, Mohammed Bennamoun

Comments: 29 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91] arXiv:2112.00942 [pdf, other]: Title: On Salience-Sensitive Sign Classification in Autonomous Vehicle Path Planning: Experimental Explorations with a Novel Dataset

Authors: Ross Greer, Jason Isa, Nachiket Deo, Akshay Rangesh, Mohan M. Trivedi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[92] arXiv:2112.00948 [pdf, other]: Title: Visual-Semantic Transformer for Scene Text Recognition

Authors: Xin Tang, Yongquan Lai, Ying Liu, Yuanyuan Fu, Rui Fang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93] arXiv:2112.00953 [pdf, other]: Title: Maximum Consensus by Weighted Influences of Monotone Boolean Functions

Authors: Erchuan Zhang, David Suter, Ruwan Tennakoon, Tat-Jun Chin, Alireza Bab-Hadiashar, Giang Truong, Syed Zulqarnain Gilani

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[94] arXiv:2112.00954 [pdf, other]: Title: Temporally Resolution Decrement: Utilizing the Shape Consistency for Higher Computational Efficiency

Authors: Tianshu Xie, Xuan Cheng, Minghui Liu, Jiali Deng, Xiaomin Wang, Ming Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95] arXiv:2112.00958 [pdf, other]: Title: Hierarchical Neural Implicit Pose Network for Animation and Motion Retargeting

Authors: Sourav Biswas, Kangxue Yin, Maria Shugrina, Sanja Fidler, Sameh Khamis

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[96] arXiv:2112.00965 [pdf, other]: Title: Vision Pair Learning: An Efficient Training Framework for Image Classification

Authors: Bei Tong, Xiaoyuan Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97] arXiv:2112.00967 [pdf, other]: Title: Relational Graph Learning for Grounded Video Description Generation

Authors: Wenqiao Zhang, Xin Eric Wang, Siliang Tang, Haizhou Shi, Haocheng Shi, Jun Xiao, Yueting Zhuang, William Yang Wang

Comments: 10 pages, 5 figures, ACM MM 2020

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[98] arXiv:2112.00969 [pdf, other]: Title: Object-Centric Unsupervised Image Captioning

Authors: Zihang Meng, David Yang, Xuefei Cao, Ashish Shah, Ser-Nam Lim

Comments: ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[99] arXiv:2112.00974 [pdf, other]: Title: Consensus Graph Representation Learning for Better Grounded Image Captioning

Authors: Wenqiao Zhang, Haochen Shi, Siliang Tang, Jun Xiao, Qiang Yu, Yueting Zhuang

Comments: 9 pages, 5 figures, AAAI 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[100] arXiv:2112.00995 [pdf, other]: Title: SwinTrack: A Simple and Strong Baseline for Transformer Tracking

Authors: Liting Lin, Heng Fan, Zhipeng Zhang, Yong Xu, Haibin Ling

Comments: 22 pages, 10 figures

Journal-ref: Advances in Neural Information Processing Systems, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101] arXiv:2112.01001 [pdf, other]: Title: SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency

Authors: Devendra Singh Chaplot, Murtaza Dalal, Saurabh Gupta, Jitendra Malik, Ruslan Salakhutdinov

Comments: Published at NeurIPS 2021. See project webpage at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[102] arXiv:2112.01011 [pdf, other]: Title: Local Similarity Pattern and Cost Self-Reassembling for Deep Stereo Matching Networks

Authors: Biyang Liu, Huimin Yu, Yangqi Long

Comments: Accepted by AAAI-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103] arXiv:2112.01019 [pdf, other]: Title: Unconstrained Face Sketch Synthesis via Perception-Adaptive Network and A New Benchmark

Authors: Lin Nie, Lingbo Liu, Zhengtao Wu, Wenxiong Kang

Comments: We proposed the first medium-scale benchmark for unconstrained face sketch synthesis

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[104] arXiv:2112.01030 [pdf, other]: Title: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework using Self-Supervised Multi-Task Learning

Authors: Linhao Qu, Shaolei Liu, Manning Wang, Zhijian Song

Comments: Accepted by the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105] arXiv:2112.01033 [pdf, other]: Title: TBN-ViT: Temporal Bilateral Network with Vision Transformer for Video Scene Parsing

Authors: Bo Yan, Leilei Cao, Hongbin Wang

Comments: The sixth place solution for ICCV2021 VSPW Challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106] arXiv:2112.01034 [pdf, other]: Title: Leveraging Human Selective Attention for Medical Image Analysis with Limited Training Data

Authors: Yifei Huang, Xiaoxiao Li, Lijin Yang, Lin Gu, Yingying Zhu, Hirofumi Seo, Qiuming Meng, Tatsuya Harada, Yoichi Sato

Comments: BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[107] arXiv:2112.01036 [pdf, other]: Title: GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation

Authors: Xingzhe He, Bastian Wandt, Helge Rhodin

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108] arXiv:2112.01037 [pdf, other]: Title: Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention

Authors: Kun Yan, Chenbin Zhang, Jun Hou, Ping Wang, Zied Bouraoui, Shoaib Jameel, Steven Schockaert

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[109] arXiv:2112.01038 [pdf, other]: Title: Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips

Authors: Lijin Yang, Yifei Huang, Yusuke Sugano, Yoichi Sato

Comments: BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[110] arXiv:2112.01041 [pdf, other]: Title: N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras

Authors: Junho Kim, Jaehyeok Bae, Gangin Park, Dongsu Zhang, Young Min Kim

Comments: Accepted to ICCV 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[111] arXiv:2112.01050 [pdf, other]: Title: CloudWalker: Random walks for 3D point cloud shape analysis

Authors: Adi Mesika, Yizhak Ben-Shabat, Ayellet Tal

Journal-ref: Computers & Graphics Volume 106, August 2022, Pages 110-118

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112] arXiv:2112.01059 [pdf, other]: Title: Stronger Baseline for Person Re-Identification

Authors: Fengliang Qi, Bo Yan, Leilei Cao, Hongbin Wang

Comments: The third-place solution for ICCV2021 VIPriors Re-identification Challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113] arXiv:2112.01062 [pdf, other]: Title: Syntax Customized Video Captioning by Imitating Exemplar Sentences

Authors: Yitian Yuan, Lin Ma, Wenwu Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[114] arXiv:2112.01063 [pdf, other]: Title: Automatic deforestation detectors based on frequentist statistics and their extensions for other spatial objects

Authors: Jesper Muren, Vilhelm Niklasson, Dmitry Otryakhin, Maxim Romashin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP); Methodology (stat.ME)
[115] arXiv:2112.01071 [pdf, other]: Title: Extract Free Dense Labels from CLIP

Authors: Chong Zhou, Chen Change Loy, Bo Dai

Comments: ECCV 2022 oral, project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[116] arXiv:2112.01072 [pdf, other]: Title: The Second Place Solution for ICCV2021 VIPriors Instance Segmentation Challenge

Authors: Bo Yan, Fengliang Qi, Leilei Cao, Hongbin Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117] arXiv:2112.01073 [pdf, other]: Title: Controllable Video Captioning with an Exemplar Sentence

Authors: Yitian Yuan, Lin Ma, Jingwen Wang, Wenwu Zhu

Journal-ref: [C]//Proceedings of the 28th ACM International Conference on Multimedia. 2020: 1085-1093

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[118] arXiv:2112.01085 [pdf, other]: Title: PTCT: Patches with 3D-Temporal Convolutional Transformer Network for Precipitation Nowcasting

Authors: Ziao Yang, Xiangrui Yang, Qifeng Lin

Comments: 9 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[119] arXiv:2112.01098 [pdf, other]: Title: Attention based Occlusion Removal for Hybrid Telepresence Systems

Authors: Surabhi Gupta, Ashwath Shetty, Avinash Sharma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120] arXiv:2112.01121 [pdf, other]: Title: "Just Drive": Colour Bias Mitigation for Semantic Segmentation in the Context of Urban Driving

Authors: Jack Stelling, Amir Atapour-Abarghouei

Comments: 2021 IEEE International Conference on Big Data (IEEE BigData 2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121] arXiv:2112.01135 [pdf, other]: Title: Open-set 3D Object Detection

Authors: Jun Cen, Peng Yun, Junhao Cai, Michael Yu Wang, Ming Liu

Comments: Received by 3DV 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[122] arXiv:2112.01148 [pdf, other]: Title: FIBA: Frequency-Injection based Backdoor Attack in Medical Image Analysis

Authors: Yu Feng, Benteng Ma, Jing Zhang, Shanshan Zhao, Yong Xia, Dacheng Tao

Comments: Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[123] arXiv:2112.01155 [pdf, other]: Title: Batch Normalization Tells You Which Filter is Important

Authors: Junghun Oh, Heewon Kim, Sungyong Baik, Cheeun Hong, Kyoung Mu Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[124] arXiv:2112.01161 [pdf, other]: Title: Video Frame Interpolation without Temporal Priors

Authors: Youjian Zhang, Chaoyue Wang, Dacheng Tao

Comments: Accepted by Neural Information Processing Systems (NeurIPS) 2020

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125] arXiv:2112.01176 [pdf, other]: Title: Overcoming the Domain Gap in Neural Action Representations

Authors: Semih Günel, Florian Aymanns, Sina Honari, Pavan Ramdya, Pascal Fua

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126] arXiv:2112.01177 [pdf, other]: Title: MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion Attention

Authors: Xixi Wang, Xiao Wang, Bo Jiang, Jin Tang, Bin Luo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127] arXiv:2112.01194 [pdf, other]: Title: Video-Text Pre-training with Learned Regions

Authors: Rui Yan, Mike Zheng Shou, Yixiao Ge, Alex Jinpeng Wang, Xudong Lin, Guanyu Cai, Jinhui Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[128] arXiv:2112.01197 [pdf, other]: Title: Sample Prior Guided Robust Model Learning to Suppress Noisy Labels

Authors: Wenkai Chen, Chuang Zhu, Yi Chen, Mengting Li, Tiejun Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[129] arXiv:2112.01314 [pdf, other]: Title: SIDNet: Learning Shading-aware Illumination Descriptor for Image Harmonization

Authors: Zhongyun Hu, Ntumba Elie Nsampi, Xue Wang, Qing Wang

Comments: Accepted by IEEE TETCI 2023. Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130] arXiv:2112.01316 [pdf, other]: Title: Putting 3D Spatially Sparse Networks on a Diet

Authors: Junha Lee, Christopher Choy, Jaesik Park

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131] arXiv:2112.01330 [pdf, other]: Title: CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Authors: Moein Sorkhei, Yue Liu, Hossein Azizpour, Edward Azavedo, Karin Dembrower, Dimitra Ntoula, Athanasios Zouzos, Fredrik Strand, Kevin Smith

Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[132] arXiv:2112.01335 [pdf, other]: Title: Semantic-Sparse Colorization Network for Deep Exemplar-based Colorization

Authors: Yunpeng Bai, Chao Dong, Zenghao Chai, Andong Wang, Zhengzhuo Xu, Chun Yuan

Comments: Accepted by ECCV2022; 14 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133] arXiv:2112.01348 [pdf, other]: Title: 3rd Place Solution for NeurIPS 2021 Shifts Challenge: Vehicle Motion Prediction

Authors: Ching-Yu Tseng, Po-Shao Lin, Yu-Jia Liou, Kuan-Chih Huang, Winston H. Hsu

Journal-ref: Bayesian Deep Learning Workshop, NeurIPS 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134] arXiv:2112.01349 [pdf, other]: Title: MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment

Authors: Jie Ren, Wenteng Liang, Ran Yan, Luo Mai, Shiwen Liu, Xiao Liu

Comments: accepted by ECCV2022

Journal-ref: European Conference on Computer Vision (2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135] arXiv:2112.01360 [pdf, other]: Title: Probabilistic Approach for Road-Users Detection

Authors: G. Melotti, W. Lu, P. Conde, D. Zhao, A. Asvadi, N. Gonçalves, C. Premebida

Comments: This work has been accepted for publication as a REGULAR PAPER in the Transactions on Intelligent Transportation Systems-ITS

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[136] arXiv:2112.01390 [pdf, other]: Title: InsCLR: Improving Instance Retrieval with Self-Supervision

Authors: Zelu Deng, Yujie Zhong, Sheng Guo, Weilin Huang

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137] arXiv:2112.01398 [pdf, other]: Title: TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation

Authors: Tan M. Dinh, Rang Nguyen, Binh-Son Hua

Comments: Accepted to ECCV 2022; TISE toolbox is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138] arXiv:2112.01402 [pdf, other]: Title: Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation

Authors: Dipika Singhania, Rahul Rahaman, Angela Yao

Comments: AAAI-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139] arXiv:2112.01422 [pdf, other]: Title: 3D-Aware Semantic-Guided Generative Model for Human Synthesis

Authors: Jichao Zhang, Enver Sangineto, Hao Tang, Aliaksandr Siarohin, Zhun Zhong, Nicu Sebe, Wei Wang

Comments: ECCV 2022. 29 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140] arXiv:2112.01426 [pdf, other]: Title: SCNet: A Generalized Attention-based Model for Crack Fault Segmentation

Authors: Hrishikesh Sharma, Prakhar Pradhan, Balamuralidhar P

Comments: Accepted at ICVGIP 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141] arXiv:2112.01454 [pdf, other]: Title: Altering Facial Expression Based on Textual Emotion

Authors: Mohammad Imrul Jubair, Md. Masud Rana, Md. Amir Hamza, Mohsena Ashraf, Fahim Ahsan Khan, Ahnaf Tahseen Prince

Comments: Accepted in VISAPP2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[142] arXiv:2112.01455 [pdf, other]: Title: Zero-Shot Text-Guided Object Generation with Dream Fields

Authors: Ajay Jain, Ben Mildenhall, Jonathan T. Barron, Pieter Abbeel, Ben Poole

Comments: CVPR 2022. 13 pages. Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[143] arXiv:2112.01473 [pdf, other]: Title: Neural Point Light Fields

Authors: Julian Ost, Issam Laradji, Alejandro Newell, Yuval Bahat, Felix Heide

Comments: 9 pages, replacement changed font of equations

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[144] arXiv:2112.01479 [pdf, other]: Title: Learning Spatial-Temporal Graphs for Active Speaker Detection

Authors: Sourya Roy, Kyle Min, Subarna Tripathi, Tanaya Guha, Somdeb Majumdar

Comments: 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[145] arXiv:2112.01502 [pdf, other]: Title: Dimensions of Motion: Monocular Prediction through Flow Subspaces

Authors: Richard Strong Bowen, Richard Tucker, Ramin Zabih, Noah Snavely

Comments: Project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[146] arXiv:2112.01503 [pdf, ps, other]: Title: Machine Learning-Based Classification Algorithms for the Prediction of Coronary Heart Diseases

Authors: Kelvin Kwakye, Emmanuel Dadzie

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[147] arXiv:2112.01504 [pdf, other]: Title: Neural Weight Step Video Compression

Authors: Mikolaj Czerkawski, Javier Cardona, Robert Atkinson, Craig Michie, Ivan Andonovic, Carmine Clemente, Christos Tachtatzis

Comments: Accepted to the pre-registration workshop at NeurIPS 2021

Journal-ref: NeurIPS 2021 workshop in pre-registration

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[148] arXiv:2112.01513 [pdf, other]: Title: OW-DETR: Open-world Detection Transformer

Authors: Akshita Gupta, Sanath Narayan, K J Joseph, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah

Comments: 16 pages, CVPR 2022 accepted

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149] arXiv:2112.01514 [pdf, other]: Title: Self-supervised Video Transformer

Authors: Kanchana Ranasinghe, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Michael Ryoo

Comments: Accepted to CVPR '22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150] arXiv:2112.01515 [pdf, other]: Title: TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Authors: Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin

Comments: Accepted by ECCV 2022, Oral, open-sourced

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151] arXiv:2112.01517 [pdf, other]: Title: Efficient Neural Radiance Fields for Interactive Free-viewpoint Video

Authors: Haotong Lin, Sida Peng, Zhen Xu, Yunzhi Yan, Qing Shuai, Hujun Bao, Xiaowei Zhou

Comments: SIGGRAPH Asia 2022; Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[152] arXiv:2112.01518 [pdf, other]: Title: DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Authors: Yongming Rao, Wenliang Zhao, Guangyi Chen, Yansong Tang, Zheng Zhu, Guan Huang, Jie Zhou, Jiwen Lu

Comments: Accepted to CVPR2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[153] arXiv:2112.01520 [pdf, other]: Title: Recognizing Scenes from Novel Viewpoints

Authors: Shengyi Qian, Alexander Kirillov, Nikhila Ravi, Devendra Singh Chaplot, Justin Johnson, David F. Fouhey, Georgia Gkioxari

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[154] arXiv:2112.01521 [pdf, other]: Title: Object-aware Monocular Depth Prediction with Instance Convolutions

Authors: Enis Simsar, Evin Pınar Örnek, Fabian Manhardt, Helisa Dhamo, Nassir Navab, Federico Tombari

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[155] arXiv:2112.01522 [pdf, other]: Title: Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks

Authors: Xizhou Zhu, Jinguo Zhu, Hao Li, Xiaoshi Wu, Xiaogang Wang, Hongsheng Li, Xiaohua Wang, Jifeng Dai

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156] arXiv:2112.01523 [pdf, other]: Title: Learning Neural Light Fields with Ray-Space Embedding Networks

Authors: Benjamin Attal, Jia-Bin Huang, Michael Zollhoefer, Johannes Kopf, Changil Kim

Comments: CVPR 2022 camera ready revision. Major changes include: 1. Additional comparison to NeX on Stanford, RealFF, Shiny datasets 2. Experiment on 360 degree lego bulldozer scene in the appendix, using Pluecker parameterization 3. Moving student-teacher results to the appendix 4. Clarity edits -- in particular, making it clear that our Stanford evaluation *does not* use subdivision

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157] arXiv:2112.01524 [pdf, other]: Title: GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras

Authors: Ye Yuan, Umar Iqbal, Pavlo Molchanov, Kris Kitani, Jan Kautz

Comments: CVPR 2022 (Oral). Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
[158] arXiv:2112.01525 [pdf, other]: Title: Co-domain Symmetry for Complex-Valued Deep Learning

Authors: Utkarsh Singhal, Yifei Xing, Stella X. Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[159] arXiv:2112.01526 [pdf, other]: Title: MViTv2: Improved Multiscale Vision Transformers for Classification and Detection

Authors: Yanghao Li, Chao-Yuan Wu, Haoqi Fan, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer

Comments: CVPR 2022 Camera Ready

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160] arXiv:2112.01527 [pdf, other]: Title: Masked-attention Mask Transformer for Universal Image Segmentation

Authors: Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar

Comments: CVPR 2022. Project page/code/models: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[161] arXiv:2112.01528 [pdf, other]: Title: A Fast Knowledge Distillation Framework for Visual Recognition

Authors: Zhiqiang Shen, Eric Xing

Comments: Our project page: this http URL, code and models are available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[162] arXiv:2112.01529 [pdf, other]: Title: BEVT: BERT Pretraining of Video Transformers

Authors: Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan

Comments: To Appear at CVPR 2022, code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[163] arXiv:2112.01530 [pdf, other]: Title: StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions

Authors: Lukas Höllein, Justin Johnson, Matthias Nießner

Comments: Accepted to CVPR2022; project page: this https URL ; video: this https URL ; code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164] arXiv:2112.01551 [pdf, other]: Title: D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

Authors: Dave Zhenyu Chen, Qirui Wu, Matthias Nießner, Angel X. Chang

Comments: Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165] arXiv:2112.01554 [pdf, other]: Title: Neural Head Avatars from Monocular RGB Videos

Authors: Philip-William Grassal (1), Malte Prinzler (1), Titus Leistner (1), Carsten Rother (1), Matthias Nießner (2), Justus Thies (3) ((1) Heidelberg University, (2) Technical University of Munich, (3) Max Planck Institute for Intelligent Systems)

Comments: Camera-ready revision - Video: this https URL Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[166] arXiv:2112.01573 [pdf, other]: Title: FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization

Authors: Xingchao Liu, Chengyue Gong, Lemeng Wu, Shujian Zhang, Hao Su, Qiang Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[167] arXiv:2112.01601 [pdf, other]: Title: Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?

Authors: Peter Lorenz, Dominik Strassel, Margret Keuper, Janis Keuper

Comments: AAAI-22 AdvML Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[168] arXiv:2112.01609 [pdf, other]: Title: Probabilistic Tracking with Deep Factors

Authors: Fan Jiang, Andrew Marmon, Ildebrando De Courten, Marc Rasi, Frank Dellaert

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169] arXiv:2112.01641 [pdf, other]: Title: Hamiltonian latent operators for content and motion disentanglement in image sequences

Authors: Asif Khan, Amos Storkey

Comments: Conference paper at NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[170] arXiv:2112.01646 [pdf, other]: Title: Investigating the usefulness of Quantum Blur

Authors: James R. Wootton, Marcel Pfaffhauser

Journal-ref: Proc. ISQCMC 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Quantum Physics (quant-ph)
[171] arXiv:2112.01651 [pdf, other]: Title: Multi-modal application: Image Memes Generation

Authors: Zhiyuan Liu, Chuanzheng Sun, Yuxin Jiang, Shiqi Jiang, Mei Ming

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[172] arXiv:2112.01683 [pdf, other]: Title: TransZero: Attribute-guided Transformer for Zero-Shot Learning

Authors: Shiming Chen, Ziming Hong, Yang Liu, Guo-Sen Xie, Baigui Sun, Hao Li, Qinmu Peng, Ke Lu, Xinge You

Comments: Accepted to AAAI'22

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[173] arXiv:2112.01686 [pdf, other]: Title: Make A Long Image Short: Adaptive Token Length for Vision Transformers

Authors: Yichen Zhu, Yuqin Zhu, Jie Du, Yi Wang, Zhicai Ou, Feifei Feng, Jian Tang

Comments: 10 pages, Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174] arXiv:2112.01695 [pdf, other]: Title: Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation

Authors: Xiang Li, Jinglu Wang, Xiao Li, Yan Lu

Comments: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175] arXiv:2112.01697 [pdf, other]: Title: LMR-CBT: Learning Modality-fused Representations with CB-Transformer for Multimodal Emotion Recognition from Unaligned Multimodal Sequences

Authors: Ziwang Fu, Feng Liu, Hanyang Wang, Siyuan Shen, Jiahao Zhang, Jiayin Qi, Xiangling Fu, Aimin Zhou

Comments: 9 pages ,Figure 2, Table 5

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[176] arXiv:2112.01698 [pdf, other]: Title: Learning to Detect Every Thing in an Open World

Authors: Kuniaki Saito, Ping Hu, Trevor Darrell, Kate Saenko

Comments: Project page is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177] arXiv:2112.01712 [pdf, other]: Title: Deep Depth from Focus with Differential Focus Volume

Authors: Fengting Yang, Xiaolei Huang, Zihan Zhou

Comments: 17 pages; CVPR2022 accepted

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178] arXiv:2112.01714 [pdf, other]: Title: Structure-Aware Multi-Hop Graph Convolution for Graph Neural Networks

Authors: Yang Li, Yuichi Tanaka

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[179] arXiv:2112.01715 [pdf, other]: Title: Self-Supervised Material and Texture Representation Learning for Remote Sensing Tasks

Authors: Peri Akiva, Matthew Purri, Matthew Leotta

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[180] arXiv:2112.01719 [pdf, other]: Title: Adaptive Poincaré Point to Set Distance for Few-Shot Classification

Authors: Rongkai Ma, Pengfei Fang, Tom Drummond, Mehrtash Harandi

Comments: Accepted at AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[181] arXiv:2112.01723 [pdf, other]: Title: Adversarial Attacks against a Satellite-borne Multispectral Cloud Detector

Authors: Andrew Du, Yee Wei Law, Michele Sasdelli, Bo Chen, Ken Clarke, Michael Brown, Tat-Jun Chin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[182] arXiv:2112.01730 [pdf, other]: Title: How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset?

Authors: Yuchi Liu, Zhongdao Wang, Tom Gedeon, Liang Zheng

Comments: European Conference on Computer Vision 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183] arXiv:2112.01732 [pdf, other]: Title: MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection

Authors: Yongri Piao, Jian Wang, Miao Zhang, Huchuan Lu

Comments: accepted by ICCV-2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[184] arXiv:2112.01736 [pdf, other]: Title: Gesture Recognition with a Skeleton-Based Keyframe Selection Module

Authors: Yunsoo Kim, Hyun Myung

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185] arXiv:2112.01740 [pdf, other]: Title: AirDet: Few-Shot Detection without Fine-tuning for Autonomous Exploration

Authors: Bowen Li, Chen Wang, Pranay Reddy, Seungchan Kim, Sebastian Scherer

Comments: 23 pages, 9 figures

Journal-ref: 2022 17th European Conference on Computer Vision (ECCV)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[186] arXiv:2112.01741 [pdf, other]: Title: Frame Averaging for Equivariant Shape Space Learning

Authors: Matan Atzmon, Koki Nagano, Sanja Fidler, Sameh Khamis, Yaron Lipman

Comments: Accepted to CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[187] arXiv:2112.01746 [pdf, other]: Title: MSP : Refine Boundary Segmentation via Multiscale Superpixel

Authors: Jie Zhu, Huabin Huang, Banghuai Li, Yong Liu, Leye Wang

Comments: under review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188] arXiv:2112.01759 [pdf, other]: Title: NeRF-SR: High-Quality Neural Radiance Fields using Supersampling

Authors: Chen Wang, Xian Wu, Yuan-Chen Guo, Song-Hai Zhang, Yu-Wing Tai, Shi-Min Hu

Comments: Accepted to MM 2022. Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[189] arXiv:2112.01766 [pdf, other]: Title: Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior

Authors: Feng Zhang, Yuanjie Shao, Yishi Sun, Kai Zhu, Changxin Gao, Nong Sang

Comments: submitted to IEEE Transactions on Image Processing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[190] arXiv:2112.01787 [pdf, other]: Title: Detect Faces Efficiently: A Survey and Evaluations

Authors: Yuantao Feng, Shiqi Yu, Hanyang Peng, Yan-Ran Li, Jianguo Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[191] arXiv:2112.01793 [pdf, other]: Title: A Systematic IoU-Related Method: Beyond Simplified Regression for Better Localization

Authors: Hanyang Peng, Shiqi Yu

Journal-ref: IEEE Transactions on Image Processing, Volume 30, pages 5032-5044, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[192] arXiv:2112.01799 [pdf, other]: Title: Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation

Authors: Minghui Hu, Yujie Wang, Tat-Jen Cham, Jianfei Yang, P.N.Suganthan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[193] arXiv:2112.01800 [pdf, other]: Title: A Survey: Deep Learning for Hyperspectral Image Classification with Few Labeled Samples

Authors: Sen Jia, Shuguo Jiang, Zhijie Lin, Nanying Li, Meng Xu, Shiqi Yu

Journal-ref: Neurocomputing, Volume 448, 2021, Pages 179-204

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[194] arXiv:2112.01801 [pdf, other]: Title: Mesh Convolution with Continuous Filters for 3D Surface Parsing

Authors: Huan Lei, Naveed Akhtar, Mubarak Shah, Ajmal Mian

Comments: Accepted to TNNLS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[195] arXiv:2112.01838 [pdf, other]: Title: Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer

Authors: Frederic Z. Zhang, Dylan Campbell, Stephen Gould

Comments: Accepted to CVPR2022. 14 pages, 14 figures and 5 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[196] arXiv:2112.01839 [src]: Title: Mind Your Clever Neighbours: Unsupervised Person Re-identification via Adaptive Clustering Relationship Modeling

Authors: Lianjie Jia, Chenyang Yu, Xiehao Ye, Tianyu Yan, Yinjie Lei, Pingping Zhang

Comments: The experimental results are not sufficient

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[197] arXiv:2112.01845 [pdf, other]: Title: Semantic Map Injected GAN Training for Image-to-Image Translation

Authors: Balaram Singh Kshatriya, Shiv Ram Dubey, Himangshu Sarma, Kunal Chaudhary, Meva Ram Gurjar, Rahul Rai, Sunny Manchanda

Comments: Accepted in Fourth Workshop on Computer Vision Applications (WCVA) at ICVGIP 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[198] arXiv:2112.01873 [pdf, other]: Title: Image-to-image Translation as a Unique Source of Knowledge

Authors: Alejandro D. Mousist

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[199] arXiv:2112.01882 [pdf, other]: Title: Incremental Learning in Semantic Segmentation from Image Labels

Authors: Fabio Cermelli, Dario Fontanel, Antonio Tavera, Marco Ciccone, Barbara Caputo

Comments: To appear in CVPR 22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[200] arXiv:2112.01900 [pdf, other]: Title: Novel Class Discovery in Semantic Segmentation

Authors: Yuyang Zhao, Zhun Zhong, Nicu Sebe, Gim Hee Lee

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[201] arXiv:2112.01901 [pdf, other]: Title: The Box Size Confidence Bias Harms Your Object Detector

Authors: Johannes Gilg, Torben Teepe, Fabian Herzog, Gerhard Rigoll

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[202] arXiv:2112.01914 [pdf, other]: Title: SGM3D: Stereo Guided Monocular 3D Object Detection

Authors: Zheyuan Zhou, Liang Du, Xiaoqing Ye, Zhikang Zou, Xiao Tan, Li Zhang, Xiangyang Xue, Jianfeng Feng

Comments: 8 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203] arXiv:2112.01924 [pdf, other]: Title: TRNR: Task-Driven Image Rain and Noise Removal with a Few Images Based on Patch Analysis

Authors: Wu Ran, Bohong Yang, Peirong Ma, Hong Lu

Comments: 16 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204] arXiv:2112.01926 [pdf, other]: Title: Panoptic-aware Image-to-Image Translation

Authors: Liyun Zhang, Photchara Ratsamee, Bowen Wang, Zhaojie Luo, Yuki Uranishi, Manabu Higashida, Haruo Takemura

Comments: In 2023 IEEE winter conference on applications of computer vision (WACV)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205] arXiv:2112.01932 [pdf, other]: Title: Multi-Content Complementation Network for Salient Object Detection in Optical Remote Sensing Images

Authors: Gongyang Li, Zhi Liu, Weisi Lin, Haibin Ling

Comments: 12 pages, 7 figures, Accepted by IEEE Transactions on Geoscience and Remote Sensing 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[206] arXiv:2112.01933 [pdf, other]: Title: Bio-inspired Polarization Event Camera

Authors: Germain Haessig, Damien Joubert, Justin Haque, Yingkai Chen, Moritz Milde, Tobi Delbruck, Viktor Gruev

Subjects: Computer Vision and Pattern Recognition (cs.CV); Instrumentation and Detectors (physics.ins-det); Optics (physics.optics)
[207] arXiv:2112.01948 [pdf, ps, other]: Title: Boosting Unsupervised Domain Adaptation with Soft Pseudo-label and Curriculum Learning

Authors: Shengjia Zhang, Tiancheng Lin, Yi Xu

Comments: 28 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[208] arXiv:2112.01970 [pdf, ps, other]: Title: Optimization of phase-only holograms calculated with scaled diffraction calculation through deep neural networks

Authors: Yoshiyuki Ishii, Tomoyoshi Shimobaba, David Blinder, Tobias Birnbaum, Peter Schelkens, Takashi Kakue, Tomoyoshi Ito

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Optics (physics.optics)
[209] arXiv:2112.01983 [pdf, other]: Title: CoNeRF: Controllable Neural Radiance Fields

Authors: Kacper Kania, Kwang Moo Yi, Marek Kowalski, Tomasz Trzciński, Andrea Tagliasacchi

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[210] arXiv:2112.01988 [pdf, other]: Title: ROCA: Robust CAD Model Retrieval and Alignment from a Single Image

Authors: Can Gümeli, Angela Dai, Matthias Nießner

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[211] arXiv:2112.02039 [pdf, other]: Title: Bridging the Gap: Point Clouds for Merging Neurons in Connectomics

Authors: Jules Berman, Dmitri B. Chklovskii, Jingpeng Wu

Comments: 10 pages, 6 figures, MIDL 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[212] arXiv:2112.02073 [pdf, other]: Title: Hierarchical Optimal Transport for Unsupervised Domain Adaptation

Authors: Mourad El Hamri, Younès Bennani, Issam Falih, Hamid Ahaggach

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[213] arXiv:2112.02082 [pdf, other]: Title: Geometry-aware Two-scale PIFu Representation for Human Reconstruction

Authors: Zheng Dong, Ke Xu, Ziheng Duan, Hujun Bao, Weiwei Xu, Rynson W.H. Lau

Comments: Accepted by NeurIPS 2022. 20 pages, 20 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[214] arXiv:2112.02091 [pdf, other]: Title: Class-agnostic Reconstruction of Dynamic Objects from Videos

Authors: Zhongzheng Ren, Xiaoming Zhao, Alexander G. Schwing

Comments: NeurIPS 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[215] arXiv:2112.02139 [pdf, other]: Title: Face Reconstruction with Variational Autoencoder and Face Masks

Authors: Rafael S. Toledo, Eric A. Antonelo

Comments: 12 pages, 7 figures, 18th Encontro Nacional de Intelig\^encia Artificial e Computacional (ENIAC)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[216] arXiv:2112.02205 [pdf, other]: Title: Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Authors: Qiangeng Xu, Yiqi Zhong, Ulrich Neumann

Journal-ref: AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[217] arXiv:2112.02214 [pdf, other]: Title: Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation

Authors: Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[218] arXiv:2112.02219 [pdf, other]: Title: Transferring Unconditional to Conditional GANs with Hyper-Modulation

Authors: Héctor Laria, Yaxing Wang, Joost van de Weijer, Bogdan Raducanu

Comments: 19 pages, 20 figures, to be published in CVPRW 2022. Code at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[219] arXiv:2112.02221 [pdf, other]: Title: Orientation Aware Weapons Detection In Visual Data : A Benchmark Dataset

Authors: Nazeef Ul Haq, Muhammad Moazam Fraz, Tufail Sajjad Shah Hashmi, Muhammad Shahzad

Comments: Submitted this paper in Journal

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[220] arXiv:2112.02225 [pdf, other]: Title: HHF: Hashing-guided Hinge Function for Deep Hashing Retrieval

Authors: Chengyin Xu, Zenghao Chai, Zhengzhuo Xu, Hongjia Li, Qiruyi Zuo, Lingyu Yang, Chun Yuan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[221] arXiv:2112.02236 [pdf, other]: Title: SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing

Authors: Yichun Shi, Xiao Yang, Yangyue Wan, Xiaohui Shen

Comments: Camera-ready for CVPR 2022. Project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[222] arXiv:2112.02237 [pdf, other]: Title: A Triple-Double Convolutional Neural Network for Panchromatic Sharpening

Authors: Tian-Jing Zhang, Liang-Jian Deng, Ting-Zhu Huang, Jocelyn Chanussot, Gemine Vivone

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[223] arXiv:2112.02238 [pdf, other]: Title: Sphere Face Model:A 3D Morphable Model with Hypersphere Manifold Latent Space

Authors: Diqiong Jiang, Yiwei Jin, Fanglue Zhang, Zhe Zhu, Yun Zhang, Ruofeng Tong, Min Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[224] arXiv:2112.02244 [pdf, other]: Title: LAVT: Language-Aware Vision Transformer for Referring Image Segmentation

Authors: Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H.S. Torr

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[225] arXiv:2112.02249 [src]: Title: Dual-Flow Transformation Network for Deformable Image Registration with Region Consistency Constraint

Authors: Xinke Ma, Yibo Yang, Yong Xia, Dacheng Tao

Comments: This paper have some errors for experiment results, thus we want to withdraw this paper. We will update the revised paper. This paper is not published in any journal or conference

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[226] arXiv:2112.02250 [pdf, other]: Title: Dense Extreme Inception Network for Edge Detection

Authors: Xavier Soria, Angel Sappa, Patricio Humanante, Arash Akbarinia

Comments: Manuscript published by Pattern Recognition journal in 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[227] arXiv:2112.02252 [pdf, other]: Title: Channel Exchanging Networks for Multimodal and Multitask Dense Image Prediction

Authors: Yikai Wang, Fuchun Sun, Wenbing Huang, Fengxiang He, Dacheng Tao

Comments: Accepted by TPAMI 2022. Code is available at this https URL arXiv admin note: text overlap with arXiv:2011.05005

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[228] arXiv:2112.02259 [pdf, other]: Title: Construct Informative Triplet with Two-stage Hard-sample Generation

Authors: Chuang Zhu, Zheng Hu, Huihui Dong, Gang He, Zekuan Yu, Shangshang Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[229] arXiv:2112.02270 [pdf, ps, other]: Title: Feature-based Recognition Framework for Super-resolution Images

Authors: Jing Hu, Meiqi Zhang, Rui Zhang (School of Artificial Intelligence and Automation.HUST)

Comments: 7 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[230] arXiv:2112.02277 [pdf, other]: Title: BAANet: Learning Bi-directional Adaptive Attention Gates for Multispectral Pedestrian Detection

Authors: Xiaoxiao Yang, Yeqian Qiang, Huijie Zhu, Chunxiang Wang, Ming Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[231] arXiv:2112.02279 [pdf, other]: Title: U2-Former: A Nested U-shaped Transformer for Image Restoration

Authors: Haobo Ji, Xin Feng, Wenjie Pei, Jinxing Li, Guangming Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232] arXiv:2112.02290 [pdf, other]: Title: Interactive Disentanglement: Learning Concepts by Interacting with their Prototype Representations

Authors: Wolfgang Stammer, Marius Memmel, Patrick Schramowski, Kristian Kersting

Comments: To be published in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[233] arXiv:2112.02297 [pdf, other]: Title: Ablation study of self-supervised learning for image classification

Authors: Ilias Papastratis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[234] arXiv:2112.02300 [pdf, other]: Title: Unsupervised Domain Generalization by Learning a Bridge Across Domains

Authors: Sivan Harary, Eli Schwartz, Assaf Arbelle, Peter Staar, Shady Abu-Hussein, Elad Amrani, Roei Herzig, Amit Alfassy, Raja Giryes, Hilde Kuehne, Dina Katabi, Kate Saenko, Rogerio Feris, Leonid Karlinsky

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[235] arXiv:2112.02303 [pdf, other]: Title: An Annotated Video Dataset for Computing Video Memorability

Authors: Rukiye Savran Kiziltepe, Lorin Sweeney, Mihai Gabriel Constantin, Faiyaz Doctor, Alba Garcia Seco de Herrera, Claire-Helene Demarty, Graham Healy, Bogdan Ionescu, Alan F. Smeaton

Comments: 11 pages

Journal-ref: Data in Brief, Volume 39, 107671, (2021), ISSN 2352-3409

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[236] arXiv:2112.02306 [pdf, other]: Title: Toward Practical Monocular Indoor Depth Estimation

Authors: Cho-Ying Wu, Jialiang Wang, Michael Hall, Ulrich Neumann, Shuochen Su

Comments: Accepted to CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[237] arXiv:2112.02308 [pdf, other]: Title: MoFaNeRF: Morphable Facial Neural Radiance Field

Authors: Yiyu Zhuang, Hao Zhu, Xusen Sun, Xun Cao

Comments: accepted to ECCV2022; code available at this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[238] arXiv:2112.02338 [pdf, other]: Title: Generalized Binary Search Network for Highly-Efficient Multi-View Stereo

Authors: Zhenxing Mi, Di Chang, Dan Xu

Comments: 16 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[239] arXiv:2112.02340 [pdf, other]: Title: Scanpath Prediction on Information Visualisations

Authors: Yao Wang, Mihai Bâce, Andreas Bulling

Comments: 11 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[240] arXiv:2112.02353 [pdf, other]: Title: Label Hierarchy Transition: Delving into Class Hierarchies to Enhance Deep Classifiers

Authors: Renzhen Wang, De cai, Kaiwen Xiao, Xixi Jia, Xiao Han, Deyu Meng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[241] arXiv:2112.02355 [pdf, other]: Title: SITA: Single Image Test-time Adaptation

Authors: Ansh Khurana, Sujoy Paul, Piyush Rai, Soma Biswas, Gaurav Aggarwal

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[242] arXiv:2112.02359 [pdf, other]: Title: Unsupervised Adaptation of Semantic Segmentation Models without Source Data

Authors: Sujoy Paul, Ansh Khurana, Gaurav Aggarwal

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[243] arXiv:2112.02363 [pdf, other]: Title: CAVER: Cross-Modal View-Mixed Transformer for Bi-Modal Salient Object Detection

Authors: Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Comments: Accepted by TIP-2023. Add more details and update the weight illustration

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[244] arXiv:2112.02373 [pdf, other]: Title: 3rd Place: A Global and Local Dual Retrieval Solution to Facebook AI Image Similarity Challenge

Authors: Xinlong Sun, Yangyang Qin, Xuyuan Xu, Guoping Gong, Yang Fang, Yexin Wang

Comments: This is the 3rd place solution for Facebook Image Similarity Challenge and NIPS2021 Workshop. The current first draft version will be updated later

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[245] arXiv:2112.02379 [pdf, other]: Title: LTT-GAN: Looking Through Turbulence by Inverting GANs

Authors: Kangfu Mei, Vishal M. Patel

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[246] arXiv:2112.02399 [pdf, other]: Title: VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts

Authors: Longtian Qiu, Renrui Zhang, Ziyu Guo, Ziyao Zeng, Zilu Guo, Yafeng Li, Guangnan Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[247] arXiv:2112.02413 [pdf, other]: Title: PointCLIP: Point Cloud Understanding by CLIP

Authors: Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li

Comments: Open sourced, Code and Model Available

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[248] arXiv:2112.02416 [pdf, other]: Title: Gated2Gated: Self-Supervised Depth Estimation from Gated Images

Authors: Amanpreet Walia, Stefanie Walz, Mario Bijelic, Fahim Mannan, Frank Julca-Aguilar, Michael Langer, Werner Ritter, Felix Heide

Comments: 11 pages, 6 Figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249] arXiv:2112.02447 [pdf, other]: Title: Next Day Wildfire Spread: A Machine Learning Data Set to Predict Wildfire Spreading from Remote-Sensing Data

Authors: Fantine Huot, R. Lily Hu, Nita Goyal, Tharun Sankar, Matthias Ihme, Yi-Fan Chen

Comments: submitted to IEEE Transactions on Geoscience and Remote Sensing

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250] arXiv:2112.02450 [pdf, other]: Title: Adaptive Feature Interpolation for Low-Shot Image Generation

Authors: Mengyu Dai, Haibin Hang, Xiaoyang Guo

Comments: ECCV'22. Code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[251] arXiv:2112.02459 [pdf, other]: Title: SSAGCN: Social Soft Attention Graph Convolution Network for Pedestrian Trajectory Prediction

Authors: Pei Lv, Wentong Wang, Yunxin Wang, Yuzhen Zhang, Mingliang Xu, Changsheng Xu

Comments: 14 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[252] arXiv:2112.02466 [pdf, ps, other]: Title: Pose-guided Feature Disentangling for Occluded Person Re-identification Based on Transformer

Authors: Tao Wang, Hong Liu, Pinhao Song, Tianyu Guo, Wei Shi

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253] arXiv:2112.02469 [pdf, other]: Title: RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather

Authors: Jialu Wang, Muhamad Risqi U. Saputra, Chris Xiaoxuan Lu, Niki Trigon, Andrew Markham

Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[254] arXiv:2112.02475 [pdf, other]: Title: Deblurring via Stochastic Refinement

Authors: Jay Whang, Mauricio Delbracio, Hossein Talebi, Chitwan Saharia, Alexandros G. Dimakis, Peyman Milanfar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[255] arXiv:2112.02487 [pdf, other]: Title: Face Trees for Expression Recognition

Authors: Mojtaba Kolahdouzi, Alireza Sepas-Moghaddam, Ali Etemad

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[256] arXiv:2112.02494 [pdf, other]: Title: Implicit Neural Deformation for Sparse-View Face Reconstruction

Authors: Moran Li, Haibin Huang, Yi Zheng, Mengtian Li, Nong Sang, Chongyang Ma

Comments: 10 pages, 6 figures, The 30th Pacific Conference on Computer Graphics and Applications. Pacific Graphics(PG) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257] arXiv:2112.02500 [pdf, other]: Title: MovieNet-PS: A Large-Scale Person Search Dataset in the Wild

Authors: Jie Qin, Peng Zheng, Yichao Yan, Rong Quan, Xiaogang Cheng, Bingbing Ni

Comments: ICASSP 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258] arXiv:2112.02507 [pdf, ps, other]: Title: Adaptive Channel Encoding Transformer for Point Cloud Analysis

Authors: Guoquan Xu, Hezhi Cao, Yifan Zhang, Yanxin Ma, Jianwei Wan, Ke Xu

Comments: ICANN2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259] arXiv:2112.02509 [pdf, ps, other]: Title: Adaptive Channel Encoding for Point Cloud Analysis

Authors: Guoquan Xu, Hezhi Cao, Yifan Zhang, Jianwei Wan, Ke Xu, Yanxin Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[260] arXiv:2112.02520 [pdf, other]: Title: Neural Photometry-guided Visual Attribute Transfer

Authors: Carlos Rodriguez-Pardo, Elena Garces

Comments: 13 pages. To be published in Transactions on Visualizations and Computer Graphics. Project website: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[261] arXiv:2112.02523 [pdf, other]: Title: STSM: Spatio-Temporal Shift Module for Efficient Action Recognition

Authors: Zhaoqilin Yang, Gaoyun An

Comments: 9 pages,4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[262] arXiv:2112.02535 [pdf, other]: Title: End-to-End Segmentation via Patch-wise Polygons Prediction

Authors: Tal Shaharabany, Lior Wolf

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[263] arXiv:2112.02571 [pdf, other]: Title: Learning Tracking Representations via Dual-Branch Fully Transformer Networks

Authors: Fei Xie, Chunyu Wang, Guangting Wang, Wankou Yang, Wenjun Zeng

Comments: ICCV21 Workshops

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[264] arXiv:2112.02582 [pdf, other]: Title: PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation

Authors: Haobo Yuan, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao

Comments: Accepted by ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265] arXiv:2112.02597 [pdf, other]: Title: Constrained Adaptive Projection with Pretrained Features for Anomaly Detection

Authors: Xingtai Gui, Di Wu, Yang Chang, Shicai Fan

Comments: Accepted to IJCAI 2022 Main Track. This version includes 6 pages of main paper, 2 pages of Appendix

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[266] arXiv:2112.02604 [pdf, other]: Title: PSI: A Pedestrian Behavior Dataset for Socially Intelligent Autonomous Car

Authors: Tina Chen, Taotao Jing, Renran Tian, Yaobin Chen, Joshua Domeyer, Heishiro Toyoda, Rini Sherony, Zhengming Ding

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[267] arXiv:2112.02624 [pdf, other]: Title: Dynamic Token Normalization Improves Vision Transformers

Authors: Wenqi Shao, Yixiao Ge, Zhaoyang Zhang, Xuyuan Xu, Xiaogang Wang, Ying Shan, Ping Luo

Comments: Published at ICLR'22; 18 pages, 12 Tables, 9 Figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[268] arXiv:2112.02644 [pdf, other]: Title: Boosting Mobile CNN Inference through Semantic Memory

Authors: Yun Li, Chen Zhang, Shihao Han, Li Lyna Zhang, Baoqun Yin, Yunxin Liu, Mengwei Xu

Comments: 13 pages, 13 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[269] arXiv:2112.02666 [pdf, other]: Title: Learning Query Expansion over the Nearest Neighbor Graph

Authors: Benjamin Klein, Lior Wolf

Comments: BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270] arXiv:2112.02713 [pdf, other]: Title: Joint Symmetry Detection and Shape Matching for Non-Rigid Point Cloud

Authors: Abhishek Sharma, Maks Ovsjanikov

Comments: Under Review. arXiv admin note: substantial text overlap with arXiv:2110.02994

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[271] arXiv:2112.02719 [pdf, other]: Title: A Survey on Deep learning based Document Image Enhancement

Authors: Zahra Anvari, Vassilis Athitsos

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[272] arXiv:2112.02725 [pdf, other]: Title: A hybrid convolutional neural network/active contour approach to segmenting dead trees in aerial imagery

Authors: Jacquelyn A. Shelton, Przemyslaw Polewski, Wei Yao, Marco Heurich

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[273] arXiv:2112.02729 [pdf, other]: Title: Facial Emotion Characterization and Detection using Fourier Transform and Machine Learning

Authors: Aishwarya Gouru, Shan Suthaharan

Comments: 8 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[274] arXiv:2112.02747 [pdf, other]: Title: Making a Bird AI Expert Work for You and Me

Authors: Dongliang Chang, Kaiyue Pang, Ruoyi Du, Zhanyu Ma, Yi-Zhe Song, Jun Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[275] arXiv:2112.02749 [pdf, other]: Title: One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning

Authors: Suzhen Wang, Lincheng Li, Yu Ding, Xin Yu

Comments: Accepted by AAAI 2022

Journal-ref: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[276] arXiv:2112.02753 [pdf, other]: Title: MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image

Authors: Xingyu Chen, Yufeng Liu, Yajiao Dong, Xiong Zhang, Chongyang Ma, Yanmin Xiong, Yuan Zhang, Xiaoyan Guo

Journal-ref: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[277] arXiv:2112.02763 [pdf, other]: Title: MetaCloth: Learning Unseen Tasks of Dense Fashion Landmark Detection from a Few Samples

Authors: Yuying Ge, Ruimao Zhang, Ping Luo

Comments: Accepted by IEEE Transactions on Image Processing

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278] arXiv:2112.02772 [pdf, other]: Title: ActiveZero: Mixed Domain Learning for Active Stereovision with Zero Annotation

Authors: Isabella Liu, Edward Yang, Jianyu Tao, Rui Chen, Xiaoshuai Zhang, Qing Ran, Zhu Liu, Hao Su

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[279] arXiv:2112.02779 [pdf, other]: Title: Revisiting LiDAR Registration and Reconstruction: A Range Image Perspective

Authors: Wei Dong, Kwonyoung Ryu, Michael Kaess, Jaesik Park

Comments: 14 pages, 9 figures. This paper is under the review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[280] arXiv:2112.02781 [pdf, other]: Title: Adjusting the Ground Truth Annotations for Connectivity-Based Learning to Delineate

Authors: Doruk Oner, Leonardo Citraro, Mateusz Koziński, Pascal Fua

Journal-ref: IEEE Transactions on Medical Imaging ( Volume: 41, Issue: 12, December 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281] arXiv:2112.02788 [pdf, other]: Title: Texture Reformer: Towards Fast and Universal Interactive Texture Transfer

Authors: Zhizhong Wang, Lei Zhao, Haibo Chen, Ailin Li, Zhiwen Zuo, Wei Xing, Dongming Lu

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[282] arXiv:2112.02789 [pdf, other]: Title: HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs

Authors: Fuqiang Zhao, Wei Yang, Jiakai Zhang, Pei Lin, Yingliang Zhang, Jingyi Yu, Lan Xu

Comments: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[283] arXiv:2112.02805 [pdf, other]: Title: Forward Compatible Training for Large-Scale Embedding Retrieval Systems

Authors: Vivek Ramanujan, Pavan Kumar Anasosalu Vasu, Ali Farhadi, Oncel Tuzel, Hadi Pouransari

Comments: 14 pages with appendix. In proceedings at the conference on Computer Vision and Pattern Recognition 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284] arXiv:2112.02814 [pdf, other]: Title: A Survey of Deep Learning for Low-Shot Object Detection

Authors: Qihan Huang, Haofei Zhang, Mengqi Xue, Jie Song, Mingli Song

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[285] arXiv:2112.02815 [pdf, other]: Title: Make It Move: Controllable Image-to-Video Generation with Text Descriptions

Authors: Yaosi Hu, Chong Luo, Zhenzhong Chen

Comments: Accepted by CVPR'2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[286] arXiv:2112.02824 [pdf, ps, other]: Title: Letter-level Online Writer Identification

Authors: Zelin Chen, Hong-Xing Yu, Ancong Wu, Wei-Shi Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287] arXiv:2112.02825 [pdf, other]: Title: Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data

Authors: Ruoyi Du, Dongliang Chang, Zhanyu Ma, Yi-Zhe Song, Jun Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288] arXiv:2112.02828 [pdf, other]: Title: PP-MSVSR: Multi-Stage Video Super-Resolution

Authors: Lielin Jiang, Na Wang, Qingqing Dang, Rui Liu, Baohua Lai

Comments: 8 pages, 6 figures, 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[289] arXiv:2112.02829 [pdf, other]: Title: SyntEO: Synthetic Data Set Generation for Earth Observation and Deep Learning -- Demonstrated for Offshore Wind Farm Detection

Authors: Thorsten Hoeser, Claudia Kuenzer

Comments: 29 pages, 13 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[290] arXiv:2112.02834 [pdf, other]: Title: A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks via Learned Weights Statistics

Authors: Prasen Kumar Sharma, Arun Abraham, Vikram Nelvoy Rajendiran

Comments: Accepted by IEEE Transactions on Multimedia

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[291] arXiv:2112.02838 [pdf, other]: Title: Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey and Outlook

Authors: Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, Jiri Matas

Comments: Tracking Survey

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[292] arXiv:2112.02841 [pdf, other]: Title: GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation

Authors: Weixuan Sun, Jing Zhang, Zheyuan Liu, Yiran Zhong, Nick Barnes

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[293] arXiv:2112.02851 [pdf, other]: Title: No-Reference Point Cloud Quality Assessment via Domain Adaptation

Authors: Qi Yang, Yipeng Liu, Siheng Chen, Yiling Xu, Jun Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[294] arXiv:2112.02853 [pdf, other]: Title: Reliable Propagation-Correction Modulation for Video Object Segmentation

Authors: Xiaohao Xu, Jinglu Wang, Xiao Li, Yan Lu

Comments: 13 pages, 8 figures, AAAI 2022 Accepted

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[295] arXiv:2112.02857 [pdf, other]: Title: PTTR: Relational 3D Point Cloud Object Tracking with Transformer

Authors: Changqing Zhou, Zhipeng Luo, Yueru Luo, Tianrui Liu, Liang Pan, Zhongang Cai, Haiyu Zhao, Shijian Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[296] arXiv:2112.02862 [pdf, other]: Title: SelectAugment: Hierarchical Deterministic Sample Selection for Data Augmentation

Authors: Shiqi Lin, Zhizheng Zhang, Xin Li, Wenjun Zeng, Zhibo Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297] arXiv:2112.02869 [pdf, ps, other]: Title: Physics Driven Deep Retinex Fusion for Adaptive Infrared and Visible Image Fusion

Authors: Yuanjie Gu, Zhibo Xiao, Yinghan Guan, Haoran Dai, Cheng Liu, Liang Xue, Shouyu Wang

Comments: 20 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[298] arXiv:2112.02889 [pdf, other]: Title: Joint Learning of Localized Representations from Medical Images and Reports

Authors: Philip Müller, Georgios Kaissis, Congyu Zou, Daniel Rueckert

Comments: Accepted at ECCV 2022

Journal-ref: Computer Vision - ECCV 2022, pp. 685-701

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[299] arXiv:2112.02891 [pdf, other]: Title: Seeing Objects in dark with Continual Contrastive Learning

Authors: Ujjal Kr Dutta

Comments: Accepted in European Conference on Computer Vision (ECCV) 2022 Workshops: IWDSC

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[300] arXiv:2112.02902 [pdf, other]: Title: Interpretable Image Classification with Differentiable Prototypes Assignment

Authors: Dawid Rymarczyk, Łukasz Struski, Michał Górszczak, Koryna Lewandowska, Jacek Tabor, Bartosz Zieliński

Comments: Accepted to ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[301] arXiv:2112.02906 [pdf, other]: Title: ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction

Authors: Xiaoming Zhao, Xingming Wu, Jinyu Miao, Weihai Chen, Peter C. Y. Chen, Zhengguo Li

Comments: 11 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[302] arXiv:2112.02910 [pdf, other]: Title: A Tale of Color Variants: Representation and Self-Supervised Learning in Fashion E-Commerce

Authors: Ujjal Kr Dutta, Sandeep Repakula, Maulik Parmar, Abhinav Ravi

Comments: In Annual Conference on Innovative Applications of Artificial Intelligence (IAAI)/ AAAI Conference on Artificial Intelligence (AAAI) 2022. arXiv admin note: substantial text overlap with arXiv:2104.08581

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[303] arXiv:2112.02922 [pdf, other]: Title: Anomaly Detection in IR Images of PV Modules using Supervised Contrastive Learning

Authors: Lukas Bommes, Mathis Hoffmann, Claudia Buerhop-Lutz, Tobias Pickel, Jens Hauch, Christoph Brabec, Andreas Maier, Ian Marius Peters

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[304] arXiv:2112.02953 [pdf, ps, other]: Title: The artificial synesthete: Image-melody translations with variational autoencoders

Authors: Karl Wienand, Wolfgang M. Heckl

Comments: 7 pages, 4 figures, supplementary media can be downloaded at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[305] arXiv:2112.02990 [pdf, other]: Title: 4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding

Authors: Yujin Chen, Matthias Nießner, Angela Dai

Comments: Accepted by ECCV 2022, Video: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[306] arXiv:2112.02991 [pdf, other]: Title: Cross-Modality Attentive Feature Fusion for Object Detection in Multispectral Remote Sensing Imagery

Authors: Qingyun Fang, Zhaokui Wang

Comments: 23 pages,11 figures, under consideration at Pattern Recognition

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[307] arXiv:2112.03020 [pdf, other]: Title: Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning

Authors: Wenjie Shi, Gao Huang, Shiji Song, Cheng Wu

Comments: Accepted as a Regular Paper in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[308] arXiv:2112.03044 [pdf, other]: Title: Fusion Detection via Distance-Decay IoU and weighted Dempster-Shafer Evidence Theory

Authors: Fang Qingyun, Wang Zhaokui

Comments: 18 pages, 7 pages, under consideration at Journal of Aerospace Information Systems

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[309] arXiv:2112.03045 [pdf, other]: Title: 3D Hierarchical Refinement and Augmentation for Unsupervised Learning of Depth and Pose from Monocular Video

Authors: Guangming Wang, Jiquan Zhong, Shijie Zhao, Wenhua Wu, Zhe Liu, Hesheng Wang

Comments: 10 pages, 7 figures, under review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[310] arXiv:2112.03051 [pdf, other]: Title: Controllable Animation of Fluid Elements in Still Images

Authors: Aniruddha Mahapatra, Kuldeep Kulkarni

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[311] arXiv:2112.03109 [pdf, other]: Title: General Facial Representation Learning in a Visual-Linguistic Manner

Authors: Yinglin Zheng, Hao Yang, Ting Zhang, Jianmin Bao, Dongdong Chen, Yangyu Huang, Lu Yuan, Dong Chen, Ming Zeng, Fang Wen

Comments: CVPR2022 Oral; 16 pages, 6 figures, 14 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[312] arXiv:2112.03111 [pdf, ps, other]: Title: Ethics and Creativity in Computer Vision

Authors: Negar Rostamzadeh, Emily Denton, Linda Petrini

Comments: Neural Information Processing System 2021 workshop on Machine Learning for Creativity and Design

Journal-ref: NeurIPS 2021 workshop on Machine Learning for Creativity and Design

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[313] arXiv:2112.03126 [pdf, other]: Title: Label-Efficient Semantic Segmentation with Diffusion Models

Authors: Dmitry Baranchuk, Ivan Rubachev, Andrey Voynov, Valentin Khrulkov, Artem Babenko

Comments: ICLR'2022; v3: camera ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[314] arXiv:2112.03145 [pdf, other]: Title: Diffusion Models for Implicit Image Segmentation Ensembles

Authors: Julia Wolleb, Robin Sandkühler, Florentin Bieder, Philippe Valmaggia, Philippe C. Cattin

Comments: In this version, we updated the results section with more detailed evaluations

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[315] arXiv:2112.03162 [pdf, other]: Title: Embedding Arithmetic of Multimodal Queries for Image Retrieval

Authors: Guillaume Couairon, Matthieu Cord, Matthijs Douze, Holger Schwenk

Comments: accepted at O-DRUM (CVPR workshop 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[316] arXiv:2112.03163 [pdf, other]: Title: Encouraging Disentangled and Convex Representation with Controllable Interpolation Regularization

Authors: Yunhao Ge, Zhi Xu, Yao Xiao, Gan Xin, Yunkui Pang, Laurent Itti

Comments: 17 pages, 19 figure (including appendix)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[317] arXiv:2112.03184 [pdf, other]: Title: HIVE: Evaluating the Human Interpretability of Visual Explanations

Authors: Sunnie S. Y. Kim, Nicole Meister, Vikram V. Ramaswamy, Ruth Fong, Olga Russakovsky

Comments: ECCV 2022. Code and supplementary material are at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318] arXiv:2112.03185 [pdf, other]: Title: Semantic Segmentation In-the-Wild Without Seeing Any Segmentation Examples

Authors: Nir Zabari, Yedid Hoshen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319] arXiv:2112.03205 [pdf, other]: Title: Simultaneously Predicting Multiple Plant Traits from Multiple Sensors via Deformable CNN Regression

Authors: Pranav Raja, Alex Olenskyj, Hamid Kamangir, Mason Earles

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[320] arXiv:2112.03221 [pdf, other]: Title: Text2Mesh: Text-Driven Neural Stylization for Meshes

Authors: Oscar Michel, Roi Bar-On, Richard Liu, Sagie Benaim, Rana Hanocka

Comments: project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Graphics (cs.GR)
[321] arXiv:2112.03223 [pdf, other]: Title: Context-Aware Transfer Attacks for Object Detection

Authors: Zikui Cai, Xinxin Xie, Shasha Li, Mingjun Yin, Chengyu Song, Srikanth V. Krishnamurthy, Amit K. Roy-Chowdhury, M. Salman Asif

Comments: accepted to AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[322] arXiv:2112.03237 [pdf, other]: Title: From Coarse to Fine-grained Concept based Discrimination for Phrase Detection

Authors: Maan Qraitem, Bryan A. Plummer

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323] arXiv:2112.03241 [pdf, other]: Title: Unsupervised Domain Adaptation for Semantic Image Segmentation: a Comprehensive Survey

Authors: Gabriela Csurka, Riccardo Volpi, Boris Chidlovskii

Comments: 33 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[324] arXiv:2112.03243 [pdf, other]: Title: Input-level Inductive Biases for 3D Reconstruction

Authors: Wang Yifan, Carl Doersch, Relja Arandjelović, João Carreira, Andrew Zisserman

Comments: CVPR 2022, including supplemental material

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325] arXiv:2112.03252 [pdf, other]: Title: CSG0: Continual Urban Scene Generation with Zero Forgetting

Authors: Himalaya Jain, Tuan-Hung Vu, Patrick Pérez, Matthieu Cord

Comments: Published at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 Workshop on Continual Learning

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326] arXiv:2112.03258 [pdf, other]: Title: DoodleFormer: Creative Sketch Drawing with Transformers

Authors: Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen, Michael Felsberg

Comments: Accepted to ECCV-2022. Project webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[327] arXiv:2112.03288 [pdf, other]: Title: Dense Depth Priors for Neural Radiance Fields from Sparse Input Views

Authors: Barbara Roessle, Jonathan T. Barron, Ben Mildenhall, Pratul P. Srinivasan, Matthias Nießner

Comments: CVPR 2022, project page: this https URL , video: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328] arXiv:2112.03325 [pdf, other]: Title: Self-Supervised Camera Self-Calibration from Video

Authors: Jiading Fang, Igor Vasiljevic, Vitor Guizilini, Rares Ambrus, Greg Shakhnarovich, Adrien Gaidon, Matthew R.Walter

Comments: The project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[329] arXiv:2112.03328 [pdf, other]: Title: Learning Connectivity with Graph Convolutional Networks for Skeleton-based Action Recognition

Authors: Hichem Sahbi

Comments: arXiv admin note: text overlap with arXiv:2104.04255, arXiv:2104.05482

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330] arXiv:2112.03340 [pdf, other]: Title: Label Hallucination for Few-Shot Classification

Authors: Yiren Jian, Lorenzo Torresani

Comments: Accepted by AAAI 2022. Code is available: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[331] arXiv:2112.03415 [pdf, other]: Title: Producing augmentation-invariant embeddings from real-life imagery

Authors: Sergio Manuel Papadakis, Sanjay Addicam

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[332] arXiv:2112.03423 [pdf, other]: Title: Hybrid SNN-ANN: Energy-Efficient Classification and Object Detection for Event-Based Vision

Authors: Alexander Kugele, Thomas Pfeil, Michael Pfeiffer, Elisabetta Chicca

Comments: Accepted at DAGM German Conference on Pattern Recognition (GCPR 2021)

Journal-ref: Pattern Recognition. DAGM GCPR 2021. Lecture Notes in Computer Science, vol 13024. Springer, Cham., pp. 297-312

Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[333] arXiv:2112.03424 [pdf, other]: Title: Learning to Solve Hard Minimal Problems

Authors: Petr Hruby, Timothy Duff, Anton Leykin, Tomas Pajdla

Comments: 24 pages total: 14 pages main paper and 10 pages supplementary

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[334] arXiv:2112.03444 [pdf, other]: Title: GPU-Based Homotopy Continuation for Minimal Problems in Computer Vision

Authors: Chiang-Heng Chien, Hongyi Fan, Ahmad Abdelfattah, Elias Tsigaridas, Stanimire Tomov, Benjamin Kimia

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[335] arXiv:2112.03451 [pdf, other]: Title: Deep Level Set for Box-supervised Instance Segmentation in Aerial Images

Authors: Wentong Li, Yijie Chen, Wenyu Liu, Jianke Zhu

Comments: 10 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[336] arXiv:2112.03471 [pdf, other]: Title: Voxelized 3D Feature Aggregation for Multiview Detection

Authors: Jiahao Ma, Jinguang Tong, Shan Wang, Wei Zhao, Zicheng Duan, Chuong Nguyen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[337] arXiv:2112.03485 [pdf, other]: Title: VizExtract: Automatic Relation Extraction from Data Visualizations

Authors: Dale Decatur, Sanjay Krishnan

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[338] arXiv:2112.03492 [pdf, other]: Title: Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal

Authors: Yucheng Shi, Yahong Han, Yu-an Tan, Xiaohui Kuang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[339] arXiv:2112.03494 [pdf, other]: Title: Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning

Authors: Rongkai Ma, Pengfei Fang, Gil Avraham, Yan Zuo, Tianyu Zhu, Tom Drummond, Mehrtash Harandi

Comments: ECCV2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[340] arXiv:2112.03517 [pdf, other]: Title: CG-NeRF: Conditional Generative Neural Radiance Fields

Authors: Kyungmin Jo, Gyumin Shim, Sanghun Jung, Soyoung Yang, Jaegul Choo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[341] arXiv:2112.03530 [pdf, other]: Title: A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion

Authors: Zhaoyang Lyu, Zhifeng Kong, Xudong Xu, Liang Pan, Dahua Lin

Comments: Accepted to ICLR 2022. Code is released at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[342] arXiv:2112.03549 [pdf, other]: Title: GaTector: A Unified Framework for Gaze Object Prediction

Authors: Binglu Wang, Tao Hu, Baoshan Li, Xiaojuan Chen, Zhijie Zhang

Comments: CVPR 2022, camera ready

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[343] arXiv:2112.03552 [pdf, other]: Title: Bootstrapping ViTs: Towards Liberating Vision Transformers from Pre-training

Authors: Haofei Zhang, Jiarui Duan, Mengqi Xue, Jie Song, Li Sun, Mingli Song

Comments: Accepted as a conference paper by CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[344] arXiv:2112.03553 [pdf, other]: Title: ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images

Authors: Binh M. Le, Simon S. Woo

Journal-ref: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[345] arXiv:2112.03562 [pdf, other]: Title: CMA-CLIP: Cross-Modality Attention CLIP for Image-Text Classification

Authors: Huidong Liu, Shaoyuan Xu, Jinmiao Fu, Yang Liu, Ning Xie, Chien-Chih Wang, Bryan Wang, Yi Sun

Comments: 9 pages, 2 figures, 6 tables, 1 algorithm

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[346] arXiv:2112.03568 [pdf, other]: Title: Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints

Authors: Jinyang Yuan, Bin Li, Xiangyang Xue

Comments: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[347] arXiv:2112.03587 [pdf, other]: Title: TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning

Authors: Yang Liu, Keze Wang, Lingbo Liu, Haoyuan Lan, Liang Lin

Comments: This work has been published in IEEE Transactions on Image Processing. The code is publicly available at this https URL arXiv admin note: substantial text overlap with arXiv:2101.00820

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348] arXiv:2112.03590 [pdf, other]: Title: Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition

Authors: Tianyu Guo, Hong Liu, Zhan Chen, Mengyuan Liu, Tao Wang, Runwei Ding

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349] arXiv:2112.03592 [pdf, other]: Title: Parallel Discrete Convolutions on Adaptive Particle Representations of Images

Authors: Joel Jonsson, Bevan L. Cheeseman, Suryanarayana Maddu, Krzysztof Gonciarz, Ivo F. Sbalzarini

Comments: 18 pages, 13 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF); Image and Video Processing (eess.IV)
[350] arXiv:2112.03596 [pdf, other]: Title: E$^2$(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition

Authors: Chiara Plizzari, Mirco Planamente, Gabriele Goletto, Marco Cannici, Emanuele Gusso, Matteo Matteucci, Barbara Caputo

Comments: To be presented at CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351] arXiv:2112.03603 [pdf, other]: Title: Handwritten Mathematical Expression Recognition via Attention Aggregation based Bi-directional Mutual Learning

Authors: Xiaohang Bian, Bo Qin, Xiaozhe Xin, Jianwu Li, Xuefeng Su, Yanfeng Wang

Comments: 9 pages,5 figures, have been accepted in AAAI 2022 Oral

Journal-ref: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[352] arXiv:2112.03612 [pdf, other]: Title: DCAN: Improving Temporal Action Detection via Dual Context Aggregation

Authors: Guo Chen, Yin-Dong Zheng, Limin Wang, Tong Lu

Comments: AAAI 2022 camera ready version

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2112.03615 [pdf, other]: Title: Saliency Diversified Deep Ensemble for Robustness to Adversaries

Authors: Alex Bogun, Dimche Kostadinov, Damian Borth

Comments: Accepted to AAAI Workshop on Adversarial Machine Learning and Beyond 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[354] arXiv:2112.03624 [pdf, other]: Title: Time-Equivariant Contrastive Video Representation Learning

Authors: Simon Jenni, Hailin Jin

Comments: ICCV 2021 (oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[355] arXiv:2112.03631 [pdf, other]: Title: SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal

Authors: Zhaoyang Sun, Yaxiong Chen, Shengwu Xiong

Comments: Accepted to AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[356] arXiv:2112.03632 [pdf, other]: Title: Generation of Non-Deterministic Synthetic Face Datasets Guided by Identity Priors

Authors: Marcel Grimmer, Haoyu Zhang, Raghavendra Ramachandra, Kiran Raja, Christoph Busch

Journal-ref: https://www.ntnu.edu/nikt2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[357] arXiv:2112.03641 [pdf, other]: Title: Gram-SLD: Automatic Self-labeling and Detection for Instance Objects

Authors: Rui Wang, Chengtun Wu, Jiawen Xin, Liang Zhang

Comments: 37 pages with 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358] arXiv:2112.03649 [pdf, other]: Title: Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection

Authors: Shoubin Yu, Zhongyin Zhao, Haoshu Fang, Andong Deng, Haisheng Su, Dongliang Wang, Weihao Gan, Cewu Lu, Wei Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[359] arXiv:2112.03650 [pdf, other]: Title: Activation to Saliency: Forming High-Quality Labels for Completely Unsupervised Salient Object Detection

Authors: Huajun Zhou, Peijia Chen, Lingxiao Yang, Jianhuang Lai, Xiaohua Xie

Comments: 11 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[360] arXiv:2112.03690 [pdf, other]: Title: Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization

Authors: Bo-Shiuan Chu, Che-Rung Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[361] arXiv:2112.03728 [pdf, other]: Title: Flexible Networks for Learning Physical Dynamics of Deformable Objects

Authors: Jinhyung Park, DoHae Lee, In-Kwon Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[362] arXiv:2112.03731 [pdf, other]: Title: SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

Authors: Guanqun Ding, Nevrez Imamoglu, Ali Caglayan, Masahiro Murakawa, Ryosuke Nakamura

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363] arXiv:2112.03736 [pdf, other]: Title: Gaussian map predictions for 3D surface feature localisation and counting

Authors: Justin Le Louëdec, Grzegorz Cielniak

Comments: BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[364] arXiv:2112.03740 [pdf, other]: Title: Dilated convolution with learnable spacings

Authors: Ismail Khalfaoui-Hassani, Thomas Pellegrini, Timothée Masquelier

Comments: Published in The Eleventh International Conference on Learning Representations (ICLR) 2023. (this https URL)

Journal-ref: The Eleventh International Conference on Learning Representations ICLR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[365] arXiv:2112.03750 [pdf, other]: Title: Wild ToFu: Improving Range and Quality of Indirect Time-of-Flight Depth with RGB Fusion in Challenging Environments

Authors: HyunJun Jung, Nikolas Brasch, Ales Leonardis, Nassir Navab, Benjamin Busam

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[366] arXiv:2112.03777 [pdf, other]: Title: Variance-Aware Weight Initialization for Point Convolutional Neural Networks

Authors: Pedro Hermosilla, Michael Schelling, Tobias Ritschel, Timo Ropinski

Comments: Accepted at ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[367] arXiv:2112.03803 [pdf, other]: Title: Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning

Authors: Manlin Zhang, Jinpeng Wang, Andy J. Ma

Comments: AAAI2022. v2: Add supplementary

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368] arXiv:2112.03810 [pdf, other]: Title: Polarimetric Pose Prediction

Authors: Daoyi Gao, Yitong Li, Patrick Ruhkamp, Iuliia Skobleva, Magdalena Wysock, HyunJun Jung, Pengyuan Wang, Arturo Guridi, Benjamin Busam

Comments: Accepted at ECCV 2022; 25 pages (14 main paper + References + 7 Appendix)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369] arXiv:2112.03814 [pdf, other]: Title: A Contrastive Distillation Approach for Incremental Semantic Segmentation in Aerial Images

Authors: Edoardo Arnaudo, Fabio Cermelli, Antonio Tavera, Claudio Rossi, Barbara Caputo

Comments: 12 pages, ICIAP 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[370] arXiv:2112.03842 [pdf, other]: Title: A Survey on Intrinsic Images: Delving Deep Into Lambert and Beyond

Authors: Elena Garces, Carlos Rodriguez-Pardo, Dan Casas, Jorge Lopez-Moreno

Comments: Accepted at International Journal of Computer Vision (to appear in 2022) this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371] arXiv:2112.03857 [pdf, other]: Title: Grounded Language-Image Pre-training

Authors: Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao

Comments: CVPR 2022; updated visualizations; fixed hyper-parameters in Appendix C.1

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[372] arXiv:2112.03860 [pdf, other]: Title: Differentiable Gaussianization Layers for Inverse Problems Regularized by Deep Generative Models

Authors: Dongzhuo Li

Comments: ICLR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[373] arXiv:2112.03902 [pdf, other]: Title: MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection

Authors: Rui Dai, Srijan Das, Kumara Kahatapitiya, Michael S. Ryoo, Francois Bremond

Comments: Accepted in CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[374] arXiv:2112.03905 [pdf, other]: Title: ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints

Authors: Srijan Das, Michael S. Ryoo

Comments: 13 pages, Codes and models will updated soon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375] arXiv:2112.03906 [pdf, other]: Title: Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning

Authors: Srijan Das, Michael S. Ryoo

Comments: Accepted at MVA 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376] arXiv:2112.03907 [pdf, other]: Title: Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields

Authors: Dor Verbin, Peter Hedman, Ben Mildenhall, Todd Zickler, Jonathan T. Barron, Pratul P. Srinivasan

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[377] arXiv:2112.03909 [pdf, other]: Title: Vehicle trajectory prediction works, but not everywhere

Authors: Mohammadhossein Bahari, Saeed Saadatnejad, Ahmad Rahimi, Mohammad Shaverdikondori, Amir-Hossein Shahidzadeh, Seyed-Mohsen Moosavi-Dezfooli, Alexandre Alahi

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[378] arXiv:2112.03917 [pdf, other]: Title: Scalable 3D Semantic Segmentation for Gun Detection in CT Scans

Authors: Marius Memmel, Christoph Reich, Nicolas Wagner, Faraz Saeedan

Comments: This work was part of the Project Lab Deep Learning in Computer Vision Winter Semester 2019/2020 at TU Darmstadt

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[379] arXiv:2112.03951 [pdf, other]: Title: Few-Shot Image Classification Along Sparse Graphs

Authors: Joseph F Comer, Philip L Jacobson, Heiko Hoffmann

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2112.04011 [pdf, other]: Title: Auxiliary Learning for Self-Supervised Video Representation via Similarity-based Knowledge Distillation

Authors: Amirhossein Dadashzadeh, Alan Whone, Majid Mirmehdi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[381] arXiv:2112.04016 [pdf, other]: Title: DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover's Distance Improves Out-Of-Distribution Face Identification

Authors: Hai Phan, Anh Nguyen

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[382] arXiv:2112.04021 [pdf, other]: Title: A Robust Completed Local Binary Pattern (RCLBP) for Surface Defect Detection

Authors: Nana Kankam Gyimah, Abenezer Girma, Mahmoud Nabil Mahmoud, Shamila Nateghi, Abdollah Homaifar, Daniel Opoku

Comments: Accepted to IEEE SMC 2021 as a special invited session paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[383] arXiv:2112.04033 [pdf, other]: Title: Image classifiers can not be made robust to small perturbations

Authors: Zheng Dai, David K. Gifford

Comments: 8 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[384] arXiv:2112.04038 [pdf, ps, other]: Title: Presentation Attack Detection Methods based on Gaze Tracking and Pupil Dynamic: A Comprehensive Survey

Authors: Jalil Nourmohammadi Khiarak

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385] arXiv:2112.04042 [pdf, ps, other]: Title: Vision-Cloud Data Fusion for ADAS: A Lane Change Prediction Case Study

Authors: Yongkang Liu, Ziran Wang, Kyungtae Han, Zhenyu Shou, Prashant Tiwari, John H.L. Hansen

Comments: Published on IEEE Transactions on Intelligent Vehicles

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[386] arXiv:2112.04054 [pdf, other]: Title: GreenPCO: An Unsupervised Lightweight Point Cloud Odometry Method

Authors: Pranav Kadam, Min Zhang, Jiahao Gu, Shan Liu, C.-C. Jay Kuo

Comments: 10 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387] arXiv:2112.04107 [pdf, other]: Title: Fully Context-Aware Image Inpainting with a Learned Semantic Pyramid

Authors: Wendong Zhang, Yunbo Wang, Bingbing Ni, Xiaokang Yang

Comments: Accepted by Pattern Recognition, 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388] arXiv:2112.04108 [pdf, other]: Title: Fully Attentional Network for Semantic Segmentation

Authors: Qi Song, Jie Li, Chenghong Li, Hao Guo, Rui Huang

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2112.04120 [pdf, other]: Title: Feature Statistics Mixing Regularization for Generative Adversarial Networks

Authors: Junho Kim, Yunjey Choi, Youngjung Uh

Comments: Accepted to CVPR 2022. Our code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390] arXiv:2112.04138 [pdf, other]: Title: Contrastive Instruction-Trajectory Learning for Vision-Language Navigation

Authors: Xiwen Liang, Fengda Zhu, Yi Zhu, Bingqian Lin, Bing Wang, Xiaodan Liang

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[391] arXiv:2112.04148 [pdf, other]: Title: Neural Points: Point Cloud Representation with Neural Fields for Arbitrary Upsampling

Authors: Wanquan Feng, Jin Li, Hongrui Cai, Xiaonan Luo, Juyong Zhang

Comments: Accepted to CVPR2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2112.04150 [pdf, other]: Title: BA-Net: Bridge Attention for Deep Convolutional Neural Networks

Authors: Yue Zhao, Junzhou Chen, Zirui Zhang, Ronghui Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[393] arXiv:2112.04154 [pdf, other]: Title: SNEAK: Synonymous Sentences-Aware Adversarial Attack on Natural Language Video Localization

Authors: Wenbo Gou, Wen Shi, Jian Lou, Lijie Huang, Pan Zhou, Ruixuan Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[394] arXiv:2112.04159 [pdf, other]: Title: Garment4D: Garment Reconstruction from Point Cloud Sequences

Authors: Fangzhou Hong, Liang Pan, Zhongang Cai, Ziwei Liu

Comments: Accepted to NeurIPS 2021. Project Page: this https URL . Codes are available: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395] arXiv:2112.04162 [pdf, other]: Title: Symmetry Perception by Deep Networks: Inadequacy of Feed-Forward Architectures and Improvements with Recurrent Connections

Authors: Shobhita Sundaram, Darius Sinha, Matthew Groth, Tomotake Sasaki, Xavier Boix

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[396] arXiv:2112.04163 [pdf, other]: Title: Assessing a Single Image in Reference-Guided Image Synthesis

Authors: Jiayi Guo, Chaoqun Du, Jiangshan Wang, Huijuan Huang, Pengfei Wan, Gao Huang

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397] arXiv:2112.04165 [pdf, other]: Title: Shortest Paths in Graphs with Matrix-Valued Edges: Concepts, Algorithm and Application to 3D Multi-Shape Analysis

Authors: Viktoria Ehm, Daniel Cremers, Florian Bernard

Comments: published at 3DV

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Optimization and Control (math.OC)
[398] arXiv:2112.04174 [pdf, other]: Title: Boosting Contrastive Learning with Relation Knowledge Distillation

Authors: Kai Zheng, Yuanjiang Wang, Ye Yuan

Comments: Accepted by AAAI-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2112.04177 [pdf, other]: Title: VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation

Authors: Su Ho Han, Sukjun Hwang, Seoung Wug Oh, Yeonchool Park, Hyunwoo Kim, Min-Jung Kim, Seon Joo Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400] arXiv:2112.04178 [pdf, other]: Title: Topology-aware Convolutional Neural Network for Efficient Skeleton-based Action Recognition

Authors: Kailin Xu, Fanfan Ye, Qiaoyong Zhong, Di Xie

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[401] arXiv:2112.04182 [pdf, other]: Title: Unimodal Face Classification with Multimodal Training

Authors: Wenbin Teng, Chongyang Bai

Comments: Accepted by IEEE International Conference On Automatic Face and Gesture Recognition 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402] arXiv:2112.04185 [pdf, other]: Title: Transformaly -- Two (Feature Spaces) Are Better Than One

Authors: Matan Jacob Cohen, Shai Avidan

Comments: CVPR Workshop, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[403] arXiv:2112.04189 [pdf, other]: Title: Transformer-Based Approach for Joint Handwriting and Named Entity Recognition in Historical documents

Authors: Ahmed Cheikh Rouhoua, Marwa Dhiaf, Yousri Kessentini, Sinda Ben Salem

Journal-ref: Pattern Recognition Letters, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[404] arXiv:2112.04203 [pdf, other]: Title: Adversarial Parametric Pose Prior

Authors: Andrey Davydov, Anastasia Remizova, Victor Constantin, Sina Honari, Mathieu Salzmann, Pascal Fua

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405] arXiv:2112.04212 [pdf, other]: Title: Do Pedestrians Pay Attention? Eye Contact Detection in the Wild

Authors: Younes Belkada, Lorenzo Bertoni, Romain Caristan, Taylor Mordan, Alexandre Alahi

Comments: Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406] arXiv:2112.04215 [pdf, other]: Title: Self-Supervised Models are Continual Learners

Authors: Enrico Fini, Victor G. Turrisi da Costa, Xavier Alameda-Pineda, Elisa Ricci, Karteek Alahari, Julien Mairal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[407] arXiv:2112.04222 [pdf, other]: Title: Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs

Authors: Kaifeng Gao, Long Chen, Yulei Niu, Jian Shao, Jun Xiao

Comments: Accepted by CVPR 2022. Code is available at this https URL We also won the 1st place of Video Relation Understanding (VRU) Grand Challenge in ACM Multimedia 2021, with a simplified version of our model.(The code for object tracklets generation is available at this https URL)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[408] arXiv:2112.04223 [pdf, other]: Title: Progressive Multi-stage Interactive Training in Mobile Network for Fine-grained Recognition

Authors: Zhenxin Wu, Qingliang Chen, Yifeng Liu, Yinqi Zhang, Chengkai Zhu, Yang Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[409] arXiv:2112.04228 [pdf, other]: Title: SimulSLT: End-to-End Simultaneous Sign Language Translation

Authors: Aoxiong Yin, Zhou Zhao, Jinglin Liu, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He

Comments: Accepted by ACM Multimedia 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[410] arXiv:2112.04255 [pdf, other]: Title: Feature matching for multi-epoch historical aerial images

Authors: Lulin Zhang, Ewelina Rupnik, Marc Pierrot-Deseilligny

Comments: 34 pages

Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411] arXiv:2112.04278 [pdf, other]: Title: DMRVisNet: Deep Multi-head Regression Network for Pixel-wise Visibility Estimation Under Foggy Weather

Authors: Jing You, Shaocheng Jia, Xin Pei, Danya Yao

Comments: 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412] arXiv:2112.04283 [pdf, other]: Title: Adverse Weather Image Translation with Asymmetric and Uncertainty-aware GAN

Authors: Jeong-gi Kwak, Youngsaeng Jin, Yuanming Li, Dongsik Yoon, Donghyeon Kim, Hanseok Ko

Comments: BMVC 2021, codes are available in here: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[413] arXiv:2112.04294 [pdf, other]: Title: A Hierarchical Spatio-Temporal Graph Convolutional Neural Network for Anomaly Detection in Videos

Authors: Xianlin Zeng, Yalong Jiang, Wenrui Ding, Hongguang Li, Yafeng Hao, Zifeng Qiu

Comments: Accepted to IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[414] arXiv:2112.04298 [pdf, other]: Title: GCA-Net : Utilizing Gated Context Attention for Improving Image Forgery Localization and Detection

Authors: Sowmen Das, Md. Saiful Islam, Md. Ruhul Amin

Comments: Accepted for publication at the CVPR 2022 Media Forensics Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[415] arXiv:2112.04312 [pdf, other]: Title: Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering

Authors: Mingfei Chen, Jianfeng Zhang, Xiangyu Xu, Lijuan Liu, Yujun Cai, Jiashi Feng, Shuicheng Yan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[416] arXiv:2112.04323 [pdf, other]: Title: Contrastive Learning with Large Memory Bank and Negative Embedding Subtraction for Accurate Copy Detection

Authors: Shuhei Yokoo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417] arXiv:2112.04345 [pdf, other]: Title: Burn After Reading: Online Adaptation for Cross-domain Streaming Data

Authors: Luyu Yang, Mingfei Gao, Zeyuan Chen, Ran Xu, Abhinav Shrivastava, Chetan Ramaiah

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[418] arXiv:2112.04367 [pdf, other]: Title: On visual self-supervision and its effect on model robustness

Authors: Michal Kucer, Diane Oyen, Garrett Kenyon

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[419] arXiv:2112.04401 [pdf, other]: Title: FPPN: Future Pseudo-LiDAR Frame Prediction for Autonomous Driving

Authors: Xudong Huang, Chunyu Lin, Haojie Liu, Lang Nie, Yao Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[420] arXiv:2112.04417 [pdf, other]: Title: What I Cannot Predict, I Do Not Understand: A Human-Centered Evaluation Framework for Explainability Methods

Authors: Julien Colin, Thomas Fel, Remi Cadene, Thomas Serre

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[421] arXiv:2112.04421 [pdf, other]: Title: SoK: Vehicle Orientation Representations for Deep Rotation Estimation

Authors: Huahong Tu, Siyuan Peng, Vladimir Leung, Richard Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422] arXiv:2112.04432 [pdf, other]: Title: Audio-Visual Synchronisation in the wild

Authors: Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[423] arXiv:2112.04446 [pdf, other]: Title: Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval

Authors: Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogerio Feris, David Harwath, James Glass, Hilde Kuehne

Comments: CVPR2022. The final published version of the proceedings will be available on IEEE Xplore

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[424] arXiv:2112.04453 [pdf, other]: Title: MLP Architectures for Vision-and-Language Modeling: An Empirical Study

Authors: Yixin Nie, Linjie Li, Zhe Gan, Shuohang Wang, Chenguang Zhu, Michael Zeng, Zicheng Liu, Mohit Bansal, Lijuan Wang

Comments: 15 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[425] arXiv:2112.04477 [pdf, other]: Title: Tracking People by Predicting 3D Appearance, Location & Pose

Authors: Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik

Comments: Project Page : this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426] arXiv:2112.04478 [pdf, other]: Title: Prompting Visual-Language Models for Efficient Video Understanding

Authors: Chen Ju, Tengda Han, Kunhao Zheng, Ya Zhang, Weidi Xie

Comments: ECCV 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[427] arXiv:2112.04480 [pdf, other]: Title: Exploring Temporal Granularity in Self-Supervised Video Representation Learning

Authors: Rui Qian, Yeqing Li, Liangzhe Yuan, Boqing Gong, Ting Liu, Matthew Brown, Serge Belongie, Ming-Hsuan Yang, Hartwig Adam, Yin Cui

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[428] arXiv:2112.04481 [pdf, other]: Title: What's Behind the Couch? Directed Ray Distance Functions (DRDF) for 3D Scene Reconstruction

Authors: Nilesh Kulkarni, Justin Johnson, David F. Fouhey

Comments: Updated illustrations for method section. Project Page see this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[429] arXiv:2112.04482 [pdf, other]: Title: FLAVA: A Foundational Language And Vision Alignment Model

Authors: Amanpreet Singh, Ronghang Hu, Vedanuj Goswami, Guillaume Couairon, Wojciech Galuba, Marcus Rohrbach, Douwe Kiela

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[430] arXiv:2112.04497 [pdf, other]: Title: SIRfyN: Single Image Relighting from your Neighbors

Authors: D.A. Forsyth, Anand Bhattad, Pranav Asthana, Yuanyi Zhong, Yuxiong Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431] arXiv:2112.04532 [pdf, other]: Title: Segment and Complete: Defending Object Detectors against Adversarial Patch Attacks with Robust Patch Detection

Authors: Jiang Liu, Alexander Levine, Chun Pong Lau, Rama Chellappa, Soheil Feizi

Comments: CVPR 2022 camera ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Image and Video Processing (eess.IV)
[432] arXiv:2112.04564 [pdf, other]: Title: CoSSL: Co-Learning of Representation and Classifier for Imbalanced Semi-Supervised Learning

Authors: Yue Fan, Dengxin Dai, Anna Kukleva, Bernt Schiele

Comments: Published at CVPR 2022 as a conference paper. Code at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[433] arXiv:2112.04585 [pdf, other]: Title: MASTAF: A Model-Agnostic Spatio-Temporal Attention Fusion Network for Few-shot Video Classification

Authors: Rex Liu, Huanle Zhang, Hamed Pirsiavash, Xin Liu

Comments: WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[434] arXiv:2112.04598 [pdf, other]: Title: InvGAN: Invertible GANs

Authors: Partha Ghosh, Dominik Zietlow, Michael J. Black, Larry S. Davis, Xiaochen Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[435] arXiv:2112.04603 [pdf, other]: Title: A Unified Architecture of Semantic Segmentation and Hierarchical Generative Adversarial Networks for Expression Manipulation

Authors: Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436] arXiv:2112.04607 [pdf, other]: Title: Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning

Authors: KL Navaneet, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Kossar Pourahmadi, Akshayvarun Subramanya, Hamed Pirsiavash

Comments: Code is available at this https URL arXiv admin note: text overlap with arXiv:2110.10309

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437] arXiv:2112.04608 [pdf, other]: Title: Enhancing Food Intake Tracking in Long-Term Care with Automated Food Imaging and Nutrient Intake Tracking (AFINI-T) Technology

Authors: Kaylen J. Pfisterer, Robert Amelard, Jennifer Boger, Audrey G. Chung, Heather H. Keller, Alexander Wong

Comments: Key words: Automatic segmentation, convolutional neural network, deep learning, food intake tracking, volume estimation, malnutrition prevention, long-term care, hospital

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[438] arXiv:2112.04610 [pdf, other]: Title: A Simple and efficient deep Scanpath Prediction

Authors: Mohamed Amine Kerkouri, Aladine Chetouani

Comments: Electronic Imaging Symposium 2022 (EI 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439] arXiv:2112.04628 [pdf, other]: Title: Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection

Authors: Xianpeng Liu, Nan Xue, Tianfu Wu

Journal-ref: Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440] arXiv:2112.04632 [pdf, other]: Title: Recurrent Glimpse-based Decoder for Detection with Transformer

Authors: Zhe Chen, Jing Zhang, Dacheng Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[441] arXiv:2112.04645 [pdf, other]: Title: BACON: Band-limited Coordinate Networks for Multiscale Scene Representation

Authors: David B. Lindell, Dave Van Veen, Jeong Joon Park, Gordon Wetzstein

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[442] arXiv:2112.04662 [pdf, other]: Title: Dual Cluster Contrastive learning for Object Re-Identification

Authors: Hantao Yao, Changsheng Xu

Comments: 12 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[443] arXiv:2112.04665 [pdf, other]: Title: Style Mixing and Patchwise Prototypical Matching for One-Shot Unsupervised Domain Adaptive Semantic Segmentation

Authors: Xinyi Wu, Zhenyao Wu, Yuhang Lu, Lili Ju, Song Wang

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[444] arXiv:2112.04674 [pdf, other]: Title: DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition

Authors: Yuxuan Liang, Pan Zhou, Roger Zimmermann, Shuicheng Yan

Comments: Accepted by ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[445] arXiv:2112.04680 [pdf, other]: Title: SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations

Authors: Zhenyu Li, Zehui Chen, Ang Li, Liangji Fang, Qinhong Jiang, Xianming Liu, Junjun Jiang, Bolei Zhou, Hang Zhao

Comments: Accepted to 36th AAAI Conference on Artificial Intelligence (AAAI 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446] arXiv:2112.04701 [pdf, other]: Title: Unsupervised Complementary-aware Multi-process Fusion for Visual Place Recognition

Authors: Stephen Hausler, Tobias Fischer, Michael Milford

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[447] arXiv:2112.04702 [pdf, other]: Title: Fast Point Transformer

Authors: Chunghyun Park, Yoonwoo Jeong, Minsu Cho, Jaesik Park

Comments: Accepted to CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448] arXiv:2112.04709 [pdf, other]: Title: Implicit Feature Refinement for Instance Segmentation

Authors: Lufan Ma, Tiancai Wang, Bin Dong, Jiangpeng Yan, Xiu Li, Xiangyu Zhang

Comments: Published at ACM MM 2021. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[449] arXiv:2112.04710 [pdf, other]: Title: Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search

Authors: Yifan Jiang, Xinyu Gong, Junru Wu, Humphrey Shi, Zhicheng Yan, Zhangyang Wang

Comments: Accepted by WACV'2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450] arXiv:2112.04719 [pdf, other]: Title: Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision

Authors: Risheng Liu, Long Ma, Tengyu Ma, Xin Fan, Zhongxuan Luo

Comments: Submitted to IEEE TPAMI. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[451] arXiv:2112.04720 [pdf, other]: Title: Amicable Aid: Perturbing Images to Improve Classification Performance

Authors: Juyeop Kim, Jun-Ho Choi, Soobeom Jang, Jong-Seok Lee

Comments: ICASSP 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[452] arXiv:2112.04731 [pdf, other]: Title: Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning

Authors: Yujun Shi, Kuangqi Zhou, Jian Liang, Zihang Jiang, Jiashi Feng, Philip Torr, Song Bai, Vincent Y. F. Tan

Comments: CVPR 2022 Camera-Ready Version

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[453] arXiv:2112.04744 [pdf, other]: Title: Superpixel-Based Building Damage Detection from Post-earthquake Very High Resolution Imagery Using Deep Neural Networks

Authors: Jun Wang, Zhoujing Li, Yixuan Qiao, Qiming Qin, Peng Gao, Guotong Xie

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[454] arXiv:2112.04752 [pdf, other]: Title: Modelling Lips-State Detection Using CNN for Non-Verbal Communications

Authors: Abtahi Ishmam, Mahmudul Hasan, Md. Saif Hassan Onim, Koushik Roy, Md. Akiful Haque Akif, Hussain Nyeem

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[455] arXiv:2112.04761 [pdf, other]: Title: HBReID: Harder Batch for Re-identification

Authors: Wen Li, Furong Xu, Jianan Zhao, Ruobing Zheng, Cheng Zou, Meng Wang, Yuan Cheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[456] arXiv:2112.04764 [pdf, other]: Title: 3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection

Authors: Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Mohammad-Ali Nikouei Mahani, Nassir Navab, Benjamin Busam, Federico Tombari

Comments: CVPR 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[457] arXiv:2112.04771 [pdf, other]: Title: Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection

Authors: Jiaqi Tang, Zhaoyang Liu, Chen Qian, Wayne Wu, Limin Wang

Comments: CVPR 2022 camera-ready version

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[458] arXiv:2112.04827 [pdf, other]: Title: Explainability of the Implications of Supervised and Unsupervised Face Image Quality Estimations Through Activation Map Variation Analyses in Face Recognition Models

Authors: Biying Fu, Naser Damer

Comments: accepted at the IEEE Winter Conference on Applications of Computer Vision Workshops, WACV Workshops 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[459] arXiv:2112.04840 [pdf, other]: Title: Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation

Authors: Gang Li, Xiang Li, Yujie Wang, Shanshan Zhang, Yichao Wu, Ding Liang

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[460] arXiv:2112.04846 [pdf, other]: Title: ScaleNet: A Shallow Architecture for Scale Estimation

Authors: Axel Barroso-Laguna, Yurun Tian, Krystian Mikolajczyk

Journal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[461] arXiv:2112.04888 [pdf, other]: Title: A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer

Authors: Weijia Wu, Yuanqiang Cai, Debing Zhang, Sibo Wang, Zhuang Li, Jiahong Li, Yejun Tang, Hong Zhou

Comments: 20 pages, 6 figures

Journal-ref: NeurIPS 2021 Track on Datasets and Benchmarks

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[462] arXiv:2112.04903 [pdf, other]: Title: PRA-Net: Point Relation-Aware Network for 3D Point Cloud Analysis

Authors: Silin Cheng, Xiwu Chen, Xinwei He, Zhe Liu, Xiang Bai

Comments: 13 pages

Journal-ref: IEEE Transactions on Image Processing, vol. 30, pp. 4436-4448, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463] arXiv:2112.04928 [pdf, other]: Title: Self-Supervised Image-to-Text and Text-to-Image Synthesis

Authors: Anindya Sundar Das, Sriparna Saha

Comments: ICONIP 2021 : The 28th International Conference on Neural Information Processing

Journal-ref: ICONIP 2021. Lecture Notes in Computer Science, vol 13111, pp 415-426. Springer, Cham

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[464] arXiv:2112.04934 [pdf, other]: Title: Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers

Authors: Zunlei Feng, Jiacong Hu, Sai Wu, Xiaotian Yu, Jie Song, Mingli Song

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[465] arXiv:2112.04937 [pdf, other]: Title: DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification

Authors: Yongbiao Chen, Sheng Zhang, Fangxin Liu, Chenggang Wu, Kaicheng Guo, Zhengwei Qi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[466] arXiv:2112.04966 [pdf, other]: Title: CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation

Authors: Lu Qi, Jason Kuen, Zhe Lin, Jiuxiang Gu, Fengyun Rao, Dian Li, Weidong Guo, Zhen Wen, Ming-Hsuan Yang, Jiaya Jia

Comments: Appeared in ECCV2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[467] arXiv:2112.04974 [pdf, other]: Title: AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach

Authors: Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Yuexin Ma, Zhe Wang, Jianping Shi

Comments: To be published in International Journal of Computer Vision (IJCV)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[468] arXiv:2112.04981 [pdf, other]: Title: PE-former: Pose Estimation Transformer

Authors: Paschalis Panteleris, Antonis Argyros

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[469] arXiv:2112.05006 [pdf, other]: Title: Exploring Event-driven Dynamic Context for Accident Scene Segmentation

Authors: Jiaming Zhang, Kailun Yang, Rainer Stiefelhagen

Comments: Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS), extended version of arXiv:2008.08974, dataset and code will be made publicly available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[470] arXiv:2112.05053 [pdf, ps, other]: Title: Illumination and Temperature-Aware Multispectral Networks for Edge-Computing-Enabled Pedestrian Detection

Authors: Yifan Zhuang, Ziyuan Pu, Jia Hu, Yinhai Wang

Comments: 13 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[471] arXiv:2112.05077 [pdf, other]: Title: Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior

Authors: Davis Rempe, Jonah Philion, Leonidas J. Guibas, Sanja Fidler, Or Litany

Comments: CVPR 2022 camera-ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[472] arXiv:2112.05080 [pdf, other]: Title: Locally Shifted Attention With Early Global Integration

Authors: Shelly Sheynin, Sagie Benaim, Adam Polyak, Lior Wolf

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[473] arXiv:2112.05112 [pdf, other]: Title: BLT: Bidirectional Layout Transformer for Controllable Layout Generation

Authors: Xiang Kong, Lu Jiang, Huiwen Chang, Han Zhang, Yuan Hao, Haifeng Gong, Irfan Essa

Comments: ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[474] arXiv:2112.05121 [pdf, other]: Title: Self-Supervised Keypoint Discovery in Behavioral Videos

Authors: Jennifer J. Sun, Serim Ryou, Roni Goldshmid, Brandon Weissbourd, John Dabiri, David J. Anderson, Ann Kennedy, Yisong Yue, Pietro Perona

Comments: CVPR 2022. Code: this https URL Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[475] arXiv:2112.05126 [pdf, other]: Title: IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo

Authors: Fangjinhua Wang, Silvano Galliani, Christoph Vogel, Marc Pollefeys

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[476] arXiv:2112.05130 [pdf, other]: Title: Multimodal Conditional Image Synthesis with Product-of-Experts GANs

Authors: Xun Huang, Arun Mallya, Ting-Chun Wang, Ming-Yu Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[477] arXiv:2112.05131 [pdf, other]: Title: Plenoxels: Radiance Fields without Neural Networks

Authors: Alex Yu, Sara Fridovich-Keil, Matthew Tancik, Qinhong Chen, Benjamin Recht, Angjoo Kanazawa

Comments: For video and code, please see this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[478] arXiv:2112.05132 [pdf, other]: Title: Spatio-temporal Relation Modeling for Few-shot Action Recognition

Authors: Anirudh Thatipelli, Sanath Narayan, Salman Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, Bernard Ghanem

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479] arXiv:2112.05134 [pdf, other]: Title: A Shared Representation for Photorealistic Driving Simulators

Authors: Saeed Saadatnejad, Siyuan Li, Taylor Mordan, Alexandre Alahi

Comments: Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[480] arXiv:2112.05136 [pdf, other]: Title: PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning

Authors: Yining Hong, Li Yi, Joshua B. Tenenbaum, Antonio Torralba, Chuang Gan

Comments: NeurIPS 2021. Project page: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[481] arXiv:2112.05138 [pdf, other]: Title: Searching Parameterized AP Loss for Object Detection

Authors: Chenxin Tao, Zizhang Li, Xizhou Zhu, Gao Huang, Yong Liu, Jifeng Dai

Comments: Accepted by NeurIPS 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[482] arXiv:2112.05139 [pdf, other]: Title: CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields

Authors: Can Wang, Menglei Chai, Mingming He, Dongdong Chen, Jing Liao

Comments: To Appear at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[483] arXiv:2112.05140 [pdf, other]: Title: NeRF for Outdoor Scene Relighting

Authors: Viktor Rudnev, Mohamed Elgharib, William Smith, Lingjie Liu, Vladislav Golyanik, Christian Theobalt

Comments: 22 pages, 10 figures, 2 tables; ECCV 2022; project web page: this https URL

Journal-ref: European Conference on Computer Vision (ECCV) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[484] arXiv:2112.05141 [pdf, other]: Title: Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework

Authors: Chenxin Tao, Honghui Wang, Xizhou Zhu, Jiahua Dong, Shiji Song, Gao Huang, Jifeng Dai

Comments: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485] arXiv:2112.05142 [pdf, other]: Title: HairCLIP: Design Your Hair by Text and Reference Image

Authors: Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Zhentao Tan, Lu Yuan, Weiming Zhang, Nenghai Yu

Comments: To Appear at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[486] arXiv:2112.05143 [pdf, other]: Title: GAN-Supervised Dense Visual Alignment

Authors: William Peebles, Jun-Yan Zhu, Richard Zhang, Antonio Torralba, Alexei A. Efros, Eli Shechtman

Comments: An updated version of our CVPR 2022 paper (oral); v2 features additional references and minor text changes. Code available at this https URL . Project page and videos available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487] arXiv:2112.05144 [pdf, ps, other]: Title: Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing

Authors: Wujie Zhou, Shaohua Dong, Caie Xu, Yaguan Qian

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[488] arXiv:2112.05181 [pdf, other]: Title: Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision

Authors: Liangzhe Yuan, Rui Qian, Yin Cui, Boqing Gong, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[489] arXiv:2112.05210 [pdf, other]: Title: 7th AI Driving Olympics: 1st Place Report for Panoptic Tracking

Authors: Rohit Mohan, Abhinav Valada

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[490] arXiv:2112.05213 [pdf, other]: Title: Progressive Seed Generation Auto-encoder for Unsupervised Point Cloud Learning

Authors: Juyoung Yang, Pyunghwan Ahn, Doyeon Kim, Haeil Lee, Junmo Kim

Comments: ICCV2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[491] arXiv:2112.05215 [pdf, other]: Title: Road Extraction from Overhead Images with Graph Neural Networks

Authors: Gaetan Bahl, Mehdi Bahri, Florent Lafarge

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[492] arXiv:2112.05219 [pdf, other]: Title: CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions

Authors: Rameen Abdal, Peihao Zhu, John Femiani, Niloy J. Mitra, Peter Wonka

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[493] arXiv:2112.05230 [pdf, other]: Title: Injecting Semantic Concepts into End-to-End Image Captioning

Authors: Zhiyuan Fang, Jianfeng Wang, Xiaowei Hu, Lin Liang, Zhe Gan, Lijuan Wang, Yezhou Yang, Zicheng Liu

Journal-ref: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[494] arXiv:2112.05236 [pdf, ps, other]: Title: KartalOl: Transfer learning using deep neural network for iris segmentation and localization: New dataset for iris segmentation

Authors: Jalil Nourmohammadi Khiarak, Samaneh Salehi Nasab, Farhang Jaryani, Seyed Naeim Moafinejad, Rana Pourmohamad, Yasin Amini, Morteza Noshad

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[495] arXiv:2112.05237 [pdf, ps, other]: Title: Transfer learning using deep neural networks for Ear Presentation Attack Detection: New Database for PAD

Authors: Jalil Nourmohammadi Khiarak

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[496] arXiv:2112.05253 [pdf, other]: Title: MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning

Authors: Constantin Eichenberg, Sidney Black, Samuel Weinbach, Letitia Parcalabescu, Anette Frank

Comments: 13 pages, 6 figures, 2 tables. Minor improvements. Accepted at EMNLP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[497] arXiv:2112.05267 [pdf, other]: Title: The Many Faces of Anger: A Multicultural Video Dataset of Negative Emotions in the Wild (MFA-Wild)

Authors: Roya Javadi, Angelica Lim

Comments: 8 pages, 13 figures, submitted to FG2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[498] arXiv:2112.05277 [pdf, other]: Title: Skeletal Graph Self-Attention: Embedding a Skeleton Inductive Bias into Sign Language Production

Authors: Ben Saunders, Necati Cihan Camgoz, Richard Bowden

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[499] arXiv:2112.05280 [pdf, other]: Title: Long-Range Thermal 3D Perception in Low Contrast Environments

Authors: Andrey Filippov, Olga Filippova

Comments: 13 pages, 16 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[500] arXiv:2112.05290 [pdf, other]: Title: Image-to-Image Translation-based Data Augmentation for Robust EV Charging Inlet Detection

Authors: Yeonjun Bang, Yeejin Lee, Byeongkeun Kang

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[501] arXiv:2112.05291 [pdf, other]: Title: LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization

Authors: Zhiwei Chen, Changan Wang, Yabiao Wang, Guannan Jiang, Yunhang Shen, Ying Tai, Chengjie Wang, Wei Zhang, Liujuan Cao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[502] arXiv:2112.05295 [pdf, other]: Title: 3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map

Authors: Prarthana Bhattacharyya, Yanlei Gu, Jiali Bao, Xu Liu, Shunsuke Kamijo

Comments: 6 pages, 6 figures

Journal-ref: 2017 IEEE 85th Vehicular Technology Conference (VTC Spring)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[503] arXiv:2112.05298 [pdf, other]: Title: IFR-Explore: Learning Inter-object Functional Relationships in 3D Indoor Scenes

Authors: Qi Li, Kaichun Mo, Yanchao Yang, Hang Zhao, Leonidas Guibas

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[504] arXiv:2112.05300 [pdf, other]: Title: Representing 3D Shapes with Probabilistic Directed Distance Fields

Authors: Tristan Aumentado-Armstrong, Stavros Tsogkas, Sven Dickinson, Allan Jepson

Comments: 22 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[505] arXiv:2112.05301 [pdf, other]: Title: Self-Ensemling for 3D Point Cloud Domain Adaption

Authors: Qing Li, Xiaojiang Peng, Chuan Yan, Pan Gao, Qi Hao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[506] arXiv:2112.05324 [pdf, other]: Title: Attention-based Transformation from Latent Features to Point Clouds

Authors: Kaiyi Zhang, Ximing Yang, Yuan Wu, Cheng Jin

Comments: 9 pages, 7 figures, AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[507] arXiv:2112.05329 [pdf, other]: Title: FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Authors: Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura

Comments: Accepted to CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[508] arXiv:2112.05335 [pdf, other]: Title: Uncertainty, Edge, and Reverse-Attention Guided Generative Adversarial Network for Automatic Building Detection in Remotely Sensed Images

Authors: Somrita Chattopadhyay, Avinash C. Kak

Comments: 23 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[509] arXiv:2112.05340 [pdf, other]: Title: Tradeoffs Between Contrastive and Supervised Learning: An Empirical Study

Authors: Ananya Karthik, Mike Wu, Noah Goodman, Alex Tamkin

Comments: NeurIPS 2021 Workshop: Self-Supervised Learning - Theory and Practice

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[510] arXiv:2112.05341 [pdf, other]: Title: Hyperdimensional Feature Fusion for Out-Of-Distribution Detection

Authors: Samuel Wilson, Tobias Fischer, Niko Sünderhauf, Feras Dayoub

Comments: Accepted to WACV2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[511] arXiv:2112.05351 [pdf, other]: Title: Exploring Pixel-level Self-supervision for Weakly Supervised Semantic Segmentation

Authors: Sung-Hoon Yoon, Hyeokjun Kweon, Jaeseok Jeong, Hyeonseong Kim, Shinjeong Kim, Kuk-Jin Yoon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[512] arXiv:2112.05375 [pdf, other]: Title: Rethinking the Two-Stage Framework for Grounded Situation Recognition

Authors: Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Tat-Seng Chua

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[513] arXiv:2112.05379 [pdf, other]: Title: Cross-Modal Transferable Adversarial Attacks from Images to Videos

Authors: Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[514] arXiv:2112.05381 [pdf, other]: Title: UNIST: Unpaired Neural Implicit Shape Translation Network

Authors: Qimin Chen, Johannes Merz, Aditya Sanghi, Hooman Shayani, Ali Mahdavi-Amiri, Hao Zhang

Comments: CVPR 2022. project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[515] arXiv:2112.05396 [pdf, other]: Title: Towards Full-to-Empty Room Generation with Structure-Aware Feature Encoding and Soft Semantic Region-Adaptive Normalization

Authors: Vasileios Gkitsas, Nikolaos Zioulis, Vladimiros Sterzentsenko, Alexandros Doumanoglou, Dimitrios Zarpalas

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[516] arXiv:2112.05404 [pdf, other]: Title: The Large Labelled Logo Dataset (L3D): A Multipurpose and Hand-Labelled Continuously Growing Dataset

Authors: Asier Gutiérrez-Fandiño, David Pérez-Fernández, Jordi Armengol-Estapé

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[517] arXiv:2112.05410 [pdf, other]: Title: Multimedia Datasets for Anomaly Detection: A Review

Authors: Pratibha Kumari, Anterpreet Kaur Bedi, Mukesh Saini

Comments: 17 pages, 11 figures, 8 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[518] arXiv:2112.05416 [pdf, other]: Title: Optimizing Edge Detection for Image Segmentation with Multicut Penalties

Authors: Steffen Jung, Sebastian Ziegler, Amirhossein Kardoost, Margret Keuper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519] arXiv:2112.05425 [pdf, other]: Title: Couplformer:Rethinking Vision Transformer with Coupling Attention Map

Authors: Hai Lan, Xihao Wang, Xian Wei

Comments: 11 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[520] arXiv:2112.05456 [pdf, other]: Title: Monitoring and Adapting the Physical State of a Camera for Autonomous Vehicles

Authors: Maik Wischow, Guillermo Gallego, Ines Ernst, Anko Börner

Comments: 17 pages, 20 figures, this https URL

Journal-ref: IEEE Transactions on Intelligent Transportation Systems (2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[521] arXiv:2112.05485 [pdf, other]: Title: Visual Transformers with Primal Object Queries for Multi-Label Image Classification

Authors: Vacit Oguz Yazici, Joost van de Weijer, Longlong Yu

Comments: Accepted to ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[522] arXiv:2112.05488 [pdf, other]: Title: DronePose: The identification, segmentation, and orientation detection of drones via neural networks

Authors: Stirling Scholes, Alice Ruget, German Mora-Martin, Feng Zhu, Istvan Gyongy, Jonathan Leach

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[523] arXiv:2112.05496 [pdf, other]: Title: Graph-based Generative Face Anonymisation with Pose Preservation

Authors: Nicola Dall'Asen, Yiming Wang, Hao Tang, Luca Zanella, Elisa Ricci

Comments: 21st International Conference on Image analysis and Processing

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[524] arXiv:2112.05498 [pdf, other]: Title: Sparse Depth Completion with Semantic Mesh Deformation Optimization

Authors: Bing Zhou, Matias Aiskovich, Sinem Guven

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[525] arXiv:2112.05504 [pdf, other]: Title: BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering

Authors: Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, Dahua Lin

Comments: Accepted to ECCV22; Previous version: CityNeRF: Building NeRF at City Scale; Project page can be found in this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[526] arXiv:2112.05533 [pdf, other]: Title: Error Diagnosis of Deep Monocular Depth Estimation Models

Authors: Jagpreet Chawla, Nikhil Thakurdesai, Anuj Godase, Md Reza, David Crandall, Soon-Heung Jung

Comments: Presented at IROS'21

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[527] arXiv:2112.05561 [pdf, other]: Title: Global Attention Mechanism: Retain Information to Enhance Channel-Spatial Interactions

Authors: Yichao Liu, Zongru Shao, Nico Hoffmann

Comments: 5 pages, 3 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[528] arXiv:2112.05576 [pdf, ps, other]: Title: GPU-accelerated image alignment for object detection in industrial applications

Authors: Trung-Son Le, Chyi-Yeu Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[529] arXiv:2112.05585 [pdf, other]: Title: Discrete neural representations for explainable anomaly detection

Authors: Stanislaw Szymanowicz, James Charles, Roberto Cipolla

Journal-ref: Winter Conference on Applications of Computer Vision 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[530] arXiv:2112.05587 [pdf, other]: Title: Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation

Authors: Tianyi Liu, Zuxuan Wu, Wenhan Xiong, Jingjing Chen, Yu-Gang Jiang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[531] arXiv:2112.05598 [pdf, other]: Title: PERF: Performant, Explicit Radiance Fields

Authors: Sverker Rasmuson, Erik Sintorn, Ulf Assarsson

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[532] arXiv:2112.05626 [pdf, other]: Title: Seq-Masks: Bridging the gap between appearance and gait modeling for video-based person re-identification

Authors: Zhigang Chang, Zhao Yang, Yongbiao Chen, Qin Zhou, Shibao Zheng

Comments: ICASSP2021 Submission

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[533] arXiv:2112.05637 [pdf, other]: Title: HeadNeRF: A Real-time NeRF-based Parametric Head Model

Authors: Yang Hong, Bo Peng, Haiyao Xiao, Ligang Liu, Juyong Zhang

Comments: Accepted by CVPR2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[534] arXiv:2112.05644 [pdf, other]: Title: Roominoes: Generating Novel 3D Floor Plans From Existing 3D Rooms

Authors: Kai Wang, Xianghao Xu, Leon Lei, Selena Ling, Natalie Lindsay, Angel X. Chang, Manolis Savva, Daniel Ritchie

Comments: Symposium on Geometry Processing (SGP) 2021

Journal-ref: Computer Graphics Forum, 40: 57-69 (2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[535] arXiv:2112.05646 [pdf, other]: Title: Mask-invariant Face Recognition through Template-level Knowledge Distillation

Authors: Marco Huber, Fadi Boutros, Florian Kirchbuchner, Naser Damer

Comments: Accepted at the 16th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[536] arXiv:2112.05667 [pdf, other]: Title: A Deep Learning Based Automated Hand Hygiene Training System

Authors: Mobina Shahbandeh, Fatemeh Ghaffarpour, Sina Vali, Mohammad Amin Haghpanah, Amin Mousavi Torkamani, Mehdi Tale Masouleh, Ahmad Kalhor

Comments: 6 pages, 13 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[537] arXiv:2112.05692 [pdf, other]: Title: VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling

Authors: Yang Li, Gang Li, Xin Zhou, Mostafa Dehghani, Alexey Gritsenko

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[538] arXiv:2112.05727 [pdf, other]: Title: Neural Belief Propagation for Scene Graph Generation

Authors: Daqi Liu, Miroslaw Bober, Josef Kittler

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[539] arXiv:2112.05744 [pdf, other]: Title: More Control for Free! Image Synthesis with Semantic Diffusion Guidance

Authors: Xihui Liu, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi, Anna Rohrbach, Trevor Darrell

Comments: WACV 2023. Project page this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[540] arXiv:2112.05749 [pdf, other]: Title: Label, Verify, Correct: A Simple Few Shot Object Detection Method

Authors: Prannay Kaul, Weidi Xie, Andrew Zisserman

Comments: CVPR 2022, project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[541] arXiv:2112.05786 [pdf, other]: Title: Guided Generative Models using Weak Supervision for Detecting Object Spatial Arrangement in Overhead Images

Authors: Weiwei Duan, Yao-Yi Chiang, Stefan Leyk, Johannes H. Uhl, Craig A. Knoblock

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[542] arXiv:2112.05808 [pdf, other]: Title: Benchmarking human visual search computational models in natural scenes: models comparison and reference datasets

Authors: F. Travi (1), G. Ruarte (1), G. Bujia (1), J. E. Kamienkowski (1,2) ((1) Laboratorio de Inteligencia Artificial Aplicada, Instituto de Ciencias de la Computación, Universidad de Buenos Aires - CONICET (2) Maestría de Explotación de Datos y Descubrimiento del Conocimiento, Universidad de Buenos Aires, Argentina)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[543] arXiv:2112.05814 [pdf, other]: Title: Deep ViT Features as Dense Visual Descriptors

Authors: Shir Amir, Yossi Gandelsman, Shai Bagon, Tali Dekel

Comments: Revised version - high res figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[544] arXiv:2112.05825 [pdf, other]: Title: Revisiting Consistency Regularization for Semi-Supervised Learning

Authors: Yue Fan, Anna Kukleva, Bernt Schiele

Comments: Published at GCPR2021 as a conference paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[545] arXiv:2112.05827 [pdf, other]: Title: Quality-Aware Multimodal Biometric Recognition

Authors: Sobhan Soleymani, Ali Dabouei, Fariborz Taherkhani, Seyed Mehdi Iranmanesh, Jeremy Dawson, Nasser M. Nasrabadi

Comments: IEEE Transactions on Biometrics, Behavior, and Identity Science

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[546] arXiv:2112.05846 [pdf, other]: Title: Semantic Interaction in Augmented Reality Environments for Microsoft HoloLens

Authors: Peer Schüett, Max Schwarz, Sven Behnke

Comments: ECMR 2019, European Conference on Mobile Robots, HoloLens, 6 pages, 6 figures

Journal-ref: European Conference on Mobile Robots (ECMR), 2019

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[547] arXiv:2112.05847 [pdf, other]: Title: A Novel Gaussian Process Based Ground Segmentation Algorithm with Local-Smoothness Estimation

Authors: Pouria Mehrabi, Hamid D. Taghirad

Comments: arXiv admin note: substantial text overlap with arXiv:2111.10638

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[548] arXiv:2112.05851 [pdf, other]: Title: Short and Long Range Relation Based Spatio-Temporal Transformer for Micro-Expression Recognition

Authors: Liangfei Zhang, Xiaopeng Hong, Ognjen Arandjelovic, Guoying Zhao

Comments: 13 pages, 9 figures

Journal-ref: IEEE Transactions on Affective Computing, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[549] arXiv:2112.05861 [pdf, other]: Title: A Discriminative Channel Diversification Network for Image Classification

Authors: Krushi Patel, Guanghui Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[550] arXiv:2112.05871 [pdf, other]: Title: On Adversarial Robustness of Point Cloud Semantic Segmentation

Authors: Jiacen Xu, Zhe Zhou, Boyuan Feng, Yufei Ding, Zhou Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[551] arXiv:2112.05883 [pdf, other]: Title: Self-supervised Spatiotemporal Representation Learning by Exploiting Video Continuity

Authors: Hanwen Liang, Niamul Quader, Zhixiang Chi, Lizhe Chen, Peng Dai, Juwei Lu, Yang Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[552] arXiv:2112.05892 [pdf, other]: Title: COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality

Authors: Honglu Zhou, Asim Kadav, Aviv Shamsian, Shijie Geng, Farley Lai, Long Zhao, Ting Liu, Mubbasir Kapadia, Hans Peter Graf

Comments: ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[553] arXiv:2112.05907 [pdf, other]: Title: Smooth-Swap: A Simple Enhancement for Face-Swapping with Smoothness

Authors: Jiseob Kim, Jihoon Lee, Byoung-Tak Zhang

Comments: CVPR 2022 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[554] arXiv:2112.05957 [pdf, other]: Title: AvatarMe++: Facial Shape and BRDF Inference with Photorealistic Rendering-Aware GANs

Authors: Alexandros Lattas, Stylianos Moschoglou, Stylianos Ploumpis, Baris Gecer, Abhijeet Ghosh, Stefanos Zafeiriou

Comments: Project and Dataset page: ( this https URL ). 20 pages, including supplemental materials. Accepted for publishing at IEEE Transactions on Pattern Analysis and Machine Intelligence on 13 November 2021. Copyright 2021 IEEE. Personal use of this material is permitted

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[555] arXiv:2112.05958 [src]: Title: You Only Need End-to-End Training for Long-Tailed Recognition

Authors: Zhiwei Zhang

Comments: This is a draft

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[556] arXiv:2112.05975 [src]: Title: CPRAL: Collaborative Panoptic-Regional Active Learning for Semantic Segmentation

Authors: Yu Qiao, Jincheng Zhu, Chengjiang Long, Zeyao Zhang, Yuxin Wang, Zhenjun Du, Xin Yang

Comments: This is not the final version of our paper, and we will upload a final version later

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[557] arXiv:2112.05982 [pdf, ps, other]: Title: Overview of The MediaEval 2021 Predicting Media Memorability Task

Authors: Rukiye Savran Kiziltepe, Mihai Gabriel Constantin, Claire-Helene Demarty, Graham Healy, Camilo Fosco, Alba Garcia Seco de Herrera, Sebastian Halder, Bogdan Ionescu, Ana Matran-Fernandez, Alan F. Smeaton, Lorin Sweeney

Comments: 3 pages, to appear in Proceedings of MediaEval 2021, December 13-15 2021, Online

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[558] arXiv:2112.05993 [pdf, other]: Title: Object Counting: You Only Need to Look at One

Authors: Hui Lin, Xiaopeng Hong, Yabin Wang

Comments: Keywords: Crowd counting, one-shot object counting, Attention

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[559] arXiv:2112.05999 [pdf, other]: Title: Curvature-guided dynamic scale networks for Multi-view Stereo

Authors: Khang Truong Giang, Soohwan Song, Sungho Jo

Comments: Accepted to ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[560] arXiv:2112.06011 [pdf, other]: Title: Improving the Transferability of Adversarial Examples with Resized-Diverse-Inputs, Diversity-Ensemble and Region Fitting

Authors: Junhua Zou, Zhisong Pan, Junyang Qiu, Xin Liu, Ting Rui, Wei Li

Comments: Accepted to ECCV2020

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[561] arXiv:2112.06029 [pdf, other]: Title: On Automatic Data Augmentation for 3D Point Cloud Classification

Authors: Wanyue Zhang, Xun Xu, Fayao Liu, Le Zhang, Chuan-Sheng Foo

Comments: BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[562] arXiv:2112.06074 [pdf, other]: Title: Early Stopping for Deep Image Prior

Authors: Hengkang Wang, Taihui Li, Zhong Zhuang, Tiancong Chen, Hengyue Liang, Ju Sun

Comments: Published in TMLR (this https URL)

Journal-ref: Transactions on Machine Learning Research (TMLR), 2835-8856 (12/2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[563] arXiv:2112.06103 [pdf, other]: Title: Improving Vision Transformers for Incremental Learning

Authors: Pei Yu, Yinpeng Chen, Ying Jin, Zicheng Liu

Comments: Add experiments on CIFAR-100, comparison with DER

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[564] arXiv:2112.06104 [pdf, other]: Title: Synthetic Map Generation to Provide Unlimited Training Data for Historical Map Text Detection

Authors: Zekun Li, Runyu Guan, Qianmu Yu, Yao-Yi Chiang, Craig A. Knoblock

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[565] arXiv:2112.06106 [pdf, other]: Title: Controlled-rearing studies of newborn chicks and deep neural networks

Authors: Donsuk Lee, Pranav Gujarathi, Justin N. Wood

Comments: NeurIPS 2021 Workshop on Shared Visual Representations in Human & Machine Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[566] arXiv:2112.06113 [pdf, other]: Title: Learning from the Tangram to Solve Mini Visual Tasks

Authors: Yizhou Zhao, Liang Qiu, Pan Lu, Feng Shi, Tian Han, Song-Chun Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[567] arXiv:2112.06116 [pdf, other]: Title: Stereoscopic Universal Perturbations across Different Architectures and Datasets

Authors: Zachary Berger, Parth Agrawal, Tian Yu Liu, Stefano Soatto, Alex Wong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[568] arXiv:2112.06120 [pdf, other]: Title: Sidewalk Measurements from Satellite Images: Preliminary Findings

Authors: Maryam Hosseini, Iago B. Araujo, Hamed Yazdanpanah, Eric K. Tokuda, Fabio Miranda, Claudio T. Silva, Roberto M. Cesar Jr

Journal-ref: Spatial Data Science Symposium 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[569] arXiv:2112.06121 [pdf, other]: Title: Magnifying Networks for Images with Billions of Pixels

Authors: Neofytos Dimitriou, Ognjen Arandjelovic

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[570] arXiv:2112.06133 [pdf, other]: Title: MVLayoutNet:3D layout reconstruction with multi-view panoramas

Authors: Zhihua Hu, Bo Duan, Yanfeng Zhang, Mingwei Sun, Jingwei Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[571] arXiv:2112.06147 [pdf, other]: Title: Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification

Authors: Lin Wan, Qianyan Jing, Zongyuan Sun, Chuang Zhang, Zhihang Li, Yehansen Chen

Comments: 13 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[572] arXiv:2112.06150 [pdf, other]: Title: Deep Translation Prior: Test-time Training for Photorealistic Style Transfer

Authors: Sunwoo Kim, Soohyun Kim, Seungryong Kim

Comments: Accepted to AAAI 2022. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[573] arXiv:2112.06161 [pdf, other]: Title: Semi-supervised Domain Adaptive Structure Learning

Authors: Can Qin, Lichen Wang, Qianqian Ma, Yu Yin, Huan Wang, Yun Fu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[574] arXiv:2112.06170 [pdf, other]: Title: Deep network for rolling shutter rectification

Authors: Praveen K, Lokesh Kumar T, A.N. Rajagopalan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[575] arXiv:2112.06171 [pdf, other]: Title: Pixel-wise Deep Image Stitching

Authors: Hyeokjun Kweon, Hyeonseong Kim, Yoonsu Kang, Youngho Yoon, Wooseong Jeong, Kuk-Jin Yoon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[576] arXiv:2112.06174 [pdf, other]: Title: Implicit Transformer Network for Screen Content Image Continuous Super-Resolution

Authors: Jingyu Yang, Sheng Shen, Huanjing Yue, Kun Li

Comments: 24 pages with 3 figures, NeurIPS 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[577] arXiv:2112.06175 [pdf, other]: Title: Unsupervised Domain-Specific Deblurring using Scale-Specific Attention

Authors: Praveen Kandula, Rajagopalan. A. N

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[578] arXiv:2112.06179 [pdf, other]: Title: BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-aided Adversarial Learning

Authors: Changgyoon Oh, Wonjune Cho, Daehee Park, Yujeong Chae, Lin Wang, Kuk-Jin Yoon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[579] arXiv:2112.06180 [pdf, other]: Title: 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation

Authors: Bolivar Solarte, Yueh-Cheng Liu, Chin-Hsuan Wu, Yi-Hsuan Tsai, Min Sun

Comments: IEEE RA-L 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[580] arXiv:2112.06183 [pdf, other]: Title: Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species

Authors: Changsheng Lu, Piotr Koniusz

Comments: Accepted by CVPR 2022; 8 pages for main paper, 6 pages for supplementary materials

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[581] arXiv:2112.06193 [pdf, other]: Title: GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic Animal Segmentation

Authors: Minh-Quan Le, Trung-Nghia Le, Tam V. Nguyen, Isao Echizen, Minh-Triet Tran

Comments: The code is available at this https URL . The dataset is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[582] arXiv:2112.06197 [pdf, other]: Title: Video as Conditional Graph Hierarchy for Multi-Granular Question Answering

Authors: Junbin Xiao, Angela Yao, Zhiyuan Liu, Yicong Li, Wei Ji, Tat-Seng Chua

Comments: AAAI'22 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[583] arXiv:2112.06238 [pdf, other]: Title: HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging

Authors: Xuanyu Zhang, Yongbing Zhang, Ruiqin Xiong, Qilin Sun, Jian Zhang

Comments: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[584] arXiv:2112.06242 [pdf, other]: Title: Formulating Event-based Image Reconstruction as a Linear Inverse Problem with Deep Regularization using Optical Flow

Authors: Zelin Zhang, Anthony Yezzi, Guillermo Gallego

Comments: 22 pages, 26 figures, 5 tables, 6 animations when clicked on

Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 45, No. 7, pp. 8372-8389, July 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[585] arXiv:2112.06307 [pdf, other]: Title: Image-to-Height Domain Translation for Synthetic Aperture Sonar

Authors: Dylan Stewart, Shawn Johnson, Alina Zare

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[586] arXiv:2112.06320 [pdf, other]: Title: Anomaly Crossing: New Horizons for Video Anomaly Detection as Cross-domain Few-shot Learning

Authors: Guangyu Sun, Zhang Liu, Lianggong Wen, Jing Shi, Chenliang Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[587] arXiv:2112.06323 [pdf, other]: Title: Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses

Authors: Chun Pong Lau, Jiang Liu, Hossein Souri, Wei-An Lin, Soheil Feizi, Rama Chellappa

Comments: Under submission

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[588] arXiv:2112.06343 [pdf, other]: Title: Change Detection Meets Visual Question Answering

Authors: Zhenghang Yuan, Lichao Mou, Zhitong Xiong, Xiaoxiang Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[589] arXiv:2112.06375 [pdf, other]: Title: Embracing Single Stride 3D Object Detector with Sparse Transformer

Authors: Lue Fan, Ziqi Pang, Tianyuan Zhang, Yu-Xiong Wang, Hang Zhao, Feng Wang, Naiyan Wang, Zhaoxiang Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[590] arXiv:2112.06379 [pdf, other]: Title: 5th Place Solution for VSPW 2021 Challenge

Authors: Jiafan Zhuang, Yixin Zhang, Xinyu Hu, Junjie Li, Zilei Wang

Comments: Presented in ICCV'21 Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[591] arXiv:2112.06389 [pdf, other]: Title: Local and Global Point Cloud Reconstruction for 3D Hand Pose Estimation

Authors: Ziwei Yu, Linlin Yang, Shicheng Chen, Angela Yao

Comments: The British Machine Vision Conference (BMVC)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[592] arXiv:2112.06390 [pdf, other]: Title: PartGlot: Learning Shape Part Segmentation from Language Reference Games

Authors: Juil Koo, Ian Huang, Panos Achlioptas, Leonidas Guibas, Minhyuk Sung

Comments: CVPR 2022 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[593] arXiv:2112.06392 [pdf, other]: Title: The Overlooked Classifier in Human-Object Interaction Recognition

Authors: Ying Jin, Yinpeng Chen, Lijuan Wang, Jianfeng Wang, Pei Yu, Lin Liang, Jenq-Neng Hwang, Zicheng Liu

Comments: arXiv admin note: substantial text overlap with arXiv:2107.13083

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[594] arXiv:2112.06398 [pdf, other]: Title: Shaping Visual Representations with Attributes for Few-Shot Recognition

Authors: Haoxing Chen, Huaxiong Li, Yaohui Li, Chunlin Chen

Comments: accepted by IEEE Signal Process. Lett

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[595] arXiv:2112.06401 [pdf, other]: Title: Deep Attentional Guided Image Filtering

Authors: Zhiwei Zhong, Xianming Liu, Junjun Jiang, Debin Zhao, Xiangyang Ji

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[596] arXiv:2112.06406 [pdf, other]: Title: Hybrid Atlas Building with Deep Registration Priors

Authors: Nian Wu, Jian Wang, Miaomiao Zhang, Guixu Zhang, Yaxin Peng, Chaomin Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[597] arXiv:2112.06428 [pdf, other]: Title: Holistic Interpretation of Public Scenes Using Computer Vision and Temporal Graphs to Identify Social Distancing Violations

Authors: Gihan Jayatilaka, Jameel Hassan, Suren Sritharan, Janith Bandara Senananayaka, Harshana Weligampola, Roshan Godaliyadda, Parakrama Ekanayake, Vijitha Herath, Janaka Ekanayake, Samath Dharmaratne

Comments: 23 pages, 19 figures. Gihan Jayatilaka, Jameel Hassan, and Suren Sritharan contributed equally to this work

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[598] arXiv:2112.06433 [pdf, other]: Title: Generate Point Clouds with Multiscale Details from Graph-Represented Structures

Authors: Ximing Yang, Zhengfu He, Cheng Jin

Comments: 16 pages, 6 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[599] arXiv:2112.06437 [pdf, other]: Title: Semi-Supervised Contrastive Learning for Remote Sensing: Identifying Ancient Urbanization in the South Central Andes

Authors: Jiachen Xu, Junlin Guo, James Zimmer-Dauphinee, Quan Liu, Yuxuan Shi, Zuhayr Asad, D. Mitchell Wilkes, Parker VanValkenburgh, Steven A. Wernke, Yuankai Huo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[600] arXiv:2112.06447 [pdf, other]: Title: SVIP: Sequence VerIfication for Procedures in Videos

Authors: Yicheng Qian, Weixin Luo, Dongze Lian, Xu Tang, Peilin Zhao, Shenghua Gao

Comments: Accepted by CVPR2022. For the included dataset, see this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[601] arXiv:2112.06451 [pdf, other]: Title: Semantically Contrastive Learning for Low-light Image Enhancement

Authors: Dong Liang, Ling Li, Mingqiang Wei, Shuo Yang, Liyan Zhang, Wenhan Yang, Yun Du, Huiyu Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[602] arXiv:2112.06454 [pdf, other]: Title: Split GCN: Effective Interactive Annotation for Segmentation of Disconnected Instance

Authors: Namgil Kim, Barom Kang, Yeonok Cho

Comments: 11 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[603] arXiv:2112.06455 [pdf, other]: Title: Self-Paced Deep Regression Forests with Consideration of Ranking Fairness

Authors: Lili Pan, Mingming Meng, Yazhou Ren, Yali Zheng, Zenglin Xu

Comments: The article is submitted to TNNLS, and is under review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[604] arXiv:2112.06456 [pdf, other]: Title: Real Time Action Recognition from Video Footage

Authors: Tasnim Sakib Apon, Mushfiqul Islam Chowdhury, MD Zubair Reza, Arpita Datta, Syeda Tanjina Hasan, MD. Golam Rabiul Alam

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[605] arXiv:2112.06467 [pdf, other]: Title: An Informative Tracking Benchmark

Authors: Xin Li, Qiao Liu, Wenjie Pei, Qiuhong Shen, Yaowei Wang, Huchuan Lu, Ming-Hsuan Yang

Comments: 10 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[606] arXiv:2112.06489 [pdf, other]: Title: Multi-Modal Mutual Information Maximization: A Novel Approach for Unsupervised Deep Cross-Modal Hashing

Authors: Tuan Hoang, Thanh-Toan Do, Tam V. Nguyen, Ngai-Man Cheung

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[607] arXiv:2112.06502 [pdf, other]: Title: DGL-GAN: Discriminator Guided Learning for GAN Compression

Authors: Yuesong Tian, Li Shen, Xiang Tian, Dacheng Tao, Zhifeng Li, Wei Liu, Yaowu Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[608] arXiv:2112.06522 [pdf, other]: Title: Anatomizing Bias in Facial Analysis

Authors: Richa Singh, Puspita Majumdar, Surbhi Mittal, Mayank Vatsa

Comments: Accepted in AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[609] arXiv:2112.06530 [pdf, ps, other]: Title: Centroid-UNet: Detecting Centroids in Aerial Images

Authors: N. Lakmal Deshapriya, Dan Tran, Sriram Reddy, Kavinda Gunasekara

Comments: Proccedings of the 42nd Asian Conference on Remote Sensing, 2021, Can Tho city, Vietnam

Journal-ref: ACRS 42nd (2021) 100

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[610] arXiv:2112.06533 [pdf, other]: Title: Makeup216: Logo Recognition with Adversarial Attention Representations

Authors: Junjun Hu, Yanhao Zhu, Bo Zhao, Jiexin Zheng, Chenxu Zhao, Xiangyu Zhu, Kangle Wu, Darun Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[611] arXiv:2112.06536 [pdf, other]: Title: SphereSR: 360° Image Super-Resolution with Arbitrary Projection via Continuous Spherical Image Representation

Authors: Youngho Yoon, Inchul Chung, Lin Wang, Kuk-Jin Yoon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[612] arXiv:2112.06538 [pdf, other]: Title: Hybrid Graph Neural Networks for Few-Shot Learning

Authors: Tianyuan Yu, Sen He, Yi-Zhe Song, Tao Xiang

Comments: To appear in AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[613] arXiv:2112.06554 [pdf, ps, other]: Title: Ensemble CNN Networks for GBM Tumors Segmentation using Multi-parametric MRI

Authors: Ramy A. Zeineldin, Mohamed E. Karar, Franziska Mathis-Ullrich, Oliver Burgert

Comments: Accepted in BraTS 2021 (as part of the BrainLes workshop proceedings distributed by Springer LNCS)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[614] arXiv:2112.06558 [pdf, other]: Title: MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning

Authors: Wenqiao Zhang, Haochen Shi, Jiannan Guo, Shengyu Zhang, Qingpeng Cai, Juncheng Li, Sihui Luo, Yueting Zhuang

Journal-ref: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[615] arXiv:2112.06569 [pdf, other]: Title: Triangle Attack: A Query-efficient Decision-based Adversarial Attack

Authors: Xiaosen Wang, Zeliang Zhang, Kangheng Tong, Dihong Gong, Kun He, Zhifeng Li, Wei Liu

Comments: Accepted by ECCV 2022, code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[616] arXiv:2112.06586 [pdf, other]: Title: Active learning with MaskAL reduces annotation effort for training Mask R-CNN

Authors: Pieter M. Blok, Gert Kootstra, Hakim Elchaoui Elghor, Boubacar Diallo, Frits K. van Evert, Eldert J. van Henten

Comments: 30 pages, 10 figures, 3 tables

Journal-ref: Computers and Electronics in Agriculture, 197 (2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[617] arXiv:2112.06592 [pdf, other]: Title: CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability

Authors: Fadi Boutros, Meiling Fang, Marcel Klemt, Biying Fu, Naser Damer

Comments: Accepted at the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023 (CVPR2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[618] arXiv:2112.06596 [pdf, other]: Title: SAC-GAN: Structure-Aware Image Composition

Authors: Hang Zhou, Rui Ma, Ling-Xiao Zhang, Lin Gao, Ali Mahdavi-Amiri, Hao Zhang

Comments: Accepted to TVCG. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[619] arXiv:2112.06624 [pdf, other]: Title: Pedestrian Trajectory Prediction via Spatial Interaction Transformer Network

Authors: Tong Su, Yu Meng, Yan Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[620] arXiv:2112.06632 [pdf, other]: Title: Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation

Authors: Zhipeng Huang, Zhizheng Zhang, Cuiling Lan, Wenjun Zeng, Peng Chu, Quanzeng You, Jiang Wang, Zicheng Liu, Zheng-jun Zha

Comments: Accepted by CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[621] arXiv:2112.06685 [pdf, other]: Title: Quaternion-Valued Convolutional Neural Network Applied for Acute Lymphoblastic Leukemia Diagnosis

Authors: Marco Aurélio Granero, Cristhian Xavier Hernández, Marcos Eduardo Valle

Journal-ref: A. Britto and K. Valdivia Delgado (Eds.): BRACIS 2021, LNAI 13074, pp. 280-293, 2021. Springer Nature Switzerland AG 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[622] arXiv:2112.06701 [pdf, other]: Title: Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images

Authors: Dong Liang, Qixiang Geng, Zongqi Wei, Dmitry A. Vorontsov, Ekaterina L. Kim, Mingqiang Wei, Huiyu Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[623] arXiv:2112.06705 [pdf, other]: Title: N-SfC: Robust and Fast Shape Estimation from Caustic Images

Authors: Marc Kassubeck, Moritz Kappel, Susana Castillo, Marcus Magnor

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[624] arXiv:2112.06714 [pdf, other]: Title: Learning Semantic-Aligned Feature Representation for Text-based Person Search

Authors: Shiping Li, Min Cao, Min Zhang

Comments: 5 pages, 3 figures, 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[625] arXiv:2112.06730 [pdf, other]: Title: VirtualCube: An Immersive 3D Video Communication System

Authors: Yizhong Zhang, Jiaolong Yang, Zhen Liu, Ruicheng Wang, Guojun Chen, Xin Tong, Baining Guo

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Multimedia (cs.MM)
[626] arXiv:2112.06741 [pdf, other]: Title: Long-tail Recognition via Compositional Knowledge Transfer

Authors: Sarah Parisot, Pedro M. Esperanca, Steven McDonagh, Tamas J. Madarasz, Yongxin Yang, Zhenguo Li

Comments: Accepted to CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[627] arXiv:2112.06745 [pdf, other]: Title: A Survey of Unsupervised Domain Adaptation for Visual Recognition

Authors: Youshan Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[628] arXiv:2112.06782 [pdf, other]: Title: GCNDepth: Self-supervised Monocular Depth Estimation based on Graph Convolutional Network

Authors: Armin Masoumian, Hatem A. Rashwan, Saddam Abdulwahab, Julian Cristiano, Domenec Puig

Comments: 10 pages, Submitted to IEEE transactions on intelligent transportation systems

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[629] arXiv:2112.06809 [pdf, other]: Title: Persistent Animal Identification Leveraging Non-Visual Markers

Authors: Michael P. J. Camilleri, Li Zhang, Rasneer S. Bains, Andrew Zisserman, Christopher K. I. Williams

Journal-ref: Machine Vision and Applications 34, 68 (2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Combinatorics (math.CO)
[630] arXiv:2112.06825 [pdf, other]: Title: VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks

Authors: Yi-Lin Sung, Jaemin Cho, Mohit Bansal

Comments: CVPR 2022 (15 pages; with new video-text and CLIP-ViL experiments)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[631] arXiv:2112.06853 [pdf, other]: Title: The whole and the parts: the MDL principle and the a-contrario framework

Authors: Rafael Grompone von Gioi, Ignacio Ramírez Paulino, Gregory Randall

Comments: Submitted to SIAM Jourinal on Imaging Sciences (SIIMS)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[632] arXiv:2112.06904 [pdf, other]: Title: HVH: Learning a Hybrid Neural Volumetric Representation for Dynamic Hair Performance Capture

Authors: Ziyan Wang, Giljoo Nam, Tuur Stuyck, Stephen Lombardi, Michael Zollhoefer, Jessica Hodgins, Christoph Lassner

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[633] arXiv:2112.06909 [pdf, other]: Title: Hallucinating Pose-Compatible Scenes

Authors: Tim Brooks, Alexei A. Efros

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[634] arXiv:2112.06910 [pdf, other]: Title: DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points

Authors: Zhengfei Kuang, Jiaman Li, Mingming He, Tong Wang, Yajie Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[635] arXiv:2112.06978 [pdf, other]: Title: Exploring Latent Dimensions of Crowd-sourced Creativity

Authors: Umut Kocasari, Alperen Bag, Efehan Atici, Pinar Yanardag

Comments: 5th Workshop on Machine Learning for Creativity and Design (NeurIPS 2021), Sydney, Australia

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[636] arXiv:2112.06988 [pdf, other]: Title: Event-guided Deblurring of Unknown Exposure Time Videos

Authors: Taewoo Kim, Jeongmin Lee, Lin Wang, Kuk-Jin Yoon

Comments: Accepted in ECCV2022(Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[637] arXiv:2112.07015 [pdf, other]: Title: Multi-Expert Human Action Recognition with Hierarchical Super-Class Learning

Authors: Hojat Asgarian Dehkordi, Ali Soltani Nezhad, Hossein Kashiani, Shahriar Baradaran Shokouhi, Ahmad Ayatollahi

Comments: 47 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[638] arXiv:2112.07074 [pdf, other]: Title: Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text

Authors: Qing Li, Boqing Gong, Yin Cui, Dan Kondratyuk, Xianzhi Du, Ming-Hsuan Yang, Matthew Brown

Comments: preliminary work

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[639] arXiv:2112.07082 [pdf, ps, other]: Title: DeepDiffusion: Unsupervised Learning of Retrieval-adapted Representations via Diffusion-based Ranking on Latent Feature Manifold

Authors: Takahiko Furuya, Ryutarou Ohbuchi

Comments: Accepted to the IEEE Access journal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[640] arXiv:2112.07088 [pdf, other]: Title: ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses

Authors: Bastian Wandt, James J. Little, Helge Rhodin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[641] arXiv:2112.07106 [pdf, other]: Title: E-CRF: Embedded Conditional Random Field for Boundary-caused Class Weights Confusion in Semantic Segmentation

Authors: Jie Zhu, Huabin Huang, Banghuai Li, Leye Wang

Comments: Accepted by ICLR2023. Camera-ready Version

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[642] arXiv:2112.07111 [pdf, other]: Title: EMDS-6: Environmental Microorganism Image Dataset Sixth Version for Image Denoising, Segmentation, Feature Extraction, Classification and Detection Methods Evaluation

Authors: Peng Zhao, Chen Li, Md Mamunur Rahaman, Hao Xu, Pingli Ma, Hechen Yang, Hongzan Sun, Tao Jiang, Ning Xu, Marcin Grzegorzek

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[643] arXiv:2112.07116 [pdf, other]: Title: Joint 3D Object Detection and Tracking Using Spatio-Temporal Representation of Camera Image and LiDAR Point Clouds

Authors: Junho Koh, Jaekyum Kim, Jinhyuk Yoo, Yecheol Kim, Dongsuk Kum, Jun Won Choi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[644] arXiv:2112.07133 [pdf, other]: Title: CLIP-Lite: Information Efficient Visual Representation Learning with Language Supervision

Authors: Aman Shrivastava, Ramprasaath R. Selvaraju, Nikhil Naik, Vicente Ordonez

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[645] arXiv:2112.07146 [pdf, other]: Title: PP-HumanSeg: Connectivity-Aware Portrait Segmentation with a Large-Scale Teleconferencing Video Dataset

Authors: Lutao Chu, Yi Liu, Zewu Wu, Shiyu Tang, Guowei Chen, Yuying Hao, Juncai Peng, Zhiliang Yu, Zeyu Chen, Baohua Lai, Haoyi Xiong

Comments: Accepted by WACV workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[646] arXiv:2112.07159 [pdf, other]: Title: Birds Eye View Social Distancing Analysis System

Authors: Zhengye Yang, Mingfei Sun, Hongzhe Ye, Zihao Xiong, Gil Zussman, Zoran Kostic

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[647] arXiv:2112.07173 [pdf, other]: Title: On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation

Authors: Binxu Wang, David Mayo, Arturo Deza, Andrei Barbu, Colin Conwell

Comments: 14 pages, 6 figures, 2 tables. Published in NeurIPS 2021 Workshop, Shared Visual Representations in Human & Machine Intelligence (SVRHM). For code, see this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC)
[648] arXiv:2112.07175 [pdf, other]: Title: Co-training Transformer with Videos and Images Improves Action Recognition

Authors: Bowen Zhang, Jiahui Yu, Christopher Fifty, Wei Han, Andrew M. Dai, Ruoming Pang, Fei Sha

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[649] arXiv:2112.07200 [pdf, other]: Title: Weakly Supervised High-Fidelity Clothing Model Generation

Authors: Ruili Feng, Cheng Ma, Chengji Shen, Xin Gao, Zhenjiang Liu, Xiaobo Li, Kairi Ou, Zhengjun Zha

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[650] arXiv:2112.07219 [pdf, other]: Title: A real-time spatiotemporal AI model analyzes skill in open surgical videos

Authors: Emmett D. Goodman, Krishna K. Patel, Yilun Zhang, William Locke, Chris J. Kennedy, Rohan Mehrotra, Stephen Ren, Melody Y. Guan, Maren Downing, Hao Wei Chen, Jevin Z. Clark, Gabriel A. Brat, Serena Yeung

Comments: 22 pages, 4 main text figures, 7 extended data figures, 4 extended data tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[651] arXiv:2112.07224 [pdf, other]: Title: Exploring Category-correlated Feature for Few-shot Image Classification

Authors: Jing Xu, Xinglin Pan, Xu Luo, Wenjie Pei, Zenglin Xu

Comments: 10 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[652] arXiv:2112.07225 [pdf, other]: Title: Margin Calibration for Long-Tailed Visual Recognition

Authors: Yidong Wang, Bowen Zhang, Wenxin Hou, Zhen Wu, Jindong Wang, Takahiro Shinozaki

Comments: Accepted by Asian Conference on Machine Learning (ACML) 2022; 16 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[653] arXiv:2112.07241 [pdf, other]: Title: Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detection

Authors: Na Zhao, Gim Hee Lee

Comments: Accepted at AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[654] arXiv:2112.07246 [pdf, other]: Title: Federated Learning for Face Recognition with Gradient Correction

Authors: Yifan Niu, Weihong Deng

Comments: accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[655] arXiv:2112.07270 [pdf, other]: Title: Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering

Authors: JianJian Cao, Xiameng Qin, Sanyuan Zhao, Jianbing Shen

Comments: pre-print, TNNLS, 12 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[656] arXiv:2112.07282 [pdf, other]: Title: SNF: Filter Pruning via Searching the Proper Number of Filters

Authors: Pengkun Liu, Yaru Yue, Yanjun Guo, Xingxiang Tao, Xiaoguang Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[657] arXiv:2112.07286 [pdf, ps, other]: Title: Levels of Autonomous Radiology

Authors: Suraj Ghuwalewala, Viraj Kulkarni, Richa Pant, Amit Kharat

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[658] arXiv:2112.07289 [pdf, other]: Title: Smoothness and effective regularizations in learned embeddings for shape matching

Authors: Riccardo Marin, Souhaib Attaiki, Simone Melzi, Emanuele Rodolà, Maks Ovsjanikov

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[659] arXiv:2112.07315 [pdf, other]: Title: Kernel-aware Burst Blind Super-Resolution

Authors: Wenyi Lian, Shanglian Peng

Comments: Accepted by WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[660] arXiv:2112.07334 [pdf, other]: Title: OMAD: Object Model with Articulated Deformations for Pose Estimation and Retrieval

Authors: Han Xue, Liu Liu, Wenqiang Xu, Haoyuan Fu, Cewu Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[661] arXiv:2112.07338 [pdf, other]: Title: Temporal Transformer Networks with Self-Supervision for Action Recognition

Authors: Yongkang Zhang, Jun Li, Guoming Wu, Han Zhang, Zhiping Shi, Zhaoxun Liu, Zizhang Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[662] arXiv:2112.07374 [pdf, other]: Title: Geometry-Contrastive Transformer for Generalized 3D Pose Transfer

Authors: Haoyu Chen, Hao Tang, Zitong Yu, Nicu Sebe, Guoying Zhao

Comments: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[663] arXiv:2112.07380 [pdf, other]: Title: TRACER: Extreme Attention Guided Salient Object Tracing Network

Authors: Min Seok Lee, Wooseok Shin, Sung Won Han

Comments: AAAI 2022, SA poster session accepted paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[664] arXiv:2112.07383 [pdf, other]: Title: Improving Human-Object Interaction Detection via Phrase Learning and Label Composition

Authors: Zhimin Li, Cheng Zou, Yu Zhao, Boxun Li, Sheng Zhong

Comments: Accepted to AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[665] arXiv:2112.07395 [pdf, other]: Title: Handwritten text generation and strikethrough characters augmentation

Authors: Alex Shonenkov, Denis Karachev, Max Novopoltsev, Mark Potanin, Denis Dimitrov, Andrey Chertok

Comments: 16 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2108.11667

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[666] arXiv:2112.07403 [pdf, ps, other]: Title: Stochastic Actor-Executor-Critic for Image-to-Image Translation

Authors: Ziwei Luo, Jing Hu, Xin Wang, Siwei Lyu, Bin Kong, Youbing Yin, Qi Song, Xi Wu

Journal-ref: IJCAI 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[667] arXiv:2112.07414 [pdf, other]: Title: Marine Bubble Flow Quantification Using Wide-Baseline Stereo Photogrammetry

Authors: Mengkun She, Tim Weiß, Yifan Song, Peter Urban, Jens Greinert, Kevin Köser

Comments: 56 pages, 26 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[668] arXiv:2112.07423 [pdf, other]: Title: Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking

Authors: Yidi Li, Hong Liu, Hao Tang

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[669] arXiv:2112.07431 [pdf, other]: Title: Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Authors: Yi Li, Yiqun Duan, Zhanghui Kuang, Yimin Chen, Wayne Zhang, Xiaomeng Li

Comments: Accept at AAAI 2022, Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[670] arXiv:2112.07441 [pdf, other]: Title: An Interpretive Constrained Linear Model for ResNet and MgNet

Authors: Juncai He, Jinchao Xu, Lian Zhang, Jianqing Zhu

Comments: 29 pages, 2 figures and 11 tables. arXiv admin note: text overlap with arXiv:1911.10428

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[671] arXiv:2112.07471 [pdf, other]: Title: I M Avatar: Implicit Morphable Head Avatars from Videos

Authors: Yufeng Zheng, Victoria Fernández Abrevaya, Marcel C. Bühler, Xu Chen, Michael J. Black, Otmar Hilliges

Comments: Accepted at CVPR 2022 as an oral presentation. Project page this https URL ; Github page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[672] arXiv:2112.07513 [pdf, other]: Title: CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

Authors: Jingyang Lin, Yingwei Pan, Rongfeng Lai, Xuehang Yang, Hongyang Chao, Ting Yao

Comments: ICME 2021 (Oral); Code is publicly available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[673] arXiv:2112.07515 [pdf, other]: Title: CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising

Authors: Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei

Comments: ACM Multimedia 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[674] arXiv:2112.07516 [pdf, other]: Title: Transferrable Contrastive Learning for Visual Domain Adaptation

Authors: Yang Chen, Yingwei Pan, Yu Wang, Ting Yao, Xinmei Tian, Tao Mei

Comments: ACM Multimedia 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[675] arXiv:2112.07517 [pdf, other]: Title: A Style and Semantic Memory Mechanism for Domain Generalization

Authors: Yang Chen, Yu Wang, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei

Comments: ICCV 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[676] arXiv:2112.07528 [pdf, other]: Title: n-CPS: Generalising Cross Pseudo Supervision to n Networks for Semi-Supervised Semantic Segmentation

Authors: Dominik Filipiak, Piotr Tempczyk, Marek Cygan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[677] arXiv:2112.07558 [pdf, other]: Title: Multi-Modal Temporal Attention Models for Crop Mapping from Satellite Time Series

Authors: Vivien Sainte Fare Garnot, Loic Landrieu, Nesrine Chehata

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[678] arXiv:2112.07589 [pdf, other]: Title: Mitigating Channel-wise Noise for Single Image Super Resolution

Authors: Srimanta Mandal, Kuldeep Purohit, A. N. Rajagopalan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[679] arXiv:2112.07599 [pdf, other]: Title: Learning to Deblur and Rotate Motion-Blurred Faces

Authors: Givi Meishvili, Attila Szabó, Simon Jenni, Paolo Favaro

Comments: British Machine Vision Conference 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[680] arXiv:2112.07642 [pdf, other]: Title: EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices

Authors: Siwei Zhang, Qianli Ma, Yan Zhang, Zhiyin Qian, Taein Kwon, Marc Pollefeys, Federica Bogo, Siyu Tang

Comments: Camera ready version for ECCV 2022, appendix included

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[681] arXiv:2112.07658 [pdf, other]: Title: AdaViT: Adaptive Tokens for Efficient Vision Transformer

Authors: Hongxu Yin, Arash Vahdat, Jose Alvarez, Arun Mallya, Jan Kautz, Pavlo Molchanov

Comments: CVPR'22 oral acceptance

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[682] arXiv:2112.07661 [pdf, other]: Title: Approaches Toward Physical and General Video Anomaly Detection

Authors: Laura Kart, Niv Cohen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[683] arXiv:2112.07662 [pdf, other]: Title: Out-of-Distribution Detection Without Class Labels

Authors: Niv Cohen, Ron Abutbul, Yedid Hoshen

Comments: Accepted to ECCV L2ID Workshop (2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[684] arXiv:2112.07664 [pdf, other]: Title: Adaptive Affinity for Associations in Multi-Target Multi-Camera Tracking

Authors: Yunzhong Hou, Zhongdao Wang, Shengjin Wang, Liang Zheng

Comments: This paper appears in: IEEE Transactions on Image Processing

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[685] arXiv:2112.07668 [pdf, other]: Title: Dual-Key Multimodal Backdoors for Visual Question Answering

Authors: Matthew Walmer, Karan Sikka, Indranil Sur, Abhinav Shrivastava, Susmit Jha

Comments: Published as conference paper at CVPR 2022. 22 pages, 11 figures, 12 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[686] arXiv:2112.07719 [pdf, other]: Title: Decomposing the Deep: Finding Class Specific Filters in Deep CNNs

Authors: Akshay Badola, Cherian Roy, Vineet Padmanabhan, Rajendra Lal

Comments: 22 pages, 5 figures, 8 tables. github repo: this https URL Preprint submitted to Elsevier. This version contains visualization of filters and ablation study w.r.t. influential features

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[687] arXiv:2112.07787 [pdf, other]: Title: Revisiting 3D Object Detection From an Egocentric Perspective

Authors: Boyang Deng, Charles R. Qi, Mahyar Najibi, Thomas Funkhouser, Yin Zhou, Dragomir Anguelov

Comments: Published in NeurIPS 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[688] arXiv:2112.07812 [pdf, other]: Title: Structure-Aware Image Segmentation with Homotopy Warping

Authors: Xiaoling Hu

Comments: 21 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
[689] arXiv:2112.07819 [pdf, other]: Title: Weed Recognition using Deep Learning Techniques on Class-imbalanced Imagery

Authors: A S M Mahmudul Hasan, Ferdous Sohel, Dean Diepeveen, Hamid Laga, Michael G.K. Jones

Comments: The paper is accepted by Crop and Pasture Science journal (this https URL)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[690] arXiv:2112.07820 [pdf, other]: Title: Value Retrieval with Arbitrary Queries for Form-like Documents

Authors: Mingfei Gao, Le Xue, Chetan Ramaiah, Chen Xing, Ran Xu, Caiming Xiong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[691] arXiv:2112.07835 [pdf, other]: Title: Mining Minority-class Examples With Uncertainty Estimates

Authors: Gursimran Singh, Lingyang Chu, Lanjun Wang, Jian Pei, Qi Tian, Yong Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[692] arXiv:2112.07878 [pdf, other]: Title: Gaze Estimation with Eye Region Segmentation and Self-Supervised Multistream Learning

Authors: Zunayed Mahmud, Paul Hungler, Ali Etemad

Comments: 5 pages, 1 figure, 3 tables, Accepted in AAAI-22 Workshop on Human-Centric Self-Supervised Learning

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[693] arXiv:2112.07879 [pdf, other]: Title: Does a Face Mask Protect my Privacy?: Deep Learning to Predict Protected Attributes from Masked Face Images

Authors: Sachith Seneviratne, Nuran Kasthuriarachchi, Sanka Rasnayaka, Danula Hettiachchi, Ridwan Shariffdeen

Comments: Accepted to AJCAI 2021 - 34th Australasian Joint Conference on Artificial Intelligence, Feb 2022, Sydney, Australia. this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[694] arXiv:2112.07895 [pdf, other]: Title: Robust Depth Completion with Uncertainty-Driven Loss Functions

Authors: Yufan Zhu, Weisheng Dong, Leida Li, Jinjian Wu, Xin Li, Guangming Shi

Comments: accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[695] arXiv:2112.07909 [pdf, other]: Title: Homography Decomposition Networks for Planar Object Tracking

Authors: Xinrui Zhan, Yueran Liu, Jianke Zhu, Yang Li

Comments: Accepted at AAAI 2022, preprint version

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[696] arXiv:2112.07910 [pdf, other]: Title: Decoupling Zero-Shot Semantic Segmentation

Authors: Jian Ding, Nan Xue, Gui-Song Xia, Dengxin Dai

Comments: Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[697] arXiv:2112.07913 [pdf, other]: Title: A Comparative Analysis of Machine Learning Approaches for Automated Face Mask Detection During COVID-19

Authors: Junaed Younus Khan, Md Abdullah Al Alamin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[698] arXiv:2112.07917 [pdf, other]: Title: SPTS: Single-Point Text Spotting

Authors: Dezhi Peng, Xinyu Wang, Yuliang Liu, Jiaxin Zhang, Mingxin Huang, Songxuan Lai, Shenggao Zhu, Jing Li, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin

Comments: Accepted by ACM MM 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[699] arXiv:2112.07918 [pdf, ps, other]: Title: M-FasterSeg: An Efficient Semantic Segmentation Network Based on Neural Architecture Search

Authors: Junjun Wu, Huiyu Kuang, Qinghua Lu, Zeqin Lin, Qingwu Shi, Xilin Liu, Xiaoman Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[700] arXiv:2112.07921 [pdf, other]: Title: Temporal Shuffling for Defending Deep Action Recognition Models against Adversarial Attacks

Authors: Jaehui Hwang, Huan Zhang, Jun-Ho Choi, Cho-Jui Hsieh, Jong-Seok Lee

Comments: 12 pages, accepted to Neural Networks

Journal-ref: Neural Networks, vol. 169, pp. 388-397, Jan. 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[701] arXiv:2112.07928 [pdf, other]: Title: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Authors: Xiaohua Chen, Yucan Zhou, Dayan Wu, Wanqian Zhang, Yu Zhou, Bo Li, Weiping Wang

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[702] arXiv:2112.07931 [pdf, ps, other]: Title: From Noise to Feature: Exploiting Intensity Distribution as a Novel Soft Biometric Trait for Finger Vein Recognition

Authors: Wenxiong Kang, Yuting Lu, Dejian Li, Wei Jia

Comments: 11 pages

Journal-ref: IEEE transactions on information forensics and security 14.4 (2018): 858-869

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[703] arXiv:2112.07945 [pdf, other]: Title: Efficient Geometry-aware 3D Generative Adversarial Networks

Authors: Eric R. Chan, Connor Z. Lin, Matthew A. Chan, Koki Nagano, Boxiao Pan, Shalini De Mello, Orazio Gallo, Leonidas Guibas, Jonathan Tremblay, Sameh Khamis, Tero Karras, Gordon Wetzstein

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[704] arXiv:2112.07948 [pdf, other]: Title: Transcoded Video Restoration by Temporal Spatial Auxiliary Network

Authors: Li Xu, Gang He, Jinjia Zhou, Jie Lei, Weiying Xie, Yunsong Li, Yu-Wing Tai

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[705] arXiv:2112.07954 [pdf, other]: Title: Object Pursuit: Building a Space of Objects via Discriminative Weight Generation

Authors: Chuanyu Pan, Yanchao Yang, Kaichun Mo, Yueqi Duan, Leonidas Guibas

Comments: 24 pages. This paper has been accepted by ICLR2022 (OpenReview: this https URL)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[706] arXiv:2112.07957 [pdf, other]: Title: FEAR: Fast, Efficient, Accurate and Robust Visual Tracker

Authors: Vasyl Borsuk, Roman Vei, Orest Kupyn, Tetiana Martyniuk, Igor Krashenyi, Jiři Matas

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[707] arXiv:2112.07962 [pdf, other]: Title: A learning-based approach to feature recognition of Engineering shapes

Authors: Lakshmi Priya Muraleedharan, Ramanathan Muthuganapathy

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[708] arXiv:2112.07963 [pdf, other]: Title: Towards General and Efficient Active Learning

Authors: Yichen Xie, Masayoshi Tomizuka, Wei Zhan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[709] arXiv:2112.07966 [pdf, other]: Title: Modality-Aware Triplet Hard Mining for Zero-shot Sketch-Based Image Retrieval

Authors: Zongheng Huang, YiFan Sun, Chuchu Han, Changxin Gao, Nong Sang

Comments: 13 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[710] arXiv:2112.07969 [pdf, ps, other]: Title: Predicting Media Memorability: Comparing Visual, Textual and Auditory Features

Authors: Lorin Sweeney, Graham Healy, Alan F. Smeaton

Comments: 3 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[711] arXiv:2112.07974 [pdf, other]: Title: Detail-aware Deep Clothing Animations Infused with Multi-source Attributes

Authors: Tianxing Li, Rui Shi, Takashi Kanai

Comments: 14 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[712] arXiv:2112.07984 [pdf, other]: Title: Temporal Action Proposal Generation with Background Constraint

Authors: Haosen Yang, Wenhao Wu, Lining Wang, Sheng Jin, Boyang Xia, Hongxun Yao, Hujie Huang

Comments: Accepted by AAAI2022. arXiv admin note: text overlap with arXiv:2105.12043

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[713] arXiv:2112.07999 [pdf, other]: Title: Self-Ensembling GAN for Cross-Domain Semantic Segmentation

Authors: Yonghao Xu, Fengxiang He, Bo Du, Dacheng Tao, Liangpei Zhang

Journal-ref: IEEE Trans. Multimedia, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[714] arXiv:2112.08001 [pdf, other]: Title: Autoencoder-based background reconstruction and foreground segmentation with background noise estimation

Authors: Bruno Sauvalle, Arnaud de La Fortelle

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[715] arXiv:2112.08006 [pdf, other]: Title: Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention

Authors: Zitian Zhang, Chuhua Xian

Comments: 14 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[716] arXiv:2112.08018 [pdf, other]: Title: MissMarple : A Novel Socio-inspired Feature-transfer Learning Deep Network for Image Splicing Detection

Authors: Angelina L. Gokhale, Dhanya Pramod, Sudeep D. Thepade, Ravi Kulkarni

Comments: 27 pages, 6 figures and 15 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[717] arXiv:2112.08022 [pdf, other]: Title: Segmentation-Reconstruction-Guided Facial Image De-occlusion

Authors: Xiangnan Yin, Di Huang, Zehua Fu, Yunhong Wang, Liming Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[718] arXiv:2112.08037 [pdf, other]: Title: LookinGood^π: Real-time Person-independent Neural Re-rendering for High-quality Human Performance Capture

Authors: Xiqi Yang, Kewei Yang, Kang Chen, Weidong Zhang, Weiwei Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[719] arXiv:2112.08050 [pdf, other]: Title: Exploring the Asynchronous of the Frequency Spectra of GAN-generated Facial Images

Authors: Binh M. Le, Simon S. Woo

Comments: International Workshop on Safety and Security of Deep Learning IJCAI, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[720] arXiv:2112.08070 [pdf, other]: Title: Depth Refinement for Improved Stereo Reconstruction

Authors: Amit Bracha, Noam Rotstein, David Bensaïd, Ron Slossberg, Ron Kimmel

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[721] arXiv:2112.08088 [pdf, other]: Title: Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions

Authors: Wenyu Liu, Gaofeng Ren, Runsheng Yu, Shi Guo, Jianke Zhu, Lei Zhang

Comments: AAAI 2022, Preprint version with Appendix

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[722] arXiv:2112.08117 [pdf, other]: Title: Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos

Authors: Pengfei Pei, Xianfeng Zhao, Yun Cao, Jinchuan Li, Xuyuan Lai

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[723] arXiv:2112.08122 [pdf, other]: Title: Self-Supervised Monocular Depth and Ego-Motion Estimation in Endoscopy: Appearance Flow to the Rescue

Authors: Shuwei Shao, Zhongcai Pei, Weihai Chen, Wentao Zhu, Xingming Wu, Dianmin Sun, Baochang Zhang

Comments: Accepted by Medical Image Analysis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[724] arXiv:2112.08171 [pdf, other]: Title: Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution

Authors: Jingye Chen, Haiyang Yu, Jianqi Ma, Bin Li, Xiangyang Xue

Comments: Accepted to AAAI2022. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[725] arXiv:2112.08175 [pdf, other]: Title: A Factorization Approach for Motor Imagery Classification

Authors: Byeong-Hoo Lee, Jeong-Hyun Cho, Byung-Hee Kwon

Comments: 4 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[726] arXiv:2112.08177 [pdf, other]: Title: Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

Authors: Gwangbin Bae, Ignas Budvytis, Roberto Cipolla

Comments: CVPR 2022 (oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[727] arXiv:2112.08178 [pdf, ps, other]: Title: Interpretable Feature Learning Framework for Smoking Behavior Detection

Authors: Nakayiza Hellen, Ggaliwango Marvin

Comments: 15 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[728] arXiv:2112.08184 [pdf, other]: Title: Interactive Visualization and Representation Analysis Applied to Glacier Segmentation

Authors: Minxing Zheng (1), Xinran Miao (1), Kris Sankaran (1) ((1) Department of Statistics, University of Wisconsin - Madison)

Comments: 10 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[729] arXiv:2112.08189 [pdf, other]: Title: ST-MTL: Spatio-Temporal Multitask Learning Model to Predict Scanpath While Tracking Instruments in Robotic Surgery

Authors: Mobarakol Islam, Vibashan VS, Chwee Ming Lim, Hongliang Ren

Comments: 12 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[730] arXiv:2112.08198 [pdf, other]: Title: Single Image Automatic Radial Distortion Compensation Using Deep Convolutional Network

Authors: Igor Janos, Wanda Benesova

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[731] arXiv:2112.08219 [pdf, other]: Title: Quantitative analysis of visual representation of sign elements in COVID-19 context

Authors: María Jesús Cano-Martínez, Miguel Carrasco, Joaquín Sandoval, César González-Martín

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[732] arXiv:2112.08227 [pdf, other]: Title: An Experimental Study of the Impact of Pre-training on the Pruning of a Convolutional Neural Network

Authors: Nathan Hubens, Matei Mancas, Bernard Gosselin, Marius Preda, Titus Zaharia

Comments: 7 pages, published at APPIS 2020

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[733] arXiv:2112.08274 [pdf, other]: Title: Putting People in their Place: Monocular Regression of 3D People in Depth

Authors: Yu Sun, Wu Liu, Qian Bao, Yili Fu, Tao Mei, Michael J. Black

Comments: CVPR 2022; Code this https URL ; Dataset this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[734] arXiv:2112.08275 [pdf, other]: Title: SeqFormer: Sequential Transformer for Video Instance Segmentation

Authors: Junfeng Wu, Yi Jiang, Song Bai, Wenqing Zhang, Xiang Bai

Comments: ECCV 2022, Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[735] arXiv:2112.08281 [pdf, other]: Title: Detecting Object States vs Detecting Objects: A New Dataset and a Quantitative Experimental Study

Authors: Filippos Gouidis, Theodore Patkos, Antonis Argyros, Dimitris Plexousakis

Comments: Submitted to the Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[736] arXiv:2112.08325 [pdf, other]: Title: ForgeryNet -- Face Forgery Analysis Challenge 2021: Methods and Results

Authors: Yinan He, Lu Sheng, Jing Shao, Ziwei Liu, Zhaofan Zou, Zhizhi Guo, Shan Jiang, Curitis Sun, Guosheng Zhang, Keyao Wang, Haixiao Yue, Zhibin Hong, Wanguo Wang, Zhenyu Li, Qi Wang, Zhenli Wang, Ronghao Xu, Mingwen Zhang, Zhiheng Wang, Zhenhang Huang, Tianming Zhang, Ningning Zhao

Comments: Technical report. Challenge website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[737] arXiv:2112.08345 [pdf, other]: Title: Reliable Multi-Object Tracking in the Presence of Unreliable Detections

Authors: Travis Mandel, Mark Jimenez, Emily Risley, Taishi Nammoto, Rebekka Williams, Max Panoff, Meynard Ballesteros, Bobbie Suarez

Comments: The full journal version of this article (published in Pattern Recognition, Vol. 135) can be found at this https URL The article is open access. The source code and dataset can be found at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[738] arXiv:2112.08359 [pdf, other]: Title: 3D Question Answering

Authors: Shuquan Ye, Dongdong Chen, Songfang Han, Jing Liao

Comments: To Appear at IEEE Transactions on Visualization and Computer Graphics (TVCG) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[739] arXiv:2112.08447 [pdf, other]: Title: Positional Encoding Augmented GAN for the Assessment of Wind Flow for Pedestrian Comfort in Urban Areas

Authors: Henrik Hoeiness, Kristoffer Gjerde, Luca Oggiano, Knut Erik Teigen Giljarhus, Massimiliano Ruocco

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[740] arXiv:2112.08455 [pdf, other]: Title: Dense Video Captioning Using Unsupervised Semantic Information

Authors: Valter Estevam, Rayson Laroca, Helio Pedrini, David Menotti

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[741] arXiv:2112.08459 [pdf, other]: Title: Rethinking Nearest Neighbors for Visual Classification

Authors: Menglin Jia, Bor-Chun Chen, Zuxuan Wu, Claire Cardie, Serge Belongie, Ser-Nam Lim

Comments: Modified paragraph spacing

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[742] arXiv:2112.08493 [pdf, other]: Title: StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation

Authors: Umut Kocasari, Alara Dirik, Mert Tiftikci, Pinar Yanardag

Comments: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[743] arXiv:2112.08497 [pdf, other]: Title: Predicting Levels of Household Electricity Consumption in Low-Access Settings

Authors: Simone Fobi, Joel Mugyenyi, Nathaniel J. Williams, Vijay Modi, Jay Taneja

Comments: Accepted to be published in Proceedings of IEEE Winter Conference on Applications of Computer Vision (WACV) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[744] arXiv:2112.08539 [pdf, other]: Title: Implicit Neural Representations for Deconvolving SAS Images

Authors: Albert Reed, Thomas Blanford, Daniel C. Brown, Suren Jayasuriya

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[745] arXiv:2112.08553 [pdf, other]: Title: UMAD: Universal Model Adaptation under Domain and Category Shift

Authors: Jian Liang, Dapeng Hu, Jiashi Feng, Ran He

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[746] arXiv:2112.08587 [pdf, other]: Title: SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning

Authors: Zhecan Wang, Haoxuan You, Liunian Harold Li, Alireza Zareian, Suji Park, Yiqing Liang, Kai-Wei Chang, Shih-Fu Chang

Comments: AAAI 2022

Journal-ref: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[747] arXiv:2112.08594 [pdf, other]: Title: Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation

Authors: Giscard Biamby, Grace Luo, Trevor Darrell, Anna Rohrbach

Comments: 11 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[748] arXiv:2112.08598 [pdf, other]: Title: FIgLib & SmokeyNet: Dataset and Deep Learning Model for Real-Time Wildland Fire Smoke Detection

Authors: Anshuman Dewangan, Yash Pande, Hans-Werner Braun, Frank Vernon, Ismael Perez, Ilkay Altintas, Garrison W. Cottrell, Mai H. Nguyen

Journal-ref: Remote Sensing. 2022; 14(4):1007

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[749] arXiv:2112.08604 [pdf, ps, other]: Title: Use Image Clustering to Facilitate Technology Assisted Review

Authors: Haozhen Zhao, Fusheng Wei, Hilary Quatinetz, Han Qin, Adam Dabrowski

Comments: 2021 IEEE International Conference on Big Data (Big Data)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[750] arXiv:2112.08605 [src]: Title: Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Authors: Rui Liu, Yahong Han, Yaowei Wang, Qi Tian

Comments: for further study

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[751] arXiv:2112.08626 [pdf, other]: Title: Analysis and Evaluation of Kinect-based Action Recognition Algorithms

Authors: Lei Wang

Comments: Master's thesis, 34 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[752] arXiv:2112.08635 [pdf, other]: Title: Road-aware Monocular Structure from Motion and Homography Estimation

Authors: Wei Sui, Teng Chen, Jiaxin Zhang, Jiao Lu, Qian Zhang

Comments: 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[753] arXiv:2112.08643 [pdf, other]: Title: TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning

Authors: Shiming Chen, Ziming Hong, Wenjin Hou, Guo-Sen Xie, Yibing Song, Jian Zhao, Xinge You, Shuicheng Yan, Ling Shao

Comments: This is an extention of AAAI'22 paper (TransZero). Accepted to TPAMI. arXiv admin note: substantial text overlap with arXiv:2112.01683

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[754] arXiv:2112.08647 [pdf, other]: Title: QAHOI: Query-Based Anchors for Human-Object Interaction Detection

Authors: Junwen Chen, Keiji Yanai

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[755] arXiv:2112.08655 [pdf, other]: Title: Feature Distillation Interaction Weighting Network for Lightweight Image Super-Resolution

Authors: Guangwei Gao, Wenjie Li, Juncheng Li, Fei Wu, Huimin Lu, Yi Yu

Comments: 9 pages, 9 figures, 4 tables, AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[756] arXiv:2112.08684 [pdf, other]: Title: Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification

Authors: Boqiang Xu, Jian Liang, Lingxiao He, Zhenan Sun

Comments: ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[757] arXiv:2112.08691 [pdf, other]: Title: Towards Robust Neural Image Compression: Adversarial Attack and Model Finetuning

Authors: Tong Chen, Zhan Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[758] arXiv:2112.08692 [pdf, other]: Title: Lacuna Reconstruction: Self-supervised Pre-training for Low-Resource Historical Document Transcription

Authors: Nikolai Vogler, Jonathan Parkes Allen, Matthew Thomas Miller, Taylor Berg-Kirkpatrick

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[759] arXiv:2112.08739 [pdf, other]: Title: Forensic Analysis of Synthetically Generated Western Blot Images

Authors: Sara Mandelli, Davide Cozzolino, Edoardo D. Cannas, Joao P. Cardenuto, Daniel Moreira, Paolo Bestagini, Walter J. Scheirer, Anderson Rocha, Luisa Verdoliva, Stefano Tubaro, Edward J. Delp

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[760] arXiv:2112.08740 [pdf, other]: Title: Feature Erasing and Diffusion Network for Occluded Person Re-Identification

Authors: Zhikang Wang, Feng Zhu, Shixiang Tang, Rui Zhao, Lihuo He, Jiangning Song

Comments: 10 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[761] arXiv:2112.08743 [pdf, other]: Title: Radio-Assisted Human Detection

Authors: Chengrun Qiu, Dongheng Zhang, Yang Hu, Houqiang Li, Qibin Sun, Yan Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[762] arXiv:2112.08775 [pdf, other]: Title: DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation

Authors: Jaewoo Park, Nam Ik Cho

Comments: Accepted to ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[763] arXiv:2112.08782 [pdf, ps, other]: Title: Improved YOLOv5 network for real-time multi-scale traffic sign detection

Authors: Junfan Wang, Yi Chen, Mingyu Gao, Zhekang Dong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[764] arXiv:2112.08796 [pdf, other]: Title: Saliency Grafting: Innocuous Attribution-Guided Mixup with Calibrated Label Mixing

Authors: Joonhyung Park, June Yong Yang, Jinwoo Shin, Sung Ju Hwang, Eunho Yang

Comments: 12 pages; Accepted to AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[765] arXiv:2112.08810 [pdf, other]: Title: Pure Noise to the Rescue of Insufficient Data: Improving Imbalanced Classification by Training on Random Noise Images

Authors: Shiran Zada, Itay Benou, Michal Irani

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[766] arXiv:2112.08814 [pdf, other]: Title: An Unsupervised Way to Understand Artifact Generating Internal Units in Generative Neural Networks

Authors: Haedong Jeong, Jiyeon Han, Jaesik Choi

Comments: AAAI22 accepted paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[767] arXiv:2112.08816 [pdf, other]: Title: Deep Hash Distillation for Image Retrieval

Authors: Young Kyun Jang, Geonmo Gu, Byungsoo Ko, Isaac Kang, Nam Ik Cho

Comments: ECCV2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[768] arXiv:2112.08817 [pdf, other]: Title: Search for temporal cell segmentation robustness in phase-contrast microscopy videos

Authors: Estibaliz Gómez-de-Mariscal, Hasini Jayatilaka, Özgün Çiçek, Thomas Brox, Denis Wirtz, Arrate Muñoz-Barrutia

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP); Quantitative Methods (q-bio.QM)
[769] arXiv:2112.08835 [pdf, other]: Title: Self-supervised Enhancement of Latent Discovery in GANs

Authors: Silpa Vadakkeeveetil Sreelatha, Adarsh Kappiyath, S Sumitra

Comments: Accepted to the 36th AAAI Conference on Artificial Intelligence (AAAI 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[770] arXiv:2112.08841 [pdf, other]: Title: A CNN based method for Sub-pixel Urban Land Cover Classification using Landsat-5 TM and Resourcesat-1 LISS-IV Imagery

Authors: Krishna Kumar Perikamana, Krishnachandran Balakrishnan, Pratyush Tripathy

Comments: 29 pages, 14 figures (including appendix), 8 tables (including appendix)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[771] arXiv:2112.08867 [pdf, other]: Title: GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation

Authors: Yu Deng, Jiaolong Yang, Jianfeng Xiang, Xin Tong

Comments: CVPR2022 Oral. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[772] arXiv:2112.08879 [pdf, other]: Title: Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds

Authors: Ayush Jain, Nikolaos Gkanatsios, Ishita Mediratta, Katerina Fragkiadaki

Comments: First two authors contributed equally | ECCV 2022 Camera Ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[773] arXiv:2112.08902 [pdf, other]: Title: Toward Minimal Misalignment at Minimal Cost in One-Stage and Anchor-Free Object Detection

Authors: Shuaizheng Hao, Hongzhe Liu, Ningwei Wang, Cheng Xu

Comments: The paper is under consideration at Pattern Recognition Letters

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[774] arXiv:2112.08906 [pdf, other]: Title: On the Uncertain Single-View Depths in Colonoscopies

Authors: Javier Rodríguez-Puigvert, David Recasens, Javier Civera, Rubén Martínez-Cantín

Comments: 11 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[775] arXiv:2112.08913 [pdf, other]: Title: Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Authors: Yujia Zhang, Lai-Man Po, Xuyuan Xu, Mengyang Liu, Yexin Wang, Weifeng Ou, Yuzhi Zhao, Wing-Yin Yu

Comments: Accepted by AAAI 2022, Preprint version with Appendix

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[776] arXiv:2112.08930 [pdf, other]: Title: Intelli-Paint: Towards Developing Human-like Painting Agents

Authors: Jaskirat Singh, Cameron Smith, Jose Echevarria, Liang Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Machine Learning (stat.ML)
[777] arXiv:2112.08935 [pdf, other]: Title: MVSS-Net: Multi-View Multi-Scale Supervised Networks for Image Manipulation Detection

Authors: Chengbo Dong, Xinru Chen, Ruohan Hu, Juan Cao, Xirong Li

Comments: arXiv admin note: substantial text overlap with arXiv:2104.06832 Accepted by T-PAMI

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[778] arXiv:2112.08949 [pdf, other]: Title: Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation

Authors: Yi Zhou, Hui Zhang, Hana Lee, Shuyang Sun, Pingjun Li, Yangguang Zhu, ByungIn Yoo, Xiaojuan Qi, Jae-Joon Han

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[779] arXiv:2112.08950 [pdf, other]: Title: Stable Long-Term Recurrent Video Super-Resolution

Authors: Benjamin Naoto Chiche, Arnaud Woiselle, Joana Frontera-Pons, Jean-Luc Starck

Comments: 9 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[780] arXiv:2112.08996 [pdf, other]: Title: Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation

Authors: Jie Qin, Jie Wu, Xuefeng Xiao, Lujun Li, Xingang Wang

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[781] arXiv:2112.09043 [pdf, other]: Title: Neural Style Transfer and Unpaired Image-to-Image Translation to deal with the Domain Shift Problem on Spheroid Segmentation

Authors: Manuel García-Domínguez, César Domínguez, Jónathan Heras, Eloy Mata, Vico Pascual

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[782] arXiv:2112.09045 [pdf, other]: Title: The MVTec 3D-AD Dataset for Unsupervised 3D Anomaly Detection and Localization

Authors: Paul Bergmann, Xin Jin, David Sattlegger, Carsten Steger

Comments: Accepted for presentation at VISAPP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[783] arXiv:2112.09061 [pdf, other]: Title: Solving Inverse Problems with NerfGANs

Authors: Giannis Daras, Wen-Sheng Chu, Abhishek Kumar, Dmitry Lagun, Alexandros G. Dimakis

Comments: 16 pages, 18 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[784] arXiv:2112.09069 [pdf, other]: Title: Progressive Graph Convolution Network for EEG Emotion Recognition

Authors: Yijin Zhou, Fu Li, Yang Li, Youshuo Ji, Guangming Shi, Wenming Zheng, Lijian Zhang, Yuanfang Chen, Rui Cheng

Comments: 11 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Signal Processing (eess.SP)
[785] arXiv:2112.09081 [pdf, other]: Title: CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data

Authors: Qi Yan, Jianhao Zheng, Simon Reding, Shanci Li, Iordan Doytchinov

Comments: CVPR 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[786] arXiv:2112.09106 [pdf, other]: Title: RegionCLIP: Region-based Language-Image Pretraining

Authors: Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao

Comments: Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[787] arXiv:2112.09120 [pdf, other]: Title: Human Hands as Probes for Interactive Object Understanding

Authors: Mohit Goyal, Sahil Modi, Rishabh Goyal, Saurabh Gupta

Comments: To Appear at CVPR 2022. Project website at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[788] arXiv:2112.09126 [pdf, other]: Title: IS-COUNT: Large-scale Object Counting from Satellite Images with Covariate-based Importance Sampling

Authors: Chenlin Meng, Enci Liu, Willie Neiswanger, Jiaming Song, Marshall Burke, David Lobell, Stefano Ermon

Comments: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[789] arXiv:2112.09127 [pdf, other]: Title: ICON: Implicit Clothed humans Obtained from Normals

Authors: Yuliang Xiu, Jinlong Yang, Dimitrios Tzionas, Michael J. Black

Comments: Project page: this https URL Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[790] arXiv:2112.09129 [pdf, other]: Title: Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition

Authors: Benjia Zhou, Pichao Wang, Jun Wan, Yanyan Liang, Fan Wang, Du Zhang, Zhen Lei, Hao Li, Rong Jin

Comments: open sourced; codes and models are available:this https URL; transformer-based method

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[791] arXiv:2112.09130 [pdf, other]: Title: Ensembling Off-the-shelf Models for GAN Training

Authors: Nupur Kumari, Richard Zhang, Eli Shechtman, Jun-Yan Zhu

Comments: CVPR 2022 (Oral). GitHub: this https URL Project webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[792] arXiv:2112.09131 [pdf, other]: Title: HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images

Authors: Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[793] arXiv:2112.09133 [pdf, other]: Title: Masked Feature Prediction for Self-Supervised Visual Pre-Training

Authors: Chen Wei, Haoqi Fan, Saining Xie, Chao-Yuan Wu, Alan Yuille, Christoph Feichtenhofer

Comments: Technical report. arXiv v2: update AVA results (details in Appendix E)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[794] arXiv:2112.09151 [pdf, other]: Title: TAFIM: Targeted Adversarial Attacks against Facial Image Manipulations

Authors: Shivangi Aneja, Lev Markhasin, Matthias Niessner

Comments: (ECCV 2022 Paper) Video: this https URL Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[795] arXiv:2112.09165 [pdf, other]: Title: ALEBk: Feasibility Study of Attention Level Estimation via Blink Detection applied to e-Learning

Authors: Roberto Daza, Daniel DeAlcala, Aythami Morales, Ruben Tolosana, Ruth Cobos, Julian Fierrez

Comments: Preprint of the paper presented to the Workshop on Artificial Intelligence for Education (AI4EDU) of AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[796] arXiv:2112.09172 [pdf, ps, other]: Title: An Audio-Visual Dataset and Deep Learning Frameworks for Crowded Scene Classification

Authors: Lam Pham, Dat Ngo, Phu X. Nguyen, Truong Hoang, Alexander Schindler

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[797] arXiv:2112.09190 [pdf, other]: Title: Monitoring crop phenology with street-level imagery using computer vision

Authors: Raphaël d'Andrimont, Momchil Yordanov, Laura Martinez-Sanchez, Marijn van der Velde

Comments: 18 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[798] arXiv:2112.09195 [pdf, other]: Title: Mitigating the Bias of Centered Objects in Common Datasets

Authors: Gergely Szabo, Andras Horvath

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[799] arXiv:2112.09201 [pdf, other]: Title: Semantic-Based Few-Shot Learning by Interactive Psychometric Testing

Authors: Lu Yin, Vlado Menkovski, Yulong Pei, Mykola Pechenizkiy

Comments: Accepted at the AAAI-22 Workshop on Interactive Machine Learning (IML@AAAI'22)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[800] arXiv:2112.09205 [pdf, other]: Title: AFDetV2: Rethinking the Necessity of the Second Stage for Object Detection from Point Clouds

Authors: Yihan Hu, Zhuangzhuang Ding, Runzhou Ge, Wenxin Shao, Li Huang, Kun Li, Qiang Liu

Comments: AAAI 2022; 1st Place Solution for the Real-time 3D Detection and the Most Efficient Model of the Waymo Open Dataset Challenges 2021 (this http URL)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[801] arXiv:2112.09214 [pdf, other]: Title: Sparse Coding with Multi-Layer Decoders using Variance Regularization

Authors: Katrina Evtimova, Yann LeCun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[802] arXiv:2112.09219 [pdf, other]: Title: All You Need is RAW: Defending Against Adversarial Attacks with Camera Image Pipelines

Authors: Yuxuan Zhang, Bo Dong, Felix Heide

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[803] arXiv:2112.09220 [pdf, other]: Title: Sim2Real Docs: Domain Randomization for Documents in Natural Scenes using Ray-traced Rendering

Authors: Nikhil Maddikunta, Huijun Zhao, Sumit Keswani, Alfy Samuel, Fu-Ming Guo, Nishan Srishankar, Vishwa Pardeshi, Austin Huang

Comments: Accepted to Neurips 2021 Data Centric AI (DCAI) Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[804] arXiv:2112.09251 [pdf, other]: Title: The Wanderings of Odysseus in 3D Scenes

Authors: Yan Zhang, Siyu Tang

Comments: cvpr22 camera ready

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[805] arXiv:2112.09253 [pdf, other]: Title: Logically at Factify 2022: Multimodal Fact Verification

Authors: Jie Gao, Hella-Franziska Hoffmann, Stylianos Oikonomou, David Kiskovski, Anil Bandhakavi

Comments: Accepted in AAAI'22: First Workshop on Multimodal Fact-Checking and Hate Speech Detection, Februrary 22 - March 1, 2022,Vancouver, BC, Canada

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[806] arXiv:2112.09260 [pdf, other]: Title: How to augment your ViTs? Consistency loss and StyleAug, a random style transfer augmentation

Authors: Akash Umakantha, Joao D. Semedo, S. Alireza Golestaneh, Wan-Yi S. Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[807] arXiv:2112.09262 [pdf, ps, other]: Title: Image Inpainting Using AutoEncoder and Guided Selection of Predicted Pixels

Authors: Mohammad H. Givkashi, Mahshid Hadipour, Arezoo PariZanganeh, Zahra Nabizadeh, Nader Karimi, Shadrokh Samavi

Comments: 5 pages, 2 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[808] arXiv:2112.09278 [pdf, other]: Title: All-photon Polarimetric Time-of-Flight Imaging

Authors: Seung-Hwan Baek, Felix Heide

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[809] arXiv:2112.09290 [pdf, other]: Title: PeopleSansPeople: A Synthetic Data Generator for Human-Centric Computer Vision

Authors: Salehe Erfanian Ebadi, You-Cyuan Jhang, Alex Zook, Saurav Dhakad, Adam Crespi, Pete Parisi, Steven Borkman, Jonathan Hogins, Sujoy Ganguly

Comments: PeopleSansPeople template Unity environment, benchmark binaries, and source code is available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Databases (cs.DB); Graphics (cs.GR); Machine Learning (cs.LG)
[810] arXiv:2112.09298 [pdf, other]: Title: Human-vehicle Cooperative Visual Perception for Autonomous Driving under Complex Road and Traffic Scenarios

Authors: Yiyue Zhao, Cailin Lei, Yu Shen, Yuchuan Du, Qijun Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[811] arXiv:2112.09300 [pdf, other]: Title: Towards End-to-End Image Compression and Analysis with Transformers

Authors: Yuanchao Bai, Xu Yang, Xianming Liu, Junjun Jiang, Yaowei Wang, Xiangyang Ji, Wen Gao

Comments: Accepted by AAAI 2022; Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[812] arXiv:2112.09318 [pdf, other]: Title: Procedural Kernel Networks

Authors: Bartlomiej Wronski

Comments: 11 pages, technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[813] arXiv:2112.09326 [pdf, other]: Title: Cinderella's shoe won't fit Soundarya: An audit of facial processing tools on Indian faces

Authors: Gaurav Jain, Smriti Parsheera

Comments: 17 pages, 2 figures and 7 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[814] arXiv:2112.09329 [pdf, other]: Title: Point2Cyl: Reverse Engineering 3D Objects from Point Clouds to Extrusion Cylinders

Authors: Mikaela Angelina Uy, Yen-yu Chang, Minhyuk Sung, Purvi Goel, Joseph Lambourne, Tolga Birdal, Leonidas Guibas

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[815] arXiv:2112.09331 [pdf, other]: Title: Contrastive Vision-Language Pre-training with Limited Resources

Authors: Quan Cui, Boyan Zhou, Yu Guo, Weidong Yin, Hao Wu, Osamu Yoshie, Yubo Chen

Comments: Accepted to ECCV2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[816] arXiv:2112.09343 [pdf, other]: Title: Domain Adaptation on Point Clouds via Geometry-Aware Implicits

Authors: Yuefan Shen, Yanchao Yang, Mi Yan, He Wang, Youyi Zheng, Leonidas Guibas

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[817] arXiv:2112.09356 [pdf, other]: Title: UniMiSS: Universal Medical Self-Supervised Learning via Breaking Dimensionality Barrier

Authors: Yutong Xie, Jianpeng Zhang, Yong Xia, Qi Wu

Comments: Accepted by ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[818] arXiv:2112.09357 [pdf, other]: Title: Interpreting Audiograms with Multi-stage Neural Networks

Authors: Shufan Li, Congxi Lu, Linkai Li, Jirong Duan, Xinping Fu, Haoshuai Zhou

Comments: 12pages,12 figures. The code for this project is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[819] arXiv:2112.09367 [pdf, other]: Title: SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder

Authors: Jonghyun Kim, Gen Li, Cheolkon Jung, Joongkyu Kim

Comments: Accepted to BMVC 2021. Codes are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[820] arXiv:2112.09379 [pdf, ps, other]: Title: Enhanced Frame and Event-Based Simulator and Event-Based Video Interpolation Network

Authors: Adam Radomski, Andreas Georgiou, Thomas Debrunner, Chenghan Li, Luca Longinotti, Minwon Seo, Moosung Kwak, Chang-Woo Shin, Paul K. J. Park, Hyunsurk Eric Ryu, Kynan Eng

Comments: 10 pages, 19 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[821] arXiv:2112.09385 [pdf, other]: Title: Full Transformer Framework for Robust Point Cloud Registration with Deep Information Interaction

Authors: Guangyan Chen, Meiling Wang, Yufeng Yue, Qingxiang Zhang, Li Yuan

Comments: 10pages, 7figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[822] arXiv:2112.09413 [pdf, other]: Title: Self-attention based anchor proposal for skeleton-based action recognition

Authors: Ruijie Hou, Zhao Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[823] arXiv:2112.09414 [pdf, other]: Title: Disentangled representations: towards interpretation of sex determination from hip bone

Authors: Kaifeng Zou, Sylvain Faisan, Fabrice Heitz, Marie Epain, Pierre Croisille, Laurent Fanton, Sébastien Valette

Journal-ref: The Visual Computer (2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[824] arXiv:2112.09422 [pdf, other]: Title: A Review on Visual Privacy Preservation Techniques for Active and Assisted Living

Authors: Siddharth Ravi, Pau Climent-Pérez, Francisco Florez-Revuelta

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[825] arXiv:2112.09426 [pdf, other]: Title: SiamTrans: Zero-Shot Multi-Frame Image Restoration with Pre-Trained Siamese Transformers

Authors: Lin Liu, Shanxin Yuan, Jianzhuang Liu, Xin Guo, Youliang Yan, Qi Tian

Journal-ref: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[826] arXiv:2112.09428 [src]: Title: Dynamics-aware Adversarial Attack of 3D Sparse Convolution Network

Authors: An Tao, Yueqi Duan, He Wang, Ziyi Wu, Pengliang Ji, Haowen Sun, Jie Zhou, Jiwen Lu

Comments: We have improved the quality of this work and updated a new version to address the limitations of the proposed method

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[827] arXiv:2112.09442 [pdf, ps, other]: Title: Adaptively Customizing Activation Functions for Various Layers

Authors: Haigen Hu, Aizhu Liu, Qiu Guan, Xiaoxin Li, Shengyong Chen, Qianwei Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[828] arXiv:2112.09445 [pdf, other]: Title: Data Efficient Language-supervised Zero-shot Recognition with Optimal Transport Distillation

Authors: Bichen Wu, Ruizhe Cheng, Peizhao Zhang, Tianren Gao, Peter Vajda, Joseph E. Gonzalez

Comments: 19 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[829] arXiv:2112.09448 [pdf, other]: Title: Distillation of Human-Object Interaction Contexts for Action Recognition

Authors: Muna Almushyti, Frederick W. Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[830] arXiv:2112.09459 [pdf, other]: Title: Weakly Supervised Semantic Segmentation via Alternative Self-Dual Teaching

Authors: Dingwen Zhang, Wenyuan Zeng, Guangyu Guo, Chaowei Fang, Lechao Cheng, Ming-Ming Cheng, Junwei Han

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[831] arXiv:2112.09490 [pdf, other]: Title: Visual Microfossil Identification via Deep Metric Learning

Authors: Tayfun Karaderi, Tilo Burghardt, Allison Y. Hsiang, Jacob Ramaer, Daniela N. Schmidt

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[832] arXiv:2112.09493 [pdf, other]: Title: Methods for segmenting cracks in 3d images of concrete: A comparison based on semi-synthetic images

Authors: Tin Barisin, Christian Jung, Franziska Müsebeck, Claudia Redenbach, Katja Schladitz

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[833] arXiv:2112.09515 [pdf, other]: Title: Symmetry-aware Neural Architecture for Embodied Visual Navigation

Authors: Shuang Liu, Takayuki Okatani

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[834] arXiv:2112.09532 [pdf, other]: Title: Pixel Distillation: A New Knowledge Distillation Scheme for Low-Resolution Image Recognition

Authors: Guangyu Guo, Longfei Han, Junwei Han, Dingwen Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[835] arXiv:2112.09546 [pdf, other]: Title: Complex Functional Maps : a Conformal Link Between Tangent Bundles

Authors: Nicolas Donati (LIX), Etienne Corman (LORIA, CNRS, PIXEL), Simone Melzi (Sapienza University of Rome), Maks Ovsjanikov (LIX)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Differential Geometry (math.DG)
[836] arXiv:2112.09568 [pdf, other]: Title: Nearest neighbor search with compact codes: A decoder perspective

Authors: Kenza Amara, Matthijs Douze, Alexandre Sablayrolles, Hervé Jégou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[837] arXiv:2112.09569 [pdf, other]: Title: CPPE-5: Medical Personal Protective Equipment Dataset

Authors: Rishit Dagli, Ali Mustufa Shaikh

Comments: 18 pages, 6 tables, 6 figures. Code and models are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[838] arXiv:2112.09581 [pdf, other]: Title: Watermarking Images in Self-Supervised Latent Spaces

Authors: Pierre Fernandez, Alexandre Sablayrolles, Teddy Furon, Hervé Jégou, Matthijs Douze

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[839] arXiv:2112.09583 [pdf, other]: Title: Align and Prompt: Video-and-Language Pre-training with Entity Prompts

Authors: Dongxu Li, Junnan Li, Hongdong Li, Juan Carlos Niebles, Steven C.H. Hoi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[840] arXiv:2112.09591 [pdf, other]: Title: Global explainability in aligned image modalities

Authors: Justin Engelmann, Amos Storkey, Miguel O. Bernabeu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[841] arXiv:2112.09598 [pdf, other]: Title: Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans

Authors: Lukáš Gajdošech, Viktor Kocur, Martin Stuchlík, Lukáš Hudec, Martin Madaras

Comments: Accepted VISAPP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[842] arXiv:2112.09645 [pdf, other]: Title: Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation

Authors: Krishna Chaitanya, Ertunc Erdil, Neerav Karani, Ender Konukoglu

Comments: 13 pages, 4 figures, 7 tables. This article is under review at a Journal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[843] arXiv:2112.09647 [pdf, other]: Title: Video-Based Reconstruction of the Trajectories Performed by Skiers

Authors: Matteo Dunnhofer, Alberto Zurini, Maurizio Dunnhofer, Christian Micheloni

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[844] arXiv:2112.09648 [pdf, other]: Title: Improving neural implicit surfaces geometry with patch warping

Authors: François Darmon, Bénédicte Bascle, Jean-Clément Devaux, Pascal Monasse, Mathieu Aubry

Comments: Accepted at CVPR2022. Project wepbage: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[845] arXiv:2112.09653 [pdf, other]: Title: Information-theoretic stochastic contrastive conditional GAN: InfoSCC-GAN

Authors: Vitaliy Kinakh, Mariia Drozdova, Guillaume Quétant, Tobias Golling, Slava Voloshynovskiy

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[846] arXiv:2112.09660 [pdf, other]: Title: AI-Assisted Verification of Biometric Data Collection

Authors: Ryan Lindsey

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[847] arXiv:2112.09664 [pdf, other]: Title: Towards More Effective PRM-based Crowd Counting via A Multi-resolution Fusion and Attention Network

Authors: Usman Sajid, Guanghui Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[848] arXiv:2112.09685 [pdf, other]: Title: Neuromorphic Camera Denoising using Graph Neural Network-driven Transformers

Authors: Yusra Alkendi, Rana Azzam, Abdulla Ayyad, Sajid Javed, Lakmal Seneviratne, Yahya Zweiri

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[849] arXiv:2112.09686 [pdf, other]: Title: Efficient Visual Tracking with Exemplar Transformers

Authors: Philippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc Van Gool

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[850] arXiv:2112.09687 [pdf, other]: Title: Light Field Neural Rendering

Authors: Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia

Comments: Project page with code and videos at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[851] arXiv:2112.09690 [pdf, other]: Title: Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

Authors: Yinghao Xu, Fangyun Wei, Xiao Sun, Ceyuan Yang, Yujun Shen, Bo Dai, Bolei Zhou, Stephen Lin

Comments: CVPR 2022 camera-ready, Project webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[852] arXiv:2112.09747 [pdf, other]: Title: A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation

Authors: Wuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, Tsung-Yi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou

Comments: ECCV 2022 accepted

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[853] arXiv:2112.09775 [pdf, other]: Title: Adaptive Subsampling for ROI-based Visual Tracking: Algorithms and FPGA Implementation

Authors: Odrika Iqbal, Victor Isaac Torres Muro, Sameeksha Katoch, Andreas Spanias, Suren Jayasuriya

Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR)
[854] arXiv:2112.09786 [pdf, other]: Title: Distill and De-bias: Mitigating Bias in Face Verification using Knowledge Distillation

Authors: Prithviraj Dhar, Joshua Gleason, Aniket Roy, Carlos D. Castillo, P. Jonathon Phillips, Rama Chellappa

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[855] arXiv:2112.09791 [pdf, other]: Title: Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks

Authors: Guangxing Han, Yicheng He, Shiyuan Huang, Jiawei Ma, Shih-Fu Chang

Comments: ICCV 2021. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[856] arXiv:2112.09809 [pdf, other]: Title: A Streaming Volumetric Image Generation Framework for Development and Evaluation of Out-of-Core Methods

Authors: Dominik Drees, Xiaoyi Jiang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[857] arXiv:2112.09828 [pdf, other]: Title: Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Authors: Shengyu Feng, Subarna Tripathi, Hesham Mostafa, Marcel Nassar, Somdeb Majumdar

Comments: WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[858] arXiv:2112.09833 [pdf, other]: Title: Face Deblurring Based on Separable Normalization and Adaptive Denormalization

Authors: Xian Zhang, Hao Zhang, Jiancheng Lv, Xiaojie Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[859] arXiv:2112.09839 [pdf, other]: Title: Calorie Aware Automatic Meal Kit Generation from an Image

Authors: Ahmad Babaeian Jelodar, Yu Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[860] arXiv:2112.09844 [pdf, other]: Title: Enhanced Object Detection in Floor-plan through Super Resolution

Authors: Dev Khare, N S Kamal, Barathi Ganesh HB, V Sowmya, V V Sajith Variyar

Comments: 3rd International Conference on Machine Learning, Image Processing, Network Security and Data Sciences

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[861] arXiv:2112.09852 [pdf, other]: Title: LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision

Authors: Rui Han, Qinglong Zhang, Chi Harold Liu, Guoren Wang, Jian Tang, Lydia Y. Chen

Comments: 13 pages, 15 figures

Journal-ref: In MobiCom'21, pages 406-419, 2021. ACM

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[862] arXiv:2112.09854 [pdf, other]: Title: Space Non-cooperative Object Active Tracking with Deep Reinforcement Learning

Authors: Dong Zhou, Guanghui Sun, Wenxiao Lei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[863] arXiv:2112.09873 [pdf, ps, other]: Title: An effective coaxiality measurement for twist drill based on line structured light sensor

Authors: Ailing Cheng, Jiaojiao Ye, Fei Yang, Shufang Lu, Fei Gao

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible.13 pages, 22 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[864] arXiv:2112.09875 [pdf, other]: Title: Adversarial Memory Networks for Action Prediction

Authors: Zhiqiang Tao, Yue Bai, Handong Zhao, Sheng Li, Yu Kong, Yun Fu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[865] arXiv:2112.09898 [pdf, other]: Title: Does Explainable Machine Learning Uncover the Black Box in Vision Applications?

Authors: Manish Narwaria

Comments: Image and Vision Computing, Volume 118, 2022, 104353, ISSN 0262-8856, this https URL

Journal-ref: Image and Vision Computing, Volume 118, 2022, 104353, ISSN 0262-8856

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[866] arXiv:2112.09902 [pdf, other]: Title: 3D Instance Segmentation of MVS Buildings

Authors: Jiazhou Chen, Yanghui Xu, Shufang Lu, Ronghua Liang, Liangliang Nan

Comments: 14 figures, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[867] arXiv:2112.09908 [pdf, other]: Title: Anomaly Discovery in Semantic Segmentation via Distillation Comparison Networks

Authors: Huan Zhou, Shi Gong, Yu Zhou, Zengqiang Zheng, Ronghua Liu, Xiang Bai

Comments: 9 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[868] arXiv:2112.09922 [pdf, other]: Title: Fast and Robust Registration of Partially Overlapping Point Clouds

Authors: Eduardo Arnold, Sajjad Mozaffari, Mehrdad Dianati

Comments: Accepted at IEEE Robotics and Automation Letters (RA-L). 8 pages, 6 figures, 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[869] arXiv:2112.09938 [pdf, other]: Title: DeepUME: Learning the Universal Manifold Embedding for Robust Point Cloud Registration

Authors: Natalie Lang, Joseph M. Francos

Comments: BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[870] arXiv:2112.09951 [pdf, ps, other]: Title: Rapid Face Mask Detection and Person Identification Model based on Deep Neural Networks

Authors: Abdullah Ahmad Khan (1), Mohd. Belal (2), GhufranUllah (3) ((1,2 and 3) Aligarh Muslim University)

Comments: 12 pages , 15 figures , International Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[871] arXiv:2112.09965 [pdf, other]: Title: Pre-Training Transformers for Domain Adaptation

Authors: Burhan Ul Tayyab, Nicholas Chua

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[872] arXiv:2112.09976 [pdf, other]: Title: Tell me what you see: A zero-shot action recognition method based on natural language descriptions

Authors: Valter Estevam, Rayson Laroca, David Menotti, Helio Pedrini

Comments: Published at Multimedia Tools and Applications

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[873] arXiv:2112.10003 [pdf, other]: Title: Image Segmentation Using Text and Image Prompts

Authors: Timo Lüddecke, Alexander S. Ecker

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[874] arXiv:2112.10047 [pdf, other]: Title: Controlling the Quality of Distillation in Response-Based Network Compression

Authors: Vibhas Vats, David Crandall

Comments: AAAI22-Workshop: 1st International Workshop on Practical Deep Learning in the Wild

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[875] arXiv:2112.10057 [pdf, other]: Title: Precondition and Effect Reasoning for Action Recognition

Authors: Yoo Hongsang, Li Haopeng, Ke Qiuhong, Liu Liangchen, Zhang Rui

Comments: The paper is under consideration at Computer Vision and Image Understanding

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[876] arXiv:2112.10063 [pdf, other]: Title: Deep Graph-level Anomaly Detection by Glocal Knowledge Distillation

Authors: Rongrong Ma, Guansong Pang, Ling Chen, Anton van den Hengel

Comments: Accepted to WSDM 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[877] arXiv:2112.10066 [pdf, other]: Title: LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach

Authors: Cristian Rodriguez-Opazo, Edison Marrese-Taylor, Basura Fernando, Hiroya Takamura, Qi Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[878] arXiv:2112.10082 [pdf, other]: Title: MoCaNet: Motion Retargeting in-the-wild via Canonicalization Networks

Authors: Wentao Zhu, Zhuoqian Yang, Ziang Di, Wayne Wu, Yizhou Wang, Chen Change Loy

Comments: Accepted by AAAI 2022. The first two authors contributed equally. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[879] arXiv:2112.10087 [pdf, other]: Title: Reasoning Structural Relation for Occlusion-Robust Facial Landmark Localization

Authors: Congcong Zhu, Xiaoqiang Li, Jide Li, Songmin Dai, Weiqin Tong

Comments: Accepted by Pattern recognition

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[880] arXiv:2112.10089 [pdf, other]: Title: Camera-aware Style Separation and Contrastive Learning for Unsupervised Person Re-identification

Authors: Xue Li, Tengfei Liang, Yi Jin, Tao Wang, Yidong Li

Comments: 6 pages, 4 figures, 2 tables

Journal-ref: 2022 IEEE International Conference on Multimedia and Expo (ICME)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[881] arXiv:2112.10098 [pdf, other]: Title: Initiative Defense against Facial Manipulation

Authors: Qidong Huang, Jie Zhang, Wenbo Zhou, WeimingZhang, Nenghai Yu

Comments: Accepted at AAAI 2021

Journal-ref: Proceedings of the AAAI Conference on Artificial Intelligence, 35(2), 1619-1627, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[882] arXiv:2112.10101 [pdf, ps, other]: Title: ArcFace Knows the Gender, Too!

Authors: Majid Farzaneh

Comments: 9 pages, 4 images, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[883] arXiv:2112.10103 [pdf, other]: Title: SAGA: Stochastic Whole-Body Grasping with Contact

Authors: Yan Wu, Jiahao Wang, Yan Zhang, Siwei Zhang, Otmar Hilliges, Fisher Yu, Siyu Tang

Comments: Accepted by ECCV 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[884] arXiv:2112.10149 [pdf, other]: Title: Elastic-Link for Binarized Neural Network

Authors: Jie Hu, Ziheng Wu, Vince Tan, Zhilin Lu, Mengze Zeng, Enhua Wu

Comments: AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[885] arXiv:2112.10155 [pdf, other]: Title: Topology Preserving Local Road Network Estimation from Single Onboard Camera Image

Authors: Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc Van Gool

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[886] arXiv:2112.10167 [pdf, other]: Title: Improving Face-Based Age Estimation with Attention-Based Dynamic Patch Fusion

Authors: Haoyi Wang, Victor Sanchez, Chang-Tsun Li

Comments: IEEE Transactions on Image Processing (accepted for publication)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[887] arXiv:2112.10175 [pdf, other]: Title: On Efficient Transformer-Based Image Pre-training for Low-Level Vision

Authors: Wenbo Li, Xin Lu, Shengju Qian, Jiangbo Lu, Xiangyu Zhang, Jiaya Jia

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[888] arXiv:2112.10194 [pdf, other]: Title: UnweaveNet: Unweaving Activity Stories

Authors: Will Price, Carl Vondrick, Dima Damen

Comments: Accepted at IEEE/CVF Computer Vision and Pattern Recognition (CVPR) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[889] arXiv:2112.10196 [pdf, other]: Title: End-to-End Learning of Multi-category 3D Pose and Shape Estimation

Authors: Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc Van Gool

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[890] arXiv:2112.10203 [pdf, other]: Title: HVTR: Hybrid Volumetric-Textural Rendering for Human Avatars

Authors: Tao Hu, Tao Yu, Zerong Zheng, He Zhang, Yebin Liu, Matthias Zwicker

Comments: Accepted to 3DV 2022. See more results at this https URL Demo: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[891] arXiv:2112.10258 [pdf, other]: Title: GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor

Authors: Jean-Baptiste Carluer, Laurent Chauvin, Jie Luo, William M. Wells III, Ines Machado, Rola Harmouche, Matthew Toews

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[892] arXiv:2112.10271 [pdf, other]: Title: Wiener Guided DIP for Unsupervised Blind Image Deconvolution

Authors: Gustav Bredell, Ertunc Erdil, Bruno Weber, Ender Konukoglu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[893] arXiv:2112.10275 [pdf, other]: Title: Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection

Authors: Renjie Li, Son Tran, Saurabh Garg, Katherine Lawler, Jane Alty, Quan Bai

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[894] arXiv:2112.10298 [pdf, other]: Title: Driver Drowsiness Detection Using Ensemble Convolutional Neural Networks on YawDD

Authors: Rais Mohammad Salman, Mahbubur Rashid, Rupal Roy, Md Manjurul Ahsan, Zahed Siddique

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[895] arXiv:2112.10305 [pdf, ps, other]: Title: Model-based gait recognition using graph network on very large population database

Authors: Zhihao Wang, Chaoying Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[896] arXiv:2112.10310 [pdf, other]: Title: Contrastive Attention Network with Dense Field Estimation for Face Completion

Authors: Xin Ma, Xiaoqiang Zhou, Huaibo Huang, Gengyun Jia, Zhenhua Chai, Xiaolin Wei

Comments: Accepted by Pattern Recognition 2021. arXiv admin note: substantial text overlap with arXiv:2010.15643

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[897] arXiv:2112.10324 [pdf, other]: Title: Product Re-identification System in Fully Automated Defect Detection

Authors: Chenggui Sun, Li Bin Song

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[898] arXiv:2112.10365 [pdf, other]: Title: DMS-GCN: Dynamic Mutiscale Spatiotemporal Graph Convolutional Networks for Human Motion Prediction

Authors: Zigeng Yan, Di-Hua Zhai, Yuanqing Xia

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[899] arXiv:2112.10390 [pdf, ps, other]: Title: Evaluation and Comparison of Deep Learning Methods for Pavement Crack Identification with Visual Images

Authors: Kai-Liang Lu

Comments: This work will be submitted for possible publication. It is a further study from 2012.14704v2

Journal-ref: Frontiers in Artificial Intelligence and Applications (CECNet2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[900] arXiv:2112.10415 [pdf, other]: Title: UFPMP-Det: Toward Accurate and Efficient Object Detection on Drone Imagery

Authors: Yecheng Huang, Jiaxin Chen, Di Huang

Comments: 8 pages, 6 figures, Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[901] arXiv:2112.10453 [pdf, other]: Title: Learning with Label Noise for Image Retrieval by Selecting Interactions

Authors: Sarah Ibrahimi, Arnaud Sors, Rafael Sampaio de Rezende, Stéphane Clinchant

Comments: Accepted at WACV 2022. 13 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[902] arXiv:2112.10457 [pdf, other]: Title: Image Animation with Keypoint Mask

Authors: Or Toledano, Yanir Marmor, Dov Gertz

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[903] arXiv:2112.10474 [pdf, other]: Title: Reciprocal Normalization for Domain Adaptation

Authors: Zhiyong Huang, Kekai Sheng, Ke Li, Jian Liang, Taiping Yao, Weiming Dong, Dengwen Zhou, Xing Sun

Comments: The best feature normalization module for domain adaptation

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[904] arXiv:2112.10481 [pdf, ps, other]: Title: a novel attention-based network for fast salient object detection

Authors: Bin Zhang, Yang Wu, Xiaojing Zhang, Ming Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[905] arXiv:2112.10482 [pdf, other]: Title: ScanQA: 3D Question Answering for Spatial Scene Understanding

Authors: Daichi Azuma, Taiki Miyanishi, Shuhei Kurita, Motoaki Kawanabe

Comments: CVPR2022. The first three authors are equally contributed. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[906] arXiv:2112.10483 [pdf, other]: Title: Fusion and Orthogonal Projection for Improved Face-Voice Association

Authors: Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, Alessio Del Bue

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[907] arXiv:2112.10485 [pdf, other]: Title: Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching

Authors: Yujie Fu, Yihong Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[908] arXiv:2112.10531 [pdf, ps, other]: Title: Object Recognition as Classification via Visual Properties

Authors: Fausto Giunchiglia, Mayukh Bagchi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[909] arXiv:2112.10570 [pdf, other]: Title: Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition

Authors: Jinfeng Wei, Yunxin Wang, Mengli Guo, Pei Lv, Xiaoshan Yang, Mingliang Xu

Comments: 12 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[910] arXiv:2112.10587 [pdf, other]: Title: Image-free multi-character recognition

Authors: Huayi Wang, Chunli Zhu, Liheng Bian

Comments: 17pages, 4figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[911] arXiv:2112.10591 [pdf, other]: Title: Real-Time Optical Flow for Vehicular Perception with Low- and High-Resolution Event Cameras

Authors: Vincent Brebion, Julien Moreau, Franck Davoine

Comments: 13 pages, journal paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[912] arXiv:2112.10600 [pdf, other]: Title: DeePaste -- Inpainting for Pasting

Authors: Levi Kassel Michael Werman

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[913] arXiv:2112.10624 [pdf, other]: Title: Learning to integrate vision data into road network data

Authors: Oliver Stromann, Alireza Razavi, Michael Felsberg

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[914] arXiv:2112.10646 [pdf, other]: Title: Raw High-Definition Radar for Multi-Task Learning

Authors: Julien Rebut, Arthur Ouaknine, Waqas Malik, Patrick Pérez

Comments: 12 pages, 7 figures, 6 tables

Journal-ref: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[915] arXiv:2112.10683 [pdf, other]: Title: SelFSR: Self-Conditioned Face Super-Resolution in the Wild via Flow Field Degradation Network

Authors: Xianfang Zeng, Jiangning Zhang, Liang Liu, Guangzhong Tian, Yong Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[916] arXiv:2112.10703 [pdf, other]: Title: Mega-NeRF: Scalable Construction of Large-Scale NeRFs for Virtual Fly-Throughs

Authors: Haithem Turki, Deva Ramanan, Mahadev Satyanarayanan

Comments: CVPR 2022 Project page: this https URL GitHub: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[917] arXiv:2112.10716 [pdf, other]: Title: BAPose: Bottom-Up Pose Estimation with Disentangled Waterfall Representations

Authors: Bruno Artacho, Andreas Savakis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[918] arXiv:2112.10727 [pdf, other]: Title: Learning Physics Properties of Fabrics and Garments with a Physics Similarity Neural Network

Authors: Li Duan, Lewis Boyd, Gerardo Aragon-Camarasa

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[919] arXiv:2112.10740 [pdf, other]: Title: Are Large-scale Datasets Necessary for Self-Supervised Pre-training?

Authors: Alaaeldin El-Nouby, Gautier Izacard, Hugo Touvron, Ivan Laptev, Hervé Jegou, Edouard Grave

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[920] arXiv:2112.10741 [pdf, other]: Title: GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Authors: Alex Nichol, Prafulla Dhariwal, Aditya Ramesh, Pranav Shyam, Pamela Mishkin, Bob McGrew, Ilya Sutskever, Mark Chen

Comments: 20 pages, 18 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[921] arXiv:2112.10752 [pdf, other]: Title: High-Resolution Image Synthesis with Latent Diffusion Models

Authors: Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[922] arXiv:2112.10759 [pdf, other]: Title: 3D-aware Image Synthesis via Learning Structural and Textural Representations

Authors: Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou

Comments: CVPR 2022 camera-ready, Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[923] arXiv:2112.10762 [pdf, other]: Title: StyleSwin: Transformer-based GAN for High-resolution Image Generation

Authors: Bowen Zhang, Shuyang Gu, Bo Zhang, Jianmin Bao, Dong Chen, Fang Wen, Yong Wang, Baining Guo

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[924] arXiv:2112.10764 [pdf, other]: Title: Mask2Former for Video Instance Segmentation

Authors: Bowen Cheng, Anwesa Choudhuri, Ishan Misra, Alexander Kirillov, Rohit Girdhar, Alexander G. Schwing

Comments: Code and models: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[925] arXiv:2112.10809 [pdf, other]: Title: Lite Vision Transformer with Enhanced Self-Attention

Authors: Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zijun Wei, Zhe Lin, Alan Yuille

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[926] arXiv:2112.10838 [pdf, other]: Title: One Sketch for All: One-Shot Personalized Sketch Segmentation

Authors: Anran Qi, Yulia Gryaditskaya, Tao Xiang, Yi-Zhe Song

Comments: IEEE Transactions on Image Processing, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[927] arXiv:2112.10844 [pdf, other]: Title: Encoding Hierarchical Information in Neural Networks helps in Subpopulation Shift

Authors: Amitangshu Mukherjee, Isha Garg, Kaushik Roy

Comments: 15 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[928] arXiv:2112.10871 [pdf, other]: Title: Translational Concept Embedding for Generalized Compositional Zero-shot Learning

Authors: He Huang, Wei Tang, Jiawei Zhang, Philip S. Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[929] arXiv:2112.10909 [pdf, ps, other]: Title: Spatiotemporal Motion Synchronization for Snowboard Big Air

Authors: Seiji Matsumura, Dan Mikami, Naoki Saijo, Makio Kashino

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[930] arXiv:2112.10936 [pdf, other]: Title: Watch Those Words: Video Falsification Detection Using Word-Conditioned Facial Motion

Authors: Shruti Agarwal, Liwen Hu, Evonne Ng, Trevor Darrell, Hao Li, Anna Rohrbach

Comments: Accepted in WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Multimedia (cs.MM)
[931] arXiv:2112.10941 [pdf, other]: Title: Structured Semantic Transfer for Multi-Label Recognition with Partial Labels

Authors: Tianshui Chen, Tao Pu, Hefeng Wu, Yuan Xie, Liang Lin

Comments: Accepted by AAAI'22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[932] arXiv:2112.10945 [pdf, other]: Title: Pixel-Stega: Generative Image Steganography Based on Autoregressive Models

Authors: Siyu Zhang, Zhongliang Yang, Haoqin Tu, Jinshuai Yang, Yongfeng Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[933] arXiv:2112.10948 [pdf, ps, other]: Title: Task-Oriented Image Transmission for Scene Classification in Unmanned Aerial Systems

Authors: Xu Kang, Bin Song, Jie Guo, Zhijin Qin, F. Richard Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[934] arXiv:2112.10960 [pdf, other]: Title: Continuous-Time Video Generation via Learning Motion Dynamics with Neural ODE

Authors: Kangyeol Kim, Sunghyun Park, Junsoo Lee, Joonseok Lee, Sookyung Kim, Jaegul Choo, Edward Choi

Comments: 24 pages; Accepted to BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[935] arXiv:2112.10963 [pdf, other]: Title: DRPN: Making CNN Dynamically Handle Scale Variation

Authors: Jingchao Peng, Haitao Zhao, Zhengwei Hu, Kaijie Zhao, Zhongze Wang

Journal-ref: Digit. Signal Process, 133 (2023), pp. 103844

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[936] arXiv:2112.10969 [pdf, other]: Title: Generalizing Interactive Backpropagating Refinement for Dense Prediction

Authors: Fanqing Lin, Brian Price, Tony Martinez

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[937] arXiv:2112.10977 [pdf, other]: Title: ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization

Authors: Zichen Yang, Jie Qin, Di Huang

Comments: Accepted to AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[938] arXiv:2112.10982 [pdf, other]: Title: Generalized Few-Shot Semantic Segmentation: All You Need is Fine-Tuning

Authors: Josh Myers-Dean, Yinan Zhao, Brian Price, Scott Cohen, Danna Gurari

Comments: Includes supplementary materials

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[939] arXiv:2112.10988 [pdf, other]: Title: Mapping industrial poultry operations at scale with deep learning and aerial imagery

Authors: Caleb Robinson, Ben Chugg, Brandon Anderson, Juan M. Lavista Ferres, Daniel E. Ho

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[940] arXiv:2112.10992 [pdf, other]: Title: Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition

Authors: Xiangbo Shu, Jiawen Yang, Rui Yan, Yan Song

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[941] arXiv:2112.11004 [pdf, other]: Title: Point spread function estimation for blind image deblurring problems based on framelet transform

Authors: Reza Parvaz

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Image and Video Processing (eess.IV); Optimization and Control (math.OC)
[942] arXiv:2112.11010 [pdf, other]: Title: MPViT: Multi-Path Vision Transformer for Dense Prediction

Authors: Youngwan Lee, Jonghee Kim, Jeff Willette, Sung Ju Hwang

Comments: technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[943] arXiv:2112.11014 [pdf, other]: Title: fMRI Neurofeedback Learning Patterns are Predictive of Personal and Clinical Traits

Authors: Rotem Leibovitz, Jhonathan Osin, Lior Wolf, Guy Gurevitch, Talma Hendler

Journal-ref: MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[944] arXiv:2112.11037 [pdf, other]: Title: SOIT: Segmenting Objects with Instance-Aware Transformers

Authors: Xiaodong Yu, Dahu Shi, Xing Wei, Ye Ren, Tingqun Ye, Wenming Tan

Comments: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[945] arXiv:2112.11081 [pdf, other]: Title: RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality

Authors: Xiaohan Ding, Honghao Chen, Xiangyu Zhang, Jungong Han, Guiguang Ding

Comments: Accepted by CVPR-2022. This is the latest version

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[946] arXiv:2112.11085 [pdf, other]: Title: Can We Use Neural Regularization to Solve Depth Super-Resolution?

Authors: Milena Gazdieva, Oleg Voynov, Alexey Artemov, Youyi Zheng, Luiz Velho, Evgeny Burnaev

Comments: 9 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[947] arXiv:2112.11088 [pdf, other]: Title: EPNet++: Cascade Bi-directional Fusion for Multi-Modal 3D Object Detection

Authors: Zhe Liu, Tengteng Huang, Bingling Li, Xiwu Chen, Xi Wang, Xiang Bai

Comments: Accepted by TPAMI-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[948] arXiv:2112.11121 [pdf, other]: Title: GlobalMatch: Registration of Forest Terrestrial Point Clouds by Global Matching of Relative Stem Positions

Authors: Xufei Wang, Zexin Yang, Xiaojun Cheng, Jantien Stoter, Wenbing Xu, Zhenlun Wu, Liangliang Nan

Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing. Vol. 197, 71-86, 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[949] arXiv:2112.11124 [pdf, other]: Title: Learning Human Motion Prediction via Stochastic Differential Equations

Authors: Kedi Lyu, Zhenguang Liu, Shuang Wu, Haipeng Chen, Xuhong Zhang, Yuyu Yin

Comments: 9 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[950] arXiv:2112.11133 [pdf, other]: Title: Cloud Sphere: A 3D Shape Representation via Progressive Deformation

Authors: Zongji Wang, Yunfei Liu, Feng Lu

Comments: This paper was submitted first in CVPR 2021 (paper id: 2255), and then was submitted in CVM 2022 (id: 160)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[951] arXiv:2112.11153 [pdf, other]: Title: PONet: Robust 3D Human Pose Estimation via Learning Orientations Only

Authors: Jue Wang, Shaoli Huang, Xinchao Wang, Dacheng Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[952] arXiv:2112.11177 [pdf, other]: Title: Generalizable Cross-modality Medical Image Segmentation via Style Augmentation and Dual Normalization

Authors: Ziqi Zhou, Lei Qi, Xin Yang, Dong Ni, Yinghuan Shi

Comments: Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[953] arXiv:2112.11224 [pdf, other]: Title: Attention-Based Sensor Fusion for Human Activity Recognition Using IMU Signals

Authors: Wenjin Tao, Haodong Chen, Md Moniruzzaman, Ming C. Leu, Zhaozheng Yi, Ruwen Qin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[954] arXiv:2112.11235 [pdf, other]: Title: Improving Robustness with Image Filtering

Authors: Matteo Terzi, Mattia Carletti, Gian Antonio Susto

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[955] arXiv:2112.11242 [pdf, other]: Title: Unsupervised deep learning techniques for powdery mildew recognition based on multispectral imaging

Authors: Alessandro Benfenati, Paola Causin, Roberto Oberti, Giovanni Stefanello

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[956] arXiv:2112.11243 [src]: Title: Projected Sliced Wasserstein Autoencoder-based Hyperspectral Images Anomaly Detection

Authors: Yurong Chen, Hui Zhang, Yaonan Wang, Q. M. Jonathan Wu, Yimin Yang

Comments: I need revise this paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[957] arXiv:2112.11244 [pdf, other]: Title: Hateful Memes Challenge: An Enhanced Multimodal Framework

Authors: Aijing Gao, Bingjun Wang, Jiaqi Yin, Yating Tian

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[958] arXiv:2112.11245 [pdf, other]: Title: Generating Photo-realistic Images from LiDAR Point Clouds with Generative Adversarial Networks

Authors: Nuriel Shalom Mor

Comments: 11 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[959] arXiv:2112.11246 [pdf, ps, other]: Title: Image quality enhancement of embedded holograms in holographic information hiding using deep neural networks

Authors: Tomoyoshi Shimobaba, Sota Oshima, Takashi Kakue, and Tomoyoshi Ito

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[960] arXiv:2112.11258 [pdf, other]: Title: PointCaps: Raw Point Cloud Processing using Capsule Networks with Euclidean Distance Routing

Authors: Dishanika Denipitiyage, Vinoj Jayasundara, Ranga Rodrigo, Chamira U. S. Edussooriya

Comments: Accepted to be published in Journal of Visual Communication and Image Representation (Elsevier), 16 Pages, 4 Figures, 5 Tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[961] arXiv:2112.11271 [pdf, other]: Title: High-Fidelity Point Cloud Completion with Low-Resolution Recovery and Noise-Aware Upsampling

Authors: Ren-Wu Li, Bo Wang, Chun-Peng Li, Ling-Xiao Zhang, Lin Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[962] arXiv:2112.11290 [pdf, other]: Title: Review of Face Presentation Attack Detection Competitions

Authors: Zitong Yu, Jukka Komulainen, Xiaobai Li, Guoying Zhao

Comments: Handbook of Biometric Anti-Spoofing (3rd Ed.)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[963] arXiv:2112.11325 [pdf, other]: Title: iSegFormer: Interactive Segmentation via Transformers with Application to 3D Knee MR Images

Authors: Qin Liu, Zhenlin Xu, Yining Jiao, Marc Niethammer

Comments: MICCAI'22 camera-ready

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[964] arXiv:2112.11329 [pdf, other]: Title: Multispectral image fusion based on super pixel segmentation

Authors: Nati Ofir, Jean-Christophe Nebel

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[965] arXiv:2112.11335 [pdf, other]: Title: Deep Learning Based 3D Point Cloud Regression for Estimating Forest Biomass

Authors: Stefan Oehmcke, Lei Li, Katerina Trepekli, Jaime Revenga, Thomas Nord-Larsen, Fabian Gieseke, Christian Igel

Comments: 31 pages, 14 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[966] arXiv:2112.11340 [pdf, other]: Title: Transferable End-to-end Room Layout Estimation via Implicit Encoding

Authors: Hao Zhao, Rene Ranftl, Yurong Chen, Hongbin Zha

Comments: Project: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[967] arXiv:2112.11347 [pdf, other]: Title: Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects

Authors: Atsuhiro Noguchi, Umar Iqbal, Jonathan Tremblay, Tatsuya Harada, Orazio Gallo

Comments: CVPR2022, 16 pages, Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[968] arXiv:2112.11366 [pdf, other]: Title: Contrastive Object Detection Using Knowledge Graph Embeddings

Authors: Christopher Lang, Alexander Braun, Abhinav Valada

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[969] arXiv:2112.11377 [pdf, other]: Title: Shape from Polarization for Complex Scenes in the Wild

Authors: Chenyang Lei, Chenyang Qi, Jiaxin Xie, Na Fan, Vladlen Koltun, Qifeng Chen

Comments: Accepted to CVPR 2022; Github link: this https URL ;Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[970] arXiv:2112.11384 [pdf, other]: Title: Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021

Authors: Pierre-Etienne Martin (LaBRI, MPI-EVA, UB), Jordan Calandre (MIA), Boris Mansencal (LaBRI), Jenny Benois-Pineau (LaBRI), Renaud Péteri (MIA), Laurent Mascarilla (MIA), Julien Morlier (IMS)

Comments: MediaEval 2021, Dec 2021, Online, Germany

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[971] arXiv:2112.11406 [pdf, other]: Title: ADJUST: A Dictionary-Based Joint Reconstruction and Unmixing Method for Spectral Tomography

Authors: Mathé T. Zeegers, Ajinkya Kadu, Tristan van Leeuwen, Kees Joost Batenburg

Comments: This paper is under consideration at Inverse Problems with minor revisions. 33 pages, 24 figures

Journal-ref: Inverse Problems 38 12 (2022) 125002

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE); Numerical Analysis (math.NA); Optimization and Control (math.OC)
[972] arXiv:2112.11427 [pdf, other]: Title: StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation

Authors: Roy Or-El, Xuan Luo, Mengyi Shan, Eli Shechtman, Jeong Joon Park, Ira Kemelmacher-Shlizerman

Comments: Camera-Ready version. Paper was accepted as oral to CVPR 2022. Added discussions and figures from the rebuttal to the supplementary material (sections C & F). Project Webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[973] arXiv:2112.11435 [pdf, other]: Title: Learned Queries for Efficient Local Attention

Authors: Moab Arar, Ariel Shamir, Amit H. Bermano

Comments: CVPR 2022 - Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[974] arXiv:2112.11454 [pdf, other]: Title: GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping

Authors: Omid Taheri, Vasileios Choutas, Michael J. Black, Dimitrios Tzionas

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[975] arXiv:2112.11542 [pdf, other]: Title: MIA-Former: Efficient and Robust Vision Transformers via Multi-grained Input-Adaptation

Authors: Zhongzhi Yu, Yonggan Fu, Sicheng Li, Chaojian Li, Yingyan Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[976] arXiv:2112.11543 [pdf, other]: Title: Real-time Street Human Motion Capture

Authors: Yanquan Chen, Fei Yang, Tianyu Lang, Guanfang Dong, Anup Basu

Comments: 7 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[977] arXiv:2112.11547 [pdf, other]: Title: Decompose the Sounds and Pixels, Recompose the Events

Authors: Varshanth R. Rao, Md Ibrahim Khalil, Haoda Li, Peng Dai, Juwei Lu

Comments: Accepted at AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[978] arXiv:2112.11554 [pdf, other]: Title: Distribution-aware Margin Calibration for Semantic Segmentation in Images

Authors: Litao Yu, Zhibin Li, Min Xu, Yongsheng Gao, Jiebo Luo, Jian Zhang

Comments: This paper has been accepted by International Journal of Computer Vision (IJCV), and published on 09 November 2021. arXiv admin note: text overlap with arXiv:2011.01462

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[979] arXiv:2112.11573 [pdf, other]: Title: Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly Types

Authors: Kihyuk Sohn, Jinsung Yoon, Chun-Liang Li, Chen-Yu Lee, Tomas Pfister

Comments: WACV2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[980] arXiv:2112.11593 [pdf, other]: Title: AdaptPose: Cross-Dataset Adaptation for 3D Human Pose Estimation by Learnable Motion Generation

Authors: Mohsen Gholami, Bastian Wandt, Helge Rhodin, Rabab Ward, Z. Jane Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[981] arXiv:2112.11610 [pdf, other]: Title: EyePAD++: A Distillation-based approach for joint Eye Authentication and Presentation Attack Detection using Periocular Images

Authors: Prithviraj Dhar, Amit Kumar, Kirsten Kaplan, Khushi Gupta, Rakesh Ranjan, Rama Chellappa

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[982] arXiv:2112.11623 [pdf, other]: Title: MOSAIC: Mobile Segmentation via decoding Aggregated Information and encoded Context

Authors: Weijun Wang, Andrew Howard

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[983] arXiv:2112.11629 [pdf, other]: Title: Convolutional neural network based on transfer learning for breast cancer screening

Authors: Hussin Ragb, Redha Ali, Elforjani Jera, Nagi Buaossa

Comments: 9 pages, 7 figures. arXiv admin note: text overlap with arXiv:2009.08831

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[984] arXiv:2112.11641 [pdf, other]: Title: JoJoGAN: One Shot Face Stylization

Authors: Min Jin Chong, David Forsyth

Comments: code at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[985] arXiv:2112.11643 [pdf, other]: Title: Exploring Credibility Scoring Metrics of Perception Systems for Autonomous Driving

Authors: Viren Khandal, Arth Vidyarthi

Comments: In 14th International Conference on COMmunication Systems & NETworkS (COMSNETS) Intelligent Transportation Systems 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[986] arXiv:2112.11648 [pdf, other]: Title: Out-of-distribution Detection with Boundary Aware Learning

Authors: Sen Pei, Xin Zhang, Bin Fan, Gaofeng Meng

Journal-ref: ECCV 2022 Poster

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[987] arXiv:2112.11679 [pdf, other]: Title: Ghost-dil-NetVLAD: A Lightweight Neural Network for Visual Place Recognition

Authors: Qingyuan Gong, Yu Liu, Liqiang Zhang, Renhe Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[988] arXiv:2112.11685 [pdf, other]: Title: Cost Aggregation Is All You Need for Few-Shot Segmentation

Authors: Sunghwan Hong, Seokju Cho, Jisu Nam, Seungryong Kim

Comments: The trained weights and codes are available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[989] arXiv:2112.11689 [pdf, other]: Title: Multi-Centroid Representation Network for Domain Adaptive Person Re-ID

Authors: Yuhang Wu, Tengteng Huang, Haotian Yao, Chi Zhang, Yuanjie Shao, Chuchu Han, Changxin Gao, Nong Sang

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[990] arXiv:2112.11691 [pdf, other]: Title: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation

Authors: Xu Yan, Zhihao Yuan, Yuhao Du, Yinghong Liao, Yao Guo, Zhen Li, Shuguang Cui

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[991] arXiv:2112.11699 [pdf, other]: Title: Few-Shot Object Detection: A Comprehensive Survey

Authors: Mona Köhler, Markus Eisenbach, Horst-Michael Gross

Comments: 27 pages, 13 figures, submitted to IEEE Transactions on Neural Networks and Learning Systems

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[992] arXiv:2112.11700 [pdf, other]: Title: Adaptive Contrast for Image Regression in Computer-Aided Disease Assessment

Authors: Weihang Dai, Xiaomeng Li, Wan Hang Keith Chiu, Michael D. Kuo, Kwang-Ting Cheng

Comments: Accepted in IEEE Transactions on Medical Imaging

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[993] arXiv:2112.11706 [pdf, other]: Title: Entropy Regularized Iterative Weighted Shrinkage-Thresholding Algorithm (ERIWSTA): An Application to CT Image Restoration

Authors: Bingxue Wu, Jiao Wei, Chen Li, Yudong Yao, Yueyang Teng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[994] arXiv:2112.11710 [pdf, ps, other]: Title: Fusion of medical imaging and electronic health records with attention and multi-head machanisms

Authors: Cheng Jiang, Yihao Chen, Jianbo Chang, Ming Feng, Renzhi Wang, Jianhua Yao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[995] arXiv:2112.11713 [pdf, other]: Title: High-Accuracy RGB-D Face Recognition via Segmentation-Aware Face Depth Estimation and Mask-Guided Attention Network

Authors: Meng-Tzu Chiu, Hsun-Ying Cheng, Chien-Yi Wang, Shang-Hong Lai

Comments: IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[996] arXiv:2112.11716 [pdf, other]: Title: Comparing radiologists' gaze and saliency maps generated by interpretability methods for chest x-rays

Authors: Ricardo Bigolin Lanfredi, Ambuj Arora, Trafton Drew, Joyce D. Schroeder, Tolga Tasdizen

Comments: This paper was presented as an Extended Abstract at the Gaze Meets ML 2022 Workshop, a NeurIPS 2022 workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[997] arXiv:2112.11729 [pdf, other]: Title: Generalized Local Optimality for Video Steganalysis in Motion Vector Domain

Authors: Liming Zhai, Lina Wang, Yanzhen Ren, Yang Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[998] arXiv:2112.11749 [pdf, other]: Title: Class-aware Sounding Objects Localization via Audiovisual Correspondence

Authors: Di Hu, Yake Wei, Rui Qian, Weiyao Lin, Ruihua Song, Ji-Rong Wen

Comments: accepted by TPAMI 2021. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[999] arXiv:2112.11779 [pdf, other]: Title: Exploring Inter-frequency Guidance of Image for Lightweight Gaussian Denoising

Authors: Zhuang Jia

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1000] arXiv:2112.11790 [pdf, other]: Title: BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View

Authors: Junjie Huang, Guan Huang, Zheng Zhu, Yun Ye, Dalong Du

Comments: Multi-camera 3D Object Detection

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1001] arXiv:2112.11798 [pdf, other]: Title: YOLO-Z: Improving small object detection in YOLOv5 for autonomous vehicles

Authors: Aduen Benjumea, Izzeddin Teeti, Fabio Cuzzolin, Andrew Bradley

Comments: ICCV 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1002] arXiv:2112.11824 [pdf, ps, other]: Title: Binary Image Skeletonization Using 2-Stage U-Net

Authors: Mohamed A. Ghanem, Alaa A. Anani

Comments: Computer Vision Course Project [AUC, Spring 21]

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1003] arXiv:2112.11834 [pdf, other]: Title: Bottom-up approaches for multi-person pose estimation and it's applications: A brief review

Authors: Milan Kresović, Thong Duy Nguyen

Comments: 13 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1004] arXiv:2112.11846 [pdf, other]: Title: A Discriminative Single-Shot Segmentation Network for Visual Object Tracking

Authors: Alan Lukežič, Jiří Matas, Matej Kristan

Comments: Extended version of the D3S tracker (CVPR2020). Accepted to IEEE TPAMI. arXiv admin note: substantial text overlap with arXiv:1911.08862

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1005] arXiv:2112.11853 [pdf, other]: Title: Geodesic squared exponential kernel for non-rigid shape registration

Authors: Florent Jousse (UCA, Qc, EPIONE), Xavier Pennec (UCA, EPIONE), Hervé Delingette (UCA, EPIONE), Matilde Gonzalez (Qc)

Comments: 2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021) PROCEEDINGS, Dec 2021, JODHPUR, India

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1006] arXiv:2112.11895 [pdf, other]: Title: Few-shot Font Generation with Weakly Supervised Localized Representations

Authors: Song Park, Sanghyuk Chun, Junbum Cha, Bado Lee, Hyunjung Shim

Comments: First two authors contributed equally. This is a journal extension of our AAAI 2021 paper arXiv:2009.11042; Code: this https URL and this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1007] arXiv:2112.11929 [pdf, other]: Title: Meta-Learning and Self-Supervised Pretraining for Real World Image Translation

Authors: Ileana Rugina, Rumen Dangovski, Mark Veillette, Pooya Khorrami, Brian Cheung, Olga Simek, Marin Soljačić

Comments: 10 pages, 8 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1008] arXiv:2112.11975 [pdf, other]: Title: Page Segmentation using Visual Adjacency Analysis

Authors: Mohammad Bajammal, Ali Mesbah

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1009] arXiv:2112.11992 [pdf, other]: Title: Automatic Estimation of Anthropometric Human Body Measurements

Authors: Dana Škorvánková, Adam Riečický, Martin Madaras

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1010] arXiv:2112.12001 [pdf, other]: Title: DA-FDFtNet: Dual Attention Fake Detection Fine-tuning Network to Detect Various AI-Generated Fake Images

Authors: Young Oh Bang, Simon S. Woo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1011] arXiv:2112.12002 [pdf, other]: Title: Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction

Authors: Henrique Siqueira, Patrick Ruhkamp, Ibrahim Halfaoui, Markus Karmann, Onay Urfalioglu

Comments: Accepted at IEEE WCCI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1012] arXiv:2112.12004 [pdf, other]: Title: Barely-Supervised Learning: Semi-Supervised Learning with very few labeled images

Authors: Thomas Lucas, Philippe Weinzaepfel, Gregory Rogez

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1013] arXiv:2112.12027 [pdf, other]: Title: Learning and Crafting for the Wide Multiple Baseline Stereo

Authors: Dmytro Mishkin

Comments: After-defence version with additional fixes based on reviewer commends. 144 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1014] arXiv:2112.12053 [pdf, other]: Title: Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results

Authors: Liang Pan, Tong Wu, Zhongang Cai, Ziwei Liu, Xumin Yu, Yongming Rao, Jiwen Lu, Jie Zhou, Mingye Xu, Xiaoyuan Luo, Kexue Fu, Peng Gao, Manning Wang, Yali Wang, Yu Qiao, Junsheng Zhou, Xin Wen, Peng Xiang, Yu-Shen Liu, Zhizhong Han, Yuanjie Yan, Junyi An, Lifa Zhu, Changwei Lin, Dongrui Liu, Xin Li, Francisco Gómez-Fernández, Qinlong Wang, Yang Yang

Comments: 15 pages, 13 figures, ICCV2021 Workshop Technique Report, the codebase webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1015] arXiv:2112.12060 [pdf, ps, other]: Title: Deep Models for Visual Sentiment Analysis of Disaster-related Multimedia Content

Authors: Khubaib Ahmad, Muhammad Asif Ayub, Kashif Ahmad, Ala Al-Fuqaha, Nasir Ahmad

Comments: 3 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1016] arXiv:2112.12070 [pdf, ps, other]: Title: A Single-Target License Plate Detection with Attention

Authors: Wenyun Li, Chi-Man Pun

Comments: IWAIT2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1017] arXiv:2112.12072 [pdf, other]: Title: Hierarchical Cross-Modality Semantic Correlation Learning Model for Multimodal Summarization

Authors: Litian Zhang, Xiaoming Zhang, Junshu Pan, Feiran Huang

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[1018] arXiv:2112.12073 [pdf, other]: Title: Two Stream Network for Stroke Detection in Table Tennis

Authors: Anam Zahra (MPI-EVA), Pierre-Etienne Martin (LaBRI, MPI-EVA, UB)

Comments: MediaEval 2021, Dec 2021, Online, Germany

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[1019] arXiv:2112.12074 [pdf, other]: Title: Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark

Authors: Pierre-Etienne Martin (LaBRI, MPI-EVA, UB)

Journal-ref: MediaEval 2021, Dec 2021, Online, Germany

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[1020] arXiv:2112.12082 [pdf, ps, other]: Title: A New Adaptive Noise Covariance Matrices Estimation and Filtering Method: Application to Multi-Object Tracking

Authors: Chao Jiang, Zhiling Wang, Shuhang Tan, Huawei Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1021] arXiv:2112.12084 [pdf, other]: Title: Input-Specific Robustness Certification for Randomized Smoothing

Authors: Ruoxin Chen, Jie Li, Junchi Yan, Ping Li, Bin Sheng

Comments: Accepted by AAAI22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1022] arXiv:2112.12086 [pdf, other]: Title: Improved skin lesion recognition by a Self-Supervised Curricular Deep Learning approach

Authors: Kirill Sirotkin (1), Marcos Escudero-Viñolo (1), Pablo Carballeira (1), Juan Carlos SanMiguel (1) ((1) Universidad Autónoma de Madrid, Escuela Politécnica Superior, Spain)

Comments: 11 pages, 8 figures, submitted to the Journal of Biomedical and Health Informatics (Special Issue on Skin Image Analysis in the Age of Deep Learning)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1023] arXiv:2112.12089 [pdf, other]: Title: Reflash Dropout in Image Super-Resolution

Authors: Xiangtao Kong, Xina Liu, Jinjin Gu, Yu Qiao, Chao Dong

Comments: CVPR2022 paper + supplementary file

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1024] arXiv:2112.12130 [pdf, other]: Title: NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

Authors: Zihan Zhu, Songyou Peng, Viktor Larsson, Weiwei Xu, Hujun Bao, Zhaopeng Cui, Martin R. Oswald, Marc Pollefeys

Comments: CVPR 2022, first two authors contributed equally. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1025] arXiv:2112.12133 [pdf, other]: Title: Can Deep Neural Networks be Converted to Ultra Low-Latency Spiking Neural Networks?

Authors: Gourav Datta, Peter A. Beerel

Comments: Accepted to DATE 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1026] arXiv:2112.12141 [pdf, other]: Title: Multi-modal 3D Human Pose Estimation with 2D Weak Supervision in Autonomous Driving

Authors: Jingxiao Zheng, Xinwei Shi, Alexander Gorban, Junhua Mao, Yang Song, Charles R. Qi, Ting Liu, Visesh Chari, Andre Cornman, Yin Zhou, Congcong Li, Dragomir Anguelov

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1027] arXiv:2112.12143 [pdf, other]: Title: Scaling Open-Vocabulary Image Segmentation with Image-Level Labels

Authors: Golnaz Ghiasi, Xiuye Gu, Yin Cui, Tsung-Yi Lin

Comments: Accepted at ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1028] arXiv:2112.12175 [pdf, other]: Title: Recur, Attend or Convolve? On Whether Temporal Modeling Matters for Cross-Domain Robustness in Action Recognition

Authors: Sofia Broomé, Ernest Pokropek, Boyu Li, Hedvig Kjellström

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1029] arXiv:2112.12180 [pdf, other]: Title: Multimodal Personality Recognition using Cross-Attention Transformer and Behaviour Encoding

Authors: Tanay Agrawal, Dhruv Agarwal, Michal Balazia, Neelabh Sinha, Francois Bremond

Comments: Preprint. Final paper accepted at the 17th International Conference on Computer Vision Theory and Applications (VISAPP), virtual, February, 2022. 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1030] arXiv:2112.12182 [pdf, other]: Title: Fine-grained Multi-Modal Self-Supervised Learning

Authors: Duo Wang, Salah Karout

Comments: Accepted at BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1031] arXiv:2112.12193 [pdf, other]: Title: Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations -- combining input rotations and a kinematic model

Authors: Michael Zwölfer, Dieter Heinrich, Kurt Schindelwig, Bastian Wandt, Helge Rhodin, Joerg Spoerri, Werner Nachbauer

Comments: extended abstract, 4 pages, 3 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1032] arXiv:2112.12218 [pdf, other]: Title: Maximum Entropy on Erroneous Predictions (MEEP): Improving model calibration for medical image segmentation

Authors: Agostina Larrazabal, Cesar Martinez, Jose Dolz, Enzo Ferrante

Comments: Accepted for publication at MICCAI 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1033] arXiv:2112.12219 [pdf, other]: Title: SAMCNet for Spatial-configuration-based Classification: A Summary of Results

Authors: Majid Farhadloo, Carl Molnar, Gaoxiang Luo, Yan Li, Shashi Shekhar, Rachel L. Maus, Svetomir N. Markovic, Raymond Moore, Alexey Leontovich

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1034] arXiv:2112.12252 [pdf, other]: Title: Leveraging Synthetic Data in Object Detection on Unmanned Aerial Vehicles

Authors: Benjamin Kiefer, David Ott, Andreas Zell

Comments: The first two authors contributed equally. Github repository will be made public soon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1035] arXiv:2112.12328 [pdf, other]: Title: Robust and Precise Facial Landmark Detection by Self-Calibrated Pose Attention Network

Authors: Jun Wan, Hui Xi, Jie Zhou, Zhihui Lai, Witold Pedrycz, Xu Wang, Hang Sun

Comments: Accept by IEEE Transactions on Cybernetics, December 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1036] arXiv:2112.12329 [pdf, other]: Title: MVDG: A Unified Multi-view Framework for Domain Generalization

Authors: Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao

Comments: Accepted by ECCV2022. The code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1037] arXiv:2112.12345 [pdf, other]: Title: Revisiting Transformation Invariant Geometric Deep Learning: Are Initial Representations All You Need?

Authors: Ziwei Zhang, Xin Wang, Zeyang Zhang, Peng Cui, Wenwu Zhu

Comments: 11 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1038] arXiv:2112.12349 [pdf, other]: Title: Learning Hierarchical Attention for Weakly-supervised Chest X-Ray Abnormality Localization and Diagnosis

Authors: Xi Ouyang, Srikrishna Karanam, Ziyan Wu, Terrence Chen, Jiayu Huo, Xiang Sean Zhou, Qian Wang, Jie-Zhi Cheng

Journal-ref: IEEE Transactions on Medical Imaging 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1039] arXiv:2112.12355 [pdf, other]: Title: A Random Point Initialization Approach to Image Segmentation with Variational Level-sets

Authors: J.N. Mueller, J.N. Corcoran

Comments: 17 pages, 27 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1040] arXiv:2112.12359 [pdf, other]: Title: Dual Path Structural Contrastive Embeddings for Learning Novel Objects

Authors: Bingbin Li, Elvis Han Cui, Yanan Li, Donghui Wang, Weng Kee Wong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1041] arXiv:2112.12385 [pdf, other]: Title: DILF-EN framework for Class-Incremental Learning

Authors: Mohammed Asad Karim, Indu Joshi, Pratik Mazumder, Pravendra Singh

Comments: Under Review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1042] arXiv:2112.12390 [pdf, other]: Title: Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

Authors: Guangming Yao, Hongzhi Wu, Yi Yuan, Lincheng Li, Kun Zhou, Xin Yu

Comments: 6 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1043] arXiv:2112.12402 [pdf, other]: Title: Iteratively Selecting an Easy Reference Frame Makes Unsupervised Video Object Segmentation Easier

Authors: Youngjo Lee, Hongje Seong, Euntai Kim

Comments: Accepted to AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1044] arXiv:2112.12409 [pdf, other]: Title: InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition

Authors: Andreea Glavan, Estefania Talavera

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1045] arXiv:2112.12455 [pdf, ps, other]: Title: Your Face Mirrors Your Deepest Beliefs-Predicting Personality and Morals through Facial Emotion Recognition

Authors: P. A. Gloor, A. Fronzetti Colladon, E. Altuntas, C. Cetinkaya, M. F. Kaiser, L. Ripperger, T. Schaefer

Journal-ref: Future Internet 14(1), 5 (2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[1046] arXiv:2112.12484 [pdf, other]: Title: Pose Adaptive Dual Mixup for Few-Shot Single-View 3D Reconstruction

Authors: Ta-Ying Cheng, Hsuan-Ru Yang, Niki Trigoni, Hwann-Tzong Chen, Tyng-Luh Liu

Comments: To appear in the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI), February 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1047] arXiv:2112.12494 [pdf, other]: Title: LaTr: Layout-Aware Transformer for Scene-Text VQA

Authors: Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1048] arXiv:2112.12496 [pdf, other]: Title: FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition

Authors: Chih-Ting Liu, Chien-Yi Wang, Shao-Yi Chien, Shang-Hong Lai

Comments: This paper was accepted by AAAI 2022 Conference on Artificial Intelligence and selected as an oral paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1049] arXiv:2112.12506 [pdf, other]: Title: Attentive Multi-View Deep Subspace Clustering Net

Authors: Run-kun Lu, Jian-wei Liu, Xin Zuo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1050] arXiv:2112.12535 [pdf, other]: Title: FourierMask: Instance Segmentation using Fourier Mapping in Implicit Neural Networks

Authors: Hamd ul Moqeet Riaz, Nuri Benbarka, Timon Hoefer, Andreas Zell

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1051] arXiv:2112.12573 [pdf, other]: Title: Boosting Generative Zero-Shot Learning by Synthesizing Diverse Features with Attribute Augmentation

Authors: Xiaojie Zhao, Yuming Shen, Shidong Wang, Haofeng Zhang

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1052] arXiv:2112.12577 [pdf, other]: Title: NVS-MonoDepth: Improving Monocular Depth Prediction with Novel View Synthesis

Authors: Zuria Bauer, Zuoyue Li, Sergio Orts-Escolano, Miguel Cazorla, Marc Pollefeys, Martin R. Oswald

Comments: 8 pages (main paper), 9 pages (supplementary material), 14 figures, 4 tables

Journal-ref: 2021 International Conference on 3D Vision (3DV)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1053] arXiv:2112.12579 [pdf, other]: Title: NeRD++: Improved 3D-mirror symmetry learning from a single image

Authors: Yancong Lin, Silvia-Laura Pintea, Jan van Gemert

Comments: BMVC 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1054] arXiv:2112.12606 [pdf, other]: Title: Towards Universal GAN Image Detection

Authors: Davide Cozzolino, Diego Gragnaniello, Giovanni Poggi, Luisa Verdoliva

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1055] arXiv:2112.12610 [pdf, other]: Title: PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving

Authors: Pengchuan Xiao, Zhenlei Shao, Steven Hao, Zishuo Zhang, Xiaolin Chai, Judy Jiao, Zesong Li, Jian Wu, Kai Sun, Kun Jiang, Yunlong Wang, Diange Yang

Comments: This paper has been published on ITSC'2021, please check the website of the PandaSet for more information: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1056] arXiv:2112.12618 [pdf, other]: Title: Manifold Learning Benefits GANs

Authors: Yao Ni, Piotr Koniusz, Richard Hartley, Richard Nock

Comments: CVPR 2022, 32 pages full version

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1057] arXiv:2112.12625 [pdf, other]: Title: Comparison and Analysis of Image-to-Image Generative Adversarial Networks: A Survey

Authors: Sagar Saxena, Mohammad Nayeem Teli

Comments: 36 pages, 22 figures, Preprint; format changed, typos corrected

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1058] arXiv:2112.12668 [pdf, other]: Title: 3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Naïve

Authors: Lei Wang, Jun Liu, Piotr Koniusz

Comments: Full 17 page version

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1059] arXiv:2112.12702 [pdf, other]: Title: TagLab: A human-centric AI system for interactive semantic segmentation

Authors: Gaia Pavoni, Massimiliano Corsini, Federico Ponchio, Alessandro Muntoni, Paolo Cignoni

Comments: Accepted at Human Centered AI workshop at NeurIPS 2021, this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[1060] arXiv:2112.12703 [pdf, other]: Title: Digital Editions as Distant Supervision for Layout Analysis of Printed Books

Authors: Alejandro H. Toselli, Si Wu, David A. Smith

Comments: 15 pages, 2 figures. International Conference on Document Analysis and Recognition. Springer, Cham, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1061] arXiv:2112.12748 [pdf, other]: Title: Assessing the Impact of Attention and Self-Attention Mechanisms on the Classification of Skin Lesions

Authors: Rafael Pedro, Arlindo L. Oliveira

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1062] arXiv:2112.12750 [pdf, other]: Title: SLIP: Self-supervision meets Language-Image Pre-training

Authors: Norman Mu, Alexander Kirillov, David Wagner, Saining Xie

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1063] arXiv:2112.12761 [pdf, other]: Title: BANMo: Building Animatable 3D Neural Models from Many Casual Videos

Authors: Gengshan Yang, Minh Vo, Natalia Neverova, Deva Ramanan, Andrea Vedaldi, Hanbyul Joo

Comments: CVPR 2022 camera-ready version (last update: May 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1064] arXiv:2112.12777 [pdf, other]: Title: Cross Modal Retrieval with Querybank Normalisation

Authors: Simion-Vlad Bogolin, Ioana Croitoru, Hailin Jin, Yang Liu, Samuel Albanie

Comments: Accepted at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1065] arXiv:2112.12782 [pdf, other]: Title: SeMask: Semantically Masked Transformers for Semantic Segmentation

Authors: Jitesh Jain, Anukriti Singh, Nikita Orlov, Zilong Huang, Jiachen Li, Steven Walton, Humphrey Shi

Comments: Updated experiments with Mix-Transformer (MiT) on ADE20K and added an analysis section

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1066] arXiv:2112.12785 [pdf, other]: Title: NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning

Authors: Tony Ng, Hyo Jin Kim, Vincent Lee, Daniel DeTone, Tsun-Yi Yang, Tianwei Shen, Eddy Ilg, Vassileios Balntas, Krystian Mikolajczyk, Chris Sweeney

Comments: Accepted at CVPR 2022. Supplementary material included after references. 15 pages, 14 figures, 6 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1067] arXiv:2112.12786 [pdf, other]: Title: ELSA: Enhanced Local Self-Attention for Vision Transformer

Authors: Jingkai Zhou, Pichao Wang, Fan Wang, Qiong Liu, Hao Li, Rong Jin

Comments: Project at \url{this https URL}

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[1068] arXiv:2112.12812 [pdf, other]: Title: MDN-VO: Estimating Visual Odometry with Confidence

Authors: Nimet Kaygusuz, Oscar Mendez, Richard Bowden

Journal-ref: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 3528-3533

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1069] arXiv:2112.12818 [pdf, other]: Title: Multi-Camera Sensor Fusion for Visual Odometry using Deep Uncertainty Estimation

Authors: Nimet Kaygusuz, Oscar Mendez, Richard Bowden

Journal-ref: 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), 2021, pp. 2944-2949

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1070] arXiv:2112.12833 [pdf, other]: Title: Dense Out-of-Distribution Detection by Robust Learning on Synthetic Negative Data

Authors: Matej Grcić, Petra Bevandić, Zoran Kalafatić, Siniša Šegvić

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1071] arXiv:2112.12843 [pdf, other]: Title: Impact of class imbalance on chest x-ray classifiers: towards better evaluation practices for discrimination and calibration performance

Authors: Candelaria Mosquera, Luciana Ferrer, Diego Milone, Daniel Luna, Enzo Ferrante

Comments: Conference on Health, Inference, and Learning (CHIL) 2022 - Invited non-archival presentation

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1072] arXiv:2112.12867 [pdf, other]: Title: HSPACE: Synthetic Parametric Humans Animated in Complex Environments

Authors: Eduard Gabriel Bazavan, Andrei Zanfir, Mihai Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1073] arXiv:2112.12887 [pdf, other]: Title: A formal approach to good practices in Pseudo-Labeling for Unsupervised Domain Adaptive Re-Identification

Authors: Fabian Dubourvieux, Romaric Audigier, Angélique Loesch, Samia Ainouz, Stéphane Canu

Comments: This paper is a preprint under submission at CVIU for review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1074] arXiv:2112.12911 [pdf, other]: Title: Cluster-guided Image Synthesis with Unconditional Models

Authors: Markos Georgopoulos, James Oldfield, Grigorios G Chrysos, Yannis Panagakis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1075] arXiv:2112.12916 [pdf, other]: Title: Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition

Authors: Yue He, Chen Chen, Jing Zhang, Juhua Liu, Fengxiang He, Chaoyue Wang, Bo Du

Comments: Accepted by AAAI-22

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1076] arXiv:2112.12917 [pdf, other]: Title: Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation

Authors: Zhiwei Liu, Xiangyu Zhu, Lu Yang, Xiang Yan, Ming Tang, Zhen Lei, Guibo Zhu, Xuetao Feng, Yan Wang, Jinqiao Wang

Comments: accepted by ACM Multimedia 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1077] arXiv:2112.12925 [pdf, other]: Title: Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective

Authors: Xiaokang Chen, Jiaxiang Tang, Jingbo Wang, Gang Zeng

Comments: Accepted to AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1078] arXiv:2112.12927 [pdf, other]: Title: Learning Aligned Cross-Modal Representation for Generalized Zero-Shot Classification

Authors: Zhiyu Fang, Xiaobin Zhu, Chun Yang, Zheng Han, Jingyan Qin, Xu-Cheng Yin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1079] arXiv:2112.12939 [pdf, other]: Title: Realtime Global Attention Network for Semantic Segmentation

Authors: Xi Mo, Xiangyu Chen

Comments: Ver1.0 for RA-L with ICRA presentation

Journal-ref: IEEE Robotics and Automation Letters 7(2022).1574-1580

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1080] arXiv:2112.12955 [pdf, ps, other]: Title: Deep ensembles in bioimage segmentation

Authors: Loris Nanni, Daniela Cuza, Alessandra Lumini, Andrea Loreggia, Sheryl Brahnam

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1081] arXiv:2112.12970 [pdf, other]: Title: SGTR: End-to-end Scene Graph Generation with Transformer

Authors: Rongjie Li, Songyang Zhang, Xuming He

Comments: Accepted by CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1082] arXiv:2112.12988 [pdf, other]: Title: iSeg3D: An Interactive 3D Shape Segmentation Tool

Authors: Sucheng Qian, Liu Liu, Wenqiang Xu, Cewu Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1083] arXiv:2112.12989 [pdf, other]: Title: Domain-Aware Continual Zero-Shot Learning

Authors: Kai Yi, Paul Janson, Wenxuan Zhang, Mohamed Elhoseiny

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1084] arXiv:2112.13002 [pdf, other]: Title: US-GAN: On the importance of Ultimate Skip Connection for Facial Expression Synthesis

Authors: Arbish Akram, Nazar Khan

Journal-ref: Multimed Tools Appl (2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1085] arXiv:2112.13003 [pdf, other]: Title: Continuous Spectral Reconstruction from RGB Images via Implicit Neural Representation

Authors: Ruikang Xu, Mingde Yao, Chang Chen, Lizhi Wang, Zhiwei Xiong

Comments: Accepted to ECCV Workshop 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1086] arXiv:2112.13018 [pdf, other]: Title: Benchmarking Pedestrian Odometry: The Brown Pedestrian Odometry Dataset (BPOD)

Authors: David Charatan, Hongyi Fan, Benjamin Kimia

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1087] arXiv:2112.13031 [pdf, other]: Title: Grounding Linguistic Commands to Navigable Regions

Authors: Nivedita Rufus, Kanishk Jain, Unni Krishnan R Nair, Vineet Gandhi, K Madhava Krishna

Journal-ref: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 8593-8600

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1088] arXiv:2112.13047 [pdf, other]: Title: Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation

Authors: Jiaxing Yan, Hong Zhao, Penghui Bu, YuSheng Jin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1089] arXiv:2112.13050 [pdf, other]: Title: Self-Gated Memory Recurrent Network for Efficient Scalable HDR Deghosting

Authors: K. Ram Prabhakar, Susmit Agrawal, R. Venkatesh Babu

Comments: 12 pages

Journal-ref: IEEE Transactions on Computational Imaging (Volume 7, 2021) 1228-1239

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1090] arXiv:2112.13060 [src]: Title: NIP: Neuron-level Inverse Perturbation Against Adversarial Attacks

Authors: Ruoxi Chen, Haibo Jin, Jinyin Chen, Haibin Zheng, Yue Yu, Shouling Ji

Comments: There are some problems in the figure so we need to withdraw this paper. We will upload the new version after revision

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1091] arXiv:2112.13076 [pdf, other]: Title: Virtuoso: Video-based Intelligence for real-time tuning on SOCs

Authors: Jayoung Lee, PengCheng Wang, Ran Xu, Venkat Dasari, Noah Weston, Yin Li, Saurabh Bagchi, Somali Chaterji

Comments: 28 pages, 15 figures, 4 tables, ACM-TODAES

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1092] arXiv:2112.13082 [pdf, other]: Title: Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection

Authors: Jiahe Fan, Mohammud J. Bocus, Brett Hosking, Rigen Wu, Yanan Liu, Sergey Vityazev, Rui Fan

Comments: 2021 IEEE International Conference on Autonomous Systems (ICAS)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1093] arXiv:2112.13085 [pdf, other]: Title: SimViT: Exploring a Simple Vision Transformer with sliding windows

Authors: Gang Li, Di Xu, Xing Cheng, Lingyu Si, Changwen Zheng

Comments: 7 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1094] arXiv:2112.13107 [pdf, other]: Title: Invertible Network for Unpaired Low-light Image Enhancement

Authors: Jize Zhang, Haolin Wang, Xiaohe Wu, Wangmeng Zuo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1095] arXiv:2112.13142 [pdf, other]: Title: Reconstructing Compact Building Models from Point Clouds Using Deep Implicit Fields

Authors: Zhaiyu Chen, Hugo Ledoux, Seyran Khademi, Liangliang Nan

Comments: Accepted for publication in ISPRS Journal of Photogrammetry and Remote Sensing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1096] arXiv:2112.13165 [pdf, other]: Title: Semantic Clustering based Deduction Learning for Image Recognition and Classification

Authors: Wenchi Ma, Xuemin Tu, Bo Luo, Guanghui Wang

Journal-ref: Pattern Recognition 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1097] arXiv:2112.13308 [pdf, other]: Title: Unsupervised Clustering Active Learning for Person Re-identification

Authors: Wenjing Gao, Minxian Li

Comments: This work was submitted to BMVC2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1098] arXiv:2112.13310 [pdf, other]: Title: Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence

Authors: Wenchi Ma, Tianxiao Zhang, Guanghui Wang

Journal-ref: AAAI 2022 workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1099] arXiv:2112.13328 [pdf, other]: Title: Continuous Offline Handwriting Recognition using Deep Learning Models

Authors: Jorge Sueiras

Comments: 186 pages, 83 figures, thesis

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1100] arXiv:2112.13341 [pdf, other]: Title: AlertTrap: A study on object detection in remote insects trap monitoring system using on-the-edge deep learning platform

Authors: An D. Le, Duy A. Pham, Dong T. Pham, Hien B. Vo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1101] arXiv:2112.13465 [pdf, other]: Title: PreDisM: Pre-Disaster Modelling With CNN Ensembles for At-Risk Communities

Authors: Vishal Anand, Yuki Miura

Journal-ref: NeurIPS 2021 Workshop on Tackling Climate Change with Machine Learning

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[1102] arXiv:2112.13478 [pdf, other]: Title: Video Joint Modelling Based on Hierarchical Transformer for Co-summarization

Authors: Li Haopeng, Ke Qiuhong, Gong Mingming, Zhang Rui

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1103] arXiv:2112.13491 [pdf, other]: Title: A Compact Neural Network-based Algorithm for Robust Image Watermarking

Authors: Hong-Bo Xu, Rong Wang, Jia Wei, Shao-Ping Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1104] arXiv:2112.13492 [pdf, other]: Title: Vision Transformer for Small-Size Datasets

Authors: Seung Hoon Lee, Seunghyun Lee, Byung Cheol Song

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1105] arXiv:2112.13494 [pdf, ps, other]: Title: Estimating Parameters of the Tree Root in Heterogeneous Soil Environments via Mask-Guided Multi-Polarimetric Integration Neural Network

Authors: Hai-Han Sun, Yee Hui Lee, Qiqi Dai, Chongyi Li, Genevieve Ow, Mohamed Lokman Mohd Yusof, Abdulkadir C. Yucel

Comments: 14 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1106] arXiv:2112.13522 [pdf, other]: Title: Dual Contrastive Learning for General Face Forgery Detection

Authors: Ke Sun, Taiping Yao, Shen Chen, Shouhong Ding, Jilin L, Rongrong Ji

Comments: This paper was accepted by AAAI 2022 Conference on Artificial Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1107] arXiv:2112.13528 [pdf, other]: Title: Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction

Authors: Jing Zhang, Jianwen Xie, Nick Barnes, Ping Li

Comments: NeurIPS 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1108] arXiv:2112.13534 [pdf, other]: Title: Adversarial Attack for Asynchronous Event-based Data

Authors: Wooju Lee, Hyun Myung

Comments: 8 pages, 6 figures, Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1109] arXiv:2112.13538 [pdf, other]: Title: Meta-Learned Feature Critics for Domain Generalized Semantic Segmentation

Authors: Zu-Yun Shiau, Wei-Wei Lin, Ci-Siang Lin, Yu-Chiang Frank Wang

Comments: Accepted by ICIP 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1110] arXiv:2112.13539 [pdf, other]: Title: Few-Shot Classification in Unseen Domains by Episodic Meta-Learning Across Visual Domains

Authors: Yuan-Chia Cheng, Ci-Siang Lin, Fu-En Yang, Yu-Chiang Frank Wang

Comments: Accepted by ICIP 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1111] arXiv:2112.13540 [pdf, other]: Title: Image Edge Restoring Filter

Authors: Qian Liu, Yongpeng Li, Zhihang Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1112] arXiv:2112.13545 [pdf, other]: Title: ViR:the Vision Reservoir

Authors: Xian Wei, Bin Wang, Mingsong Chen, Ji Yuan, Hai Lan, Jiehuang Shi, Xuan Tang, Bo Jin, Guozhang Chen, Dongping Yang

Comments: 10 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1113] arXiv:2112.13547 [pdf, other]: Title: PRIME: A few primitives can boost robustness to common corruptions

Authors: Apostolos Modas, Rahul Rade, Guillermo Ortiz-Jiménez, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

Comments: Code available at: this https URL

Journal-ref: European Conference on Computer Vision (ECCV) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1114] arXiv:2112.13548 [pdf, other]: Title: Responsive Listening Head Generation: A Benchmark Dataset and Baseline

Authors: Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei

Comments: Accepted by ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1115] arXiv:2112.13551 [pdf, other]: Title: Learning Robust and Lightweight Model through Separable Structured Transformations

Authors: Xian Wei, Yanhui Huang, Yangyu Xu, Mingsong Chen, Hai Lan, Yuanxiang Li, Zhongfeng Wang, Xuan Tang

Comments: 18 pages, 5figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1116] arXiv:2112.13565 [pdf, other]: Title: Hard Example Guided Hashing for Image Retrieval

Authors: Hai Su, Meiyin Han, Junle Liang, Jun Liang, Songsen Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1117] arXiv:2112.13583 [pdf, ps, other]: Title: Vegetation Stratum Occupancy Prediction from Airborne LiDAR 3D Point Clouds

Authors: Ekaterina Kalinicheva, Loic Landrieu, Clément Mallet, Nesrine Chehata

Journal-ref: SilviLaser 2021 Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1118] arXiv:2112.13592 [pdf, other]: Title: Multimodal Image Synthesis and Editing: The Generative AI Era

Authors: Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Shijian Lu, Lingjie Liu, Adam Kortylewski, Christian Theobalt, Eric Xing

Comments: TPAMI 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1119] arXiv:2112.13608 [pdf, other]: Title: An Empirical Study of Adder Neural Networks for Object Detection

Authors: Xinghao Chen, Chang Xu, Minjing Dong, Chunjing Xu, Yunhe Wang

Journal-ref: NeurIPS 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1120] arXiv:2112.13635 [pdf, other]: Title: AdaptivePose: Human Parts as Adaptive Points

Authors: Yabo Xiao, Xiaojuan Wang, Dongdong Yu, Guoli Wang, Qian Zhang, Mingshu He

Comments: Accepted by AAAI 2022. Code Will be released after the extention

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1121] arXiv:2112.13692 [pdf, other]: Title: Augmenting Convolutional networks with attention-based aggregation

Authors: Hugo Touvron, Matthieu Cord, Alaaeldin El-Nouby, Piotr Bojanowski, Armand Joulin, Gabriel Synnaeve, Hervé Jégou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1122] arXiv:2112.13697 [pdf, other]: Title: Weakly Supervised Visual-Auditory Fixation Prediction with Multigranularity Perception

Authors: Guotao Wang, Chenglizhao Chen, Deng-Ping Fan, Aimin Hao, Hong Qin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1123] arXiv:2112.13706 [pdf, other]: Title: Multi-Image Visual Question Answering

Authors: Harsh Raj, Janhavi Dadhania, Akhilesh Bhardwaj, Prabuchandran KJ

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1124] arXiv:2112.13707 [pdf, other]: Title: Visual Place Representation and Recognition from Depth Images

Authors: Farah Ibelaiden, Slimane Larabi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1125] arXiv:2112.13709 [pdf, other]: Title: Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training

Authors: Qi Feng, Kun He, He Wen, Cem Keskin, Yuting Ye

Comments: IEEE WACV 2023 algorithms track. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1126] arXiv:2112.13715 [pdf, other]: Title: SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos

Authors: Ailing Zeng, Lei Yang, Xuan Ju, Jiefeng Li, Jianyi Wang, Qiang Xu

Comments: Accepted by ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1127] arXiv:2112.13727 [pdf, other]: Title: A Multi-channel Training Method Boost the Performance

Authors: Yingdong Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1128] arXiv:2112.13734 [pdf, ps, other]: Title: Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models

Authors: Enoch Tetteh, Joseph Viviano, Yoshua Bengio, David Krueger, Joseph Paul Cohen

Comments: MED-NEURIPS 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1129] arXiv:2112.13762 [pdf, other]: Title: MSeg: A Composite Dataset for Multi-domain Semantic Segmentation

Authors: John Lambert, Zhuang Liu, Ozan Sener, James Hays, Vladlen Koltun

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1130] arXiv:2112.13809 [pdf, other]: Title: Improving Deep Image Matting via Local Smoothness Assumption

Authors: Rui Wang, Jun Xie, Jiacheng Han, Dezhen Qi

Comments: 9 pages, accepted by IEEE ICME 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1131] arXiv:2112.13815 [pdf, other]: Title: Temporally Constrained Neural Networks (TCNN): A framework for semi-supervised video semantic segmentation

Authors: Deepak Alapatt, Pietro Mascagni, Armine Vardazaryan, Alain Garcia, Nariaki Okamoto, Didier Mutter, Jacques Marescaux, Guido Costamagna, Bernard Dallemagne, Nicolas Padoy

Comments: 10 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1132] arXiv:2112.13843 [pdf, other]: Title: BMPQ: Bit-Gradient Sensitivity Driven Mixed-Precision Quantization of DNNs from Scratch

Authors: Souvik Kundu, Shikai Wang, Qirui Sun, Peter A. Beerel, Massoud Pedram

Comments: 4 pages, 2 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1133] arXiv:2112.13845 [pdf, other]: Title: Raw Produce Quality Detection with Shifted Window Self-Attention

Authors: Oh Joon Kwon, Byungsoo Kim, Youngduck Choi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1134] arXiv:2112.13846 [pdf, ps, other]: Title: Algorithm for recognizing the contour of a honeycomb block

Authors: Maksim Viktorovich Kubrikov, Mikhail Vladimirovich Saramud, Ivan Alekseevich Paulin, Evgeniy Petrovich Talay

Comments: 11 pages, in Russian, 13 figures, ICMTMTE

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1135] arXiv:2112.13884 [pdf, other]: Title: A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision

Authors: Ajinkya Tejankar, Maziar Sanjabi, Bichen Wu, Saining Xie, Madian Khabsa, Hamed Pirsiavash, Hamed Firooz

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1136] arXiv:2112.13889 [pdf, other]: Title: Free-Viewpoint RGB-D Human Performance Capture and Rendering

Authors: Phong Nguyen-Ha, Nikolaos Sarafianos, Christoph Lassner, Janne Heikkila, Tony Tung

Comments: Accepted at ECCV 2022, Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1137] arXiv:2112.13890 [pdf, other]: Title: SPViT: Enabling Faster Vision Transformers via Soft Token Pruning

Authors: Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Mengshu Sun, Wei Niu, Xuan Shen, Geng Yuan, Bin Ren, Minghai Qin, Hao Tang, Yanzhi Wang

Comments: ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[1138] arXiv:2112.13891 [pdf, other]: Title: GPU-accelerated Faster Mean Shift with euclidean distance metrics

Authors: Le You, Han Jiang, Jinyong Hu, Chorng Chang, Lingxi Chen, Xintong Cui, Mengyang Zhao

Comments: 7 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:2007.14283

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1139] arXiv:2112.13906 [pdf, other]: Title: Does CLIP Benefit Visual Question Answering in the Medical Domain as Much as it Does in the General Domain?

Authors: Sedigheh Eslami, Gerard de Melo, Christoph Meinel

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1140] arXiv:2112.13925 [pdf, other]: Title: Improving Depth Estimation using Location Information

Authors: Ahmed Zaitoon, Hossam El Din Abd El Munim, Hazem Abbas

Journal-ref: 2021 16th International Conference on Computer Engineering and Systems (ICCES)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[1141] arXiv:2112.13942 [pdf, other]: Title: PriFit: Learning to Fit Primitives Improves Few Shot Point Cloud Segmentation

Authors: Gopal Sharma, Bidya Dash, Aruni RoyChowdhury, Matheus Gadelha, Marios Loizou, Liangliang Cao, Rui Wang, Erik Learned-Miller, Subhransu Maji, Evangelos Kalogerakis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1142] arXiv:2112.13953 [pdf, ps, other]: Title: Source Feature Compression for Object Classification in Vision-Based Underwater Robotics

Authors: Xueyuan Zhao, Mehdi Rahmati, Dario Pompili

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1143] arXiv:2112.13977 [pdf, other]: Title: Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement Learning

Authors: Qiqi Gu, Shen Chen, Taiping Yao, Yang Chen, Shouhong Ding, Ran Yi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1144] arXiv:2112.13982 [pdf, other]: Title: Quaternion-based dynamic mode decomposition for background modeling in color videos

Authors: Juan Han, Kit Ian Kou, Jifei Miao

Comments: 16 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1145] arXiv:2112.13983 [pdf, other]: Title: Siamese Network with Interactive Transformer for Video Object Segmentation

Authors: Meng Lan, Jing Zhang, Fengxiang He, Lefei Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1146] arXiv:2112.13985 [pdf, other]: Title: LatteGAN: Visually Guided Language Attention for Multi-Turn Text-Conditioned Image Manipulation

Authors: Shoya Matsumori, Yuki Abe, Kosuke Shingyouchi, Komei Sugiura, Michita Imai

Journal-ref: IEEE Access, 9, 160521-160532 (2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1147] arXiv:2112.13986 [pdf, other]: Title: Deep-CNN based Robotic Multi-Class Under-Canopy Weed Control in Precision Farming

Authors: Yayun Du, Guofeng Zhang, Darren Tsang, M. Khalid Jawed

Comments: 8 pages, 7 figures, International Conference on Robotics and Automation (IEEE)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1148] arXiv:2112.13989 [pdf, other]: Title: Associative Adversarial Learning Based on Selective Attack

Authors: Runqi Wang, Xiaoyue Duan, Baochang Zhang, Song Xue, Wentao Zhu, David Doermann, Guodong Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1149] arXiv:2112.14000 [pdf, other]: Title: Pale Transformer: A General Vision Transformer Backbone with Pale-Shaped Attention

Authors: Sitong Wu, Tianyi Wu, Haoru Tan, Guodong Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1150] arXiv:2112.14015 [pdf, other]: Title: GuidedMix-Net: Semi-supervised Semantic Segmentation by Using Labeled Images as Reference

Authors: Peng Tu, Yawen Huang, Feng Zheng, Zhenyu He, Liujun Cao, Ling Shao

Comments: Accepted by AAAI'22. arXiv admin note: substantial text overlap with arXiv:2106.15064

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1151] arXiv:2112.14016 [pdf, other]: Title: Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking

Authors: Jin Gao, Yan Lu, Xiaojuan Qi, Yutong Kou, Bing Li, Liang Li, Shan Yu, Weiming Hu

Comments: Accepted by TPAMI. Extended version of the RLS-RTMDNet tracker (CVPR2020)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1152] arXiv:2112.14019 [pdf, other]: Title: Semi-supervised Salient Object Detection with Effective Confidence Estimation

Authors: Jiawei Liu, Jing Zhang, Nick Barnes

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1153] arXiv:2112.14023 [pdf, other]: Title: The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection

Authors: Zhikang Zou, Xiaoqing Ye, Liang Du, Xianhui Cheng, Xiao Tan, Li Zhang, Jianfeng Feng, Xiangyang Xue, Errui Ding

Comments: Accepted to ICCV 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1154] arXiv:2112.14025 [pdf, other]: Title: Delving into Probabilistic Uncertainty for Unsupervised Domain Adaptive Person Re-Identification

Authors: Jian Han, Ya-Li li, Shengjin Wang

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1155] arXiv:2112.14059 [pdf, other]: Title: DetarNet: Decoupling Translation and Rotation by Siamese Network for Point Cloud Registration

Authors: Zhi Chen, Fan Yang, Wenbing Tao

Comments: Accepted by AAAI-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1156] arXiv:2112.14084 [pdf, other]: Title: Embodied Learning for Lifelong Visual Perception

Authors: David Nilsson, Aleksis Pirinen, Erik Gärtner, Cristian Sminchisescu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1157] arXiv:2112.14087 [pdf, other]: Title: APRIL: Finding the Achilles' Heel on Privacy for Vision Transformers

Authors: Jiahao Lu, Xi Sheryl Zhang, Tianli Zhao, Xiangyu He, Jian Cheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1158] arXiv:2112.14088 [pdf, other]: Title: Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation

Authors: Philipp Harzig, Moritz Einfalt, Rainer Lienhart

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1159] arXiv:2112.14100 [pdf, other]: Title: Extended Self-Critical Pipeline for Transforming Videos to Text (TRECVID-VTT Task 2021) -- Team: MMCUniAugsburg

Authors: Philipp Harzig, Moritz Einfalt, Katja Ludwig, Rainer Lienhart

Comments: TRECVID 2021 notebook paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1160] arXiv:2112.14159 [pdf, other]: Title: Skin feature point tracking using deep feature encodings

Authors: Jose Ramon Chang, Torbjörn E.M. Nordling

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1161] arXiv:2112.14238 [pdf, other]: Title: AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition

Authors: Yulin Wang, Yang Yue, Yuanze Lin, Haojun Jiang, Zihang Lai, Victor Kulikov, Nikita Orlov, Humphrey Shi, Gao Huang

Comments: Accepted by CVPR-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1162] arXiv:2112.14239 [pdf, other]: Title: TAGPerson: A Target-Aware Generation Pipeline for Person Re-identification

Authors: Kai Chen, Weihua Chen, Tao He, Rong Du, Fan Wang, Xiuyu Sun, Yuchen Guo, Guiguang Ding

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1163] arXiv:2112.14298 [pdf, ps, other]: Title: Multimodal perception for dexterous manipulation

Authors: Guanqun Cao, Shan Luo

Comments: 19 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1164] arXiv:2112.14316 [pdf, other]: Title: FRIDA -- Generative Feature Replay for Incremental Domain Adaptation

Authors: Sayan Rakshit, Anwesh Mohanty, Ruchika Chavhan, Biplab Banerjee, Gemma Roig, Subhasis Chaudhuri

Comments: Accepted at CVIU (7th January 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1165] arXiv:2112.14327 [pdf, other]: Title: Multi-Head Deep Metric Learning Using Global and Local Representations

Authors: Mohammad K. Ebrahimpour, Gang Qian, Allison Beach

Comments: To appear in WACV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1166] arXiv:2112.14331 [pdf, other]: Title: 360° Optical Flow using Tangent Images

Authors: Mingze Yuan, Christian Richardt

Comments: The 32nd British Machine Vision Conference (BMVC 2021)

Journal-ref: BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1167] arXiv:2112.14379 [pdf, other]: Title: Background-aware Classification Activation Map for Weakly Supervised Object Localization

Authors: Lei Zhu, Qi She, Qian Chen, Xiangxi Meng, Mufeng Geng, Lujia Jin, Zhe Jiang, Bin Qiu, Yunfei You, Yibao Zhang, Qiushi Ren, Yanye Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1168] arXiv:2112.14380 [pdf, other]: Title: Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification

Authors: Beier Zhu, Yulei Niu, Xian-Sheng Hua, Hanwang Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1169] arXiv:2112.14381 [pdf, other]: Title: COTReg:Coupled Optimal Transport based Point Cloud Registration

Authors: Guofeng Mei, Xiaoshui Huang, Litao Yu, Jian Zhang, Mohammed Bennamoun

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1170] arXiv:2112.14382 [pdf, other]: Title: Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction

Authors: Hitika Tiwari, Min-Hung Chen, Yi-Min Tsai, Hsien-Kai Kuo, Hung-Jen Chen, Kevin Jou, K. S. Venkatesh, Yong-Sheng Chen

Comments: Accepted by The 33rd British Machine Vision Conference (BMVC) 2022. Evaluation code and datasets: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1171] arXiv:2112.14406 [pdf, other]: Title: Overcoming Mode Collapse with Adaptive Multi Adversarial Training

Authors: Karttikeya Mangalam, Rohin Garg

Comments: BMVC 2021 Poster

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1172] arXiv:2112.14420 [pdf, other]: Title: Invertible Image Dataset Protection

Authors: Kejiang Chen, Xianhan Zeng, Qichao Ying, Sheng Li, Zhenxing Qian, Xinpeng Zhang

Comments: Submitted to ICME 2022. Authors are from University of Science and Technology of China, Fudan University, China. A potential extended version of this work is under way

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1173] arXiv:2112.14440 [pdf, other]: Title: ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth Estimation

Authors: Chuanqing Zhuang, Zhengda Lu, Yiqun Wang, Jun Xiao, Ying Wang

Comments: 13 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1174] arXiv:2112.14478 [pdf, other]: Title: Semantic Feature Extraction for Generalized Zero-shot Learning

Authors: Junhan Kim, Kyuhong Shim, Byonghyo Shim

Comments: Accepted at AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1175] arXiv:2112.14491 [pdf, other]: Title: Two-phase training mitigates class imbalance for camera trap image classification with CNNs

Authors: Farjad Malik, Simon Wouters, Ruben Cartuyvels, Erfan Ghadery, Marie-Francine Moens

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1176] arXiv:2112.14513 [pdf, other]: Title: Spatial Distribution Patterns of Clownfish in Recirculating Aquaculture Systems

Authors: Fahad Aljehani, Ibrahima N'Doye, Micaela S. Justo, John E. Majoris, Michael L. Berumen, Taous-Meriem Laleg-Kirati

Comments: 14 pages, 15 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[1177] arXiv:2112.14540 [src]: Title: Res2NetFuse: A Fusion Method for Infrared and Visible Images

Authors: Xu Song, Xiao-Jun Wu, Hui Li, Jun Sun, Vasile Palade

Comments: There are some errors that need to be corrected

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[1178] arXiv:2112.14651 [pdf, other]: Title: On the Instability of Relative Pose Estimation and RANSAC's Role

Authors: Hongyi Fan, Joe Kileel, Benjamin Kimia

Comments: 27 pages, 11 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[1179] arXiv:2112.14656 [pdf, other]: Title: Gendered Differences in Face Recognition Accuracy Explained by Hairstyles, Makeup, and Facial Morphology

Authors: Vítor Albiero, Kai Zhang, Michael C. King, Kevin W. Bowyer

Comments: arXiv admin note: substantial text overlap with arXiv:2008.06989

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1180] arXiv:2112.14663 [pdf, other]: Title: MetaGraspNet_v0: A Large-Scale Benchmark Dataset for Vision-driven Robotic Grasping via Physics-based Metaverse Synthesis

Authors: Yuhao Chen, E. Zhixuan Zeng, Maximilian Gilles, Alexander Wong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1181] arXiv:2112.14683 [pdf, other]: Title: StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2

Authors: Ivan Skorokhodov, Sergey Tulyakov, Mohamed Elhoseiny

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1182] arXiv:2112.14757 [pdf, other]: Title: A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model

Authors: Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Han Hu, Xiang Bai

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1183] arXiv:2112.14796 [pdf, ps, other]: Title: Deep Learning meets Liveness Detection: Recent Advancements and Challenges

Authors: Arian Sabaghi, Marzieh Oghbaie, Kooshan Hashemifard, Mohammad Akbari

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1184] arXiv:2112.14804 [pdf, other]: Title: Learning Spatially-Adaptive Squeeze-Excitation Networks for Image Synthesis and Image Recognition

Authors: Jianghao Shen, Tianfu Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1185] arXiv:2112.14894 [pdf, other]: Title: Feature Generation and Hypothesis Verification for Reliable Face Anti-Spoofing

Authors: Shice Liu, Shitao Lu, Hongyi Xu, Jing Yang, Shouhong Ding, Lizhuang Ma

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[1186] arXiv:2112.14931 [pdf, other]: Title: Dense Depth Estimation from Multiple 360-degree Images Using Virtual Depth

Authors: Seongyeop Yang, Kunhee Kim, Yeejin Lee

Comments: 16 pages, 11 figures, Applied Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[1187] arXiv:2112.14934 [pdf, other]: Title: SFU-HW-Tracks-v1: Object Tracking Dataset on Raw Video Sequences

Authors: Takehiro Tanaka, Hyomin Choi, Ivan V. Bajić

Comments: 4 pages, 3 figures, submitted to Data in Brief

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1188] arXiv:2112.14968 [pdf, other]: Title: A Novel Generator with Auxiliary Branch for Improving GAN Performance

Authors: Seung Park, Yong-Goo Shin

Journal-ref: IEEE transactions on neural networks and learning systems 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1189] arXiv:2112.14971 [pdf, other]: Title: Contrastive Fine-grained Class Clustering via Generative Adversarial Networks

Authors: Yunji Kim, Jung-Woo Ha

Comments: ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1190] arXiv:2112.14976 [src]: Title: Contrastive Learning of Semantic and Visual Representations for Text Tracking

Authors: Zhuang Li, Weijia Wu, Mike Zheng Shou, Jiahong Li, Size Li, Zhongyuan Wang, Hong Zhou

Comments: Merge the paper with arXiv article 2207.08417. We will withdraw the two papers and create new one

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1191] arXiv:2112.14983 [pdf, ps, other]: Title: Exploring the pattern of Emotion in children with ASD as an early biomarker through Recurring-Convolution Neural Network (R-CNN)

Authors: Abirami S P, Kousalya G, Karthick R

Comments: 8 figures and 2 tables. totally 18 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[1192] arXiv:2112.14985 [pdf, other]: Title: THE Benchmark: Transferable Representation Learning for Monocular Height Estimation

Authors: Zhitong Xiong, Wei Huang, Jingtao Hu, Xiao Xiang Zhu

Comments: 14 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1193] arXiv:2112.15012 [pdf, other]: Title: Investigating Pose Representations and Motion Contexts Modeling for 3D Motion Prediction

Authors: Zhenguang Liu, Shuang Wu, Shuyuan Jin, Shouling Ji, Qi Liu, Shijian Lu, Li Cheng

Comments: Accepted to IEEE TPAMI, 27 Dec. 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1194] arXiv:2112.15022 [pdf, other]: Title: Continually Learning Self-Supervised Representations with Projected Functional Regularization

Authors: Alex Gomez-Villa, Bartlomiej Twardowski, Lu Yu, Andrew D. Bagdanov, Joost van de Weijer

Comments: Accepted at Workshop on Continual Learning in Computer Vision (CVPR 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1195] arXiv:2112.15031 [pdf, other]: Title: Development of a face mask detection pipeline for mask-wearing monitoring in the era of the COVID-19 pandemic: A modular approach

Authors: Benjaphan Sommana, Ukrit Watchareeruetai, Ankush Ganguly, Samuel W.F. Earp, Taya Kitiyakara, Suparee Boonmanunt, Ratchainant Thammasudjarit

Comments: Accepted at the 19th International Joint Conference on Computer Science and Software Engineering (JCSSE 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1196] arXiv:2112.15075 [pdf, other]: Title: Pose Estimation of Specific Rigid Objects

Authors: Tomas Hodan

Comments: Tomas Hodan's PhD thesis defended on July 7, 2021. Supervisor: Prof. Jiri Matas. Reviewers: Prof. Vincent Lepetit, Prof. Markus Vincze, Dr. Slobodan Ilic. A recording of the defense: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
[1197] arXiv:2112.15085 [pdf, ps, other]: Title: Feature Extraction, Classification and Prediction for Hand Hygiene Gestures with KNN Algorithm

Authors: Rashmi Bakshi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1198] arXiv:2112.15091 [pdf, other]: Title: Leveraging in-domain supervision for unsupervised image-to-image translation tasks via multi-stream generators

Authors: Dvir Yerushalmi, Dov Danon, Amit H. Bermano

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1199] arXiv:2112.15093 [pdf, other]: Title: Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

Authors: Haiyang Yu, Jingye Chen, Bin Li, Jianqi Ma, Mengnan Guan, Xixi Xu, Xiaocong Wang, Shaobo Qu, Xiangyang Xue

Comments: Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1200] arXiv:2112.15095 [pdf, other]: Title: A general technique for the estimation of farm animal body part weights from CT scans and its applications in a rabbit breeding program

Authors: Ádám Csóka, György Kovács, Virág Ács, Zsolt Matics, Zsolt Gerencsér, Zsolt Szendrő, István Nagy, Örs Petneházy, Imre Repa, Mariann Moizs, Tamás Donkó

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1201] arXiv:2112.15111 [pdf, other]: Title: Improving the Behaviour of Vision Transformers with Token-consistent Stochastic Layers

Authors: Nikola Popovic, Danda Pani Paudel, Thomas Probst, Luc Van Gool

Comments: This article is under consideration at the Computer Vision and Image Understanding journal

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1202] arXiv:2112.15139 [pdf, other]: Title: Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks

Authors: Runpei Dong, Zhanhong Tan, Mengdi Wu, Linfeng Zhang, Kaisheng Ma

Comments: Accepted at ICML 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1203] arXiv:2112.15188 [pdf, other]: Title: Towards Robustness of Neural Networks

Authors: Steven Basart

Comments: PhD Thesis

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1204] arXiv:2112.15202 [pdf, other]: Title: Visual and Object Geo-localization: A Comprehensive Survey

Authors: Daniel Wilson, Xiaohan Zhang, Waqas Sultani, Safwan Wshah

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1205] arXiv:2112.15283 [pdf, other]: Title: ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Authors: Han Zhang, Weichong Yin, Yewei Fang, Lanxin Li, Boqiang Duan, Zhihua Wu, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Comments: 15 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[1206] arXiv:2112.15324 [pdf, other]: Title: Deconfounded Visual Grounding

Authors: Jianqiang Huang, Yu Qin, Jiaxin Qi, Qianru Sun, Hanwang Zhang

Comments: AAAI 2022 Accepted

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[1207] arXiv:2112.15344 [pdf, other]: Title: P2P-Loc: Point to Point Tiny Person Localization

Authors: Xuehui Yu, Di Wu, Qixiang Ye, Jianbin Jiao, Zhenjun Han

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1208] arXiv:2112.15351 [pdf, other]: Title: Learning to Predict 3D Lane Shape and Camera Pose from a Single Image via Geometry Constraints

Authors: Ruijin Liu, Dapeng Chen, Tie Liu, Zhiliang Xiong, Zejian Yuan

Comments: 14 pages, 10 figures, accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1209] arXiv:2112.15355 [pdf, other]: Title: Sparse LiDAR Assisted Self-supervised Stereo Disparity Estimation

Authors: Xiaoming Zhao, Weihai Chen, Xingming Wu, Peter C. Y. Chen, Zhengguo Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1210] arXiv:2112.15358 [pdf, other]: Title: Conditional Generative Data-free Knowledge Distillation

Authors: Xinyi Yu, Ling Yan, Yang Yang, Libo Zhou, Linlin Ou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1211] arXiv:2112.15399 [pdf, other]: Title: InfoNeRF: Ray Entropy Minimization for Few-Shot Neural Volume Rendering

Authors: Mijeong Kim, Seonguk Seo, Bohyung Han

Comments: CVPR 2022, Website: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
[1212] arXiv:2112.15439 [pdf, other]: Title: Facial-Sketch Synthesis: A New Challenge

Authors: Deng-Ping Fan, Ziling Huang, Peng Zheng, Hong Liu, Xuebin Qin, Luc Van Gool

Comments: Accepted to Machine Intelligence Research (MIR)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1213] arXiv:2112.15458 [pdf, other]: Title: Accurate and Real-time 3D Pedestrian Detection Using an Efficient Attentive Pillar Network

Authors: Duy-Tho Le, Hengcan Shi, Hamid Rezatofighi, Jianfei Cai

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1214] arXiv:2112.15483 [pdf, other]: Title: Cloud Removal from Satellite Images

Authors: Rutvik Chauhan, Antarpuneet Singh, Sujoy Saha

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1215] arXiv:2112.15509 [pdf, other]: Title: Scene-Adaptive Attention Network for Crowd Counting

Authors: Xing Wei, Yuanrui Kang, Jihao Yang, Yunfeng Qiu, Dahu Shi, Wenming Tan, Yihong Gong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1216] arXiv:2112.15571 [pdf, other]: Title: PCACE: A Statistical Approach to Ranking Neurons for CNN Interpretability

Authors: Sílvia Casacuberta, Esra Suel, Seth Flaxman

Journal-ref: Responsible AI and DeepSpatial workshops at the 27th SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[1217] arXiv:2112.15589 [pdf, ps, other]: Title: 3-D Material Style Transfer for Reconstructing Unknown Appearance in Complex Natural Materials

Authors: Shashank Ranjan, Corey Toler-Franklin

Comments: 15 pages, 22 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1218] arXiv:2112.00007 (cross-list from cs.GR) [pdf, other]: Title: Sound-Guided Semantic Image Manipulation

Authors: Seung Hyun Lee, Wonseok Roh, Wonmin Byeon, Sang Ho Yoon, Chan Young Kim, Jinkyu Kim, Sangpil Kim

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1219] arXiv:2112.00133 (cross-list from cs.LG) [pdf, other]: Title: PokeBNN: A Binary Pursuit of Lightweight Accuracy

Authors: Yichi Zhang, Zhiru Zhang, Lukasz Lew

Comments: Accepted to CVPR 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1220] arXiv:2112.00171 (cross-list from cs.LG) [pdf, other]: Title: Improving Differentiable Architecture Search with a Generative Model

Authors: Ruisi Zhang, Youwei Liang, Sai Ashish Somayajula, Pengtao Xie

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1221] arXiv:2112.00190 (cross-list from cs.LG) [pdf, ps, other]: Title: Is the use of Deep Learning and Artificial Intelligence an appropriate means to locate debris in the ocean without harming aquatic wildlife?

Authors: Zoe Moorton, Zeyneb Kurt, Wai Lok Woo

Comments: reference list is added/updated; sorry for causing any inconveniences. 3681 words, 14 pages

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1222] arXiv:2112.00265 (cross-list from cs.LG) [pdf, other]: Title: Training BatchNorm Only in Neural Architecture Search and Beyond

Authors: Yichen Zhu, Jie Du, Yuqin Zhu, Yi Wang, Zhicai Ou, Feifei Feng, Jian Tang

Comments: 11 pages Technical report

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1223] arXiv:2112.00305 (cross-list from cs.LG) [pdf, other]: Title: Forward Operator Estimation in Generative Models with Kernel Transfer Operators

Authors: Zhichun Huang, Rudrasis Chakraborty, Vikas Singh

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1224] arXiv:2112.00324 (cross-list from cs.LG) [pdf, ps, other]: Title: Optimizing for In-memory Deep Learning with Emerging Memory Technology

Authors: Zhehui Wang, Tao Luo, Rick Siow Mong Goh, Wei Zhang, Weng-Fai Wong

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET)
[1225] arXiv:2112.00378 (cross-list from cs.LG) [pdf, other]: Title: $\ell_\infty$-Robustness and Beyond: Unleashing Efficient Adversarial Training

Authors: Hadi M. Dolatabadi, Sarah Erfani, Christopher Leckie

Comments: Accepted to the 17th European Conference on Computer Vision (ECCV 2022)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1226] arXiv:2112.00584 (cross-list from cs.GR) [pdf, other]: Title: The Shape Part Slot Machine: Contact-based Reasoning for Generating 3D Shapes from Parts

Authors: Kai Wang, Paul Guerrero, Vladimir Kim, Siddhartha Chaudhuri, Minhyuk Sung, Daniel Ritchie

Comments: European Conference on Computer Vision (ECCV) 2022

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1227] arXiv:2112.00734 (cross-list from cs.LG) [pdf, other]: Title: Personalized Federated Learning with Adaptive Batchnorm for Healthcare

Authors: Wang Lu, Jindong Wang, Yiqiang Chen, Xin Qin, Renjun Xu, Dimitrios Dimitriadis, Tao Qin

Comments: Accepted by IEEE Transactions on Big Data; code: this https URL arXiv admin note: substantial text overlap with arXiv:2106.01009

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1228] arXiv:2112.00739 (cross-list from cs.LG) [pdf, ps, other]: Title: Incomplete Multi-view Clustering via Cross-view Relation Transfer

Authors: Yiming Wang, Dongxia Chang, Zhiqiang Fu, Yao Zhao

Journal-ref: IEEE Transactions on Circuits and Systems for Video Technology, 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1229] arXiv:2112.01008 (cross-list from cs.LG) [pdf, other]: Title: Editing a classifier by rewriting its prediction rules

Authors: Shibani Santurkar, Dimitris Tsipras, Mahalaxmi Elango, David Bau, Antonio Torralba, Aleksander Madry

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1230] arXiv:2112.01010 (cross-list from cs.LG) [pdf, other]: Title: Differentiable Spatial Planning using Transformers

Authors: Devendra Singh Chaplot, Deepak Pathak, Jitendra Malik

Comments: Published at ICML 2021. See project webpage at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1231] arXiv:2112.01283 (cross-list from cs.LG) [pdf, ps, other]: Title: Detecting Extratropical Cyclones of the Northern Hemisphere with Single Shot Detector

Authors: Minjing Shi, Pengfei He, Yuli Shi

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Atmospheric and Oceanic Physics (physics.ao-ph)
[1232] arXiv:2112.01406 (cross-list from cs.LG) [pdf, other]: Title: Active Learning for Domain Adaptation: An Energy-Based Approach

Authors: Binhui Xie, Longhui Yuan, Shuang Li, Chi Harold Liu, Xinjing Cheng, Guoren Wang

Comments: Camera ready for AAAI 2022. Code is available at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1233] arXiv:2112.01421 (cross-list from cs.LG) [pdf, other]: Title: Deep residential representations: Using unsupervised learning to unlock elevation data for geo-demographic prediction

Authors: Matthew Stevenson, Christophe Mues, Cristián Bravo

Comments: 29 pages, 13 figures. V2 - Published

Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, 187, 378-392 (2022)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1234] arXiv:2112.01423 (cross-list from cs.LG) [pdf, other]: Title: Training Efficiency and Robustness in Deep Learning

Authors: Fartash Faghri

Comments: A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1235] arXiv:2112.01511 (cross-list from cs.RO) [pdf, other]: Title: The Surprising Effectiveness of Representation Learning for Visual Imitation

Authors: Jyothish Pari, Nur Muhammad Shafiullah, Sridhar Pandian Arunachalam, Lerrel Pinto

Comments: The first two authors contributed equally

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1236] arXiv:2112.01579 (cross-list from cs.GR) [pdf, other]: Title: Fast Neural Representations for Direct Volume Rendering

Authors: Sebastian Weiss, Philipp Hermüller, Rüdiger Westermann

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1237] arXiv:2112.01716 (cross-list from cs.LG) [pdf, other]: Title: Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research

Authors: Bernard Koch, Emily Denton, Alex Hanna, Jacob G. Foster

Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (stat.ML)
[1238] arXiv:2112.01790 (cross-list from cs.LG) [pdf, other]: Title: SSDL: Self-Supervised Dictionary Learning

Authors: Shuai Shao, Lei Xing, Wei Yu, Rui Xu, Yanjiang Wang, Baodi Liu

Comments: Accepted by 22th IEEE International Conference on Multimedia and Expo (ICME) as an Oral

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1239] arXiv:2112.01806 (cross-list from cs.SD) [pdf, other]: Title: Music-to-Dance Generation with Optimal Transport

Authors: Shuang Wu, Shijian Lu, Li Cheng

Comments: IJCAI 2022

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1240] arXiv:2112.01832 (cross-list from cs.MM) [pdf, other]: Title: Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval

Authors: Fan Hu, Aozhu Chen, Ziyue Wang, Fangming Zhou, Jianfeng Dong, Xirong Li

Comments: Accepted by ECCV2022

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1241] arXiv:2112.01840 (cross-list from cs.RO) [pdf, other]: Title: Graph-Guided Deformation for Point Cloud Completion

Authors: Jieqi Shi, Lingyun Xu, Liang Heng, Shaojie Shen

Comments: RAL with IROS 2021

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1242] arXiv:2112.01849 (cross-list from cs.MM) [pdf, ps, other]: Title: Cross-modal Knowledge Distillation for Vision-to-Sensor Action Recognition

Authors: Jianyuan Ni, Raunak Sarbajna, Yang Liu, Anne H.H. Ngu, Yan Yan

Comments: 5 pages, 2 figures, submitted to ICASSP2022

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1243] arXiv:2112.01917 (cross-list from cs.LG) [pdf, other]: Title: A Structured Dictionary Perspective on Implicit Neural Representations

Authors: Gizem Yüce, Guillermo Ortiz-Jiménez, Beril Besbinar, Pascal Frossard

Comments: Accepted to IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (26 pages, 16 figures)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1244] arXiv:2112.02086 (cross-list from cs.LG) [pdf, other]: Title: Data-Free Neural Architecture Search via Recursive Label Calibration

Authors: Zechun Liu, Zhiqiang Shen, Yun Long, Eric Xing, Kwang-Ting Cheng, Chas Leichner

Comments: ECCV 2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1245] arXiv:2112.02094 (cross-list from cs.RO) [pdf, other]: Title: Coupling Vision and Proprioception for Navigation of Legged Robots

Authors: Zipeng Fu, Ashish Kumar, Ananye Agarwal, Haozhi Qi, Jitendra Malik, Deepak Pathak

Comments: CVPR 2022 final version. Website at this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1246] arXiv:2112.02488 (cross-list from cs.LG) [pdf, other]: Title: Exploring Complicated Search Spaces with Interleaving-Free Sampling

Authors: Yunjie Tian, Lingxi Xie, Jiemin Fang, Jianbin Jiao, Qixiang Ye, Qi Tian

Comments: 9 pages, 8 figures, 6 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1247] arXiv:2112.02735 (cross-list from cs.RO) [pdf, other]: Title: A Dataset of Stationary, Fixed-wing Aircraft on a Collision Course for Vision-Based Sense and Avoid

Authors: Jasmin Martin, Jenna Riseley, Jason J. Ford

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1248] arXiv:2112.02849 (cross-list from cs.RO) [pdf, other]: Title: DemoGrasp: Few-Shot Learning for Robotic Grasping with Human Demonstration

Authors: Pengyuan Wang, Fabian Manhardt, Luca Minciullo, Lorenzo Garattoni, Sven Meie, Nassir Navab, Benjamin Busam

Comments: Accepted by IROS 2021

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1249] arXiv:2112.02880 (cross-list from cs.LG) [pdf, other]: Title: AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks

Authors: Huu Le, Rasmus Kjær Høier, Che-Tsung Lin, Christopher Zach

Comments: 18 pages

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1250] arXiv:2112.03028 (cross-list from cs.CV) [pdf, other]: Title: D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions

Authors: Sammy Christen, Muhammed Kocabas, Emre Aksan, Jemin Hwangbo, Jie Song, Otmar Hilliges

Comments: CVPR-2022 camera ready. Project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1251] arXiv:2112.03030 (cross-list from cs.RO) [pdf, other]: Title: Pose2Room: Understanding 3D Scenes from Human Activities

Authors: Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner

Comments: Accepted by ECCV'2022; Project page: this https URL Video: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1252] arXiv:2112.03052 (cross-list from cs.LG) [pdf, other]: Title: Scaling Up Influence Functions

Authors: Andrea Schioppa, Polina Zablotskaia, David Vilar, Artem Sokolov

Comments: Published at AAAI-22

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1253] arXiv:2112.03134 (cross-list from cs.LG) [pdf, other]: Title: Prototypical Model with Novel Information-theoretic Loss Function for Generalized Zero Shot Learning

Authors: Chunlin Ji, Hanchu Shen, Zhan Xiong, Feng Chen, Meiying Zhang, Huiwen Yang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1254] arXiv:2112.03227 (cross-list from cs.RO) [pdf, other]: Title: CALVIN: A Benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Authors: Oier Mees, Lukas Hermann, Erick Rosete-Beas, Wolfram Burgard

Comments: Accepted for publication at IEEE Robotics and Automation Letters (RAL). Code, models and dataset available at this http URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1255] arXiv:2112.03257 (cross-list from cs.LG) [pdf, other]: Title: Functional Regularization for Reinforcement Learning via Learned Fourier Features

Authors: Alexander C. Li, Deepak Pathak

Comments: Accepted at NeurIPS 2021. Website at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
[1256] arXiv:2112.03269 (cross-list from cs.HC) [pdf, other]: Title: DIY Graphics Tab: A Cost-Effective Alternative to Graphics Tablet for Educators

Authors: Mohammad Imrul Jubair, Arafat Ibne Yousuf, Tashfiq Ahmed, Hasanath Jamy, Foisal Reza, Mohsena Ashraf

Comments: Accepted in AAAI2022 workshop

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[1257] arXiv:2112.03321 (cross-list from cs.LG) [pdf, other]: Title: Noether Networks: Meta-Learning Useful Conserved Quantities

Authors: Ferran Alet, Dylan Doblar, Allan Zhou, Joshua Tenenbaum, Kenji Kawaguchi, Chelsea Finn

Comments: Accepted to NeurIPS '21. The first two authors contributed equally

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1258] arXiv:2112.03371 (cross-list from cs.LG) [pdf, other]: Title: Graphical Models with Attention for Context-Specific Independence and an Application to Perceptual Grouping

Authors: Guangyao Zhou, Wolfgang Lehrach, Antoine Dedieu, Miguel Lázaro-Gredilla, Dileep George

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1259] arXiv:2112.03379 (cross-list from cs.LG) [pdf, other]: Title: Deep Efficient Continuous Manifold Learning for Time Series Modeling

Authors: Seungwoo Jeong, Wonjun Ko, Ahmad Wisnu Mulyadi, Heung-Il Suk

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1260] arXiv:2112.03398 (cross-list from cs.LG) [pdf, other]: Title: Top-Down Deep Clustering with Multi-generator GANs

Authors: Daniel de Mello, Renato Assunção, Fabricio Murai

Comments: Accepted to AAAI 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1261] arXiv:2112.03406 (cross-list from cs.LG) [pdf, other]: Title: Equal Bits: Enforcing Equally Distributed Binary Network Weights

Authors: Yunqiang Li, Silvia L. Pintea, Jan C. van Gemert

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1262] arXiv:2112.03476 (cross-list from cs.CR) [pdf, other]: Title: Defending against Model Stealing via Verifying Embedded External Features

Authors: Yiming Li, Linghui Zhu, Xiaojun Jia, Yong Jiang, Shu-Tao Xia, Xiaochun Cao

Comments: This work is accepted by the AAAI 2022. The first two authors contributed equally to this work. 11 pages

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1263] arXiv:2112.03502 (cross-list from cs.LG) [pdf, other]: Title: A Generic Approach for Enhancing GANs by Regularized Latent Optimization

Authors: Yufan Zhou, Chunyuan Li, Changyou Chen, Jinhui Xu

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1264] arXiv:2112.03676 (cross-list from cs.LG) [pdf, other]: Title: PLACE dropout: A Progressive Layer-wise and Channel-wise Dropout for Domain Generalization

Authors: Jintao Guo, Lei Qi, Yinghuan Shi, Yang Gao

Comments: Accepted by ACM TOMM 2023. The code is available at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1265] arXiv:2112.03678 (cross-list from cs.CR) [pdf, other]: Title: Does Proprietary Software Still Offer Protection of Intellectual Property in the Age of Machine Learning? -- A Case Study using Dual Energy CT Data

Authors: Andreas Maier, Seung Hee Yang, Farhad Maleki, Nikesh Muthukrishnan, Reza Forghani

Comments: 6 pages, 2 figures, 1 table, accepted on BVM 2022

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[1266] arXiv:2112.03695 (cross-list from cs.CR) [pdf, other]: Title: Safe Distillation Box

Authors: Jingwen Ye, Yining Mao, Jie Song, Xinchao Wang, Cheng Jin, Mingli Song

Comments: Accepted by AAAI2022

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1267] arXiv:2112.03908 (cross-list from cs.RO) [pdf, other]: Title: Causal Imitative Model for Autonomous Driving

Authors: Mohammad Reza Samsami, Mohammadhossein Bahari, Saber Salehkaleybar, Alexandre Alahi

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1268] arXiv:2112.04014 (cross-list from cs.LG) [pdf, other]: Title: Unsupervised Representation Learning via Neural Activation Coding

Authors: Yookoon Park, Sangho Lee, Gunhee Kim, David M. Blei

Comments: Published in International Conference on Machine Learning (ICML), 2021

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1269] arXiv:2112.04350 (cross-list from cs.RO) [pdf, other]: Title: Transformer based trajectory prediction

Authors: Aleksey Postnikov, Aleksander Gamayunov, Gonzalo Ferrer

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1270] arXiv:2112.04468 (cross-list from cs.LG) [pdf, other]: Title: Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework

Authors: Ching-Yun Ko, Jeet Mohapatra, Sijia Liu, Pin-Yu Chen, Luca Daniel, Lily Weng

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1271] arXiv:2112.04558 (cross-list from cs.CR) [pdf, ps, other]: Title: SoK: Anti-Facial Recognition Technology

Authors: Emily Wenger, Shawn Shan, Haitao Zheng, Ben Y. Zhao

Comments: Camera-ready version for Oakland S&P 2023

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1272] arXiv:2112.04684 (cross-list from cs.RO) [pdf, other]: Title: Trajectory-Constrained Deep Latent Visual Attention for Improved Local Planning in Presence of Heterogeneous Terrain

Authors: Stefan Wapnick, Travis Manderson, David Meger, Gregory Dudek

Comments: Published in International Conference on Intelligent Robots and Systems (IROS) 2021 proceedings. Project website: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1273] arXiv:2112.04758 (cross-list from cs.LG) [pdf, other]: Title: Does Redundancy in AI Perception Systems Help to Test for Super-Human Automated Driving Performance?

Authors: Hanno Gottschalk, Matthias Rottmann, Maida Saltagic

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1274] arXiv:2112.04766 (cross-list from cs.LG) [pdf, other]: Title: Adaptive Methods for Aggregated Domain Generalization

Authors: Xavier Thomas, Dhruv Mahajan, Alex Pentland, Abhimanyu Dubey

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1275] arXiv:2112.04895 (cross-list from cs.LG) [pdf, other]: Title: Latent Space Explanation by Intervention

Authors: Itai Gat, Guy Lorberbom, Idan Schwartz, Tamir Hazan

Comments: Accepted to AAAI22

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1276] arXiv:2112.04902 (cross-list from cs.LG) [pdf, other]: Title: Learning Personal Representations from fMRIby Predicting Neurofeedback Performance

Authors: Jhonathan Osin, Lior Wolf, Guy Gurevitch, Jackob Nimrod Keynan, Tom Fruchtman-Steinbok, Ayelet Or-Borichev, Shira Reznik Balter, Talma Hendler

Journal-ref: MICCAI 2020, https://link.springer.com/chapter/10.1007/978-3-030-59728-3_46

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1277] arXiv:2112.04910 (cross-list from cs.RO) [pdf, other]: Title: Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings

Authors: Mel Vecerik, Jackie Kay, Raia Hadsell, Lourdes Agapito, Jon Scholz

Comments: Supplementary material available at: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1278] arXiv:2112.05005 (cross-list from cs.LG) [pdf, other]: Title: Mutual Adversarial Training: Learning together is better than going alone

Authors: Jiang Liu, Chun Pong Lau, Hossein Souri, Soheil Feizi, Rama Chellappa

Comments: Under submission

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1279] arXiv:2112.05090 (cross-list from cs.LG) [pdf, other]: Title: Extending the WILDS Benchmark for Unsupervised Adaptation

Authors: Shiori Sagawa, Pang Wei Koh, Tony Lee, Irena Gao, Sang Michael Xie, Kendrick Shen, Ananya Kumar, Weihua Hu, Michihiro Yasunaga, Henrik Marklund, Sara Beery, Etienne David, Ian Stavness, Wei Guo, Jure Leskovec, Kate Saenko, Tatsunori Hashimoto, Sergey Levine, Chelsea Finn, Percy Liang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1280] arXiv:2112.05124 (cross-list from cs.RO) [pdf, other]: Title: Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation

Authors: Anthony Simeonov, Yilun Du, Andrea Tagliasacchi, Joshua B. Tenenbaum, Alberto Rodriguez, Pulkit Agrawal, Vincent Sitzmann

Comments: Website: this https URL First two authors contributed equally (order determined by coin flip), last two authors equal advising

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1281] arXiv:2112.05135 (cross-list from cs.LG) [pdf, other]: Title: PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures

Authors: Dan Hendrycks, Andy Zou, Mantas Mazeika, Leonard Tang, Bo Li, Dawn Song, Jacob Steinhardt

Comments: CVPR 2022. Code and models are available at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1282] arXiv:2112.05282 (cross-list from cs.LG) [pdf, other]: Title: RamBoAttack: A Robust Query Efficient Deep Neural Network Decision Exploit

Authors: Viet Quoc Vo, Ehsan Abbasnejad, Damith C. Ranasinghe

Comments: Published in Network and Distributed System Security (NDSS) Symposium 2022. Code is available at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1283] arXiv:2112.05322 (cross-list from cs.AR) [pdf, ps, other]: Title: Dynamic hardware system for cascade SVM classification of melanoma

Authors: Shereen Afifi, Hamid GholamHosseini, Roopak Sinha

Comments: Journal paper, 9 pages, 4 figures, 4 tables

Journal-ref: Neural Computing & Applications 32 (2020) pp.1777-1788

Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1284] arXiv:2112.05419 (cross-list from cs.AI) [pdf, other]: Title: Predicting Physical World Destinations for Commands Given to Self-Driving Cars

Authors: Dusan Grujicic, Thierry Deruyttere, Marie-Francine Moens, Matthew Blaschko

Comments: Accepted at AAAI 2022. First two authors have contributed equally. Extended camera-ready version including the appendix and references to it in the main text

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1285] arXiv:2112.05493 (cross-list from cs.LG) [pdf, other]: Title: Network Compression via Central Filter

Authors: Yuanzhi Duan, Xiaofang Hu, Yue Zhou, Qiang Liu, Shukai Duan

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1286] arXiv:2112.05534 (cross-list from cs.RO) [pdf, other]: Title: An Embarrassingly Pragmatic Introduction to Vision-based Autonomous Robots

Authors: Marcos V. Conde

Comments: CS Thesis. Lecture Notes in Computer Science

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1287] arXiv:2112.05634 (cross-list from cs.LG) [pdf, other]: Title: Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks

Authors: Seungyong Moon, Gaon An, Hyun Oh Song

Comments: Accepted and to appear at AAAI 2022

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1288] arXiv:2112.05657 (cross-list from cs.AI) [pdf, ps, other]: Title: Artificial Intellgence -- Application in Life Sciences and Beyond. The Upper Rhine Artificial Intelligence Symposium UR-AI 2021

Authors: Karl-Herbert Schäfer (1), Franz Quint (2) ((1) Kaiserslautern University of Applied Sciences, (2) Karlsruhe University of Applied Sciences)

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[1289] arXiv:2112.05872 (cross-list from cs.LG) [pdf, other]: Title: SLOSH: Set LOcality Sensitive Hashing via Sliced-Wasserstein Embeddings

Authors: Yuzhe Lu, Xinran Liu, Andrea Soltoggio, Soheil Kolouri

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1290] arXiv:2112.06102 (cross-list from cs.NE) [pdf, other]: Title: NeuroHSMD: Neuromorphic Hybrid Spiking Motion Detector

Authors: Pedro Machado, Joao Filipe Ferreira, Andreas Oikonomou, T.M. McGinnity

Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1291] arXiv:2112.06132 (cross-list from cs.LG) [pdf, other]: Title: Periodic Residual Learning for Crowd Flow Forecasting

Authors: Chengxin Wang, Yuxuan Liang, Gary Tan

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1292] arXiv:2112.06511 (cross-list from cs.LG) [pdf, other]: Title: Ex-Model: Continual Learning from a Stream of Trained Models

Authors: Antonio Carta, Andrea Cossu, Vincenzo Lomonaco, Davide Bacciu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1293] arXiv:2112.06539 (cross-list from cs.RO) [pdf, other]: Title: MinkLoc3D-SI: 3D LiDAR place recognition with sparse convolutions, spherical coordinates, and intensity

Authors: Kamil Żywanowski, Adam Banaszczyk, Michał R. Nowicki, Jacek Komorowski

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1294] arXiv:2112.06658 (cross-list from cs.LG) [pdf, other]: Title: Learning to Learn Transferable Attack

Authors: Shuman Fang, Jie Li, Xianming Lin, Rongrong Ji

Comments: AAAI 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1295] arXiv:2112.06772 (cross-list from cs.AR) [pdf, other]: Title: hARMS: A Hardware Acceleration Architecture for Real-Time Event-Based Optical Flow

Authors: Daniel C. Stumpp, Himanshu Akolkar, Alan D. George, Ryad B. Benosman

Comments: 18 pages, 16 figures, 4 tables

Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
[1296] arXiv:2112.06888 (cross-list from cs.CL) [pdf, other]: Title: Improving and Diagnosing Knowledge-Based Visual Question Answering via Entity Enhanced Knowledge Injection

Authors: Diego Garcia-Olano, Yasumasa Onoe, Joydeep Ghosh

Journal-ref: Proceedings of the 1st International Workshop on Multimodal Understanding for the Web and Social Media, co-located with the Web Conference 2022 (WWW '22 Companion), April 25--29, 2022, Virtual Event, Lyon, France

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1297] arXiv:2112.07022 (cross-list from cs.GR) [pdf, other]: Title: Learning Body-Aware 3D Shape Generative Models

Authors: Bryce Blinn, Alexander Ding, R. Kenny Jones, Manolis Savva, Srinath Sridhar, Daniel Ritchie

Comments: 11 pages, 8 figures

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1298] arXiv:2112.07087 (cross-list from cs.NE) [pdf, other]: Title: Heuristic Hyperparameter Optimization for Convolutional Neural Networks using Genetic Algorithm

Authors: Meng Zhou

Comments: 8 pages, 3 figures

Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1299] arXiv:2112.07207 (cross-list from cs.IT) [pdf, other]: Title: Modeling Image Quantization Tradeoffs for Optimal Compression

Authors: Johnathan Chiu

Subjects: Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1300] arXiv:2112.07214 (cross-list from cs.SD) [pdf, other]: Title: Noise Reduction and Driving Event Extraction Method for Performance Improvement on Driving Noise-based Surface Anomaly Detection

Authors: YeongHyeon Park, JoonSung Lee, Myung Jin Kim, Wonseok Park

Comments: 3 pages, 3 figures, 2 tables

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1301] arXiv:2112.07368 (cross-list from cs.LG) [pdf, other]: Title: Simple and Robust Loss Design for Multi-Label Learning with Missing Labels

Authors: Youcai Zhang, Yuhao Cheng, Xinyu Huang, Fei Wen, Rui Feng, Yaqian Li, Yandong Guo

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1302] arXiv:2112.07443 (cross-list from cs.CL) [pdf, other]: Title: Text Classification Models for Form Entity Linking

Authors: María Villota, César Domínguez, Jónathan Heras, Eloy Mata, Vico Pascual

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1303] arXiv:2112.07566 (cross-list from cs.CL) [pdf, other]: Title: VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

Authors: Letitia Parcalabescu, Michele Cafagna, Lilitta Muradjan, Anette Frank, Iacer Calixto, Albert Gatt

Comments: Paper accepted for publication at ACL 2022 Main; 28 pages, 4 figures, 11 tables

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1304] arXiv:2112.07723 (cross-list from cs.RO) [pdf, other]: Title: Autonomous Navigation System from Simultaneous Localization and Mapping

Authors: Micheal Caracciolo, Owen Casciotti, Christopher Lloyd, Ernesto Sola-Thomas, Matthew Weaver, Kyle Bielby, Md Abdul Baset Sarker, Masudul H. Imtiaz

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1305] arXiv:2112.08060 (cross-list from cs.LG) [pdf, other]: Title: Leveraging Image-based Generative Adversarial Networks for Time Series Generation

Authors: Justin Hellermann, Stefan Lessmann

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1306] arXiv:2112.08132 (cross-list from cs.LG) [pdf, other]: Title: Improving Self-supervised Learning with Automated Unsupervised Outlier Arbitration

Authors: Yu Wang, Jingyang Lin, Jingjing Zou, Yingwei Pan, Ting Yao, Tao Mei

Comments: NeurIPS 2021; Code is publicly available at: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1307] arXiv:2112.08363 (cross-list from cs.LG) [pdf, other]: Title: Performance or Trust? Why Not Both. Deep AUC Maximization with Self-Supervised Learning for COVID-19 Chest X-ray Classifications

Authors: Siyuan He, Pengcheng Xi, Ashkan Ebadi, Stephane Tremblay, Alexander Wong

Comments: 3 pages

Journal-ref: Published at CVIS 2021: 7th Annual Conference on Vision and Intelligent Systems

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1308] arXiv:2112.08370 (cross-list from cs.LG) [pdf, other]: Title: Lifelong Generative Modelling Using Dynamic Expansion Graph Model

Authors: Fei Ye, Adrian G. Bors

Comments: Accepted in Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI 2022)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1309] arXiv:2112.08470 (cross-list from cs.CL) [pdf, other]: Title: Insta-VAX: A Multimodal Benchmark for Anti-Vaccine and Misinformation Posts Detection on Social Media

Authors: Mingyang Zhou, Mahasweta Chakraborti, Sijia Qian, Zhou Yu, Jingwen Zhang

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1310] arXiv:2112.08538 (cross-list from cs.LG) [pdf, other]: Title: Visualizing the Loss Landscape of Winning Lottery Tickets

Authors: Robert Bain

Comments: 7 pages, 7 figures, 1 algorithm/pseudocode

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1311] arXiv:2112.08654 (cross-list from cs.LG) [pdf, other]: Title: Learning to Prompt for Continual Learning

Authors: Zifeng Wang, Zizhao Zhang, Chen-Yu Lee, Han Zhang, Ruoxi Sun, Xiaoqi Ren, Guolong Su, Vincent Perot, Jennifer Dy, Tomas Pfister

Comments: Published at CVPR 2022 as a conference paper

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1312] arXiv:2112.08723 (cross-list from cs.CL) [pdf, other]: Title: Distilled Dual-Encoder Model for Vision-Language Understanding

Authors: Zekun Wang, Wenhui Wang, Haichao Zhu, Ming Liu, Bing Qin, Furu Wei

Comments: EMNLP 2022

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1313] arXiv:2112.08854 (cross-list from cs.RO) [pdf, other]: Title: Multi-Camera LiDAR Inertial Extension to the Newer College Dataset

Authors: Lintong Zhang, Marco Camurri, David Wisth, Maurice Fallon

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1314] arXiv:2112.08995 (cross-list from cs.SD) [pdf, other]: Title: Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer

Authors: Yanpeng Zhao, Jack Hessel, Youngjae Yu, Ximing Lu, Rowan Zellers, Yejin Choi

Comments: Accepted to NAACL 2022. Our code is available at this https URL

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1315] arXiv:2112.09060 (cross-list from cs.SD) [pdf, other]: Title: Towards Robust Real-time Audio-Visual Speech Enhancement

Authors: Mandar Gogate, Kia Dashtipour, Amir Hussain

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1316] arXiv:2112.09153 (cross-list from cs.LG) [pdf, other]: Title: An Empirical Investigation of the Role of Pre-training in Lifelong Learning

Authors: Sanket Vaibhav Mehta, Darshan Patil, Sarath Chandar, Emma Strubell

Journal-ref: Journal of Machine Learning Research 24 (2023) 1-50

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1317] arXiv:2112.09567 (cross-list from cs.CG) [pdf, ps, other]: Title: LTB curves with Lipschitz turn are par-regular

Authors: Etienne Le Quentrec (AMU), Loïc Mazo (UNISTRA), Étienne Baudrier (UNISTRA), Mohamed Tajine (UNISTRA)

Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Discrete Mathematics (cs.DM)
[1318] arXiv:2112.09668 (cross-list from cs.LG) [pdf, other]: Title: Deep Learning for Spatiotemporal Modeling of Urbanization

Authors: Tang Li, Jing Gao, Xi Peng

Comments: Accepted by NeurIPS 2021 MLPH (Machine Learning in Public Health) Workshop; Best Paper Awarded by NeurIPS 2021 MLPH (Machine Learning in Public Health) Workshop

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1319] arXiv:2112.09693 (cross-list from cs.LG) [pdf, other]: Title: Generalisation effects of predictive uncertainty estimation in deep learning for digital pathology

Authors: Milda Pocevičiūtė, Gabriel Eilertsen, Sofia Jarkman, Claes Lundström

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1320] arXiv:2112.09726 (cross-list from cs.SD) [pdf, other]: Title: Soundify: Matching Sound Effects to Video

Authors: David Chuan-En Lin, Anastasis Germanidis, Cristóbal Valenzuela, Yining Shi, Nikolas Martelaro

Comments: Full paper in UIST 2023; Short paper in NeurIPS 2021 ML4CD Workshop; Online demo: this http URL

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1321] arXiv:2112.09741 (cross-list from cs.LG) [pdf, other]: Title: Neurashed: A Phenomenological Model for Imitating Deep Learning Training

Authors: Weijie J. Su

Comments: 8 pages

Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1322] arXiv:2112.09802 (cross-list from cs.LG) [pdf, other]: Title: Automated Domain Discovery from Multiple Sources to Improve Zero-Shot Generalization

Authors: Kowshik Thopalli, Sameeksha Katoch, Pavan Turaga, Jayaraman J. Thiagarajan

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1323] arXiv:2112.09808 (cross-list from math.NA) [pdf, other]: Title: Direct simple computation of middle surface between 3D point clouds and/or discrete surfaces by tracking sources in distance function calculation algorithms

Authors: Balazs Kosa, Karol Mikula

Subjects: Numerical Analysis (math.NA); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[1324] arXiv:2112.10017 (cross-list from cs.LG) [pdf, other]: Title: Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks

Authors: Zixuan Ke, Bing Liu, Xingchang Huang

Journal-ref: NeurIPS 2020

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1325] arXiv:2112.10065 (cross-list from cs.DC) [pdf, other]: Title: Efficient Strong Scaling Through Burst Parallel Training

Authors: Seo Jin Park, Joshua Fried, Sunghyun Kim, Mohammad Alizadeh, Adam Belay

Comments: MLSys'22

Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1326] arXiv:2112.10138 (cross-list from math.NA) [pdf, other]: Title: Anisotropic mesh adaptation for region-based segmentation accounting for image spatial information

Authors: Matteo Giacomini, Simona Perotto

Comments: 41 pages, 13 figures, 5 tables

Journal-ref: Computers & Mathematics with Applications, 121, 1--17 (2022)

Subjects: Numerical Analysis (math.NA); Computer Vision and Pattern Recognition (cs.CV)
[1327] arXiv:2112.10139 (cross-list from cs.LG) [pdf, other]: Title: Denoised Labels for Financial Time-Series Data via Self-Supervised Learning

Authors: Yanqing Ma, Carmine Ventre, Maria Polukarov

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Statistical Finance (q-fin.ST)
[1328] arXiv:2112.10143 (cross-list from cs.RO) [pdf, other]: Title: RoboAssembly: Learning Generalizable Furniture Assembly Policy in a Novel Multi-robot Contact-rich Simulation Environment

Authors: Mingxin Yu, Lin Shao, Zhehuan Chen, Tianhao Wu, Qingnan Fan, Kaichun Mo, Hao Dong

Comments: Submitted to IEEE International Conference on Robotics and Automation (ICRA) 2022

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[1329] arXiv:2112.10384 (cross-list from cs.LG) [pdf, other]: Title: Multimodal Adversarially Learned Inference with Factorized Discriminators

Authors: Wenxue Chen, Jianke Zhu

Comments: 9 pages, 6 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1330] arXiv:2112.10572 (cross-list from cs.LG) [pdf, other]: Title: General Greedy De-bias Learning

Authors: Xinzhe Han, Shuhui Wang, Chi Su, Qingming Huang, Qi Tian

Comments: This work has been accepted by IEEE T-PAMI. Copyright is transferred without notice, after which this version may no longer be accessible

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1331] arXiv:2112.10603 (cross-list from cs.MM) [pdf, other]: Title: A Multi-user Oriented Live Free-viewpoint Video Streaming System Based On View Interpolation

Authors: Jingchuan Hu, Shuai Guo, Kai Zhou, Yu Dong, Jun Xu, Li Song

Comments: 10 pages, 7 figures

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1332] arXiv:2112.10714 (cross-list from cs.LG) [pdf, other]: Title: Learning Spatio-Temporal Specifications for Dynamical Systems

Authors: Suhail Alsalehi, Erfan Aasi, Ron Weiss, Calin Belta

Comments: 12 pages, submitted to L4DC 2021

Journal-ref: PMLR 168:968-980, 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Systems and Control (eess.SY)
[1333] arXiv:2112.10728 (cross-list from cs.CL) [pdf, other]: Title: MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding

Authors: Revanth Gangi Reddy, Xilin Rui, Manling Li, Xudong Lin, Haoyang Wen, Jaemin Cho, Lifu Huang, Mohit Bansal, Avirup Sil, Shih-Fu Chang, Alexander Schwing, Heng Ji

Comments: Accepted at AAAI 2022

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1334] arXiv:2112.10961 (cross-list from cs.IT) [pdf, other]: Title: Nonlinear Transform Source-Channel Coding for Semantic Communications

Authors: Jincheng Dai, Sixian Wang, Kailin Tan, Zhongwei Si, Xiaoqi Qin, Kai Niu, Ping Zhang

Comments: published in IEEE JSAC

Subjects: Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1335] arXiv:2112.10985 (cross-list from cs.LG) [pdf, other]: Title: Learned ISTA with Error-based Thresholding for Adaptive Sparse Coding

Authors: Ziang Li, Kailun Wu, Yiwen Guo, Changshui Zhang

Comments: Accepted in ICASSP2024

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1336] arXiv:2112.11018 (cross-list from cs.LG) [pdf, other]: Title: A Theoretical View of Linear Backpropagation and Its Convergence

Authors: Ziang Li, Yiwen Guo, Haodi Liu, Changshui Zhang

Comments: This paper is accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1337] arXiv:2112.11041 (cross-list from cs.LG) [pdf, other]: Title: Geometry-Aware Unsupervised Domain Adaptation

Authors: You-Wei Luo, Chuan-Xian Ren, Zi-Ying Chen

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1338] arXiv:2112.11312 (cross-list from cs.LG) [pdf, other]: Title: Implicit Neural Video Compression

Authors: Yunfan Zhang, Ties van Rozendaal, Johann Brehmer, Markus Nagel, Taco Cohen

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1339] arXiv:2112.11330 (cross-list from cs.LG) [pdf, ps, other]: Title: PrimSeq: a deep learning-based pipeline to quantitate rehabilitation training

Authors: Avinash Parnandi, Aakash Kaku, Anita Venkatesan, Natasha Pandit, Audre Wirtanen, Haresh Rajamohan, Kannan Venkataramanan, Dawn Nilsen, Carlos Fernandez-Granda, Heidi Schambra

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1340] arXiv:2112.11447 (cross-list from cs.AI) [pdf, other]: Title: Multi-Modality Distillation via Learning the teacher's modality-level Gram Matrix

Authors: Peng Liu

Comments: 10 pages

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1341] arXiv:2112.11450 (cross-list from cs.LG) [pdf, other]: Title: Max-Margin Contrastive Learning

Authors: Anshul Shah, Suvrit Sra, Rama Chellappa, Anoop Cherian

Comments: Accepted at AAAI 2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1342] arXiv:2112.11743 (cross-list from cs.LG) [pdf, other]: Title: Simple and Effective Balance of Contrastive Losses

Authors: Arnaud Sors, Rafael Sampaio de Rezende, Sarah Ibrahimi, Jean-Marc Andreoli

Comments: 15 pages, 10 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1343] arXiv:2112.11850 (cross-list from cs.CL) [pdf, ps, other]: Title: Multimodal Analysis of memes for sentiment extraction

Authors: Nayan Varma Alluri, Neeli Dheeraj Krishna

Comments: 5 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1344] arXiv:2112.12078 (cross-list from cs.LG) [pdf, ps, other]: Title: Deeper Learning with CoLU Activation

Authors: Advait Vagerwal

Comments: 7 pages, 4 figures, 4 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1345] arXiv:2112.12272 (cross-list from cs.LG) [pdf, ps, other]: Title: Human Activity Recognition on wrist-worn accelerometers using self-supervised neural networks

Authors: Niranjan Sridhar, Lance Myers

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1346] arXiv:2112.12371 (cross-list from cs.LG) [pdf, other]: Title: DENSE: Data-Free One-Shot Federated Learning

Authors: Jie Zhang, Chen Chen, Bo Li, Lingjuan Lyu, Shuang Wu, Shouhong Ding, Chunhua Shen, Chao Wu

Comments: Accepted by NeurIPS 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1347] arXiv:2112.12431 (cross-list from cs.LG) [pdf, other]: Title: Adaptive Modeling Against Adversarial Attacks

Authors: Zhiwen Yan, Teck Khim Ng

Comments: 10 pages, 3 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1348] arXiv:2112.12510 (cross-list from cs.NE) [pdf, other]: Title: Neuroevolution deep learning architecture search for estimation of river surface elevation from photogrammetric Digital Surface Models

Authors: Radosław Szostak, Marcin Pietroń, Mirosław Zimnoch, Przemysław Wachniew, Paweł Ćwiąkała, Edyta Puniach

Comments: extended version of NeurIPS 2021 Workshop paper - ML4PhysicalSciences

Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1349] arXiv:2112.12533 (cross-list from cs.LG) [pdf, other]: Title: PyCIL: A Python Toolbox for Class-Incremental Learning

Authors: Da-Wei Zhou, Fu-Yun Wang, Han-Jia Ye, De-Chuan Zhan

Comments: Accepted to SCIENCE CHINA Information Sciences. Code is available at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1350] arXiv:2112.12596 (cross-list from cs.HC) [pdf, other]: Title: Explainable Medical Imaging AI Needs Human-Centered Design: Guidelines and Evidence from a Systematic Review

Authors: Haomin Chen, Catalina Gomez, Chien-Ming Huang, Mathias Unberath

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1351] arXiv:2112.12612 (cross-list from cs.RO) [pdf, other]: Title: Towards Disturbance-Free Visual Mobile Manipulation

Authors: Tianwei Ni, Kiana Ehsani, Luca Weihs, Jordi Salvador

Comments: WACV 2023

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1352] arXiv:2112.12984 (cross-list from cs.RO) [pdf, ps, other]: Title: Doppler velocity-based algorithm for Clustering and Velocity Estimation of moving objects

Authors: Mian Guo, Kai Zhong, Xiaozhi Wang

Comments: 7 pages, 9 figures, 2 tables, 2 algorithms, CACRE2022

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1353] arXiv:2112.13064 (cross-list from cs.CR) [src]: Title: CatchBackdoor: Backdoor Testing by Critical Trojan Neural Path Identification via Differential Fuzzing

Authors: Haibo Jin, Ruoxi Chen, Jinyin Chen, Yao Cheng, Chong Fu, Ting Wang, Yue Yu, Zhaoyan Ming

Comments: There are some problems in the experiment so we need to withdraw this paper. We will upload the new version after revision

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1354] arXiv:2112.13121 (cross-list from cs.LG) [src]: Title: The Curse of Zero Task Diversity: On the Failure of Transfer Learning to Outperform MAML and their Empirical Equivalence

Authors: Brando Miranda, Yu-Xiong Wang, Sanmi Koyejo

Comments: An updated version with updated correction is at arXiv:2208.01545 and it's acompanying neurips submission is at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1355] arXiv:2112.13137 (cross-list from cs.LG) [pdf, other]: Title: Does MAML Only Work via Feature Re-use? A Data Centric Perspective

Authors: Brando Miranda, Yu-Xiong Wang, Sanmi Koyejo

Comments: 15 pages, 12 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1356] arXiv:2112.13149 (cross-list from cs.AR) [pdf, other]: Title: Fast and Scalable Computation of the Forward and Inverse Discrete Periodic Radon Transform

Authors: Cesar Carranza, Daniel Llamocca, Marios Pattichis

Comments: This paper has been published as follows: C. Carranza, D. Llamocca, and M. Pattichis. "Fast and scalable computation of the forward and inverse discrete periodic radon transform", IEEE Transactions on Image Processing, 25(1):119-133, Jan 2016

Journal-ref: IEEE Transactions on Image Processing, 25(1):119-133, Jan 2016

Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[1357] arXiv:2112.13150 (cross-list from cs.AR) [pdf, other]: Title: Fast 2D Convolutions and Cross-Correlations Using Scalable Architectures

Authors: Cesar Carranza, Daniel Llamocca, Marios Pattichis

Comments: The paper develops the fastest known methods for computing 2D convolutions in hardware

Journal-ref: IEEE Transactions on Image Processing 26.5 (2017): 2230-2245

Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[1358] arXiv:2112.13243 (cross-list from cs.NE) [pdf, other]: Title: Evolutionary Generation of Visual Motion Illusions

Authors: Lana Sinapayen, Eiji Watanabe

Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1359] arXiv:2112.13372 (cross-list from cs.CL) [pdf, ps, other]: Title: Delivery Issues Identification from Customer Feedback Data

Authors: Ankush Chopra, Mahima Arora, Shubham Pandey

Comments: Accepted to be part of MLDS 2022, and will be Published in Lattice journal

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1360] arXiv:2112.13659 (cross-list from cs.RO) [pdf, ps, other]: Title: M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots

Authors: Jie Yin, Ang Li, Tao Li, Wenxian Yu, Danping Zou

Comments: accepted by IEEE RA-L

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1361] arXiv:2112.13910 (cross-list from cs.CL) [pdf, other]: Title: Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal Characterization

Authors: Mesut Erhan Unal, Adriana Kovashka, Wen-Ting Chung, Yu-Ru Lin

Comments: 10 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1362] arXiv:2112.13939 (cross-list from cs.LG) [pdf, other]: Title: SPIDER: Searching Personalized Neural Architecture for Federated Learning

Authors: Erum Mushtaq, Chaoyang He, Jie Ding, Salman Avestimehr

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1363] arXiv:2112.13974 (cross-list from cs.LG) [pdf, other]: Title: A Moment in the Sun: Solar Nowcasting from Multispectral Satellite Data using Self-Supervised Learning

Authors: Akansha Singh Bansal, Trapit Bansal, David Irwin

Comments: 18 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1364] arXiv:2112.14006 (cross-list from cs.NI) [pdf, other]: Title: Multi-Band Wi-Fi Sensing with Matched Feature Granularity

Authors: Jianyuan Yu, Pu (Perry) Wang, Toshiaki Koike-Akino, Ye Wang, Philip V. Orlik, R. Michael Buehrer

Comments: 12 pages, 14 figures

Subjects: Networking and Internet Architecture (cs.NI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Signal Processing (eess.SP)
[1365] arXiv:2112.14021 (cross-list from cs.SI) [pdf, other]: Title: Multilayer Graph Contrastive Clustering Network

Authors: Liang Liu, Zhao Kang, Ling Tian, Wenbo Xu, Xixu He

Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1366] arXiv:2112.14061 (cross-list from cs.LG) [pdf, other]: Title: Investigating Shifts in GAN Output-Distributions

Authors: Ricard Durall, Janis Keuper

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1367] arXiv:2112.14232 (cross-list from cs.LG) [pdf, other]: Title: Constrained Gradient Descent: A Powerful and Principled Evasion Attack Against Neural Networks

Authors: Weiran Lin, Keane Lucas, Lujo Bauer, Michael K. Reiter, Mahmood Sharif

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1368] arXiv:2112.14299 (cross-list from cs.LG) [pdf, other]: Title: DeepAdversaries: Examining the Robustness of Deep Learning Models for Galaxy Morphology Classification

Authors: Aleksandra Ćiprijanović, Diana Kafkes, Gregory Snyder, F. Javier Sánchez, Gabriel Nathan Perdue, Kevin Pedro, Brian Nord, Sandeep Madireddy, Stefan M. Wild

Comments: 20 pages, 6 figures, 5 tables; accepted in MLST

Subjects: Machine Learning (cs.LG); Astrophysics of Galaxies (astro-ph.GA); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1369] arXiv:2112.14337 (cross-list from cs.LG) [pdf, other]: Title: Closer Look at the Transferability of Adversarial Examples: How They Fool Different Models Differently

Authors: Futa Waseda, Sosuke Nishikawa, Trung-Nghia Le, Huy H. Nguyen, Isao Echizen

Comments: 25 pages, 13 figures, Accepted at the IEEE Winter Conference on Applications of Computer Vision (WACV) 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1370] arXiv:2112.14437 (cross-list from cs.CR) [pdf, other]: Title: A Color Image Steganography Based on Frequency Sub-band Selection

Authors: Hai Su, Shan Yang, Shuqing Zhang, Songsen Yu

Comments: 19 pages,17 figures

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1371] arXiv:2112.14754 (cross-list from cs.LG) [pdf, other]: Title: Disentanglement and Generalization Under Correlation Shifts

Authors: Christina M. Funke, Paul Vicol, Kuan-Chieh Wang, Matthias Kümmerer, Richard Zemel, Matthias Bethge

Comments: CoLLAs 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1372] arXiv:2112.14772 (cross-list from cs.LG) [pdf, other]: Title: Deep Graph Clustering via Dual Correlation Reduction

Authors: Yue Liu, Wenxuan Tu, Sihang Zhou, Xinwang Liu, Linxuan Song, Xihong Yang, En Zhu

Comments: 9 pages, 6 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1373] arXiv:2112.14889 (cross-list from cs.CR) [pdf, other]: Title: Few-shot Backdoor Defense Using Shapley Estimation

Authors: Jiyang Guan, Zhuozhuo Tu, Ran He, Dacheng Tao

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1374] arXiv:2112.14921 (cross-list from cs.IR) [pdf, other]: Title: Retrieving Black-box Optimal Images from External Databases

Authors: Ryoma Sato

Comments: WSDM 2022

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1375] arXiv:2112.15068 (cross-list from cs.LG) [pdf, ps, other]: Title: Digital Rock Typing DRT Algorithm Formulation with Optimal Supervised Semantic Segmentation

Authors: Omar Alfarisi, Djamel Ouzzane, Mohamed Sassi, Tiejun Zhang

Comments: 1-Acknowledgement section is updated. 2- References section is update with one additional reference

Subjects: Machine Learning (cs.LG); Earth and Planetary Astrophysics (astro-ph.EP); Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[1376] arXiv:2112.15278 (cross-list from cs.LG) [pdf, other]: Title: Data-Free Knowledge Transfer: A Survey

Authors: Yuang Liu, Wei Zhang, Jun Wang, Jianyong Wang

Comments: 20 pages, 8 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1377] arXiv:2112.15317 (cross-list from cs.LG) [pdf, other]: Title: SplitBrain: Hybrid Data and Model Parallel Deep Learning

Authors: Farley Lai, Asim Kadav, Erik Kruus

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[1378] arXiv:2112.15320 (cross-list from cs.LG) [pdf, other]: Title: InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer

Authors: Chin-Tung Lin, Mu Yang

Comments: Rejected by ISMIR 2020

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)
[1379] arXiv:2112.15329 (cross-list from cs.LG) [pdf, other]: Title: On Distinctive Properties of Universal Perturbations

Authors: Sung Min Park, Kuo-An Wei, Kai Xiao, Jerry Li, Aleksander Madry

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1380] arXiv:2112.15402 (cross-list from cs.LG) [pdf, other]: Title: Relational Experience Replay: Continual Learning by Adaptively Tuning Task-wise Relationship

Authors: Quanziang Wang, Renzhen Wang, Yuexiang Li, Dong Wei, Kai Ma, Yefeng Zheng, Deyu Meng

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1381] arXiv:2112.15411 (cross-list from cs.LG) [pdf, other]: Title: Disjoint Contrastive Regression Learning for Multi-Sourced Annotations

Authors: Xiaoqian Ruan, Gaoang Wang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1382] arXiv:2112.15421 (cross-list from cs.LG) [pdf, other]: Title: Representation Learning via Consistent Assignment of Views to Clusters

Authors: Thalles Silva, Adín Ramírez Rivera

Comments: Pre-print. 37th ACM/SIGAPP Symposium on Applied Computing (SAC'22). Code at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1383] arXiv:2112.15541 (cross-list from cs.LG) [pdf, other]: Title: on the effectiveness of generative adversarial network on anomaly detection

Authors: Laya Rafiee Sevyeri, Thomas Fevens

Comments: This paper is an improved version of an existing paper published by the same authors in ICANN2020

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1384] arXiv:2112.15550 (cross-list from cs.LG) [pdf, other]: Title: Improving Baselines in the Wild

Authors: Kazuki Irie, Imanol Schlag, Róbert Csordás, Jürgen Schmidhuber

Comments: Presented at NeurIPS 2021 Workshop on Distribution Shifts, this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1385] arXiv:2112.15555 (cross-list from cs.LG) [pdf, other]: Title: An Unsupervised Domain Adaptation Model based on Dual-module Adversarial Training

Authors: Yiju Yang, Tianxiao Zhang, Guanyu Li, Taejoon Kim, Guanghui Wang

Comments: arXiv admin note: text overlap with arXiv:2108.00610

Journal-ref: Neurocomputing, Volume 475, 28 February 2022, Pages 102-111

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1386] arXiv:2112.00002 (cross-list from eess.IV) [pdf, other]: Title: Recovery of Continuous 3D Refractive Index Maps from Discrete Intensity-Only Measurements using Neural Fields

Authors: Renhao Liu, Yu Sun, Jiabei Zhu, Lei Tian, Ulugbek Kamilov

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1387] arXiv:2112.00729 (cross-list from eess.IV) [src]: Title: Total-Body Low-Dose CT Image Denoising using Prior Knowledge Transfer Technique with Contrastive Regularization Mechanism

Authors: Minghan Fu, Yanhua Duan, Zhaoping Cheng, Wenjian Qin, Ying Wang, Dong Liang, Zhanli Hu

Comments: Want to improve the methodology

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1388] arXiv:2112.00730 (cross-list from eess.IV) [pdf, ps, other]: Title: Highly accelerated MR parametric mapping by undersampling the k-space and reducing the contrast number simultaneously with deep learning

Authors: Yanjie Zhu, Haoxiang Li, Yuanyuan Liu, Muzi Guo, Guanxun Cheng, Gang Yang, Haifeng Wang, Dong Liang

Comments: 27 pages,11 figures. Submitted to Magnetic Resonance in Medicine

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1389] arXiv:2112.00735 (cross-list from eess.IV) [pdf, other]: Title: Reference-guided Pseudo-Label Generation for Medical Semantic Segmentation

Authors: Constantin Seibold, Simon Reiß, Jens Kleesiek, Rainer Stiefelhagen

Comments: 36th AAAI Conference on Artificial Intelligence 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1390] arXiv:2112.00794 (cross-list from eess.IV) [pdf, other]: Title: DFTS2: Simulating Deep Feature Transmission Over Packet Loss Channels

Authors: Ashiv Dhondea, Robert A. Cohen, Ivan V. Bajić

Comments: 6 pages, 4 figures, IEEE Conference on Visual Communications and Image Processing (VCIP) 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1391] arXiv:2112.00913 (cross-list from eess.IV) [pdf, other]: Title: CDLNet: Noise-Adaptive Convolutional Dictionary Learning Network for Blind Denoising and Demosaicing

Authors: Nikola Janjušević, Amirhossein Khalilian-Gourtani, Yao Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1392] arXiv:2112.01137 (cross-list from eess.IV) [pdf, other]: Title: Deep Learning-Based Carotid Artery Vessel Wall Segmentation in Black-Blood MRI Using Anatomical Priors

Authors: Dieuwertje Alblas, Christoph Brune, Jelmer M. Wolterink

Comments: SPIE Medical Imaging 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1393] arXiv:2112.01320 (cross-list from eess.IV) [pdf, other]: Title: Multi-task fusion for improving mammography screening data classification

Authors: Maria Wimmer, Gert Sluiter, David Major, Dimitrios Lenis, Astrid Berg, Theresa Neubauer, Katja Bühler

Comments: Accepted for publication in IEEE Transactions on Medical Imaging

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1394] arXiv:2112.01533 (cross-list from eess.IV) [pdf, other]: Title: Automatic tumour segmentation in H&E-stained whole-slide images of the pancreas

Authors: Pierpaolo Vendittelli, Esther M.M. Smeets, Geert Litjens

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1395] arXiv:2112.01534 (cross-list from eess.IV) [pdf, ps, other]: Title: Learning to automate cryo-electron microscopy data collection with Ptolemy

Authors: Paul T. Kim, Alex J. Noble, Anchi Cheng, Tristan Bepler

Comments: Main: 12 pages, 11 figures. Appendix: 2 pages, 1 figure

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1396] arXiv:2112.01587 (cross-list from eess.IV) [pdf, ps, other]: Title: Improving accuracy and uncertainty quantification of deep learning based quantitative MRI using Monte Carlo dropout

Authors: Mehmet Yigit Avci, Ziyu Li, Qiuyun Fan, Susie Huang, Berkin Bilgic, Qiyuan Tian

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1397] arXiv:2112.01629 (cross-list from eess.IV) [pdf, ps, other]: Title: Engineering AI Tools for Systematic and Scalable Quality Assessment in Magnetic Resonance Imaging

Authors: Yukai Zou, Ikbeom Jang

Comments: 6 pages, 2 figures, NeurIPS Data-Centric AI Workshop 2021 (Virtual)

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1398] arXiv:2112.01702 (cross-list from eess.IV) [pdf, ps, other]: Title: Localized Feature Aggregation Module for Semantic Segmentation

Authors: Ryouichi Furukawa, Kazuhiro Hotta

Comments: SMC 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1399] arXiv:2112.01767 (cross-list from eess.IV) [pdf, other]: Title: MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification

Authors: Jingye Chen, Jieneng Chen, Zongwei Zhou, Bin Li, Alan Yuille, Yongyi Lu

Comments: A technical report. Code will be released

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1400] arXiv:2112.01784 (cross-list from eess.IV) [pdf, other]: Title: Fully automatic integration of dental CBCT images and full-arch intraoral impressions with stitching error correction via individual tooth segmentation and identification

Authors: Tae Jun Jang, Hye Sun Yun, Chang Min Hyun, Jong-Eun Kim, Sang-Hwy Lee, Jin Keun Seo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1401] arXiv:2112.01797 (cross-list from eess.IV) [pdf, other]: Title: Detection of Large Vessel Occlusions using Deep Learning by Deforming Vessel Tree Segmentations

Authors: Florian Thamm, Oliver Taubmann, Markus Jürgens, Hendrik Ditt, Andreas Maier

Comments: 7 pages. Accepted at BVM-Workshop 2022, Springer

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1402] arXiv:2112.01905 (cross-list from eess.IV) [pdf, other]: Title: Towards Super-Resolution CEST MRI for Visualization of Small Structures

Authors: Lukas Folle, Katharian Tkotz, Fasil Gadjimuradov, Lorenz Kapsner, Moritz Fabian, Sebastian Bickelhaupt, David Simon, Arnd Kleyer, Gerhard Krönke, Moritz Zaiß, Armin Nagel, Andreas Maier

Journal-ref: Proceedings, German Workshop on Medical Image Computing (2022) 210-215

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1403] arXiv:2112.02101 (cross-list from eess.IV) [pdf, other]: Title: View-Consistent Metal Segmentation in the Projection Domain for Metal Artifact Reduction in CBCT -- An Investigation of Potential Improvement

Authors: Tristan M. Gottschalk, Andreas Maier, Florian Kordon, Björn W. Kreher

Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1404] arXiv:2112.02102 (cross-list from eess.IV) [pdf, other]: Title: Echocardiography Segmentation with Enforced Temporal Consistency

Authors: Nathan Painchaud, Nicolas Duchateau, Olivier Bernard, Pierre-Marc Jodoin

Comments: 12 pages, accepted for publication in IEEE TMI

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1405] arXiv:2112.02164 (cross-list from eess.IV) [pdf, other]: Title: Bridging the gap between prostate radiology and pathology through machine learning

Authors: Indrani Bhattacharya, David S. Lim, Han Lin Aung, Xingchen Liu, Arun Seetharaman, Christian A. Kunder, Wei Shao, Simon J. C. Soerensen, Richard E. Fan, Pejman Ghanouni, Katherine J. To'o, James D. Brooks, Geoffrey A. Sonn, Mirabela Rusu

Comments: Indrani Bhattacharya and David S. Lim contributed equally as first authors. Geoffrey A. Sonn and Mirabela Rusu contributed equally as senior authors

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1406] arXiv:2112.02222 (cross-list from eess.IV) [pdf, other]: Title: Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

Authors: Feng Xu, Chuang Zhu, Wenqi Tang, Ying Wang, Yu Zhang, Jie Li, Hongchuan Jiang, Zhongyue Shi, Jun Liu, Mulan Jin

Comments: Update Table 1 and corresponding descriptions

Journal-ref: Frontiers in Oncology, 11(2021), 4133

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1407] arXiv:2112.02478 (cross-list from eess.IV) [pdf, ps, other]: Title: Classification of COVID-19 on chest X-Ray images using Deep Learning model with Histogram Equalization and Lungs Segmentation

Authors: Aman Swaraj, Karan Verma

Comments: Total number of words of the manuscript- 6577 The number of words of the abstract- 238 The number of figures- 8 The number of tables- 10

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1408] arXiv:2112.02508 (cross-list from eess.IV) [pdf, other]: Title: Uncertainty-Guided Mutual Consistency Learning for Semi-Supervised Medical Image Segmentation

Authors: Yichi Zhang, Rushi Jiao, Qingcheng Liao, Dongyang Li, Jicong Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1409] arXiv:2112.02522 (cross-list from eess.IV) [pdf, other]: Title: Snapshot HDR Video Construction Using Coded Mask

Authors: Masheal Alghamdi, Qiang Fu, Ali Thabet, Wolfgang Heidrich

Comments: 13 pages, 7 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1410] arXiv:2112.02548 (cross-list from physics.flu-dyn) [pdf, other]: Title: Generative Modeling of Turbulence

Authors: Claudia Drygala, Benjamin Winhart, Francesca di Mare, Hanno Gottschalk

Subjects: Fluid Dynamics (physics.flu-dyn); Computer Vision and Pattern Recognition (cs.CV)
[1411] arXiv:2112.02608 (cross-list from eess.IV) [pdf, ps, other]: Title: Real-time Virtual Intraoperative CT for Image Guided Surgery

Authors: Yangming Li, Neeraja Konuthula, Ian M. Humphreys, Kris Moe, Blake Hannaford, Randall Bly

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1412] arXiv:2112.02743 (cross-list from eess.IV) [pdf, other]: Title: Separated Contrastive Learning for Organ-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

Authors: Jiacheng Wang, Xiaomeng Li, Yiming Han, Jing Qin, Liansheng Wang, Zhou Qichao

Comments: Accepted in AAAI-22 (Oral)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1413] arXiv:2112.02858 (cross-list from eess.IV) [pdf, ps, other]: Title: A comparison study of CNN denoisers on PRNU extraction

Authors: Hui Zeng, Morteza Darvish Morshedi Hosseini, Kang Deng, Anjie Peng, Miroslav Goljan

Comments: 12 pages, 6 figures, 4 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1414] arXiv:2112.02896 (cross-list from eess.IV) [pdf, other]: Title: Tunable Image Quality Control of 3-D Ultrasound using Switchable CycleGAN

Authors: Jaeyoung Huh, Shujaat Khan, Sungjin Choi, Dongkuk Shin, Eun Sun Lee, Jong Chul Ye

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1415] arXiv:2112.03053 (cross-list from eess.IV) [pdf, other]: Title: Fast 3D registration with accurate optimisation and little learning for Learn2Reg 2021

Authors: Hanna Siebert, Lasse Hansen, Mattias P. Heinrich

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1416] arXiv:2112.03259 (cross-list from q-bio.QM) [pdf, ps, other]: Title: Novel Local Radiomic Bayesian Classifiers for Non-Invasive Prediction of MGMT Methylation Status in Glioblastoma

Authors: Mihir Rao

Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1417] arXiv:2112.03276 (cross-list from eess.IV) [pdf, other]: Title: Organ localisation using supervised and semi supervised approaches combining reinforcement learning with imitation learning

Authors: Sankaran Iyer, Alan Blair, Laughlin Dawes, Daniel Moses, Christopher White, Arcot Sowmya

Comments: 16 pages, 12 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1418] arXiv:2112.03277 (cross-list from eess.IV) [pdf, ps, other]: Title: Automatic quality control framework for more reliable integration of machine learning-based image segmentation into medical workflows

Authors: Elena Williams, Sebastian Niehaus, Janis Reinelt, Alberto Merola, Paul Glad Mihai, Kersten Villringer, Konstantin Thierbach, Evelyn Medawar, Daniel Lichterfeld, Ingo Roeder, Nico Scherf, Maria del C. Valdés Hernández

Comments: 19 pages

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1419] arXiv:2112.03380 (cross-list from eess.IV) [pdf, other]: Title: Dynamic imaging using Motion-Compensated SmooThness Regularization on Manifolds (MoCo-SToRM)

Authors: Qing Zou, Luis A. Torres, Sean B. Fain, Nara S. Higano, Alister J. Bates, Mathews Jacob

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1420] arXiv:2112.03455 (cross-list from eess.IV) [pdf, other]: Title: Hybrid guiding: A multi-resolution refinement approach for semantic segmentation of gigapixel histopathological images

Authors: André Pedersen, Erik Smistad, Tor V. Rise, Vibeke G. Dale, Henrik S. Pettersen, Tor-Arne S. Nordmo, David Bouget, Ingerid Reinertsen, Marit Valla

Comments: 12 pages, 3 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1421] arXiv:2112.03456 (cross-list from eess.IV) [pdf, other]: Title: RSBNet: One-Shot Neural Architecture Search for A Backbone Network in Remote Sensing Image Recognition

Authors: Cheng Peng, Yangyang Li, Ronghua Shang, Licheng Jiao

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1422] arXiv:2112.03536 (cross-list from eess.IV) [pdf, other]: Title: Learning Pixel-Adaptive Weights for Portrait Photo Retouching

Authors: Binglu Wang, Chengzhe Lu, Dawei Yan, Yongqiang Zhao

Comments: Techinical report

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1423] arXiv:2112.03622 (cross-list from eess.IV) [pdf, other]: Title: Evaluating Generic Auto-ML Tools for Computational Pathology

Authors: Lars Ole Schwen, Daniela Schacherer, Christian Geißler, André Homeyer

Journal-ref: Informatics in Medicine Unlocked 29 (2022) 100853

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1424] arXiv:2112.03694 (cross-list from eess.IV) [pdf, other]: Title: Hard Sample Aware Noise Robust Learning for Histopathology Image Classification

Authors: Chuang Zhu, Wenkai Chen, Ting Peng, Ying Wang, Mulan Jin

Comments: 14 pages, 20figures, IEEE Transactions on Medical Imaging

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1425] arXiv:2112.03696 (cross-list from eess.IV) [pdf, other]: Title: Noise Distribution Adaptive Self-Supervised Image Denoising using Tweedie Distribution and Score Matching

Authors: Kwanyoung Kim, Taesung Kwon, Jong Chul Ye

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1426] arXiv:2112.03701 (cross-list from eess.IV) [pdf, other]: Title: Efficient joint noise removal and multi exposure fusion

Authors: A. Buades, J.L Lisani, O. Martorell

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1427] arXiv:2112.03712 (cross-list from eess.IV) [pdf, other]: Title: Image Compressed Sensing Using Non-local Neural Network

Authors: Wenxue Cui, Shaohui Liu, Feng Jiang, Debin Zhao

Comments: 14 pages, 11 figures, 7 tables

Journal-ref: IEEE Transactions on Multimedia, 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1428] arXiv:2112.03888 (cross-list from eess.IV) [pdf, ps, other]: Title: Image Enhancement via Bilateral Learning

Authors: Saeedeh Rezaee, Nezam Mahdavi-Amiri

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1429] arXiv:2112.03911 (cross-list from eess.IV) [pdf, ps, other]: Title: Dyadic Sex Composition and Task Classification Using fNIRS Hyperscanning Data

Authors: Liam A. Kruse, Allan L. Reiss, Mykel J. Kochenderfer, Stephanie Balters

Comments: 20th IEEE International Conference on Machine Learning and Applications

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1430] arXiv:2112.03915 (cross-list from eess.IV) [pdf, other]: Title: Embedding Gradient-based Optimization in Image Registration Networks

Authors: Huaqi Qiu, Kerstin Hammernik, Chen Qin, Chen Chen, Daniel Rueckert

Comments: Accepted by International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1431] arXiv:2112.03916 (cross-list from eess.IV) [pdf, other]: Title: BT-Unet: A self-supervised learning framework for biomedical image segmentation using Barlow Twins with U-Net models

Authors: Narinder Singh Punn, Sonali Agarwal

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1432] arXiv:2112.03998 (cross-list from eess.IV) [pdf, ps, other]: Title: Nuclei Segmentation in Histopathology Images using Deep Learning with Local and Global Views

Authors: Mahdi Arab Loodaricheh, Nader Karimi, Shadrokh Samavi

Comments: 5 pages, 5 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1433] arXiv:2112.04121 (cross-list from eess.IV) [pdf, other]: Title: Reverse image filtering using total derivative approximation and accelerated gradient descent

Authors: Fernando J. Galetto, Guang Deng

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1434] arXiv:2112.04267 (cross-list from eess.IV) [pdf, other]: Title: Implicit Neural Representations for Image Compression

Authors: Yannick Strümpler, Janis Postels, Ren Yang, Luc van Gool, Federico Tombari

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1435] arXiv:2112.04386 (cross-list from eess.IV) [pdf, other]: Title: Which images to label for few-shot medical landmark detection?

Authors: Quan Quan, Qingsong Yao, Jun Li, S. Kevin Zhou

Journal-ref: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1436] arXiv:2112.04487 (cross-list from eess.IV) [pdf, other]: Title: Joint Global and Local Hierarchical Priors for Learned Image Compression

Authors: Jun-Hyuk Kim, Byeongho Heo, Jong-Seok Lee

Comments: CVPR 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1437] arXiv:2112.04488 (cross-list from eess.IV) [pdf, other]: Title: A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution

Authors: Karam Park, Jae Woong Soh, Nam Ik Cho

Comments: Accepted for publication as a regular paper in the IEEE Transactions on Multimedia

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1438] arXiv:2112.04489 (cross-list from eess.IV) [pdf, other]: Title: Learn2Reg: comprehensive multi-task medical image registration challenge, dataset and evaluation in the era of deep learning

Authors: Alessa Hering, Lasse Hansen, Tony C. W. Mok, Albert C. S. Chung, Hanna Siebert, Stephanie Häger, Annkristin Lange, Sven Kuckertz, Stefan Heldmann, Wei Shao, Sulaiman Vesal, Mirabela Rusu, Geoffrey Sonn, Théo Estienne, Maria Vakalopoulou, Luyi Han, Yunzhi Huang, Pew-Thian Yap, Mikael Brudfors, Yaël Balbastre, Samuel Joutard, Marc Modat, Gal Lifshitz, Dan Raviv, Jinxin Lv, Qiang Li, Vincent Jaouen, Dimitris Visvikis, Constance Fourcade, Mathieu Rubeaux, Wentao Pan, Zhe Xu, Bailiang Jian, Francesca De Benetti, Marek Wodzinski, Niklas Gunnarsson, Jens Sjölund, Daniel Grzech, Huaqi Qiu, Zeju Li, Alexander Thorley, Jinming Duan, Christoph Großbröhmer, Andrew Hoopes, Ingerid Reinertsen, Yiming Xiao, Bennett Landman, Yuankai Huo, Keelin Murphy, Nikolas Lessmann, Bram van Ginneken, et al. (2 additional authors not shown)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1439] arXiv:2112.04490 (cross-list from eess.IV) [pdf, other]: Title: A novel multi-view deep learning approach for BI-RADS and density assessment of mammograms

Authors: Huyen T. X. Nguyen, Sam B. Tran, Dung B. Nguyen, Hieu H. Pham, Ha Q. Nguyen

Comments: This paper has been accepted by the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (2022 IEEE EMBC)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1440] arXiv:2112.04491 (cross-list from cs.CV) [pdf, other]: Title: Improving Image Restoration by Revisiting Global Information Aggregation

Authors: Xiaojie Chu, Liangyu Chen, Chengpeng Chen, Xin Lu

Comments: ECCV 2022; fix typo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1441] arXiv:2112.04493 (cross-list from eess.IV) [pdf, ps, other]: Title: Binary Change Guided Hyperspectral Multiclass Change Detection

Authors: Meiqi Hu, Chen Wu, Bo Du, Liangpei Zhang

Comments: 14 pages,17 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1442] arXiv:2112.04495 (cross-list from eess.IV) [pdf, other]: Title: Dynamic multi feature-class Gaussian process models

Authors: Jean-Rassaire Fouefack, Bhushan Borotikar, Marcel Lüthi, Tania S. Douglas, Valérie Burdin, Tinashe E.M. Mutsvangwa

Comments: 16

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1443] arXiv:2112.04499 (cross-list from eess.IV) [pdf, other]: Title: Multiscale Softmax Cross Entropy for Fovea Localization on Color Fundus Photography

Authors: Yuli Wu, Peter Walter, Dorit Merhof

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1444] arXiv:2112.04653 (cross-list from eess.IV) [pdf, ps, other]: Title: Extending nn-UNet for brain tumor segmentation

Authors: Huan Minh Luu, Sung-Hong Park

Comments: 12 pages, 4 figures, BraTS competition paper

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1445] arXiv:2112.04721 (cross-list from eess.IV) [pdf, ps, other]: Title: One-dimensional Deep Low-rank and Sparse Network for Accelerated MRI

Authors: Zi Wang, Chen Qian, Di Guo, Hongwei Sun, Rushuai Li, Bo Zhao, Xiaobo Qu

Comments: 16 pages

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1446] arXiv:2112.04863 (cross-list from eess.IV) [pdf, other]: Title: 3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis

Authors: Jianhui Yu, Chaoyi Zhang, Heng Wang, Dingxin Zhang, Yang Song, Tiange Xiang, Dongnan Liu, Weidong Cai

Comments: Technical Report

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1447] arXiv:2112.04882 (cross-list from eess.IV) [pdf, other]: Title: Evaluating saliency methods on artificial data with different background types

Authors: Céline Budding, Fabian Eitel, Kerstin Ritter, Stefan Haufe

Comments: 6 pages, 2 figures. Presented at Medical Imaging meets NeurIPS 2021 (poster presentation)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1448] arXiv:2112.04894 (cross-list from eess.IV) [pdf, other]: Title: Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer

Authors: Xiangde Luo, Minhao Hu, Tao Song, Guotai Wang, Shaoting Zhang

Comments: accepted to MIDL2022, code in SSL4MIS:this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1449] arXiv:2112.04984 (cross-list from eess.IV) [pdf, other]: Title: Robust Weakly Supervised Learning for COVID-19 Recognition Using Multi-Center CT Images

Authors: Qinghao Ye, Yuan Gao, Weiping Ding, Zhangming Niu, Chengjia Wang, Yinghui Jiang, Minhao Wang, Evandro Fei Fang, Wade Menpes-Smith, Jun Xia, Guang Yang

Comments: 32 pages, 8 figures, Applied Soft Computing

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1450] arXiv:2112.04998 (cross-list from eess.IV) [pdf, other]: Title: Sparse-View CT Reconstruction using Recurrent Stacked Back Projection

Authors: Wenrui Li, Gregery T. Buzzard, Charles A. Bouman

Comments: 5 pages, 5 pages, 2021 Asilomar Conference on Signals, Systems, and Computers

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1451] arXiv:2112.05074 (cross-list from math.AG) [pdf, other]: Title: Critical configurations for two projective views, a new approach

Authors: Martin Bråtelund

Comments: 26 pages, 4 figures, this version corrects an error appearing in the first table in the published version

Journal-ref: Journal of Symbolic Computation 120 (2024)

Subjects: Algebraic Geometry (math.AG); Computer Vision and Pattern Recognition (cs.CV)
[1452] arXiv:2112.05146 (cross-list from eess.IV) [pdf, other]: Title: Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction

Authors: Hyungjin Chung, Byeongsu Sim, Jong Chul Ye

Comments: Accepted to CVPR 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1453] arXiv:2112.05147 (cross-list from eess.IV) [pdf, other]: Title: Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement

Authors: Long Ma, Risheng Liu, Jiaao Zhang, Xin Fan, Zhongxuan Luo

Comments: Accepted by IEEE TNNLS. Code is available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1454] arXiv:2112.05149 (cross-list from eess.IV) [pdf, other]: Title: DiffuseMorph: Unsupervised Deformable Image Registration Using Diffusion Model

Authors: Boah Kim, Inhwa Han, Jong Chul Ye

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1455] arXiv:2112.05150 (cross-list from eess.IV) [pdf, other]: Title: Deep Recurrent Neural Network with Multi-scale Bi-directional Propagation for Video Deblurring

Authors: Chao Zhu, Hang Dong, Jinshan Pan, Boyang Liang, Yuhao Huang, Lean Fu, Fei Wang

Comments: Accepted by AAAI-2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1456] arXiv:2112.05151 (cross-list from eess.IV) [pdf, other]: Title: Annotation-efficient cancer detection with report-guided lesion annotation for deep learning-based prostate cancer detection in bpMRI

Authors: Joeran S. Bosma, Anindo Saha, Matin Hosseinzadeh, Ilse Slootweg, Maarten de Rooij, Henkjan Huisman

Journal-ref: Radiology: Artificial Intelligence (2023), e230031

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1457] arXiv:2112.05220 (cross-list from eess.IV) [pdf, other]: Title: Hidden Path Selection Network for Semantic Segmentation of Remote Sensing Images

Authors: Kunping Yang, Xin-Yi Tong, Gui-Song Xia, Weiming Shen, Liangpei Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1458] arXiv:2112.05221 (cross-list from eess.IV) [pdf, other]: Title: MantissaCam: Learning Snapshot High-dynamic-range Imaging with Perceptually-based In-pixel Irradiance Encoding

Authors: Haley M. So, Julien N.P. Martel, Piotr Dudek, Gordon Wetzstein

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1459] arXiv:2112.05303 (cross-list from eess.IV) [pdf, other]: Title: Surrogate-based cross-correlation for particle image velocimetry

Authors: Yong Lee, Fuqiang Gu, Zeyu Gong

Comments: 13 pages, 11 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1460] arXiv:2112.05478 (cross-list from math.AG) [pdf, other]: Title: Critical configurations for three projective views

Authors: Martin Bråtelund

Comments: 40 pages, 9 figures. This is a companion paper to arXiv:2112.05074. Accepted manuscript published in Mathematica Scandinavica

Subjects: Algebraic Geometry (math.AG); Computer Vision and Pattern Recognition (cs.CV)
[1461] arXiv:2112.05505 (cross-list from eess.IV) [pdf, other]: Title: DeepRLS: A Recurrent Network Architecture with Least Squares Implicit Layers for Non-blind Image Deconvolution

Authors: Iaroslav Koshelev, Daniil Selikhanovych, Stamatios Lefkimmiatis

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1462] arXiv:2112.05748 (cross-list from eess.IV) [pdf, other]: Title: Deep Learning based Framework for Automatic Diagnosis of Glaucoma based on analysis of Focal Notching in the Optic Nerve Head

Authors: Sneha Dasgupta, Rishav Mukherjee, Kaushik Dutta, Anindya Sen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1463] arXiv:2112.05752 (cross-list from eess.IV) [pdf, other]: Title: Specificity-Preserving Federated Learning for MR Image Reconstruction

Authors: Chun-Mei Feng, Yunlu Yan, Shanshan Wang, Yong Xu, Ling Shao, Huazhu Fu

Comments: 12 pages, 8 figures Code: this https URL

Journal-ref: IEEE Transactions on Medical Imaging, 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1464] arXiv:2112.05754 (cross-list from eess.IV) [pdf, other]: Title: PyTorch Connectomics: A Scalable and Flexible Segmentation Framework for EM Connectomics

Authors: Zudi Lin, Donglai Wei, Jeff Lichtman, Hanspeter Pfister

Comments: Technical report

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1465] arXiv:2112.05755 (cross-list from eess.IV) [pdf, other]: Title: Information Prebuilt Recurrent Reconstruction Network for Video Super-Resolution

Authors: Shuyun Wang, Ming Yu, Cuihong Xue, Yingchun Guo, Gang Yan

Comments: 12 pages,9 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1466] arXiv:2112.05756 (cross-list from eess.IV) [pdf, other]: Title: Enhancing Multi-Scale Implicit Learning in Image Super-Resolution with Integrated Positional Encoding

Authors: Ying-Tian Liu, Yuan-Chen Guo, Song-Hai Zhang

Comments: 10 pages, 5 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1467] arXiv:2112.05758 (cross-list from eess.IV) [pdf, other]: Title: Edge-Enhanced Dual Discriminator Generative Adversarial Network for Fast MRI with Parallel Imaging Using Multi-view Information

Authors: Jiahao Huang, Weiping Ding, Jun Lv, Jingwen Yang, Hao Dong, Javier Del Ser, Jun Xia, Tiaojuan Ren, Stephen Wong, Guang Yang

Comments: 33 pages, 13 figures, Applied Intelligence

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1468] arXiv:2112.05760 (cross-list from eess.IV) [pdf, other]: Title: Learning Representations with Contrastive Self-Supervised Learning for Histopathology Applications

Authors: Karin Stacke, Jonas Unger, Claes Lundström, Gabriel Eilertsen

Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL

Journal-ref: https://www.melba-journal.org/papers/2022:023.html

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1469] arXiv:2112.05761 (cross-list from eess.IV) [pdf, other]: Title: Self-Supervised Transformers for fMRI representation

Authors: Itzik Malkiel, Gony Rosenman, Lior Wolf, Talma Hendler

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1470] arXiv:2112.05794 (cross-list from eess.IV) [pdf, other]: Title: A Label Correction Algorithm Using Prior Information for Automatic and Accurate Geospatial Object Recognition

Authors: Weiwei Duan, Yao-Yi Chiang, Stefan Leyk, Johannes H. Uhl, Craig A. Knoblock

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1471] arXiv:2112.05900 (cross-list from eess.IV) [pdf, ps, other]: Title: Automated assessment of disease severity of COVID-19 using artificial intelligence with synthetic chest CT

Authors: Mengqiu Liu, Ying Liu, Yidong Yang, Aiping Liu, Shana Li, Changbing Qu, Xiaohui Qiu, Yang Li, Weifu Lv, Peng Zhang, Jie Wen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1472] arXiv:2112.06031 (cross-list from eess.IV) [pdf, other]: Title: Unsupervised Image to Image Translation for Multiple Retinal Pathology Synthesis in Optical Coherence Tomography Scans

Authors: Hemanth Pasupuleti, G. N. Girish

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1473] arXiv:2112.06149 (cross-list from eess.IV) [src]: Title: Two New Stenosis Detection Methods of Coronary Angiograms

Authors: Yaofang Liu, Xinyue Zhang, Wenlong Wan, Shaoyu Liu, Yingdi Liu, Hu Liu, Xueying Zeng, Qing Zhang

Comments: We submitted the paper due to an operational error. This paper is a modified version of the original paper Two New Stenoses Detection Methods of Coronary Angiograms (arXiv:2108.01516). And we will update the revised paper to the original paper later

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1474] arXiv:2112.06194 (cross-list from eess.IV) [pdf, ps, other]: Title: Improving Performance of Federated Learning based Medical Image Analysis in Non-IID Settings using Image Augmentation

Authors: Alper Emin Cetinkaya, Murat Akin, Seref Sagiroglu

Journal-ref: IEEE 14th International Conference on Information Security and Cryptology, 2021, pp. 69-74

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1475] arXiv:2112.06226 (cross-list from eess.IV) [pdf, other]: Title: Attention based Broadly Self-guided Network for Low light Image Enhancement

Authors: Zilong Chen, Yaling Liang, Minghui Du

Comments: 10 Pages,8 Figures,4 Tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1476] arXiv:2112.06334 (cross-list from eess.IV) [pdf, other]: Title: DPICT: Deep Progressive Image Compression Using Trit-Planes

Authors: Jae-Han Lee, Seungmin Jeon, Kwang Pyo Choi, Youngo Park, Chang-Su Kim

Comments: Accepted to CVPR 2022 (Oral presentation)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1477] arXiv:2112.06417 (cross-list from eess.IV) [pdf, other]: Title: LC-FDNet: Learned Lossless Image Compression with Frequency Decomposition Network

Authors: Hochang Rhee, Yeong Il Jang, Seyun Kim, Nam Ik Cho

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1478] arXiv:2112.06476 (cross-list from eess.IV) [pdf, other]: Title: gACSON software for automated segmentation and morphology analyses of myelinated axons in 3D electron microscopy

Authors: Andrea Behanova, Ali Abdollahzadeh, Ilya Belevich, Eija Jokitalo, Alejandra Sierra, Jussi Tohka

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1479] arXiv:2112.06693 (cross-list from eess.IV) [pdf, other]: Title: Hypernet-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels

Authors: Sungmin Hong, Anna K. Bonkhoff, Andrew Hoopes, Martin Bretzner, Markus D. Schirmer, Anne-Katrin Giese, Adrian V. Dalca, Polina Golland, Natalia S. Rost

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1480] arXiv:2112.06759 (cross-list from eess.IV) [pdf, ps, other]: Title: Hformer: Hybrid CNN-Transformer for Fringe Order Prediction in Phase Unwrapping of Fringe Projection

Authors: Xinjun Zhu, Zhiqiang Han, Mengkai Yuan, Qinghua Guo, Hongyi Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1481] arXiv:2112.06979 (cross-list from eess.IV) [pdf, other]: Title: The Brain Tumor Sequence Registration (BraTS-Reg) Challenge: Establishing Correspondence Between Pre-Operative and Follow-up MRI Scans of Diffuse Glioma Patients

Authors: Bhakti Baheti, Satrajit Chakrabarty, Hamed Akbari, Michel Bilello, Benedikt Wiestler, Julian Schwarting, Evan Calabrese, Jeffrey Rudie, Syed Abidi, Mina Mousa, Javier Villanueva-Meyer, Brandon K.K. Fields, Florian Kofler, Russell Takeshi Shinohara, Juan Eugenio Iglesias, Tony C. W. Mok, Albert C. S. Chung, Marek Wodzinski, Artur Jurgas, Niccolo Marini, Manfredo Atzori, Henning Muller, Christoph Grobroehmer, Hanna Siebert, Lasse Hansen, Mattias P. Heinrich, Luca Canalini, Jan Klein, Annika Gerken, Stefan Heldmann, Alessa Hering, Horst K. Hahn, Mingyuan Meng, Lei Bi, Dagan Feng, Jinman Kim, Ramy A. Zeineldin, Mohamed E. Karar, Franziska Mathis-Ullrich, Oliver Burgert, Javid Abderezaei, Aymeric Pionteck, Agamdeep Chopra, Mehmet Kurt, Kewei Yan, Yonghong Yan, Zhe Tang, Jianqiang Ma, Sahar Almahfouz Nasser, et al. (24 additional authors not shown)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1482] arXiv:2112.07102 (cross-list from eess.IV) [pdf, other]: Title: COVID-19 Pneumonia and Influenza Pneumonia Detection Using Convolutional Neural Networks

Authors: Julianna Antonchuk, Benjamin Prescott, Philip Melanchthon, Robin Singh

Comments: for associated Azure ML notebook code, see this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1483] arXiv:2112.07415 (cross-list from eess.IV) [pdf, ps, other]: Title: Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration

Authors: Ziwei Luo, Jing Hu, Xin Wang, Shu Hu, Bin Kong, Youbing Yin, Qi Song, Xi Wu, Siwei Lyu

Comments: Accepted by AAAI 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1484] arXiv:2112.07529 (cross-list from eess.IV) [pdf, ps, other]: Title: Improving COVID-19 CXR Detection with Synthetic Data Augmentation

Authors: Daniel Schaudt, Christopher Kloth, Christian Spaete, Andreas Hinteregger, Meinrad Beer, Reinhold von Schwerin

Comments: This paper has been accepted at the Upper-Rhine Artificial Intelligence Symposium 2021 arXiv:2112.05657

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1485] arXiv:2112.07555 (cross-list from eess.IV) [pdf, other]: Title: Classification of histopathology images using ConvNets to detect Lupus Nephritis

Authors: Akash Gupta, Anirudh Reddy, CV Jawahar, PK Vinod

Comments: Accepted in the 2021 Medical Imaging meets NeurIPS Workshop

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
[1486] arXiv:2112.08232 (cross-list from eess.IV) [pdf, ps, other]: Title: RA V-Net: Deep learning network for automated liver segmentation

Authors: Zhiqi Lee, Sumin Qi, Chongchong Fan, Ziwei Xie

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1487] arXiv:2112.08644 (cross-list from eess.IV) [pdf, ps, other]: Title: A comparative study of paired versus unpaired deep learning methods for physically enhancing digital rock image resolution

Authors: Yufu Niu, Samuel J. Jackson, Naif Alqahtani, Peyman Mostaghimi, Ryan T. Armstrong

Comments: 26 pages, 11 figures, 4 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1488] arXiv:2112.08767 (cross-list from eess.IV) [pdf, other]: Title: Adaptation and Attention for Neural Video Coding

Authors: Nannan Zou, Honglei Zhang, Francesco Cricri, Ramin G. Youvalari, Hamed R. Tavakoli, Jani Lainema, Emre Aksu, Miska Hannuksela, Esa Rahtu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1489] arXiv:2112.08837 (cross-list from eess.IV) [pdf, ps, other]: Title: Improving Unsupervised Stain-To-Stain Translation using Self-Supervision and Meta-Learning

Authors: Nassim Bouteldja, Barbara Mara Klinkhammer, Tarek Schlaich, Peter Boor, Dorit Merhof

Comments: Accepted for Journal of Pathology Informatics (JPI), 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1490] arXiv:2112.08851 (cross-list from stat.ML) [pdf, other]: Title: Classification Under Ambiguity: When Is Average-K Better Than Top-K?

Authors: Titouan Lorieul, Alexis Joly, Dennis Shasha

Comments: 53 pages, 21 figures

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1491] arXiv:2112.08968 (cross-list from eess.IV) [pdf, ps, other]: Title: Automated segmentation of 3-D body composition on computed tomography

Authors: Lucy Pu, Syed F. Ashraf, Naciye S Gezer, Iclal Ocak, Rajeev Dhupar

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1492] arXiv:2112.08974 (cross-list from eess.IV) [pdf, other]: Title: Quality monitoring of federated Covid-19 lesion segmentation

Authors: Camila Gonzalez, Christian Harder, Amin Ranem, Ricarda Fischbach, Isabel Kaltenborn, Armin Dadras, Andreas Bucher, Anirban Mukhopadhyay

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1493] arXiv:2112.09020 (cross-list from physics.data-an) [pdf, ps, other]: Title: Classification of diffraction patterns using a convolutional neural network in single particle imaging experiments performed at X-ray free-electron lasers

Authors: Dameli Assalauova, Alexandr Ignatenko, Fabian Isensee, Sergey Bobkov, Darya Trofimova, Ivan A. Vartanyants

Comments: Main text: 28 pages, 7 figures, Supporting Information: 12 pages, 6 figures

Subjects: Data Analysis, Statistics and Probability (physics.data-an); Computer Vision and Pattern Recognition (cs.CV); Biological Physics (physics.bio-ph)
[1494] arXiv:2112.09135 (cross-list from eess.IV) [pdf, other]: Title: ASC-Net: Unsupervised Medical Anomaly Segmentation Using an Adversarial-based Selective Cutting Network

Authors: Raunak Dey, Wenbo Sun, Haibo Xu, Yi Hong

Comments: Currently in Submission to Medical Image Analysis Journal. Extension of DOI - 10.1007/978-3-030-87240-3_23 with more details and experiments and indepth analysis. arXiv admin note: substantial text overlap with arXiv:2103.03664

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1495] arXiv:2112.09177 (cross-list from eess.IV) [pdf, other]: Title: Coherence Learning using Keypoint-based Pooling Network for Accurately Assessing Radiographic Knee Osteoarthritis

Authors: Kang Zheng, Yirui Wang, Chen-I Hsieh, Le Lu, Jing Xiao, Chang-Fu Kuo, Shun Miao

Comments: extension of RSNA 2020 report "Consistent and Coherent Computer-Aided Knee Osteoarthritis Assessment from Plain Radiographs"

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1496] arXiv:2112.09216 (cross-list from eess.IV) [pdf, other]: Title: A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy

Authors: Garvit Goel, Jingyuan Qi, Wu-chun Feng, Guohua Cao

Comments: 10 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1497] arXiv:2112.09254 (cross-list from eess.IV) [pdf, other]: Title: A Novel Image Denoising Algorithm Using Concepts of Quantum Many-Body Theory

Authors: Sayantan Dutta, Adrian Basarab, Bertrand Georgeot, Denis Kouamé

Comments: 24 pages, 14 figures; complements and expands arXiv:2108.13778

Journal-ref: Signal Processing, Volume 201, 2022, 108690

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1498] arXiv:2112.09362 (cross-list from quant-ph) [pdf, other]: Title: Colloquium: Advances in automation of quantum dot devices control

Authors: Justyna P. Zwolak, Jacob M. Taylor

Comments: 24 pages, 11 figures

Journal-ref: Rev. Mod. Phys. 95, 011006 (2023)

Subjects: Quantum Physics (quant-ph); Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1499] arXiv:2112.09496 (cross-list from eess.IV) [pdf, ps, other]: Title: Towards Launching AI Algorithms for Cellular Pathology into Clinical & Pharmaceutical Orbits

Authors: Amina Asif, Kashif Rajpoot, David Snead, Fayyaz Minhas, Nasir Rajpoot

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1500] arXiv:2112.09529 (cross-list from eess.IV) [pdf, other]: Title: End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression

Authors: M.Akın Yılmaz, A.Murat Tekalp

Comments: Accepted for publication in IEEE Transactions on Image Processing on 15 Dec. 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1501] arXiv:2112.09574 (cross-list from eess.IV) [pdf, ps, other]: Title: Super-resolution reconstruction of cytoskeleton image based on A-net deep learning network

Authors: Qian Chen, Haoxin Bai, Bingchen Che, Tianyun Zhao, Ce Zhang, Kaige Wang, Jintao Bai, Wei Zhao

Comments: The manuscript has 17 pages, 10 figures and 58 references

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1502] arXiv:2112.09654 (cross-list from eess.IV) [pdf, other]: Title: FastSurferVINN: Building Resolution-Independence into Deep Learning Segmentation Methods -- A Solution for HighRes Brain MRI

Authors: Leonie Henschel, David Kügler, Martin Reuter

Comments: accepted at NeuroImage

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1503] arXiv:2112.09694 (cross-list from eess.IV) [pdf, other]: Title: Interpretable and Interactive Deep Multiple Instance Learning for Dental Caries Classification in Bitewing X-rays

Authors: Benjamin Bergner, Csaba Rohrer, Aiham Taleb, Martha Duchrau, Guilherme De Leon, Jonas Almeida Rodrigues, Falk Schwendicke, Joachim Krois, Christoph Lippert

Comments: 19 pages, 10 figures, Full Paper, MIDL 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1504] arXiv:2112.09760 (cross-list from eess.IV) [pdf, other]: Title: Learned Half-Quadratic Splitting Network for MR Image Reconstruction

Authors: Bingyu Xin, Timothy S. Phan, Leon Axel, Dimitris N. Metaxas

Comments: accepted for MIDL2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1505] arXiv:2112.09970 (cross-list from eess.IV) [pdf, ps, other]: Title: 3D Structural Analysis of the Optic Nerve Head to Robustly Discriminate Between Papilledema and Optic Disc Drusen

Authors: Michaël J.A. Girard, Satish K. Panda, Tin Aung Tun, Elisabeth A. Wibroe, Raymond P. Najjar, Aung Tin, Alexandre H. Thiéry, Steffen Hamann, Clare Fraser, Dan Milea

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1506] arXiv:2112.10001 (cross-list from eess.IV) [pdf, other]: Title: Cross-Domain Federated Learning in Medical Imaging

Authors: Vishwa S Parekh, Shuhao Lai, Vladimir Braverman, Jeff Leal, Steven Rowe, Jay J Pillai, Michael A Jacobs

Comments: Under Review for MIDL 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1507] arXiv:2112.10024 (cross-list from eess.IV) [pdf, ps, other]: Title: Supervised laser-speckle image sampling of skin tissue to detect very early stage of diabetes by its effects on skin subcellular properties

Authors: Ahmet Orun, Luke Vella Critien, Jennifer Carter, Martin Stacey

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1508] arXiv:2112.10046 (cross-list from eess.IV) [pdf, other]: Title: A-ESRGAN: Training Real-World Blind Super-Resolution with Attention U-Net Discriminators

Authors: Zihao Wei, Yidong Huang, Yuang Chen, Chenhao Zheng, Jinnan Gao

Comments: 6 pages, 9 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1509] arXiv:2112.10071 (cross-list from eess.IV) [pdf, other]: Title: A New Image Codec Paradigm for Human and Machine Uses

Authors: Sien Chen, Jian Jin, Lili Meng, Weisi Lin, Zhuo Chen, Tsui-Shan Chang, Zhengguang Li, Huaxiang Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1510] arXiv:2112.10074 (cross-list from eess.IV) [pdf, other]: Title: QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking Results

Authors: Raghav Mehta, Angelos Filos, Ujjwal Baid, Chiharu Sako, Richard McKinley, Michael Rebsamen, Katrin Datwyler, Raphael Meier, Piotr Radojewski, Gowtham Krishnan Murugesan, Sahil Nalawade, Chandan Ganesh, Ben Wagner, Fang F. Yu, Baowei Fei, Ananth J. Madhuranthakam, Joseph A. Maldjian, Laura Daza, Catalina Gomez, Pablo Arbelaez, Chengliang Dai, Shuo Wang, Hadrien Reynaud, Yuan-han Mo, Elsa Angelini, Yike Guo, Wenjia Bai, Subhashis Banerjee, Lin-min Pei, Murat AK, Sarahi Rosas-Gonzalez, Ilyess Zemmoura, Clovis Tauber, Minh H. Vu, Tufve Nyholm, Tommy Lofstedt, Laura Mora Ballestar, Veronica Vilaplana, Hugh McHugh, Gonzalo Maso Talou, Alan Wang, Jay Patel, Ken Chang, Katharina Hoebel, Mishka Gidwani, Nishanth Arun, Sharut Gupta, Mehak Aggarwal, Praveer Singh, Elizabeth R. Gerstner, Jayashree Kalpathy-Cramer, et al. (41 additional authors not shown)

Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA): this https URL

Journal-ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1511] arXiv:2112.10184 (cross-list from eess.IV) [pdf, ps, other]: Title: A Deep Learning Based Workflow for Detection of Lung Nodules With Chest Radiograph

Authors: Yang Tai, Yu-Wen Fang (Same contribution), Fang-Yi Su, Jung-Hsien Chiang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1512] arXiv:2112.10307 (cross-list from eess.IV) [pdf, other]: Title: Skin lesion segmentation and classification using deep learning and handcrafted features

Authors: Redha Ali, Hussin K. Ragb

Comments: 7 pages, 3 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1513] arXiv:2112.10325 (cross-list from eess.IV) [pdf, other]: Title: Incremental Cross-view Mutual Distillation for Self-supervised Medical CT Synthesis

Authors: Chaowei Fang, Liang Wang, Dingwen Zhang, Jun Xu, Yixuan Yuan, Junwei Han

Comments: Accepted by CVPR2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1514] arXiv:2112.10368 (cross-list from eess.IV) [pdf, other]: Title: Deep Co-supervision and Attention Fusion Strategy for Automatic COVID-19 Lung Infection Segmentation on CT Images

Authors: Haigen Hu, Leizhao Shen, Qiu Guan, Xiaoxin Li, Qianwei Zhou, Su Ruan

Journal-ref: Pattern Recognition,2022,124:108452

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1515] arXiv:2112.10541 (cross-list from eess.IV) [pdf, ps, other]: Title: Implicit Neural Representation Learning for Hyperspectral Image Super-Resolution

Authors: Kaiwei Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1516] arXiv:2112.10652 (cross-list from eess.IV) [pdf, other]: Title: HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet

Authors: Cheng Peng, Andriy Myronenko, Ali Hatamizadeh, Vish Nath, Md Mahfuzur Rahman Siddiquee, Yufan He, Daguang Xu, Rama Chellappa, Dong Yang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1517] arXiv:2112.10755 (cross-list from math.DS) [pdf, other]: Title: Discovering State Variables Hidden in Experimental Data

Authors: Boyuan Chen, Kuang Huang, Sunand Raghupathi, Ishaan Chandratreya, Qiang Du, Hod Lipson

Comments: Project website with code, data, and overview video is at: this https URL

Subjects: Dynamical Systems (math.DS); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY); Applied Physics (physics.app-ph)
[1518] arXiv:2112.10775 (cross-list from eess.IV) [pdf, other]: Title: HarmoFL: Harmonizing Local and Global Drifts in Federated Learning on Heterogeneous Medical Images

Authors: Meirui Jiang, Zirui Wang, Qi Dou

Comments: Accepted at AAAI 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1519] arXiv:2112.11065 (cross-list from eess.IV) [pdf, other]: Title: Leveraging Image Complexity in Macro-Level Neural Network Design for Medical Image Segmentation

Authors: Tariq M. Khan, Syed S. Naqvi, Erik Meijering

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1520] arXiv:2112.11078 (cross-list from eess.IV) [pdf, other]: Title: RC-Net: A Convolutional Neural Network for Retinal Vessel Segmentation

Authors: Tariq M Khan, Antonio Robles-Kelly, Syed S. Naqvi

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1521] arXiv:2112.11381 (cross-list from eess.IV) [pdf, ps, other]: Title: A novel approach for the automated segmentation and volume quantification of cardiac fats on computed tomography

Authors: Érick Oliveira Rodrigues, FFC Morais, NAOS Morais, LS Conci, LV Neto, Aura Conci

Comments: Computer methods and programs in biomedicine, 2016

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1522] arXiv:2112.11541 (cross-list from eess.IV) [pdf, other]: Title: Teacher-Student Architecture for Mixed Supervised Lung Tumor Segmentation

Authors: Vemund Fredriksen, Svein Ole M. Svele, André Pedersen, Thomas Langø, Gabriel Kiss, Frank Lindseth

Comments: 17 pages, 3 figures, 5 tables, submitted to journal

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1523] arXiv:2112.11833 (cross-list from eess.IV) [pdf, other]: Title: Deep learning for brain metastasis detection and segmentation in longitudinal MRI data

Authors: Yixing Huang, Christoph Bert, Philipp Sommer, Benjamin Frey, Udo Gaipl, Luitpold V. Distel, Thomas Weissmann, Michael Uder, Manuel A. Schmidt, Arnd Dörfler, Andreas Maier, Rainer Fietkau, Florian Putz

Comments: Implementation is available to public at this https URL

Journal-ref: Medical Physics 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1524] arXiv:2112.12021 (cross-list from eess.IV) [pdf, other]: Title: Community Detection in Medical Image Datasets: Using Wavelets and Spectral Methods

Authors: Roozbeh Yousefzadeh

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1525] arXiv:2112.12386 (cross-list from eess.IV) [pdf, other]: Title: KFWC: A Knowledge-Driven Deep Learning Model for Fine-grained Classification of Wet-AMD

Authors: Haihong E, Jiawen He, Tianyi Hu, Lifei Wang, Lifei Yuan, Ruru Zhang, Meina Song

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1526] arXiv:2112.12560 (cross-list from eess.IV) [pdf, other]: Title: On the relationship between calibrated predictors and unbiased volume estimation

Authors: Teodora Popordanoska, Jeroen Bertels, Dirk Vandermeulen, Frederik Maes, Matthew B. Blaschko

Comments: Published at MICCAI 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1527] arXiv:2112.12609 (cross-list from eess.IV) [pdf, ps, other]: Title: Predição da Idade Cerebral a partir de Imagens de Ressonância Magnética utilizando Redes Neurais Convolucionais

Authors: Victor H. R. Oliveira, Augusto Antunes, Alexandre S. Soares, Arthur D. Reys, Robson Z. Júnior, Saulo D. S. Pedro, Danilo Silva

Comments: 3 pages, 3 figures, in Portuguese, accepted at XVIII Congresso Brasileiro de Inform\'atica em Sa\'ude (CBIS 2021)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1528] arXiv:2112.12660 (cross-list from eess.IV) [pdf, other]: Title: InDuDoNet+: A Deep Unfolding Dual Domain Network for Metal Artifact Reduction in CT Images

Authors: Hong Wang, Yuexiang Li, Haimiao Zhang, Deyu Meng, Yefeng Zheng

Journal-ref: Medical Image Analysis 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1529] arXiv:2112.12665 (cross-list from eess.IV) [pdf, other]: Title: Omni-Seg: A Single Dynamic Network for Multi-label Renal Pathology Image Segmentation using Partially Labeled Data

Authors: Ruining Deng, Quan Liu, Can Cui, Zuhayr Asad, Haichun Yang, Yuankai Huo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1530] arXiv:2112.12744 (cross-list from eess.IV) [pdf, ps, other]: Title: AI-based Reconstruction for Fast MRI -- A Systematic Review and Meta-analysis

Authors: Yutong Chen, Carola-Bibiane Schönlieb, Pietro Liò, Tim Leiner, Pier Luigi Dragotti, Ge Wang, Daniel Rueckert, David Firmin, Guang Yang

Comments: 42 pages, 5 figures, Proceedings of the IEEE

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1531] arXiv:2112.12810 (cross-list from eess.IV) [pdf, ps, other]: Title: Self-Attention Generative Adversarial Network for Iterative Reconstruction of CT Images

Authors: Ruiwen Xing, Thomas Humphries, Dong Si

Comments: 16 pages, 8 figures, 5 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1532] arXiv:2112.12839 (cross-list from q-bio.QM) [pdf, ps, other]: Title: Faster Deep Ensemble Averaging for Quantification of DNA Damage from Comet Assay Images With Uncertainty Estimates

Authors: Srikanth Namuduri, Prateek Mehta, Lise Barbe, Stephanie Lam, Zohreh Faghihmonzavi, Steve Finkbeiner, Shekhar Bhansali

Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1533] arXiv:2112.13054 (cross-list from eess.IV) [pdf, other]: Title: Generalized Wasserstein Dice Loss, Test-time Augmentation, and Transformers for the BraTS 2021 challenge

Authors: Lucas Fidon, Suprosanna Shit, Ivan Ezhov, Johannes C. Paetzold, Sébastien Ourselin, Tom Vercauteren

Comments: BraTS 2021 challenge

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1534] arXiv:2112.13110 (cross-list from eess.SP) [pdf, other]: Title: Ultrasound Speckle Suppression and Denoising using MRI-derived Normalizing Flow Priors

Authors: Vincent van de Schaft, Ruud J.G. van Sloun

Comments: 10 pages, 8 figures

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[1535] arXiv:2112.13191 (cross-list from eess.IV) [pdf, other]: Title: DSRGAN: Detail Prior-Assisted Perceptual Single Image Super-Resolution via Generative Adversarial Networks

Authors: Ziyang Liu, Zhengguo Li, Xingming Wu, Zhong Liu, Weihai Chen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1536] arXiv:2112.13194 (cross-list from eess.IV) [pdf, other]: Title: Network-Aware 5G Edge Computing for Object Detection: Augmenting Wearables to "See" More, Farther and Faster

Authors: Zhongzheng Yuan, Tommy Azzino, Yu Hao, Yixuan Lyu, Haoyang Pei, Alain Boldini, Marco Mezzavilla, Mahya Beheshti, Maurizio Porfiri, Todd Hudson, William Seiple, Yi Fang, Sundeep Rangan, Yao Wang, J.R. Rizzo

Comments: Published in: IEEE Access ( Volume: 10)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1537] arXiv:2112.13227 (cross-list from eess.IV) [pdf, other]: Title: Pseudocylindrical Convolutions for Learned Omnidirectional Image Compression

Authors: Mu Li, Kede Ma, Jinxing Li, David Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1538] arXiv:2112.13264 (cross-list from eess.IV) [pdf, ps, other]: Title: Artifact Reduction in Fundus Imaging using Cycle Consistent Adversarial Neural Networks

Authors: Sai Koushik S S, K.G. Srinivasa

Comments: 12 pages, 13 figures, draft paper

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1539] arXiv:2112.13309 (cross-list from eess.IV) [pdf, other]: Title: Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression

Authors: Zongyu Guo, Runsen Feng, Zhizheng Zhang, Xin Jin, Zhibo Chen

Comments: Preprint. Revised after peer-reviewimg

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1540] arXiv:2112.13339 (cross-list from stat.ML) [pdf, other]: Title: Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives

Authors: Hideyuki Tachibana, Mocho Go, Muneyoshi Inahara, Yotaro Katayama, Yotaro Watanabe

Comments: Major update from 2112.13339v1. 47 pages, 24 figures

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1541] arXiv:2112.13443 (cross-list from eess.IV) [pdf, other]: Title: Sinogram upsampling using Primal-Dual UNet for undersampled CT and radial MRI reconstruction

Authors: Philipp Ernst, Soumick Chatterjee, Georg Rose, Oliver Speck, Andreas Nürnberger

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1542] arXiv:2112.13513 (cross-list from eess.IV) [pdf, ps, other]: Title: MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer

Authors: Tianyi Zhang, Yunlu Feng, Yu Zhao, Guangda Fan, Aiming Yang, Shangqin Lyu, Peng Zhang, Fan Song, Chenbin Ma, Yangyang Sun, Youdan Feng, Guanglei Zhang

Comments: 12 pages, 10 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1543] arXiv:2112.13553 (cross-list from eess.IV) [pdf, ps, other]: Title: Classification of Histopathology Images of Lung Cancer Using Convolutional Neural Network (CNN)

Authors: Neha Baranwal, Preethi Doravari, Renu Kachhoria

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1544] arXiv:2112.13559 (cross-list from eess.IV) [pdf, other]: Title: DAM-AL: Dilated Attention Mechanism with Attention Loss for 3D Infant Brain Image Segmentation

Authors: Dinh-Hieu Hoang, Gia-Han Diep, Minh-Triet Tran, Ngan T.H Le

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1545] arXiv:2112.13595 (cross-list from eess.IV) [pdf, other]: Title: Depth estimation of endoscopy using sim-to-real transfer

Authors: Bong Hyuk Jeong, Hang Keun Kim, Young Don Son

Comments: 12 pages, 9 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1546] arXiv:2112.13626 (cross-list from eess.IV) [pdf, other]: Title: Generation of Synthetic Rat Brain MRI scans with a 3D Enhanced Alpha-GAN

Authors: André Ferreira (1), Ricardo Magalhães (2), Sébastien Mériaux (2), Victor Alves (1) ((1) Centro Algoritmi, University of Minho, Braga, Portugal, (2) Université Paris-Saclay, CEA, CNRS, BAOBAB, NeuroSpin, Gif-sur-Yvette, France)

Comments: 25 pages, 10 figures, 4 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1547] arXiv:2112.13637 (cross-list from eess.IV) [pdf, other]: Title: Self-normalized Classification of Parkinson's Disease DaTscan Images

Authors: Yuan Zhou, Hemant D. Tagare

Comments: To appear in IEEE BIBM 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1548] arXiv:2112.13686 (cross-list from eess.IV) [pdf, ps, other]: Title: Radiomic biomarker extracted from PI-RADS 3 patients support more eìcient and robust prostate cancer diagnosis: a multi-center study

Authors: Longfei Li, Rui Yang, Xin Chen, Cheng Li, Hairong Zheng, Yusong Lin, Zaiyi Liu, Shanshan Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1549] arXiv:2112.13811 (cross-list from eess.IV) [pdf, other]: Title: Infant Brain Age Classification: 2D CNN Outperforms 3D CNN in Small Dataset

Authors: Mahdieh Shabanian, Markus Wenzel, John P. DeVincenzo

Comments: 8 pages, 5 figures, 3 tables. arXiv admin note: text overlap with arXiv:2010.03963

Journal-ref: SPIE 2022 Medical Imaging Conference

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1550] arXiv:2112.13850 (cross-list from econ.GN) [pdf, ps, other]: Title: Using maps to predict economic activity

Authors: Imryoung Jeong, Hyunjoo Yang

Comments: 24 pages including references and appendix, 9 figures, 1 table

Subjects: General Economics (econ.GN); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1551] arXiv:2112.13865 (cross-list from eess.IV) [pdf, other]: Title: Astronomical Image Colorization and upscaling with Generative Adversarial Networks

Authors: Shreyas Kalvankar, Hrushikesh Pandit, Pranav Parwate, Atharva Patil, Snehal Kamalapur

Comments: 14 pages, 10 figures, 7 tables

Subjects: Image and Video Processing (eess.IV); Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1552] arXiv:2112.13885 (cross-list from eess.IV) [pdf, other]: Title: MedShift: identifying shift data for medical dataset curation

Authors: Xiaoyuan Guo, Judy Wawira Gichoya, Hari Trivedi, Saptarshi Purkayastha, Imon Banerjee

Comments: 35 pages, 28 figures, 2 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1553] arXiv:2112.13893 (cross-list from eess.IV) [pdf, ps, other]: Title: Non-Reference Quality Monitoring of Digital Images using Gradient Statistics and Feedforward Neural Networks

Authors: Nisar Ahmed, Hafiz Muhammad Shahzad Asif, Hassan Khalid

Comments: Fifth International Conference on Aerospace Science & Engineering (ICASE 2017) (ICASE Proceedings, Page No. 300-305)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1554] arXiv:2112.14022 (cross-list from eess.IV) [pdf, other]: Title: Towards Low Light Enhancement with RAW Images

Authors: Haofeng Huang, Wenhan Yang, Yueyu Hu, Jiaying Liu, Ling-Yu Duan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1555] arXiv:2112.14026 (cross-list from eess.IV) [pdf, ps, other]: Title: SECP-Net: SE-Connection Pyramid Network of Organ At Risk Segmentation for Nasopharyngeal Carcinoma

Authors: Zexi Huang (1), Lihua Guo (1), Xin Yang (2), Sijuan Huang (2) ((1) School of Electronic and Information Engineering, South China University of Technology, (2) Sun Yat-sen University Cancer Center)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1556] arXiv:2112.14320 (cross-list from eess.IV) [pdf, ps, other]: Title: Brain Tumor Classification by Cascaded Multiscale Multitask Learning Framework Based on Feature Aggregation

Authors: Zahra Sobhaninia, Nader Karimi, Pejman Khadivi, Shadrokh Samavi

Comments: 16 pages, 7 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1557] arXiv:2112.14340 (cross-list from eess.IV) [pdf, other]: Title: Super-Efficient Super Resolution for Fast Adversarial Defense at the Edge

Authors: Kartikeya Bhardwaj, Dibakar Gope, James Ward, Paul Whatmough, Danny Loh

Comments: This preprint is for personal use only. The official article will appear in proceedings of Design, Automation & Test in Europe (DATE), 2022, as part of the Special Initiative on Autonomous Systems Design (ASD)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1558] arXiv:2112.14555 (cross-list from eess.IV) [pdf, other]: Title: Onsite Non-Line-of-Sight Imaging via Online Calibrations

Authors: Zhengqing Pan, Ruiqian Li, Tian Gao, Zi Wang, Ping Liu, Siyuan Shen, Tao Wu, Jingyi Yu, Shiying Li

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1559] arXiv:2112.14608 (cross-list from eess.IV) [pdf, other]: Title: HPRN: Holistic Prior-embedded Relation Network for Spectral Super-Resolution

Authors: Chaoxiong Wu, Jiaojiao Li, Rui Song, Yunsong Li, Qian Du

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1560] arXiv:2112.14644 (cross-list from eess.IV) [pdf, ps, other]: Title: Implementation of Convolutional Neural Network Architecture on 3D Multiparametric Magnetic Resonance Imaging for Prostate Cancer Diagnosis

Authors: Ping-Chang Lin, Teodora Szasz, Hakizumwami B. Runesha

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1561] arXiv:2112.14768 (cross-list from eess.IV) [pdf, other]: Title: Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

Authors: Erez Yosef, Shay Elmalem, Raja Giryes

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1562] arXiv:2112.15009 (cross-list from eess.IV) [pdf, ps, other]: Title: Knowledge Matters: Radiology Report Generation with General and Specific Knowledge

Authors: Shuxin Yang, Xian Wu, Shen Ge, Shaohua Kevin Zhou, Li Xiao

Comments: Medical Image Analysis

Subjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1563] arXiv:2112.15011 (cross-list from eess.IV) [pdf, other]: Title: Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment

Authors: Shuxin Yang, Xian Wu, Shen Ge, S.Kevin Zhou, Li Xiao

Subjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1564] arXiv:2112.15106 (cross-list from eess.IV) [pdf, other]: Title: Colour alignment for relative colour constancy via non-standard references

Authors: Yunfeng Zhao, Stuart Ferguson, Huiyu Zhou, Chris Elliott, Karen Rafferty

Comments: 14 pages, 8 figures, 2 tables, accepted by IEEE Transactions on Image Processing

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1565] arXiv:2112.15180 (cross-list from eess.IV) [pdf, other]: Title: A Resolution Enhancement Plug-in for Deformable Registration of Medical Images

Authors: Kaicong Sun, Sven Simon

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1566] arXiv:2112.15299 (cross-list from eess.IV) [pdf, other]: Title: CSformer: Bridging Convolution and Transformer for Compressive Sensing

Authors: Dongjie Ye, Zhangkai Ni, Hanli Wang, Jian Zhang, Shiqi Wang, Sam Kwong

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1567] arXiv:2112.15362 (cross-list from eess.IV) [pdf, other]: Title: Modeling Mask Uncertainty in Hyperspectral Image Reconstruction

Authors: Jiamian Wang, Yulun Zhang, Xin Yuan, Ziyi Meng, Zhiqiang Tao

Comments: ECCV 2022 Oral Paper

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1568] arXiv:2112.15367 (cross-list from eess.IV) [pdf, other]: Title: Weakly Supervised Change Detection Using Guided Anisotropic Difusion

Authors: Rodrigo Caye Daudt, Bertrand Le Saux, Alexandre Boulch, Yann Gousseau

Comments: Machine Learning Journal 2021. arXiv admin note: substantial text overlap with arXiv:1904.08208

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1569] arXiv:2112.15386 (cross-list from eess.IV) [pdf, other]: Title: Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning

Authors: Bin-Cheng Yang, Gangshan Wu

Comments: 21 pages, 9 figures, 5 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1570] arXiv:2112.15523 (cross-list from eess.IV) [pdf, ps, other]: Title: Transfer learning for cancer diagnosis in histopathological images

Authors: Sandhya Aneja, Nagender Aneja, Pg Emeroylariffion Abas, Abdul Ghani Naim

Journal-ref: IAES International Journal of Artificial Intelligence (IJ-AI), Vol. 11, No. 1, March 2022, pp. 129~136

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

[ total of 1570 entries: 1-1570 ]
[ showing 1570 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Dec 2021