Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Jan 2022

[ total of 1140 entries: 1-1138 | 1139-1140 ]
[ showing 1138 entries per page: fewer | more | all ]

[1] arXiv:2201.00043 [pdf, other]: Title: Multi-Dimensional Model Compression of Vision Transformer

Authors: Zejiang Hou, Sun-Yuan Kung

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2] arXiv:2201.00059 [pdf, other]: Title: iCaps: Iterative Category-level Object Pose and Shape Estimation

Authors: Xinke Deng, Junyi Geng, Timothy Bretl, Yu Xiang, Dieter Fox

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[3] arXiv:2201.00080 [pdf, other]: Title: PatchTrack: Multiple Object Tracking Using Frame Patches

Authors: Xiaotong Chen, Seyed Mehdi Iranmanesh, Kuo-Chin Lien

Comments: 11 pages, 4 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[4] arXiv:2201.00095 [pdf, other]: Title: Computer Vision Based Parking Optimization System

Authors: Siddharth Chandrasekaran, Jeffrey Matthew Reginald, Wei Wang, Ting Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[5] arXiv:2201.00096 [pdf, other]: Title: SalyPath360: Saliency and Scanpath Prediction Framework for Omnidirectional Images

Authors: Mohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Mohamed Sayeh

Comments: Accepted at Electornic Imaging Sympotium 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[6] arXiv:2201.00097 [pdf, other]: Title: Adversarial Attack via Dual-Stage Network Erosion

Authors: Yexin Duan, Junhua Zou, Xingyu Zhou, Wu Zhang, Jin Zhang, Zhisong Pan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[7] arXiv:2201.00103 [pdf, other]: Title: Robust Region Feature Synthesizer for Zero-Shot Object Detection

Authors: Peiliang Huang, Junwei Han, De Cheng, Dingwen Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8] arXiv:2201.00107 [pdf, other]: Title: Quality-aware Part Models for Occluded Person Re-identification

Authors: Pengfei Wang, Changxing Ding, Zhiyin Shao, Zhibin Hong, Shengli Zhang, Dacheng Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9] arXiv:2201.00112 [pdf, other]: Title: SurfGen: Adversarial 3D Shape Synthesis with Explicit Surface Discriminators

Authors: Andrew Luo, Tianqin Li, Wen-Hao Zhang, Tai Sing Lee

Comments: ICCV 2021. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[10] arXiv:2201.00132 [pdf, other]: Title: SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Authors: Bao Hieu Tran, Thanh Le-Cong, Huu Manh Nguyen, Duc Anh Le, Thanh Hung Nguyen, Phi Le Nguyen

Comments: Accepted to ICMLA 2020

Journal-ref: 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[11] arXiv:2201.00177 [pdf, other]: Title: Adaptive Image Inpainting

Authors: Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12] arXiv:2201.00220 [pdf, other]: Title: Turath-150K: Image Database of Arab Heritage

Authors: Dani Kiyasseh, Rasheed El-Bouri

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13] arXiv:2201.00239 [pdf, other]: Title: SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

Authors: Dominik Bauer, Timothy Patten, Markus Vincze

Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[14] arXiv:2201.00267 [pdf, other]: Title: On the Cross-dataset Generalization in License Plate Recognition

Authors: Rayson Laroca, Everton V. Cardoso, Diego R. Lucio, Valter Estevam, David Menotti

Comments: Accepted for presentation at the International Conference on Computer Vision Theory and Applications (VISAPP) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[15] arXiv:2201.00323 [pdf, other]: Title: V-LinkNet: Learning Contextual Inpainting Across Latent Space of Generative Adversarial Network

Authors: Jireh Jam, Connah Kendrick, Vincent Drouard, Kevin Walker, Moi Hoon Yap

Comments: 13 pages including references, 9 figures and 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[16] arXiv:2201.00346 [pdf, other]: Title: Detail-Preserving Transformer for Light Field Image Super-Resolution

Authors: Shunzhou Wang, Tianfei Zhou, Yao Lu, Huijun Di

Comments: AAAI2022, Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[17] arXiv:2201.00377 [pdf, other]: Title: Parkour Spot ID: Feature Matching in Satellite and Street view images using Deep Learning

Authors: João Morais, Kaushal Rathi, Bhuvaneshwar Mohan, Shantanu Rajesh

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[18] arXiv:2201.00392 [pdf, other]: Title: Fast and High-Quality Image Denoising via Malleable Convolutions

Authors: Yifan Jiang, Bartlomiej Wronski, Ben Mildenhall, Jonathan T. Barron, Zhangyang Wang, Tianfan Xue

Comments: Accepted by ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[19] arXiv:2201.00411 [pdf, other]: Title: The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents

Authors: Sarah Pratt, Luca Weihs, Ali Farhadi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[20] arXiv:2201.00424 [pdf, other]: Title: Splicing ViT Features for Semantic Appearance Transfer

Authors: Narek Tumanyan, Omer Bar-Tal, Shai Bagon, Tali Dekel

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[21] arXiv:2201.00434 [pdf, other]: Title: TVNet: Temporal Voting Network for Action Localization

Authors: Hanyuan Wang, Dima Damen, Majid Mirmehdi, Toby Perrett

Comments: 9 pages, 7 figures, 11 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[22] arXiv:2201.00439 [pdf, other]: Title: Salient Object Detection by LTP Texture Characterization on Opposing Color Pairs under SLICO Superpixel Constraint

Authors: Didier Ndayikengurukiye, Max Mignotte

Journal-ref: J. Imaging 2022, 8(4), 110

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23] arXiv:2201.00443 [pdf, other]: Title: Scene Graph Generation: A Comprehensive Survey

Authors: Guangming Zhu, Liang Zhang, Youliang Jiang, Yixuan Dang, Haoran Hou, Peiyi Shen, Mingtao Feng, Xia Zhao, Qiguang Miao, Syed Afaq Ali Shah, Mohammed Bennamoun

Comments: Submitted to TPAMI

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2201.00454 [pdf, other]: Title: Memory-Guided Semantic Learning Network for Temporal Sentence Grounding

Authors: Daizong Liu, Xiaoye Qu, Xing Di, Yu Cheng, Zichuan Xu, Pan Zhou

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[25] arXiv:2201.00457 [pdf, other]: Title: Exploring Motion and Appearance Information for Temporal Sentence Grounding

Authors: Daizong Liu, Xiaoye Qu, Pan Zhou, Yang Liu

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[26] arXiv:2201.00461 [pdf, other]: Title: Biometrics in the Time of Pandemic: 40% Masked Face Recognition Degradation can be Reduced to 2%

Authors: Leonardo Queiroz, Kenneth Lai, Svetlana Yanushkevich, Vlad Shmerko

Comments: 11 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[27] arXiv:2201.00462 [pdf, other]: Title: D-Former: A U-shaped Dilated Transformer for 3D Medical Image Segmentation

Authors: Yixuan Wu, Kuanlun Liao, Jintai Chen, Jinhong Wang, Danny Z. Chen, Honghao Gao, Jian Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[28] arXiv:2201.00467 [pdf, other]: Title: maskGRU: Tracking Small Objects in the Presence of Large Background Motions

Authors: Constantine J. Roros, Avinash C. Kak

Comments: 12 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[29] arXiv:2201.00471 [pdf, other]: Title: Revisiting Open World Object Detection

Authors: Xiaowei Zhao, Xianglong Liu, Yifan Shen, Yixuan Qiao, Yuqing Ma, Duorui Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[30] arXiv:2201.00475 [pdf, other]: Title: CaFT: Clustering and Filter on Tokens of Transformer for Weakly Supervised Object Localization

Authors: Ming Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31] arXiv:2201.00487 [pdf, other]: Title: Language as Queries for Referring Video Object Segmentation

Authors: Jiannan Wu, Yi Jiang, Peize Sun, Zehuan Yuan, Ping Luo

Comments: 14 pages, accepted by CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32] arXiv:2201.00504 [pdf, ps, other]: Title: R-Theta Local Neighborhood Pattern for Unconstrained Facial Image Recognition and Retrieval

Authors: Soumendu Chakraborty, Satish Kumar Singh, Pavan Chakraborty

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[33] arXiv:2201.00509 [pdf, ps, other]: Title: Local Gradient Hexa Pattern: A Descriptor for Face Recognition and Retrieval

Authors: Soumendu Chakraborty, Satish Kumar Singh, Pavan Chakraborty

Journal-ref: IEEE Transactions on Circuits and Systems for Video Technology, vol-28, no-1, pp. 171-180, (2018). ISSN/ISBN: 1051-8215

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[34] arXiv:2201.00518 [pdf, ps, other]: Title: Cascaded Asymmetric Local Pattern: A Novel Descriptor for Unconstrained Facial Image Recognition and Retrieval

Authors: Soumendu Chakraborty, Satish Kumar Singh, Pavan Chakraborty

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[35] arXiv:2201.00520 [pdf, other]: Title: Vision Transformer with Deformable Attention

Authors: Zhuofan Xia, Xuran Pan, Shiji Song, Li Erran Li, Gao Huang

Comments: Accepted by CVPR2022 (12 pages, 7 figures)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36] arXiv:2201.00531 [pdf, other]: Title: Novelty-based Generalization Evaluation for Traffic Light Detection

Authors: Arvind Kumar Shekar, Laureen Lake, Liang Gou, Liu Ren

Comments: Accepted/Presented at ICMLA 2021

Journal-ref: 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[37] arXiv:2201.00572 [pdf, other]: Title: Enabling Verification of Deep Neural Networks in Perception Tasks Using Fuzzy Logic and Concept Embeddings

Authors: Gesina Schwalbe, Christian Wirth, Ute Schmid

Comments: 32 pages (including 14 pages supplemental material), 11 Figures, 8 Tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[38] arXiv:2201.00577 [pdf, other]: Title: Semantically Grounded Visual Embeddings for Zero-Shot Learning

Authors: Shah Nawaz, Jacopo Cavazza, Alessio Del Bue

Comments: Accepted at CVPRW

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[39] arXiv:2201.00625 [pdf, other]: Title: GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD Drawings

Authors: Zhaohua Zheng, Jianfang Li, Lingjie Zhu, Honghua Li, Frank Petzold, Ping Tan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[40] arXiv:2201.00672 [pdf, other]: Title: Compression-Resistant Backdoor Attack against Deep Neural Networks

Authors: Mingfu Xue, Xin Wang, Shichang Sun, Yushu Zhang, Jian Wang, Weiqiang Liu

Journal-ref: Applied Intelligence, 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41] arXiv:2201.00708 [pdf, other]: Title: Multiview point cloud registration with anisotropic and space-varying localization noise

Authors: Denis Fortun, Etienne Baudrier, Fabian Zwettler, Markus Sauer, Sylvain Faisan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42] arXiv:2201.00714 [pdf, other]: Title: Multi-view Data Classification with a Label-driven Auto-weighted Strategy

Authors: Yuyuan Yu, Guoxu Zhou, Haonan Huang, Shengli Xie, Qibin Zhao

Comments: 11 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43] arXiv:2201.00770 [pdf, other]: Title: FaceQgen: Semi-Supervised Deep Learning for Face Image Quality Assessment

Authors: Javier Hernandez-Ortega, Julian Fierrez, Ignacio Serna, Aythami Morales

Journal-ref: IEEE International Conference on Automatic Face and Gesture Recognition 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[44] arXiv:2201.00785 [pdf, other]: Title: Implicit Autoencoder for Point-Cloud Self-Supervised Representation Learning

Authors: Siming Yan, Zhenpei Yang, Haoxiang Li, Chen Song, Li Guan, Hao Kang, Gang Hua, Qixing Huang

Comments: Published in ICCV 2023. The code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45] arXiv:2201.00791 [pdf, other]: Title: DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering

Authors: Shunyu Yao, RuiZhe Zhong, Yichao Yan, Guangtao Zhai, Xiaokang Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46] arXiv:2201.00814 [pdf, other]: Title: Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space

Authors: Arnav Chavan, Zhiqiang Shen, Zhuang Liu, Zechun Liu, Kwang-Ting Cheng, Eric Xing

Comments: CVPR 2022. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[47] arXiv:2201.00848 [pdf, ps, other]: Title: Runway Extraction and Improved Mapping from Space Imagery

Authors: David A. Noever

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[48] arXiv:2201.00877 [pdf, other]: Title: Gaussian-Hermite Moment Invariants of General Multi-Channel Functions

Authors: Hanlin Mo, Hua Li, Guoying Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49] arXiv:2201.00893 [pdf, other]: Title: Rice Diseases Detection and Classification Using Attention Based Neural Network and Bayesian Optimization

Authors: Yibin Wang, Haifeng Wang, Zhaohua Peng

Journal-ref: Expert Systems with Applications, 178, 114770. (2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50] arXiv:2201.00947 [pdf, other]: Title: HWRCNet: Handwritten Word Recognition in JPEG Compressed Domain using CNN-BiLSTM Network

Authors: Bulla Rajesh, Abhishek Kumar Gupta, Ayush Raj, Mohammed Javed, Shiv Ram Dubey

Comments: Accepted in International Conference on Data Analytics and Learning, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[51] arXiv:2201.00966 [pdf, ps, other]: Title: AI visualization in Nanoscale Microscopy

Authors: Rajagopal A (1), Nirmala V (2), Andrew J (3), Arun Muthuraj Vedamanickam. ((1) Indian Institute of Technology Madras, (2) Queen Marys College, (3) Karunya Institute of Technology and Sciences. India)

Comments: Best paper award at International Conference On Big Data, Machine Learning and Applications 2021. this http URL In Springer Proceedings 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[52] arXiv:2201.00969 [pdf, ps, other]: Title: Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety

Authors: Rajagopal A, Nirmala V, Arun Muthuraj Vedamanickam

Comments: In Springer Proceedings. International Conference On Big Data, Machine Learning and Applications 2021. this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[53] arXiv:2201.00975 [pdf, other]: Title: StyleM: Stylized Metrics for Image Captioning Built with Contrastive N-grams

Authors: Chengxi Li, Brent Harrison

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[54] arXiv:2201.00977 [pdf, other]: Title: Underwater Object Classification and Detection: first results and open challenges

Authors: Andre Jesus, Claudio Zito, Claudio Tortorici, Eloy Roura, Giulia De Masi

Journal-ref: In Proceedings of OCEANS 2022 Chennai, February 21-24, pp. 1-6

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[55] arXiv:2201.00978 [pdf, other]: Title: PyramidTNT: Improved Transformer-in-Transformer Baselines with Pyramid Architecture

Authors: Kai Han, Jianyuan Guo, Yehui Tang, Yunhe Wang

Comments: Tech Report. An extension of "Transformer in Transformer" (arXiv:2103.00112)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56] arXiv:2201.00985 [pdf, other]: Title: Variational Stacked Local Attention Networks for Diverse Video Captioning

Authors: Tonmoay Deb, Akib Sadmanee, Kishor Kumar Bhaumik, Amin Ahsan Ali, M Ashraful Amin, A K M Mahbubur Rahman

Comments: To be published in Winter Conference on Applications of Computer Vision 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[57] arXiv:2201.01001 [pdf, other]: Title: Attention Mechanism Meets with Hybrid Dense Network for Hyperspectral Image Classification

Authors: Muhammad Ahmad, Adil Mehmood Khan, Manuel Mazzara, Salvatore Distefano, Swalpa Kumar Roy, Xin Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[58] arXiv:2201.01002 [pdf, ps, other]: Title: Multi-Representation Adaptation Network for Cross-domain Image Classification

Authors: Yongchun Zhu, Fuzhen Zhuang, Jindong Wang, Jingwu Chen, Zhiping Shi, Wenjuan Wu, Qing He

Comments: Neural Networks regular paper. Transfer Learning, Domain Adaptation

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[59] arXiv:2201.01008 [pdf, other]: Title: Learning to Generate Novel Classes for Deep Metric Learning

Authors: Kyungmoon Lee, Sungyeon Kim, Seunghoon Hong, Suha Kwak

Comments: Accepted to BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[60] arXiv:2201.01016 [pdf, other]: Title: Detailed Facial Geometry Recovery from Multi-View Images by Learning an Implicit Function

Authors: Yunze Xiao, Hao Zhu, Haotian Yang, Zhengyu Diao, Xiangju Lu, Xun Cao

Comments: AAAI 2022 Oral, updated to camera ready version

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61] arXiv:2201.01029 [pdf, other]: Title: Weakly-supervised continual learning for class-incremental segmentation

Authors: Gaston Lenczner, Adrien Chan-Hon-Tong, Nicola Luminari, Bertrand Le Saux

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[62] arXiv:2201.01030 [pdf, other]: Title: A Robust Visual Sampling Model Inspired by Receptive Field

Authors: Liwen Hu, Lei Ma, Dawei Weng, Tiejun Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63] arXiv:2201.01046 [pdf, other]: Title: Sound and Visual Representation Learning with Multiple Pretraining Tasks

Authors: Arun Balajee Vasudevan, Dengxin Dai, Luc Van Gool

Comments: 11 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[64] arXiv:2201.01047 [pdf, other]: Title: DIAL: Deep Interactive and Active Learning for Semantic Segmentation in Remote Sensing

Authors: Gaston Lenczner, Adrien Chan-Hon-Tong, Bertrand Le Saux, Nicola Luminari, Guy Le Besnerais

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[65] arXiv:2201.01073 [pdf, other]: Title: Towards Unsupervised Open World Semantic Segmentation

Authors: Svenja Uhlemeyer, Matthias Rottmann, Hanno Gottschalk

Comments: UAI 2022, published in PMLR, Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66] arXiv:2201.01080 [pdf, other]: Title: Towards Understanding and Harnessing the Effect of Image Transformation in Adversarial Detection

Authors: Hui Liu, Bo Zhao, Yuefeng Peng, Weidong Li, Peng Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[67] arXiv:2201.01081 [pdf, ps, other]: Title: Identifying the exterior image of buildings on a 3D map and extracting elevation information using deep learning and digital image processing

Authors: Donghwa Shon, Byeongjoon Noh, Nahyang Byun

Comments: 16 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68] arXiv:2201.01087 [pdf, other]: Title: Learning Quality-aware Representation for Multi-person Pose Regression

Authors: Yabo Xiao, Dongdong Yu, Xiaojuan Wang, Lei Jin, Guoli Wang, Qian Zhang

Comments: Accepted by AAAI2022; Slightly different compared with the camera-ready version

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69] arXiv:2201.01090 [pdf, other]: Title: Short Range Correlation Transformer for Occluded Person Re-Identification

Authors: Yunbin Zhao, Songhao Zhu, Dongsheng Wang, Zhiwei Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[70] arXiv:2201.01102 [pdf, other]: Title: Towards Transferable Unrestricted Adversarial Examples with Minimum Changes

Authors: Fangcheng Liu, Chao Zhang, Hongyang Zhang

Comments: Accepted at SaTML 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71] arXiv:2201.01115 [pdf, other]: Title: Data Augmentation for Depression Detection Using Skeleton-Based Gait Information

Authors: Jingjing Yang, Haifeng Lu, Chengming Li, Xiping Hu, Bin Hu

Comments: 10 pages,10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2201.01191 [pdf, ps, other]: Title: Automated 3D reconstruction of LoD2 and LoD1 models for all 10 million buildings of the Netherlands

Authors: Ravi Peters, Balázs Dukai, Stelios Vitalis, Jordi van Liempt, Jantien Stoter

Comments: Submitted to Journal of Photogrammetric Engineering & Remote Sensing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[73] arXiv:2201.01275 [pdf, ps, other]: Title: Local Quadruple Pattern: A Novel Descriptor for Facial Image Recognition and Retrieval

Authors: Soumendu Chakraborty, Satish Kumar Singh, Pavan Chakraborty

Comments: arXiv admin note: substantial text overlap with arXiv:2201.00504, arXiv:2201.00511

Journal-ref: Computers & Electrical Engineering, vol-62, pp. 92-104, (2017). (Elsevier) ISSN/ISBN: 0045-7906

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[74] arXiv:2201.01276 [pdf, ps, other]: Title: Local Directional Gradient Pattern: A Local Descriptor for Face Recognition

Authors: Soumendu Chakraborty, Satish Kumar Singh, Pavan Chakraborty

Journal-ref: Multimedia Tools and Applications, vol-76, no-1, pp. 1201-1216, (2017). (Springer) ISSN/ISBN: 1573-7721

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[75] arXiv:2201.01283 [pdf, other]: Title: Self-supervised Learning from 100 Million Medical Images

Authors: Florin C. Ghesu, Bogdan Georgescu, Awais Mansoor, Youngjin Yoo, Dominik Neumann, Pragneshkumar Patel, R.S. Vishwanath, James M. Balter, Yue Cao, Sasa Grbic, Dorin Comaniciu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76] arXiv:2201.01293 [pdf, other]: Title: A Transformer-Based Siamese Network for Change Detection

Authors: Wele Gedara Chaminda Bandara, Vishal M. Patel

Comments: Accepted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. 4 pages, 2 figures. Code & trained models are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2201.01294 [pdf, other]: Title: 3DVSR: 3D EPI Volume-based Approach for Angular and Spatial Light field Image Super-resolution

Authors: Trung-Hieu Tran, Jan Berberich, Sven Simon

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[78] arXiv:2201.01297 [pdf, other]: Title: Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation

Authors: Qiankun Liu, Dongdong Chen, Qi Chu, Lu Yuan, Bin Liu, Lei Zhang, Nenghai Yu

Comments: To Appear at Neurocomputing 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[79] arXiv:2201.01391 [pdf, other]: Title: Self-Supervised Approach to Addressing Zero-Shot Learning Problem

Authors: Ademola Okerinde, Sam Hoggatt, Divya Vani Lakkireddy, Nolan Brubaker, William Hsu, Lior Shamir, Brian Spiesman

Journal-ref: The 4th International Conference on Computing and Data Science (CONF-CDS 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[80] arXiv:2201.01399 [pdf, other]: Title: Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System Robustness

Authors: Hieu Le, Hans Walker, Dung Tran, Peter Chin

Comments: CSCI 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[81] arXiv:2201.01408 [pdf, other]: Title: Fusing Convolutional Neural Network and Geometric Constraint for Image-based Indoor Localization

Authors: Jingwei Song, Mitesh Patel, Maani Ghaffari

Comments: Accepted by IEEE robotics and automation letters

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[82] arXiv:2201.01410 [pdf, other]: Title: Synthesizing Tensor Transformations for Visual Self-attention

Authors: Xian Wei, Xihao Wang, Hai Lan, JiaMing Lei, Yanhui Huang, Hui Yu, Jian Yang

Comments: 13 pages,3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[83] arXiv:2201.01415 [pdf, other]: Title: Problem-dependent attention and effort in neural networks with applications to image resolution and model selection

Authors: Chris Rohlfs

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[84] arXiv:2201.01416 [pdf, other]: Title: Latent Vector Expansion using Autoencoder for Anomaly Detection

Authors: UJu Gim, YeongHyeon Park

Comments: 3 pages, 2 figures, In Proceedings of the 34th Workshop on Image Processing and Image Understanding (IPIU 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[85] arXiv:2201.01427 [pdf, other]: Title: Attention-based Dual Supervised Decoder for RGBD Semantic Segmentation

Authors: Yang Zhang, Yang Yang, Chenyun Xiong, Guodong Sun, Yanwen Guo

Comments: 12 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[86] arXiv:2201.01486 [pdf, ps, other]: Title: Sign Language Recognition System using TensorFlow Object Detection API

Authors: Sharvani Srivastava, Amisha Gangwar, Richa Mishra, Sudhakar Singh

Comments: 14 pages, 5 figures, ANTIC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[87] arXiv:2201.01494 [pdf, other]: Title: Improving Object Detection, Multi-object Tracking, and Re-Identification for Disaster Response Drones

Authors: Chongkeun Paik, Hyunwoo J. Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2201.01501 [pdf, other]: Title: Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation

Authors: Rui Peng, Rongjie Wang, Zhenyu Wang, Yawen Lai, Ronggang Wang

Comments: CVPR 2022 Accepted

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[89] arXiv:2201.01503 [pdf, other]: Title: Towards Uniform Point Distribution in Feature-preserving Point Cloud Filtering

Authors: Shuaijun Chen, Jinxi Wang, Wei Pan, Shang Gao, Meili Wang, Xuequan Lu

Comments: This paper is accepted to CVM

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
[90] arXiv:2201.01565 [pdf, other]: Title: Culture-to-Culture Image Translation and User Evaluation

Authors: Giulia Zaino, Carmine Tommaso Recchiuto, Antonio Sgorbissa

Comments: 31 pages (bibliography excluded), 4 figures, 6 Tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[91] arXiv:2201.01592 [pdf, other]: Title: Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning

Authors: Xingqun Qi, Muyi Sun, Zijian Wang, Jiaming Liu, Qi Li, Fang Zhao, Shanghang Zhang, Caifeng Shan

Comments: Accepted to IEEE TNNLS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92] arXiv:2201.01603 [pdf, other]: Title: Deep Probabilistic Graph Matching

Authors: He Liu, Tao Wang, Yidong Li, Congyan Lang, Songhe Feng, Haibin Ling

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93] arXiv:2201.01609 [pdf, other]: Title: All You Need In Sign Language Production

Authors: Razieh Rastgoo, Kourosh Kiani, Sergio Escalera, Vassilis Athitsos, Mohammad Sabokrou

Comments: arXiv admin note: substantial text overlap with arXiv:2103.15910

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[94] arXiv:2201.01615 [pdf, other]: Title: Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window Attention

Authors: Haotian Yan, Chuang Zhang, Ming Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95] arXiv:2201.01636 [pdf, other]: Title: Tackling the Class Imbalance Problem of Deep Learning Based Head and Neck Organ Segmentation

Authors: Elias Tappeiner, Martin Welk, Rainer Schubert

Comments: 10 pages, 3 figures, 1 table, submitted to the International Journal of Computer Assisted Radiology and Surgery

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[96] arXiv:2201.01654 [pdf, other]: Title: TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets

Authors: Susie Xi Rao, Johannes Rausch, Peter Egger, Ce Zhang

Comments: accepted in the AAAI-22 Workshop on Scientific Document Understanding at the Thirty-Sixth AAAI Conference on Artificial Intelligence (SDU@AAAI-22)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97] arXiv:2201.01661 [pdf, ps, other]: Title: Evaluation of Thermal Imaging on Embedded GPU Platforms for Application in Vehicular Assistance Systems

Authors: Muhammad Ali Farooq, Waseem Shariff, Peter Corcoran

Comments: 14 pages, 9 tables, and 27 figures

Journal-ref: Published in IEEE-TIV Journal in 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[98] arXiv:2201.01683 [pdf, other]: Title: Surface-Aligned Neural Radiance Fields for Controllable 3D Human Synthesis

Authors: Tianhan Xu, Yasuhiro Fujita, Eiichi Matsumoto

Comments: CVPR 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99] arXiv:2201.01699 [pdf, ps, other]: Title: An Investigation of "Benford's" Law Divergence and Machine Learning Techniques for "Intra-Class" Separability of Fingerprint Images

Authors: Aamo Iorliam, Orgem Emmanuel, Yahaya I. Shehu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[100] arXiv:2201.01703 [pdf, other]: Title: Probing TryOnGAN

Authors: Saurabh Kumar, Nishant Sinha

Comments: 5 pages, to appear in the proceedings of the 9th ACM IKDD CODS and 27th COMAD (CODS-COMAD '22)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[101] arXiv:2201.01709 [pdf, other]: Title: The Effect of Model Compression on Fairness in Facial Expression Recognition

Authors: Samuil Stoychev, Hatice Gunes

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[102] arXiv:2201.01783 [pdf, ps, other]: Title: Automated Scoring of Graphical Open-Ended Responses Using Artificial Neural Networks

Authors: Matthias von Davier, Lillian Tyack, Lale Khorramdel

Comments: 23 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
[103] arXiv:2201.01823 [pdf, other]: Title: Learning Semantic Ambiguities for Zero-Shot Learning

Authors: Celina Hanouti, Hervé Le Borgne

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[104] arXiv:2201.01831 [pdf, other]: Title: POCO: Point Convolution for Surface Reconstruction

Authors: Alexandre Boulch, Renaud Marlet

Comments: Accepted at Conference on Computer Vision and Pattern Recognition (CVPR), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG); Machine Learning (cs.LG)
[105] arXiv:2201.01850 [pdf, other]: Title: On the Real-World Adversarial Robustness of Real-Time Semantic Segmentation Models for Autonomous Driving

Authors: Giulio Rossolini, Federico Nesti, Gianluca D'Amico, Saasha Nair, Alessandro Biondi, Giorgio Buttazzo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[106] arXiv:2201.01857 [pdf, other]: Title: Multi-Grid Redundant Bounding Box Annotation for Accurate Object Detection

Authors: Solomon Negussie Tesema, El-Bay Bourennane

Comments: Will appear on "The 19th IEEE International Conference on Pervasive Intelligence and Computing (PICom 2021)". Conference Held on 25 - 28 October 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[107] arXiv:2201.01858 [pdf, other]: Title: Towards realistic symmetry-based completion of previously unseen point clouds

Authors: Taras Rumezhak, Oles Dobosevych, Rostyslav Hryniv, Vladyslav Selotkin, Volodymyr Karpiv, Mykola Maksymenko

Journal-ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, October, 2021, 2542-2550

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108] arXiv:2201.01883 [pdf, other]: Title: Memory-guided Image De-raining Using Time-Lapse Data

Authors: Jaehoon Cho, Seungryong Kim, Kwanghoon Sohn

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[109] arXiv:2201.01901 [pdf, other]: Title: Incremental Object Grounding Using Scene Graphs

Authors: John Seon Keun Yi, Yoonwoo Kim, Sonia Chernova

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[110] arXiv:2201.01928 [pdf, other]: Title: Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization

Authors: Hao Jiang, Calvin Murdock, Vamsi Krishna Ithapu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[111] arXiv:2201.01929 [pdf, other]: Title: Decompose to Adapt: Cross-domain Object Detection via Feature Disentanglement

Authors: Dongnan Liu, Chaoyi Zhang, Yang Song, Heng Huang, Chenyu Wang, Michael Barnett, Weidong Cai

Comments: Accepted to appear in IEEE Transactions on Multimedia; source code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112] arXiv:2201.01953 [pdf, other]: Title: Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling

Authors: Yang Long, Gui-Song Xia, Liangpei Zhang, Gong Cheng, Deren Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113] arXiv:2201.01961 [pdf, other]: Title: Diversity-boosted Generalization-Specialization Balancing for Zero-shot Learning

Authors: Yun Li, Zhe Liu, Xiaojun Chang, Julian McAuley, Lina Yao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[114] arXiv:2201.01971 [pdf, other]: Title: Multi-Label Classification on Remote-Sensing Images

Authors: Aditya Kumar Singh, B. Uma Shankar

Comments: The report consists of 95 Pages, 45 Figures, 31 Tables, 85 References

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[115] arXiv:2201.01976 [pdf, other]: Title: SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object Detection

Authors: Chen Chen, Zhe Chen, Jing Zhang, Dacheng Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116] arXiv:2201.01983 [pdf, other]: Title: Multi-Domain Joint Training for Person Re-Identification

Authors: Lu Yang, Lingqiao Liu, Yunlong Wang, Peng Wang, Yanning Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117] arXiv:2201.01984 [pdf, other]: Title: Compact Bidirectional Transformer for Image Captioning

Authors: Yuanen Zhou, Zhenzhen Hu, Daqing Liu, Huixia Ben, Meng Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[118] arXiv:2201.02001 [pdf, other]: Title: TransVPR: Transformer-based place recognition with multi-level attention aggregation

Authors: Ruotong Wang, Yanqing Shen, Weiliang Zuo, Sanping Zhou, Nanning Zheng

Comments: CVPR 2022 oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[119] arXiv:2201.02010 [pdf, other]: Title: Self-Training Vision Language BERTs with a Unified Conditional Model

Authors: Xiaofeng Yang, Fengmao Lv, Fayao Liu, Guosheng Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[120] arXiv:2201.02011 [pdf, other]: Title: An unambiguous cloudiness index for nonwovens

Authors: Michael Godehardt, Ali Moghiseh, Christine Oetjen, Joachim Ohser, Katja Schladitz

Journal-ref: Journal of Mathematics in Industry, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[121] arXiv:2201.02017 [pdf, other]: Title: Enhancing Egocentric 3D Pose Estimation with Third Person Views

Authors: Ameya Dhamanaskar, Mariella Dimiccoli, Enric Corona, Albert Pumarola, Francesc Moreno-Noguer

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[122] arXiv:2201.02028 [pdf, other]: Title: A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Authors: Maximilian Harl, Marvin Herchenbach, Sven Kruschel, Nico Hambauer, Patrick Zschech, Mathias Kraus

Comments: Preprint accepted for archival and presentation at the 17th International Conference on Wirtschaftsinformatik 2022. 14 pages, 5 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[123] arXiv:2201.02052 [pdf, other]: Title: A Unified Framework for Attention-Based Few-Shot Object Detection

Authors: Pierre Le Jeune, Anissa Mokraoui

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[124] arXiv:2201.02065 [pdf, other]: Title: ASL-Skeleton3D and ASL-Phono: Two Novel Datasets for the American Sign Language

Authors: Cleison Correia de Amorim, Cleber Zanchettin

Journal-ref: The paper is under consideration at Pattern Recognition Letters (2022) (under the manuscript number PRLETTERS-D-22-00140)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[125] arXiv:2201.02074 [pdf, other]: Title: EM-driven unsupervised learning for efficient motion segmentation

Authors: Etienne Meunier, Anaïs Badoual, Patrick Bouthemy

Comments: Accepted to : IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126] arXiv:2201.02093 [pdf, ps, other]: Title: Deep Learning Based Classification System For Recognizing Local Spinach

Authors: Mirajul Islam, Nushrat Jahan Ria, Jannatul Ferdous Ani, Abu Kaisar Mohammad Masum, Sheikh Abujar, Syed Akhter Hossain

Comments: 10 pages, 4 figures, supplemental materials. Accepted in 2nd International Conference on Deep Learning, Artificial Intelligence and Robotics,(ICDLAIR) 2020

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[127] arXiv:2201.02107 [pdf, other]: Title: HyperionSolarNet: Solar Panel Detection from Aerial Images

Authors: Poonam Parhar, Ryan Sawasaki, Alberto Todeschini, Colorado Reed, Hossein Vahabi, Nathan Nusaputra, Felipe Vergara

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128] arXiv:2201.02110 [pdf, other]: Title: Eye Know You Too: A DenseNet Architecture for End-to-end Eye Movement Biometrics

Authors: Dillon Lohr, Oleg V Komogortsev

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[129] arXiv:2201.02149 [pdf, other]: Title: Bio-inspired Min-Nets Improve the Performance and Robustness of Deep Networks

Authors: Philipp Grüning, Erhardt Barth

Journal-ref: Gruening, P., & Barth, E. (2021, October). Bio-inspired Min-Nets Improve the Performance and Robustness of Deep Networks. In SVRHM 2021 Workshop@ NeurIPS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130] arXiv:2201.02193 [pdf, other]: Title: Realistic Full-Body Anonymization with Surface-Guided GANs

Authors: Håkon Hukkelås, Morten Smebye, Rudolf Mester, Frank Lindseth

Comments: 8 pages, 7 figures, 6 tables. Source code and appendix available at: this https URL Published at WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131] arXiv:2201.02233 [pdf, other]: Title: Consistent Style Transfer

Authors: Xuan Luo, Zhen Han, Lingkang Yang, Lingling Zhang

Comments: 10 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132] arXiv:2201.02260 [pdf, other]: Title: CitySurfaces: City-Scale Semantic Segmentation of Sidewalk Materials

Authors: Maryam Hosseini, Fabio Miranda, Jianzhe Lin, Claudio Silva

Comments: Sustainable Cities and Society journal (accepted); Model: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[133] arXiv:2201.02263 [pdf, other]: Title: ITSA: An Information-Theoretic Approach to Automatic Shortcut Avoidance and Domain Generalization in Stereo Matching Networks

Authors: WeiQin Chuah, Ruwan Tennakoon, Reza Hoseinnezhad, Alireza Bab-Hadiashar, David Suter

Comments: 11 pages, 4 figures. Accepted by CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[134] arXiv:2201.02279 [pdf, other]: Title: De-rendering 3D Objects in the Wild

Authors: Felix Wimbauer, Shangzhe Wu, Christian Rupprecht

Journal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 18490-18499

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135] arXiv:2201.02280 [pdf, other]: Title: Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image Cropping

Authors: Nora Horanyi, Kedi Xia, Kwang Moo Yi, Abhishake Kumar Bojja, Ales Leonardis, Hyung Jin Chang

Journal-ref: Pattern Recognition, 2022, 108485, ISSN 0031-3203

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[136] arXiv:2201.02302 [pdf, other]: Title: Extending One-Stage Detection with Open-World Proposals

Authors: Sachin Konan, Kevin J Liang, Li Yin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137] arXiv:2201.02304 [pdf, other]: Title: Budget-aware Few-shot Learning via Graph Convolutional Network

Authors: Shipeng Yan, Songyang Zhang, Xuming He

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[138] arXiv:2201.02365 [pdf, other]: Title: Motion Prediction via Joint Dependency Modeling in Phase Space

Authors: Pengxiang Su, Zhenguang Liu, Shuang Wu, Lei Zhu, Yifang Yin, Xuanjing Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139] arXiv:2201.02366 [pdf, other]: Title: Uncertainty-Aware Cascaded Dilation Filtering for High-Efficiency Deraining

Authors: Qing Guo, Jingyang Sun, Felix Juefei-Xu, Lei Ma, Di Lin, Wei Feng, Song Wang

Comments: 14 pages, 10 figures, 10 tables. This is the extention of our conference version this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[140] arXiv:2201.02369 [pdf, other]: Title: Deep Generative Framework for Interactive 3D Terrain Authoring and Manipulation

Authors: Shanthika Naik, Aryamaan Jain, Avinash Sharma, KS Rajan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
[141] arXiv:2201.02396 [pdf, other]: Title: Detecting Human-to-Human-or-Object (H2O) Interactions with DIABOLO

Authors: Astrid Orcesi, Romaric Audigier, Fritz Poka Toukam, Bertrand Luvison

Comments: ACCEPTED in IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142] arXiv:2201.02494 [pdf, other]: Title: Progressive Video Summarization via Multimodal Self-supervised Learning

Authors: Li Haopeng, Ke Qiuhong, Gong Mingming, Tom Drummond

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[143] arXiv:2201.02495 [pdf, other]: Title: Sign Language Video Retrieval with Free-Form Textual Queries

Authors: Amanda Duarte, Samuel Albanie, Xavier Giró-i-Nieto, Gül Varol

Comments: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[144] arXiv:2201.02503 [pdf, ps, other]: Title: A Review of Deep Learning Techniques for Markerless Human Motion on Synthetic Datasets

Authors: Doan Duy Vo, Russell Butler

Comments: 11 pages, 5 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[145] arXiv:2201.02526 [pdf, other]: Title: Learning Target-aware Representation for Visual Tracking via Informative Interactions

Authors: Mingzhe Guo, Zhipeng Zhang, Heng Fan, Liping Jing, Yilin Lyu, Bing Li, Weiming Hu

Comments: 9 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[146] arXiv:2201.02533 [pdf, other]: Title: NeROIC: Neural Rendering of Objects from Online Image Collections

Authors: Zhengfei Kuang, Kyle Olszewski, Menglei Chai, Zeng Huang, Panos Achlioptas, Sergey Tulyakov

Comments: SIGGRAPH 2022 (Journal Track). Project page: this https URL Code repository: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147] arXiv:2201.02560 [pdf, other]: Title: A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items

Authors: Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi

Comments: IEEE Transactions on Systems, Man, and Cybernetics: Systems, Source code is available at this https URL

Journal-ref: IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148] arXiv:2201.02588 [pdf, other]: Title: FogAdapt: Self-Supervised Domain Adaptation for Semantic Segmentation of Foggy Images

Authors: Javed Iqbal, Rehan Hafiz, Mohsen Ali

Comments: Accepted at Elsevier Journal of Neurocomputing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[149] arXiv:2201.02593 [pdf, other]: Title: Equalized Focal Loss for Dense Long-Tailed Object Detection

Authors: Bo Li, Yongqiang Yao, Jingru Tan, Gang Zhang, Fengwei Yu, Jianwei Lu, Ye Luo

Comments: Accepted by the IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150] arXiv:2201.02605 [pdf, other]: Title: Detecting Twenty-thousand Classes using Image-level Supervision

Authors: Xingyi Zhou, Rohit Girdhar, Armand Joulin, Philipp Krähenbühl, Ishan Misra

Comments: ECCV 2022 camera ready. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[151] arXiv:2201.02609 [pdf, other]: Title: Generalized Category Discovery

Authors: Sagar Vaze, Kai Han, Andrea Vedaldi, Andrew Zisserman

Comments: CVPR 22. Changes from pre-print highlighted in GitHub repo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[152] arXiv:2201.02639 [pdf, other]: Title: MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound

Authors: Rowan Zellers, Jiasen Lu, Ximing Lu, Youngjae Yu, Yanpeng Zhao, Mohammadreza Salehi, Aditya Kusupati, Jack Hessel, Ali Farhadi, Yejin Choi

Comments: CVPR 2022. Project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[153] arXiv:2201.02698 [pdf, other]: Title: Development of Automatic Tree Counting Software from UAV Based Aerial Images With Machine Learning

Authors: Musa Ataş, Ayhan Talay

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[154] arXiv:2201.02714 [pdf, other]: Title: Pseudo-labelling and Meta Reweighting Learning for Image Aesthetic Quality Assessment

Authors: Xin Jin, Hao Lou, Huang Heng, Xiaodong Li, Shuai Cui, Xiaokun Zhang, Xiqiao Li

Comments: 10 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[155] arXiv:2201.02726 [pdf, ps, other]: Title: Real-time Rail Recognition Based on 3D Point Clouds

Authors: Xinyi Yu, Weiqi He, Xuecheng Qian, Yang Yang, Linlin Ou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156] arXiv:2201.02767 [pdf, other]: Title: QuadTree Attention for Vision Transformers

Authors: Shitao Tang, Jiahui Zhang, Siyu Zhu, Ping Tan

Comments: ICLR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157] arXiv:2201.02772 [pdf, other]: Title: A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Authors: Zhixiong Zeng, Wenji Mao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[158] arXiv:2201.02779 [pdf, other]: Title: A Baseline Statistical Method For Robust User-Assisted Multiple Segmentation

Authors: Huseyin Afser

Comments: Submitted to IEEE Signal Processing Letters. Is a continuation to our work: H. Af\c{s}er, "Statistical Classification via Robust Hypothesis Testing: Non-Asymptotic and Simple Bounds," in IEEE Signal Processing Letters, vol. 28, pp. 2112-2116, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Signal Processing (eess.SP)
[159] arXiv:2201.02784 [pdf, other]: Title: Relieving Long-tailed Instance Segmentation via Pairwise Class Balance

Authors: Yin-Yin He, Peizhen Zhang, Xiu-Shen Wei, Xiangyu Zhang, Jian Sun

Comments: Accepted to CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160] arXiv:2201.02798 [pdf, other]: Title: RARA: Zero-shot Sim2Real Visual Navigation with Following Foreground Cues

Authors: Klaas Kelchtermans, Tinne Tuytelaars

Comments: 7 pages, submitted to IROS, code: github.com/kkelchte/fgbg

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[161] arXiv:2201.02799 [pdf, other]: Title: Counteracting Dark Web Text-Based CAPTCHA with Generative Adversarial Learning for Proactive Cyber Threat Intelligence

Authors: Ning Zhang, Mohammadreza Ebrahimi, Weifeng Li, Hsinchun Chen

Comments: Accepted by ACM Transactions on Management Information Systems

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162] arXiv:2201.02836 [pdf, ps, other]: Title: Self-aligned Spatial Feature Extraction Network for UAV Vehicle Re-identification

Authors: Aihuan Yao, Jiahao Qi, Ping Zhong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[163] arXiv:2201.02837 [pdf, other]: Title: Mushrooms Detection, Localization and 3D Pose Estimation using RGB-D Sensor for Robotic-picking Applications

Authors: Nathanael L. Baisa, Bashir Al-Diri

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[164] arXiv:2201.02848 [pdf, other]: Title: Learning Sample Importance for Cross-Scenario Video Temporal Grounding

Authors: Peijun Bao, Yadong Mu

Comments: 7 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[165] arXiv:2201.02849 [pdf, other]: Title: Spatio-Temporal Tuples Transformer for Skeleton-Based Action Recognition

Authors: Helei Qiu, Biao Hou, Bo Ren, Xiaohua Zhang

Comments: 14 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[166] arXiv:2201.02850 [pdf, other]: Title: Image-based Automatic Dial Meter Reading in Unconstrained Scenarios

Authors: Gabriel Salomon, Rayson Laroca, David Menotti

Journal-ref: Measurement, vol. 204, p. 112025, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[167] arXiv:2201.02853 [pdf, ps, other]: Title: Fake Hilsa Fish Detection Using Machine Vision

Authors: Mirajul Islam, Jannatul Ferdous Ani, Abdur Rahman, Zakia Zaman

Comments: 12 pages, 8 figures, International Joint Conference on Advances in Computational Intelligence (IJCACI 2020)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[168] arXiv:2201.02861 [pdf, other]: Title: Decoupling Makes Weakly Supervised Local Feature Better

Authors: Kunhong Li, Longguang Wang, Li Liu, Qing Ran, Kai Xu, Yulan Guo

Comments: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169] arXiv:2201.02885 [pdf, other]: Title: Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer Vision

Authors: Maurice Günder, Facundo R. Ispizua Yamati, Jana Kierdorf, Ribana Roscher, Anne-Katrin Mahlein, Christian Bauckhage

Comments: Preprint submitted to GigaScience

Journal-ref: GigaScience, Volume 11, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[170] arXiv:2201.02946 [pdf, other]: Title: Resolving Camera Position for a Practical Application of Gaze Estimation on Edge Devices

Authors: Linh Van Ma, Tin Trung Tran, Moongu Jeon

Comments: 6 pages, 11 figures, conference paper

Journal-ref: ICAIIC 2022 (The 4th International Conference on Artificial Intelligence in Information and Communication February 21 (Mon.) ~ 24 (Thur.), 2022, Guam, USA & Virtual Conference)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[171] arXiv:2201.02963 [pdf, other]: Title: Box2Seg: Learning Semantics of 3D Point Clouds with Box-Level Supervision

Authors: Yan Liu, Qingyong Hu, Yinjie Lei, Kai Xu, Jonathan Li, Yulan Guo

Comments: 9 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[172] arXiv:2201.02980 [pdf, other]: Title: Invariance encoding in sliced-Wasserstein space for image classification with limited training data

Authors: Mohammad Shifat E Rabbi, Yan Zhuang, Shiying Li, Abu Hasnat Mohammad Rubaiyat, Xuwang Yin, Gustavo K. Rohde

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[173] arXiv:2201.02991 [pdf, other]: Title: A Survey on Face Recognition Systems

Authors: Jash Dalvi, Sanket Bafna, Devansh Bagaria, Shyamal Virnodkar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[174] arXiv:2201.03002 [pdf, other]: Title: MaskMTL: Attribute prediction in masked facial images with deep multitask learning

Authors: Prerana Mukherjee, Vinay Kaushik, Ronak Gupta, Ritika Jha, Daneshwari Kankanwadi, Brejesh Lall

Comments: In Proceedings of 9th International Conference on Pattern Recognition and Machine Intelligence (PReMI 2021), Kolkata, India

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175] arXiv:2201.03013 [pdf, ps, other]: Title: ThreshNet: An Efficient DenseNet Using Threshold Mechanism to Reduce Connections

Authors: Rui-Yang Ju, Ting-Yu Lin, Jia-Hao Jian, Jen-Shiun Chiang, Wei-Bin Yang

Comments: IEEE Access

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[176] arXiv:2201.03014 [pdf, other]: Title: Glance and Focus Networks for Dynamic Visual Recognition

Authors: Gao Huang, Yulin Wang, Kangchen Lv, Haojun Jiang, Wenhui Huang, Pengfei Qi, Shiji Song

Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI). Journal version of arXiv:2010.05300 (NeurIPS 2020). The first two authors contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[177] arXiv:2201.03018 [pdf, other]: Title: Self-Supervised Feature Learning from Partial Point Clouds via Pose Disentanglement

Authors: Meng-Shiun Tsai, Pei-Ze Chiang, Yi-Hsuan Tsai, Wei-Chen Chiu

Comments: 10 pages, 4 figures and 6 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178] arXiv:2201.03043 [pdf, other]: Title: Semantics-driven Attentive Few-shot Learning over Clean and Noisy Samples

Authors: Orhun Buğra Baran, Ramazan Gökberk Cinbiş

Comments: 25 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[179] arXiv:2201.03045 [pdf, other]: Title: Applying Artificial Intelligence for Age Estimation in Digital Forensic Investigations

Authors: Thomas Grubl, Harjinder Singh Lallie

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[180] arXiv:2201.03080 [pdf, other]: Title: The State of Aerial Surveillance: A Survey

Authors: Kien Nguyen, Clinton Fookes, Sridha Sridharan, Yingli Tian, Feng Liu, Xiaoming Liu, Arun Ross

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[181] arXiv:2201.03101 [pdf, other]: Title: ImageSubject: A Large-scale Dataset for Subject Detection

Authors: Xin Miao, Jiayi Liu, Huayan Wang, Jun Fu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[182] arXiv:2201.03141 [pdf, other]: Title: Multi-Level Attention for Unsupervised Person Re-Identification

Authors: Yi Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183] arXiv:2201.03170 [pdf, other]: Title: TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials

Authors: Jinnavat Sanalohit, Tatpong Katanyukul

Comments: 19 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[184] arXiv:2201.03176 [pdf, other]: Title: Pedestrian Detection: Domain Generalization, CNNs, Transformers and Beyond

Authors: Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao

Comments: 13 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185] arXiv:2201.03178 [pdf, ps, other]: Title: Swin Transformer coupling CNNs Makes Strong Contextual Encoders for VHR Image Road Extraction

Authors: Tao Chen, Yiran Liu, Haoyu Jiang, Ruirui Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[186] arXiv:2201.03180 [pdf, other]: Title: Transfer Learning for Scene Text Recognition in Indian Languages

Authors: Sanjana Gunna, Rohit Saluja, C. V. Jawahar

Comments: 16 pages, 5 figures

Journal-ref: ICDAR 2021: Document Analysis and Recognition, ICDAR 2021 Workshops, pp 182-197

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[187] arXiv:2201.03185 [pdf, other]: Title: Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

Authors: Sanjana Gunna, Rohit Saluja, C. V. Jawahar

Comments: 12 pages, 6 figures

Journal-ref: ICDAR 2021: Document Analysis and Recognition, ICDAR 2021 Workshops, pp 282-293

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188] arXiv:2201.03194 [pdf, other]: Title: Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification

Authors: Jingzhou Chen, Peng Wang, Jian Liu, Yuntao Qian

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[189] arXiv:2201.03212 [pdf, other]: Title: Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition

Authors: M. Usman Maqbool Bhutta, Yuxiang Sun, Darwin Lau, Ming Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[190] arXiv:2201.03243 [pdf, ps, other]: Title: Small Object Detection using Deep Learning

Authors: Aleena Ajaz, Ayesha Salar, Tauseef Jamal, Asif Ullah Khan

Comments: 21 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[191] arXiv:2201.03246 [pdf, other]: Title: Vision in adverse weather: Augmentation using CycleGANs with various object detectors for robust perception in autonomous racing

Authors: Izzeddin Teeti, Valentina Musat, Salman Khan, Alexander Rast, Fabio Cuzzolin, Andrew Bradley

Comments: ICML 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[192] arXiv:2201.03297 [pdf, other]: Title: GhostNets on Heterogeneous Devices via Cheap Operations

Authors: Kai Han, Yunhe Wang, Chang Xu, Jianyuan Guo, Chunjing Xu, Enhua Wu, Qi Tian

Comments: Accepted by IJCV 2022. Extension of GhostNet CVPR2020 paper (arXiv:1911.11907). arXiv admin note: substantial text overlap with arXiv:1911.11907

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[193] arXiv:2201.03299 [pdf, other]: Title: Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks

Authors: Claudio Filipi Gonçalves dos Santos, João Paulo Papa

Comments: 27 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[194] arXiv:2201.03323 [pdf, other]: Title: Gait Recognition Based on Deep Learning: A Survey

Authors: Claudio Filipi Gonçalves dos Santos, Diego de Souza Oliveira, Leandro A. Passos, Rafael Gonçalves Pires, Daniel Felipe Silva Santos, Lucas Pascotti Valem, Thierry P. Moreira, Marcos Cleison S. Santana, Mateus Roder, João Paulo Papa, Danilo Colombo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[195] arXiv:2201.03342 [pdf, other]: Title: COIN: Counterfactual Image Generation for VQA Interpretation

Authors: Zeyd Boukhers, Timo Hartmann, Jan Jürjens

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[196] arXiv:2201.03353 [pdf, other]: Title: GMFIM: A Generative Mask-guided Facial Image Manipulation Model for Privacy Preservation

Authors: Mohammad Hossein Khojaste, Nastaran Moradzadeh Farid, Ahmad Nickabadi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[197] arXiv:2201.03454 [pdf, other]: Title: 3D Face Morphing Attacks: Generation, Vulnerability and Detection

Authors: Jag Mohan Singh, Raghavendra Ramachandra

Comments: The paper is accepted at IEEE Transactions on Biometrics, Behavior and Identity Science

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[198] arXiv:2201.03545 [pdf, other]: Title: A ConvNet for the 2020s

Authors: Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, Saining Xie

Comments: CVPR 2022; Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[199] arXiv:2201.03546 [pdf, other]: Title: Language-driven Semantic Segmentation

Authors: Boyi Li, Kilian Q. Weinberger, Serge Belongie, Vladlen Koltun, René Ranftl

Comments: ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[200] arXiv:2201.03556 [pdf, other]: Title: Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words

Authors: Harry Nguyen, Stone Yun, Hisham Mohammad

Comments: This is a reproducibility project. Original work is by Gidaris et al. published in CVPR 2020. Pytorch implementation is public on Github. v2 clarifies comments regarding communication with original authors

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[201] arXiv:2201.03597 [pdf, other]: Title: Cross-Modality Sub-Image Retrieval using Contrastive Multimodal Image Representations

Authors: Eva Breznik, Elisabeth Wetzer, Joakim Lindblad, Nataša Sladoje

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[202] arXiv:2201.03639 [pdf, other]: Title: Multi-Query Video Retrieval

Authors: Zeyu Wang, Yu Wu, Karthik Narasimhan, Olga Russakovsky

Comments: ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203] arXiv:2201.03674 [pdf, other]: Title: PrintsGAN: Synthetic Fingerprint Generator

Authors: Joshua J. Engelsma, Steven A. Grosz, Anil K. Jain

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204] arXiv:2201.03686 [pdf, other]: Title: NFANet: A Novel Method for Weakly Supervised Water Extraction from High-Resolution Remote Sensing Imagery

Authors: Ming Lu, Leyuan Fang, Muxing Li, Bob Zhang, Yi Zhang, Pedram Ghamisi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[205] arXiv:2201.03746 [pdf, other]: Title: TSA-Net: Tube Self-Attention Network for Action Quality Assessment

Authors: Shunli Wang, Dingkang Yang, Peng Zhai, Chixiao Chen, Lihua Zhang

Comments: 9 pages, 7 figures, conference paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[206] arXiv:2201.03786 [pdf, other]: Title: Drone Object Detection Using RGB/IR Fusion

Authors: Lizhi Yang, Ruhang Ma, Avideh Zakhor

Comments: Accepted to Electronic Imaging Symposium, Computational Imaging XX Conference, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[207] arXiv:2201.03791 [pdf, other]: Title: Classification of Beer Bottles using Object Detection and Transfer Learning

Authors: Philipp Hohlfeld, Tobias Ostermeier, Dominik Brandl

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[208] arXiv:2201.03794 [pdf, other]: Title: Efficient Non-Local Contrastive Attention for Image Super-Resolution

Authors: Bin Xia, Yucheng Hang, Yapeng Tian, Wenming Yang, Qingmin Liao, Jie Zhou

Comments: Code is available at this https URL

Journal-ref: AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[209] arXiv:2201.03803 [pdf, other]: Title: Unsupervised Domain Adaptive Person Re-id with Local-enhance and Prototype Dictionary Learning

Authors: Haopeng Hou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[210] arXiv:2201.03808 [pdf, other]: Title: MobileFaceSwap: A Lightweight Framework for Video Face Swapping

Authors: Zhiliang Xu, Zhibin Hong, Changxing Ding, Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding

Comments: AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[211] arXiv:2201.03859 [pdf, other]: Title: On Exploring Pose Estimation as an Auxiliary Learning Task for Visible-Infrared Person Re-identification

Authors: Yunqi Miao, Nianchang Huang, Xiao Ma, Qiang Zhang, Jungong Han

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[212] arXiv:2201.03891 [pdf, ps, other]: Title: A Saliency based Feature Fusion Model for EEG Emotion Estimation

Authors: Victor Delvigne, Antoine Facchini, Hazem Wannous, Thierry Dutoit, Laurence Ris, Jean-Philippe Vandeborre

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[213] arXiv:2201.03902 [pdf, other]: Title: Where Is My Mind (looking at)? Predicting Visual Attention from Brain Activity

Authors: Victor Delvigne, Noé Tits, Luca La Fisca, Nathan Hubens, Antoine Maiorca, Hazem Wannous, Thierry Dutoit, Jean-Philippe Vandeborre

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Signal Processing (eess.SP); Neurons and Cognition (q-bio.NC)
[214] arXiv:2201.03965 [pdf, other]: Title: On the Efficacy of Co-Attention Transformer Layers in Visual Question Answering

Authors: Ankur Sikarwar, Gabriel Kreiman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[215] arXiv:2201.03993 [pdf, other]: Title: A Novel Home-Built Metrology to Analyze Oral Fluid Droplets and Quantify the Efficacy of Masks

Authors: Ava Tan Bhowmik

Comments: 9 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[216] arXiv:2201.04011 [pdf, other]: Title: Similarity-based Gray-box Adversarial Attack Against Deep Face Recognition

Authors: Hanrui Wang, Shuo Wang, Zhe Jin, Yandan Wang, Cunjian Chen, Massimo Tistarell

Comments: ACCEPTED in IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[217] arXiv:2201.04019 [pdf, other]: Title: Pyramid Fusion Transformer for Semantic Segmentation

Authors: Zipeng Qin, Jianbo Liu, Xiaolin Zhang, Maoqing Tian, Aojun Zhou, Shuai Yi, Hongsheng Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[218] arXiv:2201.04021 [pdf, other]: Title: Optimization Planning for 3D ConvNets

Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei

Comments: ICML 2021; Code is publicly available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[219] arXiv:2201.04022 [pdf, other]: Title: Condensing a Sequence to One Informative Frame for Video Recognition

Authors: Zhaofan Qiu, Ting Yao, Yan Shu, Chong-Wah Ngo, Tao Mei

Comments: ICCV 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[220] arXiv:2201.04023 [pdf, other]: Title: Boosting Video Representation Learning with Multi-Faceted Integration

Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Xiao-Ping Zhang, Dong Wu, Tao Mei

Comments: CVPR 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[221] arXiv:2201.04024 [pdf, other]: Title: Smart Director: An Event-Driven Directing System for Live Broadcasting

Authors: Yingwei Pan, Yue Chen, Qian Bao, Ning Zhang, Ting Yao, Jingen Liu, Tao Mei

Comments: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[222] arXiv:2201.04026 [pdf, other]: Title: Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training

Authors: Yehao Li, Jiahao Fan, Yingwei Pan, Ting Yao, Weiyao Lin, Tao Mei

Comments: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[223] arXiv:2201.04027 [pdf, other]: Title: Representing Videos as Discriminative Sub-graphs for Action Recognition

Authors: Dong Li, Zhaofan Qiu, Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei

Comments: CVPR 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[224] arXiv:2201.04029 [pdf, other]: Title: Motion-Focused Contrastive Learning of Video Representations

Authors: Rui Li, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu, Tao Mei

Comments: ICCV 2021 (Oral); Code is publicly available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[225] arXiv:2201.04039 [pdf, other]: Title: MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing

Authors: Xin Liu, Yuntao Wang, Sinan Xie, Xiaoyu Zhang, Zixian Ma, Daniel McDuff, Shwetak Patel

Comments: Published paper: this https URL

Journal-ref: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Volume Issue 1, March 2022, Article No.: 24

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[226] arXiv:2201.04042 [pdf, other]: Title: Towards Lightweight Neural Animation : Exploration of Neural Network Pruning in Mixture of Experts-based Animation Models

Authors: Antoine Maiorca, Nathan Hubens, Sohaib Laraba, Thierry Dutoit

Comments: 8 pages, 4 figures, 2 tables, 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 1: GRAPP, ISBN 978-989-758-555-5, ISSN 2184-4321, pages 286-293

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[227] arXiv:2201.04063 [pdf, ps, other]: Title: Identification of chicken egg fertility using SVM classifier based on first-order statistical feature extraction

Authors: Shoffan Saifullah, Andiko Putro Suryotomo

Comments: 9 Pages, 5 Figures, 2 Tables

Journal-ref: ILKOM Jurnal Ilmiah, 13(3), (2021), 285-293

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[228] arXiv:2201.04114 [pdf, other]: Title: DM-VIO: Delayed Marginalization Visual-Inertial Odometry

Authors: Lukas von Stumberg, Daniel Cremers

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[229] arXiv:2201.04123 [pdf, other]: Title: gDNA: Towards Generative Detailed Neural Avatars

Authors: Xu Chen, Tianjian Jiang, Jie Song, Jinlong Yang, Michael J. Black, Andreas Geiger, Otmar Hilliges

Comments: Camera-ready for CVPR 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[230] arXiv:2201.04127 [pdf, other]: Title: HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video

Authors: Chung-Yi Weng, Brian Curless, Pratul P. Srinivasan, Jonathan T. Barron, Ira Kemelmacher-Shlizerman

Comments: CVPR 2022 (oral). Project page with videos: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[231] arXiv:2201.04212 [pdf, other]: Title: MDPose: Human Skeletal Motion Reconstruction Using WiFi Micro-Doppler Signatures

Authors: Chong Tang, Wenda Li, Shelly Vishwakarma, Fangzhan Shi, Simon Julier, Kevin Chetty

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[232] arXiv:2201.04214 [pdf, other]: Title: Region-based Layout Analysis of Music Score Images

Authors: Francisco J. Castellanos, Carlos Garrido-Munoz, Antonio Ríos-Vila, Jorge Calvo-Zaragoza

Comments: Submitted to Expert Systems with Applications

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[233] arXiv:2201.04236 [pdf, other]: Title: Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents

Authors: Ethan Weber, Dim P. Papadopoulos, Agata Lapedriza, Ferda Ofli, Muhammad Imran, Antonio Torralba

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[234] arXiv:2201.04279 [pdf, other]: Title: Dynamical Audio-Visual Navigation: Catching Unheard Moving Sound Sources in Unmapped 3D Environments

Authors: Abdelrahman Younes

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[235] arXiv:2201.04288 [pdf, other]: Title: Multiview Transformers for Video Recognition

Authors: Shen Yan, Xuehan Xiong, Anurag Arnab, Zhichao Lu, Mi Zhang, Chen Sun, Cordelia Schmid

Comments: CVPR 2022; arXiv v4: update results on Epic-Kitchens-100

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[236] arXiv:2201.04309 [pdf, other]: Title: Robust Contrastive Learning against Noisy Views

Authors: Ching-Yao Chuang, R Devon Hjelm, Xin Wang, Vibhav Vineet, Neel Joshi, Antonio Torralba, Stefanie Jegelka, Yale Song

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[237] arXiv:2201.04329 [pdf, other]: Title: Neural Residual Flow Fields for Efficient Video Representations

Authors: Daniel Rho, Junwoo Cho, Jong Hwan Ko, Eunbyung Park

Comments: Accepted for ACCV 2022, codes are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[238] arXiv:2201.04341 [pdf, other]: Title: MDS-Net: A Multi-scale Depth Stratification Based Monocular 3D Object Detection Algorithm

Authors: Zhouzhen Xie, Yuying Song, Jingxuan Wu, Zecheng Li, Chunyi Song, Zhiwei Xu

Comments: 9 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[239] arXiv:2201.04358 [pdf, ps, other]: Title: Coarse-to-Fine Embedded PatchMatch and Multi-Scale Dynamic Aggregation for Reference-based Super-Resolution

Authors: Bin Xia, Yapeng Tian, Yucheng Hang, Wenming Yang, Qingmin Liao, Jie Zhou

Comments: code is availavle at this https URL

Journal-ref: AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[240] arXiv:2201.04364 [pdf, other]: Title: SCSNet: An Efficient Paradigm for Learning Simultaneously Image Colorization and Super-Resolution

Authors: Jiangning Zhang, Chao Xu, Jian Li, Yue Han, Yabiao Wang, Ying Tai, Yong Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[241] arXiv:2201.04388 [pdf, other]: Title: OCSampler: Compressing Videos to One Clip with Single-step Sampling

Authors: Jintao Lin, Haodong Duan, Kai Chen, Dahua Lin, Limin Wang

Comments: Video Understanding, Efficient Action Recognition

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[242] arXiv:2201.04402 [pdf, other]: Title: MoViDNN: A Mobile Platform for Evaluating Video Quality Enhancement with Deep Neural Networks

Authors: Ekrem Çetinkaya, Minh Nguyen, Christian Timmerer

Comments: 8 pages, 3 figures

Journal-ref: MMM 2022: MultiMedia Modeling pp 465-472

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[243] arXiv:2201.04435 [pdf, other]: Title: Beyond the Visible: A Survey on Cross-spectral Face Recognition

Authors: David Anghelone, Cunjian Chen, Arun Ross, Antitza Dantcheva

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[244] arXiv:2201.04494 [pdf, other]: Title: SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds

Authors: Qingyong Hu, Bo Yang, Sheikh Khalid, Wen Xiao, Niki Trigoni, Andrew Markham

Comments: Accepted by IJCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[245] arXiv:2201.04532 [pdf, other]: Title: Structure and position-aware graph neural network for airway labeling

Authors: Weiyi Xie, Colin Jacobs, Jean-Paul Charbonnier, Bram van Ginneken

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[246] arXiv:2201.04620 [pdf, other]: Title: SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining

Authors: Saksham Suri, Sai Saketh Rambhatla, Rama Chellappa, Abhinav Shrivastava

Comments: Accepted at ICCV2023. Project webpage: this https URL The first two authors contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[247] arXiv:2201.04623 [pdf, other]: Title: Virtual Elastic Objects

Authors: Hsiao-yu Chen, Edgar Tretschk, Tuur Stuyck, Petr Kadlecek, Ladislav Kavan, Etienne Vouga, Christoph Lassner

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[248] arXiv:2201.04676 [pdf, other]: Title: UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning

Authors: Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao

Comments: Published as a conference paper at ICLR 2022; 19pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249] arXiv:2201.04684 [pdf, other]: Title: BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations

Authors: Daiqing Li, Huan Ling, Seung Wook Kim, Karsten Kreis, Adela Barriuso, Sanja Fidler, Antonio Torralba

Comments: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250] arXiv:2201.04706 [pdf, other]: Title: Semantic Labeling of Human Action For Visually Impaired And Blind People Scene Interaction

Authors: Leyla Benhamida, Slimane Larabi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[251] arXiv:2201.04755 [pdf, ps, other]: Title: Spatial-Temporal Map Vehicle Trajectory Detection Using Dynamic Mode Decomposition and Res-UNet+ Neural Networks

Authors: Tianya T. Zhang, Peter J. Jin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[252] arXiv:2201.04756 [pdf, ps, other]: Title: Roadside Lidar Vehicle Detection and Tracking Using Range And Intensity Background Subtraction

Authors: Tianya Zhang, Peter J. Jin

Journal-ref: Journal of Advanced Transportation, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[253] arXiv:2201.04766 [pdf, other]: Title: Collision Detection: An Improved Deep Learning Approach Using SENet and ResNext

Authors: Aloukik Aditya, Liudu Zhou, Hrishika Vachhani, Dhivya Chandrasekaran, Vijay Mago

Comments: 8 pages, 5 figures, submitted to IEEE-SMC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[254] arXiv:2201.04771 [pdf, other]: Title: Unlocking large-scale crop field delineation in smallholder farming systems with transfer learning and weak supervision

Authors: Sherrie Wang, Francois Waldner, David B. Lobell

Comments: Under submission

Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[255] arXiv:2201.04777 [pdf, other]: Title: A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-19

Authors: Bingshu Wang, Jiangbin Zheng, C.L. Philip Chen

Comments: 21 pages, 9 figures, 5 tables. IEEE Transactions on Artificial Intelligence, 2021, early access

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[256] arXiv:2201.04788 [pdf, other]: Title: Trusted Media Challenge Dataset and User Study

Authors: Weiling Chen, Sheng Lun Benjamin Chua, Stefan Winkler, See-Kiong Ng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257] arXiv:2201.04796 [pdf, other]: Title: CFNet: Learning Correlation Functions for One-Stage Panoptic Segmentation

Authors: Yifeng Chen, Wenqing Chu, Fangfang Wang, Ying Tai, Ran Yi, Zhenye Gan, Liang Yao, Chengjie Wang, Xi Li

Comments: Tech report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258] arXiv:2201.04797 [pdf, other]: Title: Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching

Authors: Yunpeng Shi, Shaohan Li, Tyler Maunu, Gilad Lerman

Comments: accepted to International Conference on 3D Vision (3DV) 2021, Oral Presentation

Journal-ref: Proceedings of the 2021 International Conference on 3D Vision (3DV), 2021, pp. 352-360

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259] arXiv:2201.04806 [pdf, other]: Title: RealGait: Gait Recognition for Person Re-Identification

Authors: Shaoxiong Zhang, Yunhong Wang, Tianrui Chai, Annan Li, Anil K. Jain

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[260] arXiv:2201.04809 [pdf, other]: Title: Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks

Authors: Yuchong Yao, Xiaohui Wangr, Yuanbang Ma, Han Fang, Jiaying Wei, Liyuan Chen, Ali Anaissi, Ali Braytee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[261] arXiv:2201.04819 [pdf, other]: Title: Deep Rank-Consistent Pyramid Model for Enhanced Crowd Counting

Authors: Jiaqi Gao, Zhizhong Huang, Yiming Lei, Hongming Shan, James Z. Wang, Fei-Yue Wang, Junping Zhang

Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[262] arXiv:2201.04833 [pdf, other]: Title: SnapshotNet: Self-supervised Feature Learning for Point Cloud Data Segmentation Using Minimal Labeled Data

Authors: Xingye Li, Ling Zhang, Zhigang Zhu

Journal-ref: Computer Vision and Image Understanding, Volume 216, 2022, 103339, ISSN 1077-3142

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[263] arXiv:2201.04850 [pdf, other]: Title: Bridging Video-text Retrieval with Multiple Choice Questions

Authors: Yuying Ge, Yixiao Ge, Xihui Liu, Dian Li, Ying Shan, Xiaohu Qie, Ping Luo

Comments: Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[264] arXiv:2201.04851 [pdf, other]: Title: MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning

Authors: Yuying Ge, Yibing Song, Ruimao Zhang, Ping Luo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265] arXiv:2201.04866 [pdf, other]: Title: Weakly Supervised Scene Text Detection using Deep Reinforcement Learning

Authors: Emanuel Metzenthin, Christian Bartz, Christoph Meinel

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[266] arXiv:2201.04873 [pdf, other]: Title: VoLux-GAN: A Generative Model for 3D Face Synthesis with HDRI Relighting

Authors: Feitong Tan, Sean Fanello, Abhimitra Meka, Sergio Orts-Escolano, Danhang Tang, Rohit Pandey, Jonathan Taylor, Ping Tan, Yinda Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[267] arXiv:2201.04898 [pdf, other]: Title: Flexible Style Image Super-Resolution using Conditional Objective

Authors: Seung Ho Park, Young Su Moon, Nam Ik Cho

Comments: Will be presented in IEEE ACCESS. Code and trained models will be available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[268] arXiv:2201.04906 [pdf, other]: Title: Hand-Object Interaction Reasoning

Authors: Jian Ma, Dima Damen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[269] arXiv:2201.04924 [pdf, other]: Title: Technical Report for ICCV 2021 Challenge SSLAD-Track3B: Transformers Are Better Continual Learners

Authors: Duo Li, Guimei Cao, Yunlu Xu, Zhanzhan Cheng, Yi Niu

Comments: Rank 1st on ICCV2021 SSLAD-Track 3B

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270] arXiv:2201.04945 [pdf, other]: Title: Learning Semantic Abstraction of Shape via 3D Region of Interest

Authors: Haiyue Fang, Xiaogang Wang, Zheyuan Cai, Yahao Shi, Xun Sun, Shilin Wu, Bin Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[271] arXiv:2201.05007 [pdf, ps, other]: Title: Multi-granularity Association Learning Framework for on-the-fly Fine-Grained Sketch-based Image Retrieval

Authors: Dawei Dai, Xiaoyu Tang, Shuyin Xia, Yingge Liu, Guoyin Wang, Zizhong Chen

Comments: 17 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[272] arXiv:2201.05020 [pdf, other]: Title: Automatic Sparse Connectivity Learning for Neural Networks

Authors: Zhimin Tang, Linkai Luo, Bike Xie, Yiyu Zhu, Rujie Zhao, Lvqing Bi, Chao Lu

Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[273] arXiv:2201.05022 [pdf, other]: Title: Self-semantic contour adaptation for cross modality brain tumor segmentation

Authors: Xiaofeng Liu, Fangxu Xing, Georges El Fakhri, Jonghye Woo

Comments: Accepted to ISBI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[274] arXiv:2201.05023 [pdf, other]: Title: Stereo Magnification with Multi-Layer Images

Authors: Taras Khakhulin, Denis Korzhenkov, Pavel Solovev, Gleb Sterkin, Timotei Ardelean, Victor Lempitsky

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[275] arXiv:2201.05047 [pdf, other]: Title: TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers

Authors: Qianyu Zhou, Xiangtai Li, Lu He, Yibo Yang, Guangliang Cheng, Yunhai Tong, Lizhuang Ma, Dacheng Tao

Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), extended version of arXiv:2105.10920

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[276] arXiv:2201.05057 [pdf, other]: Title: On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles

Authors: Qingzhao Zhang, Shengtuo Hu, Jiachen Sun, Qi Alfred Chen, Z. Morley Mao

Comments: 13 pages, 13 figures, accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[277] arXiv:2201.05078 [pdf, other]: Title: CLIP-Event: Connecting Text and Images with Event Structures

Authors: Manling Li, Ruochen Xu, Shuohang Wang, Luowei Zhou, Xudong Lin, Chenguang Zhu, Michael Zeng, Heng Ji, Shih-Fu Chang

Journal-ref: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[278] arXiv:2201.05119 [pdf, other]: Title: Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?

Authors: Nenad Tomasev, Ioana Bica, Brian McWilliams, Lars Buesing, Razvan Pascanu, Charles Blundell, Jovana Mitrovic

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[279] arXiv:2201.05120 [pdf, other]: Title: SeamlessGAN: Self-Supervised Synthesis of Tileable Texture Maps

Authors: Carlos Rodriguez-Pardo, Elena Garces

Comments: 12 pages. To be published in Transactions on Visualizations and Computer Graphics. Project website: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[280] arXiv:2201.05121 [pdf, other]: Title: STEdge: Self-training Edge Detection with Multi-layer Teaching and Regularization

Authors: Yunfan Ye, Renjiao Yi, Zhiping Cai, Kai Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281] arXiv:2201.05131 [pdf, other]: Title: SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation

Authors: K L Navaneet, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Hamed Pirsiavash

Comments: In BMVC 2021. Code available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[282] arXiv:2201.05151 [pdf, other]: Title: Beyond Simple Meta-Learning: Multi-Purpose Models for Multi-Domain, Active and Continual Few-Shot Learning

Authors: Peyman Bateni, Jarred Barber, Raghav Goyal, Vaden Masrani, Jan-Willem van de Meent, Leonid Sigal, Frank Wood

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[283] arXiv:2201.05275 [pdf, ps, other]: Title: Deep Leaning-Based Ultra-Fast Stair Detection

Authors: Chen Wang, Zhongcai Pei, Shuang Qiu, Zhiyong Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284] arXiv:2201.05277 [pdf, other]: Title: Boundary-aware Self-supervised Learning for Video Scene Segmentation

Authors: Jonghwan Mun, Minchul Shin, Gunsoo Han, Sangho Lee, Seongsu Ha, Joonseok Lee, Eun-Sol Kim

Comments: The code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[285] arXiv:2201.05290 [pdf, other]: Title: Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals

Authors: Lijun Yu, Yijun Qian, Wenhe Liu, Alexander G. Hauptmann

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[286] arXiv:2201.05297 [pdf, other]: Title: MMNet: Muscle motion-guided network for micro-expression recognition

Authors: Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng Zhao

Comments: 8 pages, 4 figures

Journal-ref: Proc. 31st Int'l Joint Conf. Artificial Intelligence (IJCAI), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287] arXiv:2201.05299 [pdf, other]: Title: A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering

Authors: Feng Gao, Qing Ping, Govind Thattai, Aishwarya Reganti, Ying Nian Wu, Prem Natarajan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[288] arXiv:2201.05307 [pdf, other]: Title: Unsupervised Temporal Video Grounding with Deep Semantic Clustering

Authors: Daizong Liu, Xiaoye Qu, Yinzhen Wang, Xing Di, Kai Zou, Yu Cheng, Zichuan Xu, Pan Zhou

Comments: Accepted by AAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[289] arXiv:2201.05314 [pdf, other]: Title: A Novel Skeleton-Based Human Activity Discovery Using Particle Swarm Optimization with Gaussian Mutation

Authors: Parham Hadikhani, Daphne Teck Ching Lai, Wee-Hong Ong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
[290] arXiv:2201.05346 [pdf, ps, other]: Title: Arbitrary Handwriting Image Style Transfer

Authors: Kai Yang, Xiaoman Liang, Huihuang Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[291] arXiv:2201.05386 [pdf, other]: Title: SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions

Authors: Ali Samadzadeh, Ahmad Nickabadi

Comments: 11 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[292] arXiv:2201.05479 [pdf, other]: Title: HardBoost: Boosting Zero-Shot Learning with Hard Classes

Authors: Bo Liu, Lihua Hu, Zhanyi Hu, Qiulei Dong

Comments: 15 pages, 8 figures, submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence on Sep.16 2021, This work is an extended version of our CVPR2021 work----Hardness sampling for self-training based transductive zero-shot learning (arXiv:2106.00264)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[293] arXiv:2201.05489 [pdf, other]: Title: Emergence of Machine Language: Towards Symbolic Intelligence with Neural Networks

Authors: Yuqi Wang, Xu-Yao Zhang, Cheng-Lin Liu, Zhaoxiang Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[294] arXiv:2201.05514 [pdf, other]: Title: Determination of building flood risk maps from LiDAR mobile mapping data

Authors: Yu Feng, Qing Xiao, Claus Brenner, Aaron Peche, Juntao Yang, Udo Feuerhake, Monika Sester

Journal-ref: Computers, Environment and Urban Systems, Vol. 93, April 2022, 101759

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[295] arXiv:2201.05541 [pdf, other]: Title: ViT2Hash: Unsupervised Information-Preserving Hashing

Authors: Qinkang Gong, Liangdao Wang, Hanjiang Lai, Yan Pan, Jian Yin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[296] arXiv:2201.05545 [pdf, ps, other]: Title: Multimodal registration of FISH and nanoSIMS images using convolutional neural network models

Authors: Xiaojia He, Christof Meile, Suchendra M. Bhandarkar

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297] arXiv:2201.05585 [pdf, other]: Title: Domain Adaptation in LiDAR Semantic Segmentation via Alternating Skip Connections and Hybrid Learning

Authors: Eduardo R. Corral-Soto, Mrigank Rochan, Yannis Y. He, Shubhra Aich, Yang Liu, Liu Bingbing

Comments: 1) Introduced Fig 1, 2) Simplified Fig. 2 diagram, 3) Fixed typos in losses, 4) Introduced Fig. 3, 5) Updated evaluation results, included evaluation on SemanticPOSS, 6) Introduced Table 3 - effects on covariance matrix and mean, 7) Updated Fig. 5, 8) Added more references. Improved writing in general, especially the motivation and description of each element and contribution from the method

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[298] arXiv:2201.05675 [pdf, other]: Title: Transformers in Action: Weakly Supervised Action Segmentation

Authors: John Ridley, Huseyin Coskun, David Joseph Tan, Nassir Navab, Federico Tombari

Comments: Under Review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[299] arXiv:2201.05706 [pdf, other]: Title: Perspective Transformation Layer

Authors: Nishan Khatri, Agnibh Dasgupta, Yucong Shen, Xin Zhong, Frank Y. Shih

Comments: This paper has been accepted for publication by the 2022 International Conference on Computational Science & Computational Intelligence (CSCI'22), Research Track on Signal & Image Processing, Computer Vision & Pattern Recognition

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[300] arXiv:2201.05718 [pdf, other]: Title: Parameter-free Online Test-time Adaptation

Authors: Malik Boudiaf, Romain Mueller, Ismail Ben Ayed, Luca Bertinetto

Comments: CVPR 2022 (oral). Code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[301] arXiv:2201.05723 [pdf, other]: Title: Learning Temporally and Semantically Consistent Unpaired Video-to-video Translation Through Pseudo-Supervision From Synthetic Optical Flow

Authors: Kaihong Wang, Kumar Akash, Teruhisa Misu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[302] arXiv:2201.05729 [pdf, other]: Title: CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks

Authors: Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Jianwei Yang, Xiyang Dai, Bin Xiao, Haoxuan You, Shih-Fu Chang, Lu Yuan

Comments: This paper is greatly modified and updated to be re-submitted to another conference. The new paper is under the name "Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks", this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[303] arXiv:2201.05730 [pdf, other]: Title: Learning Hierarchical Graph Representation for Image Manipulation Detection

Authors: Wenyan Pan, Zhili Zhou, Miaogen Ling, Xin Geng, Q. M. Jonathan Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[304] arXiv:2201.05739 [pdf, other]: Title: Real-World Graph Convolution Networks (RW-GCNs) for Action Recognition in Smart Video Surveillance

Authors: Justin Sanchez, Christopher Neff, Hamed Tabkhi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[305] arXiv:2201.05761 [pdf, other]: Title: A Survey on RGB-D Datasets

Authors: Alexandre Lopes, Roberto Souza, Helio Pedrini

Comments: This paper was published at Computer Vision and Image Understanding. Access the final paper using the DOI: this https URL

Journal-ref: Computer Vision and Image Understanding 222 (2022) 103489

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[306] arXiv:2201.05772 [pdf, other]: Title: Asymmetric Hash Code Learning for Remote Sensing Image Retrieval

Authors: Weiwei Song, Zhi Gao, Renwei Dian, Pedram Ghamisi, Yongjun Zhang, Jón Atli Benediktsson

Comments: 14 pages, 12 figures, and 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[307] arXiv:2201.05775 [pdf, other]: Title: Explainability Tools Enabling Deep Learning in Future In-Situ Real-Time Planetary Explorations

Authors: Daniel Lundstrom, Alexander Huyen, Arya Mevada, Kyongsik Yun, Thomas Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[308] arXiv:2201.05776 [pdf, other]: Title: Uncertainty-Aware Multi-View Representation Learning

Authors: Yu Geng, Zongbo Han, Changqing Zhang, Qinghua Hu

Comments: AAAI 2021 published paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[309] arXiv:2201.05778 [pdf, other]: Title: Semantic decoupled representation learning for remote sensing image change detection

Authors: Hao Chen, Yifan Zao, Liqin Liu, Song Chen, Zhenwei Shi

Comments: Submitted to IEEE for possible publication. 4 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[310] arXiv:2201.05781 [pdf, other]: Title: OneDConv: Generalized Convolution For Transform-Invariant Representation

Authors: Tong Zhang, Haohan Weng, Ke Yi, C. L. Philip Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[311] arXiv:2201.05816 [pdf, other]: Title: A Critical Analysis of Image-based Camera Pose Estimation Techniques

Authors: Meng Xu, Youchen Wang, Bin Xu, Jun Zhang, Jian Ren, Stefan Poslad, Pengfei Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[312] arXiv:2201.05820 [pdf, other]: Title: Offline-Online Associated Camera-Aware Proxies for Unsupervised Person Re-identification

Authors: Menglin Wang, Jiachen Li, Baisheng Lai, Xiaojin Gong, Xian-Sheng Hua

Comments: Accepted to TIP

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[313] arXiv:2201.05829 [pdf, other]: Title: Multi-View representation learning in Multi-Task Scene

Authors: Run-kun Lu, Jian-wei Liu, Si-ming Lian, Xin Zuo

Comments: 32 pages

Journal-ref: Neural Computing and Applications(2020)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[314] arXiv:2201.05834 [pdf, other]: Title: Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition

Authors: Yi Zhang, Mingyuan Chen, Jundong Shen, Chongjun Wang

Comments: To be published in AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[315] arXiv:2201.05858 [pdf, other]: Title: Smart Parking Space Detection under Hazy conditions using Convolutional Neural Networks: A Novel Approach

Authors: Gaurav Satyanath, Jajati Keshari Sahoo, Rajendra Kumar Roul

Comments: 20 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[316] arXiv:2201.05869 [src]: Title: Prototype Guided Network for Anomaly Segmentation

Authors: Yiqing Hao, Yi Jin, Gaoyun An

Comments: Need for edit,and improve the method for better performance

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[317] arXiv:2201.05887 [pdf, other]: Title: Domain Adaptation via Bidirectional Cross-Attention Transformer

Authors: Xiyu Wang, Pengxin Guo, Yu Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[318] arXiv:2201.05914 [pdf, other]: Title: Towards Zero-shot Sign Language Recognition

Authors: Yunus Can Bilge, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319] arXiv:2201.05916 [pdf, other]: Title: Multi-level Second-order Few-shot Learning

Authors: Hongguang Zhang, Hongdong Li, Piotr Koniusz

Comments: IEEE Transactions on Multimedia

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[320] arXiv:2201.05951 [pdf, ps, other]: Title: Global Regular Network for Writer Identification

Authors: Shiyu Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[321] arXiv:2201.05958 [pdf, ps, other]: Title: Cross-Centroid Ripple Pattern for Facial Expression Recognition

Authors: Monu Verma, Prafulla Saxena, Santosh Kumar Vipparthi, Girdhari Singh

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[322] arXiv:2201.05972 [pdf, other]: Title: Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation

Authors: Shuangjie Xu, Rui Wan, Maosheng Ye, Xiaoyi Zou, Tongyi Cao

Comments: Accepted by the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[323] arXiv:2201.05986 [pdf, other]: Title: Audio-Driven Talking Face Video Generation with Dynamic Convolution Kernels

Authors: Zipeng Ye, Mengfei Xia, Ran Yi, Juyong Zhang, Yu-Kun Lai, Xuwei Huang, Guoxin Zhang, Yong-jin Liu

Comments: in IEEE Transactions on Multimedia

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[324] arXiv:2201.05989 [pdf, other]: Title: Instant Neural Graphics Primitives with a Multiresolution Hash Encoding

Authors: Thomas Müller, Alex Evans, Christoph Schied, Alexander Keller

Comments: To appear in ACM Transactions on Graphics (SIGGRAPH 2022). 15 pages, 13 figures, 3 tables

Journal-ref: ACM Trans. Graph. 41, 4, Article 102 (July 2022), 15 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[325] arXiv:2201.05991 [pdf, other]: Title: Video Transformers: A Survey

Authors: Javier Selva, Anders S. Johansen, Sergio Escalera, Kamal Nasrollahi, Thomas B. Moeslund, Albert Clapés

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326] arXiv:2201.06030 [pdf, ps, other]: Title: Fully Convolutional Change Detection Framework with Generative Adversarial Network for Unsupervised, Weakly Supervised and Regional Supervised Change Detection

Authors: Chen Wu, Bo Du, Liangpei Zhang

Comments: 13 pages, 19 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[327] arXiv:2201.06037 [pdf, other]: Title: Pursuing 3D Scene Structures with Optical Satellite Images from Affine Reconstruction to Euclidean Reconstruction

Authors: Pinhe Wang, Limin Shi, Bao Chen, Zhanyi Hu, Qiulei Dong, Jianzhong Qiao

Comments: 11 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328] arXiv:2201.06061 [pdf, other]: Title: PETS-SWINF: A regression method that considers images with metadata based Neural Network for pawpularity prediction on 2021 Kaggle Competition "PetFinder.my"

Authors: Yizheng Wang, Yinghua Liu

Comments: 8 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[329] arXiv:2201.06070 [pdf, other]: Title: ALA: Naturalness-aware Adversarial Lightness Attack

Authors: Yihao Huang, Liangru Sun, Qing Guo, Felix Juefei-Xu, Jiayi Zhu, Jincao Feng, Yang Liu, Geguang Pu

Comments: 9 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[330] arXiv:2201.06098 [pdf, other]: Title: An Edge Map based Ensemble Solution to Detect Water Level in Stream

Authors: Pratool Bharti, Priyanjani Chandra, Michael. E. Papka, David Koop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[331] arXiv:2201.06159 [pdf, other]: Title: YOLO -- You only look 10647 times

Authors: Christian Limberg, Andrew Melnik, Augustin Harter, Helge Ritter

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[332] arXiv:2201.06164 [pdf, other]: Title: Synthesis and Reconstruction of Fingerprints using Generative Adversarial Networks

Authors: Rafael Bouzaglo, Yosi Keller

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[333] arXiv:2201.06174 [pdf, other]: Title: A novel attention model for salient structure detection in seismic volumes

Authors: Muhammad Amir Shafiq, Zhiling Long, Haibin Di, Ghassan AlRegib

Comments: Published in Applied Computing and Intelligence, Nov. 2021

Journal-ref: Applied Computing and Intelligence, vol. 1, no. 1, pp. 31-45, Nov. 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[334] arXiv:2201.06176 [pdf, ps, other]: Title: A fast and accurate iris segmentation method using an LoG filter and its zero-crossings

Authors: Tariq M. Khan, Donald G. bailey, Yinan Kong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[335] arXiv:2201.06192 [pdf, other]: Title: Fooling the Eyes of Autonomous Vehicles: Robust Physical Adversarial Examples Against Traffic Sign Recognition Systems

Authors: Wei Jia, Zhaojun Lu, Haichun Zhang, Zhenglin Liu, Jie Wang, Gang Qu

Comments: 17 pages, 15 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[336] arXiv:2201.06207 [pdf, other]: Title: Discourse Analysis for Evaluating Coherence in Video Paragraph Captions

Authors: Arjun R Akula, Song-Chun Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[337] arXiv:2201.06220 [pdf, ps, other]: Title: Face Detection in Extreme Conditions: A Machine-learning Approach

Authors: Sameer Aqib Hashmi

Comments: 6 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[338] arXiv:2201.06260 [pdf, other]: Title: Towards Realistic Visual Dubbing with Heterogeneous Sources

Authors: Tianyi Xie, Liucheng Liao, Cheng Bi, Benlai Tang, Xiang Yin, Jianfei Yang, Mingjie Wang, Jiali Yao, Yang Zhang, Zejun Ma

Comments: 9 pages (including references), 7 figures, Accepted in ACM Multimedia, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[339] arXiv:2201.06289 [pdf, other]: Title: The CLEAR Benchmark: Continual LEArning on Real-World Imagery

Authors: Zhiqiu Lin, Jia Shi, Deepak Pathak, Deva Ramanan

Comments: Project site: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[340] arXiv:2201.06304 [pdf, other]: Title: Action Keypoint Network for Efficient Video Recognition

Authors: Xu Chen, Yahong Han, Xiaohan Wang, Yifan Sun, Yi Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[341] arXiv:2201.06311 [pdf, other]: Title: Graph Neural Networks for Cross-Camera Data Association

Authors: Elena Luna, Juan C. SanMiguel, José M. Martínez, Pablo Carballeira

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[342] arXiv:2201.06346 [pdf, other]: Title: Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Authors: Hwanil Choi, Wonjoon Chang, Jaesik Choi

Comments: Accepted at IJCAI-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[343] arXiv:2201.06357 [pdf, other]: Title: Disentangled Latent Transformer for Interpretable Monocular Height Estimation

Authors: Zhitong Xiong, Sining Chen, Yilei Shi, Xiao Xiang Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[344] arXiv:2201.06374 [pdf, other]: Title: RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs

Authors: Zhouxia Wang, Jiawei Zhang, Runjian Chen, Wenping Wang, Ping Luo

Comments: Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[345] arXiv:2201.06376 [pdf, other]: Title: UWC: Unit-wise Calibration Towards Rapid Network Compression

Authors: Chen Lin, Zheyang Li, Bo Peng, Haoji Hu, Wenming Tan, Ye Ren, Shiliang Pu

Comments: Accepted by BMVC 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[346] arXiv:2201.06390 [pdf, other]: Title: SwinUNet3D -- A Hierarchical Architecture for Deep Traffic Prediction using Shifted Window Transformers

Authors: Alabi Bojesomo, Hasan Al Marzouqi, Panos Liatsis

Comments: 7 pages, 1 figure

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[347] arXiv:2201.06415 [pdf, other]: Title: Improving Performance of Semantic Segmentation CycleGANs by Noise Injection into the Latent Segmentation Space

Authors: Jonas Löhdefink, Tim Fingscheidt

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[348] arXiv:2201.06427 [pdf, other]: Title: Masked Faces with Faced Masks

Authors: Jiayi Zhu, Qing Guo, Felix Juefei-Xu, Yihao Huang, Yang Liu, Geguang Pu

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[349] arXiv:2201.06435 [pdf, other]: Title: FourierNet: Shape-Preserving Network for Henle's Fiber Layer Segmentation in Optical Coherence Tomography Images

Authors: Selahattin Cansiz, Cem Kesim, Sevval Nur Bektas, Zeynep Kulali, Murat Hasanreisoglu, Cigdem Gunduz-Demir

Journal-ref: IEEE Journal of Biomedical and Health Informatics, vol. 27, no. 2, pp. 1036-1047, Feb. 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[350] arXiv:2201.06459 [pdf, other]: Title: A Novel Framework to Jointly Compress and Index Remote Sensing Images for Efficient Content-Based Retrieval

Authors: Gencer Sumbul, Jun Xiang, Nimisha Thekke Madam, Begüm Demir

Comments: Accepted at IEEE International Geoscience and Remote Sensing Symposium (IGARSS) 2022. Our code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351] arXiv:2201.06493 [pdf, other]: Title: AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection

Authors: Zehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinghong Jiang, Feng Zhao, Bolei Zhou, Hang Zhao

Comments: Accepted to IJCAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2201.06569 [pdf, other]: Title: Automatic Quantification and Visualization of Street Trees

Authors: Arpit Bahety, Rohit Saluja, Ravi Kiran Sarvadevabhatla, Anbumani Subramanian, C.V. Jawahar

Comments: Accepted at ICVGIP 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2201.06570 [pdf, other]: Title: BDA-SketRet: Bi-Level Domain Adaptation for Zero-Shot SBIR

Authors: Ushasi Chaudhuri, Ruchika Chavan, Biplab Banerjee, Anjan Dutta, Zeynep Akata

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[354] arXiv:2201.06578 [pdf, other]: Title: Collapse by Conditioning: Training Class-conditional GANs with Limited Data

Authors: Mohamad Shahbazi, Martin Danelljan, Danda Pani Paudel, Luc Van Gool

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[355] arXiv:2201.06594 [pdf, other]: Title: Using Machine Learning to Detect Rotational Symmetries from Reflectional Symmetries in 2D Images

Authors: Koen Ponse, Anna V. Kononova, Maria Loleyt, Bas van Stein

Comments: 8 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[356] arXiv:2201.06629 [pdf, other]: Title: Validation of object detection in UAV-based images using synthetic data

Authors: Eung-Joo Lee, Damon M. Conover, Shuvra S. Bhattacharyyaa, Heesung Kwon, Jason Hill, Kenneth Evensen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357] arXiv:2201.06644 [pdf, other]: Title: HydraFusion: Context-Aware Selective Sensor Fusion for Robust and Efficient Autonomous Vehicle Perception

Authors: Arnav Vaibhav Malawade, Trier Mortlock, Mohammad Abdullah Al Faruque

Comments: Accepted to be published in the 13th ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358] arXiv:2201.06648 [pdf, other]: Title: OmniPrint: A Configurable Printed Character Synthesizer

Authors: Haozhe Sun, Wei-Wei Tu, Isabelle Guyon

Comments: Accepted at 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks. this https URL

Journal-ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[359] arXiv:2201.06686 [pdf, ps, other]: Title: Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching

Authors: Hengcan Shi, Munawar Hayat, Jianfei Cai

Comments: 9 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[360] arXiv:2201.06696 [pdf, other]: Title: ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues

Authors: Hengcan Shi, Munawar Hayat, Yicheng Wu, Jianfei Cai

Comments: 10 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[361] arXiv:2201.06734 [pdf, other]: Title: Cross-modal Contrastive Distillation for Instructional Activity Anticipation

Authors: Zhengyuan Yang, Jingen Liu, Jing Huang, Xiaodong He, Tao Mei, Chenliang Xu, Jiebo Luo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362] arXiv:2201.06740 [pdf, other]: Title: Convolutional Cobweb: A Model of Incremental Learning from 2D Images

Authors: Christopher J. MacLellan, Harshil Thakur

Comments: 14 pages, 6 figures, Presented at Advances in Cognitive Systems 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[363] arXiv:2201.06750 [pdf, other]: Title: DDU-Net: Dual-Decoder-U-Net for Road Extraction Using High-Resolution Remote Sensing Images

Authors: Ying Wang, Yuexing Peng, Xinran Liu, Wei Li, George C. Alexandropoulos, Junchuan Yu, Daqing Ge, Wei Xiang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[364] arXiv:2201.06775 [pdf, other]: Title: Deformable One-Dimensional Object Detection for Routing and Manipulation

Authors: Azarakhsh Keipour, Maryam Bandari, Stefan Schaal

Comments: Accepted to IEEE Robotics and Automation Letters, January 2022. 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[365] arXiv:2201.06776 [pdf, other]: Title: Pruning-aware Sparse Regularization for Network Pruning

Authors: Nanfei Jiang, Xu Zhao, Chaoyang Zhao, Yongqi An, Ming Tang, Jinqiao Wang

Comments: MIR 2023

Journal-ref: Machine Intelligence Research, 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[366] arXiv:2201.06781 [pdf, other]: Title: When Facial Expression Recognition Meets Few-Shot Learning: A Joint and Alternate Learning Framework

Authors: Xinyi Zou, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang

Comments: 9 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[367] arXiv:2201.06794 [pdf, other]: Title: Resistance Training using Prior Bias: toward Unbiased Scene Graph Generation

Authors: Chao Chen, Yibing Zhan, Baosheng Yu, Liu Liu, Yong Luo, Bo Du

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368] arXiv:2201.06799 [pdf, other]: Title: Pistol: Pupil Invisible Supportive Tool to extract Pupil, Iris, Eye Opening, Eye Movements, Pupil and Iris Gaze Vector, and 2D as well as 3D Gaze

Authors: Wolfgang Fuhl, Daniel Weber, Shahram Eivazi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[369] arXiv:2201.06823 [pdf, other]: Title: Adaptive Weighted Guided Image Filtering for Depth Enhancement in Shape-From-Focus

Authors: Yuwen Li, Zhengguo Li, Chaobing Zheng, Shiqian Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370] arXiv:2201.06824 [pdf, ps, other]: Title: STURE: Spatial-Temporal Mutual Representation Learning for Robust Data Association in Online Multi-Object Tracking

Authors: Haidong Wang, Zhiyong Li, Yaping Li, Ke Nai, Ming Wen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371] arXiv:2201.06825 [pdf, other]: Title: Deep Learning Based Framework for Iranian License Plate Detection and Recognition

Authors: Mojtaba Shahidi Zandi, Roozbeh Rajabi

Comments: 20 pages, journal

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[372] arXiv:2201.06845 [pdf, other]: Title: Taylor3DNet: Fast 3D Shape Inference With Landmark Points Based Taylor Series

Authors: Yuting Xiao, Jiale Xu, Shenghua Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373] arXiv:2201.06857 [pdf, other]: Title: RePre: Improving Self-Supervised Vision Transformer with Reconstructive Pre-training

Authors: Luya Wang, Feng Liang, Yangguang Li, Honggang Zhang, Wanli Ouyang, Jing Shao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[374] arXiv:2201.06888 [pdf, other]: Title: Autoencoding Video Latents for Adversarial Video Generation

Authors: Sai Hemanth Kasaraneni

Comments: preprint

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375] arXiv:2201.06889 [pdf, other]: Title: Boosting Robustness of Image Matting with Context Assembling and Strong Data Augmentation

Authors: Yutong Dai, Brian Price, He Zhang, Chunhua Shen

Comments: 19 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376] arXiv:2201.06933 [pdf, other]: Title: Context-Aware Scene Prediction Network (CASPNet)

Authors: Maximilian Schäfer, Kun Zhao, Markus Bühren, Anton Kummert

Comments: 9 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[377] arXiv:2201.06945 [pdf, ps, other]: Title: It's All in the Head: Representation Knowledge Distillation through Classifier Sharing

Authors: Emanuel Ben-Baruch, Matan Karklinsky, Yossi Biton, Avi Ben-Cohen, Hussam Lawen, Nadav Zamir

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[378] arXiv:2201.06974 [pdf, other]: Title: Continual Coarse-to-Fine Domain Adaptation in Semantic Segmentation

Authors: Donald Shenaj, Francesco Barbato, Umberto Michieli, Pietro Zanuttigh

Comments: 24 pages, 9 figures, 6 tables, under submission

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[379] arXiv:2201.06978 [pdf, other]: Title: ASOCEM: Automatic Segmentation Of Contaminations in cryo-EM

Authors: Amitay Eldar, Ido Amos, Yoel Shkolnisky

Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[380] arXiv:2201.07021 [pdf, other]: Title: MuSCLe: A Multi-Strategy Contrastive Learning Framework for Weakly Supervised Semantic Segmentation

Authors: Kunhao Yuan, Gerald Schaefer, Yu-Kun Lai, Yifan Wang, Xiyao Liu, Lin Guan, Hui Fang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[381] arXiv:2201.07070 [pdf, other]: Title: Attention-based Proposals Refinement for 3D Object Detection

Authors: Minh-Quan Dao, Elwan Héry, Vincent Frémont

Comments: Accepted for IV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[382] arXiv:2201.07106 [pdf, other]: Title: Variational Inference for Quantifying Inter-observer Variability in Segmentation of Anatomical Structures

Authors: Xiaofeng Liu, Fangxu Xing, Thibault Marin, Georges El Fakhri, Jonghye Woo

Comments: SPIE Medical Imaging 2022 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[383] arXiv:2201.07120 [pdf, other]: Title: Contextual road lane and symbol generation for autonomous driving

Authors: Ajay Soni, Pratik Padamwar, Krishna Reddy Konda

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[384] arXiv:2201.07124 [pdf, other]: Title: Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR Imagery

Authors: Yan Zhao, Lingjun Zhao, Zhong Liu, Dewen Hu, Gangyao Kuang, Li Liu

Comments: A raw version as the same as the early access published in TGRS. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385] arXiv:2201.07131 [pdf, other]: Title: Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection

Authors: Alexandros Haliassos, Rodrigo Mira, Stavros Petridis, Maja Pantic

Comments: CVPR 2022. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[386] arXiv:2201.07189 [pdf, other]: Title: MUSE-VAE: Multi-Scale VAE for Environment-Aware Long Term Trajectory Prediction

Authors: Mihee Lee, Samuel S. Sohn, Seonghyeon Moon, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387] arXiv:2201.07200 [pdf, other]: Title: Optimizing Active Learning for Low Annotation Budgets

Authors: Umang Aggarwal, Adrian Popescu, Céline Hudelot

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388] arXiv:2201.07202 [pdf, other]: Title: GANmouflage: 3D Object Nondetection with Texture Fields

Authors: Rui Guo, Jasmine Collins, Oscar de Lima, Andrew Owens

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2201.07264 [pdf, other]: Title: Exploring Kervolutional Neural Networks

Authors: Nicolas Perez

Comments: 5 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390] arXiv:2201.07309 [pdf, other]: Title: OSSID: Online Self-Supervised Instance Detection by (and for) Pose Estimation

Authors: Qiao Gu, Brian Okorn, David Held

Comments: 10 pages, 6 figures. RA-L and ICRA 2022

Journal-ref: IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 3022-3029, April 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[391] arXiv:2201.07366 [pdf, other]: Title: TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval

Authors: Yue Ruan, Han-Hung Lee, Yiming Zhang, Ke Zhang, Angel X. Chang

Comments: Accepted by WACV 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2201.07384 [pdf, other]: Title: Swin-Pose: Swin Transformer Based Human Pose Estimation

Authors: Zinan Xiong, Chenxi Wang, Ying Li, Yan Luo, Yu Cao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393] arXiv:2201.07394 [pdf, other]: Title: KappaFace: Adaptive Additive Angular Margin Loss for Deep Face Recognition

Authors: Chingis Oinar, Binh M. Le, Simon S. Woo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[394] arXiv:2201.07412 [pdf, other]: Title: Poseur: Direct Human Pose Regression with Transformers

Authors: Weian Mao, Yongtao Ge, Chunhua Shen, Zhi Tian, Xinlong Wang, Zhibin Wang, Anton van den Hengel

Comments: Accepted to Proc. Eur. Conf. Comp. Vision (ECCV) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395] arXiv:2201.07422 [pdf, other]: Title: Self-Supervised Deep Blind Video Super-Resolution

Authors: Haoran Bai, Jinshan Pan

Comments: Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396] arXiv:2201.07425 [pdf, other]: Title: WebUAV-3M: A Benchmark for Unveiling the Power of Million-Scale Deep UAV Tracking

Authors: Chunhui Zhang, Guanjie Huang, Li Liu, Shan Huang, Yinan Yang, Xiang Wan, Shiming Ge, Dacheng Tao

Comments: 25 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397] arXiv:2201.07428 [pdf, ps, other]: Title: Variable Augmented Network for Invertible MR Coil Compression

Authors: Xianghao Liao, Shanshan Wang, Lanlan Tu, Yuhao Wang, Dong Liang, Qiegen Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[398] arXiv:2201.07436 [pdf, other]: Title: Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth

Authors: Doyeon Kim, Woonghyun Ka, Pyungwhan Ahn, Donggyu Joo, Sehwan Chun, Junmo Kim

Comments: 11pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2201.07451 [pdf, other]: Title: TransFuse: A Unified Transformer-based Image Fusion Framework using Self-supervised Learning

Authors: Linhao Qu, Shaolei Liu, Manning Wang, Shiman Li, Siqi Yin, Qin Qiao, Zhijian Song

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400] arXiv:2201.07459 [pdf, other]: Title: PT4AL: Using Self-Supervised Pretext Tasks for Active Learning

Authors: John Seon Keun Yi, Minseok Seo, Jongchan Park, Dong-Geol Choi

Comments: Code is available at this https URL Updated for ECCV 2022 submission

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[401] arXiv:2201.07486 [pdf, other]: Title: High-fidelity 3D Model Compression based on Key Spheres

Authors: Yuanzhan Li, Yuqi Liu, Yujie Lu, Siyu Zhang, Shen Cai, Yanting Zhang

Comments: Accepted in Data Compression Conference (DCC) 2022 as a full paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402] arXiv:2201.07495 [pdf, other]: Title: Weakly Supervised Semantic Segmentation of Remote Sensing Images for Tree Species Classification Based on Explanation Methods

Authors: Steve Ahlswede, Nimisha Thekke-Madam, Christian Schulz, Birgit Kleinschmit, Begüm Demir

Comments: 4 pages, 1 figure, submitted to IEEE Geosciences and Remote Sensing Symposium (2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[403] arXiv:2201.07540 [pdf, ps, other]: Title: Virtual Coil Augmentation Technology for MR Coil Extrapolation via Deep Learning

Authors: Cailian Yang, Xianghao Liao, Yuhao Wang, Minghui Zhang, Qiegen Liu

Comments: arXiv admin note: text overlap with arXiv:2103.15061, arXiv:1907.03063, arXiv:1807.03039 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[404] arXiv:2201.07572 [pdf, other]: Title: Superpixel Pre-Segmentation of HER2 Slides for Efficient Annotation

Authors: Mathias Öttl, Jana Mönius, Christian Marzahl, Matthias Rübner, Carol I. Geppert, Arndt Hartmann, Matthias W. Beckmann, Peter Fasching, Andreas Maier, Ramona Erber, Katharina Breininger

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[405] arXiv:2201.07583 [pdf, ps, other]: Title: DMF-Net: Dual-Branch Multi-Scale Feature Fusion Network for copy forgery identification of anti-counterfeiting QR code

Authors: Zhongyuan Guo, Hong Zheng, Changhui You, Tianyu Wang, Chang Liu

Comments: 17 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406] arXiv:2201.07594 [pdf, ps, other]: Title: Real-time Recognition of Yoga Poses using computer Vision for Smart Health Care

Authors: Abhishek Sharma, Yash Shah, Yash Agrawal, Prateek Jain

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407] arXiv:2201.07609 [pdf, other]: Title: A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo

Authors: Wang Zhao, Shaohui Liu, Yi Wei, Hengkai Guo, Yong-Jin Liu

Comments: 17 pages, 13 figures, 7 tables. ICCV 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2201.07619 [pdf, other]: Title: CAST: Character labeling in Animation using Self-supervision by Tracking

Authors: Oron Nir, Gal Rapoport, Ariel Shamir

Comments: Published as a conference paper at EuroGraphics 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[409] arXiv:2201.07661 [pdf, other]: Title: Open Source Handwritten Text Recognition on Medieval Manuscripts using Mixed Models and Document-Specific Finetuning

Authors: Christian Reul, Stefan Tomasek, Florian Langhanki, Uwe Springmann

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[410] arXiv:2201.07665 [pdf, other]: Title: Semi-automatic 3D Object Keypoint Annotation and Detection for the Masses

Authors: Kenneth Blomqvist, Jen Jen Chung, Lionel Ott, Roland Siegwart

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[411] arXiv:2201.07676 [pdf, other]: Title: Neighborhood Spatial Aggregation MC Dropout for Efficient Uncertainty-aware Semantic Segmentation in Point Clouds

Authors: Chao Qi, Jianqin Yin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412] arXiv:2201.07692 [pdf, other]: Title: GroupGazer: A Tool to Compute the Gaze per Participant in Groups with integrated Calibration to Map the Gaze Online to a Screen or Beamer Projection

Authors: Wolfgang Fuhl, Daniel Weber, Shahram Eivazi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[413] arXiv:2201.07703 [pdf, other]: Title: Q-ViT: Fully Differentiable Quantization for Vision Transformer

Authors: Zhexin Li, Tong Yang, Peisong Wang, Jian Cheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[414] arXiv:2201.07706 [pdf, ps, other]: Title: Object Detection in Autonomous Vehicles: Status and Open Challenges

Authors: Abhishek Balasubramaniam, Sudeep Pasricha

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[415] arXiv:2201.07734 [pdf, other]: Title: Towards holistic scene understanding: Semantic segmentation and beyond

Authors: Panagiotis Meletis

Comments: PhD Thesis, Eindhoven University of Technology, October 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[416] arXiv:2201.07756 [pdf, other]: Title: A pipeline for automated processing of Corona KH-4 (1962-1972) stereo imagery

Authors: Sajid Ghuffar, Tobias Bolch, Ewelina Rupnik, Atanu Bhattacharya

Comments: 24 Pages, 16 Figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[417] arXiv:2201.07781 [pdf, other]: Title: Towards a General Deep Feature Extractor for Facial Expression Recognition

Authors: Liam Schoneveld, Alice Othmani

Comments: Published in: 2021 IEEE International Conference on Image Processing (ICIP). arXiv admin note: text overlap with arXiv:2103.09154

Journal-ref: IEEE International Conference on Image Processing (ICIP), 2021, pp. 2339-2342

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[418] arXiv:2201.07788 [pdf, other]: Title: ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes

Authors: Rahul Sajnani, Adrien Poulenard, Jivitesh Jain, Radhika Dua, Leonidas J. Guibas, Srinath Sridhar

Comments: Accepted to CVPR 2022, New Orleans, Louisiana. For project page and code, see this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[419] arXiv:2201.07894 [pdf, other]: Title: Enhanced Performance of Pre-Trained Networks by Matched Augmentation Distributions

Authors: Touqeer Ahmad, Mohsen Jafarzadeh, Akshay Raj Dhamija, Ryan Rabinowitz, Steve Cruz, Chunchun Li, Terrance E. Boult

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[420] arXiv:2201.07906 [pdf, other]: Title: The Role of Facial Expressions and Emotion in ASL

Authors: Lee Kezar, Pei Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[421] arXiv:2201.07927 [pdf, other]: Title: Learning-by-Novel-View-Synthesis for Full-Face Appearance-Based 3D Gaze Estimation

Authors: Jiawei Qin, Takuru Shimoyama, Yusuke Sugano

Comments: Camera-ready version for CVPR 2022 Workshop (GAZE 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422] arXiv:2201.07929 [pdf, other]: Title: Estimating Egocentric 3D Human Pose in the Wild with External Weak Supervision

Authors: Jian Wang, Lingjie Liu, Weipeng Xu, Kripasindhu Sarkar, Diogo Luvizon, Christian Theobalt

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[423] arXiv:2201.07931 [pdf, other]: Title: Experimental Large-Scale Jet Flames' Geometrical Features Extraction for Risk Management Using Infrared Images and Deep Learning Segmentation Methods

Authors: Carmina Pérez-Guerrero, Adriana Palacios, Gilberto Ochoa-Ruiz, Christian Mata, Joaquim Casal, Miguel Gonzalez-Mendoza, Luis Eduardo Falcón-Morales

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[424] arXiv:2201.07937 [pdf, other]: Title: GASCN: Graph Attention Shape Completion Network

Authors: Haojie Huang, Ziyi Yang, Robert Platt

Comments: International Conference on 3D Vision (3DV)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425] arXiv:2201.07989 [pdf, other]: Title: Self-supervised Video Representation Learning with Cascade Positive Retrieval

Authors: Cheng-En Wu, Farley Lai, Yu Hen Hu, Asim Kadav

Comments: To appear in CVPR 2022 L3D-IVU Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426] arXiv:2201.08001 [pdf, other]: Title: CELESTIAL: Classification Enabled via Labelless Embeddings with Self-supervised Telescope Image Analysis Learning

Authors: Suhas Kotha, Anirudh Koul, Siddha Ganju, Meher Kasam

Comments: COSPAR 2021 Cross-Disciplinary Workshop on Machine Learning for Space Sciences, Sydney, Australia

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427] arXiv:2201.08002 [pdf, other]: Title: PRMI: A Dataset of Minirhizotron Images for Diverse Plant Root Study

Authors: Weihuang Xu, Guohao Yu, Yiming Cui, Romain Gloaguen, Alina Zare, Jason Bonnette, Joel Reyes-Cabrera, Ashish Rajurkar, Diane Rowland, Roser Matamala, Julie D. Jastrow, Thomas E. Juenger, Felix B. Fritschi

Comments: The 36th AAAI Conference on the AI for Agriculture and Food Systems (AIAFS) Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[428] arXiv:2201.08027 [pdf, ps, other]: Title: A Joint Morphological Profiles and Patch Tensor Change Detection for Hyperspectral Imagery

Authors: Zengfu Hou, Wei Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Methodology (stat.ME)
[429] arXiv:2201.08029 [pdf, other]: Title: Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction

Authors: Jingye Wang, Ruoyi Du, Dongliang Chang, Kongming Liang, Zhanyu Ma

Comments: The paper is accepted by ACM Multimedia 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430] arXiv:2201.08049 [pdf, other]: Title: Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation

Authors: Gongyang Li, Zhi Liu, Zhen Bai, Weisi Lin, and Haibin Ling

Comments: 11 pages, 6 figures, Accepted by IEEE Transactions on Geoscience and Remote Sensing 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431] arXiv:2201.08050 [pdf, other]: Title: TerViT: An Efficient Ternary Vision Transformer

Authors: Sheng Xu, Yanjing Li, Teli Ma, Bohan Zeng, Baochang Zhang, Peng Gao, Jinhu Lv

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[432] arXiv:2201.08051 [pdf, other]: Title: Predicting Vegetation Stratum Occupancy from Airborne LiDAR Data with Deep Learning

Authors: Ekaterina Kalinicheva, Loic Landrieu, Clément Mallet, Nesrine Chehata

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433] arXiv:2201.08071 [pdf, other]: Title: Temporal Sentence Grounding in Videos: A Survey and Future Directions

Authors: Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou

Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[434] arXiv:2201.08093 [pdf, other]: Title: AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape Estimation

Authors: Nitin Saini, Elia Bonetto, Eric Price, Aamir Ahmad, Michael J. Black

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435] arXiv:2201.08098 [pdf, other]: Title: What can we learn from misclassified ImageNet images?

Authors: Shixian Wen, Amanda Sofie Rios, Kiran Lekkala, Laurent Itti

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436] arXiv:2201.08122 [pdf, other]: Title: A Computational Model for Machine Thinking

Authors: Slimane Larabi

Comments: Internal report, RIIMA Laboratory

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437] arXiv:2201.08125 [pdf, other]: Title: Deep Unsupervised Contrastive Hashing for Large-Scale Cross-Modal Text-Image Retrieval in Remote Sensing

Authors: Georgii Mikriukov, Mahdyar Ravanbakhsh, Begüm Demir

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[438] arXiv:2201.08131 [pdf, other]: Title: GeoFill: Reference-Based Image Inpainting with Better Geometric Understanding

Authors: Yunhan Zhao, Connelly Barnes, Yuqian Zhou, Eli Shechtman, Sohrab Amirghodsi, Charless Fowlkes

Comments: Accepted to WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439] arXiv:2201.08141 [pdf, other]: Title: SPAMs: Structured Implicit Parametric Models

Authors: Pablo Palafox, Nikolaos Sarafianos, Tony Tung, Angela Dai

Comments: Project page: this https URL - Video: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440] arXiv:2201.08157 [pdf, other]: Title: WPPNets and WPPFlows: The Power of Wasserstein Patch Priors for Superresolution

Authors: Fabian Altekrüger, Johannes Hertrich

Journal-ref: SIAM Journal on Imaging Sciences, vol. 16(3), pp. 1033-1067, 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[441] arXiv:2201.08158 [pdf, other]: Title: HDhuman: High-quality Human Novel-view Rendering from Sparse Views

Authors: Tiansong Zhou, Jing Huang, Tao Yu, Ruizhi Shao, Kun Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[442] arXiv:2201.08215 [pdf, other]: Title: CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud Learning

Authors: Mingye Xu, Yali Wang, Zhipeng Zhou, Hongbin Xu, Yu Qiao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[443] arXiv:2201.08217 [pdf, other]: Title: Watermarking Pre-trained Encoders in Contrastive Learning

Authors: Yutong Wu, Han Qiu, Tianwei Zhang, Jiwei L, Meikang Qiu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[444] arXiv:2201.08264 [pdf, other]: Title: End-to-end Generative Pretraining for Multimodal Video Captioning

Authors: Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab, Cordelia Schmid

Journal-ref: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[445] arXiv:2201.08295 [pdf, other]: Title: DIVA-DAF: A Deep Learning Framework for Historical Document Image Analysis

Authors: Lars Vögtlin, Anna Scius-Bertrand, Paul Maergner, Andreas Fischer, Rolf Ingold

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446] arXiv:2201.08361 [pdf, other]: Title: Stitch it in Time: GAN-Based Facial Editing of Real Videos

Authors: Rotem Tzaban, Ron Mokady, Rinon Gal, Amit H. Bermano, Daniel Cohen-Or

Comments: Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[447] arXiv:2201.08371 [pdf, other]: Title: Revisiting Weakly Supervised Pre-Training of Visual Perception Models

Authors: Mannat Singh, Laura Gustafson, Aaron Adcock, Vinicius de Freitas Reis, Bugra Gedik, Raj Prateek Kosaraju, Dhruv Mahajan, Ross Girshick, Piotr Dollár, Laurens van der Maaten

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448] arXiv:2201.08377 [pdf, other]: Title: Omnivore: A Single Model for Many Visual Modalities

Authors: Rohit Girdhar, Mannat Singh, Nikhila Ravi, Laurens van der Maaten, Armand Joulin, Ishan Misra

Comments: Accepted at CVPR 2022 (Oral Presentation)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[449] arXiv:2201.08379 [pdf, other]: Title: Learning Pixel Trajectories with Multiscale Contrastive Random Walks

Authors: Zhangxing Bian, Allan Jabri, Alexei A. Efros, Andrew Owens

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450] arXiv:2201.08383 [pdf, other]: Title: MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition

Authors: Chao-Yuan Wu, Yanghao Li, Karttikeya Mangalam, Haoqi Fan, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer

Comments: Technical report. arXiv v2: add link to code

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[451] arXiv:2201.08425 [pdf, other]: Title: FaceOcc: A Diverse, High-quality Face Occlusion Dataset for Human Face Extraction

Authors: Xiangnan Yin, Liming Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[452] arXiv:2201.08465 [pdf, other]: Title: An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional Filters

Authors: Paul Gavrikov, Janis Keuper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[453] arXiv:2201.08550 [pdf, other]: Title: What Can Machine Vision Do for Lymphatic Histopathology Image Analysis: A Comprehensive Review

Authors: Xiaoqi Li, Haoyuan Chen, Chen Li, Md Mamunur Rahaman, Xintong Li, Jian Wu, Xiaoyan Li, Hongzan Sun, Marcin Grzegorzek

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[454] arXiv:2201.08574 [pdf, other]: Title: Classroom Slide Narration System

Authors: Jobin K.V., Ajoy Mondal, C. V. Jawahar

Journal-ref: CVIP 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[455] arXiv:2201.08613 [pdf, other]: Title: Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization

Authors: Can Wang, Sheng Jin, Yingda Guan, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang

Comments: To appear on ICLR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[456] arXiv:2201.08619 [pdf, other]: Title: Dangerous Cloaking: Natural Trigger based Backdoor Attacks on Object Detectors in the Physical World

Authors: Hua Ma, Yinshan Li, Yansong Gao, Alsharif Abuadbba, Zhi Zhang, Anmin Fu, Hyoungshick Kim, Said F. Al-Sarawi, Nepal Surya, Derek Abbott

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[457] arXiv:2201.08625 [pdf, other]: Title: VIPriors 2: Visual Inductive Priors for Data-Efficient Deep Learning Challenges

Authors: Attila Lengyel, Robert-Jan Bruintjes, Marcos Baptista Rios, Osman Semih Kayhan, Davide Zambrano, Nergis Tomen, Jan van Gemert

Comments: 11 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[458] arXiv:2201.08633 [pdf, other]: Title: Multi-view Monocular Depth and Uncertainty Prediction with Deep SfM in Dynamic Environments

Authors: Christian Homeyer, Oliver Lange, Christoph Schnörr

Comments: 20 pages, 5 figures, 3 tables, submitted to ICPRAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[459] arXiv:2201.08636 [pdf, ps, other]: Title: Conceptor Learning for Class Activation Mapping

Authors: Guangwu Qian, Zhen-Qun Yang, Xu-Lu Zhang, Yaowei Wang, Qing Li, Xiao-Yong Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[460] arXiv:2201.08657 [pdf, other]: Title: Enhancing Pseudo Label Quality for Semi-Supervised Domain-Generalized Medical Image Segmentation

Authors: Huifeng Yao, Xiaowei Hu, Xiaomeng Li

Comments: Accepted by AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[461] arXiv:2201.08663 [pdf, other]: Title: Fast Differentiable Matrix Square Root

Authors: Yue Song, Nicu Sebe, Wei Wang

Comments: Accpeted by ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Mathematical Software (cs.MS); Numerical Analysis (math.NA)
[462] arXiv:2201.08669 [pdf, other]: Title: Dynamic Deep Convolutional Candlestick Learner

Authors: Jun-Hao Chen, Yun-Cheng Tsai

Comments: 11 pages, 9 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463] arXiv:2201.08673 [pdf, other]: Title: Exploring Fusion Strategies for Accurate RGBT Visual Object Tracking

Authors: Zhangyong Tang (1), Tianyang Xu (1), Hui Li (1), Xiao-Jun Wu (1), Xuefeng Zhu (1), Josef Kittler (2) ((1) Jiangnan University, Wuxi, China, (2) University of Surrey, UK)

Comments: 13 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[464] arXiv:2201.08683 [pdf, other]: Title: A Comprehensive Study of Vision Transformers on Dense Prediction Tasks

Authors: Kishaan Jeeveswaran, Senthilkumar Kathiresan, Arnav Varma, Omar Magdy, Bahram Zonooz, Elahe Arani

Comments: 17th International Conference on Computer Vision Theory and Applications (VISAP, 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[465] arXiv:2201.08746 [pdf, other]: Title: ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specification

Authors: Jan Cychnerski, Tomasz Dziubich, Adam Brzeski

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[466] arXiv:2201.08763 [pdf, other]: Title: Object Detection in Aerial Images: What Improves the Accuracy?

Authors: Hashmat Shadab Malik, Ikboljon Sobirov, Abdelrahman Mohamed

Comments: 8 pages, 14 Figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[467] arXiv:2201.08779 [pdf, other]: Title: Contrastive and Selective Hidden Embeddings for Medical Image Segmentation

Authors: Zhuowei Li, Zihao Liu, Zhiqiang Hu, Qing Xia, Ruiqin Xiong, Shaoting Zhang, Dimitris Metaxas, Tingting Jiang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[468] arXiv:2201.08789 [pdf, other]: Title: AiTLAS: Artificial Intelligence Toolbox for Earth Observation

Authors: Ivica Dimitrovski, Ivan Kitanovski, Panče Panov, Nikola Simidjievski, Dragi Kocev

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[469] arXiv:2201.08812 [pdf, ps, other]: Title: DeepMix: Mobility-aware, Lightweight, and Hybrid 3D Object Detection for Headsets

Authors: Yongjie Guan, Xueyu Hou, Nan Wu, Bo Han, Tao Han

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[470] arXiv:2201.08813 [pdf, other]: Title: Active Predictive Coding Networks: A Neural Solution to the Problem of Learning Reference Frames and Part-Whole Hierarchies

Authors: Dimitrios C. Gklezakos, Rajesh P. N. Rao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[471] arXiv:2201.08815 [pdf, other]: Title: Learning from One and Only One Shot

Authors: Haizi Yu, Igor Mineyev, Lav R. Varshney, James A. Evans

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[472] arXiv:2201.08816 [pdf, other]: Title: Skyline variations allow estimating distance to trees on landscape photos using semantic segmentation

Authors: Laura Martinez-Sanchez, Daniele Borio, Raphaël d'Andrimont, Marijn van der Velde

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Applications (stat.AP)
[473] arXiv:2201.08831 [pdf, other]: Title: Reliable Detection of Doppelgängers based on Deep Face Representations

Authors: Christian Rathgeb, Daniel Fischer, Pawel Drozdowski, Christoph Busch

Comments: accepted in IET Biometrics

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[474] arXiv:2201.08845 [pdf, other]: Title: Point-NeRF: Point-based Neural Radiance Fields

Authors: Qiangeng Xu, Zexiang Xu, Julien Philip, Sai Bi, Zhixin Shu, Kalyan Sunkavalli, Ulrich Neumann

Comments: Accepted to CVPR 2022 (Oral)

Journal-ref: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 5438-5448) (2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[475] arXiv:2201.08887 [pdf, other]: Title: Image-to-Video Re-Identification via Mutual Discriminative Knowledge Transfer

Authors: Pichao Wang, Fan Wang, Hao Li

Comments: accepted by ICASSP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[476] arXiv:2201.08893 [pdf, other]: Title: Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers

Authors: Max Wolff, Stuart Wolff

Comments: Accepted at SVRHM 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[477] arXiv:2201.08901 [pdf, ps, other]: Title: An Ensemble Model for Face Liveness Detection

Authors: Shashank Shekhar, Avinash Patel, Mrinal Haloi, Asif Salim

Comments: Accepted and presented at MLDM 2022. To be published in Lattice journal

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[478] arXiv:2201.08938 [pdf, other]: Title: Adaptive DropBlock Enhanced Generative Adversarial Networks for Hyperspectral Image Classification

Authors: Junjie Wang, Feng Gao, Junyu Dong, Qian Du

Journal-ref: in IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 6, pp. 5040-5053, June 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[479] arXiv:2201.08949 [pdf, other]: Title: Temporal Aggregation for Adaptive RGBT Tracking

Authors: Zhangyong Tang, Tianyang Xu, Xiao-Jun Wu

Comments: 12 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[480] arXiv:2201.08951 [pdf, other]: Title: Visual Representation Learning with Self-Supervised Attention for Low-Label High-data Regime

Authors: Prarthana Bhattacharyya, Chenge Li, Xiaonan Zhao, István Fehérvári, Jason Sun

Comments: Accepted to ICASSP-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[481] arXiv:2201.08953 [pdf, other]: Title: FedMed-GAN: Federated Domain Translation on Unsupervised Cross-Modality Brain Image Synthesis

Authors: Jinbao Wang, Guoyang Xie, Yawen Huang, Jiayi Lyu, Yefeng Zheng, Feng Zheng, Yaochu Jin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[482] arXiv:2201.08954 [pdf, other]: Title: Change Detection from Synthetic Aperture Radar Images via Graph-Based Knowledge Supplement Network

Authors: Junjie Wang, Feng Gao, Junyu Dong, Shan Zhang, Qian Du

Comments: Accepted by IEEE JSTARS

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[483] arXiv:2201.08958 [pdf, other]: Title: Learning Efficient Representations for Enhanced Object Detection on Large-scene SAR Images

Authors: Siyan Li, Yue Xiao, Yuhang Zhang, Lei Chu, Robert C. Qiu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[484] arXiv:2201.08959 [pdf, other]: Title: Few-shot Object Counting with Similarity-Aware Feature Enhancement

Authors: Zhiyuan You, Kai Yang, Wenhan Luo, Xin Lu, Lei Cui, Xinyi Le

Comments: Accepted by WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485] arXiv:2201.08962 [pdf, other]: Title: Collaborative Representation for SPD Matrices with Application to Image-Set Classification

Authors: Li Chu, Rui Wang, Xiao-Jun Wu

Comments: 9 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[486] arXiv:2201.08970 [pdf, other]: Title: Parallel Rectangle Flip Attack: A Query-based Black-box Attack against Object Detection

Authors: Siyuan Liang, Baoyuan Wu, Yanbo Fan, Xingxing Wei, Xiaochun Cao

Comments: 8 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487] arXiv:2201.08977 [pdf, other]: Title: Semi-Supervised Adversarial Recognition of Refined Window Structures for Inverse Procedural Façade Modeling

Authors: Han Hu, Xinrong Liang, Yulin Ding, Qisen Shang, Bo Xu, Xuming Ge, Min Chen, Ruofei Zhong, Qing Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[488] arXiv:2201.08983 [pdf, other]: Title: BBA-net: A bi-branch attention network for crowd counting

Authors: Yi Hou, Chengyang Li, Fan Yang, Cong Ma, Liping Zhu, Yuan Li, Huizhu Jia, Xiaodong Xie

Journal-ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[489] arXiv:2201.08992 [pdf, other]: Title: Enhancing and Dissecting Crowd Counting By Synthetic Data

Authors: Yi Hou, Chengyang Li, Yuheng Lu, Liping Zhu, Yuan Li, Huizhu Jia, Xiaodong Xie

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[490] arXiv:2201.08996 [pdf, other]: Title: Linear Array Network for Low-light Image Enhancement

Authors: Keqi Wang, Ziteng Cui, Jieru Jia, Hao Xu, Ge Wu, Yin Zhuang, Lu Chen, Zhiguo Hu, Yuhua Qian

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[491] arXiv:2201.09023 [pdf, other]: Title: Content-aware Warping for View Synthesis

Authors: Mantang Guo, Junhui Hou, Jing Jin, Hui Liu, Huanqiang Zeng, Jiwen Lu

Comments: arXiv admin note: text overlap with arXiv:2108.07408

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[492] arXiv:2201.09041 [pdf, other]: Title: Inter-Semantic Domain Adversarial in Histopathological Images

Authors: Nicolas Dumas, Valentin Derangère, Laurent Arnould, Sylvain Ladoire, Louis-Oscar Morel, Nathan Vinçon

Comments: 8 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[493] arXiv:2201.09042 [pdf, other]: Title: Uncertainty-aware deep learning methods for robust diabetic retinopathy classification

Authors: Joel Jaskari, Jaakko Sahlsten, Theodoros Damoulas, Jeremias Knoblauch, Simo Särkkä, Leo Kärkkäinen, Kustaa Hietala, Kimmo Kaski

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[494] arXiv:2201.09048 [pdf, other]: Title: Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems

Authors: Xi Zheng, Rui Ma, Rui Gao, Qi Hao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[495] arXiv:2201.09049 [pdf, other]: Title: LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN

Authors: Ghulam Mujtaba, Adeel Malik, Eun-Seok Ryu

Comments: 14

Journal-ref: in IEEE Access, vol. 10, pp. 103041-103055, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[496] arXiv:2201.09061 [pdf, other]: Title: Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

Authors: J. Rafid Siddiqui

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[497] arXiv:2201.09077 [pdf, other]: Title: LTC-GIF: Attracting More Clicks on Feature-length Sports Videos

Authors: Ghulam Mujtaba, Jaehyuk Choi, Eun-Seok Ryu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[498] arXiv:2201.09079 [pdf, other]: Title: Implicit Bias of Projected Subgradient Method Gives Provable Robust Recovery of Subspaces of Unknown Codimension

Authors: Paris V. Giampouras, Benjamin D. Haeffele, René Vidal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[499] arXiv:2201.09089 [pdf, ps, other]: Title: A Comprehensive Study on Occlusion Invariant Face Recognition under Face Mask Occlusion

Authors: Susith Hemathilaka, Achala Aponso

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[500] arXiv:2201.09109 [pdf, other]: Title: Robust Unpaired Single Image Super-Resolution of Faces

Authors: Saurabh Goswami, Rajagopalan A. N

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[501] arXiv:2201.09120 [pdf, other]: Title: Investigating the Potential of Auxiliary-Classifier GANs for Image Classification in Low Data Regimes

Authors: Amil Dravid, Florian Schiffers, Yunan Wu, Oliver Cossairt, Aggelos K. Katsaggelos

Comments: 4 pages content, 1 page references, 3 figures, 2 tables, to appear in ICASSP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[502] arXiv:2201.09135 [pdf, other]: Title: MIDAS: Deep learning human action intention prediction from natural eye movement patterns

Authors: Paul Festor, Ali Shafti, Alex Harston, Michey Li, Pavel Orlov, A. Aldo Faisal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[503] arXiv:2201.09139 [pdf, other]: Title: Dual-Flattening Transformers through Decomposed Row and Column Queries for Semantic Segmentation

Authors: Ying Wang, Chiuman Ho, Wenju Xu, Ziwei Xuan, Xudong Liu, Guo-Jun Qi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[504] arXiv:2201.09144 [pdf, other]: Title: Background Invariant Classification on Infrared Imagery by Data Efficient Training and Reducing Bias in CNNs

Authors: Maliha Arif, Calvin Yong, Abhijit Mahalanobis

Comments: Accepted in AAAI-22 Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[505] arXiv:2201.09152 [pdf, other]: Title: Generative Adversarial Network Applications in Creating a Meta-Universe

Authors: Soheyla Amirian, Thiab R. Taha, Khaled Rasheed, Hamid R. Arabnia

Comments: Computational Science and Computational Intelligence; 2021 International Conference on IEEE CPS (IEEE XPLORE, Scopus), IEEE, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[506] arXiv:2201.09153 [pdf, other]: Title: An Integrated Approach for Video Captioning and Applications

Authors: Soheyla Amirian, Thiab R. Taha, Khaled Rasheed, Hamid R. Arabnia

Comments: The 2021 World Congress in Computer Science, Computer Engineering, and Applied Computing (CSCE'21), IEEE, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[507] arXiv:2201.09156 [pdf, other]: Title: LSNet: Extremely Light-Weight Siamese Network For Change Detection in Remote Sensing Image

Authors: Biyuan Liu, Huaixin Chen, Zhixi Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[508] arXiv:2201.09167 [pdf, other]: Title: Mixed X-Ray Image Separation for Artworks with Concealed Designs

Authors: Wei Pu, Jun-Jie Huang, Barak Sober, Nathan Daly, Catherine Higgitt, Ingrid Daubechies, Pier Luigi Dragotti, Miguel Rodigues

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[509] arXiv:2201.09168 [pdf, other]: Title: Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval

Authors: Jianfeng Dong, Yabing Wang, Xianke Chen, Xiaoye Qu, Xirong Li, Yuan He, Xun Wang

Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[510] arXiv:2201.09169 [pdf, other]: Title: Rich Action-semantic Consistent Knowledge for Early Action Prediction

Authors: Xiaoli Liu, Jianqin Yin, Di Guo, Huaping Liu

Comments: Accepted by IEEE TIP,15pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[511] arXiv:2201.09193 [pdf, other]: Title: Learning to Minimize the Remainder in Supervised Learning

Authors: Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

Comments: Accepted to IEEE TMM

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[512] arXiv:2201.09201 [pdf, other]: Title: Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments

Authors: Ming Dai, Enhui Zheng, Zhenhua Feng, Jiedong Zhuang, Wankou Yang

Comments: 13 pages,8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[513] arXiv:2201.09205 [pdf, other]: Title: Deeply Explain CNN via Hierarchical Decomposition

Authors: Ming-Ming Cheng, Peng-Tao Jiang, Ling-Hao Han, Liang Wang, Philip Torr

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[514] arXiv:2201.09206 [pdf, other]: Title: A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization

Authors: Ming Dai, Jianhong Hu, Jiedong Zhuang, Enhui Zheng

Comments: 14 pages, 13 figures, IEEE Transactions on Circuits and Systems for Video Technology

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[515] arXiv:2201.09207 [src]: Title: Visual Object Tracking on Multi-modal RGB-D Videos: A Review

Authors: Xue-Feng Zhu, Tianyang Xu, Xiao-Jun Wu

Comments: I prefer not to present this paper due to its subpar quality

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[516] arXiv:2201.09208 [pdf, ps, other]: Title: Design of Sensor Fusion Driver Assistance System for Active Pedestrian Safety

Authors: I-Hsi Kao, Ya-Zhu Yian, Jian-An Su, Yi-Horng Lai, Jau-Woei Perng, Tung-Li Hsieh, Yi-Shueh Tsai, Min-Shiu Hsieh

Comments: The 14th International Conference on Automation Technology (Automation 2017), December 8-10, 2017, Kaohsiung, Taiwan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[517] arXiv:2201.09213 [pdf, ps, other]: Title: FN-Net:Remove the Outliers by Filtering the Noise

Authors: Kai Lv

Comments: 6 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[518] arXiv:2201.09246 [pdf, other]: Title: Face recognition via compact second order image gradient orientations

Authors: He-Feng Yin, Xiao-Jun Wu, Xiaoning Song

Comments: 26 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519] arXiv:2201.09271 [pdf, other]: Title: Wavelet-Attention CNN for Image Classification

Authors: Zhao Xiangyu

Comments: 17 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[520] arXiv:2201.09286 [pdf, other]: Title: How to scale hyperparameters for quickshift image segmentation

Authors: Damien Garreau

Comments: 33 pages, 16 figures. Accepted to AISTATS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[521] arXiv:2201.09296 [pdf, other]: Title: A Survey for Deep RGBT Tracking

Authors: Zhangyong Tang (1), Tianyang Xu (1), Xiao-Jun Wu (1) ((1) Jiangnan University, China)

Comments: 7 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[522] arXiv:2201.09302 [pdf, ps, other]: Title: 1000x Faster Camera and Machine Vision with Ordinary Devices

Authors: Tiejun Huang, Yajing Zheng, Zhaofei Yu, Rui Chen, Yuan Li, Ruiqin Xiong, Lei Ma, Junwei Zhao, Siwei Dong, Lin Zhu, Jianing Li, Shanshan Jia, Yihua Fu, Boxin Shi, Si Wu, Yonghong Tian

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[523] arXiv:2201.09308 [pdf, other]: Title: Basket-based Softmax

Authors: Qiang Meng, Xinqian Gu, Xiaqing Xu, Feng Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[524] arXiv:2201.09318 [pdf, other]: Title: Sparse-view Cone Beam CT Reconstruction using Data-consistent Supervised and Adversarial Learning from Scarce Training Data

Authors: Anish Lahiri, Marc Klasky, Jeffrey A. Fessler, Saiprasad Ravishankar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[525] arXiv:2201.09352 [pdf, other]: Title: Out of Distribution Detection on ImageNet-O

Authors: Anugya Srivastava, Shriya Jain, Mugdha Thigle

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[526] arXiv:2201.09354 [pdf, other]: Title: Survey and Systematization of 3D Object Detection Models and Methods

Authors: Moritz Drobnitzky, Jonas Friederich, Bernhard Egger, Patrick Zschech

Comments: accepted at "The Visual Computer"

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[527] arXiv:2201.09355 [pdf, ps, other]: Title: Transformer-based SAR Image Despeckling

Authors: Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

Comments: Submitted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. Our code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[528] arXiv:2201.09373 [pdf, other]: Title: Unsupervised Severely Deformed Mesh Reconstruction (DMR) from a Single-View Image

Authors: Jie Mei, Jingxi Yu, Suzanne Romain, Craig Rose, Kelsey Magrane, Graeme LeeSon, Jenq-Neng Hwang

Comments: Under Review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[529] arXiv:2201.09381 [pdf, other]: Title: vCLIMB: A Novel Video Class Incremental Learning Benchmark

Authors: Andrés Villa, Kumail Alhamoud, Juan León Alcázar, Fabian Caba Heilbron, Victor Escorcia, Bernard Ghanem

Comments: An updated version of our CVPR 2022 paper (oral); v2 adds minor text changes. The code of our benchmark can be found at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[530] arXiv:2201.09384 [pdf, ps, other]: Title: A Comprehensive Survey on Federated Learning: Concept and Applications

Authors: Dhurgham Hassan Mahlool, Mohammed Hamzah Abed

Journal-ref: Lecture Notes on Data Engineering and Communications Technologies 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[531] arXiv:2201.09388 [pdf, ps, other]: Title: A Survey on Patients Privacy Protection with Stganography and Visual Encryption

Authors: Hussein K. Alzubaidy, Dhiah Al-Shammary, Mohammed Hamzah Abed

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[532] arXiv:2201.09390 [pdf, other]: Title: AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks

Authors: Dmitrijs Kass, Ekta Vats

Comments: 15th IAPR International Workshop on Document Analysis System (DAS)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[533] arXiv:2201.09395 [pdf, ps, other]: Title: MISeval: a Metric Library for Medical Image Segmentation Evaluation

Authors: Dominik Müller, Dennis Hartmann, Philip Meyer, Florian Auer, Iñaki Soto-Rey, Frank Kramer

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[534] arXiv:2201.09396 [pdf, other]: Title: Dynamic Label Assignment for Object Detection by Combining Predicted IoUs and Anchor IoUs

Authors: Tianxiao Zhang, Bo Luo, Ajay Sharda, Guanghui Wang

Journal-ref: Journal of Imaging 2022, 8(7), 193

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[535] arXiv:2201.09405 [pdf, other]: Title: Improving Chest X-Ray Report Generation by Leveraging Warm Starting

Authors: Aaron Nicolson, Jason Dowling, Bevan Koopman

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[536] arXiv:2201.09407 [pdf, other]: Title: Cross-Domain Document Layout Analysis via Unsupervised Document Style Guide

Authors: Xingjiao Wu, Luwei Xiao, Xiangcheng Du, Yingbin Zheng, Xin Li, Tianlong Ma, Liang He

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[537] arXiv:2201.09421 [pdf, other]: Title: Mutual Attention-based Hybrid Dimensional Network for Multimodal Imaging Computer-aided Diagnosis

Authors: Yin Dai, Yifan Gao, Fayu Liu, Jun Fu

Comments: 11 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[538] arXiv:2201.09450 [pdf, other]: Title: UniFormer: Unifying Convolution and Self-attention for Visual Recognition

Authors: Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao

Comments: 18 pages, 10 figures, 23 tables. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[539] arXiv:2201.09548 [pdf, other]: Title: Consistent 3D Hand Reconstruction in Video via self-supervised Learning

Authors: Zhigang Tu, Zhisheng Huang, Yujin Chen, Di Kang, Linchao Bao, Bisheng Yang, Junsong Yuan

Comments: arXiv admin note: substantial text overlap with arXiv:2103.11703

Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence. 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[540] arXiv:2201.09563 [pdf, ps, other]: Title: Debiasing pipeline improves deep learning model generalization for X-ray based lung nodule detection

Authors: Michael Horry, Subrata Chakraborty, Biswajeet Pradhan, Manoranjan Paul, Jing Zhu, Hui Wen Loh, Prabal Datta Barua, U. Rajendra Arharya

Comments: 32 pages, 17 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[541] arXiv:2201.09574 [pdf, other]: Title: Multi-Scale Iterative Refinement Network for RGB-D Salient Object Detection

Authors: Ze-yu Liu, Jian-wei Liu, Xin Zuo, Ming-fei Hu

Comments: 40 pages

Journal-ref: Engineering Applications of Artificial Intelligence(2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[542] arXiv:2201.09575 [pdf, other]: Title: Importance of Textlines in Historical Document Classification

Authors: Martin Kišš, Jan Kohút, Karel Beneš, Michal Hradiš

Comments: 13 pages, 7 figures, 5 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[543] arXiv:2201.09594 [pdf, other]: Title: Describe me if you can! Characterized Instance-level Human Parsing

Authors: Angelique Loesch, Romaric Audigier

Comments: 5 pages

Journal-ref: Published in: 2021 IEEE International Conference on Image Processing (ICIP)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[544] arXiv:2201.09604 [pdf, other]: Title: End-to-end Person Search Sequentially Trained on Aggregated Dataset

Authors: Angelique Loesch, Jaonary Rabarisoa, Romaric Audigier

Comments: 5 pages

Journal-ref: Published in: 2019 IEEE International Conference on Image Processing (ICIP)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[545] arXiv:2201.09613 [pdf, other]: Title: SEN12MS-CR-TS: A Remote Sensing Data Set for Multi-modal Multi-temporal Cloud Removal

Authors: Patrick Ebel, Yajin Xu, Michael Schmitt, Xiaoxiang Zhu

Journal-ref: IEEE Transactions on Geoscience and Remote Sensing, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[546] arXiv:2201.09633 [pdf, other]: Title: Paired Image to Image Translation for Strikethrough Removal From Handwritten Words

Authors: Raphaela Heil, Ekta Vats, Anders Hast

Comments: accepted at DAS2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[547] arXiv:2201.09639 [pdf, other]: Title: Question Generation for Evaluating Cross-Dataset Shifts in Multi-modal Grounding

Authors: Arjun R. Akula

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[548] arXiv:2201.09689 [pdf, other]: Title: Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Authors: Bo Li, Qiulin Wang, Jiquan Pei, Yu Yang, Xiangyang Ji

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[549] arXiv:2201.09700 [pdf, ps, other]: Title: Feature transforms for image data augmentation

Authors: Loris Nanni, Michelangelo Paci, Sheryl Brahnam, Alessandra Lumini

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[550] arXiv:2201.09701 [pdf, other]: Title: Learning Semantics for Visual Place Recognition through Multi-Scale Attention

Authors: Valerio Paolicelli, Antonio Tavera, Carlo Masone, Gabriele Berton, Barbara Caputo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[551] arXiv:2201.09717 [pdf, other]: Title: Keeping Deep Lithography Simulators Updated: Global-Local Shape-Based Novelty Detection and Active Learning

Authors: Hao-Chiang Shao, Hsing-Lei Ping, Kuo-shiuan Chen, Weng-Tai Su, Chia-Wen Lin, Shao-Yun Fang, Pin-Yian Tsai, Yan-Hsiu Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[552] arXiv:2201.09724 [pdf, other]: Title: Hot-Refresh Model Upgrades with Regression-Alleviating Compatible Training in Image Retrieval

Authors: Binjie Zhang, Yixiao Ge, Yantao Shen, Yu Li, Chun Yuan, Xuyuan Xu, Yexin Wang, Ying Shan

Comments: Accepted to ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[553] arXiv:2201.09792 [pdf, other]: Title: Patches Are All You Need?

Authors: Asher Trockman, J. Zico Kolter

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[554] arXiv:2201.09799 [pdf, other]: Title: Neural Architecture Searching for Facial Attributes-based Depression Recognition

Authors: Mingzhe Chen, Xi Xiao, Bin Zhang, Xinyu Liu, Runiu Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[555] arXiv:2201.09822 [pdf, ps, other]: Title: Spectral-PQ: A Novel Spectral Sensitivity-Orientated Perceptual Compression Technique for RGB 4:4:4 Video Data

Authors: Lee Prangnell, Victor Sanchez

Comments: arXiv admin note: text overlap with arXiv:2005.07928

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[556] arXiv:2201.09846 [pdf, other]: Title: A Novel Mix-normalization Method for Generalizable Multi-source Person Re-identification

Authors: Lei Qi, Lei Wang, Yinghuan Shi, Xin Geng

Comments: Accepted by IEEE Transactions on Multimedia (TMM)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[557] arXiv:2201.09865 [pdf, other]: Title: RePaint: Inpainting using Denoising Diffusion Probabilistic Models

Authors: Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, Luc Van Gool

Comments: We missed out on other diffusion models that work on inpainting. We corrected that and apologize for this mistake

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[558] arXiv:2201.09933 [pdf, other]: Title: Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

Authors: Yingying Zhao, Yuhu Chang, Yutian Lu, Yujiang Wang, Mingzhi Dong, Qin Lv, Robert P. Dick, Fan Yang, Tun Lu, Ning Gu, Li Shang

Comments: The EMO-Film dataset is available at: this https URL

Journal-ref: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), Volume 6, Issue 1, Article 38. March 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[559] arXiv:2201.09935 [pdf, other]: Title: What is the cost of adding a constraint in linear least squares?

Authors: Ramakrishna Kakarala, Jun Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[560] arXiv:2201.09967 [pdf, other]: Title: Attacks and Defenses for Free-Riders in Multi-Discriminator GAN

Authors: Zilong Zhao, Jiyue Huang, Stefanie Roos, Lydia Y. Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[561] arXiv:2201.09968 [pdf, other]: Title: ImpliCity: City Modeling from Satellite Images with Deep Implicit Occupancy Fields

Authors: Corinne Stucker, Bingxin Ke, Yuanwen Yue, Shengyu Huang, Iro Armeni, Konrad Schindler

Comments: Accepted for publication in the International Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (camera-ready version including keywords + supplementary material)

Journal-ref: ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., V-2-2022, 193-201, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[562] arXiv:2201.09973 [pdf, ps, other]: Title: The Vehicle Trajectory Prediction Based on ResNet and EfficientNet Model

Authors: Ruyi Qu, Shukai Huang, Jiexuan Zhou, ChenXi Fan, ZhiYuan Yan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[563] arXiv:2201.10015 [pdf, ps, other]: Title: Automatic Recognition and Digital Documentation of Cultural Heritage Hemispherical Domes using Images

Authors: Reza Maalek, Shahrokh Maalek

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[564] arXiv:2201.10029 [pdf, other]: Title: PONI: Potential Functions for ObjectGoal Navigation with Interaction-free Learning

Authors: Santhosh Kumar Ramakrishnan, Devendra Singh Chaplot, Ziad Al-Halah, Jitendra Malik, Kristen Grauman

Comments: 8 pages + supplementary. Accepted in CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[565] arXiv:2201.10034 [pdf, other]: Title: Self-Supervised Point Cloud Registration with Deep Versatile Descriptors

Authors: Dongrui Liu, Chuanchuan Chen, Changqing Xu, Robert Qiu, Lei Chu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[566] arXiv:2201.10047 [src]: Title: Are Commercial Face Detection Models as Biased as Academic Models?

Authors: Samuel Dooley, George Z. Wei, Tom Goldstein, John P. Dickerson

Comments: This preprint and arXiv:2108.12508 were combined and a more rigorous analysis added to result in the NeurIPS Datasets & Benchmark 2022 paper arXiv:2211.15937

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[567] arXiv:2201.10060 [pdf, other]: Title: ViT-HGR: Vision Transformer-based Hand Gesture Recognition from High Density Surface EMG Signals

Authors: Mansooreh Montazerin, Soheil Zabihi, Elahe Rahimian, Arash Mohammadi, Farnoosh Naderkhani

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[568] arXiv:2201.10075 [pdf, other]: Title: Splatting-based Synthesis for Video Frame Interpolation

Authors: Simon Niklaus, Ping Hu, Jiawen Chen

Comments: WACV 2023, this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[569] arXiv:2201.10079 [pdf, ps, other]: Title: Real-time automatic polyp detection in colonoscopy using feature enhancement module and spatiotemporal similarity correlation unit

Authors: Jianwei Xu, Ran Zhao, Yizhou Yu, Qingwei Zhang, Xianzhang Bian, Jun Wang, Zhizheng Ge, Dahong Qian

Comments: This paper has been accepted by Biomedical Signal Processing and Control. Please cite the paper as Xu, J., Zhao, R., Yu, Y., Zhang, Q., Bian, X., Wang, J., Ge, Z., Qian, D., 2021. Real-time automatic polyp detection in colonoscopy using feature enhancement module and spatiotemporal similarity correlation unit. Biomedical Signal Processing and Control 66, 102503

Journal-ref: Biomedical Signal Processing and Control, vol. 66, p. 102503, Apr. 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[570] arXiv:2201.10084 [pdf, other]: Title: Revisiting L1 Loss in Super-Resolution: A Probabilistic View and Beyond

Authors: Xiangyu He, Jian Cheng

Comments: Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[571] arXiv:2201.10102 [pdf, ps, other]: Title: A Classical Approach to Handcrafted Feature Extraction Techniques for Bangla Handwritten Digit Recognition

Authors: Md. Ferdous Wahid, Md. Fahim Shahriar, Md. Shohanur Islam Sobuj

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[572] arXiv:2201.10107 [pdf, other]: Title: ARPD: Anchor-free Rotation-aware People Detection using Topview Fisheye Camera

Authors: Quan Nguyen Minh, Bang Le Van, Can Nguyen, Anh Le, Viet Dung Nguyen

Comments: 2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[573] arXiv:2201.10110 [pdf, other]: Title: A Hybrid Quantum-Classical Algorithm for Robust Fitting

Authors: Anh-Dzung Doan, Michele Sasdelli, David Suter, Tat-Jun Chin

Comments: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[574] arXiv:2201.10138 [pdf, other]: Title: SURDS: Self-Supervised Attention-guided Reconstruction and Dual Triplet Loss for Writer Independent Offline Signature Verification

Authors: Soumitri Chattopadhyay, Siladittya Manna, Saumik Bhattacharya, Umapada Pal

Comments: Accepted at ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[575] arXiv:2201.10145 [pdf, other]: Title: Riemannian Local Mechanism for SPD Neural Networks

Authors: Ziheng Chen, Tianyang Xu, Xiao-Jun Wu, Rui Wang, Zhiwu Huang, Josef Kittler

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[576] arXiv:2201.10147 [pdf, other]: Title: TGFuse: An Infrared and Visible Image Fusion Approach Based on Transformer and Generative Adversarial Network

Authors: Dongyu Rao, Xiao-Jun Wu, Tianyang Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[577] arXiv:2201.10152 [pdf, other]: Title: Unsupervised Image Fusion Method based on Feature Mutual Mapping

Authors: Dongyu Rao, Xiao-Jun Wu, Tianyang Xu, Guoyang Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[578] arXiv:2201.10162 [pdf, other]: Title: Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks

Authors: Xin Jin, Ruoyu Feng, Simeng Sun, Runsen Feng, Tianyu He, Zhibo Chen

Comments: 21 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[579] arXiv:2201.10168 [pdf, other]: Title: Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos

Authors: Sangmin Woo, Jinyoung Park, Inyong Koo, Sumin Lee, Minki Jeong, Changick Kim

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[580] arXiv:2201.10175 [pdf, other]: Title: RFMask: A Simple Baseline for Human Silhouette Segmentation with Radio Signals

Authors: Zhi Wu, Dongheng Zhang, Chunyang Xie, Cong Yu, Jinbo Chen, Yang Hu, Yan Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[581] arXiv:2201.10182 [pdf, other]: Title: Pre-Trained Language Transformers are Universal Image Classifiers

Authors: Rahul Goel, Modar Sulaiman, Kimia Noorbakhsh, Mahdi Sharifi, Rajesh Sharma, Pooyan Jamshidi, Kallol Roy

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[582] arXiv:2201.10184 [pdf, other]: Title: Estimating the Direction and Radius of Pipe from GPR Image by Ellipse Inversion Model

Authors: Xiren Zhou, Qiuju Chen, Shengfei Lyu, Huanhuan Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[583] arXiv:2201.10185 [pdf, other]: Title: Zero-Shot Sketch Based Image Retrieval using Graph Transformer

Authors: Sumrit Gupta, Ushasi Chaudhuri, Biplab Banerjee

Comments: Accepted at ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[584] arXiv:2201.10210 [pdf, ps, other]: Title: Universal Generative Modeling for Calibration-free Parallel Mr Imaging

Authors: Wanqing Zhu, Bing Guan, Shanshan Wang, Minghui Zhang, Qiegen Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[585] arXiv:2201.10212 [pdf, other]: Title: Feature Diversity Learning with Sample Dropout for Unsupervised Domain Adaptive Person Re-identification

Authors: Chunren Tang, Dingyu Xue, Dongyue Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[586] arXiv:2201.10243 [pdf, other]: Title: BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment

Authors: Luis Lebron, Yvette Graham, Kevin McGuinness, Konstantinos Kouramas, Noel E. O'Connor

Comments: In press in Language Resources and Evaluation Conference(LREC) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[587] arXiv:2201.10252 [pdf, other]: Title: DocEnTr: An End-to-End Document Image Enhancement Transformer

Authors: Mohamed Ali Souibgui, Sanket Biswas, Sana Khamekhem Jemni, Yousri Kessentini, Alicia Fornés, Josep Lladós, Umapada Pal

Comments: submitted to ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[588] arXiv:2201.10271 [pdf, other]: Title: Convolutional Xformers for Vision

Authors: Pranav Jeevan, Amit sethi

Comments: 9 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[589] arXiv:2201.10276 [pdf, other]: Title: City3D: Large-Scale Building Reconstruction from Airborne LiDAR Point Clouds

Authors: Jin Huang, Jantien Stoter, Ravi Peters, Liangliang Nan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[590] arXiv:2201.10326 [pdf, other]: Title: ShapeFormer: Transformer-based Shape Completion via Sparse Representation

Authors: Xingguang Yan, Liqiang Lin, Niloy J. Mitra, Dani Lischinski, Daniel Cohen-Or, Hui Huang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[591] arXiv:2201.10366 [pdf, other]: Title: ADAPT: An Open-Source sUAS Payload for Real-Time Disaster Prediction and Response with AI

Authors: Daniel Davila, Joseph VanPelt, Alexander Lynch, Adam Romlein, Peter Webley, Matthew S. Brown

Comments: To be published in Workshop on Practical Deep Learning in the Wild at AAAI Conference on Artificial Intelligence 2022, 9 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[592] arXiv:2201.10369 [pdf, ps, other]: Title: Winograd Convolution for Deep Neural Networks: Efficient Point Selection

Authors: Syed Asad Alam, Andrew Anderson, Barbara Barabasz, David Gregg

Comments: 19 pages, 3 figures, 9 tables and 32 equations

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[593] arXiv:2201.10389 [pdf, other]: Title: BLDNet: A Semi-supervised Change Detection Building Damage Framework using Graph Convolutional Networks and Urban Domain Knowledge

Authors: Ali Ismail, Mariette Awad

Comments: 16 pages, 15 figures, submitted to IEEE Transactions on Geoscience and Remote Sensing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[594] arXiv:2201.10394 [pdf, other]: Title: Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition

Authors: Kiyoon Kim, Shreyank N Gowda, Oisin Mac Aodha, Laura Sevilla-Lara

Comments: BMVC 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[595] arXiv:2201.10395 [pdf, other]: Title: Towards Cross-Disaster Building Damage Assessment with Graph Convolutional Networks

Authors: Ali Ismail, Mariette Awad

Comments: 5 pages, 3 figures, submitted to IEEE IGARSS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[596] arXiv:2201.10410 [pdf, other]: Title: Comparison of Evaluation Metrics for Landmark Detection in CMR Images

Authors: Sven Koehler, Lalith Sharan, Julian Kuhm, Arman Ghanaat, Jelizaveta Gordejeva, Nike K. Simon, Niko M. Grell, Florian André, Sandy Engelhardt

Comments: Accepted at Bildverarbeitung f\"ur die Medizin (BVM), Informatik aktuell. Springer Vieweg, Wiesbaden 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[597] arXiv:2201.10423 [pdf, other]: Title: Rayleigh EigenDirections (REDs): GAN latent space traversals for multidimensional features

Authors: Guha Balakrishnan, Raghudeep Gadde, Aleix Martinez, Pietro Perona

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[598] arXiv:2201.10431 [pdf, other]: Title: Main Product Detection with Graph Networks for Fashion

Authors: Vacit Oguz Yazici, Longlong Yu, Arnau Ramisa, Luis Herranz, Joost van de Weijer

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[599] arXiv:2201.10439 [pdf, other]: Title: Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video

Authors: Dmitriy Serdyuk, Otavio Braga, Olivier Siohan

Comments: 5 pages, 3 figures, published at Interspeech 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[600] arXiv:2201.10448 [pdf, other]: Title: How Low Can We Go? Pixel Annotation for Semantic Segmentation

Authors: Daniel Kigli, Ariel Shamir, Shai Avidan

Comments: Paper and Supplementary

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[601] arXiv:2201.10489 [pdf, other]: Title: Sphere2Vec: Multi-Scale Representation Learning over a Spherical Surface for Geospatial Predictions

Authors: Gengchen Mai, Yao Xuan, Wenyun Zuo, Krzysztof Janowicz, Ni Lao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[602] arXiv:2201.10520 [pdf, ps, other]: Title: Adaptive Activation-based Structured Pruning

Authors: Kaiqi Zhao, Animesh Jain, Ming Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[603] arXiv:2201.10521 [pdf, other]: Title: A Review of Deep Learning Based Image Super-resolution Techniques

Authors: Fangyuan Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[604] arXiv:2201.10522 [pdf, other]: Title: Blind Image Deblurring: a Review

Authors: Zhengrong Xue

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[605] arXiv:2201.10523 [pdf, ps, other]: Title: Interpretability in Convolutional Neural Networks for Building Damage Classification in Satellite Imagery

Authors: Thomas Y. Chen

Comments: 8 pages; presented as Spotlight Talk at NeurIPS - Tackling Climate Change with Machine Learning workshop 2020

Journal-ref: NeurIPS 2020 Workshop on Tackling Climate Change with Machine Learning

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Geophysics (physics.geo-ph)
[606] arXiv:2201.10526 [pdf, other]: Title: MonarchNet: Differentiating Monarch Butterflies from Butterflies Species with Similar Phenotypes

Authors: Thomas Y. Chen

Comments: 5 pages, 2 figures, Proceedings of NeurIPS 2020 - Learning Meaningful Representations of Life (LMRL) Workshop. The FASEB Journal

Journal-ref: CVPR 2021 Workshop on CV4Animals (Computer Vision for Animal Behavior Tracking and Modeling)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Populations and Evolution (q-bio.PE); Applications (stat.AP)
[607] arXiv:2201.10602 [pdf, other]: Title: Jacobian Computation for Cumulative B-Splines on SE(3) and Application to Continuous-Time Object Tracking

Authors: Javier Tirado, Javier Civera

Comments: Accepted at IEEE Robotics and Automation Letters

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[608] arXiv:2201.10647 [pdf, other]: Title: Unsupervised Domain Adaptation for Vestibular Schwannoma and Cochlea Segmentation via Semi-supervised Learning and Label Fusion

Authors: Han Liu, Yubo Fan, Can Cui, Dingjie Su, Andrew McNeil, Benoit M. Dawant

Comments: Accepted by MICCAI 2021 BrainLes Workshop. arXiv admin note: substantial text overlap with arXiv:2109.06274

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[609] arXiv:2201.10649 [pdf, other]: Title: Attentive Task Interaction Network for Multi-Task Learning

Authors: Dimitrios Sinodinos, Narges Armanfard

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[610] arXiv:2201.10650 [pdf, other]: Title: Beyond Visual Image: Automated Diagnosis of Pigmented Skin Lesions Combining Clinical Image Features with Patient Data

Authors: José G. M. Esgario, Renato A. Krohling

Comments: 33 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[611] arXiv:2201.10654 [pdf, ps, other]: Title: SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering

Authors: Peixi Xiong, Quanzeng You, Pei Yu, Zicheng Liu, Ying Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[612] arXiv:2201.10656 [pdf, ps, other]: Title: MGA-VQA: Multi-Granularity Alignment for Visual Question Answering

Authors: Peixi Xiong, Yilin Shen, Hongxia Jin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[613] arXiv:2201.10664 [pdf, other]: Title: Do Neural Networks for Segmentation Understand Insideness?

Authors: Kimberly Villalobos, Vilim Štih, Amineh Ahmadinejad, Shobhita Sundaram, Jamell Dozier, Andrew Francl, Frederico Azevedo, Tomotake Sasaki, Xavier Boix

Journal-ref: Neural Computation 33 (2021) 2511-2549

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[614] arXiv:2201.10665 [pdf, other]: Title: Writer Recognition Using Off-line Handwritten Single Block Characters

Authors: Adrian Leo Hagström, Rustam Stanikzai, Josef Bigun, Fernando Alonso-Fernandez

Comments: Accepted for publication at IEEE International Workshop on Biometrics and Forensics IWBF 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[615] arXiv:2201.10675 [pdf, ps, other]: Title: Virtual Adversarial Training for Semi-supervised Breast Mass Classification

Authors: Xuxin Chen, Ximin Wang, Ke Zhang, Kar-Ming Fung, Theresa C. Thai, Kathleen Moore, Robert S. Mannel, Hong Liu, Bin Zheng, Yuchen Qiu

Comments: To appear in the conference Biophotonics and Immune Responses of SPIE

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[616] arXiv:2201.10695 [pdf, other]: Title: Estimation of Spectral Biophysical Skin Properties from Captured RGB Albedo

Authors: Carlos Aliaga, Christophe Hery, Mengqi Xia

Comments: 11 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[617] arXiv:2201.10700 [pdf, other]: Title: Deep Image Deblurring: A Survey

Authors: Kaihao Zhang, Wenqi Ren, Wenhan Luo, Wei-Sheng Lai, Bjorn Stenger, Ming-Hsuan Yang, Hongdong Li

Comments: To appear in International Journal of Computer Vision (IJCV)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[618] arXiv:2201.10703 [pdf, other]: Title: Anomaly Detection via Reverse Distillation from One-Class Embedding

Authors: Hanqiu Deng, Xingyu Li

Comments: 10 pages, 7 figures

Journal-ref: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[619] arXiv:2201.10712 [pdf, other]: Title: Toward Data-Driven STAP Radar

Authors: Shyam Venkatasubramanian, Chayut Wongkamthong, Mohammadreza Soltani, Bosung Kang, Sandeep Gogineni, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

Comments: 5 pages, 4 figures. Submitted to 2022 IEEE Radar Conference (RadarConf)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[620] arXiv:2201.10725 [pdf, other]: Title: Image Generation with Self Pixel-wise Normalization

Authors: Yoon-Jae Yeo, Min-Cheol Sagong, Seung Park, Sung-Jea Ko, Yong-Goo Shin

Comments: 13 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[621] arXiv:2201.10728 [pdf, other]: Title: Training Vision Transformers with Only 2040 Images

Authors: Yun-Hao Cao, Hao Yu, Jianxin Wu

Comments: 11 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[622] arXiv:2201.10734 [pdf, other]: Title: CrossRectify: Leveraging Disagreement for Semi-supervised Object Detection

Authors: Chengcheng Ma, Xingjia Pan, Qixiang Ye, Fan Tang, Weiming Dong, Changsheng Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[623] arXiv:2201.10736 [pdf, other]: Title: A Joint Convolution Auto-encoder Network for Infrared and Visible Image Fusion

Authors: Zhancheng Zhang, Yuanhao Gao, Mengyu Xiong, Xiaoqing Luo, Xiao-Jun Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[624] arXiv:2201.10737 [pdf, other]: Title: Class-Aware Adversarial Transformers for Medical Image Segmentation

Authors: Chenyu You, Ruihan Zhao, Fenglin Liu, Siyuan Dong, Sandeep Chinchali, Ufuk Topcu, Lawrence Staib, James S. Duncan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[625] arXiv:2201.10739 [pdf, other]: Title: Infrared and visible image fusion based on Multi-State Contextual Hidden Markov Model

Authors: Xiaoqing Luo, Yuting Jiang, Anqi Wang, Zhancheng Zhang, Xiao-Jun Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[626] arXiv:2201.10753 [pdf, other]: Title: Interactive Image Inpainting Using Semantic Guidance

Authors: Wangbo Yu, Jinhao Du, Ruixin Liu, Yixuan Li, Yuesheng zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[627] arXiv:2201.10766 [pdf, other]: Title: A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes

Authors: Mazda Moayeri, Phillip Pope, Yogesh Balaji, Soheil Feizi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[628] arXiv:2201.10781 [pdf, other]: Title: ASFD: Automatic and Scalable Face Detector

Authors: Jian Li, Bin Zhang, Yabiao Wang, Ying Tai, ZhenYu Zhang, Chengjie Wang, Jilin Li, Xiaoming Huang, Yili Xia

Comments: ACM MM2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[629] arXiv:2201.10788 [pdf, other]: Title: Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation

Authors: Sinan Tan, Mengmeng Ge, Di Guo, Huaping Liu, Fuchun Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[630] arXiv:2201.10801 [pdf, other]: Title: When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism

Authors: Guangting Wang, Yucheng Zhao, Chuanxin Tang, Chong Luo, Wenjun Zeng

Comments: accepted by AAAI-22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[631] arXiv:2201.10830 [pdf, other]: Title: MonoDistill: Learning Spatial Features for Monocular 3D Object Detection

Authors: Zhiyu Chong, Xinzhu Ma, Hong Zhang, Yuxin Yue, Haojie Li, Zhihui Wang, Wanli Ouyang

Comments: Accepted by ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[632] arXiv:2201.10836 [pdf, other]: Title: PARS: Pseudo-Label Aware Robust Sample Selection for Learning with Noisy Labels

Authors: Arushi Goel, Yunlong Jiao, Jordan Massiah

Comments: 16 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[633] arXiv:2201.10848 [pdf, other]: Title: Comparison of Depth Estimation Setups from Stereo Endoscopy and Optical Tracking for Point Measurements

Authors: Lukas Burger, Lalith Sharan, Samantha Fischer, Julian Brand, Maximillian Hehl, Gabriele Romano, Matthias Karck, Raffaele De Simone, Ivo Wolf, Sandy Engelhardt

Comments: Accepted at Bildverarbeitung fuer die Medizin (BVM), Informatik aktuell. Springer Vieweg, Wiesbaden 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[634] arXiv:2201.10865 [pdf, ps, other]: Title: On the Issues of TrueDepth Sensor Data for Computer Vision Tasks Across Different iPad Generations

Authors: Steffen Urban, Thomas Lindemeier, David Dobbelstein, Matthias Haenel

Comments: 17 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[635] arXiv:2201.10873 [pdf, other]: Title: TransPPG: Two-stream Transformer for Remote Heart Rate Estimate

Authors: Jiaqi Kang, Su Yang, Weishan Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[636] arXiv:2201.10937 [pdf, other]: Title: Boosting 3D Adversarial Attacks with Attacking On Frequency

Authors: Binbin Liu, Jinlai Zhang, Lyujie Chen, Jihong Zhu

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[637] arXiv:2201.10938 [pdf, other]: Title: Projective Urban Texturing

Authors: Yiangos Georgiou, Melinos Averkiou, Tom Kelly, Evangelos Kalogerakis

Journal-ref: International Conference on 3D Vision 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[638] arXiv:2201.10943 [pdf, other]: Title: Event-based Video Reconstruction via Potential-assisted Spiking Neural Network

Authors: Lin Zhu, Xiao Wang, Yi Chang, Jianing Li, Tiejun Huang, Yonghong Tian

Comments: Accepted by CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[639] arXiv:2201.10953 [pdf, other]: Title: Dual-Tasks Siamese Transformer Framework for Building Damage Assessment

Authors: Hongruixuan Chen, Edoardo Nemni, Sofia Vallecorsa, Xi Li, Chen Wu, Lars Bromley

Comments: IGARSS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[640] arXiv:2201.10963 [pdf, other]: Title: Learning to Compose Diversified Prompts for Image Emotion Classification

Authors: Sinuo Deng, Lifang Wu, Ge Shi, Lehao Xing, Meng Jian, Ye Xiang

Comments: 10 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[641] arXiv:2201.10972 [pdf, other]: Title: How Robust are Discriminatively Trained Zero-Shot Learning Models?

Authors: Mehmet Kerim Yucel, Ramazan Gokberk Cinbis, Pinar Duygulu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[642] arXiv:2201.10985 [pdf, other]: Title: Jalisco's multiclass land cover analysis and classification using a novel lightweight convnet with real-world multispectral and relief data

Authors: Alexander Quevedo, Abraham Sánchez, Raul Nancláres, Diana P. Montoya, Juan Pacho, Jorge Martínez, E. Ulises Moya-Sánchez

Comments: 12 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[643] arXiv:2201.10990 [pdf, other]: Title: Learning To Recognize Procedural Activities with Distant Supervision

Authors: Xudong Lin, Fabio Petroni, Gedas Bertasius, Marcus Rohrbach, Shih-Fu Chang, Lorenzo Torresani

Comments: CVPR 2022. Code will be released here this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[644] arXiv:2201.11006 [pdf, other]: Title: An Overview of Compressible and Learnable Image Transformation with Secret Key and Its Applications

Authors: Hitoshi Kiya, AprilPyone MaungMaung, Yuma Kinoshita, Shoko Imaizumi, Sayaka Shiota

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[645] arXiv:2201.11014 [pdf, other]: Title: Language-biased image classification: evaluation based on semantic representations

Authors: Yoann Lemesle, Masataka Sawayama, Guillermo Valle-Perez, Maxime Adolphe, Hélène Sauzéon, Pierre-Yves Oudeyer

Comments: Accepted at ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[646] arXiv:2201.11091 [pdf, ps, other]: Title: Momentum Capsule Networks

Authors: Josef Gugglberger, David Peer, Antonio Rodríguez-Sánchez

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[647] arXiv:2201.11092 [pdf, ps, other]: Title: Self-Attention Neural Bag-of-Features

Authors: Kateryna Chumachenko, Alexandros Iosifidis, Moncef Gabbouj

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[648] arXiv:2201.11095 [pdf, other]: Title: Self-attention fusion for audiovisual emotion recognition with incomplete data

Authors: Kateryna Chumachenko, Alexandros Iosifidis, Moncef Gabbouj

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[649] arXiv:2201.11097 [pdf, other]: Title: Adaptive Instance Distillation for Object Detection in Autonomous Driving

Authors: Qizhen Lan, Qing Tian

Comments: 6 pages, 3 figures

Journal-ref: 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 2022, pp. 4559-4565

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[650] arXiv:2201.11103 [pdf, other]: Title: Auto-Compressing Subset Pruning for Semantic Image Segmentation

Authors: Konstantin Ditschuneit, Johannes S. Otterbach

Comments: 10 pages, 5 figures, 1 table, appendix

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[651] arXiv:2201.11114 [pdf, other]: Title: Natural Language Descriptions of Deep Visual Features

Authors: Evan Hernandez, Sarah Schwettmann, David Bau, Teona Bagashvili, Antonio Torralba, Jacob Andreas

Comments: To be published as a conference paper at ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[652] arXiv:2201.11187 [pdf, other]: Title: DIREG3D: DIrectly REGress 3D Hands from Multiple Cameras

Authors: Ashar Ali, Upal Mahbub, Gokce Dane, Gerhard Reitmayr

Journal-ref: ICCV 2021 Fifth Workshop on Computer Vision for AR/VR

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Robotics (cs.RO); Image and Video Processing (eess.IV)
[653] arXiv:2201.11192 [pdf, other]: Title: ReforesTree: A Dataset for Estimating Tropical Forest Carbon Stock with Deep Learning and Aerial Imagery

Authors: Gyri Reiersen, David Dao, Björn Lütjens, Konstantin Klemmer, Kenza Amara, Attila Steinegger, Ce Zhang, Xiaoxiang Zhu

Comments: Accepted paper for the AI for Social Impact Track at the AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[654] arXiv:2201.11197 [pdf, ps, other]: Title: Challenges and Opportunities for Machine Learning Classification of Behavior and Mental State from Images

Authors: Peter Washington, Cezmi Onur Mutlu, Aaron Kline, Kelley Paskov, Nate Tyler Stockham, Brianna Chrisman, Nick Deveau, Mourya Surhabi, Nick Haber, Dennis P. Wall

Comments: 30 pages, 1 figure, 1 table

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[655] arXiv:2201.11228 [pdf, other]: Title: Continuous Examination by Automatic Quiz Assessment Using Spiral Codes and Image Processing

Authors: Fernando Alonso-Fernandez, Josef Bigun

Comments: Accepted at 13th IEEE Global Engineering Education Conference, EDUCON, Tunis, Tunisia, 28-31 March 2022 (Educational Conference)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[656] arXiv:2201.11279 [pdf, other]: Title: Revisiting RCAN: Improved Training for Image Super-Resolution

Authors: Zudi Lin, Prateek Garg, Atmadeep Banerjee, Salma Abdel Magid, Deqing Sun, Yulun Zhang, Luc Van Gool, Donglai Wei, Hanspeter Pfister

Comments: 13 pages with 10 tables and 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[657] arXiv:2201.11284 [pdf, other]: Title: Interactive 3D Character Modeling from 2D Orthogonal Drawings with Annotations

Authors: Zhengyu Huang, Haoran Xie, Tsukasa Fukusato

Comments: 6 pages, 4 figures, accepted in Proceedings of International Workshop on Advanced Image Technology 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[658] arXiv:2201.11296 [pdf, ps, other]: Title: Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape context

Authors: Jie Shao, Wei Yao, Peng Wan, Lei Luo, Jiaxin Lyu, Wuming Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[659] arXiv:2201.11307 [pdf, other]: Title: Dissecting the impact of different loss functions with gradient surgery

Authors: Hong Xuan, Robert Pless

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[660] arXiv:2201.11316 [pdf, other]: Title: Transformer Module Networks for Systematic Generalization in Visual Question Answering

Authors: Moyuru Yamada, Vanessa D'Amario, Kentaro Takemoto, Xavier Boix, Tomotake Sasaki

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[661] arXiv:2201.11319 [pdf, other]: Title: Dynamic Rectification Knowledge Distillation

Authors: Fahad Rahman Amik, Ahnaf Ismat Tasin, Silvia Ahmed, M. M. Lutfe Elahi, Nabeel Mohammed

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[662] arXiv:2201.11345 [pdf, other]: Title: Exploring Global Diversity and Local Context for Video Summarization

Authors: Yingchao Pan, Ouhan Huang, Qinghao Ye, Zhongjin Li, Wenjiang Wang, Guodun Li, Yuxing Chen

Comments: Accepted by IEEE Access

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[663] arXiv:2201.11351 [pdf, other]: Title: Effective Shortcut Technique for GAN

Authors: Seung Park, Cheol-Hwan Yoo, Yong-Goo Shin

Comments: arXiv admin note: text overlap with arXiv:2112.14968

Journal-ref: Applied Intelligence 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[664] arXiv:2201.11379 [pdf, other]: Title: Deep Confidence Guided Distance for 3D Partial Shape Registration

Authors: Dvir Ginzburg, Dan Raviv

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[665] arXiv:2201.11388 [pdf, other]: Title: Contrastive Embedding Distribution Refinement and Entropy-Aware Attention for 3D Point Cloud Classification

Authors: Feng Yang, Yichao Cao, Qifan Xue, Shuai Jin, Xuanpeng Li, Weigong Zhang

Comments: 15 pages, 10figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[666] arXiv:2201.11403 [pdf, other]: Title: Generalised Image Outpainting with U-Transformer

Authors: Penglei Gao, Xi Yang, Rui Zhang, John Y. Goulermas, Yujie Geng, Yuyao Yan, Kaizhu Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[667] arXiv:2201.11407 [pdf, other]: Title: Non-linear Motion Estimation for Video Frame Interpolation using Space-time Convolutions

Authors: Saikat Dutta, Arulkumar Subramaniam, Anurag Mittal

Comments: Accepted at CLIC workshop, CVPR 2022. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[668] arXiv:2201.11438 [pdf, other]: Title: DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer

Authors: Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal

Comments: Preprint

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[669] arXiv:2201.11440 [pdf, ps, other]: Title: An Analysis on Ensemble Learning optimized Medical Image Classification with Deep Convolutional Neural Networks

Authors: Dominik Müller, Iñaki Soto-Rey, Frank Kramer

Comments: Code: this https URL ; Supplementary Material: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[670] arXiv:2201.11450 [pdf, other]: Title: In Defense of Kalman Filtering for Polyp Tracking from Colonoscopy Videos

Authors: David Butler, Yuan Zhang, Tim Chen, Seon Ho Shin, Rajvinder Singh, Gustavo Carneiro

Comments: Paper accepted to the International Symposium on Biomedical Imaging (ISBI) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[671] arXiv:2201.11460 [pdf, other]: Title: RelTR: Relation Transformer for Scene Graph Generation

Authors: Yuren Cong, Michael Ying Yang, Bodo Rosenhahn

Comments: accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[672] arXiv:2201.11479 [pdf, other]: Title: Eye-focused Detection of Bell's Palsy in Videos

Authors: Sharik Ali Ansari, Koteswar Rao Jerripothula, Pragya Nagpal, Ankush Mittal

Comments: Published in the Proceedings of the 34th Canadian Conference on Artificial Intelligence. Please cite this paper in the following manner: S. A. Ansari, K. R. Jerripothula, P. Nagpal, and A. Mittal. "Eye-focused Detection of Bell's Palsy in Videos". In: Proceedings of the 34th Canadian Conference on Artificial Intelligence (June 8, 2021). doi: 10.21428/594757db.d2f8342b

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[673] arXiv:2201.11500 [pdf, other]: Title: Head and eye egocentric gesture recognition for human-robot interaction using eyewear cameras

Authors: Javier Marina-Miranda, V. Javier Traver

Comments: Copyright 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Journal-ref: IEEE Robotics and Automation Letters, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
[674] arXiv:2201.11506 [pdf, other]: Title: Anomaly Detection in Retinal Images using Multi-Scale Deep Feature Sparse Coding

Authors: Sourya Dipta Das, Saikat Dutta, Nisarg A. Shah, Dwarikanath Mahapatra, Zongyuan Ge

Comments: Accepted to ISBI 2022.\copyright IEEE

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[675] arXiv:2201.11523 [pdf, other]: Title: ResiDualGAN: Resize-Residual DualGAN for Cross-Domain Remote Sensing Images Semantic Segmentation

Authors: Yang Zhao, Peng Guo, Zihao Sun, Xiuwan Chen, Han Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[676] arXiv:2201.11528 [pdf, other]: Title: Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box Domains

Authors: Qilong Zhang, Xiaodan Li, Yuefeng Chen, Jingkuan Song, Lianli Gao, Yuan He, Hui Xue

Comments: Accepted by ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[677] arXiv:2201.11547 [pdf, other]: Title: ASOC: Adaptive Self-aware Object Co-localization

Authors: Koteswar Rao Jerripothula, Prerana Mukherjee

Comments: Published in IEEE ICME 2021. Please cite this paper in the following manner: K. R. Jerripothula and P. Mukherjee, "ASOC: Adaptive Self-Aware Object Co-Localization," 2021 IEEE International Conference on Multimedia and Expo (ICME), 2021, pp. 1-6, doi: 10.1109/ICME51207.2021.9428191

Journal-ref: 2021 IEEE International Conference on Multimedia and Expo (ICME), 2021, pp. 1-6

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[678] arXiv:2201.11608 [pdf, ps, other]: Title: A Probabilistic Framework for Dynamic Object Recognition in 3D Environment With A Novel Continuous Ground Estimation Method

Authors: Pouria Mehrabi

Comments: Master's Thesis Submitted in Partial Fulfillment of The Requirements For The Degree of Master of Science in Electrical Engineerin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[679] arXiv:2201.11620 [pdf, ps, other]: Title: Domain generalization in deep learning-based mass detection in mammography: A large-scale multi-center study

Authors: Lidia Garrucho, Kaisar Kushibar, Socayna Jouide, Oliver Diaz, Laura Igual, Karim Lekadir

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[680] arXiv:2201.11632 [pdf, other]: Title: Deep Video Prior for Video Consistency and Propagation

Authors: Chenyang Lei, Yazhou Xing, Hao Ouyang, Qifeng Chen

Comments: Accepted by TPAMI in Dec 2021; extension of NeurIPS2020 Blind Video Temporal Consistency via Deep Video Prior. arXiv admin note: substantial text overlap with arXiv:2010.11838

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[681] arXiv:2201.11664 [pdf, other]: Title: Team Yao at Factify 2022: Utilizing Pre-trained Models and Co-attention Networks for Multi-Modal Fact Verification

Authors: Wei-Yao Wang, Wen-Chih Peng

Comments: Accepted by AAAI 2022 De-Factify Workshop: First Workshop on Multimodal Fact-Checking and Hate Speech Detection

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[682] arXiv:2201.11674 [pdf, other]: Title: Vision Checklist: Towards Testable Error Analysis of Image Models to Help System Designers Interrogate Model Capabilities

Authors: Xin Du, Benedicte Legastelois, Bhargavi Ganesh, Ajitha Rajan, Hana Chockler, Vaishak Belle, Stuart Anderson, Subramanian Ramamoorthy

Comments: 17 pages, 18 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[683] arXiv:2201.11697 [pdf, other]: Title: Constrained Structure Learning for Scene Graph Generation

Authors: Daqi Liu, Miroslaw Bober, Josef Kittler

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[684] arXiv:2201.11736 [pdf, other]: Title: Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives

Authors: David T. Hoffmann, Nadine Behrmann, Juergen Gall, Thomas Brox, Mehdi Noroozi

Comments: AAAI 2022 (Main Track)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[685] arXiv:2201.11760 [pdf, other]: Title: Unsupervised Denoising of Retinal OCT with Diffusion Probabilistic Model

Authors: Dewei Hu, Yuankai K. Tao, Ipek Oguz

Comments: SPIE medical imaging, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[686] arXiv:2201.11782 [pdf, other]: Title: An Empirical Analysis of Recurrent Learning Algorithms In Neural Lossy Image Compression Systems

Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

Comments: Accepted at DCC 2021, 15 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[687] arXiv:2201.11794 [pdf, other]: Title: A Survey on Visual Transfer Learning using Knowledge Graphs

Authors: Sebastian Monka, Lavdim Halilaj, Achim Rettinger

Comments: Semantic Web Journal (SWJ)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[688] arXiv:2201.11808 [pdf, other]: Title: LAP: An Attention-Based Module for Concept Based Self-Interpretation and Knowledge Injection in Convolutional Neural Networks

Authors: Rassa Ghavami Modegh, Ahmad Salimi, Alireza Dizaji, Hamid R. Rabiee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[689] arXiv:2201.11828 [pdf, other]: Title: Pressure Eye: In-bed Contact Pressure Estimation via Contact-less Imaging

Authors: Shuangjun Liu, Sarah Ostadabbas

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[690] arXiv:2201.11843 [pdf, other]: Title: Discriminative Supervised Subspace Learning for Cross-modal Retrieval

Authors: Haoming Zhang, Xiao-Jun Wu, Tianyang Xu, Donglin Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[691] arXiv:2201.11852 [pdf, other]: Title: Towards an Automatic Diagnosis of Peripheral and Central Palsy Using Machine Learning on Facial Features

Authors: C.V. Vletter, H.L. Burger, H. Alers, N. Sourlos, Z. Al-Ars

Comments: 9 pages, 10 tables, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[692] arXiv:2201.11871 [pdf, other]: Title: Infrastructure-Based Object Detection and Tracking for Cooperative Driving Automation: A Survey

Authors: Zhengwei Bai, Guoyuan Wu, Xuewei Qi, Yongkang Liu, Kentaro Oguchi, Matthew J. Barth

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[693] arXiv:2201.11898 [pdf, other]: Title: Indicative Image Retrieval: Turning Blackbox Learning into Grey

Authors: Xulu Zhang (1), Zhenqun Yang (2), Hao Tian (1), Qing Li (3), Xiaoyong Wei (1 and 3) ((1) Sichuan University, (2) Chinese University of Hong Kong, (3) Hong Kong Polytechnic Univeristy)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[694] arXiv:2201.11937 [pdf, other]: Title: Stereo Matching with Cost Volume based Sparse Disparity Propagation

Authors: Wei Xue, Xiaojiang Peng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[695] arXiv:2201.11963 [pdf, other]: Title: Shuffle Augmentation of Features from Unlabeled Data for Unsupervised Domain Adaptation

Authors: Changwei Xu, Jianfei Yang, Haoran Tang, Han Zou, Cheng Lu, Tianshuo Zhang

Comments: 17 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[696] arXiv:2201.11975 [pdf, other]: Title: Generalized Visual Quality Assessment of GAN-Generated Face Images

Authors: Yu Tian, Zhangkai Ni, Baoliang Chen, Shiqi Wang, Hanli Wang, Sam Kwong

Comments: 12 pages, 8 figures, journal paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[697] arXiv:2201.11995 [pdf, other]: Title: Hybrid Contrastive Learning with Cluster Ensemble for Unsupervised Person Re-identification

Authors: He Sun, Mingkun Li, Chun-Guang Li

Comments: accepted by ACPR2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[698] arXiv:2201.12010 [pdf, other]: Title: Unfolding a blurred image

Authors: Kuldeep Purohit, Anshul Shah, A. N. Rajagopalan

Comments: arXiv admin note: substantial text overlap with arXiv:1804.02913

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[699] arXiv:2201.12047 [src]: Title: Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM

Authors: Ali Caglayan, Nevrez Imamoglu, Oguzhan Guclu, Ali Osman Serhatoglu, Weimin Wang, Ahmet Burak Can, Ryosuke Nakamura

Comments: This article has been removed by arXiv administrators because the submitter did not have the authority to grant the license at the time of submission

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[700] arXiv:2201.12051 [pdf, ps, other]: Title: Detection of fake faces in videos

Authors: M. Shamanth, Russel Mathias, Dr Vijayalakshmi MN

Comments: 5 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[701] arXiv:2201.12078 [pdf, other]: Title: You Only Cut Once: Boosting Data Augmentation with a Single Cut

Authors: Junlin Han, Pengfei Fang, Weihao Li, Jie Hong, Mohammad Ali Armin, Ian Reid, Lars Petersson, Hongdong Li

Comments: ICML 2022, Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[702] arXiv:2201.12083 [pdf, other]: Title: DynaMixer: A Vision MLP Architecture with Dynamic Mixing

Authors: Ziyu Wang, Wenhao Jiang, Yiming Zhu, Li Yuan, Yibing Song, Wei Liu

Comments: icml2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[703] arXiv:2201.12084 [pdf, other]: Title: Psychophysical Evaluation of Human Performance in Detecting Digital Face Image Manipulations

Authors: Robert Nichols, Christian Rathgeb, Pawel Drozdowski, Christoph Busch

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[704] arXiv:2201.12086 [pdf, other]: Title: BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Authors: Junnan Li, Dongxu Li, Caiming Xiong, Steven Hoi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[705] arXiv:2201.12089 [pdf, other]: Title: Label uncertainty-guided multi-stream model for disease screening

Authors: Chi Liu, Zongyuan Ge, Mingguang He, Xiaotong Han

Comments: To appear in ISBI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[706] arXiv:2201.12094 [pdf, other]: Title: Leveraging Inlier Correspondences Proportion for Point Cloud Registration

Authors: Lifa Zhu, Haining Guan, Changwei Lin, Renmin Han

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[707] arXiv:2201.12099 [pdf, other]: Title: Detecting Owner-member Relationship with Graph Convolution Network in Fisheye Camera System

Authors: Zizhang Wu, Jason Wang, Tianhao Xu, Fan Wang

Comments: Accepted by Pattern Recognition. arXiv admin note: substantial text overlap with arXiv:2103.16099

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[708] arXiv:2201.12133 [pdf, other]: Title: O-ViT: Orthogonal Vision Transformer

Authors: Yanhong Fei, Yingjie Liu, Xian Wei, Mingsong Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[709] arXiv:2201.12170 [pdf, other]: Title: Unsupervised Single-shot Depth Estimation using Perceptual Reconstruction

Authors: Christoph Angermann, Matthias Schwab, Markus Haltmeier, Christian Laubichler, Steinbjörn Jónsson

Comments: arXiv admin note: text overlap with arXiv:2103.16938

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[710] arXiv:2201.12184 [pdf, other]: Title: A tomographic workflow to enable deep learning for X-ray based foreign object detection

Authors: Mathé T. Zeegers, Tristan van Leeuwen, Daniël M. Pelt, Sophia Bethany Coban, Robert van Liere, Kees Joost Batenburg

Comments: This paper is under consideration at Expert Systems with Applications. 22 pages, 15 figures

Journal-ref: Expert Systems with Applications 206 (2022) 117768

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[711] arXiv:2201.12212 [pdf, other]: Title: Möbius Convolutions for Spherical CNNs

Authors: Thomas W. Mitchel, Noam Aigerman, Vladimir G. Kim, Michael Kazhdan

Comments: SIGGRAPH 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Representation Theory (math.RT)
[712] arXiv:2201.12216 [pdf, other]: Title: Self-paced learning to improve text row detection in historical documents with missing labels

Authors: Mihaela Gaman, Lida Ghadamiyan, Radu Tudor Ionescu, Marius Popescu

Comments: Accepted at ECCV Workshop on Text in Everything (TiE 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[713] arXiv:2201.12265 [pdf, other]: Title: 3D-FlowNet: Event-based optical flow estimation with 3D representation

Authors: Haixin Sun, Minh-Quan Dao, Vincent Fremont

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[714] arXiv:2201.12269 [pdf, ps, other]: Title: HSADML: Hyper-Sphere Angular Deep Metric based Learning for Brain Tumor Classification

Authors: Aman Verma, Vibhav Prakash Singh

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[715] arXiv:2201.12285 [pdf, other]: Title: Benchmarking Conventional Vision Models on Neuromorphic Fall Detection and Action Recognition Dataset

Authors: Karthik Sivarama Krishnan, Koushik Sivarama Krishnan

Comments: 6 Pages, 2 Figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[716] arXiv:2201.12288 [pdf, other]: Title: VRT: A Video Restoration Transformer

Authors: Jingyun Liang, Jiezhang Cao, Yuchen Fan, Kai Zhang, Rakesh Ranjan, Yawei Li, Radu Timofte, Luc Van Gool

Comments: add results on VFI and STVSR; SOTA results (+up to 2.16dB) on video SR, video deblurring, video denoising, video frame interpolation and space-time video super-resolution. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[717] arXiv:2201.12329 [pdf, other]: Title: DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

Authors: Shilong Liu, Feng Li, Hao Zhang, Xiao Yang, Xianbiao Qi, Hang Su, Jun Zhu, Lei Zhang

Comments: Accepted to ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[718] arXiv:2201.12346 [pdf, other]: Title: DiriNet: A network to estimate the spatial and spectral degradation functions

Authors: Ting Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[719] arXiv:2201.12384 [pdf, ps, other]: Title: Developing a Machine-Learning Algorithm to Diagnose Age-Related Macular Degeneration

Authors: Ananya Dua, Pham Hung Minh, Sajid Fahmid, Shikhar Gupta, Sophia Zheng, Vanessa Moyo, Yanran Elisa Xue

Comments: 7 pages, 7 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[720] arXiv:2201.12385 [pdf, other]: Title: A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise

Authors: Weimin Zhou, Miguel P. Eckstein

Comments: SPIE Medical Imaging 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[721] arXiv:2201.12386 [pdf, other]: Title: Few-shot Unsupervised Domain Adaptation for Multi-modal Cardiac Image Segmentation

Authors: Mingxuan Gu, Sulaiman Vesal, Ronak Kosti, Andreas Maier

Comments: Accepted t0 BVM2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[722] arXiv:2201.12425 [pdf, other]: Title: CoordX: Accelerating Implicit Neural Representation with a Split MLP Architecture

Authors: Ruofan Liang, Hongyi Sun, Nandita Vijaykumar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[723] arXiv:2201.12437 [pdf, other]: Title: Mobile Robot Manipulation using Pure Object Detection

Authors: Brent Griffin

Comments: WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[724] arXiv:2201.12467 [pdf, other]: Title: Improving Federated Learning Face Recognition via Privacy-Agnostic Clusters

Authors: Qiang Meng, Feng Zhou, Hainan Ren, Tianshu Feng, Guochao Liu, Yuanqing Lin

Comments: ICLR2022, Spotlight

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[725] arXiv:2201.12499 [pdf, other]: Title: Reconstruction of Power Lines from Point Clouds

Authors: Alexander Gribov, Khalid Duri

Comments: 15 pages, 8 figures, 1 table

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
[726] arXiv:2201.12506 [pdf, other]: Title: 2D+3D facial expression recognition via embedded tensor manifold regularization

Authors: Yunfang Fu, Qiuqi Ruan, Ziyan Luo, Gaoyun An, Yi Jin, Jun Wan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[727] arXiv:2201.12525 [pdf, other]: Title: Spherical Convolution empowered FoV Prediction in 360-degree Video Multicast with Limited FoV Feedback

Authors: Jie Li, Ling Han, Cong Zhang, Qiyue Li, Zhi Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[728] arXiv:2201.12527 [pdf, other]: Title: Scale-Invariant Adversarial Attack for Evaluating and Enhancing Adversarial Defenses

Authors: Mengting Xu, Tao Zhang, Zhongnian Li, Daoqiang Zhang

Comments: TDSC under review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[729] arXiv:2201.12528 [pdf, ps, other]: Title: SupWMA: Consistent and Efficient Tractography Parcellation of Superficial White Matter with Deep Learning

Authors: Tengfei Xue, Fan Zhang, Chaoyi Zhang, Yuqian Chen, Yang Song, Nikos Makris, Yogesh Rathi, Weidong Cai, Lauren J. O'Donnell

Comments: ISBI 2022 Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[730] arXiv:2201.12533 [pdf, other]: Title: Light field Rectification based on relative pose estimation

Authors: Xiao Huo, Dongyang Jin, Saiping Zhang, Fuzheng Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[731] arXiv:2201.12543 [pdf, other]: Title: Fast Differentiable Matrix Square Root and Inverse Square Root

Authors: Yue Song, Nicu Sebe, Wei Wang

Comments: T-PAMI 2022. arXiv admin note: substantial text overlap with arXiv:2201.08663

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[732] arXiv:2201.12558 [pdf, other]: Title: The KFIoU Loss for Rotated Object Detection

Authors: Xue Yang, Yue Zhou, Gefan Zhang, Jirui Yang, Wentao Wang, Junchi Yan, Xiaopeng Zhang, Qi Tian

Comments: 18 pages, 6 figures, 8 tables, accepted by ICLR 2023, TensorFlow code: this https URL, PyTorch code: this https URL, Jittor code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[733] arXiv:2201.12559 [pdf, other]: Title: Rebalancing Batch Normalization for Exemplar-based Class-Incremental Learning

Authors: Sungmin Cha, Sungjun Cho, Dasol Hwang, Sunwon Hong, Moontae Lee, Taesup Moon

Comments: CVPR 2023 camera ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[734] arXiv:2201.12576 [pdf, other]: Title: Scale-arbitrary Invertible Image Downscaling

Authors: Jinbo Xing, Wenbo Hu, Tien-Tsin Wong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[735] arXiv:2201.12592 [pdf, other]: Title: Exact Decomposition of Joint Low Rankness and Local Smoothness Plus Sparse Matrices

Authors: Jiangjun Peng, Yao Wang, Hongying Zhang, Jianjun Wang, Deyu Meng

Comments: 15 pages, 14 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[736] arXiv:2201.12596 [pdf, other]: Title: MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning

Authors: Zejun Li, Zhihao Fan, Huaixiao Tou, Jingjing Chen, Zhongyu Wei, Xuanjing Huang

Comments: Accepted by ACM MM22

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[737] arXiv:2201.12599 [pdf, other]: Title: Semantic-assisted image compression

Authors: Qizheng Sun (1), Caili Guo (1), Yang Yang (1), Jiujiu Chen (1), Xijun Xue (2) ((1) bupt.edu.cn, (2) chinatelecom.cn )

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[738] arXiv:2201.12622 [pdf, ps, other]: Title: Hand Gesture Recognition of Dumb Person Using one Against All Neural Network

Authors: Muhammad Asim Khan, Lan Hong, Sajjad Ahmed

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[739] arXiv:2201.12625 [pdf, ps, other]: Title: ADC-Net: An Open-Source Deep Learning Network for Automated Dispersion Compensation in Optical Coherence Tomography

Authors: Shaiban Ahmed (1), David Le (1), Taeyoon Son (1), Tobiloba Adejumo (1), Xincheng Yao (1,2) (1) Department of Biomedical Engineering, University of Illinois at Chicago (2) Department of Ophthalmology, Visual Science, University of Illinois at Chicago

Comments: 18 pages, 5 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Tissues and Organs (q-bio.TO)
[740] arXiv:2201.12626 [pdf, other]: Title: Assessing Cross-dataset Generalization of Pedestrian Crossing Predictors

Authors: Joseph Gesnouin, Steve Pechberti, Bogdan Stanciulescu, Fabien Moutarde

Comments: Submitted to the 33rd IEEE Intelligent Vehicles Symposium

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[741] arXiv:2201.12633 [pdf, other]: Title: Image Classification using Graph Neural Network and Multiscale Wavelet Superpixels

Authors: Varun Vasudevan, Maxime Bassenne, Md Tauhidul Islam, Lei Xing

Comments: 17 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[742] arXiv:2201.12646 [pdf, other]: Title: Self Semi Supervised Neural Architecture Search for Semantic Segmentation

Authors: Loïc Pauletto, Massih-Reza Amini, Nicolas Winckler

Comments: 21 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[743] arXiv:2201.12649 [pdf, ps, other]: Title: Transfer Learning for Estimation of Pendubot Angular Position Using Deep Neural Networks

Authors: Sina Khanagha

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Systems and Control (eess.SY)
[744] arXiv:2201.12693 [pdf, other]: Title: Extracting Built Environment Features for Planning Research with Computer Vision: A Review and Discussion of State-of-the-Art Approaches

Authors: Meiqing Li, Hao Sheng

Comments: CUPUM 2021 (The 17th International Conference on Computational Urban Planning and Urban Management)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[745] arXiv:2201.12705 [pdf, ps, other]: Title: A Robust Framework for Deep Learning Approaches to Facial Emotion Recognition and Evaluation

Authors: Nyle Siddiqui, Rushit Dave, Tyler Bauer, Thomas Reither, Dylan Black, Mitchell Hanson

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[746] arXiv:2201.12709 [pdf, other]: Title: Low-Rank Tensor Completion Based on Bivariate Equivalent Minimax-Concave Penalty

Authors: Hongbing Zhang, Xinyi Liu, Hongtao Fan, Yajing Li, Yinlin Ye

Comments: arXiv admin note: text overlap with arXiv:2109.12257

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[747] arXiv:2201.12712 [pdf, other]: Title: Win the Lottery Ticket via Fourier Analysis: Frequencies Guided Network Pruning

Authors: Yuzhang Shang, Bin Duan, Ziliang Zong, Liqiang Nie, Yan Yan

Comments: accepted to ICASSP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[748] arXiv:2201.12723 [pdf, other]: Title: A Frustratingly Simple Approach for End-to-End Image Captioning

Authors: Ziyang Luo, Yadong Xi, Rongsheng Zhang, Jing Ma

Comments: Work in progress

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[749] arXiv:2201.12725 [pdf, other]: Title: Generalized Global Ranking-Aware Neural Architecture Ranker for Efficient Image Classifier Search

Authors: Bicheng Guo, Tao Chen, Shibo He, Haoyu Liu, Lilin Xu, Peng Ye, Jiming Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[750] arXiv:2201.12728 [pdf, other]: Title: Video-based Facial Micro-Expression Analysis: A Survey of Datasets, Features and Algorithms

Authors: Xianye Ben, Yi Ren, Junping Zhang, Su-Jing Wang, Kidiyo Kpalma, Weixiao Meng, Yong-Jin Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[751] arXiv:2201.12733 [pdf, other]: Title: TPC: Transformation-Specific Smoothing for Point Cloud Models

Authors: Wenda Chu, Linyi Li, Bo Li

Comments: Accepted as a conference paper at ICML 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[752] arXiv:2201.12763 [pdf, other]: Title: RIM-Net: Recursive Implicit Fields for Unsupervised Learning of Hierarchical Shape Structures

Authors: Chengjie Niu, Manyi Li, Kai Xu, Hao Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[753] arXiv:2201.12765 [pdf, other]: Title: Improving Robustness by Enhancing Weak Subnets

Authors: Yong Guo, David Stutz, Bernt Schiele

Comments: To appear in ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[754] arXiv:2201.12769 [pdf, other]: Title: MVP-Net: Multiple View Pointwise Semantic Segmentation of Large-Scale Point Clouds

Authors: Chuanyu Luo, Xiaohan Li, Nuo Cheng, Han Li, Shengguang Lei, Pu Li

Journal-ref: 30. International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision(WSCG), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[755] arXiv:2201.12771 [pdf, other]: Title: Self-Supervised Moving Vehicle Detection from Audio-Visual Cues

Authors: Jannik Zürn, Wolfram Burgard

Comments: 8 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[756] arXiv:2201.12792 [pdf, other]: Title: SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video

Authors: Boyi Jiang, Yang Hong, Hujun Bao, Juyong Zhang

Comments: CVPR 2022, Oral. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[757] arXiv:2201.12805 [pdf, other]: Title: Automatic Segmentation of Left Ventricle in Cardiac Magnetic Resonance Images

Authors: Garvit Chhabra, J. H. Gagan, J. R. Harish Kumar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[758] arXiv:2201.12813 [pdf, other]: Title: Contrastive Learning from Demonstrations

Authors: André Correia, Luís A. Alexandre

Journal-ref: IEEE Robotic Computing, Naples, Italy, December 5-7, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[759] arXiv:2201.12826 [pdf, other]: Title: OptG: Optimizing Gradient-driven Criteria in Network Sparsity

Authors: Yuxin Zhang, Mingbao Lin, Mengzhao Chen, Fei Chao, Rongrong Ji

Comments: 11 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[760] arXiv:2201.12828 [pdf, other]: Title: Comprehensive Saliency Fusion for Object Co-segmentation

Authors: Harshit Singh Chhabra, Koteswar Rao Jerripothula

Comments: Published in IEEE ISM 2021. Please cite this paper in the following manner. H. S. Chhabra and K. Rao Jerripothula, "Comprehensive Saliency Fusion for Object Co-segmentation," 2021 IEEE International Symposium on Multimedia (ISM), 2021, pp. 107-110, doi: 10.1109/ISM52913.2021.00026

Journal-ref: 2021 IEEE International Symposium on Multimedia (ISM), 2021, pp. 107-110

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[761] arXiv:2201.12888 [pdf, other]: Title: A Dataset for Medical Instructional Video Classification and Question Answering

Authors: Deepak Gupta, Kush Attal, Dina Demner-Fushman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[762] arXiv:2201.12903 [pdf, other]: Title: Aggregating Global Features into Local Vision Transformer

Authors: Krushi Patel, Andres M. Bur, Fengjun Li, Guanghui Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[763] arXiv:2201.12944 [pdf, other]: Title: Deep Learning Approaches on Image Captioning: A Review

Authors: Taraneh Ghandi, Hamidreza Pourreza, Hamidreza Mahyar

Comments: 41 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[764] arXiv:2201.12961 [pdf, other]: Title: Plug-In Inversion: Model-Agnostic Inversion for Vision with Data Augmentations

Authors: Amin Ghiasi, Hamid Kazemi, Steven Reich, Chen Zhu, Micah Goldblum, Tom Goldstein

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[765] arXiv:2201.13013 [pdf, other]: Title: A Simple And Effective Filtering Scheme For Improving Neural Fields

Authors: Yixin Zhuang

Comments: Accepted to Computational Visual Media

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[766] arXiv:2201.13027 [pdf, other]: Title: BOAT: Bilateral Local Attention Vision Transformer

Authors: Tan Yu, Gangming Zhao, Ping Li, Yizhou Yu

Comments: BMVC2022 oral

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[767] arXiv:2201.13063 [pdf, other]: Title: NeuralTailor: Reconstructing Sewing Pattern Structures from 3D Point Clouds of Garments

Authors: Maria Korosteleva, Sung-Hee Lee

Comments: Updated to the version accepted to SIGGRAPH 2022 (Journal Track)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[768] arXiv:2201.13065 [pdf, other]: Title: Rigidity Preserving Image Transformations and Equivariance in Perspective

Authors: Lucas Brynte, Georg Bökman, Axel Flinth, Fredrik Kahl

Comments: v2: Substantially revised version. Among other things, experiments with the PixLoc model added

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[769] arXiv:2201.13066 [pdf, ps, other]: Title: Single Object Tracking: A Survey of Methods, Datasets, and Evaluation Metrics

Authors: Zahra Soleimanitaleb, Mohammad Ali Keyvanrad

Comments: 15 pages. This paper is about object tracking and review of methods in this task. The paper first published in the ICCKE2019 conference and then extended in this new paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[770] arXiv:2201.13078 [pdf, other]: Title: Lymphoma segmentation from 3D PET-CT images using a deep evidential network

Authors: Ling Huang, Su Ruan, Pierre Decazes, Thierry Denoeux

Comments: Preprint submitted to International Journal of Approximate Reasoning

Journal-ref: International Journal of Approximate Reasoning, Volume 149, 2022, Pages 39-60,

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[771] arXiv:2201.13081 [pdf, other]: Title: Unsupervised Anomaly Detection in 3D Brain MRI using Deep Learning with Multi-Task Brain Age Prediction

Authors: Marcel Bengs, Finn Behrendt, Max-Heinrich Laves, Julia Krüger, Roland Opfer, Alexander Schlaefer

Comments: Accepted at SPIE Medical Imaging 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[772] arXiv:2201.13084 [pdf, other]: Title: Crowd-powered Face Manipulation Detection: Fusing Human Examiner Decisions

Authors: Christian Rathgeb, Robert Nichols, Mathias Ibsen, Pawel Drozdowski, Christoph Busch

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[773] arXiv:2201.13100 [pdf, other]: Title: Adversarial Masking for Self-Supervised Learning

Authors: Yuge Shi, N. Siddharth, Philip H.S. Torr, Adam R. Kosiorek

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[774] arXiv:2201.13164 [pdf, other]: Title: Imperceptible and Multi-channel Backdoor Attack against Deep Neural Networks

Authors: Mingfu Xue, Shifeng Ni, Yinghao Wu, Yushu Zhang, Jian Wang, Weiqiang Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[775] arXiv:2201.13178 [pdf, other]: Title: Few-Shot Backdoor Attacks on Visual Object Tracking

Authors: Yiming Li, Haoxiang Zhong, Xingjun Ma, Yong Jiang, Shu-Tao Xia

Comments: This work is accepted by the ICLR 2022. The first two authors contributed equally to this work. In this version, we fix some typos and errors contained in the last one. 21 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[776] arXiv:2201.13182 [pdf, other]: Title: Learning Super-Features for Image Retrieval

Authors: Philippe Weinzaepfel, Thomas Lucas, Diane Larlus, Yannis Kalantidis

Comments: ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[777] arXiv:2201.13229 [pdf, other]: Title: Network-level Safety Metrics for Overall Traffic Safety Assessment: A Case Study

Authors: Xiwen Chen, Hao Wang, Abolfazl Razi, Brendan Russo, Jason Pacheco, John Roberts, Jeffrey Wishart, Larry Head, Alonso Granados Baca

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[778] arXiv:2201.13271 [pdf, other]: Title: StRegA: Unsupervised Anomaly Detection in Brain MRIs using a Compact Context-encoding Variational Autoencoder

Authors: Soumick Chatterjee, Alessandro Sciarra, Max Dünnwald, Pavan Tummala, Shubham Kumar Agrawal, Aishwarya Jauhari, Aman Kalra, Steffen Oeltze-Jafra, Oliver Speck, Andreas Nürnberger

Journal-ref: Computers in Biology and Medicine, 106093 (2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[779] arXiv:2201.13278 [pdf, other]: Title: Combining Local and Global Pose Estimation for Precise Tracking of Similar Objects

Authors: Niklas Gard, Anna Hilsmann, Peter Eisert

Comments: Accepted at VISAPP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[780] arXiv:2201.13279 [pdf, other]: Title: UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANs

Authors: Philipp Oberdiek, Gernot A. Fink, Matthias Rottmann

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[781] arXiv:2201.13291 [pdf, other]: Title: Metrics for saliency map evaluation of deep learning explanation methods

Authors: Tristan Gomez, Thomas Fréour, Harold Mouchère

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[782] arXiv:2201.13312 [pdf, other]: Title: On scale-invariant properties in natural images and their simulations

Authors: Maxim Koroteev, Kirill Aistov

Comments: 7 pages, 13 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[783] arXiv:2201.13322 [pdf, other]: Title: Learning to Hash Naturally Sorts

Authors: Jiaguo Yu, Yuming Shen, Menghan Wang, Haofeng Zhang, Philip H.S. Torr

Comments: IJCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[784] arXiv:2201.13338 [pdf, other]: Title: Modeling the Background for Incremental and Weakly-Supervised Semantic Segmentation

Authors: Fabio Cermelli, Massimiliano Mancini, Samuel Rota Buló, Elisa Ricci, Barbara Caputo

Comments: Accepted by T-PAMI (this https URL). arXiv admin note: substantial text overlap with arXiv:2002.00718

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[785] arXiv:2201.13392 [src]: Title: MHSnet: Multi-head and Spatial Attention Network with False-Positive Reduction for Pulmonary Nodules Detection

Authors: Juanyun Mai, Minghao Wang, Jiayin Zheng, Yanbo Shao, Zhaoqi Diao, Xinliang Fu, Yulong Chen, Jianyu Xiao, Jian You, Airu Yin, Yang Yang, Xiangcheng Qiu, Jinsheng Tao, Bo Wang, Hua Ji

Comments: We have to revise the experiment results and conclusions

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[786] arXiv:2201.13433 [pdf, other]: Title: Third Time's the Charm? Image and Video Editing with StyleGAN3

Authors: Yuval Alaluf, Or Patashnik, Zongze Wu, Asif Zamir, Eli Shechtman, Dani Lischinski, Daniel Cohen-Or

Comments: Project page available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[787] arXiv:2201.00063 (cross-list from eess.SY) [pdf, other]: Title: Croesus: Multi-Stage Processing and Transactions for Video-Analytics in Edge-Cloud Systems

Authors: Samaa Gazzaz, Vishal Chakraborty, Faisal Nawab

Comments: Published in ICDE2022

Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[788] arXiv:2201.00148 (cross-list from cs.LG) [pdf, other]: Title: Rethinking Feature Uncertainty in Stochastic Neural Networks for Adversarial Robustness

Authors: Hao Yang, Min Wang, Zhengfei Yu, Yun Zhou

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[789] arXiv:2201.00168 (cross-list from cs.LG) [pdf, other]: Title: Self-attention Multi-view Representation Learning with Diversity-promoting Complementarity

Authors: Jian-wei Liu, Xi-hao Ding, Run-kun Lu, Xionglin Luo

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[790] arXiv:2201.00171 (cross-list from cs.LG) [pdf, other]: Title: Multi-view Subspace Adaptive Learning via Autoencoder and Attention

Authors: Jian-wei Liu, Hao-jie Xie, Run-kun Lu, Xiong-lin Luo

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[791] arXiv:2201.00308 (cross-list from cs.LG) [pdf, other]: Title: DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents

Authors: Kushagra Pandey, Avideep Mukherjee, Piyush Rai, Abhishek Kumar

Comments: 12 pages main content. Camera-Ready version accepted at Transactions on Machine Learning Research

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[792] arXiv:2201.00511 (cross-list from cs.MM) [pdf, ps, other]: Title: Centre Symmetric Quadruple Pattern: A Novel Descriptor for Facial Image Recognition and Retrieval

Authors: Soumendu Chakraborty, Satish Kumar Singh, Pavan Chakraborty

Comments: arXiv admin note: text overlap with arXiv:2201.00504

Journal-ref: Pattern Recognition Letters, vol-115, pp.50-58, (2018). (Elsevier) ISSN/ISBN: 0167-8655

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[793] arXiv:2201.00596 (cross-list from cs.RO) [pdf, other]: Title: LiDAR Point--to--point Correspondences for Rigorous Registration of Kinematic Scanning in Dynamic Networks

Authors: Aurélien Brun, Davide Antonio Cucci, Jan Skaloud

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[794] arXiv:2201.00604 (cross-list from cs.LG) [pdf, other]: Title: An analysis of over-sampling labeled data in semi-supervised learning with FixMatch

Authors: Miquel Martí i Rabadán, Sebastian Bujwid, Alessandro Pieropan, Hossein Azizpour, Atsuto Maki

Comments: 10 pages, 3 figures. Published at NLDL 2022

Journal-ref: Vol. 3 (2022): Proceedings of the Northern Lights Deep Learning Workshop 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[795] arXiv:2201.00693 (cross-list from cs.IR) [pdf, other]: Title: Multimodal Entity Tagging with Multimodal Knowledge Base

Authors: Hao Peng, Hang Li, Lei Hou, Juanzi Li, Chao Qiao

Comments: 11 pages, 4 figures

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[796] arXiv:2201.00849 (cross-list from cs.LG) [pdf, other]: Title: Delving into Sample Loss Curve to Embrace Noisy and Imbalanced Data

Authors: Shenwang Jiang, Jianan Li, Ying Wang, Bo Huang, Zhang Zhang, Tingfa Xu

Comments: Accepted by AAAI-2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[797] arXiv:2201.01003 (cross-list from cs.LG) [pdf, other]: Title: Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple Sources

Authors: Yongchun Zhu, Fuzhen Zhuang, Deqing Wang

Comments: AAAI 2019 long paper. Multi-source Domain Adaptation

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[798] arXiv:2201.01155 (cross-list from cs.LG) [pdf, other]: Title: DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification Training

Authors: Xianglin Yang, Yun Lin, Ruofan Liu, Zhenfeng He, Chao Wang, Jin Song Dong, Hong Mei

Comments: Accepted in AAAI'22

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Software Engineering (cs.SE)
[799] arXiv:2201.01222 (cross-list from cs.LG) [pdf, other]: Title: The cluster structure function

Authors: Andrew R. Cohen, Paul M.B. Vitányi

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[800] arXiv:2201.01230 (cross-list from cs.LG) [pdf, other]: Title: Robust Semi-supervised Federated Learning for Images Automatic Recognition in Internet of Drones

Authors: Zhe Zhang, Shiyao Ma, Zhaohui Yang, Zehui Xiong, Jiawen Kang, Yi Wu, Kejia Zhang, Dusit Niyato

Comments: arXiv admin note: text overlap with arXiv:2110.13388

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[801] arXiv:2201.01250 (cross-list from cs.LG) [pdf, other]: Title: Transfer Learning for Retinal Vascular Disease Detection: A Pilot Study with Diabetic Retinopathy and Retinopathy of Prematurity

Authors: Guan Wang, Yusuke Kikuchi, Jinglin Yi, Qiong Zou, Rui Zhou, Xin Guo

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[802] arXiv:2201.01353 (cross-list from cs.LG) [pdf, other]: Title: Linear Variational State-Space Filtering

Authors: Daniel Pfrommer, Nikolai Matni

Comments: 18 pages, 6 figures. Fixed proof in appendix. For associated code, see this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[803] arXiv:2201.01367 (cross-list from cs.RO) [pdf, other]: Title: DenseTact: Optical Tactile Sensor for Dense Shape Reconstruction

Authors: Won Kyung Do, Monroe Kennedy III

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[804] arXiv:2201.01466 (cross-list from cs.AI) [pdf, ps, other]: Title: Challenges of Artificial Intelligence -- From Machine Learning and Computer Vision to Emotional Intelligence

Authors: Matti Pietikäinen, Olli Silven

Comments: 234 pages. Published as an electronic publication at the University of Oulu, Finland, in December 2021, ISBN: 978-952-62-3199-0 link this http URL

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[805] arXiv:2201.01488 (cross-list from cs.LG) [pdf, other]: Title: Exemplar-free Class Incremental Learning via Discriminative and Comparable One-class Classifiers

Authors: Wenju Sun, Qingyong Li, Jing Zhang, Danyu Wang, Wen Wang, Yangli-ao Geng

Journal-ref: [J]. Pattern Recognition, 2023, 140: 109561

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[806] arXiv:2201.01490 (cross-list from cs.LG) [pdf, other]: Title: Debiased Learning from Naturally Imbalanced Pseudo-Labels

Authors: Xudong Wang, Zhirong Wu, Long Lian, Stella X. Yu

Comments: Accepted by CVPR 2022

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[807] arXiv:2201.01760 (cross-list from cs.RO) [pdf, other]: Title: Multi-Robot Collaborative Perception with Graph Neural Networks

Authors: Yang Zhou, Jiuhong Xiao, Yue Zhou, Giuseppe Loianno

Comments: 8 pages, 10 figures, 3 tables, Accepted at the IEEE Robotics Automation Letter (RAL) and the IEEE International Conference on Robotics and Automation (ICRA), 2022

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[808] arXiv:2201.01763 (cross-list from cs.SD) [pdf, other]: Title: Robust Self-Supervised Audio-Visual Speech Recognition

Authors: Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed

Comments: Interspeech 2022

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[809] arXiv:2201.01806 (cross-list from cs.LG) [pdf, other]: Title: Revisiting Deep Subspace Alignment for Unsupervised Domain Adaptation

Authors: Kowshik Thopalli, Jayaraman J Thiagarajan, Rushil Anirudh, Pavan K Turaga

Comments: arXiv admin note: text overlap with arXiv:1906.04338

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[810] arXiv:2201.01819 (cross-list from cs.LG) [pdf, other]: Title: Formal Analysis of Art: Proxy Learning of Visual Concepts from Style Through Language Models

Authors: Diana Kim, Ahmed Elgammal, Marian Mazzone

Comments: 23 pages, This paper is an extended version of a paper that will be published at the 36th AAAI Conference on Artificial Intelligence, to beheld in Vancouver, BC, Canada, February 22 - March 1, 2022

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[811] arXiv:2201.01873 (cross-list from cs.GR) [pdf, other]: Title: NeuralMLS: Geometry-Aware Control Point Deformation

Authors: Meitar Shechter, Rana Hanocka, Gal Metzer, Raja Giryes, Daniel Cohen-Or

Comments: Eurographics 2022 Short Papers

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[812] arXiv:2201.01922 (cross-list from cs.LG) [pdf, other]: Title: Contrastive Neighborhood Alignment

Authors: Pengkai Zhu, Zhaowei Cai, Yuanjun Xiong, Zhuowen Tu, Luis Goncalves, Vijay Mahadevan, Stefano Soatto

Comments: 10 pages, 7 tables, 3 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[813] arXiv:2201.01978 (cross-list from cs.LG) [pdf, other]: Title: An Abstraction-Refinement Approach to Verifying Convolutional Neural Networks

Authors: Matan Ostrovsky, Clark Barrett, Guy Katz

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Logic in Computer Science (cs.LO)
[814] arXiv:2201.02057 (cross-list from cs.LG) [pdf, other]: Title: GLAN: A Graph-based Linear Assignment Network

Authors: He Liu, Tao Wang, Congyan Lang, Songhe Feng, Yi Jin, Yidong Li

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[815] arXiv:2201.02478 (cross-list from cs.LG) [pdf, other]: Title: Bayesian Neural Networks for Reversible Steganography

Authors: Ching-Chun Chang

Journal-ref: IEEE Access (2022), vol. 10, pp. 36327-36334

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[816] arXiv:2201.02610 (cross-list from cs.GR) [pdf, other]: Title: Embodied Hands: Modeling and Capturing Hands and Bodies Together

Authors: Javier Romero, Dimitrios Tzionas, Michael J. Black

Comments: SIGGRAPH ASIA 2017

Journal-ref: ACM Transactions on Graphics, Vol. 36, No. 6, Article 245. Publication date: November 2017

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[817] arXiv:2201.02620 (cross-list from cs.LG) [pdf, other]: Title: Compressing Models with Few Samples: Mimicking then Replacing

Authors: Huanyu Wang, Junjie Liu, Xin Ma, Yang Yong, Zhenhua Chai, Jianxin Wu

Comments: 12 pages, 3 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[818] arXiv:2201.02693 (cross-list from cs.LG) [pdf, other]: Title: BottleFit: Learning Compressed Representations in Deep Neural Networks for Effective and Efficient Split Computing

Authors: Yoshitomo Matsubara, Davide Callegaro, Sameer Singh, Marco Levorato, Francesco Restuccia

Comments: Accepted to IEEE WoWMoM 2022. Code and models are available at this https URL

Journal-ref: 2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[819] arXiv:2201.02711 (cross-list from cs.LG) [pdf, other]: Title: Block Walsh-Hadamard Transform Based Binary Layers in Deep Neural Networks

Authors: Hongyi Pan, Diaa Badawi, Ahmet Enis Cetin

Comments: This paper has been accepted by ACM Transactions on Embedded Computing Systems

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[820] arXiv:2201.03102 (cross-list from cs.LG) [pdf, other]: Title: Preserving Domain Private Representation via Mutual Information Maximization

Authors: Jiahong Chen, Jing Wang, Weipeng Lin, Kuangen Zhang, Clarence W. de Silva

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[821] arXiv:2201.03215 (cross-list from cs.LG) [pdf, ps, other]: Title: Handwriting recognition and automatic scoring for descriptive answers in Japanese language tests

Authors: Hung Tuan Nguyen, Cuong Tuan Nguyen, Haruki Oka, Tsunenori Ishioka, Masaki Nakagawa

Comments: Keywords: handwritten Japanese answers, handwriting recognition, automatic scoring, ensemble recognition, deep neural networks; Reported in IEICE technical report, PRMU2021-32, pp.45-50 (2021.12) Published after peer review and Presented in ICFHR2022, Lecture Notes in Computer Science, vol. 13639, pp. 274-284 (2022.11)

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[822] arXiv:2201.03364 (cross-list from cs.RO) [pdf, other]: Title: High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM

Authors: Brian M. Hopkinson, Suchendra M. Bhandarkar

Comments: 6 pages plus references, 5 figures

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[823] arXiv:2201.03446 (cross-list from cs.GR) [pdf, ps, other]: Title: Two Methods for Iso-Surface Extraction from Volumetric Data and Their Comparison

Authors: Vaclav Skala, Alex Brusi

Journal-ref: Machine Graphics & Vision, No.1/2, Vol.9, pp.149-166, Poland Academy of Sciences, Poland, ISSN 1230-0535, 2000

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[824] arXiv:2201.03529 (cross-list from cs.LG) [pdf, other]: Title: Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning

Authors: Utku Evci, Vincent Dumoulin, Hugo Larochelle, Michael C. Mozer

Comments: presented at ICML 2022 (Oral)

Journal-ref: ICML 2022, Proceedings of the 39th International Conference on Machine Learning

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[825] arXiv:2201.03668 (cross-list from cs.LG) [pdf, other]: Title: Towards Group Robustness in the presence of Partial Group Labels

Authors: Vishnu Suresh Lokhande, Kihyuk Sohn, Jinsung Yoon, Madeleine Udell, Chen-Yu Lee, Tomas Pfister

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[826] arXiv:2201.03942 (cross-list from cs.LG) [pdf, ps, other]: Title: Feature Extraction Framework based on Contrastive Learning with Adaptive Positive and Negative Samples

Authors: Hongjie Zhang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[827] arXiv:2201.03969 (cross-list from cs.LG) [pdf, other]: Title: Multimodal Representations Learning Based on Mutual Information Maximization and Minimization and Identity Embedding for Multimodal Sentiment Analysis

Authors: Jiahao Zheng, Sen Zhang, Xiaoping Wang, Zhigang Zeng

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[828] arXiv:2201.04014 (cross-list from cs.CR) [pdf, other]: Title: Captcha Attack: Turning Captchas Against Humanity

Authors: Mauro Conti, Luca Pajola, Pier Paolo Tricomi

Comments: Currently under submission

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[829] arXiv:2201.04100 (cross-list from cs.HC) [pdf, other]: Title: Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at Scale

Authors: Gang Li, Gilles Baechler, Manuel Tragut, Yang Li

Comments: Accepted to ACM CHI 2022

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[830] arXiv:2201.04122 (cross-list from cs.LG) [pdf, other]: Title: In Defense of the Unitary Scalarization for Deep Multi-Task Learning

Authors: Vitaly Kurin, Alessandro De Palma, Ilya Kostrikov, Shimon Whiteson, M. Pawan Kumar

Comments: NeurIPS 2022 camera-ready version, fixed training loss y axis scale

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[831] arXiv:2201.04182 (cross-list from cs.LG) [pdf, other]: Title: HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning

Authors: Andrey Zhmoginov, Mark Sandler, Max Vladymyrov

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[832] arXiv:2201.04194 (cross-list from cs.LG) [pdf, other]: Title: Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics

Authors: Chunheng Jiang, Tejaswini Pedapati, Pin-Yu Chen, Yizhou Sun, Jianxi Gao

Comments: 19 pages, 7 figures, neural architecture search, mean-field

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[833] arXiv:2201.04235 (cross-list from cs.DC) [pdf, other]: Title: SmartDet: Context-Aware Dynamic Control of Edge Task Offloading for Mobile Object Detection

Authors: Davide Callegaro, Francesco Restuccia, Marco Levorato

Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[834] arXiv:2201.04387 (cross-list from cs.RO) [pdf, other]: Title: Maximizing Self-supervision from Thermal Image for Effective Self-supervised Learning of Depth and Ego-motion

Authors: Ukcheol Shin, Kyunghyun Lee, Byeong-Uk Lee, In So Kweon

Comments: 8 pages, Accepted by IEEE Robotics and Automation Letters (RA-L) with IROS 2022 option

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[835] arXiv:2201.04439 (cross-list from cs.GR) [pdf, other]: Title: Real-Time Style Modelling of Human Locomotion via Feature-Wise Transformations and Local Motion Phases

Authors: Ian Mason, Sebastian Starke, Taku Komura

Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[836] arXiv:2201.04473 (cross-list from cs.RO) [pdf, other]: Title: Globally Optimal Multi-Scale Monocular Hand-Eye Calibration Using Dual Quaternions

Authors: Thomas Wodtko, Markus Horn, Michael Buchholz, Klaus Dietmayer

Journal-ref: 2021 International Conference on 3D Vision (3DV)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[837] arXiv:2201.04569 (cross-list from cs.CR) [pdf, other]: Title: Get your Foes Fooled: Proximal Gradient Split Learning for Defense against Model Inversion Attacks on IoMT data

Authors: Sunder Ali Khowaja, Ik Hyun Lee, Kapal Dev, Muhammad Aslam Jarwar, Nawab Muhammad Faseeh Qureshi

Comments: 10 pages, 5 figures, 2 tables

Journal-ref: IEEE Transactions on Network Science and Engineering, 2022

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[838] arXiv:2201.04733 (cross-list from cs.LG) [pdf, other]: Title: Adversarially Robust Classification by Conditional Generative Model Inversion

Authors: Mitra Alirezaei, Tolga Tasdizen

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[839] arXiv:2201.04813 (cross-list from cs.LG) [pdf, other]: Title: Recursive Least Squares for Training and Pruning Convolutional Neural Networks

Authors: Tianzong Yu, Chunyuan Zhang, Yuan Wang, Meng Ma, Qi Song

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[840] arXiv:2201.04990 (cross-list from cs.LG) [pdf, other]: Title: Toddler-Guidance Learning: Impacts of Critical Period on Multimodal AI Agents

Authors: Junseok Park, Kwanyoung Park, Hyunseok Oh, Ganghun Lee, Minsu Lee, Youngki Lee, Byoung-Tak Zhang

Comments: ICMI2021 Oral Presentation, 9 pages, 9 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[841] arXiv:2201.05026 (cross-list from cs.AI) [pdf, other]: Title: Fantastic Data and How to Query Them

Authors: Trung-Kien Tran, Anh Le-Tuan, Manh Nguyen-Duc, Jicheng Yuan, Danh Le-Phuoc

Journal-ref: NeurIPS Data-Centric AI Workshop 2021

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
[842] arXiv:2201.05071 (cross-list from cs.CR) [pdf, other]: Title: Evaluation of Neural Networks Defenses and Attacks using NDCG and Reciprocal Rank Metrics

Authors: Haya Brama, Lihi Dery, Tal Grinshpoun

Comments: 12 pages, 5 figures

Journal-ref: International Journal of Information Security 2022

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[843] arXiv:2201.05125 (cross-list from cs.LG) [pdf, other]: Title: GradMax: Growing Neural Networks using Gradient Information

Authors: Utku Evci, Bart van Merriënboer, Thomas Unterthiner, Max Vladymyrov, Fabian Pedregosa

Comments: ICLR 2022

Journal-ref: International Conference on Learning Representations, 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[844] arXiv:2201.05217 (cross-list from cs.LG) [pdf, other]: Title: Learning Enhancement of CNNs via Separation Index Maximizing at the First Convolutional Layer

Authors: Ali Karimi, Ahmad Kalhor

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[845] arXiv:2201.05279 (cross-list from cs.LG) [pdf, other]: Title: Manifoldron: Direct Space Partition via Manifold Discovery

Authors: Dayang Wang, Feng-Lei Fan, Bo-Jian Hou, Hao Zhang, Zhen Jia, Boce Zhou, Rongjie Lai, Hengyong Yu, Fei Wang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[846] arXiv:2201.05610 (cross-list from cs.LG) [pdf, other]: Title: When less is more: Simplifying inputs aids neural network understanding

Authors: Robin Tibor Schirrmeister, Rosanne Liu, Sara Hooker, Tonio Ball

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[847] arXiv:2201.05809 (cross-list from cs.LG) [pdf, other]: Title: Weighting and Pruning based Ensemble Deep Random Vector Functional Link Network for Tabular Data Classification

Authors: Qiushi Shi, Ponnuthurai Nagaratnam Suganthan, Rakesh Katuwal

Comments: 8 tables, 8 figures, 31 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[848] arXiv:2201.05938 (cross-list from cs.LG) [pdf, other]: Title: GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting

Authors: Zhao Chen, Vincent Casser, Henrik Kretzschmar, Dragomir Anguelov

Comments: 15 pages (including Appendix), 8 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[849] arXiv:2201.05977 (cross-list from cs.RO) [pdf, other]: Title: Lightweight Object-level Topological Semantic Mapping and Long-term Global Localization based on Graph Matching

Authors: Fan Wang, Chaofan Zhang, Fulin Tang, Hongkui Jiang, Yihong Wu, Yong Liu

Comments: 9 pages, 12 figures, 23 references

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[850] arXiv:2201.05996 (cross-list from cs.CR) [pdf, ps, other]: Title: Hardware Implementation of Multimodal Biometric using Fingerprint and Iris

Authors: Tariq M Khan

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[851] arXiv:2201.06173 (cross-list from cs.LG) [pdf, other]: Title: SunCast: Solar Irradiance Nowcasting from Geosynchronous Satellite Data

Authors: Dhileeban Kumaresan, Richard Wang, Ernesto Martinez, Richard Cziva, Alberto Todeschini, Colorado J Reed, Hossein Vahabi

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[852] arXiv:2201.06268 (cross-list from cs.AI) [pdf, other]: Title: Continual Transformers: Redundancy-Free Attention for Online Inference

Authors: Lukas Hedegaard, Arian Bakhtiarnia, Alexandros Iosifidis

Comments: 16 pages, 6 figures, 7 tables

Journal-ref: International Conference on Learning Representations, 2023

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[853] arXiv:2201.06321 (cross-list from cs.LG) [pdf, other]: Title: Landscape of Neural Architecture Search across sensors: how much do they differ ?

Authors: Kalifou René Traoré, Andrés Camero, Xiao Xiang Zhu

Comments: This work is under review for a conference publication

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[854] arXiv:2201.06378 (cross-list from cs.AI) [pdf, other]: Title: Self-Supervised Anomaly Detection by Self-Distillation and Negative Sampling

Authors: Nima Rafiee, Rahil Gholamipoorfard, Nikolas Adaloglou, Simon Jaxy, Julius Ramakers, Markus Kollmann

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[855] arXiv:2201.06406 (cross-list from cs.AI) [pdf, ps, other]: Title: Deep Learning-based Quality Assessment of Clinical Protocol Adherence in Fetal Ultrasound Dating Scans

Authors: Sevim Cengiz, Mohammad Yaqub

Comments: 13 pages, 2 figures, 3 tables. Proceedings of Machine Learning Research, Under Review. Full Paper MIDL 2022 submission

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[856] arXiv:2201.06494 (cross-list from cs.AI) [pdf, other]: Title: AugLy: Data Augmentations for Robustness

Authors: Zoe Papakipos, Joanna Bitton

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[857] arXiv:2201.06505 (cross-list from cs.AI) [pdf, ps, other]: Title: Data Harmonisation for Information Fusion in Digital Healthcare: A State-of-the-Art Systematic Review, Meta-Analysis and Future Research Directions

Authors: Yang Nan, Javier Del Ser, Simon Walsh, Carola Schönlieb, Michael Roberts, Ian Selby, Kit Howard, John Owen, Jon Neville, Julien Guiot, Benoit Ernst, Ana Pastor, Angel Alberich-Bayarri, Marion I. Menzel, Sean Walsh, Wim Vos, Nina Flerin, Jean-Paul Charbonnier, Eva van Rikxoort, Avishek Chatterjee, Henry Woodruff, Philippe Lambin, Leonor Cerdá-Alberich, Luis Martí-Bonmatí, Francisco Herrera, Guang Yang

Comments: 54 pages, 14 figures, accepted by the Information Fusion journal

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[858] arXiv:2201.06599 (cross-list from cs.LG) [pdf, ps, other]: Title: Who supervises the supervisor? Model monitoring in production using deep feature embeddings with applications to workpiece inspection

Authors: Michael Banf, Gregor Steinhagen

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[859] arXiv:2201.06618 (cross-list from cs.LG) [pdf, other]: Title: VAQF: Fully Automatic Software-Hardware Co-Design Framework for Low-Bit Vision Transformer

Authors: Mengshu Sun, Haoyu Ma, Guoliang Kang, Yifan Jiang, Tianlong Chen, Xiaolong Ma, Zhangyang Wang, Yanzhi Wang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[860] arXiv:2201.06640 (cross-list from cs.LG) [pdf, other]: Title: Towards Adversarial Evaluations for Inexact Machine Unlearning

Authors: Shashwat Goel, Ameya Prabhu, Amartya Sanyal, Ser-Nam Lim, Philip Torr, Ponnurangam Kumaraguru

Comments: Tech Report

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[861] arXiv:2201.07207 (cross-list from cs.LG) [pdf, other]: Title: Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents

Authors: Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch

Comments: Project website at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[862] arXiv:2201.07383 (cross-list from cs.LG) [pdf, other]: Title: Online Deep Learning based on Auto-Encoder

Authors: Si-si Zhang, Jian-wei Liu, Xin Zuo, Run-kun Lu, Si-ming Lian

Comments: 30 pages

Journal-ref: Applied Intelligence (2021)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[863] arXiv:2201.07544 (cross-list from cs.LG) [pdf, other]: Title: Simpler is better: spectral regularization and up-sampling techniques for variational autoencoders

Authors: Sara Björk, Jonas Nordhaug Myhre, Thomas Haugland Johansen

Comments: Submitted to ICASSP 2022, 2022 IEEE International Conference on Acoustics, Speech and Signal Processing

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[864] arXiv:2201.07646 (cross-list from cs.LG) [pdf, other]: Title: A Survey on Training Challenges in Generative Adversarial Networks for Biomedical Image Analysis

Authors: Muhammad Muneeb Saad, Ruairi O'Reilly, Mubashir Husain Rehmani

Comments: Submitted to the AI Review Journal

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[865] arXiv:2201.07698 (cross-list from cs.HC) [pdf, other]: Title: Visualization and Analysis of Wearable Health Data From COVID-19 Patients

Authors: Susanne K. Suter, Georg R. Spinner, Bianca Hoelz, Sofia Rey, Sujeanthraa Thanabalasingam, Jens Eckstein, Sven Hirsch

Comments: 17 pages, 9 figures, conference

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[866] arXiv:2201.07779 (cross-list from cs.RO) [pdf, other]: Title: Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation

Authors: Rishabh Jangir, Nicklas Hansen, Sambaran Ghosal, Mohit Jain, Xiaolong Wang

Comments: Accepted in Robotics and Automation Letters Journal (RA-L 2022). Website at this https URL .8 Pages

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[867] arXiv:2201.07823 (cross-list from cs.MM) [pdf, ps, other]: Title: BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding

Authors: Farhad Pakdaman, Mohammad Ali Adelimanesh, Mahmoud Reza Hashemi

Journal-ref: Journal of Real-Time Image Processing (2022)

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[868] arXiv:2201.07863 (cross-list from cs.RO) [pdf, other]: Title: ROS georegistration: Aerial Multi-spectral Image Simulator for the Robot Operating System

Authors: Andrew R. Willis, Kevin Brink, Kathleen Dipple

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[869] arXiv:2201.07882 (cross-list from cs.RO) [pdf, ps, other]: Title: An Automated Robotic Arm: A Machine Learning Approach

Authors: Krishnaraj Rao N S, Avinash N J, Rama Moorthy H, Karthik K, Sudesh Rao, Santosh S

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[870] arXiv:2201.07899 (cross-list from cs.CL) [pdf, ps, other]: Title: ASL Video Corpora & Sign Bank: Resources Available through the American Sign Language Linguistic Research Project (ASLLRP)

Authors: Carol Neidle, Augustine Opoku, Dimitris Metaxas

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[871] arXiv:2201.07935 (cross-list from cs.LG) [pdf, ps, other]: Title: Towards deep observation: A systematic survey on artificial intelligence techniques to monitor fetus via Ultrasound Images

Authors: Mahmood Alzubaidi, Marco Agus, Khalid Alyafei, Khaled A Althelaya, Uzair Shah, Alaa Abd-Alrazaq, Mohammed Anbar, Michel Makhlouf, Mowafa Househ

Comments: 25 pages, 4 figures, submitted to Artificial Intelligence in Medicine

Journal-ref: IScience,Volume 25, Issue 8, 19 August 2022, 104713

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Image and Video Processing (eess.IV)
[872] arXiv:2201.08142 (cross-list from cs.RO) [pdf, other]: Title: Physically Embodied Deep Image Optimisation

Authors: Daniela Mihai, Jonathon Hare

Journal-ref: 5th Workshop on Machine Learning for Creativity and Design of the Neural Information Processing Systems (NeurIPS) 2021 Conference

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[873] arXiv:2201.08266 (cross-list from cs.GR) [src]: Title: A Real-Time Rendering Method for Light Field Display

Authors: Quanzhen Wan

Comments: We are reminded by our supervisors and peers that we have not taken many potential influential factors into consideration, which might lead to a rather different outcome. If the whole idea will be certified correctly in the future, we will resubmit our updated version at that time

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[874] arXiv:2201.08279 (cross-list from cs.CG) [pdf, other]: Title: Modeling and hexahedral meshing of cerebral arterial networks from centerlines

Authors: Méghane Decroocq, Carole Frindel, Pierre Rougé, Makoto Ohta, Guillaume Lavoué

Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[875] arXiv:2201.08429 (cross-list from cs.LG) [pdf, other]: Title: A Visual Analytics Approach to Building Logistic Regression Models and its Application to Health Records

Authors: Erasmo Artur, Rosane Minghim

Comments: 16 pages and 13 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[876] arXiv:2201.08676 (cross-list from cs.LG) [pdf, other]: Title: Distance-Ratio-Based Formulation for Metric Learning

Authors: Hyeongji Kim, Pekka Parviainen, Ketil Malde

Comments: 17 pages. Codes for our experiments are available in this https URL . Perhaps, we will write a new version with experiments using normalized embedding and common metric learning performance metrics

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[877] arXiv:2201.09130 (cross-list from cs.AI) [pdf, ps, other]: Title: Artificial Intelligence for Suicide Assessment using Audiovisual Cues: A Review

Authors: Sahraoui Dhelim, Liming Chen, Huansheng Ning, Chris Nugent

Comments: Manuscript submitted to Arificial Intelligence Reviews (2022)

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[878] arXiv:2201.09165 (cross-list from cs.MM) [pdf, other]: Title: A Pre-trained Audio-Visual Transformer for Emotion Recognition

Authors: Minh Tran, Mohammad Soleymani

Comments: Accepted by IEEE ICASSP 2022

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[879] arXiv:2201.09196 (cross-list from cs.LG) [pdf, other]: Title: Learning to Predict Gradients for Semi-Supervised Continual Learning

Authors: Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

Journal-ref: IEEE Transactions on Neural Networks and Learning Systems, 2024

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[880] arXiv:2201.09243 (cross-list from cs.CR) [pdf, other]: Title: Increasing the Cost of Model Extraction with Calibrated Proof of Work

Authors: Adam Dziedzic, Muhammad Ahmad Kaleem, Yu Shen Lu, Nicolas Papernot

Comments: Published as a conference paper at ICLR 2022 (Spotlight - 5% of submitted papers)

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[881] arXiv:2201.09367 (cross-list from cs.GR) [pdf, other]: Title: Sketch2PQ: Freeform Planar Quadrilateral Mesh Design via a Single Sketch

Authors: Zhi Deng, Yang Liu, Hao Pan, Wassim Jabi, Juyong Zhang, Bailin Deng

Comments: To appear in IEEE Transactions on Visualization and Computer Graphics

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[882] arXiv:2201.09463 (cross-list from cs.SE) [pdf, other]: Title: Cyber Mobility Mirror for Enabling Cooperative Driving Automation in Mixed Traffic: A Co-Simulation Platform

Authors: Zhengwei Bai, Guoyuan Wu, Xuewei Qi, Yongkang Liu, Kentaro Oguchi, Matthew J. Barth

Comments: Accepted by the IEEE Intelligent Transportation Systems Magazine

Journal-ref: IEEE Intelligent Transportation Systems Magazine 2022

Subjects: Software Engineering (cs.SE); Computer Vision and Pattern Recognition (cs.CV)
[883] arXiv:2201.09487 (cross-list from cs.CR) [pdf, ps, other]: Title: Forgery Attack Detection in Surveillance Video Streams Using Wi-Fi Channel State Information

Authors: Yong Huang, Xiang Li, Wei Wang, Tao Jiang, Qian Zhang

Comments: To appear in IEEE Transactions on Wireless Communications. arXiv admin note: text overlap with arXiv:2101.00848

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[884] arXiv:2201.09671 (cross-list from cs.LG) [pdf, other]: Title: Analyzing Multispectral Satellite Imagery of South American Wildfires Using Deep Learning

Authors: Christopher Sun

Comments: IEEE International Conference on Applied Artificial Intelligence (May 2022)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[885] arXiv:2201.09679 (cross-list from cs.LG) [pdf, other]: Title: A Review of Deep Transfer Learning and Recent Advancements

Authors: Mohammadreza Iman, Khaled Rasheed, Hamid R. Arabnia

Comments: 18 pages, 2 figures, 1 table

Journal-ref: Technologies 2023, 11, 40

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[886] arXiv:2201.09725 (cross-list from cs.LG) [pdf, ps, other]: Title: Machine Learning Algorithms for Prediction of Penetration Depth and Geometrical Analysis of Weld in Friction Stir Spot Welding Process

Authors: Akshansh Mishra, Raheem Al-Sabur, Ahmad K. Jassim

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[887] arXiv:2201.09765 (cross-list from cs.LG) [pdf, ps, other]: Title: Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning

Authors: Haichao Zhang, Wei Xu, Haonan Yu

Comments: Spotlight paper at the 10th International Conference on Learning Representations (ICLR 2022)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[888] arXiv:2201.09828 (cross-list from cs.LG) [pdf, other]: Title: MMLatch: Bottom-up Top-down Fusion for Multimodal Sentiment Analysis

Authors: Georgios Paraskevopoulos, Efthymios Georgiou, Alexandros Potamianos

Comments: Accepted, ICASSP 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[889] arXiv:2201.09884 (cross-list from cs.LG) [pdf, other]: Title: AutoMC: Automated Model Compression based on Domain Knowledge and Progressive search strategy

Authors: Chunnan Wang, Hongzhi Wang, Xiangyu Shi

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[890] arXiv:2201.10000 (cross-list from cs.LG) [pdf, other]: Title: Neural Manifold Clustering and Embedding

Authors: Zengyi Li, Yubei Chen, Yann LeCun, Friedrich T. Sommer

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[891] arXiv:2201.10266 (cross-list from cs.AI) [pdf, other]: Title: Combining Commonsense Reasoning and Knowledge Acquisition to Guide Deep Learning in Robotics

Authors: Mohan Sridharan, Tiago Mota

Comments: 37 pages, 17 figures, 5 tables

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Robotics (cs.RO)
[892] arXiv:2201.10353 (cross-list from cs.LG) [pdf, ps, other]: Title: A Multi-modal Fusion Framework Based on Multi-task Correlation Learning for Cancer Prognosis Prediction

Authors: Kaiwen Tan, Weixian Huang, Xiaofeng Liu, Jinlong Hu, Shoubin Dong

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[893] arXiv:2201.10444 (cross-list from cs.LG) [pdf, other]: Title: AggMatch: Aggregating Pseudo Labels for Semi-Supervised Learning

Authors: Jiwon Kim, Kwangrok Ryoo, Gyuseong Lee, Seokju Cho, Junyoung Seo, Daehwan Kim, Hansang Cho, Seungryong Kim

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[894] arXiv:2201.10859 (cross-list from cs.LG) [pdf, other]: Title: Visualizing the Diversity of Representations Learned by Bayesian Neural Networks

Authors: Dennis Grinwald, Kirill Bykov, Shinichi Nakajima, Marina M.-C. Höhne

Comments: 16 pages, 18 figures

Journal-ref: Published in Transactions on Machine Learning Research (11/2023)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[895] arXiv:2201.10890 (cross-list from cs.LG) [pdf, other]: Title: One Student Knows All Experts Know: From Sparse to Dense

Authors: Fuzhao Xue, Xiaoxin He, Xiaozhe Ren, Yuxuan Lou, Yang You

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[896] arXiv:2201.10899 (cross-list from cs.LG) [pdf, other]: Title: Speeding up Heterogeneous Federated Learning with Sequentially Trained Superclients

Authors: Riccardo Zaccone, Andrea Rizzardi, Debora Caldarola, Marco Ciccone, Barbara Caputo

Comments: Published at the 26th International Conference on Pattern Recognition (ICPR), 2022, pp. 3376-3382

Journal-ref: 26th International Conference on Pattern Recognition (ICPR), 2022, pp. 3376-3382

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[897] arXiv:2201.10947 (cross-list from cs.LG) [pdf, ps, other]: Title: Enabling Deep Learning on Edge Devices through Filter Pruning and Knowledge Transfer

Authors: Kaiqi Zhao, Yitao Chen, Ming Zhao

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[898] arXiv:2201.11259 (cross-list from cs.LG) [pdf, other]: Title: Controlling Directions Orthogonal to a Classifier

Authors: Yilun Xu, Hao He, Tianxiao Shen, Tommi Jaakkola

Comments: accepted by ICLR 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[899] arXiv:2201.11511 (cross-list from cs.LG) [pdf, ps, other]: Title: Density-Aware Hyper-Graph Neural Networks for Graph-based Semi-supervised Node Classification

Authors: Jianpeng Liao, Qian Tao, Jun Yan

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[900] arXiv:2201.11613 (cross-list from cs.LG) [pdf, other]: Title: Domain-Invariant Representation Learning from EEG with Private Encoders

Authors: David Bethge, Philipp Hallgarten, Tobias Grosse-Puppendahl, Mohamed Kari, Ralf Mikut, Albrecht Schmidt, Ozan Özdenizci

Comments: 5 pages, 1 figure

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[901] arXiv:2201.11678 (cross-list from cs.LG) [pdf, other]: Title: Unsupervised Change Detection using DRE-CUSUM

Authors: Sudarshan Adiga, Ravi Tandon

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[902] arXiv:2201.11679 (cross-list from cs.LG) [pdf, other]: Title: DropNAS: Grouped Operation Dropout for Differentiable Architecture Search

Authors: Weijun Hong, Guilin Li, Weinan Zhang, Ruiming Tang, Yunhe Wang, Zhenguo Li, Yong Yu

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[903] arXiv:2201.11706 (cross-list from cs.LG) [pdf, other]: Title: A Systematic Study of Bias Amplification

Authors: Melissa Hall, Laurens van der Maaten, Laura Gustafson, Maxwell Jones, Aaron Adcock

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[904] arXiv:2201.11732 (cross-list from cs.CL) [pdf, other]: Title: IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages

Authors: Emanuele Bugliarello, Fangyu Liu, Jonas Pfeiffer, Siva Reddy, Desmond Elliott, Edoardo Maria Ponti, Ivan Vulić

Comments: ICML 2022

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[905] arXiv:2201.11812 (cross-list from cs.CR) [pdf, other]: Title: A Transfer Learning and Optimized CNN Based Intrusion Detection System for Internet of Vehicles

Authors: Li Yang, Abdallah Shami

Comments: Accepted and to appear in IEEE International Conference on Communications (ICC); Code is available at Github link: this https URL

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[906] arXiv:2201.11844 (cross-list from cs.CR) [pdf, ps, other]: Title: Speckle-based optical cryptosystem and its application for human face recognition via deep learning

Authors: Qi Zhao, Huanhao Li, Zhipeng Yu, Chi Man Woo, Tianting Zhong, Shengfu Cheng, Yuanjin Zheng, Honglin Liu, Jie Tian, Puxiang Lai

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[907] arXiv:2201.11857 (cross-list from cs.LG) [pdf, other]: Title: Using Shape Metrics to Describe 2D Data Points

Authors: William Franz Lamberti

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[908] arXiv:2201.11944 (cross-list from cs.RO) [pdf, other]: Title: DICP: Doppler Iterative Closest Point Algorithm

Authors: Bruno Hexsel, Heethesh Vhavle, Yi Chen

Comments: Accepted at Robotics: Science and Systems (RSS) 2022

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[909] arXiv:2201.11999 (cross-list from cs.SD) [pdf, other]: Title: Dual Learning Music Composition and Dance Choreography

Authors: Shuang Wu, Zhenguang Li, Shijian Lu, Li Cheng

Comments: ACMMM 2021 (Oral)

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[910] arXiv:2201.12107 (cross-list from cs.AI) [pdf, ps, other]: Title: Feature Visualization within an Automated Design Assessment leveraging Explainable Artificial Intelligence Methods

Authors: Raoul Schönhof, Artem Werner, Jannes Elstner, Boldizsar Zopcsak, Ramez Awad, Marco Huber

Comments: CIRP Design 2021, 10.1016/j.procir.2021.05.075

Journal-ref: 2021, Procedia CIRP 100(7):331-336

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[911] arXiv:2201.12114 (cross-list from cs.LG) [pdf, other]: Title: Rethinking Attention-Model Explainability through Faithfulness Violation Test

Authors: Yibing Liu, Haoliang Li, Yangyang Guo, Chenqi Kong, Jing Li, Shiqi Wang

Comments: Accepted to ICML 2022

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[912] arXiv:2201.12123 (cross-list from cs.LG) [pdf, other]: Title: DELAUNAY: a dataset of abstract art for psychophysical and machine learning research

Authors: Camille Gontier, Jakob Jordan, Mihai A. Petrovici

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[913] arXiv:2201.12179 (cross-list from cs.LG) [pdf, other]: Title: Plug & Play Attacks: Towards Robust and Flexible Model Inversion Attacks

Authors: Lukas Struppek, Dominik Hintersdorf, Antonio De Almeida Correia, Antonia Adler, Kristian Kersting

Comments: Accepted by ICML 2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[914] arXiv:2201.12240 (cross-list from cs.LG) [pdf, other]: Title: Continuous Deep Equilibrium Models: Training Neural ODEs faster by integrating them to Infinity

Authors: Avik Pal, Alan Edelman, Christopher Rackauckas

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Dynamical Systems (math.DS)
[915] arXiv:2201.12296 (cross-list from cs.LG) [pdf, other]: Title: Benchmarking Robustness of 3D Point Cloud Recognition Against Common Corruptions

Authors: Jiachen Sun, Qingzhao Zhang, Bhavya Kailkhura, Zhiding Yu, Chaowei Xiao, Z. Morley Mao

Comments: Codebase and dataset are included in this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[916] arXiv:2201.12311 (cross-list from cs.LG) [pdf, ps, other]: Title: REET: Robustness Evaluation and Enhancement Toolbox for Computational Pathology

Authors: Alex Foote, Amina Asif, Nasir Rajpoot, Fayyaz Minhas

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[917] arXiv:2201.12351 (cross-list from cs.LG) [pdf, other]: Title: Low-rank features based double transformation matrices learning for image classification

Authors: Yu-Hong Cai, Xiao-Jun Wu, Zhe Chen

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[918] arXiv:2201.12382 (cross-list from cs.AI) [pdf, other]: Title: Deep Learning Methods for Abstract Visual Reasoning: A Survey on Raven's Progressive Matrices

Authors: Mikołaj Małkiński, Jacek Mańdziuk

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[919] arXiv:2201.12406 (cross-list from cs.LG) [pdf, other]: Title: Syfer: Neural Obfuscation for Private Data Release

Authors: Adam Yala, Victor Quach, Homa Esfahanizadeh, Rafael G. L. D'Oliveira, Ken R. Duffy, Muriel Médard, Tommi S. Jaakkola, Regina Barzilay

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[920] arXiv:2201.12577 (cross-list from cs.CR) [pdf, other]: Title: Volley Revolver: A Novel Matrix-Encoding Method for Privacy-Preserving Neural Networks (Inference)

Authors: John Chiang

Comments: The encoding method we proposed in this work, $\texttt{Volley Revolver}$, is particularly tailored for privacy-preserving neural networks. There is a good chance that it can be used to assist the private neural networks training, in which case for the backpropagation algorithm of the fully-connected layer the first matrix $A$ is revolved while the second matrix $B$ is settled to be still

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[921] arXiv:2201.12604 (cross-list from cs.LG) [pdf, other]: Title: Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning System

Authors: Elahe Arani, Fahad Sarfraz, Bahram Zonooz

Comments: Published as a conference paper at ICLR 2022 (camera-ready version)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[922] arXiv:2201.12678 (cross-list from cs.LG) [pdf, ps, other]: Title: A Stochastic Bundle Method for Interpolating Networks

Authors: Alasdair Paren, Leonard Berrada, Rudra P. K. Poudel, M. Pawan Kumar

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[923] arXiv:2201.12680 (cross-list from cs.LG) [pdf, other]: Title: Understanding Deep Contrastive Learning via Coordinate-wise Optimization

Authors: Yuandong Tian

Comments: Add code links

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[924] arXiv:2201.12716 (cross-list from cs.RO) [pdf, other]: Title: You Only Demonstrate Once: Category-Level Manipulation from Single Visual Demonstration

Authors: Bowen Wen, Wenzhao Lian, Kostas Bekris, Stefan Schaal

Journal-ref: Robotics: Science and Systems (RSS) 2022

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[925] arXiv:2201.12803 (cross-list from cs.LG) [pdf, other]: Title: Generalizing similarity in noisy setups: the DIBS phenomenon

Authors: Nayara Fonseca, Veronica Guidetti

Comments: v3: version accepted at ECAI 2023 + Supplementary Material

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[926] arXiv:2201.12896 (cross-list from cs.LG) [pdf, other]: Title: Augmenting Novelty Search with a Surrogate Model to Engineer Meta-Diversity in Ensembles of Classifiers

Authors: Rui P. Cardoso, Emma Hart, David Burth Kurka, Jeremy V. Pitt

Comments: 16 pages, 4 figures, 3 tables, EvoStar 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[927] arXiv:2201.12904 (cross-list from cs.LG) [pdf, other]: Title: COIN++: Neural Compression Across Modalities

Authors: Emilien Dupont, Hrushikesh Loya, Milad Alizadeh, Adam Goliński, Yee Whye Teh, Arnaud Doucet

Comments: TMLR camera ready

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[928] arXiv:2201.12910 (cross-list from cs.LG) [pdf, other]: Title: Sparse Centroid-Encoder: A Nonlinear Model for Feature Selection

Authors: Tomojit Ghosh, Michael Kirby

Comments: 13 pages,56 figures, 5 tables. Used 12 data sets and 5 state-of-the-art models for comparison

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[929] arXiv:2201.12926 (cross-list from cs.CL) [pdf, other]: Title: Compositionality as Lexical Symmetry

Authors: Ekin Akyürek, Jacob Andreas

Comments: ACL2023 Final Version

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[930] arXiv:2201.13168 (cross-list from cs.GR) [pdf, other]: Title: SPAGHETTI: Editing Implicit Shapes Through Part Aware Generation

Authors: Amir Hertz, Or Perel, Raja Giryes, Olga Sorkine-Hornung, Daniel Cohen-Or

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[931] arXiv:2201.13190 (cross-list from cs.GR) [pdf, other]: Title: Differentiable Neural Radiosity

Authors: Saeed Hadadan, Matthias Zwicker

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[932] arXiv:2201.13361 (cross-list from cs.LG) [pdf, other]: Title: Signing the Supermask: Keep, Hide, Invert

Authors: Nils Koster, Oliver Grothe, Achim Rettinger

Comments: ICLR 2022 camera ready

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[933] arXiv:2201.00084 (cross-list from eess.IV) [pdf, other]: Title: Performance Comparison of Deep Learning Architectures for Artifact Removal in Gastrointestinal Endoscopic Imaging

Authors: Taira Watanabe, Kensuke Tanioka, Satoru Hiwa, Tomoyuki Hiroyasu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[934] arXiv:2201.00100 (cross-list from eess.IV) [pdf, other]: Title: Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images

Authors: Xiaoqiang Wang, Lei Zhu, Siliang Tang, Huazhu Fu, Ping Li, Fei Wu, Yi Yang, Yueting Zhuang

Comments: Accepted by IEEE TIP

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[935] arXiv:2201.00155 (cross-list from eess.IV) [pdf, other]: Title: Adaptive Single Image Deblurring

Authors: Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan

Comments: arXiv admin note: substantial text overlap with arXiv:2004.05343

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[936] arXiv:2201.00163 (cross-list from eess.IV) [pdf, other]: Title: Development of Diabetic Foot Ulcer Datasets: An Overview

Authors: Moi Hoon Yap, Connah Kendrick, Neil D. Reeves, Manu Goyal, Joseph M. Pappachan, Bill Cassidy

Comments: Preprint (author copy) to be published in MICCAI DFUC2021 Proceedings

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[937] arXiv:2201.00169 (cross-list from eess.IV) [pdf, other]: Title: Dynamic Scene Video Deblurring using Non-Local Attention

Authors: Maitreya Suin, A. N. Rajagopalan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[938] arXiv:2201.00187 (cross-list from eess.IV) [pdf, other]: Title: Image Restoration using Feature-guidance

Authors: Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[939] arXiv:2201.00227 (cross-list from eess.IV) [pdf, ps, other]: Title: Deep Learning Applications for Lung Cancer Diagnosis: A systematic review

Authors: Hesamoddin Hosseini, Reza Monsefi, Shabnam Shadroo

Comments: 32 pages, 14 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[940] arXiv:2201.00259 (cross-list from eess.IV) [pdf, other]: Title: Subspace modeling for fast and high-sensitivity X-ray chemical imaging

Authors: Jizhou Li, Bin Chen, Guibin Zan, Guannan Qian, Piero Pianetta, Yijin Liu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[941] arXiv:2201.00317 (cross-list from eess.IV) [pdf, other]: Title: Recurrent Feature Propagation and Edge Skip-Connections for Automatic Abdominal Organ Segmentation

Authors: Zefan Yang, Di Lin, Dong Ni, Yi Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[942] arXiv:2201.00337 (cross-list from eess.IV) [pdf, other]: Title: Riemannian Nearest-Regularized Subspace Classification for Polarimetric SAR images

Authors: Junfei Shi, Haiyan Jin

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[943] arXiv:2201.00404 (cross-list from q-bio.NC) [pdf, other]: Title: MHATC: Autism Spectrum Disorder identification utilizing multi-head attention encoder along with temporal consolidation modules

Authors: Ranjeet Ranjan Jha, Abhishek Bhardwaj, Devin Garg, Arnav Bhavsar, Aditya Nigam

Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[944] arXiv:2201.00414 (cross-list from eess.IV) [pdf, ps, other]: Title: FUSeg: The Foot Ulcer Segmentation Challenge

Authors: Chuanbo Wang, Amirreza Mahbod, Isabella Ellinger, Adrian Galdran, Sandeep Gopalakrishnan, Jeffrey Niezgoda, Zeyun Yu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[945] arXiv:2201.00429 (cross-list from eess.IV) [pdf, other]: Title: Image Denoising with Control over Deep Network Hallucination

Authors: Qiyuan Liang, Florian Cassayre, Haley Owsianko, Majed El Helou, Sabine Süsstrunk

Comments: Published in Electronic Imaging 2022, code available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[946] arXiv:2201.00458 (cross-list from eess.IV) [pdf, other]: Title: Lung-Originated Tumor Segmentation from Computed Tomography Scan (LOTUS) Benchmark

Authors: Parnian Afshar, Arash Mohammadi, Konstantinos N. Plataniotis, Keyvan Farahani, Justin Kirby, Anastasia Oikonomou, Amir Asif, Leonard Wee, Andre Dekker, Xin Wu, Mohammad Ariful Haque, Shahruk Hossain, Md. Kamrul Hasan, Uday Kamal, Winston Hsu, Jhih-Yuan Lin, M. Sohel Rahman, Nabil Ibtehaz, Sh. M. Amir Foisol, Kin-Man Lam, Zhong Guang, Runze Zhang, Sumohana S. Channappayya, Shashank Gupta, Chander Dev

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[947] arXiv:2201.00466 (cross-list from eess.IV) [pdf, other]: Title: RFormer: Transformer-based Generative Adversarial Network for Real Fundus Image Restoration on A New Clinical Benchmark

Authors: Zhuo Deng, Yuanhao Cai, Lu Chen, Zheng Gong, Qiqi Bao, Xue Yao, Dong Fang, Shaochong Zhang, Lan Ma

Comments: IEEE J-BHI 2022; The First Benchmark and First Transformer-based Method for Real Clinical Fundus Image Restoration

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[948] arXiv:2201.00636 (cross-list from eess.IV) [pdf, ps, other]: Title: Improving Feature Extraction from Histopathological Images Through A Fine-tuning ImageNet Model

Authors: Xingyu Li, Min Cen, Jinfeng Xu, Hong Zhang, Xu Steven Xu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[949] arXiv:2201.00767 (cross-list from eess.IV) [pdf, other]: Title: BDG-Net: Boundary Distribution Guided Network for Accurate Polyp Segmentation

Authors: Zihuan Qiu, Zhichuan Wang, Miaomiao Zhang, Ziyong Xu, Jie Fan, Linfeng Xu

Comments: Accepted by SPIE Medical Imaging 2022

Journal-ref: Proc. SPIE 12032, Medical Imaging 2022: Image Processing, 1203230 (4 April 2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[950] arXiv:2201.00820 (cross-list from eess.IV) [pdf, other]: Title: Low dosage 3D volume fluorescence microscopy imaging using compressive sensing

Authors: Varun Mannam, Jacob Brandt, Cody J. Smith, Scott Howard

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an); Instrumentation and Detectors (physics.ins-det); Optics (physics.optics)
[951] arXiv:2201.00895 (cross-list from eess.IV) [pdf, other]: Title: A Gradient Mapping Guided Explainable Deep Neural Network for Extracapsular Extension Identification in 3D Head and Neck Cancer Computed Tomography Images

Authors: Yibin Wang, Abdur Rahman, W. Neil. Duggar, P. Russell Roberts, Toms V. Thomas, Linkan Bian, Haifeng Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[952] arXiv:2201.00942 (cross-list from eess.IV) [pdf, other]: Title: External Attention Assisted Multi-Phase Splenic Vascular Injury Segmentation with Limited Data

Authors: Yuyin Zhou, David Dreizin, Yan Wang, Fengze Liu, Wei Shen, Alan L. Yuille

Comments: IEEE TMI

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[953] arXiv:2201.00957 (cross-list from eess.IV) [pdf, ps, other]: Title: Stain Normalized Breast Histopathology Image Recognition using Convolutional Neural Networks for Cancer Detection

Authors: Sruthi Krishna, Suganthi S.S, Shivsubramani Krishnamoorthy, Arnav Bhavsar

Comments: 26 pages, 11 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[954] arXiv:2201.01014 (cross-list from eess.IV) [pdf, other]: Title: Local Motion and Contrast Priors Driven Deep Network for Infrared Small Target Super-Resolution

Authors: Xinyi Ying, Yingqian Wang, Longguang Wang, Weidong Sheng, Li Liu, Zaiping Lin, Shilin Zhou

Journal-ref: JSTARS 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[955] arXiv:2201.01034 (cross-list from eess.IV) [pdf, other]: Title: Uncovering the Over-smoothing Challenge in Image Super-Resolution: Entropy-based Quantification and Contrastive Optimization

Authors: Tianshuo Xu, Lijiang Li, Peng Mi, Xiawu Zheng, Fei Chao, Rongrong Ji, Yonghong Tian, Qiang Shen

Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[956] arXiv:2201.01173 (cross-list from eess.IV) [pdf, other]: Title: DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression

Authors: Yi Ma, Yongqi Zhai, Ronggang Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[957] arXiv:2201.01266 (cross-list from eess.IV) [pdf, other]: Title: Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images

Authors: Ali Hatamizadeh, Vishwesh Nath, Yucheng Tang, Dong Yang, Holger Roth, Daguang Xu

Comments: 13 pages, 3 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[958] arXiv:2201.01380 (cross-list from eess.IV) [pdf, other]: Title: Image Processing Methods for Coronal Hole Segmentation, Matching, and Map Classification

Authors: V. Jatla, M.S. Pattichis, C.N. Arge

Journal-ref: IEEE Transactions on Image Processing 29 (2019): 1641-1653

Subjects: Image and Video Processing (eess.IV); Solar and Stellar Astrophysics (astro-ph.SR); Computer Vision and Pattern Recognition (cs.CV)
[959] arXiv:2201.01426 (cross-list from eess.IV) [pdf, other]: Title: Advancing 3D Medical Image Analysis with Variable Dimension Transform based Supervised 3D Pre-training

Authors: Shu Zhang, Zihao Li, Hong-Yu Zhou, Jiechao Ma, Yizhou Yu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[960] arXiv:2201.01443 (cross-list from eess.IV) [pdf, other]: Title: Neural KEM: A Kernel Method with Deep Coefficient Prior for PET Image Reconstruction

Authors: Siqi Li, Kuang Gong, Ramsey D. Badawi, Edward J. Kim, Jinyi Qi, Guobao Wang

Comments: arXiv admin note: text overlap with arXiv:2110.01174

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[961] arXiv:2201.01449 (cross-list from eess.IV) [pdf, other]: Title: Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis of Gastric Intestinal Metaplasia

Authors: Jon Braatz, Pranav Rajpurkar, Stephanie Zhang, Andrew Y. Ng, Jeanne Shen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[962] arXiv:2201.01453 (cross-list from eess.IV) [pdf, other]: Title: Robust photon-efficient imaging using a pixel-wise residual shrinkage network

Authors: Gongxin Yao, Yiwei Chen, Yong Liu, Xiaomin Hu, Yu Pan

Journal-ref: Optics Express 30(11):18856-18873, 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[963] arXiv:2201.01458 (cross-list from eess.IV) [pdf, other]: Title: Cross-SRN: Structure-Preserving Super-Resolution Network with Cross Convolution

Authors: Yuqing Liu, Qi Jia, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[964] arXiv:2201.01492 (cross-list from eess.IV) [pdf, other]: Title: FAVER: Blind Quality Prediction of Variable Frame Rate Videos

Authors: Qi Zheng, Zhengzhong Tu, Pavan C. Madhusudana, Xiaoyang Zeng, Alan C. Bovik, Yibo Fan

Comments: 12 pages, 8 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[965] arXiv:2201.01586 (cross-list from eess.IV) [pdf, other]: Title: Learning True Rate-Distortion-Optimization for End-To-End Image Compression

Authors: Fabian Brand, Kristian Fischer, Alexander Kopte, André Kaup

Comments: Accepted to DCC as Poster

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[966] arXiv:2201.01778 (cross-list from quant-ph) [pdf, other]: Title: Quantum Capsule Networks

Authors: Zidu Liu, Pei-Xin Shen, Weikang Li, L.-M. Duan, Dong-Ling Deng

Comments: 7 pages (main text) + 8 pages (supplementary information), 8 figures

Journal-ref: Quantum Sci. Technol. 8 015016 (2022)

Subjects: Quantum Physics (quant-ph); Disordered Systems and Neural Networks (cond-mat.dis-nn); Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[967] arXiv:2201.01832 (cross-list from eess.IV) [pdf, ps, other]: Title: Multiple Sclerosis Lesions Segmentation using Attention-Based CNNs in FLAIR Images

Authors: Mehdi SadeghiBakhi, Hamidreza Pourreza, Hamidreza Mahyar

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[968] arXiv:2201.01838 (cross-list from eess.IV) [pdf, other]: Title: Lumbar Bone Mineral Density Estimation from Chest X-ray Images: Anatomy-aware Attentive Multi-ROI Modeling

Authors: Fakai Wang, Kang Zheng, Le Lu, Jing Xiao, Min Wu, Chang-Fu Kuo, Shun Miao

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[969] arXiv:2201.01893 (cross-list from eess.IV) [pdf, other]: Title: Flow-Guided Sparse Transformer for Video Deblurring

Authors: Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, Luc Van Gool

Comments: ICML 2022; The First Transformer-based method for Video Deblurring

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[970] arXiv:2201.02184 (cross-list from eess.AS) [pdf, other]: Title: Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction

Authors: Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed

Comments: ICLR 2022

Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[971] arXiv:2201.02198 (cross-list from eess.IV) [pdf, other]: Title: 3D Intracranial Aneurysm Classification and Segmentation via Unsupervised Dual-branch Learning

Authors: Di Shao, Xuequan Lu, Xiao Liu

Comments: under review (corresponding: {xuequan.lu@deakin.edu.au})

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[972] arXiv:2201.02242 (cross-list from eess.IV) [pdf, other]: Title: A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration

Authors: Aline Sindel (1), Bettina Hohberger (2), Sebastian Fassihi Dehcordi (2), Christian Mardin (2), Robert Lämmer (2), Andreas Maier (1), Vincent Christlein (1) ((1) Pattern Recognition Lab, FAU Erlangen-Nürnberg, (2) Department of Ophthalmology, Universitätsklinikum Erlangen)

Comments: 6 pages, 4 figures, 1 table, accepted to BVM 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[973] arXiv:2201.02295 (cross-list from eess.IV) [pdf, ps, other]: Title: Persistent Homology for Breast Tumor Classification using Mammogram Scans

Authors: Aras Asaad, Dashti Ali, Taban Majeed, Rasber Rashid

Comments: 14 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Algebraic Topology (math.AT)
[974] arXiv:2201.02309 (cross-list from eess.IV) [pdf, other]: Title: A three-dimensional dual-domain deep network for high-pitch and sparse helical CT reconstruction

Authors: Wei Wang, Xiang-Gen Xia, Chuanjiang He, Zemin Ren, Jian Lu

Comments: 13 pages, 5 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[975] arXiv:2201.02314 (cross-list from eess.IV) [pdf, other]: Title: RestoreDet: Degradation Equivariant Representation for Object Detection in Low Resolution Images

Authors: Ziteng Cui, Yingying Zhu, Lin Gu, Guo-Jun Qi, Xiaoxiao Li, Peng Gao, Zenghui Zhang, Tatsuya Harada

Comments: 11 pages, 3figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[976] arXiv:2201.02350 (cross-list from eess.IV) [pdf, ps, other]: Title: Multiresolution Fully Convolutional Networks to detect Clouds and Snow through Optical Satellite Images

Authors: Debvrat Varshney, Claudio Persello, Prasun Kumar Gupta, Bhaskar Ramachandra Nikam

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[977] arXiv:2201.02356 (cross-list from eess.IV) [pdf, other]: Title: Cross-Modality Deep Feature Learning for Brain Tumor Segmentation

Authors: Dingwen Zhang, Guohai Huang, Qiang Zhang, Jungong Han, Junwei Han, Yizhou Yu

Comments: published on Pattern Recognition 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[978] arXiv:2201.02409 (cross-list from eess.IV) [pdf, other]: Title: Amplitude SAR Imagery Splicing Localization

Authors: Edoardo Daniele Cannas, Nicolò Bonettini, Sara Mandelli, Paolo Bestagini, Stefano Tubaro

Comments: The manuscript has been published in IEEE Access. Changes include the full citation to the IEEE published version

Journal-ref: in IEEE Access, vol. 10, pp. 33882-33899, 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[979] arXiv:2201.02420 (cross-list from eess.IV) [pdf, ps, other]: Title: Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video Coding

Authors: Jian Jin, Xingxing Zhang, Lili Meng, Weisi Lin, Jie Liang, Huaxiang Zhang, Yao Zhao

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[980] arXiv:2201.02428 (cross-list from eess.IV) [pdf, other]: Title: Effect of Prior-based Losses on Segmentation Performance: A Benchmark

Authors: Rosana El Jurdi, Caroline Petitjean, Veronika Cheplygina, Paul Honeine, Fahed Abdallah

Comments: To be submitted to SPIE: Journal of Medical Imaging

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[981] arXiv:2201.02445 (cross-list from eess.IV) [pdf, other]: Title: Negative Evidence Matters in Interpretable Histology Image Classification

Authors: Soufiane Belharbi, Marco Pedersoli, Ismail Ben Ayed, Luke McCaffrey, Eric Granger

Comments: 9 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[982] arXiv:2201.02475 (cross-list from eess.IV) [pdf, other]: Title: Deep Domain Adversarial Adaptation for Photon-efficient Imaging

Authors: Yiwei Chen, Gongxin Yao, Yong Liu, Hongye Su, Xiaomin Hu, Yu Pan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[983] arXiv:2201.02574 (cross-list from eess.IV) [pdf, other]: Title: An Incremental Learning Approach to Automatically Recognize Pulmonary Diseases from the Multi-vendor Chest Radiographs

Authors: Mehreen Sirshar, Taimur Hassan, Muhammad Usman Akram, Shoab Ahmed Khan

Comments: Computers in Biology and Medicine

Journal-ref: Computers in Biology and Medicine, 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[984] arXiv:2201.02624 (cross-list from eess.IV) [pdf, other]: Title: Microdosing: Knowledge Distillation for GAN based Compression

Authors: Leonhard Helminger, Roberto Azevedo, Abdelaziz Djelouah, Markus Gross, Christopher Schroers

Comments: BMVC 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[985] arXiv:2201.02625 (cross-list from eess.IV) [pdf, other]: Title: FlexHDR: Modelling Alignment and Exposure Uncertainties for Flexible HDR Imaging

Authors: Sibi Catley-Chandar, Thomas Tanay, Lucas Vandroux, Aleš Leonardis, Gregory Slabaugh, Eduardo Pérez-Pellitero

Comments: Accepted to IEEE Transactions on Image Processing (TIP) 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[986] arXiv:2201.02627 (cross-list from eess.IV) [pdf, other]: Title: Learning with Less Labels in Digital Pathology via Scribble Supervision from Natural Images

Authors: Eu Wern Teh, Graham W. Taylor

Comments: To appear in IEEE International Symposium on Biomedical Imaging (ISBI) 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[987] arXiv:2201.02629 (cross-list from eess.IV) [pdf, other]: Title: United adversarial learning for liver tumor segmentation and detection of multi-modality non-contrast MRI

Authors: Jianfeng Zhao, Dengwang Li, Shuo Li

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[988] arXiv:2201.02656 (cross-list from eess.IV) [pdf, other]: Title: GPU-Net: Lightweight U-Net with more diverse features

Authors: Heng Yu, Di Fan, Weihu Song

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[989] arXiv:2201.02689 (cross-list from eess.IV) [pdf, ps, other]: Title: Video Coding for Machines: Partial transmission of SIFT features

Authors: Sławomir Maćkowiak, Marek Domański, Sławomir Różek, Dominik Cywiński, Jakub Szkiełda

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[990] arXiv:2201.02746 (cross-list from eess.IV) [pdf, ps, other]: Title: Expert Knowledge-guided Geometric Representation Learning for Magnetic Resonance Imaging-based Glioma Grading

Authors: Yeqi Wang, Longfei Li, Cheng Li, Yan Xi, Hairong Zheng, Yusong Lin, Shanshan Wang

Comments: 10 pages, 9 figures, 2 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[991] arXiv:2201.02771 (cross-list from eess.IV) [pdf, other]: Title: A Sneak Attack on Segmentation of Medical Images Using Deep Neural Network Classifiers

Authors: Shuyue Guan, Murray Loew

Comments: 8 pages, 10 figures. Accepted by IEEE AIPR 2021 (Oral)

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[992] arXiv:2201.02812 (cross-list from eess.IV) [pdf, other]: Title: Hyperspectral Image Denoising Using Non-convex Local Low-rank and Sparse Separation with Spatial-Spectral Total Variation Regularization

Authors: Chong Peng, Yang Liu, Yongyong Chen, Xinxin Wu, Andrew Cheng, Zhao Kang, Chenglizhao Chen, Qiang Cheng

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[993] arXiv:2201.02821 (cross-list from eess.IV) [pdf, ps, other]: Title: Classification of Hyperspectral Images by Using Spectral Data and Fully Connected Neural Network

Authors: Zumray Dokur, Tamer Olmez

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[994] arXiv:2201.02831 (cross-list from eess.IV) [pdf, other]: Title: CrossMoDA 2021 challenge: Benchmark of Cross-Modality Domain Adaptation techniques for Vestibular Schwannoma and Cochlea Segmentation

Authors: Reuben Dorent, Aaron Kujawa, Marina Ivory, Spyridon Bakas, Nicola Rieke, Samuel Joutard, Ben Glocker, Jorge Cardoso, Marc Modat, Kayhan Batmanghelich, Arseniy Belkov, Maria Baldeon Calisto, Jae Won Choi, Benoit M. Dawant, Hexin Dong, Sergio Escalera, Yubo Fan, Lasse Hansen, Mattias P. Heinrich, Smriti Joshi, Victoriya Kashtanova, Hyeon Gyu Kim, Satoshi Kondo, Christian N. Kruse, Susana K. Lai-Yuen, Hao Li, Han Liu, Buntheng Ly, Ipek Oguz, Hyungseob Shin, Boris Shirokikh, Zixian Su, Guotai Wang, Jianghao Wu, Yanwu Xu, Kai Yao, Li Zhang, Sebastien Ourselin, Jonathan Shapey, Tom Vercauteren

Comments: In Medical Image Analysis

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[995] arXiv:2201.02832 (cross-list from eess.IV) [pdf, other]: Title: SGUIE-Net: Semantic Attention Guided Underwater Image Enhancement with Multi-Scale Perception

Authors: Qi Qi, Kunqian Li, Haiyong Zheng, Xiang Gao, Guojia Hou, Kun Sun

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[996] arXiv:2201.02833 (cross-list from eess.IV) [pdf, other]: Title: Weighted Encoding Optimization for Dynamic Single-pixel Imaging and Sensing

Authors: Xinrui Zhan, Liheng Bian, Chunli Zhu, Jun Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[997] arXiv:2201.02867 (cross-list from eess.IV) [pdf, other]: Title: Deep Generative Modeling for Volume Reconstruction in Cryo-Electron Microscopy

Authors: Claire Donnat, Axel Levy, Frederic Poitevin, Ellen Zhong, Nina Miolane

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[998] arXiv:2201.02876 (cross-list from eess.IV) [pdf, other]: Title: Defocus Deblur Microscopy via Head-to-Tail Cross-scale Fusion

Authors: Jiahe Wang, Boran Han

Comments: published on ICIP 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[999] arXiv:2201.02973 (cross-list from eess.IV) [pdf, other]: Title: MAXIM: Multi-Axis MLP for Image Processing

Authors: Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao Li

Comments: CVPR 2022 Oral; Code: \url{this https URL}

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1000] arXiv:2201.02979 (cross-list from eess.IV) [pdf, ps, other]: Title: Enhanced total variation minimization for stable image reconstruction

Authors: Congpei An, Hao-Ning Wu, Xiaoming Yuan

Comments: 29 pages, 8 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Numerical Analysis (math.NA)
[1001] arXiv:2201.03016 (cross-list from eess.IV) [pdf, ps, other]: Title: Learning from Synthetic InSAR with Vision Transformers: The case of volcanic unrest detection

Authors: Nikolaos Ioannis Bountos, Dimitrios Michail, Ioannis Papoutsis

Comments: This work has been submitted to the IEEE for possible publication

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1002] arXiv:2201.03050 (cross-list from eess.IV) [pdf, ps, other]: Title: Lung infection and normal region segmentation from CT volumes of COVID-19 cases

Authors: Masahiro Oda, Yuichiro Hayashi, Yoshito Otake, Masahiro Hashimoto, Toshiaki Akashi, Kensaku Mori

Comments: Accepted paper as a poster presentation at SPIE Medical Imaging 2021

Journal-ref: Proceedings of SPIE Medical Imaging 2021: Computer-Aided Diagnosis, Vol.11597, 115972X-1-6

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1003] arXiv:2201.03053 (cross-list from eess.IV) [pdf, other]: Title: COVID-19 Infection Segmentation from Chest CT Images Based on Scale Uncertainty

Authors: Masahiro Oda, Tong Zheng, Yuichiro Hayashi, Yoshito Otake, Masahiro Hashimoto, Toshiaki Akashi, Shigeki Aoki, Kensaku Mori

Comments: Accepted paper as a oral presentation at CILP2021, 10th MICCAI CLIP Workshop

Journal-ref: DCL 2021, PPML 2021, LL-COVID19 2021, CLIP 2021, Lecture Notes in Computer Science (LNCS) 12969, pp.88-97

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1004] arXiv:2201.03114 (cross-list from eess.SP) [pdf, other]: Title: Signal Reconstruction from Quantized Noisy Samples of the Discrete Fourier Transform

Authors: Mohak Goyal, Animesh Kumar

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1005] arXiv:2201.03131 (cross-list from astro-ph.GA) [pdf, other]: Title: Systematic biases when using deep neural networks for annotating large catalogs of astronomical images

Authors: Sanchari Dhar, Lior Shamir

Comments: A&C, accepted

Subjects: Astrophysics of Galaxies (astro-ph.GA); Cosmology and Nongalactic Astrophysics (astro-ph.CO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1006] arXiv:2201.03145 (cross-list from eess.IV) [pdf, other]: Title: Enhancing Low-Light Images in Real World via Cross-Image Disentanglement

Authors: Lanqing Guo, Renjie Wan, Wenhan Yang, Alex Kot, Bihan Wen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1007] arXiv:2201.03186 (cross-list from eess.IV) [pdf, other]: Title: MyoPS: A Benchmark of Myocardial Pathology Segmentation Combining Three-Sequence Cardiac Magnetic Resonance Images

Authors: Lei Li, Fuping Wu, Sihan Wang, Xinzhe Luo, Carlos Martin-Isla, Shuwei Zhai, Jianpeng Zhang, Yanfei Liu7, Zhen Zhang, Markus J. Ankenbrand, Haochuan Jiang, Xiaoran Zhang, Linhong Wang, Tewodros Weldebirhan Arega, Elif Altunok, Zhou Zhao, Feiyan Li, Jun Ma, Xiaoping Yang, Elodie Puybareau, Ilkay Oksuz, Stephanie Bricq, Weisheng Li, Kumaradevan Punithakumar, Sotirios A. Tsaftaris, Laura M. Schreiber, Mingjing Yang, Guocai Liu, Yong Xia, Guotai Wang, Sergio Escalera, Xiahai Zhuang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1008] arXiv:2201.03195 (cross-list from eess.IV) [pdf, other]: Title: End-to-end lossless compression of high precision depth maps guided by pseudo-residual

Authors: Yuyang Wu, Wei Gao

Comments: Data Compression Conference 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1009] arXiv:2201.03210 (cross-list from eess.IV) [pdf, other]: Title: Model-Based Image Signal Processors via Learnable Dictionaries

Authors: Marcos V. Conde, Steven McDonagh, Matteo Maggioni, Aleš Leonardis, Eduardo Pérez-Pellitero

Comments: AAAI 2022

Journal-ref: Vol. 36 No. 1: AAAI-22 Technical Tracks 1 (2022) 481-489

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1010] arXiv:2201.03230 (cross-list from eess.IV) [pdf, other]: Title: Swin Transformer for Fast MRI

Authors: Jiahao Huang, Yingying Fang, Yinzhe Wu, Huanjun Wu, Zhifan Gao, Yang Li, Javier Del Ser, Jun Xia, Guang Yang

Comments: 55 pages, 19 figures, submitted to Neurocomputing journal

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1011] arXiv:2201.03288 (cross-list from eess.IV) [pdf, other]: Title: A statistical shape model for radiation-free assessment and classification of craniosynostosis

Authors: Matthias Schaufelberger, Reinald Peter Kühle, Andreas Wachter, Frederic Weichel, Niclas Hagen, Friedemann Ringwald, Urs Eisenmann, Jürgen Hoffmann, Michael Engel, Christian Freudlsperger, Werner Nahm

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1012] arXiv:2201.03319 (cross-list from eess.IV) [pdf, ps, other]: Title: Comparison of Representation Learning Techniques for Tracking in time resolved 3D Ultrasound

Authors: Daniel Wulff, Jannis Hagenah, Floris Ernst

Comments: Presented at Medical Imaging with Deep Learning (MIDL) 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1013] arXiv:2201.03481 (cross-list from eess.IV) [pdf, other]: Title: Learning Population-level Shape Statistics and Anatomy Segmentation From Images: A Joint Deep Learning Model

Authors: Wenzheng Tao, Riddhish Bhalodia, Shireen Elhabian

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1014] arXiv:2201.03559 (cross-list from eess.IV) [pdf, other]: Title: Demonstrating The Risk of Imbalanced Datasets in Chest X-ray Image-based Diagnostics by Prototypical Relevance Propagation

Authors: Srishti Gautam, Marina M.-C. Höhne, Stine Hansen, Robert Jenssen, Michael Kampffmeyer

Comments: To appear in ISBI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1015] arXiv:2201.03560 (cross-list from eess.IV) [pdf, ps, other]: Title: Iterative training of robust k-space interpolation networks for improved image reconstruction with limited scan specific training samples

Authors: Peter Dawood, Felix Breuer, Paul R. Burd, István Homolya, Johannes Oberberger, Peter M. Jakob, Martin Blaimer

Comments: Submitted to Magnetic Resonance in Medicine

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1016] arXiv:2201.03644 (cross-list from eess.IV) [pdf, other]: Title: 3D Segmentation with Fully Trainable Gabor Kernels and Pearson's Correlation Coefficient

Authors: Ken C. L. Wong, Mehdi Moradi

Comments: This paper was accepted by the International Workshop on Machine Learning in Medical Imaging (MLMI 2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1017] arXiv:2201.03669 (cross-list from eess.IV) [pdf, other]: Title: Neuroplastic graph attention networks for nuclei segmentation in histopathology images

Authors: Yoav Alon, Huiyu Zhou

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1018] arXiv:2201.03715 (cross-list from eess.IV) [pdf, other]: Title: An analysis of reconstruction noise from undersampled 4D flow MRI

Authors: Lauren Partin, Daniele E. Schiavazzi, Carlos A. Sing Long

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph); Applications (stat.AP); Methodology (stat.ME)
[1019] arXiv:2201.03777 (cross-list from eess.IV) [pdf, other]: Title: Reciprocal Adversarial Learning for Brain Tumor Segmentation: A Solution to BraTS Challenge 2021 Segmentation Task

Authors: Himashi Peiris, Zhaolin Chen, Gary Egan, Mehrtash Harandi

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1020] arXiv:2201.03795 (cross-list from eess.IV) [pdf, other]: Title: COROLLA: An Efficient Multi-Modality Fusion Framework with Supervised Contrastive Learning for Glaucoma Grading

Authors: Zhiyuan Cai, Li Lin, Huaqing He, Xiaoying Tang

Comments: 5 pages, To be published in ISBI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1021] arXiv:2201.03992 (cross-list from eess.IV) [pdf, other]: Title: Image quality measurements and denoising using Fourier Ring Correlations

Authors: J. Kaczmar-Michalska, N.R. Hajizadeh, A.J. Rzepiela, S.F. Nørrelykke

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1022] arXiv:2201.04138 (cross-list from eess.IV) [pdf, other]: Title: Overview of the HECKTOR Challenge at MICCAI 2021: Automatic Head and Neck Tumor Segmentation and Outcome Prediction in PET/CT Images

Authors: Vincent Andrearczyk, Valentin Oreiller, Sarah Boughdad, Catherine Chez Le Rest, Hesham Elhalawani, Mario Jreige, John O. Prior, Martin Vallières, Dimitris Visvikis, Mathieu Hatt, Adrien Depeursinge

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1023] arXiv:2201.04229 (cross-list from q-bio.NC) [pdf, other]: Title: Brain Signals Analysis Based Deep Learning Methods: Recent advances in the study of non-invasive brain signals

Authors: Almabrok Essa, Hari Kotte

Comments: 18 pages

Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1024] arXiv:2201.04318 (cross-list from eess.IV) [pdf, other]: Title: Knee Cartilage Defect Assessment by Graph Representation and Surface Convolution

Authors: Zixu Zhuang, Liping Si, Sheng Wang, Kai Xuan, Xi Ouyang, Yiqiang Zhan, Zhong Xue, Lichi Zhang, Dinggang Shen, Weiwu Yao, Qian Wang

Comments: 10 pages, 4 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1025] arXiv:2201.04370 (cross-list from eess.IV) [pdf, ps, other]: Title: Predicting Alzheimer's Disease Using 3DMgNet

Authors: Yelu Gao, Huang Huang, Lian Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1026] arXiv:2201.04397 (cross-list from eess.IV) [pdf, other]: Title: Towards Adversarially Robust Deep Image Denoising

Authors: Hanshu Yan, Jingfeng Zhang, Jiashi Feng, Masashi Sugiyama, Vincent Y. F. Tan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1027] arXiv:2201.04416 (cross-list from eess.IV) [pdf, other]: Title: Optimizing Prediction of MGMT Promoter Methylation from MRI Scans using Adversarial Learning

Authors: Sauman Das

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1028] arXiv:2201.04485 (cross-list from eess.IV) [pdf, other]: Title: Depth Estimation from Single-shot Monocular Endoscope Image Using Image Domain Adaptation And Edge-Aware Depth Estimation

Authors: Masahiro Oda, Hayato Itoh, Kiyohito Tanaka, Hirotsugu Takabatake, Masaki Mori, Hiroshi Natori, Kensaku Mori

Comments: Accepted paper as an oral presentation at Joint MICCAI workshop 2021, AE-CAI/CARE/OR2.0

Journal-ref: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1029] arXiv:2201.04584 (cross-list from eess.IV) [pdf, other]: Title: ECONet: Efficient Convolutional Online Likelihood Network for Scribble-based Interactive Segmentation

Authors: Muhammad Asad, Lucas Fidon, Tom Vercauteren

Comments: Accepted at MIDL 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1030] arXiv:2201.04631 (cross-list from eess.IV) [pdf, ps, other]: Title: Early Diagnosis of Parkinsons Disease by Analyzing Magnetic Resonance Imaging Brain Scans and Patient Characteristics

Authors: Sabrina Zhu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1031] arXiv:2201.04714 (cross-list from astro-ph.IM) [pdf, other]: Title: Partial-Attribution Instance Segmentation for Astronomical Source Detection and Deblending

Authors: Ryan Hausen, Brant Robertson

Comments: Accepted to the Fourth Workshop on Machine Learning and the Physical Sciences, NeurIPS 2021, 6 pages, 1 figure

Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Astrophysics of Galaxies (astro-ph.GA); Computer Vision and Pattern Recognition (cs.CV)
[1032] arXiv:2201.04769 (cross-list from eess.IV) [pdf, other]: Title: MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

Authors: Kaifeng Pang, Zuhayr Asad, Shilin Zhao, Yuankai Huo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1033] arXiv:2201.04795 (cross-list from eess.IV) [pdf, ps, other]: Title: EMT-NET: Efficient multitask network for computer-aided diagnosis of breast cancer

Authors: Jiaqiao Shi, Aleksandar Vakanski, Min Xian, Jianrui Ding, Chunping Ning

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1034] arXiv:2201.04812 (cross-list from eess.IV) [pdf, other]: Title: Unsupervised Domain Adaptation for Cross-Modality Retinal Vessel Segmentation via Disentangling Representation Style Transfer and Collaborative Consistency Learning

Authors: Linkai Peng, Li Lin, Pujin Cheng, Ziqi Huang, Xiaoying Tang

Comments: To be published in ISBI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1035] arXiv:2201.04918 (cross-list from eess.IV) [pdf, ps, other]: Title: Realistic Endoscopic Image Generation Method Using Virtual-to-real Image-domain Translation

Authors: Masahiro Oda, Kiyohito Tanaka, Hirotsugu Takabatake, Masaki Mori, Hiroshi Natori, Kensaku Mori

Comments: Accepted paper as an oral presentation at the Joint MICCAI workshop MIAR | AE-CAI | CARE 2019

Journal-ref: Healthcare Technology Letters, Vol.6, No.6, pp.214-219, 2019

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1036] arXiv:2201.05145 (cross-list from astro-ph.IM) [pdf, other]: Title: Fully Adaptive Bayesian Algorithm for Data Analysis, FABADA

Authors: Pablo M Sanchez-Alarcon, Yago Ascasibar Sequeiros

Comments: 13 pages, 6 figures. Accepted for publication in RAS Techniques and Instruments

Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Astrophysics of Galaxies (astro-ph.GA); Solar and Stellar Astrophysics (astro-ph.SR); Computer Vision and Pattern Recognition (cs.CV); Data Analysis, Statistics and Probability (physics.data-an)
[1037] arXiv:2201.05233 (cross-list from physics.flu-dyn) [pdf, other]: Title: Density reconstruction from schlieren images through Bayesian nonparametric models

Authors: Bryn Noel Ubald (1), Pranay Seshadri (1 and 2), Andrew Duncan (1 and 2) ((1) The Alan Turing Institute, (2) Imperial College London)

Subjects: Fluid Dynamics (physics.flu-dyn); Computer Vision and Pattern Recognition (cs.CV)
[1038] arXiv:2201.05331 (cross-list from eess.IV) [pdf, ps, other]: Title: Semi-automated Virtual Unfolded View Generation Method of Stomach from CT Volumes

Authors: Masahiro Oda, Tomoaki Suito, Yuichiro Hayashi, Takayuki Kitasaka, Kazuhiro Furukawa, Ryoji Miyahara, Yoshiki Hirooka, Hidemi Goto, Gen Iinuma, Kazunari Misawa, Shigeru Nawano, Kensaku Mori

Comments: Accepted paper as a poster presentation at MICCAI 2013 (International Conference on Medical Image Computing and Computer-Assisted Intervention), Nagoya, Japan

Journal-ref: Published in Proceedings of MICCAI 2013, LNCS 8149, pp.332-339, 2013

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1039] arXiv:2201.05344 (cross-list from eess.IV) [pdf, other]: Title: AWSnet: An Auto-weighted Supervision Attention Network for Myocardial Scar and Edema Segmentation in Multi-sequence Cardiac Magnetic Resonance Images

Authors: Kai-Ni Wang, Xin Yang, Juzheng Miao, Lei Li, Jing Yao, Ping Zhou, Wufeng Xue, Guang-Quan Zhou, Xiahai Zhuang, Dong Ni

Comments: 19 pages, 10 figures, accepted by Medical Image Analysis

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1040] arXiv:2201.05373 (cross-list from eess.IV) [pdf, ps, other]: Title: A New Deep Hybrid Boosted and Ensemble Learning-based Brain Tumor Analysis using MRI

Authors: Mirza Mumtaz Zahoor, Shahzad Ahmad Qureshi, Saddam Hussain Khan, Asifullah Khan

Comments: 26 pages, 9 figures, 8 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1041] arXiv:2201.05650 (cross-list from eess.IV) [pdf, other]: Title: Disentanglement enables cross-domain Hippocampus Segmentation

Authors: John Kalkhof, Camila González, Anirban Mukhopadhyay

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1042] arXiv:2201.05768 (cross-list from eess.IV) [pdf, other]: Title: Spectral Compressive Imaging Reconstruction Using Convolution and Contextual Transformer

Authors: Lishun Wang, Zongliang Wu, Yong Zhong, Xin Yuan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1043] arXiv:2201.05810 (cross-list from eess.IV) [pdf, other]: Title: Two-Stage is Enough: A Concise Deep Unfolding Reconstruction Network for Flexible Video Compressive Sensing

Authors: Siming Zheng, Xiaoyu Yang, Xin Yuan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1044] arXiv:2201.05865 (cross-list from eess.IV) [pdf, ps, other]: Title: SDT-DCSCN for Simultaneous Super-Resolution and Deblurring of Text Images

Authors: Hala Neji, Mohamed Ben Halima, Javier Nogueras-Iso, Tarek. M. Hamdani, Abdulrahman M. Qahtani, Omar Almutiry, Habib Dhahri, Adel M. Alimi

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1045] arXiv:2201.05905 (cross-list from eess.IV) [pdf, other]: Title: SS-3DCapsNet: Self-supervised 3D Capsule Networks for Medical Segmentation on Less Labeled Data

Authors: Minh Tran, Loi Ly, Binh-Son Hua, Ngan Le

Comments: Accepted to ISBI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1046] arXiv:2201.05920 (cross-list from eess.IV) [pdf, other]: Title: ViTBIS: Vision Transformer for Biomedical Image Segmentation

Authors: Abhinav Sagar

Comments: Published at Clinical Image-Based Procedures, Distributed and Collaborative Learning, Artificial Intelligence for Combating COVID-19 and Secure and Privacy-Preserving Machine Learning workshop at MICCAI 2021

Journal-ref: Springer, Cham 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1047] arXiv:2201.05963 (cross-list from eess.IV) [pdf, ps, other]: Title: A Residual Encoder-Decoder Network for Segmentation of Retinal Image-Based Exudates in Diabetic Retinopathy Screening

Authors: Malik A. Manan, Tariq M. Khan, Ahsan Saadat, Muhammad Arsalan, Syed S. Naqvi

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1048] arXiv:2201.06045 (cross-list from eess.IV) [pdf, other]: Title: CISRNet: Compressed Image Super-Resolution Network

Authors: Agus Gunawan, Sultan Rizky Hikmawan Madjid

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1049] arXiv:2201.06052 (cross-list from eess.IV) [pdf, other]: Title: Self-Supervision and Multi-Task Learning: Challenges in Fine-Grained COVID-19 Multi-Class Classification from Chest X-rays

Authors: Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarette, Ibrahim Almakky, Mohammad Yaqub

Comments: Accepted to Conference on Medical Image Understanding and Analysis (MIUA) 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1050] arXiv:2201.06086 (cross-list from eess.IV) [pdf, other]: Title: Is it Possible to Predict MGMT Promoter Methylation from Brain Tumor MRI Scans using Deep Learning Models?

Authors: Numan Saeed, Shahad Hardan, Kudaibergen Abutalip, Mohammad Yaqub

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1051] arXiv:2201.06133 (cross-list from stat.ML) [pdf, other]: Title: On Maximum-a-Posteriori estimation with Plug & Play priors and stochastic gradient descent

Authors: Rémi Laumont, Valentin de Bortoli, Andrés Almansa, Julie Delon, Alain Durmus, Marcelo Pereyra

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Optimization and Control (math.OC)
[1052] arXiv:2201.06143 (cross-list from eess.IV) [pdf, other]: Title: Robust Scatterer Number Density Segmentation of Ultrasound Images

Authors: Ali K. Z. Tehrani, Ivan M. Rosado-Mendez, Hassan Rivaz

Comments: Accepted in IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1053] arXiv:2201.06250 (cross-list from eess.IV) [pdf, ps, other]: Title: Improving Clinical Diagnosis Performance with Automated X-ray Scan Quality Enhancement Algorithms

Authors: Karthik K, Sowmya Kamath S

Comments: Presented and Accepted in International Conference on Advances in Systems, Control and Computing (AISCC-2020) at Malaviya National Institute of Technology, Jaipur, India, February 27-28, 2020

Journal-ref: International Conference on Advances in Systems, Control and Computing (AISCC-2020) at Malaviya National Institute of Technology, Jaipur, India, February 27-28, 2020

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1054] arXiv:2201.06251 (cross-list from eess.IV) [pdf, other]: Title: Automatic Segmentation of Head and Neck Tumor: How Powerful Transformers Are?

Authors: Ikboljon Sobirov, Otabek Nazarov, Hussain Alasmawi, Mohammad Yaqub

Comments: 8 pages, 2 figures (3 more figures in Appendix), 2 tables; accepted to MIDL conference

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1055] arXiv:2201.06259 (cross-list from eess.IV) [pdf, other]: Title: Segmentation of the Carotid Lumen and Vessel Wall using Deep Learning and Location Priors

Authors: Florian Thamm, Felix Denzinger, Leonhard Rist, Celia Martin Vicario, Florian Kordon, Andreas Maier

Comments: Challenge Report - Preprint

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1056] arXiv:2201.06329 (cross-list from eess.IV) [pdf, other]: Title: H&E-adversarial network: a convolutional neural network to learn stain-invariant features through Hematoxylin & Eosin regression

Authors: Niccoló Marini, Manfredo Atzori, Sebastian Otálora, Stephane Marchand-Maillet, Henning Müller

Comments: Errata corrige Proceedings of the IEEE/CVF International Conference on Computer Vision 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1057] arXiv:2201.06358 (cross-list from eess.IV) [pdf, other]: Title: Few-shot image segmentation for cross-institution male pelvic organs using registration-assisted prototypical learning

Authors: Yiwen Li, Yunguan Fu, Qianye Yang, Zhe Min, Wen Yan, Henkjan Huisman, Dean Barratt, Victor Adrian Prisacariu, Yipeng Hu

Comments: To appear in the proceedings of the IEEE International Symposium on Biomedical Imaging (ISBI) 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1058] arXiv:2201.06383 (cross-list from eess.IV) [pdf, other]: Title: Dual Perceptual Loss for Single Image Super-Resolution Using ESRGAN

Authors: Jie Song, Huawei Yi, Wenqian Xu, Xiaohui Li, Bo Li, Yuanyuan Liu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1059] arXiv:2201.06574 (cross-list from eess.IV) [pdf, other]: Title: Neural Computed Tomography

Authors: Kunal Gupta, Brendan Colvert, Francisco Contijoch

Comments: this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1060] arXiv:2201.06931 (cross-list from eess.IV) [pdf, other]: Title: Deep Equilibrium Models for Video Snapshot Compressive Imaging

Authors: Yaping Zhao, Siming Zheng, Xin Yuan

Comments: 9 pages, 7 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1061] arXiv:2201.07066 (cross-list from eess.IV) [pdf, other]: Title: Joint denoising and HDR for RAW video sequences

Authors: A. Buades, O. Martorell, M. Sánchez-Beeckman

Comments: arXiv admin note: text overlap with arXiv:1812.11207

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1062] arXiv:2201.07219 (cross-list from eess.IV) [pdf, other]: Title: Contrastive Pretraining for Echocardiography Segmentation with Limited Data

Authors: Mohamed Saeed, Rand Muhtaseb, Mohammad Yaqub

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1063] arXiv:2201.07227 (cross-list from eess.IV) [pdf, other]: Title: Explainable Ensemble Machine Learning for Breast Cancer Diagnosis based on Ultrasound Image Texture Features

Authors: Alireza Rezazadeh, Yasamin Jafarian, Ali Kord

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1064] arXiv:2201.07231 (cross-list from eess.IV) [pdf, other]: Title: AI-based Carcinoma Detection and Classification Using Histopathological Images: A Systematic Review

Authors: Swathi Prabhua, Keerthana Prasada, Antonio Robels-Kelly, Xuequan Lu

Comments: accepted to Computers in Biology and Medicine

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1065] arXiv:2201.07344 (cross-list from eess.IV) [pdf, other]: Title: Lung Swapping Autoencoder: Learning a Disentangled Structure-texture Representation of Chest Radiographs

Authors: Lei Zhou, Joseph Bae, Huidong Liu, Gagandeep Singh, Jeremy Green, Amit Gupta, Dimitris Samaras, Prateek Prasanna

Comments: Extended version of the MICCAI 2021 paper this https URL The code is available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1066] arXiv:2201.07357 (cross-list from eess.IV) [pdf, other]: Title: Weakly Supervised Contrastive Learning for Better Severity Scoring of Lung Ultrasound

Authors: Gautam Rajendrakumar Gare, Hai V. Tran, Bennett P deBoisblanc, Ricardo Luis Rodriguez, John Michael Galeotti

Comments: Under Review for MIDL 2022 conference

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1067] arXiv:2201.07368 (cross-list from eess.IV) [pdf, other]: Title: The Role of Pleura and Adipose in Lung Ultrasound AI

Authors: Gautam Rajendrakumar Gare, Wanwen Chen, Alex Ling Yu Hung, Edward Chen, Hai V. Tran, Tom Fox, Pete Lowery, Kevin Zamora, Bennett P deBoisblanc, Ricardo Luis Rodriguez, John Michael Galeotti

Comments: Published in MICCAI 2021 workshop on Lessons Learned from the development and application of medical imaging-based AI technologies for combating COVID-19 (LL-COVID19). The first two authors contributed equally to this work

Journal-ref: LL-COVID19 2021. Lecture Notes in Computer Science, vol 12969. Springer, Cham

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1068] arXiv:2201.07562 (cross-list from eess.IV) [pdf, other]: Title: Learned Cone-Beam CT Reconstruction Using Neural Ordinary Differential Equations

Authors: Mareike Thies, Fabian Wagner, Mingxuan Gu, Lukas Folle, Lina Felsner, Andreas Maier

Comments: 6 pages

Journal-ref: 7th International Conference on Image Formation in X-Ray Computed Tomography, Proc. Vol. 12304 (2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1069] arXiv:2201.07610 (cross-list from math.OC) [pdf, other]: Title: Nonlinear Unknown Input Observability and Unknown Input Reconstruction: The General Analytical Solution

Authors: Agostino Martinelli

Comments: This paper was published by the journal of Information Fusion

Journal-ref: Journal of Information Fusion, Volume 85, September 2022, Pages 23-51

Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV)
[1070] arXiv:2201.07890 (cross-list from eess.SP) [pdf, other]: Title: Convolutional Neural Networks for Spherical Signal Processing via Spherical Haar Tight Framelets

Authors: Jianfei Li, Han Feng, Xiaosheng Zhuang

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Functional Analysis (math.FA)
[1071] arXiv:2201.07891 (cross-list from eess.SP) [pdf, other]: Title: Homogenization of Existing Inertial-Based Datasets to Support Human Activity Recognition

Authors: Hamza Amrani, Daniela Micucci, Marco Mobilio, Paolo Napoletano

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1072] arXiv:2201.08385 (cross-list from eess.IV) [pdf, other]: Title: Improving Specificity in Mammography Using Cross-correlation between Wavelet and Fourier Transform

Authors: Liuhua Zhang

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1073] arXiv:2201.08388 (cross-list from eess.IV) [pdf, other]: Title: Steerable Pyramid Transform Enables Robust Left Ventricle Quantification

Authors: Xiangyang Zhu, Kede Ma, Wufeng Xue

Comments: 10 pages, 13 figures, journal paper

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1074] arXiv:2201.08418 (cross-list from eess.IV) [pdf, other]: Title: SoftDropConnect (SDC) -- Effective and Efficient Quantification of the Network Uncertainty in Deep MR Image Analysis

Authors: Qing Lyu, Christopher T. Whitlow, Ge Wang

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1075] arXiv:2201.08512 (cross-list from eess.SP) [pdf, other]: Title: Vertical Federated Edge Learning with Distributed Integrated Sensing and Communication

Authors: Peixi Liu, Guangxu Zhu, Wei Jiang, Wu Luo, Jie Xu, Shuguang Cui

Comments: 5 pages, 7 figures, accepted by IEEE Communications Letters

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Networking and Internet Architecture (cs.NI)
[1076] arXiv:2201.08582 (cross-list from eess.IV) [pdf, other]: Title: SegTransVAE: Hybrid CNN -- Transformer with Regularization for medical image segmentation

Authors: Quan-Dung Pham (1), Hai Nguyen-Truong (1, 2 and 3), Nam Nguyen Phuong (1), Khoa N. A. Nguyen (1, 2 and 3) ((1) VinBrain JSC., Vietnam, (2) University of Science, Ho Chi Minh City, Vietnam, (3) Vietnam National University, Ho Chi Minh City, Vietnam)

Journal-ref: 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1077] arXiv:2201.08706 (cross-list from eess.IV) [pdf, other]: Title: SparseAlign: A Super-Resolution Algorithm for Automatic Marker Localization and Deformation Estimation in Cryo-Electron Tomography

Authors: Poulami Somanya Ganguly, Felix Lucka, Holger Kohr, Erik Franken, Hermen Jan Hupkes, K Joost Batenburg

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA); Optimization and Control (math.OC); Quantitative Methods (q-bio.QM)
[1078] arXiv:2201.08741 (cross-list from eess.IV) [pdf, ps, other]: Title: Improving Across-Dataset Brain Tissue Segmentation Using Transformer

Authors: Vishwanatha M. Rao, Zihan Wan, Soroush Arabshahi, David J. Ma, Pin-Yu Lee, Ye Tian, Xuzhe Zhang, Andrew F. Laine, Jia Guo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1079] arXiv:2201.08865 (cross-list from eess.IV) [pdf, other]: Title: On the in vivo recognition of kidney stones using machine learning

Authors: Francisco Lopez-Tiro, Vincent Estrade, Jacques Hubert, Daniel Flores-Araiza, Miguel Gonzalez-Mendoza, Gilberto Ochoa-Ruiz, Christian Daul

Comments: Paper submitted to IEEE Access

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1080] arXiv:2201.08935 (cross-list from eess.IV) [pdf, other]: Title: SAR Image Change Detection Based on Multiscale Capsule Network

Authors: Yunhao Gao, Feng Gao, Junyu Dong, Heng-Chao Li

Journal-ref: in IEEE Geoscience and Remote Sensing Letters, vol. 18, no. 3, pp. 484-488, March 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1081] arXiv:2201.08944 (cross-list from eess.IV) [pdf, other]: Title: DCNGAN: A Deformable Convolutional-Based GAN with QP Adaptation for Perceptual Quality Enhancement of Compressed Video

Authors: Saiping Zhang, Luis Herranz, Marta Mrak, Marc Gorriz Blanch, Shuai Wan, Fuzheng Yang

Comments: 5 pages, 4 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1082] arXiv:2201.08955 (cross-list from eess.IV) [pdf, other]: Title: Modality Bank: Learn multi-modality images across data centers without sharing medical data

Authors: Qi Chang, Hui Qu, Zhennan Yan, Yunhe Gao, Lohendran Baskaran, Dimitris Metaxas

Comments: arXiv admin note: substantial text overlap with arXiv:2012.08604

Journal-ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2022, pp. 4758-4763

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1083] arXiv:2201.08964 (cross-list from physics.optics) [pdf, ps, other]: Title: Diffractive all-optical computing for quantitative phase imaging

Authors: Deniz Mengu, Aydogan Ozcan

Comments: 23 Pages, 5 Figures

Journal-ref: Advanced Optical Materials (2022)

Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Applied Physics (physics.app-ph)
[1084] arXiv:2201.09163 (cross-list from eess.IV) [pdf, ps, other]: Title: Pulmonary Fissure Segmentation in CT Images Based on ODoS Filter and Shape Features

Authors: Yuanyuan Peng, Pengpeng Luan, Hongbin Tu, Xiong Li, Ping Zhou

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1085] arXiv:2201.09240 (cross-list from eess.IV) [pdf, other]: Title: Learning-Driven Lossy Image Compression; A Comprehensive Survey

Authors: Sonain Jamil, Md. Jalil Piran, MuhibUrRahman

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1086] arXiv:2201.09267 (cross-list from stat.ML) [pdf, other]: Title: Spectral, Probabilistic, and Deep Metric Learning: Tutorial and Survey

Authors: Benyamin Ghojogh, Ali Ghodsi, Fakhri Karray, Mark Crowley

Comments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learning

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1087] arXiv:2201.09314 (cross-list from eess.IV) [pdf, ps, other]: Title: Perceptual cGAN for MRI Super-resolution

Authors: Sahar Almahfouz Nasser, Saqib Shamsi, Valay Bundele, Bhavesh Garg, Amit Sethi

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1088] arXiv:2201.09360 (cross-list from eess.IV) [pdf, other]: Title: POTHER: Patch-Voted Deep Learning-Based Chest X-ray Bias Analysis for COVID-19 Detection

Authors: Tomasz Szczepański, Arkadiusz Sitek, Tomasz Trzciński, Szymon Płotka

Comments: Accepted at International Conference on Computational Science (ICCS) 2022, London

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1089] arXiv:2201.09376 (cross-list from eess.IV) [pdf, other]: Title: ReconFormer: Accelerated MRI Reconstruction Using Recurrent Transformer

Authors: Pengfei Guo, Yiqun Mei, Jinyuan Zhou, Shanshan Jiang, Vishal M. Patel

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1090] arXiv:2201.09400 (cross-list from eess.IV) [pdf, other]: Title: Fast MRI Reconstruction: How Powerful Transformers Are?

Authors: Jiahao Huang, Yinzhe Wu, Huanjun Wu, Guang Yang

Comments: 5 pages, 5 figures, EMBC 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1091] arXiv:2201.09522 (cross-list from eess.SP) [pdf, other]: Title: Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement Learning

Authors: Tristan S.W. Stevens, Nishith Chennakeshava, Frederik J. de Bruijn, Martin Pekař, Ruud J.G. van Sloun

Comments: 5 pages, 3 figures, conference

Journal-ref: ICASSP 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1092] arXiv:2201.09579 (cross-list from eess.IV) [pdf, other]: Title: AutoSeg -- Steering the Inductive Biases for Automatic Pathology Segmentation

Authors: Felix Meissen, Georgios Kaissis, Daniel Rueckert

Comments: 8 pages, 3 figures, part of the MICCAI MOOD Challenge 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1093] arXiv:2201.09693 (cross-list from eess.IV) [pdf, other]: Title: Shape-consistent Generative Adversarial Networks for multi-modal Medical segmentation maps

Authors: Leo Segre, Or Hirschorn, Dvir Ginzburg, Dan Raviv

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1094] arXiv:2201.09851 (cross-list from eess.IV) [pdf, other]: Title: Hyperspectral Image Super-resolution with Deep Priors and Degradation Model Inversion

Authors: Xiuheng Wang, Jie Chen, Cédric Richard

Comments: Proc. IEEE Int. Conf. on Acoust, Speech, Signal Process. (ICASSP), to be published. Manuscript submitted October 6th, 2021; revised January 8th, 2022; accepted January 22nd, 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1095] arXiv:2201.09867 (cross-list from eess.IV) [pdf, ps, other]: Title: Importance of Preprocessing in Histopathology Image Classification Using Deep Convolutional Neural Network

Authors: Nilgun Sengoz, Tuncay Yigit, Ozlem Ozmen, Ali Hakan Isik

Comments: 6 Pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1096] arXiv:2201.09873 (cross-list from eess.IV) [pdf, other]: Title: Transformers in Medical Imaging: A Survey

Authors: Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, Huazhu Fu

Comments: 41 pages, \url{this https URL}

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1097] arXiv:2201.09929 (cross-list from math.DG) [pdf, other]: Title: Euclidean and Affine Curve Reconstruction

Authors: Jose Agudelo, Brooke Dippold, Ian Klein, Alex Kokot, Eric Geiger, Irina Kogan

Comments: This paper is a result of an REU project conducted at the North Carolina State University in the Summer and Fall 2020. This version has several minor corrections

Journal-ref: Involve 17 (2024) 29-63

Subjects: Differential Geometry (math.DG); Computer Vision and Pattern Recognition (cs.CV)
[1098] arXiv:2201.09952 (cross-list from eess.IV) [pdf, ps, other]: Title: A Deep Learning Approach for the Detection of COVID-19 from Chest X-Ray Images using Convolutional Neural Networks

Authors: Aditya Saxena, Shamsheer Pal Singh

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1099] arXiv:2201.09972 (cross-list from eess.IV) [pdf, ps, other]: Title: COVID-19 Detection Using CT Image Based On YOLOv5 Network

Authors: Ruyi Qu, Yi Yang, Yuwei Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1100] arXiv:2201.10166 (cross-list from eess.IV) [pdf, other]: Title: Dense Pixel-Labeling for Reverse-Transfer and Diagnostic Learning on Lung Ultrasound for COVID-19 and Pneumonia Detection

Authors: Gautam Rajendrakumar Gare, Andrew Schoenling, Vipin Philip, Hai V Tran, Bennett P deBoisblanc, Ricardo Luis Rodriguez, John Michael Galeotti

Comments: Published in 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) \copyright 2021 IEEE

Journal-ref: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), 2021, pp. 1406-1410

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1101] arXiv:2201.10294 (cross-list from eess.IV) [pdf, other]: Title: S2MS: Self-Supervised Learning Driven Multi-Spectral CT Image Enhancement

Authors: Chaoyang Zhang, Shaojie Chang, Ti Bai, Xi Chen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1102] arXiv:2201.10305 (cross-list from eess.IV) [pdf, other]: Title: Mutual information neural estimation for unsupervised multi-modal registration of brain images

Authors: Gerard Snaauw (1), Michele Sasdelli (1), Gabriel Maicas (1), Stephan Lau (1 and 2), Johan Verjans (1 and 2), Mark Jenkinson (1 and 2), Gustavo Carneiro (1) ((1) Australian Institute for Machine Learning (AIML), University of Adelaide, Adelaide, Australia, (2) South Australian Health and Medical Research Institute (SAHMRI), Adelaide, Australia)

Comments: 4 pages, 4 figures, 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), oral presentation

Journal-ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2022, pp. 3510-3513

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1103] arXiv:2201.10324 (cross-list from eess.IV) [pdf, other]: Title: Addressing the Intra-class Mode Collapse Problem using Adaptive Input Image Normalization in GAN-based X-ray Images

Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

Comments: Accepted to the IEEE EMBC22 Conference

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1104] arXiv:2201.10345 (cross-list from eess.IV) [pdf, other]: Title: Ultra Low-Parameter Denoising: Trainable Bilateral Filter Layers in Computed Tomography

Authors: Fabian Wagner, Mareike Thies, Mingxuan Gu, Yixing Huang, Sabrina Pechmann, Mayank Patwari, Stefan Ploner, Oliver Aust, Stefan Uderhardt, Georg Schett, Silke Christiansen, Andreas Maier

Journal-ref: Med.Phys. 49 (2022) 5107-5120

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1105] arXiv:2201.10360 (cross-list from eess.SP) [pdf, other]: Title: Resource-efficient Deep Neural Networks for Automotive Radar Interference Mitigation

Authors: Johanna Rock, Wolfgang Roth, Mate Toth, Paul Meissner, Franz Pernkopf

Comments: 15 pages; published in IEEE Journal of Selected Topics in Signal Processing, Special Issue on Recent Advances in Automotive Radar Signal Processing, Volume: 15, Issue: 4, June 2021. arXiv admin note: text overlap with arXiv:2011.12706

Journal-ref: IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 4, pp. 927-940, June 2021

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[1106] arXiv:2201.10424 (cross-list from eess.IV) [pdf, other]: Title: Improving segmentation of calcified and non-calcified plaques on CCTA-CPR scans via masking of the artery wall

Authors: Antonio Tejero-de-Pablos, Hiroaki Yamane, Yusuke Kurose, Junichi Iho, Youji Tokunaga, Makoto Horie, Keisuke Nishizawa, Yusaku Hayashi, Yasushi Koyama, Tatsuya Harada

Comments: Extended abstract (see SPIE for final published version)

Journal-ref: SPIE 12465, Medical Imaging 2023: Computer-Aided Diagnosis

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1107] arXiv:2201.10511 (cross-list from eess.IV) [pdf, other]: Title: Initial Investigations Towards Non-invasive Monitoring of Chronic Wound Healing Using Deep Learning and Ultrasound Imaging

Authors: Maja Schlereth (1,2), Daniel Stromer (2), Yash Mantri (3), Jason Tsujimoto (3), Katharina Breininger (1), Andreas Maier (2), Caesar Anderson (4), Pranav S. Garimella (5), Jesse V. Jokerst (6) ((1) Department Artificial Intelligence in Biomedical Engineering, FAU Erlangen-Nürnberg, Erlangen, (2) Pattern Recognition Lab, FAU Erlangen-Nürnberg, Erlangen, (3) Department of Bioengineering, University of California, San Diego, (4) Department of Emergency Medicine, San Diego, (5) Division of Nephrology and Hypertension, Department of Medicine, San Diego, (6) Department of Nanoengineering, University of California, San Diego)

Comments: 6 pages, 2 figures, accepted by BVM conference proceedings 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1108] arXiv:2201.10747 (cross-list from eess.IV) [pdf, other]: Title: Learning Multiple Probabilistic Degradation Generators for Unsupervised Real World Image Super Resolution

Authors: Sangyun Lee, Sewoong Ahn, Kwangjin Yoon

Comments: Accepted to ECCVW 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1109] arXiv:2201.10776 (cross-list from eess.IV) [pdf, other]: Title: DSFormer: A Dual-domain Self-supervised Transformer for Accelerated Multi-contrast MRI Reconstruction

Authors: Bo Zhou, Neel Dey, Jo Schlemper, Seyed Sadegh Mohseni Salehi, Chi Liu, James S. Duncan, Michal Sofka

Comments: Accepted at WACV 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1110] arXiv:2201.10849 (cross-list from eess.IV) [pdf, other]: Title: Predicting Knee Osteoarthritis Progression from Structural MRI using Deep Learning

Authors: Egor Panfilov, Simo Saarakkala, Miika T. Nieminen, Aleksei Tiulpin

Comments: $\copyright$ 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1111] arXiv:2201.10885 (cross-list from eess.IV) [pdf, other]: Title: Hyperparameter Optimization for COVID-19 Chest X-Ray Classification

Authors: Ibraheem Hamdi, Muhammad Ridzuan, Mohammad Yaqub

Comments: 15 pages, 13 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1112] arXiv:2201.10910 (cross-list from eess.IV) [pdf, other]: Title: A Bayesian Based Deep Unrolling Algorithm for Single-Photon Lidar Systems

Authors: Jakeoung Koo, Abderrahim Halimi, Stephen McLaughlin

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1113] arXiv:2201.10981 (cross-list from eess.IV) [pdf, other]: Title: Joint Liver and Hepatic Lesion Segmentation in MRI using a Hybrid CNN with Transformer Layers

Authors: Georg Hille, Shubham Agrawal, Pavan Tummala, Christian Wybranski, Maciej Pech, Alexey Surov, Sylvia Saalfeld

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1114] arXiv:2201.11000 (cross-list from eess.IV) [pdf, other]: Title: One shot PACS: Patient specific Anatomic Context and Shape prior aware recurrent registration-segmentation of longitudinal thoracic cone beam CTs

Authors: Jue Jiang, Harini Veeraraghavan

Comments: This manuscript is currently under minor revision at IEEE Transactions on Medical Imaging

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1115] arXiv:2201.11002 (cross-list from eess.IV) [pdf, other]: Title: A Multi-rater Comparative Study of Automatic Target Localization Methods for Epilepsy Deep Brain Stimulation Procedures

Authors: Han Liu, Kathryn L. Holloway, Dario J. Englot, Benoit M. Dawant

Comments: Accepted by SPIE Medical Imaging 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1116] arXiv:2201.11037 (cross-list from eess.IV) [pdf, other]: Title: RTNet: Relation Transformer Network for Diabetic Retinopathy Multi-lesion Segmentation

Authors: Shiqi Huang, Jianan Li, Yuze Xiao, Ning Shen, Tingfa Xu

Comments: IEEE Transactions on Medical Imaging

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1117] arXiv:2201.11246 (cross-list from eess.IV) [pdf, other]: Title: HistoKT: Cross Knowledge Transfer in Computational Pathology

Authors: Ryan Zhang, Jiadai Zhu, Stephen Yang, Mahdi S. Hosseini, Angelo Genovese, Lina Chen, Corwyn Rowsell, Savvas Damaskinos, Sonal Varma, Konstantinos N. Plataniotis

Comments: Accepted in ICASSP2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1118] arXiv:2201.11333 (cross-list from eess.IV) [pdf, ps, other]: Title: Few-shot Transfer Learning for Holographic Image Reconstruction using a Recurrent Neural Network

Authors: Luzhe Huang, Xilin Yang, Tairan Liu, Aydogan Ozcan

Comments: 10 Pages, 3 Figures

Journal-ref: APL Photonics (2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1119] arXiv:2201.11389 (cross-list from eess.IV) [pdf, other]: Title: Multi-Frame Quality Enhancement On Compressed Video Using Quantised Data of Deep Belief Networks

Authors: Dionne Takudzwa Chasi, Mkhuseli Ngxande

Comments: 7 pages, 11 figures and 3 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1120] arXiv:2201.11446 (cross-list from eess.IV) [pdf, other]: Title: Pan-tumor CAnine cuTaneous Cancer Histology (CATCH) dataset

Authors: Frauke Wilm, Marco Fragoso, Christian Marzahl, Jingna Qiu, Chloé Puget, Laura Diehl, Christof A. Bertram, Robert Klopfleisch, Andreas Maier, Katharina Breininger, Marc Aubreville

Comments: Submitted to Scientific Data. 15 pages, 9 figures, 6 tables

Journal-ref: Scientific Data vol. 9 (2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1121] arXiv:2201.11630 (cross-list from eess.IV) [pdf, other]: Title: Automatic Classification of Neuromuscular Diseases in Children Using Photoacoustic Imaging

Authors: Maja Schlereth, Daniel Stromer, Katharina Breininger, Alexandra Wagner, Lina Tan, Andreas Maier, Ferdinand Knieling

Comments: accepted by BVM conference proceedings 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1122] arXiv:2201.11700 (cross-list from eess.IV) [pdf, other]: Title: Matched Illumination

Authors: Yuteng Zhu, Graham D. Finlayson

Comments: 15 pages, 7 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1123] arXiv:2201.11737 (cross-list from eess.IV) [pdf, ps, other]: Title: PRNU Based Source Camera Identification for Webcam and Smartphone Videos

Authors: Fernando Martín-Rodríguez, Fernando Isasi-de-Vicente

Comments: 4 pages, 5 figures, 4 tables. arXiv admin note: substantial text overlap with arXiv:2107.01885

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1124] arXiv:2201.11793 (cross-list from eess.IV) [pdf, other]: Title: Denoising Diffusion Restoration Models

Authors: Bahjat Kawar, Michael Elad, Stefano Ermon, Jiaming Song

Comments: Project page: this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1125] arXiv:2201.11795 (cross-list from eess.IV) [pdf, other]: Title: Neural JPEG: End-to-End Image Compression Leveraging a Standard JPEG Encoder-Decoder

Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

Comments: Accepted in DCC 2022, 11 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1126] arXiv:2201.11864 (cross-list from eess.IV) [pdf, other]: Title: Classification of White Blood Cell Leukemia with Low Number of Interpretable and Explainable Features

Authors: William Franz Lamberti

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1127] arXiv:2201.11866 (cross-list from eess.IV) [pdf, other]: Title: Calibrating Histopathology Image Classifiers using Label Smoothing

Authors: Jerry Wei, Lorenzo Torresani, Jason Wei, Saeed Hassanpour

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1128] arXiv:2201.11987 (cross-list from eess.IV) [pdf, ps, other]: Title: Computer-aided Recognition and Assessment of a Porous Bioelastomer on Ultrasound Images for Regenerative Medicine Applications

Authors: Dun Wang, Kaixuan Guo, Yanying Zhu, Jia Sun, Aliona Dreglea, Jiao Yu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1129] arXiv:2201.11996 (cross-list from eess.IV) [pdf, other]: Title: Deep Networks for Image and Video Super-Resolution

Authors: Kuldeep Purohit, Srimanta Mandal, A. N. Rajagopalan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1130] arXiv:2201.11998 (cross-list from eess.IV) [pdf, other]: Title: Image Superresolution using Scale-Recurrent Dense Network

Authors: Kuldeep Purohit, Srimanta Mandal, A. N. Rajagopalan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1131] arXiv:2201.12152 (cross-list from eess.IV) [pdf, other]: Title: Carotid artery wall segmentation in ultrasound image sequences using a deep convolutional neural network

Authors: Nolann Lainé, Guillaume Zahnd, Herv é Liebgott, Maciej Orkisz

Comments: 5 pages, 4 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1132] arXiv:2201.12260 (cross-list from eess.IV) [pdf, other]: Title: A Review on Deep-Learning Algorithms for Fetal Ultrasound-Image Analysis

Authors: Maria Chiara Fiorentino, Francesca Pia Villani, Mariachiara Di Cosmo, Emanuele Frontoni, Sara Moccia

Journal-ref: Medical Image Analysis 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1133] arXiv:2201.12389 (cross-list from eess.IV) [pdf, other]: Title: DoubleU-Net++: Architecture with Exploit Multiscale Features for Vertebrae Segmentation

Authors: Simindokht Jahangard, Mahdi Bonyani, Abbas Khosravi

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1134] arXiv:2201.12589 (cross-list from eess.IV) [pdf, other]: Title: FedMed-ATL: Misaligned Unpaired Brain Image Synthesis via Affine Transform Loss

Authors: Jinbao Wang, Guoyang Xie, Yawen Huang, Yefeng Zheng, Yaochu Jin, Feng Zheng

Comments: arXiv admin note: text overlap with arXiv:2201.08953

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1135] arXiv:2201.12773 (cross-list from eess.IV) [pdf, other]: Title: Practical Noise Simulation for RGB Images

Authors: Saeed Ranjbar Alvar, Ivan V. Bajić

Comments: Reference paper for the code

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1136] arXiv:2201.12785 (cross-list from eess.IV) [pdf, other]: Title: TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical Images

Authors: Jiangyun Li, Wenxuan Wang, Chen Chen, Tianxiang Zhang, Sen Zha, Jing Wang, Hong Yu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1137] arXiv:2201.13256 (cross-list from math.OC) [pdf, other]: Title: Proximal Denoiser for Convergent Plug-and-Play Optimization with Nonconvex Regularization

Authors: Samuel Hurault, Arthur Leclaire, Nicolas Papadakis

Comments: 21 pages. arXiv admin note: text overlap with arXiv:2110.03220

Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV)
[1138] arXiv:2201.13309 (cross-list from physics.data-an) [pdf, other]: Title: Accelerating Laue Depth Reconstruction Algorithm with CUDA

Authors: Ke Yue, Schwarz Nicholas, Tischler Jonathan Z

Comments: 2015 IEEE International Conference on Cluster Computing

Subjects: Data Analysis, Statistics and Probability (physics.data-an); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)

[ total of 1140 entries: 1-1138 | 1139-1140 ]
[ showing 1138 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Jan 2022