We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Dec 2021

[ total of 1570 entries: 1-1570 ]
[ showing 1570 entries per page: fewer | more ]
[1]  arXiv:2112.00011 [pdf, other]
Title: Predicting Poverty Level from Satellite Imagery using Deep Neural Networks
Comments: 14 pages, 5 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[2]  arXiv:2112.00050 [pdf, other]
Title: Pattern-Aware Data Augmentation for LiDAR 3D Object Detection
Comments: Published paper in the IEEE Intelligent Transportation Systems Conference - ITSC 2021
Journal-ref: 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), 2021, pp. 2703-2710
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[3]  arXiv:2112.00054 [pdf, other]
Title: Task2Sim : Towards Effective Pre-training and Transfer from Synthetic Data
Comments: Accepted to CVPR'22
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[4]  arXiv:2112.00061 [pdf, other]
Title: Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources
Comments: CVPR'22
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[5]  arXiv:2112.00065 [pdf, other]
Title: Boosting EfficientNets Ensemble Performance via Pseudo-Labels and Synthetic Images by pix2pixHD for Infection and Ischaemia Classification in Diabetic Foot Ulcers
Comments: Accepted for Workshop Proceedings of the Diabetic Foot Ulcers Challenge (DFUC) as part of the 2021 24th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[6]  arXiv:2112.00113 [pdf, other]
Title: Beyond Flatland: Pre-training with a Strong 3D Inductive Bias
Comments: NeurIPS 2021 pre-registration workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[7]  arXiv:2112.00166 [pdf, ps, other]
Title: TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information
Comments: To Appear In European Conference on Computer Vision (ECCV) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8]  arXiv:2112.00167 [pdf, other]
Title: Event-Based Fusion for Motion Deblurring with Cross-modal Attention
Comments: Accepted by ECCV 2022 as oral presentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2112.00169 [pdf, other]
Title: 3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image
Comments: Project page: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[10]  arXiv:2112.00180 [pdf, other]
Title: SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[11]  arXiv:2112.00185 [pdf, other]
Title: Light Field Implicit Representation for Flexible Resolution Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[12]  arXiv:2112.00202 [pdf, other]
Title: 3DVNet: Multi-View Depth Prediction and Volumetric Refinement
Comments: 10 pages, 6 figures, 3 tables. Accepted to 3DV 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13]  arXiv:2112.00206 [pdf, other]
Title: Querying Labelled Data with Scenario Programs for Sim-to-Real Validation
Comments: pre-print
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Programming Languages (cs.PL); Robotics (cs.RO)
[14]  arXiv:2112.00207 [pdf, ps, other]
Title: Improved sparse PCA method for face and image recognition
Comments: 11 pages. arXiv admin note: substantial text overlap with arXiv:1904.08496
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[15]  arXiv:2112.00216 [pdf, other]
Title: PoseKernelLifter: Metric Lifting of 3D Human Pose using Sound
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[16]  arXiv:2112.00219 [pdf, other]
Title: Scalable Primitives for Generalized Sensor Fusion in Autonomous Vehicles
Comments: Presented in Machine Learning for Autonomous Driving Workshop at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia. 11 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[17]  arXiv:2112.00234 [pdf, other]
Title: MC-Blur: A Comprehensive Benchmark for Image Deblurring
Comments: To appear in IEEE TCSVT
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2112.00236 [pdf, other]
Title: VoRTX: Volumetric 3D Reconstruction With Transformers for Voxelwise View Selection and Fusion
Comments: 3DV 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[19]  arXiv:2112.00246 [pdf, other]
Title: AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-shot Interactions
Comments: ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[20]  arXiv:2112.00250 [pdf, ps, other]
Title: Shallow Network Based on Depthwise Over-Parameterized Convolution for Hyperspectral Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[21]  arXiv:2112.00260 [pdf, other]
Title: Ranking Distance Calibration for Cross-Domain Few-Shot Learning
Comments: Accepted at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[22]  arXiv:2112.00263 [pdf, other]
Title: GLocal: Global Graph Reasoning and Local Structure Transfer for Person Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23]  arXiv:2112.00281 [pdf, other]
Title: FDA-GAN: Flow-based Dual Attention GAN for Human Pose Transfer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24]  arXiv:2112.00289 [pdf, other]
Title: Point Cloud Segmentation Using Sparse Temporal Local Attention
Comments: 8 pages, 3 figures Published at the Australasian Conference on Robotics and Automation (ACRA) 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[25]  arXiv:2112.00290 [pdf, other]
Title: Unsupervised Statistical Learning for Die Analysis in Ancient Numismatics
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[26]  arXiv:2112.00295 [pdf, other]
Title: Multiple Fusion Adaptation: A Strong Framework for Unsupervised Semantic Segmentation Adaptation
Comments: 13 pages, 2 figures, submitted to BMVC2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[27]  arXiv:2112.00302 [pdf, other]
Title: Graph Convolutional Module for Temporal Action Localization in Videos
Comments: Accepted by T-PAMI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[28]  arXiv:2112.00317 [pdf, other]
Title: Unleashing the Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification
Comments: Technical report, code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[29]  arXiv:2112.00319 [pdf, other]
Title: Object-Aware Cropping for Self-Supervised Learning
Journal-ref: Transactions on Machine Learning Research 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[30]  arXiv:2112.00322 [pdf, other]
Title: FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31]  arXiv:2112.00323 [pdf, other]
Title: Push Stricter to Decide Better: A Class-Conditional Feature Adaptive Framework for Improving Adversarial Robustness
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32]  arXiv:2112.00336 [pdf, other]
Title: Multi-View Stereo with Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33]  arXiv:2112.00337 [pdf, other]
Title: A Unified Benchmark for the Unknown Detection Capability of Deep Neural Networks
Comments: Published in ESWA (this https URL)
Journal-ref: Expert Systems with Applications (2023), Vol. 229, Part A, 120461
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[34]  arXiv:2112.00342 [pdf, other]
Title: Confidence Propagation Cluster: Unleash Full Potential of Object Detectors
Comments: Accepted by CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35]  arXiv:2112.00343 [pdf, other]
Title: Camera Motion Agnostic 3D Human Pose Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36]  arXiv:2112.00348 [pdf, other]
Title: Automatic travel pattern extraction from visa page stamps using CNN models
Comments: 15 pages, 13 figures, 4 tables, submitted for peer review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37]  arXiv:2112.00374 [pdf, other]
Title: CLIPstyler: Image Style Transfer with a Single Text Condition
Comments: CVPR 2022 camera ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[38]  arXiv:2112.00380 [pdf, other]
Title: Deep Measurement Updates for Bayes Filters
Journal-ref: IEEE Robotics and Automation Letters, vol. 7, no. 1, pp. 414-421, Jan. 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[39]  arXiv:2112.00384 [pdf, other]
Title: Exploration into Translation-Equivariant Image Quantization
Comments: ICASSP 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[40]  arXiv:2112.00390 [pdf, other]
Title: SegDiff: Image Segmentation with Diffusion Probabilistic Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[41]  arXiv:2112.00396 [pdf, other]
Title: Dyadic Human Motion Prediction
Comments: added reference for section 2
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42]  arXiv:2112.00410 [pdf, other]
Title: Rethink, Revisit, Revise: A Spiral Reinforced Self-Revised Network for Zero-Shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[43]  arXiv:2112.00412 [pdf, other]
Title: The Majority Can Help The Minority: Context-rich Minority Oversampling for Long-tailed Classification
Comments: Accepted by CVPR 2022, 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[44]  arXiv:2112.00428 [pdf, other]
Title: Adv-4-Adv: Thwarting Changing Adversarial Perturbations via Adversarial Domain Adaptation
Comments: 22 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[45]  arXiv:2112.00431 [pdf, other]
Title: MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Comments: 12 Pages, 6 Figures, 7 Tables
Journal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[46]  arXiv:2112.00432 [pdf, other]
Title: A benchmark with decomposed distribution shifts for 360 monocular depth estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[47]  arXiv:2112.00448 [pdf, other]
Title: On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification
Comments: Accepted for publication in CVIP 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48]  arXiv:2112.00459 [pdf, other]
Title: Information Theoretic Representation Distillation
Comments: BMVC 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49]  arXiv:2112.00463 [pdf, other]
Title: The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization
Comments: Accepted to CVPR 2022 - Camera Ready Version - Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50]  arXiv:2112.00475 [pdf, other]
Title: Weakly-Supervised Video Object Grounding via Causal Intervention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[51]  arXiv:2112.00484 [pdf, other]
Title: Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52]  arXiv:2112.00485 [pdf, other]
Title: Learning Transformer Features for Image Quality Assessment
Authors: Chao Zeng, Sam Kwong
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[53]  arXiv:2112.00492 [pdf, other]
Title: Human-Object Interaction Detection via Weak Supervision
Comments: Accepted at BMVC'21
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54]  arXiv:2112.00496 [pdf, other]
Title: Revisiting the Transferability of Supervised Pretraining: an MLP Perspective
Comments: Accepted by CVPR 2022. [camera ready with supplement]
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[55]  arXiv:2112.00504 [pdf, other]
Title: Learning Oriented Remote Sensing Object Detection via Naive Geometric Computing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56]  arXiv:2112.00510 [pdf, other]
Title: Trimap-guided Feature Mining and Fusion Network for Natural Image Matting
Comments: Accepted to Computer Vision and Image Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[57]  arXiv:2112.00527 [pdf, other]
Title: Subtask-dominated Transfer Learning for Long-tail Person Search
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58]  arXiv:2112.00532 [pdf, other]
Title: FaceTuneGAN: Face Autoencoder for Convolutional Expression Transfer Using Neural Generative Adversarial Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[59]  arXiv:2112.00556 [pdf, other]
Title: Semi-Supervised Surface Anomaly Detection of Composite Wind Turbine Blades From Drone Imagery
Comments: In-proceedings at 2022 17th International Conference on Computer Vision Theory and Applications (VISAPP)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[60]  arXiv:2112.00557 [pdf, ps, other]
Title: 3D Reconstruction Using a Linear Laser Scanner and a Camera
Authors: Rui Wang
Comments: 8 pages, 16 figures, published in The 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE2021)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[61]  arXiv:2112.00560 [pdf, other]
Title: Attribute Artifacts Removal for Geometry-based Point Cloud Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[62]  arXiv:2112.00568 [pdf, other]
Title: Dual Spoof Disentanglement Generation for Face Anti-spoofing with Depth Uncertainty Learning
Comments: Accepted to TCSVT, arXiv version. The codes are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63]  arXiv:2112.00580 [pdf, other]
Title: Background Activation Suppression for Weakly Supervised Object Localization
Comments: Accepted by CVPR 2022. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64]  arXiv:2112.00582 [pdf, other]
Title: Transformer-based Network for RGB-D Saliency Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[65]  arXiv:2112.00585 [pdf, other]
Title: Neural Emotion Director: Speech-preserving semantic control of facial expressions in "in-the-wild" videos
Comments: CVPR 2022 (oral). Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66]  arXiv:2112.00599 [pdf, other]
Title: An implementation of the "Guess who?" game using CLIP
Comments: Code available at this https URL
Journal-ref: Intelligent Data Engineering and Automated Learning (IDEAL 2021). Lecture Notes in Computer Science, vol 13113
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[67]  arXiv:2112.00627 [pdf, other]
Title: DeepSportLab: a Unified Framework for Ball Detection, Player Instance Segmentation and Pose Estimation in Team Sports Scenes
Comments: 13 pages, 5 figures, BMVC, BMVC2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68]  arXiv:2112.00639 [pdf, other]
Title: A Systematic Review of Robustness in Deep Learning for Computer Vision: Mind the gap?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[69]  arXiv:2112.00656 [pdf, other]
Title: Object-aware Video-language Pre-training for Retrieval
Comments: CVPR2022; Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[70]  arXiv:2112.00665 [pdf, other]
Title: Iterative Saliency Enhancement using Superpixel Similarity
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71]  arXiv:2112.00686 [pdf, other]
Title: CYBORG: Blending Human Saliency Into the Loss Improves Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72]  arXiv:2112.00690 [pdf, other]
Title: MDFM: Multi-Decision Fusing Model for Few-Shot Learning
Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT). arXiv admin note: text overlap with arXiv:2109.07785
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[73]  arXiv:2112.00694 [pdf, other]
Title: Label-Free Model Evaluation with Semi-Structured Dataset Representations
Comments: 10 pages, 8 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74]  arXiv:2112.00698 [pdf, ps, other]
Title: CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded Systems
Comments: 5 pages, 3 figures, published in an IEEE Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[75]  arXiv:2112.00718 [pdf, other]
Title: Improving GAN Equilibrium by Raising Spatial Awareness
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76]  arXiv:2112.00719 [pdf, other]
Title: HyperInverter: Improving StyleGAN Inversion via Hypernetwork
Comments: Accepted to CVPR 2022; Project page is located at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77]  arXiv:2112.00724 [pdf, other]
Title: RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs
Comments: Project page available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[78]  arXiv:2112.00725 [pdf, other]
Title: The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image
Comments: Accepted at ICLR'23. Webpage: this https URL, code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[79]  arXiv:2112.00726 [pdf, other]
Title: MonoScene: Monocular 3D Semantic Scene Completion
Comments: Accepted at CVPR 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[80]  arXiv:2112.00775 [pdf, other]
Title: Routing with Self-Attention for Multimodal Capsule Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81]  arXiv:2112.00793 [pdf, other]
Title: Using Deep Image Prior to Assist Variational Selective Segmentation Deep Learning Algorithms
Comments: Presented at SIPAIM 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[82]  arXiv:2112.00804 [pdf, other]
Title: PreViTS: Contrastive Pretraining with Video Tracking Supervision
Comments: To be presented at WACV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83]  arXiv:2112.00821 [pdf, other]
Title: FaSS-MVS -- Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[84]  arXiv:2112.00847 [pdf, other]
Title: CLAWS: Contrastive Learning with hard Attention and Weak Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[85]  arXiv:2112.00849 [pdf, ps, other]
Title: Interpretable Deep Learning-Based Forensic Iris Segmentation and Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[86]  arXiv:2112.00854 [pdf, other]
Title: GANORCON: Are Generative Models Useful for Few-shot Segmentation?
Comments: CVPR 2022 Camera Ready Version
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[87]  arXiv:2112.00879 [pdf, other]
Title: Generating Diverse 3D Reconstructions from a Single Occluded Face Image
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88]  arXiv:2112.00891 [pdf, other]
Title: Event Neural Networks
Comments: Accepted to ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89]  arXiv:2112.00933 [pdf, other]
Title: PartImageNet: A Large, High-Quality Dataset of Parts
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2112.00941 [pdf, other]
Title: Generalized Closed-form Formulae for Feature-based Subpixel Alignment in Patch-based Matching
Comments: 29 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91]  arXiv:2112.00942 [pdf, other]
Title: On Salience-Sensitive Sign Classification in Autonomous Vehicle Path Planning: Experimental Explorations with a Novel Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[92]  arXiv:2112.00948 [pdf, other]
Title: Visual-Semantic Transformer for Scene Text Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93]  arXiv:2112.00953 [pdf, other]
Title: Maximum Consensus by Weighted Influences of Monotone Boolean Functions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[94]  arXiv:2112.00954 [pdf, other]
Title: Temporally Resolution Decrement: Utilizing the Shape Consistency for Higher Computational Efficiency
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95]  arXiv:2112.00958 [pdf, other]
Title: Hierarchical Neural Implicit Pose Network for Animation and Motion Retargeting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[96]  arXiv:2112.00965 [pdf, other]
Title: Vision Pair Learning: An Efficient Training Framework for Image Classification
Authors: Bei Tong, Xiaoyuan Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97]  arXiv:2112.00967 [pdf, other]
Title: Relational Graph Learning for Grounded Video Description Generation
Comments: 10 pages, 5 figures, ACM MM 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[98]  arXiv:2112.00969 [pdf, other]
Title: Object-Centric Unsupervised Image Captioning
Comments: ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[99]  arXiv:2112.00974 [pdf, other]
Title: Consensus Graph Representation Learning for Better Grounded Image Captioning
Comments: 9 pages, 5 figures, AAAI 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[100]  arXiv:2112.00995 [pdf, other]
Title: SwinTrack: A Simple and Strong Baseline for Transformer Tracking
Comments: 22 pages, 10 figures
Journal-ref: Advances in Neural Information Processing Systems, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101]  arXiv:2112.01001 [pdf, other]
Title: SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency
Comments: Published at NeurIPS 2021. See project webpage at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[102]  arXiv:2112.01011 [pdf, other]
Title: Local Similarity Pattern and Cost Self-Reassembling for Deep Stereo Matching Networks
Comments: Accepted by AAAI-2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103]  arXiv:2112.01019 [pdf, other]
Title: Unconstrained Face Sketch Synthesis via Perception-Adaptive Network and A New Benchmark
Comments: We proposed the first medium-scale benchmark for unconstrained face sketch synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[104]  arXiv:2112.01030 [pdf, other]
Title: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework using Self-Supervised Multi-Task Learning
Comments: Accepted by the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105]  arXiv:2112.01033 [pdf, other]
Title: TBN-ViT: Temporal Bilateral Network with Vision Transformer for Video Scene Parsing
Comments: The sixth place solution for ICCV2021 VSPW Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106]  arXiv:2112.01034 [pdf, other]
Title: Leveraging Human Selective Attention for Medical Image Analysis with Limited Training Data
Comments: BMVC 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[107]  arXiv:2112.01036 [pdf, other]
Title: GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108]  arXiv:2112.01037 [pdf, other]
Title: Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[109]  arXiv:2112.01038 [pdf, other]
Title: Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips
Comments: BMVC 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[110]  arXiv:2112.01041 [pdf, other]
Title: N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras
Comments: Accepted to ICCV 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[111]  arXiv:2112.01050 [pdf, other]
Title: CloudWalker: Random walks for 3D point cloud shape analysis
Journal-ref: Computers & Graphics Volume 106, August 2022, Pages 110-118
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112]  arXiv:2112.01059 [pdf, other]
Title: Stronger Baseline for Person Re-Identification
Comments: The third-place solution for ICCV2021 VIPriors Re-identification Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113]  arXiv:2112.01062 [pdf, other]
Title: Syntax Customized Video Captioning by Imitating Exemplar Sentences
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[114]  arXiv:2112.01063 [pdf, other]
Title: Automatic deforestation detectors based on frequentist statistics and their extensions for other spatial objects
Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP); Methodology (stat.ME)
[115]  arXiv:2112.01071 [pdf, other]
Title: Extract Free Dense Labels from CLIP
Comments: ECCV 2022 oral, project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[116]  arXiv:2112.01072 [pdf, other]
Title: The Second Place Solution for ICCV2021 VIPriors Instance Segmentation Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117]  arXiv:2112.01073 [pdf, other]
Title: Controllable Video Captioning with an Exemplar Sentence
Journal-ref: [C]//Proceedings of the 28th ACM International Conference on Multimedia. 2020: 1085-1093
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[118]  arXiv:2112.01085 [pdf, other]
Title: PTCT: Patches with 3D-Temporal Convolutional Transformer Network for Precipitation Nowcasting
Comments: 9 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[119]  arXiv:2112.01098 [pdf, other]
Title: Attention based Occlusion Removal for Hybrid Telepresence Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2112.01121 [pdf, other]
Title: "Just Drive": Colour Bias Mitigation for Semantic Segmentation in the Context of Urban Driving
Comments: 2021 IEEE International Conference on Big Data (IEEE BigData 2021)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2112.01135 [pdf, other]
Title: Open-set 3D Object Detection
Comments: Received by 3DV 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2112.01148 [pdf, other]
Title: FIBA: Frequency-Injection based Backdoor Attack in Medical Image Analysis
Comments: Accepted by CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[123]  arXiv:2112.01155 [pdf, other]
Title: Batch Normalization Tells You Which Filter is Important
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[124]  arXiv:2112.01161 [pdf, other]
Title: Video Frame Interpolation without Temporal Priors
Comments: Accepted by Neural Information Processing Systems (NeurIPS) 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2112.01176 [pdf, other]
Title: Overcoming the Domain Gap in Neural Action Representations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2112.01177 [pdf, other]
Title: MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2112.01194 [pdf, other]
Title: Video-Text Pre-training with Learned Regions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[128]  arXiv:2112.01197 [pdf, other]
Title: Sample Prior Guided Robust Model Learning to Suppress Noisy Labels
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[129]  arXiv:2112.01314 [pdf, other]
Title: SIDNet: Learning Shading-aware Illumination Descriptor for Image Harmonization
Comments: Accepted by IEEE TETCI 2023. Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2112.01316 [pdf, other]
Title: Putting 3D Spatially Sparse Networks on a Diet
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2112.01330 [pdf, other]
Title: CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer
Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[132]  arXiv:2112.01335 [pdf, other]
Title: Semantic-Sparse Colorization Network for Deep Exemplar-based Colorization
Comments: Accepted by ECCV2022; 14 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2112.01348 [pdf, other]
Title: 3rd Place Solution for NeurIPS 2021 Shifts Challenge: Vehicle Motion Prediction
Journal-ref: Bayesian Deep Learning Workshop, NeurIPS 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134]  arXiv:2112.01349 [pdf, other]
Title: MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment
Comments: accepted by ECCV2022
Journal-ref: European Conference on Computer Vision (2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135]  arXiv:2112.01360 [pdf, other]
Title: Probabilistic Approach for Road-Users Detection
Comments: This work has been accepted for publication as a REGULAR PAPER in the Transactions on Intelligent Transportation Systems-ITS
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[136]  arXiv:2112.01390 [pdf, other]
Title: InsCLR: Improving Instance Retrieval with Self-Supervision
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2112.01398 [pdf, other]
Title: TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation
Comments: Accepted to ECCV 2022; TISE toolbox is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138]  arXiv:2112.01402 [pdf, other]
Title: Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation
Comments: AAAI-2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139]  arXiv:2112.01422 [pdf, other]
Title: 3D-Aware Semantic-Guided Generative Model for Human Synthesis
Comments: ECCV 2022. 29 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140]  arXiv:2112.01426 [pdf, other]
Title: SCNet: A Generalized Attention-based Model for Crack Fault Segmentation
Comments: Accepted at ICVGIP 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141]  arXiv:2112.01454 [pdf, other]
Title: Altering Facial Expression Based on Textual Emotion
Comments: Accepted in VISAPP2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[142]  arXiv:2112.01455 [pdf, other]
Title: Zero-Shot Text-Guided Object Generation with Dream Fields
Comments: CVPR 2022. 13 pages. Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[143]  arXiv:2112.01473 [pdf, other]
Title: Neural Point Light Fields
Comments: 9 pages, replacement changed font of equations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[144]  arXiv:2112.01479 [pdf, other]
Title: Learning Spatial-Temporal Graphs for Active Speaker Detection
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[145]  arXiv:2112.01502 [pdf, other]
Title: Dimensions of Motion: Monocular Prediction through Flow Subspaces
Comments: Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[146]  arXiv:2112.01503 [pdf, ps, other]
Title: Machine Learning-Based Classification Algorithms for the Prediction of Coronary Heart Diseases
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[147]  arXiv:2112.01504 [pdf, other]
Title: Neural Weight Step Video Compression
Comments: Accepted to the pre-registration workshop at NeurIPS 2021
Journal-ref: NeurIPS 2021 workshop in pre-registration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[148]  arXiv:2112.01513 [pdf, other]
Title: OW-DETR: Open-world Detection Transformer
Comments: 16 pages, CVPR 2022 accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149]  arXiv:2112.01514 [pdf, other]
Title: Self-supervised Video Transformer
Comments: Accepted to CVPR '22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150]  arXiv:2112.01515 [pdf, other]
Title: TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation
Comments: Accepted by ECCV 2022, Oral, open-sourced
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151]  arXiv:2112.01517 [pdf, other]
Title: Efficient Neural Radiance Fields for Interactive Free-viewpoint Video
Comments: SIGGRAPH Asia 2022; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[152]  arXiv:2112.01518 [pdf, other]
Title: DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
Comments: Accepted to CVPR2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[153]  arXiv:2112.01520 [pdf, other]
Title: Recognizing Scenes from Novel Viewpoints
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[154]  arXiv:2112.01521 [pdf, other]
Title: Object-aware Monocular Depth Prediction with Instance Convolutions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[155]  arXiv:2112.01522 [pdf, other]
Title: Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2112.01523 [pdf, other]
Title: Learning Neural Light Fields with Ray-Space Embedding Networks
Comments: CVPR 2022 camera ready revision. Major changes include: 1. Additional comparison to NeX on Stanford, RealFF, Shiny datasets 2. Experiment on 360 degree lego bulldozer scene in the appendix, using Pluecker parameterization 3. Moving student-teacher results to the appendix 4. Clarity edits -- in particular, making it clear that our Stanford evaluation *does not* use subdivision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2112.01524 [pdf, other]
Title: GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras
Comments: CVPR 2022 (Oral). Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
[158]  arXiv:2112.01525 [pdf, other]
Title: Co-domain Symmetry for Complex-Valued Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[159]  arXiv:2112.01526 [pdf, other]
Title: MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
Comments: CVPR 2022 Camera Ready
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2112.01527 [pdf, other]
Title: Masked-attention Mask Transformer for Universal Image Segmentation
Comments: CVPR 2022. Project page/code/models: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[161]  arXiv:2112.01528 [pdf, other]
Title: A Fast Knowledge Distillation Framework for Visual Recognition
Comments: Our project page: this http URL, code and models are available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[162]  arXiv:2112.01529 [pdf, other]
Title: BEVT: BERT Pretraining of Video Transformers
Comments: To Appear at CVPR 2022, code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[163]  arXiv:2112.01530 [pdf, other]
Title: StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions
Comments: Accepted to CVPR2022; project page: this https URL ; video: this https URL ; code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2112.01551 [pdf, other]
Title: D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Comments: Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165]  arXiv:2112.01554 [pdf, other]
Title: Neural Head Avatars from Monocular RGB Videos
Authors: Philip-William Grassal (1), Malte Prinzler (1), Titus Leistner (1), Carsten Rother (1), Matthias Nießner (2), Justus Thies (3) ((1) Heidelberg University, (2) Technical University of Munich, (3) Max Planck Institute for Intelligent Systems)
Comments: Camera-ready revision - Video: this https URL Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[166]  arXiv:2112.01573 [pdf, other]
Title: FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[167]  arXiv:2112.01601 [pdf, other]
Title: Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?
Comments: AAAI-22 AdvML Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[168]  arXiv:2112.01609 [pdf, other]
Title: Probabilistic Tracking with Deep Factors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169]  arXiv:2112.01641 [pdf, other]
Title: Hamiltonian latent operators for content and motion disentanglement in image sequences
Comments: Conference paper at NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[170]  arXiv:2112.01646 [pdf, other]
Title: Investigating the usefulness of Quantum Blur
Journal-ref: Proc. ISQCMC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Quantum Physics (quant-ph)
[171]  arXiv:2112.01651 [pdf, other]
Title: Multi-modal application: Image Memes Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[172]  arXiv:2112.01683 [pdf, other]
Title: TransZero: Attribute-guided Transformer for Zero-Shot Learning
Comments: Accepted to AAAI'22
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[173]  arXiv:2112.01686 [pdf, other]
Title: Make A Long Image Short: Adaptive Token Length for Vision Transformers
Comments: 10 pages, Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2112.01695 [pdf, other]
Title: Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation
Comments: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2112.01697 [pdf, other]
Title: LMR-CBT: Learning Modality-fused Representations with CB-Transformer for Multimodal Emotion Recognition from Unaligned Multimodal Sequences
Comments: 9 pages ,Figure 2, Table 5
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[176]  arXiv:2112.01698 [pdf, other]
Title: Learning to Detect Every Thing in an Open World
Comments: Project page is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2112.01712 [pdf, other]
Title: Deep Depth from Focus with Differential Focus Volume
Comments: 17 pages; CVPR2022 accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2112.01714 [pdf, other]
Title: Structure-Aware Multi-Hop Graph Convolution for Graph Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[179]  arXiv:2112.01715 [pdf, other]
Title: Self-Supervised Material and Texture Representation Learning for Remote Sensing Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[180]  arXiv:2112.01719 [pdf, other]
Title: Adaptive Poincaré Point to Set Distance for Few-Shot Classification
Comments: Accepted at AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[181]  arXiv:2112.01723 [pdf, other]
Title: Adversarial Attacks against a Satellite-borne Multispectral Cloud Detector
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[182]  arXiv:2112.01730 [pdf, other]
Title: How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset?
Comments: European Conference on Computer Vision 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183]  arXiv:2112.01732 [pdf, other]
Title: MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection
Comments: accepted by ICCV-2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[184]  arXiv:2112.01736 [pdf, other]
Title: Gesture Recognition with a Skeleton-Based Keyframe Selection Module
Comments: 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185]  arXiv:2112.01740 [pdf, other]
Title: AirDet: Few-Shot Detection without Fine-tuning for Autonomous Exploration
Comments: 23 pages, 9 figures
Journal-ref: 2022 17th European Conference on Computer Vision (ECCV)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[186]  arXiv:2112.01741 [pdf, other]
Title: Frame Averaging for Equivariant Shape Space Learning
Comments: Accepted to CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[187]  arXiv:2112.01746 [pdf, other]
Title: MSP : Refine Boundary Segmentation via Multiscale Superpixel
Comments: under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188]  arXiv:2112.01759 [pdf, other]
Title: NeRF-SR: High-Quality Neural Radiance Fields using Supersampling
Comments: Accepted to MM 2022. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[189]  arXiv:2112.01766 [pdf, other]
Title: Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior
Comments: submitted to IEEE Transactions on Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[190]  arXiv:2112.01787 [pdf, other]
Title: Detect Faces Efficiently: A Survey and Evaluations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[191]  arXiv:2112.01793 [pdf, other]
Title: A Systematic IoU-Related Method: Beyond Simplified Regression for Better Localization
Journal-ref: IEEE Transactions on Image Processing, Volume 30, pages 5032-5044, 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[192]  arXiv:2112.01799 [pdf, other]
Title: Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[193]  arXiv:2112.01800 [pdf, other]
Title: A Survey: Deep Learning for Hyperspectral Image Classification with Few Labeled Samples
Journal-ref: Neurocomputing, Volume 448, 2021, Pages 179-204
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[194]  arXiv:2112.01801 [pdf, other]
Title: Mesh Convolution with Continuous Filters for 3D Surface Parsing
Comments: Accepted to TNNLS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[195]  arXiv:2112.01838 [pdf, other]
Title: Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer
Comments: Accepted to CVPR2022. 14 pages, 14 figures and 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[196]  arXiv:2112.01839 [src]
Title: Mind Your Clever Neighbours: Unsupervised Person Re-identification via Adaptive Clustering Relationship Modeling
Comments: The experimental results are not sufficient
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[197]  arXiv:2112.01845 [pdf, other]
Title: Semantic Map Injected GAN Training for Image-to-Image Translation
Comments: Accepted in Fourth Workshop on Computer Vision Applications (WCVA) at ICVGIP 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[198]  arXiv:2112.01873 [pdf, other]
Title: Image-to-image Translation as a Unique Source of Knowledge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[199]  arXiv:2112.01882 [pdf, other]
Title: Incremental Learning in Semantic Segmentation from Image Labels
Comments: To appear in CVPR 22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[200]  arXiv:2112.01900 [pdf, other]
Title: Novel Class Discovery in Semantic Segmentation
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[201]  arXiv:2112.01901 [pdf, other]
Title: The Box Size Confidence Bias Harms Your Object Detector
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[202]  arXiv:2112.01914 [pdf, other]
Title: SGM3D: Stereo Guided Monocular 3D Object Detection
Comments: 8 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203]  arXiv:2112.01924 [pdf, other]
Title: TRNR: Task-Driven Image Rain and Noise Removal with a Few Images Based on Patch Analysis
Comments: 16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204]  arXiv:2112.01926 [pdf, other]
Title: Panoptic-aware Image-to-Image Translation
Comments: In 2023 IEEE winter conference on applications of computer vision (WACV)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205]  arXiv:2112.01932 [pdf, other]
Title: Multi-Content Complementation Network for Salient Object Detection in Optical Remote Sensing Images
Comments: 12 pages, 7 figures, Accepted by IEEE Transactions on Geoscience and Remote Sensing 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[206]  arXiv:2112.01933 [pdf, other]
Title: Bio-inspired Polarization Event Camera
Subjects: Computer Vision and Pattern Recognition (cs.CV); Instrumentation and Detectors (physics.ins-det); Optics (physics.optics)
[207]  arXiv:2112.01948 [pdf, ps, other]
Title: Boosting Unsupervised Domain Adaptation with Soft Pseudo-label and Curriculum Learning
Comments: 28 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[208]  arXiv:2112.01970 [pdf, ps, other]
Title: Optimization of phase-only holograms calculated with scaled diffraction calculation through deep neural networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Optics (physics.optics)
[209]  arXiv:2112.01983 [pdf, other]
Title: CoNeRF: Controllable Neural Radiance Fields
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[210]  arXiv:2112.01988 [pdf, other]
Title: ROCA: Robust CAD Model Retrieval and Alignment from a Single Image
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[211]  arXiv:2112.02039 [pdf, other]
Title: Bridging the Gap: Point Clouds for Merging Neurons in Connectomics
Comments: 10 pages, 6 figures, MIDL 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[212]  arXiv:2112.02073 [pdf, other]
Title: Hierarchical Optimal Transport for Unsupervised Domain Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[213]  arXiv:2112.02082 [pdf, other]
Title: Geometry-aware Two-scale PIFu Representation for Human Reconstruction
Comments: Accepted by NeurIPS 2022. 20 pages, 20 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[214]  arXiv:2112.02091 [pdf, other]
Title: Class-agnostic Reconstruction of Dynamic Objects from Videos
Comments: NeurIPS 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[215]  arXiv:2112.02139 [pdf, other]
Title: Face Reconstruction with Variational Autoencoder and Face Masks
Comments: 12 pages, 7 figures, 18th Encontro Nacional de Intelig\^encia Artificial e Computacional (ENIAC)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[216]  arXiv:2112.02205 [pdf, other]
Title: Behind the Curtain: Learning Occluded Shapes for 3D Object Detection
Journal-ref: AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[217]  arXiv:2112.02214 [pdf, other]
Title: Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[218]  arXiv:2112.02219 [pdf, other]
Title: Transferring Unconditional to Conditional GANs with Hyper-Modulation
Comments: 19 pages, 20 figures, to be published in CVPRW 2022. Code at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[219]  arXiv:2112.02221 [pdf, other]
Title: Orientation Aware Weapons Detection In Visual Data : A Benchmark Dataset
Comments: Submitted this paper in Journal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[220]  arXiv:2112.02225 [pdf, other]
Title: HHF: Hashing-guided Hinge Function for Deep Hashing Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[221]  arXiv:2112.02236 [pdf, other]
Title: SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing
Comments: Camera-ready for CVPR 2022. Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[222]  arXiv:2112.02237 [pdf, other]
Title: A Triple-Double Convolutional Neural Network for Panchromatic Sharpening
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[223]  arXiv:2112.02238 [pdf, other]
Title: Sphere Face Model:A 3D Morphable Model with Hypersphere Manifold Latent Space
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[224]  arXiv:2112.02244 [pdf, other]
Title: LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[225]  arXiv:2112.02249 [src]
Title: Dual-Flow Transformation Network for Deformable Image Registration with Region Consistency Constraint
Comments: This paper have some errors for experiment results, thus we want to withdraw this paper. We will update the revised paper. This paper is not published in any journal or conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[226]  arXiv:2112.02250 [pdf, other]
Title: Dense Extreme Inception Network for Edge Detection
Comments: Manuscript published by Pattern Recognition journal in 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[227]  arXiv:2112.02252 [pdf, other]
Title: Channel Exchanging Networks for Multimodal and Multitask Dense Image Prediction
Comments: Accepted by TPAMI 2022. Code is available at this https URL arXiv admin note: text overlap with arXiv:2011.05005
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[228]  arXiv:2112.02259 [pdf, other]
Title: Construct Informative Triplet with Two-stage Hard-sample Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[229]  arXiv:2112.02270 [pdf, ps, other]
Title: Feature-based Recognition Framework for Super-resolution Images
Authors: Jing Hu, Meiqi Zhang, Rui Zhang (School of Artificial Intelligence and Automation.HUST)
Comments: 7 pages, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[230]  arXiv:2112.02277 [pdf, other]
Title: BAANet: Learning Bi-directional Adaptive Attention Gates for Multispectral Pedestrian Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[231]  arXiv:2112.02279 [pdf, other]
Title: U2-Former: A Nested U-shaped Transformer for Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232]  arXiv:2112.02290 [pdf, other]
Title: Interactive Disentanglement: Learning Concepts by Interacting with their Prototype Representations
Comments: To be published in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[233]  arXiv:2112.02297 [pdf, other]
Title: Ablation study of self-supervised learning for image classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[234]  arXiv:2112.02300 [pdf, other]
Title: Unsupervised Domain Generalization by Learning a Bridge Across Domains
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[235]  arXiv:2112.02303 [pdf, other]
Title: An Annotated Video Dataset for Computing Video Memorability
Comments: 11 pages
Journal-ref: Data in Brief, Volume 39, 107671, (2021), ISSN 2352-3409
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[236]  arXiv:2112.02306 [pdf, other]
Title: Toward Practical Monocular Indoor Depth Estimation
Comments: Accepted to CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[237]  arXiv:2112.02308 [pdf, other]
Title: MoFaNeRF: Morphable Facial Neural Radiance Field
Comments: accepted to ECCV2022; code available at this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[238]  arXiv:2112.02338 [pdf, other]
Title: Generalized Binary Search Network for Highly-Efficient Multi-View Stereo
Comments: 16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[239]  arXiv:2112.02340 [pdf, other]
Title: Scanpath Prediction on Information Visualisations
Comments: 11 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[240]  arXiv:2112.02353 [pdf, other]
Title: Label Hierarchy Transition: Delving into Class Hierarchies to Enhance Deep Classifiers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[241]  arXiv:2112.02355 [pdf, other]
Title: SITA: Single Image Test-time Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[242]  arXiv:2112.02359 [pdf, other]
Title: Unsupervised Adaptation of Semantic Segmentation Models without Source Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[243]  arXiv:2112.02363 [pdf, other]
Title: CAVER: Cross-Modal View-Mixed Transformer for Bi-Modal Salient Object Detection
Comments: Accepted by TIP-2023. Add more details and update the weight illustration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[244]  arXiv:2112.02373 [pdf, other]
Title: 3rd Place: A Global and Local Dual Retrieval Solution to Facebook AI Image Similarity Challenge
Comments: This is the 3rd place solution for Facebook Image Similarity Challenge and NIPS2021 Workshop. The current first draft version will be updated later
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[245]  arXiv:2112.02379 [pdf, other]
Title: LTT-GAN: Looking Through Turbulence by Inverting GANs
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[246]  arXiv:2112.02399 [pdf, other]
Title: VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[247]  arXiv:2112.02413 [pdf, other]
Title: PointCLIP: Point Cloud Understanding by CLIP
Comments: Open sourced, Code and Model Available
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[248]  arXiv:2112.02416 [pdf, other]
Title: Gated2Gated: Self-Supervised Depth Estimation from Gated Images
Comments: 11 pages, 6 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249]  arXiv:2112.02447 [pdf, other]
Title: Next Day Wildfire Spread: A Machine Learning Data Set to Predict Wildfire Spreading from Remote-Sensing Data
Comments: submitted to IEEE Transactions on Geoscience and Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250]  arXiv:2112.02450 [pdf, other]
Title: Adaptive Feature Interpolation for Low-Shot Image Generation
Comments: ECCV'22. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[251]  arXiv:2112.02459 [pdf, other]
Title: SSAGCN: Social Soft Attention Graph Convolution Network for Pedestrian Trajectory Prediction
Comments: 14 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[252]  arXiv:2112.02466 [pdf, ps, other]
Title: Pose-guided Feature Disentangling for Occluded Person Re-identification Based on Transformer
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253]  arXiv:2112.02469 [pdf, other]
Title: RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather
Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[254]  arXiv:2112.02475 [pdf, other]
Title: Deblurring via Stochastic Refinement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[255]  arXiv:2112.02487 [pdf, other]
Title: Face Trees for Expression Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[256]  arXiv:2112.02494 [pdf, other]
Title: Implicit Neural Deformation for Sparse-View Face Reconstruction
Comments: 10 pages, 6 figures, The 30th Pacific Conference on Computer Graphics and Applications. Pacific Graphics(PG) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257]  arXiv:2112.02500 [pdf, other]
Title: MovieNet-PS: A Large-Scale Person Search Dataset in the Wild
Comments: ICASSP 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258]  arXiv:2112.02507 [pdf, ps, other]
Title: Adaptive Channel Encoding Transformer for Point Cloud Analysis
Comments: ICANN2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259]  arXiv:2112.02509 [pdf, ps, other]
Title: Adaptive Channel Encoding for Point Cloud Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[260]  arXiv:2112.02520 [pdf, other]
Title: Neural Photometry-guided Visual Attribute Transfer
Comments: 13 pages. To be published in Transactions on Visualizations and Computer Graphics. Project website: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[261]  arXiv:2112.02523 [pdf, other]
Title: STSM: Spatio-Temporal Shift Module for Efficient Action Recognition
Comments: 9 pages,4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[262]  arXiv:2112.02535 [pdf, other]
Title: End-to-End Segmentation via Patch-wise Polygons Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[263]  arXiv:2112.02571 [pdf, other]
Title: Learning Tracking Representations via Dual-Branch Fully Transformer Networks
Comments: ICCV21 Workshops
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[264]  arXiv:2112.02582 [pdf, other]
Title: PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
Comments: Accepted by ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265]  arXiv:2112.02597 [pdf, other]
Title: Constrained Adaptive Projection with Pretrained Features for Anomaly Detection
Comments: Accepted to IJCAI 2022 Main Track. This version includes 6 pages of main paper, 2 pages of Appendix
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[266]  arXiv:2112.02604 [pdf, other]
Title: PSI: A Pedestrian Behavior Dataset for Socially Intelligent Autonomous Car
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[267]  arXiv:2112.02624 [pdf, other]
Title: Dynamic Token Normalization Improves Vision Transformers
Comments: Published at ICLR'22; 18 pages, 12 Tables, 9 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[268]  arXiv:2112.02644 [pdf, other]
Title: Boosting Mobile CNN Inference through Semantic Memory
Comments: 13 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[269]  arXiv:2112.02666 [pdf, other]
Title: Learning Query Expansion over the Nearest Neighbor Graph
Comments: BMVC 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270]  arXiv:2112.02713 [pdf, other]
Title: Joint Symmetry Detection and Shape Matching for Non-Rigid Point Cloud
Comments: Under Review. arXiv admin note: substantial text overlap with arXiv:2110.02994
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[271]  arXiv:2112.02719 [pdf, other]
Title: A Survey on Deep learning based Document Image Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[272]  arXiv:2112.02725 [pdf, other]
Title: A hybrid convolutional neural network/active contour approach to segmenting dead trees in aerial imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[273]  arXiv:2112.02729 [pdf, other]
Title: Facial Emotion Characterization and Detection using Fourier Transform and Machine Learning
Comments: 8 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[274]  arXiv:2112.02747 [pdf, other]
Title: Making a Bird AI Expert Work for You and Me
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[275]  arXiv:2112.02749 [pdf, other]
Title: One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning
Comments: Accepted by AAAI 2022
Journal-ref: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[276]  arXiv:2112.02753 [pdf, other]
Title: MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image
Journal-ref: CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[277]  arXiv:2112.02763 [pdf, other]
Title: MetaCloth: Learning Unseen Tasks of Dense Fashion Landmark Detection from a Few Samples
Comments: Accepted by IEEE Transactions on Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278]  arXiv:2112.02772 [pdf, other]
Title: ActiveZero: Mixed Domain Learning for Active Stereovision with Zero Annotation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[279]  arXiv:2112.02779 [pdf, other]
Title: Revisiting LiDAR Registration and Reconstruction: A Range Image Perspective
Comments: 14 pages, 9 figures. This paper is under the review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[280]  arXiv:2112.02781 [pdf, other]
Title: Adjusting the Ground Truth Annotations for Connectivity-Based Learning to Delineate
Journal-ref: IEEE Transactions on Medical Imaging ( Volume: 41, Issue: 12, December 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281]  arXiv:2112.02788 [pdf, other]
Title: Texture Reformer: Towards Fast and Universal Interactive Texture Transfer
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[282]  arXiv:2112.02789 [pdf, other]
Title: HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[283]  arXiv:2112.02805 [pdf, other]
Title: Forward Compatible Training for Large-Scale Embedding Retrieval Systems
Comments: 14 pages with appendix. In proceedings at the conference on Computer Vision and Pattern Recognition 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284]  arXiv:2112.02814 [pdf, other]
Title: A Survey of Deep Learning for Low-Shot Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[285]  arXiv:2112.02815 [pdf, other]
Title: Make It Move: Controllable Image-to-Video Generation with Text Descriptions
Comments: Accepted by CVPR'2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[286]  arXiv:2112.02824 [pdf, ps, other]
Title: Letter-level Online Writer Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287]  arXiv:2112.02825 [pdf, other]
Title: Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288]  arXiv:2112.02828 [pdf, other]
Title: PP-MSVSR: Multi-Stage Video Super-Resolution
Comments: 8 pages, 6 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[289]  arXiv:2112.02829 [pdf, other]
Title: SyntEO: Synthetic Data Set Generation for Earth Observation and Deep Learning -- Demonstrated for Offshore Wind Farm Detection
Comments: 29 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[290]  arXiv:2112.02834 [pdf, other]
Title: A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks via Learned Weights Statistics
Comments: Accepted by IEEE Transactions on Multimedia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[291]  arXiv:2112.02838 [pdf, other]
Title: Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey and Outlook
Comments: Tracking Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[292]  arXiv:2112.02841 [pdf, other]
Title: GETAM: Gradient-weighted Element-wise Transformer Attention Map for Weakly-supervised Semantic segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[293]  arXiv:2112.02851 [pdf, other]
Title: No-Reference Point Cloud Quality Assessment via Domain Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[294]  arXiv:2112.02853 [pdf, other]
Title: Reliable Propagation-Correction Modulation for Video Object Segmentation
Comments: 13 pages, 8 figures, AAAI 2022 Accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[295]  arXiv:2112.02857 [pdf, other]
Title: PTTR: Relational 3D Point Cloud Object Tracking with Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[296]  arXiv:2112.02862 [pdf, other]
Title: SelectAugment: Hierarchical Deterministic Sample Selection for Data Augmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297]  arXiv:2112.02869 [pdf, ps, other]
Title: Physics Driven Deep Retinex Fusion for Adaptive Infrared and Visible Image Fusion
Comments: 20 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[298]  arXiv:2112.02889 [pdf, other]
Title: Joint Learning of Localized Representations from Medical Images and Reports
Comments: Accepted at ECCV 2022
Journal-ref: Computer Vision - ECCV 2022, pp. 685-701
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[299]  arXiv:2112.02891 [pdf, other]
Title: Seeing Objects in dark with Continual Contrastive Learning
Authors: Ujjal Kr Dutta
Comments: Accepted in European Conference on Computer Vision (ECCV) 2022 Workshops: IWDSC
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[300]  arXiv:2112.02902 [pdf, other]
Title: Interpretable Image Classification with Differentiable Prototypes Assignment
Comments: Accepted to ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[301]  arXiv:2112.02906 [pdf, other]
Title: ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction
Comments: 11 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[302]  arXiv:2112.02910 [pdf, other]
Title: A Tale of Color Variants: Representation and Self-Supervised Learning in Fashion E-Commerce
Comments: In Annual Conference on Innovative Applications of Artificial Intelligence (IAAI)/ AAAI Conference on Artificial Intelligence (AAAI) 2022. arXiv admin note: substantial text overlap with arXiv:2104.08581
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[303]  arXiv:2112.02922 [pdf, other]
Title: Anomaly Detection in IR Images of PV Modules using Supervised Contrastive Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[304]  arXiv:2112.02953 [pdf, ps, other]
Title: The artificial synesthete: Image-melody translations with variational autoencoders
Comments: 7 pages, 4 figures, supplementary media can be downloaded at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[305]  arXiv:2112.02990 [pdf, other]
Title: 4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding
Comments: Accepted by ECCV 2022, Video: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[306]  arXiv:2112.02991 [pdf, other]
Title: Cross-Modality Attentive Feature Fusion for Object Detection in Multispectral Remote Sensing Imagery
Comments: 23 pages,11 figures, under consideration at Pattern Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[307]  arXiv:2112.03020 [pdf, other]
Title: Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning
Comments: Accepted as a Regular Paper in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[308]  arXiv:2112.03044 [pdf, other]
Title: Fusion Detection via Distance-Decay IoU and weighted Dempster-Shafer Evidence Theory
Comments: 18 pages, 7 pages, under consideration at Journal of Aerospace Information Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[309]  arXiv:2112.03045 [pdf, other]
Title: 3D Hierarchical Refinement and Augmentation for Unsupervised Learning of Depth and Pose from Monocular Video
Comments: 10 pages, 7 figures, under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[310]  arXiv:2112.03051 [pdf, other]
Title: Controllable Animation of Fluid Elements in Still Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[311]  arXiv:2112.03109 [pdf, other]
Title: General Facial Representation Learning in a Visual-Linguistic Manner
Comments: CVPR2022 Oral; 16 pages, 6 figures, 14 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[312]  arXiv:2112.03111 [pdf, ps, other]
Title: Ethics and Creativity in Computer Vision
Comments: Neural Information Processing System 2021 workshop on Machine Learning for Creativity and Design
Journal-ref: NeurIPS 2021 workshop on Machine Learning for Creativity and Design
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[313]  arXiv:2112.03126 [pdf, other]
Title: Label-Efficient Semantic Segmentation with Diffusion Models
Comments: ICLR'2022; v3: camera ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[314]  arXiv:2112.03145 [pdf, other]
Title: Diffusion Models for Implicit Image Segmentation Ensembles
Comments: In this version, we updated the results section with more detailed evaluations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[315]  arXiv:2112.03162 [pdf, other]
Title: Embedding Arithmetic of Multimodal Queries for Image Retrieval
Comments: accepted at O-DRUM (CVPR workshop 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[316]  arXiv:2112.03163 [pdf, other]
Title: Encouraging Disentangled and Convex Representation with Controllable Interpolation Regularization
Comments: 17 pages, 19 figure (including appendix)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[317]  arXiv:2112.03184 [pdf, other]
Title: HIVE: Evaluating the Human Interpretability of Visual Explanations
Comments: ECCV 2022. Code and supplementary material are at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318]  arXiv:2112.03185 [pdf, other]
Title: Semantic Segmentation In-the-Wild Without Seeing Any Segmentation Examples
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319]  arXiv:2112.03205 [pdf, other]
Title: Simultaneously Predicting Multiple Plant Traits from Multiple Sensors via Deformable CNN Regression
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[320]  arXiv:2112.03221 [pdf, other]
Title: Text2Mesh: Text-Driven Neural Stylization for Meshes
Comments: project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Graphics (cs.GR)
[321]  arXiv:2112.03223 [pdf, other]
Title: Context-Aware Transfer Attacks for Object Detection
Comments: accepted to AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[322]  arXiv:2112.03237 [pdf, other]
Title: From Coarse to Fine-grained Concept based Discrimination for Phrase Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323]  arXiv:2112.03241 [pdf, other]
Title: Unsupervised Domain Adaptation for Semantic Image Segmentation: a Comprehensive Survey
Comments: 33 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[324]  arXiv:2112.03243 [pdf, other]
Title: Input-level Inductive Biases for 3D Reconstruction
Comments: CVPR 2022, including supplemental material
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325]  arXiv:2112.03252 [pdf, other]
Title: CSG0: Continual Urban Scene Generation with Zero Forgetting
Comments: Published at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 Workshop on Continual Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326]  arXiv:2112.03258 [pdf, other]
Title: DoodleFormer: Creative Sketch Drawing with Transformers
Comments: Accepted to ECCV-2022. Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[327]  arXiv:2112.03288 [pdf, other]
Title: Dense Depth Priors for Neural Radiance Fields from Sparse Input Views
Comments: CVPR 2022, project page: this https URL , video: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328]  arXiv:2112.03325 [pdf, other]
Title: Self-Supervised Camera Self-Calibration from Video
Comments: The project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[329]  arXiv:2112.03328 [pdf, other]
Title: Learning Connectivity with Graph Convolutional Networks for Skeleton-based Action Recognition
Authors: Hichem Sahbi
Comments: arXiv admin note: text overlap with arXiv:2104.04255, arXiv:2104.05482
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330]  arXiv:2112.03340 [pdf, other]
Title: Label Hallucination for Few-Shot Classification
Comments: Accepted by AAAI 2022. Code is available: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[331]  arXiv:2112.03415 [pdf, other]
Title: Producing augmentation-invariant embeddings from real-life imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[332]  arXiv:2112.03423 [pdf, other]
Title: Hybrid SNN-ANN: Energy-Efficient Classification and Object Detection for Event-Based Vision
Comments: Accepted at DAGM German Conference on Pattern Recognition (GCPR 2021)
Journal-ref: Pattern Recognition. DAGM GCPR 2021. Lecture Notes in Computer Science, vol 13024. Springer, Cham., pp. 297-312
Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[333]  arXiv:2112.03424 [pdf, other]
Title: Learning to Solve Hard Minimal Problems
Comments: 24 pages total: 14 pages main paper and 10 pages supplementary
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[334]  arXiv:2112.03444 [pdf, other]
Title: GPU-Based Homotopy Continuation for Minimal Problems in Computer Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[335]  arXiv:2112.03451 [pdf, other]
Title: Deep Level Set for Box-supervised Instance Segmentation in Aerial Images
Comments: 10 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[336]  arXiv:2112.03471 [pdf, other]
Title: Voxelized 3D Feature Aggregation for Multiview Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[337]  arXiv:2112.03485 [pdf, other]
Title: VizExtract: Automatic Relation Extraction from Data Visualizations
Comments: 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[338]  arXiv:2112.03492 [pdf, other]
Title: Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[339]  arXiv:2112.03494 [pdf, other]
Title: Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning
Comments: ECCV2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[340]  arXiv:2112.03517 [pdf, other]
Title: CG-NeRF: Conditional Generative Neural Radiance Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[341]  arXiv:2112.03530 [pdf, other]
Title: A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion
Comments: Accepted to ICLR 2022. Code is released at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[342]  arXiv:2112.03549 [pdf, other]
Title: GaTector: A Unified Framework for Gaze Object Prediction
Comments: CVPR 2022, camera ready
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[343]  arXiv:2112.03552 [pdf, other]
Title: Bootstrapping ViTs: Towards Liberating Vision Transformers from Pre-training
Comments: Accepted as a conference paper by CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[344]  arXiv:2112.03553 [pdf, other]
Title: ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images
Journal-ref: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[345]  arXiv:2112.03562 [pdf, other]
Title: CMA-CLIP: Cross-Modality Attention CLIP for Image-Text Classification
Comments: 9 pages, 2 figures, 6 tables, 1 algorithm
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[346]  arXiv:2112.03568 [pdf, other]
Title: Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints
Comments: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[347]  arXiv:2112.03587 [pdf, other]
Title: TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning
Comments: This work has been published in IEEE Transactions on Image Processing. The code is publicly available at this https URL arXiv admin note: substantial text overlap with arXiv:2101.00820
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348]  arXiv:2112.03590 [pdf, other]
Title: Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349]  arXiv:2112.03592 [pdf, other]
Title: Parallel Discrete Convolutions on Adaptive Particle Representations of Images
Comments: 18 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF); Image and Video Processing (eess.IV)
[350]  arXiv:2112.03596 [pdf, other]
Title: E$^2$(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition
Comments: To be presented at CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351]  arXiv:2112.03603 [pdf, other]
Title: Handwritten Mathematical Expression Recognition via Attention Aggregation based Bi-directional Mutual Learning
Comments: 9 pages,5 figures, have been accepted in AAAI 2022 Oral
Journal-ref: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[352]  arXiv:2112.03612 [pdf, other]
Title: DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Comments: AAAI 2022 camera ready version
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353]  arXiv:2112.03615 [pdf, other]
Title: Saliency Diversified Deep Ensemble for Robustness to Adversaries
Comments: Accepted to AAAI Workshop on Adversarial Machine Learning and Beyond 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[354]  arXiv:2112.03624 [pdf, other]
Title: Time-Equivariant Contrastive Video Representation Learning
Comments: ICCV 2021 (oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[355]  arXiv:2112.03631 [pdf, other]
Title: SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal
Comments: Accepted to AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[356]  arXiv:2112.03632 [pdf, other]
Title: Generation of Non-Deterministic Synthetic Face Datasets Guided by Identity Priors
Journal-ref: https://www.ntnu.edu/nikt2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[357]  arXiv:2112.03641 [pdf, other]
Title: Gram-SLD: Automatic Self-labeling and Detection for Instance Objects
Comments: 37 pages with 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358]  arXiv:2112.03649 [pdf, other]
Title: Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[359]  arXiv:2112.03650 [pdf, other]
Title: Activation to Saliency: Forming High-Quality Labels for Completely Unsupervised Salient Object Detection
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[360]  arXiv:2112.03690 [pdf, other]
Title: Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[361]  arXiv:2112.03728 [pdf, other]
Title: Flexible Networks for Learning Physical Dynamics of Deformable Objects
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[362]  arXiv:2112.03731 [pdf, other]
Title: SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363]  arXiv:2112.03736 [pdf, other]
Title: Gaussian map predictions for 3D surface feature localisation and counting
Comments: BMVC 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[364]  arXiv:2112.03740 [pdf, other]
Title: Dilated convolution with learnable spacings
Comments: Published in The Eleventh International Conference on Learning Representations (ICLR) 2023. (this https URL)
Journal-ref: The Eleventh International Conference on Learning Representations ICLR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[365]  arXiv:2112.03750 [pdf, other]
Title: Wild ToFu: Improving Range and Quality of Indirect Time-of-Flight Depth with RGB Fusion in Challenging Environments
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[366]  arXiv:2112.03777 [pdf, other]
Title: Variance-Aware Weight Initialization for Point Convolutional Neural Networks
Comments: Accepted at ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[367]  arXiv:2112.03803 [pdf, other]
Title: Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning
Comments: AAAI2022. v2: Add supplementary
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368]  arXiv:2112.03810 [pdf, other]
Title: Polarimetric Pose Prediction
Comments: Accepted at ECCV 2022; 25 pages (14 main paper + References + 7 Appendix)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369]  arXiv:2112.03814 [pdf, other]
Title: A Contrastive Distillation Approach for Incremental Semantic Segmentation in Aerial Images
Comments: 12 pages, ICIAP 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[370]  arXiv:2112.03842 [pdf, other]
Title: A Survey on Intrinsic Images: Delving Deep Into Lambert and Beyond
Comments: Accepted at International Journal of Computer Vision (to appear in 2022) this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371]  arXiv:2112.03857 [pdf, other]
Title: Grounded Language-Image Pre-training
Comments: CVPR 2022; updated visualizations; fixed hyper-parameters in Appendix C.1
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[372]  arXiv:2112.03860 [pdf, other]
Title: Differentiable Gaussianization Layers for Inverse Problems Regularized by Deep Generative Models
Authors: Dongzhuo Li
Comments: ICLR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[373]  arXiv:2112.03902 [pdf, other]
Title: MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
Comments: Accepted in CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[374]  arXiv:2112.03905 [pdf, other]
Title: ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
Comments: 13 pages, Codes and models will updated soon
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375]  arXiv:2112.03906 [pdf, other]
Title: Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning
Comments: Accepted at MVA 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376]  arXiv:2112.03907 [pdf, other]
Title: Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[377]  arXiv:2112.03909 [pdf, other]
Title: Vehicle trajectory prediction works, but not everywhere
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[378]  arXiv:2112.03917 [pdf, other]
Title: Scalable 3D Semantic Segmentation for Gun Detection in CT Scans
Comments: This work was part of the Project Lab Deep Learning in Computer Vision Winter Semester 2019/2020 at TU Darmstadt
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[379]  arXiv:2112.03951 [pdf, other]
Title: Few-Shot Image Classification Along Sparse Graphs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380]  arXiv:2112.04011 [pdf, other]
Title: Auxiliary Learning for Self-Supervised Video Representation via Similarity-based Knowledge Distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[381]  arXiv:2112.04016 [pdf, other]
Title: DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover's Distance Improves Out-Of-Distribution Face Identification
Authors: Hai Phan, Anh Nguyen
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[382]  arXiv:2112.04021 [pdf, other]
Title: A Robust Completed Local Binary Pattern (RCLBP) for Surface Defect Detection
Comments: Accepted to IEEE SMC 2021 as a special invited session paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[383]  arXiv:2112.04033 [pdf, other]
Title: Image classifiers can not be made robust to small perturbations
Comments: 8 pages, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[384]  arXiv:2112.04038 [pdf, ps, other]
Title: Presentation Attack Detection Methods based on Gaze Tracking and Pupil Dynamic: A Comprehensive Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385]  arXiv:2112.04042 [pdf, ps, other]
Title: Vision-Cloud Data Fusion for ADAS: A Lane Change Prediction Case Study
Comments: Published on IEEE Transactions on Intelligent Vehicles
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[386]  arXiv:2112.04054 [pdf, other]
Title: GreenPCO: An Unsupervised Lightweight Point Cloud Odometry Method
Comments: 10 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387]  arXiv:2112.04107 [pdf, other]
Title: Fully Context-Aware Image Inpainting with a Learned Semantic Pyramid
Comments: Accepted by Pattern Recognition, 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388]  arXiv:2112.04108 [pdf, other]
Title: Fully Attentional Network for Semantic Segmentation
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389]  arXiv:2112.04120 [pdf, other]
Title: Feature Statistics Mixing Regularization for Generative Adversarial Networks
Comments: Accepted to CVPR 2022. Our code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390]  arXiv:2112.04138 [pdf, other]
Title: Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[391]  arXiv:2112.04148 [pdf, other]
Title: Neural Points: Point Cloud Representation with Neural Fields for Arbitrary Upsampling
Comments: Accepted to CVPR2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392]  arXiv:2112.04150 [pdf, other]
Title: BA-Net: Bridge Attention for Deep Convolutional Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[393]  arXiv:2112.04154 [pdf, other]
Title: SNEAK: Synonymous Sentences-Aware Adversarial Attack on Natural Language Video Localization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[394]  arXiv:2112.04159 [pdf, other]
Title: Garment4D: Garment Reconstruction from Point Cloud Sequences
Comments: Accepted to NeurIPS 2021. Project Page: this https URL . Codes are available: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395]  arXiv:2112.04162 [pdf, other]
Title: Symmetry Perception by Deep Networks: Inadequacy of Feed-Forward Architectures and Improvements with Recurrent Connections
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[396]  arXiv:2112.04163 [pdf, other]
Title: Assessing a Single Image in Reference-Guided Image Synthesis
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397]  arXiv:2112.04165 [pdf, other]
Title: Shortest Paths in Graphs with Matrix-Valued Edges: Concepts, Algorithm and Application to 3D Multi-Shape Analysis
Comments: published at 3DV
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Optimization and Control (math.OC)
[398]  arXiv:2112.04174 [pdf, other]
Title: Boosting Contrastive Learning with Relation Knowledge Distillation
Comments: Accepted by AAAI-2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399]  arXiv:2112.04177 [pdf, other]
Title: VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400]  arXiv:2112.04178 [pdf, other]
Title: Topology-aware Convolutional Neural Network for Efficient Skeleton-based Action Recognition
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[401]  arXiv:2112.04182 [pdf, other]
Title: Unimodal Face Classification with Multimodal Training
Comments: Accepted by IEEE International Conference On Automatic Face and Gesture Recognition 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402]  arXiv:2112.04185 [pdf, other]
Title: Transformaly -- Two (Feature Spaces) Are Better Than One
Comments: CVPR Workshop, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[403]  arXiv:2112.04189 [pdf, other]
Title: Transformer-Based Approach for Joint Handwriting and Named Entity Recognition in Historical documents
Journal-ref: Pattern Recognition Letters, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[404]  arXiv:2112.04203 [pdf, other]
Title: Adversarial Parametric Pose Prior
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405]  arXiv:2112.04212 [pdf, other]
Title: Do Pedestrians Pay Attention? Eye Contact Detection in the Wild
Comments: Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406]  arXiv:2112.04215 [pdf, other]
Title: Self-Supervised Models are Continual Learners
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[407]  arXiv:2112.04222 [pdf, other]
Title: Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Comments: Accepted by CVPR 2022. Code is available at this https URL We also won the 1st place of Video Relation Understanding (VRU) Grand Challenge in ACM Multimedia 2021, with a simplified version of our model.(The code for object tracklets generation is available at this https URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[408]  arXiv:2112.04223 [pdf, other]
Title: Progressive Multi-stage Interactive Training in Mobile Network for Fine-grained Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[409]  arXiv:2112.04228 [pdf, other]
Title: SimulSLT: End-to-End Simultaneous Sign Language Translation
Comments: Accepted by ACM Multimedia 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[410]  arXiv:2112.04255 [pdf, other]
Title: Feature matching for multi-epoch historical aerial images
Comments: 34 pages
Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411]  arXiv:2112.04278 [pdf, other]
Title: DMRVisNet: Deep Multi-head Regression Network for Pixel-wise Visibility Estimation Under Foggy Weather
Comments: 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412]  arXiv:2112.04283 [pdf, other]
Title: Adverse Weather Image Translation with Asymmetric and Uncertainty-aware GAN
Comments: BMVC 2021, codes are available in here: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[413]  arXiv:2112.04294 [pdf, other]
Title: A Hierarchical Spatio-Temporal Graph Convolutional Neural Network for Anomaly Detection in Videos
Comments: Accepted to IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[414]  arXiv:2112.04298 [pdf, other]
Title: GCA-Net : Utilizing Gated Context Attention for Improving Image Forgery Localization and Detection
Comments: Accepted for publication at the CVPR 2022 Media Forensics Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[415]  arXiv:2112.04312 [pdf, other]
Title: Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[416]  arXiv:2112.04323 [pdf, other]
Title: Contrastive Learning with Large Memory Bank and Negative Embedding Subtraction for Accurate Copy Detection
Authors: Shuhei Yokoo
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417]  arXiv:2112.04345 [pdf, other]
Title: Burn After Reading: Online Adaptation for Cross-domain Streaming Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[418]  arXiv:2112.04367 [pdf, other]
Title: On visual self-supervision and its effect on model robustness
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[419]  arXiv:2112.04401 [pdf, other]
Title: FPPN: Future Pseudo-LiDAR Frame Prediction for Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[420]  arXiv:2112.04417 [pdf, other]
Title: What I Cannot Predict, I Do Not Understand: A Human-Centered Evaluation Framework for Explainability Methods
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[421]  arXiv:2112.04421 [pdf, other]
Title: SoK: Vehicle Orientation Representations for Deep Rotation Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422]  arXiv:2112.04432 [pdf, other]
Title: Audio-Visual Synchronisation in the wild
Subjects: Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[423]  arXiv:2112.04446 [pdf, other]
Title: Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Comments: CVPR2022. The final published version of the proceedings will be available on IEEE Xplore
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[424]  arXiv:2112.04453 [pdf, other]
Title: MLP Architectures for Vision-and-Language Modeling: An Empirical Study
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[425]  arXiv:2112.04477 [pdf, other]
Title: Tracking People by Predicting 3D Appearance, Location & Pose
Comments: Project Page : this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426]  arXiv:2112.04478 [pdf, other]
Title: Prompting Visual-Language Models for Efficient Video Understanding
Comments: ECCV 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[427]  arXiv:2112.04480 [pdf, other]
Title: Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[428]  arXiv:2112.04481 [pdf, other]
Title: What's Behind the Couch? Directed Ray Distance Functions (DRDF) for 3D Scene Reconstruction
Comments: Updated illustrations for method section. Project Page see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[429]  arXiv:2112.04482 [pdf, other]
Title: FLAVA: A Foundational Language And Vision Alignment Model
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[430]  arXiv:2112.04497 [pdf, other]
Title: SIRfyN: Single Image Relighting from your Neighbors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431]  arXiv:2112.04532 [pdf, other]
Title: Segment and Complete: Defending Object Detectors against Adversarial Patch Attacks with Robust Patch Detection
Comments: CVPR 2022 camera ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Image and Video Processing (eess.IV)
[432]  arXiv:2112.04564 [pdf, other]
Title: CoSSL: Co-Learning of Representation and Classifier for Imbalanced Semi-Supervised Learning
Comments: Published at CVPR 2022 as a conference paper. Code at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[433]  arXiv:2112.04585 [pdf, other]
Title: MASTAF: A Model-Agnostic Spatio-Temporal Attention Fusion Network for Few-shot Video Classification
Comments: WACV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[434]  arXiv:2112.04598 [pdf, other]
Title: InvGAN: Invertible GANs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[435]  arXiv:2112.04603 [pdf, other]
Title: A Unified Architecture of Semantic Segmentation and Hierarchical Generative Adversarial Networks for Expression Manipulation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436]  arXiv:2112.04607 [pdf, other]
Title: Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning
Comments: Code is available at this https URL arXiv admin note: text overlap with arXiv:2110.10309
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437]  arXiv:2112.04608 [pdf, other]
Title: Enhancing Food Intake Tracking in Long-Term Care with Automated Food Imaging and Nutrient Intake Tracking (AFINI-T) Technology
Comments: Key words: Automatic segmentation, convolutional neural network, deep learning, food intake tracking, volume estimation, malnutrition prevention, long-term care, hospital
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[438]  arXiv:2112.04610 [pdf, other]
Title: A Simple and efficient deep Scanpath Prediction
Comments: Electronic Imaging Symposium 2022 (EI 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439]  arXiv:2112.04628 [pdf, other]
Title: Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
Journal-ref: Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440]  arXiv:2112.04632 [pdf, other]
Title: Recurrent Glimpse-based Decoder for Detection with Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[441]  arXiv:2112.04645 [pdf, other]
Title: BACON: Band-limited Coordinate Networks for Multiscale Scene Representation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[442]  arXiv:2112.04662 [pdf, other]
Title: Dual Cluster Contrastive learning for Object Re-Identification
Comments: 12 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[443]  arXiv:2112.04665 [pdf, other]
Title: Style Mixing and Patchwise Prototypical Matching for One-Shot Unsupervised Domain Adaptive Semantic Segmentation
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[444]  arXiv:2112.04674 [pdf, other]
Title: DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
Comments: Accepted by ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[445]  arXiv:2112.04680 [pdf, other]
Title: SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations
Comments: Accepted to 36th AAAI Conference on Artificial Intelligence (AAAI 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446]  arXiv:2112.04701 [pdf, other]
Title: Unsupervised Complementary-aware Multi-process Fusion for Visual Place Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[447]  arXiv:2112.04702 [pdf, other]
Title: Fast Point Transformer
Comments: Accepted to CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448]  arXiv:2112.04709 [pdf, other]
Title: Implicit Feature Refinement for Instance Segmentation
Comments: Published at ACM MM 2021. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[449]  arXiv:2112.04710 [pdf, other]
Title: Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search
Comments: Accepted by WACV'2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450]  arXiv:2112.04719 [pdf, other]
Title: Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision
Comments: Submitted to IEEE TPAMI. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[451]  arXiv:2112.04720 [pdf, other]
Title: Amicable Aid: Perturbing Images to Improve Classification Performance
Comments: ICASSP 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[452]  arXiv:2112.04731 [pdf, other]
Title: Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning
Comments: CVPR 2022 Camera-Ready Version
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[453]  arXiv:2112.04744 [pdf, other]
Title: Superpixel-Based Building Damage Detection from Post-earthquake Very High Resolution Imagery Using Deep Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[454]  arXiv:2112.04752 [pdf, other]
Title: Modelling Lips-State Detection Using CNN for Non-Verbal Communications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[455]  arXiv:2112.04761 [pdf, other]
Title: HBReID: Harder Batch for Re-identification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[456]  arXiv:2112.04764 [pdf, other]
Title: 3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection
Comments: CVPR 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[457]  arXiv:2112.04771 [pdf, other]
Title: Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
Comments: CVPR 2022 camera-ready version
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[458]  arXiv:2112.04827 [pdf, other]
Title: Explainability of the Implications of Supervised and Unsupervised Face Image Quality Estimations Through Activation Map Variation Analyses in Face Recognition Models
Comments: accepted at the IEEE Winter Conference on Applications of Computer Vision Workshops, WACV Workshops 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[459]  arXiv:2112.04840 [pdf, other]
Title: Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[460]  arXiv:2112.04846 [pdf, other]
Title: ScaleNet: A Shallow Architecture for Scale Estimation
Journal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[461]  arXiv:2112.04888 [pdf, other]
Title: A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
Comments: 20 pages, 6 figures
Journal-ref: NeurIPS 2021 Track on Datasets and Benchmarks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[462]  arXiv:2112.04903 [pdf, other]
Title: PRA-Net: Point Relation-Aware Network for 3D Point Cloud Analysis
Comments: 13 pages
Journal-ref: IEEE Transactions on Image Processing, vol. 30, pp. 4436-4448, 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463]  arXiv:2112.04928 [pdf, other]
Title: Self-Supervised Image-to-Text and Text-to-Image Synthesis
Comments: ICONIP 2021 : The 28th International Conference on Neural Information Processing
Journal-ref: ICONIP 2021. Lecture Notes in Computer Science, vol 13111, pp 415-426. Springer, Cham
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[464]  arXiv:2112.04934 [pdf, other]
Title: Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[465]  arXiv:2112.04937 [pdf, other]
Title: DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[466]  arXiv:2112.04966 [pdf, other]
Title: CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation
Comments: Appeared in ECCV2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[467]  arXiv:2112.04974 [pdf, other]
Title: AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach
Comments: To be published in International Journal of Computer Vision (IJCV)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[468]  arXiv:2112.04981 [pdf, other]
Title: PE-former: Pose Estimation Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[469]  arXiv:2112.05006 [pdf, other]
Title: Exploring Event-driven Dynamic Context for Accident Scene Segmentation
Comments: Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS), extended version of arXiv:2008.08974, dataset and code will be made publicly available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[470]  arXiv:2112.05053 [pdf, ps, other]
Title: Illumination and Temperature-Aware Multispectral Networks for Edge-Computing-Enabled Pedestrian Detection
Comments: 13 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[471]  arXiv:2112.05077 [pdf, other]
Title: Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior
Comments: CVPR 2022 camera-ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[472]  arXiv:2112.05080 [pdf, other]
Title: Locally Shifted Attention With Early Global Integration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[473]  arXiv:2112.05112 [pdf, other]
Title: BLT: Bidirectional Layout Transformer for Controllable Layout Generation
Comments: ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[474]  arXiv:2112.05121 [pdf, other]
Title: Self-Supervised Keypoint Discovery in Behavioral Videos
Comments: CVPR 2022. Code: this https URL Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[475]  arXiv:2112.05126 [pdf, other]
Title: IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[476]  arXiv:2112.05130 [pdf, other]
Title: Multimodal Conditional Image Synthesis with Product-of-Experts GANs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[477]  arXiv:2112.05131 [pdf, other]
Title: Plenoxels: Radiance Fields without Neural Networks
Comments: For video and code, please see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[478]  arXiv:2112.05132 [pdf, other]
Title: Spatio-temporal Relation Modeling for Few-shot Action Recognition
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479]  arXiv:2112.05134 [pdf, other]
Title: A Shared Representation for Photorealistic Driving Simulators
Comments: Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[480]  arXiv:2112.05136 [pdf, other]
Title: PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning
Comments: NeurIPS 2021. Project page: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[481]  arXiv:2112.05138 [pdf, other]
Title: Searching Parameterized AP Loss for Object Detection
Comments: Accepted by NeurIPS 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[482]  arXiv:2112.05139 [pdf, other]
Title: CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields
Comments: To Appear at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[483]  arXiv:2112.05140 [pdf, other]
Title: NeRF for Outdoor Scene Relighting
Comments: 22 pages, 10 figures, 2 tables; ECCV 2022; project web page: this https URL
Journal-ref: European Conference on Computer Vision (ECCV) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[484]  arXiv:2112.05141 [pdf, other]
Title: Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework
Comments: CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485]  arXiv:2112.05142 [pdf, other]
Title: HairCLIP: Design Your Hair by Text and Reference Image
Comments: To Appear at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[486]  arXiv:2112.05143 [pdf, other]
Title: GAN-Supervised Dense Visual Alignment
Comments: An updated version of our CVPR 2022 paper (oral); v2 features additional references and minor text changes. Code available at this https URL . Project page and videos available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487]  arXiv:2112.05144 [pdf, ps, other]
Title: Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[488]  arXiv:2112.05181 [pdf, other]
Title: Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[489]  arXiv:2112.05210 [pdf, other]
Title: 7th AI Driving Olympics: 1st Place Report for Panoptic Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[490]  arXiv:2112.05213 [pdf, other]
Title: Progressive Seed Generation Auto-encoder for Unsupervised Point Cloud Learning
Comments: ICCV2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[491]  arXiv:2112.05215 [pdf, other]
Title: Road Extraction from Overhead Images with Graph Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[492]  arXiv:2112.05219 [pdf, other]
Title: CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[493]  arXiv:2112.05230 [pdf, other]
Title: Injecting Semantic Concepts into End-to-End Image Captioning
Journal-ref: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[494]  arXiv:2112.05236 [pdf, ps, other]
Title: KartalOl: Transfer learning using deep neural network for iris segmentation and localization: New dataset for iris segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[495]  arXiv:2112.05237 [pdf, ps, other]
Title: Transfer learning using deep neural networks for Ear Presentation Attack Detection: New Database for PAD
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[496]  arXiv:2112.05253 [pdf, other]
Title: MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning
Comments: 13 pages, 6 figures, 2 tables. Minor improvements. Accepted at EMNLP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[497]  arXiv:2112.05267 [pdf, other]
Title: The Many Faces of Anger: A Multicultural Video Dataset of Negative Emotions in the Wild (MFA-Wild)
Comments: 8 pages, 13 figures, submitted to FG2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[498]  arXiv:2112.05277 [pdf, other]
Title: Skeletal Graph Self-Attention: Embedding a Skeleton Inductive Bias into Sign Language Production
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[499]  arXiv:2112.05280 [pdf, other]
Title: Long-Range Thermal 3D Perception in Low Contrast Environments
Comments: 13 pages, 16 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[500]  arXiv:2112.05290 [pdf, other]
Title: Image-to-Image Translation-based Data Augmentation for Robust EV Charging Inlet Detection
Comments: 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[501]  arXiv:2112.05291 [pdf, other]
Title: LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[502]  arXiv:2112.05295 [pdf, other]
Title: 3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map
Comments: 6 pages, 6 figures
Journal-ref: 2017 IEEE 85th Vehicular Technology Conference (VTC Spring)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[503]  arXiv:2112.05298 [pdf, other]
Title: IFR-Explore: Learning Inter-object Functional Relationships in 3D Indoor Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[504]  arXiv:2112.05300 [pdf, other]
Title: Representing 3D Shapes with Probabilistic Directed Distance Fields
Comments: 22 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[505]  arXiv:2112.05301 [pdf, other]
Title: Self-Ensemling for 3D Point Cloud Domain Adaption
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[506]  arXiv:2112.05324 [pdf, other]
Title: Attention-based Transformation from Latent Features to Point Clouds
Comments: 9 pages, 7 figures, AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[507]  arXiv:2112.05329 [pdf, other]
Title: FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Comments: Accepted to CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[508]  arXiv:2112.05335 [pdf, other]
Title: Uncertainty, Edge, and Reverse-Attention Guided Generative Adversarial Network for Automatic Building Detection in Remotely Sensed Images
Comments: 23 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[509]  arXiv:2112.05340 [pdf, other]
Title: Tradeoffs Between Contrastive and Supervised Learning: An Empirical Study
Comments: NeurIPS 2021 Workshop: Self-Supervised Learning - Theory and Practice
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[510]  arXiv:2112.05341 [pdf, other]
Title: Hyperdimensional Feature Fusion for Out-Of-Distribution Detection
Comments: Accepted to WACV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[511]  arXiv:2112.05351 [pdf, other]
Title: Exploring Pixel-level Self-supervision for Weakly Supervised Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[512]  arXiv:2112.05375 [pdf, other]
Title: Rethinking the Two-Stage Framework for Grounded Situation Recognition
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[513]  arXiv:2112.05379 [pdf, other]
Title: Cross-Modal Transferable Adversarial Attacks from Images to Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[514]  arXiv:2112.05381 [pdf, other]
Title: UNIST: Unpaired Neural Implicit Shape Translation Network
Comments: CVPR 2022. project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[515]  arXiv:2112.05396 [pdf, other]
Title: Towards Full-to-Empty Room Generation with Structure-Aware Feature Encoding and Soft Semantic Region-Adaptive Normalization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[516]  arXiv:2112.05404 [pdf, other]
Title: The Large Labelled Logo Dataset (L3D): A Multipurpose and Hand-Labelled Continuously Growing Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[517]  arXiv:2112.05410 [pdf, other]
Title: Multimedia Datasets for Anomaly Detection: A Review
Comments: 17 pages, 11 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[518]  arXiv:2112.05416 [pdf, other]
Title: Optimizing Edge Detection for Image Segmentation with Multicut Penalties
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519]  arXiv:2112.05425 [pdf, other]
Title: Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Comments: 11 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[520]  arXiv:2112.05456 [pdf, other]
Title: Monitoring and Adapting the Physical State of a Camera for Autonomous Vehicles
Comments: 17 pages, 20 figures, this https URL
Journal-ref: IEEE Transactions on Intelligent Transportation Systems (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[521]  arXiv:2112.05485 [pdf, other]
Title: Visual Transformers with Primal Object Queries for Multi-Label Image Classification
Comments: Accepted to ICPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[522]  arXiv:2112.05488 [pdf, other]
Title: DronePose: The identification, segmentation, and orientation detection of drones via neural networks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[523]  arXiv:2112.05496 [pdf, other]
Title: Graph-based Generative Face Anonymisation with Pose Preservation
Comments: 21st International Conference on Image analysis and Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[524]  arXiv:2112.05498 [pdf, other]
Title: Sparse Depth Completion with Semantic Mesh Deformation Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[525]  arXiv:2112.05504 [pdf, other]
Title: BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering
Comments: Accepted to ECCV22; Previous version: CityNeRF: Building NeRF at City Scale; Project page can be found in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[526]  arXiv:2112.05533 [pdf, other]
Title: Error Diagnosis of Deep Monocular Depth Estimation Models
Comments: Presented at IROS'21
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[527]  arXiv:2112.05561 [pdf, other]
Title: Global Attention Mechanism: Retain Information to Enhance Channel-Spatial Interactions
Comments: 5 pages, 3 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[528]  arXiv:2112.05576 [pdf, ps, other]
Title: GPU-accelerated image alignment for object detection in industrial applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[529]  arXiv:2112.05585 [pdf, other]
Title: Discrete neural representations for explainable anomaly detection
Journal-ref: Winter Conference on Applications of Computer Vision 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[530]  arXiv:2112.05587 [pdf, other]
Title: Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[531]  arXiv:2112.05598 [pdf, other]
Title: PERF: Performant, Explicit Radiance Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[532]  arXiv:2112.05626 [pdf, other]
Title: Seq-Masks: Bridging the gap between appearance and gait modeling for video-based person re-identification
Comments: ICASSP2021 Submission
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[533]  arXiv:2112.05637 [pdf, other]
Title: HeadNeRF: A Real-time NeRF-based Parametric Head Model
Comments: Accepted by CVPR2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[534]  arXiv:2112.05644 [pdf, other]
Title: Roominoes: Generating Novel 3D Floor Plans From Existing 3D Rooms
Comments: Symposium on Geometry Processing (SGP) 2021
Journal-ref: Computer Graphics Forum, 40: 57-69 (2021)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[535]  arXiv:2112.05646 [pdf, other]
Title: Mask-invariant Face Recognition through Template-level Knowledge Distillation
Comments: Accepted at the 16th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[536]  arXiv:2112.05667 [pdf, other]
Title: A Deep Learning Based Automated Hand Hygiene Training System
Comments: 6 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[537]  arXiv:2112.05692 [pdf, other]
Title: VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[538]  arXiv:2112.05727 [pdf, other]
Title: Neural Belief Propagation for Scene Graph Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[539]  arXiv:2112.05744 [pdf, other]
Title: More Control for Free! Image Synthesis with Semantic Diffusion Guidance
Comments: WACV 2023. Project page this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[540]  arXiv:2112.05749 [pdf, other]
Title: Label, Verify, Correct: A Simple Few Shot Object Detection Method
Comments: CVPR 2022, project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[541]  arXiv:2112.05786 [pdf, other]
Title: Guided Generative Models using Weak Supervision for Detecting Object Spatial Arrangement in Overhead Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[542]  arXiv:2112.05808 [pdf, other]
Title: Benchmarking human visual search computational models in natural scenes: models comparison and reference datasets
Authors: F. Travi (1), G. Ruarte (1), G. Bujia (1), J. E. Kamienkowski (1,2) ((1) Laboratorio de Inteligencia Artificial Aplicada, Instituto de Ciencias de la Computación, Universidad de Buenos Aires - CONICET (2) Maestría de Explotación de Datos y Descubrimiento del Conocimiento, Universidad de Buenos Aires, Argentina)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[543]  arXiv:2112.05814 [pdf, other]
Title: Deep ViT Features as Dense Visual Descriptors
Comments: Revised version - high res figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[544]  arXiv:2112.05825 [pdf, other]
Title: Revisiting Consistency Regularization for Semi-Supervised Learning
Comments: Published at GCPR2021 as a conference paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[545]  arXiv:2112.05827 [pdf, other]
Title: Quality-Aware Multimodal Biometric Recognition
Comments: IEEE Transactions on Biometrics, Behavior, and Identity Science
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[546]  arXiv:2112.05846 [pdf, other]
Title: Semantic Interaction in Augmented Reality Environments for Microsoft HoloLens
Comments: ECMR 2019, European Conference on Mobile Robots, HoloLens, 6 pages, 6 figures
Journal-ref: European Conference on Mobile Robots (ECMR), 2019
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[547]  arXiv:2112.05847 [pdf, other]
Title: A Novel Gaussian Process Based Ground Segmentation Algorithm with Local-Smoothness Estimation
Comments: arXiv admin note: substantial text overlap with arXiv:2111.10638
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[548]  arXiv:2112.05851 [pdf, other]
Title: Short and Long Range Relation Based Spatio-Temporal Transformer for Micro-Expression Recognition
Comments: 13 pages, 9 figures
Journal-ref: IEEE Transactions on Affective Computing, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[549]  arXiv:2112.05861 [pdf, other]
Title: A Discriminative Channel Diversification Network for Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[550]  arXiv:2112.05871 [pdf, other]
Title: On Adversarial Robustness of Point Cloud Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[551]  arXiv:2112.05883 [pdf, other]
Title: Self-supervised Spatiotemporal Representation Learning by Exploiting Video Continuity
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[552]  arXiv:2112.05892 [pdf, other]
Title: COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality
Comments: ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[553]  arXiv:2112.05907 [pdf, other]
Title: Smooth-Swap: A Simple Enhancement for Face-Swapping with Smoothness
Comments: CVPR 2022 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[554]  arXiv:2112.05957 [pdf, other]
Title: AvatarMe++: Facial Shape and BRDF Inference with Photorealistic Rendering-Aware GANs
Comments: Project and Dataset page: ( this https URL ). 20 pages, including supplemental materials. Accepted for publishing at IEEE Transactions on Pattern Analysis and Machine Intelligence on 13 November 2021. Copyright 2021 IEEE. Personal use of this material is permitted
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[555]  arXiv:2112.05958 [src]
Title: You Only Need End-to-End Training for Long-Tailed Recognition
Authors: Zhiwei Zhang
Comments: This is a draft
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[556]  arXiv:2112.05975 [src]
Title: CPRAL: Collaborative Panoptic-Regional Active Learning for Semantic Segmentation
Comments: This is not the final version of our paper, and we will upload a final version later
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[557]  arXiv:2112.05982 [pdf, ps, other]
Title: Overview of The MediaEval 2021 Predicting Media Memorability Task
Comments: 3 pages, to appear in Proceedings of MediaEval 2021, December 13-15 2021, Online
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[558]  arXiv:2112.05993 [pdf, other]
Title: Object Counting: You Only Need to Look at One
Comments: Keywords: Crowd counting, one-shot object counting, Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[559]  arXiv:2112.05999 [pdf, other]
Title: Curvature-guided dynamic scale networks for Multi-view Stereo
Comments: Accepted to ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[560]  arXiv:2112.06011 [pdf, other]
Title: Improving the Transferability of Adversarial Examples with Resized-Diverse-Inputs, Diversity-Ensemble and Region Fitting
Comments: Accepted to ECCV2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[561]  arXiv:2112.06029 [pdf, other]
Title: On Automatic Data Augmentation for 3D Point Cloud Classification
Comments: BMVC 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[562]  arXiv:2112.06074 [pdf, other]
Title: Early Stopping for Deep Image Prior
Comments: Published in TMLR (this https URL)
Journal-ref: Transactions on Machine Learning Research (TMLR), 2835-8856 (12/2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[563]  arXiv:2112.06103 [pdf, other]
Title: Improving Vision Transformers for Incremental Learning
Comments: Add experiments on CIFAR-100, comparison with DER
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[564]  arXiv:2112.06104 [pdf, other]
Title: Synthetic Map Generation to Provide Unlimited Training Data for Historical Map Text Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[565]  arXiv:2112.06106 [pdf, other]
Title: Controlled-rearing studies of newborn chicks and deep neural networks
Comments: NeurIPS 2021 Workshop on Shared Visual Representations in Human & Machine Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[566]  arXiv:2112.06113 [pdf, other]
Title: Learning from the Tangram to Solve Mini Visual Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[567]  arXiv:2112.06116 [pdf, other]
Title: Stereoscopic Universal Perturbations across Different Architectures and Datasets
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[568]  arXiv:2112.06120 [pdf, other]
Title: Sidewalk Measurements from Satellite Images: Preliminary Findings
Journal-ref: Spatial Data Science Symposium 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[569]  arXiv:2112.06121 [pdf, other]
Title: Magnifying Networks for Images with Billions of Pixels
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[570]  arXiv:2112.06133 [pdf, other]
Title: MVLayoutNet:3D layout reconstruction with multi-view panoramas
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[571]  arXiv:2112.06147 [pdf, other]
Title: Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification
Comments: 13 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[572]  arXiv:2112.06150 [pdf, other]
Title: Deep Translation Prior: Test-time Training for Photorealistic Style Transfer
Comments: Accepted to AAAI 2022. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[573]  arXiv:2112.06161 [pdf, other]
Title: Semi-supervised Domain Adaptive Structure Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[574]  arXiv:2112.06170 [pdf, other]
Title: Deep network for rolling shutter rectification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[575]  arXiv:2112.06171 [pdf, other]
Title: Pixel-wise Deep Image Stitching
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[576]  arXiv:2112.06174 [pdf, other]
Title: Implicit Transformer Network for Screen Content Image Continuous Super-Resolution
Comments: 24 pages with 3 figures, NeurIPS 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[577]  arXiv:2112.06175 [pdf, other]
Title: Unsupervised Domain-Specific Deblurring using Scale-Specific Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[578]  arXiv:2112.06179 [pdf, other]
Title: BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-aided Adversarial Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[579]  arXiv:2112.06180 [pdf, other]
Title: 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation
Comments: IEEE RA-L 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[580]  arXiv:2112.06183 [pdf, other]
Title: Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species
Comments: Accepted by CVPR 2022; 8 pages for main paper, 6 pages for supplementary materials
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[581]  arXiv:2112.06193 [pdf, other]
Title: GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic Animal Segmentation
Comments: The code is available at this https URL . The dataset is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[582]  arXiv:2112.06197 [pdf, other]
Title: Video as Conditional Graph Hierarchy for Multi-Granular Question Answering
Comments: AAAI'22 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[583]  arXiv:2112.06238 [pdf, other]
Title: HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging
Comments: CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[584]  arXiv:2112.06242 [pdf, other]
Title: Formulating Event-based Image Reconstruction as a Linear Inverse Problem with Deep Regularization using Optical Flow
Comments: 22 pages, 26 figures, 5 tables, 6 animations when clicked on
Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 45, No. 7, pp. 8372-8389, July 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[585]  arXiv:2112.06307 [pdf, other]
Title: Image-to-Height Domain Translation for Synthetic Aperture Sonar
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[586]  arXiv:2112.06320 [pdf, other]
Title: Anomaly Crossing: New Horizons for Video Anomaly Detection as Cross-domain Few-shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[587]  arXiv:2112.06323 [pdf, other]
Title: Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses
Comments: Under submission
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[588]  arXiv:2112.06343 [pdf, other]
Title: Change Detection Meets Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[589]  arXiv:2112.06375 [pdf, other]
Title: Embracing Single Stride 3D Object Detector with Sparse Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[590]  arXiv:2112.06379 [pdf, other]
Title: 5th Place Solution for VSPW 2021 Challenge
Comments: Presented in ICCV'21 Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[591]  arXiv:2112.06389 [pdf, other]
Title: Local and Global Point Cloud Reconstruction for 3D Hand Pose Estimation
Comments: The British Machine Vision Conference (BMVC)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[592]  arXiv:2112.06390 [pdf, other]
Title: PartGlot: Learning Shape Part Segmentation from Language Reference Games
Comments: CVPR 2022 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[593]  arXiv:2112.06392 [pdf, other]
Title: The Overlooked Classifier in Human-Object Interaction Recognition
Comments: arXiv admin note: substantial text overlap with arXiv:2107.13083
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[594]  arXiv:2112.06398 [pdf, other]
Title: Shaping Visual Representations with Attributes for Few-Shot Recognition
Comments: accepted by IEEE Signal Process. Lett
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[595]  arXiv:2112.06401 [pdf, other]
Title: Deep Attentional Guided Image Filtering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[596]  arXiv:2112.06406 [pdf, other]
Title: Hybrid Atlas Building with Deep Registration Priors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[597]  arXiv:2112.06428 [pdf, other]
Title: Holistic Interpretation of Public Scenes Using Computer Vision and Temporal Graphs to Identify Social Distancing Violations
Comments: 23 pages, 19 figures. Gihan Jayatilaka, Jameel Hassan, and Suren Sritharan contributed equally to this work
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[598]  arXiv:2112.06433 [pdf, other]
Title: Generate Point Clouds with Multiscale Details from Graph-Represented Structures
Comments: 16 pages, 6 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[599]  arXiv:2112.06437 [pdf, other]
Title: Semi-Supervised Contrastive Learning for Remote Sensing: Identifying Ancient Urbanization in the South Central Andes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[600]  arXiv:2112.06447 [pdf, other]
Title: SVIP: Sequence VerIfication for Procedures in Videos
Comments: Accepted by CVPR2022. For the included dataset, see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[601]  arXiv:2112.06451 [pdf, other]
Title: Semantically Contrastive Learning for Low-light Image Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[602]  arXiv:2112.06454 [pdf, other]
Title: Split GCN: Effective Interactive Annotation for Segmentation of Disconnected Instance
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[603]  arXiv:2112.06455 [pdf, other]
Title: Self-Paced Deep Regression Forests with Consideration of Ranking Fairness
Comments: The article is submitted to TNNLS, and is under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[604]  arXiv:2112.06456 [pdf, other]
Title: Real Time Action Recognition from Video Footage
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[605]  arXiv:2112.06467 [pdf, other]
Title: An Informative Tracking Benchmark
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[606]  arXiv:2112.06489 [pdf, other]
Title: Multi-Modal Mutual Information Maximization: A Novel Approach for Unsupervised Deep Cross-Modal Hashing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[607]  arXiv:2112.06502 [pdf, other]
Title: DGL-GAN: Discriminator Guided Learning for GAN Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[608]  arXiv:2112.06522 [pdf, other]
Title: Anatomizing Bias in Facial Analysis
Comments: Accepted in AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[609]  arXiv:2112.06530 [pdf, ps, other]
Title: Centroid-UNet: Detecting Centroids in Aerial Images
Comments: Proccedings of the 42nd Asian Conference on Remote Sensing, 2021, Can Tho city, Vietnam
Journal-ref: ACRS 42nd (2021) 100
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[610]  arXiv:2112.06533 [pdf, other]
Title: Makeup216: Logo Recognition with Adversarial Attention Representations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[611]  arXiv:2112.06536 [pdf, other]
Title: SphereSR: 360° Image Super-Resolution with Arbitrary Projection via Continuous Spherical Image Representation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[612]  arXiv:2112.06538 [pdf, other]
Title: Hybrid Graph Neural Networks for Few-Shot Learning
Comments: To appear in AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[613]  arXiv:2112.06554 [pdf, ps, other]
Title: Ensemble CNN Networks for GBM Tumors Segmentation using Multi-parametric MRI
Comments: Accepted in BraTS 2021 (as part of the BrainLes workshop proceedings distributed by Springer LNCS)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[614]  arXiv:2112.06558 [pdf, other]
Title: MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning
Journal-ref: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[615]  arXiv:2112.06569 [pdf, other]
Title: Triangle Attack: A Query-efficient Decision-based Adversarial Attack
Comments: Accepted by ECCV 2022, code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[616]  arXiv:2112.06586 [pdf, other]
Title: Active learning with MaskAL reduces annotation effort for training Mask R-CNN
Comments: 30 pages, 10 figures, 3 tables
Journal-ref: Computers and Electronics in Agriculture, 197 (2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[617]  arXiv:2112.06592 [pdf, other]
Title: CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability
Comments: Accepted at the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023 (CVPR2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[618]  arXiv:2112.06596 [pdf, other]
Title: SAC-GAN: Structure-Aware Image Composition
Comments: Accepted to TVCG. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[619]  arXiv:2112.06624 [pdf, other]
Title: Pedestrian Trajectory Prediction via Spatial Interaction Transformer Network
Authors: Tong Su, Yu Meng, Yan Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[620]  arXiv:2112.06632 [pdf, other]
Title: Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation
Comments: Accepted by CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[621]  arXiv:2112.06685 [pdf, other]
Title: Quaternion-Valued Convolutional Neural Network Applied for Acute Lymphoblastic Leukemia Diagnosis
Journal-ref: A. Britto and K. Valdivia Delgado (Eds.): BRACIS 2021, LNAI 13074, pp. 280-293, 2021. Springer Nature Switzerland AG 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[622]  arXiv:2112.06701 [pdf, other]
Title: Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[623]  arXiv:2112.06705 [pdf, other]
Title: N-SfC: Robust and Fast Shape Estimation from Caustic Images
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[624]  arXiv:2112.06714 [pdf, other]
Title: Learning Semantic-Aligned Feature Representation for Text-based Person Search
Comments: 5 pages, 3 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[625]  arXiv:2112.06730 [pdf, other]
Title: VirtualCube: An Immersive 3D Video Communication System
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Multimedia (cs.MM)
[626]  arXiv:2112.06741 [pdf, other]
Title: Long-tail Recognition via Compositional Knowledge Transfer
Comments: Accepted to CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[627]  arXiv:2112.06745 [pdf, other]
Title: A Survey of Unsupervised Domain Adaptation for Visual Recognition
Authors: Youshan Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[628]  arXiv:2112.06782 [pdf, other]
Title: GCNDepth: Self-supervised Monocular Depth Estimation based on Graph Convolutional Network
Comments: 10 pages, Submitted to IEEE transactions on intelligent transportation systems
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[629]  arXiv:2112.06809 [pdf, other]
Title: Persistent Animal Identification Leveraging Non-Visual Markers
Journal-ref: Machine Vision and Applications 34, 68 (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Combinatorics (math.CO)
[630]  arXiv:2112.06825 [pdf, other]
Title: VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Comments: CVPR 2022 (15 pages; with new video-text and CLIP-ViL experiments)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[631]  arXiv:2112.06853 [pdf, other]
Title: The whole and the parts: the MDL principle and the a-contrario framework
Comments: Submitted to SIAM Jourinal on Imaging Sciences (SIIMS)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[632]  arXiv:2112.06904 [pdf, other]
Title: HVH: Learning a Hybrid Neural Volumetric Representation for Dynamic Hair Performance Capture
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[633]  arXiv:2112.06909 [pdf, other]
Title: Hallucinating Pose-Compatible Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[634]  arXiv:2112.06910 [pdf, other]
Title: DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[635]  arXiv:2112.06978 [pdf, other]
Title: Exploring Latent Dimensions of Crowd-sourced Creativity
Comments: 5th Workshop on Machine Learning for Creativity and Design (NeurIPS 2021), Sydney, Australia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[636]  arXiv:2112.06988 [pdf, other]
Title: Event-guided Deblurring of Unknown Exposure Time Videos
Comments: Accepted in ECCV2022(Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[637]  arXiv:2112.07015 [pdf, other]
Title: Multi-Expert Human Action Recognition with Hierarchical Super-Class Learning
Comments: 47 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[638]  arXiv:2112.07074 [pdf, other]
Title: Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Comments: preliminary work
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[639]  arXiv:2112.07082 [pdf, ps, other]
Title: DeepDiffusion: Unsupervised Learning of Retrieval-adapted Representations via Diffusion-based Ranking on Latent Feature Manifold
Comments: Accepted to the IEEE Access journal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[640]  arXiv:2112.07088 [pdf, other]
Title: ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[641]  arXiv:2112.07106 [pdf, other]
Title: E-CRF: Embedded Conditional Random Field for Boundary-caused Class Weights Confusion in Semantic Segmentation
Comments: Accepted by ICLR2023. Camera-ready Version
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[642]  arXiv:2112.07111 [pdf, other]
Title: EMDS-6: Environmental Microorganism Image Dataset Sixth Version for Image Denoising, Segmentation, Feature Extraction, Classification and Detection Methods Evaluation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[643]  arXiv:2112.07116 [pdf, other]
Title: Joint 3D Object Detection and Tracking Using Spatio-Temporal Representation of Camera Image and LiDAR Point Clouds
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[644]  arXiv:2112.07133 [pdf, other]
Title: CLIP-Lite: Information Efficient Visual Representation Learning with Language Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[645]  arXiv:2112.07146 [pdf, other]
Title: PP-HumanSeg: Connectivity-Aware Portrait Segmentation with a Large-Scale Teleconferencing Video Dataset
Comments: Accepted by WACV workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[646]  arXiv:2112.07159 [pdf, other]
Title: Birds Eye View Social Distancing Analysis System
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[647]  arXiv:2112.07173 [pdf, other]
Title: On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation
Comments: 14 pages, 6 figures, 2 tables. Published in NeurIPS 2021 Workshop, Shared Visual Representations in Human & Machine Intelligence (SVRHM). For code, see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC)
[648]  arXiv:2112.07175 [pdf, other]
Title: Co-training Transformer with Videos and Images Improves Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[649]  arXiv:2112.07200 [pdf, other]
Title: Weakly Supervised High-Fidelity Clothing Model Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[650]  arXiv:2112.07219 [pdf, other]
Title: A real-time spatiotemporal AI model analyzes skill in open surgical videos
Comments: 22 pages, 4 main text figures, 7 extended data figures, 4 extended data tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[651]  arXiv:2112.07224 [pdf, other]
Title: Exploring Category-correlated Feature for Few-shot Image Classification
Comments: 10 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[652]  arXiv:2112.07225 [pdf, other]
Title: Margin Calibration for Long-Tailed Visual Recognition
Comments: Accepted by Asian Conference on Machine Learning (ACML) 2022; 16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[653]  arXiv:2112.07241 [pdf, other]
Title: Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detection
Authors: Na Zhao, Gim Hee Lee
Comments: Accepted at AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[654]  arXiv:2112.07246 [pdf, other]
Title: Federated Learning for Face Recognition with Gradient Correction
Comments: accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[655]  arXiv:2112.07270 [pdf, other]
Title: Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering
Comments: pre-print, TNNLS, 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[656]  arXiv:2112.07282 [pdf, other]
Title: SNF: Filter Pruning via Searching the Proper Number of Filters
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[657]  arXiv:2112.07286 [pdf, ps, other]
Title: Levels of Autonomous Radiology
Comments: 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[658]  arXiv:2112.07289 [pdf, other]
Title: Smoothness and effective regularizations in learned embeddings for shape matching
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[659]  arXiv:2112.07315 [pdf, other]
Title: Kernel-aware Burst Blind Super-Resolution
Comments: Accepted by WACV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[660]  arXiv:2112.07334 [pdf, other]
Title: OMAD: Object Model with Articulated Deformations for Pose Estimation and Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[661]  arXiv:2112.07338 [pdf, other]
Title: Temporal Transformer Networks with Self-Supervision for Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[662]  arXiv:2112.07374 [pdf, other]
Title: Geometry-Contrastive Transformer for Generalized 3D Pose Transfer
Comments: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[663]  arXiv:2112.07380 [pdf, other]
Title: TRACER: Extreme Attention Guided Salient Object Tracing Network
Comments: AAAI 2022, SA poster session accepted paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[664]  arXiv:2112.07383 [pdf, other]
Title: Improving Human-Object Interaction Detection via Phrase Learning and Label Composition
Comments: Accepted to AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[665]  arXiv:2112.07395 [pdf, other]
Title: Handwritten text generation and strikethrough characters augmentation
Comments: 16 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2108.11667
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[666]  arXiv:2112.07403 [pdf, ps, other]
Title: Stochastic Actor-Executor-Critic for Image-to-Image Translation
Journal-ref: IJCAI 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[667]  arXiv:2112.07414 [pdf, other]
Title: Marine Bubble Flow Quantification Using Wide-Baseline Stereo Photogrammetry
Comments: 56 pages, 26 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[668]  arXiv:2112.07423 [pdf, other]
Title: Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[669]  arXiv:2112.07431 [pdf, other]
Title: Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation
Comments: Accept at AAAI 2022, Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[670]  arXiv:2112.07441 [pdf, other]
Title: An Interpretive Constrained Linear Model for ResNet and MgNet
Comments: 29 pages, 2 figures and 11 tables. arXiv admin note: text overlap with arXiv:1911.10428
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[671]  arXiv:2112.07471 [pdf, other]
Title: I M Avatar: Implicit Morphable Head Avatars from Videos
Comments: Accepted at CVPR 2022 as an oral presentation. Project page this https URL ; Github page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[672]  arXiv:2112.07513 [pdf, other]
Title: CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning
Comments: ICME 2021 (Oral); Code is publicly available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[673]  arXiv:2112.07515 [pdf, other]
Title: CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising
Comments: ACM Multimedia 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[674]  arXiv:2112.07516 [pdf, other]
Title: Transferrable Contrastive Learning for Visual Domain Adaptation
Comments: ACM Multimedia 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[675]  arXiv:2112.07517 [pdf, other]
Title: A Style and Semantic Memory Mechanism for Domain Generalization
Comments: ICCV 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[676]  arXiv:2112.07528 [pdf, other]
Title: n-CPS: Generalising Cross Pseudo Supervision to n Networks for Semi-Supervised Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[677]  arXiv:2112.07558 [pdf, other]
Title: Multi-Modal Temporal Attention Models for Crop Mapping from Satellite Time Series
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[678]  arXiv:2112.07589 [pdf, other]
Title: Mitigating Channel-wise Noise for Single Image Super Resolution
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[679]  arXiv:2112.07599 [pdf, other]
Title: Learning to Deblur and Rotate Motion-Blurred Faces
Comments: British Machine Vision Conference 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[680]  arXiv:2112.07642 [pdf, other]
Title: EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices
Comments: Camera ready version for ECCV 2022, appendix included
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[681]  arXiv:2112.07658 [pdf, other]
Title: AdaViT: Adaptive Tokens for Efficient Vision Transformer
Comments: CVPR'22 oral acceptance
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[682]  arXiv:2112.07661 [pdf, other]
Title: Approaches Toward Physical and General Video Anomaly Detection
Authors: Laura Kart, Niv Cohen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[683]  arXiv:2112.07662 [pdf, other]
Title: Out-of-Distribution Detection Without Class Labels
Comments: Accepted to ECCV L2ID Workshop (2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[684]  arXiv:2112.07664 [pdf, other]
Title: Adaptive Affinity for Associations in Multi-Target Multi-Camera Tracking
Comments: This paper appears in: IEEE Transactions on Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[685]  arXiv:2112.07668 [pdf, other]
Title: Dual-Key Multimodal Backdoors for Visual Question Answering
Comments: Published as conference paper at CVPR 2022. 22 pages, 11 figures, 12 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[686]  arXiv:2112.07719 [pdf, other]
Title: Decomposing the Deep: Finding Class Specific Filters in Deep CNNs
Comments: 22 pages, 5 figures, 8 tables. github repo: this https URL Preprint submitted to Elsevier. This version contains visualization of filters and ablation study w.r.t. influential features
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[687]  arXiv:2112.07787 [pdf, other]
Title: Revisiting 3D Object Detection From an Egocentric Perspective
Comments: Published in NeurIPS 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[688]  arXiv:2112.07812 [pdf, other]
Title: Structure-Aware Image Segmentation with Homotopy Warping
Authors: Xiaoling Hu
Comments: 21 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
[689]  arXiv:2112.07819 [pdf, other]
Title: Weed Recognition using Deep Learning Techniques on Class-imbalanced Imagery
Comments: The paper is accepted by Crop and Pasture Science journal (this https URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[690]  arXiv:2112.07820 [pdf, other]
Title: Value Retrieval with Arbitrary Queries for Form-like Documents
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[691]  arXiv:2112.07835 [pdf, other]
Title: Mining Minority-class Examples With Uncertainty Estimates
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[692]  arXiv:2112.07878 [pdf, other]
Title: Gaze Estimation with Eye Region Segmentation and Self-Supervised Multistream Learning
Comments: 5 pages, 1 figure, 3 tables, Accepted in AAAI-22 Workshop on Human-Centric Self-Supervised Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[693]  arXiv:2112.07879 [pdf, other]
Title: Does a Face Mask Protect my Privacy?: Deep Learning to Predict Protected Attributes from Masked Face Images
Comments: Accepted to AJCAI 2021 - 34th Australasian Joint Conference on Artificial Intelligence, Feb 2022, Sydney, Australia. this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[694]  arXiv:2112.07895 [pdf, other]
Title: Robust Depth Completion with Uncertainty-Driven Loss Functions
Comments: accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[695]  arXiv:2112.07909 [pdf, other]
Title: Homography Decomposition Networks for Planar Object Tracking
Comments: Accepted at AAAI 2022, preprint version
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[696]  arXiv:2112.07910 [pdf, other]
Title: Decoupling Zero-Shot Semantic Segmentation
Comments: Accepted by CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[697]  arXiv:2112.07913 [pdf, other]
Title: A Comparative Analysis of Machine Learning Approaches for Automated Face Mask Detection During COVID-19
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[698]  arXiv:2112.07917 [pdf, other]
Title: SPTS: Single-Point Text Spotting
Comments: Accepted by ACM MM 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[699]  arXiv:2112.07918 [pdf, ps, other]
Title: M-FasterSeg: An Efficient Semantic Segmentation Network Based on Neural Architecture Search
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[700]  arXiv:2112.07921 [pdf, other]
Title: Temporal Shuffling for Defending Deep Action Recognition Models against Adversarial Attacks
Comments: 12 pages, accepted to Neural Networks
Journal-ref: Neural Networks, vol. 169, pp. 388-397, Jan. 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[701]  arXiv:2112.07928 [pdf, other]
Title: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[702]  arXiv:2112.07931 [pdf, ps, other]
Title: From Noise to Feature: Exploiting Intensity Distribution as a Novel Soft Biometric Trait for Finger Vein Recognition
Comments: 11 pages
Journal-ref: IEEE transactions on information forensics and security 14.4 (2018): 858-869
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[703]  arXiv:2112.07945 [pdf, other]
Title: Efficient Geometry-aware 3D Generative Adversarial Networks
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[704]  arXiv:2112.07948 [pdf, other]
Title: Transcoded Video Restoration by Temporal Spatial Auxiliary Network
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[705]  arXiv:2112.07954 [pdf, other]
Title: Object Pursuit: Building a Space of Objects via Discriminative Weight Generation
Comments: 24 pages. This paper has been accepted by ICLR2022 (OpenReview: this https URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[706]  arXiv:2112.07957 [pdf, other]
Title: FEAR: Fast, Efficient, Accurate and Robust Visual Tracker
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[707]  arXiv:2112.07962 [pdf, other]
Title: A learning-based approach to feature recognition of Engineering shapes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[708]  arXiv:2112.07963 [pdf, other]
Title: Towards General and Efficient Active Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[709]  arXiv:2112.07966 [pdf, other]
Title: Modality-Aware Triplet Hard Mining for Zero-shot Sketch-Based Image Retrieval
Comments: 13 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[710]  arXiv:2112.07969 [pdf, ps, other]
Title: Predicting Media Memorability: Comparing Visual, Textual and Auditory Features
Comments: 3 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[711]  arXiv:2112.07974 [pdf, other]
Title: Detail-aware Deep Clothing Animations Infused with Multi-source Attributes
Comments: 14 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[712]  arXiv:2112.07984 [pdf, other]
Title: Temporal Action Proposal Generation with Background Constraint
Comments: Accepted by AAAI2022. arXiv admin note: text overlap with arXiv:2105.12043
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[713]  arXiv:2112.07999 [pdf, other]
Title: Self-Ensembling GAN for Cross-Domain Semantic Segmentation
Journal-ref: IEEE Trans. Multimedia, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[714]  arXiv:2112.08001 [pdf, other]
Title: Autoencoder-based background reconstruction and foreground segmentation with background noise estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[715]  arXiv:2112.08006 [pdf, other]
Title: Consistent Depth Prediction under Various Illuminations using Dilated Cross Attention
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[716]  arXiv:2112.08018 [pdf, other]
Title: MissMarple : A Novel Socio-inspired Feature-transfer Learning Deep Network for Image Splicing Detection
Comments: 27 pages, 6 figures and 15 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[717]  arXiv:2112.08022 [pdf, other]
Title: Segmentation-Reconstruction-Guided Facial Image De-occlusion
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[718]  arXiv:2112.08037 [pdf, other]
Title: LookinGood^π: Real-time Person-independent Neural Re-rendering for High-quality Human Performance Capture
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[719]  arXiv:2112.08050 [pdf, other]
Title: Exploring the Asynchronous of the Frequency Spectra of GAN-generated Facial Images
Comments: International Workshop on Safety and Security of Deep Learning IJCAI, 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[720]  arXiv:2112.08070 [pdf, other]
Title: Depth Refinement for Improved Stereo Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[721]  arXiv:2112.08088 [pdf, other]
Title: Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions
Comments: AAAI 2022, Preprint version with Appendix
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[722]  arXiv:2112.08117 [pdf, other]
Title: Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[723]  arXiv:2112.08122 [pdf, other]
Title: Self-Supervised Monocular Depth and Ego-Motion Estimation in Endoscopy: Appearance Flow to the Rescue
Comments: Accepted by Medical Image Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[724]  arXiv:2112.08171 [pdf, other]
Title: Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution
Comments: Accepted to AAAI2022. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[725]  arXiv:2112.08175 [pdf, other]
Title: A Factorization Approach for Motor Imagery Classification
Comments: 4 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[726]  arXiv:2112.08177 [pdf, other]
Title: Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry
Comments: CVPR 2022 (oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[727]  arXiv:2112.08178 [pdf, ps, other]
Title: Interpretable Feature Learning Framework for Smoking Behavior Detection
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[728]  arXiv:2112.08184 [pdf, other]
Title: Interactive Visualization and Representation Analysis Applied to Glacier Segmentation
Authors: Minxing Zheng (1), Xinran Miao (1), Kris Sankaran (1) ((1) Department of Statistics, University of Wisconsin - Madison)
Comments: 10 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[729]  arXiv:2112.08189 [pdf, other]
Title: ST-MTL: Spatio-Temporal Multitask Learning Model to Predict Scanpath While Tracking Instruments in Robotic Surgery
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[730]  arXiv:2112.08198 [pdf, other]
Title: Single Image Automatic Radial Distortion Compensation Using Deep Convolutional Network
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[731]  arXiv:2112.08219 [pdf, other]
Title: Quantitative analysis of visual representation of sign elements in COVID-19 context
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[732]  arXiv:2112.08227 [pdf, other]
Title: An Experimental Study of the Impact of Pre-training on the Pruning of a Convolutional Neural Network
Comments: 7 pages, published at APPIS 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[733]  arXiv:2112.08274 [pdf, other]
Title: Putting People in their Place: Monocular Regression of 3D People in Depth
Comments: CVPR 2022; Code this https URL ; Dataset this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[734]  arXiv:2112.08275 [pdf, other]
Title: SeqFormer: Sequential Transformer for Video Instance Segmentation
Comments: ECCV 2022, Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[735]  arXiv:2112.08281 [pdf, other]
Title: Detecting Object States vs Detecting Objects: A New Dataset and a Quantitative Experimental Study
Comments: Submitted to the Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISAPP)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[736]  arXiv:2112.08325 [pdf, other]
Title: ForgeryNet -- Face Forgery Analysis Challenge 2021: Methods and Results
Comments: Technical report. Challenge website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[737]  arXiv:2112.08345 [pdf, other]
Title: Reliable Multi-Object Tracking in the Presence of Unreliable Detections
Comments: The full journal version of this article (published in Pattern Recognition, Vol. 135) can be found at this https URL The article is open access. The source code and dataset can be found at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[738]  arXiv:2112.08359 [pdf, other]
Title: 3D Question Answering
Comments: To Appear at IEEE Transactions on Visualization and Computer Graphics (TVCG) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[739]  arXiv:2112.08447 [pdf, other]
Title: Positional Encoding Augmented GAN for the Assessment of Wind Flow for Pedestrian Comfort in Urban Areas
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[740]  arXiv:2112.08455 [pdf, other]
Title: Dense Video Captioning Using Unsupervised Semantic Information
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[741]  arXiv:2112.08459 [pdf, other]
Title: Rethinking Nearest Neighbors for Visual Classification
Comments: Modified paragraph spacing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[742]  arXiv:2112.08493 [pdf, other]
Title: StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation
Comments: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[743]  arXiv:2112.08497 [pdf, other]
Title: Predicting Levels of Household Electricity Consumption in Low-Access Settings
Comments: Accepted to be published in Proceedings of IEEE Winter Conference on Applications of Computer Vision (WACV) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[744]  arXiv:2112.08539 [pdf, other]
Title: Implicit Neural Representations for Deconvolving SAS Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[745]  arXiv:2112.08553 [pdf, other]
Title: UMAD: Universal Model Adaptation under Domain and Category Shift
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[746]  arXiv:2112.08587 [pdf, other]
Title: SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
Comments: AAAI 2022
Journal-ref: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[747]  arXiv:2112.08594 [pdf, other]
Title: Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation
Comments: 11 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[748]  arXiv:2112.08598 [pdf, other]
Title: FIgLib & SmokeyNet: Dataset and Deep Learning Model for Real-Time Wildland Fire Smoke Detection
Journal-ref: Remote Sensing. 2022; 14(4):1007
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[749]  arXiv:2112.08604 [pdf, ps, other]
Title: Use Image Clustering to Facilitate Technology Assisted Review
Comments: 2021 IEEE International Conference on Big Data (Big Data)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[750]  arXiv:2112.08605 [src]
Title: Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection
Comments: for further study
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[751]  arXiv:2112.08626 [pdf, other]
Title: Analysis and Evaluation of Kinect-based Action Recognition Algorithms
Authors: Lei Wang
Comments: Master's thesis, 34 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[752]  arXiv:2112.08635 [pdf, other]
Title: Road-aware Monocular Structure from Motion and Homography Estimation
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[753]  arXiv:2112.08643 [pdf, other]
Title: TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning
Comments: This is an extention of AAAI'22 paper (TransZero). Accepted to TPAMI. arXiv admin note: substantial text overlap with arXiv:2112.01683
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[754]  arXiv:2112.08647 [pdf, other]
Title: QAHOI: Query-Based Anchors for Human-Object Interaction Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[755]  arXiv:2112.08655 [pdf, other]
Title: Feature Distillation Interaction Weighting Network for Lightweight Image Super-Resolution
Comments: 9 pages, 9 figures, 4 tables, AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[756]  arXiv:2112.08684 [pdf, other]
Title: Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification
Comments: ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[757]  arXiv:2112.08691 [pdf, other]
Title: Towards Robust Neural Image Compression: Adversarial Attack and Model Finetuning
Authors: Tong Chen, Zhan Ma
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[758]  arXiv:2112.08692 [pdf, other]
Title: Lacuna Reconstruction: Self-supervised Pre-training for Low-Resource Historical Document Transcription
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[759]  arXiv:2112.08739 [pdf, other]
Title: Forensic Analysis of Synthetically Generated Western Blot Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[760]  arXiv:2112.08740 [pdf, other]
Title: Feature Erasing and Diffusion Network for Occluded Person Re-Identification
Comments: 10 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[761]  arXiv:2112.08743 [pdf, other]
Title: Radio-Assisted Human Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[762]  arXiv:2112.08775 [pdf, other]
Title: DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation
Comments: Accepted to ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[763]  arXiv:2112.08782 [pdf, ps, other]
Title: Improved YOLOv5 network for real-time multi-scale traffic sign detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[764]  arXiv:2112.08796 [pdf, other]
Title: Saliency Grafting: Innocuous Attribution-Guided Mixup with Calibrated Label Mixing
Comments: 12 pages; Accepted to AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[765]  arXiv:2112.08810 [pdf, other]
Title: Pure Noise to the Rescue of Insufficient Data: Improving Imbalanced Classification by Training on Random Noise Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[766]  arXiv:2112.08814 [pdf, other]
Title: An Unsupervised Way to Understand Artifact Generating Internal Units in Generative Neural Networks
Comments: AAAI22 accepted paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[767]  arXiv:2112.08816 [pdf, other]
Title: Deep Hash Distillation for Image Retrieval
Comments: ECCV2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[768]  arXiv:2112.08817 [pdf, other]
Title: Search for temporal cell segmentation robustness in phase-contrast microscopy videos
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP); Quantitative Methods (q-bio.QM)
[769]  arXiv:2112.08835 [pdf, other]
Title: Self-supervised Enhancement of Latent Discovery in GANs
Comments: Accepted to the 36th AAAI Conference on Artificial Intelligence (AAAI 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[770]  arXiv:2112.08841 [pdf, other]
Title: A CNN based method for Sub-pixel Urban Land Cover Classification using Landsat-5 TM and Resourcesat-1 LISS-IV Imagery
Comments: 29 pages, 14 figures (including appendix), 8 tables (including appendix)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[771]  arXiv:2112.08867 [pdf, other]
Title: GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation
Comments: CVPR2022 Oral. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[772]  arXiv:2112.08879 [pdf, other]
Title: Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds
Comments: First two authors contributed equally | ECCV 2022 Camera Ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[773]  arXiv:2112.08902 [pdf, other]
Title: Toward Minimal Misalignment at Minimal Cost in One-Stage and Anchor-Free Object Detection
Comments: The paper is under consideration at Pattern Recognition Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[774]  arXiv:2112.08906 [pdf, other]
Title: On the Uncertain Single-View Depths in Colonoscopies
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[775]  arXiv:2112.08913 [pdf, other]
Title: Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation
Comments: Accepted by AAAI 2022, Preprint version with Appendix
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[776]  arXiv:2112.08930 [pdf, other]
Title: Intelli-Paint: Towards Developing Human-like Painting Agents
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Machine Learning (stat.ML)
[777]  arXiv:2112.08935 [pdf, other]
Title: MVSS-Net: Multi-View Multi-Scale Supervised Networks for Image Manipulation Detection
Comments: arXiv admin note: substantial text overlap with arXiv:2104.06832 Accepted by T-PAMI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[778]  arXiv:2112.08949 [pdf, other]
Title: Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[779]  arXiv:2112.08950 [pdf, other]
Title: Stable Long-Term Recurrent Video Super-Resolution
Comments: 9 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[780]  arXiv:2112.08996 [pdf, other]
Title: Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[781]  arXiv:2112.09043 [pdf, other]
Title: Neural Style Transfer and Unpaired Image-to-Image Translation to deal with the Domain Shift Problem on Spheroid Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[782]  arXiv:2112.09045 [pdf, other]
Title: The MVTec 3D-AD Dataset for Unsupervised 3D Anomaly Detection and Localization
Comments: Accepted for presentation at VISAPP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[783]  arXiv:2112.09061 [pdf, other]
Title: Solving Inverse Problems with NerfGANs
Comments: 16 pages, 18 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[784]  arXiv:2112.09069 [pdf, other]
Title: Progressive Graph Convolution Network for EEG Emotion Recognition
Comments: 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Signal Processing (eess.SP)
[785]  arXiv:2112.09081 [pdf, other]
Title: CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data
Comments: CVPR 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[786]  arXiv:2112.09106 [pdf, other]
Title: RegionCLIP: Region-based Language-Image Pretraining
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[787]  arXiv:2112.09120 [pdf, other]
Title: Human Hands as Probes for Interactive Object Understanding
Comments: To Appear at CVPR 2022. Project website at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[788]  arXiv:2112.09126 [pdf, other]
Title: IS-COUNT: Large-scale Object Counting from Satellite Images with Covariate-based Importance Sampling
Comments: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[789]  arXiv:2112.09127 [pdf, other]
Title: ICON: Implicit Clothed humans Obtained from Normals
Comments: Project page: this https URL Accepted by CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[790]  arXiv:2112.09129 [pdf, other]
Title: Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition
Comments: open sourced; codes and models are available:this https URL; transformer-based method
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[791]  arXiv:2112.09130 [pdf, other]
Title: Ensembling Off-the-shelf Models for GAN Training
Comments: CVPR 2022 (Oral). GitHub: this https URL Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[792]  arXiv:2112.09131 [pdf, other]
Title: HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[793]  arXiv:2112.09133 [pdf, other]
Title: Masked Feature Prediction for Self-Supervised Visual Pre-Training
Comments: Technical report. arXiv v2: update AVA results (details in Appendix E)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[794]  arXiv:2112.09151 [pdf, other]
Title: TAFIM: Targeted Adversarial Attacks against Facial Image Manipulations
Comments: (ECCV 2022 Paper) Video: this https URL Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[795]  arXiv:2112.09165 [pdf, other]
Title: ALEBk: Feasibility Study of Attention Level Estimation via Blink Detection applied to e-Learning
Comments: Preprint of the paper presented to the Workshop on Artificial Intelligence for Education (AI4EDU) of AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[796]  arXiv:2112.09172 [pdf, ps, other]
Title: An Audio-Visual Dataset and Deep Learning Frameworks for Crowded Scene Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[797]  arXiv:2112.09190 [pdf, other]
Title: Monitoring crop phenology with street-level imagery using computer vision
Comments: 18 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[798]  arXiv:2112.09195 [pdf, other]
Title: Mitigating the Bias of Centered Objects in Common Datasets
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[799]  arXiv:2112.09201 [pdf, other]
Title: Semantic-Based Few-Shot Learning by Interactive Psychometric Testing
Comments: Accepted at the AAAI-22 Workshop on Interactive Machine Learning (IML@AAAI'22)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[800]  arXiv:2112.09205 [pdf, other]
Title: AFDetV2: Rethinking the Necessity of the Second Stage for Object Detection from Point Clouds
Comments: AAAI 2022; 1st Place Solution for the Real-time 3D Detection and the Most Efficient Model of the Waymo Open Dataset Challenges 2021 (this http URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[801]  arXiv:2112.09214 [pdf, other]
Title: Sparse Coding with Multi-Layer Decoders using Variance Regularization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[802]  arXiv:2112.09219 [pdf, other]
Title: All You Need is RAW: Defending Against Adversarial Attacks with Camera Image Pipelines
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[803]  arXiv:2112.09220 [pdf, other]
Title: Sim2Real Docs: Domain Randomization for Documents in Natural Scenes using Ray-traced Rendering
Comments: Accepted to Neurips 2021 Data Centric AI (DCAI) Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[804]  arXiv:2112.09251 [pdf, other]
Title: The Wanderings of Odysseus in 3D Scenes
Authors: Yan Zhang, Siyu Tang
Comments: cvpr22 camera ready
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[805]  arXiv:2112.09253 [pdf, other]
Title: Logically at Factify 2022: Multimodal Fact Verification
Comments: Accepted in AAAI'22: First Workshop on Multimodal Fact-Checking and Hate Speech Detection, Februrary 22 - March 1, 2022,Vancouver, BC, Canada
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[806]  arXiv:2112.09260 [pdf, other]
Title: How to augment your ViTs? Consistency loss and StyleAug, a random style transfer augmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[807]  arXiv:2112.09262 [pdf, ps, other]
Title: Image Inpainting Using AutoEncoder and Guided Selection of Predicted Pixels
Comments: 5 pages, 2 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[808]  arXiv:2112.09278 [pdf, other]
Title: All-photon Polarimetric Time-of-Flight Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[809]  arXiv:2112.09290 [pdf, other]
Title: PeopleSansPeople: A Synthetic Data Generator for Human-Centric Computer Vision
Comments: PeopleSansPeople template Unity environment, benchmark binaries, and source code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Databases (cs.DB); Graphics (cs.GR); Machine Learning (cs.LG)
[810]  arXiv:2112.09298 [pdf, other]
Title: Human-vehicle Cooperative Visual Perception for Autonomous Driving under Complex Road and Traffic Scenarios
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[811]  arXiv:2112.09300 [pdf, other]
Title: Towards End-to-End Image Compression and Analysis with Transformers
Comments: Accepted by AAAI 2022; Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[812]  arXiv:2112.09318 [pdf, other]
Title: Procedural Kernel Networks
Comments: 11 pages, technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[813]  arXiv:2112.09326 [pdf, other]
Title: Cinderella's shoe won't fit Soundarya: An audit of facial processing tools on Indian faces
Comments: 17 pages, 2 figures and 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[814]  arXiv:2112.09329 [pdf, other]
Title: Point2Cyl: Reverse Engineering 3D Objects from Point Clouds to Extrusion Cylinders
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[815]  arXiv:2112.09331 [pdf, other]
Title: Contrastive Vision-Language Pre-training with Limited Resources
Comments: Accepted to ECCV2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[816]  arXiv:2112.09343 [pdf, other]
Title: Domain Adaptation on Point Clouds via Geometry-Aware Implicits
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[817]  arXiv:2112.09356 [pdf, other]
Title: UniMiSS: Universal Medical Self-Supervised Learning via Breaking Dimensionality Barrier
Comments: Accepted by ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[818]  arXiv:2112.09357 [pdf, other]
Title: Interpreting Audiograms with Multi-stage Neural Networks
Comments: 12pages,12 figures. The code for this project is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[819]  arXiv:2112.09367 [pdf, other]
Title: SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder
Comments: Accepted to BMVC 2021. Codes are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[820]  arXiv:2112.09379 [pdf, ps, other]
Title: Enhanced Frame and Event-Based Simulator and Event-Based Video Interpolation Network
Comments: 10 pages, 19 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[821]  arXiv:2112.09385 [pdf, other]
Title: Full Transformer Framework for Robust Point Cloud Registration with Deep Information Interaction
Comments: 10pages, 7figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[822]  arXiv:2112.09413 [pdf, other]
Title: Self-attention based anchor proposal for skeleton-based action recognition
Authors: Ruijie Hou, Zhao Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[823]  arXiv:2112.09414 [pdf, other]
Title: Disentangled representations: towards interpretation of sex determination from hip bone
Journal-ref: The Visual Computer (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[824]  arXiv:2112.09422 [pdf, other]
Title: A Review on Visual Privacy Preservation Techniques for Active and Assisted Living
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[825]  arXiv:2112.09426 [pdf, other]
Title: SiamTrans: Zero-Shot Multi-Frame Image Restoration with Pre-Trained Siamese Transformers
Journal-ref: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[826]  arXiv:2112.09428 [src]
Title: Dynamics-aware Adversarial Attack of 3D Sparse Convolution Network
Comments: We have improved the quality of this work and updated a new version to address the limitations of the proposed method
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[827]  arXiv:2112.09442 [pdf, ps, other]
Title: Adaptively Customizing Activation Functions for Various Layers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[828]  arXiv:2112.09445 [pdf, other]
Title: Data Efficient Language-supervised Zero-shot Recognition with Optimal Transport Distillation
Comments: 19 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[829]  arXiv:2112.09448 [pdf, other]
Title: Distillation of Human-Object Interaction Contexts for Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[830]  arXiv:2112.09459 [pdf, other]
Title: Weakly Supervised Semantic Segmentation via Alternative Self-Dual Teaching
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[831]  arXiv:2112.09490 [pdf, other]
Title: Visual Microfossil Identification via Deep Metric Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[832]  arXiv:2112.09493 [pdf, other]
Title: Methods for segmenting cracks in 3d images of concrete: A comparison based on semi-synthetic images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[833]  arXiv:2112.09515 [pdf, other]
Title: Symmetry-aware Neural Architecture for Embodied Visual Navigation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[834]  arXiv:2112.09532 [pdf, other]
Title: Pixel Distillation: A New Knowledge Distillation Scheme for Low-Resolution Image Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[835]  arXiv:2112.09546 [pdf, other]
Title: Complex Functional Maps : a Conformal Link Between Tangent Bundles
Authors: Nicolas Donati (LIX), Etienne Corman (LORIA, CNRS, PIXEL), Simone Melzi (Sapienza University of Rome), Maks Ovsjanikov (LIX)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Differential Geometry (math.DG)
[836]  arXiv:2112.09568 [pdf, other]
Title: Nearest neighbor search with compact codes: A decoder perspective
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[837]  arXiv:2112.09569 [pdf, other]
Title: CPPE-5: Medical Personal Protective Equipment Dataset
Comments: 18 pages, 6 tables, 6 figures. Code and models are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[838]  arXiv:2112.09581 [pdf, other]
Title: Watermarking Images in Self-Supervised Latent Spaces
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[839]  arXiv:2112.09583 [pdf, other]
Title: Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[840]  arXiv:2112.09591 [pdf, other]
Title: Global explainability in aligned image modalities
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[841]  arXiv:2112.09598 [pdf, other]
Title: Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans
Comments: Accepted VISAPP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[842]  arXiv:2112.09645 [pdf, other]
Title: Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation
Comments: 13 pages, 4 figures, 7 tables. This article is under review at a Journal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[843]  arXiv:2112.09647 [pdf, other]
Title: Video-Based Reconstruction of the Trajectories Performed by Skiers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[844]  arXiv:2112.09648 [pdf, other]
Title: Improving neural implicit surfaces geometry with patch warping
Comments: Accepted at CVPR2022. Project wepbage: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[845]  arXiv:2112.09653 [pdf, other]
Title: Information-theoretic stochastic contrastive conditional GAN: InfoSCC-GAN
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[846]  arXiv:2112.09660 [pdf, other]
Title: AI-Assisted Verification of Biometric Data Collection
Authors: Ryan Lindsey
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[847]  arXiv:2112.09664 [pdf, other]
Title: Towards More Effective PRM-based Crowd Counting via A Multi-resolution Fusion and Attention Network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[848]  arXiv:2112.09685 [pdf, other]
Title: Neuromorphic Camera Denoising using Graph Neural Network-driven Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[849]  arXiv:2112.09686 [pdf, other]
Title: Efficient Visual Tracking with Exemplar Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[850]  arXiv:2112.09687 [pdf, other]
Title: Light Field Neural Rendering
Comments: Project page with code and videos at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[851]  arXiv:2112.09690 [pdf, other]
Title: Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
Comments: CVPR 2022 camera-ready, Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[852]  arXiv:2112.09747 [pdf, other]
Title: A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Comments: ECCV 2022 accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[853]  arXiv:2112.09775 [pdf, other]
Title: Adaptive Subsampling for ROI-based Visual Tracking: Algorithms and FPGA Implementation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR)
[854]  arXiv:2112.09786 [pdf, other]
Title: Distill and De-bias: Mitigating Bias in Face Verification using Knowledge Distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[855]  arXiv:2112.09791 [pdf, other]
Title: Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Comments: ICCV 2021. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[856]  arXiv:2112.09809 [pdf, other]
Title: A Streaming Volumetric Image Generation Framework for Development and Evaluation of Out-of-Core Methods
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[857]  arXiv:2112.09828 [pdf, other]
Title: Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs
Comments: WACV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[858]  arXiv:2112.09833 [pdf, other]
Title: Face Deblurring Based on Separable Normalization and Adaptive Denormalization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[859]  arXiv:2112.09839 [pdf, other]
Title: Calorie Aware Automatic Meal Kit Generation from an Image
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[860]  arXiv:2112.09844 [pdf, other]
Title: Enhanced Object Detection in Floor-plan through Super Resolution
Comments: 3rd International Conference on Machine Learning, Image Processing, Network Security and Data Sciences
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[861]  arXiv:2112.09852 [pdf, other]
Title: LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision
Comments: 13 pages, 15 figures
Journal-ref: In MobiCom'21, pages 406-419, 2021. ACM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[862]  arXiv:2112.09854 [pdf, other]
Title: Space Non-cooperative Object Active Tracking with Deep Reinforcement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[863]  arXiv:2112.09873 [pdf, ps, other]
Title: An effective coaxiality measurement for twist drill based on line structured light sensor
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible.13 pages, 22 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[864]  arXiv:2112.09875 [pdf, other]
Title: Adversarial Memory Networks for Action Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[865]  arXiv:2112.09898 [pdf, other]
Title: Does Explainable Machine Learning Uncover the Black Box in Vision Applications?
Authors: Manish Narwaria
Comments: Image and Vision Computing, Volume 118, 2022, 104353, ISSN 0262-8856, this https URL
Journal-ref: Image and Vision Computing, Volume 118, 2022, 104353, ISSN 0262-8856
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[866]  arXiv:2112.09902 [pdf, other]
Title: 3D Instance Segmentation of MVS Buildings
Comments: 14 figures, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[867]  arXiv:2112.09908 [pdf, other]
Title: Anomaly Discovery in Semantic Segmentation via Distillation Comparison Networks
Comments: 9 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[868]  arXiv:2112.09922 [pdf, other]
Title: Fast and Robust Registration of Partially Overlapping Point Clouds
Comments: Accepted at IEEE Robotics and Automation Letters (RA-L). 8 pages, 6 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[869]  arXiv:2112.09938 [pdf, other]
Title: DeepUME: Learning the Universal Manifold Embedding for Robust Point Cloud Registration
Comments: BMVC 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[870]  arXiv:2112.09951 [pdf, ps, other]
Title: Rapid Face Mask Detection and Person Identification Model based on Deep Neural Networks
Authors: Abdullah Ahmad Khan (1), Mohd. Belal (2), GhufranUllah (3) ((1,2 and 3) Aligarh Muslim University)
Comments: 12 pages , 15 figures , International Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[871]  arXiv:2112.09965 [pdf, other]
Title: Pre-Training Transformers for Domain Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[872]  arXiv:2112.09976 [pdf, other]
Title: Tell me what you see: A zero-shot action recognition method based on natural language descriptions
Comments: Published at Multimedia Tools and Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[873]  arXiv:2112.10003 [pdf, other]
Title: Image Segmentation Using Text and Image Prompts
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[874]  arXiv:2112.10047 [pdf, other]
Title: Controlling the Quality of Distillation in Response-Based Network Compression
Comments: AAAI22-Workshop: 1st International Workshop on Practical Deep Learning in the Wild
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[875]  arXiv:2112.10057 [pdf, other]
Title: Precondition and Effect Reasoning for Action Recognition
Comments: The paper is under consideration at Computer Vision and Image Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[876]  arXiv:2112.10063 [pdf, other]
Title: Deep Graph-level Anomaly Detection by Glocal Knowledge Distillation
Comments: Accepted to WSDM 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[877]  arXiv:2112.10066 [pdf, other]
Title: LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[878]  arXiv:2112.10082 [pdf, other]
Title: MoCaNet: Motion Retargeting in-the-wild via Canonicalization Networks
Comments: Accepted by AAAI 2022. The first two authors contributed equally. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[879]  arXiv:2112.10087 [pdf, other]
Title: Reasoning Structural Relation for Occlusion-Robust Facial Landmark Localization
Comments: Accepted by Pattern recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[880]  arXiv:2112.10089 [pdf, other]
Title: Camera-aware Style Separation and Contrastive Learning for Unsupervised Person Re-identification
Comments: 6 pages, 4 figures, 2 tables
Journal-ref: 2022 IEEE International Conference on Multimedia and Expo (ICME)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[881]  arXiv:2112.10098 [pdf, other]
Title: Initiative Defense against Facial Manipulation
Comments: Accepted at AAAI 2021
Journal-ref: Proceedings of the AAAI Conference on Artificial Intelligence, 35(2), 1619-1627, 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[882]  arXiv:2112.10101 [pdf, ps, other]
Title: ArcFace Knows the Gender, Too!
Authors: Majid Farzaneh
Comments: 9 pages, 4 images, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[883]  arXiv:2112.10103 [pdf, other]
Title: SAGA: Stochastic Whole-Body Grasping with Contact
Comments: Accepted by ECCV 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[884]  arXiv:2112.10149 [pdf, other]
Title: Elastic-Link for Binarized Neural Network
Comments: AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[885]  arXiv:2112.10155 [pdf, other]
Title: Topology Preserving Local Road Network Estimation from Single Onboard Camera Image
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[886]  arXiv:2112.10167 [pdf, other]
Title: Improving Face-Based Age Estimation with Attention-Based Dynamic Patch Fusion
Comments: IEEE Transactions on Image Processing (accepted for publication)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[887]  arXiv:2112.10175 [pdf, other]
Title: On Efficient Transformer-Based Image Pre-training for Low-Level Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[888]  arXiv:2112.10194 [pdf, other]
Title: UnweaveNet: Unweaving Activity Stories
Comments: Accepted at IEEE/CVF Computer Vision and Pattern Recognition (CVPR) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[889]  arXiv:2112.10196 [pdf, other]
Title: End-to-End Learning of Multi-category 3D Pose and Shape Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[890]  arXiv:2112.10203 [pdf, other]
Title: HVTR: Hybrid Volumetric-Textural Rendering for Human Avatars
Comments: Accepted to 3DV 2022. See more results at this https URL Demo: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[891]  arXiv:2112.10258 [pdf, other]
Title: GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[892]  arXiv:2112.10271 [pdf, other]
Title: Wiener Guided DIP for Unsupervised Blind Image Deconvolution
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[893]  arXiv:2112.10275 [pdf, other]
Title: Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[894]  arXiv:2112.10298 [pdf, other]
Title: Driver Drowsiness Detection Using Ensemble Convolutional Neural Networks on YawDD
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[895]  arXiv:2112.10305 [pdf, ps, other]
Title: Model-based gait recognition using graph network on very large population database
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[896]  arXiv:2112.10310 [pdf, other]
Title: Contrastive Attention Network with Dense Field Estimation for Face Completion
Comments: Accepted by Pattern Recognition 2021. arXiv admin note: substantial text overlap with arXiv:2010.15643
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[897]  arXiv:2112.10324 [pdf, other]
Title: Product Re-identification System in Fully Automated Defect Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[898]  arXiv:2112.10365 [pdf, other]
Title: DMS-GCN: Dynamic Mutiscale Spatiotemporal Graph Convolutional Networks for Human Motion Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[899]  arXiv:2112.10390 [pdf, ps, other]
Title: Evaluation and Comparison of Deep Learning Methods for Pavement Crack Identification with Visual Images
Authors: Kai-Liang Lu
Comments: This work will be submitted for possible publication. It is a further study from 2012.14704v2
Journal-ref: Frontiers in Artificial Intelligence and Applications (CECNet2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[900]  arXiv:2112.10415 [pdf, other]
Title: UFPMP-Det: Toward Accurate and Efficient Object Detection on Drone Imagery
Comments: 8 pages, 6 figures, Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[901]  arXiv:2112.10453 [pdf, other]
Title: Learning with Label Noise for Image Retrieval by Selecting Interactions
Comments: Accepted at WACV 2022. 13 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[902]  arXiv:2112.10457 [pdf, other]
Title: Image Animation with Keypoint Mask
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[903]  arXiv:2112.10474 [pdf, other]
Title: Reciprocal Normalization for Domain Adaptation
Comments: The best feature normalization module for domain adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[904]  arXiv:2112.10481 [pdf, ps, other]
Title: a novel attention-based network for fast salient object detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[905]  arXiv:2112.10482 [pdf, other]
Title: ScanQA: 3D Question Answering for Spatial Scene Understanding
Comments: CVPR2022. The first three authors are equally contributed. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[906]  arXiv:2112.10483 [pdf, other]
Title: Fusion and Orthogonal Projection for Improved Face-Voice Association
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[907]  arXiv:2112.10485 [pdf, other]
Title: Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching
Authors: Yujie Fu, Yihong Wu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[908]  arXiv:2112.10531 [pdf, ps, other]
Title: Object Recognition as Classification via Visual Properties
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[909]  arXiv:2112.10570 [pdf, other]
Title: Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition
Comments: 12 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[910]  arXiv:2112.10587 [pdf, other]
Title: Image-free multi-character recognition
Comments: 17pages, 4figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[911]  arXiv:2112.10591 [pdf, other]
Title: Real-Time Optical Flow for Vehicular Perception with Low- and High-Resolution Event Cameras
Comments: 13 pages, journal paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[912]  arXiv:2112.10600 [pdf, other]
Title: DeePaste -- Inpainting for Pasting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[913]  arXiv:2112.10624 [pdf, other]
Title: Learning to integrate vision data into road network data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[914]  arXiv:2112.10646 [pdf, other]
Title: Raw High-Definition Radar for Multi-Task Learning
Comments: 12 pages, 7 figures, 6 tables
Journal-ref: CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[915]  arXiv:2112.10683 [pdf, other]
Title: SelFSR: Self-Conditioned Face Super-Resolution in the Wild via Flow Field Degradation Network
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[916]  arXiv:2112.10703 [pdf, other]
Title: Mega-NeRF: Scalable Construction of Large-Scale NeRFs for Virtual Fly-Throughs
Comments: CVPR 2022 Project page: this https URL GitHub: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[917]  arXiv:2112.10716 [pdf, other]
Title: BAPose: Bottom-Up Pose Estimation with Disentangled Waterfall Representations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[918]  arXiv:2112.10727 [pdf, other]
Title: Learning Physics Properties of Fabrics and Garments with a Physics Similarity Neural Network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[919]  arXiv:2112.10740 [pdf, other]
Title: Are Large-scale Datasets Necessary for Self-Supervised Pre-training?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[920]  arXiv:2112.10741 [pdf, other]
Title: GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Comments: 20 pages, 18 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[921]  arXiv:2112.10752 [pdf, other]
Title: High-Resolution Image Synthesis with Latent Diffusion Models
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[922]  arXiv:2112.10759 [pdf, other]
Title: 3D-aware Image Synthesis via Learning Structural and Textural Representations
Comments: CVPR 2022 camera-ready, Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[923]  arXiv:2112.10762 [pdf, other]
Title: StyleSwin: Transformer-based GAN for High-resolution Image Generation
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[924]  arXiv:2112.10764 [pdf, other]
Title: Mask2Former for Video Instance Segmentation
Comments: Code and models: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[925]  arXiv:2112.10809 [pdf, other]
Title: Lite Vision Transformer with Enhanced Self-Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[926]  arXiv:2112.10838 [pdf, other]
Title: One Sketch for All: One-Shot Personalized Sketch Segmentation
Comments: IEEE Transactions on Image Processing, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[927]  arXiv:2112.10844 [pdf, other]
Title: Encoding Hierarchical Information in Neural Networks helps in Subpopulation Shift
Comments: 15 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[928]  arXiv:2112.10871 [pdf, other]
Title: Translational Concept Embedding for Generalized Compositional Zero-shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[929]  arXiv:2112.10909 [pdf, ps, other]
Title: Spatiotemporal Motion Synchronization for Snowboard Big Air
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[930]  arXiv:2112.10936 [pdf, other]
Title: Watch Those Words: Video Falsification Detection Using Word-Conditioned Facial Motion
Comments: Accepted in WACV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Multimedia (cs.MM)
[931]  arXiv:2112.10941 [pdf, other]
Title: Structured Semantic Transfer for Multi-Label Recognition with Partial Labels
Comments: Accepted by AAAI'22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[932]  arXiv:2112.10945 [pdf, other]
Title: Pixel-Stega: Generative Image Steganography Based on Autoregressive Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[933]  arXiv:2112.10948 [pdf, ps, other]
Title: Task-Oriented Image Transmission for Scene Classification in Unmanned Aerial Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[934]  arXiv:2112.10960 [pdf, other]
Title: Continuous-Time Video Generation via Learning Motion Dynamics with Neural ODE
Comments: 24 pages; Accepted to BMVC 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[935]  arXiv:2112.10963 [pdf, other]
Title: DRPN: Making CNN Dynamically Handle Scale Variation
Journal-ref: Digit. Signal Process, 133 (2023), pp. 103844
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[936]  arXiv:2112.10969 [pdf, other]
Title: Generalizing Interactive Backpropagating Refinement for Dense Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[937]  arXiv:2112.10977 [pdf, other]
Title: ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization
Comments: Accepted to AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[938]  arXiv:2112.10982 [pdf, other]
Title: Generalized Few-Shot Semantic Segmentation: All You Need is Fine-Tuning
Comments: Includes supplementary materials
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[939]  arXiv:2112.10988 [pdf, other]
Title: Mapping industrial poultry operations at scale with deep learning and aerial imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[940]  arXiv:2112.10992 [pdf, other]
Title: Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[941]  arXiv:2112.11004 [pdf, other]
Title: Point spread function estimation for blind image deblurring problems based on framelet transform
Authors: Reza Parvaz
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Image and Video Processing (eess.IV); Optimization and Control (math.OC)
[942]  arXiv:2112.11010 [pdf, other]
Title: MPViT: Multi-Path Vision Transformer for Dense Prediction
Comments: technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[943]  arXiv:2112.11014 [pdf, other]
Title: fMRI Neurofeedback Learning Patterns are Predictive of Personal and Clinical Traits
Journal-ref: MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[944]  arXiv:2112.11037 [pdf, other]
Title: SOIT: Segmenting Objects with Instance-Aware Transformers
Comments: AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[945]  arXiv:2112.11081 [pdf, other]
Title: RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
Comments: Accepted by CVPR-2022. This is the latest version
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[946]  arXiv:2112.11085 [pdf, other]
Title: Can We Use Neural Regularization to Solve Depth Super-Resolution?
Comments: 9 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[947]  arXiv:2112.11088 [pdf, other]
Title: EPNet++: Cascade Bi-directional Fusion for Multi-Modal 3D Object Detection
Comments: Accepted by TPAMI-2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[948]  arXiv:2112.11121 [pdf, other]
Title: GlobalMatch: Registration of Forest Terrestrial Point Clouds by Global Matching of Relative Stem Positions
Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing. Vol. 197, 71-86, 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[949]  arXiv:2112.11124 [pdf, other]
Title: Learning Human Motion Prediction via Stochastic Differential Equations
Comments: 9 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[950]  arXiv:2112.11133 [pdf, other]
Title: Cloud Sphere: A 3D Shape Representation via Progressive Deformation
Comments: This paper was submitted first in CVPR 2021 (paper id: 2255), and then was submitted in CVM 2022 (id: 160)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[951]  arXiv:2112.11153 [pdf, other]
Title: PONet: Robust 3D Human Pose Estimation via Learning Orientations Only
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[952]  arXiv:2112.11177 [pdf, other]
Title: Generalizable Cross-modality Medical Image Segmentation via Style Augmentation and Dual Normalization
Comments: Accepted by CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[953]  arXiv:2112.11224 [pdf, other]
Title: Attention-Based Sensor Fusion for Human Activity Recognition Using IMU Signals
Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[954]  arXiv:2112.11235 [pdf, other]
Title: Improving Robustness with Image Filtering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[955]  arXiv:2112.11242 [pdf, other]
Title: Unsupervised deep learning techniques for powdery mildew recognition based on multispectral imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[956]  arXiv:2112.11243 [src]
Title: Projected Sliced Wasserstein Autoencoder-based Hyperspectral Images Anomaly Detection
Comments: I need revise this paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[957]  arXiv:2112.11244 [pdf, other]
Title: Hateful Memes Challenge: An Enhanced Multimodal Framework
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[958]  arXiv:2112.11245 [pdf, other]
Title: Generating Photo-realistic Images from LiDAR Point Clouds with Generative Adversarial Networks
Comments: 11 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[959]  arXiv:2112.11246 [pdf, ps, other]
Title: Image quality enhancement of embedded holograms in holographic information hiding using deep neural networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[960]  arXiv:2112.11258 [pdf, other]
Title: PointCaps: Raw Point Cloud Processing using Capsule Networks with Euclidean Distance Routing
Comments: Accepted to be published in Journal of Visual Communication and Image Representation (Elsevier), 16 Pages, 4 Figures, 5 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[961]  arXiv:2112.11271 [pdf, other]
Title: High-Fidelity Point Cloud Completion with Low-Resolution Recovery and Noise-Aware Upsampling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[962]  arXiv:2112.11290 [pdf, other]
Title: Review of Face Presentation Attack Detection Competitions
Comments: Handbook of Biometric Anti-Spoofing (3rd Ed.)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[963]  arXiv:2112.11325 [pdf, other]
Title: iSegFormer: Interactive Segmentation via Transformers with Application to 3D Knee MR Images
Comments: MICCAI'22 camera-ready
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[964]  arXiv:2112.11329 [pdf, other]
Title: Multispectral image fusion based on super pixel segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[965]  arXiv:2112.11335 [pdf, other]
Title: Deep Learning Based 3D Point Cloud Regression for Estimating Forest Biomass
Comments: 31 pages, 14 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[966]  arXiv:2112.11340 [pdf, other]
Title: Transferable End-to-end Room Layout Estimation via Implicit Encoding
Comments: Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[967]  arXiv:2112.11347 [pdf, other]
Title: Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects
Comments: CVPR2022, 16 pages, Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[968]  arXiv:2112.11366 [pdf, other]
Title: Contrastive Object Detection Using Knowledge Graph Embeddings
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[969]  arXiv:2112.11377 [pdf, other]
Title: Shape from Polarization for Complex Scenes in the Wild
Comments: Accepted to CVPR 2022; Github link: this https URL ;Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[970]  arXiv:2112.11384 [pdf, other]
Title: Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021
Authors: Pierre-Etienne Martin (LaBRI, MPI-EVA, UB), Jordan Calandre (MIA), Boris Mansencal (LaBRI), Jenny Benois-Pineau (LaBRI), Renaud Péteri (MIA), Laurent Mascarilla (MIA), Julien Morlier (IMS)
Comments: MediaEval 2021, Dec 2021, Online, Germany
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[971]  arXiv:2112.11406 [pdf, other]
Title: ADJUST: A Dictionary-Based Joint Reconstruction and Unmixing Method for Spectral Tomography
Comments: This paper is under consideration at Inverse Problems with minor revisions. 33 pages, 24 figures
Journal-ref: Inverse Problems 38 12 (2022) 125002
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE); Numerical Analysis (math.NA); Optimization and Control (math.OC)
[972]  arXiv:2112.11427 [pdf, other]
Title: StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation
Comments: Camera-Ready version. Paper was accepted as oral to CVPR 2022. Added discussions and figures from the rebuttal to the supplementary material (sections C & F). Project Webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[973]  arXiv:2112.11435 [pdf, other]
Title: Learned Queries for Efficient Local Attention
Comments: CVPR 2022 - Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[974]  arXiv:2112.11454 [pdf, other]
Title: GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[975]  arXiv:2112.11542 [pdf, other]
Title: MIA-Former: Efficient and Robust Vision Transformers via Multi-grained Input-Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[976]  arXiv:2112.11543 [pdf, other]
Title: Real-time Street Human Motion Capture
Comments: 7 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[977]  arXiv:2112.11547 [pdf, other]
Title: Decompose the Sounds and Pixels, Recompose the Events
Comments: Accepted at AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[978]  arXiv:2112.11554 [pdf, other]
Title: Distribution-aware Margin Calibration for Semantic Segmentation in Images
Comments: This paper has been accepted by International Journal of Computer Vision (IJCV), and published on 09 November 2021. arXiv admin note: text overlap with arXiv:2011.01462
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[979]  arXiv:2112.11573 [pdf, other]
Title: Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly Types
Comments: WACV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[980]  arXiv:2112.11593 [pdf, other]
Title: AdaptPose: Cross-Dataset Adaptation for 3D Human Pose Estimation by Learnable Motion Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[981]  arXiv:2112.11610 [pdf, other]
Title: EyePAD++: A Distillation-based approach for joint Eye Authentication and Presentation Attack Detection using Periocular Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[982]  arXiv:2112.11623 [pdf, other]
Title: MOSAIC: Mobile Segmentation via decoding Aggregated Information and encoded Context
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[983]  arXiv:2112.11629 [pdf, other]
Title: Convolutional neural network based on transfer learning for breast cancer screening
Comments: 9 pages, 7 figures. arXiv admin note: text overlap with arXiv:2009.08831
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[984]  arXiv:2112.11641 [pdf, other]
Title: JoJoGAN: One Shot Face Stylization
Comments: code at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[985]  arXiv:2112.11643 [pdf, other]
Title: Exploring Credibility Scoring Metrics of Perception Systems for Autonomous Driving
Comments: In 14th International Conference on COMmunication Systems & NETworkS (COMSNETS) Intelligent Transportation Systems 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[986]  arXiv:2112.11648 [pdf, other]
Title: Out-of-distribution Detection with Boundary Aware Learning
Journal-ref: ECCV 2022 Poster
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[987]  arXiv:2112.11679 [pdf, other]
Title: Ghost-dil-NetVLAD: A Lightweight Neural Network for Visual Place Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[988]  arXiv:2112.11685 [pdf, other]
Title: Cost Aggregation Is All You Need for Few-Shot Segmentation
Comments: The trained weights and codes are available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[989]  arXiv:2112.11689 [pdf, other]
Title: Multi-Centroid Representation Network for Domain Adaptive Person Re-ID
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[990]  arXiv:2112.11691 [pdf, other]
Title: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[991]  arXiv:2112.11699 [pdf, other]
Title: Few-Shot Object Detection: A Comprehensive Survey
Comments: 27 pages, 13 figures, submitted to IEEE Transactions on Neural Networks and Learning Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[992]  arXiv:2112.11700 [pdf, other]
Title: Adaptive Contrast for Image Regression in Computer-Aided Disease Assessment
Comments: Accepted in IEEE Transactions on Medical Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[993]  arXiv:2112.11706 [pdf, other]
Title: Entropy Regularized Iterative Weighted Shrinkage-Thresholding Algorithm (ERIWSTA): An Application to CT Image Restoration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[994]  arXiv:2112.11710 [pdf, ps, other]
Title: Fusion of medical imaging and electronic health records with attention and multi-head machanisms
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[995]  arXiv:2112.11713 [pdf, other]
Title: High-Accuracy RGB-D Face Recognition via Segmentation-Aware Face Depth Estimation and Mask-Guided Attention Network
Comments: IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[996]  arXiv:2112.11716 [pdf, other]
Title: Comparing radiologists' gaze and saliency maps generated by interpretability methods for chest x-rays
Comments: This paper was presented as an Extended Abstract at the Gaze Meets ML 2022 Workshop, a NeurIPS 2022 workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[997]  arXiv:2112.11729 [pdf, other]
Title: Generalized Local Optimality for Video Steganalysis in Motion Vector Domain
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[998]  arXiv:2112.11749 [pdf, other]
Title: Class-aware Sounding Objects Localization via Audiovisual Correspondence
Comments: accepted by TPAMI 2021. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[999]  arXiv:2112.11779 [pdf, other]
Title: Exploring Inter-frequency Guidance of Image for Lightweight Gaussian Denoising
Authors: Zhuang Jia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1000]  arXiv:2112.11790 [pdf, other]
Title: BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View
Comments: Multi-camera 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1001]  arXiv:2112.11798 [pdf, other]
Title: YOLO-Z: Improving small object detection in YOLOv5 for autonomous vehicles
Comments: ICCV 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1002]  arXiv:2112.11824 [pdf, ps, other]
Title: Binary Image Skeletonization Using 2-Stage U-Net
Comments: Computer Vision Course Project [AUC, Spring 21]
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1003]  arXiv:2112.11834 [pdf, other]
Title: Bottom-up approaches for multi-person pose estimation and it's applications: A brief review
Comments: 13 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1004]  arXiv:2112.11846 [pdf, other]
Title: A Discriminative Single-Shot Segmentation Network for Visual Object Tracking
Comments: Extended version of the D3S tracker (CVPR2020). Accepted to IEEE TPAMI. arXiv admin note: substantial text overlap with arXiv:1911.08862
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1005]  arXiv:2112.11853 [pdf, other]
Title: Geodesic squared exponential kernel for non-rigid shape registration
Authors: Florent Jousse (UCA, Qc, EPIONE), Xavier Pennec (UCA, EPIONE), Hervé Delingette (UCA, EPIONE), Matilde Gonzalez (Qc)
Comments: 2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021) PROCEEDINGS, Dec 2021, JODHPUR, India
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1006]  arXiv:2112.11895 [pdf, other]
Title: Few-shot Font Generation with Weakly Supervised Localized Representations
Comments: First two authors contributed equally. This is a journal extension of our AAAI 2021 paper arXiv:2009.11042; Code: this https URL and this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1007]  arXiv:2112.11929 [pdf, other]
Title: Meta-Learning and Self-Supervised Pretraining for Real World Image Translation
Comments: 10 pages, 8 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1008]  arXiv:2112.11975 [pdf, other]
Title: Page Segmentation using Visual Adjacency Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1009]  arXiv:2112.11992 [pdf, other]
Title: Automatic Estimation of Anthropometric Human Body Measurements
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1010]  arXiv:2112.12001 [pdf, other]
Title: DA-FDFtNet: Dual Attention Fake Detection Fine-tuning Network to Detect Various AI-Generated Fake Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1011]  arXiv:2112.12002 [pdf, other]
Title: Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction
Comments: Accepted at IEEE WCCI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1012]  arXiv:2112.12004 [pdf, other]
Title: Barely-Supervised Learning: Semi-Supervised Learning with very few labeled images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1013]  arXiv:2112.12027 [pdf, other]
Title: Learning and Crafting for the Wide Multiple Baseline Stereo
Authors: Dmytro Mishkin
Comments: After-defence version with additional fixes based on reviewer commends. 144 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1014]  arXiv:2112.12053 [pdf, other]
Title: Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results
Comments: 15 pages, 13 figures, ICCV2021 Workshop Technique Report, the codebase webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1015]  arXiv:2112.12060 [pdf, ps, other]
Title: Deep Models for Visual Sentiment Analysis of Disaster-related Multimedia Content
Comments: 3 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1016]  arXiv:2112.12070 [pdf, ps, other]
Title: A Single-Target License Plate Detection with Attention
Comments: IWAIT2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1017]  arXiv:2112.12072 [pdf, other]
Title: Hierarchical Cross-Modality Semantic Correlation Learning Model for Multimodal Summarization
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[1018]  arXiv:2112.12073 [pdf, other]
Title: Two Stream Network for Stroke Detection in Table Tennis
Authors: Anam Zahra (MPI-EVA), Pierre-Etienne Martin (LaBRI, MPI-EVA, UB)
Comments: MediaEval 2021, Dec 2021, Online, Germany
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[1019]  arXiv:2112.12074 [pdf, other]
Title: Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark
Authors: Pierre-Etienne Martin (LaBRI, MPI-EVA, UB)
Journal-ref: MediaEval 2021, Dec 2021, Online, Germany
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[1020]  arXiv:2112.12082 [pdf, ps, other]
Title: A New Adaptive Noise Covariance Matrices Estimation and Filtering Method: Application to Multi-Object Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1021]  arXiv:2112.12084 [pdf, other]
Title: Input-Specific Robustness Certification for Randomized Smoothing
Comments: Accepted by AAAI22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1022]  arXiv:2112.12086 [pdf, other]
Title: Improved skin lesion recognition by a Self-Supervised Curricular Deep Learning approach
Authors: Kirill Sirotkin (1), Marcos Escudero-Viñolo (1), Pablo Carballeira (1), Juan Carlos SanMiguel (1) ((1) Universidad Autónoma de Madrid, Escuela Politécnica Superior, Spain)
Comments: 11 pages, 8 figures, submitted to the Journal of Biomedical and Health Informatics (Special Issue on Skin Image Analysis in the Age of Deep Learning)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1023]  arXiv:2112.12089 [pdf, other]
Title: Reflash Dropout in Image Super-Resolution
Comments: CVPR2022 paper + supplementary file
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1024]  arXiv:2112.12130 [pdf, other]
Title: NICE-SLAM: Neural Implicit Scalable Encoding for SLAM
Comments: CVPR 2022, first two authors contributed equally. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1025]  arXiv:2112.12133 [pdf, other]
Title: Can Deep Neural Networks be Converted to Ultra Low-Latency Spiking Neural Networks?
Comments: Accepted to DATE 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1026]  arXiv:2112.12141 [pdf, other]
Title: Multi-modal 3D Human Pose Estimation with 2D Weak Supervision in Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1027]  arXiv:2112.12143 [pdf, other]
Title: Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
Comments: Accepted at ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1028]  arXiv:2112.12175 [pdf, other]
Title: Recur, Attend or Convolve? On Whether Temporal Modeling Matters for Cross-Domain Robustness in Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1029]  arXiv:2112.12180 [pdf, other]
Title: Multimodal Personality Recognition using Cross-Attention Transformer and Behaviour Encoding
Comments: Preprint. Final paper accepted at the 17th International Conference on Computer Vision Theory and Applications (VISAPP), virtual, February, 2022. 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1030]  arXiv:2112.12182 [pdf, other]
Title: Fine-grained Multi-Modal Self-Supervised Learning
Comments: Accepted at BMVC 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1031]  arXiv:2112.12193 [pdf, other]
Title: Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations -- combining input rotations and a kinematic model
Comments: extended abstract, 4 pages, 3 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1032]  arXiv:2112.12218 [pdf, other]
Title: Maximum Entropy on Erroneous Predictions (MEEP): Improving model calibration for medical image segmentation
Comments: Accepted for publication at MICCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1033]  arXiv:2112.12219 [pdf, other]
Title: SAMCNet for Spatial-configuration-based Classification: A Summary of Results
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1034]  arXiv:2112.12252 [pdf, other]
Title: Leveraging Synthetic Data in Object Detection on Unmanned Aerial Vehicles
Comments: The first two authors contributed equally. Github repository will be made public soon
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1035]  arXiv:2112.12328 [pdf, other]
Title: Robust and Precise Facial Landmark Detection by Self-Calibrated Pose Attention Network
Comments: Accept by IEEE Transactions on Cybernetics, December 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1036]  arXiv:2112.12329 [pdf, other]
Title: MVDG: A Unified Multi-view Framework for Domain Generalization
Comments: Accepted by ECCV2022. The code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1037]  arXiv:2112.12345 [pdf, other]
Title: Revisiting Transformation Invariant Geometric Deep Learning: Are Initial Representations All You Need?
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1038]  arXiv:2112.12349 [pdf, other]
Title: Learning Hierarchical Attention for Weakly-supervised Chest X-Ray Abnormality Localization and Diagnosis
Journal-ref: IEEE Transactions on Medical Imaging 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1039]  arXiv:2112.12355 [pdf, other]
Title: A Random Point Initialization Approach to Image Segmentation with Variational Level-sets
Comments: 17 pages, 27 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1040]  arXiv:2112.12359 [pdf, other]
Title: Dual Path Structural Contrastive Embeddings for Learning Novel Objects
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1041]  arXiv:2112.12385 [pdf, other]
Title: DILF-EN framework for Class-Incremental Learning
Comments: Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1042]  arXiv:2112.12390 [pdf, other]
Title: Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields
Comments: 6 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1043]  arXiv:2112.12402 [pdf, other]
Title: Iteratively Selecting an Easy Reference Frame Makes Unsupervised Video Object Segmentation Easier
Comments: Accepted to AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1044]  arXiv:2112.12409 [pdf, other]
Title: InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1045]  arXiv:2112.12455 [pdf, ps, other]
Title: Your Face Mirrors Your Deepest Beliefs-Predicting Personality and Morals through Facial Emotion Recognition
Journal-ref: Future Internet 14(1), 5 (2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[1046]  arXiv:2112.12484 [pdf, other]
Title: Pose Adaptive Dual Mixup for Few-Shot Single-View 3D Reconstruction
Comments: To appear in the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI), February 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1047]  arXiv:2112.12494 [pdf, other]
Title: LaTr: Layout-Aware Transformer for Scene-Text VQA
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1048]  arXiv:2112.12496 [pdf, other]
Title: FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition
Comments: This paper was accepted by AAAI 2022 Conference on Artificial Intelligence and selected as an oral paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1049]  arXiv:2112.12506 [pdf, other]
Title: Attentive Multi-View Deep Subspace Clustering Net
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1050]  arXiv:2112.12535 [pdf, other]
Title: FourierMask: Instance Segmentation using Fourier Mapping in Implicit Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1051]  arXiv:2112.12573 [pdf, other]
Title: Boosting Generative Zero-Shot Learning by Synthesizing Diverse Features with Attribute Augmentation
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1052]  arXiv:2112.12577 [pdf, other]
Title: NVS-MonoDepth: Improving Monocular Depth Prediction with Novel View Synthesis
Comments: 8 pages (main paper), 9 pages (supplementary material), 14 figures, 4 tables
Journal-ref: 2021 International Conference on 3D Vision (3DV)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1053]  arXiv:2112.12579 [pdf, other]
Title: NeRD++: Improved 3D-mirror symmetry learning from a single image
Comments: BMVC 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1054]  arXiv:2112.12606 [pdf, other]
Title: Towards Universal GAN Image Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1055]  arXiv:2112.12610 [pdf, other]
Title: PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving
Comments: This paper has been published on ITSC'2021, please check the website of the PandaSet for more information: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1056]  arXiv:2112.12618 [pdf, other]
Title: Manifold Learning Benefits GANs
Comments: CVPR 2022, 32 pages full version
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1057]  arXiv:2112.12625 [pdf, other]
Title: Comparison and Analysis of Image-to-Image Generative Adversarial Networks: A Survey
Comments: 36 pages, 22 figures, Preprint; format changed, typos corrected
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1058]  arXiv:2112.12668 [pdf, other]
Title: 3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Naïve
Comments: Full 17 page version
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1059]  arXiv:2112.12702 [pdf, other]
Title: TagLab: A human-centric AI system for interactive semantic segmentation
Comments: Accepted at Human Centered AI workshop at NeurIPS 2021, this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[1060]  arXiv:2112.12703 [pdf, other]
Title: Digital Editions as Distant Supervision for Layout Analysis of Printed Books
Comments: 15 pages, 2 figures. International Conference on Document Analysis and Recognition. Springer, Cham, 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1061]  arXiv:2112.12748 [pdf, other]
Title: Assessing the Impact of Attention and Self-Attention Mechanisms on the Classification of Skin Lesions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1062]  arXiv:2112.12750 [pdf, other]
Title: SLIP: Self-supervision meets Language-Image Pre-training
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1063]  arXiv:2112.12761 [pdf, other]
Title: BANMo: Building Animatable 3D Neural Models from Many Casual Videos
Comments: CVPR 2022 camera-ready version (last update: May 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1064]  arXiv:2112.12777 [pdf, other]
Title: Cross Modal Retrieval with Querybank Normalisation
Comments: Accepted at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1065]  arXiv:2112.12782 [pdf, other]
Title: SeMask: Semantically Masked Transformers for Semantic Segmentation
Comments: Updated experiments with Mix-Transformer (MiT) on ADE20K and added an analysis section
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1066]  arXiv:2112.12785 [pdf, other]
Title: NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning
Comments: Accepted at CVPR 2022. Supplementary material included after references. 15 pages, 14 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1067]  arXiv:2112.12786 [pdf, other]
Title: ELSA: Enhanced Local Self-Attention for Vision Transformer
Comments: Project at \url{this https URL}
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[1068]  arXiv:2112.12812 [pdf, other]
Title: MDN-VO: Estimating Visual Odometry with Confidence
Journal-ref: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 3528-3533
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1069]  arXiv:2112.12818 [pdf, other]
Title: Multi-Camera Sensor Fusion for Visual Odometry using Deep Uncertainty Estimation
Journal-ref: 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), 2021, pp. 2944-2949
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1070]  arXiv:2112.12833 [pdf, other]
Title: Dense Out-of-Distribution Detection by Robust Learning on Synthetic Negative Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1071]  arXiv:2112.12843 [pdf, other]
Title: Impact of class imbalance on chest x-ray classifiers: towards better evaluation practices for discrimination and calibration performance
Comments: Conference on Health, Inference, and Learning (CHIL) 2022 - Invited non-archival presentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1072]  arXiv:2112.12867 [pdf, other]
Title: HSPACE: Synthetic Parametric Humans Animated in Complex Environments
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1073]  arXiv:2112.12887 [pdf, other]
Title: A formal approach to good practices in Pseudo-Labeling for Unsupervised Domain Adaptive Re-Identification
Comments: This paper is a preprint under submission at CVIU for review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1074]  arXiv:2112.12911 [pdf, other]
Title: Cluster-guided Image Synthesis with Unconditional Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1075]  arXiv:2112.12916 [pdf, other]
Title: Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
Comments: Accepted by AAAI-22
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1076]  arXiv:2112.12917 [pdf, other]
Title: Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation
Comments: accepted by ACM Multimedia 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1077]  arXiv:2112.12925 [pdf, other]
Title: Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective
Comments: Accepted to AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1078]  arXiv:2112.12927 [pdf, other]
Title: Learning Aligned Cross-Modal Representation for Generalized Zero-Shot Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1079]  arXiv:2112.12939 [pdf, other]
Title: Realtime Global Attention Network for Semantic Segmentation
Authors: Xi Mo, Xiangyu Chen
Comments: Ver1.0 for RA-L with ICRA presentation
Journal-ref: IEEE Robotics and Automation Letters 7(2022).1574-1580
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1080]  arXiv:2112.12955 [pdf, ps, other]
Title: Deep ensembles in bioimage segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1081]  arXiv:2112.12970 [pdf, other]
Title: SGTR: End-to-end Scene Graph Generation with Transformer
Comments: Accepted by CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1082]  arXiv:2112.12988 [pdf, other]
Title: iSeg3D: An Interactive 3D Shape Segmentation Tool
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1083]  arXiv:2112.12989 [pdf, other]
Title: Domain-Aware Continual Zero-Shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1084]  arXiv:2112.13002 [pdf, other]
Title: US-GAN: On the importance of Ultimate Skip Connection for Facial Expression Synthesis
Journal-ref: Multimed Tools Appl (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1085]  arXiv:2112.13003 [pdf, other]
Title: Continuous Spectral Reconstruction from RGB Images via Implicit Neural Representation
Comments: Accepted to ECCV Workshop 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1086]  arXiv:2112.13018 [pdf, other]
Title: Benchmarking Pedestrian Odometry: The Brown Pedestrian Odometry Dataset (BPOD)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1087]  arXiv:2112.13031 [pdf, other]
Title: Grounding Linguistic Commands to Navigable Regions
Journal-ref: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 8593-8600
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1088]  arXiv:2112.13047 [pdf, other]
Title: Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1089]  arXiv:2112.13050 [pdf, other]
Title: Self-Gated Memory Recurrent Network for Efficient Scalable HDR Deghosting
Comments: 12 pages
Journal-ref: IEEE Transactions on Computational Imaging (Volume 7, 2021) 1228-1239
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1090]  arXiv:2112.13060 [src]
Title: NIP: Neuron-level Inverse Perturbation Against Adversarial Attacks
Comments: There are some problems in the figure so we need to withdraw this paper. We will upload the new version after revision
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1091]  arXiv:2112.13076 [pdf, other]
Title: Virtuoso: Video-based Intelligence for real-time tuning on SOCs
Comments: 28 pages, 15 figures, 4 tables, ACM-TODAES
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1092]  arXiv:2112.13082 [pdf, other]
Title: Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection
Comments: 2021 IEEE International Conference on Autonomous Systems (ICAS)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1093]  arXiv:2112.13085 [pdf, other]
Title: SimViT: Exploring a Simple Vision Transformer with sliding windows
Comments: 7 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1094]  arXiv:2112.13107 [pdf, other]
Title: Invertible Network for Unpaired Low-light Image Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1095]  arXiv:2112.13142 [pdf, other]
Title: Reconstructing Compact Building Models from Point Clouds Using Deep Implicit Fields
Comments: Accepted for publication in ISPRS Journal of Photogrammetry and Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1096]  arXiv:2112.13165 [pdf, other]
Title: Semantic Clustering based Deduction Learning for Image Recognition and Classification
Journal-ref: Pattern Recognition 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1097]  arXiv:2112.13308 [pdf, other]
Title: Unsupervised Clustering Active Learning for Person Re-identification
Comments: This work was submitted to BMVC2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1098]  arXiv:2112.13310 [pdf, other]
Title: Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence
Journal-ref: AAAI 2022 workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1099]  arXiv:2112.13328 [pdf, other]
Title: Continuous Offline Handwriting Recognition using Deep Learning Models
Authors: Jorge Sueiras
Comments: 186 pages, 83 figures, thesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1100]  arXiv:2112.13341 [pdf, other]
Title: AlertTrap: A study on object detection in remote insects trap monitoring system using on-the-edge deep learning platform
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1101]  arXiv:2112.13465 [pdf, other]
Title: PreDisM: Pre-Disaster Modelling With CNN Ensembles for At-Risk Communities
Journal-ref: NeurIPS 2021 Workshop on Tackling Climate Change with Machine Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[1102]  arXiv:2112.13478 [pdf, other]
Title: Video Joint Modelling Based on Hierarchical Transformer for Co-summarization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1103]  arXiv:2112.13491 [pdf, other]
Title: A Compact Neural Network-based Algorithm for Robust Image Watermarking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1104]  arXiv:2112.13492 [pdf, other]
Title: Vision Transformer for Small-Size Datasets
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1105]  arXiv:2112.13494 [pdf, ps, other]
Title: Estimating Parameters of the Tree Root in Heterogeneous Soil Environments via Mask-Guided Multi-Polarimetric Integration Neural Network
Comments: 14 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1106]  arXiv:2112.13522 [pdf, other]
Title: Dual Contrastive Learning for General Face Forgery Detection
Comments: This paper was accepted by AAAI 2022 Conference on Artificial Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1107]  arXiv:2112.13528 [pdf, other]
Title: Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction
Comments: NeurIPS 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1108]  arXiv:2112.13534 [pdf, other]
Title: Adversarial Attack for Asynchronous Event-based Data
Authors: Wooju Lee, Hyun Myung
Comments: 8 pages, 6 figures, Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1109]  arXiv:2112.13538 [pdf, other]
Title: Meta-Learned Feature Critics for Domain Generalized Semantic Segmentation
Comments: Accepted by ICIP 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1110]  arXiv:2112.13539 [pdf, other]
Title: Few-Shot Classification in Unseen Domains by Episodic Meta-Learning Across Visual Domains
Comments: Accepted by ICIP 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1111]  arXiv:2112.13540 [pdf, other]
Title: Image Edge Restoring Filter
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1112]  arXiv:2112.13545 [pdf, other]
Title: ViR:the Vision Reservoir
Comments: 10 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1113]  arXiv:2112.13547 [pdf, other]
Title: PRIME: A few primitives can boost robustness to common corruptions
Comments: Code available at: this https URL
Journal-ref: European Conference on Computer Vision (ECCV) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1114]  arXiv:2112.13548 [pdf, other]
Title: Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Comments: Accepted by ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1115]  arXiv:2112.13551 [pdf, other]
Title: Learning Robust and Lightweight Model through Separable Structured Transformations
Comments: 18 pages, 5figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1116]  arXiv:2112.13565 [pdf, other]
Title: Hard Example Guided Hashing for Image Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1117]  arXiv:2112.13583 [pdf, ps, other]
Title: Vegetation Stratum Occupancy Prediction from Airborne LiDAR 3D Point Clouds
Journal-ref: SilviLaser 2021 Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1118]  arXiv:2112.13592 [pdf, other]
Title: Multimodal Image Synthesis and Editing: The Generative AI Era
Comments: TPAMI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1119]  arXiv:2112.13608 [pdf, other]
Title: An Empirical Study of Adder Neural Networks for Object Detection
Journal-ref: NeurIPS 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1120]  arXiv:2112.13635 [pdf, other]
Title: AdaptivePose: Human Parts as Adaptive Points
Comments: Accepted by AAAI 2022. Code Will be released after the extention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1121]  arXiv:2112.13692 [pdf, other]
Title: Augmenting Convolutional networks with attention-based aggregation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1122]  arXiv:2112.13697 [pdf, other]
Title: Weakly Supervised Visual-Auditory Fixation Prediction with Multigranularity Perception
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1123]  arXiv:2112.13706 [pdf, other]
Title: Multi-Image Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1124]  arXiv:2112.13707 [pdf, other]
Title: Visual Place Representation and Recognition from Depth Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1125]  arXiv:2112.13709 [pdf, other]
Title: Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training
Comments: IEEE WACV 2023 algorithms track. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1126]  arXiv:2112.13715 [pdf, other]
Title: SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos
Comments: Accepted by ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1127]  arXiv:2112.13727 [pdf, other]
Title: A Multi-channel Training Method Boost the Performance
Authors: Yingdong Hu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1128]  arXiv:2112.13734 [pdf, ps, other]
Title: Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models
Comments: MED-NEURIPS 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1129]  arXiv:2112.13762 [pdf, other]
Title: MSeg: A Composite Dataset for Multi-domain Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1130]  arXiv:2112.13809 [pdf, other]
Title: Improving Deep Image Matting via Local Smoothness Assumption
Comments: 9 pages, accepted by IEEE ICME 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1131]  arXiv:2112.13815 [pdf, other]
Title: Temporally Constrained Neural Networks (TCNN): A framework for semi-supervised video semantic segmentation
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1132]  arXiv:2112.13843 [pdf, other]
Title: BMPQ: Bit-Gradient Sensitivity Driven Mixed-Precision Quantization of DNNs from Scratch
Comments: 4 pages, 2 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1133]  arXiv:2112.13845 [pdf, other]
Title: Raw Produce Quality Detection with Shifted Window Self-Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1134]  arXiv:2112.13846 [pdf, ps, other]
Title: Algorithm for recognizing the contour of a honeycomb block
Comments: 11 pages, in Russian, 13 figures, ICMTMTE
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1135]  arXiv:2112.13884 [pdf, other]
Title: A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1136]  arXiv:2112.13889 [pdf, other]
Title: Free-Viewpoint RGB-D Human Performance Capture and Rendering
Comments: Accepted at ECCV 2022, Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1137]  arXiv:2112.13890 [pdf, other]
Title: SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
Comments: ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[1138]  arXiv:2112.13891 [pdf, other]
Title: GPU-accelerated Faster Mean Shift with euclidean distance metrics
Comments: 7 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:2007.14283
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1139]  arXiv:2112.13906 [pdf, other]
Title: Does CLIP Benefit Visual Question Answering in the Medical Domain as Much as it Does in the General Domain?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1140]  arXiv:2112.13925 [pdf, other]
Title: Improving Depth Estimation using Location Information
Journal-ref: 2021 16th International Conference on Computer Engineering and Systems (ICCES)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[1141]  arXiv:2112.13942 [pdf, other]
Title: PriFit: Learning to Fit Primitives Improves Few Shot Point Cloud Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1142]  arXiv:2112.13953 [pdf, ps, other]
Title: Source Feature Compression for Object Classification in Vision-Based Underwater Robotics
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1143]  arXiv:2112.13977 [pdf, other]
Title: Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1144]  arXiv:2112.13982 [pdf, other]
Title: Quaternion-based dynamic mode decomposition for background modeling in color videos
Comments: 16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1145]  arXiv:2112.13983 [pdf, other]
Title: Siamese Network with Interactive Transformer for Video Object Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1146]  arXiv:2112.13985 [pdf, other]
Title: LatteGAN: Visually Guided Language Attention for Multi-Turn Text-Conditioned Image Manipulation
Journal-ref: IEEE Access, 9, 160521-160532 (2021)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1147]  arXiv:2112.13986 [pdf, other]
Title: Deep-CNN based Robotic Multi-Class Under-Canopy Weed Control in Precision Farming
Comments: 8 pages, 7 figures, International Conference on Robotics and Automation (IEEE)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1148]  arXiv:2112.13989 [pdf, other]
Title: Associative Adversarial Learning Based on Selective Attack
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1149]  arXiv:2112.14000 [pdf, other]
Title: Pale Transformer: A General Vision Transformer Backbone with Pale-Shaped Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1150]  arXiv:2112.14015 [pdf, other]
Title: GuidedMix-Net: Semi-supervised Semantic Segmentation by Using Labeled Images as Reference
Comments: Accepted by AAAI'22. arXiv admin note: substantial text overlap with arXiv:2106.15064
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1151]  arXiv:2112.14016 [pdf, other]
Title: Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking
Comments: Accepted by TPAMI. Extended version of the RLS-RTMDNet tracker (CVPR2020)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1152]  arXiv:2112.14019 [pdf, other]
Title: Semi-supervised Salient Object Detection with Effective Confidence Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1153]  arXiv:2112.14023 [pdf, other]
Title: The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection
Comments: Accepted to ICCV 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1154]  arXiv:2112.14025 [pdf, other]
Title: Delving into Probabilistic Uncertainty for Unsupervised Domain Adaptive Person Re-Identification
Comments: Accepted by AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1155]  arXiv:2112.14059 [pdf, other]
Title: DetarNet: Decoupling Translation and Rotation by Siamese Network for Point Cloud Registration
Comments: Accepted by AAAI-2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1156]  arXiv:2112.14084 [pdf, other]
Title: Embodied Learning for Lifelong Visual Perception
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1157]  arXiv:2112.14087 [pdf, other]
Title: APRIL: Finding the Achilles' Heel on Privacy for Vision Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1158]  arXiv:2112.14088 [pdf, other]
Title: Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1159]  arXiv:2112.14100 [pdf, other]
Title: Extended Self-Critical Pipeline for Transforming Videos to Text (TRECVID-VTT Task 2021) -- Team: MMCUniAugsburg
Comments: TRECVID 2021 notebook paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1160]  arXiv:2112.14159 [pdf, other]
Title: Skin feature point tracking using deep feature encodings
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1161]  arXiv:2112.14238 [pdf, other]
Title: AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition
Comments: Accepted by CVPR-2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1162]  arXiv:2112.14239 [pdf, other]
Title: TAGPerson: A Target-Aware Generation Pipeline for Person Re-identification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1163]  arXiv:2112.14298 [pdf, ps, other]
Title: Multimodal perception for dexterous manipulation
Authors: Guanqun Cao, Shan Luo
Comments: 19 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1164]  arXiv:2112.14316 [pdf, other]
Title: FRIDA -- Generative Feature Replay for Incremental Domain Adaptation
Comments: Accepted at CVIU (7th January 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1165]  arXiv:2112.14327 [pdf, other]
Title: Multi-Head Deep Metric Learning Using Global and Local Representations
Comments: To appear in WACV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1166]  arXiv:2112.14331 [pdf, other]
Title: 360° Optical Flow using Tangent Images
Comments: The 32nd British Machine Vision Conference (BMVC 2021)
Journal-ref: BMVC 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1167]  arXiv:2112.14379 [pdf, other]
Title: Background-aware Classification Activation Map for Weakly Supervised Object Localization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1168]  arXiv:2112.14380 [pdf, other]
Title: Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1169]  arXiv:2112.14381 [pdf, other]
Title: COTReg:Coupled Optimal Transport based Point Cloud Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1170]  arXiv:2112.14382 [pdf, other]
Title: Self-Supervised Robustifying Guidance for Monocular 3D Face Reconstruction
Comments: Accepted by The 33rd British Machine Vision Conference (BMVC) 2022. Evaluation code and datasets: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1171]  arXiv:2112.14406 [pdf, other]
Title: Overcoming Mode Collapse with Adaptive Multi Adversarial Training
Comments: BMVC 2021 Poster
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1172]  arXiv:2112.14420 [pdf, other]
Title: Invertible Image Dataset Protection
Comments: Submitted to ICME 2022. Authors are from University of Science and Technology of China, Fudan University, China. A potential extended version of this work is under way
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1173]  arXiv:2112.14440 [pdf, other]
Title: ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth Estimation
Comments: 13 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1174]  arXiv:2112.14478 [pdf, other]
Title: Semantic Feature Extraction for Generalized Zero-shot Learning
Comments: Accepted at AAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1175]  arXiv:2112.14491 [pdf, other]
Title: Two-phase training mitigates class imbalance for camera trap image classification with CNNs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1176]  arXiv:2112.14513 [pdf, other]
Title: Spatial Distribution Patterns of Clownfish in Recirculating Aquaculture Systems
Comments: 14 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[1177]  arXiv:2112.14540 [src]
Title: Res2NetFuse: A Fusion Method for Infrared and Visible Images
Comments: There are some errors that need to be corrected
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[1178]  arXiv:2112.14651 [pdf, other]
Title: On the Instability of Relative Pose Estimation and RANSAC's Role
Comments: 27 pages, 11 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[1179]  arXiv:2112.14656 [pdf, other]
Title: Gendered Differences in Face Recognition Accuracy Explained by Hairstyles, Makeup, and Facial Morphology
Comments: arXiv admin note: substantial text overlap with arXiv:2008.06989
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1180]  arXiv:2112.14663 [pdf, other]
Title: MetaGraspNet_v0: A Large-Scale Benchmark Dataset for Vision-driven Robotic Grasping via Physics-based Metaverse Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1181]  arXiv:2112.14683 [pdf, other]
Title: StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1182]  arXiv:2112.14757 [pdf, other]
Title: A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1183]  arXiv:2112.14796 [pdf, ps, other]
Title: Deep Learning meets Liveness Detection: Recent Advancements and Challenges
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1184]  arXiv:2112.14804 [pdf, other]
Title: Learning Spatially-Adaptive Squeeze-Excitation Networks for Image Synthesis and Image Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1185]  arXiv:2112.14894 [pdf, other]
Title: Feature Generation and Hypothesis Verification for Reliable Face Anti-Spoofing
Comments: Accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[1186]  arXiv:2112.14931 [pdf, other]
Title: Dense Depth Estimation from Multiple 360-degree Images Using Virtual Depth
Comments: 16 pages, 11 figures, Applied Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[1187]  arXiv:2112.14934 [pdf, other]
Title: SFU-HW-Tracks-v1: Object Tracking Dataset on Raw Video Sequences
Comments: 4 pages, 3 figures, submitted to Data in Brief
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1188]  arXiv:2112.14968 [pdf, other]
Title: A Novel Generator with Auxiliary Branch for Improving GAN Performance
Journal-ref: IEEE transactions on neural networks and learning systems 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1189]  arXiv:2112.14971 [pdf, other]
Title: Contrastive Fine-grained Class Clustering via Generative Adversarial Networks
Comments: ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1190]  arXiv:2112.14976 [src]
Title: Contrastive Learning of Semantic and Visual Representations for Text Tracking
Comments: Merge the paper with arXiv article 2207.08417. We will withdraw the two papers and create new one
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1191]  arXiv:2112.14983 [pdf, ps, other]
Title: Exploring the pattern of Emotion in children with ASD as an early biomarker through Recurring-Convolution Neural Network (R-CNN)
Comments: 8 figures and 2 tables. totally 18 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[1192]  arXiv:2112.14985 [pdf, other]
Title: THE Benchmark: Transferable Representation Learning for Monocular Height Estimation
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1193]  arXiv:2112.15012 [pdf, other]
Title: Investigating Pose Representations and Motion Contexts Modeling for 3D Motion Prediction
Comments: Accepted to IEEE TPAMI, 27 Dec. 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1194]  arXiv:2112.15022 [pdf, other]
Title: Continually Learning Self-Supervised Representations with Projected Functional Regularization
Comments: Accepted at Workshop on Continual Learning in Computer Vision (CVPR 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1195]  arXiv:2112.15031 [pdf, other]
Title: Development of a face mask detection pipeline for mask-wearing monitoring in the era of the COVID-19 pandemic: A modular approach
Comments: Accepted at the 19th International Joint Conference on Computer Science and Software Engineering (JCSSE 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1196]  arXiv:2112.15075 [pdf, other]
Title: Pose Estimation of Specific Rigid Objects
Authors: Tomas Hodan
Comments: Tomas Hodan's PhD thesis defended on July 7, 2021. Supervisor: Prof. Jiri Matas. Reviewers: Prof. Vincent Lepetit, Prof. Markus Vincze, Dr. Slobodan Ilic. A recording of the defense: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
[1197]  arXiv:2112.15085 [pdf, ps, other]
Title: Feature Extraction, Classification and Prediction for Hand Hygiene Gestures with KNN Algorithm
Authors: Rashmi Bakshi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1198]  arXiv:2112.15091 [pdf, other]
Title: Leveraging in-domain supervision for unsupervised image-to-image translation tasks via multi-stream generators
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1199]  arXiv:2112.15093 [pdf, other]
Title: Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study
Comments: Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1200]  arXiv:2112.15095 [pdf, other]
Title: A general technique for the estimation of farm animal body part weights from CT scans and its applications in a rabbit breeding program
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1201]  arXiv:2112.15111 [pdf, other]
Title: Improving the Behaviour of Vision Transformers with Token-consistent Stochastic Layers
Comments: This article is under consideration at the Computer Vision and Image Understanding journal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1202]  arXiv:2112.15139 [pdf, other]
Title: Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
Comments: Accepted at ICML 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1203]  arXiv:2112.15188 [pdf, other]
Title: Towards Robustness of Neural Networks
Authors: Steven Basart
Comments: PhD Thesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1204]  arXiv:2112.15202 [pdf, other]
Title: Visual and Object Geo-localization: A Comprehensive Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1205]  arXiv:2112.15283 [pdf, other]
Title: ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Comments: 15 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[1206]  arXiv:2112.15324 [pdf, other]
Title: Deconfounded Visual Grounding
Comments: AAAI 2022 Accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[1207]  arXiv:2112.15344 [pdf, other]
Title: P2P-Loc: Point to Point Tiny Person Localization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1208]  arXiv:2112.15351 [pdf, other]
Title: Learning to Predict 3D Lane Shape and Camera Pose from a Single Image via Geometry Constraints
Comments: 14 pages, 10 figures, accepted by AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1209]  arXiv:2112.15355 [pdf, other]
Title: Sparse LiDAR Assisted Self-supervised Stereo Disparity Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1210]  arXiv:2112.15358 [pdf, other]
Title: Conditional Generative Data-free Knowledge Distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1211]  arXiv:2112.15399 [pdf, other]
Title: InfoNeRF: Ray Entropy Minimization for Few-Shot Neural Volume Rendering
Comments: CVPR 2022, Website: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
[1212]  arXiv:2112.15439 [pdf, other]
Title: Facial-Sketch Synthesis: A New Challenge
Comments: Accepted to Machine Intelligence Research (MIR)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1213]  arXiv:2112.15458 [pdf, other]
Title: Accurate and Real-time 3D Pedestrian Detection Using an Efficient Attentive Pillar Network
Comments: 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1214]  arXiv:2112.15483 [pdf, other]
Title: Cloud Removal from Satellite Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1215]  arXiv:2112.15509 [pdf, other]
Title: Scene-Adaptive Attention Network for Crowd Counting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1216]  arXiv:2112.15571 [pdf, other]
Title: PCACE: A Statistical Approach to Ranking Neurons for CNN Interpretability
Journal-ref: Responsible AI and DeepSpatial workshops at the 27th SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[1217]  arXiv:2112.15589 [pdf, ps, other]
Title: 3-D Material Style Transfer for Reconstructing Unknown Appearance in Complex Natural Materials
Comments: 15 pages, 22 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1218]  arXiv:2112.00007 (cross-list from cs.GR) [pdf, other]
Title: Sound-Guided Semantic Image Manipulation
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1219]  arXiv:2112.00133 (cross-list from cs.LG) [pdf, other]
Title: PokeBNN: A Binary Pursuit of Lightweight Accuracy
Comments: Accepted to CVPR 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1220]  arXiv:2112.00171 (cross-list from cs.LG) [pdf, other]
Title: Improving Differentiable Architecture Search with a Generative Model
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1221]  arXiv:2112.00190 (cross-list from cs.LG) [pdf, ps, other]
Title: Is the use of Deep Learning and Artificial Intelligence an appropriate means to locate debris in the ocean without harming aquatic wildlife?
Comments: reference list is added/updated; sorry for causing any inconveniences. 3681 words, 14 pages
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1222]  arXiv:2112.00265 (cross-list from cs.LG) [pdf, other]
Title: Training BatchNorm Only in Neural Architecture Search and Beyond
Comments: 11 pages Technical report
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1223]  arXiv:2112.00305 (cross-list from cs.LG) [pdf, other]
Title: Forward Operator Estimation in Generative Models with Kernel Transfer Operators
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1224]  arXiv:2112.00324 (cross-list from cs.LG) [pdf, ps, other]
Title: Optimizing for In-memory Deep Learning with Emerging Memory Technology
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET)
[1225]  arXiv:2112.00378 (cross-list from cs.LG) [pdf, other]
Title: $\ell_\infty$-Robustness and Beyond: Unleashing Efficient Adversarial Training
Comments: Accepted to the 17th European Conference on Computer Vision (ECCV 2022)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1226]  arXiv:2112.00584 (cross-list from cs.GR) [pdf, other]
Title: The Shape Part Slot Machine: Contact-based Reasoning for Generating 3D Shapes from Parts
Comments: European Conference on Computer Vision (ECCV) 2022
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1227]  arXiv:2112.00734 (cross-list from cs.LG) [pdf, other]
Title: Personalized Federated Learning with Adaptive Batchnorm for Healthcare
Comments: Accepted by IEEE Transactions on Big Data; code: this https URL arXiv admin note: substantial text overlap with arXiv:2106.01009
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1228]  arXiv:2112.00739 (cross-list from cs.LG) [pdf, ps, other]
Title: Incomplete Multi-view Clustering via Cross-view Relation Transfer
Journal-ref: IEEE Transactions on Circuits and Systems for Video Technology, 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1229]  arXiv:2112.01008 (cross-list from cs.LG) [pdf, other]
Title: Editing a classifier by rewriting its prediction rules
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1230]  arXiv:2112.01010 (cross-list from cs.LG) [pdf, other]
Title: Differentiable Spatial Planning using Transformers
Comments: Published at ICML 2021. See project webpage at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1231]  arXiv:2112.01283 (cross-list from cs.LG) [pdf, ps, other]
Title: Detecting Extratropical Cyclones of the Northern Hemisphere with Single Shot Detector
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Atmospheric and Oceanic Physics (physics.ao-ph)
[1232]  arXiv:2112.01406 (cross-list from cs.LG) [pdf, other]
Title: Active Learning for Domain Adaptation: An Energy-Based Approach
Comments: Camera ready for AAAI 2022. Code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1233]  arXiv:2112.01421 (cross-list from cs.LG) [pdf, other]
Title: Deep residential representations: Using unsupervised learning to unlock elevation data for geo-demographic prediction
Comments: 29 pages, 13 figures. V2 - Published
Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, 187, 378-392 (2022)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1234]  arXiv:2112.01423 (cross-list from cs.LG) [pdf, other]
Title: Training Efficiency and Robustness in Deep Learning
Authors: Fartash Faghri
Comments: A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1235]  arXiv:2112.01511 (cross-list from cs.RO) [pdf, other]
Title: The Surprising Effectiveness of Representation Learning for Visual Imitation
Comments: The first two authors contributed equally
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1236]  arXiv:2112.01579 (cross-list from cs.GR) [pdf, other]
Title: Fast Neural Representations for Direct Volume Rendering
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1237]  arXiv:2112.01716 (cross-list from cs.LG) [pdf, other]
Title: Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research
Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (stat.ML)
[1238]  arXiv:2112.01790 (cross-list from cs.LG) [pdf, other]
Title: SSDL: Self-Supervised Dictionary Learning
Comments: Accepted by 22th IEEE International Conference on Multimedia and Expo (ICME) as an Oral
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1239]  arXiv:2112.01806 (cross-list from cs.SD) [pdf, other]
Title: Music-to-Dance Generation with Optimal Transport
Comments: IJCAI 2022
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1240]  arXiv:2112.01832 (cross-list from cs.MM) [pdf, other]
Title: Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
Comments: Accepted by ECCV2022
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1241]  arXiv:2112.01840 (cross-list from cs.RO) [pdf, other]
Title: Graph-Guided Deformation for Point Cloud Completion
Comments: RAL with IROS 2021
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1242]  arXiv:2112.01849 (cross-list from cs.MM) [pdf, ps, other]
Title: Cross-modal Knowledge Distillation for Vision-to-Sensor Action Recognition
Comments: 5 pages, 2 figures, submitted to ICASSP2022
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1243]  arXiv:2112.01917 (cross-list from cs.LG) [pdf, other]
Title: A Structured Dictionary Perspective on Implicit Neural Representations
Comments: Accepted to IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (26 pages, 16 figures)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1244]  arXiv:2112.02086 (cross-list from cs.LG) [pdf, other]
Title: Data-Free Neural Architecture Search via Recursive Label Calibration
Comments: ECCV 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1245]  arXiv:2112.02094 (cross-list from cs.RO) [pdf, other]
Title: Coupling Vision and Proprioception for Navigation of Legged Robots
Comments: CVPR 2022 final version. Website at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1246]  arXiv:2112.02488 (cross-list from cs.LG) [pdf, other]
Title: Exploring Complicated Search Spaces with Interleaving-Free Sampling
Comments: 9 pages, 8 figures, 6 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1247]  arXiv:2112.02735 (cross-list from cs.RO) [pdf, other]
Title: A Dataset of Stationary, Fixed-wing Aircraft on a Collision Course for Vision-Based Sense and Avoid
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1248]  arXiv:2112.02849 (cross-list from cs.RO) [pdf, other]
Title: DemoGrasp: Few-Shot Learning for Robotic Grasping with Human Demonstration
Comments: Accepted by IROS 2021
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1249]  arXiv:2112.02880 (cross-list from cs.LG) [pdf, other]
Title: AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks
Comments: 18 pages
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1250]  arXiv:2112.03028 (cross-list from cs.CV) [pdf, other]
Title: D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
Comments: CVPR-2022 camera ready. Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1251]  arXiv:2112.03030 (cross-list from cs.RO) [pdf, other]
Title: Pose2Room: Understanding 3D Scenes from Human Activities
Comments: Accepted by ECCV'2022; Project page: this https URL Video: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1252]  arXiv:2112.03052 (cross-list from cs.LG) [pdf, other]
Title: Scaling Up Influence Functions
Comments: Published at AAAI-22
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1253]  arXiv:2112.03134 (cross-list from cs.LG) [pdf, other]
Title: Prototypical Model with Novel Information-theoretic Loss Function for Generalized Zero Shot Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1254]  arXiv:2112.03227 (cross-list from cs.RO) [pdf, other]
Title: CALVIN: A Benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Comments: Accepted for publication at IEEE Robotics and Automation Letters (RAL). Code, models and dataset available at this http URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1255]  arXiv:2112.03257 (cross-list from cs.LG) [pdf, other]
Title: Functional Regularization for Reinforcement Learning via Learned Fourier Features
Comments: Accepted at NeurIPS 2021. Website at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
[1256]  arXiv:2112.03269 (cross-list from cs.HC) [pdf, other]
Title: DIY Graphics Tab: A Cost-Effective Alternative to Graphics Tablet for Educators
Comments: Accepted in AAAI2022 workshop
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[1257]  arXiv:2112.03321 (cross-list from cs.LG) [pdf, other]
Title: Noether Networks: Meta-Learning Useful Conserved Quantities
Comments: Accepted to NeurIPS '21. The first two authors contributed equally
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1258]  arXiv:2112.03371 (cross-list from cs.LG) [pdf, other]
Title: Graphical Models with Attention for Context-Specific Independence and an Application to Perceptual Grouping
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1259]  arXiv:2112.03379 (cross-list from cs.LG) [pdf, other]
Title: Deep Efficient Continuous Manifold Learning for Time Series Modeling
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1260]  arXiv:2112.03398 (cross-list from cs.LG) [pdf, other]
Title: Top-Down Deep Clustering with Multi-generator GANs
Comments: Accepted to AAAI 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1261]  arXiv:2112.03406 (cross-list from cs.LG) [pdf, other]
Title: Equal Bits: Enforcing Equally Distributed Binary Network Weights
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1262]  arXiv:2112.03476 (cross-list from cs.CR) [pdf, other]
Title: Defending against Model Stealing via Verifying Embedded External Features
Comments: This work is accepted by the AAAI 2022. The first two authors contributed equally to this work. 11 pages
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1263]  arXiv:2112.03502 (cross-list from cs.LG) [pdf, other]
Title: A Generic Approach for Enhancing GANs by Regularized Latent Optimization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1264]  arXiv:2112.03676 (cross-list from cs.LG) [pdf, other]
Title: PLACE dropout: A Progressive Layer-wise and Channel-wise Dropout for Domain Generalization
Comments: Accepted by ACM TOMM 2023. The code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1265]  arXiv:2112.03678 (cross-list from cs.CR) [pdf, other]
Title: Does Proprietary Software Still Offer Protection of Intellectual Property in the Age of Machine Learning? -- A Case Study using Dual Energy CT Data
Comments: 6 pages, 2 figures, 1 table, accepted on BVM 2022
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[1266]  arXiv:2112.03695 (cross-list from cs.CR) [pdf, other]
Title: Safe Distillation Box
Comments: Accepted by AAAI2022
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1267]  arXiv:2112.03908 (cross-list from cs.RO) [pdf, other]
Title: Causal Imitative Model for Autonomous Driving
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1268]  arXiv:2112.04014 (cross-list from cs.LG) [pdf, other]
Title: Unsupervised Representation Learning via Neural Activation Coding
Comments: Published in International Conference on Machine Learning (ICML), 2021
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1269]  arXiv:2112.04350 (cross-list from cs.RO) [pdf, other]
Title: Transformer based trajectory prediction
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1270]  arXiv:2112.04468 (cross-list from cs.LG) [pdf, other]
Title: Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1271]  arXiv:2112.04558 (cross-list from cs.CR) [pdf, ps, other]
Title: SoK: Anti-Facial Recognition Technology
Comments: Camera-ready version for Oakland S&P 2023
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1272]  arXiv:2112.04684 (cross-list from cs.RO) [pdf, other]
Title: Trajectory-Constrained Deep Latent Visual Attention for Improved Local Planning in Presence of Heterogeneous Terrain
Comments: Published in International Conference on Intelligent Robots and Systems (IROS) 2021 proceedings. Project website: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1273]  arXiv:2112.04758 (cross-list from cs.LG) [pdf, other]
Title: Does Redundancy in AI Perception Systems Help to Test for Super-Human Automated Driving Performance?
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1274]  arXiv:2112.04766 (cross-list from cs.LG) [pdf, other]
Title: Adaptive Methods for Aggregated Domain Generalization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1275]  arXiv:2112.04895 (cross-list from cs.LG) [pdf, other]
Title: Latent Space Explanation by Intervention
Comments: Accepted to AAAI22
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1276]  arXiv:2112.04902 (cross-list from cs.LG) [pdf, other]
Title: Learning Personal Representations from fMRIby Predicting Neurofeedback Performance
Journal-ref: MICCAI 2020, https://link.springer.com/chapter/10.1007/978-3-030-59728-3_46
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1277]  arXiv:2112.04910 (cross-list from cs.RO) [pdf, other]
Title: Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings
Comments: Supplementary material available at: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1278]  arXiv:2112.05005 (cross-list from cs.LG) [pdf, other]
Title: Mutual Adversarial Training: Learning together is better than going alone
Comments: Under submission
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1279]  arXiv:2112.05090 (cross-list from cs.LG) [pdf, other]
Title: Extending the WILDS Benchmark for Unsupervised Adaptation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1280]  arXiv:2112.05124 (cross-list from cs.RO) [pdf, other]
Title: Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation
Comments: Website: this https URL First two authors contributed equally (order determined by coin flip), last two authors equal advising
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1281]  arXiv:2112.05135 (cross-list from cs.LG) [pdf, other]
Title: PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures
Comments: CVPR 2022. Code and models are available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1282]  arXiv:2112.05282 (cross-list from cs.LG) [pdf, other]
Title: RamBoAttack: A Robust Query Efficient Deep Neural Network Decision Exploit
Comments: Published in Network and Distributed System Security (NDSS) Symposium 2022. Code is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1283]  arXiv:2112.05322 (cross-list from cs.AR) [pdf, ps, other]
Title: Dynamic hardware system for cascade SVM classification of melanoma
Comments: Journal paper, 9 pages, 4 figures, 4 tables
Journal-ref: Neural Computing & Applications 32 (2020) pp.1777-1788
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1284]  arXiv:2112.05419 (cross-list from cs.AI) [pdf, other]
Title: Predicting Physical World Destinations for Commands Given to Self-Driving Cars
Comments: Accepted at AAAI 2022. First two authors have contributed equally. Extended camera-ready version including the appendix and references to it in the main text
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1285]  arXiv:2112.05493 (cross-list from cs.LG) [pdf, other]
Title: Network Compression via Central Filter
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1286]  arXiv:2112.05534 (cross-list from cs.RO) [pdf, other]
Title: An Embarrassingly Pragmatic Introduction to Vision-based Autonomous Robots
Authors: Marcos V. Conde
Comments: CS Thesis. Lecture Notes in Computer Science
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1287]  arXiv:2112.05634 (cross-list from cs.LG) [pdf, other]
Title: Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks
Comments: Accepted and to appear at AAAI 2022
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1288]  arXiv:2112.05657 (cross-list from cs.AI) [pdf, ps, other]
Title: Artificial Intellgence -- Application in Life Sciences and Beyond. The Upper Rhine Artificial Intelligence Symposium UR-AI 2021
Authors: Karl-Herbert Schäfer (1), Franz Quint (2) ((1) Kaiserslautern University of Applied Sciences, (2) Karlsruhe University of Applied Sciences)
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[1289]  arXiv:2112.05872 (cross-list from cs.LG) [pdf, other]
Title: SLOSH: Set LOcality Sensitive Hashing via Sliced-Wasserstein Embeddings
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1290]  arXiv:2112.06102 (cross-list from cs.NE) [pdf, other]
Title: NeuroHSMD: Neuromorphic Hybrid Spiking Motion Detector
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1291]  arXiv:2112.06132 (cross-list from cs.LG) [pdf, other]
Title: Periodic Residual Learning for Crowd Flow Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1292]  arXiv:2112.06511 (cross-list from cs.LG) [pdf, other]
Title: Ex-Model: Continual Learning from a Stream of Trained Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1293]  arXiv:2112.06539 (cross-list from cs.RO) [pdf, other]
Title: MinkLoc3D-SI: 3D LiDAR place recognition with sparse convolutions, spherical coordinates, and intensity
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1294]  arXiv:2112.06658 (cross-list from cs.LG) [pdf, other]
Title: Learning to Learn Transferable Attack
Comments: AAAI 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1295]  arXiv:2112.06772 (cross-list from cs.AR) [pdf, other]
Title: hARMS: A Hardware Acceleration Architecture for Real-Time Event-Based Optical Flow
Comments: 18 pages, 16 figures, 4 tables
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
[1296]  arXiv:2112.06888 (cross-list from cs.CL) [pdf, other]
Title: Improving and Diagnosing Knowledge-Based Visual Question Answering via Entity Enhanced Knowledge Injection
Journal-ref: Proceedings of the 1st International Workshop on Multimodal Understanding for the Web and Social Media, co-located with the Web Conference 2022 (WWW '22 Companion), April 25--29, 2022, Virtual Event, Lyon, France
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1297]  arXiv:2112.07022 (cross-list from cs.GR) [pdf, other]
Title: Learning Body-Aware 3D Shape Generative Models
Comments: 11 pages, 8 figures
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1298]  arXiv:2112.07087 (cross-list from cs.NE) [pdf, other]
Title: Heuristic Hyperparameter Optimization for Convolutional Neural Networks using Genetic Algorithm
Authors: Meng Zhou
Comments: 8 pages, 3 figures
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1299]  arXiv:2112.07207 (cross-list from cs.IT) [pdf, other]
Title: Modeling Image Quantization Tradeoffs for Optimal Compression
Authors: Johnathan Chiu
Subjects: Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1300]  arXiv:2112.07214 (cross-list from cs.SD) [pdf, other]
Title: Noise Reduction and Driving Event Extraction Method for Performance Improvement on Driving Noise-based Surface Anomaly Detection
Comments: 3 pages, 3 figures, 2 tables
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1301]  arXiv:2112.07368 (cross-list from cs.LG) [pdf, other]
Title: Simple and Robust Loss Design for Multi-Label Learning with Missing Labels
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1302]  arXiv:2112.07443 (cross-list from cs.CL) [pdf, other]
Title: Text Classification Models for Form Entity Linking
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1303]  arXiv:2112.07566 (cross-list from cs.CL) [pdf, other]
Title: VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena
Comments: Paper accepted for publication at ACL 2022 Main; 28 pages, 4 figures, 11 tables
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1304]  arXiv:2112.07723 (cross-list from cs.RO) [pdf, other]
Title: Autonomous Navigation System from Simultaneous Localization and Mapping
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1305]  arXiv:2112.08060 (cross-list from cs.LG) [pdf, other]
Title: Leveraging Image-based Generative Adversarial Networks for Time Series Generation
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1306]  arXiv:2112.08132 (cross-list from cs.LG) [pdf, other]
Title: Improving Self-supervised Learning with Automated Unsupervised Outlier Arbitration
Comments: NeurIPS 2021; Code is publicly available at: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1307]  arXiv:2112.08363 (cross-list from cs.LG) [pdf, other]
Title: Performance or Trust? Why Not Both. Deep AUC Maximization with Self-Supervised Learning for COVID-19 Chest X-ray Classifications
Comments: 3 pages
Journal-ref: Published at CVIS 2021: 7th Annual Conference on Vision and Intelligent Systems
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1308]  arXiv:2112.08370 (cross-list from cs.LG) [pdf, other]
Title: Lifelong Generative Modelling Using Dynamic Expansion Graph Model
Comments: Accepted in Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI 2022)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1309]  arXiv:2112.08470 (cross-list from cs.CL) [pdf, other]
Title: Insta-VAX: A Multimodal Benchmark for Anti-Vaccine and Misinformation Posts Detection on Social Media
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1310]  arXiv:2112.08538 (cross-list from cs.LG) [pdf, other]
Title: Visualizing the Loss Landscape of Winning Lottery Tickets
Authors: Robert Bain
Comments: 7 pages, 7 figures, 1 algorithm/pseudocode
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1311]  arXiv:2112.08654 (cross-list from cs.LG) [pdf, other]
Title: Learning to Prompt for Continual Learning
Comments: Published at CVPR 2022 as a conference paper
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1312]  arXiv:2112.08723 (cross-list from cs.CL) [pdf, other]
Title: Distilled Dual-Encoder Model for Vision-Language Understanding
Comments: EMNLP 2022
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1313]  arXiv:2112.08854 (cross-list from cs.RO) [pdf, other]
Title: Multi-Camera LiDAR Inertial Extension to the Newer College Dataset
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1314]  arXiv:2112.08995 (cross-list from cs.SD) [pdf, other]
Title: Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
Comments: Accepted to NAACL 2022. Our code is available at this https URL
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1315]  arXiv:2112.09060 (cross-list from cs.SD) [pdf, other]
Title: Towards Robust Real-time Audio-Visual Speech Enhancement
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1316]  arXiv:2112.09153 (cross-list from cs.LG) [pdf, other]
Title: An Empirical Investigation of the Role of Pre-training in Lifelong Learning
Journal-ref: Journal of Machine Learning Research 24 (2023) 1-50
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1317]  arXiv:2112.09567 (cross-list from cs.CG) [pdf, ps, other]
Title: LTB curves with Lipschitz turn are par-regular
Authors: Etienne Le Quentrec (AMU), Loïc Mazo (UNISTRA), Étienne Baudrier (UNISTRA), Mohamed Tajine (UNISTRA)
Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Discrete Mathematics (cs.DM)
[1318]  arXiv:2112.09668 (cross-list from cs.LG) [pdf, other]
Title: Deep Learning for Spatiotemporal Modeling of Urbanization
Authors: Tang Li, Jing Gao, Xi Peng
Comments: Accepted by NeurIPS 2021 MLPH (Machine Learning in Public Health) Workshop; Best Paper Awarded by NeurIPS 2021 MLPH (Machine Learning in Public Health) Workshop
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1319]  arXiv:2112.09693 (cross-list from cs.LG) [pdf, other]
Title: Generalisation effects of predictive uncertainty estimation in deep learning for digital pathology
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1320]  arXiv:2112.09726 (cross-list from cs.SD) [pdf, other]
Title: Soundify: Matching Sound Effects to Video
Comments: Full paper in UIST 2023; Short paper in NeurIPS 2021 ML4CD Workshop; Online demo: this http URL
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1321]  arXiv:2112.09741 (cross-list from cs.LG) [pdf, other]
Title: Neurashed: A Phenomenological Model for Imitating Deep Learning Training
Authors: Weijie J. Su
Comments: 8 pages
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1322]  arXiv:2112.09802 (cross-list from cs.LG) [pdf, other]
Title: Automated Domain Discovery from Multiple Sources to Improve Zero-Shot Generalization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1323]  arXiv:2112.09808 (cross-list from math.NA) [pdf, other]
Title: Direct simple computation of middle surface between 3D point clouds and/or discrete surfaces by tracking sources in distance function calculation algorithms
Subjects: Numerical Analysis (math.NA); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[1324]  arXiv:2112.10017 (cross-list from cs.LG) [pdf, other]
Title: Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks
Journal-ref: NeurIPS 2020
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1325]  arXiv:2112.10065 (cross-list from cs.DC) [pdf, other]
Title: Efficient Strong Scaling Through Burst Parallel Training
Comments: MLSys'22
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1326]  arXiv:2112.10138 (cross-list from math.NA) [pdf, other]
Title: Anisotropic mesh adaptation for region-based segmentation accounting for image spatial information
Comments: 41 pages, 13 figures, 5 tables
Journal-ref: Computers & Mathematics with Applications, 121, 1--17 (2022)
Subjects: Numerical Analysis (math.NA); Computer Vision and Pattern Recognition (cs.CV)
[1327]  arXiv:2112.10139 (cross-list from cs.LG) [pdf, other]
Title: Denoised Labels for Financial Time-Series Data via Self-Supervised Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Statistical Finance (q-fin.ST)
[1328]  arXiv:2112.10143 (cross-list from cs.RO) [pdf, other]
Title: RoboAssembly: Learning Generalizable Furniture Assembly Policy in a Novel Multi-robot Contact-rich Simulation Environment
Comments: Submitted to IEEE International Conference on Robotics and Automation (ICRA) 2022
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[1329]  arXiv:2112.10384 (cross-list from cs.LG) [pdf, other]
Title: Multimodal Adversarially Learned Inference with Factorized Discriminators
Comments: 9 pages, 6 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1330]  arXiv:2112.10572 (cross-list from cs.LG) [pdf, other]
Title: General Greedy De-bias Learning
Comments: This work has been accepted by IEEE T-PAMI. Copyright is transferred without notice, after which this version may no longer be accessible
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1331]  arXiv:2112.10603 (cross-list from cs.MM) [pdf, other]
Title: A Multi-user Oriented Live Free-viewpoint Video Streaming System Based On View Interpolation
Comments: 10 pages, 7 figures
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1332]  arXiv:2112.10714 (cross-list from cs.LG) [pdf, other]
Title: Learning Spatio-Temporal Specifications for Dynamical Systems
Comments: 12 pages, submitted to L4DC 2021
Journal-ref: PMLR 168:968-980, 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Systems and Control (eess.SY)
[1333]  arXiv:2112.10728 (cross-list from cs.CL) [pdf, other]
Title: MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Comments: Accepted at AAAI 2022
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1334]  arXiv:2112.10961 (cross-list from cs.IT) [pdf, other]
Title: Nonlinear Transform Source-Channel Coding for Semantic Communications
Comments: published in IEEE JSAC
Subjects: Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1335]  arXiv:2112.10985 (cross-list from cs.LG) [pdf, other]
Title: Learned ISTA with Error-based Thresholding for Adaptive Sparse Coding
Comments: Accepted in ICASSP2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1336]  arXiv:2112.11018 (cross-list from cs.LG) [pdf, other]
Title: A Theoretical View of Linear Backpropagation and Its Convergence
Comments: This paper is accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1337]  arXiv:2112.11041 (cross-list from cs.LG) [pdf, other]
Title: Geometry-Aware Unsupervised Domain Adaptation
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1338]  arXiv:2112.11312 (cross-list from cs.LG) [pdf, other]
Title: Implicit Neural Video Compression
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1339]  arXiv:2112.11330 (cross-list from cs.LG) [pdf, ps, other]
Title: PrimSeq: a deep learning-based pipeline to quantitate rehabilitation training
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1340]  arXiv:2112.11447 (cross-list from cs.AI) [pdf, other]
Title: Multi-Modality Distillation via Learning the teacher's modality-level Gram Matrix
Authors: Peng Liu
Comments: 10 pages
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1341]  arXiv:2112.11450 (cross-list from cs.LG) [pdf, other]
Title: Max-Margin Contrastive Learning
Comments: Accepted at AAAI 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1342]  arXiv:2112.11743 (cross-list from cs.LG) [pdf, other]
Title: Simple and Effective Balance of Contrastive Losses
Comments: 15 pages, 10 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1343]  arXiv:2112.11850 (cross-list from cs.CL) [pdf, ps, other]
Title: Multimodal Analysis of memes for sentiment extraction
Comments: 5 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1344]  arXiv:2112.12078 (cross-list from cs.LG) [pdf, ps, other]
Title: Deeper Learning with CoLU Activation
Authors: Advait Vagerwal
Comments: 7 pages, 4 figures, 4 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1345]  arXiv:2112.12272 (cross-list from cs.LG) [pdf, ps, other]
Title: Human Activity Recognition on wrist-worn accelerometers using self-supervised neural networks
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1346]  arXiv:2112.12371 (cross-list from cs.LG) [pdf, other]
Title: DENSE: Data-Free One-Shot Federated Learning
Comments: Accepted by NeurIPS 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1347]  arXiv:2112.12431 (cross-list from cs.LG) [pdf, other]
Title: Adaptive Modeling Against Adversarial Attacks
Comments: 10 pages, 3 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1348]  arXiv:2112.12510 (cross-list from cs.NE) [pdf, other]
Title: Neuroevolution deep learning architecture search for estimation of river surface elevation from photogrammetric Digital Surface Models
Comments: extended version of NeurIPS 2021 Workshop paper - ML4PhysicalSciences
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1349]  arXiv:2112.12533 (cross-list from cs.LG) [pdf, other]
Title: PyCIL: A Python Toolbox for Class-Incremental Learning
Comments: Accepted to SCIENCE CHINA Information Sciences. Code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1350]  arXiv:2112.12596 (cross-list from cs.HC) [pdf, other]
Title: Explainable Medical Imaging AI Needs Human-Centered Design: Guidelines and Evidence from a Systematic Review
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1351]  arXiv:2112.12612 (cross-list from cs.RO) [pdf, other]
Title: Towards Disturbance-Free Visual Mobile Manipulation
Comments: WACV 2023
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1352]  arXiv:2112.12984 (cross-list from cs.RO) [pdf, ps, other]
Title: Doppler velocity-based algorithm for Clustering and Velocity Estimation of moving objects
Comments: 7 pages, 9 figures, 2 tables, 2 algorithms, CACRE2022
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1353]  arXiv:2112.13064 (cross-list from cs.CR) [src]
Title: CatchBackdoor: Backdoor Testing by Critical Trojan Neural Path Identification via Differential Fuzzing
Comments: There are some problems in the experiment so we need to withdraw this paper. We will upload the new version after revision
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1354]  arXiv:2112.13121 (cross-list from cs.LG) [src]
Title: The Curse of Zero Task Diversity: On the Failure of Transfer Learning to Outperform MAML and their Empirical Equivalence
Comments: An updated version with updated correction is at arXiv:2208.01545 and it's acompanying neurips submission is at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1355]  arXiv:2112.13137 (cross-list from cs.LG) [pdf, other]
Title: Does MAML Only Work via Feature Re-use? A Data Centric Perspective
Comments: 15 pages, 12 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1356]  arXiv:2112.13149 (cross-list from cs.AR) [pdf, other]
Title: Fast and Scalable Computation of the Forward and Inverse Discrete Periodic Radon Transform
Comments: This paper has been published as follows: C. Carranza, D. Llamocca, and M. Pattichis. "Fast and scalable computation of the forward and inverse discrete periodic radon transform", IEEE Transactions on Image Processing, 25(1):119-133, Jan 2016
Journal-ref: IEEE Transactions on Image Processing, 25(1):119-133, Jan 2016
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[1357]  arXiv:2112.13150 (cross-list from cs.AR) [pdf, other]
Title: Fast 2D Convolutions and Cross-Correlations Using Scalable Architectures
Comments: The paper develops the fastest known methods for computing 2D convolutions in hardware
Journal-ref: IEEE Transactions on Image Processing 26.5 (2017): 2230-2245
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[1358]  arXiv:2112.13243 (cross-list from cs.NE) [pdf, other]
Title: Evolutionary Generation of Visual Motion Illusions
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1359]  arXiv:2112.13372 (cross-list from cs.CL) [pdf, ps, other]
Title: Delivery Issues Identification from Customer Feedback Data
Comments: Accepted to be part of MLDS 2022, and will be Published in Lattice journal
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1360]  arXiv:2112.13659 (cross-list from cs.RO) [pdf, ps, other]
Title: M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots
Comments: accepted by IEEE RA-L
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1361]  arXiv:2112.13910 (cross-list from cs.CL) [pdf, other]
Title: Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal Characterization
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1362]  arXiv:2112.13939 (cross-list from cs.LG) [pdf, other]
Title: SPIDER: Searching Personalized Neural Architecture for Federated Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1363]  arXiv:2112.13974 (cross-list from cs.LG) [pdf, other]
Title: A Moment in the Sun: Solar Nowcasting from Multispectral Satellite Data using Self-Supervised Learning
Comments: 18 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1364]  arXiv:2112.14006 (cross-list from cs.NI) [pdf, other]
Title: Multi-Band Wi-Fi Sensing with Matched Feature Granularity
Comments: 12 pages, 14 figures
Subjects: Networking and Internet Architecture (cs.NI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Signal Processing (eess.SP)
[1365]  arXiv:2112.14021 (cross-list from cs.SI) [pdf, other]
Title: Multilayer Graph Contrastive Clustering Network
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1366]  arXiv:2112.14061 (cross-list from cs.LG) [pdf, other]
Title: Investigating Shifts in GAN Output-Distributions
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1367]  arXiv:2112.14232 (cross-list from cs.LG) [pdf, other]
Title: Constrained Gradient Descent: A Powerful and Principled Evasion Attack Against Neural Networks
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1368]  arXiv:2112.14299 (cross-list from cs.LG) [pdf, other]
Title: DeepAdversaries: Examining the Robustness of Deep Learning Models for Galaxy Morphology Classification
Comments: 20 pages, 6 figures, 5 tables; accepted in MLST
Subjects: Machine Learning (cs.LG); Astrophysics of Galaxies (astro-ph.GA); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1369]  arXiv:2112.14337 (cross-list from cs.LG) [pdf, other]
Title: Closer Look at the Transferability of Adversarial Examples: How They Fool Different Models Differently
Comments: 25 pages, 13 figures, Accepted at the IEEE Winter Conference on Applications of Computer Vision (WACV) 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1370]  arXiv:2112.14437 (cross-list from cs.CR) [pdf, other]
Title: A Color Image Steganography Based on Frequency Sub-band Selection
Comments: 19 pages,17 figures
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1371]  arXiv:2112.14754 (cross-list from cs.LG) [pdf, other]
Title: Disentanglement and Generalization Under Correlation Shifts
Comments: CoLLAs 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1372]  arXiv:2112.14772 (cross-list from cs.LG) [pdf, other]
Title: Deep Graph Clustering via Dual Correlation Reduction
Comments: 9 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1373]  arXiv:2112.14889 (cross-list from cs.CR) [pdf, other]
Title: Few-shot Backdoor Defense Using Shapley Estimation
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1374]  arXiv:2112.14921 (cross-list from cs.IR) [pdf, other]
Title: Retrieving Black-box Optimal Images from External Databases
Authors: Ryoma Sato
Comments: WSDM 2022
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1375]  arXiv:2112.15068 (cross-list from cs.LG) [pdf, ps, other]
Title: Digital Rock Typing DRT Algorithm Formulation with Optimal Supervised Semantic Segmentation
Comments: 1-Acknowledgement section is updated. 2- References section is update with one additional reference
Subjects: Machine Learning (cs.LG); Earth and Planetary Astrophysics (astro-ph.EP); Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[1376]  arXiv:2112.15278 (cross-list from cs.LG) [pdf, other]
Title: Data-Free Knowledge Transfer: A Survey
Comments: 20 pages, 8 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1377]  arXiv:2112.15317 (cross-list from cs.LG) [pdf, other]
Title: SplitBrain: Hybrid Data and Model Parallel Deep Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[1378]  arXiv:2112.15320 (cross-list from cs.LG) [pdf, other]
Title: InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer
Comments: Rejected by ISMIR 2020
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)
[1379]  arXiv:2112.15329 (cross-list from cs.LG) [pdf, other]
Title: On Distinctive Properties of Universal Perturbations
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1380]  arXiv:2112.15402 (cross-list from cs.LG) [pdf, other]
Title: Relational Experience Replay: Continual Learning by Adaptively Tuning Task-wise Relationship
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1381]  arXiv:2112.15411 (cross-list from cs.LG) [pdf, other]
Title: Disjoint Contrastive Regression Learning for Multi-Sourced Annotations
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1382]  arXiv:2112.15421 (cross-list from cs.LG) [pdf, other]
Title: Representation Learning via Consistent Assignment of Views to Clusters
Comments: Pre-print. 37th ACM/SIGAPP Symposium on Applied Computing (SAC'22). Code at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1383]  arXiv:2112.15541 (cross-list from cs.LG) [pdf, other]
Title: on the effectiveness of generative adversarial network on anomaly detection
Comments: This paper is an improved version of an existing paper published by the same authors in ICANN2020
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1384]  arXiv:2112.15550 (cross-list from cs.LG) [pdf, other]
Title: Improving Baselines in the Wild
Comments: Presented at NeurIPS 2021 Workshop on Distribution Shifts, this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1385]  arXiv:2112.15555 (cross-list from cs.LG) [pdf, other]
Title: An Unsupervised Domain Adaptation Model based on Dual-module Adversarial Training
Comments: arXiv admin note: text overlap with arXiv:2108.00610
Journal-ref: Neurocomputing, Volume 475, 28 February 2022, Pages 102-111
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1386]  arXiv:2112.00002 (cross-list from eess.IV) [pdf, other]
Title: Recovery of Continuous 3D Refractive Index Maps from Discrete Intensity-Only Measurements using Neural Fields
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1387]  arXiv:2112.00729 (cross-list from eess.IV) [src]
Title: Total-Body Low-Dose CT Image Denoising using Prior Knowledge Transfer Technique with Contrastive Regularization Mechanism
Comments: Want to improve the methodology
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1388]  arXiv:2112.00730 (cross-list from eess.IV) [pdf, ps, other]
Title: Highly accelerated MR parametric mapping by undersampling the k-space and reducing the contrast number simultaneously with deep learning
Comments: 27 pages,11 figures. Submitted to Magnetic Resonance in Medicine
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1389]  arXiv:2112.00735 (cross-list from eess.IV) [pdf, other]
Title: Reference-guided Pseudo-Label Generation for Medical Semantic Segmentation
Comments: 36th AAAI Conference on Artificial Intelligence 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1390]  arXiv:2112.00794 (cross-list from eess.IV) [pdf, other]
Title: DFTS2: Simulating Deep Feature Transmission Over Packet Loss Channels
Comments: 6 pages, 4 figures, IEEE Conference on Visual Communications and Image Processing (VCIP) 2021
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1391]  arXiv:2112.00913 (cross-list from eess.IV) [pdf, other]
Title: CDLNet: Noise-Adaptive Convolutional Dictionary Learning Network for Blind Denoising and Demosaicing
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1392]  arXiv:2112.01137 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning-Based Carotid Artery Vessel Wall Segmentation in Black-Blood MRI Using Anatomical Priors
Comments: SPIE Medical Imaging 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1393]  arXiv:2112.01320 (cross-list from eess.IV) [pdf, other]
Title: Multi-task fusion for improving mammography screening data classification
Comments: Accepted for publication in IEEE Transactions on Medical Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1394]  arXiv:2112.01533 (cross-list from eess.IV) [pdf, other]
Title: Automatic tumour segmentation in H&E-stained whole-slide images of the pancreas
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1395]  arXiv:2112.01534 (cross-list from eess.IV) [pdf, ps, other]
Title: Learning to automate cryo-electron microscopy data collection with Ptolemy
Comments: Main: 12 pages, 11 figures. Appendix: 2 pages, 1 figure
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1396]  arXiv:2112.01587 (cross-list from eess.IV) [pdf, ps, other]
Title: Improving accuracy and uncertainty quantification of deep learning based quantitative MRI using Monte Carlo dropout
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1397]  arXiv:2112.01629 (cross-list from eess.IV) [pdf, ps, other]
Title: Engineering AI Tools for Systematic and Scalable Quality Assessment in Magnetic Resonance Imaging
Comments: 6 pages, 2 figures, NeurIPS Data-Centric AI Workshop 2021 (Virtual)
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1398]  arXiv:2112.01702 (cross-list from eess.IV) [pdf, ps, other]
Title: Localized Feature Aggregation Module for Semantic Segmentation
Comments: SMC 2021
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1399]  arXiv:2112.01767 (cross-list from eess.IV) [pdf, other]
Title: MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification
Comments: A technical report. Code will be released
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1400]  arXiv:2112.01784 (cross-list from eess.IV) [pdf, other]
Title: Fully automatic integration of dental CBCT images and full-arch intraoral impressions with stitching error correction via individual tooth segmentation and identification
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1401]  arXiv:2112.01797 (cross-list from eess.IV) [pdf, other]
Title: Detection of Large Vessel Occlusions using Deep Learning by Deforming Vessel Tree Segmentations
Comments: 7 pages. Accepted at BVM-Workshop 2022, Springer
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1402]  arXiv:2112.01905 (cross-list from eess.IV) [pdf, other]
Title: Towards Super-Resolution CEST MRI for Visualization of Small Structures
Journal-ref: Proceedings, German Workshop on Medical Image Computing (2022) 210-215
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1403]  arXiv:2112.02101 (cross-list from eess.IV) [pdf, other]
Title: View-Consistent Metal Segmentation in the Projection Domain for Metal Artifact Reduction in CBCT -- An Investigation of Potential Improvement
Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1404]  arXiv:2112.02102 (cross-list from eess.IV) [pdf, other]
Title: Echocardiography Segmentation with Enforced Temporal Consistency
Comments: 12 pages, accepted for publication in IEEE TMI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1405]  arXiv:2112.02164 (cross-list from eess.IV) [pdf, other]
Title: Bridging the gap between prostate radiology and pathology through machine learning
Comments: Indrani Bhattacharya and David S. Lim contributed equally as first authors. Geoffrey A. Sonn and Mirabela Rusu contributed equally as senior authors
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1406]  arXiv:2112.02222 (cross-list from eess.IV) [pdf, other]
Title: Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides
Comments: Update Table 1 and corresponding descriptions
Journal-ref: Frontiers in Oncology, 11(2021), 4133
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1407]  arXiv:2112.02478 (cross-list from eess.IV) [pdf, ps, other]
Title: Classification of COVID-19 on chest X-Ray images using Deep Learning model with Histogram Equalization and Lungs Segmentation
Comments: Total number of words of the manuscript- 6577 The number of words of the abstract- 238 The number of figures- 8 The number of tables- 10
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1408]  arXiv:2112.02508 (cross-list from eess.IV) [pdf, other]
Title: Uncertainty-Guided Mutual Consistency Learning for Semi-Supervised Medical Image Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1409]  arXiv:2112.02522 (cross-list from eess.IV) [pdf, other]
Title: Snapshot HDR Video Construction Using Coded Mask
Comments: 13 pages, 7 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1410]  arXiv:2112.02548 (cross-list from physics.flu-dyn) [pdf, other]
Title: Generative Modeling of Turbulence
Subjects: Fluid Dynamics (physics.flu-dyn); Computer Vision and Pattern Recognition (cs.CV)
[1411]  arXiv:2112.02608 (cross-list from eess.IV) [pdf, ps, other]
Title: Real-time Virtual Intraoperative CT for Image Guided Surgery
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1412]  arXiv:2112.02743 (cross-list from eess.IV) [pdf, other]
Title: Separated Contrastive Learning for Organ-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation
Comments: Accepted in AAAI-22 (Oral)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1413]  arXiv:2112.02858 (cross-list from eess.IV) [pdf, ps, other]
Title: A comparison study of CNN denoisers on PRNU extraction
Comments: 12 pages, 6 figures, 4 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1414]  arXiv:2112.02896 (cross-list from eess.IV) [pdf, other]
Title: Tunable Image Quality Control of 3-D Ultrasound using Switchable CycleGAN
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1415]  arXiv:2112.03053 (cross-list from eess.IV) [pdf, other]
Title: Fast 3D registration with accurate optimisation and little learning for Learn2Reg 2021
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1416]  arXiv:2112.03259 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Novel Local Radiomic Bayesian Classifiers for Non-Invasive Prediction of MGMT Methylation Status in Glioblastoma
Authors: Mihir Rao
Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1417]  arXiv:2112.03276 (cross-list from eess.IV) [pdf, other]
Title: Organ localisation using supervised and semi supervised approaches combining reinforcement learning with imitation learning
Comments: 16 pages, 12 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1418]  arXiv:2112.03277 (cross-list from eess.IV) [pdf, ps, other]
Title: Automatic quality control framework for more reliable integration of machine learning-based image segmentation into medical workflows
Comments: 19 pages
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1419]  arXiv:2112.03380 (cross-list from eess.IV) [pdf, other]
Title: Dynamic imaging using Motion-Compensated SmooThness Regularization on Manifolds (MoCo-SToRM)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1420]  arXiv:2112.03455 (cross-list from eess.IV) [pdf, other]
Title: Hybrid guiding: A multi-resolution refinement approach for semantic segmentation of gigapixel histopathological images
Comments: 12 pages, 3 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1421]  arXiv:2112.03456 (cross-list from eess.IV) [pdf, other]
Title: RSBNet: One-Shot Neural Architecture Search for A Backbone Network in Remote Sensing Image Recognition
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1422]  arXiv:2112.03536 (cross-list from eess.IV) [pdf, other]
Title: Learning Pixel-Adaptive Weights for Portrait Photo Retouching
Comments: Techinical report
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1423]  arXiv:2112.03622 (cross-list from eess.IV) [pdf, other]
Title: Evaluating Generic Auto-ML Tools for Computational Pathology
Journal-ref: Informatics in Medicine Unlocked 29 (2022) 100853
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1424]  arXiv:2112.03694 (cross-list from eess.IV) [pdf, other]
Title: Hard Sample Aware Noise Robust Learning for Histopathology Image Classification
Comments: 14 pages, 20figures, IEEE Transactions on Medical Imaging
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1425]  arXiv:2112.03696 (cross-list from eess.IV) [pdf, other]
Title: Noise Distribution Adaptive Self-Supervised Image Denoising using Tweedie Distribution and Score Matching
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1426]  arXiv:2112.03701 (cross-list from eess.IV) [pdf, other]
Title: Efficient joint noise removal and multi exposure fusion
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1427]  arXiv:2112.03712 (cross-list from eess.IV) [pdf, other]
Title: Image Compressed Sensing Using Non-local Neural Network
Comments: 14 pages, 11 figures, 7 tables
Journal-ref: IEEE Transactions on Multimedia, 2021
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1428]  arXiv:2112.03888 (cross-list from eess.IV) [pdf, ps, other]
Title: Image Enhancement via Bilateral Learning
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1429]  arXiv:2112.03911 (cross-list from eess.IV) [pdf, ps, other]
Title: Dyadic Sex Composition and Task Classification Using fNIRS Hyperscanning Data
Comments: 20th IEEE International Conference on Machine Learning and Applications
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1430]  arXiv:2112.03915 (cross-list from eess.IV) [pdf, other]
Title: Embedding Gradient-based Optimization in Image Registration Networks
Comments: Accepted by International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1431]  arXiv:2112.03916 (cross-list from eess.IV) [pdf, other]
Title: BT-Unet: A self-supervised learning framework for biomedical image segmentation using Barlow Twins with U-Net models
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1432]  arXiv:2112.03998 (cross-list from eess.IV) [pdf, ps, other]
Title: Nuclei Segmentation in Histopathology Images using Deep Learning with Local and Global Views
Comments: 5 pages, 5 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1433]  arXiv:2112.04121 (cross-list from eess.IV) [pdf, other]
Title: Reverse image filtering using total derivative approximation and accelerated gradient descent
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1434]  arXiv:2112.04267 (cross-list from eess.IV) [pdf, other]
Title: Implicit Neural Representations for Image Compression
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1435]  arXiv:2112.04386 (cross-list from eess.IV) [pdf, other]
Title: Which images to label for few-shot medical landmark detection?
Journal-ref: Proceedings of the Conference on Computer Vision and Pattern Recognition, 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1436]  arXiv:2112.04487 (cross-list from eess.IV) [pdf, other]
Title: Joint Global and Local Hierarchical Priors for Learned Image Compression
Comments: CVPR 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1437]  arXiv:2112.04488 (cross-list from eess.IV) [pdf, other]
Title: A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution
Comments: Accepted for publication as a regular paper in the IEEE Transactions on Multimedia
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1438]  arXiv:2112.04489 (cross-list from eess.IV) [pdf, other]
[1439]  arXiv:2112.04490 (cross-list from eess.IV) [pdf, other]
Title: A novel multi-view deep learning approach for BI-RADS and density assessment of mammograms
Comments: This paper has been accepted by the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (2022 IEEE EMBC)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1440]  arXiv:2112.04491 (cross-list from cs.CV) [pdf, other]
Title: Improving Image Restoration by Revisiting Global Information Aggregation
Comments: ECCV 2022; fix typo
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1441]  arXiv:2112.04493 (cross-list from eess.IV) [pdf, ps, other]
Title: Binary Change Guided Hyperspectral Multiclass Change Detection
Comments: 14 pages,17 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1442]  arXiv:2112.04495 (cross-list from eess.IV) [pdf, other]
Title: Dynamic multi feature-class Gaussian process models
Comments: 16
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1443]  arXiv:2112.04499 (cross-list from eess.IV) [pdf, other]
Title: Multiscale Softmax Cross Entropy for Fovea Localization on Color Fundus Photography
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1444]  arXiv:2112.04653 (cross-list from eess.IV) [pdf, ps, other]
Title: Extending nn-UNet for brain tumor segmentation
Comments: 12 pages, 4 figures, BraTS competition paper
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1445]  arXiv:2112.04721 (cross-list from eess.IV) [pdf, ps, other]
Title: One-dimensional Deep Low-rank and Sparse Network for Accelerated MRI
Comments: 16 pages
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1446]  arXiv:2112.04863 (cross-list from eess.IV) [pdf, other]
Title: 3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis
Comments: Technical Report
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1447]  arXiv:2112.04882 (cross-list from eess.IV) [pdf, other]
Title: Evaluating saliency methods on artificial data with different background types
Comments: 6 pages, 2 figures. Presented at Medical Imaging meets NeurIPS 2021 (poster presentation)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1448]  arXiv:2112.04894 (cross-list from eess.IV) [pdf, other]
Title: Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer
Comments: accepted to MIDL2022, code in SSL4MIS:this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1449]  arXiv:2112.04984 (cross-list from eess.IV) [pdf, other]
Title: Robust Weakly Supervised Learning for COVID-19 Recognition Using Multi-Center CT Images
Comments: 32 pages, 8 figures, Applied Soft Computing
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1450]  arXiv:2112.04998 (cross-list from eess.IV) [pdf, other]
Title: Sparse-View CT Reconstruction using Recurrent Stacked Back Projection
Comments: 5 pages, 5 pages, 2021 Asilomar Conference on Signals, Systems, and Computers
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1451]  arXiv:2112.05074 (cross-list from math.AG) [pdf, other]
Title: Critical configurations for two projective views, a new approach
Comments: 26 pages, 4 figures, this version corrects an error appearing in the first table in the published version
Journal-ref: Journal of Symbolic Computation 120 (2024)
Subjects: Algebraic Geometry (math.AG); Computer Vision and Pattern Recognition (cs.CV)
[1452]  arXiv:2112.05146 (cross-list from eess.IV) [pdf, other]
Title: Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction
Comments: Accepted to CVPR 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1453]  arXiv:2112.05147 (cross-list from eess.IV) [pdf, other]
Title: Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement
Comments: Accepted by IEEE TNNLS. Code is available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1454]  arXiv:2112.05149 (cross-list from eess.IV) [pdf, other]
Title: DiffuseMorph: Unsupervised Deformable Image Registration Using Diffusion Model
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1455]  arXiv:2112.05150 (cross-list from eess.IV) [pdf, other]
Title: Deep Recurrent Neural Network with Multi-scale Bi-directional Propagation for Video Deblurring
Comments: Accepted by AAAI-2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1456]  arXiv:2112.05151 (cross-list from eess.IV) [pdf, other]
Title: Annotation-efficient cancer detection with report-guided lesion annotation for deep learning-based prostate cancer detection in bpMRI
Journal-ref: Radiology: Artificial Intelligence (2023), e230031
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1457]  arXiv:2112.05220 (cross-list from eess.IV) [pdf, other]
Title: Hidden Path Selection Network for Semantic Segmentation of Remote Sensing Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1458]  arXiv:2112.05221 (cross-list from eess.IV) [pdf, other]
Title: MantissaCam: Learning Snapshot High-dynamic-range Imaging with Perceptually-based In-pixel Irradiance Encoding
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1459]  arXiv:2112.05303 (cross-list from eess.IV) [pdf, other]
Title: Surrogate-based cross-correlation for particle image velocimetry
Comments: 13 pages, 11 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1460]  arXiv:2112.05478 (cross-list from math.AG) [pdf, other]
Title: Critical configurations for three projective views
Comments: 40 pages, 9 figures. This is a companion paper to arXiv:2112.05074. Accepted manuscript published in Mathematica Scandinavica
Subjects: Algebraic Geometry (math.AG); Computer Vision and Pattern Recognition (cs.CV)
[1461]  arXiv:2112.05505 (cross-list from eess.IV) [pdf, other]
Title: DeepRLS: A Recurrent Network Architecture with Least Squares Implicit Layers for Non-blind Image Deconvolution
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1462]  arXiv:2112.05748 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning based Framework for Automatic Diagnosis of Glaucoma based on analysis of Focal Notching in the Optic Nerve Head
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1463]  arXiv:2112.05752 (cross-list from eess.IV) [pdf, other]
Title: Specificity-Preserving Federated Learning for MR Image Reconstruction
Comments: 12 pages, 8 figures Code: this https URL
Journal-ref: IEEE Transactions on Medical Imaging, 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1464]  arXiv:2112.05754 (cross-list from eess.IV) [pdf, other]
Title: PyTorch Connectomics: A Scalable and Flexible Segmentation Framework for EM Connectomics
Comments: Technical report
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1465]  arXiv:2112.05755 (cross-list from eess.IV) [pdf, other]
Title: Information Prebuilt Recurrent Reconstruction Network for Video Super-Resolution
Comments: 12 pages,9 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1466]  arXiv:2112.05756 (cross-list from eess.IV) [pdf, other]
Title: Enhancing Multi-Scale Implicit Learning in Image Super-Resolution with Integrated Positional Encoding
Comments: 10 pages, 5 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1467]  arXiv:2112.05758 (cross-list from eess.IV) [pdf, other]
Title: Edge-Enhanced Dual Discriminator Generative Adversarial Network for Fast MRI with Parallel Imaging Using Multi-view Information
Comments: 33 pages, 13 figures, Applied Intelligence
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1468]  arXiv:2112.05760 (cross-list from eess.IV) [pdf, other]
Title: Learning Representations with Contrastive Self-Supervised Learning for Histopathology Applications
Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL
Journal-ref: https://www.melba-journal.org/papers/2022:023.html
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1469]  arXiv:2112.05761 (cross-list from eess.IV) [pdf, other]
Title: Self-Supervised Transformers for fMRI representation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1470]  arXiv:2112.05794 (cross-list from eess.IV) [pdf, other]
Title: A Label Correction Algorithm Using Prior Information for Automatic and Accurate Geospatial Object Recognition
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1471]  arXiv:2112.05900 (cross-list from eess.IV) [pdf, ps, other]
Title: Automated assessment of disease severity of COVID-19 using artificial intelligence with synthetic chest CT
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1472]  arXiv:2112.06031 (cross-list from eess.IV) [pdf, other]
Title: Unsupervised Image to Image Translation for Multiple Retinal Pathology Synthesis in Optical Coherence Tomography Scans
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1473]  arXiv:2112.06149 (cross-list from eess.IV) [src]
Title: Two New Stenosis Detection Methods of Coronary Angiograms
Comments: We submitted the paper due to an operational error. This paper is a modified version of the original paper Two New Stenoses Detection Methods of Coronary Angiograms (arXiv:2108.01516). And we will update the revised paper to the original paper later
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1474]  arXiv:2112.06194 (cross-list from eess.IV) [pdf, ps, other]
Title: Improving Performance of Federated Learning based Medical Image Analysis in Non-IID Settings using Image Augmentation
Journal-ref: IEEE 14th International Conference on Information Security and Cryptology, 2021, pp. 69-74
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1475]  arXiv:2112.06226 (cross-list from eess.IV) [pdf, other]
Title: Attention based Broadly Self-guided Network for Low light Image Enhancement
Comments: 10 Pages,8 Figures,4 Tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1476]  arXiv:2112.06334 (cross-list from eess.IV) [pdf, other]
Title: DPICT: Deep Progressive Image Compression Using Trit-Planes
Comments: Accepted to CVPR 2022 (Oral presentation)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1477]  arXiv:2112.06417 (cross-list from eess.IV) [pdf, other]
Title: LC-FDNet: Learned Lossless Image Compression with Frequency Decomposition Network
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1478]  arXiv:2112.06476 (cross-list from eess.IV) [pdf, other]
Title: gACSON software for automated segmentation and morphology analyses of myelinated axons in 3D electron microscopy
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1479]  arXiv:2112.06693 (cross-list from eess.IV) [pdf, other]
Title: Hypernet-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1480]  arXiv:2112.06759 (cross-list from eess.IV) [pdf, ps, other]
Title: Hformer: Hybrid CNN-Transformer for Fringe Order Prediction in Phase Unwrapping of Fringe Projection
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1481]  arXiv:2112.06979 (cross-list from eess.IV) [pdf, other]
[1482]  arXiv:2112.07102 (cross-list from eess.IV) [pdf, other]
Title: COVID-19 Pneumonia and Influenza Pneumonia Detection Using Convolutional Neural Networks
Comments: for associated Azure ML notebook code, see this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1483]  arXiv:2112.07415 (cross-list from eess.IV) [pdf, ps, other]
Title: Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration
Comments: Accepted by AAAI 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1484]  arXiv:2112.07529 (cross-list from eess.IV) [pdf, ps, other]
Title: Improving COVID-19 CXR Detection with Synthetic Data Augmentation
Comments: This paper has been accepted at the Upper-Rhine Artificial Intelligence Symposium 2021 arXiv:2112.05657
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1485]  arXiv:2112.07555 (cross-list from eess.IV) [pdf, other]
Title: Classification of histopathology images using ConvNets to detect Lupus Nephritis
Comments: Accepted in the 2021 Medical Imaging meets NeurIPS Workshop
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
[1486]  arXiv:2112.08232 (cross-list from eess.IV) [pdf, ps, other]
Title: RA V-Net: Deep learning network for automated liver segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1487]  arXiv:2112.08644 (cross-list from eess.IV) [pdf, ps, other]
Title: A comparative study of paired versus unpaired deep learning methods for physically enhancing digital rock image resolution
Comments: 26 pages, 11 figures, 4 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1488]  arXiv:2112.08767 (cross-list from eess.IV) [pdf, other]
Title: Adaptation and Attention for Neural Video Coding
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1489]  arXiv:2112.08837 (cross-list from eess.IV) [pdf, ps, other]
Title: Improving Unsupervised Stain-To-Stain Translation using Self-Supervision and Meta-Learning
Comments: Accepted for Journal of Pathology Informatics (JPI), 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1490]  arXiv:2112.08851 (cross-list from stat.ML) [pdf, other]
Title: Classification Under Ambiguity: When Is Average-K Better Than Top-K?
Comments: 53 pages, 21 figures
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1491]  arXiv:2112.08968 (cross-list from eess.IV) [pdf, ps, other]
Title: Automated segmentation of 3-D body composition on computed tomography
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1492]  arXiv:2112.08974 (cross-list from eess.IV) [pdf, other]
Title: Quality monitoring of federated Covid-19 lesion segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1493]  arXiv:2112.09020 (cross-list from physics.data-an) [pdf, ps, other]
Title: Classification of diffraction patterns using a convolutional neural network in single particle imaging experiments performed at X-ray free-electron lasers
Comments: Main text: 28 pages, 7 figures, Supporting Information: 12 pages, 6 figures
Subjects: Data Analysis, Statistics and Probability (physics.data-an); Computer Vision and Pattern Recognition (cs.CV); Biological Physics (physics.bio-ph)
[1494]  arXiv:2112.09135 (cross-list from eess.IV) [pdf, other]
Title: ASC-Net: Unsupervised Medical Anomaly Segmentation Using an Adversarial-based Selective Cutting Network
Comments: Currently in Submission to Medical Image Analysis Journal. Extension of DOI - 10.1007/978-3-030-87240-3_23 with more details and experiments and indepth analysis. arXiv admin note: substantial text overlap with arXiv:2103.03664
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1495]  arXiv:2112.09177 (cross-list from eess.IV) [pdf, other]
Title: Coherence Learning using Keypoint-based Pooling Network for Accurately Assessing Radiographic Knee Osteoarthritis
Comments: extension of RSNA 2020 report "Consistent and Coherent Computer-Aided Knee Osteoarthritis Assessment from Plain Radiographs"
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1496]  arXiv:2112.09216 (cross-list from eess.IV) [pdf, other]
Title: A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy
Comments: 10 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1497]  arXiv:2112.09254 (cross-list from eess.IV) [pdf, other]
Title: A Novel Image Denoising Algorithm Using Concepts of Quantum Many-Body Theory
Comments: 24 pages, 14 figures; complements and expands arXiv:2108.13778
Journal-ref: Signal Processing, Volume 201, 2022, 108690
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1498]  arXiv:2112.09362 (cross-list from quant-ph) [pdf, other]
Title: Colloquium: Advances in automation of quantum dot devices control
Comments: 24 pages, 11 figures
Journal-ref: Rev. Mod. Phys. 95, 011006 (2023)
Subjects: Quantum Physics (quant-ph); Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1499]  arXiv:2112.09496 (cross-list from eess.IV) [pdf, ps, other]
Title: Towards Launching AI Algorithms for Cellular Pathology into Clinical & Pharmaceutical Orbits
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1500]  arXiv:2112.09529 (cross-list from eess.IV) [pdf, other]
Title: End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression
Comments: Accepted for publication in IEEE Transactions on Image Processing on 15 Dec. 2021
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1501]  arXiv:2112.09574 (cross-list from eess.IV) [pdf, ps, other]
Title: Super-resolution reconstruction of cytoskeleton image based on A-net deep learning network
Comments: The manuscript has 17 pages, 10 figures and 58 references
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1502]  arXiv:2112.09654 (cross-list from eess.IV) [pdf, other]
Title: FastSurferVINN: Building Resolution-Independence into Deep Learning Segmentation Methods -- A Solution for HighRes Brain MRI
Comments: accepted at NeuroImage
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1503]  arXiv:2112.09694 (cross-list from eess.IV) [pdf, other]
Title: Interpretable and Interactive Deep Multiple Instance Learning for Dental Caries Classification in Bitewing X-rays
Comments: 19 pages, 10 figures, Full Paper, MIDL 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1504]  arXiv:2112.09760 (cross-list from eess.IV) [pdf, other]
Title: Learned Half-Quadratic Splitting Network for MR Image Reconstruction
Comments: accepted for MIDL2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1505]  arXiv:2112.09970 (cross-list from eess.IV) [pdf, ps, other]
Title: 3D Structural Analysis of the Optic Nerve Head to Robustly Discriminate Between Papilledema and Optic Disc Drusen
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1506]  arXiv:2112.10001 (cross-list from eess.IV) [pdf, other]
Title: Cross-Domain Federated Learning in Medical Imaging
Comments: Under Review for MIDL 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1507]  arXiv:2112.10024 (cross-list from eess.IV) [pdf, ps, other]
Title: Supervised laser-speckle image sampling of skin tissue to detect very early stage of diabetes by its effects on skin subcellular properties
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1508]  arXiv:2112.10046 (cross-list from eess.IV) [pdf, other]
Title: A-ESRGAN: Training Real-World Blind Super-Resolution with Attention U-Net Discriminators
Comments: 6 pages, 9 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1509]  arXiv:2112.10071 (cross-list from eess.IV) [pdf, other]
Title: A New Image Codec Paradigm for Human and Machine Uses
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1510]  arXiv:2112.10074 (cross-list from eess.IV) [pdf, other]
Title: QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking Results
Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA): this https URL
Journal-ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1511]  arXiv:2112.10184 (cross-list from eess.IV) [pdf, ps, other]
Title: A Deep Learning Based Workflow for Detection of Lung Nodules With Chest Radiograph
Authors: Yang Tai, Yu-Wen Fang (Same contribution), Fang-Yi Su, Jung-Hsien Chiang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1512]  arXiv:2112.10307 (cross-list from eess.IV) [pdf, other]
Title: Skin lesion segmentation and classification using deep learning and handcrafted features
Comments: 7 pages, 3 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1513]  arXiv:2112.10325 (cross-list from eess.IV) [pdf, other]
Title: Incremental Cross-view Mutual Distillation for Self-supervised Medical CT Synthesis
Comments: Accepted by CVPR2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1514]  arXiv:2112.10368 (cross-list from eess.IV) [pdf, other]
Title: Deep Co-supervision and Attention Fusion Strategy for Automatic COVID-19 Lung Infection Segmentation on CT Images
Journal-ref: Pattern Recognition,2022,124:108452
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1515]  arXiv:2112.10541 (cross-list from eess.IV) [pdf, ps, other]
Title: Implicit Neural Representation Learning for Hyperspectral Image Super-Resolution
Authors: Kaiwei Zhang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1516]  arXiv:2112.10652 (cross-list from eess.IV) [pdf, other]
Title: HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1517]  arXiv:2112.10755 (cross-list from math.DS) [pdf, other]
Title: Discovering State Variables Hidden in Experimental Data
Comments: Project website with code, data, and overview video is at: this https URL
Subjects: Dynamical Systems (math.DS); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY); Applied Physics (physics.app-ph)
[1518]  arXiv:2112.10775 (cross-list from eess.IV) [pdf, other]
Title: HarmoFL: Harmonizing Local and Global Drifts in Federated Learning on Heterogeneous Medical Images
Comments: Accepted at AAAI 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1519]  arXiv:2112.11065 (cross-list from eess.IV) [pdf, other]
Title: Leveraging Image Complexity in Macro-Level Neural Network Design for Medical Image Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1520]  arXiv:2112.11078 (cross-list from eess.IV) [pdf, other]
Title: RC-Net: A Convolutional Neural Network for Retinal Vessel Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1521]  arXiv:2112.11381 (cross-list from eess.IV) [pdf, ps, other]
Title: A novel approach for the automated segmentation and volume quantification of cardiac fats on computed tomography
Comments: Computer methods and programs in biomedicine, 2016
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1522]  arXiv:2112.11541 (cross-list from eess.IV) [pdf, other]
Title: Teacher-Student Architecture for Mixed Supervised Lung Tumor Segmentation
Comments: 17 pages, 3 figures, 5 tables, submitted to journal
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1523]  arXiv:2112.11833 (cross-list from eess.IV) [pdf, other]
Title: Deep learning for brain metastasis detection and segmentation in longitudinal MRI data
Comments: Implementation is available to public at this https URL
Journal-ref: Medical Physics 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1524]  arXiv:2112.12021 (cross-list from eess.IV) [pdf, other]
Title: Community Detection in Medical Image Datasets: Using Wavelets and Spectral Methods
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1525]  arXiv:2112.12386 (cross-list from eess.IV) [pdf, other]
Title: KFWC: A Knowledge-Driven Deep Learning Model for Fine-grained Classification of Wet-AMD
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1526]  arXiv:2112.12560 (cross-list from eess.IV) [pdf, other]
Title: On the relationship between calibrated predictors and unbiased volume estimation
Comments: Published at MICCAI 2021
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1527]  arXiv:2112.12609 (cross-list from eess.IV) [pdf, ps, other]
Title: Predição da Idade Cerebral a partir de Imagens de Ressonância Magnética utilizando Redes Neurais Convolucionais
Comments: 3 pages, 3 figures, in Portuguese, accepted at XVIII Congresso Brasileiro de Inform\'atica em Sa\'ude (CBIS 2021)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1528]  arXiv:2112.12660 (cross-list from eess.IV) [pdf, other]
Title: InDuDoNet+: A Deep Unfolding Dual Domain Network for Metal Artifact Reduction in CT Images
Journal-ref: Medical Image Analysis 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1529]  arXiv:2112.12665 (cross-list from eess.IV) [pdf, other]
Title: Omni-Seg: A Single Dynamic Network for Multi-label Renal Pathology Image Segmentation using Partially Labeled Data
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1530]  arXiv:2112.12744 (cross-list from eess.IV) [pdf, ps, other]
Title: AI-based Reconstruction for Fast MRI -- A Systematic Review and Meta-analysis
Comments: 42 pages, 5 figures, Proceedings of the IEEE
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1531]  arXiv:2112.12810 (cross-list from eess.IV) [pdf, ps, other]
Title: Self-Attention Generative Adversarial Network for Iterative Reconstruction of CT Images
Comments: 16 pages, 8 figures, 5 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1532]  arXiv:2112.12839 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Faster Deep Ensemble Averaging for Quantification of DNA Damage from Comet Assay Images With Uncertainty Estimates
Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1533]  arXiv:2112.13054 (cross-list from eess.IV) [pdf, other]
Title: Generalized Wasserstein Dice Loss, Test-time Augmentation, and Transformers for the BraTS 2021 challenge
Comments: BraTS 2021 challenge
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1534]  arXiv:2112.13110 (cross-list from eess.SP) [pdf, other]
Title: Ultrasound Speckle Suppression and Denoising using MRI-derived Normalizing Flow Priors
Comments: 10 pages, 8 figures
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[1535]  arXiv:2112.13191 (cross-list from eess.IV) [pdf, other]
Title: DSRGAN: Detail Prior-Assisted Perceptual Single Image Super-Resolution via Generative Adversarial Networks
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1536]  arXiv:2112.13194 (cross-list from eess.IV) [pdf, other]
Title: Network-Aware 5G Edge Computing for Object Detection: Augmenting Wearables to "See" More, Farther and Faster
Comments: Published in: IEEE Access ( Volume: 10)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1537]  arXiv:2112.13227 (cross-list from eess.IV) [pdf, other]
Title: Pseudocylindrical Convolutions for Learned Omnidirectional Image Compression
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1538]  arXiv:2112.13264 (cross-list from eess.IV) [pdf, ps, other]
Title: Artifact Reduction in Fundus Imaging using Cycle Consistent Adversarial Neural Networks
Comments: 12 pages, 13 figures, draft paper
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1539]  arXiv:2112.13309 (cross-list from eess.IV) [pdf, other]
Title: Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression
Comments: Preprint. Revised after peer-reviewimg
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1540]  arXiv:2112.13339 (cross-list from stat.ML) [pdf, other]
Title: Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives
Comments: Major update from 2112.13339v1. 47 pages, 24 figures
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1541]  arXiv:2112.13443 (cross-list from eess.IV) [pdf, other]
Title: Sinogram upsampling using Primal-Dual UNet for undersampled CT and radial MRI reconstruction
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1542]  arXiv:2112.13513 (cross-list from eess.IV) [pdf, ps, other]
Title: MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer
Comments: 12 pages, 10 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1543]  arXiv:2112.13553 (cross-list from eess.IV) [pdf, ps, other]
Title: Classification of Histopathology Images of Lung Cancer Using Convolutional Neural Network (CNN)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1544]  arXiv:2112.13559 (cross-list from eess.IV) [pdf, other]
Title: DAM-AL: Dilated Attention Mechanism with Attention Loss for 3D Infant Brain Image Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1545]  arXiv:2112.13595 (cross-list from eess.IV) [pdf, other]
Title: Depth estimation of endoscopy using sim-to-real transfer
Comments: 12 pages, 9 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1546]  arXiv:2112.13626 (cross-list from eess.IV) [pdf, other]
Title: Generation of Synthetic Rat Brain MRI scans with a 3D Enhanced Alpha-GAN
Authors: André Ferreira (1), Ricardo Magalhães (2), Sébastien Mériaux (2), Victor Alves (1) ((1) Centro Algoritmi, University of Minho, Braga, Portugal, (2) Université Paris-Saclay, CEA, CNRS, BAOBAB, NeuroSpin, Gif-sur-Yvette, France)
Comments: 25 pages, 10 figures, 4 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1547]  arXiv:2112.13637 (cross-list from eess.IV) [pdf, other]
Title: Self-normalized Classification of Parkinson's Disease DaTscan Images
Comments: To appear in IEEE BIBM 2021
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1548]  arXiv:2112.13686 (cross-list from eess.IV) [pdf, ps, other]
Title: Radiomic biomarker extracted from PI-RADS 3 patients support more eìcient and robust prostate cancer diagnosis: a multi-center study
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1549]  arXiv:2112.13811 (cross-list from eess.IV) [pdf, other]
Title: Infant Brain Age Classification: 2D CNN Outperforms 3D CNN in Small Dataset
Comments: 8 pages, 5 figures, 3 tables. arXiv admin note: text overlap with arXiv:2010.03963
Journal-ref: SPIE 2022 Medical Imaging Conference
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1550]  arXiv:2112.13850 (cross-list from econ.GN) [pdf, ps, other]
Title: Using maps to predict economic activity
Comments: 24 pages including references and appendix, 9 figures, 1 table
Subjects: General Economics (econ.GN); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1551]  arXiv:2112.13865 (cross-list from eess.IV) [pdf, other]
Title: Astronomical Image Colorization and upscaling with Generative Adversarial Networks
Comments: 14 pages, 10 figures, 7 tables
Subjects: Image and Video Processing (eess.IV); Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1552]  arXiv:2112.13885 (cross-list from eess.IV) [pdf, other]
Title: MedShift: identifying shift data for medical dataset curation
Comments: 35 pages, 28 figures, 2 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1553]  arXiv:2112.13893 (cross-list from eess.IV) [pdf, ps, other]
Title: Non-Reference Quality Monitoring of Digital Images using Gradient Statistics and Feedforward Neural Networks
Comments: Fifth International Conference on Aerospace Science & Engineering (ICASE 2017) (ICASE Proceedings, Page No. 300-305)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1554]  arXiv:2112.14022 (cross-list from eess.IV) [pdf, other]
Title: Towards Low Light Enhancement with RAW Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1555]  arXiv:2112.14026 (cross-list from eess.IV) [pdf, ps, other]
Title: SECP-Net: SE-Connection Pyramid Network of Organ At Risk Segmentation for Nasopharyngeal Carcinoma
Authors: Zexi Huang (1), Lihua Guo (1), Xin Yang (2), Sijuan Huang (2) ((1) School of Electronic and Information Engineering, South China University of Technology, (2) Sun Yat-sen University Cancer Center)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1556]  arXiv:2112.14320 (cross-list from eess.IV) [pdf, ps, other]
Title: Brain Tumor Classification by Cascaded Multiscale Multitask Learning Framework Based on Feature Aggregation
Comments: 16 pages, 7 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1557]  arXiv:2112.14340 (cross-list from eess.IV) [pdf, other]
Title: Super-Efficient Super Resolution for Fast Adversarial Defense at the Edge
Comments: This preprint is for personal use only. The official article will appear in proceedings of Design, Automation & Test in Europe (DATE), 2022, as part of the Special Initiative on Autonomous Systems Design (ASD)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1558]  arXiv:2112.14555 (cross-list from eess.IV) [pdf, other]
Title: Onsite Non-Line-of-Sight Imaging via Online Calibrations
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1559]  arXiv:2112.14608 (cross-list from eess.IV) [pdf, other]
Title: HPRN: Holistic Prior-embedded Relation Network for Spectral Super-Resolution
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1560]  arXiv:2112.14644 (cross-list from eess.IV) [pdf, ps, other]
Title: Implementation of Convolutional Neural Network Architecture on 3D Multiparametric Magnetic Resonance Imaging for Prostate Cancer Diagnosis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1561]  arXiv:2112.14768 (cross-list from eess.IV) [pdf, other]
Title: Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1562]  arXiv:2112.15009 (cross-list from eess.IV) [pdf, ps, other]
Title: Knowledge Matters: Radiology Report Generation with General and Specific Knowledge
Comments: Medical Image Analysis
Subjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1563]  arXiv:2112.15011 (cross-list from eess.IV) [pdf, other]
Title: Radiology Report Generation with a Learned Knowledge Base and Multi-modal Alignment
Subjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1564]  arXiv:2112.15106 (cross-list from eess.IV) [pdf, other]
Title: Colour alignment for relative colour constancy via non-standard references
Comments: 14 pages, 8 figures, 2 tables, accepted by IEEE Transactions on Image Processing
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1565]  arXiv:2112.15180 (cross-list from eess.IV) [pdf, other]
Title: A Resolution Enhancement Plug-in for Deformable Registration of Medical Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1566]  arXiv:2112.15299 (cross-list from eess.IV) [pdf, other]
Title: CSformer: Bridging Convolution and Transformer for Compressive Sensing
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1567]  arXiv:2112.15362 (cross-list from eess.IV) [pdf, other]
Title: Modeling Mask Uncertainty in Hyperspectral Image Reconstruction
Comments: ECCV 2022 Oral Paper
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1568]  arXiv:2112.15367 (cross-list from eess.IV) [pdf, other]
Title: Weakly Supervised Change Detection Using Guided Anisotropic Difusion
Comments: Machine Learning Journal 2021. arXiv admin note: substantial text overlap with arXiv:1904.08208
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1569]  arXiv:2112.15386 (cross-list from eess.IV) [pdf, other]
Title: Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning
Comments: 21 pages, 9 figures, 5 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1570]  arXiv:2112.15523 (cross-list from eess.IV) [pdf, ps, other]
Title: Transfer learning for cancer diagnosis in histopathological images
Journal-ref: IAES International Journal of Artificial Intelligence (IJ-AI), Vol. 11, No. 1, March 2022, pp. 129~136
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ total of 1570 entries: 1-1570 ]
[ showing 1570 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)