We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions

[ total of 454 entries: 1-50 | 51-100 | 101-150 | 151-200 | ... | 451-454 ]
[ showing 50 entries per page: fewer | more | all ]

Mon, 13 May 2024 (showing first 50 of 60 entries)

[1]  arXiv:2405.06636 [pdf, other]
Title: Federated Document Visual Question Answering: A Pilot Study
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[2]  arXiv:2405.06634 [pdf, other]
Title: Multimodal LLMs Struggle with Basic Visual Network Analysis: a VNA Benchmark
Comments: 11 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[3]  arXiv:2405.06600 [pdf, other]
Title: Multi-Object Tracking in the Dark
Comments: Accepted by CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[4]  arXiv:2405.06598 [pdf, other]
Title: A Lightweight Transformer for Remote Sensing Image Change Captioning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[5]  arXiv:2405.06593 [pdf, other]
Title: Non-Uniform Spatial Alignment Errors in sUAS Imagery From Wide-Area Disasters
Comments: 6 pages, 5 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[6]  arXiv:2405.06586 [pdf, other]
Title: Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[7]  arXiv:2405.06574 [pdf, other]
Title: Deep video representation learning: a survey
Comments: Multimedia Tools and Applications (2023) 1-31
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8]  arXiv:2405.06547 [pdf, other]
Title: OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation
Authors: Jinwei Lin
Comments: 24 pages, 13 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2405.06536 [pdf, other]
Title: Mesh Denoising Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[10]  arXiv:2405.06535 [pdf, other]
Title: Controllable Image Generation With Composed Parallel Token Prediction
Comments: 9 pages, 6 figures, non-anonymised pre-print for NeurIPS 2024 main conference. arXiv admin note: text overlap with arXiv:2402.04550, arXiv:2404.13788, arXiv:2403.06098, arXiv:2401.16025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[11]  arXiv:2405.06525 [pdf, other]
Title: Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12]  arXiv:2405.06502 [pdf, other]
Title: Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13]  arXiv:2405.06468 [pdf, other]
Title: Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[14]  arXiv:2405.06467 [pdf, other]
Title: Attend, Distill, Detect: Attention-aware Entropy Distillation for Anomaly Detection
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[15]  arXiv:2405.06408 [pdf, other]
Title: I3DGS: Improve 3D Gaussian Splatting from Multiple Dimensions
Authors: Jinwei Lin
Comments: 16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:2405.06389 [pdf, other]
Title: Continual Novel Class Discovery via Feature Enhancement and Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[17]  arXiv:2405.06383 [pdf, other]
Title: How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2405.06354 [pdf, other]
Title: KeepOriginalAugment: Single Image-based Better Information-Preserving Data Augmentation Approach
Comments: This paper has been accepted at 20th International Conference on Artificial Intelligence Applications and Innovations 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[19]  arXiv:2405.06345 [pdf, other]
Title: Evaluating Adversarial Robustness in the Spatial Frequency Domain
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[20]  arXiv:2405.06342 [pdf, other]
Title: Compression-Realized Deep Structural Network for Video Quality Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[21]  arXiv:2405.06340 [pdf, other]
Title: Improving Transferable Targeted Adversarial Attack via Normalized Logit Calibration and Truncated Feature Mixing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[22]  arXiv:2405.06323 [pdf, other]
Title: Open Access Battle Damage Detection via Pixel-Wise T-Test on Sentinel-1 Imagery
Authors: Ollie Ballinger
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23]  arXiv:2405.06319 [pdf, other]
Title: Decoding Emotions in Abstract Art: Cognitive Plausibility of CLIP in Recognizing Color-Emotion Associations
Comments: To appear in the Proceedings of the Annual Meeting of the Cognitive Science Society 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[24]  arXiv:2405.06288 [pdf, other]
Title: PCLMix: Weakly Supervised Medical Image Segmentation via Pixel-Level Contrastive Learning and Dynamic Mix Augmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[25]  arXiv:2405.06283 [pdf, other]
Title: Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[26]  arXiv:2405.06279 [pdf, other]
Title: Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration
Comments: Accepted at ICRA 2024 (IEEE International Conference on Robotics and Automation 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[27]  arXiv:2405.06278 [pdf, other]
Title: Exploring the Interplay of Interpretability and Robustness in Deep Neural Networks: A Saliency-guided Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[28]  arXiv:2405.06277 [pdf, other]
Title: Learning A Spiking Neural Network for Efficient Image Deraining
Comments: Accepted by IJCAI2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[29]  arXiv:2405.06264 [pdf, other]
Title: Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection
Comments: Accepted by AAAI-24
Journal-ref: Fan, Y.; Wei, X.; Gong, R.; Ma, Y.; Zhang, X.; Zhang, Q.; Liu, X. Selective Focus: Investigating Semantics Sensitivity in Post-Training Quantization for Lane Detection. AAAI 2024, 38, 11936-11943
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[30]  arXiv:2405.06260 [pdf, other]
Title: Precise Apple Detection and Localization in Orchards using YOLOv5 for Robotic Harvesting Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[31]  arXiv:2405.06246 [pdf, ps, other]
Title: Comparative Analysis of Advanced Feature Matching Algorithms in Challenging High Spatial Resolution Optical Satellite Stereo Scenarios
Comments: The manuscript is accepted as Oral Presentation in IEEE International Geoscience and Remote Sensing Symposium(IGARSS 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32]  arXiv:2405.06241 [pdf, other]
Title: MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[33]  arXiv:2405.06228 [pdf, other]
Title: Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34]  arXiv:2405.06227 [pdf, other]
Title: MaskMatch: Boosting Semi-Supervised Learning Through Mask Autoencoder-Driven Feature Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35]  arXiv:2405.06217 [pdf, other]
Title: DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
Comments: Accepted by ICME 2024 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[36]  arXiv:2405.06216 [pdf, other]
Title: Event-based Structure-from-Orbit
Authors: Ethan Elms (1), Yasir Latif (1), Tae Ha Park (2), Tat-Jun Chin (1) ((1) The University of Adelaide, (2) Stanford University)
Comments: This work will be published in the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37]  arXiv:2405.06214 [pdf, other]
Title: Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[38]  arXiv:2405.06201 [pdf, other]
Title: PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[39]  arXiv:2405.06198 [pdf, ps, other]
Title: MAPL: Memory Augmentation and Pseudo-Labeling for Semi-Supervised Anomaly Detection
Authors: Junzhuo Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[40]  arXiv:2405.06196 [pdf, other]
Title: VLSM-Adapter: Finetuning Vision-Language Segmentation Efficiently with Lightweight Blocks
Comments: 12 pages, 5 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[41]  arXiv:2405.06191 [pdf, ps, other]
Title: ODC-SA Net: Orthogonal Direction Enhancement and Scale Aware Network for Polyp Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42]  arXiv:2405.06185 [pdf, other]
Title: Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection
Comments: 7 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43]  arXiv:2405.06181 [pdf, other]
Title: Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[44]  arXiv:2405.06143 [pdf, other]
Title: Perceptual Crack Detection for Rendered 3D Textured Meshes
Comments: Accepted by IEEE QoMEX 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG); Multimedia (cs.MM)
[45]  arXiv:2405.06128 [pdf, other]
Title: Enhanced Multimodal Content Moderation of Children's Videos using Audiovisual Fusion
Comments: 8 pages, 3 figures, Accepted at The 37th International FLAIRS Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46]  arXiv:2405.06116 [pdf, other]
Title: Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba
Comments: Extension Journal of TTPOINT and PEPNet
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47]  arXiv:2405.06088 [pdf, other]
Title: A Mixture of Experts Approach to 3D Human Motion Prediction
Comments: 16 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48]  arXiv:2405.06057 [pdf, other]
Title: UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[49]  arXiv:2405.06049 [pdf, other]
Title: BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[50]  arXiv:2405.05983 [pdf, ps, other]
Title: Real-Time Pill Identification for the Visually Impaired Using Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 454 entries: 1-50 | 51-100 | 101-150 | 151-200 | ... | 451-454 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)