We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 311

[ total of 425 entries: 1-1000 | 312-425 ]
[ showing up to 1000 entries per page: fewer | more ]

Tue, 14 May 2024 (continued, showing last 54 of 142 entries)

[312]  arXiv:2405.06916 [pdf, other]
Title: High-order Neighborhoods Know More: HyperGraph Learning Meets Source-free Unsupervised Domain Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[313]  arXiv:2405.06914 [pdf, other]
Title: Non-confusing Generation of Customized Concepts in Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[314]  arXiv:2405.06911 [pdf, other]
Title: Replication Study and Benchmarking of Real-Time Object Detection Models
Comments: Authors are presented in alphabetical order, each having equal contribution to the work. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[315]  arXiv:2405.06903 [pdf, other]
Title: UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[316]  arXiv:2405.06893 [pdf, other]
Title: ADLDA: A Method to Reduce the Harm of Data Distribution Shift in Data Augmentation
Authors: Haonan Wang
Comments: 8 page 4 fig
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[317]  arXiv:2405.06887 [pdf, other]
Title: FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318]  arXiv:2405.06875 [pdf, other]
Title: LogicAL: Towards logical anomaly synthesis for unsupervised anomaly localization
Authors: Ying Zhao
Comments: Accepted to Visual Anomaly and Novelty Detection (VAND) 2.0 Workshop at CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319]  arXiv:2405.06872 [pdf, other]
Title: eCAR: edge-assisted Collaborative Augmented Reality Framework
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[320]  arXiv:2405.06865 [pdf, other]
Title: Disrupting Style Mimicry Attacks on Video Imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[321]  arXiv:2405.06849 [pdf, other]
Title: GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs
Comments: Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[322]  arXiv:2405.06845 [pdf, other]
Title: CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras
Comments: Accepted to the 18th IEEE International Conference on Automatic Face and Gesture Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323]  arXiv:2405.06841 [pdf, other]
Title: Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis
Comments: accepted at IEEE FG 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[324]  arXiv:2405.06828 [pdf, other]
Title: G-FARS: Gradient-Field-based Auto-Regressive Sampling for 3D Part Grouping
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325]  arXiv:2405.06821 [pdf, other]
Title: Synchronized Object Detection for Autonomous Sorting, Mapping, and Quantification of Medical Materials
Comments: To be submitted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326]  arXiv:2405.06814 [pdf, other]
Title: Dual-Task Vision Transformer for Rapid and Accurate Intracerebral Hemorrhage Classification on CT Images
Comments: 9 pages, 4 figure3
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[327]  arXiv:2405.06782 [pdf, other]
Title: GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328]  arXiv:2405.06778 [pdf, other]
Title: Shape Conditioned Human Motion Generation with Diffusion Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[329]  arXiv:2405.06765 [pdf, other]
Title: Common Corruptions for Enhancing and Evaluating Robustness in Air-to-Air Visual Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330]  arXiv:2405.06749 [pdf, other]
Title: Ensuring UAV Safety: A Vision-only and Real-time Framework for Collision Avoidance Through Object Detection, Tracking, and Distance Estimation
Comments: accepted at ICUAS 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[331]  arXiv:2405.07991 (cross-list from cs.RO) [pdf, other]
Title: SPIN: Simultaneous Perception, Interaction and Navigation
Comments: In CVPR 2024. Website at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[332]  arXiv:2405.07990 (cross-list from cs.CL) [pdf, other]
Title: Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[333]  arXiv:2405.07987 (cross-list from cs.LG) [pdf, other]
Title: The Platonic Representation Hypothesis
Comments: Equal contributions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[334]  arXiv:2405.07930 (cross-list from cs.MM) [pdf, other]
Title: Improving Multimodal Learning with Multi-Loss Gradient Modulation
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[335]  arXiv:2405.07905 (cross-list from eess.IV) [pdf, other]
[336]  arXiv:2405.07869 (cross-list from eess.IV) [pdf, other]
Title: Enhancing Clinically Significant Prostate Cancer Prediction in T2-weighted Images through Transfer Learning from Breast Cancer
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[337]  arXiv:2405.07861 (cross-list from eess.IV) [pdf, other]
Title: Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[338]  arXiv:2405.07854 (cross-list from eess.IV) [pdf, other]
Title: Using Multiparametric MRI with Optimized Synthetic Correlated Diffusion Imaging to Enhance Breast Cancer Pathologic Complete Response Prediction
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[339]  arXiv:2405.07842 (cross-list from astro-ph.IM) [pdf, other]
Title: Ground-based Image Deconvolution with Swin Transformer UNet
Comments: 11 pages, 14 figures
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV)
[340]  arXiv:2405.07827 (cross-list from cs.MM) [pdf, other]
Title: Automatic Recognition of Food Ingestion Environment from the AIM-2 Wearable Sensor
Comments: Accepted at CVPRw 2024
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[341]  arXiv:2405.07813 (cross-list from cs.LG) [pdf, other]
Title: Localizing Task Information for Improved Model Merging and Compression
Comments: Accepted ICML 2024; The first two authors contributed equally to this work; Project website: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[342]  arXiv:2405.07780 (cross-list from cs.LG) [pdf, other]
Title: Harnessing Hierarchical Label Distribution Variations in Test Agnostic Long-tail Recognition
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[343]  arXiv:2405.07762 (cross-list from eess.IV) [pdf, other]
Title: A method for supervoxel-wise association studies of age and other non-imaging variables from coronary computed tomography angiograms
Comments: 34 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[344]  arXiv:2405.07674 (cross-list from eess.IV) [pdf, other]
Title: CoVScreen: Pitfalls and recommendations for screening COVID-19 using Chest X-rays
Authors: Sonit Singh
Comments: 21 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[345]  arXiv:2405.07606 (cross-list from cs.HC) [pdf, other]
Title: AIris: An AI-powered Wearable Assistive Device for the Visually Impaired
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[346]  arXiv:2405.07544 (cross-list from cs.RO) [pdf, other]
Title: Automatic Odometry-Less OpenDRIVE Generation From Sparse Point Clouds
Comments: 8 pages, 4 figures, 3 algorithms, 2 tables
Journal-ref: 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[347]  arXiv:2405.07489 (cross-list from cs.LG) [pdf, other]
Title: Sparse Domain Transfer via Elastic Net Regularization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[348]  arXiv:2405.07392 (cross-list from cs.RO) [pdf, other]
Title: NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU
Authors: Yuhao Zhang
Comments: 12 pages, 5 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[349]  arXiv:2405.07338 (cross-list from eess.IV) [pdf, other]
Title: Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[350]  arXiv:2405.07309 (cross-list from cs.RO) [pdf, other]
Title: DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[351]  arXiv:2405.07283 (cross-list from cs.RO) [pdf, other]
Title: BeautyMap: Binary-Encoded Adaptable Ground Matrix for Dynamic Points Removal in Global Maps
Comments: The first two authors are co-first authors. 8 pages, accepted by RA-L
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[352]  arXiv:2405.07256 (cross-list from eess.IV) [pdf, other]
Title: Leveraging Fixed and Dynamic Pseudo-labels for Semi-supervised Medical Image Segmentation
Comments: Under Review
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[353]  arXiv:2405.07145 (cross-list from cs.CR) [pdf, other]
Title: Stable Signature is Unstable: Removing Image Watermark from Diffusion Models
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[354]  arXiv:2405.07041 (cross-list from cs.RO) [pdf, other]
Title: Multi-agent Traffic Prediction via Denoised Endpoint Distribution
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[355]  arXiv:2405.07033 (cross-list from cs.NI) [pdf, ps, other]
Title: A Performance Analysis Modeling Framework for Extended Reality Applications in Edge-Assisted Wireless Networks
Comments: 12 pages, 4 figures; To appear in Proceedings of IEEE International Conference on Distributed Computing Systems (ICDCS), 2024
Subjects: Networking and Internet Architecture (cs.NI); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Image and Video Processing (eess.IV)
[356]  arXiv:2405.07023 (cross-list from eess.IV) [pdf, other]
Title: Efficient Real-world Image Super-Resolution Via Adaptive Directional Gradient Convolution
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[357]  arXiv:2405.07001 (cross-list from cs.CL) [pdf, other]
Title: Evaluating Task-based Effectiveness of MLLMs on Charts
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[358]  arXiv:2405.06995 (cross-list from cs.SD) [pdf, other]
Title: Benchmarking Cross-Domain Audio-Visual Deception Detection
Comments: 10 pages
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[359]  arXiv:2405.06880 (cross-list from eess.IV) [pdf, other]
Title: EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation
Comments: 14 pages, 5 figures, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[360]  arXiv:2405.06859 (cross-list from cs.LG) [pdf, other]
Title: Reimplementation of Learning to Reweight Examples for Robust Deep Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[361]  arXiv:2405.06855 (cross-list from cs.LG) [pdf, other]
Title: Linear Explanations for Individual Neurons
Comments: Published in ICML 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[362]  arXiv:2405.06789 (cross-list from eess.IV) [pdf, other]
Title: Self-Consistent Recursive Diffusion Bridge for Medical Image Translation
Comments: 11 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[363]  arXiv:2405.06786 (cross-list from eess.IV) [pdf, other]
Title: SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[364]  arXiv:2405.06702 (cross-list from cs.CL) [pdf, other]
Title: Malayalam Sign Language Identification using Finetuned YOLOv8 and Computer Vision Techniques
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[365]  arXiv:2405.06646 (cross-list from cs.GR) [pdf, other]
Title: On-the-fly Learning to Transfer Motion Style with Diffusion Models: A Semantic Guidance Approach
Comments: 23 pages
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)

Mon, 13 May 2024

[366]  arXiv:2405.06636 [pdf, other]
Title: Federated Document Visual Question Answering: A Pilot Study
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[367]  arXiv:2405.06634 [pdf, other]
Title: Multimodal LLMs Struggle with Basic Visual Network Analysis: a VNA Benchmark
Comments: 11 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[368]  arXiv:2405.06600 [pdf, other]
Title: Multi-Object Tracking in the Dark
Comments: Accepted by CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369]  arXiv:2405.06598 [pdf, other]
Title: A Lightweight Transformer for Remote Sensing Image Change Captioning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370]  arXiv:2405.06593 [pdf, other]
Title: Non-Uniform Spatial Alignment Errors in sUAS Imagery From Wide-Area Disasters
Comments: 6 pages, 5 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371]  arXiv:2405.06586 [pdf, other]
Title: Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[372]  arXiv:2405.06574 [pdf, other]
Title: Deep video representation learning: a survey
Comments: Multimedia Tools and Applications (2023) 1-31
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373]  arXiv:2405.06547 [pdf, other]
Title: OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation
Authors: Jinwei Lin
Comments: 24 pages, 13 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[374]  arXiv:2405.06536 [pdf, other]
Title: Mesh Denoising Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375]  arXiv:2405.06535 [pdf, other]
Title: Controllable Image Generation With Composed Parallel Token Prediction
Comments: 9 pages, 6 figures, non-anonymised pre-print for NeurIPS 2024 main conference. arXiv admin note: text overlap with arXiv:2402.04550, arXiv:2404.13788, arXiv:2403.06098, arXiv:2401.16025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[376]  arXiv:2405.06525 [pdf, other]
Title: Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[377]  arXiv:2405.06502 [pdf, other]
Title: Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[378]  arXiv:2405.06468 [pdf, other]
Title: Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[379]  arXiv:2405.06467 [pdf, other]
Title: Attend, Distill, Detect: Attention-aware Entropy Distillation for Anomaly Detection
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380]  arXiv:2405.06408 [pdf, other]
Title: I3DGS: Improve 3D Gaussian Splatting from Multiple Dimensions
Authors: Jinwei Lin
Comments: 16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[381]  arXiv:2405.06389 [pdf, other]
Title: Continual Novel Class Discovery via Feature Enhancement and Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[382]  arXiv:2405.06383 [pdf, other]
Title: How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[383]  arXiv:2405.06354 [pdf, other]
Title: KeepOriginalAugment: Single Image-based Better Information-Preserving Data Augmentation Approach
Comments: This paper has been accepted at 20th International Conference on Artificial Intelligence Applications and Innovations 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[384]  arXiv:2405.06345 [pdf, other]
Title: Evaluating Adversarial Robustness in the Spatial Frequency Domain
Comments: 14 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385]  arXiv:2405.06342 [pdf, other]
Title: Compression-Realized Deep Structural Network for Video Quality Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[386]  arXiv:2405.06340 [pdf, other]
Title: Improving Transferable Targeted Adversarial Attack via Normalized Logit Calibration and Truncated Feature Mixing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387]  arXiv:2405.06323 [pdf, other]
Title: Open Access Battle Damage Detection via Pixel-Wise T-Test on Sentinel-1 Imagery
Authors: Ollie Ballinger
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388]  arXiv:2405.06319 [pdf, other]
Title: Decoding Emotions in Abstract Art: Cognitive Plausibility of CLIP in Recognizing Color-Emotion Associations
Comments: To appear in the Proceedings of the Annual Meeting of the Cognitive Science Society 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[389]  arXiv:2405.06288 [pdf, other]
Title: PCLMix: Weakly Supervised Medical Image Segmentation via Pixel-Level Contrastive Learning and Dynamic Mix Augmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390]  arXiv:2405.06283 [pdf, other]
Title: Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[391]  arXiv:2405.06279 [pdf, other]
Title: Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration
Comments: Accepted at ICRA 2024 (IEEE International Conference on Robotics and Automation 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[392]  arXiv:2405.06278 [pdf, other]
Title: Exploring the Interplay of Interpretability and Robustness in Deep Neural Networks: A Saliency-guided Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[393]  arXiv:2405.06277 [pdf, other]
Title: Learning A Spiking Neural Network for Efficient Image Deraining
Comments: Accepted by IJCAI2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[394]  arXiv:2405.06264 [pdf, other]
Title: Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection
Comments: Accepted by AAAI-24
Journal-ref: AAAI 2024, 38, 11936-11943
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395]  arXiv:2405.06260 [pdf, other]
Title: Precise Apple Detection and Localization in Orchards using YOLOv5 for Robotic Harvesting Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[396]  arXiv:2405.06246 [pdf, ps, other]
Title: Comparative Analysis of Advanced Feature Matching Algorithms in Challenging High Spatial Resolution Optical Satellite Stereo Scenarios
Comments: The manuscript is accepted as Oral Presentation in IEEE International Geoscience and Remote Sensing Symposium(IGARSS 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397]  arXiv:2405.06241 [pdf, other]
Title: MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[398]  arXiv:2405.06228 [pdf, other]
Title: Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399]  arXiv:2405.06227 [pdf, other]
Title: MaskMatch: Boosting Semi-Supervised Learning Through Mask Autoencoder-Driven Feature Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400]  arXiv:2405.06217 [pdf, other]
Title: DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
Comments: Accepted by ICME 2024 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[401]  arXiv:2405.06216 [pdf, other]
Title: Event-based Structure-from-Orbit
Authors: Ethan Elms (1), Yasir Latif (1), Tae Ha Park (2), Tat-Jun Chin (1) ((1) The University of Adelaide, (2) Stanford University)
Comments: This work will be published in the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402]  arXiv:2405.06214 [pdf, other]
Title: Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[403]  arXiv:2405.06201 [pdf, other]
Title: PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[404]  arXiv:2405.06198 [pdf, ps, other]
Title: MAPL: Memory Augmentation and Pseudo-Labeling for Semi-Supervised Anomaly Detection
Authors: Junzhuo Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[405]  arXiv:2405.06196 [pdf, other]
Title: VLSM-Adapter: Finetuning Vision-Language Segmentation Efficiently with Lightweight Blocks
Comments: 12 pages, 5 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[406]  arXiv:2405.06191 [pdf, ps, other]
Title: ODC-SA Net: Orthogonal Direction Enhancement and Scale Aware Network for Polyp Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407]  arXiv:2405.06185 [pdf, other]
Title: Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection
Comments: 7 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408]  arXiv:2405.06181 [pdf, other]
Title: Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[409]  arXiv:2405.06143 [pdf, other]
Title: Perceptual Crack Detection for Rendered 3D Textured Meshes
Comments: Accepted by IEEE QoMEX 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG); Multimedia (cs.MM)
[410]  arXiv:2405.06128 [pdf, other]
Title: Enhanced Multimodal Content Moderation of Children's Videos using Audiovisual Fusion
Comments: 8 pages, 3 figures, Accepted at The 37th International FLAIRS Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411]  arXiv:2405.06116 [pdf, other]
Title: Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba
Comments: Extension Journal of TTPOINT and PEPNet
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412]  arXiv:2405.06088 [pdf, other]
Title: A Mixture of Experts Approach to 3D Human Motion Prediction
Comments: 16 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[413]  arXiv:2405.06057 [pdf, other]
Title: UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[414]  arXiv:2405.06049 [pdf, other]
Title: BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[415]  arXiv:2405.05983 [pdf, ps, other]
Title: Real-Time Pill Identification for the Visually Impaired Using Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[416]  arXiv:2405.06473 (cross-list from cs.RO) [pdf, other]
Title: Autonomous Driving with a Deep Dual-Model Solution for Steering and Braking Control
Comments: 6 pages, 2 figures, accepted for publication in Proceedings of International Conference on Smart and Sustainable Technologies (SpliTech 2024)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[417]  arXiv:2405.06463 (cross-list from eess.IV) [pdf, other]
Title: MRSegmentator: Robust Multi-Modality Segmentation of 40 Classes in MRI and CT Sequences
Comments: 13 pages, 6 figures; corrected co-author info
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[418]  arXiv:2405.06301 (cross-list from cs.LG) [pdf, ps, other]
Title: Learning from String Sequences
Comments: 10 pages, 1 figure, 4 tables, Technical Report
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[419]  arXiv:2405.06286 (cross-list from cs.RO) [pdf, ps, other]
Title: A Joint Approach Towards Data-Driven Virtual Testing for Automated Driving: The AVEAS Project
Comments: 6 pages, 5 figures, 2 tables
Journal-ref: Proceedings of the 7th International Symposium on Future Active Safety Technology toward zero traffic accidents (JSAE FAST-zero '23), 2023
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG); Systems and Control (eess.SY)
[420]  arXiv:2405.06284 (cross-list from eess.IV) [pdf, other]
Title: Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention
Comments: Accepted in Computer Vision and Pattern Recognition (CVPR) 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[421]  arXiv:2405.06265 (cross-list from cs.RO) [pdf, other]
Title: Uncertainty-aware Semantic Mapping in Off-road Environments with Dempster-Shafer Theory of Evidence
Comments: Our project website can be found at this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[422]  arXiv:2405.06234 (cross-list from cs.LG) [pdf, other]
Title: TS3IM: Unveiling Structural Similarity in Time Series through Image Similarity Assessment Insights
Authors: Yuhan Liu, Ke Tu
Comments: 6 pages, 6 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[423]  arXiv:2405.06175 (cross-list from eess.IV) [pdf, other]
Title: Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[424]  arXiv:2405.06166 (cross-list from eess.IV) [pdf, other]
Title: MDNet: Multi-Decoder Network for Abdominal CT Organs Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[425]  arXiv:2405.06149 (cross-list from cs.AI) [pdf, other]
Title: DisBeaNet: A Deep Neural Network to augment Unmanned Surface Vessels for maritime situational awareness
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[ total of 425 entries: 1-1000 | 312-425 ]
[ showing up to 1000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)