We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 307

[ total of 456 entries: 1-224 | 84-307 | 308-456 ]
[ showing 224 entries per page: fewer | more | all ]

Tue, 7 May 2024 (continued, showing last 87 of 159 entries)

[308]  arXiv:2405.02929 [pdf, other]
Title: Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[309]  arXiv:2405.02918 [pdf, other]
Title: MERIT: Multi-view Evidential learning for Reliable and Interpretable liver fibrosis sTaging
Comments: Submitted to Medical Image Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[310]  arXiv:2405.02917 [pdf, other]
Title: Overconfidence is Key: Verbalized Uncertainty Evaluation in Large Language and Vision-Language Models
Comments: 8 pages, with appendix. To appear in TrustNLP workshop @ NAACL 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[311]  arXiv:2405.02913 [pdf, ps, other]
Title: Fast TILs estimation in lung cancer WSIs based on semi-stochastic patch sampling
Comments: 18 pages, 7 figures, 6 appendix pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[312]  arXiv:2405.02911 [pdf, other]
Title: Multimodal Sense-Informed Prediction of 3D Human Motions
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[313]  arXiv:2405.02906 [pdf, other]
Title: SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection
Comments: 9 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[314]  arXiv:2405.02882 [pdf, other]
Title: A drone detector with modified backbone and multiple pyramid featuremaps enhancement structure (MDDPE)
Authors: Chenhao Wu
Comments: 20 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[315]  arXiv:2405.02880 [pdf, other]
Title: Blending Distributed NeRFs with Tri-stage Robust Pose Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[316]  arXiv:2405.02859 [pdf, other]
Title: MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior
Comments: 14 pages, 10 figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[317]  arXiv:2405.02844 [pdf, other]
Title: SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318]  arXiv:2405.02843 [pdf, other]
Title: Residual-Conditioned Optimal Transport: Towards Structure-preserving Unpaired and Paired Image Restoration
Comments: ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319]  arXiv:2405.02834 [pdf, other]
Title: Scene-Adaptive Person Search via Bilateral Modulations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[320]  arXiv:2405.02832 [pdf, other]
Title: Fast One-Stage Unsupervised Domain Adaptive Person Search
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[321]  arXiv:2405.02830 [pdf, other]
Title: You Only Need Half: Boosting Data Augmentation by Using Partial Content
Authors: Juntao Hu, Yuan Wu
Comments: Technical report,16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[322]  arXiv:2405.02824 [pdf, other]
Title: Adaptive Guidance Learning for Camouflaged Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323]  arXiv:2405.02815 [pdf, other]
Title: Region-specific Risk Quantification for Interpretable Prognosis of COVID-19
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[324]  arXiv:2405.02811 [pdf, other]
Title: PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325]  arXiv:2405.02797 [pdf, other]
Title: Adapting to Distribution Shift by Visual Domain Prompt Generation
Comments: ICLR2024, code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[326]  arXiv:2405.02793 [pdf, other]
Title: ImageInWords: Unlocking Hyper-Detailed Image Descriptions
Comments: Webpage (this https URL), GitHub (this https URL), HuggingFace (this https URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[327]  arXiv:2405.02792 [pdf, ps, other]
Title: Jointly Learning Spatial, Angular, and Temporal Information for Enhanced Lane Detection
Comments: 5 pages, 3 Figures , Accepted IEEE Conference on Signal Processing and Communications Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328]  arXiv:2405.02791 [pdf, other]
Title: Efficient Text-driven Motion Generation via Latent Consistency Training
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[329]  arXiv:2405.02787 [pdf, ps, other]
Title: Light Field Spatial Resolution Enhancement Framework
Comments: 5 pages, 6 figures, accepted in IEEE Conference on Signal Processing and Communications Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330]  arXiv:2405.02785 [pdf, other]
Title: Fused attention mechanism-based ore sorting network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[331]  arXiv:2405.02782 [pdf, ps, other]
Title: A self-supervised text-vision framework for automated brain abnormality detection
Comments: Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[332]  arXiv:2405.02781 [pdf, other]
Title: Instantaneous Perception of Moving Objects in 3D
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[333]  arXiv:2405.02771 [pdf, other]
Title: MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning
Comments: Data and code is available on the project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[334]  arXiv:2405.02762 [pdf, other]
Title: TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes
Comments: 8 pages, submitted to IROS2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[335]  arXiv:2405.02751 [pdf, other]
Title: Deep Image Restoration For Image Anti-Forensics
Authors: Eren Tahir, Mert Bal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[336]  arXiv:2405.02730 [pdf, other]
Title: U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers
Comments: 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[337]  arXiv:2405.02717 [pdf, other]
Title: AFter: Attention-based Fusion Router for RGBT Tracking
Comments: Peer review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[338]  arXiv:2405.02692 [pdf, ps, other]
Title: Diffeomorphic Transformer-based Abdomen MRI-CT Deformable Image Registration
Comments: 18 pages and 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[339]  arXiv:2405.02686 [pdf, other]
Title: Boosting 3D Neuron Segmentation with 2D Vision Transformer Pre-trained on Natural Images
Comments: 3 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[340]  arXiv:2405.02676 [pdf, other]
Title: Hand-Object Interaction Controller (HOIC): Deep Reinforcement Learning for Reconstructing Interactions with Physics
Comments: SIGGRAPH 2024 Conference Track
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[341]  arXiv:2405.02652 [pdf, other]
Title: Deep Pulse-Signal Magnification for remote Heart Rate Estimation in Compressed Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[342]  arXiv:2405.02608 [pdf, other]
Title: UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model
Comments: Accepted by CVPR 2024. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[343]  arXiv:2405.02595 [pdf, other]
Title: Vision-based 3D occupancy prediction in autonomous driving: a review and outlook
Comments: 20 pages, 20 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[344]  arXiv:2405.02591 [pdf, other]
Title: Better YOLO with Attention-Augmented Network and Enhanced Generalization Performance for Safety Helmet Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[345]  arXiv:2405.02586 [pdf, other]
Title: Generalizing CLIP to Unseen Domain via Text-Guided Diverse Novel Feature Synthesis
Comments: 24 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[346]  arXiv:2405.02581 [pdf, other]
Title: Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacements
Comments: Accepted at CVPR24 as Poster Highlight
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[347]  arXiv:2405.02571 [pdf, other]
Title: ViTALS: Vision Transformer for Action Localization in Surgical Nephrectomy
Comments: Nephrectomy surgery, Surgical Phase Recognition, Surgical Workflow Segmentation, 11 pages, 2 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348]  arXiv:2405.02568 [pdf, other]
Title: ActiveNeuS: Active 3D Reconstruction using Neural Implicit Surface Uncertainty
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[349]  arXiv:2405.02564 [pdf, ps, other]
Title: Leveraging the Human Ventral Visual Stream to Improve Neural Network Robustness
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[350]  arXiv:2405.02556 [pdf, other]
Title: Few-Shot Fruit Segmentation via Transfer Learning
Comments: To be published in the 2024 IEEE International Conference on Robotics and Automation (ICRA)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[351]  arXiv:2405.02538 [pdf, other]
Title: AdaFPP: Adapt-Focused Bi-Propagating Prototype Learning for Panoramic Activity Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352]  arXiv:2405.02515 [pdf, other]
Title: SR4ZCT: Self-supervised Through-plane Resolution Enhancement for CT Images with Arbitrary Resolution and Overlap
Comments: MLMI2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353]  arXiv:2405.02512 [pdf, other]
Title: Spatio-Temporal SwinMAE: A Swin Transformer based Multiscale Representation Learner for Temporal Satellite Imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[354]  arXiv:2405.02509 [pdf, other]
Title: Implicit Neural Representations for Robust Joint Sparse-View CT Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[355]  arXiv:2405.02508 [pdf, other]
Title: Rasterized Edge Gradients: Handling Discontinuities Differentiably
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[356]  arXiv:2405.02386 [pdf, other]
Title: Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids
Comments: SIGGRAPH 2024, Project page: this https URL , Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[357]  arXiv:2405.02363 [pdf, other]
Title: LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[358]  arXiv:2405.02334 [pdf, other]
Title: Rad4XCNN: a new agnostic method for post-hoc global explanation of CNN-derived features by means of radiomics
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[359]  arXiv:2405.02332 [pdf, other]
Title: Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models
Journal-ref: Generative Models for Computer Vision - CVPR 2024 Workshop, Jun 2024, Seattle, United States
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[360]  arXiv:2405.02317 [pdf, other]
Title: Long-term Human Participation Assessment In Collaborative Learning Environments Using Dynamic Scene Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[361]  arXiv:2405.02312 [pdf, ps, other]
Title: YOLOv5 vs. YOLOv8 in Marine Fisheries: Balancing Class Detection and Instance Count
Comments: 12 pages, 25 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[362]  arXiv:2405.02305 [pdf, ps, other]
Title: Inserting Faces inside Captions: Image Captioning with Attention Guided Merging
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[363]  arXiv:2405.02301 [pdf, other]
Title: TFCounter:Polishing Gems for Training-Free Object Counting
Comments: 14pages,11 figuers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[364]  arXiv:2405.02297 [pdf, other]
Title: Employing Universal Voting Schemes for Improved Visual Place Recognition Performance
Comments: arXiv admin note: substantial text overlap with arXiv:2305.05705
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365]  arXiv:2405.02296 [pdf, other]
Title: Möbius Transform for Mitigating Perspective Distortions in Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366]  arXiv:2405.02295 [pdf, other]
Title: Neural Additive Image Model: Interpretation through Interpolation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[367]  arXiv:2405.02288 [pdf, other]
Title: Prospective Role of Foundation Models in Advancing Autonomous Vehicles
Comments: 36 pages,5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[368]  arXiv:2405.03649 (cross-list from cs.LG) [pdf, other]
Title: Learning Robust Classifiers with Self-Guided Spurious Correlation Mitigation
Comments: Accepted to IJCAI 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[369]  arXiv:2405.03501 (cross-list from cs.LG) [pdf, other]
Title: Boosting Single Positive Multi-label Classification with Generalized Robust Loss
Comments: 14 pages, 5 figures, 6 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[370]  arXiv:2405.03500 (cross-list from cs.MM) [pdf, other]
Title: A Rate-Distortion-Classification Approach for Lossy Image Compression
Authors: Yuefeng Zhang
Comments: 15 pages
Journal-ref: Digital Signal Processing Volume 141, September 2023, 104163
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[371]  arXiv:2405.03486 (cross-list from cs.CR) [pdf, other]
Title: UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI)
[372]  arXiv:2405.03408 (cross-list from astro-ph.IM) [pdf, other]
Title: An Image Quality Evaluation and Masking Algorithm Based On Pre-trained Deep Neural Networks
Comments: Accepted by the AJ. The code could be downloaded from: this https URL with DOI of: 10.12149/101415
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Solar and Stellar Astrophysics (astro-ph.SR); Computer Vision and Pattern Recognition (cs.CV)
[373]  arXiv:2405.03376 (cross-list from cs.LG) [pdf, other]
Title: CRA5: Extreme Compression of ERA5 for Portable Global Climate and Weather Research via an Efficient Variational Transformer
Comments: Main text and supplementary, 22 pages, 13 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[374]  arXiv:2405.03355 (cross-list from cs.LG) [pdf, other]
Title: On the Theory of Cross-Modality Distillation with Contrastive Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[375]  arXiv:2405.03301 (cross-list from cs.LG) [pdf, other]
Title: Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification
Comments: International Joint Conference on Artificial Intelligence 2024 (to be published)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[376]  arXiv:2405.03164 (cross-list from cs.RO) [pdf, other]
Title: The Role of Predictive Uncertainty and Diversity in Embodied AI and Robot Learning
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[377]  arXiv:2405.03141 (cross-list from eess.IV) [pdf, other]
Title: Automatic Ultrasound Curve Angle Measurement via Affinity Clustering for Adolescent Idiopathic Scoliosis Evaluation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[378]  arXiv:2405.03103 (cross-list from cs.LG) [pdf, other]
Title: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[379]  arXiv:2405.03008 (cross-list from eess.IV) [pdf, other]
Title: DVMSR: Distillated Vision Mamba for Efficient Super-Resolution
Comments: 8 pages, 8 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[380]  arXiv:2405.02984 (cross-list from cs.CL) [pdf, other]
Title: E-TSL: A Continuous Educational Turkish Sign Language Dataset with Baseline Methods
Comments: 7 pages, 3 figures, 4 tables, submitted to IEEE conference
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[381]  arXiv:2405.02942 (cross-list from physics.optics) [pdf, other]
Title: Design, analysis, and manufacturing of a glass-plastic hybrid minimalist aspheric panoramic annular lens
Comments: Accepted to Optics & Laser Technology
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[382]  arXiv:2405.02857 (cross-list from eess.IV) [pdf, other]
Title: I$^3$Net: Inter-Intra-slice Interpolation Network for Medical Slice Synthesis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[383]  arXiv:2405.02852 (cross-list from eess.IV) [pdf, other]
Title: On Enhancing Brain Tumor Segmentation Across Diverse Populations with Convolutional Neural Networks
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[384]  arXiv:2405.02807 (cross-list from cs.LG) [pdf, ps, other]
Title: Kinematic analysis of structural mechanics based on convolutional neural network
Comments: 9 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[385]  arXiv:2405.02784 (cross-list from eess.IV) [pdf, other]
Title: MR-Transformer: Vision Transformer for Total Knee Replacement Prediction Using Magnetic Resonance Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[386]  arXiv:2405.02766 (cross-list from cs.LG) [pdf, other]
Title: Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning
Comments: Accepted at 3rd Conference on Lifelong Learning Agents (CoLLAs), 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[387]  arXiv:2405.02700 (cross-list from cs.LG) [pdf, other]
Title: Towards a Scalable Identification of Novel Modes in Generative Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[388]  arXiv:2405.02698 (cross-list from cs.LG) [pdf, ps, other]
Title: Stable Diffusion Dataset Generation for Downstream Classification Tasks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[389]  arXiv:2405.02678 (cross-list from cs.LG) [pdf, other]
Title: Position Paper: Quo Vadis, Unsupervised Time Series Anomaly Detection?
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[390]  arXiv:2405.02648 (cross-list from cs.LG) [pdf, other]
Title: A Conformal Prediction Score that is Robust to Label Noise
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[391]  arXiv:2405.02504 (cross-list from eess.IV) [pdf, other]
Title: Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[392]  arXiv:2405.02497 (cross-list from math.OC) [pdf, other]
Title: Prediction techniques for dynamic imaging with online primal-dual methods
Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV)
[393]  arXiv:2405.02383 (cross-list from stat.ML) [pdf, other]
Title: A Fresh Look at Sanity Checks for Saliency Maps
Comments: arXiv admin note: text overlap with arXiv:2401.06465
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[394]  arXiv:2405.02367 (cross-list from cs.LG) [pdf, other]
Title: Enhancing Social Media Post Popularity Prediction with Visual Content
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)

Mon, 6 May 2024

[395]  arXiv:2405.02280 [pdf, other]
Title: DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396]  arXiv:2405.02266 [pdf, other]
Title: On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397]  arXiv:2405.02246 [pdf, other]
Title: What matters when building vision-language models?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[398]  arXiv:2405.02220 [pdf, other]
Title: Designed Dithering Sign Activation for Binary Neural Networks
Comments: 7 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[399]  arXiv:2405.02218 [pdf, other]
Title: Multispectral Fine-Grained Classification of Blackgrass in Wheat and Barley Crops
Comments: 19 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400]  arXiv:2405.02191 [pdf, ps, other]
Title: Non-Destructive Peat Analysis using Hyperspectral Imaging and Machine Learning
Comments: 4 pages,4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[401]  arXiv:2405.02171 [pdf, other]
Title: Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations
Comments: Accpted by IEEE TPAMI in 2024. Extended version of ECCV 2022 paper "Self-Supervised Learning for Real-World Super-Resolution from Dual Zoomed Observations" (arXiv:2203.01325)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402]  arXiv:2405.02162 [pdf, other]
Title: Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[403]  arXiv:2405.02155 [pdf, other]
Title: Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification
Authors: Siqi Yin, Lifan Jiang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[404]  arXiv:2405.02114 [pdf, other]
Title: Probablistic Restoration with Adaptive Noise Sampling for 3D Human Pose Estimation
Comments: ICME 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405]  arXiv:2405.02077 [pdf, other]
Title: MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406]  arXiv:2405.02068 [pdf, other]
Title: Advancing Pre-trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection
Comments: The paper is under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407]  arXiv:2405.02066 [pdf, other]
Title: WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[408]  arXiv:2405.02061 [pdf, other]
Title: Towards general deep-learning-based tree instance segmentation models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[409]  arXiv:2405.02023 [pdf, other]
Title: IFNet: Deep Imaging and Focusing for Handheld SAR with Millimeter-wave Signals
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[410]  arXiv:2405.02008 [pdf, other]
Title: DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411]  arXiv:2405.02005 [pdf, other]
Title: HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2
Comments: 8 pages, 9 figures, 2 tables. Will be published in the ISPRS The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412]  arXiv:2405.02004 [pdf, other]
Title: M${^2}$Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[413]  arXiv:2405.01992 [pdf, other]
Title: SFFNet: A Wavelet-Based Spatial and Frequency Domain Fusion Network for Remote Sensing Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[414]  arXiv:2405.01937 [pdf, other]
Title: An Attention Based Pipeline for Identifying Pre-Cancer Lesions in Head and Neck Clinical Images
Comments: 5 pages, 3 figures, accepted in ISBI 2024, update: corrected typos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[415]  arXiv:2405.01934 [pdf, other]
Title: Impact of Architectural Modifications on Deep Learning Adversarial Robustness
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[416]  arXiv:2405.01926 [pdf, other]
Title: Auto-Encoding Morph-Tokens for Multimodal LLM
Comments: Accepted by ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417]  arXiv:2405.01920 [pdf, ps, other]
Title: Lightweight Change Detection in Heterogeneous Remote Sensing Images with Online All-Integer Pruning Training
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[418]  arXiv:2405.01885 [pdf, other]
Title: Enhancing Micro Gesture Recognition for Emotion Understanding via Context-aware Visual-Text Contrastive Learning
Comments: accepted by IEEE Signal Processing Letters
Journal-ref: IEEE Signal Processing Letters (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[419]  arXiv:2405.01872 [pdf, other]
Title: Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[420]  arXiv:2405.01828 [pdf, other]
Title: FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[421]  arXiv:2405.01825 [pdf, other]
Title: Improving Concept Alignment in Vision-Language Concept Bottleneck Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422]  arXiv:2405.01734 [pdf, other]
Title: Diabetic Retinopathy Detection Using Quantum Transfer Learning
Comments: 14 pages, 12 figures and 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[423]  arXiv:2405.01723 [pdf, other]
Title: Zero-Shot Monocular Motion Segmentation in the Wild by Combining Deep Learning with Geometric Motion Model Fusion
Comments: Accepted by the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[424]  arXiv:2405.01705 [pdf, other]
Title: Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[425]  arXiv:2405.01701 [pdf, ps, other]
Title: Active Learning Enabled Low-cost Cell Image Segmentation Using Bounding Box Annotation
Authors: Yu Zhu, Qiang Yang, Li Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426]  arXiv:2405.01699 [pdf, other]
Title: SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients
Comments: 7 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[427]  arXiv:2405.01691 [pdf, other]
Title: Language-Enhanced Latent Representations for Out-of-Distribution Detection in Autonomous Driving
Comments: Presented at the Robot Trust for Symbiotic Societies (RTSS) Workshop, co-located with ICRA 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[428]  arXiv:2405.01688 [pdf, other]
Title: Adapting Self-Supervised Learning for Computational Pathology
Comments: Presented at DCA in MI Workshop, CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429]  arXiv:2405.01662 [pdf, other]
Title: Out-of-distribution detection based on subspace projection of high-dimensional features output by the last convolutional layer
Authors: Qiuyu Zhu, Yiwei He
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430]  arXiv:2405.01656 [pdf, other]
Title: S4: Self-Supervised Sensing Across the Spectrum
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[431]  arXiv:2405.01654 [pdf, other]
Title: Key Patches Are All You Need: A Multiple Instance Learning Framework For Robust Medical Diagnosis
Comments: Accepted in DEF-AI-MIA Workshop@CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432]  arXiv:2405.01646 [pdf, other]
Title: Explaining models relating objects and privacy
Comments: 7 pages, 3 figures, 1 table, supplementary material included as Appendix. Paper accepted at the 3rd XAI4CV Workshop at CVPR 2024. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433]  arXiv:2405.01636 [pdf, other]
Title: Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey
Comments: 35 pages, 9 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[434]  arXiv:2405.01558 [pdf, other]
Title: Configurable Learned Holography
Comments: 14 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Optics (physics.optics)
[435]  arXiv:2405.02287 (cross-list from cs.CL) [pdf, other]
Title: Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[436]  arXiv:2405.02208 (cross-list from eess.IV) [pdf, other]
Title: Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[437]  arXiv:2405.02179 (cross-list from cs.SD) [pdf, other]
Title: Training-Free Deepfake Voice Recognition by Leveraging Large-Scale Pre-Trained Models
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[438]  arXiv:2405.02109 (cross-list from eess.IV) [pdf, ps, other]
Title: Three-Dimensional Amyloid-Beta PET Synthesis from Structural MRI with Conditional Generative Adversarial Networks
Comments: Abstract Submitted and Presented at the 2024 International Society of Magnetic Resonance in Medicine. Singapore, Singapore, May 4-9. Abstract Number 2239
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[439]  arXiv:2405.01995 (cross-list from cs.LG) [pdf, other]
Title: Cooperation and Federation in Distributed Radar Point Cloud Processing
Journal-ref: 2023 IEEE 34th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[440]  arXiv:2405.01971 (cross-list from cs.RO) [pdf, other]
Title: A Sonar-based AUV Positioning System for Underwater Environments with Low Infrastructure Density
Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024
Journal-ref: IEEE ICRA Workshop on Field Robotics 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[441]  arXiv:2405.01963 (cross-list from cs.CR) [pdf, other]
Title: From Attack to Defense: Insights into Deep Learning Security Measures in Black-Box Settings
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[442]  arXiv:2405.01857 (cross-list from cs.NE) [pdf, other]
Title: TinySeg: Model Optimizing Framework for Image Segmentation on Tiny Embedded Systems
Comments: LCTES 2024
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[443]  arXiv:2405.01822 (cross-list from eess.IV) [pdf, other]
Title: Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[444]  arXiv:2405.01820 (cross-list from cs.CY) [pdf, ps, other]
Title: Real Risks of Fake Data: Synthetic Data, Diversity-Washing and Consent Circumvention
Journal-ref: FAccT '24, June 03--06, 2024, Rio de Janeiro, Brazil
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[445]  arXiv:2405.01776 (cross-list from cs.RO) [pdf, other]
Title: An Approach to Systematic Data Acquisition and Data-Driven Simulation for the Safety Testing of Automated Driving Functions
Comments: 8 pages, 5 figures
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[446]  arXiv:2405.01750 (cross-list from eess.IV) [pdf, other]
Title: PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[447]  arXiv:2405.01726 (cross-list from eess.IV) [pdf, ps, other]
Title: SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[448]  arXiv:2405.01725 (cross-list from eess.IV) [pdf, other]
Title: Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[449]  arXiv:2405.01673 (cross-list from cs.RO) [pdf, other]
Title: ShadowNav: Autonomous Global Localization for Lunar Navigation in Darkness
Comments: 21 pages, 13 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[450]  arXiv:2405.01661 (cross-list from cs.LG) [pdf, other]
Title: When a Relation Tells More Than a Concept: Exploring and Evaluating Classifier Decisions with CoReX
Comments: preliminary version, submitted to Machine Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[451]  arXiv:2405.01658 (cross-list from eess.IV) [pdf, other]
Title: MMIST-ccRCC: A Real World Medical Dataset for the Development of Multi-Modal Systems
Comments: Accepted in DCA in MI Workshop@CVPR2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[452]  arXiv:2405.01644 (cross-list from eess.IV) [pdf, ps, other]
Title: A Classification-Based Adaptive Segmentation Pipeline: Feasibility Study Using Polycystic Liver Disease and Metastases from Colorectal Cancer CT Images
Comments: J Digit Imaging. Inform. med. (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[453]  arXiv:2405.01607 (cross-list from cs.LG) [pdf, other]
Title: Wildfire Risk Prediction: A Review
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[454]  arXiv:2405.01600 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning Descriptor Hybridization with Feature Reduction for Accurate Cervical Cancer Colposcopy Image Classification
Comments: 7 Pages double column, 5 figures, and 5 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[455]  arXiv:2405.01587 (cross-list from cs.CL) [pdf, ps, other]
Title: Improve Academic Query Resolution through BERT-based Question Extraction from Images
Journal-ref: 2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI) volume 2 (2024) 1-4
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[456]  arXiv:2405.01583 (cross-list from cs.CL) [pdf, other]
Title: MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning
Authors: Nadia Saeed
Comments: 7 pages, 3 figures, Clinical NLP 2024 workshop proceedings in Shared Task
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ total of 456 entries: 1-224 | 84-307 | 308-456 ]
[ showing 224 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)