We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 311

[ total of 533 entries: 1-311 | 312-533 ]
[ showing 311 entries per page: fewer | more | all ]

Tue, 30 Apr 2024 (continued, showing last 32 of 168 entries)

[312]  arXiv:2404.18343 (cross-list from cs.MM) [pdf, other]
Title: G-Refine: A General Quality Refiner for Text-to-Image Generation
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[313]  arXiv:2404.18246 (cross-list from cs.LG) [pdf, other]
Title: AdaFSNet: Time Series Classification Based on Convolutional Network with a Adaptive and Effective Kernel Size Configuration
Comments: Accepted by IJCNN 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[314]  arXiv:2404.18198 (cross-list from quant-ph) [pdf, other]
Title: Permutation-equivariant quantum convolutional neural networks
Comments: 13 pages, 10 figures
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[315]  arXiv:2404.18178 (cross-list from eess.IV) [pdf, other]
Title: Assessing Image Quality Using a Simple Generative Representation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[316]  arXiv:2404.18161 (cross-list from cs.LG) [pdf, other]
Title: IMEX-Reg: Implicit-Explicit Regularization in the Function Space for Continual Learning
Comments: Published in Transactions on Machine Learning Research
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[317]  arXiv:2404.18096 (cross-list from eess.IV) [pdf, other]
Title: Snake with Shifted Window: Learning to Adapt Vessel Pattern for OCTA Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[318]  arXiv:2404.18083 (cross-list from cs.RO) [pdf, other]
Title: Online,Target-Free LiDAR-Camera Extrinsic Calibration via Cross-Modal Mask Matching
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[319]  arXiv:2404.18066 (cross-list from cs.NE) [pdf, other]
Title: Quantized Context Based LIF Neurons for Recurrent Spiking Neural Networks in 45nm
Comments: 7 Pages, 7 Figures, 2 Tables
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[320]  arXiv:2404.18058 (cross-list from eess.IV) [pdf, other]
Title: Joint Reference Frame Synthesis and Post Filter Enhancement for Versatile Video Coding
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[321]  arXiv:2404.18006 (cross-list from cs.RO) [pdf, other]
Title: FRAME: A Modular Framework for Autonomous Map-merging: Advancements in the Field
Comments: 28 pages, 24 figures. Submitted to Field Robotics
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[322]  arXiv:2404.17974 (cross-list from cs.RO) [pdf, other]
Title: HVOFusion: Incremental Mesh Reconstruction Using Hybrid Voxel Octree
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[323]  arXiv:2404.17931 (cross-list from cs.LG) [pdf, ps, other]
Title: Critical Review for One-class Classification: recent advances and the reality behind them
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[324]  arXiv:2404.17926 (cross-list from eess.IV) [pdf, other]
Title: Pre-training on High Definition X-ray Images: An Experimental Study
Comments: Technology Report
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[325]  arXiv:2404.17890 (cross-list from eess.IV) [pdf, other]
Title: DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction
Comments: 15 pages, 10 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[326]  arXiv:2404.17878 (cross-list from eess.IV) [pdf, ps, other]
Title: Processing HSV Colored Medical Images and Adapting Color Thresholds for Computational Image Analysis: a Practical Introduction to an open-source tool
Authors: Lie Cai, Andre Pfob
Comments: An open-source tool that can adapt different color thresholds of HSV-colored medical images. The newly developed pre-processing Matlab function successfully works on multi-center, international shear wave elastography data (NCT 02638935). Step-by-step instructions with accompanying code lines were provided, easy to follow and reproduce
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[327]  arXiv:2404.17830 (cross-list from cs.LG) [pdf, other]
Title: Dynamic Against Dynamic: An Open-set Self-learning Framework
Comments: The first two authors contributed equally to this work. Accepted at IJCAI2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[328]  arXiv:2404.17805 (cross-list from cs.LG) [pdf, other]
Title: From Optimization to Generalization: Fair Federated Learning against Quality Shift via Inter-Client Sharpness Matching
Comments: This paper is accepted at IJCAI'24 (Main Track)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[329]  arXiv:2404.17773 (cross-list from cs.LG) [pdf, other]
Title: Compressing Latent Space via Least Volume
Authors: Qiuyi Chen, Mark Fuge
Comments: 24 pages, International Conference on Learning Representations 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[330]  arXiv:2404.17768 (cross-list from cs.LG) [pdf, other]
Title: Make the Most of Your Data: Changing the Training Data Distribution to Improve In-distribution Generalization Performance
Comments: 32 pages, 11 figures, 6 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[331]  arXiv:2404.17745 (cross-list from cs.RO) [pdf, ps, other]
Title: An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation
Comments: 22 Pages, 3 Tables, 9 Figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[332]  arXiv:2404.17742 (cross-list from eess.IV) [pdf, other]
Title: Segmentation Quality and Volumetric Accuracy in Medical Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[333]  arXiv:2404.17736 (cross-list from eess.SP) [pdf, other]
Title: Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[334]  arXiv:2404.17718 (cross-list from cs.RO) [pdf, other]
Title: Lessons from Deploying CropFollow++: Under-Canopy Agricultural Navigation with Keypoints
Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[335]  arXiv:2404.17704 (cross-list from eess.IV) [pdf, other]
Title: SPLICE -- Streamlining Digital Pathology Image Processing
Comments: Under review for publication
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[336]  arXiv:2404.17699 (cross-list from cs.LG) [pdf, other]
Title: Deep Learning for Melt Pool Depth Contour Prediction From Surface Thermal Images via Vision Transformers
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[337]  arXiv:2404.17697 (cross-list from cs.RO) [pdf, ps, other]
Title: Enhancing Track Management Systems with Vehicle-To-Vehicle Enabled Sensor Fusion
Comments: 6 pages, 5 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[338]  arXiv:2404.17670 (cross-list from eess.IV) [pdf, other]
Title: Federated Learning for Blind Image Super-Resolution
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[339]  arXiv:2404.17651 (cross-list from cs.LG) [pdf, other]
Title: Hard ASH: Sparsity and the right optimizer make a continual learner
Authors: Santtu Keskinen
Comments: ICLR 2024 TinyPaper
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[340]  arXiv:2404.17621 (cross-list from eess.IV) [pdf, other]
Title: Attention-aware non-rigid image registration for accelerated MR imaging
Comments: 14 pages, 7 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[341]  arXiv:2404.17620 (cross-list from cs.LG) [pdf, other]
Title: Neural Modes: Self-supervised Learning of Nonlinear Modal Subspaces
Comments: Accepted to CVPR 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[342]  arXiv:2404.17617 (cross-list from cs.CR) [pdf, other]
Title: Beyond Traditional Threats: A Persistent Backdoor Attack on Federated Learning
Journal-ref: Proceedings of the AAAI Conference on Artificial Intelligence. 2024, 38(19): 21359-21367
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[343]  arXiv:2404.17587 (cross-list from cs.HC) [pdf, other]
Title: Uncovering the Metaverse within Everyday Environments: a Coarse-to-Fine Approach
Comments: This paper has been accepted by The 48th IEEE International Conference on Computers, Software, and Applications (COMPSAC 2024) for publication. It includes around 5600 words, 11 pages, 15 figures, and 1 table
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)

Mon, 29 Apr 2024

[344]  arXiv:2404.17571 [pdf, other]
Title: Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[345]  arXiv:2404.17569 [pdf, other]
Title: MaPa: Text-driven Photorealistic Material Painting for 3D Shapes
Comments: SIGGRAPH 2024. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[346]  arXiv:2404.17565 [pdf, other]
Title: ChangeBind: A Hybrid Change Encoder for Remote Sensing Change Detection
Comments: accepted at IGARSS 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[347]  arXiv:2404.17534 [pdf, other]
Title: Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models
Comments: 11 pages, 9 figures, 6 tables. For associated code, see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[348]  arXiv:2404.17528 [pdf, other]
Title: Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields
Comments: Accepted by CVPR 2024. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349]  arXiv:2404.17507 [pdf, other]
Title: HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
Comments: 28pages, 4.5MB
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[350]  arXiv:2404.17503 [pdf, ps, other]
Title: Inhomogeneous illuminated image enhancement under extremely low visibility condition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[351]  arXiv:2404.17498 [pdf, other]
Title: Learning text-to-video retrieval from image captioning
Comments: A short version of this work appeared at CVPR 2023 Workshops. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352]  arXiv:2404.17488 [pdf, other]
Title: Low Cost Machine Vision for Insect Classification
Journal-ref: Arai, K. (eds) Intelligent Systems and Applications. IntelliSys 2023. Lecture Notes in Networks and Systems, vol 824. Springer
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[353]  arXiv:2404.17486 [pdf, other]
Title: TextGaze: Gaze-Controllable Face Generation with Natural Language
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[354]  arXiv:2404.17484 [pdf, other]
Title: Sparse Reconstruction of Optical Doppler Tomography Based on State Space Model
Comments: 19 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[355]  arXiv:2404.17433 [pdf, other]
Title: PromptCIR: Blind Compressed Image Restoration with Prompt Learning
Comments: Winner of NTIRE 2024 Blind Compressed Image Enhancement Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[356]  arXiv:2404.17427 [pdf, other]
Title: Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection
Comments: Accepted with an oral presentation at UAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357]  arXiv:2404.17419 [pdf, other]
Title: Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation
Comments: 5 pages including references, 2 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358]  arXiv:2404.17400 [pdf, other]
Title: Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement
Comments: 14 page
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[359]  arXiv:2404.17381 [pdf, other]
Title: Frequency-Guided Multi-Level Human Action Anomaly Detection with Normalizing Flows
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[360]  arXiv:2404.17364 [pdf, other]
Title: MV-VTON: Multi-View Virtual Try-On with Diffusion Models
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[361]  arXiv:2404.17360 [pdf, other]
Title: UniRGB-IR: A Unified Framework for Visible-Infrared Downstream Tasks via Adapter Tuning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362]  arXiv:2404.17340 [pdf, other]
Title: Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning
Comments: Accepted at NeurIPS 2023. Email: liucl1996@163.com
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363]  arXiv:2404.17335 [pdf, other]
Title: A Novel Spike Transformer Network for Depth Estimation from Event Cameras via Cross-modality Knowledge Distillation
Comments: 16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[364]  arXiv:2404.17324 [pdf, other]
Title: Dense Road Surface Grip Map Prediction from Multimodal Image Data
Comments: 17 pages, 7 figures (supplementary material 1 page, 1 figure). Submitted to 27th International Conference of Pattern Recognition (ICPR 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365]  arXiv:2404.17310 [pdf, other]
Title: Image Copy-Move Forgery Detection via Deep PatchMatch and Pairwise Ranking Learning
Comments: 16 pages, 14figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366]  arXiv:2404.17275 [pdf, other]
Title: Adversarial Reweighting with $α$-Power Maximization for Domain Adaptation
Comments: To appear in IJCV
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[367]  arXiv:2404.17273 [pdf, other]
Title: 3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting
Comments: Accepted Information Processing and Management (IP&M), 10 pages, 9 figures and 8 tables
Journal-ref: Information Processing & Management, Volume 61, Issue 4, July 2024, 103716
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368]  arXiv:2404.17255 [pdf, other]
Title: SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369]  arXiv:2404.17254 [pdf, other]
Title: Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370]  arXiv:2404.17253 [pdf, other]
Title: Weakly Supervised Training for Hologram Verification in Identity Documents
Authors: Glen Pouliquen (1 and 2), Guillaume Chiron (1), Joseph Chazalon (2), Thierry Géraud (2), Ahmad Montaser Awal (1) ((1) IDnow AI & ML Center of Excellence, France, (2) EPITA Research Lab. (LRE), EPITA, France)
Comments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371]  arXiv:2404.17251 [pdf, other]
Title: Camera Motion Estimation from RGB-D-Inertial Scene Flow
Comments: Accepted to CVPR2024 Workshop on Visual Odometry and Computer Vision Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[372]  arXiv:2404.17245 [pdf, other]
Title: Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting
Comments: Accepted at eLVM Workshop, CVPR, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373]  arXiv:2404.17243 [pdf, other]
Title: Binarizing Documents by Leveraging both Space and Frequency
Comments: Accepted at ICDAR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[374]  arXiv:2404.17230 [pdf, other]
Title: ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
Comments: 12 pages, submitted to ECCV2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375]  arXiv:2404.17221 [pdf, other]
Title: SAGHOG: Self-Supervised Autoencoder for Generating HOG Features for Writer Retrieval
Comments: accepted for ICDAR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376]  arXiv:2404.17205 [pdf, other]
Title: Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[377]  arXiv:2404.17202 [pdf, other]
Title: Self-supervised visual learning in the low-data regime: a comparative evaluation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[378]  arXiv:2404.17199 [pdf, other]
Title: Few-shot Calligraphy Style Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[379]  arXiv:2404.17186 [pdf, other]
Title: MCSDNet: Mesoscale Convective System Detection Network via Multi-scale Spatiotemporal Information
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[380]  arXiv:2404.17184 [pdf, other]
Title: Low-Rank Knowledge Decomposition for Medical Foundation Models
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[381]  arXiv:2404.17176 [pdf, other]
Title: MovieChat+: Question-aware Sparse Memory for Long Video Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[382]  arXiv:2404.17173 [pdf, other]
Title: Exploring Beyond Logits: Hierarchical Dynamic Labeling Based on Embeddings for Semi-Supervised Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[383]  arXiv:2404.17170 [pdf, other]
Title: S-IQA Image Quality Assessment With Compressive Sampling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[384]  arXiv:2404.17159 [pdf, other]
Title: Phase-aggregated Dual-branch Network for Efficient Fingerprint Dense Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385]  arXiv:2404.17152 [pdf, other]
Title: CSCO: Connectivity Search of Convolutional Operators
Comments: To appear on Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[386]  arXiv:2404.17149 [pdf, other]
Title: Pose-Specific 3D Fingerprint Unfolding
Journal-ref: 15th Chinese Conference on Biometric Recognition (CCBR), Shanghai, China, 2021, pp. 185-194
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387]  arXiv:2404.17148 [pdf, other]
Title: Direct Regression of Distortion Field from a Single Fingerprint Image
Journal-ref: 2022 IEEE International Joint Conference on Biometrics (IJCB), Abu Dhabi, United Arab Emirates, 2022, pp. 1-8
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388]  arXiv:2404.17147 [pdf, other]
Title: On the Federated Learning Framework for Cooperative Perception
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[389]  arXiv:2404.17118 [pdf, ps, other]
Title: Localization of Pallets on Shelves Using Horizontal Plane Projection of a 360-degree Image
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[390]  arXiv:2404.17105 [pdf, other]
Title: Synthesizing Iris Images using Generative Adversarial Networks: Survey and Comparative Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[391]  arXiv:2404.17100 [pdf, other]
Title: Open-Set Video-based Facial Expression Recognition with Human Expression-sensitive Prompting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392]  arXiv:2404.17092 [pdf, other]
Title: Defending Spiking Neural Networks against Adversarial Attacks through Image Purification
Authors: Weiran Chen, Qi Sun, Qi Xu
Comments: 8 pages, 5 figures, ECAI2024 under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393]  arXiv:2404.17041 [pdf, other]
Title: Nuclei-Location Based Point Set Registration of Multi-Stained Whole Slide Images
Comments: 15 pages, 5 figures, Submitted to Medical Image Understanding and Analysis Conference 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[394]  arXiv:2404.17033 [pdf, other]
Title: Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segmentation
Comments: Accepted at MIDL 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395]  arXiv:2404.17031 [pdf, other]
Title: Motor Focus: Ego-Motion Prediction with All-Pixel Matching
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396]  arXiv:2404.17029 [pdf, other]
Title: Dr-SAM: An End-to-End Framework for Vascular Segmentation, Diameter Estimation, and Anomaly Detection on Angiography Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[397]  arXiv:2404.16994 [pdf, other]
Title: PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[398]  arXiv:2404.16972 [pdf, other]
Title: CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399]  arXiv:2404.16944 [pdf, other]
Title: Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400]  arXiv:2404.16885 [pdf, ps, other]
Title: Adapting an Artificial Intelligence Sexually Transmitted Diseases Symptom Checker Tool for Mpox Detection: The HeHealth Experience
Comments: 15 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[401]  arXiv:2404.16882 [pdf, other]
Title: ThermoPore: Predicting Part Porosity Based on Thermal Images Using Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[402]  arXiv:2404.16845 [pdf, other]
Title: HaLo-NeRF: Learning Geometry-Guided Semantics for Exploring Unconstrained Photo Collections
Comments: Eurographics 2024. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[403]  arXiv:2404.16844 [pdf, other]
Title: Sugarcane Health Monitoring With Satellite Spectroscopy and Machine Learning: A Review
Comments: 22 pages, 6 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[404]  arXiv:2404.16833 [pdf, other]
Title: Leaf-Based Plant Disease Detection and Explainable AI
Comments: To appear in a Journal/Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[405]  arXiv:2404.17521 (cross-list from cs.RO) [pdf, other]
Title: Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations
Comments: Project website and open-source code: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[406]  arXiv:2404.17426 (cross-list from eess.IV) [pdf, ps, other]
Title: One-Shot Image Restoration
Authors: Deborah Pereg
Comments: arXiv admin note: text overlap with arXiv:2209.14267
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[407]  arXiv:2404.17371 (cross-list from cs.LG) [pdf, other]
Title: Estimating the Robustness Radius for Randomized Smoothing with 100$\times$ Sample Efficiency
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[408]  arXiv:2404.17357 (cross-list from eess.IV) [pdf, other]
Title: Simultaneous Tri-Modal Medical Image Fusion and Super-Resolution using Conditional Diffusion Model
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[409]  arXiv:2404.17350 (cross-list from cs.LG) [pdf, other]
Title: On the Road to Clarity: Exploring Explainable AI for World Models in a Driver Assistance System
Comments: 8 pages, 6 figures, to be published in IEEE CAI 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[410]  arXiv:2404.17302 (cross-list from cs.RO) [pdf, other]
Title: Part-Guided 3D RL for Sim2Real Articulated Object Manipulation
Comments: 9 pages
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[411]  arXiv:2404.17252 (cross-list from cs.LG) [pdf, ps, other]
Title: Comparison of self-supervised in-domain and supervised out-domain transfer learning for bird species recognition
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[412]  arXiv:2404.17235 (cross-list from eess.IV) [pdf, other]
Title: Optimizing Universal Lesion Segmentation: State Space Model-Guided Hierarchical Networks with Feature Importance Adjustment
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[413]  arXiv:2404.17215 (cross-list from cs.RO) [pdf, other]
Title: SLAM for Indoor Mapping of Wide Area Construction Environments
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[414]  arXiv:2404.17212 (cross-list from cs.ET) [pdf, ps, other]
Title: Scrutinizing Data from Sky: An Examination of Its Veracity in Area Based Traffic Contexts
Authors: Yawar Ali (1), Krishnan K N (1), Debashis Ray Sarkar (1), K. Ramachandra Rao (1), Niladri Chatterjee (1), Ashish Bhaskar (2) ((1) Indian Institute of Technology Delhi, New Delhi, India (2) Queensland University of Technology, Brisbane, Australia)
Subjects: Emerging Technologies (cs.ET); Computer Vision and Pattern Recognition (cs.CV)
[415]  arXiv:2404.17151 (cross-list from cs.MM) [pdf, other]
Title: MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text Detection
Comments: Accepted by Transaction on Multimedia
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[416]  arXiv:2404.17104 (cross-list from cs.HC) [pdf, other]
Title: Don't Look at the Camera: Achieving Perceived Eye Contact
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[417]  arXiv:2404.17083 (cross-list from eess.IV) [pdf, other]
Title: Calculation of Femur Caput Collum Diaphyseal angle for X-Rays images using Semantic Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[418]  arXiv:2404.17064 (cross-list from eess.IV) [pdf, other]
Title: Detection of Peri-Pancreatic Edema using Deep Learning and Radiomics Techniques
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[419]  arXiv:2404.17063 (cross-list from cs.HC) [pdf, other]
Title: WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair Users
Comments: Published for ACM CHI 2024. For source files, see this https URL
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[420]  arXiv:2404.16917 (cross-list from cs.LG) [pdf, other]
Title: Grad Queue : A probabilistic framework to reinforce sparse gradients
Comments: 15 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[421]  arXiv:2404.16897 (cross-list from cs.LG) [pdf, other]
Title: Exploring Learngene via Stage-wise Weight Sharing for Initializing Variable-sized Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Fri, 26 Apr 2024

[422]  arXiv:2404.16831 [pdf, other]
[423]  arXiv:2404.16829 [pdf, other]
Title: Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[424]  arXiv:2404.16828 [pdf, other]
Title: Made to Order: Discovering monotonic temporal changes via self-supervised video ordering
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[425]  arXiv:2404.16825 [pdf, other]
Title: ResVR: Joint Rescaling and Viewport Rendering of Omnidirectional Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[426]  arXiv:2404.16824 [pdf, other]
Title: V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427]  arXiv:2404.16821 [pdf, other]
Title: How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[428]  arXiv:2404.16820 [pdf, other]
Title: Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Comments: Data and code will be released at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429]  arXiv:2404.16818 [pdf, other]
Title: Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430]  arXiv:2404.16814 [pdf, other]
Title: Meta-Transfer Derm-Diagnosis: Exploring Few-Shot Learning and Transfer Learning for Skin Disease Classification in Long-Tail Distribution
Comments: 17 pages, 5 figures, 6 tables, submitted to IEEE Journal of Biomedical and Health Informatics
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[431]  arXiv:2404.16804 [pdf, other]
Title: AAPL: Adding Attributes to Prompt Learning for Vision-Language Models
Comments: Accepted to CVPR 2024 Workshop on Prompting in Vision, Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[432]  arXiv:2404.16790 [pdf, other]
Title: SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433]  arXiv:2404.16781 [pdf, other]
Title: Registration by Regression (RbR): a framework for interpretable and flexible atlas registration
Comments: 11 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[434]  arXiv:2404.16773 [pdf, other]
Title: ConKeD++ -- Improving descriptor learning for retinal image registration: A comprehensive study of contrastive losses
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435]  arXiv:2404.16771 [pdf, other]
Title: ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[436]  arXiv:2404.16754 [pdf, other]
Title: RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437]  arXiv:2404.16752 [pdf, other]
Title: TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[438]  arXiv:2404.16748 [pdf, other]
Title: TELA: Text to Layer-wise 3D Clothed Human Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439]  arXiv:2404.16739 [pdf, ps, other]
Title: CBRW: A Novel Approach for Cancelable Biometric Template Generation based on
Authors: Nitin Kumar, Manisha
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440]  arXiv:2404.16717 [pdf, other]
Title: Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class
Comments: Accepted to FAccT 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[441]  arXiv:2404.16687 [pdf, other]
Title: NTIRE 2024 Quality Assessment of AI-Generated Content Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[442]  arXiv:2404.16685 [pdf, other]
Title: Multi-scale HSV Color Feature Embedding for High-fidelity NIR-to-RGB Spectrum Translation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[443]  arXiv:2404.16678 [pdf, other]
Title: Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[444]  arXiv:2404.16670 [pdf, other]
Title: EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[445]  arXiv:2404.16666 [pdf, other]
Title: PhyRecon: Physically Plausible Neural Scene Reconstruction
Comments: project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446]  arXiv:2404.16637 [pdf, other]
Title: Zero-Shot Distillation for Image Encoders: How to Make Effective Use of Synthetic Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[447]  arXiv:2404.16635 [pdf, other]
Title: TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning
Comments: 13 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448]  arXiv:2404.16633 [pdf, other]
Title: Self-Balanced R-CNN for Instance Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[449]  arXiv:2404.16622 [pdf, other]
Title: DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting
Comments: Accepted to CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450]  arXiv:2404.16617 [src]
Title: Denoising: from classical methods to deep CNNs
Comments: This document uses works by authors not yet presented to the community and may appear to be original
Subjects: Computer Vision and Pattern Recognition (cs.CV); History and Overview (math.HO)
[451]  arXiv:2404.16612 [pdf, other]
Title: MuseumMaker: Continual Style Customization without Catastrophic Forgetting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[452]  arXiv:2404.16609 [pdf, other]
Title: SFMViT: SlowFast Meet ViT in Chaotic World
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[453]  arXiv:2404.16581 [pdf, other]
Title: AudioScenic: Audio-Driven Video Scene Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[454]  arXiv:2404.16578 [pdf, other]
Title: Road Surface Friction Estimation for Winter Conditions Utilising General Visual Features
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[455]  arXiv:2404.16573 [pdf, other]
Title: Multi-Scale Representations by Varying Window Attention for Semantic Segmentation
Comments: ICLR2024 Poster
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[456]  arXiv:2404.16571 [pdf, other]
Title: MonoPCC: Photometric-invariant Cycle Constraint for Monocular Depth Estimation of Endoscopic Images
Comments: 9 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[457]  arXiv:2404.16561 [pdf, ps, other]
Title: Research on geometric figure classification algorithm based on Deep Learning
Comments: 6 pages,9 figures
Journal-ref: Scientific Journal of Intelligent Systems Research,Volume 4 Issue 6, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[458]  arXiv:2404.16558 [pdf, other]
Title: DeepKalPose: An Enhanced Deep-Learning Kalman Filter for Temporally Consistent Monocular Vehicle Pose Estimation
Comments: 4 pages, 3 Figures, published to IET Electronic Letters
Journal-ref: Electronics Letters (ISSN: 00135194), jaar: 2024, volume: 60, nummer: 8, startpagina: ?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[459]  arXiv:2404.16557 [pdf, other]
Title: Energy-Latency Manipulation of Multi-modal Large Language Models via Verbose Samples
Comments: arXiv admin note: substantial text overlap with arXiv:2401.11170
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[460]  arXiv:2404.16556 [pdf, other]
Title: Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[461]  arXiv:2404.16552 [pdf, other]
Title: Efficient Solution of Point-Line Absolute Pose
Comments: CVPR 2024, 11 pages, 8 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[462]  arXiv:2404.16548 [pdf, other]
Title: Cross-Domain Spatial Matching for Camera and Radar Sensor Data Fusion in Autonomous Vehicle Perception System
Comments: 12 pages including highlights and graphical abstract, submitted to Expert Systems with Applications journal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463]  arXiv:2404.16538 [pdf, other]
Title: OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[464]  arXiv:2404.16536 [pdf, other]
Title: 3D Face Modeling via Weakly-supervised Disentanglement Network joint Identity-consistency Prior
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[465]  arXiv:2404.16507 [pdf, other]
Title: Semantic-aware Next-Best-View for Multi-DoFs Mobile System in Search-and-Acquisition based Visual Perception
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[466]  arXiv:2404.16501 [pdf, other]
Title: 360SFUDA++: Towards Source-free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes
Comments: arXiv admin note: substantial text overlap with arXiv:2403.12505
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[467]  arXiv:2404.16493 [pdf, other]
Title: Commonsense Prototype for Outdoor Unsupervised 3D Object Detection
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[468]  arXiv:2404.16484 [pdf, other]
[469]  arXiv:2404.16474 [pdf, other]
Title: DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[470]  arXiv:2404.16471 [pdf, other]
Title: COBRA -- COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[471]  arXiv:2404.16456 [pdf, other]
Title: Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[472]  arXiv:2404.16452 [pdf, other]
Title: PAD: Patch-Agnostic Defense against Adversarial Patch Attacks
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[473]  arXiv:2404.16451 [pdf, other]
Title: Latent Modulated Function for Computational Optimal Continuous Image Representation
Authors: Zongyao He, Zhi Jin
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[474]  arXiv:2404.16432 [pdf, other]
Title: Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[475]  arXiv:2404.16429 [pdf, other]
Title: Depth Supervised Neural Surface Reconstruction from Airborne Imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[476]  arXiv:2404.16423 [pdf, other]
Title: Neural Assembler: Learning to Generate Fine-Grained Robotic Assembly Instructions from Multi-View Images
Authors: Hongyu Yan, Yadong Mu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[477]  arXiv:2404.16422 [pdf, other]
Title: Robust Fine-tuning for Pre-trained 3D Point Cloud Models
Comments: 9 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[478]  arXiv:2404.16421 [pdf, other]
Title: SynCellFactory: Generative Data Augmentation for Cell Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479]  arXiv:2404.16416 [pdf, other]
Title: Learning Discriminative Spatio-temporal Representations for Semi-supervised Action Recognition
Comments: 10 pages, 6 figures, 6 tables, 56 conferences
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[480]  arXiv:2404.16409 [pdf, other]
Title: Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series
Authors: Aimi Okabayashi (IRISA, OBELIX), Nicolas Audebert (CEDRIC - VERTIGO, CNAM, LaSTIG, IGN), Simon Donike (IPL), Charlotte Pelletier (OBELIX, IRISA)
Journal-ref: EARTHVISION 2024 IEEE/CVF CVPR Workshop. Large Scale Computer Vision for Remote Sensing Imagery, Jun 2024, Seattle, United States
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[481]  arXiv:2404.16398 [pdf, other]
Title: Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval
Comments: 20 pages, 8 sugures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[482]  arXiv:2404.16386 [pdf, other]
Title: Promoting CNNs with Cross-Architecture Knowledge Distillation for Efficient Monocular Depth Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[483]  arXiv:2404.16385 [pdf, other]
Title: Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical Visual Language Pre-trained Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[484]  arXiv:2404.16380 [pdf, ps, other]
Title: Efficient Higher-order Convolution for Small Kernels in Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485]  arXiv:2404.16375 [pdf, other]
Title: List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[486]  arXiv:2404.16371 [pdf, other]
Title: Multimodal Information Interaction for Medical Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487]  arXiv:2404.16359 [pdf, other]
Title: An Improved Graph Pooling Network for Skeleton-Based Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[488]  arXiv:2404.16348 [pdf, other]
Title: Dual Expert Distillation Network for Generalized Zero-Shot Learning
Comments: 9 pages, 4 figures; Accepted to IJCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[489]  arXiv:2404.16339 [pdf, other]
Title: Training-Free Unsupervised Prompt for Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[490]  arXiv:2404.16331 [pdf, other]
Title: IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[491]  arXiv:2404.16325 [pdf, other]
Title: Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[492]  arXiv:2404.16323 [pdf, other]
Title: DIG3D: Marrying Gaussian Splatting with Deformable Transformer for Single Image 3D Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[493]  arXiv:2404.16306 [pdf, other]
Title: TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[494]  arXiv:2404.16304 [pdf, other]
Title: BezierFormer: A Unified Architecture for 2D and 3D Lane Detection
Comments: ICME 2024, 11 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[495]  arXiv:2404.16302 [pdf, other]
Title: CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions
Comments: The dataset and source code will be made publicly available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Robotics (cs.RO); Image and Video Processing (eess.IV)
[496]  arXiv:2404.16301 [pdf, other]
Title: Style Adaptation for Domain-adaptive Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[497]  arXiv:2404.16296 [pdf, ps, other]
Title: Research on Splicing Image Detection Algorithms Based on Natural Image Statistical Characteristics
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[498]  arXiv:2404.16268 [pdf, other]
Title: Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis
Comments: 9 pages, 7 figures, accepted at 2024 IEEE/CVF Computer Vision and Pattern Recognition Vision for Agriculture Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[499]  arXiv:2404.16266 [pdf, other]
Title: A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation
Comments: GECCO 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[500]  arXiv:2404.16223 [pdf, other]
Title: Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey
Comments: CVPR 2024 - NTIRE Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[501]  arXiv:2404.16222 [pdf, other]
Title: Step Differences in Instructional Video
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[502]  arXiv:2404.16221 [pdf, other]
Title: NeRF-XL: Scaling NeRFs with Multiple GPUs
Comments: Webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Graphics (cs.GR)
[503]  arXiv:2404.16216 [pdf, other]
Title: ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[504]  arXiv:2404.16205 [pdf, other]
Title: AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results
Comments: CVPR 2024 Workshop -- AI for Streaming (AIS) Video Quality Assessment Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[505]  arXiv:2404.16193 [pdf, other]
Title: Improving Multi-label Recognition using Class Co-Occurrence Probabilities
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[506]  arXiv:2404.16155 [pdf, other]
Title: Does SAM dream of EIG? Characterizing Interactive Segmenter Performance using Expected Information Gain
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG)
[507]  arXiv:2404.16139 [pdf, other]
Title: A Survey on Intermediate Fusion Methods for Collaborative Perception Categorized by Real World Challenges
Comments: 8 pages, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[508]  arXiv:2404.16136 [pdf, other]
Title: 3D Human Pose Estimation with Occlusions: Introducing BlendMimic3D Dataset and GCN Refinement
Comments: Accepted at 6th Workshop and Competition on Affective Behavior Analysis in-the-wild - CVPR 2024 Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[509]  arXiv:2404.16133 [pdf, ps, other]
Title: Quantitative Characterization of Retinal Features in Translated OCTA
Comments: The article has been revised and edited
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[510]  arXiv:2404.16123 [pdf, other]
Title: FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication
Comments: Conference paper at CVPR 2024. 6 pages, 8 figures. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[511]  arXiv:2404.16038 [pdf, other]
Title: A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming
Comments: 16 pages, 10 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[512]  arXiv:2404.16037 [pdf, other]
Title: VN-Net: Vision-Numerical Fusion Graph Convolutional Network for Sparse Spatio-Temporal Meteorological Forecasting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph)
[513]  arXiv:2404.16823 (cross-list from cs.RO) [pdf, other]
Title: Learning Visuotactile Skills with Two Multifingered Hands
Comments: Code and Project Website: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[514]  arXiv:2404.16767 (cross-list from cs.LG) [pdf, other]
Title: REBEL: Reinforcement Learning via Regressing Relative Rewards
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[515]  arXiv:2404.16718 (cross-list from eess.IV) [pdf, other]
Title: Features Fusion for Dual-View Mammography Mass Detection
Comments: Accepted at ISBI 2024 (21st IEEE International Symposium on Biomedical Imaging)
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[516]  arXiv:2404.16708 (cross-list from eess.IV) [pdf, other]
Title: Multi-view Cardiac Image Segmentation via Trans-Dimensional Priors
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[517]  arXiv:2404.16529 (cross-list from cs.RO) [pdf, other]
Title: Vision-based robot manipulation of transparent liquid containers in a laboratory setting
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[518]  arXiv:2404.16510 (cross-list from cs.GR) [pdf, other]
Title: Interactive3D: Create What You Want by Interactive 3D Generation
Comments: project page: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[519]  arXiv:2404.16482 (cross-list from q-bio.NC) [pdf, other]
Title: CoCoG: Controllable Visual Stimuli Generation based on Human Concept Representations
Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[520]  arXiv:2404.16397 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning-based Prediction of Breast Cancer Tumor and Immune Phenotypes from Histopathology
Comments: Paper accepted at the First Workshop on Imageomics (Imageomics-AAAI-24) - Discovering Biological Knowledge from Images using AI (this https URL), held as part of the 38th Annual AAAI Conference on Artificial Intelligence (this https URL)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[521]  arXiv:2404.16346 (cross-list from eess.IV) [pdf, other]
Title: Light-weight Retinal Layer Segmentation with Global Reasoning
Comments: IEEE Transactions on Instrumentation & Measurement
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[522]  arXiv:2404.16336 (cross-list from cs.LG) [pdf, other]
Title: FedStyle: Style-Based Federated Learning Crowdsourcing Framework for Art Commissions
Comments: Accepted to ICME 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[523]  arXiv:2404.16307 (cross-list from cs.LG) [pdf, other]
Title: Boosting Model Resilience via Implicit Adversarial Data Augmentation
Comments: 9 pages, 6 figures, accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[524]  arXiv:2404.16300 (cross-list from cs.LG) [pdf, other]
Title: Reinforcement Learning with Generative Models for Compact Support Sets
Comments: 4 pages, 2 figures. Code available at: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[525]  arXiv:2404.16292 (cross-list from cs.GR) [pdf, other]
Title: One Noise to Rule Them All: Learning a Unified Model of Spatially-Varying Noise Patterns
Comments: In ACM Transactions on Graphics (Proceedings of SIGGRAPH) 2024, 21 pages
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[526]  arXiv:2404.16255 (cross-list from cs.CR) [pdf, other]
Title: Enhancing Privacy in Face Analytics Using Fully Homomorphic Encryption
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[527]  arXiv:2404.16212 (cross-list from cs.CR) [pdf, other]
Title: An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape
Comments: Accepted to IEEE S&P 2024; 19 pages, 10 figures
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[528]  arXiv:2404.16192 (cross-list from cs.CL) [pdf, other]
Title: Fusion of Domain-Adapted Vision and Language Models for Medical Visual Question Answering
Comments: Clinical NLP @ NAACL 2024
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[529]  arXiv:2404.16174 (cross-list from cs.HC) [pdf, other]
Title: MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification Models
Comments: 14 pages, 6 figures, ACM FAccT 2024
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[530]  arXiv:2404.16112 (cross-list from cs.LG) [pdf, other]
Title: Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[531]  arXiv:2404.16080 (cross-list from eess.IV) [pdf, other]
Title: Enhancing Diagnosis through AI-driven Analysis of Reflectance Confocal Microscopy
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[532]  arXiv:2404.16049 (cross-list from physics.med-ph) [pdf, other]
Title: Exploring the limitations of blood pressure estimation using the photoplethysmography signal
Comments: 17 pages, 7 figures, 3 tables
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[533]  arXiv:2404.15405 (cross-list from astro-ph.SR) [pdf, ps, other]
Title: Photometry of Saturated Stars with Machine Learning
Authors: Dominek Winecki (1) Christopher S. Kochanek (2) ((1) Dept. of Computer Science and Engineeering, The Ohio State University (2) Dept. of Astronomy, The Ohio State University)
Comments: submitted to ApJ
Subjects: Solar and Stellar Astrophysics (astro-ph.SR); Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV)
[ total of 533 entries: 1-311 | 312-533 ]
[ showing 311 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)