We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 160

[ total of 614 entries: 1-50 | 11-60 | 61-110 | 111-160 | 161-210 | 211-260 | 261-310 | 311-360 | ... | 611-614 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 24 May 2024 (continued, showing 50 of 242 entries)

[161]  arXiv:2405.13467 [pdf, other]
Title: AdaFedFR: Federated Face Recognition with Adaptive Inter-Class Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162]  arXiv:2405.13459 [pdf, other]
Title: Adapting Multi-modal Large Language Model to Concept Drift in the Long-tailed Open World
Authors: Xiaoyu Yang, Jie Lu, En Yu
Comments: 26 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2405.13451 [pdf, other]
Title: A Label Propagation Strategy for CutMix in Multi-Label Remote Sensing Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2405.13438 [pdf, ps, other]
Title: Dynamically enhanced static handwriting representation for Parkinson's disease detection
Journal-ref: Pattern Recognition Letters, vol. 128, pp. 204-210 (2019)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165]  arXiv:2405.13397 [pdf, other]
Title: Multi Player Tracking in Ice Hockey with Homographic Projections
Comments: Accepted at the Conference on Robots and Vision (CRV), 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2405.13389 [pdf, other]
Title: HR-INR: Continuous Space-Time Video Super-Resolution via Event Camera
Comments: 30 pages, 20 figures, 8 tables. This work was submitted for review in the second half of 2023. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Robotics (cs.RO)
[167]  arXiv:2405.13388 [pdf, other]
Title: Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168]  arXiv:2405.13382 [pdf, other]
Title: VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169]  arXiv:2405.13376 [pdf, other]
Title: Markerless retro-identification complements re-identification of individual insect subjects in archived image data of biological experiments
Comments: Accepted to CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[170]  arXiv:2405.13374 [pdf, other]
Title: Collaboration of Teachers for Semi-supervised Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[171]  arXiv:2405.13360 [pdf, other]
Title: How to Trace Latent Generative Model Generated Images without Artificial Watermark?
Comments: ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[172]  arXiv:2405.13337 [pdf, other]
Title: Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[173]  arXiv:2405.13335 [pdf, other]
Title: Vision Transformer with Sparse Scan Prior
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2405.13285 [pdf, ps, other]
Title: Enhancing Active Learning for Sentinel 2 Imagery through Contrastive Learning and Uncertainty Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[175]  arXiv:2405.13278 [pdf, other]
Title: Single color virtual H&E staining with In-and-Out Net
Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[176]  arXiv:2405.13267 [pdf, other]
Title: FLARE up your data: Diffusion-based Augmentation Method in Astronomical Imaging
Comments: 15 pages main paper (including references), 3 pages supplementary material. Our code and SpaceNet dataset is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2405.13256 [pdf, ps, other]
Title: Traffic control using intelligent timing of traffic lights with reinforcement learning technique and real-time processing of surveillance camera images
Comments: 6th International conference on traffic management and safety ,Tehran city, 12 pages in Persian
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[178]  arXiv:2405.13229 [pdf, other]
Title: Transfer Learning Approach for Railway Technical Map (RTM) Component Identification
Comments: 9 pages, 8 figures
Journal-ref: Lecture Notes in Networks and Systems: 465 (2022) 479-488
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Digital Libraries (cs.DL)
[179]  arXiv:2405.13218 [pdf, other]
Title: Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[180]  arXiv:2405.13206 [pdf, other]
Title: Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[181]  arXiv:2405.13202 [pdf, other]
Title: Empowering Urban Traffic Management: Elevated 3D LiDAR for Data Collection and Advanced Object Detection Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[182]  arXiv:2405.13197 [pdf, other]
Title: Global-Local Detail Guided Transformer for Sea Ice Recognition in Optical Remote Sensing Images
Comments: 5 pages, 5 figures
Journal-ref: IEEE IGARSS 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183]  arXiv:2405.13195 [pdf, other]
Title: CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[184]  arXiv:2405.13194 [pdf, other]
Title: KPConvX: Modernizing Kernel Point Convolution with Kernel Attention
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185]  arXiv:2405.13152 [pdf, other]
Title: Enhancing Interaction Modeling with Agent Selection and Physical Methods for Trajectory Prediction
Comments: code:this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[186]  arXiv:2405.13127 [pdf, other]
Title: Towards Retrieval-Augmented Architectures for Image Captioning
Comments: ACM Transactions on Multimedia Computing, Communications and Applications (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[187]  arXiv:2405.13097 [pdf, other]
Title: NieR: Normal-Based Lighting Scene Rendering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188]  arXiv:2405.14802 (cross-list from eess.IV) [pdf, other]
Title: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[189]  arXiv:2405.14800 (cross-list from cs.CR) [pdf, other]
Title: Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Comments: 17 pages, 5 figures
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[190]  arXiv:2405.14791 (cross-list from cs.LG) [pdf, other]
Title: Recurrent Early Exits for Federated Learning with Heterogeneous Clients
Comments: Accepted at the 41st International Conference on Machine Learning (ICML 2024)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[191]  arXiv:2405.14768 (cross-list from cs.CL) [pdf, other]
Title: WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[192]  arXiv:2405.14731 (cross-list from cs.RO) [pdf, other]
Title: CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments
Comments: 8 pages, 8 figures, 4 tables, Accepted at the IEEE Robotics Automation Letter (RA-L) 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[193]  arXiv:2405.14720 (cross-list from eess.IV) [pdf, other]
Title: Convolutional Neural Network Model Observers Discount Signal-like Anatomical Structures During Search in Virtual Digital Breast Tomosynthesis Phantoms
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[194]  arXiv:2405.14622 (cross-list from cs.LG) [pdf, other]
Title: Calibrated Self-Rewarding Vision Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[195]  arXiv:2405.14590 (cross-list from eess.IV) [pdf, other]
Title: MAMOC: MRI Motion Correction via Masked Autoencoding
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[196]  arXiv:2405.14522 (cross-list from cs.LG) [pdf, other]
Title: Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[197]  arXiv:2405.14477 (cross-list from cs.LG) [pdf, other]
Title: LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[198]  arXiv:2405.14453 (cross-list from eess.IV) [pdf, other]
Title: Domain-specific augmentations with resolution agnostic self-attention mechanism improves choroid segmentation in optical coherence tomography images
Comments: 13 pages, 2 figures, 8 tables (including supplementary material)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[199]  arXiv:2405.14327 (cross-list from eess.IV) [pdf, other]
Title: Autoregressive Image Diffusion: Generating Image Sequence and Application in MRI
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[200]  arXiv:2405.14313 (cross-list from cs.LG) [pdf, other]
Title: Smooth Pseudo-Labeling
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[201]  arXiv:2405.14304 (cross-list from cs.GR) [pdf, other]
Title: Exposure Diffusion: HDR Image Generation by Consistent LDR denoising
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[202]  arXiv:2405.14300 (cross-list from eess.IV) [pdf, other]
Title: Automatic diagnosis of cardiac magnetic resonance images based on semi-supervised learning
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[203]  arXiv:2405.14242 (cross-list from eess.IV) [pdf, ps, other]
Title: M2ANET: Mobile Malaria Attention Network for efficient classification of plasmodium parasites in blood cells
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[204]  arXiv:2405.14239 (cross-list from cs.LG) [pdf, other]
Title: Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations
Comments: 20 pages, 2 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[205]  arXiv:2405.14222 (cross-list from cs.LG) [pdf, other]
Title: RAQ-VAE: Rate-Adaptive Vector-Quantized Variational Autoencoder
Comments: Under review
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[206]  arXiv:2405.14221 (cross-list from eess.IV) [pdf, other]
Title: Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[207]  arXiv:2405.14205 (cross-list from cs.CL) [pdf, other]
Title: Agent Planning with World Knowledge Model
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[208]  arXiv:2405.14189 (cross-list from cs.CL) [pdf, other]
Title: Semantic-guided Prompt Organization for Universal Goal Hijacking against LLMs
Comments: 15 pages
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[209]  arXiv:2405.14147 (cross-list from cs.LG) [pdf, other]
Title: Minimum number of neurons in fully connected layers of a given neural network (the first approximation)
Authors: Oleg I.Berngardt
Comments: 21 pages, 2 figures, 1 table
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[210]  arXiv:2405.14129 (cross-list from cs.CL) [pdf, other]
Title: AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability
Comments: Code and models are available at $\href{this https URL}{\textit{this https URL}}$
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[ total of 614 entries: 1-50 | 11-60 | 61-110 | 111-160 | 161-210 | 211-260 | 261-310 | 311-360 | ... | 611-614 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)