We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 118

[ total of 614 entries: 1-50 | 19-68 | 69-118 | 119-168 | 169-218 | 219-268 | 269-318 | ... | 569-614 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 24 May 2024 (continued, showing 50 of 242 entries)

[119]  arXiv:2405.13979 [pdf, ps, other]
Title: Optimizing Curvature Learning for Robust Hyperbolic Deep Learning in Computer Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2405.13951 [pdf, other]
Title: Text Prompting for Multi-Concept Video Customization by Autoregressive Generation
Comments: Paper accepted to AI4CC Workshop at CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2405.13949 [pdf, other]
Title: PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery
Comments: 10 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2405.13943 [pdf, other]
Title: DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus
Authors: Yu Chen, Gim Hee Lee
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[123]  arXiv:2405.13911 [pdf, other]
Title: TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Comments: 32 pages, 12 figures, 11 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[124]  arXiv:2405.13903 [pdf, other]
Title: ST-Gait++: Leveraging spatio-temporal convolutions for gait-based emotion recognition on videos
Comments: Accepted for publication in the LXCV Workshop @ CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2405.13901 [pdf, other]
Title: DCT-Based Decorrelated Attention for Vision Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[126]  arXiv:2405.13896 [pdf, other]
Title: A General Framework for Jersey Number Recognition in Sports Video
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2405.13874 [pdf, other]
Title: Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching
Comments: Accepted to CVPR2024 Image Matching Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2405.13870 [pdf, other]
Title: FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Comments: CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2405.13865 [pdf, other]
Title: ReVideo: Remake a Video with Motion and Content Control
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2405.13864 [pdf, other]
Title: Just rotate it! Uncertainty estimation in closed-source models via multiple queries
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[131]  arXiv:2405.13860 [pdf, other]
Title: MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling
Comments: 17 pages, 12 pages for main paper, 5 pages for supplementary
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132]  arXiv:2405.13859 [pdf, other]
Title: QGait: Toward Accurate Quantization for Gait Recognition with Binarized Input
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2405.13824 [pdf, other]
Title: GMMFormer v2: An Uncertainty-aware Framework for Partially Relevant Video Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134]  arXiv:2405.13800 [pdf, other]
Title: Dense Connector for MLLMs
Comments: Technical report. 25 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[135]  arXiv:2405.13781 [pdf, other]
Title: Addressing the Elephant in the Room: Robust Animal Re-Identification with Unsupervised Part-Based Feature Alignment
Comments: Accepted to CVPR workshop CV4Animals 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[136]  arXiv:2405.13779 [pdf, other]
Title: Robust Disaster Assessment from Aerial Imagery Using Text-to-Image Synthetic Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[137]  arXiv:2405.13777 [pdf, other]
Title: No Filter: Cultural and Socioeconomic Diversityin Contrastive Vision-Language Models
Comments: 15 pages, 5 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[138]  arXiv:2405.13762 [pdf, other]
Title: A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[139]  arXiv:2405.13758 [pdf, other]
Title: Counterfactual Gradients-based Quantification of Prediction Trust in Neural Networks
Comments: 2024 IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[140]  arXiv:2405.13748 [pdf, other]
Title: Monocular Gaussian SLAM with Language Extended Loop Closure
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141]  arXiv:2405.13745 [pdf, other]
Title: NeurCross: A Self-Supervised Neural Approach for Representing Cross Fields in Quad Mesh Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142]  arXiv:2405.13722 [pdf, other]
Title: InstaDrag: Lightning Fast and Accurate Drag-based Image Editing Emerging from Videos
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2405.13694 [pdf, other]
Title: Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances
Comments: 14 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[144]  arXiv:2405.13686 [pdf, other]
Title: Embedding Generalized Semantic Knowledge into Few-Shot Remote Sensing Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[145]  arXiv:2405.13685 [pdf, other]
Title: Prompt Mixing in Diffusion Models using the Black Scholes Algorithm
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[146]  arXiv:2405.13675 [pdf, other]
Title: Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2405.13672 [pdf, other]
Title: Advancing Spiking Neural Networks towards Multiscale Spatiotemporal Interaction Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2405.13659 [pdf, other]
Title: EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Comments: 23 pages,10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149]  arXiv:2405.13637 [pdf, other]
Title: Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[150]  arXiv:2405.13581 [pdf, other]
Title: Safety Alignment for Vision Language Models
Comments: 23 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151]  arXiv:2405.13580 [pdf, other]
Title: AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext Tasks
Comments: Accepted in ICDAR 2024. Project page is at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[152]  arXiv:2405.13571 [pdf, other]
Title: Cross-Modal Distillation in Industrial Anomaly Detection: Exploring Efficient Multi-Modal IAD
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2405.13570 [pdf, other]
Title: MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[154]  arXiv:2405.13555 [pdf, ps, other]
Title: A Perspective Analysis of Handwritten Signature Technology
Journal-ref: ACM Computing Surveys (CSUR), vol.51, no 6, pp. 117:1-117:39 (2018)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2405.13540 [pdf, other]
Title: Directly Denoising Diffusion Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2405.13538 [pdf, other]
Title: Ultra-Fast Adaptive Track Detection Network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2405.13532 [pdf, other]
Title: What Makes Good Few-shot Examples for Vision-Language Models?
Comments: 8 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158]  arXiv:2405.13518 [pdf, other]
Title: PerSense: Personalized Instance Segmentation in Dense Images
Comments: Technical report of PerSense
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2405.13482 [pdf, other]
Title: Continual Learning in Medical Imaging from Theory to Practice: A Survey and Practical Analysis
Comments: 15 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2405.13473 [pdf, other]
Title: Class-Conditional self-reward mechanism for improved Text-to-Image models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[161]  arXiv:2405.13467 [pdf, other]
Title: AdaFedFR: Federated Face Recognition with Adaptive Inter-Class Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162]  arXiv:2405.13459 [pdf, other]
Title: Adapting Multi-modal Large Language Model to Concept Drift in the Long-tailed Open World
Authors: Xiaoyu Yang, Jie Lu, En Yu
Comments: 26 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2405.13451 [pdf, other]
Title: A Label Propagation Strategy for CutMix in Multi-Label Remote Sensing Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2405.13438 [pdf, ps, other]
Title: Dynamically enhanced static handwriting representation for Parkinson's disease detection
Journal-ref: Pattern Recognition Letters, vol. 128, pp. 204-210 (2019)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165]  arXiv:2405.13397 [pdf, other]
Title: Multi Player Tracking in Ice Hockey with Homographic Projections
Comments: Accepted at the Conference on Robots and Vision (CRV), 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2405.13389 [pdf, other]
Title: HR-INR: Continuous Space-Time Video Super-Resolution via Event Camera
Comments: 30 pages, 20 figures, 8 tables. This work was submitted for review in the second half of 2023. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Robotics (cs.RO)
[167]  arXiv:2405.13388 [pdf, other]
Title: Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168]  arXiv:2405.13382 [pdf, other]
Title: VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 614 entries: 1-50 | 19-68 | 69-118 | 119-168 | 169-218 | 219-268 | 269-318 | ... | 569-614 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)