We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 25

[ total of 565 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | ... | 551-565 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 16 Apr 2024 (continued, showing 25 of 195 entries)

[26]  arXiv:2404.09857 [pdf, other]
Title: Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[27]  arXiv:2404.09846 [pdf, other]
Title: A Diffusion-based Data Generator for Training Object Recognition Models in Ultra-Range Distance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[28]  arXiv:2404.09842 [pdf, other]
Title: STMixer: A One-Stage Sparse Action Detector
Comments: Extended version of the paper arXiv:2303.15879 presented at CVPR 2023. Accepted by TPAMI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[29]  arXiv:2404.09833 [pdf, other]
Title: Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video
Comments: CVPR 2024. Project page (with code): this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[30]  arXiv:2404.09831 [pdf, other]
Title: Digging into contrastive learning for robust depth estimation with diffusion models
Comments: 8 pages,6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31]  arXiv:2404.09826 [pdf, other]
Title: A Recipe for CAC: Mosaic-based Generalized Loss for Improved Class-Agnostic Counting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32]  arXiv:2404.09819 [pdf, other]
Title: 3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow
Comments: 22 pages, 25 figures, to be published in CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33]  arXiv:2404.09807 [pdf, other]
Title: A Universal Protocol to Benchmark Camera Calibration for Sports
Comments: 12 pages, 5 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34]  arXiv:2404.09797 [pdf, other]
Title: TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35]  arXiv:2404.09790 [pdf, other]
[36]  arXiv:2404.09778 [pdf, other]
Title: The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning
Comments: 26 pages, submitted to ECCV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37]  arXiv:2404.09768 [pdf, other]
Title: Contrastive Pretraining for Visual Concept Explanations of Socioeconomic Outcomes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[38]  arXiv:2404.09752 [pdf, other]
Title: Can We Break Free from Strong Data Augmentations in Self-Supervised Learning?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[39]  arXiv:2404.09748 [pdf, other]
Title: LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[40]  arXiv:2404.09736 [pdf, other]
Title: FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features
Comments: Accepted to CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2404.09735 [pdf, other]
Title: Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement
Comments: CVPRW 2024, best LPIPS in the NTIRE low light enhancement challenge 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42]  arXiv:2404.09732 [pdf, other]
Title: Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models
Comments: CVPRW 2024; Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43]  arXiv:2404.09707 [pdf, other]
Title: Adaptive Patching for High-resolution Image Segmentation with Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[44]  arXiv:2404.09697 [pdf, other]
Title: HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2404.09692 [pdf, other]
Title: XoFTR: Cross-modal Feature Matching Transformer
Comments: CVPR Image Matching Workshop, 2024. 12 pages, 7 figures, 5 tables. Codes and dataset are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46]  arXiv:2404.09690 [pdf, other]
Title: Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[47]  arXiv:2404.09654 [pdf, other]
Title: Do LLMs Understand Visual Anomalies? Uncovering LLM Capabilities in Zero-shot Anomaly Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[48]  arXiv:2404.09640 [pdf, other]
Title: CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning
Comments: Ongoing work; 10 pages, 2 Tables, 9 Figures; Repo is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49]  arXiv:2404.09633 [pdf, other]
Title: In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50]  arXiv:2404.09632 [pdf, other]
Title: Bridging Vision and Language Spaces with Assignment Prediction
Comments: ICLR 2024 Camera-ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ total of 565 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | ... | 551-565 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)