We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 439

[ total of 739 entries: 1-50 | ... | 290-339 | 340-389 | 390-439 | 440-489 | 490-539 | 540-589 | 590-639 | ... | 690-739 ]
[ showing 50 entries per page: fewer | more | all ]

Wed, 29 May 2024 (continued, showing 50 of 152 entries)

[440]  arXiv:2405.17705 [pdf, other]
Title: DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos
Comments: 9 pages,7 figures;project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[441]  arXiv:2405.17704 [pdf, other]
Title: Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation
Comments: Accepted to Conference on Lifelong Learning Agents (CoLLAs) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[442]  arXiv:2405.17698 [pdf, other]
Title: BaboonLand Dataset: Tracking Primates in the Wild and Automating Behaviour Recognition from Drone Videos
Comments: Dataset will be published shortly
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[443]  arXiv:2405.17686 [pdf, other]
Title: Towards Causal Physical Error Discovery in Video Analytics Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[444]  arXiv:2405.17680 [pdf, other]
Title: Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent
Authors: Yi Xu, Yun Fu
Comments: Datasets, code, and model weights at available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[445]  arXiv:2405.17678 [pdf, other]
Title: TIMA: Text-Image Mutual Awareness for Balancing Zero-Shot Adversarial Robustness and Generalization Ability
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[446]  arXiv:2405.17677 [pdf, other]
Title: Understanding differences in applying DETR to natural and medical images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[447]  arXiv:2405.17673 [pdf, other]
Title: Fast Samplers for Inverse Problems in Iterative Refinement Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[448]  arXiv:2405.17661 [pdf, other]
Title: RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[449]  arXiv:2405.17660 [pdf, other]
Title: LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450]  arXiv:2405.17613 [pdf, other]
Title: A Framework for Multi-modal Learning: Jointly Modeling Inter- & Intra-Modality Dependencies
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[451]  arXiv:2405.17609 [pdf, other]
Title: GarmentCodeData: A Dataset of 3D Made-to-Measure Garments With Sewing Patterns
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[452]  arXiv:2405.17596 [pdf, other]
Title: GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Comments: Our project page is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[453]  arXiv:2405.17568 [pdf, other]
Title: ExtremeMETA: High-speed Lightweight Image Segmentation Model by Remodeling Multi-channel Metamaterial Imagers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[454]  arXiv:2405.17532 [pdf, other]
Title: ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[455]  arXiv:2405.17531 [pdf, other]
Title: Evolutive Rendering Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[456]  arXiv:2405.17523 [pdf, other]
Title: Locally Testing Model Detections for Semantic Global Concepts
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[457]  arXiv:2405.17475 [pdf, other]
Title: How Culturally Aware are Vision-Language Models?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[458]  arXiv:2405.17457 [pdf, other]
Title: Data-Free Federated Class Incremental Learning with Diffusion-Based Generative Memory
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[459]  arXiv:2405.17456 [pdf, other]
Title: Optimized Linear Measurements for Inverse Problems using Diffusion-Based Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[460]  arXiv:2405.17455 [pdf, other]
Title: WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (stat.ML)
[461]  arXiv:2405.17450 [pdf, other]
Title: The Power of Next-Frame Prediction for Learning Physical Laws
Comments: 7 Figures, 12 Pages, 1 Table
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[462]  arXiv:2405.17449 [pdf, ps, other]
Title: Image Based Character Recognition, Documentation System To Decode Inscription From Temple
Comments: This research paper is a part of capstone project submitted to VIT Chennai, VIT University
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[463]  arXiv:2405.17447 [pdf, other]
Title: How to train your ViT for OOD Detection
Comments: arXiv admin note: text overlap with arXiv:2306.00826
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[464]  arXiv:2405.17444 [pdf, other]
Title: Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network
Authors: Min Hun Lee
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[465]  arXiv:2405.18418 (cross-list from cs.LG) [pdf, other]
Title: Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Comments: Code and videos at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[466]  arXiv:2405.18410 (cross-list from eess.IV) [pdf, other]
Title: Towards a Sampling Theory for Implicit Neural Representations
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[467]  arXiv:2405.18407 (cross-list from cs.LG) [pdf, other]
Title: Phased Consistency Model
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[468]  arXiv:2405.18376 (cross-list from cs.LG) [pdf, other]
Title: Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[469]  arXiv:2405.18358 (cross-list from cs.CL) [pdf, other]
Title: MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[470]  arXiv:2405.18356 (cross-list from eess.IV) [pdf, other]
Title: Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography
Comments: Accepted to Medical Image Analysis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[471]  arXiv:2405.18334 (cross-list from cs.DB) [pdf, other]
Title: SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches
Journal-ref: Published on International Conference on Very Large Databases 2024
Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[472]  arXiv:2405.18327 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Histopathology Based AI Model Predicts Anti-Angiogenic Therapy Response in Renal Cancer Clinical Trial
Comments: 19 pages, 4 Figures
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[473]  arXiv:2405.18267 (cross-list from eess.IV) [pdf, other]
Title: CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths
Comments: Early acceptance at MICCAI2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[474]  arXiv:2405.18236 (cross-list from cs.CR) [pdf, other]
Title: Position Paper: Think Globally, React Locally -- Bringing Real-time Reference-based Website Phishing Detection on macOS
Comments: 8 pages, 7 figures, 8 tables. Accepted to STAST'24, 14th International Workshop on Socio-Technical Aspects in Security, Affiliated with the 9th IEEE European Symposium on Security and Privacy, this https URL
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[475]  arXiv:2405.18213 (cross-list from cs.SD) [pdf, other]
Title: NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields
Comments: Project Page: this https URL
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[476]  arXiv:2405.18196 (cross-list from cs.RO) [pdf, other]
Title: Render and Diffuse: Aligning Image and Action Spaces for Diffusion-based Behaviour Cloning
Comments: Robotics: Science and Systems (RSS) 2024. Videos are available on our project webpage at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[477]  arXiv:2405.18193 (cross-list from cs.LG) [pdf, other]
Title: In-Context Symmetries: Self-Supervised Learning through Contextual World Models
Comments: 32 pages, 24 tables and 11 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[478]  arXiv:2405.18167 (cross-list from eess.IV) [pdf, other]
Title: Confidence-aware multi-modality learning for eye disease screening
Comments: 27 pages, 7 figures, 9 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[479]  arXiv:2405.18064 (cross-list from cs.AI) [pdf, ps, other]
Title: Automated Real-World Sustainability Data Generation from Images of Buildings
Comments: 6 pages
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[480]  arXiv:2405.18045 (cross-list from cs.LG) [pdf, other]
Title: Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based Losses
Comments: Accepted at ICML 2024. Code available at: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[481]  arXiv:2405.17969 (cross-list from cs.CL) [pdf, other]
Title: Knowledge Circuits in Pretrained Transformers
Comments: Work in progress, 25 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[482]  arXiv:2405.17927 (cross-list from cs.AI) [pdf, other]
Title: The Evolution of Multimodal Model Architectures
Comments: 30 pages, 6 tables, 7 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[483]  arXiv:2405.17811 (cross-list from cs.GR) [pdf, other]
Title: Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
Comments: Project page here: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[484]  arXiv:2405.17769 (cross-list from cs.RO) [pdf, other]
Title: Microsaccade-inspired Event Camera for Robotics
Comments: Published on Science Robotics June 2024 issue
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[485]  arXiv:2405.17756 (cross-list from eess.IV) [pdf, ps, other]
Title: Motion-Informed Deep Learning for Brain MR Image Reconstruction Framework
Comments: 22 pages, 7 figures, 4 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[486]  arXiv:2405.17706 (cross-list from cs.AI) [pdf, other]
Title: Video Enriched Retrieval Augmented Generation Using Aligned Video Captions
Authors: Kevin Dela Rosa
Comments: SIGIR 2024 Workshop on Multimodal Representation and Retrieval (MRR 2024)
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[487]  arXiv:2405.17663 (cross-list from cs.LG) [pdf, other]
Title: What's the Opposite of a Face? Finding Shared Decodable Concepts and their Negations in the Brain
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[488]  arXiv:2405.17659 (cross-list from eess.IV) [pdf, other]
Title: Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[489]  arXiv:2405.17537 (cross-list from cs.AI) [pdf, other]
Title: BIOSCAN-CLIP: Bridging Vision and Genomics for Biodiversity Monitoring at Scale
Comments: 16 pages with 9 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[ total of 739 entries: 1-50 | ... | 290-339 | 340-389 | 390-439 | 440-489 | 490-539 | 540-589 | 590-639 | ... | 690-739 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)