We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 370

[ total of 790 entries: 1-25 | ... | 296-320 | 321-345 | 346-370 | 371-395 | 396-420 | 421-445 | 446-470 | ... | 771-790 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 29 May 2024 (continued, showing 25 of 152 entries)

[371]  arXiv:2405.17455 [pdf, other]
Title: WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (stat.ML)
[372]  arXiv:2405.17450 [pdf, other]
Title: The Power of Next-Frame Prediction for Learning Physical Laws
Comments: 7 Figures, 12 Pages, 1 Table
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[373]  arXiv:2405.17449 [pdf, ps, other]
Title: Image Based Character Recognition, Documentation System To Decode Inscription From Temple
Comments: This research paper is a part of capstone project submitted to VIT Chennai, VIT University
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[374]  arXiv:2405.17447 [pdf, other]
Title: How to train your ViT for OOD Detection
Comments: arXiv admin note: text overlap with arXiv:2306.00826
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[375]  arXiv:2405.17444 [pdf, other]
Title: Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network
Authors: Min Hun Lee
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[376]  arXiv:2405.18418 (cross-list from cs.LG) [pdf, other]
Title: Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Comments: Code and videos at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[377]  arXiv:2405.18410 (cross-list from eess.IV) [pdf, other]
Title: Towards a Sampling Theory for Implicit Neural Representations
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[378]  arXiv:2405.18407 (cross-list from cs.LG) [pdf, other]
Title: Phased Consistency Model
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[379]  arXiv:2405.18376 (cross-list from cs.LG) [pdf, other]
Title: Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[380]  arXiv:2405.18358 (cross-list from cs.CL) [pdf, other]
Title: MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[381]  arXiv:2405.18356 (cross-list from eess.IV) [pdf, other]
Title: Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography
Comments: Accepted to Medical Image Analysis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[382]  arXiv:2405.18334 (cross-list from cs.DB) [pdf, other]
Title: SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches
Journal-ref: Published on International Conference on Very Large Databases 2024
Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[383]  arXiv:2405.18327 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Histopathology Based AI Model Predicts Anti-Angiogenic Therapy Response in Renal Cancer Clinical Trial
Comments: 19 pages, 4 Figures
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[384]  arXiv:2405.18267 (cross-list from eess.IV) [pdf, other]
Title: CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths
Comments: Early acceptance at MICCAI2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[385]  arXiv:2405.18236 (cross-list from cs.CR) [pdf, other]
Title: Position Paper: Think Globally, React Locally -- Bringing Real-time Reference-based Website Phishing Detection on macOS
Comments: 8 pages, 7 figures, 8 tables. Accepted to STAST'24, 14th International Workshop on Socio-Technical Aspects in Security, Affiliated with the 9th IEEE European Symposium on Security and Privacy, this https URL
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[386]  arXiv:2405.18213 (cross-list from cs.SD) [pdf, other]
Title: NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields
Comments: Project Page: this https URL
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[387]  arXiv:2405.18196 (cross-list from cs.RO) [pdf, other]
Title: Render and Diffuse: Aligning Image and Action Spaces for Diffusion-based Behaviour Cloning
Comments: Robotics: Science and Systems (RSS) 2024. Videos are available on our project webpage at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[388]  arXiv:2405.18193 (cross-list from cs.LG) [pdf, other]
Title: In-Context Symmetries: Self-Supervised Learning through Contextual World Models
Comments: 32 pages, 24 tables and 11 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[389]  arXiv:2405.18167 (cross-list from eess.IV) [pdf, other]
Title: Confidence-aware multi-modality learning for eye disease screening
Comments: 27 pages, 7 figures, 9 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[390]  arXiv:2405.18064 (cross-list from cs.AI) [pdf, ps, other]
Title: Automated Real-World Sustainability Data Generation from Images of Buildings
Comments: 6 pages
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[391]  arXiv:2405.18045 (cross-list from cs.LG) [pdf, other]
Title: Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based Losses
Comments: Accepted at ICML 2024. Code available at: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[392]  arXiv:2405.17969 (cross-list from cs.CL) [pdf, other]
Title: Knowledge Circuits in Pretrained Transformers
Comments: Work in progress, 25 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[393]  arXiv:2405.17927 (cross-list from cs.AI) [pdf, other]
Title: The Evolution of Multimodal Model Architectures
Comments: 30 pages, 6 tables, 7 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[394]  arXiv:2405.17811 (cross-list from cs.GR) [pdf, other]
Title: Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
Comments: Project page here: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[395]  arXiv:2405.17769 (cross-list from cs.RO) [pdf, other]
Title: Microsaccade-inspired Event Camera for Robotics
Comments: Published on Science Robotics June 2024 issue
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[ total of 790 entries: 1-25 | ... | 296-320 | 321-345 | 346-370 | 371-395 | 396-420 | 421-445 | 446-470 | ... | 771-790 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)