Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 366

[ total of 790 entries: 1-25 | ... | 292-316 | 317-341 | 342-366 | 367-391 | 392-416 | 417-441 | 442-466 | ... | 767-790 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 29 May 2024 (continued, showing 25 of 152 entries)

[367] arXiv:2405.17523 [pdf, other]: Title: Locally Testing Model Detections for Semantic Global Concepts

Authors: Franz Motzkus, Georgii Mikriukov, Christian Hellert, Ute Schmid

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[368] arXiv:2405.17475 [pdf, other]: Title: How Culturally Aware are Vision-Language Models?

Authors: Olena Burda-Lassen, Aman Chadha, Shashank Goswami, Vinija Jain

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[369] arXiv:2405.17457 [pdf, other]: Title: Data-Free Federated Class Incremental Learning with Diffusion-Based Generative Memory

Authors: Naibo Wang, Yuchen Deng, Wenjie Feng, Jianwei Yin, See-Kiong Ng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[370] arXiv:2405.17456 [pdf, other]: Title: Optimized Linear Measurements for Inverse Problems using Diffusion-Based Image Generation

Authors: Ling-Qi Zhang, Zahra Kadkhodaie, Eero P. Simoncelli, David H. Brainard

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[371] arXiv:2405.17455 [pdf, other]: Title: WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets

Authors: Adib Hasan, Mardavij Roozbehani, Munther Dahleh

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (stat.ML)
[372] arXiv:2405.17450 [pdf, other]: Title: The Power of Next-Frame Prediction for Learning Physical Laws

Authors: Thomas Winterbottom, G. Thomas Hudson, Daniel Kluvanec, Dean Slack, Jamie Sterling, Junjie Shentu, Chenghao Xiao, Zheming Zhou, Noura Al Moubayed

Comments: 7 Figures, 12 Pages, 1 Table

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[373] arXiv:2405.17449 [pdf, ps, other]: Title: Image Based Character Recognition, Documentation System To Decode Inscription From Temple

Authors: Velmathi G, Shangavelan M, Harish D, Krithikshun M S

Comments: This research paper is a part of capstone project submitted to VIT Chennai, VIT University

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[374] arXiv:2405.17447 [pdf, other]: Title: How to train your ViT for OOD Detection

Authors: Maximilian Mueller, Matthias Hein

Comments: arXiv admin note: text overlap with arXiv:2306.00826

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[375] arXiv:2405.17444 [pdf, other]: Title: Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network

Authors: Min Hun Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[376] arXiv:2405.18418 (cross-list from cs.LG) [pdf, other]: Title: Hierarchical World Models as Visual Whole-Body Humanoid Controllers

Authors: Nicklas Hansen, Jyothir S V, Vlad Sobal, Yann LeCun, Xiaolong Wang, Hao Su

Comments: Code and videos at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[377] arXiv:2405.18410 (cross-list from eess.IV) [pdf, other]: Title: Towards a Sampling Theory for Implicit Neural Representations

Authors: Mahrokh Najaf, Gregory Ongie

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[378] arXiv:2405.18407 (cross-list from cs.LG) [pdf, other]: Title: Phased Consistency Model

Authors: Fu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen, Peng Gao, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li, Xiaogang Wang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[379] arXiv:2405.18376 (cross-list from cs.LG) [pdf, other]: Title: Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning

Authors: Dongjie Chen, Kartik Patwari, Zhengfeng Lai, Sen-ching Cheung, Chen-Nee Chuah

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2405.18358 (cross-list from cs.CL) [pdf, other]: Title: MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning

Authors: Somnath Kumar, Yash Gadhia, Tanuja Ganu, Akshay Nambi

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[381] arXiv:2405.18356 (cross-list from eess.IV) [pdf, other]: Title: Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography

Authors: Jie Liu, Yixiao Zhang, Kang Wang, Mehmet Can Yavuz, Xiaoxi Chen, Yixuan Yuan, Haoliang Li, Yang Yang, Alan Yuille, Yucheng Tang, Zongwei Zhou

Comments: Accepted to Medical Image Analysis

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[382] arXiv:2405.18334 (cross-list from cs.DB) [pdf, other]: Title: SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches

Authors: Renzhi Wu, Pramod Chunduri, Dristi J Shah, Ashmitha Julius Aravind, Ali Payani, Xu Chu, Joy Arulraj, Kexin Rong

Journal-ref: Published on International Conference on Very Large Databases 2024

Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[383] arXiv:2405.18327 (cross-list from q-bio.QM) [pdf, ps, other]: Title: Histopathology Based AI Model Predicts Anti-Angiogenic Therapy Response in Renal Cancer Clinical Trial

Authors: Jay Jasti, Hua Zhong, Vandana Panwar, Vipul Jarmale, Jeffrey Miyata, Deyssy Carrillo, Alana Christie, Dinesh Rakheja, Zora Modrusan, Edward Ernest Kadel III, Niha Beig, Mahrukh Huseni, James Brugarolas, Payal Kapur, Satwik Rajaram

Comments: 19 pages, 4 Figures

Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[384] arXiv:2405.18267 (cross-list from eess.IV) [pdf, other]: Title: CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths

Authors: Reihaneh Teimouri, Marta Kersten-Oertel, Yiming Xiao

Comments: Early acceptance at MICCAI2024

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[385] arXiv:2405.18236 (cross-list from cs.CR) [pdf, other]: Title: Position Paper: Think Globally, React Locally -- Bringing Real-time Reference-based Website Phishing Detection on macOS

Authors: Ivan Petrukha, Nataliia Stulova, Sergii Kryvoblotskyi

Comments: 8 pages, 7 figures, 8 tables. Accepted to STAST'24, 14th International Workshop on Socio-Technical Aspects in Security, Affiliated with the 9th IEEE European Symposium on Security and Privacy, this https URL

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[386] arXiv:2405.18213 (cross-list from cs.SD) [pdf, other]: Title: NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields

Authors: Amandine Brunetto, Sascha Hornauer, Fabien Moutarde

Comments: Project Page: this https URL

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[387] arXiv:2405.18196 (cross-list from cs.RO) [pdf, other]: Title: Render and Diffuse: Aligning Image and Action Spaces for Diffusion-based Behaviour Cloning

Authors: Vitalis Vosylius, Younggyo Seo, Jafar Uruç, Stephen James

Comments: Robotics: Science and Systems (RSS) 2024. Videos are available on our project webpage at this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[388] arXiv:2405.18193 (cross-list from cs.LG) [pdf, other]: Title: In-Context Symmetries: Self-Supervised Learning through Contextual World Models

Authors: Sharut Gupta, Chenyu Wang, Yifei Wang, Tommi Jaakkola, Stefanie Jegelka

Comments: 32 pages, 24 tables and 11 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2405.18167 (cross-list from eess.IV) [pdf, other]: Title: Confidence-aware multi-modality learning for eye disease screening

Authors: Ke Zou, Tian Lin, Zongbo Han, Meng Wang, Xuedong Yuan, Haoyu Chen, Changqing Zhang, Xiaojing Shen, Huazhu Fu

Comments: 27 pages, 7 figures, 9 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[390] arXiv:2405.18064 (cross-list from cs.AI) [pdf, ps, other]: Title: Automated Real-World Sustainability Data Generation from Images of Buildings

Authors: Peter J Bentley, Soo Ling Lim, Rajat Mathur, Sid Narang

Comments: 6 pages

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[391] arXiv:2405.18045 (cross-list from cs.LG) [pdf, other]: Title: Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based Losses

Authors: Panagiotis Koromilas, Giorgos Bouritsas, Theodoros Giannakopoulos, Mihalis Nicolaou, Yannis Panagakis

Comments: Accepted at ICML 2024. Code available at: this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)

[ total of 790 entries: 1-25 | ... | 292-316 | 317-341 | 342-366 | 367-391 | 392-416 | 417-441 | 442-466 | ... | 767-790 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 366

Wed, 29 May 2024 (continued, showing 25 of 152 entries)