Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 439

[ total of 739 entries: 1-50 | ... | 290-339 | 340-389 | 390-439 | 440-489 | 490-539 | 540-589 | 590-639 | ... | 690-739 ]
[ showing 50 entries per page: fewer | more | all ]

Wed, 29 May 2024 (continued, showing 50 of 152 entries)

[440] arXiv:2405.17705 [pdf, other]: Title: DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos

Authors: Linhan Wang, Kai Cheng, Shuo Lei, Shengkun Wang, Wei Yin, Chenyang Lei, Xiaoxiao Long, Chang-Tien Lu

Comments: 9 pages,7 figures;project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[441] arXiv:2405.17704 [pdf, other]: Title: Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation

Authors: Amir El-Ghoussani, Julia Hornauer, Gustavo Carneiro, Vasileios Belagiannis

Comments: Accepted to Conference on Lifelong Learning Agents (CoLLAs) 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[442] arXiv:2405.17698 [pdf, other]: Title: BaboonLand Dataset: Tracking Primates in the Wild and Automating Behaviour Recognition from Drone Videos

Authors: Isla Duporge, Maksim Kholiavchenko, Roi Harel, Scott Wolf, Dan Rubenstein, Meg Crofoot, Tanya Berger-Wolf, Stephen Lee, Julie Barreau, Jenna Kline, Michelle Ramirez, Chuck Stewart

Comments: Dataset will be published shortly

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[443] arXiv:2405.17686 [pdf, other]: Title: Towards Causal Physical Error Discovery in Video Analytics Systems

Authors: Jinjin Zhao, Ted Shaowang, Stavos Sintos, Sanjay Krishnan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[444] arXiv:2405.17680 [pdf, other]: Title: Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent

Authors: Yi Xu, Yun Fu

Comments: Datasets, code, and model weights at available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[445] arXiv:2405.17678 [pdf, other]: Title: TIMA: Text-Image Mutual Awareness for Balancing Zero-Shot Adversarial Robustness and Generalization Ability

Authors: Fengji Ma, Li Liu, Hei Victor Cheng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[446] arXiv:2405.17677 [pdf, other]: Title: Understanding differences in applying DETR to natural and medical images

Authors: Yanqi Xu, Yiqiu Shen, Carlos Fernandez-Granda, Laura Heacock, Krzysztof J. Geras

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[447] arXiv:2405.17673 [pdf, other]: Title: Fast Samplers for Inverse Problems in Iterative Refinement Models

Authors: Kushagra Pandey, Ruihan Yang, Stephan Mandt

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[448] arXiv:2405.17661 [pdf, other]: Title: RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance

Authors: Jiaojiao Fan, Haotian Xue, Qinsheng Zhang, Yongxin Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[449] arXiv:2405.17660 [pdf, other]: Title: LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking

Authors: Shaohua Dong, Yunhe Feng, Qing Yang, Yuewei Lin, Heng Fan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450] arXiv:2405.17613 [pdf, other]: Title: A Framework for Multi-modal Learning: Jointly Modeling Inter- & Intra-Modality Dependencies

Authors: Divyam Madaan, Taro Makino, Sumit Chopra, Kyunghyun Cho

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[451] arXiv:2405.17609 [pdf, other]: Title: GarmentCodeData: A Dataset of 3D Made-to-Measure Garments With Sewing Patterns

Authors: Maria Korosteleva, Timur Levent Kesdogan, Fabian Kemper, Stephan Wenninger, Jasmin Koller, Yuhan Zhang, Mario Botsch, Olga Sorkine-Hornung

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[452] arXiv:2405.17596 [pdf, other]: Title: GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane

Authors: Yansong Qu, Shaohui Dai, Xinyang Li, Jianghang Lin, Liujuan Cao, Shengchuan Zhang, Rongrong Ji

Comments: Our project page is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[453] arXiv:2405.17568 [pdf, other]: Title: ExtremeMETA: High-speed Lightweight Image Segmentation Model by Remodeling Multi-channel Metamaterial Imagers

Authors: Quan Liu, Brandon T. Swartz, Ivan Kravchenko, Jason G. Valentine, Yuankai Huo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[454] arXiv:2405.17532 [pdf, other]: Title: ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance

Authors: Jiannan Huang, Jun Hao Liew, Hanshu Yan, Yuyang Yin, Yao Zhao, Yunchao Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[455] arXiv:2405.17531 [pdf, other]: Title: Evolutive Rendering Models

Authors: Fangneng Zhan, Hanxue Liang, Yifan Wang, Michael Niemeyer, Michael Oechsle, Adam Kortylewski, Cengiz Oztireli, Gordon Wetzstein, Christian Theobalt

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[456] arXiv:2405.17523 [pdf, other]: Title: Locally Testing Model Detections for Semantic Global Concepts

Authors: Franz Motzkus, Georgii Mikriukov, Christian Hellert, Ute Schmid

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[457] arXiv:2405.17475 [pdf, other]: Title: How Culturally Aware are Vision-Language Models?

Authors: Olena Burda-Lassen, Aman Chadha, Shashank Goswami, Vinija Jain

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[458] arXiv:2405.17457 [pdf, other]: Title: Data-Free Federated Class Incremental Learning with Diffusion-Based Generative Memory

Authors: Naibo Wang, Yuchen Deng, Wenjie Feng, Jianwei Yin, See-Kiong Ng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[459] arXiv:2405.17456 [pdf, other]: Title: Optimized Linear Measurements for Inverse Problems using Diffusion-Based Image Generation

Authors: Ling-Qi Zhang, Zahra Kadkhodaie, Eero P. Simoncelli, David H. Brainard

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[460] arXiv:2405.17455 [pdf, other]: Title: WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets

Authors: Adib Hasan, Mardavij Roozbehani, Munther Dahleh

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (stat.ML)
[461] arXiv:2405.17450 [pdf, other]: Title: The Power of Next-Frame Prediction for Learning Physical Laws

Authors: Thomas Winterbottom, G. Thomas Hudson, Daniel Kluvanec, Dean Slack, Jamie Sterling, Junjie Shentu, Chenghao Xiao, Zheming Zhou, Noura Al Moubayed

Comments: 7 Figures, 12 Pages, 1 Table

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[462] arXiv:2405.17449 [pdf, ps, other]: Title: Image Based Character Recognition, Documentation System To Decode Inscription From Temple

Authors: Velmathi G, Shangavelan M, Harish D, Krithikshun M S

Comments: This research paper is a part of capstone project submitted to VIT Chennai, VIT University

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[463] arXiv:2405.17447 [pdf, other]: Title: How to train your ViT for OOD Detection

Authors: Maximilian Mueller, Matthias Hein

Comments: arXiv admin note: text overlap with arXiv:2306.00826

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[464] arXiv:2405.17444 [pdf, other]: Title: Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network

Authors: Min Hun Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[465] arXiv:2405.18418 (cross-list from cs.LG) [pdf, other]: Title: Hierarchical World Models as Visual Whole-Body Humanoid Controllers

Authors: Nicklas Hansen, Jyothir S V, Vlad Sobal, Yann LeCun, Xiaolong Wang, Hao Su

Comments: Code and videos at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[466] arXiv:2405.18410 (cross-list from eess.IV) [pdf, other]: Title: Towards a Sampling Theory for Implicit Neural Representations

Authors: Mahrokh Najaf, Gregory Ongie

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[467] arXiv:2405.18407 (cross-list from cs.LG) [pdf, other]: Title: Phased Consistency Model

Authors: Fu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen, Peng Gao, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li, Xiaogang Wang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[468] arXiv:2405.18376 (cross-list from cs.LG) [pdf, other]: Title: Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning

Authors: Dongjie Chen, Kartik Patwari, Zhengfeng Lai, Sen-ching Cheung, Chen-Nee Chuah

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[469] arXiv:2405.18358 (cross-list from cs.CL) [pdf, other]: Title: MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning

Authors: Somnath Kumar, Yash Gadhia, Tanuja Ganu, Akshay Nambi

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[470] arXiv:2405.18356 (cross-list from eess.IV) [pdf, other]: Title: Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography

Authors: Jie Liu, Yixiao Zhang, Kang Wang, Mehmet Can Yavuz, Xiaoxi Chen, Yixuan Yuan, Haoliang Li, Yang Yang, Alan Yuille, Yucheng Tang, Zongwei Zhou

Comments: Accepted to Medical Image Analysis

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[471] arXiv:2405.18334 (cross-list from cs.DB) [pdf, other]: Title: SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches

Authors: Renzhi Wu, Pramod Chunduri, Dristi J Shah, Ashmitha Julius Aravind, Ali Payani, Xu Chu, Joy Arulraj, Kexin Rong

Journal-ref: Published on International Conference on Very Large Databases 2024

Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[472] arXiv:2405.18327 (cross-list from q-bio.QM) [pdf, ps, other]: Title: Histopathology Based AI Model Predicts Anti-Angiogenic Therapy Response in Renal Cancer Clinical Trial

Authors: Jay Jasti, Hua Zhong, Vandana Panwar, Vipul Jarmale, Jeffrey Miyata, Deyssy Carrillo, Alana Christie, Dinesh Rakheja, Zora Modrusan, Edward Ernest Kadel III, Niha Beig, Mahrukh Huseni, James Brugarolas, Payal Kapur, Satwik Rajaram

Comments: 19 pages, 4 Figures

Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[473] arXiv:2405.18267 (cross-list from eess.IV) [pdf, other]: Title: CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths

Authors: Reihaneh Teimouri, Marta Kersten-Oertel, Yiming Xiao

Comments: Early acceptance at MICCAI2024

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[474] arXiv:2405.18236 (cross-list from cs.CR) [pdf, other]: Title: Position Paper: Think Globally, React Locally -- Bringing Real-time Reference-based Website Phishing Detection on macOS

Authors: Ivan Petrukha, Nataliia Stulova, Sergii Kryvoblotskyi

Comments: 8 pages, 7 figures, 8 tables. Accepted to STAST'24, 14th International Workshop on Socio-Technical Aspects in Security, Affiliated with the 9th IEEE European Symposium on Security and Privacy, this https URL

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[475] arXiv:2405.18213 (cross-list from cs.SD) [pdf, other]: Title: NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields

Authors: Amandine Brunetto, Sascha Hornauer, Fabien Moutarde

Comments: Project Page: this https URL

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[476] arXiv:2405.18196 (cross-list from cs.RO) [pdf, other]: Title: Render and Diffuse: Aligning Image and Action Spaces for Diffusion-based Behaviour Cloning

Authors: Vitalis Vosylius, Younggyo Seo, Jafar Uruç, Stephen James

Comments: Robotics: Science and Systems (RSS) 2024. Videos are available on our project webpage at this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[477] arXiv:2405.18193 (cross-list from cs.LG) [pdf, other]: Title: In-Context Symmetries: Self-Supervised Learning through Contextual World Models

Authors: Sharut Gupta, Chenyu Wang, Yifei Wang, Tommi Jaakkola, Stefanie Jegelka

Comments: 32 pages, 24 tables and 11 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[478] arXiv:2405.18167 (cross-list from eess.IV) [pdf, other]: Title: Confidence-aware multi-modality learning for eye disease screening

Authors: Ke Zou, Tian Lin, Zongbo Han, Meng Wang, Xuedong Yuan, Haoyu Chen, Changqing Zhang, Xiaojing Shen, Huazhu Fu

Comments: 27 pages, 7 figures, 9 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[479] arXiv:2405.18064 (cross-list from cs.AI) [pdf, ps, other]: Title: Automated Real-World Sustainability Data Generation from Images of Buildings

Authors: Peter J Bentley, Soo Ling Lim, Rajat Mathur, Sid Narang

Comments: 6 pages

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[480] arXiv:2405.18045 (cross-list from cs.LG) [pdf, other]: Title: Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based Losses

Authors: Panagiotis Koromilas, Giorgos Bouritsas, Theodoros Giannakopoulos, Mihalis Nicolaou, Yannis Panagakis

Comments: Accepted at ICML 2024. Code available at: this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[481] arXiv:2405.17969 (cross-list from cs.CL) [pdf, other]: Title: Knowledge Circuits in Pretrained Transformers

Authors: Yunzhi Yao, Ningyu Zhang, Zekun Xi, Mengru Wang, Ziwen Xu, Shumin Deng, Huajun Chen

Comments: Work in progress, 25 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[482] arXiv:2405.17927 (cross-list from cs.AI) [pdf, other]: Title: The Evolution of Multimodal Model Architectures

Authors: Shakti N. Wadekar, Abhishek Chaurasia, Aman Chadha, Eugenio Culurciello

Comments: 30 pages, 6 tables, 7 figures

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[483] arXiv:2405.17811 (cross-list from cs.GR) [pdf, other]: Title: Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh

Authors: Xiangjun Gao, Xiaoyu Li, Yiyu Zhuang, Qi Zhang, Wenbo Hu, Chaopeng Zhang, Yao Yao, Ying Shan, Long Quan

Comments: Project page here: this https URL

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[484] arXiv:2405.17769 (cross-list from cs.RO) [pdf, other]: Title: Microsaccade-inspired Event Camera for Robotics

Authors: Botao He, Ze Wang, Yuan Zhou, Jingxi Chen, Chahat Deep Singh, Haojia Li, Yuman Gao, Shaojie Shen, Kaiwei Wang, Yanjun Cao, Chao Xu, Yiannis Aloimonos, Fei Gao, Cornelia Fermuller

Comments: Published on Science Robotics June 2024 issue

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[485] arXiv:2405.17756 (cross-list from eess.IV) [pdf, ps, other]: Title: Motion-Informed Deep Learning for Brain MR Image Reconstruction Framework

Authors: Zhifeng Chen, Kamlesh Pawar, Kh Tohidul Islam, Himashi Peiris, Gary Egan, Zhaolin Chen

Comments: 22 pages, 7 figures, 4 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[486] arXiv:2405.17706 (cross-list from cs.AI) [pdf, other]: Title: Video Enriched Retrieval Augmented Generation Using Aligned Video Captions

Authors: Kevin Dela Rosa

Comments: SIGIR 2024 Workshop on Multimodal Representation and Retrieval (MRR 2024)

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[487] arXiv:2405.17663 (cross-list from cs.LG) [pdf, other]: Title: What's the Opposite of a Face? Finding Shared Decodable Concepts and their Negations in the Brain

Authors: Cory Efird, Alex Murphy, Joel Zylberberg, Alona Fyshe

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[488] arXiv:2405.17659 (cross-list from eess.IV) [pdf, other]: Title: Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba

Authors: Jiahao Huang, Liutao Yang, Fanwen Wang, Yinzhe Wu, Yang Nan, Weiwen Wu, Chengyan Wang, Kuangyu Shi, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[489] arXiv:2405.17537 (cross-list from cs.AI) [pdf, other]: Title: BIOSCAN-CLIP: Bridging Vision and Genomics for Biodiversity Monitoring at Scale

Authors: ZeMing Gong, Austin T. Wang, Joakim Bruslund Haurum, Scott C. Lowe, Graham W. Taylor, Angel X. Chang

Comments: 16 pages with 9 figures

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)

[ total of 739 entries: 1-50 | ... | 290-339 | 340-389 | 390-439 | 440-489 | 490-539 | 540-589 | 590-639 | ... | 690-739 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 439

Wed, 29 May 2024 (continued, showing 50 of 152 entries)