Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 160

[ total of 614 entries: 1-50 | 11-60 | 61-110 | 111-160 | 161-210 | 211-260 | 261-310 | 311-360 | ... | 611-614 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 24 May 2024 (continued, showing 50 of 242 entries)

[161] arXiv:2405.13467 [pdf, other]: Title: AdaFedFR: Federated Face Recognition with Adaptive Inter-Class Representation Learning

Authors: Di Qiu, Xinyang Lin, Kaiye Wang, Xiangxiang Chu, Pengfei Yan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162] arXiv:2405.13459 [pdf, other]: Title: Adapting Multi-modal Large Language Model to Concept Drift in the Long-tailed Open World

Authors: Xiaoyu Yang, Jie Lu, En Yu

Comments: 26 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163] arXiv:2405.13451 [pdf, other]: Title: A Label Propagation Strategy for CutMix in Multi-Label Remote Sensing Image Classification

Authors: Tom Burgert, Tim Siebert, Kai Norman Clasen, Begüm Demir

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164] arXiv:2405.13438 [pdf, ps, other]: Title: Dynamically enhanced static handwriting representation for Parkinson's disease detection

Authors: Moises Diaz, Miguel Angel Ferrer, Donato Impedovo, Giuseppe Pirlo, Gennaro Vessio

Journal-ref: Pattern Recognition Letters, vol. 128, pp. 204-210 (2019)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165] arXiv:2405.13397 [pdf, other]: Title: Multi Player Tracking in Ice Hockey with Homographic Projections

Authors: Harish Prakash, Jia Cheng Shang, Ken M. Nsiempba, Yuhao Chen, David A. Clausi, John S. Zelek

Comments: Accepted at the Conference on Robots and Vision (CRV), 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166] arXiv:2405.13389 [pdf, other]: Title: HR-INR: Continuous Space-Time Video Super-Resolution via Event Camera

Authors: Yunfan Lu, Zipeng Wang, Yusheng Wang, Hui Xiong

Comments: 30 pages, 20 figures, 8 tables. This work was submitted for review in the second half of 2023. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Robotics (cs.RO)
[167] arXiv:2405.13388 [pdf, other]: Title: Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation

Authors: Dingwen Zhang, Hao Li, Diqi He, Nian Liu, Lechao Cheng, Jingdong Wang, Junwei Han

Comments: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168] arXiv:2405.13382 [pdf, other]: Title: VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Authors: Yongxin Guo, Jingyu Liu, Mingda Li, Xiaoying Tang, Xi Chen, Bo Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169] arXiv:2405.13376 [pdf, other]: Title: Markerless retro-identification complements re-identification of individual insect subjects in archived image data of biological experiments

Authors: Asaduz Zaman, Vanessa Kellermann, Alan Dorin

Comments: Accepted to CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[170] arXiv:2405.13374 [pdf, other]: Title: Collaboration of Teachers for Semi-supervised Object Detection

Authors: Liyu Chen, Huaao Tang, Yi Wen, Hanting Chen, Wei Li, Junchao Liu, Jie Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[171] arXiv:2405.13360 [pdf, other]: Title: How to Trace Latent Generative Model Generated Images without Artificial Watermark?

Authors: Zhenting Wang, Vikash Sehwag, Chen Chen, Lingjuan Lyu, Dimitris N. Metaxas, Shiqing Ma

Comments: ICML 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[172] arXiv:2405.13337 [pdf, other]: Title: Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer

Authors: Qihang Fan, Huaibo Huang, Mingrui Chen, Ran He

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[173] arXiv:2405.13335 [pdf, other]: Title: Vision Transformer with Sparse Scan Prior

Authors: Qihang Fan, Huaibo Huang, Mingrui Chen, Ran He

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174] arXiv:2405.13285 [pdf, ps, other]: Title: Enhancing Active Learning for Sentinel 2 Imagery through Contrastive Learning and Uncertainty Estimation

Authors: David Pogorzelski, Peter Arlinghaus

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[175] arXiv:2405.13278 [pdf, other]: Title: Single color virtual H&E staining with In-and-Out Net

Authors: Mengkun Chen, Yen-Tung Liu, Fadeel Sher Khan, Matthew C. Fox, Jason S. Reichenberg, Fabiana C.P.S. Lopes, Katherine R. Sebastian, Mia K. Markey, James W. Tunnell

Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[176] arXiv:2405.13267 [pdf, other]: Title: FLARE up your data: Diffusion-based Augmentation Method in Astronomical Imaging

Authors: Mohammed Talha Alam, Raza Imam, Mohsen Guizani, Fakhri Karray

Comments: 15 pages main paper (including references), 3 pages supplementary material. Our code and SpaceNet dataset is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177] arXiv:2405.13256 [pdf, ps, other]: Title: Traffic control using intelligent timing of traffic lights with reinforcement learning technique and real-time processing of surveillance camera images

Authors: Mahdi Jamebozorg, Mohsen Hami, Sajjad Deh Deh Jani

Comments: 6th International conference on traffic management and safety ,Tehran city, 12 pages in Persian

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[178] arXiv:2405.13229 [pdf, other]: Title: Transfer Learning Approach for Railway Technical Map (RTM) Component Identification

Authors: Obadage Rochana Rumalshan, Pramuka Weerasinghe, Mohamed Shaheer, Prabhath Gunathilake, Erunika Dayaratna

Comments: 9 pages, 8 figures

Journal-ref: Lecture Notes in Networks and Systems: 465 (2022) 479-488

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Digital Libraries (cs.DL)
[179] arXiv:2405.13218 [pdf, other]: Title: Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction

Authors: Maciej Kilian, Varun Japan, Luke Zettlemoyer

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[180] arXiv:2405.13206 [pdf, other]: Title: Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding

Authors: Rong Gao, Xin Liu, Bohao Xing, Zitong Yu, Bjorn W. Schuller, Heikki Kälviäinen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[181] arXiv:2405.13202 [pdf, other]: Title: Empowering Urban Traffic Management: Elevated 3D LiDAR for Data Collection and Advanced Object Detection Analysis

Authors: Nawfal Guefrachi, Hakim Ghazzai, Ahmad Alsharoa

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[182] arXiv:2405.13197 [pdf, other]: Title: Global-Local Detail Guided Transformer for Sea Ice Recognition in Optical Remote Sensing Images

Authors: Zhanchao Huang, Wenjun Hong, Hua Su

Comments: 5 pages, 5 figures

Journal-ref: IEEE IGARSS 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183] arXiv:2405.13195 [pdf, other]: Title: CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers

Authors: Andrew Marmon, Grant Schindler, José Lezama, Dan Kondratyuk, Bryan Seybold, Irfan Essa

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[184] arXiv:2405.13194 [pdf, other]: Title: KPConvX: Modernizing Kernel Point Convolution with Kernel Attention

Authors: Hugues Thomas, Yao-Hung Hubert Tsai, Timothy D. Barfoot, Jian Zhang

Comments: CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185] arXiv:2405.13152 [pdf, other]: Title: Enhancing Interaction Modeling with Agent Selection and Physical Methods for Trajectory Prediction

Authors: Shiji Huang, Lei Ye, Min Chen, Wenhai Luo, Chenqi Xu, Deyuan Liang, Dihong Wang

Comments: code:this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[186] arXiv:2405.13127 [pdf, other]: Title: Towards Retrieval-Augmented Architectures for Image Captioning

Authors: Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita Cucchiara

Comments: ACM Transactions on Multimedia Computing, Communications and Applications (2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[187] arXiv:2405.13097 [pdf, other]: Title: NieR: Normal-Based Lighting Scene Rendering

Authors: Hongsheng Wang, Yang Wang, Yalan Liu, Fayuan Hu, Shengyu Zhang, Fei Wu, Feng Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188] arXiv:2405.14802 (cross-list from eess.IV) [pdf, other]: Title: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation

Authors: Hongxu Jiang, Muhammad Imran, Linhai Ma, Teng Zhang, Yuyin Zhou, Muxuan Liang, Kuang Gong, Wei Shao

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[189] arXiv:2405.14800 (cross-list from cs.CR) [pdf, other]: Title: Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

Authors: Shengfang Zhai, Huanran Chen, Yinpeng Dong, Jiajun Li, Qingni Shen, Yansong Gao, Hang Su, Yang Liu

Comments: 17 pages, 5 figures

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[190] arXiv:2405.14791 (cross-list from cs.LG) [pdf, other]: Title: Recurrent Early Exits for Federated Learning with Heterogeneous Clients

Authors: Royson Lee, Javier Fernandez-Marques, Shell Xu Hu, Da Li, Stefanos Laskaridis, Łukasz Dudziak, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane

Comments: Accepted at the 41st International Conference on Machine Learning (ICML 2024)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[191] arXiv:2405.14768 (cross-list from cs.CL) [pdf, other]: Title: WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models

Authors: Peng Wang, Zexi Li, Ningyu Zhang, Ziwen Xu, Yunzhi Yao, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen

Comments: Work in progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[192] arXiv:2405.14731 (cross-list from cs.RO) [pdf, other]: Title: CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments

Authors: Yang Zhou, Long Quang, Carlos Nieto-Granda, Giuseppe Loianno

Comments: 8 pages, 8 figures, 4 tables, Accepted at the IEEE Robotics Automation Letter (RA-L) 2024

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[193] arXiv:2405.14720 (cross-list from eess.IV) [pdf, other]: Title: Convolutional Neural Network Model Observers Discount Signal-like Anatomical Structures During Search in Virtual Digital Breast Tomosynthesis Phantoms

Authors: Aditya Jonnalagadda, Bruno B. Barufaldi, Andrew D.A. Maidment, Susan P. Weinstein, Craig K. Abbey, Miguel P. Eckstein

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[194] arXiv:2405.14622 (cross-list from cs.LG) [pdf, other]: Title: Calibrated Self-Rewarding Vision Language Models

Authors: Yiyang Zhou, Zhiyuan Fan, Dongjie Cheng, Sihan Yang, Zhaorun Chen, Chenhang Cui, Xiyao Wang, Yun Li, Linjun Zhang, Huaxiu Yao

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[195] arXiv:2405.14590 (cross-list from eess.IV) [pdf, other]: Title: MAMOC: MRI Motion Correction via Masked Autoencoding

Authors: Lennart Alexander Van der Goten, Jingyu Guo, Kevin Smith

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[196] arXiv:2405.14522 (cross-list from cs.LG) [pdf, other]: Title: Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property

Authors: Yuya Yoshikawa, Masanari Kimura, Ryotaro Shimizu, Yuki Saito

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[197] arXiv:2405.14477 (cross-list from cs.LG) [pdf, other]: Title: LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models

Authors: Seyedmorteza Sadat, Jakob Buhmann, Derek Bradley, Otmar Hilliges, Romann M. Weber

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[198] arXiv:2405.14453 (cross-list from eess.IV) [pdf, other]: Title: Domain-specific augmentations with resolution agnostic self-attention mechanism improves choroid segmentation in optical coherence tomography images

Authors: Jamie Burke, Justin Engelmann, Charlene Hamid, Diana Moukaddem, Dan Pugh, Neeraj Dhaun, Amos Storkey, Niall Strang, Stuart King, Tom MacGillivray, Miguel O. Bernabeu, Ian J.C. MacCormick

Comments: 13 pages, 2 figures, 8 tables (including supplementary material)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[199] arXiv:2405.14327 (cross-list from eess.IV) [pdf, other]: Title: Autoregressive Image Diffusion: Generating Image Sequence and Application in MRI

Authors: Guanxiong Luo, Shoujin Huang, Martin Uecker

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[200] arXiv:2405.14313 (cross-list from cs.LG) [pdf, other]: Title: Smooth Pseudo-Labeling

Authors: Nikolaos Karaliolios, Hervé Le Borgne, Florian Chabot

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[201] arXiv:2405.14304 (cross-list from cs.GR) [pdf, other]: Title: Exposure Diffusion: HDR Image Generation by Consistent LDR denoising

Authors: Mojtaba Bemana, Thomas Leimkühler, Karol Myszkowski, Hans-Peter Seidel, Tobias Ritschel

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[202] arXiv:2405.14300 (cross-list from eess.IV) [pdf, other]: Title: Automatic diagnosis of cardiac magnetic resonance images based on semi-supervised learning

Authors: Hejun Huang, Zuguo Chen, Yi Huang, Guangqiang Luo, Chaoyang Chen, Youzhi Song

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[203] arXiv:2405.14242 (cross-list from eess.IV) [pdf, ps, other]: Title: M2ANET: Mobile Malaria Attention Network for efficient classification of plasmodium parasites in blood cells

Authors: Salam Ahmed Ali, Peshraw Salam Abdulqadir, Shan Ali Abdullah, Haruna Yunusa

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[204] arXiv:2405.14239 (cross-list from cs.LG) [pdf, other]: Title: Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations

Authors: Mohammed Baharoon, Jonathan Klein, Dominik L. Michels

Comments: 20 pages, 2 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[205] arXiv:2405.14222 (cross-list from cs.LG) [pdf, other]: Title: RAQ-VAE: Rate-Adaptive Vector-Quantized Variational Autoencoder

Authors: Jiwan Seo, Joonhyuk Kang

Comments: Under review

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[206] arXiv:2405.14221 (cross-list from eess.IV) [pdf, other]: Title: Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization

Authors: Zhibo Chen, Heming Sun, Li Zhang, Fan Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[207] arXiv:2405.14205 (cross-list from cs.CL) [pdf, other]: Title: Agent Planning with World Knowledge Model

Authors: Shuofei Qiao, Runnan Fang, Ningyu Zhang, Yuqi Zhu, Xiang Chen, Shumin Deng, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen

Comments: Work in progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[208] arXiv:2405.14189 (cross-list from cs.CL) [pdf, other]: Title: Semantic-guided Prompt Organization for Universal Goal Hijacking against LLMs

Authors: Yihao Huang, Chong Wang, Xiaojun Jia, Qing Guo, Felix Juefei-Xu, Jian Zhang, Geguang Pu, Yang Liu

Comments: 15 pages

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[209] arXiv:2405.14147 (cross-list from cs.LG) [pdf, other]: Title: Minimum number of neurons in fully connected layers of a given neural network (the first approximation)

Authors: Oleg I.Berngardt

Comments: 21 pages, 2 figures, 1 table

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[210] arXiv:2405.14129 (cross-list from cs.CL) [pdf, other]: Title: AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability

Authors: Fei Zhao, Taotian Pang, Chunhui Li, Zhen Wu, Junjie Guo, Shangyu Xing, Xinyu Dai

Comments: Code and models are available at $\href{this https URL}{\textit{this https URL}}$

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

[ total of 614 entries: 1-50 | 11-60 | 61-110 | 111-160 | 161-210 | 211-260 | 261-310 | 311-360 | ... | 611-614 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 160

Fri, 24 May 2024 (continued, showing 50 of 242 entries)