Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 36

[ total of 679 entries: 1-100 | 37-136 | 137-236 | 237-336 | 337-436 | ... | 637-679 ]
[ showing 100 entries per page: fewer | more | all ]

Wed, 5 Jun 2024 (continued, showing last 66 of 102 entries)

[37] arXiv:2406.02223 [pdf, other]: Title: SMCL: Saliency Masked Contrastive Learning for Long-tailed Recognition

Authors: Sanglee Park, Seung-won Hwang, Jungmin So

Comments: accepted at ICASSP 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[38] arXiv:2406.02208 [pdf, other]: Title: Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts

Authors: Haodong Hong, Sen Wang, Zi Huang, Qi Wu, Jiajun Liu

Comments: IJCAI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[39] arXiv:2406.02202 [pdf, other]: Title: Can CLIP help CLIP in learning 3D?

Authors: Cristian Sbrolli, Matteo Matteucci

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[40] arXiv:2406.02184 [pdf, other]: Title: GraVITON: Graph based garment warping with attention guided inversion for Virtual-tryon

Authors: Sanhita Pathak, Vinay Kaushik, Brejesh Lall

Comments: 18 pages, 7 Figures and 6 Tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41] arXiv:2406.02158 [pdf, other]: Title: Radar Spectra-Language Model for Automotive Scene Parsing

Authors: Mariia Pushkareva, Yuri Feldman, Csaba Domokos, Kilian Rambach, Dotan Di Castro

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[42] arXiv:2406.02153 [pdf, other]: Title: Analyzing the Feature Extractor Networks for Face Image Synthesis

Authors: Erdi Sarıtaş, Hazım Kemal Ekenel

Comments: Accepted at 18th International Conference on Automatic Face and Gesture Recognition (FG) on 1st SD-FGA Workshop 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43] arXiv:2406.02147 [pdf, other]: Title: UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking

Authors: Lijun Zhou, Tao Tang, Pengkun Hao, Zihang He, Kalok Ho, Shuo Gu, Wenbo Hou, Zhihui Hao, Haiyang Sun, Kun Zhan, Peng Jia, Xianpeng Lang, Xiaodan Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[44] arXiv:2406.02142 [pdf, other]: Title: Analyzing the Effect of Combined Degradations on Face Recognition

Authors: Erdi Sarıtaş, Hazım Kemal Ekenel

Comments: Accepted at 18th International Conference on Automatic Face and Gesture Recognition (FG) on 2nd PrivAAL Workshop 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45] arXiv:2406.02125 [pdf, other]: Title: Domain Game: Disentangle Anatomical Feature for Single Domain Generalized Segmentation

Authors: Hao Chen, Hongrun Zhang, U Wang Chan, Rui Yin, Xiaofei Wang, Chao Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46] arXiv:2406.02074 [pdf, other]: Title: FaceCom: Towards High-fidelity 3D Facial Shape Completion via Optimization and Inpainting Guidance

Authors: Yinglong Li, Hongyu Wu, Xiaogang Wang, Qingzhao Qin, Yijiao Zhao, Yong wang, Aimin Hao

Comments: accepted to CVPR2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47] arXiv:2406.02058 [pdf, other]: Title: OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding

Authors: Yanmin Wu, Jiarui Meng, Haijie Li, Chenming Wu, Yahao Shi, Xinhua Cheng, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Jian Zhang

Comments: technical report, 15 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[48] arXiv:2406.02038 [pdf, other]: Title: Leveraging Predicate and Triplet Learning for Scene Graph Generation

Authors: Jiankai Li, Yunhong Wang, Xiefan Guo, Ruijie Yang, Weixin Li

Comments: CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49] arXiv:2406.02037 [pdf, ps, other]: Title: Multi-Scale Direction-Aware Network for Infrared Small Target Detection

Authors: Jinmiao Zhao, Zelin Shi, Chuang Yu, Yunpeng Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50] arXiv:2406.02021 [pdf, other]: Title: MetaMixer Is All You Need

Authors: Seokju Yun, Dongheon Lee, Youngmin Ro

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[51] arXiv:2406.01994 [pdf, other]: Title: 3D Imaging of Complex Specular Surfaces by Fusing Polarimetric and Deflectometric Information

Authors: Jiazhang Wang, Oliver Cossairt, Florian Willomitzer

Subjects: Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[52] arXiv:2406.01987 [pdf, other]: Title: Dealing with All-stage Missing Modality: Towards A Universal Model with Robust Reconstruction and Personalization

Authors: Yunpeng Zhao, Cheng Chen, Qing You Pang, Quanzheng Li, Carol Tang, Beng-Ti Ang, Yueming Jin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53] arXiv:2406.01970 [pdf, other]: Title: The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise

Authors: Yuanhao Ban, Ruochen Wang, Tianyi Zhou, Boqing Gong, Cho-Jui Hsieh, Minhao Cheng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[54] arXiv:2406.01956 [pdf, other]: Title: Enhance Image-to-Image Generation with LLaVA Prompt and Negative Prompt

Authors: Zhicheng Ding, Panfeng Li, Qikai Yang, Siyang Li

Comments: Accepted by 2024 5th International Conference on Information Science, Parallel and Distributed Systems

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[55] arXiv:2406.01954 [pdf, other]: Title: Plug-and-Play Diffusion Distillation

Authors: Yi-Ting Hsiao, Siavash Khodadadeh, Kevin Duarte, Wei-An Lin, Hui Qu, Mingi Kwon, Ratheesh Kalarot

Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56] arXiv:2406.01938 [pdf, other]: Title: Nutrition Estimation for Dietary Management: A Transformer Approach with Depth Sensing

Authors: Zhengyi Kwan, Wei Zhang, Zhengkui Wang, Aik Beng Ng, Simon See

Comments: 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[57] arXiv:2406.01932 [pdf, other]: Title: Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning

Authors: Heather Doig, Oscar Pizarro, Jacquomo Monk, Stefan Williams

Comments: 7 pages, 5 figures. Submitted to the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[58] arXiv:2406.01920 [pdf, other]: Title: CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models

Authors: Junho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[59] arXiv:2406.01917 [pdf, other]: Title: GOMAA-Geo: GOal Modality Agnostic Active Geo-localization

Authors: Anindya Sarkar, Srikumar Sastry, Aleksis Pirinen, Chongjie Zhang, Nathan Jacobs, Yevgeniy Vorobeychik

Comments: 23 pages, 17 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[60] arXiv:2406.01916 [pdf, other]: Title: FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping

Authors: Yuzhou Ji, He Zhu, Junshu Tang, Wuyi Liu, Zhizhong Zhang, Yuan Xie, Lizhuang Ma, Xin Tan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61] arXiv:2406.01914 [pdf, other]: Title: HPE-CogVLM: New Head Pose Grounding Task Exploration on Vision Language Model

Authors: Yu Tian, Tianqi Shao, Tsukasa Demizu, Xuyang Wu, Hsin-Tai Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[62] arXiv:2406.01906 [pdf, other]: Title: ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization

Authors: Chen Mao, Jingqi Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[63] arXiv:2406.01900 [pdf, other]: Title: Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation

Authors: Yue Ma, Hongyu Liu, Hongfa Wang, Heng Pan, Yingqing He, Junkun Yuan, Ailing Zeng, Chengfei Cai, Heung-Yeung Shum, Wei Liu, Qifeng Chen

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64] arXiv:2406.01894 [pdf, other]: Title: SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks

Authors: Yi Pan, Jun-Jie Huang, Zihan Chen, Wentao Zhao, Ziyue Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[65] arXiv:2406.01884 [pdf, other]: Title: Rank-based No-reference Quality Assessment for Face Swapping

Authors: Xinghui Zhou, Wenbo Zhou, Tianyi Wei, Shen Chen, Taiping Yao, Shouhong Ding, Weiming Zhang, Nenghai Yu

Comments: 8 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66] arXiv:2406.01869 [pdf, ps, other]: Title: Fruit Classification System with Deep Learning and Neural Architecture Search

Authors: Christine Dewi, Dhananjay Thiruvady, Nayyar Zaidi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[67] arXiv:2406.01867 [pdf, other]: Title: MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training

Authors: Kengo Uchida, Takashi Shibuya, Yuhta Takida, Naoki Murata, Shusuke Takahashi, Yuki Mitsufuji

Comments: 12 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68] arXiv:2406.01843 [pdf, other]: Title: L-MAGIC: Language Model Assisted Generation of Images with Coherence

Authors: Zhipeng Cai, Matthias Mueller, Reiner Birkl, Diana Wofk, Shao-Yen Tseng, JunDa Cheng, Gabriela Ben-Melech Stan, Vasudev Lal, Michael Paulitsch

Comments: accepted to CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69] arXiv:2406.01837 [pdf, other]: Title: Boosting Vision-Language Models with Transduction

Authors: Maxime Zanella, Benoît Gérin, Ismail Ben Ayed

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70] arXiv:2406.01820 [pdf, other]: Title: Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning

Authors: Leonardo Iurada, Marco Ciccone, Tatiana Tommasi

Comments: Accepted CVPR 2024 - this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[71] arXiv:2406.01815 [pdf, ps, other]: Title: Deep asymmetric mixture model for unsupervised cell segmentation

Authors: Yang Nan, Guang Yang

Comments: 5 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2406.01797 [pdf, other]: Title: The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry

Authors: Paolo Cudrano, Xiaoyu Luo, Matteo Matteucci

Comments: Accepted to CoLLAs 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[73] arXiv:2406.01791 [pdf, other]: Title: Hybrid-Learning Video Moment Retrieval across Multi-Domain Labels

Authors: Weitong Cai, Jiabo Huang, Shaogang Gong

Comments: Accepted by BMVC2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74] arXiv:2406.01765 [pdf, other]: Title: Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers

Authors: Fatemeh Nourilenjan Nokabadi, Jean-François Lalonde, Christian Gagné

Comments: Published in Transactions on Machine Learning Research (05/2024): this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75] arXiv:2406.01764 [pdf, other]: Title: An approximation-based approach versus an AI one for the study of CT images of abdominal aorta aneurysms

Authors: Lucrezia Rinelli, Arianna Travaglini, Nicolò Vescera, Gianluca Vinti

Comments: 28 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76] arXiv:2406.01662 [pdf, other]: Title: Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)

Authors: Zane Durante, Robathan Harries, Edward Vendrow, Zelun Luo, Yuta Kyuragi, Kazuki Kozuka, Li Fei-Fei, Ehsan Adeli

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[77] arXiv:2406.01658 [pdf, other]: Title: Proxy Denoising for Source-Free Domain Adaptation

Authors: Song Tang, Wenxin Su, Mao Ye, Jianwei Zhang, Xiatian Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78] arXiv:2406.01598 [pdf, ps, other]: Title: D2E-An Autonomous Decision-making Dataset involving Driver States and Human Evaluation

Authors: Zehong Ke, Yanbo Jiang, Yuning Wang, Hao Cheng, Jinhao Li, Jianqiang Wang

Comments: Submit for ITSC 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Robotics (cs.RO)
[79] arXiv:2406.01597 [pdf, other]: Title: End-to-End Rate-Distortion Optimized 3D Gaussian Representation

Authors: Henan Wang, Hanxin Zhu, Tianyu He, Runsen Feng, Jiajun Deng, Jiang Bian, Zhibo Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[80] arXiv:2406.02537 (cross-list from cs.CL) [pdf, other]: Title: TopViewRS: Vision-Language Models as Top-View Spatial Reasoners

Authors: Chengzu Li, Caiqi Zhang, Han Zhou, Nigel Collier, Anna Korhonen, Ivan Vulić

Comments: 9 pages, 3 figures, 3 tables (21 pages, 4 figures, 15 tables including references and appendices)

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[81] arXiv:2406.02534 (cross-list from eess.IV) [pdf, other]: Title: Enhancing predictive imaging biomarker discovery through treatment effect analysis

Authors: Shuhan Xiao, Lukas Klein, Jens Petersen, Philipp Vollmuth, Paul F. Jaeger, Klaus H. Maier-Hein

Comments: 19 pages, 12 figures

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[82] arXiv:2406.02529 (cross-list from eess.IV) [pdf, other]: Title: ReLUs Are Sufficient for Learning Implicit Neural Representations

Authors: Joseph Shenouda, Yamin Zhou, Robert D. Nowak

Comments: Accepted to ICML 2024

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[83] arXiv:2406.02480 (cross-list from eess.IV) [pdf, other]: Title: Fairness Evolution in Continual Learning for Medical Imaging

Authors: Marina Ceccon, Davide Dalle Pezze, Alessandro Fabris, Gian Antonio Susto

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[84] arXiv:2406.02477 (cross-list from eess.IV) [pdf, other]: Title: Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion

Authors: Colin Hansen, Simas Glinskis, Ashwin Raju, Micha Kornreich, JinHyeong Park, Jayashri Pawar, Richard Herzog, Li Zhang, Benjamin Odry

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[85] arXiv:2406.02465 (cross-list from cs.LG) [pdf, other]: Title: An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Encoders

Authors: Scott C. Lowe, Joakim Bruslund Haurum, Sageev Oore, Thomas B. Moeslund, Graham W. Taylor

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[86] arXiv:2406.02422 (cross-list from eess.IV) [pdf, other]: Title: IterMask2: Iterative Unsupervised Anomaly Segmentation via Spatial and Frequency Masking for Brain Lesions in MRI

Authors: Ziyun Liang, Xiaoqing Guo, J. Alison Noble, Konstantinos Kamnitsas

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[87] arXiv:2406.02395 (cross-list from cs.LG) [pdf, other]: Title: GrootVL: Tree Topology is All You Need in State Space Model

Authors: Yicheng Xiao, Lin Song, Shaoli Huang, Jiangshan Wang, Siyu Song, Yixiao Ge, Xiu Li, Ying Shan

Comments: The code is available at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2406.02349 (cross-list from cs.NE) [pdf, other]: Title: CADE: Cosine Annealing Differential Evolution for Spiking Neural Network

Authors: Runhua Jiang, Guodong Du, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang

Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2406.02343 (cross-list from cs.LG) [pdf, other]: Title: Cluster-Aware Similarity Diffusion for Instance Retrieval

Authors: Jifei Luo, Hantao Yao, Changsheng Xu

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[90] arXiv:2406.02077 (cross-list from eess.IV) [pdf, other]: Title: Multi-target stain normalization for histology slides

Authors: Desislav Ivanov, Carlo Alberto Barbano, Marco Grangetto

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[91] arXiv:2406.02064 (cross-list from cs.LG) [pdf, other]: Title: Advancing Generalized Transfer Attack with Initialization Derived Bilevel Optimization and Dynamic Sequence Truncation

Authors: Yaohua Liu, Jiaxin Gao, Xuan Liu, Xianghao Jiao, Xin Fan, Risheng Liu

Comments: Accepted by IJCAI 2024. 10 pages

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[92] arXiv:2406.02027 (cross-list from cs.LG) [pdf, other]: Title: Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions

Authors: Feng Wu, Lei Cui, Shaowen Yao, Shui Yu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[93] arXiv:2406.01996 (cross-list from cs.LG) [pdf, other]: Title: Bayesian Mesh Optimization for Graph Neural Networks to Enhance Engineering Performance Prediction

Authors: Jangseop Park, Namwoo Kang

Comments: 17 pages, 8 figures, 3 tables

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[94] arXiv:2406.01993 (cross-list from eess.IV) [pdf, ps, other]: Title: Choroidal Vessel Segmentation on Indocyanine Green Angiography Images via Human-in-the-Loop Labeling

Authors: Ruoyu Chen (1), Ziwei Zhao (1), Mayinuer Yusufu (4 and 5), Xianwen Shang (1), Danli Shi (1 and 2), Mingguang He (1,2 and 3) ((1) School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China. (2) Research Centre for SHARP Vision, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China.(3) Centre for Eye and Vision Research (CEVR), 17W Hong Kong Science Park, Hong Kong SAR, China.(4) Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia.(5) Department of Surgery (Ophthalmology), The University of Melbourne, Melbourne, Australia)

Comments: 25 pages,4 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[95] arXiv:2406.01975 (cross-list from cs.LG) [pdf, other]: Title: Can Dense Connectivity Benefit Outlier Detection? An Odyssey with NAS

Authors: Hao Fu, Tunhou Zhang, Hai Li, Yiran Chen

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[96] arXiv:2406.01961 (cross-list from cs.RO) [pdf, other]: Title: Exploring Real World Map Change Generalization of Prior-Informed HD Map Prediction Models

Authors: Samuel M.Bateman, Ning Xu, H.Charles Zhao, Yael Ben Shalom, Vince Gong, Greg Long, Will Maddern

Comments: Accepted to CVPR 2024, Workshop on Autonomous Driving

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[97] arXiv:2406.01829 (cross-list from cs.NE) [pdf, other]: Title: FacAID: A Transformer Model for Neuro-Symbolic Facade Reconstruction

Authors: Aleksander Płocharski, Jan Swidzinski, Joanna Porter-Sobieraj, Przemyslaw Musialski

Comments: 11 pages, 10 figures, preprint

Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[98] arXiv:2406.01733 (cross-list from cs.LG) [pdf, other]: Title: Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching

Authors: Xinyin Ma, Gongfan Fang, Michael Bi Mi, Xinchao Wang

Comments: Code is available at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[99] arXiv:2406.01708 (cross-list from cs.CR) [pdf, other]: Title: Model for Peanuts: Hijacking ML Models without Training Access is Possible

Authors: Mahmoud Ghorbel, Halima Bouzidi, Ioan Marius Bilasco, Ihsen Alouani

Comments: 17 pages, 14 figures, 7 tables

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[100] arXiv:2406.01613 (cross-list from q-bio.QM) [pdf, other]: Title: QuST: QuPath Extension for Integrative Whole Slide Image and Spatial Transcriptomics Analysis

Authors: Chao-Hui Huang

Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[101] arXiv:2406.01605 (cross-list from eess.IV) [pdf, other]: Title: An Enhanced Encoder-Decoder Network Architecture for Reducing Information Loss in Image Semantic Segmentation

Authors: Zijun Gao, Qi Wang, Taiyuan Mei, Xiaohan Cheng, Yun Zi, Haowei Yang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[102] arXiv:2406.01604 (cross-list from cs.IR) [pdf, other]: Title: An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval

Authors: Xiaolun Jing, Genke Yang, Jian Chu

Comments: 20 pages

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)

Tue, 4 Jun 2024 (showing first 34 of 228 entries)

[103] arXiv:2406.01595 [pdf, other]: Title: MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild

Authors: Zeren Jiang, Chen Guo, Manuel Kaufmann, Tianjian Jiang, Julien Valentin, Otmar Hilliges, Jie Song

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104] arXiv:2406.01594 [pdf, other]: Title: DiffUHaul: A Training-Free Method for Object Dragging in Images

Authors: Omri Avrahami, Rinon Gal, Gal Chechik, Ohad Fried, Dani Lischinski, Arash Vahdat, Weili Nie

Comments: Project page is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[105] arXiv:2406.01593 [pdf, other]: Title: Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting

Authors: Shaojie Ma, Yawei Luo, Yi Yang

Comments: Project Page: see this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106] arXiv:2406.01592 [pdf, other]: Title: Text-guided Controllable Mesh Refinement for Interactive 3D Modeling

Authors: Yun-Chun Chen, Selena Ling, Zhiqin Chen, Vladimir G. Kim, Matheus Gadelha, Alec Jacobson

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
[107] arXiv:2406.01591 [pdf, other]: Title: DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation

Authors: Chun-Hung Wu, Shih-Hong Chen, Chih-Yao Hu, Hsin-Yu Wu, Kai-Hsin Chen, Yu-You Chen, Chih-Hai Su, Chih-Kuo Lee, Yu-Lun Liu

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108] arXiv:2406.01584 [pdf, other]: Title: SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model

Authors: An-Chieh Cheng, Hongxu Yin, Yang Fu, Qiushan Guo, Ruihan Yang, Jan Kautz, Xiaolong Wang, Sifei Liu

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109] arXiv:2406.01583 [pdf, other]: Title: Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP

Authors: Sriram Balasubramanian, Samyadeep Basu, Soheil Feizi

Comments: 22 pages, 15 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[110] arXiv:2406.01579 [pdf, other]: Title: Tetrahedron Splatting for 3D Generation

Authors: Chun Gu, Zeyu Yang, Zijie Pan, Xiatian Zhu, Li Zhang

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[111] arXiv:2406.01561 [pdf, other]: Title: Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

Authors: Mingyuan Zhou, Zhendong Wang, Huangjie Zheng, Hai Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[112] arXiv:2406.01559 [pdf, other]: Title: Prototypical Transformer as Unified Motion Learners

Authors: Cheng Han, Yawen Lu, Guohao Sun, James C. Liang, Zhiwen Cao, Qifan Wang, Qiang Guan, Sohail A. Dianat, Raghuveer M. Rao, Tong Geng, Zhiqiang Tao, Dongfang Liu

Comments: 21 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113] arXiv:2406.01555 [pdf, other]: Title: Towards Flexible Interactive Reflection Removal with Human Guidance

Authors: Xiao Chen, Xudong Jiang, Yunkang Tao, Zhen Lei, Qing Li, Chenyang Lei, Zhaoxiang Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[114] arXiv:2406.01551 [pdf, other]: Title: ELSA: Evaluating Localization of Social Activities in Urban Streets

Authors: Maryam Hosseini, Marco Cipriano, Sedigheh Eslami, Daniel Hodczak, Liu Liu, Andres Sevtsuk, Gerard de Melo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[115] arXiv:2406.01494 [pdf, other]: Title: Robust Classification by Coupling Data Mollification with Label Smoothing

Authors: Markus Heinonen, Ba-Hien Tran, Michael Kampffmeyer, Maurizio Filippone

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[116] arXiv:2406.01493 [pdf, other]: Title: Learning Temporally Consistent Video Depth from Video Diffusion Priors

Authors: Jiahao Shao, Yuanbo Yang, Hongyu Zhou, Youmin Zhang, Yujun Shen, Matteo Poggi, Yiyi Liao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117] arXiv:2406.01489 [pdf, other]: Title: DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention

Authors: Yang Liu, Xiaofei Li, Jun Zhang, Shengze Hu, Jun Lei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[118] arXiv:2406.01486 [pdf, other]: Title: Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric Videos

Authors: Luigi Seminara, Giovanni Maria Farinella, Antonino Furnari

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[119] arXiv:2406.01480 [pdf, other]: Title: Towards Automating the Retrospective Generation of BIM Models: A Unified Framework for 3D Semantic Reconstruction of the Built Environment

Authors: Ka Lung Cheung, Chi Chung Lee

Comments: CVPRW 2024, Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120] arXiv:2406.01476 [pdf, other]: Title: DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors

Authors: Tianyu Huang, Yihan Zeng, Hui Li, Wangmeng Zuo, Rynson W. H. Lau

Comments: Technical report. Codes are released at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121] arXiv:2406.01460 [pdf, other]: Title: MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization

Authors: Yu Zhang, Qi Zhang, Zixuan Gong, Yiwei Shi, Yepeng Liu, Duoqian Miao, Yang Liu, Ke Liu, Kun Yi, Wei Fan, Liang Hu, Changwei Wang

Comments: ICML 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[122] arXiv:2406.01455 [pdf, other]: Title: Automatic Fused Multimodal Deep Learning for Plant Identification

Authors: Alfreds Lapkovskis, Natalia Nefedova, Ali Beikmohammadi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[123] arXiv:2406.01451 [pdf, other]: Title: SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation

Authors: Danni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo, Haowei Wang, Xiaoshuai Sun, Rongrong Ji

Comments: Accepted by ICML2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[124] arXiv:2406.01449 [pdf, other]: Title: SLANT: Spurious Logo ANalysis Toolkit

Authors: Maan Qraitem, Piotr Teterwak, Kate Saenko, Bryan A. Plummer

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125] arXiv:2406.01432 [pdf, other]: Title: ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models

Authors: Thanh-Dat Truong, Xin Li, Bhiksha Raj, Jackson Cothren, Khoa Luu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126] arXiv:2406.01429 [pdf, other]: Title: EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding

Authors: Thanh-Dat Truong, Utsav Prabhu, Dongyi Wang, Bhiksha Raj, Susan Gauch, Jeyamkondan Subbiah, Khoa Luu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127] arXiv:2406.01425 [pdf, other]: Title: Sensitivity-Informed Augmentation for Robust Segmentation

Authors: Laura Zheng, Wenjie Wei, Tony Wu, Jacob Clements, Shreelekha Revankar, Andre Harrison, Yu Shen, Ming C. Lin

Comments: 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128] arXiv:2406.01402 [pdf, other]: Title: Mixture of Rationale: Multi-Modal Reasoning Mixture for Visual Question Answering

Authors: Tao Li, Linjun Shou, Xuejun Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[129] arXiv:2406.01395 [pdf, other]: Title: TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation

Authors: Antonio Santo, Juan J. Cabrera, David Valiente, Carlos Viegas, Arturo Gil

Comments: This work has been submitted to the IEEE Transactions on Intelligent Vehicles for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130] arXiv:2406.01388 [pdf, other]: Title: AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Authors: Junhao Cheng, Xi Lu, Hanhui Li, Khun Loun Zai, Baiqiao Yin, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131] arXiv:2406.01380 [pdf, other]: Title: Convolutional Unscented Kalman Filter for Multi-Object Tracking with Outliers

Authors: Shiqi Liu, Wenhan Cao, Chang Liu, Tianyi Zhang, Shengbo Eben Li

Comments: 11 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[132] arXiv:2406.01365 [pdf, other]: Title: From Feature Visualization to Visual Circuits: Effect of Adversarial Model Manipulation

Authors: Geraldin Nanfack, Michael Eickenberg, Eugene Belilovsky

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[133] arXiv:2406.01356 [pdf, other]: Title: MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images

Authors: Ke-Lei Wang, Pin-Hsuan Chou, Young-Ching Chou, Chia-Jen Liu, Cheng-Kuan Lin, Yu-Chee Tseng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134] arXiv:2406.01355 [pdf, other]: Title: Differentially Private Fine-Tuning of Diffusion Models

Authors: Yu-Lin Tsai, Yizhe Li, Zekai Chen, Po-Yu Chen, Chia-Mu Yu, Xuebin Ren, Francois Buet-Golfouse

Comments: 16 pages, 5 figures, 11 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[135] arXiv:2406.01349 [pdf, other]: Title: Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation

Authors: Enhui Ma, Lijun Zhou, Tao Tang, Zhan Zhang, Dong Han, Junpeng Jiang, Kun Zhan, Peng Jia, Xianpeng Lang, Haiyang Sun, Di Lin, Kaicheng Yu

Comments: Project Page: this https URL, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[136] arXiv:2406.01337 [pdf, other]: Title: ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds

Authors: Ka Lung Cheung, Chi Chung Lee

Comments: CVPRW 2024 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 679 entries: 1-100 | 37-136 | 137-236 | 237-336 | 337-436 | ... | 637-679 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 36

Wed, 5 Jun 2024 (continued, showing last 66 of 102 entries)

Tue, 4 Jun 2024 (showing first 34 of 228 entries)