Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 32

[ total of 729 entries: 1-100 | 33-132 | 133-232 | 233-332 | 333-432 | ... | 633-729 ]
[ showing 100 entries per page: fewer | more | all ]

Tue, 4 Jun 2024 (continued, showing 100 of 228 entries)

[33] arXiv:2406.01349 [pdf, other]: Title: Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation

Authors: Enhui Ma, Lijun Zhou, Tao Tang, Zhan Zhang, Dong Han, Junpeng Jiang, Kun Zhan, Peng Jia, Xianpeng Lang, Haiyang Sun, Di Lin, Kaicheng Yu

Comments: Project Page: this https URL, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34] arXiv:2406.01337 [pdf, other]: Title: ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds

Authors: Ka Lung Cheung, Chi Chung Lee

Comments: CVPRW 2024 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35] arXiv:2406.01334 [pdf, other]: Title: HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models

Authors: Mengcheng Li, Hongwen Zhang, Yuxiang Zhang, Ruizhi Shao, Tao Yu, Yebin Liu

Comments: accepted in CVPR2024, project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36] arXiv:2406.01326 [pdf, other]: Title: TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy

Authors: Weichao Zhao, Hao Feng, Qi Liu, Jingqun Tang, Shu Wei, Binghong Wu, Lei Liao, Yongjie Ye, Hao Liu, Houqiang Li, Can Huang

Comments: 20 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37] arXiv:2406.01316 [pdf, other]: Title: Enhancing Inertial Hand based HAR through Joint Representation of Language, Pose and Synthetic IMUs

Authors: Vitor Fortes Rey, Lala Shakti Swarup Ray, Xia Qingxin, Kaishun Wu, Paul Lukowicz

Comments: Review Copy

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[38] arXiv:2406.01315 [pdf, other]: Title: Scale-Free Image Keypoints Using Differentiable Persistent Homology

Authors: Giovanni Barbarani, Francesco Vaccarino, Gabriele Trivigno, Marco Guerra, Gabriele Berton, Carlo Masone

Comments: Accepted to ICML 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Algebraic Topology (math.AT)
[39] arXiv:2406.01314 [pdf, other]: Title: Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization

Authors: Firas Khader, Omar S. M. El Nahhas, Tianyu Han, Gustav Müller-Franzes, Sven Nebelung, Jakob Nikolas Kather, Daniel Truhn

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[40] arXiv:2406.01302 [pdf, ps, other]: Title: Pulmonary Embolism Mortality Prediction Using Multimodal Learning Based on Computed Tomography Angiography and Clinical Data

Authors: Zhusi Zhong, Helen Zhang, Fayez H. Fayad, Andrew C. Lancaster, John Sollee, Shreyas Kulkarni, Cheng Ting Lin, Jie Li, Xinbo Gao, Scott Collinsa, Sun H. Ahn, Harrison X. Bai, Zhicheng Jiao, Michael K. Atalay

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41] arXiv:2406.01300 [pdf, other]: Title: pOps: Photo-Inspired Diffusion Operators

Authors: Elad Richardson, Yuval Alaluf, Ali Mahdavi-Amiri, Daniel Cohen-Or

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42] arXiv:2406.01294 [pdf, other]: Title: Capsule Enhanced Variational AutoEncoder for Underwater Image Reconstruction

Authors: Rita Pucci, Niki Martinel

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[43] arXiv:2406.01278 [pdf, other]: Title: fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity perception in image embeddings

Authors: Tillmann Ohm, Andres Karjus, Mikhail Tamm, Maximilian Schich

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Machine Learning (cs.LG)
[44] arXiv:2406.01264 [pdf, other]: Title: FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor Synthesis

Authors: Linshan Wu, Jiaxin Zhuang, Xuefeng Ni, Hao Chen

Comments: Preprint

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45] arXiv:2406.01256 [pdf, other]: Title: Augmented Commonsense Knowledge for Remote Object Grounding

Authors: Bahram Mohammadi, Yicong Hong, Yuankai Qi, Qi Wu, Shirui Pan, Javen Qinfeng Shi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[46] arXiv:2406.01210 [pdf, other]: Title: GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer

Authors: Ding Jia, Jianyuan Guo, Kai Han, Han Wu, Chao Zhang, Chang Xu, Xinghao Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47] arXiv:2406.01203 [pdf, other]: Title: Scaling Up Deep Clustering Methods Beyond ImageNet-1K

Authors: Nikolas Adaloglou, Felix Michels, Kaspar Senft, Diana Petrusheva, Markus Kollmann

Comments: Work in progress

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[48] arXiv:2406.01196 [pdf, other]: Title: 3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information

Authors: Sihan Wen, Xiantan Zhu, Zhiming Tan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[49] arXiv:2406.01194 [pdf, other]: Title: AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation

Authors: Lorenzo Mur-Labadia, Ruben Martinez-Cantin, Josechu Guerrero, Giovanni Maria Farinella, Antonino Furnari

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50] arXiv:2406.01188 [pdf, other]: Title: UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation

Authors: Xiang Wang, Shiwei Zhang, Changxin Gao, Jiayu Wang, Xiaoqiang Zhou, Yingya Zhang, Luxin Yan, Nong Sang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51] arXiv:2406.01170 [pdf, other]: Title: Zero-Shot Out-of-Distribution Detection with Outlier Label Exposure

Authors: Choubo Ding, Guansong Pang

Comments: Accepted by IJCNN2024, 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52] arXiv:2406.01159 [pdf, other]: Title: Dimba: Transformer-Mamba Diffusion Models

Authors: Zhengcong Fei, Mingyuan Fan, Changqian Yu, Debang Li, Youqiang Zhang, Junshi Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53] arXiv:2406.01154 [pdf, other]: Title: DeepUniUSTransformer: Towards A Universal UltraSound Model with Prompted Guidance

Authors: Zehui Lin, Zhuoneng Zhang, Xindi Hu, Zhifan Gao, Xin Yang, Yue Sun, Dong Ni, Tao Tan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54] arXiv:2406.01136 [pdf, other]: Title: Towards Practical Single-shot Motion Synthesis

Authors: Konstantinos Roditakis, Spyridon Thermos, Nikolaos Zioulis

Comments: CVPR 2024, AI for 3D Generation Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[55] arXiv:2406.01127 [pdf, other]: Title: Learning Adaptive Fusion Bank for Multi-modal Salient Object Detection

Authors: Kunpeng Wang, Zhengzheng Tu, Chenglong Li, Cheng Zhang, Bin Luo

Comments: Accepted by TCSVT 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56] arXiv:2406.01125 [pdf, other]: Title: $Δ$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers

Authors: Pengtao Chen, Mingzhu Shen, Peng Ye, Jianjian Cao, Chongjun Tu, Christos-Savvas Bouganis, Yiren Zhao, Tao Chen

Comments: 12 pages, 6 figures, 6 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[57] arXiv:2406.01112 [pdf, other]: Title: BACON: Bayesian Optimal Condensation Framework for Dataset Distillation

Authors: Zheng Zhou, Hongbo Zhao, Guangliang Cheng, Xiangtai Li, Shuchang Lyu, Wenquan Feng, Qi Zhao

Comments: 22 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58] arXiv:2406.01079 [pdf, other]: Title: Object Aware Egocentric Online Action Detection

Authors: Joungbin An, Yunsu Park, Hyolim Kang, Seon Joo Kim

Comments: CVPR First Joint Egocentric Vision Workshop 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[59] arXiv:2406.01078 [pdf, other]: Title: CUT: A Controllable, Universal, and Training-Free Visual Anomaly Generation Framework

Authors: Han Sun, Yunkang Cao, Olga Fink

Comments: 9 pages excluding appendix

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[60] arXiv:2406.01076 [pdf, other]: Title: Estimating Canopy Height at Scale

Authors: Jan Pauls, Max Zimmer, Una M. Kelly, Martin Schwartz, Sassan Saatchi, Philippe Ciais, Sebastian Pokutta, Martin Brandt, Fabian Gieseke

Comments: ICML Camera-Ready, 17 pages, 14 figures, 7 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[61] arXiv:2406.01073 [pdf, other]: Title: Understanding the Cross-Domain Capabilities of Video-Based Few-Shot Action Recognition Models

Authors: Georgia Markham, Mehala Balamurali, Andrew J. Hill

Comments: Preprint. Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[62] arXiv:2406.01071 [pdf, other]: Title: Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline

Authors: Jan Lippemeier, Stefanie Hittmeyer, Oliver Niehörster, Markus Lange-Hegermann

Comments: 10 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[63] arXiv:2406.01069 [pdf, other]: Title: UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment

Authors: Hantao Zhou, Longxiang Tang, Rui Yang, Guanyi Qin, Yan Zhang, Runze Hu, Xiu Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64] arXiv:2406.01063 [pdf, other]: Title: DANCE: Dual-View Distribution Alignment for Dataset Condensation

Authors: Hansong Zhang, Shikun Li, Fanzhao Lin, Weiping Wang, Zhenxing Qian, Shiming Ge

Comments: This work has been accepted by IJCAI-24

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[65] arXiv:2406.01062 [pdf, other]: Title: SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models

Authors: Qilong Zhangli, Jindong Jiang, Di Liu, Licheng Yu, Xiaoliang Dai, Ankit Ramchandani, Guan Pang, Dimitris N. Metaxas, Praveen Krishnan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66] arXiv:2406.01059 [pdf, other]: Title: VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model

Authors: Jinze Yang, Haoran Wang, Zining Zhu, Chenglong Liu, Meng Wymond Wu, Zeke Xie, Zhong Ji, Jungong Han, Mingming Sun

Comments: 15 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[67] arXiv:2406.01056 [pdf, other]: Title: Virtual avatar generation models as world navigators

Authors: Sai Mandava

Comments: 16 pages, 15 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Robotics (cs.RO)
[68] arXiv:2406.01042 [pdf, other]: Title: Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting

Authors: Fang Li, Hao Zhang, Narendra Ahuja

Comments: GitHub Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69] arXiv:2406.01040 [pdf, other]: Title: Synthetic Data Generation for 3D Myocardium Deformation Analysis

Authors: Shahar Zuler, Dan Raviv

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[70] arXiv:2406.01033 [pdf, ps, other]: Title: Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement

Authors: Yung-Hui Lin, Yu-Wen Chang, Huang-Chia Shih, Takahiro Ogawa

Comments: 10 pages, 6 figures, 5 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[71] arXiv:2406.01029 [pdf, other]: Title: CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos

Authors: Trong-Thuan Nguyen, Pha Nguyen, Xin Li, Jackson Cothren, Alper Yilmaz, Khoa Luu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2406.01028 [pdf, other]: Title: LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network

Authors: Xuanqi Zhang, Haijin Zeng, Jinwang Pan, Qiangqiang Shen, Yongyong Chen

Comments: 9pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73] arXiv:2406.01025 [pdf, ps, other]: Title: Khayyam Offline Persian Handwriting Dataset

Authors: Pourya Jafarzadeh, Padideh Choobdar, Vahid Mohammadi Safarzadeh

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74] arXiv:2406.01020 [pdf, other]: Title: CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment

Authors: Daekyu Kwon, Dongyoung Kim, Sehwan Ki, Younghyun Jo, Hyong-Euk Lee, Seon Joo Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75] arXiv:2406.01003 [pdf, other]: Title: Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras

Authors: Lingen Li, Mingde Yao, Xingyu Meng, Muquan Yu, Tianfan Xue, Jinwei Gu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76] arXiv:2406.00985 [pdf, other]: Title: MultiEdits: Simultaneous Multi-Aspect Editing with Text-to-Image Diffusion Models

Authors: Mingzhen Huang, Jialing Cai, Shan Jia, Vishnu Suresh Lokhande, Siwei Lyu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2406.00977 [pdf, other]: Title: Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model

Authors: Kezhen Chen, Rahul Thapa, Rahul Chalamala, Ben Athiwaratkun, Shuaiwen Leon Song, James Zou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[78] arXiv:2406.00971 [pdf, other]: Title: MiniGPT-Reverse-Designing: Predicting Image Adjustments Utilizing MiniGPT-4

Authors: Vahid Azizi, Fatemeh Koochaki

Comments: 8 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[79] arXiv:2406.00956 [pdf, other]: Title: Improving Segment Anything on the Fly: Auxiliary Online Learning and Adaptive Fusion for Medical Image Segmentation

Authors: Tianyu Huang, Tao Zhou, Weidi Xie, Shuo Wang, Qi Dou, Yizhe Zhang

Comments: Project Link: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[80] arXiv:2406.00955 [pdf, other]: Title: How Video Meetings Change Your Expression

Authors: Sumit Sarin, Utkarsh Mall, Purva Tendulkar, Carl Vondrick

Comments: Project webpage is available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81] arXiv:2406.00947 [pdf, other]: Title: Cross-Dimensional Medical Self-Supervised Representation Learning Based on a Pseudo-3D Transformation

Authors: Fei Gao, Siwen Wang, Churan Wang, Fandong Zhang, Hong-Yu Zhou, Yizhou Wang, Gang Yu, Yizhou Yu

Comments: MICCAI 2024 accept

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[82] arXiv:2406.00934 [pdf, other]: Title: LanEvil: Benchmarking the Robustness of Lane Detection to Environmental Illusions

Authors: Tianyuan Zhang, Lu Wang, Hainan Li, Yisong Xiao, Siyuan Liang, Aishan Liu, Xianglong Liu, Dacheng Tao

Comments: Submitted to ACM MM 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83] arXiv:2406.00929 [pdf, other]: Title: Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry

Authors: Takayuki Kanai, Igor Vasiljevic, Vitor Guizilini, Kazuhiro Shintani

Comments: 8 pages. 5 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[84] arXiv:2406.00919 [pdf, other]: Title: Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling

Authors: Jinxing Zhou, Dan Guo, Yiran Zhong, Meng Wang

Comments: IJCV 2024 Accepted. arXiv admin note: substantial text overlap with arXiv:2303.02344

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[85] arXiv:2406.00917 [pdf, other]: Title: Alignment-Free RGBT Salient Object Detection: Semantics-guided Asymmetric Correlation Network and A Unified Benchmark

Authors: Kunpeng Wang, Danying Lin, Chenglong Li, Zhengzheng Tu, Bin Luo

Comments: Accepted by TMM 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[86] arXiv:2406.00908 [pdf, other]: Title: ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation

Authors: Shaoshu Yang, Yong Zhang, Xiaodong Cun, Ying Shan, Ran He

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[87] arXiv:2406.00907 [pdf, other]: Title: DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic Surgery

Authors: Yuning Zhou, Henry Badgery, Matthew Read, James Bailey, Catherine E. Davey

Comments: 29 pages, 16 figures; MIDL 2024 - Medical Imaging with Deep Learning

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[88] arXiv:2406.00891 [pdf, other]: Title: Global High Categorical Resolution Land Cover Mapping via Weak Supervision

Authors: Xin-Yi Tong, Runmin Dong, Xiao Xiang Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2406.00885 [pdf, other]: Title: Visual place recognition for aerial imagery: A survey

Authors: Ivan Moskalenko, Anastasiia Kornilova, Gonzalo Ferrer

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[90] arXiv:2406.00872 [pdf, other]: Title: OLIVE: Object Level In-Context Visual Embeddings

Authors: Timothy Ossowski, Junjie Hu

Comments: ACL 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[91] arXiv:2406.00856 [pdf, other]: Title: DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

Authors: Yewon Lim, Changyeon Lee, Aerin Kim, Oren Etzioni

Comments: 6 pages, 1 figure

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[92] arXiv:2406.00848 [pdf, ps, other]: Title: Eating Smart: Advancing Health Informatics with the Grounding DINO based Dietary Assistant App

Authors: Abdelilah Nossair, Hamza El Housni

Comments: The work presented in this paper was part of the proceedings for the First International Conference on Artificial Intelligence (ICATA 2024)

Journal-ref: Eating Smart: Advancing Health Informatics with the Grounding DINO-based Dietary Assistant App, International Journal of Scientific and Innovative Studies, June 2024, Volume 3, Number 3, Pages 26-34, Available online at IJSRIS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93] arXiv:2406.00830 [pdf, other]: Title: Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection

Authors: Yang Cao, Yihan Zeng, Hang Xu, Dan Xu

Comments: Code Page: this https URL This paper has been submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) for possible publication

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[94] arXiv:2406.00828 [pdf, other]: Title: Stealing Image-to-Image Translation Models With a Single Query

Authors: Nurit Spingarn-Eliezer, Tomer Michaeli

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95] arXiv:2406.00808 [pdf, other]: Title: EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing

Authors: Hadrien Reynaud, Qingjie Meng, Mischa Dombrowski, Arijit Ghosh, Thomas Day, Alberto Gomez, Paul Leeson, Bernhard Kainz

Comments: Accepted at MICCAI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96] arXiv:2406.00798 [pdf, other]: Title: PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency

Authors: Yeonsung Jung, Heecheol Yun, Joonhyung Park, Jin-Hwa Kim, Eunho Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[97] arXiv:2406.00791 [pdf, other]: Title: Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor

Authors: Lei Liu, Zhihao Hu, Zhenghao Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[98] arXiv:2406.00783 [pdf, other]: Title: AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark

Authors: Li Lin, Santosh, Xin Wang, Shu Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99] arXiv:2406.00777 [pdf, other]: Title: Diffusion Features to Bridge Domain Gap for Semantic Segmentation

Authors: Yuxiang Ji, Boyong He, Chenyuan Qu, Zhuoyue Tan, Chuan Qin, Liaoni Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[100] arXiv:2406.00772 [pdf, other]: Title: Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion Models

Authors: Cristiano Patrício, Carlo Alberto Barbano, Attilio Fiandrotti, Riccardo Renzulli, Marco Grangetto, Luis F. Teixeira, João C. Neves

Comments: 18 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101] arXiv:2406.00750 [pdf, other]: Title: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models

Authors: Wenqiang Sun, Zhengyi Wang, Shuo Chen, Yikai Wang, Zilong Chen, Jun Zhu, Jun Zhang

Comments: project can be found in: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[102] arXiv:2406.00749 [pdf, other]: Title: CCF: Cross Correcting Framework for Pedestrian Trajectory Prediction

Authors: Pranav Singh Chib, Pravendra Singh

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103] arXiv:2406.00721 [pdf, other]: Title: Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks

Authors: Cong Wang, Wei Wang, Chengjin Yu, Jie Mu

Comments: IJCAI-24; Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104] arXiv:2406.00714 [pdf, other]: Title: A Survey of Deep Learning Based Radar and Vision Fusion for 3D Object Detection in Autonomous Driving

Authors: Di Wu, Feng Yang, Benlian Xu, Pan Liao, Bo Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105] arXiv:2406.00704 [pdf, other]: Title: An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites

Authors: Ylva Grønningsæter, Halvor S. Smørvik, Ole-Christoffer Granmo

Comments: 8 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[106] arXiv:2406.00699 [pdf, other]: Title: Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation

Authors: Yuan Xiao, Shiqing Ma, Juan Zhai, Chunrong Fang, Jinyuan Jia, Zhenyu Chen

Comments: Accepted to CVPR2024. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[107] arXiv:2406.00696 [pdf, ps, other]: Title: Bilinear-Convolutional Neural Network Using a Matrix Similarity-based Joint Loss Function for Skin Disease Classification

Authors: Belal Ahmad, Mohd Usama, Tanvir Ahmad, Adnan Saeed, Shabnam Khatoon, Long Hu

Comments: 16 pages, 11 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108] arXiv:2406.00687 [pdf, other]: Title: Lay-A-Scene: Personalized 3D Object Arrangement Using Text-to-Image Priors

Authors: Ohad Rahamim, Hilit Segev, Idan Achituve, Yuval Atzmon, Yoni Kasten, Gal Chechik

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109] arXiv:2406.00685 [pdf, other]: Title: Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training

Authors: Jiacheng Zhang, Feng Liu, Dawei Zhou, Jingfeng Zhang, Tongliang Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[110] arXiv:2406.00684 [pdf, other]: Title: Deciphering Oracle Bone Language with Diffusion Models

Authors: Haisu Guan, Huanxin Yang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu

Comments: ACL2024 main conference long paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[111] arXiv:2406.00676 [pdf, other]: Title: W-Net: A Facial Feature-Guided Face Super-Resolution Network

Authors: Hao Liu, Yang Yang, Yunxia Liu

Comments: 15 pages,9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112] arXiv:2406.00672 [pdf, other]: Title: Task-oriented Embedding Counts: Heuristic Clustering-driven Feature Fine-tuning for Whole Slide Image Classification

Authors: Xuenian Wang, Shanshan Shi, Renao Yan, Qiehe Sun, Lianghui Zhu, Tian Guan, Yonghong He

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113] arXiv:2406.00670 [pdf, other]: Title: Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Authors: Yunheng Li, ZhongYu Li, Quansheng Zeng, Qibin Hou, Ming-Ming Cheng

Comments: Accepted by ICML 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[114] arXiv:2406.00663 [pdf, other]: Title: SimSAM: Zero-shot Medical Image Segmentation via Simulated Interaction

Authors: Benjamin Towle, Xin Chen, Ke Zhou

Comments: Published at ISBI 2024. Awarded Top 12 Oral Presentation

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[115] arXiv:2406.00644 [pdf, other]: Title: Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance

Authors: Jun Li, Tongkun Su, Baoliang Zhao, Faqin Lv, Qiong Wang, Nassir Navab, Ying Hu, Zhongliang Jiang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116] arXiv:2406.00639 [pdf, other]: Title: An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition

Authors: Haojun Xu, Yan Gao, Jie Li, Xinbo Gao

Comments: 12 pages, 8 figures init commit

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117] arXiv:2406.00637 [pdf, other]: Title: Representing Animatable Avatar via Factorized Neural Fields

Authors: Chunjin Song, Zhijie Wu, Bastian Wandt, Leonid Sigal, Helge Rhodin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[118] arXiv:2406.00636 [pdf, other]: Title: T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences

Authors: Taeryung Lee, Fabien Baradel, Thomas Lucas, Kyoung Mu Lee, Gregory Rogez

Comments: CVPR 2024 HuMoGen Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[119] arXiv:2406.00632 [pdf, other]: Title: Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior

Authors: Yukai Shi, Yupei Lin, Pengxu Wei, Xiaoyu Xian, Tianshui Chen, Liang Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120] arXiv:2406.00631 [pdf, other]: Title: MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging

Authors: Jiaying Zhou, Mingzhou Jiang, Junde Wu, Jiayuan Zhu, Ziyue Wang, Yueming Jin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121] arXiv:2406.00629 [pdf, other]: Title: Correlation Matching Transformation Transformers for UHD Image Restoration

Authors: Cong Wang, Jinshan Pan, Wei Wang, Gang Fu, Siyuan Liang, Mengzhu Wang, Xiao-Ming Wu, Jun Liu

Comments: AAAI-24; Source codes, datasets, visual results, and pre-trained models are: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[122] arXiv:2406.00625 [pdf, other]: Title: SAM-LAD: Segment Anything Model Meets Zero-Shot Logic Anomaly Detection

Authors: Yun Peng, Xiao Lin, Nachuan Ma, Jiayuan Du, Chuangwei Liu, Chengju Liu, Qijun Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[123] arXiv:2406.00622 [pdf, other]: Title: Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering

Authors: Xingrui Wang, Wufei Ma, Angtian Wang, Shuo Chen, Adam Kortylewski, Alan Yuille

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[124] arXiv:2406.00609 [pdf, other]: Title: SuperGaussian: Repurposing Video Models for 3D Super Resolution

Authors: Yuan Shen, Duygu Ceylan, Paul Guerrero, Zexiang Xu, Niloy J. Mitra, Shenlong Wang, Anna Frühstück

Comments: Check our project website for details: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[125] arXiv:2406.00600 [pdf, other]: Title: Kolmogorov-Arnold Network for Satellite Image Classification in Remote Sensing

Authors: Minjong Cheon

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Data Analysis, Statistics and Probability (physics.data-an)
[126] arXiv:2406.00598 [pdf, other]: Title: Efficient Neural Light Fields (ENeLF) for Mobile Devices

Authors: Austin Peng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127] arXiv:2406.00589 [pdf, other]: Title: Robust Visual Tracking via Iterative Gradient Descent and Threshold Selection

Authors: Zhuang Qi, Junlin Zhang, Xin Qi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[128] arXiv:2406.00587 [pdf, other]: Title: Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024

Authors: Biao Wu, Diankai Zhang, Si Gao, Chengjian Zheng, Shaoli Liu, Ning Wang

Comments: Champion Solution for CVPR 2024 PVUW VSS Track. arXiv admin note: text overlap with arXiv:2306.02894

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[129] arXiv:2406.00571 [pdf, other]: Title: An Image Segmentation Model with Transformed Total Variation

Authors: Elisha Dayag, Kevin Bui, Fredrick Park, Jack Xin

Comments: Accepted to EUSIPCO'24

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Numerical Analysis (math.NA)
[130] arXiv:2406.00545 [pdf, ps, other]: Title: Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation

Authors: Xinyue Chen, Miaojing Shi

Comments: ICME 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[131] arXiv:2406.00512 [pdf, ps, other]: Title: On the use of first and second derivative approximations for biometric online signature recognition

Authors: Marcos Faundez-Zanuy, Moises Diaz

Comments: Advances in Computational Intelligence. IWANN 2023. pp 461 to 472

Journal-ref: Lecture Notes in Computer Science, vol 14134, 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132] arXiv:2406.00510 [pdf, other]: Title: Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection

Authors: Jiaming Li, Jiacheng Zhang, Jichang Li, Ge Li, Si Liu, Liang Lin, Guanbin Li

Comments: CVPR2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 729 entries: 1-100 | 33-132 | 133-232 | 233-332 | 333-432 | ... | 633-729 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 32

Tue, 4 Jun 2024 (continued, showing 100 of 228 entries)