Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 136

[ total of 593 entries: 1-104 | 33-136 | 137-240 | 241-344 | 345-448 | 449-552 | 553-593 ]
[ showing 104 entries per page: fewer | more | all ]

Thu, 25 Apr 2024 (continued, showing last 61 of 85 entries)

[137] arXiv:2404.15812 [pdf, other]: Title: Facilitating Advanced Sentinel-2 Analysis Through a Simplified Computation of Nadir BRDF Adjusted Reflectance

Authors: David Montero, Miguel D. Mahecha, César Aybar, Clemens Mosig, Sebastian Wieneke

Comments: Submitted to FOSS4G Europe 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Instrumentation and Methods for Astrophysics (astro-ph.IM)
[138] arXiv:2404.15802 [pdf, other]: Title: Raformer: Redundancy-Aware Transformer for Video Wire Inpainting

Authors: Zhong Ji, Yimu Su, Yan Zhang, Jiacheng Hou, Yanwei Pang, Jungong Han

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[139] arXiv:2404.15790 [pdf, other]: Title: Leveraging Large Language Models for Multimodal Search

Authors: Oriol Barbany, Michael Huang, Xinliang Zhu, Arnab Dhua

Comments: Published at CVPRW 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140] arXiv:2404.15789 [pdf, other]: Title: MotionMaster: Training-free Camera Motion Transfer For Video Generation

Authors: Teng Hu, Jiangning Zhang, Ran Yi, Yating Wang, Hongrui Huang, Jieyu Weng, Yabiao Wang, Lizhuang Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141] arXiv:2404.15785 [pdf, other]: Title: Seeing Beyond Classes: Zero-Shot Grounded Situation Recognition via Language Explainer

Authors: Jiaming Lei, Lin Li, Chunping Wang, Jun Xiao, Long Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142] arXiv:2404.15781 [pdf, other]: Title: Real-Time Compressed Sensing for Joint Hyperspectral Image Transmission and Restoration for CubeSat

Authors: Chih-Chung Hsu, Chih-Yu Jian, Eng-Shen Tu, Chia-Ming Lee, Guan-Lin Chen

Comments: Accepted by TGRS 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[143] arXiv:2404.15774 [pdf, other]: Title: Toward Physics-Aware Deep Learning Architectures for LiDAR Intensity Simulation

Authors: Vivek Anand, Bharat Lohani, Gaurav Pandey, Rakesh Mishra

Comments: 7 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[144] arXiv:2404.15771 [pdf, other]: Title: DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines

Authors: Xin Jiang, Hao Tang, Rui Yan, Jinhui Tang, Zechao Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[145] arXiv:2404.15770 [pdf, other]: Title: ChEX: Interactive Localization and Region Description in Chest X-rays

Authors: Philip Müller, Georgios Kaissis, Daniel Rueckert

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[146] arXiv:2404.15765 [pdf, other]: Title: 3D Face Morphing Attack Generation using Non-Rigid Registration

Authors: Jag Mohan Singh, Raghavendra Ramachandra

Comments: Accepted to 2024 18th International Conference on Automatic Face and Gesture Recognition (FG) as short paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147] arXiv:2404.15743 [pdf, other]: Title: SRAGAN: Saliency Regularized and Attended Generative Adversarial Network for Chinese Ink-wash Painting Generation

Authors: Xiang Gao, Yuqi Zhang

Comments: 25 pages, 14 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148] arXiv:2404.15736 [pdf, other]: Title: What Makes Multimodal In-Context Learning Work?

Authors: Folco Bertini Baldassini, Mustafa Shukor, Matthieu Cord, Laure Soulier, Benjamin Piwowarski

Comments: 20 pages, 16 figures. Accepted to CVPR 2024 Workshop on Prompting in Vision. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[149] arXiv:2404.15734 [pdf, other]: Title: Fine-grained Spatial-temporal MLP Architecture for Metro Origin-Destination Prediction

Authors: Yang Liu, Binglin Chen, Yongsen Zheng, Guanbin Li, Liang Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150] arXiv:2404.15721 [pdf, other]: Title: SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision

Authors: Ankit Vani, Bac Nguyen, Samuel Lavoie, Ranjay Krishna, Aaron Courville

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151] arXiv:2404.15719 [pdf, other]: Title: HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition

Authors: Jinfu Liu, Baiqiao Yin, Jiaying Lin, Jiajun Wen, Yue Li, Mengyuan Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[152] arXiv:2404.15714 [pdf, other]: Title: Ada-DF: An Adaptive Label Distribution Fusion Network For Facial Expression Recognition

Authors: Shu Liu, Yan Xu, Tongming Wan, Xiaoyan Kui

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[153] arXiv:2404.15709 [pdf, other]: Title: ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos

Authors: Zerui Chen, Shizhe Chen, Cordelia Schmid, Ivan Laptev

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[154] arXiv:2404.15707 [pdf, other]: Title: ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images

Authors: Jinseo Jeong, Junseo Koo, Qimeng Zhang, Gunhee Kim

Comments: CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155] arXiv:2404.15700 [pdf, other]: Title: MAS-SAM: Segment Any Marine Animal with Aggregated Features

Authors: Tianyu Yan, Zifu Wan, Xinhao Deng, Pingping Zhang, Yang Liu, Huchuan Lu

Comments: Accepted by IJCAI2024. More modifications may be performed

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[156] arXiv:2404.15697 [pdf, other]: Title: DeepFeatureX Net: Deep Features eXtractors based Network for discriminating synthetic from real images

Authors: Orazio Pontorno (1), Luca Guarnera (1), Sebastiano Battiato (1) ((1) University of Catania)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[157] arXiv:2404.15683 [pdf, other]: Title: AnoFPDM: Anomaly Segmentation with Forward Process of Diffusion Models for Brain MRI

Authors: Yiming Che, Fazle Rafsani, Jay Shah, Md Mahfuzur Rahman Siddiquee, Teresa Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158] arXiv:2404.15677 [pdf, other]: Title: CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

Authors: Qinghe Wang, Baolu Li, Xiaomin Li, Bing Cao, Liqian Ma, Huchuan Lu, Xu Jia

Comments: Code will be released very soon: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159] arXiv:2404.15672 [pdf, other]: Title: Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability, Composability, and Decomposability from Anatomy via Self-Supervision

Authors: Mohammad Reza Hosseinzadeh Taher, Michael B. Gotway, Jianming Liang

Comments: Accepted at CVPR 2024 [main conference]

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160] arXiv:2404.15655 [pdf, other]: Title: Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering

Authors: Jiawei Yao, Qi Qian, Juhua Hu

Comments: Accepted by CVPR 2024. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161] arXiv:2404.15653 [pdf, other]: Title: CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Authors: Sachin Mehta, Maxwell Horton, Fartash Faghri, Mohammad Hossein Sekhavat, Mahyar Najibi, Mehrdad Farajtabar, Oncel Tuzel, Mohammad Rastegari

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[162] arXiv:2404.15644 [pdf, other]: Title: Building-PCC: Building Point Cloud Completion Benchmarks

Authors: Weixiao Gao, Ravi Peters, Jantien Stoter

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163] arXiv:2404.15638 [pdf, other]: Title: PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing

Authors: Yutong Chen, Zhang Wen, Chao Wang, Lei Gong, Zhongchao Yi

Comments: 8 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[164] arXiv:2404.15635 [pdf, other]: Title: A Real-time Evaluation Framework for Pedestrian's Potential Risk at Non-Signalized Intersections Based on Predicted Post-Encroachment Time

Authors: Tengfeng Lin, Zhixiong Jin, Seongjin Choi, Hwasoo Yeo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[165] arXiv:2404.15608 [pdf, other]: Title: Understanding and Improving CNNs with Complex Structure Tensor: A Biometrics Study

Authors: Kevin Hernandez-Diaz, Josef Bigun, Fernando Alonso-Fernandez

Comments: preprint manuscript

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166] arXiv:2404.15592 [pdf, other]: Title: ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction

Authors: Henry Peng Zou, Vinay Samuel, Yue Zhou, Weizhi Zhang, Liancheng Fang, Zihe Song, Philip S. Yu, Cornelia Caragea

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[167] arXiv:2404.15591 [pdf, other]: Title: Domain Adaptation for Learned Image Compression with Supervised Adapters

Authors: Alberto Presta, Gabriele Spadaro, Enzo Tartaglione, Attilio Fiandrotti, Marco Grangetto

Comments: 10 pages, published to Data compression conference 2024 (DCC2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[168] arXiv:2404.15580 [pdf, other]: Title: MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis

Authors: Jiaxin Zhuang, Linshan Wu, Qiong Wang, Varut Vardhanabhuti, Lin Luo, Hao Chen

Comments: submitted to journal

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169] arXiv:2404.15564 [pdf, other]: Title: Guided AbsoluteGrad: Magnitude of Gradients Matters to Explanation's Localization and Saliency

Authors: Jun Huang, Yan Liu

Comments: CAI2024 Camera-ready Submission

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[170] arXiv:2404.15552 [pdf, other]: Title: Cross-Temporal Spectrogram Autoencoder (CTSAE): Unsupervised Dimensionality Reduction for Clustering Gravitational Wave Glitches

Authors: Yi Li, Yunan Wu, Aggelos K. Katsaggelos

Subjects: Computer Vision and Pattern Recognition (cs.CV); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (cs.LG); General Relativity and Quantum Cosmology (gr-qc)
[171] arXiv:2404.15523 [pdf, other]: Title: Understanding Hyperbolic Metric Learning through Hard Negative Sampling

Authors: Yun Yue, Fangzhou Lin, Guanyi Mou, Ziming Zhang

Comments: published in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2024. arXiv admin note: text overlap with arXiv:2203.10833 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[172] arXiv:2404.15516 [pdf, other]: Title: Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval

Authors: Young Kyun Jang, Donghyun Kim, Zihang Meng, Dat Huynh, Ser-Nam Lim

Comments: 15 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[173] arXiv:2404.15506 [pdf, other]: Title: Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

Authors: Mu Hu, Wei Yin, Chi Zhang, Zhipeng Cai, Xiaoxiao Long, Hao Chen, Kaixuan Wang, Gang Yu, Chunhua Shen, Shaojie Shen

Comments: Our project page is at this https URL arXiv admin note: substantial text overlap with arXiv:2307.10984

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174] arXiv:2404.15451 [pdf, other]: Title: CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection

Authors: Hongyi Cai, Mohammad Mahdinur Rahman, Jingyu Wu, Yulun Deng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175] arXiv:2404.15449 [pdf, other]: Title: ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning

Authors: Weifeng Chen, Jiacheng Zhang, Jie Wu, Hefeng Wu, Xuefeng Xiao, Liang Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[176] arXiv:2404.15447 [pdf, other]: Title: GLoD: Composing Global Contexts and Local Details in Image Generation

Authors: Moyuru Yamada

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[177] arXiv:2404.15445 [pdf, other]: Title: Deep multi-prototype capsule networks

Authors: Saeid Abbassi, Kamaledin Ghiasi-Shirazi, Ahad Harati

Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[178] arXiv:2404.15436 [pdf, other]: Title: Iterative Cluster Harvesting for Wafer Map Defect Patterns

Authors: Alina Pleli, Simon Baeuerle, Michel Janus, Jonas Barth, Ralf Mikut, Hendrik P. A. Lensch

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[179] arXiv:2404.15406 [pdf, other]: Title: Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs

Authors: Davide Caffagni, Federico Cocchi, Nicholas Moratelli, Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

Comments: CVPR 2024 Workshop on What is Next in Multimodal Foundation Models

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[180] arXiv:2404.15385 [pdf, ps, other]: Title: Sum of Group Error Differences: A Critical Examination of Bias Evaluation in Biometric Verification and a Dual-Metric Measure

Authors: Alaa Elobaid, Nathan Ramoly, Lara Younes, Symeon Papadopoulos, Eirini Ntoutsi, Ioannis Kompatsiaris

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[181] arXiv:2404.15383 [pdf, other]: Title: WANDR: Intention-guided Human Motion Generation

Authors: Markos Diomataris, Nikos Athanasiou, Omid Taheri, Xi Wang, Otmar Hilliges, Michael J. Black

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[182] arXiv:2404.15378 [pdf, other]: Title: Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint Distributions

Authors: Khai Nguyen, Nhat Ho

Comments: 24 pages, 11 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[183] arXiv:2404.15919 (cross-list from cs.LG) [pdf, other]: Title: An Element-Wise Weights Aggregation Method for Federated Learning

Authors: Yi Hu, Hanchi Ren, Chen Hu, Jingjing Deng, Xianghua Xie

Comments: 2023 IEEE International Conference on Data Mining Workshops (ICDMW)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[184] arXiv:2404.15918 (cross-list from eess.IV) [pdf, other]: Title: Perception and Localization of Macular Degeneration Applying Convolutional Neural Network, ResNet and Grad-CAM

Authors: Tahmim Hossain, Sagor Chandro Bakchy

Comments: 12 pages, 5 figures, 2 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[185] arXiv:2404.15847 (cross-list from physics.med-ph) [pdf, other]: Title: 3D Freehand Ultrasound using Visual Inertial and Deep Inertial Odometry for Measuring Patellar Tracking

Authors: Russell Buchanan, S. Jack Tu, Marco Camurri, Stephen J. Mellon, Maurice Fallon

Comments: Accepted to IEEE Medical Measurements & Applications (MeMeA) 2024

Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[186] arXiv:2404.15786 (cross-list from eess.IV) [pdf, other]: Title: Rethinking Model Prototyping through the MedMNIST+ Dataset Collection

Authors: Sebastian Doerrich, Francesco Di Salvo, Julius Brockmann, Christian Ledig

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[187] arXiv:2404.15718 (cross-list from eess.IV) [pdf, other]: Title: Mitigating False Predictions In Unreasonable Body Regions

Authors: Constantin Ulrich, Catherine Knobloch, Julius C. Holzschuh, Tassilo Wald, Maximilian R. Rokuss, Maximilian Zenk, Maximilian Fischer, Michael Baumgartner, Fabian Isensee, Klaus H. Maier-Hein

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[188] arXiv:2404.15661 (cross-list from cs.GR) [pdf, other]: Title: CWF: Consolidating Weak Features in High-quality Mesh Simplification

Authors: Rui Xu, Longdu Liu, Ningna Wang, Shuangmin Chen, Shiqing Xin, Xiaohu Guo, Zichun Zhong, Taku Komura, Wenping Wang, Changhe Tu

Comments: 14 pages, 22 figures

Subjects: Graphics (cs.GR); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[189] arXiv:2404.15532 (cross-list from cs.HC) [pdf, other]: Title: BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis

Authors: Shuhang Lin, Wenyue Hua, Lingyao Li, Che-Jui Chang, Lizhou Fan, Jianchao Ji, Hang Hua, Mingyu Jin, Jiebo Luo, Yongfeng Zhang

Comments: 26 pages, 14 figures The data and code for this project are accessible at this https URL

Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[190] arXiv:2404.15394 (cross-list from eess.IV) [pdf, ps, other]: Title: On Generating Cancelable Biometric Template using Reverse of Boolean XOR

Authors: Manisha, Nitin Kumar

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[191] arXiv:2404.15367 (cross-list from eess.SP) [pdf, other]: Title: Leveraging Visibility Graphs for Enhanced Arrhythmia Classification with Graph Convolutional Networks

Authors: Rafael F. Oliveira, Gladston J. P. Moreira, Vander L. S. Freitas, Eduardo J. S. Luz

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[192] arXiv:2404.15364 (cross-list from eess.SP) [pdf, other]: Title: MP-DPD: Low-Complexity Mixed-Precision Neural Networks for Energy-Efficient Digital Predistortion of Wideband Power Amplifiers

Authors: Yizhuo Wu, Ang Li, Mohammadreza Beikmirza, Gagan Deep Singh, Qinyu Chen, Leo C. N. de Vreede, Morteza Alavi, Chang Gao

Comments: Accepted to IEEE Microwave and Wireless Technology Letters (MWTL)

Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[193] arXiv:2404.15346 (cross-list from eess.SP) [pdf, other]: Title: A Novel Micro-Doppler Coherence Loss for Deep Learning Radar Applications

Authors: Mikolaj Czerkawski, Christos Ilioudis, Carmine Clemente, Craig Michie, Ivan Andonovic, Christos Tachtatzis

Comments: Presented at 2021 18th European Radar Conference (EuRAD)

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[194] arXiv:2404.15318 (cross-list from q-bio.QM) [pdf, ps, other]: Title: VASARI-auto: equitable, efficient, and economical featurisation of glioma MRI

Authors: James K Ruffle, Samia Mohinta, Kelly Pegoretti Baruteau, Rebekah Rajiah, Faith Lee, Sebastian Brandner, Parashkev Nachev, Harpreet Hyare

Comments: 28 pages, 6 figures, 1 table

Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Tissues and Organs (q-bio.TO)
[195] arXiv:2404.15312 (cross-list from eess.SP) [pdf, other]: Title: Realtime Person Identification via Gait Analysis

Authors: Shanmuga Venkatachalam, Harideep Nair, Prabhu Vellaisamy, Yongqi Zhou, Ziad Youssfi, John Paul Shen

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[196] arXiv:2404.15287 (cross-list from eess.IV) [pdf, other]: Title: A Semi-automatic Cranial Implant Design Tool Based on Rigid ICP Template Alignment and Voxel Space Reconstruction

Authors: Michael Lackner, Behrus Puladi, Jens Kleesiek, Jan Egger, Jianning Li

Comments: 6 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[197] arXiv:2404.14956 (cross-list from eess.IV) [pdf, other]: Title: DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions

Authors: Ye Zhang, Yifeng Wang, Zijie Fang, Hao Bian, Linghan Cai, Ziyue Wang, Yongbing Zhang

Comments: 13 pages, 11 figures, 8 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)

Wed, 24 Apr 2024 (showing first 43 of 110 entries)

[198] arXiv:2404.15276 [pdf, other]: Title: SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation

Authors: Xiangyu Xu, Lijuan Liu, Shuicheng Yan

Comments: Published at TPAMI 2024

Journal-ref: https://www.computer.org/csdl/journal/tp/2024/05/10354384/1SP2qWh8Fq0

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[199] arXiv:2404.15275 [pdf, other]: Title: ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

Authors: Xuanhua He, Quande Liu, Shengju Qian, Xin Wang, Tao Hu, Ke Cao, Keyu Yan, Man Zhou, Jie Zhang

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[200] arXiv:2404.15272 [pdf, other]: Title: CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios

Authors: Jingyang Lin, Yingda Xia, Jianpeng Zhang, Ke Yan, Le Lu, Jiebo Luo, Ling Zhang

Comments: 12 pages, 5 figures, 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[201] arXiv:2404.15271 [pdf, other]: Title: Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models

Authors: Wanrong Zhu, Jennifer Healey, Ruiyi Zhang, William Yang Wang, Tong Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[202] arXiv:2404.15267 [pdf, other]: Title: From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation

Authors: Zehuan Huang, Hongxing Fan, Lipeng Wang, Lu Sheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203] arXiv:2404.15264 [pdf, other]: Title: TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

Authors: Jiahe Li, Jiawei Zhang, Xiao Bai, Jin Zheng, Xin Ning, Jun Zhou, Lin Gu

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204] arXiv:2404.15263 [pdf, other]: Title: Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization

Authors: Lahav Lipson, Jia Deng

Comments: Accepted to CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205] arXiv:2404.15259 [pdf, other]: Title: FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent

Authors: Cameron Smith, David Charatan, Ayush Tewari, Vincent Sitzmann

Comments: Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[206] arXiv:2404.15254 [pdf, other]: Title: UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition

Authors: Bin Wang, Zhuangcheng Gu, Chao Xu, Bo Zhang, Botian Shi, Conghui He

Comments: 17 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[207] arXiv:2404.15252 [pdf, other]: Title: Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions

Authors: Xingguang Zhang, Chih-Hsien Chou

Comments: accepted by the UG2+ workshop at CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[208] arXiv:2404.15244 [pdf, other]: Title: Efficient Transformer Encoders for Mask2Former-style models

Authors: Manyi Yao, Abhishek Aich, Yumin Suh, Amit Roy-Chowdhury, Christian Shelton, Manmohan Chandraker

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[209] arXiv:2404.15234 [pdf, other]: Title: Massively Annotated Datasets for Assessment of Synthetic and Real Data in Face Recognition

Authors: Pedro C. Neto, Rafael M. Mamede, Carolina Albuquerque, Tiago Gonçalves, Ana F. Sequeira

Comments: Accepted at FG 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[210] arXiv:2404.15228 [pdf, other]: Title: Re-Thinking Inverse Graphics With Large Language Models

Authors: Peter Kulits, Haiwen Feng, Weiyang Liu, Victoria Abrevaya, Michael J. Black

Comments: 31 pages; project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[211] arXiv:2404.15224 [pdf, other]: Title: Deep Models for Multi-View 3D Object Recognition: A Review

Authors: Mona Alzahrani, Muhammad Usman, Salma Kammoun, Saeed Anwar, Tarek Helmy

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[212] arXiv:2404.15217 [pdf, other]: Title: Towards Large-Scale Training of Pathology Foundation Models

Authors: kaiko.ai, Nanne Aben, Edwin D. de Jong, Ioannis Gatopoulos, Nicolas Känzig, Mikhail Karasikov, Axel Lagré, Roman Moser, Joost van Doorn, Fei Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[213] arXiv:2404.15212 [pdf, other]: Title: Real-time Lane-wise Traffic Monitoring in Optimal ROIs

Authors: Mei Qiu, Wei Lin, Lauren Ann Christopher, Stanley Chien, Yaobin Chen, Shu Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[214] arXiv:2404.15174 [pdf, other]: Title: Fourier-enhanced Implicit Neural Fusion Network for Multispectral and Hyperspectral Image Fusion

Authors: Yu-Jie Liang, Zihan Cao, Liang-Jian Deng, Xiao Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[215] arXiv:2404.15163 [pdf, other]: Title: Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment

Authors: Tianwei Zhou, Songbai Tan, Wei Zhou, Yu Luo, Yuan-Gen Wang, Guanghui Yue

Comments: IEEE Transactions on Broadcasting (TBC)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[216] arXiv:2404.15161 [pdf, other]: Title: Combating Missing Modalities in Egocentric Videos at Test Time

Authors: Merey Ramazanova, Alejandro Pardo, Bernard Ghanem, Motasem Alfarra

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[217] arXiv:2404.15141 [pdf, other]: Title: CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method

Authors: Mingbao Lin, Zhihang Lin, Wengyi Zhan, Liujuan Cao, Rongrong Ji

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[218] arXiv:2404.15129 [pdf, ps, other]: Title: Gallbladder Cancer Detection in Ultrasound Images based on YOLO and Faster R-CNN

Authors: Sara Dadjouy, Hedieh Sajedi

Comments: Published in 2024 10th International Conference on Artificial Intelligence and Robotics (QICAR)

Journal-ref: 2024 10th International Conference on Artificial Intelligence and Robotics (QICAR) (pp. 227-231). IEEE

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[219] arXiv:2404.15127 [pdf, other]: Title: MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language Learning

Authors: Sunan He, Yuxiang Nie, Zhixuan Chen, Zhiyuan Cai, Hongmei Wang, Shu Yang, Hao Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[220] arXiv:2404.15100 [pdf, other]: Title: Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation

Authors: Xun Wu, Shaohan Huang, Furu Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[221] arXiv:2404.15081 [pdf, other]: Title: Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models

Authors: Jingyao Xu, Yuetong Lu, Yandong Li, Siyang Lu, Dongdong Wang, Xiang Wei

Comments: Published at CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[222] arXiv:2404.15041 [pdf, other]: Title: LEAF: Unveiling Two Sides of the Same Coin in Semi-supervised Facial Expression Recognition

Authors: Fan Zhang, Zhi-Qi Cheng, Jian Zhao, Xiaojiang Peng, Xuelong Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[223] arXiv:2404.15037 [pdf, other]: Title: DP-Net: Learning Discriminative Parts for image recognition

Authors: Ronan Sicre, Hanwei Zhang, Julien Dejasmin, Chiheb Daaloul, Stéphane Ayache, Thierry Artières

Comments: IEEE ICIP 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[224] arXiv:2404.15033 [pdf, other]: Title: IPAD: Industrial Process Anomaly Detection Dataset

Authors: Jinfan Liu, Yichao Yan, Junjie Li, Weiming Zhao, Pengzhi Chu, Xingdong Sheng, Yunhui Liu, Xiaokang Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[225] arXiv:2404.15028 [pdf, other]: Title: PRISM: A Promptable and Robust Interactive Segmentation Model with Visual Prompts

Authors: Hao Li, Han Liu, Dewei Hu, Jiacheng Wang, Ipek Oguz

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[226] arXiv:2404.15024 [pdf, other]: Title: A Learning Paradigm for Interpretable Gradients

Authors: Felipe Torres Figueroa, Hanwei Zhang, Ronan Sicre, Yannis Avrithis, Stephane Ayache

Comments: VISAPP 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[227] arXiv:2404.15022 [pdf, other]: Title: A review of deep learning-based information fusion techniques for multimodal medical image classification

Authors: Yihao Li, Mostafa El Habib Daho, Pierre-Henri Conze, Rachid Zeghlache, Hugo Le Boité, Ramin Tadayoni, Béatrice Cochener, Mathieu Lamard, Gwenolé Quellec

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[228] arXiv:2404.15014 [pdf, other]: Title: OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving

Authors: Guoqing Wang, Zhongdao Wang, Pin Tang, Jilai Zheng, Xiangxuan Ren, Bailan Feng, Chao Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[229] arXiv:2404.15010 [pdf, other]: Title: X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition

Authors: Shuofeng Sun, Yongming Rao, Jiwen Lu, Haibin Yan

Journal-ref: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[230] arXiv:2404.15009 [pdf, other]: Title: The Brain Tumor Segmentation in Pediatrics (BraTS-PEDs) Challenge: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)

Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Deep Gandhi, Xinyang Liu, Zhifan Jiang, Syed Muhammed Anwar, Jake Albrecht, Maruf Adewole, Udunna Anazodo, Hannah Anderson, Sina Bagheri, Ujjwal Baid, Timothy Bergquist, Austin J. Borja, Evan Calabrese, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Anurag Gottipati, Debanjan Haldar, Shuvanjan Haldar, Juan Eugenio Iglesias, Anastasia Janas, Elaine Johansen, Blaise V Jones, Neda Khalili, Florian Kofler, Dominic LaBella, Hollie Anne Lai, Koen Van Leemput, Hongwei Bran Li, Nazanin Maleki, Aaron S McAllister, Zeke Meier, Bjoern Menze, Ahmed W Moawad, Khanak K Nandolia, Julija Pavaine, Marie Piraud, Tina Poussaint, Sanjay P Prabhu, Zachary Reitman, Andres Rodriguez, Jeffrey D Rudie, Mariana Sanchez-Montano, et al. (27 additional authors not shown)

Comments: arXiv admin note: substantial text overlap with arXiv:2305.17033

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[231] arXiv:2404.15008 [pdf, other]: Title: External Prompt Features Enhanced Parameter-efficient Fine-tuning for Salient Object Detection

Authors: Wen Liang, Peipei Ran, Mengchao Bai, Xiao Liu, P. Bilha Githinji, Wei Zhao, Peiwu Qin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232] arXiv:2404.14996 [pdf, other]: Title: CA-Stream: Attention-based pooling for interpretable image recognition

Authors: Felipe Torres, Hanwei Zhang, Ronan Sicre, Stéphane Ayache, Yannis Avrithis

Comments: CVPR XAI4CV workshop 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[233] arXiv:2404.14990 [pdf, other]: Title: Interpreting COVID Lateral Flow Tests' Results with Foundation Models

Authors: Stuti Pandey, Josh Myers-Dean, Jarek Reynolds, Danna Gurari

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[234] arXiv:2404.14985 [pdf, other]: Title: Other Tokens Matter: Exploring Global and Local Features of Vision Transformers for Object Re-Identification

Authors: Yingquan Wang, Pingping Zhang, Dong Wang, Huchuan Lu

Comments: Accepted by CVIU2024. More modifications may be performed

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[235] arXiv:2404.14979 [pdf, other]: Title: SGFormer: Spherical Geometry Transformer for 360 Depth Estimation

Authors: Junsong Zhang, Zisong Chen, Chunyu Lin, Lang Nie, Zhijie Shen, Junda Huang, Yao Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[236] arXiv:2404.14975 [pdf, other]: Title: CAGE: Circumplex Affect Guided Expression Inference

Authors: Niklas Wagner, Felix Mätzler, Samed R. Vossberg, Helen Schneider, Svetlana Pavlitska, J. Marius Zöllner

Comments: Accepted for publication at ABAW Workshop at CVPR2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[237] arXiv:2404.14967 [pdf, other]: Title: CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields

Authors: Deheng Zhang, Clara Fernandez-Labrador, Christopher Schroers

Comments: International Conference on 3D Vision 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[238] arXiv:2404.14966 [pdf, other]: Title: Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model

Authors: Xu Han, Yuan Tang, Zhaoxuan Wang, Xianzhi Li

Comments: 10 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[239] arXiv:2404.14956 [pdf, other]: Title: DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions

Authors: Ye Zhang, Yifeng Wang, Zijie Fang, Hao Bian, Linghan Cai, Ziyue Wang, Yongbing Zhang

Comments: 13 pages, 11 figures, 8 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[240] arXiv:2404.14955 [pdf, other]: Title: Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification

Authors: Muhammad Ahmad, Salvatore Distifano, Manuel Mazzara, Adil Mehmood Khan

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 593 entries: 1-104 | 33-136 | 137-240 | 241-344 | 345-448 | 449-552 | 553-593 ]
[ showing 104 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 136

Thu, 25 Apr 2024 (continued, showing last 61 of 85 entries)

Wed, 24 Apr 2024 (showing first 43 of 110 entries)