Computer Vision and Pattern Recognition
Authors and titles for recent submissions
[ total of 763 entries: 1-224 | 225-448 | 449-672 | 673-763 ][ showing 224 entries per page: fewer | more | all ]
Thu, 28 Mar 2024
- [1] arXiv:2403.18820 [pdf, other]
-
Title: MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and RenderingComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [2] arXiv:2403.18819 [pdf, other]
-
Title: Benchmarking Object Detectors with COCO: A New Path ForwardSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [3] arXiv:2403.18818 [pdf, other]
-
Title: ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and InsertionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [4] arXiv:2403.18816 [pdf, other]
-
Title: Garment3DGen: 3D Garment Stylization and Texture GenerationComments: Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [5] arXiv:2403.18814 [pdf, other]
-
Title: Mini-Gemini: Mining the Potential of Multi-modality Vision Language ModelsAuthors: Yanwei Li, Yuechen Zhang, Chengyao Wang, Zhisheng Zhong, Yixin Chen, Ruihang Chu, Shaoteng Liu, Jiaya JiaComments: Code and models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [6] arXiv:2403.18811 [pdf, other]
-
Title: Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance AccompanimentAuthors: Li Siyao, Tianpei Gu, Zhitao Yang, Zhengyu Lin, Ziwei Liu, Henghui Ding, Lei Yang, Chen Change LoyComments: ICLR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [7] arXiv:2403.18807 [pdf, other]
-
Title: ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth EstimationComments: Accepted at IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [8] arXiv:2403.18795 [pdf, other]
-
Title: Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [9] arXiv:2403.18791 [pdf, other]
-
Title: Object Pose Estimation via the Aggregation of Diffusion FeaturesComments: Accepted to CVPR2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [10] arXiv:2403.18784 [pdf, other]
-
Title: SplatFace: Gaussian Splat Face Reconstruction Leveraging an Optimizable SurfaceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [11] arXiv:2403.18775 [pdf, other]
-
Title: ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic ObjectComments: Accepted at CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [12] arXiv:2403.18774 [pdf, other]
-
Title: RAW: A Robust and Agile Plug-and-Play Watermark Framework for AI-Generated Images with Provable GuaranteesSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [13] arXiv:2403.18762 [pdf, other]
-
Title: ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place RecognitionAuthors: Weidong Xie, Lun Luo, Nanfei Ye, Yi Ren, Shaoyi Du, Minhang Wang, Jintao Xu, Rui Ai, Weihao Gu, Xieyuanli ChenComments: 8 pages, 11 figures, conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [14] arXiv:2403.18756 [pdf, ps, other]
-
Title: Detection of subclinical atherosclerosis by image-based deep learning on chest x-rayAuthors: Guglielmo Gallone, Francesco Iodice, Alberto Presta, Davide Tore, Ovidio de Filippo, Michele Visciano, Carlo Alberto Barbano, Alessandro Serafini, Paola Gorrini, Alessandro Bruno, Walter Grosso Marra, James Hughes, Mario Iannaccone, Paolo Fonio, Attilio Fiandrotti, Alessandro Depaoli, Marco Grangetto, Gaetano Maria de Ferrari, Fabrizio D'AscenzoComments: Submitted to European Heart Journal - Cardiovascular Imaging Added also the additional material 44 pages (30 main paper, 14 additional material), 14 figures (5 main manuscript, 9 additional material)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [15] arXiv:2403.18730 [pdf, other]
-
Title: Towards Image Ambient Lighting NormalizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [16] arXiv:2403.18715 [pdf, other]
-
Title: Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive DecodingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
- [17] arXiv:2403.18714 [pdf, other]
- [18] arXiv:2403.18711 [pdf, other]
-
Title: SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite ImageryComments: 5 pages, 3 figures, 1 table; Accepted to International Geoscience and Remote Sensing Symposium (IGARSS) 2024; Code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [19] arXiv:2403.18708 [pdf, other]
-
Title: Dense Vision Transformer Compression with Few SamplesComments: Accepted to CVPR 2024. Note: Jianxin Wu is a contributing author for the arXiv version of this paper but is not listed as an author in the CVPR version due to his role as Program ChairSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [20] arXiv:2403.18690 [pdf, other]
-
Title: Annolid: Annotate, Segment, and Track Anything You NeedSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [21] arXiv:2403.18674 [pdf, other]
-
Title: Deep Learning for Robust and Explainable Models in Computer VisionAuthors: Mohammadreza AmirianComments: 150 pages, 37 figures, 12 tablesJournal-ref: OPARU is the OPen Access Repository of Ulm University and Ulm University of Applied Sciences, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [22] arXiv:2403.18649 [pdf, other]
-
Title: Addressing Data Annotation Challenges in Multiple Sensors: A Solution for Scania Collected DatasetsAuthors: Ajinkya Khoche, Aron Asefaw, Alejandro Gonzalez, Bogdan Timus, Sina Sharif Mansouri, Patric JensfeltComments: Accepted to European Control Conference 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
- [23] arXiv:2403.18605 [pdf, other]
-
Title: FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image EditingComments: Our project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [24] arXiv:2403.18600 [pdf, other]
-
Title: RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional VideosComments: 23 pages, 6 figures, 12 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [25] arXiv:2403.18593 [pdf, other]
-
Title: Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image UnderstandingComments: 20 pages, 8 figures, 6 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [26] arXiv:2403.18575 [pdf, other]
-
Title: HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object InteractionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [27] arXiv:2403.18565 [pdf, other]
-
Title: Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning -- A ReviewComments: 16 pages, 4 figures, 1 Table, published in IEEE Access JournalJournal-ref: IEEE Access, vol. 12, pp. 10281-10295, 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [28] arXiv:2403.18554 [pdf, other]
-
Title: CosalPure: Learning Concept from Group Images for Robust Co-Saliency DetectionComments: 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [29] arXiv:2403.18551 [pdf, other]
-
Title: Attention Calibration for Disentangled Text-to-Image PersonalizationComments: Accepted to CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [30] arXiv:2403.18550 [pdf, other]
-
Title: OrCo: Towards Better Generalization via Orthogonality and Contrast for Few-Shot Class-Incremental LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [31] arXiv:2403.18548 [pdf, other]
-
Title: A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness ConstraintComments: This paper is accepted by CVPR2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [32] arXiv:2403.18525 [pdf, other]
-
Title: Language Plays a Pivotal Role in the Object-Attribute Compositional Generalization of CLIPComments: Oral accepted at OODCV 2023(this http URL)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [33] arXiv:2403.18512 [pdf, other]
-
Title: ParCo: Part-Coordinating Text-to-Motion SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [34] arXiv:2403.18495 [pdf, other]
-
Title: Direct mineral content prediction from drill core images via transfer learningAuthors: Romana Boiger, Sergey V. Churakov, Ignacio Ballester Llagaria, Georg Kosakowski, Raphael Wüst, Nikolaos I. PrasianakisSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [35] arXiv:2403.18493 [pdf, other]
-
Title: VersaT2I: Improving Text-to-Image Models with Versatile RewardAuthors: Jianshu Guo, Wenhao Chai, Jie Deng, Hsiang-Wei Huang, Tian Ye, Yichen Xu, Jiawei Zhang, Jenq-Neng Hwang, Gaoang WangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [36] arXiv:2403.18490 [pdf, other]
-
Title: I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [37] arXiv:2403.18476 [pdf, other]
-
Title: Modeling uncertainty for Gaussian SplattingSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [38] arXiv:2403.18471 [pdf, other]
-
Title: DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [39] arXiv:2403.18469 [pdf, other]
-
Title: Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point CloudsComments: CVPR2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [40] arXiv:2403.18461 [pdf, other]
-
Title: DiffStyler: Diffusion-based Localized Image Style TransferAuthors: Shaoxu LiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [41] arXiv:2403.18454 [pdf, other]
-
Title: Scaling Vision-and-Language Navigation With Offline RLComments: Published in Transactions on Machine Learning Research (04/2024)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [42] arXiv:2403.18452 [pdf, other]
-
Title: SingularTrajectory: Universal Trajectory Predictor Using Diffusion ModelComments: Accepted at CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [43] arXiv:2403.18443 [pdf, other]
-
Title: $\mathrm{F^2Depth}$: Self-supervised Indoor Monocular Depth Estimation via Optical Flow Consistency and Feature Map SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [44] arXiv:2403.18442 [pdf, other]
-
Title: Backpropagation-free Network for 3D Test-time AdaptationAuthors: Yanshuo Wang, Ali Cheraghian, Zeeshan Hayder, Jie Hong, Sameera Ramasinghe, Shafin Rahman, David Ahmedt-Aristizabal, Xuesong Li, Lars Petersson, Mehrtash HarandiComments: CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [45] arXiv:2403.18425 [pdf, other]
-
Title: U-Sketch: An Efficient Approach for Sketch to Image Diffusion ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [46] arXiv:2403.18417 [pdf, other]
-
Title: ECNet: Effective Controllable Text-to-Image Diffusion ModelsAuthors: Sicheng Li, Keqiang Sun, Zhixin Lai, Xiaoshi Wu, Feng Qiu, Haoran Xie, Kazunori Miyata, Hongsheng LiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [47] arXiv:2403.18407 [pdf, other]
-
Title: A Channel-ensemble Approach: Unbiased and Low-variance Pseudo-labels is Critical for Semi-supervised ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [48] arXiv:2403.18406 [pdf, other]
-
Title: An Image Grid Can Be Worth a Video: Zero-shot Video Question Answering Using a VLMComments: Our code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [49] arXiv:2403.18397 [pdf, ps, other]
-
Title: Colour and Brush Stroke Pattern Recognition in Abstract Art using Modified Deep Convolutional Generative Adversarial NetworksComments: 28 pages, 5 tables, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [50] arXiv:2403.18383 [pdf, other]
-
Title: Generative Multi-modal Models are Good Class-Incremental LearnersComments: Accepted at CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [51] arXiv:2403.18373 [pdf, other]
-
Title: BAM: Box Abstraction Monitors for Real-time OoD Detection in Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [52] arXiv:2403.18370 [pdf, other]
-
Title: Ship in Sight: Diffusion Models for Ship-Image Super ResolutionComments: Accepted at 2024 International Joint Conference on Neural Networks (IJCNN)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [53] arXiv:2403.18361 [pdf, other]
-
Title: ViTAR: Vision Transformer with Any ResolutionAuthors: Qihang Fan, Quanzeng You, Xiaotian Han, Yongfei Liu, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia YangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [54] arXiv:2403.18360 [pdf, other]
-
Title: Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain AdaptationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [55] arXiv:2403.18356 [pdf, other]
-
Title: MonoHair: High-Fidelity Hair Modeling from a Monocular VideoAuthors: Keyu Wu, Lingchen Yang, Zhiyi Kuang, Yao Feng, Xutao Han, Yuefan Shen, Hongbo Fu, Kun Zhou, Youyi ZhengComments: Accepted by IEEE CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [56] arXiv:2403.18351 [pdf, other]
-
Title: Generating Diverse Agricultural Data for Vision-Based Farming ApplicationsAuthors: Mikolaj Cieslak, Umabharathi Govindarajan, Alejandro Garcia, Anuradha Chandrashekar, Torsten Hädrich, Aleksander Mendoza-Drosik, Dominik L. Michels, Sören Pirk, Chia-Chun Fu, Wojciech PałubickiComments: 10 pages, 8 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
- [57] arXiv:2403.18342 [pdf, other]
-
Title: Learning Inclusion Matching for Animation Paint Bucket ColorizationComments: accepted to CVPR 2024. Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [58] arXiv:2403.18334 [pdf, other]
-
Title: DODA: Diffusion for Object-detection Domain Adaptation in AgricultureSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [59] arXiv:2403.18330 [pdf, other]
-
Title: Tracking-Assisted Object Detection with Event CamerasAuthors: Ting-Kang Yen, Igor Morawski, Shusil Dangi, Kai He, Chung-Yi Lin, Jia-Fong Yeh, Hung-Ting Su, Winston HsuSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [60] arXiv:2403.18328 [pdf, other]
-
Title: PIPNet3D: Interpretable Detection of Alzheimer in MRI ScansAuthors: Lisa Anita De Santi, Jörg Schlötterer, Michael Scheschenja, Joel Wessendorf, Meike Nauta, Vincenzo Positano, Christin SeifertSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [61] arXiv:2403.18318 [pdf, other]
-
Title: Uncertainty-Aware SAR ATR: Defending Against Adversarial Attacks via Bayesian Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [62] arXiv:2403.18294 [pdf, other]
-
Title: Multi-scale Unified Network for Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [63] arXiv:2403.18293 [pdf, other]
-
Title: Efficient Test-Time Adaptation of Vision-Language ModelsComments: Accepted to CVPR 2024. The code has been released in \url{this https URL}Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [64] arXiv:2403.18291 [pdf, other]
-
Title: Towards Non-Exemplar Semi-Supervised Class-Incremental LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [65] arXiv:2403.18282 [pdf, other]
-
Title: SGDM: Static-Guided Dynamic Module Make Stronger Visual ModelsComments: 16 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [66] arXiv:2403.18281 [pdf, other]
-
Title: AIR-HLoc: Adaptive Image Retrieval for Efficient Visual LocalisationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [67] arXiv:2403.18274 [pdf, other]
-
Title: DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure AlignmentSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [68] arXiv:2403.18271 [pdf, other]
-
Title: Unleashing the Potential of SAM for Medical Adaptation via Hierarchical DecodingComments: CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [69] arXiv:2403.18270 [pdf, other]
-
Title: Image Deraining via Self-supervised Reinforcement LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [70] arXiv:2403.18260 [pdf, other]
-
Title: Toward Interactive Regional Understanding in Vision-Large Language ModelsComments: NAACL 2024 Main ConferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [71] arXiv:2403.18258 [pdf, other]
-
Title: Enhancing Generative Class Incremental Learning Performance with Model Forgetting ApproachSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [72] arXiv:2403.18252 [pdf, other]
-
Title: Beyond Embeddings: The Promise of Visual Table in Multi-Modal ModelsComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
- [73] arXiv:2403.18241 [pdf, other]
-
Title: NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and GenerationAuthors: Ruikai Cui, Weizhe Liu, Weixuan Sun, Senbo Wang, Taizhang Shang, Yang Li, Xibin Song, Han Yan, Zhennan Wu, Shenzhou Chen, Hongdong Li, Pan JiSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
- [74] arXiv:2403.18238 [pdf, other]
-
Title: TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial ScenesAuthors: Liangyu Xu, Wanxuan Lu, Hongfeng Yu, Yongqiang Mao, Hanbo Bi, Chenglong Liu, Xian Sun, Kun FuComments: 17 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [75] arXiv:2403.18228 [pdf, other]
-
Title: Fourier or Wavelet bases as counterpart self-attention in spikformer for efficient visual classificationComments: 18 pages, 2 figures. arXiv admin note: substantial text overlap with arXiv:2308.02557Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [76] arXiv:2403.18211 [pdf, other]
-
Title: NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level ModulationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [77] arXiv:2403.18208 [pdf, other]
-
Title: An Evolutionary Network Architecture Search Framework with Adaptive Multimodal Fusion for Hand Gesture RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
- [78] arXiv:2403.18207 [pdf, other]
-
Title: Road Obstacle Detection based on Unknown Objectness ScoresComments: ICRA 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [79] arXiv:2403.18201 [pdf, other]
-
Title: Few-shot Online Anomaly Detection and SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [80] arXiv:2403.18193 [pdf, other]
-
Title: Middle Fusion and Multi-Stage, Multi-Form Prompts for Robust RGB-T TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [81] arXiv:2403.18187 [pdf, other]
-
Title: LayoutFlow: Flow Matching for Layout GenerationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [82] arXiv:2403.18186 [pdf, other]
-
Title: Don't Look into the Dark: Latent Codes for Pluralistic Image InpaintingComments: cvpr 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [83] arXiv:2403.18180 [pdf, other]
-
Title: Multi-Layer Dense Attention Decoder for Polyp SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [84] arXiv:2403.18158 [pdf, other]
-
Title: The Effects of Short Video-Sharing Services on Video Copy DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [85] arXiv:2403.18118 [pdf, other]
-
Title: EgoLifter: Open-world 3D Segmentation for Egocentric PerceptionComments: Preprint. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [86] arXiv:2403.18117 [pdf, ps, other]
-
Title: TDIP: Tunable Deep Image Processing, a Real Time Melt Pool Monitoring SolutionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [87] arXiv:2403.18116 [pdf, other]
-
Title: QuakeSet: A Dataset and Low-Resource Models to Monitor Earthquakes through Sentinel-1Comments: Accepted at ISCRAM 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [88] arXiv:2403.18114 [pdf, other]
-
Title: Segment Any Medical Model ExtendedAuthors: Yihao Liu, Jiaming Zhang, Andres Diaz-Pinto, Haowei Li, Alejandro Martin-Gomez, Amir Kheradmand, Mehran ArmandComments: The content of the manuscript has been presented in SPIE Medical Imaging 2024, and had been accepted to appear in the proceedings of the conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [89] arXiv:2403.18104 [pdf, other]
-
Title: Mathematical Foundation and Corrections for Full Range Head Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [90] arXiv:2403.18094 [pdf, other]
-
Title: A Personalized Video-Based Hand Taxonomy: Application for Individuals with Spinal Cord InjurySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [91] arXiv:2403.18092 [pdf, other]
-
Title: OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware InterpolationComments: CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [92] arXiv:2403.18080 [pdf, other]
-
Title: EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose EstimationAuthors: Chenhongyi Yang, Anastasia Tkach, Shreyas Hampali, Linguang Zhang, Elliot J. Crowley, Cem KeskinComments: Tech ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [93] arXiv:2403.18074 [pdf, other]
-
Title: Every Shot Counts: Using Exemplars for Repetition Counting in VideosComments: Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [94] arXiv:2403.18067 [pdf, other]
-
Title: State of the art applications of deep learning within tracking and detecting marine debris: A surveyComments: Review paper, 60 pages including references, 1 figure, 3 tables, 1 supplementary dataSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [95] arXiv:2403.18063 [pdf, other]
-
Title: Spectral Convolutional Transformer: Harmonizing Real vs. Complex Multi-View Spectral Operators for Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
- [96] arXiv:2403.18040 [pdf, other]
-
Title: Global Point Cloud Registration Network for Large TransformationsAuthors: Hanz Cuevas-Velasquez, Alejandro Galán-Cuenca, Antonio Javier Gallego, Marcelo Saval-Calvo, Robert B. FisherSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [97] arXiv:2403.18038 [pdf, ps, other]
-
Title: TGGLinesPlus: A robust topological graph-guided computer vision algorithm for line detection from imagesComments: Our TGGLinesPlus Python implementation is open source. 27 pages, 8 figures and 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [98] arXiv:2403.18036 [pdf, other]
-
Title: Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene AffordanceAuthors: Zan Wang, Yixin Chen, Baoxiong Jia, Puhao Li, Jinlu Zhang, Jingze Zhang, Tengyu Liu, Yixin Zhu, Wei Liang, Siyuan HuangComments: CVPR 2024; 16 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [99] arXiv:2403.18033 [pdf, other]
-
Title: SpectralWaste Dataset: Multimodal Data for Waste Sorting AutomationAuthors: Sara Casao, Fernando Peña, Alberto Sabater, Rosa Castillón, Darío Suárez, Eduardo Montijano, Ana C. MurilloSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [100] arXiv:2403.17998 [pdf, other]
-
Title: Text Is MASS: Modeling as Stochastic Embedding for Text-Video RetrievalAuthors: Jiamian Wang, Guohao Sun, Pichao Wang, Dongfang Liu, Sohail Dianat, Majid Rabbani, Raghuveer Rao, Zhiqiang TaoComments: Accepted by CVPR 2024, code and model are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [101] arXiv:2403.17995 [pdf, other]
-
Title: Semi-Supervised Image Captioning Considering Wasserstein Graph MatchingAuthors: Yang YangSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [102] arXiv:2403.17994 [pdf, other]
-
Title: Solution for Point Tracking Task of ICCV 1st Perception Test Challenge 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [103] arXiv:2403.18821 (cross-list from cs.SD) [pdf, other]
-
Title: Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and BenchmarkAuthors: Ziyang Chen, Israel D. Gebru, Christian Richardt, Anurag Kumar, William Laney, Andrew Owens, Alexander RichardComments: Accepted to CVPR 2024. Project site: this https URLSubjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
- [104] arXiv:2403.18734 (cross-list from eess.IV) [pdf, other]
-
Title: A vascular synthetic model for improved aneurysm segmentation and detection via Deep Neural NetworksSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [105] arXiv:2403.18731 (cross-list from cs.AI) [pdf, other]
-
Title: Enhancing Manufacturing Quality Prediction Models through the Integration of Explainability MethodsSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [106] arXiv:2403.18717 (cross-list from cs.LG) [pdf, other]
-
Title: Semi-Supervised Learning for Deep Causal Generative ModelsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [107] arXiv:2403.18660 (cross-list from cs.GR) [pdf, other]
-
Title: InstructBrush: Learning Attention-based Instruction Optimization for Image EditingAuthors: Ruoyu Zhao, Qingnan Fan, Fei Kou, Shuai Qin, Hong Gu, Wei Wu, Pengcheng Xu, Mingrui Zhu, Nannan Wang, Xinbo GaoComments: Project Page: this https URLSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [108] arXiv:2403.18637 (cross-list from eess.IV) [pdf, other]
-
Title: Transformers-based architectures for stroke segmentation: A reviewSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [109] arXiv:2403.18589 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Users prefer Jpegli over same-sized libjpeg-turbo or MozJPEGSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [110] arXiv:2403.18587 (cross-list from cs.CR) [pdf, other]
-
Title: The Impact of Uniform Inputs on Activation Sparsity and Energy-Latency Attacks in Computer VisionComments: Accepted at the DLSP 2024Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [111] arXiv:2403.18546 (cross-list from cs.RO) [pdf, other]
-
Title: Efficient Heatmap-Guided 6-Dof Grasp Detection in Cluttered ScenesComments: Extensive results on GraspNet-1B datasetSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [112] arXiv:2403.18514 (cross-list from eess.IV) [pdf, other]
-
Title: CT-3DFlow : Leveraging 3D Normalizing Flows for Unsupervised Detection of Pathological Pulmonary CT scansAuthors: Aissam Djahnine, Alexandre Popoff, Emilien Jupin-Delevaux, Vincent Cottin, Olivier Nempont, Loic BousselSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [113] arXiv:2403.18501 (cross-list from eess.IV) [pdf, other]
-
Title: HEMIT: H&E to Multiplex-immunohistochemistry Image Translation with Dual-Branch Pix2pix GeneratorSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [114] arXiv:2403.18468 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Deep Learning Segmentation and Classification of Red Blood Cells Using a Large Multi-Scanner DatasetComments: 15 pages, 12 figures, 8 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [115] arXiv:2403.18447 (cross-list from cs.CL) [pdf, other]
-
Title: Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory PredictionComments: Accepted at CVPR 2024Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [116] arXiv:2403.18388 (cross-list from cs.AI) [pdf, other]
-
Title: FTBC: Forward Temporal Bias Correction for Optimizing ANN-SNN ConversionSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [117] arXiv:2403.18347 (cross-list from astro-ph.SR) [pdf, other]
-
Title: A Quantum Fuzzy-based Approach for Real-Time Detection of Solar Coronal HolesComments: 14 pages, 5 figures, 3 tablesSubjects: Solar and Stellar Astrophysics (astro-ph.SR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [118] arXiv:2403.18346 (cross-list from cs.CL) [pdf, other]
-
Title: Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal PerspectiveSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [119] arXiv:2403.18339 (cross-list from eess.IV) [pdf, other]
-
Title: H2ASeg: Hierarchical Adaptive Interaction and Weighting Network for Tumor Segmentation in PET/CT ImagesComments: 10 pages,4 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [120] arXiv:2403.18321 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Implementation of the Principal Component Analysis onto High-Performance Computer Facilities for Hyperspectral Dimensionality Reduction: Results and ComparisonsAuthors: E. Martel, R. Lazcano, J. Lopez, D. Madroñal, R. Salvador, S. Lopez, E. Juarez, R. Guerra, C. Sanz, R. SarmientoComments: 30 pages, 10 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [121] arXiv:2403.18301 (cross-list from cs.LG) [pdf, other]
-
Title: Selective Mixup Fine-Tuning for Optimizing Non-Decomposable ObjectivesAuthors: Shrinivas Ramasubramanian, Harsh Rangwani, Sho Takemori, Kunal Samanta, Yuhei Umeda, Venkatesh Babu RadhakrishnanComments: ICLR 2024 SpotLightSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [122] arXiv:2403.18266 (cross-list from cs.LG) [pdf, other]
-
Title: Branch-Tuning: Balancing Stability and Plasticity for Continual Self-Supervised LearningSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [123] arXiv:2403.18233 (cross-list from eess.IV) [pdf, other]
-
Title: Benchmarking Image Transformers for Prostate Cancer Detection from Ultrasound DataAuthors: Mohamed Harmanani, Paul F. R. Wilson, Fahimeh Fooladgar, Amoon Jamzad, Mahdi Gilany, Minh Nguyen Nhat To, Brian Wodlinger, Purang Abolmaesumi, Parvin MousaviComments: early draft, 7 pages; Accepted to SPIE Medical Imaging 2024Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
- [124] arXiv:2403.18198 (cross-list from eess.IV) [pdf, other]
-
Title: Generative Medical SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [125] arXiv:2403.18196 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Looking Beyond What You See: An Empirical Analysis on Subgroup Intersectional Fairness for Multi-label Chest X-ray Classification Using Social Determinants of Racial Health InequitiesComments: ICCV CVAMD 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [126] arXiv:2403.18178 (cross-list from cs.RO) [pdf, other]
-
Title: Online Embedding Multi-Scale CLIP Features into 3D MapsComments: 8 pages, 7 figuresSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [127] arXiv:2403.18151 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Automated Report Generation for Lung Cytological Images Using a CNN Vision Classifier and Multiple-Transformer Text Decoders: Preliminary StudyAuthors: Atsushi Teramoto, Ayano Michiba, Yuka Kiriyama, Tetsuya Tsukamoto, Kazuyoshi Imaizumi, Hiroshi FujitaComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [128] arXiv:2403.18144 (cross-list from cs.CR) [pdf, other]
-
Title: Leak and Learn: An Attacker's Cookbook to Train Using Leaked Data from Federated LearningComments: Accepted to CVPR 2024Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [129] arXiv:2403.18139 (cross-list from eess.IV) [pdf, other]
-
Title: Pseudo-MRI-Guided PET Image Reconstruction Method Based on a Diffusion Probabilistic ModelAuthors: Weijie Gan, Huidong Xie, Carl von Gall, Günther Platsch, Michael T. Jurkiewicz, Andrea Andrade, Udunna C. Anazodo, Ulugbek S. Kamilov, Hongyu An, Jorge CabelloSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [130] arXiv:2403.18134 (cross-list from eess.IV) [pdf, other]
-
Title: Integrative Graph-Transformer Framework for Histopathology Whole Slide Image Representation and ClassificationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [131] arXiv:2403.18132 (cross-list from cs.LG) [pdf, other]
-
Title: Recommendation of data-free class-incremental learning algorithms by simulating future dataSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [132] arXiv:2403.18103 (cross-list from cs.LG) [pdf, other]
-
Title: Tutorial on Diffusion Models for Imaging and VisionAuthors: Stanley H. ChanSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [133] arXiv:2403.18096 (cross-list from cs.RO) [pdf, other]
-
Title: Efficient Multi-Band Temporal Video Filter for Reducing Human-Robot InteractionAuthors: Lawrence O'GormanComments: 15 pages, 5 figures, 4 tablesSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [134] arXiv:2403.18035 (cross-list from cs.LG) [pdf, other]
-
Title: Bidirectional Consistency ModelsComments: 40 pages, 25 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [135] arXiv:2403.18028 (cross-list from cs.LG) [pdf, other]
-
Title: Predicting species occurrence patterns from partial observationsComments: Tackling Climate Change with Machine Learning workshop at ICLR 2024Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Populations and Evolution (q-bio.PE)
- [136] arXiv:2403.17958 (cross-list from cs.LG) [pdf, other]
-
Title: Deep Generative Domain Adaptation with Temporal Attention for Cross-User Activity RecognitionSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
Wed, 27 Mar 2024 (showing first 88 of 128 entries)
- [137] arXiv:2403.17937 [pdf, other]
-
Title: Efficient Video Object Segmentation via Modulated Cross-Attention MemoryAuthors: Abdelrahman Shaker, Syed Talal Wasim, Martin Danelljan, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz KhanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [138] arXiv:2403.17936 [pdf, other]
-
Title: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture SynthesisAuthors: Muhammad Hamza Mughal, Rishabh Dabral, Ikhsanul Habibie, Lucia Donatelli, Marc Habermann, Christian TheobaltComments: CVPR 2024. Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [139] arXiv:2403.17935 [pdf, other]
-
Title: OmniVid: A Generative Framework for Universal Video UnderstandingComments: Accepted by CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [140] arXiv:2403.17934 [pdf, other]
-
Title: AiOS: All-in-One-Stage Expressive Human Pose and Shape EstimationAuthors: Qingping Sun, Yanjun Wang, Ailing Zeng, Wanqi Yin, Chen Wei, Wenjia Wang, Haiyi Mei, Chi Sing Leung, Ziwei Liu, Lei Yang, Zhongang CaiComments: Homepage: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [141] arXiv:2403.17931 [pdf, other]
-
Title: Track Everything Everywhere Fast and RobustlyComments: project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [142] arXiv:2403.17929 [pdf, other]
-
Title: Towards Explaining Hypercomplex Neural NetworksComments: The paper has been accepted at IEEE WCCI 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [143] arXiv:2403.17926 [pdf, other]
-
Title: FastCAR: Fast Classification And Regression Multi-Task Learning via Task Consolidation for Modelling a Continuous Property Variable of Object ClassesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [144] arXiv:2403.17924 [pdf, other]
-
Title: AID: Attention Interpolation of Text-to-Image DiffusionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [145] arXiv:2403.17920 [pdf, other]
-
Title: TC4D: Trajectory-Conditioned Text-to-4D GenerationAuthors: Sherwin Bahmani, Xian Liu, Yifan Wang, Ivan Skorokhodov, Victor Rong, Ziwei Liu, Xihui Liu, Jeong Joon Park, Sergey Tulyakov, Gordon Wetzstein, Andrea Tagliasacchi, David B. LindellComments: Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [146] arXiv:2403.17915 [pdf, other]
-
Title: Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy VideosAuthors: Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang, Inbar Fried, Stephen M. Pizer, Marc Niethammer, Roni SenguptaComments: 26 pages, 7 tables, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [147] arXiv:2403.17909 [pdf, other]
-
Title: ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change DetectionComments: accepted at IEEE TGRSSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [148] arXiv:2403.17898 [pdf, other]
-
Title: Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D GaussiansComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [149] arXiv:2403.17893 [pdf, other]
-
Title: A Survey on 3D Egocentric Human Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [150] arXiv:2403.17888 [pdf, other]
-
Title: 2D Gaussian Splatting for Geometrically Accurate Radiance FieldsComments: 12 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [151] arXiv:2403.17884 [pdf, other]
-
Title: Sen2Fire: A Challenging Benchmark Dataset for Wildfire Detection using Sentinel DataSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [152] arXiv:2403.17883 [pdf, other]
-
Title: Superior and Pragmatic Talking Face Generation with Teacher-Student FrameworkAuthors: Chao Liang, Jianwen Jiang, Tianyun Zhong, Gaojie Lin, Zhengkun Rong, Jiaqi Yang, Yongming ZhuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [153] arXiv:2403.17881 [pdf, other]
-
Title: Deepfake Generation and Detection: A Benchmark and SurveyAuthors: Gan Pei, Jiangning Zhang, Menghan Hu, Guangtao Zhai, Chengjie Wang, Zhenyu Zhang, Jian Yang, Chunhua Shen, Dacheng TaoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [154] arXiv:2403.17879 [pdf, other]
-
Title: Low-Latency Neural Stereo StreamingComments: Accepted by CVPR2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [155] arXiv:2403.17870 [pdf, other]
-
Title: Boosting Diffusion Models with Moving Average Sampling in Frequency DomainComments: CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [156] arXiv:2403.17869 [pdf, other]
-
Title: To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of 3D Transfer LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [157] arXiv:2403.17839 [pdf, other]
-
Title: ReMamber: Referring Image Segmentation with Mamba TwisterSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [158] arXiv:2403.17837 [pdf, other]
-
Title: GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image ReconstructionComments: Submitted to IEEESubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
- [159] arXiv:2403.17834 [pdf, other]
-
Title: A foundation model utilizing chest CT volumes and radiology reports for supervised-level zero-shot detection of abnormalitiesAuthors: Ibrahim Ethem Hamamci, Sezgin Er, Furkan Almas, Ayse Gulnihan Simsek, Sevval Nil Esirgun, Irem Dogan, Muhammed Furkan Dasdelen, Bastian Wittmann, Enis Simsar, Mehmet Simsar, Emine Bensu Erdemir, Abdullah Alanbay, Anjany Sekuboyina, Berkan Lafci, Mehmet K. Ozdemir, Bjoern MenzeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [160] arXiv:2403.17830 [pdf, other]
-
Title: Assessment of Multimodal Large Language Models in Alignment with Human ValuesAuthors: Zhelun Shi, Zhipin Wang, Hongxing Fan, Zaibin Zhang, Lijun Li, Yongting Zhang, Zhenfei Yin, Lu Sheng, Yu Qiao, Jing ShaoComments: arXiv admin note: text overlap with arXiv:2311.02692Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [161] arXiv:2403.17827 [pdf, other]
-
Title: DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual DescriptionsAuthors: Sammy Christen, Shreyas Hampali, Fadime Sener, Edoardo Remelli, Tomas Hodan, Eric Sauser, Shugao Ma, Bugra TekinComments: Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
- [162] arXiv:2403.17823 [pdf, other]
-
Title: Efficient Image Pre-Training with Siamese Cropped Masked AutoencodersAuthors: Alexandre Eymaël, Renaud Vandeghen, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van DroogenbroeckComments: 19 pages, 6 figures, 3 tables, 1 page of supplementary materialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [163] arXiv:2403.17822 [pdf, other]
-
Title: DN-Splatter: Depth and Normal Priors for Gaussian Splatting and MeshingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [164] arXiv:2403.17804 [pdf, other]
-
Title: Improving Text-to-Image Consistency via Automatic Prompt OptimizationAuthors: Oscar Mañas, Pietro Astolfi, Melissa Hall, Candace Ross, Jack Urbanek, Adina Williams, Aishwarya Agrawal, Adriana Romero-Soriano, Michal DrozdzalSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [165] arXiv:2403.17801 [pdf, other]
-
Title: Towards 3D Vision with Low-Cost Single-Photon CamerasAuthors: Fangzhou Mu, Carter Sifferman, Sacha Jungerman, Yiquan Li, Mark Han, Michael Gleicher, Mohit Gupta, Yin LiSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [166] arXiv:2403.17782 [pdf, other]
-
Title: GenesisTex: Adapting Image Denoising Diffusion to Texture SpaceComments: 12 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [167] arXiv:2403.17765 [pdf, other]
-
Title: MUTE-SLAM: Real-Time Neural SLAM with Multiple Tri-Plane Hash RepresentationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [168] arXiv:2403.17761 [pdf, other]
-
Title: Makeup Prior Models for 3D Facial Makeup Estimation and ApplicationsComments: CVPR2024. Project: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [169] arXiv:2403.17757 [pdf, other]
-
Title: Noise2Noise Denoising of CRISM Hyperspectral DataComments: 5 pages, 3 figures. Accepted as a conference paper at the ICLR 2024 ML4RS WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [170] arXiv:2403.17749 [pdf, other]
-
Title: Multi-Task Dense Prediction via Mixture of Low-Rank ExpertsComments: Accepted at CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [171] arXiv:2403.17727 [pdf, other]
-
Title: FastPerson: Enhancing Video Learning through Effective Video Summarization that Preserves Linguistic and Visual ContextsJournal-ref: AHs '24: Proceedings of the Augmented Humans International Conference 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
- [172] arXiv:2403.17725 [pdf, other]
-
Title: Deep Learning for Segmentation of Cracks in High-Resolution Images of Steel BridgesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [173] arXiv:2403.17712 [pdf, other]
-
Title: Invisible Gas Detection: An RGB-Thermal Cross Attention Network and A New BenchmarkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [174] arXiv:2403.17709 [pdf, other]
-
Title: Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship DetectionComments: CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [175] arXiv:2403.17708 [pdf, other]
-
Title: Panonut360: A Head and Eye Tracking Dataset for Panoramic VideoComments: 7 pages,ACM MMSys'24 acceptedSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
- [176] arXiv:2403.17702 [pdf, other]
-
Title: The Solution for the CVPR 2023 1st foundation model challenge-Track2Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [177] arXiv:2403.17695 [pdf, other]
-
Title: PlainMamba: Improving Non-Hierarchical Mamba in Visual RecognitionAuthors: Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. CrowleySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [178] arXiv:2403.17694 [pdf, other]
-
Title: AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait AnimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
- [179] arXiv:2403.17692 [pdf, other]
-
Title: Manifold-Guided Lyapunov Control with Diffusion ModelsComments: 14 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Differential Geometry (math.DG); Optimization and Control (math.OC); Computation (stat.CO)
- [180] arXiv:2403.17691 [pdf, other]
-
Title: Not All Similarities Are Created Equal: Leveraging Data-Driven Biases to Inform GenAI Copyright DisputesAuthors: Uri Hacohen, Adi Haviv, Shahar Sarfaty, Bruria Friedman, Niva Elkin-Koren, Roi Livni, Amit H BermanoComments: Presented at ACM CSLAW 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [181] arXiv:2403.17678 [pdf, other]
-
Title: Hierarchical Light Transformer Ensembles for Multimodal Trajectory ForecastingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [182] arXiv:2403.17664 [pdf, other]
-
Title: DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic PreservationAuthors: Qilin Wang, Jiangning Zhang, Chengming Xu, Weijian Cao, Ying Tai, Yue Han, Yanhao Ge, Hong Gu, Chengjie Wang, Yanwei FuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [183] arXiv:2403.17651 [pdf, other]
-
Title: Exploring Dynamic Transformer for Efficient Object TrackingAuthors: Jiawen Zhu, Xin Chen, Haiwen Diao, Shuai Li, Jun-Yan He, Chenyang Li, Bin Luo, Dong Wang, Huchuan LuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [184] arXiv:2403.17638 [pdf, other]
-
Title: Learning with Unreliability: Fast Few-shot Voxel Radiance Fields with Relative Geometric ConsistencyComments: CVPR 2024 final versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [185] arXiv:2403.17633 [pdf, other]
-
Title: UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain GapsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [186] arXiv:2403.17631 [pdf, other]
-
Title: AniArtAvatar: Animatable 3D Art Avatar from a Single ImageAuthors: Shaoxu LiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [187] arXiv:2403.17610 [pdf, other]
-
Title: MMVP: A Multimodal MoCap Dataset with Vision and Pressure SensorsAuthors: He Zhang, Shenghao Ren, Haolei Yuan, Jianhui Zhao, Fan Li, Shuangpeng Sun, Zhenghao Liang, Tao Yu, Qiu Shen, Xun CaoComments: CVPR2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [188] arXiv:2403.17608 [pdf, other]
-
Title: Fake or JPEG? Revealing Common Biases in Generated Image Detection DatasetsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [189] arXiv:2403.17589 [pdf, other]
-
Title: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language ModelsComments: CVPR2024; Codes are available at \url{this https URL}Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
- [190] arXiv:2403.17550 [pdf, other]
-
Title: DeepMIF: Deep Monotonic Implicit Fields for Large-Scale LiDAR 3D MappingComments: 8 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [191] arXiv:2403.17541 [pdf, other]
-
Title: WordRobe: Text-Guided Generation of Textured 3D GarmentsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [192] arXiv:2403.17537 [pdf, other]
-
Title: NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided SegmentationComments: To appear in CVPR2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [193] arXiv:2403.17530 [pdf, other]
-
Title: Boosting Few-Shot Learning with Disentangled Self-Supervised Learning and Meta-Learning for Medical Image ClassificationComments: 20 pages, 4 figures, 4 tables. Submitted to Elsevier on 25 March 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [194] arXiv:2403.17525 [pdf, other]
-
Title: Equipping Sketch Patches with Context-Aware Positional Encoding for Graphic Sketch RepresentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [195] arXiv:2403.17512 [pdf, other]
-
Title: Random-coupled Neural NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [196] arXiv:2403.17502 [pdf, other]
-
Title: SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational AutoencoderSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [197] arXiv:2403.17496 [pdf, other]
-
Title: Dr.Hair: Reconstructing Scalp-Connected Hair Strands without Pre-training via Differentiable Rendering of Line SegmentsComments: CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [198] arXiv:2403.17477 [pdf, other]
-
Title: DiffGaze: A Diffusion Model for Continuous Gaze Sequence Generation on 360° ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [199] arXiv:2403.17465 [pdf, other]
-
Title: LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image DetectionComments: CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [200] arXiv:2403.17423 [pdf, other]
-
Title: Test-time Adaptation Meets Image Enhancement: Improving Accuracy via Uncertainty-aware Logit SwitchingAuthors: Shohei Enomoto, Naoya Hasegawa, Kazuki Adachi, Taku Sasaki, Shin'ya Yamaguchi, Satoshi Suzuki, Takeharu EdaComments: Accepted to IJCNN2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [201] arXiv:2403.17422 [pdf, other]
-
Title: InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse DiffusionComments: Accepted to CVPR 2024, project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [202] arXiv:2403.17409 [pdf, other]
-
Title: Neural Clustering based Visual Representation LearningComments: CVPR 2024. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [203] arXiv:2403.17390 [pdf, other]
-
Title: SSF3D: Strict Semi-Supervised 3D Object Detection with Switching FilterAuthors: Songbur WongSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [204] arXiv:2403.17387 [pdf, other]
-
Title: Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object DetectionAuthors: Jiacheng Zhang, Jiaming Li, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin LiComments: To appear in CVPR2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [205] arXiv:2403.17377 [pdf, other]
-
Title: Self-Rectifying Diffusion Sampling with Perturbed-Attention GuidanceAuthors: Donghoon Ahn, Hyoungwon Cho, Jaewon Min, Wooseok Jang, Jungwoo Kim, SeonHwa Kim, Hyun Hee Park, Kyong Hwan Jin, Seungryong KimComments: Project page is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [206] arXiv:2403.17373 [pdf, other]
-
Title: AIDE: An Automatic Data Engine for Object Detection in Autonomous DrivingAuthors: Mingfu Liang, Jong-Chyi Su, Samuel Schulter, Sparsh Garg, Shiyu Zhao, Ying Wu, Manmohan ChandrakerComments: Accepted by CVPR-2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [207] arXiv:2403.17369 [pdf, other]
-
Title: CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt TuningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [208] arXiv:2403.17360 [pdf, other]
-
Title: Activity-Biometrics: Person Identification from Daily ActivitiesComments: CVPR 2024 Main conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [209] arXiv:2403.17346 [pdf, other]
-
Title: TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild VideosComments: The project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [210] arXiv:2403.17343 [pdf, other]
-
Title: Language Models are Free Boosters for Biomedical Imaging TasksSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [211] arXiv:2403.17342 [pdf, other]
-
Title: The Solution for the ICCV 2023 1st Scientific Figure Captioning ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [212] arXiv:2403.17334 [pdf, other]
-
Title: OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd RepresentationComments: Accepted by CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [213] arXiv:2403.17330 [pdf, other]
-
Title: Staircase Localization for Autonomous Exploration in Urban EnvironmentsComments: 9 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [214] arXiv:2403.17301 [pdf, other]
-
Title: Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous DrivingComments: Accepted by CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [215] arXiv:2403.17237 [pdf, other]
-
Title: DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric DiffusionComments: Project webpage: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
- [216] arXiv:2403.17223 [pdf, ps, other]
-
Title: Co-Occurring of Object Detection and Identification towards unlabeled object discoveryComments: 6 pages, 2 figures,Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [217] arXiv:2403.17217 [pdf, other]
-
Title: DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face ReenactmentAuthors: Stella Bounareli, Christos Tzelepis, Vasileios Argyriou, Ioannis Patras, Georgios TzimiropoulosComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [218] arXiv:2403.17213 [pdf, other]
-
Title: AnimateMe: 4D Facial Expressions via Diffusion ModelsAuthors: Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias, Alexandros Lattas, Stylianos Moschoglou, Stylianos Ploumpis, Stefanos ZafeiriouSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [219] arXiv:2403.17192 [pdf, ps, other]
-
Title: Strategies to Improve Real-World Applicability of Laparoscopic Anatomy Segmentation ModelsComments: 13 pages, 5 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [220] arXiv:2403.17188 [pdf, other]
-
Title: LOTUS: Evasive and Resilient Backdoor Attacks through Sub-PartitioningAuthors: Siyuan Cheng, Guanhong Tao, Yingqi Liu, Guangyu Shen, Shengwei An, Shiwei Feng, Xiangzhe Xu, Kaiyuan Zhang, Shiqing Ma, Xiangyu ZhangComments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [221] arXiv:2403.17176 [pdf, other]
-
Title: Histogram Layers for Neural Engineered FeaturesComments: 11 pages, 7 figures, submitted for reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [222] arXiv:2403.17175 [pdf, ps, other]
-
Title: Engagement Measurement Based on Facial Landmarks and Spatial-Temporal Graph Convolutional NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [223] arXiv:2403.17173 [pdf, other]
-
Title: Task2Box: Box Embeddings for Modeling Asymmetric Task RelationshipsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [224] arXiv:2403.17128 [pdf, other]
-
Title: Benchmarking Video Frame InterpolationComments: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
[ showing 224 entries per page: fewer | more | all ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, new, 2403, contact, help (Access key information)