We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 136

[ total of 593 entries: 1-104 | 33-136 | 137-240 | 241-344 | 345-448 | 449-552 | 553-593 ]
[ showing 104 entries per page: fewer | more | all ]

Thu, 25 Apr 2024 (continued, showing last 61 of 85 entries)

[137]  arXiv:2404.15812 [pdf, other]
Title: Facilitating Advanced Sentinel-2 Analysis Through a Simplified Computation of Nadir BRDF Adjusted Reflectance
Comments: Submitted to FOSS4G Europe 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Instrumentation and Methods for Astrophysics (astro-ph.IM)
[138]  arXiv:2404.15802 [pdf, other]
Title: Raformer: Redundancy-Aware Transformer for Video Wire Inpainting
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[139]  arXiv:2404.15790 [pdf, other]
Title: Leveraging Large Language Models for Multimodal Search
Comments: Published at CVPRW 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140]  arXiv:2404.15789 [pdf, other]
Title: MotionMaster: Training-free Camera Motion Transfer For Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141]  arXiv:2404.15785 [pdf, other]
Title: Seeing Beyond Classes: Zero-Shot Grounded Situation Recognition via Language Explainer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142]  arXiv:2404.15781 [pdf, other]
Title: Real-Time Compressed Sensing for Joint Hyperspectral Image Transmission and Restoration for CubeSat
Comments: Accepted by TGRS 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[143]  arXiv:2404.15774 [pdf, other]
Title: Toward Physics-Aware Deep Learning Architectures for LiDAR Intensity Simulation
Comments: 7 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[144]  arXiv:2404.15771 [pdf, other]
Title: DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[145]  arXiv:2404.15770 [pdf, other]
Title: ChEX: Interactive Localization and Region Description in Chest X-rays
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[146]  arXiv:2404.15765 [pdf, other]
Title: 3D Face Morphing Attack Generation using Non-Rigid Registration
Comments: Accepted to 2024 18th International Conference on Automatic Face and Gesture Recognition (FG) as short paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2404.15743 [pdf, other]
Title: SRAGAN: Saliency Regularized and Attended Generative Adversarial Network for Chinese Ink-wash Painting Generation
Authors: Xiang Gao, Yuqi Zhang
Comments: 25 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2404.15736 [pdf, other]
Title: What Makes Multimodal In-Context Learning Work?
Comments: 20 pages, 16 figures. Accepted to CVPR 2024 Workshop on Prompting in Vision. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[149]  arXiv:2404.15734 [pdf, other]
Title: Fine-grained Spatial-temporal MLP Architecture for Metro Origin-Destination Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150]  arXiv:2404.15721 [pdf, other]
Title: SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151]  arXiv:2404.15719 [pdf, other]
Title: HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[152]  arXiv:2404.15714 [pdf, other]
Title: Ada-DF: An Adaptive Label Distribution Fusion Network For Facial Expression Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[153]  arXiv:2404.15709 [pdf, other]
Title: ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[154]  arXiv:2404.15707 [pdf, other]
Title: ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2404.15700 [pdf, other]
Title: MAS-SAM: Segment Any Marine Animal with Aggregated Features
Comments: Accepted by IJCAI2024. More modifications may be performed
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[156]  arXiv:2404.15697 [pdf, other]
Title: DeepFeatureX Net: Deep Features eXtractors based Network for discriminating synthetic from real images
Authors: Orazio Pontorno (1), Luca Guarnera (1), Sebastiano Battiato (1) ((1) University of Catania)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[157]  arXiv:2404.15683 [pdf, other]
Title: AnoFPDM: Anomaly Segmentation with Forward Process of Diffusion Models for Brain MRI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158]  arXiv:2404.15677 [pdf, other]
Title: CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Comments: Code will be released very soon: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2404.15672 [pdf, other]
Title: Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability, Composability, and Decomposability from Anatomy via Self-Supervision
Comments: Accepted at CVPR 2024 [main conference]
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2404.15655 [pdf, other]
Title: Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering
Comments: Accepted by CVPR 2024. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161]  arXiv:2404.15653 [pdf, other]
Title: CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[162]  arXiv:2404.15644 [pdf, other]
Title: Building-PCC: Building Point Cloud Completion Benchmarks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2404.15638 [pdf, other]
Title: PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing
Comments: 8 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[164]  arXiv:2404.15635 [pdf, other]
Title: A Real-time Evaluation Framework for Pedestrian's Potential Risk at Non-Signalized Intersections Based on Predicted Post-Encroachment Time
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[165]  arXiv:2404.15608 [pdf, other]
Title: Understanding and Improving CNNs with Complex Structure Tensor: A Biometrics Study
Comments: preprint manuscript
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2404.15592 [pdf, other]
Title: ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[167]  arXiv:2404.15591 [pdf, other]
Title: Domain Adaptation for Learned Image Compression with Supervised Adapters
Comments: 10 pages, published to Data compression conference 2024 (DCC2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[168]  arXiv:2404.15580 [pdf, other]
Title: MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis
Comments: submitted to journal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169]  arXiv:2404.15564 [pdf, other]
Title: Guided AbsoluteGrad: Magnitude of Gradients Matters to Explanation's Localization and Saliency
Authors: Jun Huang, Yan Liu
Comments: CAI2024 Camera-ready Submission
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[170]  arXiv:2404.15552 [pdf, other]
Title: Cross-Temporal Spectrogram Autoencoder (CTSAE): Unsupervised Dimensionality Reduction for Clustering Gravitational Wave Glitches
Subjects: Computer Vision and Pattern Recognition (cs.CV); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (cs.LG); General Relativity and Quantum Cosmology (gr-qc)
[171]  arXiv:2404.15523 [pdf, other]
Title: Understanding Hyperbolic Metric Learning through Hard Negative Sampling
Comments: published in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2024. arXiv admin note: text overlap with arXiv:2203.10833 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[172]  arXiv:2404.15516 [pdf, other]
Title: Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[173]  arXiv:2404.15506 [pdf, other]
Title: Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Comments: Our project page is at this https URL arXiv admin note: substantial text overlap with arXiv:2307.10984
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2404.15451 [pdf, other]
Title: CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2404.15449 [pdf, other]
Title: ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[176]  arXiv:2404.15447 [pdf, other]
Title: GLoD: Composing Global Contexts and Local Details in Image Generation
Authors: Moyuru Yamada
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[177]  arXiv:2404.15445 [pdf, other]
Title: Deep multi-prototype capsule networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[178]  arXiv:2404.15436 [pdf, other]
Title: Iterative Cluster Harvesting for Wafer Map Defect Patterns
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[179]  arXiv:2404.15406 [pdf, other]
Title: Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
Comments: CVPR 2024 Workshop on What is Next in Multimodal Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[180]  arXiv:2404.15385 [pdf, ps, other]
Title: Sum of Group Error Differences: A Critical Examination of Bias Evaluation in Biometric Verification and a Dual-Metric Measure
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[181]  arXiv:2404.15383 [pdf, other]
Title: WANDR: Intention-guided Human Motion Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[182]  arXiv:2404.15378 [pdf, other]
Title: Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint Distributions
Authors: Khai Nguyen, Nhat Ho
Comments: 24 pages, 11 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[183]  arXiv:2404.15919 (cross-list from cs.LG) [pdf, other]
Title: An Element-Wise Weights Aggregation Method for Federated Learning
Comments: 2023 IEEE International Conference on Data Mining Workshops (ICDMW)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[184]  arXiv:2404.15918 (cross-list from eess.IV) [pdf, other]
Title: Perception and Localization of Macular Degeneration Applying Convolutional Neural Network, ResNet and Grad-CAM
Comments: 12 pages, 5 figures, 2 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[185]  arXiv:2404.15847 (cross-list from physics.med-ph) [pdf, other]
Title: 3D Freehand Ultrasound using Visual Inertial and Deep Inertial Odometry for Measuring Patellar Tracking
Comments: Accepted to IEEE Medical Measurements & Applications (MeMeA) 2024
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[186]  arXiv:2404.15786 (cross-list from eess.IV) [pdf, other]
Title: Rethinking Model Prototyping through the MedMNIST+ Dataset Collection
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[187]  arXiv:2404.15718 (cross-list from eess.IV) [pdf, other]
Title: Mitigating False Predictions In Unreasonable Body Regions
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[188]  arXiv:2404.15661 (cross-list from cs.GR) [pdf, other]
Title: CWF: Consolidating Weak Features in High-quality Mesh Simplification
Comments: 14 pages, 22 figures
Subjects: Graphics (cs.GR); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[189]  arXiv:2404.15532 (cross-list from cs.HC) [pdf, other]
Title: BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis
Comments: 26 pages, 14 figures The data and code for this project are accessible at this https URL
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[190]  arXiv:2404.15394 (cross-list from eess.IV) [pdf, ps, other]
Title: On Generating Cancelable Biometric Template using Reverse of Boolean XOR
Authors: Manisha, Nitin Kumar
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[191]  arXiv:2404.15367 (cross-list from eess.SP) [pdf, other]
Title: Leveraging Visibility Graphs for Enhanced Arrhythmia Classification with Graph Convolutional Networks
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[192]  arXiv:2404.15364 (cross-list from eess.SP) [pdf, other]
Title: MP-DPD: Low-Complexity Mixed-Precision Neural Networks for Energy-Efficient Digital Predistortion of Wideband Power Amplifiers
Comments: Accepted to IEEE Microwave and Wireless Technology Letters (MWTL)
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[193]  arXiv:2404.15346 (cross-list from eess.SP) [pdf, other]
Title: A Novel Micro-Doppler Coherence Loss for Deep Learning Radar Applications
Comments: Presented at 2021 18th European Radar Conference (EuRAD)
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[194]  arXiv:2404.15318 (cross-list from q-bio.QM) [pdf, ps, other]
Title: VASARI-auto: equitable, efficient, and economical featurisation of glioma MRI
Comments: 28 pages, 6 figures, 1 table
Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Tissues and Organs (q-bio.TO)
[195]  arXiv:2404.15312 (cross-list from eess.SP) [pdf, other]
Title: Realtime Person Identification via Gait Analysis
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[196]  arXiv:2404.15287 (cross-list from eess.IV) [pdf, other]
Title: A Semi-automatic Cranial Implant Design Tool Based on Rigid ICP Template Alignment and Voxel Space Reconstruction
Comments: 6 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[197]  arXiv:2404.14956 (cross-list from eess.IV) [pdf, other]
Title: DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions
Comments: 13 pages, 11 figures, 8 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)

Wed, 24 Apr 2024 (showing first 43 of 110 entries)

[198]  arXiv:2404.15276 [pdf, other]
Title: SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation
Comments: Published at TPAMI 2024
Journal-ref: https://www.computer.org/csdl/journal/tp/2024/05/10354384/1SP2qWh8Fq0
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[199]  arXiv:2404.15275 [pdf, other]
Title: ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[200]  arXiv:2404.15272 [pdf, other]
Title: CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios
Comments: 12 pages, 5 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[201]  arXiv:2404.15271 [pdf, other]
Title: Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[202]  arXiv:2404.15267 [pdf, other]
Title: From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203]  arXiv:2404.15264 [pdf, other]
Title: TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204]  arXiv:2404.15263 [pdf, other]
Title: Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization
Comments: Accepted to CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205]  arXiv:2404.15259 [pdf, other]
Title: FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent
Comments: Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[206]  arXiv:2404.15254 [pdf, other]
Title: UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Comments: 17 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[207]  arXiv:2404.15252 [pdf, other]
Title: Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions
Comments: accepted by the UG2+ workshop at CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[208]  arXiv:2404.15244 [pdf, other]
Title: Efficient Transformer Encoders for Mask2Former-style models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[209]  arXiv:2404.15234 [pdf, other]
Title: Massively Annotated Datasets for Assessment of Synthetic and Real Data in Face Recognition
Comments: Accepted at FG 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[210]  arXiv:2404.15228 [pdf, other]
Title: Re-Thinking Inverse Graphics With Large Language Models
Comments: 31 pages; project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[211]  arXiv:2404.15224 [pdf, other]
Title: Deep Models for Multi-View 3D Object Recognition: A Review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[212]  arXiv:2404.15217 [pdf, other]
Title: Towards Large-Scale Training of Pathology Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[213]  arXiv:2404.15212 [pdf, other]
Title: Real-time Lane-wise Traffic Monitoring in Optimal ROIs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[214]  arXiv:2404.15174 [pdf, other]
Title: Fourier-enhanced Implicit Neural Fusion Network for Multispectral and Hyperspectral Image Fusion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[215]  arXiv:2404.15163 [pdf, other]
Title: Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment
Comments: IEEE Transactions on Broadcasting (TBC)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[216]  arXiv:2404.15161 [pdf, other]
Title: Combating Missing Modalities in Egocentric Videos at Test Time
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[217]  arXiv:2404.15141 [pdf, other]
Title: CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[218]  arXiv:2404.15129 [pdf, ps, other]
Title: Gallbladder Cancer Detection in Ultrasound Images based on YOLO and Faster R-CNN
Comments: Published in 2024 10th International Conference on Artificial Intelligence and Robotics (QICAR)
Journal-ref: 2024 10th International Conference on Artificial Intelligence and Robotics (QICAR) (pp. 227-231). IEEE
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[219]  arXiv:2404.15127 [pdf, other]
Title: MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[220]  arXiv:2404.15100 [pdf, other]
Title: Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[221]  arXiv:2404.15081 [pdf, other]
Title: Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models
Comments: Published at CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[222]  arXiv:2404.15041 [pdf, other]
Title: LEAF: Unveiling Two Sides of the Same Coin in Semi-supervised Facial Expression Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[223]  arXiv:2404.15037 [pdf, other]
Title: DP-Net: Learning Discriminative Parts for image recognition
Comments: IEEE ICIP 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[224]  arXiv:2404.15033 [pdf, other]
Title: IPAD: Industrial Process Anomaly Detection Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[225]  arXiv:2404.15028 [pdf, other]
Title: PRISM: A Promptable and Robust Interactive Segmentation Model with Visual Prompts
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[226]  arXiv:2404.15024 [pdf, other]
Title: A Learning Paradigm for Interpretable Gradients
Comments: VISAPP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[227]  arXiv:2404.15022 [pdf, other]
Title: A review of deep learning-based information fusion techniques for multimodal medical image classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[228]  arXiv:2404.15014 [pdf, other]
Title: OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[229]  arXiv:2404.15010 [pdf, other]
Title: X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition
Journal-ref: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[230]  arXiv:2404.15009 [pdf, other]
[231]  arXiv:2404.15008 [pdf, other]
Title: External Prompt Features Enhanced Parameter-efficient Fine-tuning for Salient Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232]  arXiv:2404.14996 [pdf, other]
Title: CA-Stream: Attention-based pooling for interpretable image recognition
Comments: CVPR XAI4CV workshop 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[233]  arXiv:2404.14990 [pdf, other]
Title: Interpreting COVID Lateral Flow Tests' Results with Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[234]  arXiv:2404.14985 [pdf, other]
Title: Other Tokens Matter: Exploring Global and Local Features of Vision Transformers for Object Re-Identification
Comments: Accepted by CVIU2024. More modifications may be performed
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[235]  arXiv:2404.14979 [pdf, other]
Title: SGFormer: Spherical Geometry Transformer for 360 Depth Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[236]  arXiv:2404.14975 [pdf, other]
Title: CAGE: Circumplex Affect Guided Expression Inference
Comments: Accepted for publication at ABAW Workshop at CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[237]  arXiv:2404.14967 [pdf, other]
Title: CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields
Comments: International Conference on 3D Vision 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[238]  arXiv:2404.14966 [pdf, other]
Title: Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[239]  arXiv:2404.14956 [pdf, other]
Title: DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions
Comments: 13 pages, 11 figures, 8 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[240]  arXiv:2404.14955 [pdf, other]
Title: Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 593 entries: 1-104 | 33-136 | 137-240 | 241-344 | 345-448 | 449-552 | 553-593 ]
[ showing 104 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)