We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 112

[ total of 679 entries: 1-104 | 9-112 | 113-216 | 217-320 | 321-424 | 425-528 | ... | 633-679 ]
[ showing 104 entries per page: fewer | more | all ]

Tue, 4 Jun 2024 (continued, showing 104 of 228 entries)

[113]  arXiv:2406.01555 [pdf, other]
Title: Towards Flexible Interactive Reflection Removal with Human Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[114]  arXiv:2406.01551 [pdf, other]
Title: ELSA: Evaluating Localization of Social Activities in Urban Streets
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2406.01494 [pdf, other]
Title: Robust Classification by Coupling Data Mollification with Label Smoothing
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[116]  arXiv:2406.01493 [pdf, other]
Title: Learning Temporally Consistent Video Depth from Video Diffusion Priors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117]  arXiv:2406.01489 [pdf, other]
Title: DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[118]  arXiv:2406.01486 [pdf, other]
Title: Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[119]  arXiv:2406.01480 [pdf, other]
Title: Towards Automating the Retrospective Generation of BIM Models: A Unified Framework for 3D Semantic Reconstruction of the Built Environment
Comments: CVPRW 2024, Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2406.01476 [pdf, other]
Title: DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors
Comments: Technical report. Codes are released at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2406.01460 [pdf, other]
Title: MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization
Comments: ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[122]  arXiv:2406.01455 [pdf, other]
Title: Automatic Fused Multimodal Deep Learning for Plant Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[123]  arXiv:2406.01451 [pdf, other]
Title: SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation
Comments: Accepted by ICML2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[124]  arXiv:2406.01449 [pdf, other]
Title: SLANT: Spurious Logo ANalysis Toolkit
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2406.01432 [pdf, other]
Title: ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2406.01429 [pdf, other]
Title: EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2406.01425 [pdf, other]
Title: Sensitivity-Informed Augmentation for Robust Segmentation
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2406.01402 [pdf, other]
Title: Mixture of Rationale: Multi-Modal Reasoning Mixture for Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[129]  arXiv:2406.01395 [pdf, other]
Title: TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation
Comments: This work has been submitted to the IEEE Transactions on Intelligent Vehicles for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2406.01388 [pdf, other]
Title: AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2406.01380 [pdf, other]
Title: Convolutional Unscented Kalman Filter for Multi-Object Tracking with Outliers
Comments: 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[132]  arXiv:2406.01365 [pdf, other]
Title: From Feature Visualization to Visual Circuits: Effect of Adversarial Model Manipulation
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[133]  arXiv:2406.01356 [pdf, other]
Title: MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134]  arXiv:2406.01355 [pdf, other]
Title: Differentially Private Fine-Tuning of Diffusion Models
Comments: 16 pages, 5 figures, 11 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[135]  arXiv:2406.01349 [pdf, other]
Title: Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Comments: Project Page: this https URL, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[136]  arXiv:2406.01337 [pdf, other]
Title: ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds
Comments: CVPRW 2024 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2406.01334 [pdf, other]
Title: HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models
Comments: accepted in CVPR2024, project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138]  arXiv:2406.01326 [pdf, other]
Title: TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Comments: 20 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[139]  arXiv:2406.01316 [pdf, other]
Title: Enhancing Inertial Hand based HAR through Joint Representation of Language, Pose and Synthetic IMUs
Comments: Review Copy
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[140]  arXiv:2406.01315 [pdf, other]
Title: Scale-Free Image Keypoints Using Differentiable Persistent Homology
Comments: Accepted to ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Algebraic Topology (math.AT)
[141]  arXiv:2406.01314 [pdf, other]
Title: Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[142]  arXiv:2406.01302 [pdf, ps, other]
Title: Pulmonary Embolism Mortality Prediction Using Multimodal Learning Based on Computed Tomography Angiography and Clinical Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2406.01300 [pdf, other]
Title: pOps: Photo-Inspired Diffusion Operators
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[144]  arXiv:2406.01294 [pdf, other]
Title: Capsule Enhanced Variational AutoEncoder for Underwater Image Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[145]  arXiv:2406.01278 [pdf, other]
Title: fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity perception in image embeddings
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Machine Learning (cs.LG)
[146]  arXiv:2406.01264 [pdf, other]
Title: FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor Synthesis
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2406.01256 [pdf, other]
Title: Augmented Commonsense Knowledge for Remote Object Grounding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[148]  arXiv:2406.01210 [pdf, other]
Title: GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
Comments: Accepted by ICML 2024, code and models are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149]  arXiv:2406.01203 [pdf, other]
Title: Scaling Up Deep Clustering Methods Beyond ImageNet-1K
Comments: Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[150]  arXiv:2406.01196 [pdf, other]
Title: 3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[151]  arXiv:2406.01194 [pdf, other]
Title: AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[152]  arXiv:2406.01188 [pdf, other]
Title: UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2406.01170 [pdf, other]
Title: Zero-Shot Out-of-Distribution Detection with Outlier Label Exposure
Comments: Accepted by IJCNN2024, 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[154]  arXiv:2406.01159 [pdf, other]
Title: Dimba: Transformer-Mamba Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2406.01154 [pdf, other]
Title: DeepUniUSTransformer: Towards A Universal UltraSound Model with Prompted Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2406.01136 [pdf, other]
Title: Towards Practical Single-shot Motion Synthesis
Comments: CVPR 2024, AI for 3D Generation Workshop, Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2406.01127 [pdf, other]
Title: Learning Adaptive Fusion Bank for Multi-modal Salient Object Detection
Comments: Accepted by TCSVT 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158]  arXiv:2406.01125 [pdf, other]
Title: $Δ$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers
Comments: 12 pages, 6 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2406.01112 [pdf, other]
Title: BACON: Bayesian Optimal Condensation Framework for Dataset Distillation
Comments: 22 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2406.01079 [pdf, other]
Title: Object Aware Egocentric Online Action Detection
Comments: CVPR First Joint Egocentric Vision Workshop 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[161]  arXiv:2406.01078 [pdf, other]
Title: CUT: A Controllable, Universal, and Training-Free Visual Anomaly Generation Framework
Comments: 9 pages excluding appendix
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[162]  arXiv:2406.01076 [pdf, other]
Title: Estimating Canopy Height at Scale
Comments: ICML Camera-Ready, 17 pages, 14 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[163]  arXiv:2406.01073 [pdf, other]
Title: Understanding the Cross-Domain Capabilities of Video-Based Few-Shot Action Recognition Models
Comments: Preprint. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2406.01071 [pdf, other]
Title: Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[165]  arXiv:2406.01069 [pdf, other]
Title: UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2406.01063 [pdf, other]
Title: DANCE: Dual-View Distribution Alignment for Dataset Condensation
Comments: This work has been accepted by IJCAI-24
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[167]  arXiv:2406.01062 [pdf, other]
Title: SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[168]  arXiv:2406.01059 [pdf, other]
Title: VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[169]  arXiv:2406.01056 [pdf, other]
Title: Virtual avatar generation models as world navigators
Authors: Sai Mandava
Comments: 16 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Robotics (cs.RO)
[170]  arXiv:2406.01042 [pdf, other]
Title: Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting
Comments: GitHub Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2406.01040 [pdf, other]
Title: Synthetic Data Generation for 3D Myocardium Deformation Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[172]  arXiv:2406.01033 [pdf, ps, other]
Title: Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
Comments: 10 pages, 6 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[173]  arXiv:2406.01029 [pdf, other]
Title: CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2406.01028 [pdf, other]
Title: LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network
Comments: 9pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2406.01025 [pdf, ps, other]
Title: Khayyam Offline Persian Handwriting Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[176]  arXiv:2406.01020 [pdf, other]
Title: CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2406.01003 [pdf, other]
Title: Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2406.00985 [pdf, other]
Title: MultiEdits: Simultaneous Multi-Aspect Editing with Text-to-Image Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[179]  arXiv:2406.00977 [pdf, other]
Title: Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[180]  arXiv:2406.00971 [pdf, other]
Title: MiniGPT-Reverse-Designing: Predicting Image Adjustments Utilizing MiniGPT-4
Comments: 8 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[181]  arXiv:2406.00956 [pdf, other]
Title: Improving Segment Anything on the Fly: Auxiliary Online Learning and Adaptive Fusion for Medical Image Segmentation
Comments: Project Link: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[182]  arXiv:2406.00955 [pdf, other]
Title: How Video Meetings Change Your Expression
Comments: Project webpage is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183]  arXiv:2406.00947 [pdf, other]
Title: Cross-Dimensional Medical Self-Supervised Representation Learning Based on a Pseudo-3D Transformation
Comments: MICCAI 2024 accept
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[184]  arXiv:2406.00934 [pdf, other]
Title: LanEvil: Benchmarking the Robustness of Lane Detection to Environmental Illusions
Comments: Submitted to ACM MM 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[185]  arXiv:2406.00929 [pdf, other]
Title: Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry
Comments: 8 pages. 5 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[186]  arXiv:2406.00919 [pdf, other]
Title: Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling
Comments: IJCV 2024 Accepted. arXiv admin note: substantial text overlap with arXiv:2303.02344
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[187]  arXiv:2406.00917 [pdf, other]
Title: Alignment-Free RGBT Salient Object Detection: Semantics-guided Asymmetric Correlation Network and A Unified Benchmark
Comments: Accepted by TMM 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188]  arXiv:2406.00908 [pdf, other]
Title: ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[189]  arXiv:2406.00907 [pdf, other]
Title: DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic Surgery
Comments: 29 pages, 16 figures; MIDL 2024 - Medical Imaging with Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[190]  arXiv:2406.00891 [pdf, other]
Title: Global High Categorical Resolution Land Cover Mapping via Weak Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[191]  arXiv:2406.00885 [pdf, other]
Title: Visual place recognition for aerial imagery: A survey
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[192]  arXiv:2406.00872 [pdf, other]
Title: OLIVE: Object Level In-Context Visual Embeddings
Comments: ACL 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[193]  arXiv:2406.00856 [pdf, other]
Title: DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection
Comments: 6 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[194]  arXiv:2406.00848 [pdf, ps, other]
Title: Eating Smart: Advancing Health Informatics with the Grounding DINO based Dietary Assistant App
Comments: The work presented in this paper was part of the proceedings for the First International Conference on Artificial Intelligence (ICATA 2024)
Journal-ref: Eating Smart: Advancing Health Informatics with the Grounding DINO-based Dietary Assistant App, International Journal of Scientific and Innovative Studies, June 2024, Volume 3, Number 3, Pages 26-34, Available online at IJSRIS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[195]  arXiv:2406.00830 [pdf, other]
Title: Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
Comments: Code Page: this https URL This paper has been submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) for possible publication
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[196]  arXiv:2406.00828 [pdf, other]
Title: Stealing Image-to-Image Translation Models With a Single Query
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[197]  arXiv:2406.00808 [pdf, other]
Title: EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing
Comments: Accepted at MICCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[198]  arXiv:2406.00798 [pdf, other]
Title: PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[199]  arXiv:2406.00791 [pdf, other]
Title: Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[200]  arXiv:2406.00783 [pdf, other]
Title: AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[201]  arXiv:2406.00777 [pdf, other]
Title: Diffusion Features to Bridge Domain Gap for Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[202]  arXiv:2406.00772 [pdf, other]
Title: Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion Models
Comments: 18 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203]  arXiv:2406.00750 [pdf, other]
Title: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models
Comments: project can be found in: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[204]  arXiv:2406.00749 [pdf, other]
Title: CCF: Cross Correcting Framework for Pedestrian Trajectory Prediction
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205]  arXiv:2406.00721 [pdf, other]
Title: Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks
Comments: IJCAI-24; Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[206]  arXiv:2406.00714 [pdf, other]
Title: A Survey of Deep Learning Based Radar and Vision Fusion for 3D Object Detection in Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[207]  arXiv:2406.00704 [pdf, other]
Title: An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites
Comments: 8 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[208]  arXiv:2406.00699 [pdf, other]
Title: Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation
Comments: Accepted to CVPR2024. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[209]  arXiv:2406.00696 [pdf, ps, other]
Title: Bilinear-Convolutional Neural Network Using a Matrix Similarity-based Joint Loss Function for Skin Disease Classification
Comments: 16 pages, 11 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[210]  arXiv:2406.00687 [pdf, other]
Title: Lay-A-Scene: Personalized 3D Object Arrangement Using Text-to-Image Priors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[211]  arXiv:2406.00685 [pdf, other]
Title: Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[212]  arXiv:2406.00684 [pdf, other]
Title: Deciphering Oracle Bone Language with Diffusion Models
Comments: ACL2024 main conference long paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[213]  arXiv:2406.00676 [pdf, other]
Title: W-Net: A Facial Feature-Guided Face Super-Resolution Network
Comments: 15 pages,9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[214]  arXiv:2406.00672 [pdf, other]
Title: Task-oriented Embedding Counts: Heuristic Clustering-driven Feature Fine-tuning for Whole Slide Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[215]  arXiv:2406.00670 [pdf, other]
Title: Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Comments: Accepted by ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[216]  arXiv:2406.00663 [pdf, other]
Title: SimSAM: Zero-shot Medical Image Segmentation via Simulated Interaction
Comments: Published at ISBI 2024. Awarded Top 12 Oral Presentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 679 entries: 1-104 | 9-112 | 113-216 | 217-320 | 321-424 | 425-528 | ... | 633-679 ]
[ showing 104 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)