We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 32

[ total of 729 entries: 1-100 | 33-132 | 133-232 | 233-332 | 333-432 | ... | 633-729 ]
[ showing 100 entries per page: fewer | more | all ]

Tue, 4 Jun 2024 (continued, showing 100 of 228 entries)

[33]  arXiv:2406.01349 [pdf, other]
Title: Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Comments: Project Page: this https URL, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34]  arXiv:2406.01337 [pdf, other]
Title: ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds
Comments: CVPRW 2024 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35]  arXiv:2406.01334 [pdf, other]
Title: HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models
Comments: accepted in CVPR2024, project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36]  arXiv:2406.01326 [pdf, other]
Title: TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Comments: 20 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37]  arXiv:2406.01316 [pdf, other]
Title: Enhancing Inertial Hand based HAR through Joint Representation of Language, Pose and Synthetic IMUs
Comments: Review Copy
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[38]  arXiv:2406.01315 [pdf, other]
Title: Scale-Free Image Keypoints Using Differentiable Persistent Homology
Comments: Accepted to ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Algebraic Topology (math.AT)
[39]  arXiv:2406.01314 [pdf, other]
Title: Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[40]  arXiv:2406.01302 [pdf, ps, other]
Title: Pulmonary Embolism Mortality Prediction Using Multimodal Learning Based on Computed Tomography Angiography and Clinical Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2406.01300 [pdf, other]
Title: pOps: Photo-Inspired Diffusion Operators
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42]  arXiv:2406.01294 [pdf, other]
Title: Capsule Enhanced Variational AutoEncoder for Underwater Image Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[43]  arXiv:2406.01278 [pdf, other]
Title: fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity perception in image embeddings
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Machine Learning (cs.LG)
[44]  arXiv:2406.01264 [pdf, other]
Title: FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor Synthesis
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2406.01256 [pdf, other]
Title: Augmented Commonsense Knowledge for Remote Object Grounding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[46]  arXiv:2406.01210 [pdf, other]
Title: GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47]  arXiv:2406.01203 [pdf, other]
Title: Scaling Up Deep Clustering Methods Beyond ImageNet-1K
Comments: Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[48]  arXiv:2406.01196 [pdf, other]
Title: 3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[49]  arXiv:2406.01194 [pdf, other]
Title: AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50]  arXiv:2406.01188 [pdf, other]
Title: UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51]  arXiv:2406.01170 [pdf, other]
Title: Zero-Shot Out-of-Distribution Detection with Outlier Label Exposure
Comments: Accepted by IJCNN2024, 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52]  arXiv:2406.01159 [pdf, other]
Title: Dimba: Transformer-Mamba Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53]  arXiv:2406.01154 [pdf, other]
Title: DeepUniUSTransformer: Towards A Universal UltraSound Model with Prompted Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54]  arXiv:2406.01136 [pdf, other]
Title: Towards Practical Single-shot Motion Synthesis
Comments: CVPR 2024, AI for 3D Generation Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[55]  arXiv:2406.01127 [pdf, other]
Title: Learning Adaptive Fusion Bank for Multi-modal Salient Object Detection
Comments: Accepted by TCSVT 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56]  arXiv:2406.01125 [pdf, other]
Title: $Δ$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers
Comments: 12 pages, 6 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[57]  arXiv:2406.01112 [pdf, other]
Title: BACON: Bayesian Optimal Condensation Framework for Dataset Distillation
Comments: 22 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58]  arXiv:2406.01079 [pdf, other]
Title: Object Aware Egocentric Online Action Detection
Comments: CVPR First Joint Egocentric Vision Workshop 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[59]  arXiv:2406.01078 [pdf, other]
Title: CUT: A Controllable, Universal, and Training-Free Visual Anomaly Generation Framework
Comments: 9 pages excluding appendix
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[60]  arXiv:2406.01076 [pdf, other]
Title: Estimating Canopy Height at Scale
Comments: ICML Camera-Ready, 17 pages, 14 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[61]  arXiv:2406.01073 [pdf, other]
Title: Understanding the Cross-Domain Capabilities of Video-Based Few-Shot Action Recognition Models
Comments: Preprint. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[62]  arXiv:2406.01071 [pdf, other]
Title: Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[63]  arXiv:2406.01069 [pdf, other]
Title: UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64]  arXiv:2406.01063 [pdf, other]
Title: DANCE: Dual-View Distribution Alignment for Dataset Condensation
Comments: This work has been accepted by IJCAI-24
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[65]  arXiv:2406.01062 [pdf, other]
Title: SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66]  arXiv:2406.01059 [pdf, other]
Title: VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[67]  arXiv:2406.01056 [pdf, other]
Title: Virtual avatar generation models as world navigators
Authors: Sai Mandava
Comments: 16 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Robotics (cs.RO)
[68]  arXiv:2406.01042 [pdf, other]
Title: Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting
Comments: GitHub Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69]  arXiv:2406.01040 [pdf, other]
Title: Synthetic Data Generation for 3D Myocardium Deformation Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[70]  arXiv:2406.01033 [pdf, ps, other]
Title: Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
Comments: 10 pages, 6 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[71]  arXiv:2406.01029 [pdf, other]
Title: CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72]  arXiv:2406.01028 [pdf, other]
Title: LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network
Comments: 9pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73]  arXiv:2406.01025 [pdf, ps, other]
Title: Khayyam Offline Persian Handwriting Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74]  arXiv:2406.01020 [pdf, other]
Title: CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75]  arXiv:2406.01003 [pdf, other]
Title: Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76]  arXiv:2406.00985 [pdf, other]
Title: MultiEdits: Simultaneous Multi-Aspect Editing with Text-to-Image Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77]  arXiv:2406.00977 [pdf, other]
Title: Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[78]  arXiv:2406.00971 [pdf, other]
Title: MiniGPT-Reverse-Designing: Predicting Image Adjustments Utilizing MiniGPT-4
Comments: 8 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[79]  arXiv:2406.00956 [pdf, other]
Title: Improving Segment Anything on the Fly: Auxiliary Online Learning and Adaptive Fusion for Medical Image Segmentation
Comments: Project Link: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[80]  arXiv:2406.00955 [pdf, other]
Title: How Video Meetings Change Your Expression
Comments: Project webpage is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81]  arXiv:2406.00947 [pdf, other]
Title: Cross-Dimensional Medical Self-Supervised Representation Learning Based on a Pseudo-3D Transformation
Comments: MICCAI 2024 accept
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[82]  arXiv:2406.00934 [pdf, other]
Title: LanEvil: Benchmarking the Robustness of Lane Detection to Environmental Illusions
Comments: Submitted to ACM MM 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83]  arXiv:2406.00929 [pdf, other]
Title: Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry
Comments: 8 pages. 5 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[84]  arXiv:2406.00919 [pdf, other]
Title: Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-wise Pseudo Labeling
Comments: IJCV 2024 Accepted. arXiv admin note: substantial text overlap with arXiv:2303.02344
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[85]  arXiv:2406.00917 [pdf, other]
Title: Alignment-Free RGBT Salient Object Detection: Semantics-guided Asymmetric Correlation Network and A Unified Benchmark
Comments: Accepted by TMM 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[86]  arXiv:2406.00908 [pdf, other]
Title: ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[87]  arXiv:2406.00907 [pdf, other]
Title: DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic Surgery
Comments: 29 pages, 16 figures; MIDL 2024 - Medical Imaging with Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[88]  arXiv:2406.00891 [pdf, other]
Title: Global High Categorical Resolution Land Cover Mapping via Weak Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89]  arXiv:2406.00885 [pdf, other]
Title: Visual place recognition for aerial imagery: A survey
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[90]  arXiv:2406.00872 [pdf, other]
Title: OLIVE: Object Level In-Context Visual Embeddings
Comments: ACL 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[91]  arXiv:2406.00856 [pdf, other]
Title: DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection
Comments: 6 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[92]  arXiv:2406.00848 [pdf, ps, other]
Title: Eating Smart: Advancing Health Informatics with the Grounding DINO based Dietary Assistant App
Comments: The work presented in this paper was part of the proceedings for the First International Conference on Artificial Intelligence (ICATA 2024)
Journal-ref: Eating Smart: Advancing Health Informatics with the Grounding DINO-based Dietary Assistant App, International Journal of Scientific and Innovative Studies, June 2024, Volume 3, Number 3, Pages 26-34, Available online at IJSRIS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93]  arXiv:2406.00830 [pdf, other]
Title: Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
Comments: Code Page: this https URL This paper has been submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) for possible publication
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[94]  arXiv:2406.00828 [pdf, other]
Title: Stealing Image-to-Image Translation Models With a Single Query
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95]  arXiv:2406.00808 [pdf, other]
Title: EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing
Comments: Accepted at MICCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2406.00798 [pdf, other]
Title: PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[97]  arXiv:2406.00791 [pdf, other]
Title: Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[98]  arXiv:2406.00783 [pdf, other]
Title: AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99]  arXiv:2406.00777 [pdf, other]
Title: Diffusion Features to Bridge Domain Gap for Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[100]  arXiv:2406.00772 [pdf, other]
Title: Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion Models
Comments: 18 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101]  arXiv:2406.00750 [pdf, other]
Title: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models
Comments: project can be found in: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[102]  arXiv:2406.00749 [pdf, other]
Title: CCF: Cross Correcting Framework for Pedestrian Trajectory Prediction
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103]  arXiv:2406.00721 [pdf, other]
Title: Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks
Comments: IJCAI-24; Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104]  arXiv:2406.00714 [pdf, other]
Title: A Survey of Deep Learning Based Radar and Vision Fusion for 3D Object Detection in Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105]  arXiv:2406.00704 [pdf, other]
Title: An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites
Comments: 8 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[106]  arXiv:2406.00699 [pdf, other]
Title: Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation
Comments: Accepted to CVPR2024. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[107]  arXiv:2406.00696 [pdf, ps, other]
Title: Bilinear-Convolutional Neural Network Using a Matrix Similarity-based Joint Loss Function for Skin Disease Classification
Comments: 16 pages, 11 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108]  arXiv:2406.00687 [pdf, other]
Title: Lay-A-Scene: Personalized 3D Object Arrangement Using Text-to-Image Priors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109]  arXiv:2406.00685 [pdf, other]
Title: Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[110]  arXiv:2406.00684 [pdf, other]
Title: Deciphering Oracle Bone Language with Diffusion Models
Comments: ACL2024 main conference long paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[111]  arXiv:2406.00676 [pdf, other]
Title: W-Net: A Facial Feature-Guided Face Super-Resolution Network
Comments: 15 pages,9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112]  arXiv:2406.00672 [pdf, other]
Title: Task-oriented Embedding Counts: Heuristic Clustering-driven Feature Fine-tuning for Whole Slide Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113]  arXiv:2406.00670 [pdf, other]
Title: Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Comments: Accepted by ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[114]  arXiv:2406.00663 [pdf, other]
Title: SimSAM: Zero-shot Medical Image Segmentation via Simulated Interaction
Comments: Published at ISBI 2024. Awarded Top 12 Oral Presentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[115]  arXiv:2406.00644 [pdf, other]
Title: Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116]  arXiv:2406.00639 [pdf, other]
Title: An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition
Comments: 12 pages, 8 figures init commit
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117]  arXiv:2406.00637 [pdf, other]
Title: Representing Animatable Avatar via Factorized Neural Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[118]  arXiv:2406.00636 [pdf, other]
Title: T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences
Comments: CVPR 2024 HuMoGen Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[119]  arXiv:2406.00632 [pdf, other]
Title: Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2406.00631 [pdf, other]
Title: MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2406.00629 [pdf, other]
Title: Correlation Matching Transformation Transformers for UHD Image Restoration
Comments: AAAI-24; Source codes, datasets, visual results, and pre-trained models are: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2406.00625 [pdf, other]
Title: SAM-LAD: Segment Anything Model Meets Zero-Shot Logic Anomaly Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[123]  arXiv:2406.00622 [pdf, other]
Title: Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[124]  arXiv:2406.00609 [pdf, other]
Title: SuperGaussian: Repurposing Video Models for 3D Super Resolution
Comments: Check our project website for details: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[125]  arXiv:2406.00600 [pdf, other]
Title: Kolmogorov-Arnold Network for Satellite Image Classification in Remote Sensing
Authors: Minjong Cheon
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Data Analysis, Statistics and Probability (physics.data-an)
[126]  arXiv:2406.00598 [pdf, other]
Title: Efficient Neural Light Fields (ENeLF) for Mobile Devices
Authors: Austin Peng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2406.00589 [pdf, other]
Title: Robust Visual Tracking via Iterative Gradient Descent and Threshold Selection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[128]  arXiv:2406.00587 [pdf, other]
Title: Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024
Comments: Champion Solution for CVPR 2024 PVUW VSS Track. arXiv admin note: text overlap with arXiv:2306.02894
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2406.00571 [pdf, other]
Title: An Image Segmentation Model with Transformed Total Variation
Comments: Accepted to EUSIPCO'24
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Numerical Analysis (math.NA)
[130]  arXiv:2406.00545 [pdf, ps, other]
Title: Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation
Comments: ICME 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[131]  arXiv:2406.00512 [pdf, ps, other]
Title: On the use of first and second derivative approximations for biometric online signature recognition
Comments: Advances in Computational Intelligence. IWANN 2023. pp 461 to 472
Journal-ref: Lecture Notes in Computer Science, vol 14134, 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132]  arXiv:2406.00510 [pdf, other]
Title: Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
Comments: CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 729 entries: 1-100 | 33-132 | 133-232 | 233-332 | 333-432 | ... | 633-729 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)