We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 240

[ total of 456 entries: 1-104 | 33-136 | 137-240 | 241-344 | 345-448 | 449-456 ]
[ showing 104 entries per page: fewer | more | all ]

Tue, 7 May 2024 (continued, showing 104 of 159 entries)

[241]  arXiv:2405.03662 [pdf, other]
Title: Diffeomorphic Template Registration for Atmospheric Turbulence Mitigation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[242]  arXiv:2405.03660 [pdf, other]
Title: CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification
Comments: 18 Pages, 4 Figures and Accepted in ICDAR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[243]  arXiv:2405.03659 [pdf, other]
Title: A Construct-Optimize Approach to Sparse View Synthesis without Camera Pose
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[244]  arXiv:2405.03652 [pdf, ps, other]
Title: Field-of-View Extension for Diffusion MRI via Deep Generative Models
Comments: 20 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[245]  arXiv:2405.03650 [pdf, other]
Title: Generated Contents Enrichment
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[246]  arXiv:2405.03643 [pdf, other]
Title: Collecting Consistently High Quality Object Tracks with Minimal Human Involvement by Using Self-Supervised Learning to Detect Tracker Errors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[247]  arXiv:2405.03642 [pdf, other]
Title: Classification of Breast Cancer Histopathology Images using a Modified Supervised Contrastive Learning Method
Comments: 16 pages, 3 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[248]  arXiv:2405.03633 [pdf, other]
Title: Neural Graph Mapping for Dense SLAM with Efficient Loop Closure
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[249]  arXiv:2405.03613 [pdf, other]
Title: Dual Relation Mining Network for Zero-Shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250]  arXiv:2405.03565 [pdf, other]
Title: Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing
Comments: Accepted to AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[251]  arXiv:2405.03546 [pdf, other]
Title: CCDM: Continuous Conditional Diffusion Models for Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[252]  arXiv:2405.03545 [pdf, other]
Title: Optimizing Hand Region Detection in MediaPipe Holistic Full-Body Pose Estimation to Improve Accuracy and Avoid Downstream Errors
Authors: Amit Moryossef
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253]  arXiv:2405.03541 [pdf, other]
Title: RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[254]  arXiv:2405.03520 [pdf, other]
Title: Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Comments: This survey will be regularly updated at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[255]  arXiv:2405.03519 [pdf, other]
Title: Low-light Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[256]  arXiv:2405.03485 [pdf, other]
Title: LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model
Comments: 9 pages,7 figures, SIGGRAPH 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[257]  arXiv:2405.03462 [pdf, ps, other]
Title: A Lightweight Neural Architecture Search Model for Medical Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[258]  arXiv:2405.03458 [pdf, other]
Title: SSyncOA: Self-synchronizing Object-aligned Watermarking to Resist Cropping-paste Attacks
Comments: 7 pages, 5 figures (Have been accepted by ICME 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259]  arXiv:2405.03436 [pdf, other]
Title: DBDH: A Dual-Branch Dual-Head Neural Network for Invisible Embedded Regions Localization
Comments: 7 pages, 6 figures (Have been accepted by IJCNN 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[260]  arXiv:2405.03420 [pdf, ps, other]
Title: Implantable Adaptive Cells: differentiable architecture search to improve the performance of any trained U-shaped network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[261]  arXiv:2405.03417 [pdf, other]
Title: Gaussian Splatting: 3D Reconstruction and Novel View Synthesis, a Review
Comments: 24 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[262]  arXiv:2405.03388 [pdf, other]
Title: 3D LiDAR Mapping in Dynamic Environments Using a 4D Implicit Neural Representation
Comments: 10 pages, CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[263]  arXiv:2405.03381 [pdf, other]
Title: Statistical Edge Detection And UDF Learning For Shape Representation
Authors: Virgile Foy (IMT), Fabrice Gamboa (IMT), Reda Chhaibi (IMT)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Applications (stat.AP)
[264]  arXiv:2405.03373 [pdf, other]
Title: Knowledge-aware Text-Image Retrieval for Remote Sensing Images
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265]  arXiv:2405.03352 [pdf, other]
Title: Salient Object Detection From Arbitrary Modalities
Comments: 15 Pages, 7 Figures, 8 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[266]  arXiv:2405.03351 [pdf, other]
Title: Modality Prompts for Arbitrary Modality Salient Object Detection
Comments: 13 pages, 7 Figures, 3 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[267]  arXiv:2405.03349 [pdf, other]
Title: Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[268]  arXiv:2405.03333 [pdf, other]
Title: Light-VQA+: A Video Quality Assessment Model for Exposure Correction with Vision-Language Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[269]  arXiv:2405.03328 [pdf, other]
Title: Enhancing Spatiotemporal Disease Progression Models via Latent Diffusion and Prior Knowledge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[270]  arXiv:2405.03318 [pdf, other]
Title: Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation
Comments: 11 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[271]  arXiv:2405.03314 [pdf, other]
Title: Deep Learning-based Point Cloud Registration for Augmented Reality-guided Surgery
Comments: 5 pages, 4 figures; accepted at IEEE ISBI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[272]  arXiv:2405.03311 [pdf, other]
Title: Federated Learning for Drowsiness Detection in Connected Vehicles
Comments: 14 pages, 8 figures, 1 table, EAI INTSYS 2023 conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[273]  arXiv:2405.03280 [pdf, other]
Title: Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[274]  arXiv:2405.03272 [pdf, other]
Title: WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[275]  arXiv:2405.03243 [pdf, other]
Title: Mind the Gap Between Synthetic and Real: Utilizing Transfer Learning to Probe the Boundaries of Stable Diffusion Generated Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[276]  arXiv:2405.03235 [pdf, ps, other]
Title: Cross-Modal Domain Adaptation in Brain Disease Diagnosis: Maximum Mean Discrepancy-based Convolutional Neural Networks
Authors: Xuran Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[277]  arXiv:2405.03221 [pdf, other]
Title: Spatial and Surface Correspondence Field for Interaction Transfer
Comments: Accepted to SIGGRAPH 2024, project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[278]  arXiv:2405.03218 [pdf, other]
Title: Elevator, Escalator or Neither? Classifying Pedestrian Conveyor State Using Inertial Navigation System
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[279]  arXiv:2405.03202 [pdf, other]
Title: Hierarchical Space-Time Attention for Micro-Expression Recognition
Comments: 9 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[280]  arXiv:2405.03197 [pdf, other]
Title: StyleSeg V2: Towards Robust One-shot Segmentation of Brain Tissue via Optimization-free Registration Error Perception
Comments: 9 pages, 8 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281]  arXiv:2405.03194 [pdf, other]
Title: CityLLaVA: Efficient Fine-Tuning for VLMs in City Scenario
Comments: Accepted by AICITY2024 Workshop Track2 at CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[282]  arXiv:2405.03193 [pdf, other]
Title: Exploring Frequencies via Feature Mixing and Meta-Learning for Improving Adversarial Transferability
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[283]  arXiv:2405.03190 [pdf, other]
Title: Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284]  arXiv:2405.03177 [pdf, other]
Title: Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[285]  arXiv:2405.03162 [pdf, other]
[286]  arXiv:2405.03159 [pdf, other]
Title: DeepMpMRI: Tensor-decomposition Regularized Learning for Fast and High-Fidelity Multi-Parametric Microstructural MR Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287]  arXiv:2405.03150 [pdf, other]
Title: Video Diffusion Models: A Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[288]  arXiv:2405.03144 [pdf, other]
Title: PTQ4SAM: Post-Training Quantization for Segment Anything
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[289]  arXiv:2405.03121 [pdf, other]
Title: AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
Comments: 14 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[290]  arXiv:2405.03109 [pdf, other]
Title: Intra-task Mutual Attention based Vision Transformer for Few-Shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[291]  arXiv:2405.03104 [pdf, other]
Title: GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding
Comments: Accepted in ICDAR 2024 (Athens, Greece)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[292]  arXiv:2405.03099 [pdf, other]
Title: SketchGPT: Autoregressive Modeling for Sketch Generation and Recognition
Comments: Accepted in ICDAR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[293]  arXiv:2405.03091 [pdf, ps, other]
Title: Research on Image Recognition Technology Based on Multimodal Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[294]  arXiv:2405.03055 [pdf, other]
Title: Multi-hop graph transformer network for 3D human pose estimation
Journal-ref: Journal of Visual Communication and Image Representation, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[295]  arXiv:2405.03039 [pdf, ps, other]
Title: Performance Evaluation of Real-Time Object Detection for Electric Scooters
Comments: 10 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[296]  arXiv:2405.03025 [pdf, other]
Title: Matten: Video Generation with Mamba-Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297]  arXiv:2405.03011 [pdf, ps, other]
Title: AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation
Comments: 15 pages, 7 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[298]  arXiv:2405.02982 [pdf, other]
Title: Paintings and Drawings Aesthetics Assessment with Rich Attributes for Various Artistic Categories
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[299]  arXiv:2405.02977 [pdf, other]
Title: SkelCap: Automated Generation of Descriptive Text from Skeleton Keypoint Sequences
Comments: 8 pages, 5 figures, 7 tables, submitted to IEEE conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[300]  arXiv:2405.02962 [pdf, other]
Title: VectorPainter: A Novel Approach to Stylized Vector Graphics Synthesis with Vectorized Strokes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[301]  arXiv:2405.02961 [pdf, other]
Title: JOSENet: A Joint Stream Embedding Network for Violence Detection in Surveillance Videos
Comments: Submitted to the International Journal of Computer Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[302]  arXiv:2405.02958 [pdf, ps, other]
Title: Score-based Generative Priors Guided Model-driven Network for MRI Reconstruction
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[303]  arXiv:2405.02954 [pdf, other]
Title: Source-Free Domain Adaptation Guided by Vision and Vision-Language Pre-Training
Comments: Extension of ICCV paper arXiv:2212.07585, submitted to IJCV
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[304]  arXiv:2405.02951 [pdf, other]
Title: iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval
Comments: Extended version of the ICCV2023 paper arXiv:2303.15247
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[305]  arXiv:2405.02945 [pdf, other]
Title: Invertible Residual Rescaling Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[306]  arXiv:2405.02944 [pdf, other]
Title: Imaging Signal Recovery Using Neural Network Priors Under Uncertain Forward Model Parameters
Comments: Accepted by PBDL-CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[307]  arXiv:2405.02941 [pdf, other]
Title: Boundary-aware Decoupled Flow Networks for Realistic Extreme Rescaling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[308]  arXiv:2405.02929 [pdf, other]
Title: Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[309]  arXiv:2405.02918 [pdf, other]
Title: MERIT: Multi-view Evidential learning for Reliable and Interpretable liver fibrosis sTaging
Comments: Submitted to Medical Image Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[310]  arXiv:2405.02917 [pdf, other]
Title: Overconfidence is Key: Verbalized Uncertainty Evaluation in Large Language and Vision-Language Models
Comments: 8 pages, with appendix. To appear in TrustNLP workshop @ NAACL 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[311]  arXiv:2405.02913 [pdf, ps, other]
Title: Fast TILs estimation in lung cancer WSIs based on semi-stochastic patch sampling
Comments: 18 pages, 7 figures, 6 appendix pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[312]  arXiv:2405.02911 [pdf, other]
Title: Multimodal Sense-Informed Prediction of 3D Human Motions
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[313]  arXiv:2405.02906 [pdf, other]
Title: SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection
Comments: 9 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[314]  arXiv:2405.02882 [pdf, other]
Title: A drone detector with modified backbone and multiple pyramid featuremaps enhancement structure (MDDPE)
Authors: Chenhao Wu
Comments: 20 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[315]  arXiv:2405.02880 [pdf, other]
Title: Blending Distributed NeRFs with Tri-stage Robust Pose Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[316]  arXiv:2405.02859 [pdf, other]
Title: MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior
Comments: 14 pages, 10 figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[317]  arXiv:2405.02844 [pdf, other]
Title: SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318]  arXiv:2405.02843 [pdf, other]
Title: Residual-Conditioned Optimal Transport: Towards Structure-preserving Unpaired and Paired Image Restoration
Comments: ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319]  arXiv:2405.02834 [pdf, other]
Title: Scene-Adaptive Person Search via Bilateral Modulations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[320]  arXiv:2405.02832 [pdf, other]
Title: Fast One-Stage Unsupervised Domain Adaptive Person Search
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[321]  arXiv:2405.02830 [pdf, other]
Title: You Only Need Half: Boosting Data Augmentation by Using Partial Content
Authors: Juntao Hu, Yuan Wu
Comments: Technical report,16 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[322]  arXiv:2405.02824 [pdf, other]
Title: Adaptive Guidance Learning for Camouflaged Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323]  arXiv:2405.02815 [pdf, other]
Title: Region-specific Risk Quantification for Interpretable Prognosis of COVID-19
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[324]  arXiv:2405.02811 [pdf, other]
Title: PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325]  arXiv:2405.02797 [pdf, other]
Title: Adapting to Distribution Shift by Visual Domain Prompt Generation
Comments: ICLR2024, code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[326]  arXiv:2405.02793 [pdf, other]
Title: ImageInWords: Unlocking Hyper-Detailed Image Descriptions
Comments: Webpage (this https URL), GitHub (this https URL), HuggingFace (this https URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[327]  arXiv:2405.02792 [pdf, ps, other]
Title: Jointly Learning Spatial, Angular, and Temporal Information for Enhanced Lane Detection
Comments: 5 pages, 3 Figures , Accepted IEEE Conference on Signal Processing and Communications Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328]  arXiv:2405.02791 [pdf, other]
Title: Efficient Text-driven Motion Generation via Latent Consistency Training
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[329]  arXiv:2405.02787 [pdf, ps, other]
Title: Light Field Spatial Resolution Enhancement Framework
Comments: 5 pages, 6 figures, accepted in IEEE Conference on Signal Processing and Communications Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330]  arXiv:2405.02785 [pdf, other]
Title: Fused attention mechanism-based ore sorting network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[331]  arXiv:2405.02782 [pdf, ps, other]
Title: A self-supervised text-vision framework for automated brain abnormality detection
Comments: Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[332]  arXiv:2405.02781 [pdf, other]
Title: Instantaneous Perception of Moving Objects in 3D
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[333]  arXiv:2405.02771 [pdf, other]
Title: MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning
Comments: Data and code is available on the project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[334]  arXiv:2405.02762 [pdf, other]
Title: TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes
Comments: 8 pages, submitted to IROS2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[335]  arXiv:2405.02751 [pdf, other]
Title: Deep Image Restoration For Image Anti-Forensics
Authors: Eren Tahir, Mert Bal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[336]  arXiv:2405.02730 [pdf, other]
Title: U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers
Comments: 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[337]  arXiv:2405.02717 [pdf, other]
Title: AFter: Attention-based Fusion Router for RGBT Tracking
Comments: Peer review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[338]  arXiv:2405.02692 [pdf, ps, other]
Title: Diffeomorphic Transformer-based Abdomen MRI-CT Deformable Image Registration
Comments: 18 pages and 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[339]  arXiv:2405.02686 [pdf, other]
Title: Boosting 3D Neuron Segmentation with 2D Vision Transformer Pre-trained on Natural Images
Comments: 3 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[340]  arXiv:2405.02676 [pdf, other]
Title: Hand-Object Interaction Controller (HOIC): Deep Reinforcement Learning for Reconstructing Interactions with Physics
Comments: SIGGRAPH 2024 Conference Track
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[341]  arXiv:2405.02652 [pdf, other]
Title: Deep Pulse-Signal Magnification for remote Heart Rate Estimation in Compressed Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[342]  arXiv:2405.02608 [pdf, other]
Title: UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model
Comments: Accepted by CVPR 2024. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[343]  arXiv:2405.02595 [pdf, other]
Title: Vision-based 3D occupancy prediction in autonomous driving: a review and outlook
Comments: 20 pages, 20 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[344]  arXiv:2405.02591 [pdf, other]
Title: Better YOLO with Attention-Augmented Network and Enhanced Generalization Performance for Safety Helmet Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 456 entries: 1-104 | 33-136 | 137-240 | 241-344 | 345-448 | 449-456 ]
[ showing 104 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)