We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 552

[ total of 456 entries: 1-104 | 41-144 | 145-248 | 249-352 | 353-456 ]
[ showing 104 entries per page: fewer | more | all ]

Tue, 7 May 2024 (continued, showing last 42 of 159 entries)

[353]  arXiv:2405.02512 [pdf, other]
Title: Spatio-Temporal SwinMAE: A Swin Transformer based Multiscale Representation Learner for Temporal Satellite Imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[354]  arXiv:2405.02509 [pdf, other]
Title: Implicit Neural Representations for Robust Joint Sparse-View CT Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[355]  arXiv:2405.02508 [pdf, other]
Title: Rasterized Edge Gradients: Handling Discontinuities Differentiably
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[356]  arXiv:2405.02386 [pdf, other]
Title: Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids
Comments: SIGGRAPH 2024, Project page: this https URL , Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[357]  arXiv:2405.02363 [pdf, other]
Title: LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[358]  arXiv:2405.02334 [pdf, other]
Title: Rad4XCNN: a new agnostic method for post-hoc global explanation of CNN-derived features by means of radiomics
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[359]  arXiv:2405.02332 [pdf, other]
Title: Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models
Journal-ref: Generative Models for Computer Vision - CVPR 2024 Workshop, Jun 2024, Seattle, United States
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[360]  arXiv:2405.02317 [pdf, other]
Title: Long-term Human Participation Assessment In Collaborative Learning Environments Using Dynamic Scene Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[361]  arXiv:2405.02312 [pdf, ps, other]
Title: YOLOv5 vs. YOLOv8 in Marine Fisheries: Balancing Class Detection and Instance Count
Comments: 12 pages, 25 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[362]  arXiv:2405.02305 [pdf, ps, other]
Title: Inserting Faces inside Captions: Image Captioning with Attention Guided Merging
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[363]  arXiv:2405.02301 [pdf, other]
Title: TFCounter:Polishing Gems for Training-Free Object Counting
Comments: 14pages,11 figuers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[364]  arXiv:2405.02297 [pdf, other]
Title: Employing Universal Voting Schemes for Improved Visual Place Recognition Performance
Comments: arXiv admin note: substantial text overlap with arXiv:2305.05705
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365]  arXiv:2405.02296 [pdf, other]
Title: Möbius Transform for Mitigating Perspective Distortions in Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366]  arXiv:2405.02295 [pdf, other]
Title: Neural Additive Image Model: Interpretation through Interpolation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[367]  arXiv:2405.02288 [pdf, other]
Title: Prospective Role of Foundation Models in Advancing Autonomous Vehicles
Comments: 36 pages,5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[368]  arXiv:2405.03649 (cross-list from cs.LG) [pdf, other]
Title: Learning Robust Classifiers with Self-Guided Spurious Correlation Mitigation
Comments: Accepted to IJCAI 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[369]  arXiv:2405.03501 (cross-list from cs.LG) [pdf, other]
Title: Boosting Single Positive Multi-label Classification with Generalized Robust Loss
Comments: 14 pages, 5 figures, 6 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[370]  arXiv:2405.03500 (cross-list from cs.MM) [pdf, other]
Title: A Rate-Distortion-Classification Approach for Lossy Image Compression
Authors: Yuefeng Zhang
Comments: 15 pages
Journal-ref: Digital Signal Processing Volume 141, September 2023, 104163
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[371]  arXiv:2405.03486 (cross-list from cs.CR) [pdf, other]
Title: UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI)
[372]  arXiv:2405.03408 (cross-list from astro-ph.IM) [pdf, other]
Title: An Image Quality Evaluation and Masking Algorithm Based On Pre-trained Deep Neural Networks
Comments: Accepted by the AJ. The code could be downloaded from: this https URL with DOI of: 10.12149/101415
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Solar and Stellar Astrophysics (astro-ph.SR); Computer Vision and Pattern Recognition (cs.CV)
[373]  arXiv:2405.03376 (cross-list from cs.LG) [pdf, other]
Title: CRA5: Extreme Compression of ERA5 for Portable Global Climate and Weather Research via an Efficient Variational Transformer
Comments: Main text and supplementary, 22 pages, 13 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[374]  arXiv:2405.03355 (cross-list from cs.LG) [pdf, other]
Title: On the Theory of Cross-Modality Distillation with Contrastive Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[375]  arXiv:2405.03301 (cross-list from cs.LG) [pdf, other]
Title: Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification
Comments: International Joint Conference on Artificial Intelligence 2024 (to be published)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[376]  arXiv:2405.03164 (cross-list from cs.RO) [pdf, other]
Title: The Role of Predictive Uncertainty and Diversity in Embodied AI and Robot Learning
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[377]  arXiv:2405.03141 (cross-list from eess.IV) [pdf, other]
Title: Automatic Ultrasound Curve Angle Measurement via Affinity Clustering for Adolescent Idiopathic Scoliosis Evaluation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[378]  arXiv:2405.03103 (cross-list from cs.LG) [pdf, other]
Title: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[379]  arXiv:2405.03008 (cross-list from eess.IV) [pdf, other]
Title: DVMSR: Distillated Vision Mamba for Efficient Super-Resolution
Comments: 8 pages, 8 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[380]  arXiv:2405.02984 (cross-list from cs.CL) [pdf, other]
Title: E-TSL: A Continuous Educational Turkish Sign Language Dataset with Baseline Methods
Comments: 7 pages, 3 figures, 4 tables, submitted to IEEE conference
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[381]  arXiv:2405.02942 (cross-list from physics.optics) [pdf, other]
Title: Design, analysis, and manufacturing of a glass-plastic hybrid minimalist aspheric panoramic annular lens
Comments: Accepted to Optics & Laser Technology
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[382]  arXiv:2405.02857 (cross-list from eess.IV) [pdf, other]
Title: I$^3$Net: Inter-Intra-slice Interpolation Network for Medical Slice Synthesis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[383]  arXiv:2405.02852 (cross-list from eess.IV) [pdf, other]
Title: On Enhancing Brain Tumor Segmentation Across Diverse Populations with Convolutional Neural Networks
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[384]  arXiv:2405.02807 (cross-list from cs.LG) [pdf, ps, other]
Title: Kinematic analysis of structural mechanics based on convolutional neural network
Comments: 9 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[385]  arXiv:2405.02784 (cross-list from eess.IV) [pdf, other]
Title: MR-Transformer: Vision Transformer for Total Knee Replacement Prediction Using Magnetic Resonance Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[386]  arXiv:2405.02766 (cross-list from cs.LG) [pdf, other]
Title: Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning
Comments: Accepted at 3rd Conference on Lifelong Learning Agents (CoLLAs), 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[387]  arXiv:2405.02700 (cross-list from cs.LG) [pdf, other]
Title: Towards a Scalable Identification of Novel Modes in Generative Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[388]  arXiv:2405.02698 (cross-list from cs.LG) [pdf, ps, other]
Title: Stable Diffusion Dataset Generation for Downstream Classification Tasks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[389]  arXiv:2405.02678 (cross-list from cs.LG) [pdf, other]
Title: Position Paper: Quo Vadis, Unsupervised Time Series Anomaly Detection?
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[390]  arXiv:2405.02648 (cross-list from cs.LG) [pdf, other]
Title: A Conformal Prediction Score that is Robust to Label Noise
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[391]  arXiv:2405.02504 (cross-list from eess.IV) [pdf, other]
Title: Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[392]  arXiv:2405.02497 (cross-list from math.OC) [pdf, other]
Title: Prediction techniques for dynamic imaging with online primal-dual methods
Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV)
[393]  arXiv:2405.02383 (cross-list from stat.ML) [pdf, other]
Title: A Fresh Look at Sanity Checks for Saliency Maps
Comments: arXiv admin note: text overlap with arXiv:2401.06465
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[394]  arXiv:2405.02367 (cross-list from cs.LG) [pdf, other]
Title: Enhancing Social Media Post Popularity Prediction with Visual Content
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)

Mon, 6 May 2024

[395]  arXiv:2405.02280 [pdf, other]
Title: DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396]  arXiv:2405.02266 [pdf, other]
Title: On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397]  arXiv:2405.02246 [pdf, other]
Title: What matters when building vision-language models?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[398]  arXiv:2405.02220 [pdf, other]
Title: Designed Dithering Sign Activation for Binary Neural Networks
Comments: 7 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[399]  arXiv:2405.02218 [pdf, other]
Title: Multispectral Fine-Grained Classification of Blackgrass in Wheat and Barley Crops
Comments: 19 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400]  arXiv:2405.02191 [pdf, ps, other]
Title: Non-Destructive Peat Analysis using Hyperspectral Imaging and Machine Learning
Comments: 4 pages,4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[401]  arXiv:2405.02171 [pdf, other]
Title: Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations
Comments: Accpted by IEEE TPAMI in 2024. Extended version of ECCV 2022 paper "Self-Supervised Learning for Real-World Super-Resolution from Dual Zoomed Observations" (arXiv:2203.01325)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402]  arXiv:2405.02162 [pdf, other]
Title: Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[403]  arXiv:2405.02155 [pdf, other]
Title: Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification
Authors: Siqi Yin, Lifan Jiang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[404]  arXiv:2405.02114 [pdf, other]
Title: Probablistic Restoration with Adaptive Noise Sampling for 3D Human Pose Estimation
Comments: ICME 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405]  arXiv:2405.02077 [pdf, other]
Title: MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406]  arXiv:2405.02068 [pdf, other]
Title: Advancing Pre-trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection
Comments: The paper is under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407]  arXiv:2405.02066 [pdf, other]
Title: WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[408]  arXiv:2405.02061 [pdf, other]
Title: Towards general deep-learning-based tree instance segmentation models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[409]  arXiv:2405.02023 [pdf, other]
Title: IFNet: Deep Imaging and Focusing for Handheld SAR with Millimeter-wave Signals
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[410]  arXiv:2405.02008 [pdf, other]
Title: DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411]  arXiv:2405.02005 [pdf, other]
Title: HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2
Comments: 8 pages, 9 figures, 2 tables. Will be published in the ISPRS The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412]  arXiv:2405.02004 [pdf, other]
Title: M${^2}$Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[413]  arXiv:2405.01992 [pdf, other]
Title: SFFNet: A Wavelet-Based Spatial and Frequency Domain Fusion Network for Remote Sensing Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[414]  arXiv:2405.01937 [pdf, other]
Title: An Attention Based Pipeline for Identifying Pre-Cancer Lesions in Head and Neck Clinical Images
Comments: 5 pages, 3 figures, accepted in ISBI 2024, update: corrected typos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[415]  arXiv:2405.01934 [pdf, other]
Title: Impact of Architectural Modifications on Deep Learning Adversarial Robustness
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[416]  arXiv:2405.01926 [pdf, other]
Title: Auto-Encoding Morph-Tokens for Multimodal LLM
Comments: Accepted by ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417]  arXiv:2405.01920 [pdf, ps, other]
Title: Lightweight Change Detection in Heterogeneous Remote Sensing Images with Online All-Integer Pruning Training
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[418]  arXiv:2405.01885 [pdf, other]
Title: Enhancing Micro Gesture Recognition for Emotion Understanding via Context-aware Visual-Text Contrastive Learning
Comments: accepted by IEEE Signal Processing Letters
Journal-ref: IEEE Signal Processing Letters (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[419]  arXiv:2405.01872 [pdf, other]
Title: Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[420]  arXiv:2405.01828 [pdf, other]
Title: FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[421]  arXiv:2405.01825 [pdf, other]
Title: Improving Concept Alignment in Vision-Language Concept Bottleneck Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422]  arXiv:2405.01734 [pdf, other]
Title: Diabetic Retinopathy Detection Using Quantum Transfer Learning
Comments: 14 pages, 12 figures and 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[423]  arXiv:2405.01723 [pdf, other]
Title: Zero-Shot Monocular Motion Segmentation in the Wild by Combining Deep Learning with Geometric Motion Model Fusion
Comments: Accepted by the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[424]  arXiv:2405.01705 [pdf, other]
Title: Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[425]  arXiv:2405.01701 [pdf, ps, other]
Title: Active Learning Enabled Low-cost Cell Image Segmentation Using Bounding Box Annotation
Authors: Yu Zhu, Qiang Yang, Li Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426]  arXiv:2405.01699 [pdf, other]
Title: SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients
Comments: 7 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[427]  arXiv:2405.01691 [pdf, other]
Title: Language-Enhanced Latent Representations for Out-of-Distribution Detection in Autonomous Driving
Comments: Presented at the Robot Trust for Symbiotic Societies (RTSS) Workshop, co-located with ICRA 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[428]  arXiv:2405.01688 [pdf, other]
Title: Adapting Self-Supervised Learning for Computational Pathology
Comments: Presented at DCA in MI Workshop, CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429]  arXiv:2405.01662 [pdf, other]
Title: Out-of-distribution detection based on subspace projection of high-dimensional features output by the last convolutional layer
Authors: Qiuyu Zhu, Yiwei He
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430]  arXiv:2405.01656 [pdf, other]
Title: S4: Self-Supervised Sensing Across the Spectrum
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[431]  arXiv:2405.01654 [pdf, other]
Title: Key Patches Are All You Need: A Multiple Instance Learning Framework For Robust Medical Diagnosis
Comments: Accepted in DEF-AI-MIA Workshop@CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432]  arXiv:2405.01646 [pdf, other]
Title: Explaining models relating objects and privacy
Comments: 7 pages, 3 figures, 1 table, supplementary material included as Appendix. Paper accepted at the 3rd XAI4CV Workshop at CVPR 2024. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433]  arXiv:2405.01636 [pdf, other]
Title: Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey
Comments: 35 pages, 9 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[434]  arXiv:2405.01558 [pdf, other]
Title: Configurable Learned Holography
Comments: 14 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Optics (physics.optics)
[435]  arXiv:2405.02287 (cross-list from cs.CL) [pdf, other]
Title: Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[436]  arXiv:2405.02208 (cross-list from eess.IV) [pdf, other]
Title: Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[437]  arXiv:2405.02179 (cross-list from cs.SD) [pdf, other]
Title: Training-Free Deepfake Voice Recognition by Leveraging Large-Scale Pre-Trained Models
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[438]  arXiv:2405.02109 (cross-list from eess.IV) [pdf, ps, other]
Title: Three-Dimensional Amyloid-Beta PET Synthesis from Structural MRI with Conditional Generative Adversarial Networks
Comments: Abstract Submitted and Presented at the 2024 International Society of Magnetic Resonance in Medicine. Singapore, Singapore, May 4-9. Abstract Number 2239
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[439]  arXiv:2405.01995 (cross-list from cs.LG) [pdf, other]
Title: Cooperation and Federation in Distributed Radar Point Cloud Processing
Journal-ref: 2023 IEEE 34th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[440]  arXiv:2405.01971 (cross-list from cs.RO) [pdf, other]
Title: A Sonar-based AUV Positioning System for Underwater Environments with Low Infrastructure Density
Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024
Journal-ref: IEEE ICRA Workshop on Field Robotics 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[441]  arXiv:2405.01963 (cross-list from cs.CR) [pdf, other]
Title: From Attack to Defense: Insights into Deep Learning Security Measures in Black-Box Settings
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[442]  arXiv:2405.01857 (cross-list from cs.NE) [pdf, other]
Title: TinySeg: Model Optimizing Framework for Image Segmentation on Tiny Embedded Systems
Comments: LCTES 2024
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[443]  arXiv:2405.01822 (cross-list from eess.IV) [pdf, other]
Title: Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[444]  arXiv:2405.01820 (cross-list from cs.CY) [pdf, ps, other]
Title: Real Risks of Fake Data: Synthetic Data, Diversity-Washing and Consent Circumvention
Journal-ref: FAccT '24, June 03--06, 2024, Rio de Janeiro, Brazil
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[445]  arXiv:2405.01776 (cross-list from cs.RO) [pdf, other]
Title: An Approach to Systematic Data Acquisition and Data-Driven Simulation for the Safety Testing of Automated Driving Functions
Comments: 8 pages, 5 figures
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[446]  arXiv:2405.01750 (cross-list from eess.IV) [pdf, other]
Title: PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[447]  arXiv:2405.01726 (cross-list from eess.IV) [pdf, ps, other]
Title: SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[448]  arXiv:2405.01725 (cross-list from eess.IV) [pdf, other]
Title: Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[449]  arXiv:2405.01673 (cross-list from cs.RO) [pdf, other]
Title: ShadowNav: Autonomous Global Localization for Lunar Navigation in Darkness
Comments: 21 pages, 13 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[450]  arXiv:2405.01661 (cross-list from cs.LG) [pdf, other]
Title: When a Relation Tells More Than a Concept: Exploring and Evaluating Classifier Decisions with CoReX
Comments: preliminary version, submitted to Machine Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[451]  arXiv:2405.01658 (cross-list from eess.IV) [pdf, other]
Title: MMIST-ccRCC: A Real World Medical Dataset for the Development of Multi-Modal Systems
Comments: Accepted in DCA in MI Workshop@CVPR2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[452]  arXiv:2405.01644 (cross-list from eess.IV) [pdf, ps, other]
Title: A Classification-Based Adaptive Segmentation Pipeline: Feasibility Study Using Polycystic Liver Disease and Metastases from Colorectal Cancer CT Images
Comments: J Digit Imaging. Inform. med. (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[453]  arXiv:2405.01607 (cross-list from cs.LG) [pdf, other]
Title: Wildfire Risk Prediction: A Review
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[454]  arXiv:2405.01600 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning Descriptor Hybridization with Feature Reduction for Accurate Cervical Cancer Colposcopy Image Classification
Comments: 7 Pages double column, 5 figures, and 5 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[455]  arXiv:2405.01587 (cross-list from cs.CL) [pdf, ps, other]
Title: Improve Academic Query Resolution through BERT-based Question Extraction from Images
Journal-ref: 2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI) volume 2 (2024) 1-4
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[456]  arXiv:2405.01583 (cross-list from cs.CL) [pdf, other]
Title: MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning
Authors: Nadia Saeed
Comments: 7 pages, 3 figures, Clinical NLP 2024 workshop proceedings in Shared Task
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ total of 456 entries: 1-104 | 41-144 | 145-248 | 249-352 | 353-456 ]
[ showing 104 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)