We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 81

[ total of 604 entries: 1-25 | 7-31 | 32-56 | 57-81 | 82-106 | 107-131 | 132-156 | 157-181 | ... | 582-604 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 19 Apr 2024 (continued, showing 25 of 109 entries)

[82]  arXiv:2404.11732 [pdf, other]
Title: Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Comments: Accepted at CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83]  arXiv:2404.11727 [pdf, ps, other]
Title: Deep Learning for Video-Based Assessment of Endotracheal Intubation Skills
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[84]  arXiv:2404.11669 [pdf, other]
Title: Factorized Motion Fields for Fast Sparse Input Dynamic View Synthesis
Comments: Accepted at SIGGRAPH 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[85]  arXiv:2404.11630 [pdf, other]
Title: SNP: Structured Neuron-level Pruning to Preserve Attention Scores
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[86]  arXiv:2404.12387 (cross-list from cs.CL) [pdf, other]
Title: Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[87]  arXiv:2404.12341 (cross-list from cs.LG) [pdf, other]
Title: Measuring Feature Dependency of Neural Networks by Collapsing Feature Dimensions in the Data Manifold
Comments: Accepted and will be pulished in International Symposium on Biomedical Imaging (ISBI) 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[88]  arXiv:2404.12339 (cross-list from cs.RO) [pdf, other]
Title: SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints
Comments: Accepted to ICRA 2024, project website: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[89]  arXiv:2404.12251 (cross-list from cs.LG) [pdf, other]
Title: Dynamic Modality and View Selection for Multimodal Emotion Recognition with Missing Modalities
Comments: 15 pages
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[90]  arXiv:2404.12163 (cross-list from eess.IV) [pdf, other]
Title: Unsupervised Microscopy Video Denoising
Comments: Accepted at CVPRW 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[91]  arXiv:2404.12130 (cross-list from cs.LG) [pdf, other]
Title: One-Shot Sequential Federated Learning for Non-IID Data by Enhancing Local Model Diversity
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[92]  arXiv:2404.12062 (cross-list from cs.SD) [pdf, other]
Title: MIDGET: Music Conditioned 3D Dance Generation
Comments: 12 pages, 6 figures Published in AI 2023: Advances in Artificial Intelligence
Journal-ref: In Australasian Joint Conference on Artificial Intelligence (pp. 277-288). Singapore: Springer Nature Singapore 2023
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Audio and Speech Processing (eess.AS)
[93]  arXiv:2404.11974 (cross-list from eess.IV) [pdf, other]
Title: Device (In)Dependence of Deep Learning-based Image Age Approximation
Comments: This work was accepted and presented in: 2022 ICPR-Workshop on Artificial Intelligence for Multimedia Forensics and Disinformation Detection. Montreal, Quebec, Canada. However, due to a technical issue on the publishing companies' side, the work does not appear in the workshop proceedings
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[94]  arXiv:2404.11962 (cross-list from cs.AI) [pdf, other]
Title: ©Plug-in Authorization for Human Content Copyright Protection in Text-to-Image Model
Comments: 20 pages, 6 figures
Subjects: Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[95]  arXiv:2404.11947 (cross-list from cs.LG) [pdf, other]
Title: VCC-INFUSE: Towards Accurate and Efficient Selection of Unlabeled Examples in Semi-supervised Learning
Comments: Accepted paper of IJCAI 2024. Shijie Fang and Qianhan Feng contributed equally to this paper
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2404.11946 (cross-list from cs.RO) [pdf, other]
Title: S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles
Comments: 12 pages,4 figures, published to IEEE Transactions on Intelligent Vehicles
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[97]  arXiv:2404.11936 (cross-list from cs.LG) [pdf, other]
Title: LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights
Comments: 8 pages, accepted to CVPR24 First Workshop on Efficient and On-Device Generation (EDGE)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[98]  arXiv:2404.11929 (cross-list from eess.IV) [pdf, other]
Title: A Symmetric Regressor for MRI-Based Assessment of Striatal Dopamine Transporter Uptake in Parkinson's Disease
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[99]  arXiv:2404.11925 (cross-list from cs.LG) [pdf, other]
Title: EdgeFusion: On-Device Text-to-Image Generation
Comments: 4 pages, accepted to CVPR24 First Workshop on Efficient and On-Device Generation (EDGE)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[100]  arXiv:2404.11889 (cross-list from eess.IV) [pdf, other]
Title: Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans
Comments: 13 pages, 10 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[101]  arXiv:2404.11843 (cross-list from eess.IV) [pdf, other]
Title: Computer-Aided Diagnosis of Thoracic Diseases in Chest X-rays using hybrid CNN-Transformer Architecture
Authors: Sonit Singh
Comments: 24 pages, 13 Figures, 13 Tables. arXiv admin note: text overlap with arXiv:1904.09925 by other authors
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[102]  arXiv:2404.11795 (cross-list from cs.LG) [pdf, other]
Title: Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[103]  arXiv:2404.11776 (cross-list from cs.LG) [pdf, ps, other]
Title: 3D object quality prediction for Metal Jet Printer with Multimodal thermal encoder
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[104]  arXiv:2404.11769 (cross-list from cs.LG) [pdf, other]
Title: QGen: On the Ability to Generalize in Quantization Aware Training
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[105]  arXiv:2404.11741 (cross-list from physics.med-ph) [pdf, other]
Title: Diffusion Schrödinger Bridge Models for High-Quality MR-to-CT Synthesis for Head and Neck Proton Treatment Planning
Comments: International Conference on the use of Computers in Radiation therapy (ICCR)
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[106]  arXiv:2404.11735 (cross-list from cs.LG) [pdf, other]
Title: Learning with 3D rotations, a hitchhiker's guide to SO(3)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[ total of 604 entries: 1-25 | 7-31 | 32-56 | 57-81 | 82-106 | 107-131 | 132-156 | 157-181 | ... | 582-604 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)