We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 123

[ total of 429 entries: 1-25 | ... | 49-73 | 74-98 | 99-123 | 124-148 | 149-173 | 174-198 | 199-223 | ... | 424-429 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 21 May 2024 (continued, showing 25 of 142 entries)

[124]  arXiv:2405.11837 [pdf, other]
Title: Improving the Explain-Any-Concept by Introducing Nonlinearity to the Trainable Surrogate Model
Comments: This paper is accepted for publication at IEEE SIU conference, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[125]  arXiv:2405.11823 [pdf, other]
Title: Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction
Comments: International Conference of Computational Photography (ICCP 2024), 11 pages and 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2405.11822 [pdf, other]
Title: FeTT: Continual Class Incremental Learning via Feature Transformation Tuning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2405.11814 [pdf, other]
Title: Climatic & Anthropogenic Hazards to the Nasca World Heritage: Application of Remote Sensing, AI, and Flood Modelling
Comments: accepted at IGARSS 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[128]  arXiv:2405.11809 [pdf, other]
Title: Distill-then-prune: An Efficient Compression Framework for Real-time Stereo Matching Network on Edge Devices
Comments: International Conference on Robotics and Automation (ICRA) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[129]  arXiv:2405.11794 [pdf, other]
Title: ViViD: Video Virtual Try-on using Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2405.11793 [pdf, other]
Title: MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise
Comments: Early Accepted by The International Conference on Medical Image Computing and Computer Assisted Intervention(MICCAI)2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2405.11770 [pdf, other]
Title: Learning Spatial Similarity Distribution for Few-shot Object Counting
Comments: Accepted to IJCAI2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132]  arXiv:2405.11765 [pdf, other]
Title: DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment
Comments: Manuscript submitted to IEEE Transactions on Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2405.11757 [pdf, other]
Title: DLAFormer: An End-to-End Transformer For Document Layout Analysis
Comments: ICDAR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134]  arXiv:2405.11754 [pdf, other]
Title: Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135]  arXiv:2405.11732 [pdf, ps, other]
Title: Quality assurance of organs-at-risk delineation in radiotherapy
Comments: 14 pages,5 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[136]  arXiv:2405.11690 [pdf, other]
Title: InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily Scenarios
Comments: The first two authors contributed equally to this work
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2405.11685 [pdf, other]
Title: ColorFoil: Investigating Color Blindness in Large Vision and Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[138]  arXiv:2405.11682 [pdf, other]
Title: FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention
Comments: Submitted to IEEE
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[139]  arXiv:2405.11677 [pdf, other]
Title: Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries
Comments: Early author version of paper. Refer to the full paper at this https URL
Journal-ref: IEEE Transactions on Image Processing (2024) (Volume: 33) Page(s): 2462 - 2476
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[140]  arXiv:2405.11675 [pdf, other]
Title: Deep Ensemble Art Style Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[141]  arXiv:2405.11655 [pdf, other]
Title: Track Anything Rapter(TAR)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[142]  arXiv:2405.11643 [pdf, other]
Title: Morphological Prototyping for Unsupervised Slide Representation Learning in Computational Pathology
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Applications (stat.AP)
[143]  arXiv:2405.11629 [pdf, other]
Title: Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[144]  arXiv:2405.11621 [pdf, ps, other]
Title: Computer Vision in the Food Industry: Accurate, Real-time, and Automatic Food Recognition with Pretrained MobileNetV2
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[145]  arXiv:2405.11618 [pdf, other]
Title: Transcriptomics-guided Slide Representation Learning in Computational Pathology
Comments: CVPR'24, Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[146]  arXiv:2405.11616 [pdf, other]
Title: Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2405.11614 [pdf, other]
Title: Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[148]  arXiv:2405.11582 [pdf, other]
Title: SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization
Comments: Accepted to ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[ total of 429 entries: 1-25 | ... | 49-73 | 74-98 | 99-123 | 124-148 | 149-173 | 174-198 | 199-223 | ... | 424-429 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)