We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Mar 2024

[ total of 3051 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 3051 ]
[ showing 25 entries per page: fewer | more ]
[1]  arXiv:2403.00068 [pdf, other]
Title: Artwork Explanation in Large-scale Vision Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[2]  arXiv:2403.00154 [pdf, other]
Title: LLMs in Political Science: Heralding a New Era of Visual Analysis
Authors: Yu Wang
Comments: 7 pages, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3]  arXiv:2403.00174 [pdf, other]
Title: A citizen science toolkit to collect human perceptions of urban environments using open street view images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[4]  arXiv:2403.00175 [pdf, other]
Title: FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
Comments: 14 pages, 9 figures, 1 table
Journal-ref: Sensors 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[5]  arXiv:2403.00196 [pdf, other]
Title: Learning to Find Missing Video Frames with Synthetic Data Augmentation: A General Framework and Application in Generating Thermal Images Using RGB Cameras
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[6]  arXiv:2403.00206 [pdf, ps, other]
Title: MaskLRF: Self-supervised Pretraining via Masked Autoencoding of Local Reference Frames for Rotation-invariant 3D Point Set Analysis
Authors: Takahiko Furuya
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[7]  arXiv:2403.00209 [pdf, other]
Title: ChartReformer: Natural Language-Driven Chart Image Editing
Comments: Published in ICDAR 2024. Code and model are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8]  arXiv:2403.00211 [pdf, other]
Title: Trustworthy Self-Attention: Enabling the Network to Focus Only on the Most Relevant References
Comments: Correct Figure 1
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2403.00219 [pdf, other]
Title: Multi-modal Attribute Prompting for Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[10]  arXiv:2403.00231 [pdf, other]
Title: Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Comments: Project page: this https URL Fix typos
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[11]  arXiv:2403.00245 [pdf, other]
Title: YOLO-MED : Multi-Task Interaction Network for Biomedical Images
Comments: Accepted by ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12]  arXiv:2403.00249 [pdf, other]
Title: Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Comments: Accepted to LREC-COLING 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13]  arXiv:2403.00250 [pdf, other]
Title: Rethinking Classifier Re-Training in Long-Tailed Recognition: A Simple Logits Retargeting Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[14]  arXiv:2403.00257 [pdf, ps, other]
Title: Robust deep labeling of radiological emphysema subtypes using squeeze and excitation convolutional neural networks: The MESA Lung and SPIROMICS Studies
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[15]  arXiv:2403.00261 [pdf, other]
Title: Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person Re-identification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:2403.00268 [pdf, other]
Title: Improving Acne Image Grading with Label Distribution Smoothing
Comments: Accepted to IEEE ISBI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[17]  arXiv:2403.00269 [pdf, other]
Title: Large Convolutional Model Tuning via Filter Subspace
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[18]  arXiv:2403.00272 [pdf, other]
Title: Dual Pose-invariant Embeddings: Learning Category and Object-specific Discriminative Representations for Recognition and Retrieval
Comments: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[19]  arXiv:2403.00274 [pdf, other]
Title: CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[20]  arXiv:2403.00303 [pdf, other]
Title: ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Comments: Accepted by CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[21]  arXiv:2403.00307 [pdf, other]
Title: Embedded Multi-label Feature Selection via Orthogonal Regression
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[22]  arXiv:2403.00325 [pdf, other]
Title: Small, Versatile and Mighty: A Range-View Perception Framework
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23]  arXiv:2403.00326 [pdf, other]
Title: DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24]  arXiv:2403.00327 [pdf, other]
Title: Task Indicating Transformer for Task-conditional Dense Predictions
Comments: Accepted by ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[25]  arXiv:2403.00352 [pdf, other]
Title: Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Comments: Accepted to AAAI-2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ total of 3051 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 3051 ]
[ showing 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2405, contact, help  (Access key information)