We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 275

[ total of 605 entries: 1-25 | ... | 201-225 | 226-250 | 251-275 | 276-300 | 301-325 | 326-350 | 351-375 | ... | 601-605 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 11 Jun 2024 (continued, showing 25 of 165 entries)

[276]  arXiv:2406.06201 [pdf, other]
Title: 2DP-2MRC: 2-Dimensional Pointer-based Machine Reading Comprehension Method for Multimodal Moment Retrieval
Comments: Accepted by INTERSPEECH 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[277]  arXiv:2406.06187 [pdf, other]
Title: An Effective-Efficient Approach for Dense Multi-Label Action Detection
Comments: 14 pages. arXiv admin note: substantial text overlap with arXiv:2308.05051
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278]  arXiv:2406.06183 [pdf, other]
Title: Black carbon plumes from gas flaring in North Africa identified from multi-spectral imagery with deep learning
Comments: Published at the workshop Tackling Climate Change with Machine Learning at ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[279]  arXiv:2406.06165 [pdf, other]
Title: Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios
Comments: Accepted to ICIP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG)
[280]  arXiv:2406.06163 [pdf, other]
Title: Extending Segment Anything Model into Auditory and Temporal Dimensions for Audio-Visual Segmentation
Comments: Accepted to ICIP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281]  arXiv:2406.06136 [pdf, other]
Title: A Comparative Survey of Vision Transformers for Feature Extraction in Texture Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[282]  arXiv:2406.06134 [pdf, other]
Title: DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection
Comments: 10 pages (including supplementary), 3 figures, SynData4CV@CVPR 24 (Workshop)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[283]  arXiv:2406.06133 [pdf, other]
Title: ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models
Comments: 8 pages, 8 figures, CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284]  arXiv:2406.06122 [pdf, ps, other]
Title: W-Net: One-Shot Arbitrary-Style Chinese Character Generation with Deep Neural Networks
Journal-ref: 2018, Neural Information Processing - 25th International Conference, ICONIP
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[285]  arXiv:2406.06089 [pdf, other]
Title: Texture Re-scalable Universal Adversarial Perturbation
Comments: 14 pages (accepted by TIFS2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[286]  arXiv:2406.06087 [pdf, other]
Title: GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
Comments: 28 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287]  arXiv:2406.06079 [pdf, other]
Title: Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288]  arXiv:2406.06072 [pdf, other]
Title: Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor Control
Comments: accepted to ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[289]  arXiv:2406.06069 [pdf, other]
Title: PointABM:Integrating Bidirectional State Space Model with Multi-Head Self-Attention for Point Cloud Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[290]  arXiv:2406.06062 [pdf, other]
Title: ProcessPainter: Learn Painting Process from Sequence Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[291]  arXiv:2406.06050 [pdf, other]
Title: Generalizable Human Gaussians from Single-View Image
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[292]  arXiv:2406.06048 [pdf, other]
Title: Robust Latent Representation Tuning for Image-text Classification
Authors: Hao Sun, Yu Song
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[293]  arXiv:2406.06045 [pdf, other]
Title: Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[294]  arXiv:2406.06044 [pdf, other]
Title: FRAG: Frequency Adapting Group for Diffusion Video Editing
Comments: 16 pages, 16 figures, ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[295]  arXiv:2406.06040 [pdf, other]
Title: Vript: A Video Is Worth Thousands of Words
Comments: submitted to NeurIPS Dataset & Benchmark track
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[296]  arXiv:2406.06039 [pdf, other]
Title: Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset
Comments: Accepted to ICML 2024, Code released at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297]  arXiv:2406.06028 [pdf, other]
Title: ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[298]  arXiv:2406.06004 [pdf, other]
Title: FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model
Comments: Accepted at ACL (Main) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[299]  arXiv:2406.05980 [pdf, other]
Title: Causality-inspired Latent Feature Augmentation for Single Domain Generalization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[300]  arXiv:2406.05967 [pdf, other]
[ total of 605 entries: 1-25 | ... | 201-225 | 226-250 | 251-275 | 276-300 | 301-325 | 326-350 | 351-375 | ... | 601-605 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)