We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 343

[ total of 421 entries: 1-25 | ... | 269-293 | 294-318 | 319-343 | 344-368 | 369-393 | 394-418 | 419-421 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 10 May 2024 (continued, showing 25 of 86 entries)

[344]  arXiv:2405.05841 [pdf, other]
Title: Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
Comments: Accepted to IJCAI2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[345]  arXiv:2405.05830 [pdf, ps, other]
Title: Mask-TS Net: Mask Temperature Scaling Uncertainty Calibration for Polyp Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[346]  arXiv:2405.05811 [pdf, other]
Title: Parallel Cross Strip Attention Network for Single Image Dehazing
Comments: 10 pages , 4 figures, CTISC'24
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[347]  arXiv:2405.05808 [pdf, other]
Title: Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348]  arXiv:2405.05806 [pdf, other]
Title: MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation
Comments: 34 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349]  arXiv:2405.05803 [pdf, other]
Title: Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[350]  arXiv:2405.05791 [pdf, other]
Title: Sequential Amodal Segmentation via Cumulative Occlusion Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351]  arXiv:2405.05769 [pdf, other]
Title: Exploring Text-Guided Single Image Editing for Remote Sensing Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352]  arXiv:2405.05768 [pdf, other]
Title: FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting
Comments: Accepted by IJCAI-2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353]  arXiv:2405.05766 [pdf, other]
Title: To Trust or Not to Trust: Towards a novel approach to measure trust for XAI systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[354]  arXiv:2405.05763 [pdf, ps, other]
Title: DP-MDM: Detail-Preserving MR Reconstruction via Multiple Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[355]  arXiv:2405.05760 [pdf, other]
Title: Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[356]  arXiv:2405.05755 [pdf, other]
Title: CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[357]  arXiv:2405.05749 [pdf, other]
Title: NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior
Comments: 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358]  arXiv:2405.05745 [pdf, other]
Title: Efficient Pretraining Model based on Multi-Scale Local Visual Field Feature Reconstruction for PCB CT Image Element Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[359]  arXiv:2405.05742 [pdf, other]
Title: How Quality Affects Deep Neural Networks in Fine-Grained Image Classification
Comments: VISAPP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[360]  arXiv:2405.05714 [pdf, other]
Title: Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[361]  arXiv:2405.05707 [pdf, other]
Title: LatentColorization: Latent Diffusion-Based Speaker Video Colorization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362]  arXiv:2405.05691 [pdf, other]
Title: StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[363]  arXiv:2405.05674 [pdf, ps, other]
Title: TransAnaNet: Transformer-based Anatomy Change Prediction Network for Head and Neck Cancer Patient Radiotherapy
Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[364]  arXiv:2405.05672 [pdf, other]
Title: Multi-Stream Keypoint Attention Network for Sign Language Recognition and Translation
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365]  arXiv:2405.05663 [pdf, other]
Title: RPBG: Towards Robust Neural Point-based Graphics in the Wild
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366]  arXiv:2405.05647 [pdf, ps, other]
Title: Letter to the Editor: What are the legal and ethical considerations of submitting radiology reports to ChatGPT?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[367]  arXiv:2405.05636 [pdf, other]
Title: SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[368]  arXiv:2405.05615 [pdf, other]
Title: Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
Comments: Accepted to ICML2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[ total of 421 entries: 1-25 | ... | 269-293 | 294-318 | 319-343 | 344-368 | 369-393 | 394-418 | 419-421 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)