Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 343

[ total of 421 entries: 1-25 | ... | 269-293 | 294-318 | 319-343 | 344-368 | 369-393 | 394-418 | 419-421 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 10 May 2024 (continued, showing 25 of 86 entries)

[344] arXiv:2405.05841 [pdf, other]: Title: Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition

Authors: Zuan Gao, Yuxin Wang, Yadong Qu, Boqiang Zhang, Zixiao Wang, Jianjun Xu, Hongtao Xie

Comments: Accepted to IJCAI2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[345] arXiv:2405.05830 [pdf, ps, other]: Title: Mask-TS Net: Mask Temperature Scaling Uncertainty Calibration for Polyp Segmentation

Authors: Yudian Zhang, Chenhao Xu, Kaiye Xu, Haijiang Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[346] arXiv:2405.05811 [pdf, other]: Title: Parallel Cross Strip Attention Network for Single Image Dehazing

Authors: Lihan Tong, Yun Liu, Tian Ye, Weijia Li, Liyuan Chen, Erkang Chen

Comments: 10 pages , 4 figures, CTISC'24

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[347] arXiv:2405.05808 [pdf, other]: Title: Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes

Authors: Ruihao Gong, Yang Yong, Zining Wang, Jinyang Guo, Xiuying Wei, Yuqing Ma, Xianglong Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348] arXiv:2405.05806 [pdf, other]: Title: MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation

Authors: Yuxiang Wei, Zhilong Ji, Jinfeng Bai, Hongzhi Zhang, Lei Zhang, Wangmeng Zuo

Comments: 34 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349] arXiv:2405.05803 [pdf, other]: Title: Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference

Authors: Zhihang Lin, Mingbao Lin, Luxi Lin, Rongrong Ji

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[350] arXiv:2405.05791 [pdf, other]: Title: Sequential Amodal Segmentation via Cumulative Occlusion Learning

Authors: Jiayang Ao, Qiuhong Ke, Krista A. Ehinger

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351] arXiv:2405.05769 [pdf, other]: Title: Exploring Text-Guided Single Image Editing for Remote Sensing Images

Authors: Fangzhou Han, Lingyu Si, Hongwei Dong, Lamei Zhang, Hao Chen, Bo Du

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2405.05768 [pdf, other]: Title: FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting

Authors: Yikun Ma, Dandan Zhan, Zhi Jin

Comments: Accepted by IJCAI-2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2405.05766 [pdf, other]: Title: To Trust or Not to Trust: Towards a novel approach to measure trust for XAI systems

Authors: Miquel Miró-Nicolau, Gabriel Moyà-Alcover, Antoni Jaume-i-Capó, Manuel González-Hidalgo, Maria Gemma Sempere Campello, Juan Antonio Palmer Sancho

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[354] arXiv:2405.05763 [pdf, ps, other]: Title: DP-MDM: Detail-Preserving MR Reconstruction via Multiple Diffusion Models

Authors: Mengxiao Geng, Jiahao Zhu, Xiaolin Zhu, Qiqing Liu, Dong Liang, Qiegen Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[355] arXiv:2405.05760 [pdf, other]: Title: Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media

Authors: Zhizhen Zhang, Ning Wang, Haojie Li, Zhihui Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[356] arXiv:2405.05755 [pdf, other]: Title: CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks

Authors: Nick Nikzad, Yongsheng Gao, Jun Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[357] arXiv:2405.05749 [pdf, other]: Title: NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior

Authors: Gihoon Kim, Kwanggyoon Seo, Sihun Cha, Junyong Noh

Comments: 11 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358] arXiv:2405.05745 [pdf, other]: Title: Efficient Pretraining Model based on Multi-Scale Local Visual Field Feature Reconstruction for PCB CT Image Element Segmentation

Authors: Chen Chen, Kai Qiao, Jie Yang, Jian Chen, Bin Yan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[359] arXiv:2405.05742 [pdf, other]: Title: How Quality Affects Deep Neural Networks in Fine-Grained Image Classification

Authors: Joseph Smith, Zheming Zuo, Jonathan Stonehouse, Boguslaw Obara

Comments: VISAPP 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[360] arXiv:2405.05714 [pdf, other]: Title: Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning

Authors: Rui Zhao, Bin Shi, Jianfei Ruan, Tianze Pan, Bo Dong

Comments: CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[361] arXiv:2405.05707 [pdf, other]: Title: LatentColorization: Latent Diffusion-Based Speaker Video Colorization

Authors: Rory Ward, Dan Bigioi, Shubhajit Basak, John G. Breslin, Peter Corcoran

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362] arXiv:2405.05691 [pdf, other]: Title: StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework

Authors: Yiheng Huang, Hui Yang, Chuanchen Luo, Yuxi Wang, Shibiao Xu, Zhaoxiang Zhang, Man Zhang, Junran Peng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[363] arXiv:2405.05674 [pdf, ps, other]: Title: TransAnaNet: Transformer-based Anatomy Change Prediction Network for Head and Neck Cancer Patient Radiotherapy

Authors: Meixu Chen, Kai Wang, Michael Dohopolski, Howard Morgan, Jing Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[364] arXiv:2405.05672 [pdf, other]: Title: Multi-Stream Keypoint Attention Network for Sign Language Recognition and Translation

Authors: Mo Guan, Yan Wang, Guangkun Ma, Jiarui Liu, Mingzu Sun

Comments: 15 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365] arXiv:2405.05663 [pdf, other]: Title: RPBG: Towards Robust Neural Point-based Graphics in the Wild

Authors: Qingtian Zhu, Zizhuang Wei, Zhongtian Zheng, Yifan Zhan, Zhuyu Yao, Jiawang Zhang, Kejian Wu, Yinqiang Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366] arXiv:2405.05647 [pdf, ps, other]: Title: Letter to the Editor: What are the legal and ethical considerations of submitting radiology reports to ChatGPT?

Authors: Siddharth Agarwal, David Wood, Robin Carpenter, Yiran Wei, Marc Modat, Thomas C Booth

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[367] arXiv:2405.05636 [pdf, other]: Title: SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space

Authors: Zeren Zhang, Haibo Qin, Jiayu Huang, Yixin Li, Hui Lin, Yitao Duan, Jinwen Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[368] arXiv:2405.05615 [pdf, other]: Title: Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

Authors: Shibo Jie, Yehui Tang, Ning Ding, Zhi-Hong Deng, Kai Han, Yunhe Wang

Comments: Accepted to ICML2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)

[ total of 421 entries: 1-25 | ... | 269-293 | 294-318 | 319-343 | 344-368 | 369-393 | 394-418 | 419-421 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 343

Fri, 10 May 2024 (continued, showing 25 of 86 entries)