Computer Vision and Pattern Recognition
Authors and titles for recent submissions
[ total of 420 entries: 1-420 ][ showing up to 553 entries per page: fewer | more ]
Tue, 21 May 2024
- [1] arXiv:2405.12221 [pdf, other]
-
Title: Images that Sound: Composing Images and Sounds on a Single CanvasComments: Project site: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [2] arXiv:2405.12218 [pdf, other]
-
Title: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View StereoAuthors: Tianqi Liu, Guangcong Wang, Shoukang Hu, Liao Shen, Xinyi Ye, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei LiuComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [3] arXiv:2405.12217 [pdf, other]
-
Title: Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context LearningAuthors: Guanglin Zhou, Zhongyi Han, Shiming Chen, Biwei Huang, Liming Zhu, Salman Khan, Xin Gao, Lina YaoComments: 17 pages, 7 figures, 7 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [4] arXiv:2405.12211 [pdf, other]
-
Title: Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal SlicesAuthors: Nathaniel Cohen, Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas, Tomer MichaeliComments: ICML 2024. Code and examples are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [5] arXiv:2405.12202 [pdf, other]
-
Title: Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for Arbitrary-scale Super-resolutionComments: 20 pages, 14 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [6] arXiv:2405.12200 [pdf, other]
-
Title: Multi-View Attentive Contextualization for Multi-View 3D Object DetectionComments: Accepted by CVPR2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [7] arXiv:2405.12175 [pdf, other]
-
Title: Enhancing Explainable AI: A Hybrid Approach Combining GradCAM and LRP for CNN InterpretabilitySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [8] arXiv:2405.12150 [pdf, other]
-
Title: Bangladeshi Native Vehicle Detection in WildAuthors: Bipin Saha, Md. Johirul Islam, Shaikh Khaled Mostaque, Aditya Bhowmik, Tapodhir Karmakar Taton, Md. Nakib Hayat Chowdhury, Mamun Bin Ibne ReazComments: 13 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [9] arXiv:2405.12139 [pdf, other]
-
Title: DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLMComments: Accepted by CVPR Workshop 2024, Oral PresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [10] arXiv:2405.12126 [pdf, other]
-
Title: Alzheimer's Magnetic Resonance Imaging Classification Using Deep and Meta-Learning ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Machine Learning (cs.LG); Multimedia (cs.MM)
- [11] arXiv:2405.12114 [pdf, other]
-
Title: A New Cross-Space Total Variation Regularization Model for Color Image Restoration with Quaternion Blur OperatorComments: 15pages,10figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
- [12] arXiv:2405.12110 [pdf, other]
-
Title: CoR-GS: Sparse-View 3D Gaussian Splatting via Co-RegularizationComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [13] arXiv:2405.12107 [pdf, other]
-
Title: Imp: Highly Capable Large Multimodal Models for Mobile DevicesAuthors: Zhenwei Shao, Zhou Yu, Jun Yu, Xuecheng Ouyang, Lihao Zheng, Zhenbiao Gai, Mingyang Wang, Jiajun DingComments: 19 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [14] arXiv:2405.12105 [pdf, other]
-
Title: Sheet Music Transformer ++: End-to-End Full-Page Optical Music Recognition for Pianoform Sheet MusicSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [15] arXiv:2405.12070 [pdf, other]
-
Title: AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot MovementsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [16] arXiv:2405.12069 [pdf, other]
-
Title: Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture WarpingComments: Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [17] arXiv:2405.12057 [pdf, other]
-
Title: NPLMV-PS: Neural Point-Light Multi-View Photometric StereoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [18] arXiv:2405.12018 [pdf, other]
-
Title: Continuous Sign Language Recognition with Adapted Conformer via Unsupervised PretrainingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [19] arXiv:2405.12006 [pdf, other]
-
Title: Depth Reconstruction with Neural Signed Distance Fields in Structured Light SystemsComments: 10 pages, 8 figures, accepted by 3DV 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [20] arXiv:2405.12003 [pdf, other]
-
Title: Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image ClassificationComments: 19 pages, 16 figures,Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [21] arXiv:2405.11993 [pdf, other]
-
Title: GGAvatar: Geometric Adjustment of Gaussian Head AvatarComments: 9 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [22] arXiv:2405.11985 [pdf, other]
-
Title: MTVQA: Benchmarking Multilingual Text-Centric Visual Question AnsweringAuthors: Jingqun Tang, Qi Liu, Yongjie Ye, Jinghui Lu, Shu Wei, Chunhui Lin, Wanqing Li, Mohamad Fitri Faiz Bin Mahmood, Hao Feng, Zhen Zhao, Yanjie Wang, Yuliang Liu, Hao Liu, Xiang Bai, Can HuangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [23] arXiv:2405.11978 [pdf, other]
-
Title: SM-DTW: Stability Modulated Dynamic Time Warping for signature verificationJournal-ref: Pattern Recognition Letters, Volume: 121, Pages 113-122 (2019)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [24] arXiv:2405.11977 [pdf, other]
-
Title: GuidedRec: Guiding Ill-Posed Unsupervised Volumetric RecoveryAuthors: Alexandre Cafaro, Amaury Leroy, Guillaume Beldjoudi, Pauline Maury, Charlotte Robert, Eric Deutsch, Vincent Grégoire, Vincent Lepetit, Nikos ParagiosSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [25] arXiv:2405.11976 [pdf, other]
-
Title: Position-Guided Prompt Learning for Anomaly Detection in Chest X-RaysComments: MICCAI 2024 Early AcceptSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [26] arXiv:2405.11971 [pdf, other]
-
Title: Data Augmentation for Text-based Person Retrieval Using Large Language ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [27] arXiv:2405.11936 [pdf, other]
-
Title: UAV-VisLoc: A Large-scale Dataset for UAV Visual LocalizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [28] arXiv:2405.11921 [pdf, other]
-
Title: MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror ReflectionsAuthors: Jiayue Liu, Xiao Tang, Freeman Cheng, Roy Yang, Zhihao Li, Jianzhuang Liu, Yi Huang, Jiaqi Lin, Shiyong Liu, Xiaofei Wu, Songcen Xu, Chun YuanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [29] arXiv:2405.11914 [pdf, other]
-
Title: PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB ImagesComments: 10 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [30] arXiv:2405.11913 [pdf, other]
-
Title: Diff-BGM: A Diffusion Model for Video Background Music GenerationComments: Accepted by CVPR 2024(Poster)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [31] arXiv:2405.11905 [pdf, other]
-
Title: CSTA: CNN-based Spatiotemporal Attention for Video SummarizationComments: Accepted at CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [32] arXiv:2405.11903 [pdf, ps, other]
-
Title: A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentationAuthors: Sushmita Sarker, Prithul Sarker, Gunner Stone, Ryan Gorman, Alireza Tavakkoli, George Bebis, Javad SattarvandComments: Published in Springer Nature (Machine Vision and Applications)Journal-ref: Machine Vision and Applications 35, 67 (2024)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [33] arXiv:2405.11894 [pdf, other]
-
Title: Refining Coded Image in Human Vision Layer Using CNN-Based Post-ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [34] arXiv:2405.11867 [pdf, other]
-
Title: Depth Prompting for Sensor-Agnostic Depth EstimationComments: Accepted at CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [35] arXiv:2405.11862 [pdf, other]
-
Title: SEMv3: A Fast and Robust Approach to Table Separation Line DetectionComments: 9 pages, 6 figures, 5 tables. Accepted by IJCAI2024 main trackSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [36] arXiv:2405.11852 [pdf, other]
-
Title: Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [37] arXiv:2405.11850 [pdf, other]
- [38] arXiv:2405.11846 [pdf, other]
-
Title: EPPS: Advanced Polyp Segmentation via Edge Information Injection and Selective Feature DecouplingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [39] arXiv:2405.11837 [pdf, other]
-
Title: Improving the Explain-Any-Concept by Introducing Nonlinearity to the Trainable Surrogate ModelComments: This paper is accepted for publication at IEEE SIU conference, 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [40] arXiv:2405.11823 [pdf, other]
-
Title: Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video ReconstructionComments: International Conference of Computational Photography (ICCP 2024), 11 pages and 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [41] arXiv:2405.11822 [pdf, other]
-
Title: FeTT: Continual Class Incremental Learning via Feature Transformation TuningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [42] arXiv:2405.11814 [pdf, other]
-
Title: Climatic & Anthropogenic Hazards to the Nasca World Heritage: Application of Remote Sensing, AI, and Flood ModellingComments: accepted at IGARSS 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
- [43] arXiv:2405.11809 [pdf, other]
-
Title: Distill-then-prune: An Efficient Compression Framework for Real-time Stereo Matching Network on Edge DevicesComments: International Conference on Robotics and Automation (ICRA) 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [44] arXiv:2405.11794 [pdf, other]
-
Title: ViViD: Video Virtual Try-on using Diffusion ModelsAuthors: Zixun Fang, Wei Zhai, Aimin Su, Hongliang Song, Kai Zhu, Mao Wang, Yu Chen, Zhiheng Liu, Yang Cao, Zheng-Jun ZhaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [45] arXiv:2405.11793 [pdf, other]
-
Title: MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text ExpertiseComments: Early Accepted by The International Conference on Medical Image Computing and Computer Assisted Intervention(MICCAI)2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [46] arXiv:2405.11770 [pdf, other]
-
Title: Learning Spatial Similarity Distribution for Few-shot Object CountingComments: Accepted to IJCAI2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [47] arXiv:2405.11765 [pdf, other]
-
Title: DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical AlignmentComments: Manuscript submitted to IEEE Transactions on Image ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [48] arXiv:2405.11757 [pdf, other]
-
Title: DLAFormer: An End-to-End Transformer For Document Layout AnalysisComments: ICDAR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [49] arXiv:2405.11754 [pdf, other]
-
Title: Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain AdaptationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [50] arXiv:2405.11732 [pdf, ps, other]
-
Title: Quality assurance of organs-at-risk delineation in radiotherapyAuthors: Yihao Zhao, Cuiyun Yuan, Ying Liang, Yang Li, Chunxia Li, Man Zhao, Jun Hu, Wei Liu, Chenbin LiuComments: 14 pages,5 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [51] arXiv:2405.11690 [pdf, other]
-
Title: InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily ScenariosComments: The first two authors contributed equally to this workSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [52] arXiv:2405.11685 [pdf, other]
-
Title: ColorFoil: Investigating Color Blindness in Large Vision and Language ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [53] arXiv:2405.11682 [pdf, other]
-
Title: FADet: A Multi-sensor 3D Object Detection Network based on Local Featured AttentionComments: Submitted to IEEESubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [54] arXiv:2405.11677 [pdf, other]
-
Title: Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging GeometriesAuthors: Christiaan G.A. Viviers, Lena Filatova, Maurice Termeer, Peter H.N. de With, Fons van der SommenComments: Early author version of paper. Refer to the full paper at this https URLJournal-ref: IEEE Transactions on Image Processing (2024) (Volume: 33) Page(s): 2462 - 2476Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [55] arXiv:2405.11675 [pdf, other]
-
Title: Deep Ensemble Art Style RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [56] arXiv:2405.11655 [pdf, other]
-
Title: Track Anything Rapter(TAR)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [57] arXiv:2405.11643 [pdf, other]
-
Title: Morphological Prototyping for Unsupervised Slide Representation Learning in Computational PathologyAuthors: Andrew H. Song, Richard J. Chen, Tong Ding, Drew F.K. Williamson, Guillaume Jaume, Faisal MahmoodComments: CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Applications (stat.AP)
- [58] arXiv:2405.11629 [pdf, other]
-
Title: Searching Realistic-Looking Adversarial Objects For Autonomous Driving SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [59] arXiv:2405.11621 [pdf, ps, other]
-
Title: Computer Vision in the Food Industry: Accurate, Real-time, and Automatic Food Recognition with Pretrained MobileNetV2Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [60] arXiv:2405.11618 [pdf, other]
-
Title: Transcriptomics-guided Slide Representation Learning in Computational PathologyAuthors: Guillaume Jaume, Lukas Oldenburg, Anurag Vaidya, Richard J. Chen, Drew F.K. Williamson, Thomas Peeters, Andrew H. Song, Faisal MahmoodComments: CVPR'24, OralSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [61] arXiv:2405.11616 [pdf, other]
-
Title: Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise AttentionAuthors: Peng Li, Yuan Liu, Xiaoxiao Long, Feihu Zhang, Cheng Lin, Mengfei Li, Xingqun Qi, Shanghang Zhang, Wenhan Luo, Ping Tan, Wenping Wang, Qifeng Liu, Yike GuoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [62] arXiv:2405.11614 [pdf, other]
-
Title: Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge DistillationSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [63] arXiv:2405.11582 [pdf, other]
-
Title: SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch NormalizationComments: Accepted to ICML 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [64] arXiv:2405.11574 [pdf, other]
-
Title: Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image ClassificationComments: Reproducibility studySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [65] arXiv:2405.11564 [pdf, other]
-
Title: CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [66] arXiv:2405.11551 [pdf, other]
-
Title: An Invisible Backdoor Attack Based On Semantic FeatureAuthors: Yangming ChenSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [67] arXiv:2405.11536 [pdf, other]
-
Title: RobMOT: Robust 3D Multi-Object Tracking by Observational Noise and State Estimation Drift Mitigation on LiDAR PointCloudSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [68] arXiv:2405.11526 [pdf, other]
-
Title: Register assisted aggregation for Visual Place RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [69] arXiv:2405.11523 [pdf, other]
-
Title: Diffusion-Based Hierarchical Image SteganographyComments: arXiv admin note: text overlap with arXiv:2305.16936Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [70] arXiv:2405.11511 [pdf, other]
-
Title: Online Action Representation using Change Detection and Symbolic ProgrammingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [71] arXiv:2405.11501 [pdf, other]
-
Title: DogFLW: Dog Facial Landmarks in the Wild DatasetAuthors: George Martvel, Greta Abele, Annika Bremhorst, Chiara Canori, Nareed Farhat, Giulia Pedretti, Ilan Shimshoni, Anna ZamanskySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [72] arXiv:2405.11498 [pdf, other]
-
Title: The Effectiveness of Edge Detection Evaluation Metrics for Automated Coastline DetectionJournal-ref: 2023 Photonics & Electromagnetics Research Symposium (PIERS)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [73] arXiv:2405.11496 [pdf, other]
-
Title: DEMO: A Statistical Perspective for Efficient Image-Text MatchingSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
- [74] arXiv:2405.11494 [pdf, other]
-
Title: Automated Coastline Extraction Using Edge Detection AlgorithmsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [75] arXiv:2405.11493 [pdf, other]
-
Title: Point Cloud Compression with Implicit Neural Representations: A Unified FrameworkComments: 6 Pages, 6 Figures, submitted to IEEE ICCCSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Signal Processing (eess.SP)
- [76] arXiv:2405.11491 [pdf, other]
-
Title: BOSC: A Backdoor-based Framework for Open Set Synthetic Image AttributionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [77] arXiv:2405.11487 [pdf, other]
-
Title: "Previously on ..." From Recaps to Story SummarizationComments: CVPR 2024; Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [78] arXiv:2405.11483 [pdf, other]
-
Title: MICap: A Unified Model for Identity-aware Movie DescriptionsComments: CVPR 2024, Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [79] arXiv:2405.11481 [pdf, other]
-
Title: Physics-aware Hand-object Interaction DenoisingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [80] arXiv:2405.11478 [pdf, other]
-
Title: Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance for Low-Light Image EnhancementComments: Accepted to CVPR 2024 Workshop NTIRE: New Trends in Image Restoration and Enhancement workshop and ChallengesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [81] arXiv:2405.11476 [pdf, other]
-
Title: NubbleDrop: A Simple Way to Improve Matching Strategy for Prompted One-Shot SegmentationComments: Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [82] arXiv:2405.11473 [pdf, other]
-
Title: FIFO-Diffusion: Generating Infinite Videos from Text without TrainingComments: Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [83] arXiv:2405.11468 [pdf, other]
-
Title: Emphasizing Crucial Features for Efficient Image RestorationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [84] arXiv:2405.11467 [pdf, other]
-
Title: AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data AugmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [85] arXiv:2405.11448 [pdf, other]
-
Title: Cross-Domain Knowledge Distillation for Low-Resolution Human Pose EstimationComments: 11 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [86] arXiv:2405.11442 [pdf, other]
-
Title: Unifying 3D Vision-Language Understanding via Promptable QueriesAuthors: Ziyu Zhu, Zhuofan Zhang, Xiaojian Ma, Xuesong Niu, Yixin Chen, Baoxiong Jia, Zhidong Deng, Siyuan Huang, Qing LiComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [87] arXiv:2405.11437 [pdf, other]
-
Title: The First Swahili Language Scene Text Detection and Recognition DatasetComments: Accepted to ICDAR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [88] arXiv:2405.11351 [pdf, other]
-
Title: PlantTracing: Tracing Arabidopsis Thaliana Apex with CenterTrackComments: 4 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [89] arXiv:2405.11345 [pdf, ps, other]
-
Title: City-Scale Multi-Camera Vehicle Tracking System with Improved Self-Supervised Camera Link ModelSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [90] arXiv:2405.11338 [pdf, ps, other]
-
Title: EyeFound: A Multimodal Generalist Foundation Model for Ophthalmic ImagingAuthors: Danli Shi, Weiyi Zhang, Xiaolan Chen, Yexin Liu, Jianchen Yang, Siyu Huang, Yih Chung Tham, Yingfeng Zheng, Mingguang HeComments: 21 pages, 2 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [91] arXiv:2405.11337 [pdf, other]
-
Title: A Unified Approach Towards Active Learning and Out-of-Distribution DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [92] arXiv:2405.11336 [pdf, other]
-
Title: UPAM: Unified Prompt Attack in Text-to-Image Generation Models Against Both Textual Filters and Visual CheckersComments: Accepted by ICML2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [93] arXiv:2405.11315 [pdf, other]
-
Title: MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly DetectionComments: 12 pages, 3 figures, 5 tables, early accepted at MICCAI 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [94] arXiv:2405.11293 [pdf, other]
-
Title: InfRS: Incremental Few-Shot Object Detection in Remote Sensing ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [95] arXiv:2405.11286 [pdf, other]
-
Title: Motion Avatar: Generate Human and Animal Avatars with Arbitrary MotionAuthors: Zeyu Zhang, Yiran Wang, Biao Wu, Shuo Chen, Zhiyuan Zhang, Shiya Huang, Wenbo Zhang, Meng Fang, Ling Chen, Yang ZhaoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [96] arXiv:2405.11276 [pdf, other]
-
Title: Visible and Clear: Finding Tiny Objects in Difference MapSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [97] arXiv:2405.11270 [pdf, other]
-
Title: HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from VideosSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [98] arXiv:2405.11252 [pdf, other]
-
Title: Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score MatchingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [99] arXiv:2405.11240 [pdf, other]
-
Title: Testing the Performance of Face Recognition for People with Down SyndromeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [100] arXiv:2405.11236 [pdf, other]
-
Title: TriLoRA: Integrating SVD for Advanced Style Personalization in Text-to-Image GenerationComments: Accepted by AI for Content Creation (AI4CC) workshop at CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [101] arXiv:2405.11205 [pdf, other]
-
Title: Fuse & Calibrate: A bi-directional Vision-Language Guided Framework for Referring Image SegmentationComments: 12 pages, 4 figures ICIC2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [102] arXiv:2405.11190 [pdf, other]
-
Title: ReasonPix2Pix: Instruction Reasoning Dataset for Advanced Image EditingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [103] arXiv:2405.11180 [pdf, other]
-
Title: GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [104] arXiv:2405.11165 [pdf, other]
-
Title: Automated Multi-level Preference for MLLMsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [105] arXiv:2405.11158 [pdf, other]
-
Title: Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation ModelsComments: The paper is published at ICRA 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [106] arXiv:2405.11154 [pdf, other]
-
Title: Revisiting the Robust Generalization of Adversarial Prompt TuningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [107] arXiv:2405.11151 [pdf, other]
-
Title: Multi-scale Information Sharing and Selection Network with Boundary Attention for Polyp SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [108] arXiv:2405.11145 [pdf, other]
-
Title: Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless PredictionsAuthors: Junzhang Liu, Zhecan Wang, Hammad Ayyubi, Haoxuan You, Chris Thomas, Rui Sun, Shih-Fu Chang, Kai-Wei ChangSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [109] arXiv:2405.11129 [pdf, other]
-
Title: MotionGS : Compact Gaussian Splatting SLAM by Motion FilterSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [110] arXiv:2405.11126 [pdf, other]
-
Title: Flexible Motion In-betweening with Diffusion ModelsComments: SIGGRAPH 2024. For project page and code, see this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [111] arXiv:2405.11112 [pdf, other]
-
Title: Enhancing Understanding Through Wildlife Re-IdentificationAuthors: J. BuitenhuisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [112] arXiv:2405.11067 [pdf, other]
-
Title: Bayesian Learning-driven Prototypical Contrastive Loss for Class-Incremental LearningAuthors: Nisha L. Raichur, Lucas Heublein, Tobias Feigl, Alexander Rügamer, Christopher Mutschler, Felix OttSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [113] arXiv:2405.11021 [pdf, other]
-
Title: Photorealistic 3D Urban Scene Reconstruction and Point Cloud Extraction using Google Earth Imagery and Gaussian SplattingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [114] arXiv:2405.10954 [pdf, ps, other]
-
Title: Multimodal CLIP Inference for Meta-Few-Shot Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [115] arXiv:2405.10952 [pdf, other]
-
Title: VICAN: Very Efficient Calibration Algorithm for Large Camera NetworksComments: To appear at the IEEE International Conference on Robotics and Automation (ICRA), 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [116] arXiv:2405.10951 [pdf, other]
-
Title: Block Selective Reprogramming for On-device Training of Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [117] arXiv:2405.10949 [pdf, other]
-
Title: Global License Plate DatasetAuthors: Siddharth AgrawalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [118] arXiv:2405.10948 [pdf, other]
-
Title: Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic SurgeryAuthors: Guankun Wang, Long Bai, Wan Jun Nah, Jie Wang, Zhaoxi Zhang, Zhen Chen, Jinlin Wu, Mobarakol Islam, Hongbin Liu, Hongliang RenSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO); Image and Video Processing (eess.IV)
- [119] arXiv:2405.10947 [pdf, other]
-
Title: Depth-aware Panoptic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [120] arXiv:2405.10946 [pdf, other]
-
Title: Application of Tensorized Neural Networks for Cloud ClassificationAuthors: Alifu Xiafukaiti, Devanshu Garg, Aruto Hosaka, Koichi Yanagisawa, Yuichiro Minato, Tsuyoshi YoshidaComments: 11 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Atmospheric and Oceanic Physics (physics.ao-ph)
- [121] arXiv:2405.12171 (cross-list from cs.SE) [pdf, other]
-
Title: State of the Practice for Medical Imaging SoftwareComments: 73 pages, 14 figures, 12 tablesSubjects: Software Engineering (cs.SE); Computer Vision and Pattern Recognition (cs.CV)
- [122] arXiv:2405.11880 (cross-list from cs.LG) [pdf, other]
-
Title: Quantifying In-Context Reasoning Effects and Memorization Effects in LLMsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [123] arXiv:2405.11829 (cross-list from cs.LG) [pdf, other]
-
Title: Adversarially Diversified Rehearsal Memory (ADRM): Mitigating Memory Overfitting Challenge in Continual LearningSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [124] arXiv:2405.11708 (cross-list from cs.LG) [pdf, other]
-
Title: Adaptive Batch Normalization Networks for Adversarial RobustnessComments: Accepted at IEEE International Conference on Advanced Video and Signal-based Surveillance (AVSS) 2024Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [125] arXiv:2405.11659 (cross-list from cs.RO) [pdf, other]
-
Title: Auto-Platoon : Freight by exampleSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [126] arXiv:2405.11640 (cross-list from cs.AI) [pdf, other]
-
Title: Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical ReasoningSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [127] arXiv:2405.11598 (cross-list from eess.IV) [pdf, other]
-
Title: AI-Assisted Diagnosis for Covid-19 CXR Screening: From Data Collection to Clinical ValidationAuthors: Carlo Alberto Barbano, Riccardo Renzulli, Marco Grosso, Domenico Basile, Marco Busso, Marco GrangettoComments: Accepted at 21st IEEE International Symposium on Biomedical Imaging (ISBI)Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [128] arXiv:2405.11533 (cross-list from cs.LG) [pdf, other]
-
Title: Hierarchical Selective ClassificationSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [129] arXiv:2405.11492 (cross-list from cs.RO) [pdf, other]
-
Title: Enhancing Vehicle Aerodynamics with Deep Reinforcement Learning in Voxelised ModelsSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [130] arXiv:2405.11386 (cross-list from eess.IV) [pdf, other]
-
Title: Liver Fat Quantification Network with Body ShapeSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [131] arXiv:2405.11326 (cross-list from cs.LG) [pdf, other]
-
Title: On the Trajectory Regularity of ODE-based Diffusion SamplingComments: ICML 2024, 30 pagesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [132] arXiv:2405.11320 (cross-list from cs.LG) [pdf, other]
-
Title: Sampling Strategies for Mitigating Bias in Face Synthesis MethodsComments: Accepted to the BIAS 2023 ECML-PKDD WorkshopSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [133] arXiv:2405.11301 (cross-list from cs.CL) [pdf, other]
-
Title: Enhancing Fine-Grained Image Classifications via Cascaded Vision Language ModelsAuthors: Canshi WeiSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [134] arXiv:2405.11298 (cross-list from cs.RO) [pdf, other]
-
Title: Visual Episodic Memory-based ExplorationComments: FLAIRS 2023, 7 pages, 11 figuresJournal-ref: The International FLAIRS Conference Proceedings. Vol. 36. 2023Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [135] arXiv:2405.11295 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Medical Image Analysis for Detection, Treatment and Planning of Disease using Artificial Intelligence ApproachesComments: 10 pages, 3 figuresJournal-ref: International Journal of Microsystems and IoT, Vol. 1, Issue 5, pp.278- 287, 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
- [136] arXiv:2405.11289 (cross-list from eess.IV) [pdf, other]
-
Title: Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion ClassificationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [137] arXiv:2405.11273 (cross-list from cs.AI) [pdf, other]
-
Title: Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of ExpertsAuthors: Yunxin Li, Shenyuan Jiang, Baotian Hu, Longyue Wang, Wanqi Zhong, Wenhan Luo, Lin Ma, Min ZhangComments: 22 pages, 13 figures. Project Website: this https URL Working in progressSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [138] arXiv:2405.11176 (cross-list from cs.RO) [pdf, other]
-
Title: Outlier-Robust Long-Term Robotic Mapping Leveraging Ground SegmentationAuthors: Hyungtae LimComments: 2 pages, 4 figuresSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [139] arXiv:2405.11133 (cross-list from eess.IV) [pdf, ps, other]
-
Title: XCAT-2.0: A Comprehensive Library of Personalized Digital Twins Derived from CT ScansAuthors: Lavsen Dahal, Mobina Ghojoghnejad, Dhrubajyoti Ghosh, Yubraj Bhandari, David Kim, Fong Chi Ho, Fakrul Islam Tushar, Ehsan Abadi, Ehsan Samei, Joseph Lo, Paul SegarsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [140] arXiv:2405.11064 (cross-list from eess.SP) [pdf, other]
-
Title: TVCondNet: A Conditional Denoising Neural Network for NMR SpectroscopyAuthors: Zihao Zou, Shirin Shoushtari, Jiaming Liu, Jialiang Zhang, Patrick Judge, Emilia Santana, Alison Lim, Marcus Foston, Ulugbek S. KamilovSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
- [141] arXiv:2405.11029 (cross-list from cs.LG) [pdf, other]
-
Title: Generative Artificial Intelligence: A Systematic Review and ApplicationsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [142] arXiv:2405.10950 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Classification of colorectal primer carcinoma from normal colon with mid-infrared spectraComments: 15 pages, 5 figures, 4 tables, Conferentia Chemometrica 2023 special edition, for the original digital location, see this https URL , digital biblio info: (2024) e3542Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Tissues and Organs (q-bio.TO)
Mon, 20 May 2024
- [143] arXiv:2405.10934 [pdf, other]
-
Title: Reconstruction of Manipulated Garment with Guided Deformation PriorSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [144] arXiv:2405.10913 [pdf, other]
-
Title: Blackbox Adaptation for Medical Image SegmentationComments: Accepted early at MICCAI 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [145] arXiv:2405.10885 [pdf, other]
- [146] arXiv:2405.10879 [pdf, other]
-
Title: One registration is worth two segmentationsComments: Early Accepted by MICCAI2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [147] arXiv:2405.10871 [pdf, other]
-
Title: BraTS-Path Challenge: Assessing Heterogeneous Histopathologic Brain Tumor Sub-regionsAuthors: Spyridon Bakas, Siddhesh P. Thakur, Shahriar Faghani, Mana Moassefi, Ujjwal Baid, Verena Chung, Sarthak Pati, Shubham Innani, Bhakti Baheti, Jake Albrecht, Alexandros Karargyris, Hasan Kassem, MacLean P. Nasrallah, Jared T. Ahrendsen, Valeria Barresi, Maria A. Gubbiotti, Giselle Y. López, Calixto-Hope G. Lucas, Michael L. Miller, Lee A. D. Cooper, Jason T. Huse, William R. BellSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [148] arXiv:2405.10868 [pdf, other]
-
Title: Air Signing and Privacy-Preserving Signature Verification for Digital DocumentsSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [149] arXiv:2405.10864 [pdf, other]
-
Title: Improving face generation quality and prompt following with synthetic captionsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [150] arXiv:2405.10842 [pdf, ps, other]
-
Title: Automated Radiology Report Generation: A Review of Recent AdvancesComments: 24 pages, 8 figures, 6 tables. Submitted to IEEE Reviews in Biomedical EngineeringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [151] arXiv:2405.10832 [pdf, other]
-
Title: Open-Vocabulary Spatio-Temporal Action DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [152] arXiv:2405.10802 [pdf, other]
-
Title: Reduced storage direct tensor ring decomposition for convolutional neural networks compressionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [153] arXiv:2405.10748 [pdf, other]
-
Title: Deep Data Consistency: a Fast and Robust Diffusion Model-based Solver for Inverse ProblemsComments: Codes: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [154] arXiv:2405.10739 [pdf, other]
-
Title: Efficient Multimodal Large Language Models: A SurveyAuthors: Yizhang Jin, Jian Li, Yexin Liu, Tianjun Gu, Kai Wu, Zhengkai Jiang, Muyang He, Bo Zhao, Xin Tan, Zhenye Gan, Yabiao Wang, Chengjie Wang, Lizhuang MaSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [155] arXiv:2405.10736 [pdf, other]
-
Title: StackOverflowVQA: Stack Overflow Visual Question Answering DatasetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [156] arXiv:2405.10718 [pdf, other]
-
Title: SignLLM: Sign Languages Production Large Language ModelsComments: 33 pages, website at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [157] arXiv:2405.10707 [pdf, ps, other]
-
Title: HARIS: Human-Like Attention for Reference Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [158] arXiv:2405.10696 [pdf, other]
-
Title: Autonomous AI-enabled Industrial Sorting Pipeline for Advanced Textile RecyclingAuthors: Yannis Spyridis, Vasileios Argyriou, Antonios Sarigiannidis, Panagiotis Radoglou, Panagiotis SarigiannidisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [159] arXiv:2405.10690 [pdf, other]
-
Title: CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video ParsingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [160] arXiv:2405.10674 [pdf, other]
-
Title: From Sora What We Can See: A Survey of Text-to-Video GenerationAuthors: Rui Sun, Yumin Zhang, Tejal Shah, Jiahao Sun, Shuoying Zhang, Wenqi Li, Haoran Duan, Bo Wei, Rajiv RanjanComments: A comprehensive list of text-to-video generation studies in this survey is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [161] arXiv:2405.10612 [pdf, other]
-
Title: Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [162] arXiv:2405.10610 [pdf, other]
-
Title: Driving Referring Video Object Segmentation with Vision-Language Pre-trained ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [163] arXiv:2405.10598 [pdf, other]
-
Title: Learning Object-Centric Representation via Reverse Hierarchy GuidanceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [164] arXiv:2405.10591 [pdf, other]
-
Title: GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-SupervisionAuthors: Xin Tan, Wenbin Wu, Zhiwei Zhang, Chaojie Fan, Yong Peng, Zhizhong Zhang, Yuan Xie, Lizhuang MaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [165] arXiv:2405.10589 [pdf, other]
-
Title: Improving Point-based Crowd Counting and Localization Based on Auxiliary Point GuidanceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [166] arXiv:2405.10577 [pdf, other]
-
Title: DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [167] arXiv:2405.10575 [pdf, other]
-
Title: Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence TheorySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [168] arXiv:2405.10567 [pdf, other]
-
Title: Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation TrackComments: ICRA 2024 RoboDrive Challenge Robust Map Segmentation Track 3rd Place Technical Report. arXiv admin note: text overlap with arXiv:2205.09743 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [169] arXiv:2405.10557 [pdf, other]
-
Title: Resolving Symmetry Ambiguity in Correspondence-based Methods for Instance-level Object Pose EstimationAuthors: Yongliang Lin, Yongzhi Su, Sandeep Inuganti, Yan Di, Naeem Ajilforoushan, Hanqing Yang, Yu Zhang, Jason RambachComments: 8 pages,10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [170] arXiv:2405.10554 [pdf, other]
-
Title: NeRO: Neural Road Surface ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [171] arXiv:2405.10530 [pdf, other]
-
Title: CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic SegmentationComments: 5 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [172] arXiv:2405.10529 [pdf, other]
-
Title: Safeguarding Vision-Language Models Against Patched Visual Prompt InjectorsComments: 15 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [173] arXiv:2405.10518 [pdf, ps, other]
-
Title: Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [174] arXiv:2405.10508 [pdf, other]
-
Title: ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes GenerationComments: Accepted at CVPR 2024 Workshop on AI3DGSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [175] arXiv:2405.10504 [pdf, ps, other]
-
Title: Multi-scale Semantic Prior Features Guided Deep Neural Network for Urban Street-view ImageSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [176] arXiv:2405.10489 [pdf, other]
-
Title: MixCut:A Data Augmentation Method for Facial Expression RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [177] arXiv:2405.10456 [pdf, other]
-
Title: Region-level labels in ice charts can produce pixel-level segmentation for Sea Ice typesComments: Published at ICLR 2024 Machine Learning for Remote Sensing (ML4RS) WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [178] arXiv:2405.10444 [pdf, other]
-
Title: A Novel Bounding Box Regression Method for Single Object TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [179] arXiv:2405.10439 [pdf, other]
-
Title: Beyond Traditional Single Object Tracking: A SurveySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [180] arXiv:2405.10423 [pdf, other]
-
Title: Diversity-Aware Sign Language Production through a Pose Encoding Variational AutoencoderSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [181] arXiv:2405.10398 [pdf, other]
-
Title: Drone-type-Set: Drone types detection benchmark for drone detection and trackingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [182] arXiv:2405.10370 [pdf, other]
-
Title: Grounded 3D-LLM with Referent TokensAuthors: Yilun Chen, Shuai Yang, Haifeng Huang, Tai Wang, Ruiyuan Lyu, Runsen Xu, Dahua Lin, Jiangmiao PangComments: PreprintSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [183] arXiv:2405.10357 [pdf, other]
-
Title: RGB Guided ToF Imaging System: A Survey of Deep Learning-based MethodsComments: To appear on International Journal of Computer Vision (IJCV)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [184] arXiv:2405.10347 [pdf, other]
-
Title: Networking Systems for Video Anomaly Detection: A Tutorial and SurveyAuthors: Jing Liu, Yang Liu, Jieyu Lin, Jielin Li, Peng Sun, Bo Hu, Liang Song, Azzedine Boukerche, Victor C.M. LeungComments: Submitted to ACM Computing Surveys, under review,for more information and supplementary material, please see this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
- [185] arXiv:2405.10939 (cross-list from cs.LG) [pdf, other]
-
Title: DINO as a von Mises-Fisher mixture modelComments: Accepted to ICLR 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [186] arXiv:2405.10870 (cross-list from eess.IV) [pdf, other]
-
Title: Multicenter Privacy-Preserving Model Training for Deep Learning Brain Metastases AutosegmentationAuthors: Yixing Huang, Zahra Khodabakhshi, Ahmed Gomaa, Manuel Schmidt, Rainer Fietkau, Matthias Guckenberger, Nicolaus Andratschke, Christoph Bert, Stephanie Tanadini-Lang, Florian PutzComments: Submission to the Green Journal (Major Revision)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [187] arXiv:2405.10833 (cross-list from eess.IV) [pdf, other]
-
Title: Automatic segmentation of Organs at Risk in Head and Neck cancer patients from CT and MRI scansAuthors: Sébastien Quetin, Andrew Heschl, Mauricio Murillo, Murali Rohit, Shirin A. Enger, Farhad MalekiSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [188] arXiv:2405.10803 (cross-list from eess.IV) [pdf, other]
-
Title: A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for ExplainabilityComments: Early AcceptSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [189] arXiv:2405.10754 (cross-list from math.OC) [pdf, other]
-
Title: Stable Phase Retrieval with Mirror DescentSubjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [190] arXiv:2405.10723 (cross-list from eess.IV) [pdf, other]
-
Title: Eddeep: Fast eddy-current distortion correction for diffusion MRI with deep learningAuthors: Antoine Legouhy, Ross Callaghan, Whitney Stee, Philippe Peigneux, Hojjat Azadbakht, Hui ZhangComments: submitted to MICCAI 2024Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [191] arXiv:2405.10705 (cross-list from eess.IV) [pdf, other]
-
Title: 3D Vessel Reconstruction from Sparse-View Dynamic DSA Images via Vessel Probability Guided Attenuation LearningAuthors: Zhentao Liu, Huangxuan Zhao, Wenhui Qin, Zhenghong Zhou, Xinggang Wang, Wenping Wang, Xiaochun Lai, Chuansheng Zheng, Dinggang Shen, Zhiming CuiComments: 12 pages, 13 figures, 5 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [192] arXiv:2405.10702 (cross-list from cs.CL) [pdf, ps, other]
-
Title: Empowering Prior to Court Legal Analysis: A Transparent and Accessible Dataset for Defensive Statement Classification and InterpretationSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [193] arXiv:2405.10691 (cross-list from eess.IV) [pdf, other]
-
Title: LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image CompletionAuthors: Zihao Zhu, Tianli Tao, Yitian Tao, Haowen Deng, Xinyi Cai, Gaofeng Wu, Kaidong Wang, Haifeng Tang, Lixuan Zhu, Zhuoyang Gu, Jiawei Huang, Dinggang Shen, Han ZhangSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [194] arXiv:2405.10561 (cross-list from eess.IV) [pdf, other]
-
Title: Infrared Image Super-Resolution via Lightweight Information Split NetworkAuthors: Shijie Liu, Kang Yan, Feiwei Qin, Changmiao Wang, Ruiquan Ge, Kai Zhang, Jie Huang, Yong Peng, Jin CaoSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [195] arXiv:2405.10550 (cross-list from eess.IV) [pdf, other]
-
Title: LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-DiffusionAuthors: Tong Chen, Qingcheng Lyu, Long Bai, Erjian Guo, Huxin Gao, Xiaoxiao Yang, Hongliang Ren, Luping ZhouSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [196] arXiv:2405.10531 (cross-list from cs.LG) [pdf, other]
-
Title: Nonparametric Teaching of Implicit Neural RepresentationsComments: ICML 2024 (24 pages, 13 figures)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [197] arXiv:2405.10497 (cross-list from cs.MM) [pdf, other]
-
Title: SMP Challenge: An Overview and Analysis of Social Media Prediction ChallengeAuthors: Bo Wu, Peiye Liu, Wen-Huang Cheng, Bei Liu, Zhaoyang Zeng, Jia Wang, Qiushi Huang, Jiebo LuoComments: ACM Multimedia. arXiv admin note: text overlap with arXiv:1910.01795Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI)
Fri, 17 May 2024
- [198] arXiv:2405.10320 [pdf, other]
-
Title: Toon3D: Seeing Cartoons from a New PerspectiveAuthors: Ethan Weber, Riley Peterlinz, Rohan Mathur, Frederik Warburg, Alexei A. Efros, Angjoo KanazawaComments: Please see our project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [199] arXiv:2405.10317 [pdf, other]
-
Title: Text-to-Vector Generation with Neural Path RepresentationComments: Accepted by SIGGRAPH 2024. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [200] arXiv:2405.10316 [pdf, other]
-
Title: Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion ModelComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [201] arXiv:2405.10314 [pdf, other]
-
Title: CAT3D: Create Anything in 3D with Multi-View Diffusion ModelsAuthors: Ruiqi Gao, Aleksander Holynski, Philipp Henzler, Arthur Brussee, Ricardo Martin-Brualla, Pratul Srinivasan, Jonathan T. Barron, Ben PooleComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [202] arXiv:2405.10305 [pdf, other]
-
Title: 4D Panoptic Scene Graph GenerationAuthors: Jingkang Yang, Jun Cen, Wenxuan Peng, Shuai Liu, Fangzhou Hong, Xiangtai Li, Kaiyang Zhou, Qifeng Chen, Ziwei LiuComments: Accepted as NeurIPS 2023. Code: this https URL Previous Series: PSG this https URL and PVSG this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [203] arXiv:2405.10300 [pdf, other]
-
Title: Grounding DINO 1.5: Advance the "Edge" of Open-Set Object DetectionAuthors: Tianhe Ren, Qing Jiang, Shilong Liu, Zhaoyang Zeng, Wenlong Liu, Han Gao, Hongjie Huang, Zhengyu Ma, Xiaoke Jiang, Yihao Chen, Yuda Xiong, Hao Zhang, Feng Li, Peijun Tang, Kent Yu, Lei ZhangComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [204] arXiv:2405.10286 [pdf, other]
-
Title: FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language modelsComments: Accepted at CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [205] arXiv:2405.10272 [pdf, other]
-
Title: Faces that Speak: Jointly Synthesising Talking Face and Speech from TextAuthors: Youngjoon Jang, Ji-Hoon Kim, Junseok Ahn, Doyeop Kwak, Hong-Sun Yang, Yoon-Cheol Ju, Il-Hwan Kim, Byeong-Yeol Kim, Joon Son ChungComments: CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
- [206] arXiv:2405.10266 [pdf, other]
-
Title: A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language SupervisionAuthors: Charles Raude, K R Prajwal, Liliane Momeni, Hannah Bull, Samuel Albanie, Andrew Zisserman, Gül VarolSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [207] arXiv:2405.10256 [pdf, other]
-
Title: Biasing & Debiasing based Approach Towards Fair Knowledge Transfer for Equitable Skin AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [208] arXiv:2405.10255 [pdf, other]
-
Title: When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language ModelsAuthors: Xianzheng Ma, Yash Bhalgat, Brandon Smart, Shuai Chen, Xinghui Li, Jian Ding, Jindong Gu, Dave Zhenyu Chen, Songyou Peng, Jia-Wang Bian, Philip H Torr, Marc Pollefeys, Matthias Nießner, Ian D Reid, Angel X. Chang, Iro Laina, Victor Adrian PrisacariuSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [209] arXiv:2405.10244 [pdf, ps, other]
-
Title: Towards Task-Compatible Compressible RepresentationsComments: To be published in ICME Workshops 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [210] arXiv:2405.10185 [pdf, other]
-
Title: DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataComments: Accepted to CVPR 2024, codes are available at \href{this https URL}{this https URL}Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [211] arXiv:2405.10175 [pdf, other]
-
Title: Filling Missing Values Matters for Range Image-Based Point Cloud SegmentationComments: This paper has been submitted to a journalSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [212] arXiv:2405.10160 [pdf, other]
-
Title: PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation LearningComments: 15 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [213] arXiv:2405.10148 [pdf, other]
-
Title: SpecDETR: A Transformer-based Hyperspectral Point Object Detection NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [214] arXiv:2405.10140 [pdf, other]
-
Title: Libra: Building Decoupled Vision System on Large Language ModelsComments: ICML2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [215] arXiv:2405.10132 [pdf, other]
-
Title: Cooperative Visual-LiDAR Extrinsic Calibration Technology for Intersection Vehicle-Infrastructure: A reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [216] arXiv:2405.10122 [pdf, other]
-
Title: Generating Coherent Sequences of Visual Illustrations for Real-World Manual TasksAuthors: João Bordalo, Vasco Ramos, Rodrigo Valério, Diogo Glória-Silva, Yonatan Bitton, Michal Yarom, Idan Szpektor, Joao MagalhaesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [217] arXiv:2405.10082 [pdf, other]
-
Title: An Integrated Framework for Multi-Granular Explanation of Video SummarizationComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [218] arXiv:2405.10075 [pdf, other]
-
Title: HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase RecognitionComments: Accepted by MICCAI2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [219] arXiv:2405.10053 [pdf, other]
-
Title: SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object DetectionComments: Accepted as a conference paper (highlight) at CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [220] arXiv:2405.10046 [pdf, other]
-
Title: A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long DistanceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [221] arXiv:2405.10041 [pdf, other]
-
Title: Revealing Hierarchical Structure of Leaf Venations in Plant Science via Label-Efficient Segmentation: Dataset and MethodAuthors: Weizhen Liu, Ao Li, Ze Wu, Yue Li, Baobin Ge, Guangyu Lan, Shilin Chen, Minghe Li, Yunfei Liu, Xiaohui Yuan, Nanqing DongComments: Accepted by IJCAI2024, Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [222] arXiv:2405.10037 [pdf, other]
-
Title: Bilateral Event Mining and Complementary for Event Stream Super-ResolutionAuthors: Zhilin Huang, Quanmin Liang, Yijie Yu, Chujun Qin, Xiawu Zheng, Kai Huang, Zikun Zhou, Wenming YangComments: Accepted to CVPR2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [223] arXiv:2405.10030 [pdf, other]
-
Title: RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image DehazingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [224] arXiv:2405.10014 [pdf, other]
-
Title: Frequency-Domain Refinement with Multiscale Diffusion for Super ResolutionSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [225] arXiv:2405.10008 [pdf, other]
-
Title: Solving the enigma: Deriving optimal explanations of deep networksAuthors: Michail Mamalakis, Antonios Mamalakis, Ingrid Agartz, Lynn Egeland Mørch-Johnsen, Graham Murray, John Suckling, Pietro LioComments: keywords: XAI, neuroscience, brain, 3D, 2D, computer vision, classificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [226] arXiv:2405.09996 [pdf, other]
-
Title: Driving-Video Dehazing with Non-Aligned Regularization for Safety AssistanceComments: Accepted by CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [227] arXiv:2405.09985 [pdf, other]
-
Title: VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce MarketingComments: project page: this https URL;Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [228] arXiv:2405.09981 [pdf, other]
-
Title: Adversarial Robustness for Visual Grounding of Multimodal Large Language ModelsComments: ICLR 2024 Workshop on Reliable and Responsible Foundation ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [229] arXiv:2405.09976 [pdf, other]
-
Title: Language-Oriented Semantic Latent Representation for Image TransmissionAuthors: Giordano Cicchetti, Eleonora Grassucci, Jihong Park, Jinho Choi, Sergio Barbarossa, Danilo ComminielloComments: Under review at IEEE International Workshop on Machine Learning for Signal Processing (MLSP) 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [230] arXiv:2405.09964 [pdf, other]
-
Title: KPNDepth: Depth Estimation of Lane Images under Complex Rainy EnvironmentAuthors: Zhengxu ShiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [231] arXiv:2405.09955 [pdf, other]
-
Title: Dual-band feature selection for maturity classification of specialty crops by hyperspectral imagingComments: Preprint: Paper submitted to the special issue of "Computers and Electronics in Agriculture"Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [232] arXiv:2405.09942 [pdf, other]
-
Title: FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object DetectionComments: arXiv admin note: text overlap with arXiv:2307.07662, text overlap with arXiv:1902.09630 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [233] arXiv:2405.09934 [pdf, other]
-
Title: Detecting Domain Shift in Multiple Instance Learning for Digital Pathology Using Fréchet Domain DistanceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [234] arXiv:2405.09933 [pdf, other]
-
Title: MiniMaxAD: A Lightweight Autoencoder for Feature-Rich Anomaly DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [235] arXiv:2405.09931 [pdf, other]
-
Title: Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction RecognitionComments: Accepted by CVPR2024. Project HomePage: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [236] arXiv:2405.09924 [pdf, other]
-
Title: Infrared Adversarial Car StickersComments: Accepted by CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [237] arXiv:2405.09923 [pdf, other]
-
Title: NTIRE 2024 Restore Any Image Model (RAIM) in the Wild ChallengeAuthors: Jie Liang, Radu Timofte, Qiaosi Yi, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei ZhangSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [238] arXiv:2405.09922 [pdf, other]
-
Title: Cross-sensor self-supervised training and alignment for remote sensingAuthors: Valerio Marsocci (CEDRIC - VERTIGO, CNAM), Nicolas Audebert (CEDRIC - VERTIGO, CNAM, LaSTIG, IGN)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [239] arXiv:2405.09902 [pdf, other]
-
Title: Unveiling the Potential: Harnessing Deep Metric Learning to Circumvent Video Streaming EncryptionComments: Published in the WI-IAT 2023 proceedingsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
- [240] arXiv:2405.09883 [pdf, other]
-
Title: RoScenes: A Large-scale Multi-view 3D Dataset for Roadside PerceptionAuthors: Xiaosu Zhu, Hualian Sheng, Sijia Cai, Bing Deng, Shaopeng Yang, Qiao Liang, Ken Chen, Lianli Gao, Jingkuan Song, Jieping YeComments: Technical report. 32 pages, 21 figures, 13 tables. this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [241] arXiv:2405.09882 [pdf, other]
-
Title: DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy ProtectionComments: 16 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [242] arXiv:2405.09880 [pdf, other]
-
Title: Deep Learning-Based Quasi-Conformal Surface Registration for Partial 3D Faces Applied to Facial RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [243] arXiv:2405.09879 [pdf, other]
-
Title: Generative Unlearning for Any IdentityComments: 15 pages, 17 figures, 10 tables, CVPR 2024 PosterSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [244] arXiv:2405.09874 [pdf, other]
-
Title: Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent DiffusionAuthors: Xinyang Li, Zhangyu Lai, Linning Xu, Jianfei Guo, Liujuan Cao, Shengchuan Zhang, Bo Dai, Rongrong JiComments: Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [245] arXiv:2405.09873 [pdf, other]
-
Title: IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation ModelComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [246] arXiv:2405.09863 [pdf, other]
-
Title: Box-Free Model Watermarks Are Prone to Black-Box Removal AttacksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [247] arXiv:2405.09858 [pdf, other]
-
Title: Towards Realistic Incremental Scenario in Class Incremental Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [248] arXiv:2405.09828 [pdf, other]
-
Title: PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale featuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [249] arXiv:2405.09827 [pdf, other]
-
Title: Parallel Backpropagation for Shared-Feature VisualizationAuthors: Alexander Lappe, Anna Bognár, Ghazaleh Ghamkhari Nejad, Albert Mukovskiy, Lucas Martini, Martin A. Giese, Rufin VogelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [250] arXiv:2405.09806 [pdf, other]
-
Title: MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [251] arXiv:2405.09789 [pdf, other]
-
Title: LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image InterpretationComments: Accepted by IJCAI'2024. The code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [252] arXiv:2405.09782 [pdf, other]
-
Title: Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object DetectionAuthors: Feiran Li, Qianqian Xu, Shilong Bao, Zhiyong Yang, Runmin Cong, Xiaochun Cao, Qingming HuangComments: This paper has been accepted by ICML2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [253] arXiv:2405.09777 [pdf, other]
-
Title: Rethinking Barely-Supervised Segmentation from an Unsupervised Domain Adaptation PerspectiveSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [254] arXiv:2405.09755 [pdf, other]
-
Title: Collision Avoidance Metric for 3D Camera EvaluationSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [255] arXiv:2405.09717 [pdf, other]
-
Title: From NeRFs to Gaussian Splats, and BackSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [256] arXiv:2405.09713 [pdf, other]
-
Title: SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World KnowledgeAuthors: Andong Wang, Bo Wu, Sunli Chen, Zhenfang Chen, Haotian Guan, Wei-Ning Lee, Li Erran Li, Chuang GanComments: CVPRSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [257] arXiv:2405.09707 [pdf, other]
-
Title: Point2SSM++: Self-Supervised Learning of Anatomical Shape Models from Point CloudsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [258] arXiv:2405.09697 [pdf, other]
-
Title: Weakly Supervised Bayesian Shape Modeling from Unsegmented Medical ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [259] arXiv:2405.09682 [pdf, other]
-
Title: Synth-to-Real Unsupervised Domain Adaptation for Instance SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [260] arXiv:2405.09588 [pdf, ps, other]
-
Title: Training Deep Learning Models with Hybrid Datasets for Robust Automatic Target Detection on real SAR imagesAuthors: Benjamin Camus, Théo Voillemin, Corentin Le Barbu, Jean-Christophe Louvigné (DGA.MI), Carole Belloni (DGA.MI), Emmanuel Vallée (DGA.MI)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Signal Processing (eess.SP)
- [261] arXiv:2405.09582 [pdf, other]
-
Title: AD-Aligning: Emulating Human-like Generalization for Cognitive Domain Adaptation in Deep LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [262] arXiv:2405.09550 [pdf, other]
-
Title: Mask-based Invisible Backdoor Attacks on Object DetectionAuthors: Shin Jeong JinComments: 7 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
- [263] arXiv:2405.10292 (cross-list from cs.AI) [pdf, other]
-
Title: Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement LearningAuthors: Yuexiang Zhai, Hao Bai, Zipeng Lin, Jiayi Pan, Shengbang Tong, Yifei Zhou, Alane Suhr, Saining Xie, Yann LeCun, Yi Ma, Sergey LevineSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [264] arXiv:2405.10262 (cross-list from cs.LG) [pdf, other]
-
Title: Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted FeaturesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [265] arXiv:2405.10254 (cross-list from eess.IV) [pdf, other]
-
Title: PRISM: A Multi-Modal Generative Foundation Model for Slide-Level HistopathologyAuthors: George Shaikovski, Adam Casson, Kristen Severson, Eric Zimmermann, Yi Kan Wang, Jeremy D. Kunz, Juan A. Retamero, Gerard Oakley, David Klimstra, Christopher Kanan, Matthew Hanna, Michal Zelechowski, Julian Viret, Neil Tenenholtz, James Hall, Nicolo Fusi, Razik Yousfi, Peter Hamilton, William A. Moye, Eugene Vorontsov, Siqi Liu, Thomas J. FuchsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [266] arXiv:2405.10246 (cross-list from eess.IV) [pdf, other]
-
Title: A Foundation Model for Brain Lesion Segmentation with Mixture of Modality ExpertsAuthors: Xinru Zhang, Ni Ou, Berke Doga Basaran, Marco Visentin, Mengyun Qiao, Renyang Gu, Cheng Ouyang, Yaou Liu, Paul M. Matthew, Chuyang Ye, Wenjia BaiComments: The work has been early accepted by MICCAI 2024Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [267] arXiv:2405.10068 (cross-list from eess.IV) [pdf, other]
-
Title: MrRegNet: Multi-resolution Mask Guided Convolutional Neural Network for Medical Image Registration with Large DeformationsComments: Accepted for publication at IEEE International Symposium on Biomedical Imaging (ISBI) 2024Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [268] arXiv:2405.10020 (cross-list from cs.RO) [pdf, other]
-
Title: Natural Language Can Help Bridge the Sim2Real GapComments: To appear in RSS 2024Subjects: Robotics (cs.RO); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [269] arXiv:2405.10004 (cross-list from eess.IV) [pdf, other]
-
Title: ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image DatasetAuthors: Johannes Rückert, Louise Bloch, Raphael Brüngel, Ahmad Idrissi-Yaghir, Henning Schäfer, Cynthia S. Schmidt, Sven Koitka, Obioma Pelka, Asma Ben Abacha, Alba G. Seco de Herrera, Henning Müller, Peter A. Horn, Felix Nensa, Christoph M. FriedrichComments: Major revision Scientific DataSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [270] arXiv:2405.09990 (cross-list from eess.IV) [pdf, other]
-
Title: Histopathology Foundation Models Enable Accurate Ovarian Cancer Subtype ClassificationSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [271] arXiv:2405.09959 (cross-list from eess.IV) [pdf, other]
-
Title: Patient-Specific Real-Time Segmentation in Trackerless Brain UltrasoundAuthors: Reuben Dorent, Erickson Torio, Nazim Haouchine, Colin Galvin, Sarah Frisken, Alexandra Golby, Tina Kapur, William WellsComments: Early accept at MICCAI 2024 - code available at: this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [272] arXiv:2405.09864 (cross-list from astro-ph.IM) [pdf, other]
-
Title: Solar multi-object multi-frame blind deconvolution with a spatially variant convolution neural emulatorAuthors: A. Asensio Ramos (IAC+ULL)Comments: 15 pages, 14 figures, accepted for publication in A&ASubjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV)
- [273] arXiv:2405.09851 (cross-list from eess.IV) [pdf, other]
-
Title: Region of Interest Detection in Melanocytic Skin Tumor Whole Slide Images -- Nevus & MelanomaAuthors: Yi Cui, Yao Li, Jayson R. Miedema, Sharon N. Edmiston, Sherif Farag, J.S. Marron, Nancy E. ThomasComments: 5 figures, NeurIPS 2022 WorkshopSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [274] arXiv:2405.09820 (cross-list from cs.LG) [pdf, other]
-
Title: Densely Distilling Cumulative Knowledge for Continual LearningComments: 12 pages; Continual Leanrning; Class-incremental Learning; Knowledge Distillation; ForgettingSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [275] arXiv:2405.09814 (cross-list from cs.GR) [pdf, other]
-
Title: Semantic Gesticulator: Semantics-Aware Co-Speech Gesture SynthesisComments: SIGGRAPH 2024 (Journal Track); Project page: this https URLSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [276] arXiv:2405.09798 (cross-list from cs.LG) [pdf, other]
-
Title: Many-Shot In-Context Learning in Multimodal Foundation ModelsAuthors: Yixing Jiang, Jeremy Irvin, Ji Hun Wang, Muhammad Ahmed Chaudhry, Jonathan H. Chen, Andrew Y. NgSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [277] arXiv:2405.09787 (cross-list from eess.IV) [pdf, other]
-
Title: Analysis of the BraTS 2023 Intracranial Meningioma Segmentation ChallengeAuthors: Dominic LaBella, Ujjwal Baid, Omaditya Khanna, Shan McBurney-Lin, Ryan McLean, Pierre Nedelec, Arif Rashid, Nourel Hoda Tahon, Talissa Altes, Radhika Bhalerao, Yaseen Dhemesh, Devon Godfrey, Fathi Hilal, Scott Floyd, Anastasia Janas, Anahita Fathi Kazerooni, John Kirkpatrick, Collin Kent, Florian Kofler, Kevin Leu, Nazanin Maleki, Bjoern Menze, Maxence Pajot, Zachary J. Reitman, Jeffrey D. Rudie, Rachit Saluja, Yury Velichko, Chunhao Wang, Pranav Warman, Maruf Adewole, Jake Albrecht, Udunna Anazodo, Syed Muhammad Anwar, Timothy Bergquist, Sully Francis Chen, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Nastaran Khalili, Juan Eugenio Iglesias, Zhifan Jiang, Elaine Johanson, Koen Van Leemput, Hongwei Bran Li, Marius George Linguraru, Xinyang Liu, Aria Mahtabfar, Zeke Meier, et al. (71 additional authors not shown)Comments: 16 pages, 11 tables, 10 figures, MICCAISubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [278] arXiv:2405.09716 (cross-list from eess.IV) [pdf, other]
-
Title: Illumination Histogram Consistency Metric for Quantitative Assessment of Video SequencesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [279] arXiv:2405.09711 (cross-list from cs.AI) [pdf, other]
-
Title: STAR: A Benchmark for Situated Reasoning in Real-World VideosComments: NeurIPSSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [280] arXiv:2405.09695 (cross-list from cs.HC) [pdf, other]
-
Title: Enhancing Saliency Prediction in Monitoring Tasks: The Role of Visual HighlightsSubjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
- [281] arXiv:2405.09601 (cross-list from physics.med-ph) [pdf, ps, other]
-
Title: Fully Automated OCT-based Tissue Screening SystemAuthors: Shaohua Pi, Razieh Ganjee, Lingyun Wang, Riley K. Arbuckle, Chengcheng Zhao, Jose A Sahel, Bingjie Wang, Yuanyuan ChenSubjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
- [282] arXiv:2405.09600 (cross-list from cs.LG) [pdf, other]
-
Title: Aggregate Representation Measure for Predictive Model ReusabilitySubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [283] arXiv:2405.09594 (cross-list from eess.IV) [pdf, other]
-
Title: Learning Generalized Medical Image Representations through Image-Graph Contrastive PretrainingComments: Accepted into Machine Learning for Health (ML4H) 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [284] arXiv:2405.09589 (cross-list from cs.LG) [pdf, other]
-
Title: Unveiling Hallucination in Text, Image, Video, and Audio Foundation Models: A Comprehensive SurveySubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [285] arXiv:2405.09586 (cross-list from eess.IV) [pdf, other]
-
Title: Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report GenerationSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [286] arXiv:2405.09558 (cross-list from eess.SP) [pdf, other]
-
Title: An EM Body Model for Device-Free Localization with Multiple Antenna Receivers: A First StudyJournal-ref: 2023 IEEE-APS Topical Conference on Antennas and Propagation in Wireless Communications (APWC)Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [287] arXiv:2405.09552 (cross-list from eess.IV) [pdf, other]
-
Title: ODFormer: Semantic Fundus Image Segmentation Using Transformer for Optic Nerve Head DetectionAuthors: Jiayi Wang, Yi-An Mao, Xiaoyu Ma, Sicen Guo, Yuting Shao, Xiao Lv, Wenting Han, Mark Christopher, Linda M. Zangwill, Yanlong Bi, Rui FanSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Thu, 16 May 2024
- [288] arXiv:2405.09546 [pdf, other]
-
Title: BEHAVIOR Vision Suite: Customizable Dataset Generation via SimulationAuthors: Yunhao Ge, Yihe Tang, Jiashu Xu, Cem Gokmen, Chengshu Li, Wensi Ai, Benjamin Jose Martinez, Arman Aydin, Mona Anvari, Ayush K Chakravarthy, Hong-Xing Yu, Josiah Wong, Sanjana Srivastava, Sharon Lee, Shengxin Zha, Laurent Itti, Yunzhu Li, Roberto Martín-Martín, Miao Liu, Pengchuan Zhang, Ruohan Zhang, Li Fei-Fei, Jiajun WuComments: CVPR 2024 (Highlight). Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [289] arXiv:2405.09544 [pdf, other]
-
Title: Classifying geospatial objects from multiview aerial imagery using semantic meshesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [290] arXiv:2405.09487 [pdf, other]
-
Title: Color Space Learning for Cross-Color Person Re-IdentificationComments: Accepted by ICME 2024 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [291] arXiv:2405.09463 [pdf, other]
-
Title: Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal Candidiasis ScreeningAuthors: Yan Kong, Sheng Wang, Jiangdong Cai, Zihao Zhao, Zhenrong Shen, Yonghao Li, Manman Fei, Qian WangComments: MICCAI-2024 early accept. Our code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [292] arXiv:2405.09459 [pdf, other]
-
Title: Fourier Boundary Features Network with Wider Catchers for Glass SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [293] arXiv:2405.09431 [pdf, other]
-
Title: A Survey On Text-to-3D Contents Generation In The WildAuthors: Chenhan JiangComments: 11 pages, 10 figures, 4 tables. arXiv admin note: text overlap with arXiv:2401.17807 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [294] arXiv:2405.09426 [pdf, other]
-
Title: Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated ImagesComments: 10 pages, 3 figures. Submitted to IEEE Transactions on Human-Machine SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [295] arXiv:2405.09409 [pdf, ps, other]
-
Title: Real-World Federated Learning in Radiology: Hurdles to overcome and Benefits to gainAuthors: Markus R. Bujotzek, Ünal Akünal, Stefan Denner, Peter Neher, Maximilian Zenk, Eric Frodl, Astha Jaiswal, Moon Kim, Nicolai R. Krekiehn, Manuel Nickel, Richard Ruppel, Marcus Both, Felix Döllinger, Marcel Opitz, Thorsten Persigehl, Jens Kleesiek, Tobias Penzkofer, Klaus Maier-Hein, Rickmer Braren, Andreas BucherSubjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
- [296] arXiv:2405.09404 [pdf, other]
-
Title: Time-Equivariant Contrastive Learning for Degenerative Disease Progression in Retinal OCTAuthors: Taha Emre, Arunava Chakravarty, Dmitrii Lachinov, Antoine Rivail, Ursula Schmidt-Erfurth, Hrvoje BogunovićComments: Accepted at MICCAI 2024 (early accept, top 11%)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [297] arXiv:2405.09403 [pdf, other]
-
Title: Identity Overlap Between Face Recognition Train/Test Data: Causing Optimistic Bias in Accuracy MeasurementSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [298] arXiv:2405.09365 [pdf, other]
-
Title: SARATR-X: A Foundation Model for Synthetic Aperture Radar Images Target RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [299] arXiv:2405.09355 [pdf, other]
-
Title: Vision-Based Neurosurgical Guidance: Unsupervised Localization and Camera-Pose PredictionAuthors: Gary Sarwin, Alessandro Carretta, Victor Staartjes, Matteo Zoli, Diego Mazzatenta, Luca Regli, Carlo Serra, Ender KonukogluComments: Early Accept at MICCAI 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [300] arXiv:2405.09342 [pdf, other]
-
Title: Progressive Depth Decoupling and Modulating for Flexible Depth CompletionComments: The article is accepted by IEEE Transactions on Instrumentation & MeasurementSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [301] arXiv:2405.09334 [pdf, other]
-
Title: Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark StudyComments: 23 pages, 9 Figures, 13 TablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
- [302] arXiv:2405.09333 [pdf, other]
-
Title: Application of Gated Recurrent Units for CT Trajectory OptimizationComments: 4 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [303] arXiv:2405.09321 [pdf, other]
-
Title: ReconBoost: Boosting Can Achieve Modality ReconcilementComments: This paper has been accepted by ICML2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
- [304] arXiv:2405.09291 [pdf, other]
-
Title: Sensitivity Decouple Learning for Image Compression Artifacts ReductionComments: Accepted by Transactions on Image ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [305] arXiv:2405.09288 [pdf, other]
-
Title: DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual ExplanationsComments: Accepted to Medical Imaging with Deep Learning (MIDL) 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [306] arXiv:2405.09266 [pdf, other]
-
Title: Dance Any Beat: Blending Beats with Visuals in Dance Video GenerationComments: 11 pages, 6 figures, demo page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [307] arXiv:2405.09247 [pdf, other]
-
Title: Graph Neural Network based Handwritten Trajectories RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [308] arXiv:2405.09215 [pdf, other]
-
Title: Xmodel-VLM: A Simple Baseline for Multimodal Vision Language ModelSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [309] arXiv:2405.09194 [pdf, ps, other]
-
Title: Flexible image analysis for law enforcement agencies with deep neural networks to determine: where, who and whatAuthors: Henri Bouma, Bart Joosten, Maarten C Kruithof, Maaike H T de Boer, Alexandru Ginsca (LIST (CEA)), Benjamin Labbe (LIST (CEA)), Quoc T Vuong (LIST (CEA))Journal-ref: SPIE - Counterterrorism, Crime Fighting, Forensics, and Surveillance Technologies II, 2018, pp.27Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [310] arXiv:2405.09152 [pdf, other]
-
Title: Scalable Image Coding for Humans and Machines Using Feature Fusion NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [311] arXiv:2405.09150 [pdf, other]
-
Title: Curriculum Dataset DistillationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [312] arXiv:2405.09148 [pdf, ps, other]
-
Title: A Hierarchically Feature Reconstructed Autoencoder for Unsupervised Anomaly DetectionComments: 12 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [313] arXiv:2405.09138 [pdf, other]
-
Title: OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better PracticalityAuthors: Chao Fan, Saihui Hou, Junhao Liang, Chuanfu Shen, Jingzhe Ma, Dongyang Jin, Yongzhen Huang, Shiqi YuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [314] arXiv:2405.09131 [pdf, other]
-
Title: RobustMVS: Single Domain Generalized Deep Multi-view StereoComments: Accepted to TCSVT. Code will be released at: this https URL Benchmark will be released at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [315] arXiv:2405.09125 [pdf, other]
-
Title: HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text RecognitionComments: 12 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [316] arXiv:2405.09114 [pdf, other]
-
Title: SOEDiff: Efficient Distillation for Small Object EditingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [317] arXiv:2405.09083 [pdf, other]
-
Title: RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image DehazingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [318] arXiv:2405.09059 [pdf, other]
-
Title: Task-adaptive Q-FaceComments: Ever submitted to ECCV2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [319] arXiv:2405.09056 [pdf, other]
-
Title: CTS: A Consistency-Based Medical Image Segmentation ModelSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [320] arXiv:2405.09054 [pdf, other]
-
Title: Dim Small Target Detection and Tracking: A Novel Method Based on Temporal Energy Selective Scaling and Trajectory AssociationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [321] arXiv:2405.09050 [pdf, other]
-
Title: 3D Shape Augmentation with Content-Aware Shape ResizingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [322] arXiv:2405.09045 [pdf, other]
-
Title: AMSNet: Netlist Dataset for AMS CircuitsAuthors: Zhuofu Tao, Yichen Shi, Yiru Huo, Rui Ye, Zonghang Li, Li Huang, Chen Wu, Na Bai, Zhiping Yu, Ting-Jung Lin, Lei HeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [323] arXiv:2405.09041 [pdf, other]
-
Title: Learning from Partial Label Proportions for Whole Slide Image SegmentationAuthors: Shinnosuke Matsuo, Daiki Suehiro, Seiichi Uchida, Hiroaki Ito, Kazuhiro Terada, Akihiko Yoshizawa, Ryoma BiseComments: Accepted at MICCAI2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [324] arXiv:2405.09032 [pdf, other]
-
Title: ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expression RecognitionComments: Accept by ICDAR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [325] arXiv:2405.09024 [pdf, other]
-
Title: Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy LabelsAuthors: Guozhang Liu, Ting Liu, Mengke Yuan, Tao Pang, Guangxing Yang, Hao Fu, Tao Wang, Tongkui LiaoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [326] arXiv:2405.09006 [pdf, other]
-
Title: Spatial Semantic Recurrent Mining for Referring Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [327] arXiv:2405.08996 [pdf, other]
-
Title: Learning Correspondence for Deformable ObjectsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [328] arXiv:2405.08992 [pdf, other]
-
Title: Contextual Emotion Recognition using Large Vision Language ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [329] arXiv:2405.08991 [pdf, other]
-
Title: Theoretical Analysis for Expectation-Maximization-Based Multi-Model 3D RegistrationComments: arXiv admin note: substantial text overlap with arXiv:2402.10865Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [330] arXiv:2405.08961 [pdf, other]
-
Title: Bird's-Eye View to Street-View: A SurveySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [331] arXiv:2405.08932 [pdf, other]
-
Title: Self-supervised vision-langage alignment of deep learning representations for bone X-rays analysisSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [332] arXiv:2405.08911 [pdf, other]
-
Title: CLIP with Quality Captions: A Strong Pretraining for Vision TasksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [333] arXiv:2405.08909 [pdf, other]
-
Title: ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and AssociationComments: 14 pages, 3 figures, accepted by CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [334] arXiv:2405.08890 [pdf, other]
-
Title: Language-Guided Self-Supervised Video Summarization Using Text Semantic Matching Considering the Diversity of the VideoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [335] arXiv:2405.09539 (cross-list from eess.IV) [pdf, ps, other]
-
Title: MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal CancerComments: Early accepted to MICCAI 2024 (6/6/5)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [336] arXiv:2405.09530 (cross-list from cs.CY) [pdf, other]
-
Title: A community palm modelAuthors: Nicholas Clinton, Andreas Vollrath, Remi D'annunzio, Desheng Liu, Henry B. Glick, Adrià Descals, Alicia Sullivan, Oliver Guinan, Jacob Abramowitz, Fred Stolle, Chris Goodman, Tanya Birch, David Quinn, Olga Danylo, Tijs Lips, Daniel Coelho, Enikoe Bihari, Bryce Cronkite-Ratcliff, Ate Poortinga, Atena Haghighattalab, Evan Notman, Michael DeWitt, Aaron Yonas, Gennadii Donchyts, Devaja Shah, David Saah, Karis Tenneson, Nguyen Hanh Quyen, Megha Verma, Andrew WilcoxComments: v0Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [337] arXiv:2405.09472 (cross-list from eess.IV) [pdf, other]
-
Title: Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality AssessmentComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [338] arXiv:2405.09353 (cross-list from eess.IV) [pdf, other]
-
Title: Large coordinate kernel attention network for lightweight image super-resolutionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [339] arXiv:2405.09298 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Deep Blur Multi-Model (DeepBlurMM) -- a strategy to mitigate the impact of image blur on deep learning model performance in histopathology image analysisSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [340] arXiv:2405.09286 (cross-list from cs.MM) [pdf, other]
-
Title: MVBIND: Self-Supervised Music Recommendation For Videos Via Embedding Space BindingSubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [341] arXiv:2405.09077 (cross-list from eess.IV) [pdf, other]
-
Title: Compressive Feature Selection for Remote Visual Multi-Task InferenceComments: 6 pages, 8 figures, IEEE ICME Workshop on Coding for MachinesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [342] arXiv:2405.09049 (cross-list from cs.LG) [pdf, other]
-
Title: Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous DrivingSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [343] arXiv:2405.08981 (cross-list from cs.HC) [pdf, other]
-
Title: Impact of Design Decisions in Scanpath ModelingComments: 16 pagesSubjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [344] arXiv:2405.08920 (cross-list from cs.LG) [pdf, other]
-
Title: Neural Collapse Meets Differential Privacy: Curious Behaviors of NoisyGD with Near-perfect Representation LearningComments: To appear in ICML 2024Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Wed, 15 May 2024
- [345] arXiv:2405.08816 [pdf, other]
-
Title: The RoboDrive Challenge: Drive Anytime Anywhere in Any ConditionAuthors: Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang, Zeliang Ma, Dengyi Ji, Haiwen Li, Xingliang Huang, Yu Tian, Genghua Kou, Fan Jia, Yingfei Liu, Tiancai Wang, Ying Li, Xiaoshuai Hao, Yifan Yang, Hui Zhang, Mengchuan Wei, Yi Zhou, Haimei Zhao, Jing Zhang, Jinke Li, Xiao He, Xiaoqiang Cheng, Bingyang Zhang, Lirong Zhao, Dianlei Ding, Fangsheng Liu, Yixiang Yan, Hongming Wang, Nanfei Ye, Lun Luo, Yubo Tian, Yiwei Zuo, Zhe Cao, Yi Ren, Yunfan Li, Wenjie Liu, Xun Wu, Yifan Mao, Ming Li, Jian Liu, Jiayang Liu, Zihan Qin, Cunxi Chu, et al. (25 additional authors not shown)Comments: ICRA 2024; 31 pages, 24 figures, 5 tables; Code at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [346] arXiv:2405.08815 [pdf, other]
-
Title: Efficient Vision-Language Pre-training by Cluster MaskingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [347] arXiv:2405.08813 [pdf, other]
-
Title: CinePile: A Long Video Question Answering Dataset and BenchmarkAuthors: Ruchit Rawal, Khalid Saifullah, Ronen Basri, David Jacobs, Gowthami Somepalli, Tom GoldsteinComments: Project page with all the artifacts - this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
- [348] arXiv:2405.08807 [pdf, other]
-
Title: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure InterpretationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [349] arXiv:2405.08794 [pdf, other]
-
Title: Ambiguous Annotations: When is a Pedestrian not a Pedestrian?Comments: Paper accepted at the CVPR 2024 Vision and Language for Autonomous Driving and Robotics WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [350] arXiv:2405.08786 [pdf, other]
-
Title: Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS ScoringAuthors: Tiantian Zhang, Manxi Lin, Hongda Guo, Xiaofan Zhang, Ka Fung Peter Chiu, Aasa Feragen, Qi DouSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [351] arXiv:2405.08780 [pdf, ps, other]
-
Title: Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modelingAuthors: Gregory Holste, Mingquan Lin, Ruiwen Zhou, Fei Wang, Lei Liu, Qi Yan, Sarah H. Van Tassel, Kyle Kovacs, Emily Y. Chew, Zhiyong Lu, Zhangyang Wang, Yifan PengSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [352] arXiv:2405.08776 [pdf, ps, other]
-
Title: FolkTalent: Enhancing Classification and Tagging of Indian Folk PaintingsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [353] arXiv:2405.08768 [pdf, other]
-
Title: EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone TrainingComments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). Journal version of arXiv:2211.09703 (ICCV 2023). Code is available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [354] arXiv:2405.08765 [pdf, other]
-
Title: Image to Pseudo-Episode: Boosting Few-Shot Segmentation by Unlabeled DataSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [355] arXiv:2405.08748 [pdf, other]
-
Title: Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese UnderstandingAuthors: Zhimin Li, Jianwei Zhang, Qin Lin, Jiangfeng Xiong, Yanxin Long, Xinchi Deng, Yingfang Zhang, Xingchao Liu, Minbin Huang, Zedong Xiao, Dayou Chen, Jiajun He, Jiahao Li, Wenyue Li, Chen Zhang, Rongwei Quan, Jianxiang Lu, Jiabin Huang, Xiaoyan Yuan, Xiaoxiao Zheng, Yixuan Li, Jihong Zhang, Chao Zhang, Meng Chen, Jie Liu, Zheng Fang, Weiyan Wang, Jinbao Xue, Yangyu Tao, Jianchen Zhu, Kai Liu, Sihuan Lin, Yifu Sun, Yun Li, Dongdong Wang, Mingtao Chen, Zhichao Hu, Xiao Xiao, Yan Chen, Yuhong Liu, Wei Liu, Di Wang, Yong Yang, Jie Jiang, Qinglin LuComments: Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [356] arXiv:2405.08720 [pdf, other]
-
Title: The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling PerspectiveComments: To appear at CVPR 2024 Workshop on AI for Content Creation (AI4CC)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [357] arXiv:2405.08717 [pdf, other]
-
Title: How Much You Ate? Food Portion Estimation on SpoonsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [358] arXiv:2405.08715 [pdf, other]
-
Title: DeVOS: Flow-Guided Deformable Transformer for Video Object SegmentationAuthors: Volodymyr Fedynyak, Yaroslav Romanus, Bohdan Hlovatskyi, Bohdan Sydor, Oles Dobosevych, Igor Babin, Roman RiazantsevSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [359] arXiv:2405.08695 [pdf, other]
-
Title: The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasksSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [360] arXiv:2405.08681 [pdf, other]
-
Title: Achieving Fairness Through Channel Pruning for Dermatological Disease DiagnosisComments: 13 pages, 3 figures, early accepted by International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [361] arXiv:2405.08668 [pdf, other]
-
Title: Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM ResearchSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
- [362] arXiv:2405.08609 [pdf, other]
-
Title: Dynamic NeRF: A ReviewAuthors: Jinwei LinComments: 25 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [363] arXiv:2405.08593 [pdf, other]
-
Title: Open-Vocabulary Object Detection via Neighboring Region Attention AlignmentSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [364] arXiv:2405.08589 [pdf, other]
-
Title: Variable Substitution and Bilinear Programming for Aligning Partially Overlapping Point SetsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [365] arXiv:2405.08587 [pdf, other]
-
Title: EchoTracker: Advancing Myocardial Point Tracking in EchocardiographyAuthors: Md Abulkalam Azad, Artem Chernyshov, John Nyberg, Ingrid Tveten, Lasse Lovstakken, Håvard Dalen, Bjørnar Grenne, Andreas ØstvikComments: Submitted version that got provisionally (early) accepted (top 11%) to MICCAI2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [366] arXiv:2405.08586 [pdf, other]
-
Title: Cross-Domain Feature Augmentation for Domain GeneralizationComments: Accepted to the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024); Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [367] arXiv:2405.08578 [pdf, ps, other]
-
Title: Local-peak scale-invariant feature transform for fast and random image stitchingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [368] arXiv:2405.08555 [pdf, other]
-
Title: Dual-Branch Network for Portrait Image Quality AssessmentAuthors: Wei Sun, Weixia Zhang, Yanwei Jiang, Haoning Wu, Zicheng Zhang, Jun Jia, Yingjie Zhou, Zhongpeng Ji, Xiongkuo Min, Weisi Lin, Guangtao ZhaiSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [369] arXiv:2405.08547 [pdf, other]
-
Title: Exploring Graph-based Knowledge: Multi-Level Feature Distillation via Channels Relational GraphSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [370] arXiv:2405.08533 [pdf, other]
-
Title: Dynamic Feature Learning and Matching for Class-Incremental LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [371] arXiv:2405.08493 [pdf, ps, other]
-
Title: Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental StudySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [372] arXiv:2405.08487 [pdf, other]
-
Title: Semantic Contextualization of Face Forgery: A New Definition, Dataset, and Detection MethodSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [373] arXiv:2405.08483 [pdf, other]
-
Title: RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D ImagesComments: Accepted by CVPR Workshop DLGC, 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [374] arXiv:2405.08463 [pdf, other]
-
Title: A Timely Survey on Vision Transformer for Deepfake DetectionAuthors: Zhikan Wang, Zhongyao Cheng, Jiajie Xiong, Xun Xu, Tianrui Li, Bharadwaj Veeravalli, Xulei YangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [375] arXiv:2405.08458 [pdf, other]
-
Title: Rethinking Prior Information Generation with CLIP for Few-Shot SegmentationComments: Accepted by CVPR 2024; The camera-ready versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [376] arXiv:2405.08434 [pdf, other]
-
Title: TP3M: Transformer-based Pseudo 3D Image Matching with ReferenceComments: Accepted by ICRA 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [377] arXiv:2405.08429 [pdf, other]
-
Title: TEDNet: Twin Encoder Decoder Neural Network for 2D Camera and LiDAR Road DetectionAuthors: Martín Bayón-Gutiérrez, María Teresa García-Ordás, Héctor Alaiz Moretón, Jose Aveleira-Mata, Sergio Rubio Martín, José Alberto Benítez-AndradesComments: Source code: this https URLJournal-ref: M Bay\'on-Guti\'errez, MT Garc\'ia-Ord\'as, H Alaiz Moret\'on, J Aveleira-Mata, S Rubio-Mart\'in, JA Ben\'itez-Andrades. TEDNet: Twin Encoder Decoder Neural Network for 2D Camera and LiDAR Road Detection. Logic Journal of the IGPL. 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [378] arXiv:2405.08419 [pdf, other]
-
Title: WaterMamba: Visual State Space Model for Underwater Image EnhancementComments: arXiv admin note: substantial text overlap with arXiv:2403.06098Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [379] arXiv:2405.08344 [pdf, other]
-
Title: No Time to Waste: Squeeze Time into Channel for Mobile Video UnderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [380] arXiv:2405.08337 [pdf, ps, other]
-
Title: Perivascular space Identification Nnunet for Generalised Usage (PINGU)Authors: Benjamin Sinclair, Lucy Vivash, Jasmine Moses, Miranda Lynch, William Pham, Karina Dorfman, Cassandra Marotta, Shaun Koh, Jacob Bunyamin, Ella Rowsthorn, Alex Jarema, Himashi Peiris, Zhaolin Chen, Sandy R Shultz, David K Wright, Dexiao Kong, Sharon L. Naismith, Terence J. OBrien, Meng LawSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [381] arXiv:2405.08329 [pdf, other]
-
Title: Cross-Dataset Generalization For Retinal Lesions SegmentationComments: 6 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [382] arXiv:2405.08322 [pdf, other]
-
Title: StraightPCF: Straight Point Cloud FilteringComments: This paper has been accepted to the IEEE/CVF CVPR Conference, 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [383] arXiv:2405.08300 [pdf, other]
-
Title: Vector-Symbolic Architecture for Event-Based Optical FlowSubjects: Computer Vision and Pattern Recognition (cs.CV); Symbolic Computation (cs.SC)
- [384] arXiv:2405.08272 [pdf, other]
-
Title: VS-Assistant: Versatile Surgery Assistant on the Demand of SurgeonsAuthors: Zhen Chen, Xingjian Luo, Jinlin Wu, Danny T.M. Chan, Zhen Lei, Jinqiao Wang, Sebastien Ourselin, Hongbin LiuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [385] arXiv:2405.08270 [pdf, other]
-
Title: Towards Clinician-Preferred Segmentation: Leveraging Human-in-the-Loop for Test Time Adaptation in Medical Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [386] arXiv:2405.08263 [pdf, other]
-
Title: Palette-based Color Transfer between ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [387] arXiv:2405.08251 [pdf, other]
-
Title: Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale EventsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [388] arXiv:2405.08246 [pdf, other]
-
Title: Compositional Text-to-Image Generation with Dense Blob RepresentationsComments: ICML 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [389] arXiv:2405.08245 [pdf, ps, other]
-
Title: Progressive enhancement and restoration for mural images under low-light and defected conditions based on multi-receptive field strategySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [390] arXiv:2405.08210 [pdf, other]
-
Title: Infinite Texture: Text-guided High Resolution Diffusion Texture SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [391] arXiv:2405.08204 [pdf, other]
-
Title: A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action DetectionComments: IEEE Transactions on Pattern Analysis and Machine Intelligence (2024)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [392] arXiv:2405.08197 [pdf, other]
-
Title: IHC Matters: Incorporating IHC analysis to H&E Whole Slide Image Analysis for Improved Cancer Grading via Two-stage Multimodal Bilinear Pooling FusionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [393] arXiv:2405.08114 [pdf, other]
-
Title: RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on Recurrent Affine TransformationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [394] arXiv:2405.08055 [pdf, other]
-
Title: DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D GenerationComments: arXiv admin note: substantial text overlap with arXiv:2309.07920Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [395] arXiv:2405.08766 (cross-list from cs.LG) [pdf, other]
-
Title: Energy-based Hopfield Boosting for Out-of-Distribution DetectionSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [396] arXiv:2405.08745 (cross-list from eess.IV) [pdf, other]
-
Title: Enhancing Blind Video Quality Assessment with Rich Quality-aware FeaturesAuthors: Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao ZhaiSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [397] arXiv:2405.08733 (cross-list from cs.GR) [pdf, other]
-
Title: A Simple Approach to Differentiable Rendering of SDFsSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [398] arXiv:2405.08672 (cross-list from eess.IV) [pdf, other]
-
Title: EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic CameraComments: early accepted by MICCAI 2024Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [399] arXiv:2405.08658 (cross-list from eess.IV) [pdf, other]
-
Title: Beyond the Black Box: Do More Complex Models Provide Superior XAI Explanations?Comments: 15 pages, 9 figures, 5 tablesSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [400] arXiv:2405.08657 (cross-list from eess.IV) [pdf, other]
-
Title: Self-supervised learning improves robustness of deep learning lung tumor segmentation to CT imaging differencesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [401] arXiv:2405.08654 (cross-list from cs.LG) [pdf, other]
-
Title: Can we Defend Against the Unknown? An Empirical Study About Threshold Selection for Neural Network MonitoringComments: 13 pages, 5 figures, 6 tables. To appear in the proceedings of the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [402] arXiv:2405.08621 (cross-list from eess.IV) [pdf, other]
-
Title: RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video ContentComments: 8pages, 2figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [403] arXiv:2405.08576 (cross-list from cs.RO) [pdf, other]
-
Title: Hearing Touch: Audio-Visual Pretraining for Contact-Rich ManipulationComments: Accepted to ICRA 2024Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [404] arXiv:2405.08556 (cross-list from eess.IV) [pdf, other]
-
Title: Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentationAuthors: Rezkellah Noureddine Khiati, Pierre-Yves Brillet, Aurélien Justet, Radu Ispa, Catalin FetitaComments: 14 pages, 7 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [405] arXiv:2405.08431 (cross-list from eess.IV) [pdf, other]
-
Title: Similarity Metrics for MR Image-To-Image TranslationComments: 29 pages, 6 figures, appendix with 5 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [406] arXiv:2405.08423 (cross-list from eess.IV) [pdf, other]
-
Title: NAFRSSR: a Lightweight Recursive Network for Efficient Stereo Image Super-ResolutionAuthors: Yihong Chen, Zhen Fan, Shuai Dong, Zhiwei Chen, Wenjie Li, Minghui Qin, Min Zeng, Xubing Lu, Guofu Zhou, Xingsen Gao, Jun-Ming LiuSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [407] arXiv:2405.08363 (cross-list from cs.CR) [pdf, other]
-
Title: UnMarker: A Universal Attack on Defensive WatermarkingSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [408] arXiv:2405.08340 (cross-list from cs.CR) [pdf, other]
-
Title: Achieving Resolution-Agnostic DNN-based Image Watermarking:A Novel Perspective of Implicit Neural RepresentationSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [409] arXiv:2405.08297 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Distance-Restricted Explanations: Theoretical Underpinnings & Efficient ImplementationAuthors: Yacine Izza, Xuanxiang Huang, Antonio Morgado, Jordi Planes, Alexey Ignatiev, Joao Marques-SilvaSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
- [410] arXiv:2405.08282 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Automatic Segmentation of the Kidneys and Cystic Renal Lesions on Non-Contrast CT Using a Convolutional Neural NetworkAuthors: Lucas Aronson (1), Ruben Ngnitewe Massaa (1), Syed Jamal Safdar Gardezi (1), Andrew L. Wentland (1,2,3) ((1) Department of Radiology, University of Wisconsin School of Medicine & Public Health, Madison, WI, USA, (2) Department of Medical Physics, University of Wisconsin School of Medicine & Public Health, Madison, WI, USA, (3) Department of Biomedical Engineering, University of Wisconsin School of Medicine & Public Health, Madison, WI, USA)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [411] arXiv:2405.08275 (cross-list from math.OC) [pdf, other]
-
Title: Power of $\ell_1$-Norm Regularized Kaczmarz Algorithms for High-Order Tensor RecoveryComments: arXiv admin note: text overlap with arXiv:2311.00783Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
- [412] arXiv:2405.08209 (cross-list from cs.CY) [pdf, other]
-
Title: Who's in and who's out? A case study of multimodal CLIP-filtering in DataCompComments: Content warning: This paper discusses societal stereotypes and sexually-explicit material that may be disturbing, distressing, and/or offensive to the readerSubjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [413] arXiv:2405.08169 (cross-list from eess.IV) [pdf, other]
-
Title: Rethinking Histology Slide Digitization Workflows for Low-Resource SettingsComments: MICCAI 2024 Early Accept. First four authors contributed equallySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [414] arXiv:2405.08119 (cross-list from eess.SY) [pdf, other]
-
Title: GPS-IMU Sensor Fusion for Reliable Autonomous Vehicle Position EstimationAuthors: Simegnew Yihunie AlabaComments: 6 pages, 4 figures, and conferenceSubjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [415] arXiv:2405.08054 (cross-list from cs.GR) [pdf, other]
-
Title: Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided ConditioningAuthors: Wenqi Dong, Bangbang Yang, Lin Ma, Xiao Liu, Liyuan Cui, Hujun Bao, Yuewen Ma, Zhaopeng CuiComments: Project webpage: this https URLSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [416] arXiv:2405.08049 (cross-list from eess.IV) [pdf, other]
-
Title: Optimizing Synthetic Correlated Diffusion Imaging for Breast Cancer Tumour DelineationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [417] arXiv:2405.08042 (cross-list from cs.HC) [pdf, other]
-
Title: LLAniMAtion: LLAMA Driven Gesture AnimationSubjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [418] arXiv:2405.08038 (cross-list from cs.LG) [pdf, other]
-
Title: Feature Expansion and enhanced Compression for Class Incremental LearningAuthors: Quentin Ferdinand (ENSTA Bretagne, Lab-STICC\_MATRIX), Gilles Le Chenadec (ENSTA Bretagne, Lab-STICC\_MATRIX), Benoit Clement (CROSSING, ENSTA Bretagne, Lab-STICC\_MATRIX), Panagiotis Papadakis (Lab-STICC\_RAMBO, IMT Atlantique - INFO), Quentin OliveauSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [419] arXiv:2405.08020 (cross-list from cs.LG) [pdf, other]
-
Title: ReActXGB: A Hybrid Binary Convolutional Neural Network Architecture for Improved Performance and Computational EfficiencyComments: Accepted to ICCE-TW 2024Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [420] arXiv:2405.07994 (cross-list from eess.IV) [pdf, ps, other]
-
Title: BubbleID: A Deep Learning Framework for Bubble Interface Dynamics AnalysisComments: 16 pages, 4 figuresSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ showing up to 553 entries per page: fewer | more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)