We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 36

[ total of 679 entries: 1-100 | 37-136 | 137-236 | 237-336 | 337-436 | ... | 637-679 ]
[ showing 100 entries per page: fewer | more | all ]

Wed, 5 Jun 2024 (continued, showing last 66 of 102 entries)

[37]  arXiv:2406.02223 [pdf, other]
Title: SMCL: Saliency Masked Contrastive Learning for Long-tailed Recognition
Comments: accepted at ICASSP 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[38]  arXiv:2406.02208 [pdf, other]
Title: Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Comments: IJCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[39]  arXiv:2406.02202 [pdf, other]
Title: Can CLIP help CLIP in learning 3D?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[40]  arXiv:2406.02184 [pdf, other]
Title: GraVITON: Graph based garment warping with attention guided inversion for Virtual-tryon
Comments: 18 pages, 7 Figures and 6 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2406.02158 [pdf, other]
Title: Radar Spectra-Language Model for Automotive Scene Parsing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[42]  arXiv:2406.02153 [pdf, other]
Title: Analyzing the Feature Extractor Networks for Face Image Synthesis
Comments: Accepted at 18th International Conference on Automatic Face and Gesture Recognition (FG) on 1st SD-FGA Workshop 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43]  arXiv:2406.02147 [pdf, other]
Title: UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[44]  arXiv:2406.02142 [pdf, other]
Title: Analyzing the Effect of Combined Degradations on Face Recognition
Comments: Accepted at 18th International Conference on Automatic Face and Gesture Recognition (FG) on 2nd PrivAAL Workshop 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2406.02125 [pdf, other]
Title: Domain Game: Disentangle Anatomical Feature for Single Domain Generalized Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[46]  arXiv:2406.02074 [pdf, other]
Title: FaceCom: Towards High-fidelity 3D Facial Shape Completion via Optimization and Inpainting Guidance
Comments: accepted to CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47]  arXiv:2406.02058 [pdf, other]
Title: OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding
Comments: technical report, 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[48]  arXiv:2406.02038 [pdf, other]
Title: Leveraging Predicate and Triplet Learning for Scene Graph Generation
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49]  arXiv:2406.02037 [pdf, ps, other]
Title: Multi-Scale Direction-Aware Network for Infrared Small Target Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50]  arXiv:2406.02021 [pdf, other]
Title: MetaMixer Is All You Need
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[51]  arXiv:2406.01994 [pdf, other]
Title: 3D Imaging of Complex Specular Surfaces by Fusing Polarimetric and Deflectometric Information
Subjects: Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[52]  arXiv:2406.01987 [pdf, other]
Title: Dealing with All-stage Missing Modality: Towards A Universal Model with Robust Reconstruction and Personalization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53]  arXiv:2406.01970 [pdf, other]
Title: The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[54]  arXiv:2406.01956 [pdf, other]
Title: Enhance Image-to-Image Generation with LLaVA Prompt and Negative Prompt
Comments: Accepted by 2024 5th International Conference on Information Science, Parallel and Distributed Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[55]  arXiv:2406.01954 [pdf, other]
Title: Plug-and-Play Diffusion Distillation
Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[56]  arXiv:2406.01938 [pdf, other]
Title: Nutrition Estimation for Dietary Management: A Transformer Approach with Depth Sensing
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[57]  arXiv:2406.01932 [pdf, other]
Title: Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning
Comments: 7 pages, 5 figures. Submitted to the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[58]  arXiv:2406.01920 [pdf, other]
Title: CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[59]  arXiv:2406.01917 [pdf, other]
Title: GOMAA-Geo: GOal Modality Agnostic Active Geo-localization
Comments: 23 pages, 17 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[60]  arXiv:2406.01916 [pdf, other]
Title: FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61]  arXiv:2406.01914 [pdf, other]
Title: HPE-CogVLM: New Head Pose Grounding Task Exploration on Vision Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[62]  arXiv:2406.01906 [pdf, other]
Title: ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization
Authors: Chen Mao, Jingqi Hu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[63]  arXiv:2406.01900 [pdf, other]
Title: Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[64]  arXiv:2406.01894 [pdf, other]
Title: SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[65]  arXiv:2406.01884 [pdf, other]
Title: Rank-based No-reference Quality Assessment for Face Swapping
Comments: 8 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[66]  arXiv:2406.01869 [pdf, ps, other]
Title: Fruit Classification System with Deep Learning and Neural Architecture Search
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[67]  arXiv:2406.01867 [pdf, other]
Title: MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Comments: 12 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68]  arXiv:2406.01843 [pdf, other]
Title: L-MAGIC: Language Model Assisted Generation of Images with Coherence
Comments: accepted to CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69]  arXiv:2406.01837 [pdf, other]
Title: Boosting Vision-Language Models with Transduction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70]  arXiv:2406.01820 [pdf, other]
Title: Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning
Comments: Accepted CVPR 2024 - this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[71]  arXiv:2406.01815 [pdf, ps, other]
Title: Deep asymmetric mixture model for unsupervised cell segmentation
Authors: Yang Nan, Guang Yang
Comments: 5 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72]  arXiv:2406.01797 [pdf, other]
Title: The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry
Comments: Accepted to CoLLAs 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[73]  arXiv:2406.01791 [pdf, other]
Title: Hybrid-Learning Video Moment Retrieval across Multi-Domain Labels
Comments: Accepted by BMVC2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74]  arXiv:2406.01765 [pdf, other]
Title: Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers
Comments: Published in Transactions on Machine Learning Research (05/2024): this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75]  arXiv:2406.01764 [pdf, other]
Title: An approximation-based approach versus an AI one for the study of CT images of abdominal aorta aneurysms
Comments: 28 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76]  arXiv:2406.01662 [pdf, other]
Title: Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[77]  arXiv:2406.01658 [pdf, other]
Title: Proxy Denoising for Source-Free Domain Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78]  arXiv:2406.01598 [pdf, ps, other]
Title: D2E-An Autonomous Decision-making Dataset involving Driver States and Human Evaluation
Comments: Submit for ITSC 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Robotics (cs.RO)
[79]  arXiv:2406.01597 [pdf, other]
Title: End-to-End Rate-Distortion Optimized 3D Gaussian Representation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[80]  arXiv:2406.02537 (cross-list from cs.CL) [pdf, other]
Title: TopViewRS: Vision-Language Models as Top-View Spatial Reasoners
Comments: 9 pages, 3 figures, 3 tables (21 pages, 4 figures, 15 tables including references and appendices)
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[81]  arXiv:2406.02534 (cross-list from eess.IV) [pdf, other]
Title: Enhancing predictive imaging biomarker discovery through treatment effect analysis
Comments: 19 pages, 12 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[82]  arXiv:2406.02529 (cross-list from eess.IV) [pdf, other]
Title: ReLUs Are Sufficient for Learning Implicit Neural Representations
Comments: Accepted to ICML 2024
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[83]  arXiv:2406.02480 (cross-list from eess.IV) [pdf, other]
Title: Fairness Evolution in Continual Learning for Medical Imaging
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[84]  arXiv:2406.02477 (cross-list from eess.IV) [pdf, other]
Title: Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[85]  arXiv:2406.02465 (cross-list from cs.LG) [pdf, other]
Title: An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Encoders
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[86]  arXiv:2406.02422 (cross-list from eess.IV) [pdf, other]
Title: IterMask2: Iterative Unsupervised Anomaly Segmentation via Spatial and Frequency Masking for Brain Lesions in MRI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[87]  arXiv:2406.02395 (cross-list from cs.LG) [pdf, other]
Title: GrootVL: Tree Topology is All You Need in State Space Model
Comments: The code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[88]  arXiv:2406.02349 (cross-list from cs.NE) [pdf, other]
Title: CADE: Cosine Annealing Differential Evolution for Spiking Neural Network
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[89]  arXiv:2406.02343 (cross-list from cs.LG) [pdf, other]
Title: Cluster-Aware Similarity Diffusion for Instance Retrieval
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2406.02077 (cross-list from eess.IV) [pdf, other]
Title: Multi-target stain normalization for histology slides
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[91]  arXiv:2406.02064 (cross-list from cs.LG) [pdf, other]
Title: Advancing Generalized Transfer Attack with Initialization Derived Bilevel Optimization and Dynamic Sequence Truncation
Comments: Accepted by IJCAI 2024. 10 pages
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[92]  arXiv:2406.02027 (cross-list from cs.LG) [pdf, other]
Title: Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[93]  arXiv:2406.01996 (cross-list from cs.LG) [pdf, other]
Title: Bayesian Mesh Optimization for Graph Neural Networks to Enhance Engineering Performance Prediction
Comments: 17 pages, 8 figures, 3 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[94]  arXiv:2406.01993 (cross-list from eess.IV) [pdf, ps, other]
Title: Choroidal Vessel Segmentation on Indocyanine Green Angiography Images via Human-in-the-Loop Labeling
Authors: Ruoyu Chen (1), Ziwei Zhao (1), Mayinuer Yusufu (4 and 5), Xianwen Shang (1), Danli Shi (1 and 2), Mingguang He (1,2 and 3) ((1) School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China. (2) Research Centre for SHARP Vision, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China.(3) Centre for Eye and Vision Research (CEVR), 17W Hong Kong Science Park, Hong Kong SAR, China.(4) Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, East Melbourne, Australia.(5) Department of Surgery (Ophthalmology), The University of Melbourne, Melbourne, Australia)
Comments: 25 pages,4 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[95]  arXiv:2406.01975 (cross-list from cs.LG) [pdf, other]
Title: Can Dense Connectivity Benefit Outlier Detection? An Odyssey with NAS
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2406.01961 (cross-list from cs.RO) [pdf, other]
Title: Exploring Real World Map Change Generalization of Prior-Informed HD Map Prediction Models
Comments: Accepted to CVPR 2024, Workshop on Autonomous Driving
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[97]  arXiv:2406.01829 (cross-list from cs.NE) [pdf, other]
Title: FacAID: A Transformer Model for Neuro-Symbolic Facade Reconstruction
Comments: 11 pages, 10 figures, preprint
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[98]  arXiv:2406.01733 (cross-list from cs.LG) [pdf, other]
Title: Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
Comments: Code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[99]  arXiv:2406.01708 (cross-list from cs.CR) [pdf, other]
Title: Model for Peanuts: Hijacking ML Models without Training Access is Possible
Comments: 17 pages, 14 figures, 7 tables
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[100]  arXiv:2406.01613 (cross-list from q-bio.QM) [pdf, other]
Title: QuST: QuPath Extension for Integrative Whole Slide Image and Spatial Transcriptomics Analysis
Authors: Chao-Hui Huang
Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[101]  arXiv:2406.01605 (cross-list from eess.IV) [pdf, other]
Title: An Enhanced Encoder-Decoder Network Architecture for Reducing Information Loss in Image Semantic Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[102]  arXiv:2406.01604 (cross-list from cs.IR) [pdf, other]
Title: An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval
Comments: 20 pages
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)

Tue, 4 Jun 2024 (showing first 34 of 228 entries)

[103]  arXiv:2406.01595 [pdf, other]
Title: MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104]  arXiv:2406.01594 [pdf, other]
Title: DiffUHaul: A Training-Free Method for Object Dragging in Images
Comments: Project page is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[105]  arXiv:2406.01593 [pdf, other]
Title: Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting
Comments: Project Page: see this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106]  arXiv:2406.01592 [pdf, other]
Title: Text-guided Controllable Mesh Refinement for Interactive 3D Modeling
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
[107]  arXiv:2406.01591 [pdf, other]
Title: DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108]  arXiv:2406.01584 [pdf, other]
Title: SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109]  arXiv:2406.01583 [pdf, other]
Title: Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP
Comments: 22 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[110]  arXiv:2406.01579 [pdf, other]
Title: Tetrahedron Splatting for 3D Generation
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[111]  arXiv:2406.01561 [pdf, other]
Title: Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[112]  arXiv:2406.01559 [pdf, other]
Title: Prototypical Transformer as Unified Motion Learners
Comments: 21 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113]  arXiv:2406.01555 [pdf, other]
Title: Towards Flexible Interactive Reflection Removal with Human Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[114]  arXiv:2406.01551 [pdf, other]
Title: ELSA: Evaluating Localization of Social Activities in Urban Streets
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2406.01494 [pdf, other]
Title: Robust Classification by Coupling Data Mollification with Label Smoothing
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[116]  arXiv:2406.01493 [pdf, other]
Title: Learning Temporally Consistent Video Depth from Video Diffusion Priors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117]  arXiv:2406.01489 [pdf, other]
Title: DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[118]  arXiv:2406.01486 [pdf, other]
Title: Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[119]  arXiv:2406.01480 [pdf, other]
Title: Towards Automating the Retrospective Generation of BIM Models: A Unified Framework for 3D Semantic Reconstruction of the Built Environment
Comments: CVPRW 2024, Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2406.01476 [pdf, other]
Title: DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors
Comments: Technical report. Codes are released at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2406.01460 [pdf, other]
Title: MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization
Comments: ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[122]  arXiv:2406.01455 [pdf, other]
Title: Automatic Fused Multimodal Deep Learning for Plant Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[123]  arXiv:2406.01451 [pdf, other]
Title: SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation
Comments: Accepted by ICML2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[124]  arXiv:2406.01449 [pdf, other]
Title: SLANT: Spurious Logo ANalysis Toolkit
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2406.01432 [pdf, other]
Title: ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126]  arXiv:2406.01429 [pdf, other]
Title: EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2406.01425 [pdf, other]
Title: Sensitivity-Informed Augmentation for Robust Segmentation
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2406.01402 [pdf, other]
Title: Mixture of Rationale: Multi-Modal Reasoning Mixture for Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[129]  arXiv:2406.01395 [pdf, other]
Title: TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation
Comments: This work has been submitted to the IEEE Transactions on Intelligent Vehicles for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130]  arXiv:2406.01388 [pdf, other]
Title: AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2406.01380 [pdf, other]
Title: Convolutional Unscented Kalman Filter for Multi-Object Tracking with Outliers
Comments: 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[132]  arXiv:2406.01365 [pdf, other]
Title: From Feature Visualization to Visual Circuits: Effect of Adversarial Model Manipulation
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[133]  arXiv:2406.01356 [pdf, other]
Title: MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[134]  arXiv:2406.01355 [pdf, other]
Title: Differentially Private Fine-Tuning of Diffusion Models
Comments: 16 pages, 5 figures, 11 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[135]  arXiv:2406.01349 [pdf, other]
Title: Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Comments: Project Page: this https URL, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[136]  arXiv:2406.01337 [pdf, other]
Title: ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds
Comments: CVPRW 2024 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 679 entries: 1-100 | 37-136 | 137-236 | 237-336 | 337-436 | ... | 637-679 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)