We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 136

[ total of 456 entries: 1-100 | 37-136 | 137-236 | 237-336 | 337-436 | 437-456 ]
[ showing 100 entries per page: fewer | more | all ]

Thu, 9 May 2024 (continued, showing last 26 of 76 entries)

[137]  arXiv:2405.04717 [pdf, other]
Title: Remote Diffusion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138]  arXiv:2405.04682 [pdf, other]
Title: TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
Comments: 23 pages, 12 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[139]  arXiv:2405.04675 [pdf, other]
Title: TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model
Comments: 5 pages, 8 figures, accepted in NICOGRAPH International 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[140]  arXiv:2405.04662 [pdf, other]
Title: Radar Fields: Frequency-Space Neural Scene Representations for FMCW Radar
Comments: 8 pages, 6 figures, to be published in SIGGRAPH 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141]  arXiv:2405.04650 [pdf, other]
Title: A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[142]  arXiv:2405.04634 [pdf, other]
Title: FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes
Comments: 15 pages | 9 figures | 8 tables | Dataset is available at this https URL | Trained model is available at this https URL | Deep learning code repository is on Gihtub at this https URL | Data engineering code repository is on Github at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[143]  arXiv:2405.04605 [pdf, ps, other]
Title: AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan Datasets
Comments: 16 pages, 2 tables, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[144]  arXiv:2405.04589 [pdf, other]
Title: A Novel Wide-Area Multiobject Detection System with High-Probability Region Searching
Comments: Accepted by ICRA 2024
Journal-ref: 2024 IEEE International Conference on Robotics and Automation (ICRA)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[145]  arXiv:2405.04549 [pdf, other]
Title: ClothPPO: A Proximal Policy Optimization Enhancing Framework for Robotic Cloth Manipulation with Observation-Aligned Action Spaces
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[146]  arXiv:2405.04538 [pdf, other]
Title: DiffFinger: Advancing Synthetic Fingerprint Generation through Denoising Diffusion Probabilistic Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[147]  arXiv:2405.04537 [pdf, other]
Title: An intuitive multi-frequency feature representation for SO(3)-equivariant networks
Comments: ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[148]  arXiv:2405.04536 [pdf, other]
Title: When Training-Free NAS Meets Vision Transformer: A Neural Tangent Kernel Perspective
Authors: Qiqi Zhou, Yichen Zhu
Comments: ICASSP2024 oral
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[149]  arXiv:2405.04535 [pdf, other]
Title: Image Classification for CSSVD Detection in Cacao Plants
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[150]  arXiv:2405.05170 (cross-list from cs.MM) [pdf, other]
Title: Picking watermarks from noise (PWFN): an improved robust watermarking model against intensive distortions
Comments: Accepted by ICME2024
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[151]  arXiv:2405.05160 (cross-list from cs.LG) [pdf, other]
Title: Selective Classification Under Distribution Shifts
Comments: Total 25 pages (14 pages for main body); preprint for journal submission
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[152]  arXiv:2405.05095 (cross-list from math.NA) [pdf, other]
Title: Approximation properties relative to continuous scale space for hybrid discretizations of Gaussian derivative operators
Authors: Tony Lindeberg
Comments: 13 pages, 11 figures. arXiv admin note: text overlap with arXiv:2311.11317
Subjects: Numerical Analysis (math.NA); Computer Vision and Pattern Recognition (cs.CV)
[153]  arXiv:2405.05007 (cross-list from eess.IV) [pdf, other]
Title: HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation
Authors: Jiashu Xu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[154]  arXiv:2405.04966 (cross-list from cs.IT) [pdf, other]
Title: Communication-Efficient Collaborative Perception via Information Filling with Codebook
Comments: 10 pages, Accepted by CVPR 2024
Subjects: Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[155]  arXiv:2405.04902 (cross-list from eess.IV) [pdf, other]
Title: HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2405.04890 (cross-list from cs.RO) [pdf, other]
Title: GISR: Geometric Initialization and Silhouette-based Refinement for Single-View Robot Pose and Configuration Estimation
Comments: Submitted to IEEE Robotics and Automation Letters (RA-L)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2405.04867 (cross-list from eess.IV) [pdf, other]
[158]  arXiv:2405.04812 (cross-list from cs.RO) [pdf, other]
Title: General Place Recognition Survey: Towards Real-World Autonomy
Comments: 20 pages, 12 figures, under review
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2405.04778 (cross-list from eess.IV) [pdf, other]
Title: Teacher-Student Network for Real-World Face Super-Resolution with Progressive Embedding of Edge Information
Comments: Accepted by ICIP 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2405.04610 (cross-list from eess.IV) [pdf, other]
Title: Exploring Explainable AI Techniques for Improved Interpretability in Lung and Colon Cancer Classification
Comments: Accepted in 4th International Conference on Computing and Communication Networks (ICCCNet-2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[161]  arXiv:2405.04595 (cross-list from eess.IV) [pdf, ps, other]
Title: An Advanced Features Extraction Module for Remote Sensing Image Super-Resolution
Comments: Preprint of paper from The 21st International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology or ECTI-CON 2024, Khon Kaen, Thailand
Journal-ref: ECTI-CON 2024, Khon Kaen Thailand
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[162]  arXiv:2405.04507 (cross-list from stat.AP) [pdf, other]
Title: New allometric models for the USA create a step-change in forest carbon estimation, modeling, and mapping
Authors: Lucas K. Johnson (1), Michael J. Mahoney (1), Grant Domke (2), Colin M. Beier (1) ((1) State University of New York College of Environmental Science and Forestry, (2) USDA Forest Service)
Comments: Manuscript: 16 pages, 7 figures; Supplements: 3 pages, 2 figures; Submitted to: Remote Sensing of Environment
Subjects: Applications (stat.AP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

Wed, 8 May 2024

[163]  arXiv:2405.04534 [pdf, other]
Title: Tactile-Augmented Radiance Fields
Comments: CVPR 2024, Project page: this https URL, Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2405.04533 [pdf, other]
Title: ChatHuman: Language-driven 3D Human Understanding with Retrieval-Augmented Tool Reasoning
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[165]  arXiv:2405.04496 [pdf, other]
Title: Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[166]  arXiv:2405.04489 [pdf, other]
Title: S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[167]  arXiv:2405.04457 [pdf, other]
Title: Towards Geographic Inclusion in the Evaluation of Text-to-Image Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)
[168]  arXiv:2405.04442 [pdf, other]
Title: AugmenTory: A Fast and Flexible Polygon Augmentation Library
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[169]  arXiv:2405.04416 [pdf, other]
Title: DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid
Comments: Originally submitted to Siggraph Asia 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[170]  arXiv:2405.04408 [pdf, other]
Title: DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2405.04404 [pdf, other]
Title: Vision Mamba: A Comprehensive Survey and Taxonomy
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[172]  arXiv:2405.04403 [pdf, other]
Title: Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[173]  arXiv:2405.04390 [pdf, other]
Title: DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving
Comments: Accepted by CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[174]  arXiv:2405.04377 [pdf, other]
Title: Choose What You Need: Disentangled Representation Learning for Scene Text Recognition, Removal and Editing
Comments: Accepted to CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2405.04370 [pdf, other]
Title: Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[176]  arXiv:2405.04356 [pdf, other]
Title: Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2405.04345 [pdf, other]
Title: Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications
Comments: 8 pages, 8 figures, accepted for publication in The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS Archives) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[178]  arXiv:2405.04327 [pdf, other]
Title: Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation
Comments: CVPR2024 NTIRE Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[179]  arXiv:2405.04312 [pdf, other]
Title: Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[180]  arXiv:2405.04311 [pdf, ps, other]
Title: Cross-IQA: Unsupervised Learning for Image Quality Assessment
Authors: Zhen Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[181]  arXiv:2405.04309 [pdf, other]
Title: Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[182]  arXiv:2405.04305 [pdf, other]
Title: A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[183]  arXiv:2405.04299 [pdf, other]
Title: ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[184]  arXiv:2405.04251 [pdf, other]
Title: A General Model for Detecting Learner Engagement: Implementation and Evaluation
Comments: 13 pages, 2 Postscript figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[185]  arXiv:2405.04233 [pdf, other]
Title: Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Comments: Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[186]  arXiv:2405.04211 [pdf, other]
Title: Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction
Comments: 31 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[187]  arXiv:2405.04189 [pdf, ps, other]
Title: Artificial Intelligence-powered fossil shark tooth identification: Unleashing the potential of Convolutional Neural Networks
Comments: 40 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[188]  arXiv:2405.04175 [pdf, other]
Title: Topicwise Separable Sentence Retrieval for Medical Report Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[189]  arXiv:2405.04167 [pdf, other]
Title: Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment
Comments: Accepted by CVPR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[190]  arXiv:2405.04164 [pdf, other]
Title: Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation
Comments: Accepted at ICLR2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[191]  arXiv:2405.04133 [pdf, other]
Title: Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[192]  arXiv:2405.04121 [pdf, other]
Title: ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation
Comments: 9 pages, 6 figures, ICME 2024 oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[193]  arXiv:2405.04103 [pdf, other]
Title: COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval
Comments: Accepted by ICME 2024 oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[194]  arXiv:2405.04100 [pdf, other]
Title: ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios
Comments: Accepted by ICRA 2024 as Oral Presentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[195]  arXiv:2405.04097 [pdf, other]
Title: Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); Multimedia (cs.MM)
[196]  arXiv:2405.04093 [pdf, other]
Title: DCNN: Dual Cross-current Neural Networks Realized Using An Interactive Deep Learning Discriminator for Fine-grained Objects
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[197]  arXiv:2405.04044 [pdf, other]
Title: DMOFC: Discrimination Metric-Optimized Feature Compression
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[198]  arXiv:2405.04042 [pdf, other]
Title: Space-time Reinforcement Network for Video Object Segmentation
Comments: Accepted by ICME 2024. 6 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[199]  arXiv:2405.04009 [pdf, other]
Title: Structured Click Control in Transformer-based Interactive Segmentation
Comments: 10 pages, 6 figures, submitted to NeurIPS 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[200]  arXiv:2405.04007 [pdf, other]
Title: SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing
Comments: Technical Report; Dataset released in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[201]  arXiv:2405.03995 [pdf, other]
Title: Deep Event-based Object Detection in Autonomous Driving: A Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[202]  arXiv:2405.03981 [pdf, other]
Title: Predicting Lung Disease Severity via Image-Based AQI Analysis using Deep Learning Techniques
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[203]  arXiv:2405.03978 [pdf, other]
Title: VMambaCC: A Visual State Space Model for Crowd Counting
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204]  arXiv:2405.03971 [pdf, other]
Title: Unified End-to-End V2X Cooperative Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[205]  arXiv:2405.03959 [pdf, other]
Title: Joint Estimation of Identity Verification and Relative Pose for Partial Fingerprints
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[206]  arXiv:2405.03958 [pdf, other]
Title: Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[207]  arXiv:2405.03955 [pdf, ps, other]
Title: IPFed: Identity protected federated learning for user authentication
Journal-ref: 2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[208]  arXiv:2405.03945 [pdf, other]
Title: Role of Sensing and Computer Vision in 6G Wireless Communications
Subjects: Computer Vision and Pattern Recognition (cs.CV); Networking and Internet Architecture (cs.NI)
[209]  arXiv:2405.03894 [pdf, other]
Title: MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View
Comments: CVPRW: Generative Models for Computer Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[210]  arXiv:2405.03884 [pdf, other]
Title: BadFusion: 2D-Oriented Backdoor Attacks against 3D Object Detection
Comments: Accepted at IJCAI 2024 Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[211]  arXiv:2405.03882 [pdf, other]
Title: Trio-ViT: Post-Training Quantization and Acceleration for Softmax-Free Efficient Vision Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[212]  arXiv:2405.03852 [pdf, other]
Title: VSA4VQA: Scaling a Vector Symbolic Architecture to Visual Question Answering on Natural Images
Comments: To be published in the Proceedings of the Annual Meeting of the Cognitive Science Society (CogSci'24)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[213]  arXiv:2405.03846 [pdf, other]
Title: Enhancing Apparent Personality Trait Analysis with Cross-Modal Embeddings
Comments: 14 pages, 4 figures
Journal-ref: Annales Universitatis Scientiarium Budapestinensis de Rolando E\"otv\"os Nominatae. Sectio Computatorica, MaCS Special Issue, 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[214]  arXiv:2405.03803 [pdf, other]
Title: MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[215]  arXiv:2405.03770 [pdf, other]
Title: Foundation Models for Video Understanding: A Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[216]  arXiv:2405.03722 [pdf, other]
Title: Class-relevant Patch Embedding Selection for Few-Shot Image Classification
Comments: arXiv admin note: text overlap with arXiv:2405.03109
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[217]  arXiv:2405.03715 [pdf, other]
Title: Iterative Filter Pruning for Concatenation-based CNN Architectures
Comments: Accepted for publication at IJCNN 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[218]  arXiv:2405.03702 [pdf, other]
Title: Leafy Spurge Dataset: Real-world Weed Classification Within Aerial Drone Imagery
Comments: Official Dataset Technical Report. Used in DA-Fusion (arXiv:2302.07944)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[219]  arXiv:2405.04459 (cross-list from cs.AI) [pdf, other]
Title: A Significantly Better Class of Activation Functions Than ReLU Like Activation Functions
Comments: 14 pages
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[220]  arXiv:2405.04392 (cross-list from cs.RO) [pdf, other]
Title: BILTS: A novel bi-invariant local trajectory-shape descriptor for rigid-body motion
Comments: This work has been submitted as a regular research paper for consideration in the IEEE Transactions on Robotics. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Robotics (cs.RO); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[221]  arXiv:2405.04378 (cross-list from cs.RO) [pdf, other]
Title: $\textbf{Splat-MOVER}$: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[222]  arXiv:2405.04295 (cross-list from eess.IV) [pdf, other]
Title: Semi-Supervised Disease Classification based on Limited Medical Image Data
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[223]  arXiv:2405.04288 (cross-list from eess.IV) [pdf, other]
Title: BetterNet: An Efficient CNN Architecture with Residual Learning and Attention for Precision Polyp Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[224]  arXiv:2405.04274 (cross-list from eess.IV) [pdf, other]
Title: Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[225]  arXiv:2405.04191 (cross-list from cs.LG) [pdf, other]
Title: Effective and Robust Adversarial Training against Data and Label Corruptions
Comments: 12 pages, 8 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[226]  arXiv:2405.04169 (cross-list from eess.IV) [pdf, other]
Title: D-TrAttUnet: Toward Hybrid CNN-Transformer Architecture for Generic and Subtle Segmentation in Medical Images
Comments: arXiv admin note: text overlap with arXiv:2303.15576
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[227]  arXiv:2405.04071 (cross-list from cs.RO) [pdf, other]
Title: IMU-Aided Event-based Stereo Visual Odometry
Comments: 10 pages, 7 figures, ICRA
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[228]  arXiv:2405.04041 (cross-list from cs.AI) [pdf, other]
Title: Feature Map Convergence Evaluation for Functional Module
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[229]  arXiv:2405.04023 (cross-list from eess.IV) [pdf, other]
Title: Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using AI
Comments: 9 pages, 12 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[230]  arXiv:2405.03905 (cross-list from cs.AR) [pdf, other]
Title: A 65nm 36nJ/Decision Bio-inspired Temporal-Sparsity-Aware Digital Keyword Spotting IC with 0.6V Near-Threshold SRAM
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[231]  arXiv:2405.03827 (cross-list from cs.RO) [pdf, other]
Title: Direct learning of home vector direction for insect-inspired robot navigation
Comments: Published at ICRA 2024, project webpage at this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[232]  arXiv:2405.03762 (cross-list from eess.IV) [pdf, other]
Title: Deep learning classifier of locally advanced rectal cancer treatment response from endoscopy images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[233]  arXiv:2405.03732 (cross-list from eess.IV) [pdf, ps, other]
Title: Accelerated MR Cholangiopancreatography with Deep Learning-based Reconstruction
Comments: 20 pages, 6 figures, 2 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[234]  arXiv:2405.03730 (cross-list from cs.LG) [pdf, other]
Title: Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[235]  arXiv:2405.03713 (cross-list from eess.IV) [pdf, other]
Title: Improve Cross-Modality Segmentation by Treating MRI Images as Inverted CT Scans
Comments: 3 pages, 2 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

Tue, 7 May 2024 (showing first 1 of 159 entries)

[236]  arXiv:2405.03690 [pdf, other]
Title: How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 456 entries: 1-100 | 37-136 | 137-236 | 237-336 | 337-436 | 437-456 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)