We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Jun 2022

[ total of 1594 entries: 1-1591 | 1592-1594 ]
[ showing 1591 entries per page: fewer | more | all ]
[1]  arXiv:2206.00048 [pdf, other]
Title: PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs
Comments: Accepted at ICLR 2023. Code available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2]  arXiv:2206.00069 [pdf, other]
Title: Comparing feature fusion strategies for Deep Learning-based kidney stone identification
Comments: 4 pages, 3 figures, XXVIII\`eme Colloque Francophone de Traitement du Signal et des Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3]  arXiv:2206.00092 [pdf, other]
Title: FHIST: A Benchmark for Few-shot Classification of Histological Images
Comments: Code available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[4]  arXiv:2206.00100 [pdf, other]
Title: VALHALLA: Visual Hallucination for Machine Translation
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[5]  arXiv:2206.00123 [pdf, other]
Title: Glo-In-One: Holistic Glomerular Detection, Segmentation, and Lesion Characterization with Large-scale Web Image Mining
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[6]  arXiv:2206.00148 [pdf, other]
Title: Hands-Up: Leveraging Synthetic Data for Hands-On-Wheel Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[7]  arXiv:2206.00162 [pdf, other]
Title: PAGER: Progressive Attribute-Guided Extendable Robust Image Generation
Comments: 19 pages, 12 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[8]  arXiv:2206.00171 [pdf, other]
Title: Learning Sequential Contexts using Transformer for 3D Hand Pose Estimation
Comments: Accepted to ICPR'22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2206.00181 [pdf, other]
Title: Labeling Where Adapting Fails: Cross-Domain Semantic Segmentation with Point Supervision via Active Selection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[10]  arXiv:2206.00182 [pdf, other]
Title: Differentiable Soft-Masked Attention
Comments: arXiv admin note: text overlap with arXiv:2112.09131
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[11]  arXiv:2206.00205 [pdf, other]
Title: CAFA: Class-Aware Feature Alignment for Test-Time Adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12]  arXiv:2206.00214 [pdf, other]
Title: LiDAR-MIMO: Efficient Uncertainty Estimation for LiDAR-based 3D Object Detection
Comments: 8 pages, 4 figures and 5 tables. Accepted in IEEE IV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13]  arXiv:2206.00222 [pdf, other]
Title: Cross-domain Detection Transformer based on Spatial-aware and Semantic-aware Token Alignment
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[14]  arXiv:2206.00227 [pdf, other]
Title: Rethinking the Augmentation Module in Contrastive Learning: Learning Hierarchical Augmentation Invariance with Expanded Views
Comments: Accepted to CVPR 2022
Journal-ref: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[15]  arXiv:2206.00244 [pdf, other]
Title: Fair Comparison between Efficient Attentions
Comments: 4 pages abstract
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[16]  arXiv:2206.00252 [pdf, other]
Title: Interpretable Deep Learning Classifier by Detection of Prototypical Parts on Kidney Stones Images
Comments: Extended abstract accepted at LatinX in Computer Vision Research Workshop, at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[17]  arXiv:2206.00272 [pdf, other]
Title: Vision GNN: An Image is Worth Graph of Nodes
Comments: NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2206.00274 [pdf, other]
Title: Point-Teaching: Weakly Semi-Supervised Object Detection with Point Annotations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[19]  arXiv:2206.00280 [pdf, other]
Title: Automatic Bounding Box Annotation with Small Training Data Sets for Industrial Manufacturing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[20]  arXiv:2206.00282 [pdf, other]
Title: Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At Scale
Comments: 26 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
[21]  arXiv:2206.00291 [pdf, other]
Title: Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for Edge Devices
Comments: Accepted into Embedded Vision Workshop 2022 of CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[22]  arXiv:2206.00309 [pdf, other]
Title: Label-Efficient Online Continual Object Detection in Streaming Video
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23]  arXiv:2206.00311 [pdf, other]
Title: MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24]  arXiv:2206.00343 [pdf, other]
Title: Towards view-invariant vehicle speed detection from driving simulator images
Comments: 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[25]  arXiv:2206.00344 [pdf, other]
Title: Self-Supervised Learning as a Means To Reduce the Need for Labeled Data in Medical Image Analysis
Comments: Accepted by 30th European Signal Processing Conference, EUSIPCO 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[26]  arXiv:2206.00359 [pdf, other]
Title: DeepCluE: Enhanced Image Clustering via Multi-layer Ensembles in Deep Neural Networks
Comments: To appear in IEEE Transactions on Emerging Topics in Computational Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[27]  arXiv:2206.00364 [pdf, other]
Title: Elucidating the Design Space of Diffusion-Based Generative Models
Comments: NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[28]  arXiv:2206.00384 [pdf, other]
Title: Generalized Supervised Contrastive Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[29]  arXiv:2206.00386 [pdf, other]
Title: DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[30]  arXiv:2206.00415 [pdf, other]
Title: Learning Invariant Visual Representations for Compositional Zero-Shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31]  arXiv:2206.00447 [pdf, other]
Title: CD$^2$: Fine-grained 3D Mesh Reconstruction With Twice Chamfer Distance
Comments: Just accepted by TOMM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32]  arXiv:2206.00468 [pdf, other]
Title: PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation
Comments: CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33]  arXiv:2206.00481 [pdf, other]
Title: Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer
Comments: Accepted to BMVC 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[34]  arXiv:2206.00489 [pdf, other]
Title: Attack-Agnostic Adversarial Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35]  arXiv:2206.00491 [pdf, other]
Title: Semantic Room Wireframe Detection from a Single View
Comments: Accepted for ICPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36]  arXiv:2206.00506 [pdf, other]
Title: Proximally Sensitive Error for Anomaly Detection and Feature Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[37]  arXiv:2206.00515 [pdf, other]
Title: Landslide4Sense: Reference Benchmark Data and Deep Learning Models for Landslide Detection
Journal-ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-17, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[38]  arXiv:2206.00527 [pdf, other]
Title: Amodal Cityscapes: A New Dataset, its Generation, and an Amodal Semantic Segmentation Challenge Baseline
Comments: This paper is accepted at IEEE Intelligent Vehicles Symposium 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[39]  arXiv:2206.00535 [pdf, other]
Title: Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machines
Comments: 9 pages, 5 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Social and Information Networks (cs.SI)
[40]  arXiv:2206.00580 [pdf, other]
Title: Dog nose print matching with dual global descriptor based on Contrastive Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2206.00608 [pdf, other]
Title: On the Choice of Data for Efficient Training and Validation of End-to-End Driving Models
Comments: Accepted at CVPR VDU Workshop 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[42]  arXiv:2206.00614 [pdf, other]
Title: Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43]  arXiv:2206.00629 [pdf, other]
Title: CLIP4IDC: CLIP for Image Difference Captioning
Comments: Accepted to AACL-IJCNLP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[44]  arXiv:2206.00630 [pdf, other]
Title: Unifying Voxel-based Representation with Transformer for 3D Object Detection
Comments: Accepted to NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2206.00645 [pdf, other]
Title: Floorplan Restoration by Structure Hallucinating Transformer Cascades
Comments: Published at BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[46]  arXiv:2206.00665 [pdf, other]
Title: MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47]  arXiv:2206.00718 [pdf, other]
Title: Context-Driven Detection of Invertebrate Species in Deep-Sea Video
Journal-ref: International Journal of Computer Vision 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48]  arXiv:2206.00735 [pdf, other]
Title: Cascaded Video Generation for Videos In-the-Wild
Comments: Accepted to the 26th International Conference on Pattern Recognition (ICPR 2022). arXiv admin note: substantial text overlap with arXiv:2106.02719
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[49]  arXiv:2206.00746 [pdf, other]
Title: Residual Multiplicative Filter Networks for Multiscale Reconstruction
Comments: NeurIPS 2022, Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[50]  arXiv:2206.00771 [pdf, other]
Title: Dynamic Linear Transformer for 3D Biomedical Image Segmentation
Comments: 8 Pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51]  arXiv:2206.00790 [pdf, other]
Title: Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction
Comments: Add code
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52]  arXiv:2206.00798 [pdf, other]
Title: Multi-scale frequency separation network for image deblurring
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53]  arXiv:2206.00800 [pdf, other]
Title: CcHarmony: Color-checker based Image Harmonization Dataset
Authors: Haoxu Huang, Li Niu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54]  arXiv:2206.00806 [pdf, other]
Title: XBound-Former: Toward Cross-scale Boundary Modeling in Transformers
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[55]  arXiv:2206.00812 [pdf, other]
Title: Modeling sRGB Camera Noise with Normalizing Flows
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[56]  arXiv:2206.00859 [pdf, other]
Title: Disentangled Generation Network for Enlarged License Plate Recognition and A Unified Dataset
Comments: Submission to CVIU
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[57]  arXiv:2206.00878 [pdf, other]
Title: EfficientNeRF: Efficient Neural Radiance Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58]  arXiv:2206.00893 [pdf, other]
Title: Leveraging Systematic Knowledge of 2D Transformations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[59]  arXiv:2206.00897 [pdf, other]
Title: xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar Imagery
Comments: Accepted to NeurIPS 2022. 10 pages (25 with references and supplement)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[60]  arXiv:2206.00902 [pdf, other]
Title: MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61]  arXiv:2206.00923 [pdf, other]
Title: Modeling Image Composition for Complex Scene Generation
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[62]  arXiv:2206.00924 [pdf, other]
Title: FACM: Intermediate Layer Still Retain Effective Features against Adversarial Examples
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63]  arXiv:2206.00930 [pdf, other]
Title: Predicting Physical Object Properties from Video
Comments: accepted for International Joint Conference on Neural Networks (IJCNN) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[64]  arXiv:2206.00947 [pdf, other]
Title: A Bhattacharyya Coefficient-Based Framework for Noise Model-Aware Random Walker Image Segmentation
Comments: Dominik Drees and Florian Eilers contributed equally to this work
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[65]  arXiv:2206.00960 [pdf, other]
Title: SparseDet: Towards End-to-End 3D Object Detection
Journal-ref: Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, pp. 781- 792. Feb. 6-8, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[66]  arXiv:2206.00971 [pdf, other]
Title: CVM-Cervix: A Hybrid Cervical Pap-Smear Image Classification Framework Using CNN, Visual Transformer and Multilayer Perceptron
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[67]  arXiv:2206.00997 [pdf, other]
Title: Is Mapping Necessary for Realistic PointGoal Navigation?
Comments: Corrected typos in the Abstract
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68]  arXiv:2206.01009 [pdf, other]
Title: Unified Recurrence Modeling for Video Action Anticipation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69]  arXiv:2206.01010 [pdf, other]
Title: Long-tailed Recognition by Learning from Latent Categories
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70]  arXiv:2206.01014 [pdf, other]
Title: Suggestive Annotation of Brain MR Images with Gradient-guided Sampling
Comments: Manuscript accepted by MedIA
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[71]  arXiv:2206.01017 [pdf, other]
Title: Structured Two-stream Attention Network for Video Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72]  arXiv:2206.01034 [pdf, other]
Title: Adversarial Laser Spot: Robust and Covert Physical-World Attack to DNNs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[73]  arXiv:2206.01038 [pdf, other]
Title: A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications
Comments: 26 pages. The toolbox is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[74]  arXiv:2206.01061 [pdf, other]
Title: FV-UPatches: Enhancing Universality in Finger Vein Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[75]  arXiv:2206.01062 [pdf, other]
Title: DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
Comments: 9 pages, 6 figures, 5 tables. Accepted paper at SIGKDD 2022 conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[76]  arXiv:2206.01102 [pdf, other]
Title: A temporal chrominance trigger for clean-label backdoor attack against anti-spoof rebroadcast detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[77]  arXiv:2206.01125 [pdf, other]
Title: Prefix Conditioning Unifies Language and Label Supervision
Comments: CVPR2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78]  arXiv:2206.01127 [pdf, other]
Title: VL-BEiT: Generative Vision-Language Pretraining
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[79]  arXiv:2206.01136 [pdf, other]
Title: Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[80]  arXiv:2206.01153 [pdf, other]
Title: Multi-View Active Fine-Grained Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81]  arXiv:2206.01160 [pdf, other]
Title: DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[82]  arXiv:2206.01161 [pdf, other]
Title: Optimizing Relevance Maps of Vision Transformers Improves Robustness
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83]  arXiv:2206.01191 [pdf, other]
Title: EfficientFormer: Vision Transformers at MobileNet Speed
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[84]  arXiv:2206.01198 [pdf, other]
Title: Pruning-as-Search: Efficient Neural Architecture Search via Channel Pruning and Structural Reparameterization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[85]  arXiv:2206.01201 [pdf, other]
Title: REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Comments: Accepted by NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[86]  arXiv:2206.01202 [pdf, other]
Title: Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[87]  arXiv:2206.01203 [pdf, other]
Title: Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding Boxes
Comments: Project page: this https URL
Journal-ref: European Conference on Computer Vision (ECCV), 2022, Oral Presentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88]  arXiv:2206.01204 [pdf, other]
Title: Siamese Image Modeling for Self-Supervised Vision Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89]  arXiv:2206.01232 [pdf, other]
Title: What Are Expected Queries in End-to-End Object Detection?
Comments: The source code is publicly available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90]  arXiv:2206.01244 [pdf, other]
Title: Real-Time Portrait Stylization on the Edge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[91]  arXiv:2206.01256 [pdf, other]
Title: PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
Comments: Adding 3D lane detection results on OpenLane Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92]  arXiv:2206.01290 [pdf, other]
Title: Points2NeRF: Generating Neural Radiance Fields from 3D point cloud
Comments: arXiv admin note: text overlap with arXiv:2003.08934 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93]  arXiv:2206.01297 [pdf, other]
Title: Lossless Compression of Point Cloud Sequences Using Sequence Optimized CNN Models
Comments: 9 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[94]  arXiv:2206.01309 [pdf, other]
Title: H-EMD: A Hierarchical Earth Mover's Distance Method for Instance Segmentation
Comments: Accepted at IEEE Transactions On Medical Imaging (TMI)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95]  arXiv:2206.01319 [pdf, other]
Title: Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling
Comments: This paper has been accepted by ECCV2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96]  arXiv:2206.01326 [pdf, other]
Title: Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic Information
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[97]  arXiv:2206.01327 [pdf, other]
Title: RELAY: Robotic EyeLink AnalYsis of the EyeLink 1000 using an Artificial Eye
Comments: 12 Pages, 17 Figures, 2 Tables. Git Repository: this https URL Appendix Repository: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[98]  arXiv:2206.01334 [pdf, other]
Title: Long Scale Error Control in Low Light Image and Video Enhancement Using Equivariance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99]  arXiv:2206.01365 [pdf, other]
Title: Adversarial Attacks on Human Vision
Comments: 21 pages, 8 figures, 1 table
Journal-ref: Extended version of IEEE MultiMedia, vol. 23, no. 1, pp. 82-91, Jan.-Mar. 2016
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[100]  arXiv:2206.01369 [pdf, other]
Title: Incremental Learning Meets Transfer Learning: Application to Multi-site Prostate MRI Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[101]  arXiv:2206.01370 [pdf, other]
Title: Towards Improving the Generation Quality of Autoregressive Slot VAEs
Comments: Published in Neural Computation. 38 pages, 18 figures. Code and videos available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[102]  arXiv:2206.01381 [pdf, other]
Title: CF-YOLO: Cross Fusion YOLO for Object Detection in Adverse Weather with a High-quality Real Snow Dataset
Comments: 10pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103]  arXiv:2206.01384 [pdf, ps, other]
Title: End-to-End 3D Hand Pose Estimation from Stereo Cameras
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104]  arXiv:2206.01408 [pdf, other]
Title: MetaLR: Meta-tuning of Learning Rates for Transfer Learning in Medical Imaging
Comments: MICCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[105]  arXiv:2206.01417 [pdf, other]
Title: Learning an Adaptation Function to Assess Image Visual Similarities
Authors: Olivier Risser-Maroix (LIPADE), Amine Marzouki (LIPADE), Hala Djeghim (LIPADE), Camille Kurtz (LIPADE), Nicolas Lomenie (LIPADE)
Journal-ref: ORASIS 2021, Centre National de la Recherche Scientifique [CNRS], Sep 2021, Saint Ferr{\'e}ol, France
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106]  arXiv:2206.01429 [pdf, other]
Title: Learning rich optical embeddings for privacy-preserving lensless image classification
Comments: 29 pages, 23 figures, under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[107]  arXiv:2206.01441 [pdf, other]
Title: Exploring Transformers for Behavioural Biometrics: A Case Study in Gait Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[108]  arXiv:2206.01466 [pdf, other]
Title: Recognition of Unseen Bird Species by Learning from Field Guides
Comments: Accepted to WACV2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[109]  arXiv:2206.01467 [pdf, other]
Title: The Importance of Image Interpretation: Patterns of Semantic Misclassification in Real-World Adversarial Images
Comments: International Conference on Multimedia Modeling (MMM) 2023. Resources are publicly available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[110]  arXiv:2206.01473 [pdf, other]
Title: Distributional loss for convolutional neural network regression and application to GNSS multi-path estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[111]  arXiv:2206.01498 [pdf, ps, other]
Title: YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack detection
Authors: Xiao Ruiqiang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112]  arXiv:2206.01524 [pdf, other]
Title: Anomaly detection in surveillance videos using transformer based attention model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113]  arXiv:2206.01627 [pdf, other]
Title: Pruning for Feature-Preserving Circuits in CNNs
Comments: Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[114]  arXiv:2206.01646 [pdf, other]
Title: Integrating Prior Knowledge in Contrastive Learning with Kernel
Comments: ICML 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[115]  arXiv:2206.01651 [pdf, other]
Title: D'ARTAGNAN: Counterfactual Video Generation
Comments: Accepted for MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116]  arXiv:2206.01653 [pdf, other]
[117]  arXiv:2206.01658 [pdf, ps, other]
Title: Identification via Retinal Vessels Combining LBP and HOG
Authors: Ali Noori
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[118]  arXiv:2206.01661 [pdf, other]
Title: Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image Synthesis
Authors: Jan Zuiderveld
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[119]  arXiv:2206.01670 [pdf, other]
Title: Egocentric Video-Language Pretraining
Comments: Accepted by NeurIPS 2022. Double champions at Ego4D and EPIC-Kitchens, CVPR 2022 challenges. 23 pages, 13 figures, 12 tables. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[120]  arXiv:2206.01705 [pdf, other]
Title: Gradient Obfuscation Checklist Test Gives a False Sense of Security
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2206.01714 [pdf, other]
Title: Compositional Visual Generation with Composable Diffusion Models
Comments: ECCV 2022. First three authors contributed equally. Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[122]  arXiv:2206.01718 [pdf, other]
Title: A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[123]  arXiv:2206.01720 [pdf, other]
Title: Revisiting the "Video" in Video-Language Understanding
Comments: CVPR 2022 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[124]  arXiv:2206.01724 [pdf, other]
Title: SNAKE: Shape-aware Neural 3D Keypoint Field
Comments: Accepted by NeurIPS 2022. Codes are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[125]  arXiv:2206.01733 [pdf, other]
Title: Adversarial RAW: Image-Scaling Attack Against Imaging Pipeline
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[126]  arXiv:2206.01734 [pdf, ps, other]
Title: Using UAS Imagery and Computer Vision to Support Site-Specific Weed Control in Corn
Comments: 16 Figures, 3 Tables,. arXiv admin note: substantial text overlap with arXiv:2204.12417
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[127]  arXiv:2206.01772 [pdf, other]
Title: Radar Guided Dynamic Visual Attention for Resource-Efficient RGB Object Detection
Comments: Accepted in International Joint Conference on Neural Networks (IJCNN) 2022
Journal-ref: 2022 International Joint Conference on Neural Networks (IJCNN)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[128]  arXiv:2206.01777 [pdf, other]
Title: Real-Time Super-Resolution for Real-World Images on Mobile Devices
Comments: arXiv admin note: text overlap with arXiv:2004.13674
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[129]  arXiv:2206.01794 [pdf, other]
Title: Additive MIL: Intrinsically Interpretable Multiple Instance Learning for Pathology
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[130]  arXiv:2206.01813 [pdf, other]
Title: Learning sRGB-to-Raw-RGB De-rendering with Content-Aware Metadata
Comments: CVPR 2022 (GitHub: this https URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[131]  arXiv:2206.01821 [pdf, other]
Title: EAANet: Efficient Attention Augmented Convolutional Networks
Comments: 8 pages, 4 figures. Not published
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132]  arXiv:2206.01831 [pdf, other]
Title: Spatial Feature Mapping for 6DoF Object Pose Estimation
Comments: Pattern Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2206.01841 [pdf, other]
Title: Coffee Roast Intelligence
Comments: 6 pages, 13 figures, 3 tables, this work was presented at the CSC498 COMPUTER SCIENCE CAPSTONE PROJECT I and CSC499 COMPUTER SCIENCE CAPSTONE PROJECT II courses
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[134]  arXiv:2206.01843 [pdf, other]
Title: Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[135]  arXiv:2206.01863 [pdf, other]
Title: Recursive Deformable Image Registration Network with Mutual Attention
Comments: arXiv admin note: text overlap with arXiv:2203.04290
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[136]  arXiv:2206.01867 [pdf, other]
Title: SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional Space
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2206.01881 [pdf, other]
Title: Face Recognition Accuracy Across Demographics: Shining a Light Into the Problem
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138]  arXiv:2206.01884 [pdf, ps, other]
Title: A Superimposed Divide-and-Conquer Image Recognition Method for SEM Images of Nanoparticles on The Surface of Monocrystalline silicon with High Aggregation Degree
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[139]  arXiv:2206.01908 [pdf, other]
Title: Video-based Human-Object Interaction Detection from Tubelet Tokens
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140]  arXiv:2206.01910 [pdf, other]
Title: The Spike Gating Flow: A Hierarchical Structure Based Spiking Neural Network for Online Gesture Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[141]  arXiv:2206.01916 [pdf, other]
Title: Nerfels: Renderable Neural Codes for Improved Camera Pose Estimation
Comments: Published at CVPRW with supplementary material
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142]  arXiv:2206.01923 [pdf, other]
Title: From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2206.01942 [pdf, other]
Title: Occlusion-Resistant Instance Segmentation of Piglets in Farrowing Pens Using Center Clustering Network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[144]  arXiv:2206.01961 [pdf, other]
Title: C$^3$Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[145]  arXiv:2206.01986 [pdf, other]
Title: Delving into the Openness of CLIP
Comments: Accepted by Findings of ACL 2023 (Long Paper). Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[146]  arXiv:2206.01988 [pdf, other]
Title: Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation
Comments: CVPR 2022 (Poster)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2206.01992 [pdf, other]
Title: CAINNFlow: Convolutional block Attention modules and Invertible Neural Networks Flow for anomaly detection and localization tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[148]  arXiv:2206.01999 [pdf, other]
Title: MSR: Making Self-supervised learning Robust to Aggressive Augmentations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[149]  arXiv:2206.02002 [pdf, other]
Title: CVNets: High Performance Library for Computer Vision
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[150]  arXiv:2206.02015 [pdf, other]
Title: APES: Articulated Part Extraction from Sprite Sheets
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[151]  arXiv:2206.02027 [pdf, other]
Title: Implicit Neural Representation for Mesh-Free Inverse Obstacle Scattering
Comments: 6 pages, 8 figures, to be published in 2022 Asilomar Conference on Signals, Systems, and Computers
Journal-ref: 2022 Asilomar Conference on Signals, Systems, and Computers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[152]  arXiv:2206.02029 [pdf, other]
Title: Guided Deep Metric Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[153]  arXiv:2206.02050 [pdf, other]
Title: Learning Speaker-specific Lip-to-Speech Generation
Comments: Accepted at ICPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[154]  arXiv:2206.02066 [pdf, other]
Title: PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers
Comments: 11 pages, 9 figures; This paper will be published by CVPR2023 soon, please refer to the official version then
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[155]  arXiv:2206.02070 [pdf, other]
Title: Priors in Deep Image Restoration and Enhancement: A Survey
Comments: Preprint. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[156]  arXiv:2206.02082 [pdf, other]
Title: Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Comments: To appear in CVPR 2023; The code will be released at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157]  arXiv:2206.02086 [pdf, other]
Title: Towards the Creation of a Nutrition and Food Group Based Image Database
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158]  arXiv:2206.02087 [pdf, other]
Title: Accurate Scoliosis Vertebral Landmark Localization on X-ray Images via Shape-constrained Multi-stage Cascaded CNNs
Comments: 9 pages, submitted to IEEE Journal of Biomedical and Health Informatics
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2206.02099 [pdf, other]
Title: Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation
Comments: CVPR 2022; Our model ranks 1st on Waymo and SemanticKITTI (single-scan) challenges, and ranks 3rd on SemanticKITTI (multi-scan) challenge; Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160]  arXiv:2206.02104 [pdf, other]
Title: ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161]  arXiv:2206.02110 [pdf, other]
Title: Computer Vision-based Characterization of Large-scale Jet Flames using a Synthetic Infrared Image Generation Approach
Comments: Pre-print submitted to Engineering Science and Technology, an International Journal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[162]  arXiv:2206.02116 [pdf, other]
Title: Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in Videos
Comments: Accepted to CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163]  arXiv:2206.02118 [pdf, other]
Title: ShapePU: A New PU Learning Framework Regularized by Global Consistency for Scribble Supervised Cardiac Segmentation
Comments: 11 pages,4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164]  arXiv:2206.02120 [pdf, other]
Title: MPANet: Multi-Patch Attention For Infrared Small Target object Detection
Comments: 4 pages 3 figures
Journal-ref: 2022IGARSS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165]  arXiv:2206.02136 [pdf, other]
Title: LDRNet: Enabling Real-time Document Localization on Mobile Devices
Comments: ECML-PKDD 2022 this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
[166]  arXiv:2206.02146 [pdf, other]
Title: Recurrent Video Restoration Transformer with Guided Deformable Attention
Comments: Accepted by NeurIPS 2022. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[167]  arXiv:2206.02153 [pdf, other]
Title: HPGNN: Using Hierarchical Graph Neural Networks for Outdoor Point Cloud Processing
Comments: Accepted for ICPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[168]  arXiv:2206.02158 [pdf, other]
Title: Vanilla Feature Distillation for Improving the Accuracy-Robustness Trade-Off in Adversarial Training
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[169]  arXiv:2206.02163 [pdf, other]
Title: MotionCNN: A Strong Baseline for Motion Prediction in Autonomous Driving
Comments: CVPR Workshop on Autonomous Driving 2021. Waymo Motion Prediction Challenge 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[170]  arXiv:2206.02180 [pdf, other]
Title: Semi-Supervised Learning for Mars Imagery Classification and Segmentation
Comments: Accepted by ACM Trans. on Multimedia Computing Communications and Applications (TOMM)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2206.02187 [pdf, other]
Title: M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation
Comments: Accepted for publication in the 5th Multimodal Learning and Applications (MULA) Workshop at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[172]  arXiv:2206.02194 [pdf, other]
Title: FOF: Learning Fourier Occupancy Field for Monocular Real-time Human Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[173]  arXiv:2206.02200 [pdf, other]
Title: GridShift: A Faster Mode-seeking Algorithm for Image Segmentation and Object Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[174]  arXiv:2206.02203 [pdf, ps, other]
Title: 3D Convolutional with Attention for Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175]  arXiv:2206.02220 [pdf, other]
Title: U(1) Symmetry-breaking Observed in Generic CNN Bottleneck Layers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[176]  arXiv:2206.02234 [pdf, other]
Title: Two Decades of Bengali Handwritten Digit Recognition: A Survey
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. 38 pages, 23 figures, 12 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177]  arXiv:2206.02257 [pdf, other]
Title: Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2206.02260 [pdf, other]
Title: SealID: Saimaa ringed seal re-identification dataset
Comments: 15 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Populations and Evolution (q-bio.PE)
[179]  arXiv:2206.02261 [pdf, other]
Title: Towards Individual Grevy's Zebra Identification via Deep 3D Fitting and Metric Learning
Comments: 4 pages, 5 figures, 1 table; typos corrected, references updated
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[180]  arXiv:2206.02270 [pdf, other]
Title: Estimating building energy efficiency from street view imagery, aerial imagery, and land surface temperature data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[181]  arXiv:2206.02281 [pdf, other]
Title: E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[182]  arXiv:2206.02288 [pdf, other]
Title: ACT: Semi-supervised Domain-adaptive Medical Image Segmentation with Asymmetric Co-training
Comments: MICCAI 2022 (early accept)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183]  arXiv:2206.02295 [pdf, other]
Title: HIFI-Net: A Novel Network for Enhancement to Underwater Images
Comments: 7 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[184]  arXiv:2206.02307 [pdf, other]
Title: Bootstrapping Semi-supervised Medical Image Segmentation with Anatomical-aware Contrastive Distillation
Comments: Accepted at Information Processing in Medical Imaging (IPMI 2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[185]  arXiv:2206.02325 [pdf, other]
Title: Evaluation-oriented Knowledge Distillation for Deep Face Recognition
Comments: CVPR2022 Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[186]  arXiv:2206.02327 [pdf, other]
Title: JigsawHSI: a network for Hyperspectral Image classification
Comments: 7 pages, 7 figures, not peer reviewed
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[187]  arXiv:2206.02331 [pdf, ps, other]
Title: MASNet:Improve Performance of Siamese Networks with Mutual-attention for Remote Sensing Change Detection Tasks
Comments: XXIV ISPRS Congress
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[188]  arXiv:2206.02338 [pdf, other]
Title: OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression
Comments: Accepted by NeurIPS2022. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[189]  arXiv:2206.02342 [pdf, other]
Title: WHU-Stereo: A Challenging Benchmark for Stereo Matching of High-Resolution Satellite Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[190]  arXiv:2206.02343 [pdf, other]
Title: Contrastive Graph Multimodal Model for Text Classification in Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[191]  arXiv:2206.02345 [pdf, other]
Title: Anomaly Detection with Test Time Augmentation and Consistency Evaluation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[192]  arXiv:2206.02349 [pdf, other]
Title: Invariant Grounding for Video Question Answering
Comments: CVPR2022 Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[193]  arXiv:2206.02355 [pdf, other]
Title: Relation Matters: Foreground-aware Graph-based Relational Reasoning for Domain Adaptive Object Detection
Comments: Accepted by IEEE T-PAMI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[194]  arXiv:2206.02366 [pdf, other]
Title: Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D Scans
Comments: In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[195]  arXiv:2206.02373 [pdf, other]
Title: Sports Re-ID: Improving Re-Identification Of Players In Broadcast Videos Of Team Sports
Authors: Bharath Comandur
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[196]  arXiv:2206.02374 [pdf, other]
Title: CorticalFlow: A Diffeomorphic Mesh Deformation Module for Cortical Surface Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[197]  arXiv:2206.02377 [pdf, other]
Title: BInGo: Bayesian Intrinsic Groupwise Registration via Explicit Hierarchical Disentanglement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[198]  arXiv:2206.02392 [pdf, ps, other]
Title: Semi-Supervised Segmentation of Mitochondria from Electron Microscopy Images Using Spatial Continuity
Comments: 4 pages of main text, 5 pages of supplementary material and 1 page of references
Journal-ref: 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). IEEE, 2022: 1-5
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[199]  arXiv:2206.02405 [pdf, other]
Title: Image Protection for Robust Cropping Localization and Recovery
Comments: Accepted by IEEE ICME 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[200]  arXiv:2206.02424 [pdf, ps, other]
Title: Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles
Comments: 18 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[201]  arXiv:2206.02452 [pdf, other]
Title: Universal Photometric Stereo Network using Global Lighting Contexts
Authors: Satoshi Ikehata
Comments: Accepted to CVPR2022. Code and Dataset at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[202]  arXiv:2206.02454 [pdf, other]
Title: What do CNNs Learn in the First Layer and Why? A Linear Systems Perspective
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203]  arXiv:2206.02498 [pdf, other]
Title: NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation
Comments: 22 pages, 13 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204]  arXiv:2206.02502 [pdf, other]
Title: BehavePassDB: Public Database for Mobile Behavioral Biometrics and Benchmark Evaluation
Comments: 11 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205]  arXiv:2206.02531 [pdf, other]
Title: 3D-Augmented Contrastive Knowledge Distillation for Image-based Object Pose Estimation
Comments: Accepted for presentation at International Conference on Multimedia Retrieval (ICMR '22)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[206]  arXiv:2206.02539 [pdf, other]
Title: Robustness Evaluation and Adversarial Training of an Instance Segmentation Model
Comments: 15 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[207]  arXiv:2206.02544 [pdf, other]
Title: RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation
Comments: Accepted at the IEEE Winter Conference on Applications of Computer Vision, WACV 2022
Journal-ref: 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 2723-2732
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[208]  arXiv:2206.02547 [pdf, ps, other]
Title: Towards retrieving dispersion profiles using quantum-mimic Optical Coherence Tomography and Machine Learnin
Comments: 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[209]  arXiv:2206.02559 [pdf, other]
Title: Conversation Group Detection With Spatio-Temporal Context
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[210]  arXiv:2206.02564 [pdf, other]
Title: Machine Learning for Detection of 3D Features using sparse X-ray data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Data Analysis, Statistics and Probability (physics.data-an)
[211]  arXiv:2206.02573 [pdf, other]
Title: Team VI-I2R Technical Report on EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[212]  arXiv:2206.02598 [pdf, other]
Title: [Reproducibility Report] Explainable Deep One-Class Classification
Comments: Submitted to the ML Reproducibility Challenge 2021 Fall
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[213]  arXiv:2206.02609 [pdf, other]
Title: Real-World Image Super-Resolution by Exclusionary Dual-Learning
Comments: IEEE TMM 2022; Considering large volume of RealSR datasets, a multi-dataset sampling scheme is developed
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[214]  arXiv:2206.02619 [pdf, other]
Title: VPIT: Real-time Embedded Single Object 3D Tracking Using Voxel Pseudo Images
Comments: 10 pages, 5 figures, 4 tables. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[215]  arXiv:2206.02622 [pdf, other]
Title: Hardware-accelerated Mars Sample Localization via deep transfer learning from photorealistic simulations
Comments: Preprint version only. Final version at IEEE Xplore. Accepted for IEEE Robotics and Automation Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[216]  arXiv:2206.02647 [pdf, other]
Title: Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning
Comments: Accepted to CVPR 2022 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[217]  arXiv:2206.02664 [pdf, other]
Title: Learning with Capsules: A Survey
Comments: 29 pages, 43 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[218]  arXiv:2206.02680 [pdf, other]
Title: Separable Self-attention for Mobile Vision Transformers
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[219]  arXiv:2206.02714 [pdf, other]
Title: FuSS: Fusing Superpixels for Improved Segmentation Consistency
Comments: submitted to IEEEACCESS. 19 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[220]  arXiv:2206.02715 [pdf, other]
Title: Day-to-Night Image Synthesis for Training Nighttime Neural ISPs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[221]  arXiv:2206.02717 [pdf, other]
Title: Scene Aware Person Image Generation through Global Contextual Conditioning
Comments: Accepted in The International Conference on Pattern Recognition (ICPR) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[222]  arXiv:2206.02721 [pdf, other]
Title: Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering
Authors: Yongyi Su, Xun Xu, Kui Jia
Comments: NeurIPS 2022 accepted paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[223]  arXiv:2206.02735 [pdf, other]
Title: People Tracking in Panoramic Video for Guiding Robots
Comments: Accepted to 17th International Conference on Intelligent Autonomous Systems (IAS-17)
Journal-ref: Proceedings of the 17th International Conference on Intelligent Autonomous Systems (IAS 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[224]  arXiv:2206.02749 [pdf, other]
Title: CORE: Consistent Representation Learning for Face Forgery Detection
Comments: Accepted by CVPRW 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[225]  arXiv:2206.02761 [pdf, other]
Title: Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical Images
Comments: 12 pages, 5 figures. In proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022. Copyright 2022 by the author(s)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[226]  arXiv:2206.02770 [pdf, other]
Title: Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[227]  arXiv:2206.02776 [pdf, other]
Title: Volumetric Disentanglement for 3D Scene Manipulation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[228]  arXiv:2206.02777 [pdf, other]
Title: Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[229]  arXiv:2206.02779 [pdf, other]
Title: Blended Latent Diffusion
Comments: Accepted to SIGGRAPH 2023. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[230]  arXiv:2206.02780 [pdf, other]
Title: GenSDF: Two-Stage Learning of Generalizable Signed Distance Functions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[231]  arXiv:2206.02846 [pdf, other]
Title: A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232]  arXiv:2206.02850 [pdf, other]
Title: GLF-CR: SAR-Enhanced Cloud Removal with Global-Local Fusion
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[233]  arXiv:2206.02876 [pdf, other]
Title: SpikiLi: A Spiking Simulation of LiDAR based Real-time Object Detection for Autonomous Driving
Comments: Accepted at Workshop on Event Sensing and Neuromorphic Engineering - 8th International Conference on Event-based Control, Communication, and Signal Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[234]  arXiv:2206.02903 [pdf, other]
Title: Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps
Comments: CVPR 2022 Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[235]  arXiv:2206.02912 [pdf, ps, other]
Title: Learning Image Representations for Content Based Image Retrieval of Radiotherapy Treatment Plans
Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[236]  arXiv:2206.02967 [pdf, other]
Title: Masked Unsupervised Self-training for Label-free Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[237]  arXiv:2206.02977 [pdf, other]
Title: DETR++: Taming Your Multi-Scale Detection Transformer
Comments: T4V: Transformers for Vision workshop @ CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[238]  arXiv:2206.02985 [pdf, other]
Title: Structured Context Transformer for Generic Event Boundary Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[239]  arXiv:2206.02997 [pdf, ps, other]
Title: TadML: A fast temporal action detection with Mechanics-MLP
Comments: 8 pages,3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[240]  arXiv:2206.03001 [pdf, other]
Title: PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Comments: arXiv admin note: text overlap with arXiv:2109.03144
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[241]  arXiv:2206.03010 [pdf, other]
Title: MS-RNN: A Flexible Multi-Scale Framework for Spatiotemporal Predictive Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[242]  arXiv:2206.03012 [pdf, other]
Title: TriBYOL: Triplet BYOL for Self-Supervised Representation Learning
Comments: Published as a conference paper at ICASSP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[243]  arXiv:2206.03014 [pdf, other]
Title: The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation
Comments: Accepted by CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[244]  arXiv:2206.03017 [pdf, other]
Title: Development of Automatic Endotracheal Tube and Carina Detection on Portable Supine Chest Radiographs using Artificial Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[245]  arXiv:2206.03033 [pdf, other]
Title: Deep Learning Techniques for Visual Counting
Authors: Luca Ciampi
Comments: Version with high-quality images can be found at this https URL arXiv admin note: text overlap with arXiv:1802.03601, arXiv:1707.01202, arXiv:1809.02165, arXiv:1901.06026, arXiv:1808.01244 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[246]  arXiv:2206.03048 [pdf, other]
Title: Layered Depth Refinement with Mask Guidance
Comments: Accepted to CVPR 2022 (camera-ready version)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[247]  arXiv:2206.03061 [pdf, other]
Title: Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detection
Comments: Accepted by IJCNN2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[248]  arXiv:2206.03062 [pdf, other]
Title: Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map
Comments: 9 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249]  arXiv:2206.03064 [pdf, other]
Title: A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector
Comments: Accepted By WACV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250]  arXiv:2206.03086 [pdf, other]
Title: Online Deep Clustering with Video Track Consistency
Comments: Accepted at ICPR2022 as oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[251]  arXiv:2206.03087 [pdf, other]
Title: Critical Regularizations for Neural Surface Reconstruction in the Wild
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[252]  arXiv:2206.03105 [pdf, other]
Title: Dual Swin-Transformer based Mutual Interactive Network for RGB-D Salient Object Detection
Authors: Chao Zeng, Sam Kwong
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253]  arXiv:2206.03111 [pdf, other]
Title: Medical Image Registration via Neural Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[254]  arXiv:2206.03113 [pdf, other]
Title: Wavelet Prior Attention Learning in Axial Inpainting Network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[255]  arXiv:2206.03149 [pdf, other]
Title: Self-Training of Handwritten Word Recognition for Synthetic-to-Real Adaptation
Comments: Accepted for publication in International Conference on Pattern Recognition (ICPR) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[256]  arXiv:2206.03164 [pdf, other]
Title: Utility of Equivariant Message Passing in Cortical Mesh Segmentation
Comments: 13 pages, 3 figures, accepted for MIUA 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[257]  arXiv:2206.03196 [pdf, other]
Title: Improving Image Captioning with Control Signal of Sentence Quality
Authors: Zhangzi Zhu, Hong Qu
Comments: Accepted by ICASSP2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258]  arXiv:2206.03207 [pdf, other]
Title: Omnivision forecasting: combining satellite observations with sky images for improved intra-hour solar energy predictions
Comments: Submitted to Renewable Energy
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[259]  arXiv:2206.03210 [pdf, other]
Title: Deep Neural Patchworks: Coping with Large Segmentation Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[260]  arXiv:2206.03287 [pdf, other]
Title: NeMF: Neural Motion Fields for Kinematic Animation
Comments: Accepted to NeurIPS 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[261]  arXiv:2206.03361 [pdf, other]
Title: Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[262]  arXiv:2206.03367 [pdf, other]
Title: Localizing Semantic Patches for Accelerating Image Classification
Comments: Accepted by ICME-2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[263]  arXiv:2206.03368 [pdf, other]
Title: IL-MCAM: An interactive learning and multi-channel attention mechanism-based weakly supervised colorectal histopathology image classification approach
Journal-ref: Computers in Biology and Medicine, Volume 143, April 2022, 105265
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[264]  arXiv:2206.03373 [pdf, other]
Title: Garment Avatars: Realistic Cloth Driving using Pattern Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265]  arXiv:2206.03410 [pdf, other]
Title: Fast and Robust Non-Rigid Registration Using Accelerated Majorization-Minimization
Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[266]  arXiv:2206.03428 [pdf, other]
Title: Revealing Single Frame Bias for Video-and-Language Learning
Comments: 19 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[267]  arXiv:2206.03429 [pdf, other]
Title: Generating Long Videos of Dynamic Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[268]  arXiv:2206.03431 [pdf, other]
Title: Self-supervised Domain Adaptation in Crowd Counting
Comments: Accepted at ICIP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[269]  arXiv:2206.03452 [pdf, other]
Title: Can CNNs Be More Robust Than Transformers?
Comments: ICLR2023. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270]  arXiv:2206.03461 [pdf, other]
Title: Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
[271]  arXiv:2206.03480 [pdf, other]
Title: SHRED: 3D Shape Region Decomposition with Learned Local Operations
Comments: SIGGRAPH ASIA 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[272]  arXiv:2206.03484 [pdf, other]
Title: Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Comments: CVPR camera ready
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[273]  arXiv:2206.03544 [pdf, other]
Title: A Penny for Your (visual) Thoughts: Self-Supervised Reconstruction of Natural Movies from Brain Activity
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[274]  arXiv:2206.03591 [pdf, other]
Title: ObPose: Leveraging Pose for Object-Centric Scene Inference and Generation in 3D
Comments: 14 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[275]  arXiv:2206.03600 [pdf, other]
Title: OneRing: A Simple Method for Source-free Open-partial Domain Adaptation
Comments: Updated. It only focuses on source-free open-partial domain adaptation, to avoid any potential misunderstanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[276]  arXiv:2206.03612 [pdf, other]
Title: Predictive Modeling of Charge Levels for Battery Electric Vehicles using CNN EfficientNet and IGTD Algorithm
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Signal Processing (eess.SP)
[277]  arXiv:2206.03657 [pdf, other]
Title: Delving into the Pre-training Paradigm of Monocular 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278]  arXiv:2206.03661 [pdf, other]
Title: One Hyper-Initializer for All Network Architectures in Medical Image Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[279]  arXiv:2206.03666 [pdf, other]
Title: Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking
Journal-ref: ICRA2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[280]  arXiv:2206.03673 [pdf, other]
Title: Unsupervised Learning of 3D Scene Flow from Monocular Camera
Comments: ICRA2021
Journal-ref: 2021 IEEE International Conference on Robotics and Automation (ICRA)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[281]  arXiv:2206.03678 [pdf, other]
Title: UHD Image Deblurring via Multi-scale Cubic-Mixer
Comments: 8 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[282]  arXiv:2206.03680 [pdf, other]
Title: Improving Evaluation of Debiasing in Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[283]  arXiv:2206.03687 [pdf, other]
Title: A Unified Model for Multi-class Anomaly Detection
Comments: Accepted by NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284]  arXiv:2206.03691 [pdf, other]
Title: Robust Deep Ensemble Method for Real-world Image Denoising
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[285]  arXiv:2206.03697 [pdf, other]
Title: Blind Face Restoration: Benchmark Datasets and a Baseline Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[286]  arXiv:2206.03698 [pdf, other]
Title: What do we learn? Debunking the Myth of Unsupervised Outlier Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[287]  arXiv:2206.03727 [pdf, other]
Title: Wavelet Regularization Benefits Adversarial Training
Comments: Preprint version
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288]  arXiv:2206.03740 [pdf, other]
Title: Large Loss Matters in Weakly Supervised Multi-Label Classification
Comments: CVPR 2022. First two authors contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[289]  arXiv:2206.03753 [pdf, other]
Title: Task Agnostic Restoration of Natural Video Dynamics
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[290]  arXiv:2206.03775 [pdf, other]
Title: PixSelect: Less but Reliable Pixels for Accurate and Efficient Localization
Journal-ref: IEEE International Conference on Robotics and Automation (ICRA), May 23-27, 2022. Philadelphia, PA, USA, p 4156-4162
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[291]  arXiv:2206.03778 [pdf, other]
Title: Learning Digital Terrain Models from Point Clouds: ALS2DTM Dataset and Rasterization-based GAN
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[292]  arXiv:2206.03789 [pdf, other]
Title: Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Comments: Accepted by CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[293]  arXiv:2206.03799 [pdf, other]
Title: Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth Maps
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[294]  arXiv:2206.03820 [pdf, ps, other]
Title: SUPER-IVIM-DC: Intra-voxel incoherent motion based Fetal lung maturity assessment from limited DWI data using supervised learning coupled with data-consistency
Comments: Accepted to the International Conference on Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, to be held during Sept 18-22 in Singapore
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[295]  arXiv:2206.03858 [pdf, other]
Title: Rotation-Equivariant Conditional Spherical Neural Fields for Learning a Natural Illumination Prior
Comments: NeurIPS 2022 - Project Website: jadgardner.github.io/RENI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[296]  arXiv:2206.03860 [pdf, other]
Title: Orthonormal Convolutions for the Rotation Based Iterative Gaussianization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297]  arXiv:2206.03862 [pdf, other]
Title: Perceptual Quality Assessment for Fine-Grained Compressed Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[298]  arXiv:2206.03876 [pdf, other]
Title: Progressive GANomaly: Anomaly detection with progressively growing GANs
Comments: SPIE Medical Imaging 2022: Image Processing conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[299]  arXiv:2206.03888 [pdf, other]
Title: ConFUDA: Contrastive Fewshot Unsupervised Domain Adaptation for Medical Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[300]  arXiv:2206.03891 [pdf, other]
Title: PrivHAR: Recognizing Human Actions From Privacy-preserving Lens
Comments: Oral paper presented at European Conference on Computer Vision (ECCV) 2022, in Tel Aviv, Israel
Journal-ref: Computer Vision--ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23--27, 2022, Proceedings, Part IV
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[301]  arXiv:2206.03928 [pdf, other]
Title: Direct Triangulation with Spherical Projection for Omnidirectional Cameras
Authors: Ciarán Eising
Comments: 8 pages, 4 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[302]  arXiv:2206.03939 [pdf, other]
Title: Depth-Adapted CNNs for RGB-D Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[303]  arXiv:2206.03943 [pdf, other]
Title: Robust Environment Perception for Automated Driving: A Unified Learning Pipeline for Visual-Infrared Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[304]  arXiv:2206.03970 [pdf, other]
Title: Narrowing the Coordinate-frame Gap in Behavior Prediction Models: Distillation for Efficient and Accurate Scene-centric Motion Forecasting
Comments: Accepted at ICRA 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[305]  arXiv:2206.04003 [pdf, other]
Title: Patch-based Object-centric Transformers for Efficient Video Generation
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[306]  arXiv:2206.04028 [pdf, other]
Title: CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving
Comments: Pre-trained backbones and fine-tuned downstream models are now available: this https URL Code will be released
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[307]  arXiv:2206.04029 [pdf, other]
Title: Accelerating Score-based Generative Models for High-Resolution Image Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[308]  arXiv:2206.04040 [pdf, other]
Title: MobileOne: An Improved One millisecond Mobile Backbone
Comments: Accepted at CVPR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[309]  arXiv:2206.04042 [pdf, other]
Title: Learning Ego 3D Representation as Ray Tracing
Comments: ECCV 2022. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[310]  arXiv:2206.04046 [pdf, other]
Title: Sparse Mixture-of-Experts are Domain Generalizable Learners
Comments: ICLR 2023 (accepted as Oral presentation)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[311]  arXiv:2206.04124 [pdf, other]
Title: DRHDR: A Dual branch Residual Network for Multi-Bracket High Dynamic Range Imaging
Comments: Accepted by CVPRW 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[312]  arXiv:2206.04125 [pdf, other]
Title: Towards Self-supervised and Weight-preserving Neural Architecture Search
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[313]  arXiv:2206.04158 [pdf, other]
Title: Texture Extraction Methods Based Ensembling Framework for Improved Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[314]  arXiv:2206.04170 [pdf, other]
Title: CASS: Cross Architectural Self-Supervision for Medical Image Analysis
Comments: (27 pages, 14 figures), Accepted at NeurIPS 2022 Workshop: Self-Supervised Learning - Theory and Practice
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[315]  arXiv:2206.04176 [pdf, other]
Title: VN-Transformer: Rotation-Equivariant Attention for Vector Neurons
Comments: Published in Transactions on Machine Learning Research (TMLR), 2023; Previous version appeared in Workshop on Machine Learning for Autonomous Driving, Conference on Neural Information Processing Systems (NeurIPS), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[316]  arXiv:2206.04197 [pdf, other]
Title: SCAMPS: Synthetics for Camera Measurement of Physiological Signals
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[317]  arXiv:2206.04231 [pdf, other]
Title: JNMR: Joint Non-linear Motion Regression for Video Frame Interpolation
Comments: Accepted by IEEE Transactions on Image Processing (TIP)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318]  arXiv:2206.04242 [pdf, other]
Title: OOD Augmentation May Be at Odds with Open-Set Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319]  arXiv:2206.04246 [pdf, other]
Title: SwinCheX: Multi-label classification on chest X-ray images with transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[320]  arXiv:2206.04271 [pdf, other]
Title: DeepVerge: Classification of Roadside Verge Biodiversity and Conservation Potential
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[321]  arXiv:2206.04281 [pdf, other]
Title: Local Spatiotemporal Representation Learning for Longitudinally-consistent Neuroimage Analysis
Comments: Accepted at NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[322]  arXiv:2206.04295 [pdf, other]
Title: Reconstruct Face from Features Using GAN Generator as a Distribution Constraint
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323]  arXiv:2206.04325 [pdf, other]
Title: CFA: Coupled-hypersphere-based Feature Adaptation for Target-Oriented Anomaly Localization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[324]  arXiv:2206.04349 [pdf, other]
Title: Deep radiomic signature with immune cell markers predicts the survival of glioma patients
Journal-ref: Neurocomputing, Volume 469, 16 January 2022, Pages 366-375
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
[325]  arXiv:2206.04365 [pdf, other]
Title: CARLA-GeAR: a Dataset Generator for a Systematic Evaluation of Adversarial Robustness of Vision Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326]  arXiv:2206.04374 [pdf, other]
Title: Uncovering bias in the PlantVillage dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[327]  arXiv:2206.04381 [pdf, other]
Title: STIP: A SpatioTemporal Information-Preserving and Perception-Augmented Model for High-Resolution Video Prediction
Comments: This journal paper is extended from our previous work accepted in CVPR2022 and has been submitted to IEEE Transactions on Multimedia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[328]  arXiv:2206.04382 [pdf, other]
Title: CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes
Comments: Accepted at ECCV 2022. [Project page] this https URL [Code] this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[329]  arXiv:2206.04399 [pdf, ps, other]
Title: Depression Recognition using Remote Photoplethysmography from Facial Videos
Comments: 10 pages, 5 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[330]  arXiv:2206.04401 [pdf, other]
Title: Cross-modal Local Shortest Path and Global Enhancement for Visible-Thermal Person Re-Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[331]  arXiv:2206.04403 [pdf, other]
Title: VITA: Video Instance Segmentation via Object Token Association
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[332]  arXiv:2206.04406 [pdf, other]
Title: Unsupervised Learning of the Total Variation Flow
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[333]  arXiv:2206.04425 [pdf, other]
Title: Multiple Instance Learning for Digital Pathology: A Review on the State-of-the-Art, Limitations & Future Potential
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[334]  arXiv:2206.04449 [pdf, other]
Title: Segmentation Enhanced Lameness Detection in Dairy Cows from RGB and Depth Video
Comments: Accepted at the CV4Animals workshop in CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[335]  arXiv:2206.04452 [pdf, other]
Title: Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer
Comments: 20 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[336]  arXiv:2206.04453 [pdf, other]
Title: The Missing Link: Finding label relations across datasets
Comments: ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[337]  arXiv:2206.04479 [pdf, ps, other]
Title: BSM loss: A superior way in modeling aleatory uncertainty of fine_grained classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[338]  arXiv:2206.04503 [pdf, other]
Title: cycle text2face: cycle text-to-face gan via transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[339]  arXiv:2206.04511 [pdf, other]
Title: Efficient Human Pose Estimation via 3D Event Point Cloud
Comments: Accepted to 3DV 2022. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[340]  arXiv:2206.04531 [pdf, other]
Title: ECLAD: Extracting Concepts with Local Aggregated Descriptors
Comments: 34 pages, under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[341]  arXiv:2206.04557 [pdf, other]
Title: SparseFormer: Attention-based Depth Completion Network
Comments: Accepted at CV4ARVR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[342]  arXiv:2206.04558 [pdf, other]
Title: BFS-Net: Weakly Supervised Cell Instance Segmentation from Bright-Field Microscopy Z-Stacks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[343]  arXiv:2206.04575 [pdf, other]
Title: Transformer based Urdu Handwritten Text Optical Character Reader
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[344]  arXiv:2206.04584 [pdf, other]
Title: Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer
Comments: Tech report. Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[345]  arXiv:2206.04590 [pdf, other]
Title: GASP: Gated Attention For Saliency Prediction
Comments: International Joint Conference on Artificial Intelligence (IJCAI-21)
Journal-ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (2021) 584-591
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[346]  arXiv:2206.04636 [pdf, other]
Title: Spatial Entropy as an Inductive Bias for Vision Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[347]  arXiv:2206.04655 [pdf, other]
Title: Towards Layer-wise Image Vectorization
Comments: Accepted as Oral Presentation at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348]  arXiv:2206.04656 [pdf, other]
Title: Simple Cues Lead to a Strong Multi-Object Tracker
Comments: Accepted to CVPR2023!
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349]  arXiv:2206.04662 [pdf, other]
Title: DiSparse: Disentangled Sparsification for Multitask Model Compression
Comments: Accepted at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[350]  arXiv:2206.04664 [pdf, other]
Title: On Data Scaling in Masked Image Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351]  arXiv:2206.04665 [pdf, other]
Title: AGConv: Adaptive Graph Convolution on 3D Point Clouds
Comments: arXiv admin note: substantial text overlap with arXiv:2108.08035
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352]  arXiv:2206.04667 [pdf, other]
Title: Extreme Masking for Learning Instance and Distributed Visual Representations
Comments: Accepted in TMLR
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353]  arXiv:2206.04668 [pdf, other]
Title: GateHUB: Gated History Unit with Background Suppression for Online Action Detection
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[354]  arXiv:2206.04669 [pdf, other]
Title: Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[355]  arXiv:2206.04670 [pdf, other]
Title: PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies
Comments: Accepted by NeurIPS'22. Code and models are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[356]  arXiv:2206.04671 [pdf, other]
Title: Open Challenges in Deep Stereo: the Booster Dataset
Comments: CVPR 2022, New Orleans. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357]  arXiv:2206.04673 [pdf, other]
Title: Neural Prompt Search
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[358]  arXiv:2206.04674 [pdf, other]
Title: Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
Comments: Code shall be released at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[359]  arXiv:2206.04783 [pdf, other]
Title: ReFace: Real-time Adversarial Attacks on Face Recognition Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[360]  arXiv:2206.04785 [pdf, other]
Title: Building Spatio-temporal Transformers for Egocentric 3D Pose Estimation
Comments: 4 pages, Extended abstract, Joint International Workshop on Egocentric Perception, Interaction and Computing (EPIC) and Ego4D, IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[361]  arXiv:2206.04790 [pdf, other]
Title: Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
Comments: Accepted to ECCV-2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362]  arXiv:2206.04797 [pdf, other]
Title: Memory-efficient model-based deep learning with convergence and robustness guarantees
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[363]  arXiv:2206.04831 [pdf, other]
Title: R4D: Utilizing Reference Objects for Long-Range Distance Estimation
Comments: ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[364]  arXiv:2206.04846 [pdf, other]
Title: Masked Autoencoders are Robust Data Augmentors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365]  arXiv:2206.04854 [pdf, other]
Title: Heterogeneous Face Recognition via Face Synthesis with Identity-Attribute Disentanglement
Comments: Accepted for publication in IEEE Transactions on Information Forensics and Security (TIFS)
Journal-ref: IEEE Transactions on Information Forensics and Security, vol. 17, pp. 1344-1358, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[366]  arXiv:2206.04863 [pdf, other]
Title: Symbolic image detection using scene and knowledge graphs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[367]  arXiv:2206.04867 [pdf, other]
Title: The Gender Gap in Face Recognition Accuracy Is a Hairy Problem
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[368]  arXiv:2206.04874 [pdf, ps, other]
Title: The 1st Data Science for Pavements Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369]  arXiv:2206.04879 [pdf, other]
Title: Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion
Comments: IEEE Transactions on Image Processing 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370]  arXiv:2206.04901 [pdf, other]
Title: NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors
Comments: Hao-Kang Liu and I-Chao Shen contributed equally to the paper. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[371]  arXiv:2206.04906 [pdf, other]
Title: Out of Sight, Out of Mind: A Source-View-Wise Feature Aggregation for Multi-View Image-Based Rendering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[372]  arXiv:2206.04916 [pdf, other]
Title: PatchComplete: Learning Multi-Resolution Patch Priors for 3D Shape Completion on Unseen Categories
Comments: Video link: this https URL ; Project page: this https URL ; Accepted to NeurIPS'22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373]  arXiv:2206.04927 [pdf, other]
Title: Ego2HandsPose: A Dataset for Egocentric Two-hand 3D Global Pose Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[374]  arXiv:2206.04942 [pdf, other]
Title: Neural Template: Topology-aware Reconstruction and Disentangled Generation of 3D Meshes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375]  arXiv:2206.04949 [pdf, other]
Title: Deep Multi-View Semi-Supervised Clustering with Sample Pairwise Constraints
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[376]  arXiv:2206.04958 [pdf, other]
Title: Self-Supervised Deep Subspace Clustering with Entropy-norm
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[377]  arXiv:2206.04975 [pdf, other]
Title: NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[378]  arXiv:2206.04979 [pdf, ps, other]
Title: Convolutional layers are equivariant to discrete shifts but not continuous translations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[379]  arXiv:2206.04981 [pdf, other]
Title: Positional Label for Self-Supervised Vision Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380]  arXiv:2206.05028 [pdf, other]
Title: Spatial Cross-Attention Improves Self-Supervised Visual Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[381]  arXiv:2206.05039 [pdf, other]
Title: Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[382]  arXiv:2206.05099 [pdf, other]
Title: SimVP: Simpler yet Better Video Prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[383]  arXiv:2206.05102 [pdf, other]
Title: Saccade Mechanisms for Image Classification, Object Detection and Tracking
Comments: 4 Pages, 6 figures, will be presented at CVPR2022-NeuroVision workshop as a Lightning talk
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[384]  arXiv:2206.05127 [pdf, other]
Title: Globally-Optimal Contrast Maximisation for Event Cameras
Comments: arXiv admin note: substantial text overlap with arXiv:2203.03914
Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385]  arXiv:2206.05128 [pdf, ps, other]
Title: Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators
Comments: 9 pages, 15 figures. Will be presented in Embedded Vision Workshop at CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR)
[386]  arXiv:2206.05149 [pdf, other]
Title: Referring Image Matting
Comments: Accepted to CVPR2023. The dataset, code and models are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[387]  arXiv:2206.05158 [pdf, other]
Title: MEAT: Maneuver Extraction from Agent Trajectories
Comments: Accepted at IEEE Intelligent Vehicles Symposium (IV) 2022 2nd Workshop on Autonomy@Scale
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[388]  arXiv:2206.05159 [pdf, ps, other]
Title: An Image Processing Pipeline for Camera Trap Time-Lapse Recordings
Comments: 5 pages, 2 figures, presented at the CV4Animals workshop of CVIP2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[389]  arXiv:2206.05184 [pdf, other]
Title: SERE: Exploring Feature Self-relation for Self-supervised Transformer
Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
Journal-ref: 10.1109/TPAMI.2023.3309979
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390]  arXiv:2206.05194 [pdf, other]
Title: Learning the Space of Deep Models
Comments: Accepted at ICPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[391]  arXiv:2206.05225 [pdf, other]
Title: ClamNet: Using contrastive learning with variable depth Unets for medical image segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392]  arXiv:2206.05252 [pdf, other]
Title: Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models
Comments: 12 pages, 12 figures (with supplemental: 34 pages)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393]  arXiv:2206.05253 [pdf, other]
Title: Rethinking Spatial Invariance of Convolutional Networks for Object Counting
Comments: Accepted to CVPR 2022, Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
[394]  arXiv:2206.05257 [pdf, other]
Title: Explaining Image Classifiers Using Contrastive Counterfactuals in Generative Latent Spaces
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[395]  arXiv:2206.05259 [pdf, other]
Title: Is Self-Supervised Learning More Robust Than Supervised Learning?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[396]  arXiv:2206.05260 [pdf, other]
Title: Balanced Product of Calibrated Experts for Long-Tailed Recognition
Comments: Accepted at CVPR 2023, 19 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[397]  arXiv:2206.05275 [pdf, other]
Title: Spatial-temporal Concept based Explanation of 3D ConvNets
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[398]  arXiv:2206.05281 [pdf, other]
Title: Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model
Comments: VizWiz Grand Challenge: Describing Images and Videos Taken by Blind People (CVPR Workshop 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[399]  arXiv:2206.05282 [pdf, other]
Title: Learning to Estimate Shapley Values with Vision Transformers
Comments: ICLR 2023 camera-ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[400]  arXiv:2206.05291 [pdf, other]
Title: ProActive: Self-Attentive Temporal Point Process Flows for Activity Sequences
Comments: KDD 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[401]  arXiv:2206.05309 [pdf, ps, other]
Title: EigenFairing: 3D Model Fairing using Image Coherence
Comments: British Machine Vision Conference, BMVC 2004, Kingston, UK, September 7-9, 2004
Journal-ref: Proceedings of the British Machine Conference, pages 1-10, BMVA Press, September 2004
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[402]  arXiv:2206.05319 [pdf, other]
Title: Object Instance Identification in Dynamic Environments
Comments: Joint 1st Ego4D and 10th EPIC Workshop (EPIC@CVPR2022) Extended Abstract
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[403]  arXiv:2206.05375 [pdf, other]
Title: Generalizable Neural Radiance Fields for Novel View Synthesis with Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[404]  arXiv:2206.05377 [pdf, other]
Title: Fast building segmentation from satellite imagery and few local labels
Comments: Accepted at EarthVision 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[405]  arXiv:2206.05379 [pdf, other]
Title: A Benchmark for Compositional Visual Reasoning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[406]  arXiv:2206.05390 [pdf, other]
Title: Transformer-based Self-Supervised Fish Segmentation in Underwater Videos
Comments: 11 pages, 6 figures. Submitted to the journal, International Journal of Intelligent Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407]  arXiv:2206.05394 [pdf, other]
Title: Applications of Deep Learning in Fish Habitat Monitoring: A Tutorial and Survey
Comments: 26 pages, 7 figures. Submitted to the journal, Expert Systems With Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408]  arXiv:2206.05398 [pdf, other]
Title: E2PN: Efficient SE(3)-Equivariant Point Network
Comments: CVPR 2023, 16 pages. See this https URL for code
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[409]  arXiv:2206.05420 [pdf, other]
Title: VAC2: Visual Analysis of Combined Causality in Event Sequences
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[410]  arXiv:2206.05422 [pdf, other]
Title: Access Control of Semantic Segmentation Models Using Encrypted Feature Maps
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[411]  arXiv:2206.05424 [pdf, other]
Title: Precise Affordance Annotation for Egocentric Action Video Datasets
Comments: Technical report for CVPR 2022 EPIC-Ego4D Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412]  arXiv:2206.05431 [pdf, other]
Title: Learned reconstruction methods with convergence guarantees
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[413]  arXiv:2206.05432 [pdf, ps, other]
Title: Luminance-Guided Chrominance Image Enhancement for HEVC Intra Coding
Comments: ISCAS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[414]  arXiv:2206.05488 [pdf, ps, other]
Title: Kaggle Kinship Recognition Challenge: Introduction of Convolution-Free Model to boost conventional
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[415]  arXiv:2206.05496 [pdf, other]
Title: An Evaluation of OCR on Egocentric Data
Comments: Extended Abstract, EPIC workshop at CVPR 22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[416]  arXiv:2206.05498 [pdf, other]
Title: A Review of Causality for Learning Algorithms in Medical Image Analysis
Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL". ; Paper ID: 2022:028
Journal-ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); General Literature (cs.GL)
[417]  arXiv:2206.05514 [pdf, other]
Title: Toward Real-world Single Image Deraining: A New Benchmark and Beyond
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[418]  arXiv:2206.05520 [pdf, other]
Title: A Two-stage Method for Non-extreme Value Salt-and-Pepper Noise Removal
Comments: UESTC course project
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[419]  arXiv:2206.05539 [pdf, other]
Title: A Simplified Un-Supervised Learning Based Approach for Ink Mismatch Detection in Handwritten Hyper-Spectral Document Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[420]  arXiv:2206.05542 [pdf, other]
Title: Surround-View Cameras based Holistic Visual Perception for Automated Driving
Authors: Varun Ravi Kumar
Comments: Doctoral thesis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[421]  arXiv:2206.05617 [pdf, other]
Title: Federated Learning with Research Prototypes for Multi-Center MRI-based Detection of Prostate Cancer with Diverse Histopathology
Comments: under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
[422]  arXiv:2206.05619 [pdf, other]
Title: Deep Learning Models for Automated Classification of Dog Emotional States from Facial Expressions
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[423]  arXiv:2206.05641 [pdf, ps, other]
Title: An Unsupervised Deep-Learning Method for Bone Age Assessment
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[424]  arXiv:2206.05648 [pdf, other]
Title: Indirect-Instant Attention Optimization for Crowd Counting in Dense Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425]  arXiv:2206.05651 [pdf, other]
Title: STD-NET: Search of Image Steganalytic Deep-learning Architecture via Hierarchical Tensor Decomposition
Comments: Submitted to IEEE T-DSC
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[426]  arXiv:2206.05683 [pdf, other]
Title: APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
Comments: Neurips 2022 dataset and benchmark track
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427]  arXiv:2206.05707 [pdf, other]
Title: DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[428]  arXiv:2206.05708 [pdf, other]
Title: Narrowing the Gap: Improved Detector Training with Noisy Location Annotations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429]  arXiv:2206.05712 [pdf, other]
Title: Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory Prediction
Comments: This paper has been accepted by CVPR 2022. Reference: Li, L., Pagnucco, M. and Song, Y., 2022. Graph-Based Spatial Transformer With Memory Replay for Multi-Future Pedestrian Trajectory Prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 2231-2241)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430]  arXiv:2206.05717 [pdf, other]
Title: Crowd Localization from Gaussian Mixture Scoped Knowledge and Scoped Teacher
Comments: Accepted by IEEE TIP
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431]  arXiv:2206.05730 [pdf, other]
Title: Object Occlusion of Adding New Categories in Objection Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432]  arXiv:2206.05737 [pdf, other]
Title: SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433]  arXiv:2206.05741 [pdf, other]
Title: Bootstrapping Multi-view Representations for Fake News Detection
Comments: Authors are from Fudan University, China. Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[434]  arXiv:2206.05763 [pdf, other]
Title: SeATrans: Learning Segmentation-Assisted diagnosis model via Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435]  arXiv:2206.05765 [pdf, other]
Title: A Semantic Consistency Feature Alignment Object Detection Model Based on Mixed-Class Distribution Metrics
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[436]  arXiv:2206.05810 [pdf, other]
Title: Analysis of Branch Specialization and its Application in Image Decomposition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[437]  arXiv:2206.05833 [pdf, other]
Title: COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition
Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[438]  arXiv:2206.05836 [pdf, other]
Title: GLIPv2: Unifying Localization and Vision-Language Understanding
Comments: NeurIPS 2022; updated with reviewers' comments addressed; Code is released at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[439]  arXiv:2206.05837 [pdf, other]
Title: NeuralODF: Learning Omnidirectional Distance Fields for 3D Shape Representation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440]  arXiv:2206.05842 [pdf, ps, other]
Title: Efficiency Comparison of AI classification algorithms for Image Detection and Recognition in Real-time
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[441]  arXiv:2206.05844 [pdf, other]
Title: FisheyeEX: Polar Outpainting for Extending the FoV of Fisheye Lens
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[442]  arXiv:2206.05846 [pdf, other]
Title: InBiaseD: Inductive Bias Distillation to Improve Generalization and Robustness through Shape-awareness
Comments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[443]  arXiv:2206.05853 [pdf, other]
Title: Modeling Generalized Specialist Approach To Train Quality Resilient Snapshot Ensemble
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[444]  arXiv:2206.05866 [pdf, other]
Title: TC-SfM: Robust Track-Community-Based Structure-from-Motion
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[445]  arXiv:2206.05896 [pdf, other]
Title: Improve Ranking Correlation of Super-net through Training Scheme from One-shot NAS to Few-shot NAS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446]  arXiv:2206.05897 [pdf, other]
Title: $\texttt{GradICON}$: Approximate Diffeomorphisms via Gradient Inverse Consistency
Comments: 29 pages, 16 figures, CVPR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[447]  arXiv:2206.05898 [pdf, other]
Title: Pixel to Binary Embedding Towards Robustness for CNNs
Comments: Accepted to ICPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[448]  arXiv:2206.05903 [pdf, other]
Title: Geometrically Guided Integrated Gradients
Comments: 19 pages, 23 figures, funding sources added
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[449]  arXiv:2206.05912 [pdf, other]
Title: INDIGO: Intrinsic Multimodality for Domain Generalization
Comments: Under Submission
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450]  arXiv:2206.05927 [pdf, other]
Title: LinK3D: Linear Keypoints Representation for 3D LiDAR Point Cloud
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[451]  arXiv:2206.05962 [pdf, other]
Title: PRO-TIP: Phantom for RObust automatic ultrasound calibration by TIP detection
Comments: This preprint was submitted to MICCAI 2022. The Version of Record of this contribution will be published in Springer LNCS
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[452]  arXiv:2206.05963 [pdf, ps, other]
Title: ATDN vSLAM: An all-through Deep Learning-Based Solution for Visual Simultaneous Localization and Mapping
Comments: Published in Periodica Polytechnica Electrical Engineering 11 pages
Journal-ref: Periodica Polytechnica Electrical Engineering and Computer Science, 66(3), pp. 236-247, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[453]  arXiv:2206.05967 [pdf, other]
Title: GoToNet: Fast Monocular Scene Exposure and Exploration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[454]  arXiv:2206.05970 [pdf, other]
Title: Hypernetwork-Based Adaptive Image Restoration
Comments: 5 pages, 5 Figures, ICASSP 2023
Journal-ref: ICASSP 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[455]  arXiv:2206.05981 [pdf, other]
Title: Efficient Human-in-the-loop System for Guiding DNNs Attention
Comments: 13 pages, 11 figures, proceeding of ACM IUI 2023, video this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[456]  arXiv:2206.05982 [pdf, other]
Title: Learning Fashion Compatibility from In-the-wild Images
Comments: Accepted to ICPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[457]  arXiv:2206.06014 [pdf, other]
Title: Exploring and Exploiting Hubness Priors for High-Quality GAN Latent Sampling
Comments: Accepted at ICML 2022. Our code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[458]  arXiv:2206.06023 [pdf, other]
Title: Virtual embeddings and self-consistency for self-supervised learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[459]  arXiv:2206.06067 [pdf, other]
Title: Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation
Comments: ICLR'23 accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[460]  arXiv:2206.06079 [pdf, other]
Title: OHM: GPU Based Occupancy Map Generation
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[461]  arXiv:2206.06100 [pdf, other]
Title: AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields
Authors: Takuhiro Kaneko
Comments: Accepted to CVPR 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[462]  arXiv:2206.06103 [pdf, other]
Title: Learning Feature Disentanglement and Dynamic Fusion for Recaptured Image Forensic
Comments: Accepted by CVPR2022 workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463]  arXiv:2206.06119 [pdf, other]
Title: Satellite-based high-resolution maps of cocoa planted area for Côte d'Ivoire and Ghana
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[464]  arXiv:2206.06120 [pdf, ps, other]
Title: Brain tumour segmentation with incomplete imaging data
Comments: 26 pages, 8 figures, 4 supplementary tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Tissues and Organs (q-bio.TO)
[465]  arXiv:2206.06122 [pdf, other]
Title: Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning
Comments: Accepted to NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[466]  arXiv:2206.06168 [pdf, other]
Title: 2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning Approach
Comments: 2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[467]  arXiv:2206.06177 [pdf, other]
Title: Transductive CLIP with Class-Conditional Contrastive Learning
Comments: Published in IEEE ICASSP 2022
Journal-ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[468]  arXiv:2206.06214 [pdf, other]
Title: Real-World Light Field Image Super-Resolution via Degradation Modulation
Comments: 15 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[469]  arXiv:2206.06219 [pdf, other]
Title: Making Sense of Dependence: Efficient Black-box Explanations Using Dependence Measure
Comments: Accepted to NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML); Other Statistics (stat.OT)
[470]  arXiv:2206.06252 [pdf, other]
Title: Transformer Lesion Tracker
Comments: Accepted MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[471]  arXiv:2206.06258 [pdf, other]
Title: Featurized Query R-CNN
Comments: Tech Report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[472]  arXiv:2206.06289 [pdf, other]
Title: Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation
Comments: Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Top-performing systems for both no interaction and no restriction tracks in SAPIEN ManiSkill Challenge 2021. The source code and model are publicly available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Robotics (cs.RO)
[473]  arXiv:2206.06291 [pdf, other]
Title: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection
Comments: CVPR 2022; Code is publicly available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[474]  arXiv:2206.06292 [pdf, other]
Title: MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Comments: CVPR 2022; Code is publicly available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[475]  arXiv:2206.06293 [pdf, other]
Title: Learning Domain Adaptive Object Detection with Probabilistic Teacher
Comments: To appear in ICML 2022. Code is coming soon: this https URL
Journal-ref: International Conference on Machine Learning (ICML), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[476]  arXiv:2206.06323 [pdf, other]
Title: Visual Transformer for Object Detection
Authors: Michael Yang
Comments: In preparation for short paper of conferences. I am using the name Michael Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[477]  arXiv:2206.06340 [pdf, other]
Title: SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data
Comments: First two authors contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[478]  arXiv:2206.06346 [pdf, ps, other]
Title: Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Comments: Tech report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479]  arXiv:2206.06359 [pdf, other]
Title: EnergyMatch: Energy-based Pseudo-Labeling for Semi-Supervised Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[480]  arXiv:2206.06360 [pdf, other]
Title: ARF: Artistic Radiance Fields
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[481]  arXiv:2206.06363 [pdf, other]
Title: Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[482]  arXiv:2206.06404 [pdf, other]
Title: Compositional Mixture Representations for Vision and Text
Comments: Workshop on Learning with Limited Labelled Data for Image and Video Understanding (L3D-IVU), CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[483]  arXiv:2206.06420 [pdf, other]
Title: GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation
Comments: Open Sourced
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[484]  arXiv:2206.06427 [pdf, other]
Title: A Multi-purpose Realistic Haze Benchmark with Quantifiable Haze Levels and Ground Truth
Comments: This paper has been ACCEPTED for publication as a REGULAR paper in the IEEE Transactions on Image Processing (TIP)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485]  arXiv:2206.06430 [pdf, ps, other]
Title: A Training Method For VideoPose3D With Ideology of Action Recognition
Authors: Hao Bai
Comments: Published by IEEE, on conference CONF-SPML
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[486]  arXiv:2206.06435 [pdf, ps, other]
Title: ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy
Authors: Hao Bai
Comments: Accepted by CONF-CDS'22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487]  arXiv:2206.06461 [pdf, other]
Title: Self-Supervised Representation Learning With MUlti-Segmental Informational Coding (MUSIC)
Authors: Chuang Niu, Ge Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[488]  arXiv:2206.06466 [pdf, other]
Title: Revisiting the Shape-Bias of Deep Learning for Dermoscopic Skin Lesion Classification
Comments: Submitted preprint accepted for MIUA 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[489]  arXiv:2206.06481 [pdf, other]
Title: RigNeRF: Fully Controllable Neural 3D Portraits
Comments: The project page can be found here: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[490]  arXiv:2206.06484 [pdf, other]
Title: On Image Segmentation With Noisy Labels: Characterization and Volume Properties of the Optimal Solutions to Accuracy and Dice
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[491]  arXiv:2206.06487 [pdf, other]
Title: The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation
Comments: Accepted by ICLR 2023 (top-5%). The first three authors contribute equally. Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[492]  arXiv:2206.06488 [pdf, other]
Title: Multimodal Learning with Transformers: A Survey
Comments: This paper is accepted by IEEE TPAMI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[493]  arXiv:2206.06490 [pdf, other]
Title: Learning Task-Independent Game State Representations from Unlabeled Images
Comments: Conference on Games (CoG) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[494]  arXiv:2206.06506 [pdf, other]
Title: Spiking Neural Networks for Frame-based and Event-based Single Object Localization
Comments: 21 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[495]  arXiv:2206.06510 [pdf, other]
Title: Generalizable Method for Face Anti-Spoofing with Semi-Supervised Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[496]  arXiv:2206.06518 [pdf, other]
Title: Estimating Pose from Pressure Data for Smart Beds with Deep Image-based Pose Estimators
Comments: The version of record of this article, first published in Applied Intelligence, is available online at Publisher's website this https URL arXiv admin note: substantial text overlap with arXiv:1908.08919
Journal-ref: Applied Intelligence (2021): 1-15
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[497]  arXiv:2206.06533 [pdf, other]
Title: 3D scene reconstruction from monocular spherical video with motion parallax
Authors: Kenji Tanaka
Comments: 13 pages, 18 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[498]  arXiv:2206.06544 [pdf, ps, other]
Title: A Survey of Automated Data Augmentation Algorithms for Deep Learning-based Image Classification Tasks
Comments: 68 pages, 9 figures. Submitted to Knowledge and Information Systems (KAIS)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[499]  arXiv:2206.06607 [pdf, other]
Title: Plug-and-Play Pseudo Label Correction Network for Unsupervised Person Re-identification
Comments: 19 pages,9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[500]  arXiv:2206.06608 [pdf, other]
Title: Label Matching Semi-Supervised Object Detection
Comments: To appear in CVPR 2022. Code is coming soon: this https URL
Journal-ref: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[501]  arXiv:2206.06619 [pdf, other]
Title: TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Comments: arXiv admin note: text overlap with arXiv:2104.08541
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[502]  arXiv:2206.06620 [pdf, other]
Title: Slimmable Domain Adaptation
Comments: To appear in CVPR 2022. Code is coming soon: this https URL
Journal-ref: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[503]  arXiv:2206.06637 [pdf, other]
Title: RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks
Comments: Accepted by TPAMI. This paper is a journal extension of our CVPR 2021 paper (arXiv:2101.00910)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[504]  arXiv:2206.06640 [pdf, other]
Title: Confidence Score for Source-Free Unsupervised Domain Adaptation
Comments: ICML 2022 camera ready
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[505]  arXiv:2206.06665 [pdf, other]
Title: Online Easy Example Mining for Weakly-supervised Gland Segmentation from Histology Images
Comments: MICCAI 2022 Accepeted
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[506]  arXiv:2206.06694 [pdf, other]
Title: ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation dataset
Comments: 12 pages, 2 figures
Journal-ref: Scientific data 9.1 (2022): 762
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[507]  arXiv:2206.06712 [pdf, other]
Title: Visual Radial Basis Q-Network
Comments: This paper has been accepted for publication at the 3rd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2022. \c{opyright}Springer Nature 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[508]  arXiv:2206.06714 [pdf, other]
Title: Interpretable Gait Recognition by Granger Causality
Comments: Preprint. Full paper accepted at the IEEE/IAPR International Conference on Pattern Recognition (ICPR), Montreal, Canada, August 2022. 7 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[509]  arXiv:2206.06715 [pdf, other]
Title: Semi-signed prioritized neural fitting for surface reconstruction from unoriented point clouds
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[510]  arXiv:2206.06731 [pdf, ps, other]
Title: Learning Dense Features for Point Cloud Registration Using a Graph Attention Network
Comments: 15 pages, 3 figures
Journal-ref: Applied Sciences 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[511]  arXiv:2206.06741 [pdf, other]
Title: Recurrent Transformer Variational Autoencoders for Multi-Action Motion Synthesis
Comments: accepted at Transformers for Vision workshop at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[512]  arXiv:2206.06743 [pdf, other]
Title: Weakly-Supervised Crack Detection
Comments: Submitted to IEEE Transactions on Intelligent Transportation Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[513]  arXiv:2206.06761 [pdf, other]
Title: Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINO
Comments: ICML 2022 Workshop paper accepted at AdvML Frontiers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[514]  arXiv:2206.06801 [pdf, other]
Title: Peripheral Vision Transformer
Comments: Accepted to NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[515]  arXiv:2206.06803 [pdf, other]
Title: Asymmetric Dual-Decoder U-Net for Joint Rain and Haze Removal
Comments: 12 pages, 35 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[516]  arXiv:2206.06829 [pdf, other]
Title: Efficient Decoder-free Object Detection with Transformers
Comments: Update metadata, 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[517]  arXiv:2206.06922 [pdf, other]
Title: Object Scene Representation Transformer
Comments: Accepted at NeurIPS '22. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[518]  arXiv:2206.06923 [pdf, ps, other]
Title: A Multi-task Framework for Infrared Small Target Detection and Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519]  arXiv:2206.06930 [pdf, other]
Title: Comprehending and Ordering Semantics for Image Captioning
Comments: CVPR 2022; Code is publicly available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[520]  arXiv:2206.06931 [pdf, other]
Title: Stand-Alone Inter-Frame Attention in Video Models
Comments: CVPR 2022; Code is publicly available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[521]  arXiv:2206.06948 [pdf, other]
Title: Monitoring Urban Forests from Auto-Generated Segmentation Maps
Comments: accepted for presentation and publication at IGARSS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[522]  arXiv:2206.06959 [pdf, other]
Title: AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data
Comments: CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[523]  arXiv:2206.07011 [pdf, other]
Title: Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention
Comments: 11 pages, 5 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[524]  arXiv:2206.07018 [pdf, other]
Title: Turning a Curse into a Blessing: Enabling In-Distribution-Data-Free Backdoor Removal via Stabilized Model Inversion
Comments: Because of an equation and author informational error, this paper has been withdrawn by the submitter
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[525]  arXiv:2206.07028 [pdf, other]
Title: Learning 3D Object Shape and Layout without 3D Supervision
Comments: CVPR 2022, project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[526]  arXiv:2206.07036 [pdf, other]
Title: Accurate 3D Body Shape Regression using Metric and Semantic Attributes
Comments: First two authors contributed equally
Journal-ref: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[527]  arXiv:2206.07038 [pdf, other]
Title: AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos
Comments: NeurIPS 2022. Codes and models are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[528]  arXiv:2206.07045 [pdf, other]
Title: ReCo: Retrieve and Co-segment for Zero-shot Transfer
Comments: Tech report. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[529]  arXiv:2206.07047 [pdf, other]
Title: RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation
Comments: CVPR 2022, New Orleans. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[530]  arXiv:2206.07117 [pdf, other]
Title: TriHorn-Net: A Model for Accurate Depth-Based 3D Hand Pose Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[531]  arXiv:2206.07125 [pdf, other]
Title: Self-Supervised Pretraining for Differentially Private Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[532]  arXiv:2206.07160 [pdf, other]
Title: LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[533]  arXiv:2206.07162 [pdf, other]
Title: Category-Agnostic 6D Pose Estimation with Conditional Neural Processes
Comments: Accepted at CVPR2022 workshop: Women in Computer Vision (WiCV)
Journal-ref: CVPR2022 workshop: Women in Computer Vision (WiCV)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[534]  arXiv:2206.07163 [pdf, other]
Title: DeepRecon: Joint 2D Cardiac Segmentation and 3D Volume Reconstruction via A Structure-Specific Generative Method
Comments: MICCAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[535]  arXiv:2206.07171 [pdf, other]
Title: Segmentation in large-scale cellular electron microscopy with deep learning: A literature survey
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[536]  arXiv:2206.07198 [pdf, other]
Title: Surgical Phase Recognition in Laparoscopic Cholecystectomy
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[537]  arXiv:2206.07207 [pdf, other]
Title: Beyond Grounding: Extracting Fine-Grained Event Hierarchies Across Modalities
Comments: AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[538]  arXiv:2206.07240 [pdf, other]
Title: Test-Time Adaptation for Visual Document Understanding
Comments: Accepted at TMLR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[539]  arXiv:2206.07255 [pdf, other]
Title: GRAM-HD: 3D-Consistent Image Generation at High Resolution with Generative Radiance Manifolds
Comments: ICCV2023 camera ready version (more results and method comparisons). Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[540]  arXiv:2206.07259 [pdf, other]
Title: Self-Supervised Learning of Image Scale and Orientation
Comments: Presented in BMVC 2021, code is available on this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[541]  arXiv:2206.07267 [pdf, other]
Title: Rethinking Generalization in Few-Shot Classification
Comments: Accepted at NeurIPS 2022. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[542]  arXiv:2206.07272 [pdf, ps, other]
Title: Machine vision for vial positioning detection toward the safe automation of material synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[543]  arXiv:2206.07282 [pdf, other]
Title: Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[544]  arXiv:2206.07298 [pdf, other]
Title: S$^2$-FPN: Scale-ware Strip Attention Guided Feature Pyramid Network for Real-time Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[545]  arXiv:2206.07307 [pdf, other]
Title: VCT: A Video Compression Transformer
Comments: NeurIPS'22 Camera Ready Version. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[546]  arXiv:2206.07326 [pdf, other]
Title: Recent Advances in Scene Image Representation and Classification
Comments: This paper is under review in Multimedia Tools and Applications (Springer) journal. This article may be deleted or updated based on the policies of the journal
Journal-ref: Multimedia Tools and Applications, 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[547]  arXiv:2206.07344 [pdf, other]
Title: Automatic Detection of Rice Disease in Images of Various Leaf Sizes
Comments: 28 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[548]  arXiv:2206.07348 [pdf, ps, other]
Title: Unsupervised multi-branch Capsule for Hyperspectral and LiDAR classification
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[549]  arXiv:2206.07349 [pdf, other]
Title: XMorpher: Full Transformer for Deformable Medical Image Registration via Cross Attention
Comments: accepted by MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[550]  arXiv:2206.07352 [pdf, ps, other]
Title: Robust SAR ATR on MSTAR with Deep Learning Models trained on Full Synthetic MOCEM data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Image and Video Processing (eess.IV)
[551]  arXiv:2206.07372 [pdf, other]
Title: MonoGround: Detecting Monocular 3D Objects from the Ground
Authors: Zequn Qin, Xi Li
Comments: CVPR22
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[552]  arXiv:2206.07389 [pdf, other]
Title: Ultra Fast Deep Lane Detection with Hybrid Anchor Driven Ordinal Classification
Comments: TPAMI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[553]  arXiv:2206.07394 [pdf, other]
Title: Efficient Adaptive Ensembling for Image Classification
Journal-ref: Expert Systems (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[554]  arXiv:2206.07423 [pdf, other]
Title: Zero-shot object goal visual navigation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[555]  arXiv:2206.07431 [pdf, other]
Title: Physically-admissible polarimetric data augmentation for road-scene analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[556]  arXiv:2206.07434 [pdf, other]
Title: Self-Supervised Implicit Attention: Guided Attention by The Model Itself
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[557]  arXiv:2206.07435 [pdf, other]
Title: Forecasting of depth and ego-motion with transformers and self-supervision
Comments: Accepted in ICPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[558]  arXiv:2206.07458 [pdf, other]
Title: VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection
Comments: Accepted by ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[559]  arXiv:2206.07459 [pdf, other]
Title: READ: Aggregating Reconstruction Error into Out-of-distribution Detection
Comments: Accepted to AAAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[560]  arXiv:2206.07460 [pdf, other]
Title: Coarse-to-fine Deep Video Coding with Hyperprior-guided Mode Prediction
Comments: CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[561]  arXiv:2206.07468 [pdf, ps, other]
Title: PolyU-BPCoMa: A Dataset and Benchmark Towards Mobile Colorized Mapping Using a Backpack Multisensorial System
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[562]  arXiv:2206.07510 [pdf, other]
Title: Deep Multi-Task Networks For Occluded Pedestrian Pose Estimation
Comments: 4 pages, 5 tables, 2 figures
Journal-ref: Proceedings of the 2022 Irish Machine Vision and Image Processing Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[563]  arXiv:2206.07557 [pdf, other]
Title: How to Reduce Change Detection to Semantic Segmentation
Comments: Accepted by Pattern Recognition. Code is at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[564]  arXiv:2206.07565 [pdf, other]
Title: A Meta-Analysis of Distributionally-Robust Models
Comments: To be presented at ICML Workshop on Principles of Distribution Shift 2022. Copyright 2022 by the author(s)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[565]  arXiv:2206.07578 [src]
Title: E2V-SDE: From Asynchronous Events to Fast and Continuous Video Reconstruction via Neural Stochastic Differential Equations
Comments: arXiv admin note: This submission has been withdrawn by arXiv administrators due to inappropriate text overlap with external sources. Additional information at this https URL
Journal-ref: The IEEE / CVF Computer Vision and Pattern Recognition Conference 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[566]  arXiv:2206.07580 [pdf, other]
Title: Evaluating object detector ensembles for improving the robustness of artifact detection in endoscopic video streams
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[567]  arXiv:2206.07634 [pdf, other]
Title: Real3D-Aug: Point Cloud Augmentation by Placing Real Objects with Occlusion Handling for 3D Detection and Segmentation
Comments: Submitted on 15th June 2022 to IEEE RA-L journal
Journal-ref: Computer Vision Winter Workshop 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[568]  arXiv:2206.07643 [pdf, other]
Title: Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Comments: NeurIPS 2022. Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[569]  arXiv:2206.07662 [pdf, other]
Title: SP-ViT: Learning 2D Spatial Priors for Vision Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[570]  arXiv:2206.07669 [pdf, other]
Title: A Unified Sequence Interface for Vision Tasks
Comments: The first three authors contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[571]  arXiv:2206.07684 [pdf, other]
Title: AVATAR: Unconstrained Audiovisual Speech Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[572]  arXiv:2206.07687 [pdf, other]
Title: Structured Sparsity Learning for Efficient Video Super-Resolution
Comments: Accepted by CVPR2023, code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[573]  arXiv:2206.07689 [pdf, other]
Title: Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022
Comments: Ego4D CVPR22 Object State Localization challenge. arXiv admin note: substantial text overlap with arXiv:2206.06346
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[574]  arXiv:2206.07690 [pdf, other]
Title: ELUDE: Generating interpretable explanations via a decomposition into labelled and unlabelled features
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[575]  arXiv:2206.07692 [pdf, other]
Title: A Simple Data Mixing Prior for Improving Self-Supervised Learning
Comments: CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[576]  arXiv:2206.07695 [pdf, other]
Title: VoxGRAF: Fast 3D-Aware Image Synthesis with Sparse Voxel Grids
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[577]  arXiv:2206.07696 [pdf, other]
Title: Diffusion Models for Video Prediction and Infilling
Comments: Published in TMLR (11/2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[578]  arXiv:2206.07698 [pdf, other]
Title: Neural Deformable Voxel Grid for Fast Optimization of Dynamic View Synthesis
Comments: Technical Report: 29 pages; project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[579]  arXiv:2206.07699 [pdf, other]
Title: Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Comments: ICLR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[580]  arXiv:2206.07700 [pdf, other]
Title: Masked Siamese ConvNets
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[581]  arXiv:2206.07704 [pdf, other]
Title: Waymo Open Dataset: Panoramic Video Panoptic Segmentation
Comments: Our dataset can be found at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[582]  arXiv:2206.07705 [pdf, other]
Title: LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection
Comments: Find the primary metrics for the 2022 Waymo Open Dataset 3D Camera-Only Detection Challenge at this https URL . Find the code at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[583]  arXiv:2206.07706 [pdf, other]
Title: Masked Frequency Modeling for Self-Supervised Visual Pre-Training
Comments: ICLR 2023. Project page: this https URL Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[584]  arXiv:2206.07707 [pdf, other]
Title: Variable Bitrate Neural Fields
Comments: SIGGRAPH 2022. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[585]  arXiv:2206.07710 [pdf, other]
Title: PlanarRecon: Real-time 3D Plane Detection and Reconstruction from Posed Monocular Videos
Comments: CVPR 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[586]  arXiv:2206.07764 [pdf, other]
Title: SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos
Comments: Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[587]  arXiv:2206.07771 [pdf, other]
Title: Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Comments: ICLR 2023. Project at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[588]  arXiv:2206.07802 [pdf, other]
Title: Improving generalization by mimicking the human visual diet
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[589]  arXiv:2206.07835 [pdf, other]
Title: Disentangling visual and written concepts in CLIP
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[590]  arXiv:2206.07846 [pdf, ps, other]
Title: Action Spotting using Dense Detection Anchors Revisited: Submission to the SoccerNet Challenge 2022
Comments: v2: a few more experiments, more detailed method description
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[591]  arXiv:2206.07850 [pdf, other]
Title: HF-NeuS: Improved Surface Reconstruction Using High-Frequency Details
Comments: To appear in NeurIPS 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[592]  arXiv:2206.07893 [pdf, other]
Title: PeQuENet: Perceptual Quality Enhancement of Compressed Video with Adaptation- and Attention-based Network
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[593]  arXiv:2206.07897 [pdf, other]
Title: NCAGC: A Neighborhood Contrast Framework for Attributed Graph Clustering
Journal-ref: Neurocomputing, 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[594]  arXiv:2206.07932 [pdf, other]
Title: Lifelong Wandering: A realistic few-shot online continual learning setting
Comments: CVPR 2022 Workshop on Continual Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[595]  arXiv:2206.07934 [pdf, other]
Title: BANet: Motion Forecasting with Boundary Aware Network
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[596]  arXiv:2206.07953 [pdf, other]
Title: Analysis and Extensions of Adversarial Training for Video Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[597]  arXiv:2206.07959 [pdf, other]
Title: Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[598]  arXiv:2206.07967 [pdf, other]
Title: DreamNet: A Deep Riemannian Network based on SPD Manifold Learning for Visual Classification
Comments: 9 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[599]  arXiv:2206.07981 [pdf, other]
Title: Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[600]  arXiv:2206.07986 [pdf, other]
Title: Image Captioning based on Feature Refinement and Reflective Decoding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[601]  arXiv:2206.07990 [pdf, other]
Title: Patch-level Representation Learning for Self-supervised Vision Transformers
Comments: Accepted to CVPR 2022 (Oral). Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[602]  arXiv:2206.07994 [pdf, other]
Title: Joint Class-Affinity Loss Correction for Robust Medical Image Segmentation with Noisy Labels
Comments: Accepted to MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[603]  arXiv:2206.08009 [pdf, other]
Title: Balancing Discriminability and Transferability for Source-Free Domain Adaptation
Comments: ICML 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[604]  arXiv:2206.08016 [pdf, other]
Title: Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning Approaches
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[605]  arXiv:2206.08026 [pdf, other]
Title: DeepFormableTag: End-to-end Generation and Recognition of Deformable Fiducial Markers
Journal-ref: ACM Transactions on Graphics 40, 4, Article 67 (August 2021)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[606]  arXiv:2206.08083 [pdf, other]
Title: CARLANE: A Lane Detection Benchmark for Unsupervised Domain Adaptation from Simulation to multiple Real-World Domains
Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks, 22 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[607]  arXiv:2206.08084 [pdf, other]
Title: An Improved Normed-Deformable Convolution for Crowd Counting
Journal-ref: IEEE Signal Processing Letters 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[608]  arXiv:2206.08105 [pdf, other]
Title: A Simple Baseline for Adversarial Domain Adaptation-based Unsupervised Flood Forecasting
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[609]  arXiv:2206.08126 [pdf, other]
Title: Channel Importance Matters in Few-Shot Image Classification
Comments: Accepted to ICML 2022; code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[610]  arXiv:2206.08129 [pdf, other]
Title: Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline
Comments: Accepted at NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[611]  arXiv:2206.08150 [pdf, other]
Title: Self-Adaptive Label Augmentation for Semi-supervised Few-shot Classification
Comments: 9 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[612]  arXiv:2206.08155 [pdf, other]
Title: Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Comments: NeurIPS 2022 Camera-Ready; Project Webpage: this https URL; 25 pages; 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[613]  arXiv:2206.08158 [pdf, other]
Title: Volumetric Supervised Contrastive Learning for Seismic Semantic Segmentation
Journal-ref: The International Meeting for Applied Geoscience & Energy (IMAGE) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[614]  arXiv:2206.08171 [pdf, other]
Title: K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather Conditions
Comments: Accepted at NeurIPS 2022 Datasets and Benchmarks Track
Journal-ref: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS Datasets and Benchmarks 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[615]  arXiv:2206.08172 [pdf, other]
Title: RefCrowd: Grounding the Target in Crowd with Referring Expressions
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[616]  arXiv:2206.08176 [pdf, other]
Title: Level 2 Autonomous Driving on a Single Device: Diving into the Devils of Openpilot
Comments: Tech report. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[617]  arXiv:2206.08182 [pdf, other]
Title: Nucleus Segmentation and Analysis in Breast Cancer with the MIScnn Framework
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[618]  arXiv:2206.08186 [pdf, other]
Title: Asymptotic Soft Cluster Pruning for Deep Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[619]  arXiv:2206.08194 [pdf, other]
Title: Online Segmentation of LiDAR Sequences: Dataset and Algorithm
Comments: Code and data are available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[620]  arXiv:2206.08206 [pdf, other]
Title: Selective Multi-Scale Learning for Object Detection
Comments: Accepted by ICANN2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[621]  arXiv:2206.08219 [pdf, other]
Title: HaGRID - HAnd Gesture Recognition Image Dataset
Comments: 12 pages, 5 figures, open-source dataset for computer vision
Journal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024) 4572-4581
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[622]  arXiv:2206.08222 [pdf, other]
Title: Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[623]  arXiv:2206.08224 [pdf, other]
Title: Multi scale Feature Extraction and Fusion for Online Knowledge Distillation
Comments: 12 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[624]  arXiv:2206.08227 [pdf, other]
Title: Delving into the Scale Variance Problem in Object Detection
Comments: Accepted by ICTAI2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[625]  arXiv:2206.08229 [pdf, other]
Title: Open-Set Recognition with Gradient-Based Representations
Comments: Published at IEEE International Conference on Image Processing (ICIP) 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[626]  arXiv:2206.08236 [pdf, other]
Title: Simple and Efficient Architectures for Semantic Segmentation
Comments: To be presented at Efficient Deep Learning for Computer Vision Workshop at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[627]  arXiv:2206.08275 [pdf, other]
Title: Rank the triplets: A ranking-based multiple instance learning framework for detecting HPV infection in head and neck cancers using routine H&E images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[628]  arXiv:2206.08304 [pdf, other]
Title: Adversarial Patch Attacks and Defences in Vision-Based Tasks: A Survey
Comments: A. Sharma and Y. Bian share equal contribution
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[629]  arXiv:2206.08339 [pdf, other]
Title: iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[630]  arXiv:2206.08343 [pdf, other]
Title: Realistic One-shot Mesh-based Head Avatars
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[631]  arXiv:2206.08345 [pdf, ps, other]
Title: Real-World Single Image Super-Resolution Under Rainy Condition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[632]  arXiv:2206.08347 [pdf, other]
Title: Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning
Comments: CVPR 2022, project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[633]  arXiv:2206.08355 [pdf, other]
Title: FWD: Real-time Novel View Synthesis with Forward Warping and Depth
Comments: CVPR 2022. Project website this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[634]  arXiv:2206.08356 [pdf, other]
Title: OmniMAE: Single Model Masked Pretraining on Images and Videos
Comments: CVPR 2023. Code/models: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[635]  arXiv:2206.08357 [pdf, other]
Title: Spatially-Adaptive Multilayer Selection for GAN Inversion and Editing
Comments: CVPR 2022. Github: this https URL Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[636]  arXiv:2206.08358 [pdf, other]
Title: MixGen: A New Multi-Modal Data Augmentation
Comments: First three authors contributed equally. Code are available at this https URL Oral presentation at WACV 2023 Pretraining Large Vision and Multimodal Models Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[637]  arXiv:2206.08361 [pdf, other]
Title: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[638]  arXiv:2206.08362 [pdf, other]
Title: Unified Fourier-based Kernel and Nonlinearity Design for Equivariant Networks on Homogeneous Spaces
Comments: Accepted at ICML2022 Thirty-ninth International Conference on Machine Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[639]  arXiv:2206.08365 [pdf, other]
Title: Virtual Correspondence: Humans as a Cue for Extreme-View Geometry
Comments: CVPR 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[640]  arXiv:2206.08367 [pdf, other]
Title: SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation
Comments: Published at IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[641]  arXiv:2206.08368 [pdf, other]
Title: Unbiased 4D: Monocular 4D Reconstruction with a Neural Deformation Model
Comments: 26 pages, 17 figures, 8 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[642]  arXiv:2206.08405 [pdf, ps, other]
Title: Going Deeper than Tracking: a Survey of Computer-Vision Based Recognition of Animal Pain and Affective States
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[643]  arXiv:2206.08423 [pdf, other]
Title: IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes
Comments: CVPR 22 camera ready version with supplementary
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[644]  arXiv:2206.08427 [pdf, other]
Title: SATBench: Benchmarking the speed-accuracy tradeoff in object recognition by humans and dynamic neural networks
Comments: 19 pages, 12 figures. Under Review at NeurIPS Datasets and Benchmarks Track 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[645]  arXiv:2206.08428 [pdf, other]
Title: EyeNeRF: A Hybrid Representation for Photorealistic Synthesis, Animation and Relighting of Human Eyes
Authors: Gengyan Li (1 and 2), Abhimitra Meka (1), Franziska Müller (1), Marcel C. Bühler (2), Otmar Hilliges (2), Thabo Beeler (1) ((1) Google Inc., (2) ETH Zürich)
Comments: 16 pages, 16 figures, 1 table, to be published in ACM Transactions on Graphics (TOG) (Volume: 41, Issue: 4), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[646]  arXiv:2206.08429 [pdf, other]
Title: Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[647]  arXiv:2206.08460 [pdf, other]
Title: TUSK: Task-Agnostic Unsupervised Keypoints
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[648]  arXiv:2206.08462 [pdf, other]
Title: Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies
Comments: 9 pages, 6 figures. fixed LaTeX typo for algorithm reference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[649]  arXiv:2206.08477 [pdf, other]
Title: Backdoor Attacks on Vision Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[650]  arXiv:2206.08488 [pdf, other]
Title: Controllable Image Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[651]  arXiv:2206.08500 [pdf, other]
Title: What do navigation agents learn about their environment?
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[652]  arXiv:2206.08509 [pdf, other]
Title: Neural Architecture Adaptation for Object Detection by Searching Channel Dimensions and Mapping Pre-trained Parameters
Comments: Accepted to ICPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[653]  arXiv:2206.08524 [pdf, other]
Title: CDNet: Contrastive Disentangled Network for Fine-Grained Image Categorization of Ocular B-Scan Ultrasound
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[654]  arXiv:2206.08537 [pdf, ps, other]
Title: Large-Margin Representation Learning for Texture Classification
Comments: 7 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[655]  arXiv:2206.08547 [pdf, other]
Title: Texture Generation Using A Graph Generative Adversarial Network And Differentiable Rendering
Comments: The final publication is available at Springer via this http URL
Journal-ref: Springer.13836.(2023)388-401
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[656]  arXiv:2206.08549 [pdf, other]
Title: Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[657]  arXiv:2206.08566 [pdf, other]
Title: Active Data Discovery: Mining Unknown Data using Submodular Information Measures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[658]  arXiv:2206.08567 [pdf, other]
Title: Rectify ViT Shortcut Learning by Visual Saliency
Comments: NeurIPS2022 Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[659]  arXiv:2206.08568 [pdf, other]
Title: Multi-Contextual Predictions with Vision Transformer for Video Anomaly Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[660]  arXiv:2206.08572 [pdf, other]
Title: Enhanced Bi-directional Motion Estimation for Video Frame Interpolation
Comments: Accepted by WACV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[661]  arXiv:2206.08585 [pdf, other]
Title: HairFIT: Pose-Invariant Hairstyle Transfer via Flow-based Hair Alignment and Semantic-Region-Aware Inpainting
Comments: BMVC 2021 Oral Presentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[662]  arXiv:2206.08605 [pdf, ps, other]
Title: On Efficient Real-Time Semantic Segmentation: A Survey
Comments: 19 pages, 13 figures, 4 tables This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[663]  arXiv:2206.08610 [pdf, other]
Title: Masked Autoencoders for Generic Event Boundary Detection CVPR'2022 Kinetics-GEBD Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[664]  arXiv:2206.08614 [pdf, other]
Title: Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment
Comments: Accepted to NeurIPS Track on Datasets and Benchmarks 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[665]  arXiv:2206.08632 [pdf, other]
Title: Learning Using Privileged Information for Zero-Shot Action Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[666]  arXiv:2206.08638 [pdf, ps, other]
Title: Minimum Noticeable Difference based Adversarial Privacy Preserving Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[667]  arXiv:2206.08640 [pdf, other]
Title: Uncertainty-aware Evaluation of Time-Series Classification for Online Handwriting Recognition with Domain Shift
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[668]  arXiv:2206.08641 [pdf, other]
Title: Diverse Multiple Trajectory Prediction Using a Two-stage Prediction Network Trained with Lane Loss
Comments: RA-L accepted
Journal-ref: IEEE Robotics and Automation Letters (2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[669]  arXiv:2206.08645 [pdf, other]
Title: Local Slot Attention for Vision-and-Language Navigation
Comments: ICMR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[670]  arXiv:2206.08655 [pdf, other]
Title: Learning Implicit Feature Alignment Function for Semantic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[671]  arXiv:2206.08657 [pdf, other]
Title: BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning
Comments: Accepted by AAAI 2023, Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[672]  arXiv:2206.08683 [pdf, other]
Title: AggNet: Learning to Aggregate Faces for Group Membership Verification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[673]  arXiv:2206.08701 [pdf, ps, other]
Title: Towards Real-Time Visual Tracking with Graded Color-names Features
Comments: 12 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[674]  arXiv:2206.08712 [pdf, other]
Title: An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions
Comments: Accepted to RAL2022, code at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[675]  arXiv:2206.08748 [pdf, ps, other]
Title: ReViSe: Remote Vital Signs Measurement Using Smartphone Camera
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[676]  arXiv:2206.08749 [pdf, other]
Title: From a few Accurate 2D Correspondences to 3D Point Clouds
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[677]  arXiv:2206.08751 [pdf, other]
Title: Perceptual Quality Assessment of Virtual Reality Videos in the Wild
Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[678]  arXiv:2206.08778 [pdf, other]
Title: CTooth: A Fully Annotated 3D Dataset and Benchmark for Tooth Volume Segmentation on Cone Beam Computed Tomography Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[679]  arXiv:2206.08789 [pdf, ps, other]
Title: Reconstructing vehicles from orthographic drawings using deep neural networks
Authors: Robin Klippert
Comments: 9 Pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[680]  arXiv:2206.08791 [pdf, other]
Title: DU-Net based Unsupervised Contrastive Learning for Cancer Segmentation in Histology Images
Comments: arXiv admin note: text overlap with arXiv:2002.05709 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[681]  arXiv:2206.08792 [pdf, other]
Title: FD-CAM: Improving Faithfulness and Discriminability of Visual Explanation for CNNs
Comments: Accepted by ICPR 2022 and also accepted by CVPR 2022 Explainable Artificial Intelligence for Computer Vision (XAI4CV) Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[682]  arXiv:2206.08794 [pdf, other]
Title: The Importance of Background Information for Out of Distribution Generalization
Comments: 6 pages, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[683]  arXiv:2206.08801 [pdf, other]
Title: Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training
Comments: Accepted in CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[684]  arXiv:2206.08833 [pdf, ps, other]
Title: A Comparative Study of Confidence Calibration in Deep Learning: From Computer Vision to Medical Imaging
Comments: 17 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[685]  arXiv:2206.08861 [pdf, other]
Title: DGMIL: Distribution Guided Multiple Instance Learning for Whole Slide Image Classification
Comments: accepted by MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[686]  arXiv:2206.08880 [pdf, other]
Title: Improving Generalization of Metric Learning via Listwise Self-distillation
Comments: 11 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[687]  arXiv:2206.08883 [pdf, other]
Title: CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Comments: ICML 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[688]  arXiv:2206.08898 [pdf, other]
Title: SimA: Simple Softmax-free Attention for Vision Transformers
Comments: Code is available here: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[689]  arXiv:2206.08903 [pdf, other]
Title: Colonoscopy 3D Video Dataset with Paired Depth from 2D-3D Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[690]  arXiv:2206.08916 [pdf, other]
Title: Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[691]  arXiv:2206.08919 [pdf, other]
Title: VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[692]  arXiv:2206.08920 [pdf, other]
Title: VectorMapNet: End-to-end Vectorized HD Map Learning
Comments: Accepted by ICML 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[693]  arXiv:2206.08927 [pdf, other]
Title: Cross-task Attention Mechanism for Dense Multi-task Learning
Comments: 10 figures, 6 tables, 23 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[694]  arXiv:2206.08929 [pdf, other]
Title: TAVA: Template-free Animatable Volumetric Actors
Comments: Code: this https URL; Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[695]  arXiv:2206.08948 [pdf, other]
Title: CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Comments: CVPR 2022 Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[696]  arXiv:2206.08954 [pdf, other]
Title: Bag of Image Patch Embedding Behind the Success of Self-Supervised Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[697]  arXiv:2206.08970 [pdf, other]
Title: MultiEarth 2022 -- The Champion Solution for the Matrix Completion Challenge via Multimodal Regression and Generation
Comments: CVPR 2022, MultiEarth 2022, Matrix Completion Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[698]  arXiv:2206.08977 [pdf, ps, other]
Title: BN-HTRd: A Benchmark Dataset for Document Level Offline Bangla Handwritten Text Recognition (HTR) and Line Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[699]  arXiv:2206.08990 [pdf, other]
Title: Shadows Shed Light on 3D Objects
Comments: 19 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[700]  arXiv:2206.09027 [pdf, other]
Title: Landscape Learning for Neural Network Inversion
Comments: 15 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[701]  arXiv:2206.09038 [pdf, other]
Title: Validation of Vector Data using Oblique Images
Comments: In Proceedings of 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM GIS'08)
Journal-ref: Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM GIS '08), pp. 1-10. 2008
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[702]  arXiv:2206.09055 [src]
Title: Augmented Imagefication: A Data-driven Fault Detection Method for Aircraft Air Data Sensors
Comments: a crucial design defect to acquire flying data by simulation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[703]  arXiv:2206.09061 [pdf, other]
Title: Design of Supervision-Scalable Learning Systems: Methodology and Performance Benchmarking
Comments: 16 pages, 12 figures, 4 tables, under consideration at Pattern Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[704]  arXiv:2206.09068 [pdf, other]
Title: Attention-based Dynamic Subspace Learners for Medical Image Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[705]  arXiv:2206.09071 [pdf, other]
Title: Analysis & Computational Complexity Reduction of Monocular and Stereo Depth Estimation Techniques
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[706]  arXiv:2206.09082 [pdf, other]
Title: Context-aware Proposal Network for Temporal Action Detection
Comments: First place winning solution for temporal action detection task in CVPR-2022 AcitivityNet Challenge. arXiv admin note: substantial text overlap with arXiv:2106.11812
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[707]  arXiv:2206.09089 [pdf, ps, other]
Title: A Dynamic Data Driven Approach for Explainable Scene Understanding
Comments: Unpublished draft of book chapter
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[708]  arXiv:2206.09106 [pdf, other]
Title: Embodied Scene-aware Human Pose Estimation
Comments: NeurIPS 2022. Project website: this https URL Zhengyi Luo and Shun Iwase contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[709]  arXiv:2206.09111 [pdf, other]
Title: VReBERT: A Simple and Flexible Transformer for Visual Relationship Detection
Comments: Published at International Conference on Pattern Recognition (ICPR) 2022, Montreal Quebec
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[710]  arXiv:2206.09114 [pdf, other]
Title: Bear the Query in Mind: Visual Grounding with Query-conditioned Convolution
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[711]  arXiv:2206.09132 [pdf, other]
Title: Replacing Labeled Real-image Datasets with Auto-generated Contours
Comments: Accepted to CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[712]  arXiv:2206.09148 [pdf, other]
Title: Deep Compatible Learning for Partially-Supervised Medical Image Segmentation
Comments: 16 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[713]  arXiv:2206.09178 [pdf, other]
Title: REVECA -- Rich Encoder-decoder framework for Video Event CAptioner
Comments: The IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR). LOng-form VidEo Understanding (LOVEU) workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[714]  arXiv:2206.09191 [pdf, other]
Title: Gender Artifacts in Visual Datasets
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[715]  arXiv:2206.09202 [pdf, other]
Title: Camera Adaptation for Fundus-Image-Based CVD Risk Estimation
Comments: This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this contribution will be added soon
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[716]  arXiv:2206.09221 [pdf, ps, other]
Title: 3D Face Parsing via Surface Parameterization and 2D Semantic Segmentation Network
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[717]  arXiv:2206.09242 [pdf, other]
Title: GaLeNet: Multimodal Learning for Disaster Prediction, Management and Relief
Comments: Accepted to CVPR 2022 Workshop on Multimodal Learning for Earth and Environment
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[718]  arXiv:2206.09243 [pdf, other]
Title: Structured Light with Redundancy Codes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[719]  arXiv:2206.09244 [pdf, other]
Title: GAN2X: Non-Lambertian Inverse Rendering of Image GANs
Comments: Accepted to 3DV 2022. The video demo is available at the project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[720]  arXiv:2206.09256 [pdf, other]
Title: Multistream Gaze Estimation with Anatomical Eye Region Isolation by Synthetic to Real Transfer Learning
Comments: 15 pages, 7 figures, 14 tables. This work has been accepted to the IEEE Transactions on Artificial Intelligence $\copyright$ 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses
Journal-ref: IEEE Transactions on Artificial Intelligence, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[721]  arXiv:2206.09265 [pdf, ps, other]
Title: SAViR-T: Spatially Attentive Visual Reasoning with Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[722]  arXiv:2206.09293 [pdf, other]
Title: Rethinking Bayesian Deep Learning Methods for Semi-Supervised Volumetric Medical Image Segmentation
Comments: To appear at CVPR 2022, and the supplementary material can be found at the official site. The source codes are at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[723]  arXiv:2206.09325 [pdf, other]
Title: EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm
Subjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET)
[724]  arXiv:2206.09358 [pdf, other]
Title: What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[725]  arXiv:2206.09362 [src]
Title: Towards Generalizable Person Re-identification with a Bi-stream Generative Model
Comments: There is a mistake of equation 1
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[726]  arXiv:2206.09365 [pdf, other]
Title: Semi-supervised Change Detection of Small Water Bodies Using RGB and Multispectral Images in Peruvian Rainforests
Comments: 8 pages, 5 figures. Accepted to Proceedings of IEEE WHISPERS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[727]  arXiv:2206.09372 [pdf, other]
Title: mvHOTA: A multi-view higher order tracking accuracy metric to measure spatial and temporal associations in multi-point detection
Comments: 16 pages, 9 figures
Journal-ref: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization (2022) 1-9
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[728]  arXiv:2206.09410 [pdf, other]
Title: Low-Mid Adversarial Perturbation against Unauthorized Face Recognition System
Comments: published in Information Sciences
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[729]  arXiv:2206.09414 [pdf, other]
Title: Terrain Classification using Transfer Learning on Hyperspectral Images: A Comparative study
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[730]  arXiv:2206.09420 [pdf, other]
Title: Agricultural Plantation Classification using Transfer Learning Approach based on CNN
Authors: Uphar Singh, Tushar Musale, Ranjana Vyas, O.P.Vyas (Indian Institute of Information Technology, Allahabad, India)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[731]  arXiv:2206.09474 [pdf, other]
Title: 3D Object Detection for Autonomous Driving: A Comprehensive Survey
Comments: Accepted to International Journal of Computer Vision (IJCV). Project page is at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[732]  arXiv:2206.09479 [pdf, other]
Title: StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
Comments: 32 pages, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, 2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[733]  arXiv:2206.09485 [pdf, other]
Title: Video frame interpolation for high dynamic range sequences captured with dual-exposure sensors
Comments: 13 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[734]  arXiv:2206.09500 [pdf, other]
Title: Unbiased Teacher v2: Semi-supervised Object Detection for Anchor-free and Anchor-based Detectors
Comments: Project Page is at this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[735]  arXiv:2206.09504 [pdf, other]
Title: A Parallel Implementation of Computing Mean Average Precision
Authors: Beinan Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[736]  arXiv:2206.09509 [pdf, ps, other]
Title: Hybrid Facial Expression Recognition (FER2013) Model for Real-Time Emotion Classification and Prediction
Comments: 8 Pages, 8 Figures, 5 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
[737]  arXiv:2206.09541 [pdf, other]
Title: DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[738]  arXiv:2206.09548 [pdf, other]
Title: Variational Distillation for Multi-View Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[739]  arXiv:2206.09552 [pdf, other]
Title: Dynamic Message Propagation Network for RGB-D Salient Object Detection
Comments: 12 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[740]  arXiv:2206.09553 [pdf, other]
Title: Capturing and Inferring Dense Full-Body Human-Scene Contact
Comments: CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[741]  arXiv:2206.09554 [pdf, other]
Title: Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation
Comments: TMM2022, 11 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[742]  arXiv:2206.09564 [pdf, other]
Title: A Novel Long-term Iterative Mining Scheme for Video Salient Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[743]  arXiv:2206.09575 [pdf, other]
Title: C-SENN: Contrastive Self-Explaining Neural Network
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[744]  arXiv:2206.09581 [pdf, ps, other]
Title: Explicit and implicit models in infrared and visible image fusion
Authors: Zixuan Wang, Bin Sun
Comments: 8 pages, 5 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[745]  arXiv:2206.09585 [pdf, other]
Title: 5th Place Solution for YouTube-VOS Challenge 2022: Video Object Segmentation
Comments: 5th Place Solution for Video Object Segmentation in the 4th Large-scale Video Object Segmentation Challenge, CVPR 2022 Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[746]  arXiv:2206.09592 [pdf, other]
Title: DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection
Comments: v3(same as v2) version, update structure (add foreground generation, stable diffusion), add more experiments
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[747]  arXiv:2206.09596 [pdf, other]
Title: Efficient and Flexible Sublabel-Accurate Energy Minimization
Comments: To be published at ICPR 2022, Copyright 2022 IEEE
Subjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[748]  arXiv:2206.09597 [pdf, other]
Title: Winning the CVPR'2022 AQTC Challenge: A Two-stage Function-centric Approach
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[749]  arXiv:2206.09604 [pdf, other]
Title: Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[750]  arXiv:2206.09664 [pdf, other]
Title: What Can be Seen is What You Get: Structure Aware Point Cloud Augmentation
Comments: Published in IEEE IV 2022
Journal-ref: 33rd IEEE Intelligent Vehicles Symposium, Aachen, Germany, June 5th - June 9th 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[751]  arXiv:2206.09667 [pdf, other]
Title: MSANet: Multi-Similarity and Attention Guidance for Boosting Few-Shot Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[752]  arXiv:2206.09683 [pdf, other]
Title: Distribution Regularized Self-Supervised Learning for Domain Adaptation of Semantic Segmentation
Comments: Accepted for publication at Image and Vision Computing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[753]  arXiv:2206.09731 [pdf, other]
Title: Semantic Labeling of High Resolution Images Using EfficientUNets and Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[754]  arXiv:2206.09736 [pdf, other]
Title: Geo-NI: Geometry-aware Neural Interpolation for Light Field Rendering
Comments: 13 pages, 8 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[755]  arXiv:2206.09742 [pdf, ps, other]
Title: Developing a Free and Open-source Automated Building Exterior Crack Inspection Software for Construction and Facility Managers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[756]  arXiv:2206.09753 [pdf, other]
Title: Visualizing and Understanding Contrastive Learning
Comments: Accepted to IEEE Transactions on Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[757]  arXiv:2206.09756 [pdf, other]
Title: Time Gated Convolutional Neural Networks for Crop Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[758]  arXiv:2206.09769 [pdf, other]
Title: Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathology
Comments: Accepted at MICCAI2022 Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[759]  arXiv:2206.09770 [pdf, other]
Title: Real-time Full-stack Traffic Scene Perception for Autonomous Driving with Roadside Cameras
Comments: This paper is accepted and presented in ICRA 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[760]  arXiv:2206.09796 [pdf, other]
Title: Knowledge Distillation for Oriented Object Detection on Aerial Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[761]  arXiv:2206.09806 [pdf, other]
Title: Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval
Comments: 10 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[762]  arXiv:2206.09842 [pdf, other]
Title: Practical Deepfake Detection: Vulnerabilities in Global Contexts
Comments: 6 pages, 6 figures, presented as a workshop paper at Responsible AI @ ICLR 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[763]  arXiv:2206.09843 [pdf, other]
Title: Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification
Comments: Advances in Neural Information Processing Systems (NeurIPS 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[764]  arXiv:2206.09852 [pdf, other]
Title: M&M Mix: A Multimodal Multiview Transformer Ensemble
Comments: Technical report for Epic-Kitchens challenge 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[765]  arXiv:2206.09853 [pdf, other]
Title: DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[766]  arXiv:2206.09885 [pdf, other]
Title: KOLOMVERSE: KRISO open large-scale image dataset for object detection in the maritime universe
Comments: 13 Pages, 12 figures, submitted to NeurIPS 2022 Datasets and Benchmarks Track (Under Review)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[767]  arXiv:2206.09900 [pdf, other]
Title: Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders
Comments: Accepted by TIV
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[768]  arXiv:2206.09907 [pdf, other]
Title: ORFD: A Dataset and Benchmark for Off-Road Freespace Detection
Comments: Accepted by ICRA2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[769]  arXiv:2206.09959 [pdf, other]
Title: Global Context Vision Transformers
Comments: Accepted to ICML 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[770]  arXiv:2206.10033 [pdf, other]
Title: Test Time Transform Prediction for Open Set Histopathological Image Recognition
Comments: Accepted to MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[771]  arXiv:2206.10041 [pdf, other]
Title: MPA: MultiPath++ Based Architecture for Motion Prediction
Authors: Stepan Konev
Comments: CVPR 2022, Workshop on Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[772]  arXiv:2206.10059 [pdf, other]
Title: Bypass Network for Semantics Driven Image Paragraph Captioning
Comments: Under consideration at Computer Vision and Image Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[773]  arXiv:2206.10066 [pdf, other]
Title: RendNet: Unified 2D/3D Recognizer With Latent Space Rendering
Comments: CVPR 2022 Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[774]  arXiv:2206.10075 [pdf, other]
Title: Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[775]  arXiv:2206.10080 [pdf, other]
Title: One-stage Action Detection Transformer
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[776]  arXiv:2206.10082 [pdf, other]
Title: Optimally Controllable Perceptual Lossy Compression
Comments: ICML 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[777]  arXiv:2206.10090 [pdf, other]
Title: KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Journal-ref: Transaction on Circuits and Systems for Video Technology,2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[778]  arXiv:2206.10092 [pdf, other]
Title: BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection
Comments: Accepted by AAAI2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[779]  arXiv:2206.10095 [pdf, other]
Title: Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[780]  arXiv:2206.10096 [pdf, ps, other]
Title: Transformers Improve Breast Cancer Diagnosis from Unregistered Multi-View Mammograms
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[781]  arXiv:2206.10098 [pdf, other]
Title: Reconstruct from BEV: A 3D Lane Detection Approach based on Geometry Structure Prior
Comments: Proceedings of the CVPR 2022 Workshop of Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[782]  arXiv:2206.10107 [pdf, other]
Title: Sensitivity of Average Precision to Bounding Box Perturbations
Authors: Ali Borji
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[783]  arXiv:2206.10118 [pdf, other]
Title: HOPE: Hierarchical Spatial-temporal Network for Occupancy Flow Prediction
Comments: 1st Ranking Solution for the Occupancy and Flow Prediction of the Waymo Open Dataset Challenges 2022 (this http URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[784]  arXiv:2206.10129 [pdf, other]
Title: Automatic Concept Extraction for Concept Bottleneck-based Video Classification
Comments: 10 pages, Appendix: 2 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[785]  arXiv:2206.10131 [pdf, other]
Title: An Integrated Representation & Compression Scheme Based on Convolutional Autoencoders with 4D DCT Perceptual Encoding for High Dynamic Range Light Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[786]  arXiv:2206.10137 [pdf, other]
Title: Few-Max: Few-Shot Domain Adaptation for Unsupervised Contrastive Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[787]  arXiv:2206.10145 [pdf, other]
Title: Deep Learning Eliminates Massive Dust Storms from Images of Tianwen-1
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[788]  arXiv:2206.10146 [pdf, other]
Title: KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[789]  arXiv:2206.10155 [pdf, other]
Title: Review Neural Networks about Image Transformation Based on IGC Learning Framework with Annotated Information
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[790]  arXiv:2206.10157 [pdf, other]
Title: Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[791]  arXiv:2206.10177 [pdf, other]
Title: TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural Networks
Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[792]  arXiv:2206.10186 [pdf, other]
Title: Improving Localization for Semi-Supervised Object Detection
Journal-ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[793]  arXiv:2206.10192 [pdf, other]
Title: LDD: A Dataset for Grape Diseases Object Detection and Instance Segmentation
Journal-ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[794]  arXiv:2206.10207 [pdf, other]
Title: SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Comments: Accepted by NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[795]  arXiv:2206.10213 [pdf, other]
Title: Rethinking Unsupervised Neural Superpixel Segmentation
Comments: ICIP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[796]  arXiv:2206.10225 [pdf, other]
Title: Broken News: Making Newspapers Accessible to Print-Impaired
Journal-ref: Extended Abstract at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[797]  arXiv:2206.10241 [pdf, other]
Title: Deep Active Latent Surfaces for Medical Geometries
Comments: 14 pages, 9 figures, submitted for review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[798]  arXiv:2206.10253 [pdf, other]
Title: Document Navigability: A Need for Print-Impaired
Comments: Published at Accessibility, Vision, and Autonomy Meet, CVPR 2022 Workshop
Journal-ref: Extended Abstract for Poster Session at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[799]  arXiv:2206.10254 [pdf, other]
Title: Towards Optimizing OCR for Accessibility
Journal-ref: Extended Abstract for Poster Session at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[800]  arXiv:2206.10263 [pdf, other]
Title: Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping
Comments: submitted to IROS 2015 (rejected)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[801]  arXiv:2206.10324 [pdf, other]
Title: Online progressive instance-balanced sampling for weakly supervised object detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[802]  arXiv:2206.10329 [pdf, other]
Title: SVG Vector Font Generation for Chinese Characters with Transformer
Comments: Accepted to ICIP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[803]  arXiv:2206.10360 [pdf, other]
Title: Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss
Comments: 5 pages, 3 figures; Accepted to ICIP2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[804]  arXiv:2206.10375 [pdf, other]
Title: MEStereo-Du2CNN: A Novel Dual Channel CNN for Learning Robust Depth Estimates from Multi-exposure Stereo Images for HDR 3D Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[805]  arXiv:2206.10411 [pdf, other]
Title: Audio-video fusion strategies for active speaker detection in meetings
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[806]  arXiv:2206.10436 [pdf, other]
Title: Transformer-Based Multi-modal Proposal and Re-Rank for Wikipedia Image-Caption Matching
Comments: Accepted for publication at the Wiki-M3L workshop, co-located with ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[807]  arXiv:2206.10457 [pdf, other]
Title: Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[808]  arXiv:2206.10465 [pdf, other]
Title: An Overview of Privacy-enhancing Technologies in Biometric Recognition
Comments: 12 pages, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[809]  arXiv:2206.10491 [pdf, other]
Title: Bi-Calibration Networks for Weakly-Supervised Video Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[810]  arXiv:2206.10520 [pdf, other]
Title: SFace: Privacy-friendly and Accurate Face Recognition using Synthetic Data
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[811]  arXiv:2206.10526 [pdf, other]
Title: QuantFace: Towards Lightweight Face Recognition by Synthetic Data Low-bit Quantization
Comments: Accepted ICPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[812]  arXiv:2206.10531 [pdf, other]
Title: Neural Transformers for Intraductal Papillary Mucosal Neoplasms (IPMN) Classification in MRI images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[813]  arXiv:2206.10535 [pdf, other]
Title: EpiGRAF: Rethinking training of 3D GANs
Comments: NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[814]  arXiv:2206.10536 [pdf, other]
Title: HealNet -- Self-Supervised Acute Wound Heal-Stage Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[815]  arXiv:2206.10552 [pdf, other]
Title: Vicinity Vision Transformer
Comments: code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[816]  arXiv:2206.10555 [pdf, other]
Title: LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs
Comments: In CVPR 2023. Code is at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[817]  arXiv:2206.10562 [pdf, other]
Title: Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[818]  arXiv:2206.10571 [pdf, other]
Title: Toward Unpaired Multi-modal Medical Image Segmentation via Learning Structured Semantic Consistency
Comments: MIDL23
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[819]  arXiv:2206.10573 [pdf, ps, other]
Title: H&E-based Computational Biomarker Enables Universal EGFR Screening for Lung Adenocarcinoma
Subjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[820]  arXiv:2206.10587 [pdf, ps, other]
Title: Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements
Comments: 28 pages, 6 figures, 3 supplementary figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[821]  arXiv:2206.10589 [pdf, other]
Title: EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Comments: Accepted at ECCVW 2022 (Oral, CADL: Computational Aspects of Deep Learning)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[822]  arXiv:2206.10590 [pdf, other]
Title: Temporally Consistent Semantic Video Editing
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[823]  arXiv:2206.10665 [pdf, other]
Title: BOSS: A Benchmark for Human Belief Prediction in Object-context Scenarios
Comments: 9 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[824]  arXiv:2206.10673 [pdf, ps, other]
Title: Natural Backdoor Datasets
Comments: 18 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[825]  arXiv:2206.10690 [pdf, other]
Title: Learning Continuous Rotation Canonicalization with Radial Beam Sampling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[826]  arXiv:2206.10692 [pdf, other]
Title: Multi-level Domain Adaptation for Lane Detection
Comments: Proceedings of the CVPR 2022 Workshop of Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[827]  arXiv:2206.10698 [pdf, other]
Title: TiCo: Transformation Invariance and Covariance Contrast for Self-Supervised Visual Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[828]  arXiv:2206.10711 [pdf, other]
Title: Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning
Comments: Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS). Extended version of arXiv:2103.00868. The project is at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[829]  arXiv:2206.10737 [pdf, other]
Title: Deep Metric Color Embeddings for Splicing Localization in Severely Degraded Images
Comments: 14 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[830]  arXiv:2206.10779 [pdf, other]
Title: Not Just Streaks: Towards Ground Truth for Single Image Deraining
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[831]  arXiv:2206.10789 [pdf, other]
Title: Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[832]  arXiv:2206.10809 [pdf, other]
Title: SSMI: How to Make Objects of Interest Disappear without Accessing Object Detectors?
Comments: 6 pages, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[833]  arXiv:2206.10821 [pdf, other]
Title: Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized Activations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[834]  arXiv:2206.10830 [pdf, other]
Title: A Feature Memory Rearrangement Network for Visual Inspection of Textured Surface Defects Toward Edge Intelligent Manufacturing
Comments: Revision to IEEE transactions on automation science and engineering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[835]  arXiv:2206.10831 [pdf, other]
Title: MultiEarth 2022 Deforestation Challenge -- ForestGump
Comments: CVPR 2022, MultiEarth 2022, Deforestation Estimation Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[836]  arXiv:2206.10845 [pdf, other]
Title: Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation
Comments: The solution of 1st Place in AVA Accessibility Vision and Autonomy Challenge on CVPR 2022 workshop. Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[837]  arXiv:2206.10861 [pdf, other]
Title: UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022
Comments: 5 pages, 3 figures; technical report for AVA Challenge (see this https URL) at the International Challenge on Activity Recognition (ActivityNet), CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[838]  arXiv:2206.10869 [pdf, other]
Title: NVIDIA-UNIBZ Submission for EPIC-KITCHENS-100 Action Anticipation Challenge 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[839]  arXiv:2206.10878 [pdf, other]
Title: Feature Re-calibration based Multiple Instance Learning for Whole Slide Image Classification
Comments: MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[840]  arXiv:2206.10879 [pdf, other]
Title: Symmetric Network with Spatial Relationship Modeling for Natural Language-based Vehicle Retrieval
Comments: 8 pages, 3 figures, publised to CVPRW
Journal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 3226-3233
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[841]  arXiv:2206.10885 [pdf, other]
Title: KiloNeuS: A Versatile Neural Implicit Surface Representation for Real-Time Rendering
Comments: 9 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[842]  arXiv:2206.10886 [pdf, other]
Title: Optical Flow Regularization of Implicit Neural Representations for Video Frame Interpolation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[843]  arXiv:2206.10892 [pdf, other]
Title: I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose Estimation
Comments: Accepected by IJCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[844]  arXiv:2206.10902 [pdf, other]
Title: S2TNet: Spatio-Temporal Transformer Networks for Trajectory Prediction in Autonomous Driving
Comments: Accepted by ACML2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[845]  arXiv:2206.10903 [pdf, ps, other]
Title: UniUD-FBK-UB-UniBZ Submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2022
Comments: Ranked joint 1st place in the Multi-Instance Action Retrieval Challenge organized at EPIC@CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[846]  arXiv:2206.10910 [pdf, other]
Title: SpA-Former: Transformer image shadow detection and removal via spatial attention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[847]  arXiv:2206.10915 [pdf, other]
Title: Understanding the effect of sparsity on neural networks robustness
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[848]  arXiv:2206.10965 [pdf, other]
Title: Polar Parametrization for Vision-based Surround-View 3D Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[849]  arXiv:2206.10969 [pdf, other]
Title: Single Morphing Attack Detection using Siamese Network and Few-shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[850]  arXiv:2206.10988 [pdf, other]
Title: AdvSmo: Black-box Adversarial Attack by Smoothing Linear Structure of Texture
Comments: 6 pages,3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[851]  arXiv:2206.10989 [pdf, other]
Title: Identity Documents Authentication based on Forgery Detection of Guilloche Pattern
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[852]  arXiv:2206.10996 [pdf, other]
Title: ProtoCLIP: Prototypical Contrastive Language Image Pretraining
Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[853]  arXiv:2206.11011 [pdf, other]
Title: Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[854]  arXiv:2206.11053 [pdf, other]
Title: Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformer
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO); Image and Video Processing (eess.IV)
[855]  arXiv:2206.11080 [pdf, other]
Title: Motion Gait: Gait Recognition via Motion Excitation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[856]  arXiv:2206.11095 [pdf, other]
Title: A High Resolution Multi-exposure Stereoscopic Image & Video Database of Natural Scenes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[857]  arXiv:2206.11115 [pdf, other]
Title: ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[858]  arXiv:2206.11134 [pdf, other]
Title: Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[859]  arXiv:2206.11180 [pdf, other]
Title: Optimal transport meets noisy label robust loss and MixUp regularization for domain adaptation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[860]  arXiv:2206.11203 [pdf, other]
Title: Facke: a Survey on Generative Models for Face Swapping
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[861]  arXiv:2206.11212 [pdf, other]
Title: VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason Objectives
Comments: NeurIPS 2022 (first two authors contributed equally)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[862]  arXiv:2206.11215 [pdf, other]
Title: Certifiable 3D Object Pose Estimation: Foundations, Learning Models, and Self-Training
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[863]  arXiv:2206.11250 [pdf, other]
Title: Depth-aware Glass Surface Detection with Cross-modal Context Mining
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[864]  arXiv:2206.11253 [pdf, other]
Title: Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Comments: Accepted by NeurIPS 2022. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[865]  arXiv:2206.11352 [pdf, ps, other]
Title: Doubly Reparameterized Importance Weighted Structure Learning for Scene Graph Generation
Comments: arXiv admin note: substantial text overlap with arXiv:2205.07017
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[866]  arXiv:2206.11358 [pdf, other]
Title: Monocular Spherical Depth Estimation with Explicitly Connected Weak Layout Cues
Comments: Project page at this https URL
Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, Volume 183, January 2022, Pages 269-285
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[867]  arXiv:2206.11404 [pdf, other]
Title: The ArtBench Dataset: Benchmarking Generative Models with Artworks
Comments: The first two authors contributed equally to this work. The code and data are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[868]  arXiv:2206.11428 [pdf, other]
Title: LidarMultiNet: Unifying LiDAR Semantic Segmentation, 3D Object Detection, and Panoptic Segmentation in a Single Multi-task Network
Comments: Official 1st Place Solution for the Waymo Open Dataset Challenges 2022 - 3D Semantic Segmentation. Official leaderboard: this https URL CVPR 2022 Workshop on Autonomous Driving: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[869]  arXiv:2206.11443 [pdf, other]
Title: Image-based Stability Quantification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[870]  arXiv:2206.11459 [pdf, other]
Title: Explore Spatio-temporal Aggregation for Insubstantial Object Detection: Benchmark Dataset and Baseline
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[871]  arXiv:2206.11462 [pdf, ps, other]
Title: ICME 2022 Few-shot LOGO detection top 9 solution
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[872]  arXiv:2206.11473 [pdf, other]
Title: Complementary datasets to COCO for object detection
Authors: Ali Borji
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[873]  arXiv:2206.11474 [pdf, other]
Title: Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation
Comments: 24 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[874]  arXiv:2206.11476 [pdf, other]
Title: Dynamic Scene Deblurring Based on Continuous Cross-Layer Attention Transmission
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[875]  arXiv:2206.11493 [pdf, other]
Title: Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization
Comments: Accepted by CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[876]  arXiv:2206.11499 [pdf, other]
Title: Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set
Comments: 14 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[877]  arXiv:2206.11502 [pdf, ps, other]
Title: A Review of Published Machine Learning Natural Language Processing Applications for Protocolling Radiology Imaging
Authors: Nihal Raju (5), Michael Woodburn (1 and 5), Stefan Kachel (2 and 3), Jack O'Shaughnessy (5), Laurence Sorace (5), Natalie Yang (2), Ruth P Lim (2 and 4) ((1) Harvard University, Extension School, Cambridge, MA, USA, (2) Department of Radiology, The University of Melbourne, Parkville, (3) Department of Radiology, Columbia University in the City of New York, (4) Department of Surgery, Austin, The University of Melbourne, (5) Austin Hospital, Austin Health, Melbourne, Australia)
Comments: 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[878]  arXiv:2206.11520 [pdf, other]
Title: ICOS Protein Expression Segmentation: Can Transformer Networks Give Better Results?
Comments: Accepted MIUA conference (Abstract short paper)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[879]  arXiv:2206.11541 [pdf, other]
Title: A Neuromorphic Vision-Based Measurement for Robust Relative Localization in Future Space Exploration Missions
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[880]  arXiv:2206.11589 [pdf, other]
Title: Learning Towards the Largest Margins
Comments: ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[881]  arXiv:2206.11610 [pdf, other]
Title: 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Comments: Winner of the 2nd RxR-Habitat Competition @ CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[882]  arXiv:2206.11629 [pdf, other]
Title: Global Sensing and Measurements Reuse for Image Compressed Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[883]  arXiv:2206.11653 [pdf, other]
Title: Learning To Generate Scene Graph from Head to Tail
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[884]  arXiv:2206.11657 [pdf, other]
Title: Warped Convolutional Networks: Bridge Homography to sl(3) algebra by Group Convolution
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[885]  arXiv:2206.11678 [pdf, other]
Title: BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose Estimation
Comments: 4 pages, 4 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[886]  arXiv:2206.11695 [pdf, other]
Title: NTIRE 2022 Challenge on Perceptual Image Quality Assessment
Comments: This report has been published in CVPR 2022 NTIRE workshop. arXiv admin note: text overlap with arXiv:2105.03072
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[887]  arXiv:2206.11723 [pdf, other]
Title: Self-Supervised Training with Autoencoders for Visual Anomaly Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[888]  arXiv:2206.11736 [pdf, other]
Title: NovelCraft: A Dataset for Novelty Detection and Discovery in Open Worlds
Comments: Published in Transactions on Machine Learning Research (03/2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[889]  arXiv:2206.11739 [pdf, other]
Title: Evidence fusion with contextual discounting for multi-modality medical image segmentation
Comments: MICCAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[890]  arXiv:2206.11752 [pdf, other]
Title: CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose
Comments: CVPR2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[891]  arXiv:2206.11759 [pdf, other]
Title: What makes you, you? Analyzing Recognition by Swapping Face Parts
Comments: Accepted for publication at 26TH International Conference on Pattern Recognition (ICPR), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[892]  arXiv:2206.11768 [pdf, other]
Title: FitGAN: Fit- and Shape-Realistic Generative Adversarial Networks for Fashion
Comments: 26th International Conference on Pattern Recognition (ICPR) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[893]  arXiv:2206.11804 [pdf, other]
Title: Rethinking Surgical Instrument Segmentation: A Background Image Can Be All You Need
Comments: 10 pages, MICCAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[894]  arXiv:2206.11808 [pdf, other]
Title: Unseen Object 6D Pose Estimation: A Benchmark and Baselines
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[895]  arXiv:2206.11825 [pdf, other]
Title: YOLOSA: Object detection based on 2D local feature superimposed self-attention
Comments: This paper is under consideration at Pattern Recognition Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[896]  arXiv:2206.11826 [pdf, other]
Title: Toward Clinically Assisted Colorectal Polyp Recognition via Structured Cross-modal Representation Consistency
Comments: Early Accepted by MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[897]  arXiv:2206.11892 [pdf, other]
Title: DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection
Comments: Code available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[898]  arXiv:2206.11894 [pdf, other]
Title: MaskViT: Masked Visual Pre-Training for Video Prediction
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[899]  arXiv:2206.11895 [pdf, other]
Title: Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space
Comments: NeurIPS 2022. Our code is at this https URL Our project page is at this https URL v3, v4 for minor updates on figures and visualizations
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[900]  arXiv:2206.11896 [pdf, other]
Title: EventNeRF: Neural Radiance Fields from a Single Colour Event Camera
Comments: 19 pages, 21 figures, 3 tables; CVPR 2023
Journal-ref: Computer Vision and Pattern Recognition (CVPR) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[901]  arXiv:2206.11920 [pdf, other]
Title: Agriculture-Vision Challenge 2022 -- The Runner-Up Solution for Agricultural Pattern Recognition via Transformer-based Models
Comments: CVPR 2022, Agriculture-Vision Challenge, Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[902]  arXiv:2206.11927 [pdf, other]
Title: Towards Galaxy Foundation Models with Hybrid Contrastive Learning
Comments: Accepted at the ICML 2022 Workshop on Machine Learning for Astrophysics. Data: www.github.com/mwalmsley/pytorch-galaxy-datasets. Please reach out to share your labelled data - all contributions will be credited in future work
Subjects: Computer Vision and Pattern Recognition (cs.CV); Astrophysics of Galaxies (astro-ph.GA)
[903]  arXiv:2206.11952 [pdf, other]
Title: UNeRF: Time and Memory Conscious U-Shaped Network for Training Neural Radiance Fields
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[904]  arXiv:2206.12035 [pdf, other]
Title: The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge--Track 3: Referring Video Object Segmentation
Comments: 4 pages, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[905]  arXiv:2206.12043 [pdf, other]
Title: Protecting President Zelenskyy against Deep Fakes
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[906]  arXiv:2206.12046 [pdf, other]
Title: Bilateral Network with Channel Splitting Network and Transformer for Thermal Image Super-Resolution
Comments: The second place solution for CVPR2022 PBVS-TISR challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[907]  arXiv:2206.12055 [pdf, other]
Title: SDF-StyleGAN: Implicit SDF-Based StyleGAN for 3D Shape Generation
Comments: Accepted to Computer Graphics Forum (SGP), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[908]  arXiv:2206.12063 [src]
Title: Mutual Information-guided Knowledge Transfer for Novel Class Discovery
Comments: The derivation of Mutual Information in the manuscript is wrong
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[909]  arXiv:2206.12071 [pdf, other]
Title: Contrastive Learning of Features between Images and LiDAR
Comments: accepted in CASE2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[910]  arXiv:2206.12073 [pdf, other]
Title: MaskRange: A Mask-classification Model for Range-view based LiDAR Segmentation
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[911]  arXiv:2206.12099 [pdf, ps, other]
Title: A novel approach for glaucoma classification by wavelet neural networks using graph-based, statisitcal features of qualitatively improved images
Comments: 25 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[912]  arXiv:2206.12117 [pdf, other]
Title: Self Supervised Learning for Few Shot Hyperspectral Image Classification
Comments: Accepted in IGARSS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[913]  arXiv:2206.12123 [pdf, ps, other]
Title: Some theoretical results on discrete contour trees
Authors: Yuqing Song
Comments: 5 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
[914]  arXiv:2206.12126 [pdf, other]
Title: Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning
Comments: Accepted by CVPR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[915]  arXiv:2206.12128 [pdf, other]
Title: Excavating RoI Attention for Underwater Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[916]  arXiv:2206.12216 [pdf, other]
Title: Optimized Views Photogrammetry: Precision Analysis and A Large-scale Case Study in Qingdao
Comments: 16 pages, 24 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[917]  arXiv:2206.12351 [pdf, other]
Title: Megapixel Image Generation with Step-Unrolled Denoising Autoencoders
Comments: 17 pages + 9 appendix pages. 20 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[918]  arXiv:2206.12356 [pdf, other]
Title: HM3D-ABO: A Photo-realistic Dataset for Object-centric Multi-view 3D Reconstruction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[919]  arXiv:2206.12370 [pdf, other]
Title: Mixed Sample Augmentation for Online Distillation
Comments: 5 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[920]  arXiv:2206.12372 [pdf, other]
Title: QReg: On Regularization Effects of Quantization
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[921]  arXiv:2206.12381 [pdf, other]
Title: Defending Backdoor Attacks on Vision Transformer via Patch Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[922]  arXiv:2206.12396 [pdf, other]
Title: Text-Driven Stylization of Video Objects
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[923]  arXiv:2206.12403 [pdf, other]
Title: ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Comments: code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[924]  arXiv:2206.12455 [pdf, other]
Title: Ev-NeRF: Event Based Neural Radiance Field
Comments: Accepted to WACV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[925]  arXiv:2206.12458 [pdf, other]
Title: Bag of Tricks for Long-Tail Visual Recognition of Animal Species in Camera-Trap Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[926]  arXiv:2206.12464 [pdf, other]
Title: Motion Estimation for Large Displacements and Deformations
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[927]  arXiv:2206.12480 [pdf, other]
Title: Attention-Guided Autoencoder for Automated Progression Prediction of Subjective Cognitive Decline with Structural MRI
Comments: 10 pages, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[928]  arXiv:2206.12498 [pdf, other]
Title: Optimal and Robust Category-level Perception: Object Pose and Shape Estimation from 2D and 3D Semantic Keypoints
Comments: arXiv admin note: text overlap with arXiv:2104.08383
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[929]  arXiv:2206.12505 [pdf, other]
Title: Stain Based Contrastive Co-training for Histopathological Image Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[930]  arXiv:2206.12533 [pdf, other]
Title: From Shallow to Deep: Compositional Reasoning over Graphs for Visual Question Answering
Authors: Zihao Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[931]  arXiv:2206.12534 [pdf, other]
Title: SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos
Comments: CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[932]  arXiv:2206.12558 [pdf, other]
Title: FastBVP-Net: a lightweight pulse extraction network for measuring heart rhythm via facial videos
Comments: 9 pages, 2figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[933]  arXiv:2206.12571 [pdf, other]
Title: CV 3315 Is All You Need : Semantic Segmentation Competition
Authors: Akide Liu, Zihan Wang
Comments: arXiv admin note: text overlap with arXiv:2105.15203 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[934]  arXiv:2206.12590 [pdf, other]
Title: RSTAM: An Effective Black-Box Impersonation Attack on Face Recognition using a Mobile and Compact Printer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[935]  arXiv:2206.12592 [pdf, other]
Title: Asymmetric Transfer Hashing with Adaptive Bipartite Graph Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[936]  arXiv:2206.12596 [pdf, ps, other]
Title: Non-iterative Coarse-to-fine Registration based on Single-pass Deep Cumulative Learning
Comments: Accepted at International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2022)
Journal-ref: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 88-97, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[937]  arXiv:2206.12612 [pdf, other]
Title: Learn to Predict How Humans Manipulate Large-sized Objects from Interactive Motions
Journal-ref: IEEE Robotics and Automation Letters ( Volume: 7, Issue: 2, April 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[938]  arXiv:2206.12614 [pdf, other]
Title: BokehMe: When Neural Rendering Meets Classical Rendering
Comments: Accepted by CVPR 2022 (Oral); Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[939]  arXiv:2206.12622 [pdf, other]
Title: SAT: Self-adaptive training for fashion compatibility prediction
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[940]  arXiv:2206.12623 [pdf, other]
Title: Inverted Semantic-Index for Image Retrieval
Authors: Ying Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
[941]  arXiv:2206.12634 [pdf, other]
Title: SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection
Comments: winner method at LOVEU@CVPR'22 Generic Event Boundary Detection Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[942]  arXiv:2206.12648 [pdf, other]
Title: BIMS-PU: Bi-Directional and Multi-Scale Point Cloud Upsampling
Comments: Accepted to RA-L 2022. in IEEE Robotics and Automation Letters
Journal-ref: in IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 7447-7454, July 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[943]  arXiv:2206.12650 [pdf, ps, other]
Title: Machine Learning-based Biological Ageing Estimation Technologies: A Survey
Comments: in Recent Advances in AI-enabled Automated Medical Diagnosis this https URL
Journal-ref: Recent Advances in AI-enabled Automated Medical Diagnosis, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[944]  arXiv:2206.12651 [pdf, ps, other]
Title: Review on Social Behavior Analysis of Laboratory Animals: From Methodologies to Applications
Comments: this https URL
Journal-ref: Recent Advances in AI-enabled Automated Medical Diagnosis, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[945]  arXiv:2206.12653 [pdf, ps, other]
Title: Diagnostic Communication and Visual System based on Vehicle UDS Protocol
Authors: Hong Zhang, Ding Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[946]  arXiv:2206.12657 [pdf, other]
Title: Enhanced Deep Animation Video Interpolation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[947]  arXiv:2206.12675 [pdf, other]
Title: Learning to Infer 3D Shape Programs with Differentiable Renderer
Authors: Yichao Liang
Comments: Technical report written in 2020; 10 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:1901.02875 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[948]  arXiv:2206.12681 [pdf, other]
Title: UltraMNIST Classification: A Benchmark to Train CNNs for Very Large Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[949]  arXiv:2206.12685 [pdf, ps, other]
Title: Defense against adversarial attacks on deep convolutional neural networks through nonlocal denoising
Journal-ref: IAES International Journal of Artificial Intelligence, Vol. 11, No. 3, September 2022, pp. 961~968, ISSN: 2252-8938
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[950]  arXiv:2206.12694 [pdf, other]
Title: RandStainNA: Learning Stain-Agnostic Features from Histology Slides by Bridging Stain Augmentation and Normalization
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[951]  arXiv:2206.12704 [pdf, other]
Title: Anatomy-Guided Weakly-Supervised Abnormality Localization in Chest X-rays
Comments: Accepted by MICCAI 20222
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[952]  arXiv:2206.12714 [pdf, other]
Title: Defending Multimodal Fusion Models against Single-Source Adversaries
Comments: CVPR 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[953]  arXiv:2206.12725 [pdf, other]
Title: Empirical Evaluation of Physical Adversarial Patch Attacks Against Overhead Object Detection Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[954]  arXiv:2206.12738 [pdf, other]
Title: Self-Supervised 3D Monocular Object Detection by Recycling Bounding Boxes
Comments: Published at ICCVW-SSLAD 2021. arXiv admin note: substantial text overlap with arXiv:2104.10786
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[955]  arXiv:2206.12740 [pdf, other]
Title: Multi Visual Modality Fall Detection Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[956]  arXiv:2206.12745 [pdf, ps, other]
Title: Sequential image recovery using joint hierarchical Bayesian learning
Comments: 24 pages, 15 figures
Journal-ref: J Sci Comput 96, 4 (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[957]  arXiv:2206.12755 [pdf, other]
Title: Training Your Sparse Neural Network Better with Any Mask
Comments: Accepted by ICML 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[958]  arXiv:2206.12772 [pdf, other]
Title: Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation
Comments: Camera-ready Version for ACMMM 2022, Project page is this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[959]  arXiv:2206.12788 [pdf, other]
Title: Representative Teacher Keys for Knowledge Distillation Model Compression Based on Attention Mechanism for Image Classification
Comments: eight pages, six figures, three tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[960]  arXiv:2206.12794 [pdf, other]
Title: CTMQ: Cyclic Training of Convolutional Neural Networks with Multiple Quantization Steps
Comments: submitted to NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[961]  arXiv:2206.12798 [pdf, other]
Title: Multiple Instance Learning with Mixed Supervision in Gleason Grading
Comments: Accepted by MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[962]  arXiv:2206.12837 [pdf, other]
Title: Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer
Comments: Ailin and Zhewei contributed equally to this work. ACM MM22 workshop paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[963]  arXiv:2206.12845 [pdf, other]
Title: RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval
Comments: Preprint, under review in TCSVT Journal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[964]  arXiv:2206.12849 [pdf, other]
Title: Semantic Role Aware Correlation Transformer for Text to Video Retrieval
Comments: Camera-ready for ICIP 2021
Journal-ref: IEEE International Conference on Image Processing (ICIP), 2021, pp. 1334-1338
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[965]  arXiv:2206.12869 [pdf, other]
Title: Image Aesthetics Assessment Using Graph Attention Network
Comments: International Conference on Pattern Recognition (ICPR), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[966]  arXiv:2206.12885 [pdf, ps, other]
Title: FingerGAN: A Constrained Fingerprint Generation Scheme for Latent Fingerprint Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[967]  arXiv:2206.12912 [pdf, other]
Title: Woodscape Fisheye Object Detection for Autonomous Driving -- CVPR 2022 OmniCV Workshop Challenge
Comments: Workshop on Omnidirectional Computer Vision (OmniCV) at Conference on Computer Vision and Pattern Recognition (CVPR) 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[968]  arXiv:2206.12914 [pdf, other]
Title: Video Anomaly Detection via Prediction Network with Enhanced Spatio-Temporal Memory Exchange
Comments: Accepted at ICASSP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[969]  arXiv:2206.12921 [pdf, other]
Title: Non-Parametric Style Transfer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[970]  arXiv:2206.12923 [pdf, other]
Title: Video Activity Localisation with Uncertainties in Temporal Boundary
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[971]  arXiv:2206.12925 [pdf, other]
Title: Vision Transformer for Contrastive Clustering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[972]  arXiv:2206.12930 [pdf, other]
Title: SVBR-NET: A Non-Blind Spatially Varying Defocus Blur Removal Network
Comments: Accepted to ICIP2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[973]  arXiv:2206.12943 [pdf, other]
Title: Multi-view Feature Augmentation with Adaptive Class Activation Mapping
Comments: An arxiv version of the paper published in Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21). See this https URL
Journal-ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence. Main Track. 2021. Pages 678-684
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[974]  arXiv:2206.12946 [pdf, other]
Title: AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[975]  arXiv:2206.12952 [pdf, other]
Title: Nonwatertight Mesh Reconstruction
Authors: Partha Ghosh
Comments: arXiv admin note: text overlap with arXiv:2106.03452 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[976]  arXiv:2206.12958 [pdf, ps, other]
Title: Szloca: towards a framework for full 3D tracking through a single camera in context of interactive arts
Authors: Sahaj Garg
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[977]  arXiv:2206.12959 [pdf, other]
Title: Probabilistic PolarGMM: Unsupervised Cluster Learning of Very Noisy Projection Images of Unknown Pose
Comments: 13 pages, including appendices
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[978]  arXiv:2206.12963 [pdf, other]
Title: Self-Healing Robust Neural Networks via Closed-Loop Control
Comments: 48 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[979]  arXiv:2206.12972 [pdf, other]
Title: VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
Comments: accepted by The 29th IEEE International Conference on Image Processing (IEEE ICIP) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[980]  arXiv:2206.12994 [pdf, other]
Title: Automatic Generation of Product-Image Sequence in E-commerce
Comments: Accepted by KDD 2022 ADS
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[981]  arXiv:2206.13028 [pdf, other]
Title: Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Comments: 10 pages, 4 figures, accepted by AAAI 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[982]  arXiv:2206.13042 [pdf, other]
Title: A Strategy Optimized Pix2pix Approach for SAR-to-Optical Image Translation Task
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[983]  arXiv:2206.13076 [pdf, other]
Title: SearchMorph:Multi-scale Correlation Iterative Network for Deformable Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[984]  arXiv:2206.13078 [pdf, other]
Title: Video2StyleGAN: Encoding Video in Latent Space for Manipulation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[985]  arXiv:2206.13079 [pdf, other]
Title: Dynamic Bank Learning for Semi-supervised Federated Image Diagnosis with Class Imbalance
Comments: Early accepted by 25th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI'22)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[986]  arXiv:2206.13082 [pdf, ps, other]
Title: PST: Plant segmentation transformer for 3D point clouds of rapeseed plants at the podding stage
Comments: 46 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[987]  arXiv:2206.13115 [pdf, other]
Title: Lesion-Aware Contrastive Representation Learning for Histopathology Whole Slide Images Analysis
Comments: accepted for MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[988]  arXiv:2206.13117 [src]
Title: SARNet: Semantic Augmented Registration of Large-Scale Urban Point Clouds
Comments: Author information changes
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[989]  arXiv:2206.13142 [pdf, other]
Title: Representing motion as a sequence of latent primitives, a flexible approach for human motion modelling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[990]  arXiv:2206.13155 [pdf, other]
Title: Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[991]  arXiv:2206.13156 [pdf, other]
Title: Kernel Attention Transformer (KAT) for Histopathology Whole Slide Image Classification
Comments: accepted for MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[992]  arXiv:2206.13188 [pdf, other]
Title: Self-supervised Learning in Remote Sensing: A Review
Comments: Accepted by IEEE Geoscience and Remote Sensing Magazine. 32 pages, 22 content pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[993]  arXiv:2206.13199 [pdf, other]
Title: MGNet: Monocular Geometric Scene Understanding for Autonomous Driving
Journal-ref: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 15784-15795
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[994]  arXiv:2206.13263 [pdf, other]
Title: Learning with Weak Annotations for Robust Maritime Obstacle Detection
Comments: Published in MDPI Sensors, 23 pages, 8 figures
Journal-ref: Sensors 2022, 22, 9139
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[995]  arXiv:2206.13282 [pdf, other]
Title: Monocular Depth Decomposition of Semi-Transparent Volume Renderings
Comments: accepted at IEEE TVCG 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[996]  arXiv:2206.13294 [pdf, other]
Title: LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation
Journal-ref: CoRL 2022 https://openreview.net/forum?id=abd_D-iVjk0
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[997]  arXiv:2206.13296 [pdf, other]
Title: Consistency-preserving Visual Question Answering in Medical Imaging
Comments: Appears in Medical Image Computing and Computer Assisted Interventions (MICCAI), 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[998]  arXiv:2206.13304 [pdf, other]
Title: PARTICUL: Part Identification with Confidence measure using Unsupervised Learning
Authors: Romain Xu-Darme (LSL, MRIM ), Georges Quénot (MRIM ), Zakaria Chihani (LSL), Marie-Christine Rousset (SLIDE )
Comments: Accepted at XAIE: 2nd Workshop on Explainable and Ethical AI -- ICPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[999]  arXiv:2206.13317 [pdf, other]
Title: Automatic identification of segmentation errors for radiotherapy using geometric learning
Comments: Accepted in 25th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2022). This preprint has not undergone peer review or any post-submission improvements or corrections
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1000]  arXiv:2206.13318 [pdf, other]
Title: Key-frame Guided Network for Thyroid Nodule Recognition using Ultrasound Videos
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1001]  arXiv:2206.13329 [pdf, other]
Title: Prior-Guided One-shot Neural Architecture Search
Comments: Official 3st Place Solution for the Second workshop Neural Architecture Search Second lightweight NAS Challenge 2022 - Track1 Supernet Track. Official leaderboard: this https URL CVPR 2022 Workshop: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1002]  arXiv:2206.13342 [pdf, other]
Title: Open Set Classification of Untranscribed Handwritten Documents
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[1003]  arXiv:2206.13346 [pdf, other]
Title: Distributional Gaussian Processes Layers for Out-of-Distribution Detection
Comments: Published in Journal of Machine Learning for Biomedical Imaging: Special Issue: Information Processing in Medical Imaging (IPMI) 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1004]  arXiv:2206.13356 [pdf, other]
Title: iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition
Comments: This is a technical report from the Chinese University of Hong Kong
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1005]  arXiv:2206.13381 [pdf, other]
Title: TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask
Comments: This paper has been accepted by IEEE Transactions on Multimedia
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1006]  arXiv:2206.13383 [pdf, ps, other]
Title: Mushroom image recognition and distance generation based on attention-mechanism model and genetic information
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1007]  arXiv:2206.13386 [pdf, other]
Title: Uncovering variability in human driving behavior through automatic extraction of similar traffic scenes from large naturalistic datasets
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1008]  arXiv:2206.13388 [pdf, ps, other]
Title: Rotated Digit Recognition by Variational Autoencoders with Fixed Output Distributions
Authors: David Yevick
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1009]  arXiv:2206.13389 [pdf, other]
Title: UI Layers Merger: Merging UI layers via Visual Learning and Boundary Prior
Comments: 15 pages, accepted to Frontiers of Information Technology & Electronic Engineering. This is a preprint version, the copyright belongs to the Springer Nature journals
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1010]  arXiv:2206.13390 [pdf, other]
Title: A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1011]  arXiv:2206.13391 [pdf, other]
Title: Deep reinforced active learning for multi-class image classification
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1012]  arXiv:2206.13392 [pdf, ps, other]
Title: Remote Sensing Image Classification using Transfer Learning and Attention Based Deep Neural Network
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1013]  arXiv:2206.13395 [pdf, other]
Title: Gait Cycle Reconstruction and Human Identification from Occluded Sequences
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1014]  arXiv:2206.13396 [pdf, other]
Title: A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic Search
Comments: Winner of the Rearrangement Challenge at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[1015]  arXiv:2206.13397 [pdf, other]
Title: Generative Modelling With Inverse Heat Dissipation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1016]  arXiv:2206.13398 [pdf, other]
Title: An Efficient Industrial Federated Learning Framework for AIoT: A Face Recognition Application
Comments: FL-IJCAL'22 Accepted Paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1017]  arXiv:2206.13413 [pdf, other]
Title: RES: A Robust Framework for Guiding Visual Explanation
Comments: Published in KDD 2022
Journal-ref: In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1018]  arXiv:2206.13434 [pdf, other]
Title: ContraReg: Contrastive Learning of Multi-modality Unsupervised Deformable Image Registration
Comments: Accepted by MICCAI 2022. 13 pages, 6 figures, and 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1019]  arXiv:2206.13454 [pdf, other]
Title: Optimizing Video Prediction via Video Frame Interpolation
Comments: Accepted by the CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1020]  arXiv:2206.13462 [pdf, other]
Title: Learn Fast, Segment Well: Fast Object Segmentation Learning on the iCub Robot
Comments: \copyright 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1021]  arXiv:2206.13500 [pdf, other]
Title: Neural Neural Textures Make Sim2Real Consistent
Comments: 9 pages, 10 figures (without references or appendix); 16 pages, 16 figures (with appendix)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
[1022]  arXiv:2206.13502 [pdf, other]
Title: Programmatic Concept Learning for Human Motion Description and Synthesis
Comments: CVPR 2022. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1023]  arXiv:2206.13559 [pdf, other]
Title: ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
Comments: Accepted in NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1024]  arXiv:2206.13577 [pdf, other]
Title: A View Independent Classification Framework for Yoga Postures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1025]  arXiv:2206.13597 [pdf, other]
Title: NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1026]  arXiv:2206.13608 [pdf, other]
Title: Reducing Annotation Need in Self-Explanatory Models for Lung Nodule Diagnosis
Comments: 10 pages, 4 figures, 2 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1027]  arXiv:2206.13626 [pdf, other]
Title: Patch Selection for Melanoma Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1028]  arXiv:2206.13628 [pdf, other]
Title: Multi-scale Network with Attentional Multi-resolution Fusion for Point Cloud Semantic Segmentation
Authors: Yuyan Li, Ye Duan
Comments: ICPR 2022, poster
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1029]  arXiv:2206.13644 [pdf, other]
Title: Feature Refinement to Improve High Resolution Image Inpainting
Comments: 5 pages, 5 figures, Published in CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1030]  arXiv:2206.13673 [pdf, other]
Title: How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels
Comments: 8 pages
Journal-ref: IEEE Robotics and Automation Letters 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[1031]  arXiv:2206.13677 [pdf, other]
Title: Towards Global-Scale Crowd+AI Techniques to Map and Assess Sidewalks for People with Disabilities
Comments: CVPR 2022 AVA (Accessibility, Vision, and Autonomy Meet) Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1032]  arXiv:2206.13718 [pdf, other]
Title: The Third Place Solution for CVPR2022 AVA Accessibility Vision and Autonomy Challenge
Comments: The third place solution for CVPR2022 AVA Accessibility Vision and Autonomy Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1033]  arXiv:2206.13728 [pdf, ps, other]
Title: Boosting R-CNN: Reweighting R-CNN Samples by RPN's Error for Underwater Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1034]  arXiv:2206.13732 [pdf, other]
Title: A Comprehensive Survey on Deep Gait Recognition: Algorithms, Datasets and Challenges
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1035]  arXiv:2206.13737 [pdf, other]
Title: Adversarial Consistency for Single Domain Generalization in Medical Image Segmentation
Comments: MICCAI2022 accpted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1036]  arXiv:2206.13785 [pdf, other]
Title: 3D Multi-Object Tracking with Differentiable Pose Estimation
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1037]  arXiv:2206.13803 [pdf, other]
Title: FedIIC: Towards Robust Federated Learning for Class-Imbalanced Medical Image Classification
Comments: This paper has been accepted by MICCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1038]  arXiv:2206.13829 [pdf, other]
Title: Cross-Forgery Analysis of Vision Transformers and CNNs for Deepfake Image Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1039]  arXiv:2206.13850 [pdf, other]
Title: When the Sun Goes Down: Repairing Photometric Losses for All-Day Depth Estimation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1040]  arXiv:2206.13858 [pdf, other]
Title: Accurate and Real-time Pseudo Lidar Detection: Is Stereo Neural Network Really Necessary?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1041]  arXiv:2206.13887 [pdf, other]
Title: Generating near-infrared facial expression datasets with dimensional affect labels
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1042]  arXiv:2206.13951 [pdf, other]
Title: Robustifying Vision Transformer without Retraining from Scratch by Test-Time Class-Conditional Feature Alignment
Comments: Accepted to IJCAI-ECAI2022. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1043]  arXiv:2206.13962 [src]
Title: Multi-Prior Learning via Neural Architecture Search for Blind Face Restoration
Comments: We found some problems with the article and need to withdrawal it
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1044]  arXiv:2206.13963 [pdf, other]
Title: Primitive Graph Learning for Unified Vector Mapping
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1045]  arXiv:2206.13964 [pdf, other]
Title: Learning Gait Representation from Massive Unlabelled Walking Videos: A Benchmark
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1046]  arXiv:2206.13996 [pdf, other]
Title: Detecting tiny objects in aerial images: A normalized Wasserstein distance and a new benchmark
Comments: Accepted by ISPRS Journal of Photogrammetry and Remote Sensing
Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing (2022) 190:79-93
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1047]  arXiv:2206.14009 [pdf, other]
Title: Show Me Your Face, And I'll Tell You How You Speak
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[1048]  arXiv:2206.14011 [pdf, ps, other]
Title: Taxonomy and evolution predicting using deep learning in images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1049]  arXiv:2206.14020 [pdf, other]
Title: Rethinking Adversarial Examples for Location Privacy Protection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1050]  arXiv:2206.14116 [pdf, other]
Title: SSL-Lanes: Self-Supervised Learning for Motion Forecasting in Autonomous Driving
Comments: Accepted to CoRL-2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[1051]  arXiv:2206.14164 [pdf, ps, other]
Title: Visualizing and Alleviating the Effect of Radial Distortion on Camera Calibration Using Principal Lines
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1052]  arXiv:2206.14180 [pdf, other]
Title: High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions
Comments: Accepted to ECCV 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1053]  arXiv:2206.14195 [pdf, other]
Title: Pedestrian 3D Bounding Box Prediction
Comments: Accepted and published in hEART2022 (the 10th Symposium of the European Association for Research in Transportation): this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1054]  arXiv:2206.14245 [pdf, other]
Title: SImProv: Scalable Image Provenance Framework for Robust Content Attribution
Comments: Under consideration at Computer Vision and Image Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1055]  arXiv:2206.14263 [pdf, other]
Title: ZoDIAC: Zoneout Dropout Injection Attention Calculation
Comments: This work has been submitted to SN-AIRE journal and is currently under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1056]  arXiv:2206.14302 [pdf, ps, other]
Title: Reinforcement Learning in Medical Image Analysis: Concepts, Applications, Challenges, and Future Directions
Comments: 30 pages, 13 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1057]  arXiv:2206.14314 [pdf, other]
Title: Generative Neural Articulated Radiance Fields
Comments: Project website: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1058]  arXiv:2206.14344 [pdf, other]
Title: A New Adjacency Matrix Configuration in GCN-based Models for Skeleton-based Action Recognition
Comments: 19 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1059]  arXiv:2206.14350 [pdf, ps, other]
Title: Convolutional Neural Network Based Partial Face Detection
Comments: Accepted in 7th International Conference for Convergence in Technology (I2CT), 2022, 6 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1060]  arXiv:2206.14355 [pdf, other]
Title: EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1061]  arXiv:2206.14381 [pdf, other]
Title: Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Comments: Ranked joint 3rd place in the Multi-Instance Retrieval Challenge at EPIC@CVPR2022. (v2: ref error is corrected)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[1062]  arXiv:2206.14409 [pdf, ps, other]
Title: BATFormer: Towards Boundary-Aware Lightweight Transformer for Efficient Medical Image Segmentation
Comments: Accepted by IEEE Journal of Biomedical and Health Informatics The source code is publicly available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1063]  arXiv:2206.14413 [pdf, other]
Title: The Lighter The Better: Rethinking Transformers in Medical Image Segmentation Through Adaptive Pruning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1064]  arXiv:2206.14437 [pdf, other]
Title: MaNi: Maximizing Mutual Information for Nuclei Cross-Domain Unsupervised Segmentation
Comments: Accepted at MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1065]  arXiv:2206.14451 [pdf, other]
Title: SRCN3D: Sparse R-CNN 3D for Compact Convolutional Multi-View 3D Object Detection and Tracking
Comments: Accepted to Vision-centric Autonomous Driving(VCAD) Workshop at CVPR2023, For more details refer to this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1066]  arXiv:2206.14467 [pdf, other]
Title: Single-domain Generalization in Medical Image Segmentation via Test-time Adaptation from Shape Dictionary
Comments: Accepted to AAAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1067]  arXiv:2206.14475 [pdf, other]
Title: Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1068]  arXiv:2206.14538 [pdf, other]
Title: vMFNet: Compositionality Meets Domain-generalised Segmentation
Comments: Accepted by MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1069]  arXiv:2206.14554 [pdf, other]
Title: Uncertainty-aware Panoptic Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1070]  arXiv:2206.14555 [pdf, other]
Title: Technical Report for CVPR 2022 LOVEU AQTC Challenge
Comments: 4 pages, 3 figures, technical report for track3 of CVPR 2022 LOVEU challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1071]  arXiv:2206.14651 [pdf, other]
Title: BoT-SORT: Robust Associations Multi-Pedestrian Tracking
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1072]  arXiv:2206.14702 [pdf, other]
Title: Interventional Contrastive Learning with Meta Semantic Regularizer
Comments: Accepted by ICML 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1073]  arXiv:2206.14718 [pdf, other]
Title: LViT: Language meets Vision Transformer in Medical Image Segmentation
Comments: Accepted by IEEE Transactions on Medical Imaging (TMI)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1074]  arXiv:2206.14735 [pdf, other]
Title: GO-Surf: Neural Feature Grid Optimization for Fast, High-Fidelity RGB-D Surface Reconstruction
Comments: 3DV2022 (Oral), first two authors contributed equally. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1075]  arXiv:2206.14797 [pdf, other]
Title: 3D-Aware Video Generation
Comments: TMLR 2023; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1076]  arXiv:2206.14841 [pdf, other]
Title: Causality for Inherently Explainable Transformers: CAT-XPLAIN
Comments: Accepted for spotlight presentation at the Explainable Artificial Intelligence for Computer Vision Workshop at CVPR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1077]  arXiv:2206.14892 [pdf, other]
Title: Semantic Unfolding of StyleGAN Latent Space
Comments: Accepted at ICIP22
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1078]  arXiv:2206.14923 [pdf, other]
Title: On Non-Random Missing Labels in Semi-Supervised Learning
Journal-ref: ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1079]  arXiv:2206.14938 [pdf, other]
Title: Regularization of NeRFs using differential geometry
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1080]  arXiv:2206.14971 [pdf, other]
Title: Boosting 3D Object Detection by Simulating Multimodality on Point Clouds
Comments: Published in CVPR 2022 as Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1081]  arXiv:2206.14973 [pdf, other]
Title: Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital Pathology
Comments: MICAAI2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1082]  arXiv:2206.14989 [pdf, other]
Title: A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1083]  arXiv:2206.14996 [pdf, other]
Title: Cross-domain Federated Object Detection
Comments: ICME 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1084]  arXiv:2206.15002 [pdf, other]
Title: Spatial Transformer Network with Transfer Learning for Small-scale Fine-grained Skeleton-based Tai Chi Action Recognition
Comments: 6 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1085]  arXiv:2206.15015 [pdf, other]
Title: Exploring Temporally Dynamic Data Augmentation for Video Recognition
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1086]  arXiv:2206.15031 [pdf, other]
Title: Timestamp-Supervised Action Segmentation with Graph Convolutional Networks
Comments: Accepted to IROS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1087]  arXiv:2206.15083 [pdf, other]
Title: UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration
Comments: Accepted to CVPR2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1088]  arXiv:2206.15085 [pdf, other]
Title: Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1089]  arXiv:2206.15109 [pdf, ps, other]
Title: MKIoU Loss: Towards Accurate Oriented Object Detection in Aerial Images
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1090]  arXiv:2206.15128 [pdf, other]
Title: Detecting and Recovering Adversarial Examples from Extracting Non-robust and Highly Predictive Adversarial Perturbations
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1091]  arXiv:2206.15138 [pdf, other]
Title: DFGC 2022: The Second DeepFake Game Competition
Comments: Accepted by IJCB 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1092]  arXiv:2206.15154 [pdf, other]
Title: BoxGraph: Semantic Place Recognition and Pose Estimation from 3D LiDAR
Comments: Accepted for publication at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1093]  arXiv:2206.15157 [pdf, other]
Title: HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection
Authors: Tim Broedermann (1), Christos Sakaridis (1), Dengxin Dai (2), Luc Van Gool (1 and 3) ((1) ETH Zurich, (2) MPI for Informatics, (3) KU Leuven)
Comments: IEEE International Conference on Intelligent Transportation Systems (ITSC) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1094]  arXiv:2206.15186 [pdf, other]
Title: Out-of-Distribution Detection for Long-tailed and Fine-grained Skin Lesion Images
Comments: Accepted to MICCAI 2022 (top 13% paper; early accept)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1095]  arXiv:2206.15189 [pdf, other]
Title: Multi-Granularity Regularized Re-Balancing for Class Incremental Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1096]  arXiv:2206.15248 [pdf, other]
Title: CTrGAN: Cycle Transformers GAN for Gait Transfer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1097]  arXiv:2206.15255 [pdf, other]
Title: Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery
Comments: 11 pages, 4 figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1098]  arXiv:2206.15258 [pdf, other]
Title: Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1099]  arXiv:2206.15268 [pdf, other]
Title: Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach
Comments: arXiv admin note: text overlap with arXiv:2112.04771
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1100]  arXiv:2206.15275 [pdf, other]
Title: Multiclass-SGCN: Sparse Graph-based Trajectory Prediction with Agent Class Embedding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1101]  arXiv:2206.15282 [pdf, other]
Title: TINC: Temporally Informed Non-Contrastive Learning for Disease Progression Modeling in Retinal OCT Volumes
Comments: Accepted at MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1102]  arXiv:2206.15296 [pdf, other]
Title: Self-SuperFlow: Self-supervised Scene Flow Prediction in Stereo Sequences
Comments: Accepted at ICIP 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1103]  arXiv:2206.15328 [pdf, other]
Title: Neural Annotation Refinement: Development of a New 3D Dataset for Adrenal Gland Analysis
Comments: MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1104]  arXiv:2206.15349 [pdf, other]
Title: Revisiting Competitive Coding Approach for Palmprint Recognition: A Linear Discriminant Analysis Perspective
Comments: 12 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1105]  arXiv:2206.15351 [pdf, ps, other]
Title: Deep Learning to See: Towards New Foundations of Computer Vision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1106]  arXiv:2206.15353 [pdf, other]
Title: Learning Underrepresented Classes from Decentralized Partially Labeled Medical Images
Comments: Accepted by MICCAI 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1107]  arXiv:2206.15369 [pdf, other]
Title: No Reason for No Supervision: Improved Generalization in Supervised Models
Comments: Accepted to ICLR 2023 (spotlight)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1108]  arXiv:2206.15398 [pdf, other]
Title: PolarFormer: Multi-camera 3D Object Detection with Polar Transformer
Comments: Accepted to AAAI2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1109]  arXiv:2206.15415 [pdf, other]
Title: MEAD: A Multi-Armed Approach for Evaluation of Adversarial Examples Detectors
Comments: This paper has been accepted to appear in the Proceedings of the 2022 European Conference on Machine Learning and Data Mining (ECML-PKDD), 19th to the 23rd of September, Grenoble, France
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1110]  arXiv:2206.15436 [pdf, other]
Title: Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1111]  arXiv:2206.15462 [pdf, other]
Title: Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations
Comments: CVPR 2023. Fix ReferIt results. Code: this https URL Project Webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1112]  arXiv:2206.15472 [pdf, other]
Title: On-Device Training Under 256KB Memory
Comments: NeurIPS 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1113]  arXiv:2206.00169 (cross-list from cs.LG) [pdf, other]
Title: Discovering the Hidden Vocabulary of DALLE-2
Comments: 6 pages, 4 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1114]  arXiv:2206.00266 (cross-list from cs.RO) [pdf, other]
Title: PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry
Comments: 7 pages, 5 figures, conference
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1115]  arXiv:2206.00380 (cross-list from cs.LG) [pdf, other]
Title: Strongly Augmented Contrastive Clustering
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1116]  arXiv:2206.00393 (cross-list from cs.SD) [pdf, other]
Title: Towards Generalisable Audio Representations for Audio-Visual Navigation
Comments: CVPR 2022 Embodied AI Workshop
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO); Audio and Speech Processing (eess.AS)
[1117]  arXiv:2206.00432 (cross-list from cs.RO) [pdf, ps, other]
Title: Evaluating Gaussian Grasp Maps for Generative Grasping Models
Comments: 9 pages, 6 figures, to be published in IJCNN 2022
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1118]  arXiv:2206.00471 (cross-list from cs.LG) [pdf, other]
Title: Augmentation Component Analysis: Modeling Similarity via the Augmentation Overlaps
Comments: Accept to ICLR 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1119]  arXiv:2206.00606 (cross-list from cs.LG) [pdf, other]
Title: Topological Deep Learning: Going Beyond Graph Data
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI); Algebraic Topology (math.AT); Machine Learning (stat.ML)
[1120]  arXiv:2206.00621 (cross-list from cs.CL) [pdf, other]
Title: Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training
Comments: ACL 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1121]  arXiv:2206.00719 (cross-list from cs.LG) [pdf, other]
Title: Dataset Distillation using Neural Feature Regression
Comments: NeurIPS 2022 camera-ready version
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1122]  arXiv:2206.00785 (cross-list from cs.DL) [pdf, other]
Title: Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness
Authors: Christoph Auer (1), Michele Dolfi (1), André Carvalho (2), Cesar Berrospi Ramis (1), Peter W. J. Staar (1) ((1) IBM Research, (2) SoftINSA Lda.)
Comments: 11 pages, 7 figures, to be published in IEEE CLOUD 2022
Subjects: Digital Libraries (cs.DL); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[1123]  arXiv:2206.00809 (cross-list from cs.MM) [pdf, other]
Title: Distilling Knowledge from Object Classification to Aesthetics Assessment
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1124]  arXiv:2206.00843 (cross-list from cs.LG) [pdf, other]
Title: DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks
Comments: Accepted at ICML 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1125]  arXiv:2206.00845 (cross-list from cs.LG) [pdf, other]
Title: Hyperspherical Consistency Regularization
Comments: Accepted by CVPR 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1126]  arXiv:2206.00913 (cross-list from cs.LG) [pdf, other]
Title: Improving the Robustness and Generalization of Deep Neural Network with Confidence Threshold Reduction
Comments: Under review
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1127]  arXiv:2206.00941 (cross-list from cs.LG) [pdf, other]
Title: Improving Diffusion Models for Inverse Problems using Manifold Constraints
Comments: NeurIPS 2022 camera-ready; 29 pages, 16 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1128]  arXiv:2206.00944 (cross-list from cs.LG) [pdf, other]
Title: Feature Space Particle Inference for Neural Network Ensembles
Comments: ICML2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1129]  arXiv:2206.00991 (cross-list from cs.RO) [pdf, ps, other]
Title: StopNet: Scalable Trajectory and Occupancy Prediction for Urban Autonomous Driving
Journal-ref: IEEE International Conference on Robotics and Automation 2022
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1130]  arXiv:2206.01002 (cross-list from cs.LG) [pdf, other]
Title: Introducing One Sided Margin Loss for Solving Classification Problems in Deep Networks
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1131]  arXiv:2206.01094 (cross-list from cs.MM) [pdf, ps, other]
Title: A DTCWT-SVD Based Video Watermarking resistant to frame rate conversion
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1132]  arXiv:2206.01178 (cross-list from cs.LG) [pdf, other]
Title: Discretization Invariant Networks for Learning Maps between Neural Fields
Comments: Published in Transactions on Machine Learning Research 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1133]  arXiv:2206.01197 (cross-list from cs.LG) [pdf, other]
Title: Hard Negative Sampling Strategies for Contrastive Representation Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1134]  arXiv:2206.01251 (cross-list from cs.LG) [pdf, other]
Title: Using Representation Expressiveness and Learnability to Evaluate Self-Supervised Learning Methods
Journal-ref: TMLR 2023 -- Transactions of Machine Learning Research, 11/2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1135]  arXiv:2206.01366 (cross-list from cs.LG) [pdf, other]
Title: Supernet Training for Federated Image Classification under System Heterogeneity
Comments: Oral paper on ICML 22 Workshop: "Dynamic Neural Networks"; Under review
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1136]  arXiv:2206.01382 (cross-list from cs.DS) [pdf, ps, other]
Title: Falconn++: A Locality-sensitive Filtering Approach for Approximate Nearest Neighbor Search
Authors: Ninh Pham, Tao Liu
Comments: To appear in NeurIPS 2022
Subjects: Data Structures and Algorithms (cs.DS); Computer Vision and Pattern Recognition (cs.CV)
[1137]  arXiv:2206.01612 (cross-list from cs.LG) [pdf, other]
Title: OmniXAI: A Library for Explainable AI
Comments: Github repo: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1138]  arXiv:2206.01634 (cross-list from cs.LG) [pdf, other]
Title: Reinforcement Learning with Neural Radiance Fields
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1139]  arXiv:2206.01690 (cross-list from cs.LG) [pdf, other]
Title: Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-learning
Comments: Published at CVPR 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1140]  arXiv:2206.01829 (cross-list from cs.LG) [pdf, other]
Title: Drawing out of Distribution with Neuro-Symbolic Generative Models
Comments: Preprint. Under review. 25 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Symbolic Computation (cs.SC)
[1141]  arXiv:2206.01898 (cross-list from cs.LG) [pdf, other]
Title: Saliency Attack: Towards Imperceptible Black-box Adversarial Attack
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1142]  arXiv:2206.02102 (cross-list from cs.LG) [pdf, other]
Title: AUTM Flow: Atomic Unrestricted Time Machine for Monotonic Normalizing Flows
Comments: 20 pages, 3 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[1143]  arXiv:2206.02131 (cross-list from cs.LG) [pdf, other]
Title: Federated Adversarial Training with Transformers
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1144]  arXiv:2206.02183 (cross-list from cs.LG) [pdf, other]
Title: Functional Ensemble Distillation
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1145]  arXiv:2206.02284 (cross-list from cs.SD) [pdf, other]
Title: Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator
Comments: MICCAI 2022 (early accept, Oral Presentation ~3%)
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1146]  arXiv:2206.02286 (cross-list from cs.LG) [pdf, other]
Title: AugLoss: A Robust Augmentation-based Fine Tuning Methodology
Comments: 10 pages, 6 figures, 6 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1147]  arXiv:2206.02353 (cross-list from cs.LG) [pdf, other]
Title: Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data
Comments: 36 pages, 5 figures, 9 tables, Survey paper
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1148]  arXiv:2206.02409 (cross-list from cs.AI) [pdf, other]
Title: Is More Data All You Need? A Causal Exploration
Comments: 10 pages
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1149]  arXiv:2206.02574 (cross-list from cs.LG) [pdf, other]
Title: On the duality between contrastive and non-contrastive self-supervised learning
Authors: Quentin Garrido (FAIR, LIGM), Yubei Chen (FAIR), Adrien Bardes (FAIR, WILLOW), Laurent Najman (LIGM), Yann Lecun (FAIR, CIMS)
Comments: The Eleventh International Conference on Learning Representations, 2023, Kigali, Rwanda
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1150]  arXiv:2206.02659 (cross-list from cs.LG) [pdf, other]
Title: Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees
Comments: 38 pages. Appeared in ICML 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Statistics Theory (math.ST); Machine Learning (stat.ML)
[1151]  arXiv:2206.02671 (cross-list from cs.SD) [pdf, ps, other]
Title: Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Audio-Visual Hearing Aids
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1152]  arXiv:2206.02792 (cross-list from cs.LG) [pdf, other]
Title: FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced Data
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (stat.ML)
[1153]  arXiv:2206.02840 (cross-list from cs.RO) [pdf, other]
Title: Spatial Acoustic Projection for 3D Imaging Sonar Reconstruction
Comments: Preprint
Journal-ref: IEEE International Conference on Robotics and Automation (ICRA) 2022
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1154]  arXiv:2206.02881 (cross-list from cs.RO) [pdf, other]
Title: Mesh-based Dynamics with Occlusion Reasoning for Cloth Manipulation
Comments: RSS 2022, $\href{this https URL}{\text{project website}}$
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1155]  arXiv:2206.02916 (cross-list from cs.LG) [pdf, other]
Title: Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1156]  arXiv:2206.02958 (cross-list from cs.LG) [pdf, other]
Title: Saliency Cards: A Framework to Characterize and Compare Saliency Methods
Comments: Published at FAccT 2023, 19 pages, 8 figures, 2 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1157]  arXiv:2206.03083 (cross-list from cs.RO) [pdf, other]
Title: Pushing the Limits of Learning-based Traversability Analysis for Autonomous Driving on CPU
Comments: Accepted to 17th International Conference on Intelligent Autonomous Systems (IAS-17)
Journal-ref: Proceedings of the 17th International Conference on Intelligent Autonomous Systems (IAS 2022)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1158]  arXiv:2206.03271 (cross-list from cs.LG) [pdf, other]
Title: On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1159]  arXiv:2206.03354 (cross-list from cs.CL) [pdf, other]
Title: cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
Comments: Accepted at ICPR 2022; 9 pages
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1160]  arXiv:2206.03380 (cross-list from cs.GR) [pdf, other]
Title: Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and Denoising
Comments: Project website: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1161]  arXiv:2206.03382 (cross-list from cs.DC) [pdf, other]
Title: Tutel: Adaptive Mixture-of-Experts at Scale
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1162]  arXiv:2206.03398 (cross-list from cs.LG) [pdf, other]
Title: Towards a General Purpose CNN for Long Range Dependencies in $N$D
Comments: First two authors contributed equally to this work
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1163]  arXiv:2206.03430 (cross-list from cs.RO) [pdf, other]
Title: Robot Self-Calibration Using Actuated 3D Sensors
Authors: Arne Peters
Comments: 15 pages, 9 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1164]  arXiv:2206.03491 (cross-list from cs.AI) [pdf, other]
Title: EiX-GNN : Concept-level eigencentrality explainer for graph neural networks
Authors: Adrien Raison (XLIM-ASALI), Pascal Bourdon (XLIM-ASALI), David Helbert (XLIM-ASALI)
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1165]  arXiv:2206.03583 (cross-list from cs.CR) [pdf, other]
Title: Contributor-Aware Defenses Against Adversarial Backdoor Attacks
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1166]  arXiv:2206.03584 (cross-list from cs.CR) [pdf, ps, other]
Title: White-box Membership Attack Against Machine Learning Based Retinopathy Classification
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1167]  arXiv:2206.03596 (cross-list from cs.LG) [pdf, other]
Title: Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1168]  arXiv:2206.03739 (cross-list from cs.AI) [pdf, other]
Title: Disentangled Ontology Embedding for Zero-shot Learning
Comments: Accepted by KDD'22
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1169]  arXiv:2206.03826 (cross-list from cs.LG) [pdf, other]
Title: Towards Understanding Why Mask-Reconstruction Pretraining Helps in Downstream Tasks
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[1170]  arXiv:2206.04006 (cross-list from cs.SD) [pdf, other]
Title: Few-Shot Audio-Visual Learning of Environment Acoustics
Comments: Accepted to NeurIPS 2022
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1171]  arXiv:2206.04016 (cross-list from cs.NE) [pdf, other]
Title: SYNERgy between SYNaptic consolidation and Experience Replay for general continual learning
Comments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1172]  arXiv:2206.04129 (cross-list from cs.RO) [pdf, other]
Title: Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D Convolutions
Comments: Accepted for RA-L
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1173]  arXiv:2206.04310 (cross-list from cs.LG) [pdf, other]
Title: GSmooth: Certified Robustness against Semantic Transformations via Generalized Randomized Smoothing
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1174]  arXiv:2206.04318 (cross-list from cs.MM) [pdf, other]
Title: Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1175]  arXiv:2206.04363 (cross-list from cs.MM) [pdf, other]
Title: Deep Neural Network for Blind Visual Quality Assessment of 4K Content
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1176]  arXiv:2206.04459 (cross-list from cs.LG) [pdf, other]
Title: SDQ: Stochastic Differentiable Quantization with Mixed Precision
Comments: ICML 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1177]  arXiv:2206.04523 (cross-list from cs.CL) [pdf, other]
Title: Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[1178]  arXiv:2206.04530 (cross-list from cs.LG) [pdf, other]
Title: DORA: Exploring Outlier Representations in Deep Neural Networks
Comments: 24 pages, 18 figures
Journal-ref: Published in Transactions on Machine Learning Research (06/2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1179]  arXiv:2206.04625 (cross-list from cs.LG) [pdf, other]
Title: AttX: Attentive Cross-Connections for Fusion of Wearable Signals in Emotion Recognition
Comments: 13 pages, 8 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1180]  arXiv:2206.04676 (cross-list from cs.LG) [pdf, other]
Title: Extending Momentum Contrast with Cross Similarity Consistency Regularization
Comments: IEEE Transactions on Circuits and Systems for Video Technology
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1181]  arXiv:2206.04677 (cross-list from cs.CR) [pdf, other]
Title: On the Permanence of Backdoors in Evolving Models
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1182]  arXiv:2206.04679 (cross-list from cs.LG) [pdf, other]
Title: POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples
Comments: Accepted at NeurIPS 2021 (First two authors contribute equally)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1183]  arXiv:2206.04756 (cross-list from cs.LG) [pdf, other]
Title: An Empirical Study on Disentanglement of Negative-free Contrastive Learning
Comments: Accepted to NeurIPS 2022; 10 pages main text + 15 pages appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1184]  arXiv:2206.04776 (cross-list from cs.LG) [pdf, other]
Title: What should AI see? Using the Public's Opinion to Determine the Perception of an AI
Comments: 26 pages, 12 figures
Journal-ref: AI and Ethics (2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1185]  arXiv:2206.04779 (cross-list from cs.LG) [pdf, other]
Title: Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Comments: Published at TMLR, 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1186]  arXiv:2206.04881 (cross-list from cs.CR) [pdf, other]
Title: Enhancing Clean Label Backdoor Attack with Two-phase Specific Triggers
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1187]  arXiv:2206.04888 (cross-list from cs.MM) [pdf, other]
Title: AntPivot: Livestream Highlight Detection via Hierarchical Attention Mechanism
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1188]  arXiv:2206.05008 (cross-list from cs.GR) [pdf, other]
Title: Subjective Quality Assessment for Images Generated by Computer Graphics
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1189]  arXiv:2206.05093 (cross-list from cs.LG) [pdf, other]
Title: Federated Momentum Contrastive Clustering
Comments: Originally submitted March 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1190]  arXiv:2206.05263 (cross-list from cs.LG) [pdf, other]
Title: Causal Balancing for Domain Generalization
Comments: Published at ICLR 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1191]  arXiv:2206.05266 (cross-list from cs.LG) [pdf, other]
Title: Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Comments: NeurIPS 2022. Code for ELo-SACv3 is at this https URL and code for ELo-Rainbow is at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1192]  arXiv:2206.05323 (cross-list from cs.LG) [pdf, other]
Title: Memory Classifiers: Two-stage Classification for Robustness in Machine Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1193]  arXiv:2206.05344 (cross-list from cs.GR) [pdf, other]
Title: Differentiable Rendering of Neural SDFs through Reparameterization
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1194]  arXiv:2206.05365 (cross-list from cs.LG) [pdf, ps, other]
Title: Object Detection, Recognition, Deep Learning, and the Universal Law of Generalization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1195]  arXiv:2206.05400 (cross-list from cs.RO) [pdf, ps, other]
Title: High-Definition Map Generation Technologies For Autonomous Driving
Comments: 25 pages, 17 figures, submitted to a journal
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1196]  arXiv:2206.05555 (cross-list from cs.CL) [pdf, other]
Title: A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1197]  arXiv:2206.05625 (cross-list from cs.AI) [pdf, ps, other]
Title: Exploring the Intersection between Neural Architecture Search and Continual Learning
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1198]  arXiv:2206.05649 (cross-list from cs.GR) [pdf, other]
Title: TileGen: Tileable, Controllable Material Generation and Capture
Comments: 18 pages, 19 figures
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1199]  arXiv:2206.05687 (cross-list from cs.HC) [pdf, other]
Title: DRNet: Decomposition and Reconstruction Network for Remote Physiological Measurement
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1200]  arXiv:2206.05751 (cross-list from cs.LG) [pdf, other]
Title: Consistent Attack: Universal Adversarial Perturbation on Embodied Vision Navigation
Journal-ref: Pattern Recognition Letters (PRL), 2023
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1201]  arXiv:2206.05859 (cross-list from cs.LG) [pdf, ps, other]
Title: A Directed-Evolution Method for Sparsification and Compression of Neural Networks with Application to Object Identification and Segmentation and considerations of optimal quantization using small number of bits
Comments: 12 pages total, 5 figures, 2 appendices
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1202]  arXiv:2206.05893 (cross-list from cs.LG) [pdf, other]
Title: Deploying Convolutional Networks on Untrusted Platforms Using 2D Holographic Reduced Representations
Comments: To appear in the Proceedings of the 39 th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1203]  arXiv:2206.05930 (cross-list from cs.LG) [pdf, other]
Title: Faster Optimization-Based Meta-Learning Adaptation Phase
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1204]  arXiv:2206.06173 (cross-list from eess.SY) [pdf, other]
Title: LiVeR: Lightweight Vehicle Detection and Classification in Real-Time
Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
[1205]  arXiv:2206.06273 (cross-list from cs.CG) [pdf, other]
Title: Learning Joint Surface Atlases
Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[1206]  arXiv:2206.06489 (cross-list from cs.AI) [pdf, other]
Title: BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1207]  arXiv:2206.06522 (cross-list from cs.CL) [pdf, other]
Title: LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
Comments: NeurIPS 2022 (our code is available at: this https URL)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1208]  arXiv:2206.06553 (cross-list from cs.RO) [pdf, other]
Title: Safe Output Feedback Motion Planning from Images via Learned Perception Modules and Contraction Theory
Comments: Workshop on the Algorithmic Foundations of Robotics (WAFR) XV, 2022, College Park, MD, USA
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1209]  arXiv:2206.06577 (cross-list from cs.GR) [pdf, other]
Title: Physics Informed Neural Fields for Smoke Reconstruction with Sparse Data
Comments: accepted to ACM Transactions On Graphics (SIGGRAPH 2022), further info:\url{this https URL}
Journal-ref: ACM Trans. Graph.41, 4 (2022), 119:1-119:14
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1210]  arXiv:2206.06662 (cross-list from cs.LG) [pdf, other]
Title: Learning Best Combination for Efficient N:M Sparsity
Comments: Accepted by 36th Conference on Neural Information Processing Systems (NeurIPS 2022)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1211]  arXiv:2206.06737 (cross-list from cs.LG) [pdf, other]
Title: Adversarial Vulnerability of Randomized Ensembles
Comments: Published as a conference paper in ICML 2022
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1212]  arXiv:2206.06854 (cross-list from cs.AI) [pdf, other]
Title: On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport Perspective
Authors: Mathieu Serrurier (IRIT-ADRIA, UT), Franck Mamalet (UT), Thomas Fel (UT), Louis Béthune (UT3, UT, IRIT-ADRIA), Thibaut Boissin (UT)
Journal-ref: Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, Dec 2023, New Orleans (Louisiana), United States
Subjects: Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1213]  arXiv:2206.06994 (cross-list from cs.AI) [pdf, other]
Title: ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Comments: ProcTHOR website: this https URL
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1214]  arXiv:2206.07081 (cross-list from cs.LG) [pdf, ps, other]
Title: Applications of Generative Adversarial Networks in Neuroimaging and Clinical Neuroscience
Journal-ref: NeuroImage 269:119898 (2023)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1215]  arXiv:2206.07136 (cross-list from cs.LG) [pdf, other]
Title: Automatic Clipping: Differentially Private Deep Learning Made Easier and Stronger
Comments: accepted to NeurIPS 2023
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1216]  arXiv:2206.07137 (cross-list from cs.LG) [pdf, other]
Title: Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet Learnt
Comments: ICML 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1217]  arXiv:2206.07148 (cross-list from cs.MM) [pdf, other]
Title: It's Time for Artistic Correspondence in Music and Video
Comments: CVPR 2022
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1218]  arXiv:2206.07155 (cross-list from cs.LG) [pdf, other]
Title: Self-Supervision on Images and Text Reduces Reliance on Visual Shortcut Features
Comments: 4 pages, 2 figures, spotlight talk at SCIS workshop, ICML 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1219]  arXiv:2206.07173 (cross-list from cs.CY) [pdf, other]
Title: Measuring Representational Harms in Image Captioning
Comments: ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2022
Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
[1220]  arXiv:2206.07179 (cross-list from cs.LG) [pdf, other]
Title: Proximal Splitting Adversarial Attacks for Semantic Segmentation
Comments: CVPR 2023. Code available at: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1221]  arXiv:2206.07260 (cross-list from cs.LG) [pdf, other]
Title: On Enforcing Better Conditioned Meta-Learning for Rapid Few-Shot Adaptation
Comments: Accepted at NeurIPS 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1222]  arXiv:2206.07290 (cross-list from cs.LG) [pdf, other]
Title: Differentiable Top-k Classification Learning
Comments: Published at ICML 2022, Code @ this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1223]  arXiv:2206.07387 (cross-list from cs.LG) [pdf, other]
Title: The Manifold Hypothesis for Gradient-Based Explanations
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1224]  arXiv:2206.07538 (cross-list from cs.RO) [pdf, other]
Title: Body Gesture Recognition to Control a Social Robot
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1225]  arXiv:2206.07736 (cross-list from cs.LG) [pdf, other]
Title: Improving Diversity with Adversarially Learned Transformations for Domain Generalization
Comments: WACV 2023. Code: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1226]  arXiv:2206.07741 (cross-list from cs.LG) [pdf, other]
Title: Edge Inference with Fully Differentiable Quantized Mixed Precision Neural Networks
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1227]  arXiv:2206.07758 (cross-list from cs.LG) [pdf, other]
Title: Reconstructing Training Data from Trained Neural Networks
Comments: Fixed a typo in the acknowledgements
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[1228]  arXiv:2206.07795 (cross-list from cs.LG) [pdf, other]
Title: On Calibrated Model Uncertainty in Deep Learning
Comments: The European Conference on Machine Learning (ECML PKDD 2020). arXiv admin note: text overlap with arXiv:2103.11214
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1229]  arXiv:2206.07898 (cross-list from cs.AI) [pdf, other]
Title: Multimodal Dialogue State Tracking
Comments: Accepted at NAACL 2022 (Oral)
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1230]  arXiv:2206.08010 (cross-list from cs.GR) [pdf, other]
Title: MoDi: Unconditional Motion Synthesis from Diverse Data
Comments: Video: this https URL, Project page: this https URL, Code: this https URL
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1231]  arXiv:2206.08076 (cross-list from cs.HC) [pdf, other]
Title: Learning Effect of Lay People in Gesture-Based Locomotion in Virtual Reality
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[1232]  arXiv:2206.08077 (cross-list from cs.RO) [pdf, other]
Title: Neural Scene Representation for Locomotion on Structured Terrain
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1233]  arXiv:2206.08138 (cross-list from cs.LG) [pdf, other]
Title: Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification
Comments: version 2 is the correct version, including supplementary material at the end
Journal-ref: NeurIPS 2021 Competition and Demonstration Track, Dec 2021, On-line, United States
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1234]  arXiv:2206.08213 (cross-list from cs.LG) [pdf, other]
Title: A Closer Look at Smoothness in Domain Adversarial Training
Comments: ICML 2022. Code: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1235]  arXiv:2206.08242 (cross-list from cs.LG) [pdf, other]
Title: Catastrophic overfitting can be induced with discriminative non-robust features
Comments: Published in Transactions on Machine Learning Research (TMLR)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1236]  arXiv:2206.08255 (cross-list from cs.LG) [pdf, other]
Title: Gradient-Based Adversarial and Out-of-Distribution Detection
Comments: International Conference on Machine Learning (ICML) Workshop on New Frontiers in Adversarial Machine Learning, July 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1237]  arXiv:2206.08312 (cross-list from cs.SD) [pdf, other]
Title: SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
Comments: Camera-ready version. Website: this https URL Project page: this https URL
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1238]  arXiv:2206.08316 (cross-list from cs.LG) [pdf, other]
Title: Boosting the Adversarial Transferability of Surrogate Models with Dark Knowledge
Comments: Accepted at 2023 International Conference on Tools with Artificial Intelligence (ICTAI)
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1239]  arXiv:2206.08422 (cross-list from cs.GR) [pdf, ps, other]
Title: Real-time motion amplification on mobile devices
Authors: Henning U. Voss
Comments: Supplemental data at this https URL Changes to v1: Inclusion of offline video processing
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1240]  arXiv:2206.08476 (cross-list from cs.LG) [pdf, other]
Title: Zero-Shot AutoML with Pretrained Models
Journal-ref: International Conference on Machine Learning 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1241]  arXiv:2206.08497 (cross-list from cs.GR) [pdf, other]
Title: Unsupervised Kinematic Motion Detection for Part-segmented 3D Shape Collections
Comments: SIGGRAPH 2022
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1242]  arXiv:2206.08517 (cross-list from cs.RO) [pdf, other]
Title: ECTLO: Effective Continuous-time Odometry Using Range Image for LiDAR with Small FoV
Authors: Xin Zheng, Jianke Zhu
Comments: 8 pages, 5 figures. Accepted for publication in the Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1243]  arXiv:2206.08522 (cross-list from cs.RO) [pdf, other]
Title: VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation
Subjects: Robotics (cs.RO); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1244]  arXiv:2206.08653 (cross-list from cs.LG) [pdf, other]
Title: All Mistakes Are Not Equal: Comprehensive Hierarchy Aware Multi-label Predictions (CHAMP)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1245]  arXiv:2206.08684 (cross-list from cs.LG) [pdf, other]
Title: Sparse Double Descent: Where Network Pruning Aggravates Overfitting
Comments: ICML 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1246]  arXiv:2206.08704 (cross-list from cs.LG) [pdf, other]
Title: Maximum Class Separation as Inductive Bias in One Matrix
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1247]  arXiv:2206.08802 (cross-list from cs.LG) [pdf, other]
Title: Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets
Comments: Accepted by ICML 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1248]  arXiv:2206.08826 (cross-list from cs.LG) [pdf, other]
Title: Multimodal Attention-based Deep Learning for Alzheimer's Disease Diagnosis
Comments: 11 pages, 5 figures
Journal-ref: Journal of the American Medical Informatics Association, 2022; ocac168
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1249]  arXiv:2206.08842 (cross-list from cs.MM) [pdf, other]
Title: Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Information Retrieval (cs.IR)
[1250]  arXiv:2206.08853 (cross-list from cs.LG) [pdf, other]
Title: MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Comments: Outstanding Paper Award at NeurIPS 2022. Project website: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1251]  arXiv:2206.08869 (cross-list from cs.LG) [pdf, other]
Title: Fast Lossless Neural Compression with Integer-Only Discrete Flows
Comments: Accepted as a conference paper at International Conference on Machine Learning (ICML) 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1252]  arXiv:2206.08882 (cross-list from cs.MA) [pdf, other]
Title: Edge-Aided Sensor Data Sharing in Vehicular Communication Networks
Comments: Accepted for IEEE 95th Vehicular Technology Conference (VTC2022-Spring)
Subjects: Multiagent Systems (cs.MA); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1253]  arXiv:2206.08890 (cross-list from cs.LG) [pdf, other]
Title: Disentangling Model Multiplicity in Deep Learning
Comments: 13 pages, 6 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1254]  arXiv:2206.08965 (cross-list from cs.AI) [pdf, other]
Title: KitBit: A New AI Model for Solving Intelligence Tests and Numerical Series
Comments: 11 pages
Journal-ref: Corsino, V., Gilperez, J. M., & Herrera, L. (2023). "KitBit: A New AI Model for Solving Intelligence Tests and Numerical Series." IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(11), 13893-13903
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1255]  arXiv:2206.09012 (cross-list from cs.LG) [pdf, other]
Title: Diffusion models as plug-and-play priors
Comments: NeurIPS 2022; code: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1256]  arXiv:2206.09034 (cross-list from cs.LG) [pdf, other]
Title: Towards Better Selective Classification
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1257]  arXiv:2206.09059 (cross-list from cs.CL) [pdf, other]
Title: CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks
Comments: Accepted to NeurIPS 2022 Datasets and Benchmarks track
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1258]  arXiv:2206.09203 (cross-list from cs.AI) [pdf, other]
Title: Interactive Visual Reasoning under Uncertainty
Comments: Accepted at NeurIPS 2023 (Datasets and Benchmarks)
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1259]  arXiv:2206.09272 (cross-list from cs.CR) [pdf, other]
Title: DECK: Model Hardening for Defending Pervasive Backdoors
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1260]  arXiv:2206.09286 (cross-list from cs.GR) [pdf, other]
Title: From Universal Humanoid Control to Automatic Physically Valid Character Creation
Comments: Project page: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1261]  arXiv:2206.09359 (cross-list from cs.LG) [pdf, other]
Title: Productive Reproducible Workflows for DNNs: A Case Study for Industrial Defect Detection
Comments: 7 pages, 5 figures, AccML 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF); Software Engineering (cs.SE)
[1262]  arXiv:2206.09378 (cross-list from cs.CL) [pdf, ps, other]
Title: A Self-Guided Framework for Radiology Report Generation
Comments: 11 pages, 3 figures, accepted by Medical Image Computing and Computer Assisted Intervention 2022(MICCAI 2022)
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1263]  arXiv:2206.09386 (cross-list from cs.LG) [pdf, other]
Title: Scalable Neural Data Server: A Data Recommender for Transfer Learning
Comments: Neurips 2021
Journal-ref: Advances in Neural Information Processing Systems, Volume 34, pages 8984-8997, year 2021
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1264]  arXiv:2206.09391 (cross-list from cs.LG) [pdf, other]
Title: Towards Adversarial Attack on Vision-Language Pre-training Models
Comments: Accepted by ACM MM2022. Code is available in GitHub
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1265]  arXiv:2206.09449 (cross-list from cs.NE) [pdf, other]
Title: SNN2ANN: A Fast and Memory-Efficient Training Framework for Spiking Neural Networks
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1266]  arXiv:2206.09570 (cross-list from cs.HC) [pdf, other]
Title: Guardian Angel: A Novel Walking Aid for the Visually Impaired
Comments: 2 pages, 1 figure
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[1267]  arXiv:2206.09616 (cross-list from cs.LG) [pdf, other]
Title: Revisiting lp-constrained Softmax Loss: A Comprehensive Study
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1268]  arXiv:2206.09628 (cross-list from cs.LG) [pdf, other]
Title: Diversified Adversarial Attacks based on Conjugate Gradient Method
Comments: Proceedings of the 39th International Conference on Machine Learning (ICML 2022)
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1269]  arXiv:2206.09699 (cross-list from cs.CG) [pdf, other]
Title: FoR$^2$M: Recognition and Repair of Foldings in Mesh Surfaces. Application to 3D Object Degradation
Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1270]  arXiv:2206.09811 (cross-list from cs.LG) [pdf, other]
Title: Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search
Comments: Accepted to CVPR2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1271]  arXiv:2206.09868 (cross-list from cs.LG) [pdf, other]
Title: Understanding Robust Learning through the Lens of Representation Similarities
Comments: 35 pages, 29 figures; Accepted to Neurips 2022
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1272]  arXiv:2206.09880 (cross-list from cs.LG) [pdf, ps, other]
Title: Breaking Down Out-of-Distribution Detection: Many Methods Based on OOD Training Data Estimate a Combination of the Same Core Quantities
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1273]  arXiv:2206.09946 (cross-list from cs.CY) [pdf, ps, other]
Title: Short Video Uprising: How #BlackLivesMatter Content on TikTok Challenges the Protest Paradigm
Comments: Workshop Proceedings of the 16th International AAAI Conference on Web and Social Media
Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
[1274]  arXiv:2206.10011 (cross-list from cs.LG) [pdf, other]
Title: When Does Re-initialization Work?
Comments: Published in PMLR Volume 187; spotlight presentation at I Can't Believe It's Not Better Workshop at NeurIPS 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1275]  arXiv:2206.10244 (cross-list from cs.RO) [pdf, other]
Title: Experimental Evaluation of Pose Initialization Methods for Relative Navigation Between Non-Cooperative Satellites
Comments: To be presented at the 2022 IEEE INTERNATIONAL WORKSHOP ON Metrology for AeroSpace
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1276]  arXiv:2206.10249 (cross-list from cs.HC) [pdf, other]
Title: Incorporating Voice Instructions in Model-Based Reinforcement Learning for Self-Driving Cars
Comments: NeurIPS 2021 Workshop on Machine Learning for Autonomous Driving
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1277]  arXiv:2206.10255 (cross-list from eess.SY) [pdf, other]
Title: GNN-PMB: A Simple but Effective Online 3D Multi-Object Tracker without Bells and Whistles
Comments: accepted by IEEE Transactions on Intelligent Vehicles
Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
[1278]  arXiv:2206.10274 (cross-list from cs.RO) [pdf, other]
Title: Attention-driven Active Vision for Efficient Reconstruction of Plants and Targeted Plant Parts
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1279]  arXiv:2206.10326 (cross-list from cs.HC) [pdf, other]
Title: The Metaverse Data Deluge: What Can We Do About It?
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[1280]  arXiv:2206.10352 (cross-list from cs.HC) [pdf, other]
Title: Psychologically-Inspired, Unsupervised Inference of Perceptual Groups of GUI Widgets from GUI Images
Comments: 12 Pages, accepted to ESEC/FSE '2022
Journal-ref: In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022)
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Software Engineering (cs.SE)
[1281]  arXiv:2206.10365 (cross-list from cs.LG) [pdf, other]
Title: A Flexible Diffusion Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1282]  arXiv:2206.10421 (cross-list from cs.SD) [pdf, other]
Title: Rethinking Audio-visual Synchronization for Active Speaker Detection
Comments: Accepted by IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2022)
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1283]  arXiv:2206.10480 (cross-list from cs.LG) [pdf, other]
Title: Learning to Estimate and Refine Fluid Motion with Physical Dynamics
Comments: published at ICML 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[1284]  arXiv:2206.10620 (cross-list from cs.LG) [pdf, other]
Title: CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing Framework
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Programming Languages (cs.PL)
[1285]  arXiv:2206.10670 (cross-list from cs.RO) [pdf, other]
Title: SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene Understanding
Comments: accepted at ISRR 2022
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1286]  arXiv:2206.10797 (cross-list from cs.LG) [pdf, other]
Title: Imitation Learning for Generalizable Self-driving Policy with Sim-to-real Transfer
Comments: Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Source code is available at: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1287]  arXiv:2206.10816 (cross-list from cs.LG) [pdf, other]
Title: Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming
Comments: 28 pages, 13 figures, ICML2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1288]  arXiv:2206.10843 (cross-list from cs.LG) [pdf, other]
Title: Learning Debiased Classifier with Biased Committee
Comments: Conference on Neural Information Processing Systems (NeurIPS), New Orleans, 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1289]  arXiv:2206.10935 (cross-list from cs.LG) [pdf, other]
Title: A Study on the Evaluation of Generative Models
Comments: 13 pages
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1290]  arXiv:2206.11073 (cross-list from cs.NE) [pdf, other]
Title: A Unified and Biologically-Plausible Relational Graph Representation of Vision Transformers
Comments: 11 pages,7 figures, submitted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022)
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1291]  arXiv:2206.11141 (cross-list from cs.RO) [pdf, other]
Title: Hybrid Physical Metric For 6-DoF Grasp Pose Detection
Comments: 7 pages, 7 figures, accepted by ICRA 2022
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1292]  arXiv:2206.11229 (cross-list from cs.IR) [pdf, other]
Title: Business Document Information Extraction: Towards Practical Benchmarks
Comments: Accepted to CLEF 2022
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1293]  arXiv:2206.11251 (cross-list from cs.LG) [pdf, other]
Title: Behavior Transformers: Cloning $k$ modes with one stone
Comments: Code and data available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1294]  arXiv:2206.11260 (cross-list from cs.SD) [pdf, other]
Title: Few-shot Long-Tailed Bird Audio Recognition
Comments: LifeCLEF2022 (best paper award)
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1295]  arXiv:2206.11376 (cross-list from cs.RO) [pdf, other]
Title: Real-Time Online Skeleton Extraction and Gesture Recognition on Pepper
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1296]  arXiv:2206.11461 (cross-list from cs.GR) [pdf, other]
Title: Towards Better User Studies in Computer Graphics and Vision
Comments: 18 pages of text, 6 pages of references, 3 figures, 1 table
Journal-ref: Foundations and Trends in Computer Graphics and Vision (2023). Vol. 15: No. 3, pp 201-252
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1297]  arXiv:2206.11481 (cross-list from cs.CG) [pdf, ps, other]
Title: A Novel Algorithm for Exact Concave Hull Extraction
Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[1298]  arXiv:2206.11488 (cross-list from cs.LG) [pdf, other]
Title: On the Importance and Applicability of Pre-Training for Federated Learning
Comments: Accepted to ICLR 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1299]  arXiv:2206.11602 (cross-list from cs.LG) [pdf, other]
Title: Prototype-Anchored Learning for Learning with Imperfect Annotations
Comments: ICML 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1300]  arXiv:2206.11623 (cross-list from cs.RO) [pdf, other]
Title: Waypoint Generation in Row-based Crops with Deep Learning and Contrastive Clustering
Comments: Accepted at ECML PKDD 2022
Journal-ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13718, Springer
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1301]  arXiv:2206.11849 (cross-list from cs.LG) [pdf, other]
Title: Sample Condensation in Online Continual Learning
Comments: Accepted as a conference paper at 2022 International Joint Conference on Neural Networks (IJCNN 2022). Part of 2022 IEEE World Congress on Computational Intelligence (IEEE WCCI 2022)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1302]  arXiv:2206.12139 (cross-list from cs.NI) [pdf, other]
Title: HARU: Haptic Augmented Reality-Assisted User-Centric Industrial Network Planning
Subjects: Networking and Internet Architecture (cs.NI); Computer Vision and Pattern Recognition (cs.CV)
[1303]  arXiv:2206.12145 (cross-list from cs.RO) [pdf, other]
Title: Efficient and Robust Training of Dense Object Nets for Multi-Object Robot Manipulation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1304]  arXiv:2206.12251 (cross-list from cs.CR) [pdf, other]
Title: Adversarial Zoom Lens: A Novel Physical-World Attack to DNNs
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1305]  arXiv:2206.12292 (cross-list from cs.LG) [pdf, other]
Title: InfoAT: Improving Adversarial Training Using the Information Bottleneck Principle
Comments: Published in: IEEE Transactions on Neural Networks and Learning Systems ( Early Access )
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1306]  arXiv:2206.12322 (cross-list from cs.LG) [pdf, other]
Title: How to train accurate BNNs for embedded systems?
Journal-ref: Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing (2023)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1307]  arXiv:2206.12484 (cross-list from cs.LG) [pdf, other]
Title: An Intensity and Phase Stacked Analysis of Phase-OTDR System using Deep Transfer Learning and Recurrent Neural Networks
Comments: 15 pages, 9 figures. Title updated
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1308]  arXiv:2206.12649 (cross-list from cs.CL) [pdf, other]
Title: Sentiment Analysis with R: Natural Language Processing for Semi-Automated Assessments of Qualitative Data
Comments: 14 pages, 6 figures
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[1309]  arXiv:2206.12705 (cross-list from cs.LG) [pdf, other]
Title: p-Meta: Towards On-device Deep Model Adaptation
Comments: Published in SIGKDD 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1310]  arXiv:2206.12753 (cross-list from cs.DB) [pdf, other]
Title: Spatiotemporal Data Mining: A Survey
Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[1311]  arXiv:2206.12941 (cross-list from cs.RO) [pdf, ps, other]
Title: Object Detection and Tracking with Autonomous UAV
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1312]  arXiv:2206.13043 (cross-list from cs.LG) [pdf, other]
Title: Automated Systems For Diagnosis of Dysgraphia in Children: A Survey and Novel Framework
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1313]  arXiv:2206.13387 (cross-list from cs.AI) [pdf, other]
Title: ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1314]  arXiv:2206.13399 (cross-list from cs.LG) [pdf, other]
Title: Transfer Learning via Test-Time Neural Networks Aggregation
Comments: 8 pages
Journal-ref: Proceedings of the 17th international joint conference on computer vision, imaging and computer graphics theory and applications, VISIGRAPP 2022, volume 5: VISAPP, online streaming, february 6-8, 2022, 2022, pp. 642-649
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1315]  arXiv:2206.13406 (cross-list from cs.RO) [pdf, other]
Title: Explicitly incorporating spatial information to recurrent networks for agriculture
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1316]  arXiv:2206.13491 (cross-list from cs.LG) [pdf, other]
Title: Effective training-time stacking for ensembling of deep neural networks
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1317]  arXiv:2206.13497 (cross-list from cs.LG) [pdf, other]
Title: Robustness Implies Generalization via Data-Dependent Generalization Bounds
Comments: Accepted by ICML 2022, and selected for ICML long presentation (top 2% of submissions)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Probability (math.PR); Machine Learning (stat.ML)
[1318]  arXiv:2206.13498 (cross-list from cs.LG) [pdf, other]
Title: Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior
Comments: Fixed backdoor localization results, made changes to abstract and introduction
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1319]  arXiv:2206.13499 (cross-list from cs.LG) [pdf, other]
Title: Prompting Decision Transformer for Few-Shot Policy Generalization
Comments: ICML 2022. Project page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1320]  arXiv:2206.13630 (cross-list from cs.AI) [pdf, ps, other]
Title: Toward an ImageNet Library of Functions for Global Optimization Benchmarking
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1321]  arXiv:2206.13687 (cross-list from cs.LG) [pdf, other]
Title: POEM: Out-of-Distribution Detection with Posterior Sampling
Comments: ICML 2022 (Long Talk); First two authors contributed equally
Journal-ref: Thirty-ninth International Conference on Machine Learning (2022)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1322]  arXiv:2206.13883 (cross-list from cs.RO) [pdf, other]
Title: Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems
Comments: 8 pages, 5 figures, To be published in RA-L 2022
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1323]  arXiv:2206.13932 (cross-list from cs.LG) [pdf, other]
Title: Discrete Morse Sandwich: Fast Computation of Persistence Diagrams for Scalar Data -- An Algorithm and A Benchmark
Subjects: Machine Learning (cs.LG); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
[1324]  arXiv:2206.13968 (cross-list from cs.LG) [pdf, other]
Title: Information Entropy Initialized Concrete Autoencoder for Optimal Sensor Placement and Reconstruction of Geophysical Fields
Comments: 18 pages, 6 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Atmospheric and Oceanic Physics (physics.ao-ph)
[1325]  arXiv:2206.13991 (cross-list from cs.LG) [pdf, other]
Title: Increasing Confidence in Adversarial Robustness Evaluations
Comments: Oral at CVPR 2022 Workshop (Art of Robustness). Project website this https URL
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1326]  arXiv:2206.14056 (cross-list from cs.LG) [pdf, ps, other]
Title: Deep Neural Networks pruning via the Structured Perspective Regularization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[1327]  arXiv:2206.14085 (cross-list from cs.LG) [pdf, other]
Title: Continual Learning with Transformers for Image Classification
Comments: Appeared in CVPR CLVision workshop. arXiv admin note: substantial text overlap with arXiv:2203.04640
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1328]  arXiv:2206.14098 (cross-list from cs.LG) [pdf, other]
Title: RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
Comments: Presented at MLSys 2023. Code available from Cerebras Systems: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1329]  arXiv:2206.14137 (cross-list from cs.NE) [pdf, ps, other]
Title: aSTDP: A More Biologically Plausible Learning
Authors: Shiyuan Li
Comments: 17 pages, 6 figures. arXiv admin note: text overlap with arXiv:1912.00009
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[1330]  arXiv:2206.14244 (cross-list from cs.RO) [pdf, other]
Title: Masked World Models for Visual Control
Comments: Project website: this https URL Accepted to CoRL 2022
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1331]  arXiv:2206.14256 (cross-list from cs.LG) [pdf, other]
Title: GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning
Authors: Doğay Kamar (1), Nazım Kemal Üre (1 and 2), Gözde Ünal (1 and 2) ((1) Faculty of Computer and Informatics, Istanbul Technical University (2) Artificial Intelligence and Data Science Research Center, Istanbul Technical University)
Journal-ref: International Conference on Agents and Artificial Intelligence - ICAART, Volume 2, 264-272 (2022)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1332]  arXiv:2206.14372 (cross-list from cs.RO) [pdf, other]
Title: Formalizing and Evaluating Requirements of Perception Systems for Automated Vehicles using Spatio-Temporal Perception Logic
Comments: 32 pages, 11 figures, 6 tables, 4 algorithms, 2 appendixes
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Formal Languages and Automata Theory (cs.FL)
[1333]  arXiv:2206.14486 (cross-list from cs.LG) [pdf, other]
Title: Beyond neural scaling laws: beating power law scaling via data pruning
Comments: Outstanding Paper Award @ NeurIPS 2022. Added github link to metric scores
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1334]  arXiv:2206.14502 (cross-list from cs.LG) [pdf, other]
Title: RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness
Comments: 22 pages, 18 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1335]  arXiv:2206.14528 (cross-list from cs.RO) [pdf, other]
Title: Procrustes Analysis with Deformations: A Closed-Form Solution by Eigenvalue Decomposition
Comments: Published on International journal of computer vision (IJCV) 2022
Journal-ref: International Journal of Computer Vision 130, no. 2 (2022): 567-593
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1336]  arXiv:2206.14541 (cross-list from cs.LG) [pdf, other]
Title: Why patient data cannot be easily forgotten?
Comments: Ruolin Su and Xiao Liu contributed equally. Accepted by MICCAI 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1337]  arXiv:2206.14579 (cross-list from cs.CL) [pdf, other]
Title: Competence-based Multimodal Curriculum Learning for Medical Report Generation
Comments: Accepted by ACL 2021 (Oral)
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1338]  arXiv:2206.14581 (cross-list from cs.ET) [pdf, other]
Title: On-device Synaptic Memory Consolidation using Fowler-Nordheim Quantum-tunneling
Subjects: Emerging Technologies (cs.ET); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1339]  arXiv:2206.14617 (cross-list from cs.GR) [pdf, other]
Title: Perspective (In)consistency of Paint by Text
Authors: Hany Farid
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[1340]  arXiv:2206.14658 (cross-list from cs.LG) [pdf, other]
Title: Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs
Comments: ICML Workshop on Hardware Aware Efficient Training, 2022
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1341]  arXiv:2206.14687 (cross-list from cs.LG) [pdf, other]
Title: Multi-scale Physical Representations for Approximating PDE Solutions with Graph Neural Operators
Comments: ICLR 2022 Workshop on Geometrical and Topological Representation Learning
Journal-ref: ICLR 2022 Workshop on Geometrical and Topological Representation Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1342]  arXiv:2206.14709 (cross-list from cs.LG) [pdf, other]
Title: An extensible Benchmarking Graph-Mesh dataset for studying Steady-State Incompressible Navier-Stokes Equations
Comments: ICLR 2022 Workshop on Geometrical and Topological Representation Learning
Journal-ref: ICLR 2022 Workshop on Geometrical and Topological Representation Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[1343]  arXiv:2206.14854 (cross-list from cs.RO) [pdf, other]
Title: Neural Motion Fields: Encoding Grasp Trajectories as Implicit Value Functions
Comments: RSS 2022 Workshop on Implicit Representations for Robotic Manipulation
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1344]  arXiv:2206.14868 (cross-list from cs.LG) [pdf, other]
Title: Teach me how to Interpolate a Myriad of Embeddings
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1345]  arXiv:2206.15007 (cross-list from cs.CL) [pdf, other]
Title: GSCLIP : A Framework for Explaining Distribution Shifts in Natural Language
Comments: Accepted by ICML 2022 DataPerf
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1346]  arXiv:2206.15170 (cross-list from cs.AI) [pdf, other]
Title: LiDAR-as-Camera for End-to-End Driving
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1347]  arXiv:2206.15316 (cross-list from cs.LG) [pdf, other]
Title: Anomaly Detection in Echocardiograms with Dynamic Variational Trajectory Models
Journal-ref: Proceedings of the 7th Machine Learning for Healthcare Conference, PMLR 182:425-458, 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computation (stat.CO); Machine Learning (stat.ML)
[1348]  arXiv:2206.15469 (cross-list from cs.RO) [pdf, other]
Title: Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Comments: Code and robot videos are available on this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1349]  arXiv:2206.15470 (cross-list from cs.GR) [pdf, other]
Title: Dressing Avatars: Deep Photorealistic Appearance for Physically Simulated Clothing
Comments: SIGGRAPH Asia 2022 (ACM ToG) camera ready. The supplementary video can be found on this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1350]  arXiv:2206.00002 (cross-list from eess.IV) [pdf, other]
Title: Calibrated Bagging Deep Learning for Image Semantic Segmentation: A Case Study on COVID-19 Chest X-ray Image
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1351]  arXiv:2206.00041 (cross-list from eess.IV) [pdf, ps, other]
Title: Characterization of 3D Printers and X-Ray Computerized Tomography
Comments: Total 13 Pages, 11 Figures, 5 Tables, 10 References
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1352]  arXiv:2206.00105 (cross-list from eess.IV) [pdf, other]
Title: Deep learning pipeline for image classification on mobile phones
Comments: 20 pages
Journal-ref: 9th International Conference on Artificial Intelligence and Applications (AIAPP 2022)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1353]  arXiv:2206.00305 (cross-list from eess.IV) [pdf, ps, other]
Title: Supervised Denoising of Diffusion-Weighted Magnetic Resonance Images Using a Convolutional Neural Network and Transfer Learning
Comments: Preprint submitted to NeuroImage
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1354]  arXiv:2206.00338 (cross-list from eess.IV) [pdf, other]
Title: CellCentroidFormer: Combining Self-attention and Convolution for Cell Detection
Comments: Accepted at MIUA 2022; Added experiments with CircleNets and extended figure captions
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1355]  arXiv:2206.00356 (cross-list from eess.IV) [pdf, other]
Title: A Survey on Deep Learning for Skin Lesion Segmentation
Comments: Published in Medical Image Analysis (2023); 55 pages, 10 figures; Mirikharaji and Abhishek: Joint first authors; Celebi and Hamarneh: Joint senior authors
Journal-ref: Medical Image Analysis (2023): 102863
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1356]  arXiv:2206.00389 (cross-list from eess.IV) [pdf, other]
Title: A comparative study between vision transformers and CNNs in digital pathology
Comments: 8 pages, 2 figures, accepted for workshop T4Vision (CVPR 2022)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1357]  arXiv:2206.00455 (cross-list from q-bio.QM) [pdf, ps, other]
Title: A robust and lightweight deep attention multiple instance learning algorithm for predicting genetic alterations
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Genomics (q-bio.GN)
[1358]  arXiv:2206.00536 (cross-list from eess.IV) [pdf, other]
Title: Impact of loss function in Deep Learning methods for accurate retinal vessel segmentation
Comments: Paper submitted to MICAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1359]  arXiv:2206.00566 (cross-list from eess.IV) [pdf, ps, other]
Title: The Fully Convolutional Transformer for Medical Image Segmentation
Journal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, pp. 3660-3669
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1360]  arXiv:2206.00831 (cross-list from eess.IV) [pdf, other]
Title: Dynamic Cardiac MRI Reconstruction Using Combined Tensor Nuclear Norm and Casorati Matrix Nuclear Norm Regularizations
Authors: Yinghao Zhang, Yue Hu
Comments: 4 pages, 3 figures, 1 table, accepted in IEEE ISBI 2022
Journal-ref: [C]//2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). IEEE, 2022: 1-4
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1361]  arXiv:2206.00850 (cross-list from eess.IV) [pdf, other]
Title: Dynamic MRI using Learned Transform-based Tensor Low-Rank Network (LT$^2$LR-Net)
Comments: 4 pages, 2 figures, 1 tabel, accepted by IEEE ISBI 2023 Conference
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1362]  arXiv:2206.01088 (cross-list from eess.IV) [pdf, other]
Title: Machine Learning-based Lung and Colon Cancer Detection using Deep Feature Extraction and Ensemble Learning
Comments: Accepted for publication in the Special Issue of Expert Systems with Applications (IF:6.954, Cite:12.70) How to Cite: Md. Alamin Talukder, Md. Manowarul Islam, Md Ashraf Uddin, Arnisha Akhter, Khondokar Fida Hasan, Mohammad Ali Moni. "Machine Learning-based Lung and Colon Cancer Detection using Deep Feature Extraction and Ensemble Learning", Expert Systems with Applications. 2022 Jun 1
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1363]  arXiv:2206.01096 (cross-list from eess.IV) [pdf, ps, other]
Title: A Dual-fusion Semantic Segmentation Framework With GAN For SAR Images
Comments: 4 pages,4 figures, 2022 IEEE International Geoscience and Remote Sensing Symposium
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1364]  arXiv:2206.01103 (cross-list from eess.IV) [pdf, other]
Title: Noise2NoiseFlow: Realistic Camera Noise Modeling without Clean Images
Comments: CVPR 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1365]  arXiv:2206.01118 (cross-list from eess.IV) [pdf, ps, other]
Title: Comparing Conventional and Deep Feature Models for Classifying Fundus Photography of Hemorrhages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1366]  arXiv:2206.01344 (cross-list from eess.IV) [pdf, ps, other]
Title: Detecting Pulmonary Embolism from Computed Tomography Using Convolutional Neural Network
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1367]  arXiv:2206.01397 (cross-list from physics.optics) [pdf, other]
Title: Dynamic Structured Illumination Microscopy with a Neural Space-time Model
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[1368]  arXiv:2206.01430 (cross-list from eess.IV) [pdf, other]
Title: LenslessPiCam: A Hardware and Software Platform for Lensless Computational Imaging with a Raspberry Pi
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1369]  arXiv:2206.01644 (cross-list from quant-ph) [pdf, ps, other]
Title: Mirror modular cloning and fast quantum associative retrieval
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1370]  arXiv:2206.01728 (cross-list from eess.IV) [pdf, ps, other]
Title: A review of machine learning approaches, challenges and prospects for computational tumor pathology
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1371]  arXiv:2206.01731 (cross-list from eess.IV) [pdf, other]
Title: Empirical Study of Quality Image Assessment for Synthesis of Fetal Head Ultrasound Imaging with DCGANs
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1372]  arXiv:2206.01735 (cross-list from eess.IV) [pdf, other]
Title: Examining the behaviour of state-of-the-art convolutional neural networks for brain tumor detection with and without transfer learning
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1373]  arXiv:2206.01736 (cross-list from eess.IV) [pdf, other]
Title: Adaptive Adversarial Training to Improve Adversarial Robustness of DNNs for Medical Image Segmentation and Detection
Comments: 17 pages
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1374]  arXiv:2206.01737 (cross-list from eess.IV) [pdf, other]
Title: MaxStyle: Adversarial Style Composition for Robust Medical Image Segmentation
Comments: Early accepted by MICCAI 2022 (Camera-ready version)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1375]  arXiv:2206.01738 (cross-list from eess.IV) [pdf, other]
Title: RIDDLE: Lidar Data Compression with Range Image Deep Delta Encoding
Comments: 14 pages, 10 figures; CVPR 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1376]  arXiv:2206.01739 (cross-list from eess.IV) [pdf, ps, other]
Title: Mutual- and Self- Prototype Alignment for Semi-supervised Medical Image Segmentation
Comments: 11 pages, 3 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1377]  arXiv:2206.01740 (cross-list from eess.IV) [pdf, other]
Title: Denoising Fast X-Ray Fluorescence Raster Scans of Paintings
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1378]  arXiv:2206.01741 (cross-list from eess.IV) [pdf, other]
Title: Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation
Comments: MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1379]  arXiv:2206.01742 (cross-list from eess.IV) [pdf, other]
Title: Learning Probabilistic Topological Representations Using Discrete Morse Theory
Comments: 16 pages, 11 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1380]  arXiv:2206.01743 (cross-list from eess.IV) [pdf, other]
Title: Orthogonal Transform based Generative Adversarial Network for Image Dehazing
Comments: 12 pages, 14 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1381]  arXiv:2206.01745 (cross-list from eess.IV) [pdf, ps, other]
Title: Detection of Fibrosis in Cine Magnetic Resonance Images Using Artificial Intelligence Techniques
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1382]  arXiv:2206.01746 (cross-list from eess.IV) [pdf, ps, other]
Title: Automatic Quantification of Volumes and Biventricular Function in Cardiac Resonance. Validation of a New Artificial Intelligence Approach
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1383]  arXiv:2206.01774 (cross-list from eess.IV) [pdf, other]
Title: Monkeypox Image Data collection
Comments: This is the attempt of creating monkeypox image dataset collected from various sources and it will continue to update by collectiong samples from journals and other public access domains
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1384]  arXiv:2206.01793 (cross-list from eess.IV) [pdf, ps, other]
Title: R2U++: A Multiscale Recurrent Residual U-Net with Dense Skip Connections for Medical Image Segmentation
Comments: Paper accepted in Neural Computing and Applications (2022). Please cite the final version available from Springer website this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1385]  arXiv:2206.01826 (cross-list from stat.ME) [pdf, other]
Title: The Gamma Generalized Normal Distribution: A Descriptor of SAR Imagery
Comments: 21 pages, 6 figures, 6 tables
Journal-ref: Journal of Computational and Applied Mathematics, vol. 347, pages 257-272, February 2019
Subjects: Methodology (stat.ME); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Statistics Theory (math.ST); Data Analysis, Statistics and Probability (physics.data-an)
[1386]  arXiv:2206.01856 (cross-list from eess.IV) [pdf, other]
Title: Poisson2Sparse: Self-Supervised Poisson Denoising From a Single Image
Comments: Accepted to MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1387]  arXiv:2206.01862 (cross-list from eess.IV) [pdf, other]
Title: Image Data collection and implementation of deep learning-based model in detecting Monkeypox disease using modified VGG16
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1388]  arXiv:2206.01897 (cross-list from eess.IV) [pdf, other]
Title: Modeling of Textures to Predict Immune Cell Status and Survival of Brain Tumour Patients
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
[1389]  arXiv:2206.01903 (cross-list from eess.IV) [pdf, other]
Title: Deep Radiomic Analysis for Predicting Coronavirus Disease 2019 in Computerized Tomography and X-ray Images
Journal-ref: IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):3-11
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1390]  arXiv:2206.02061 (cross-list from eess.SP) [pdf, other]
Title: Low Power Neuromorphic EMG Gesture Classification
Comments: 3 Pages, 5 figures, 1 table
Subjects: Signal Processing (eess.SP); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Neural and Evolutionary Computing (cs.NE)
[1391]  arXiv:2206.02225 (cross-list from eess.IV) [pdf, other]
Title: Physically Inspired Constraint for Unsupervised Regularized Ultrasound Elastography
Comments: Accepted in MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1392]  arXiv:2206.02278 (cross-list from eess.IV) [pdf, other]
Title: Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks
Comments: 9 pages, 10 figures
Journal-ref: Proceedings Volume 10789, Image and Signal Processing for Remote Sensing XXIV; 1078916 (2018)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP); Methodology (stat.ME)
[1393]  arXiv:2206.02358 (cross-list from eess.SP) [pdf, other]
Title: Implementation of a Modified U-Net for Medical Image Segmentation on Edge Devices
Comments: Preprint of paper accepted in IEEE Transactions on Circuits and Systems II: Express Brief
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[1394]  arXiv:2206.02425 (cross-list from eess.IV) [pdf, other]
Title: mmFormer: Multimodal Medical Transformer for Incomplete Multimodal Learning of Brain Tumor Segmentation
Comments: Accepted to MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1395]  arXiv:2206.02510 (cross-list from physics.optics) [pdf, other]
Title: Single pixel imaging at high pixel resolutions
Comments: Paper accepted to Optics Express on 23/05/2022
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1396]  arXiv:2206.02558 (cross-list from q-bio.NC) [pdf, other]
Title: Binding Dancers Into Attractors
Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1397]  arXiv:2206.02748 (cross-list from eess.IV) [pdf, other]
Title: Compound Multi-branch Feature Fusion for Real Image Restoration
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1398]  arXiv:2206.02797 (cross-list from eess.AS) [pdf, ps, other]
Title: FedNST: Federated Noisy Student Training for Automatic Speech Recognition
Comments: Accepted at Interspeech 2022
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[1399]  arXiv:2206.02837 (cross-list from eess.IV) [pdf, other]
Title: EVAC+: Multi-scale V-net with Deep Feature CRF Layers for Brain Extraction
Comments: Replaced with advancements in the model and results
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1400]  arXiv:2206.02838 (cross-list from eess.IV) [pdf, other]
Title: Invertible Sharpening Network for MRI Reconstruction Enhancement
Comments: Accepted by MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1401]  arXiv:2206.02959 (cross-list from eess.IV) [pdf, other]
Title: HMRNet: High and Multi-Resolution Network with Bidirectional Feature Calibration for Brain Structure Segmentation in Radiotherapy
Comments: 11 pages, 6 figures, Accepted by IEEE JBHI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1402]  arXiv:2206.03003 (cross-list from eess.IV) [pdf, other]
Title: Transformer-based Personalized Attention Mechanism for Medical Images with Clinical Records
Journal-ref: Takagi, Yusuke, et al. "Transformer-based personalized attention mechanism for medical images with clinical records." Journal of Pathology Informatics (2023): 100185
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1403]  arXiv:2206.03009 (cross-list from eess.IV) [pdf, other]
Title: Self-Knowledge Distillation based Self-Supervised Learning for Covid-19 Detection from Chest X-Ray Images
Comments: Published as a conference paper at ICASSP 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1404]  arXiv:2206.03043 (cross-list from eess.IV) [pdf, other]
Title: COVIDx CT-3: A Large-scale, Multinational, Open-Source Benchmark Dataset for Computer-aided COVID-19 Screening from Chest CT Images
Comments: 6 pages, MED-NeurIPS 2022 workshop
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1405]  arXiv:2206.03049 (cross-list from eess.IV) [pdf, other]
Title: Siamese Encoder-based Spatial-Temporal Mixer for Growth Trend Prediction of Lung Nodules on CT Scans
Comments: MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1406]  arXiv:2206.03066 (cross-list from quant-ph) [pdf, other]
Title: Recent Advances for Quantum Neural Networks in Generative Learning
Comments: The first two authors contributed equally to this work
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1407]  arXiv:2206.03247 (cross-list from eess.IV) [pdf, other]
Title: Towards better Interpretable and Generalizable AD detection using Collective Artificial Intelligence
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1408]  arXiv:2206.03336 (cross-list from eess.IV) [pdf, other]
Title: Parotid Gland MRI Segmentation Based on Swin-Unet and Multimodal Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1409]  arXiv:2206.03359 (cross-list from eess.IV) [pdf, other]
Title: An efficient semi-supervised quality control system trained using physics-based MRI-artefact generators and adversarial training
Authors: Daniele Ravi (for the Alzheimer's Disease Neuroimaging Initiative), Frederik Barkhof, Daniel C. Alexander, Lemuel Puglisi, Geoffrey JM Parker, Arman Eshaghi
Journal-ref: Medical Image Analysis 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1410]  arXiv:2206.03413 (cross-list from physics.med-ph) [pdf, ps, other]
Title: Deep Learning based Direct Segmentation Assisted by Deformable Image Registration for Cone-Beam CT based Auto-Segmentation for Adaptive Radiotherapy
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[1411]  arXiv:2206.03603 (cross-list from eess.IV) [pdf, ps, other]
Title: A new method incorporating deep learning with shape priors for left ventricular segmentation in myocardial perfusion SPECT images
Comments: 21 pages, 14 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1412]  arXiv:2206.03671 (cross-list from eess.IV) [pdf, other]
Title: COVIDx CXR-3: A Large-Scale, Open-Source Benchmark Dataset of Chest X-ray Images for Computer-Aided COVID-19 Diagnostics
Comments: 5 pages, MED-NeurIPS 2022 workshop
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1413]  arXiv:2206.03709 (cross-list from eess.IV) [pdf, other]
Title: Hypernetwork-based Personalized Federated Learning for Multi-Institutional CT Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1414]  arXiv:2206.03803 (cross-list from eess.IV) [pdf, other]
Title: Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1415]  arXiv:2206.03830 (cross-list from eess.IV) [pdf, other]
Title: Generative Myocardial Motion Tracking via Latent Space Exploration with Biomechanics-informed Prior
Comments: Under review
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1416]  arXiv:2206.03900 (cross-list from eess.IV) [pdf, other]
Title: Unsupervised Deformable Image Registration with Absent Correspondences in Pre-operative and Post-Recurrence Brain Tumor MRI Scans
Comments: Accepted by MICCAI2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1417]  arXiv:2206.03935 (cross-list from eess.IV) [pdf, other]
Title: Dual-Distribution Discrepancy for Anomaly Detection in Chest X-Rays
Comments: Early Accepted to MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1418]  arXiv:2206.03955 (cross-list from stat.ML) [pdf, other]
Title: Out-of-Distribution Detection with Class Ratio Estimation
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1419]  arXiv:2206.04056 (cross-list from eess.IV) [pdf, ps, other]
Title: An Improved Deep Convolutional Neural Network by Using Hybrid Optimization Algorithms to Detect and Classify Brain Tumor Using Augmented MRI Images
Comments: Multimed Tools Appl (2022)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1420]  arXiv:2206.04145 (cross-list from eess.IV) [pdf, other]
Title: Deep Estimation of Speckle Statistics Parametric Images
Comments: Accepted in EMBC 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1421]  arXiv:2206.04238 (cross-list from eess.IV) [pdf, other]
Title: Cardiac Adipose Tissue Segmentation via Image-Level Annotations
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1422]  arXiv:2206.04272 (cross-list from cond-mat.mes-hall) [pdf, ps, other]
Title: STEM image analysis based on deep learning: identification of vacancy defects and polymorphs of ${MoS_2}$
Comments: 24 pages, 5 figures
Journal-ref: Nano Letters, 2022
Subjects: Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Materials Science (cond-mat.mtrl-sci); Computer Vision and Pattern Recognition (cs.CV)
[1423]  arXiv:2206.04289 (cross-list from eess.IV) [pdf, other]
Title: A No-Reference Deep Learning Quality Assessment Method for Super-resolution Images Based on Frequency Maps
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1424]  arXiv:2206.04328 (cross-list from eess.IV) [pdf, other]
Title: Novel projection schemes for graph-based Light Field coding
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1425]  arXiv:2206.04336 (cross-list from eess.IV) [pdf, other]
Title: Joint Modeling of Image and Label Statistics for Enhancing Model Generalizability of Medical Image Segmentation
Comments: MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1426]  arXiv:2206.04341 (cross-list from eess.IV) [pdf, other]
Title: How Asynchronous Events Encode Video
Comments: 6 pages, 4 figures
Journal-ref: 2021 55th Asilomar Conference on Signals, Systems, and Computers
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1427]  arXiv:2206.04431 (cross-list from eess.IV) [pdf, other]
Title: Cross-boosting of WNNM Image Denoising method by Directional Wavelet Packets
Comments: 30 pages, 28 figures. arXiv admin note: substantial text overlap with arXiv:2008.11595. text overlap with arXiv:2001.04899
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1428]  arXiv:2206.04514 (cross-list from eess.IV) [pdf, ps, other]
Title: SAR Despeckling using a Denoising Diffusion Probabilistic Model
Comments: Our code is available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1429]  arXiv:2206.04548 (cross-list from eess.IV) [pdf, other]
Title: Classification of COVID-19 in Chest X-ray Images Using Fusion of Deep Features and LightGBM
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1430]  arXiv:2206.04647 (cross-list from eess.IV) [pdf, other]
Title: VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
Comments: Accepted to CVPR 2022. Project page: this http URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1431]  arXiv:2206.04681 (cross-list from eess.IV) [pdf, other]
Title: Gaussian Fourier Pyramid for Local Laplacian Filter
Journal-ref: IEEE Signal Processing Letters (SPL), vol. 29, pp. 11-15, 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1432]  arXiv:2206.04682 (cross-list from eess.IV) [pdf, other]
Title: RT-DNAS: Real-time Constrained Differentiable Neural Architecture Search for 3D Cardiac Cine MRI Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1433]  arXiv:2206.04684 (cross-list from eess.IV) [pdf, other]
Title: Structure-consistent Restoration Network for Cataract Fundus Image Enhancement
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1434]  arXiv:2206.04689 (cross-list from eess.IV) [pdf, ps, other]
Title: AI-based Clinical Assessment of Optic Nerve Head Robustness Superseding Biomechanical Testing
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1435]  arXiv:2206.04732 (cross-list from eess.IV) [pdf, other]
Title: AI-MIA: COVID-19 Detection & Severity Analysis through Medical Imaging
Comments: arXiv admin note: substantial text overlap with arXiv:2106.07524
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1436]  arXiv:2206.04877 (cross-list from eess.IV) [pdf, other]
Title: Efficient Per-Shot Convex Hull Prediction By Recurrent Learning
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1437]  arXiv:2206.05047 (cross-list from eess.IV) [pdf, other]
Title: A GPU-Accelerated Light-field Super-resolution Framework Based on Mixed Noise Model and Weighted Regularization
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
[1438]  arXiv:2206.05049 (cross-list from eess.IV) [pdf, other]
Title: Denoising Generalized Expectation-Consistent Approximation for MR Image Recovery
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1439]  arXiv:2206.05054 (cross-list from eess.IV) [pdf, other]
Title: A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences
Comments: Accepted to IEEE 24th International Workshop on Multimedia Signal Processing, 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1440]  arXiv:2206.05092 (cross-list from eess.IV) [pdf, other]
Title: Learning self-calibrated optic disc and cup segmentation from multi-rater annotations
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1441]  arXiv:2206.05148 (cross-list from eess.IV) [pdf, other]
Title: Weakly-supervised segmentation using inherently-explainable classification models and their application to brain tumour classification
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1442]  arXiv:2206.05236 (cross-list from physics.optics) [pdf, ps, other]
Title: Optical Diffraction Tomography based on 3D Physics-Inspired Neural Network (PINN)
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV)
[1443]  arXiv:2206.05277 (cross-list from eess.IV) [pdf, other]
Title: Superresolution and Segmentation of OCT scans using Multi-Stage adversarial Guided Attention Training
Comments: 5 pages,conference
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1444]  arXiv:2206.05278 (cross-list from eess.IV) [pdf, other]
Title: Dual-Branch Squeeze-Fusion-Excitation Module for Cross-Modality Registration of Cardiac SPECT and CT
Comments: 10 pages, 4 figures, accepted at MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1445]  arXiv:2206.05279 (cross-list from eess.IV) [pdf, other]
Title: PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1446]  arXiv:2206.05283 (cross-list from eess.IV) [pdf, other]
Title: Poissonian Blurred Image Deconvolution by Framelet based Local Minimal Prior
Authors: Reza Parvaz
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1447]  arXiv:2206.05284 (cross-list from eess.IV) [pdf, other]
Title: Decoupling Predictions in Distributed Learning for Multi-Center Left Atrial MRI Segmentation
Comments: Accepted by MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1448]  arXiv:2206.05288 (cross-list from eess.IV) [pdf, other]
Title: From Labels to Priors in Capsule Endoscopy: A Prior Guided Approach for Improving Generalization with Few Labels
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1449]  arXiv:2206.05289 (cross-list from eess.IV) [pdf, other]
Title: Localized adversarial artifacts for compressed sensing MRI
Comments: 14 pages, 7 figures
Journal-ref: SIAM Journal on Imaging Sciences, 16(4):SC14-SC26, 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1450]  arXiv:2206.05472 (cross-list from eess.IV) [pdf, other]
Title: Differentiable Projection from Optical Coherence Tomography B-Scan without Retinal Layer Segmentation Supervision
Comments: ISBI2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1451]  arXiv:2206.05516 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning-Based MR Image Re-parameterization
Comments: A. Narang, A. Raj, M. Pop and M. Ebrahimi, "Deep Learning-Based MR Image Re-parameterization," 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE), Las Vegas, NV, USA, 2023, pp. 536-541, doi: 10.1109/CSCE60160.2023.00094
Journal-ref: 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1452]  arXiv:2206.05575 (cross-list from eess.IV) [pdf, ps, other]
Title: MammoFL: Mammographic Breast Density Estimation using Federated Learning
Comments: Deep learning, federated learning, mammography, breast density, risk assessment
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[1453]  arXiv:2206.05615 (cross-list from eess.IV) [pdf, other]
Title: Machine learning approaches for COVID-19 detection from chest X-ray imaging: A Systematic Review
Authors: Harold Brayan Arteaga-Arteaga (1), Melissa delaPava (1), Alejandro Mora-Rubio (1), Mario Alejandro Bravo-Ortíz (1), Jesus Alejandro Alzate-Grisales (1), Daniel Arias-Garzón (1), Luis Humberto López-Murillo (2), Felipe Buitrago-Carmona (3), Juan Pablo Villa-Pulgarín (1), Esteban Mercado-Ruiz (1), Simon Orozco-Arias (3 and 4), M. Hassaballah (5), Maria de la Iglesia-Vaya (6), Oscar Cardona-Morales (1), Reinel Tabares-Soto (1) ((1) Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia, (2) Department of Chemical Engineering, Universidad Nacional de Colombia, Manizales, Colombia, (3) Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia, (4) Department of Systems and informatics, Universidad de Caldas, Manizales, Colombia, (5) Faculty of Computers and Information, South Valley University, Qena, Egypt, (6) Unidad Mixta de Imagen Biomédica FISABIO-CIPF, Fundación para el Fomento de la Investigación Sanitario y Biomédica de la Comunidad Valenciana, Valencia, Spain)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1454]  arXiv:2206.05618 (cross-list from physics.med-ph) [pdf, other]
Title: Synthetic PET via Domain Translation of 3D MRI
Comments: under review
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[1455]  arXiv:2206.05647 (cross-list from eess.IV) [pdf, other]
Title: A Fast Alternating Minimization Algorithm for Coded Aperture Snapshot Spectral Imaging Based on Sparsity and Deep Image Priors
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[1456]  arXiv:2206.05650 (cross-list from eess.IV) [pdf, other]
Title: Preprocessing Enhanced Image Compression for Machine Vision
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1457]  arXiv:2206.05695 (cross-list from eess.IV) [pdf, ps, other]
Title: PD-DWI: Predicting response to neoadjuvant chemotherapy in invasive breast cancer with Physiologically-Decomposed Diffusion-Weighted MRI machine-learning model
Comments: Accepted to Medical Image Computing and Computer Assisted Intervention - MICCAI 2022 to be held during Sept 18-22 in Singapore
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1458]  arXiv:2206.05782 (cross-list from eess.IV) [pdf, other]
Title: DSCA: A Dual-Stream Network with Cross-Attention on Whole-Slide Image Pyramids for Cancer Prognosis
Comments: 12 pages, 6 figures, 7 tables
Journal-ref: Expert Systems with Applications, 120280 (2023)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1459]  arXiv:2206.05935 (cross-list from eess.IV) [pdf, other]
Title: Fluorescence angiography classification in colorectal surgery -- A preliminary report
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1460]  arXiv:2206.06065 (cross-list from eess.IV) [pdf, ps, other]
Title: Deep ensemble learning for segmenting tuberculosis-consistent manifestations in chest radiographs
Comments: 13 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1461]  arXiv:2206.06070 (cross-list from eess.IV) [pdf, other]
Title: Annular Computational Imaging: Capture Clear Panoramic Images through Simple Lens
Comments: Accepted to IEEE Transactions on Computational Imaging (TCI). Code and datasets are publicly available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[1462]  arXiv:2206.06127 (cross-list from eess.IV) [pdf, other]
Title: SyntheX: Scaling Up Learning-based X-ray Image Analysis Through In Silico Experiments
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1463]  arXiv:2206.06235 (cross-list from eess.IV) [pdf, ps, other]
Title: Prostate Cancer Malignancy Detection and localization from mpMRI using auto-Deep Learning: One Step Closer to Clinical Utilization
Comments: arXiv admin note: text overlap with arXiv:1903.12331
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1464]  arXiv:2206.06253 (cross-list from eess.IV) [pdf, ps, other]
Title: RPLHR-CT Dataset and Transformer Baseline for Volumetric Super-Resolution from CT Scans
Comments: Accepted MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1465]  arXiv:2206.06264 (cross-list from eess.IV) [pdf, other]
Title: Automatic Polyp Segmentation with Multiple Kernel Dilated Convolution Network
Journal-ref: Published CBMS 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1466]  arXiv:2206.06267 (cross-list from eess.IV) [pdf, other]
Title: MMMNA-Net for Overall Survival Time Prediction of Brain Tumor Patients
Comments: Accepted EMBC 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1467]  arXiv:2206.06341 (cross-list from eess.IV) [pdf, other]
Title: Unsupervised inter-frame motion correction for whole-body dynamic PET using convolutional long short-term memory in a convolutional neural network
Comments: Preprint submitted to Medical Image Analysis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP); Applications (stat.AP)
[1468]  arXiv:2206.06445 (cross-list from eess.IV) [pdf, other]
Title: Fitting Segmentation Networks on Varying Image Resolutions using Splatting
Comments: Accepted for MIUA 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1469]  arXiv:2206.06448 (cross-list from eess.IV) [pdf, ps, other]
Title: Assessing Privacy Leakage in Synthetic 3-D PET Imaging using Transversal GAN
Comments: arXiv admin note: text overlap with arXiv:2111.01866
Subjects: Image and Video Processing (eess.IV); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1470]  arXiv:2206.06541 (cross-list from eess.IV) [pdf, other]
Title: Pixel-by-pixel Mean Opinion Score (pMOS) for No-Reference Image Quality Assessment
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1471]  arXiv:2206.06575 (cross-list from eess.IV) [pdf, other]
Title: Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1472]  arXiv:2206.06598 (cross-list from eess.IV) [pdf, other]
Title: CorticalFlow$^{++}$: Boosting Cortical Surface Reconstruction Accuracy, Regularity, and Interoperability
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1473]  arXiv:2206.06623 (cross-list from eess.IV) [pdf, other]
Title: ULTRA: Uncertainty-aware Label Distribution Learning for Breast Tumor Cellularity Assessment
Comments: Paper accepted by MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1474]  arXiv:2206.06654 (cross-list from eess.IV) [pdf, other]
Title: The Kidneys Are Not All Normal: Investigating the Speckle Distributions of Transplanted Kidneys
Comments: 25 pages, 2 figures, 3 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[1475]  arXiv:2206.06657 (cross-list from eess.IV) [pdf, other]
Title: The Open Kidney Ultrasound Data Set
Comments: 21 pages, 1 figure, 5 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1476]  arXiv:2206.06663 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Quantitative Imaging Principles Improves Medical Image Learning
Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1477]  arXiv:2206.06701 (cross-list from eess.IV) [pdf, other]
Title: CNN-based Classification Framework for Lung Tissues with Auxiliary Information
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1478]  arXiv:2206.06725 (cross-list from eess.IV) [pdf, other]
Title: Automated SSIM Regression for Detection and Quantification of Motion Artefacts in Brain MR Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1479]  arXiv:2206.06730 (cross-list from eess.IV) [pdf, other]
Title: Automated Precision Localization of Peripherally Inserted Central Catheter Tip through Model-Agnostic Multi-Stage Networks
Comments: Subin Park and Yoon Ki Cha have contributed equally to this work as the co-first author. Kyung-Su Kim (kskim.doc@gmail.com) and Myung Jin Chung (mj1.chung@samsung.com) have contributed equally to this work as the co-corresponding author
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1480]  arXiv:2206.06813 (cross-list from eess.IV) [pdf, other]
Title: Learning towards Synchronous Network Memorizability and Generalizability for Continual Segmentation across Multiple Sites
Comments: Early accepted in MICCAI2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1481]  arXiv:2206.06862 (cross-list from q-bio.QM) [pdf, other]
Title: Evaluating histopathology transfer learning with ChampKit
Comments: Submitted to NeurIPS 2022 Track on Datasets and Benchmarks. Source code available at this https URL
Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1482]  arXiv:2206.06947 (cross-list from eess.IV) [pdf, other]
Title: K-Space Transformer for Undersampled MRI Reconstruction
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1483]  arXiv:2206.07122 (cross-list from stat.ML) [pdf, other]
Title: Loss Functions for Classification using Structured Entropy
Authors: Brian Lucena
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG)
[1484]  arXiv:2206.07156 (cross-list from eess.IV) [pdf, other]
Title: Federated Multi-organ Segmentation with Inconsistent Labels
Comments: v1: 10 pages, 5 figures; v2: 14 pages, 5 figures, accepted by IEEE Transactions on Medical Imaging (TMI), published version available at this https URL, source code available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1485]  arXiv:2206.07219 (cross-list from eess.IV) [pdf, ps, other]
Title: A Projection-Based K-space Transformer Network for Undersampled Radial MRI Reconstruction with Limited Training Subjects
Comments: Accepted at MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1486]  arXiv:2206.07280 (cross-list from eess.IV) [pdf, ps, other]
Title: ERNAS: An Evolutionary Neural Architecture Search for Magnetic Resonance Image Reconstructions
Comments: 11 pages, 9 figures, and 4 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1487]  arXiv:2206.07281 (cross-list from physics.optics) [pdf, ps, other]
Title: Super-resolution image display using diffractive decoders
Comments: 26 Pages, 9 Figures
Journal-ref: Science Advances (2022)
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Applied Physics (physics.app-ph)
[1488]  arXiv:2206.07364 (cross-list from eess.IV) [pdf, other]
Title: Seeking Common Ground While Reserving Differences: Multiple Anatomy Collaborative Framework for Undersampled MRI Reconstruction
Comments: submitted to an IEEE journal
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1489]  arXiv:2206.07388 (cross-list from physics.geo-ph) [pdf, ps, other]
Title: Subsurface Depths Structure Maps Reconstruction with Generative Adversarial Networks
Authors: Dmitry Ivlev
Comments: 12 pages, 12 figures, 1 table
Subjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1490]  arXiv:2206.07417 (cross-list from eess.IV) [pdf, other]
Title: Interpretable differential diagnosis for Alzheimer's disease and Frontotemporal dementia
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1491]  arXiv:2206.07422 (cross-list from eess.IV) [pdf, other]
Title: Deep Neural Network Pruning for Nuclei Instance Segmentation in Hematoxylin & Eosin-Stained Histological Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1492]  arXiv:2206.07481 (cross-list from eess.SP) [pdf, ps, other]
Title: A Survey of Detection Methods for Die Attachment and Wire Bonding Defects in Integrated Circuit Manufacturing
Comments: 13 pages, 9 figures, 8 tables
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1493]  arXiv:2206.07494 (cross-list from cond-mat.stat-mech) [pdf, other]
Title: Counting Phases and Faces Using Bayesian Thermodynamic Integration
Comments: 20 pages, 9 figures, plus appendix with additional figures
Subjects: Statistical Mechanics (cond-mat.stat-mech); Disordered Systems and Neural Networks (cond-mat.dis-nn); Computer Vision and Pattern Recognition (cs.CV); Data Analysis, Statistics and Probability (physics.data-an)
[1494]  arXiv:2206.07542 (cross-list from q-bio.NC) [pdf, other]
Title: A Deep Generative Model of Neonatal Cortical Surface Development
Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1495]  arXiv:2206.07595 (cross-list from eess.IV) [pdf, ps, other]
Title: BIO-CXRNET: A Robust Multimodal Stacking Machine Learning Technique for Mortality Risk Prediction of COVID-19 Patients using Chest X-Ray Images and Clinical Data
Comments: 25 pages, 8 Tables, 10 Figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1496]  arXiv:2206.07599 (cross-list from eess.IV) [pdf, other]
Title: How GNNs Facilitate CNNs in Mining Geometric Information from Large-Scale Medical Images
Comments: 21 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1497]  arXiv:2206.07664 (cross-list from eess.IV) [pdf, other]
Title: CRISP - Reliable Uncertainty Estimation for Medical Image Segmentation
Comments: 9 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1498]  arXiv:2206.08019 (cross-list from eess.IV) [pdf, other]
Title: Multi-View Imputation and Cross-Attention Network Based on Incomplete Longitudinal and Multimodal Data for Conversion Prediction of Mild Cognitive Impairment
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1499]  arXiv:2206.08023 (cross-list from eess.IV) [pdf, other]
Title: AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1500]  arXiv:2206.08078 (cross-list from eess.IV) [pdf, other]
Title: U-PET: MRI-based Dementia Detection with Joint Generation of Synthetic FDG-PET Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1501]  arXiv:2206.08272 (cross-list from eess.IV) [pdf, other]
Title: Longitudinal detection of new MS lesions using Deep Learning
Comments: preprint
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1502]  arXiv:2206.08277 (cross-list from astro-ph.EP) [pdf, other]
Title: A machine-generated catalogue of Charon's craters and implications for the Kuiper belt
Authors: Mohamad Ali-Dib
Comments: 16 pages, 2 figures, accepted for publication in Icarus
Subjects: Earth and Planetary Astrophysics (astro-ph.EP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1503]  arXiv:2206.08298 (cross-list from eess.IV) [pdf, other]
Title: Video Capsule Endoscopy Classification using Focal Modulation Guided Convolutional Neural Network
Journal-ref: CBMS 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1504]  arXiv:2206.08308 (cross-list from eess.IV) [pdf, ps, other]
Title: Deepfake histological images for enhancing digital pathology
Subjects: Image and Video Processing (eess.IV); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1505]  arXiv:2206.08398 (cross-list from eess.IV) [pdf, other]
Title: Learning Generic Lung Ultrasound Biomarkers for Decoupling Feature Extraction from Downstream Tasks
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1506]  arXiv:2206.08439 (cross-list from eess.IV) [pdf, other]
Title: OpenSRH: optimizing brain tumor surgery using intraoperative stimulated Raman histology
Comments: Neural Information Processing Systems (NeurIPS) 2022 Datasets and Benchmarks Track
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1507]  arXiv:2206.08481 (cross-list from eess.IV) [pdf, other]
Title: Orientation-guided Graph Convolutional Network for Bone Surface Segmentation
Comments: Accepted at MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1508]  arXiv:2206.08543 (cross-list from eess.IV) [pdf, ps, other]
Title: Multi-Classification of Brain Tumor Images Using Transfer Learning Based Deep Neural Network
Comments: 7 pages, 4 figures, 2 tables, International Virtual Conference on ARTIFICIAL INTELLIGENCE FOR SMART COMMUNITY, Malaysia
Journal-ref: Conference proceedings \c{opyright} 2023 International Conference on Artificial Intelligence for Smart Community
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1509]  arXiv:2206.08557 (cross-list from eess.IV) [pdf, other]
Title: COVID-19 Detection using Transfer Learning with Convolutional Neural Network
Comments: 4 pages, 4 figures, 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST), DHAKA, Bangladesh
Journal-ref: 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST), DHAKA, Bangladesh, 2021, pp. 429-432
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1510]  arXiv:2206.08612 (cross-list from eess.IV) [pdf, other]
Title: OADAT: Experimental and Synthetic Clinical Optoacoustic Data for Standardized Image Processing
Comments: Accepted to TMLR. 32 pages, 24 figures, 9 tables
Journal-ref: Transactions on Machine Learning Research (2023) 2835-8856
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1511]  arXiv:2206.08671 (cross-list from stat.ML) [pdf, other]
Title: FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification
Journal-ref: The Eleventh International Conference on Learning Representations (ICLR 2023)
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1512]  arXiv:2206.08787 (cross-list from eess.IV) [pdf, other]
Title: Leveraging Uncertainty in Deep Learning for Pancreatic Adenocarcinoma Grading
Comments: 26th UK Conference on Medical Image Understanding and Analysis; 27 - 29 July 2022; University of Cambridge, UK. arXiv admin note: text overlap with arXiv:2003.10769
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1513]  arXiv:2206.08885 (cross-list from eess.IV) [pdf, other]
Title: Incorporating intratumoral heterogeneity into weakly-supervised deep learning models via variance pooling
Comments: MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Methodology (stat.ME)
[1514]  arXiv:2206.08936 (cross-list from eess.IV) [pdf, other]
Title: Simultaneous Bone and Shadow Segmentation Network using Task Correspondence Consistency
Comments: Accepted at MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1515]  arXiv:2206.08984 (cross-list from eess.IV) [pdf, other]
Title: Multi-scale Super-resolution Magnetic Resonance Spectroscopic Imaging with Adjustable Sharpness
Comments: Accepted by MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1516]  arXiv:2206.08985 (cross-list from eess.IV) [pdf, other]
Title: TransResU-Net: Transformer based ResU-Net for Real-Time Colonoscopy Polyp Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1517]  arXiv:2206.08994 (cross-list from stat.ML) [pdf, other]
Title: Robust Group Synchronization via Quadratic Programming
Comments: Accepted to ICML 2022
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[1518]  arXiv:2206.09065 (cross-list from eess.IV) [pdf, ps, other]
Title: Free-form Lesion Synthesis Using a Partial Convolution Generative Adversarial Network for Enhanced Deep Learning Liver Tumor Segmentation
Comments: The paper is under review by JACMP-Journal of Applied Medical Physics
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1519]  arXiv:2206.09128 (cross-list from eess.IV) [pdf, other]
Title: A Combined PCA-MLP Network for Early Breast Cancer Detection
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1520]  arXiv:2206.09146 (cross-list from eess.IV) [pdf, other]
Title: A Perceptually Optimized and Self-Calibrated Tone Mapping Operator
Comments: 15 pages,17 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1521]  arXiv:2206.09193 (cross-list from eess.IV) [pdf, ps, other]
Title: Multi-Modality Image Super-Resolution using Generative Adversarial Networks
Comments: to be published in the Proceedings of 16th International Conference on Computer Graphics, Visualization, Computer Vision and Image Processing (CGVCVIP), Lisbon, Portugal, July 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1522]  arXiv:2206.09210 (cross-list from eess.IV) [pdf, other]
Title: Multi-Modality Image Inpainting using Generative Adversarial Networks
Comments: to be published in the Proceedings of 26th Int'l Conf on Image Processing, Computer Vision, & Pattern Recognition (IPCV), July 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1523]  arXiv:2206.09309 (cross-list from eess.IV) [pdf, other]
Title: TBraTS: Trusted Brain Tumor Segmentation
Comments: 11 pages, 4 figures, Accepted by MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1524]  arXiv:2206.09611 (cross-list from eess.IV) [pdf, other]
Title: SJ-HD^2R: Selective Joint High Dynamic Range and Denoising Imaging for Dynamic Scenes
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1525]  arXiv:2206.09867 (cross-list from eess.SP) [pdf, other]
Title: WiFi-based Spatiotemporal Human Action Perception
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[1526]  arXiv:2206.10152 (cross-list from physics.optics) [pdf, ps, other]
Title: Diffractive Interconnects: All-Optical Permutation Operation Using Diffractive Networks
Comments: 22 Pages, 6 Figures
Journal-ref: Nanophotonics (2022)
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1527]  arXiv:2206.10183 (cross-list from eess.IV) [pdf, ps, other]
Title: covEcho Resource constrained lung ultrasound image analysis tool for faster triaging and active learning
Comments: Submitted to Elsevier CMPBUP on Dec 1, 2021
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1528]  arXiv:2206.10286 (cross-list from eess.IV) [pdf, other]
Title: Position-prior Clustering-based Self-attention Module for Knee Cartilage Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1529]  arXiv:2206.10294 (cross-list from eess.IV) [pdf, other]
Title: Using the Polar Transform for Efficient Deep Learning-Based Aorta Segmentation in CTA Images
Comments: Accepted to 64th International Symposium ELMAR-2022, Zadar, Croatia
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1530]  arXiv:2206.10357 (cross-list from eess.IV) [pdf, other]
Title: Confidence-Guided Unsupervised Domain Adaptation for Cerebellum Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1531]  arXiv:2206.10455 (cross-list from eess.IV) [src]
Title: Automated Coronary Calcium Scoring using U-Net Models through Semi-supervised Learning on Non-Gated CT Scans
Authors: Sanskriti Singh
Comments: There is no correlation between gated and non-gated CT scans causing the points used in the training and results to be flawed. It was inaccurately assumed that there was a correlation between the scans
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1532]  arXiv:2206.10543 (cross-list from eess.IV) [pdf, other]
Title: Faster Diffusion Cardiac MRI with Deep Learning-based breath hold reduction
Comments: 15 pages, 1 figures, 2 tables. To be published in MIUA22
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1533]  arXiv:2206.10750 (cross-list from eess.SP) [pdf, other]
Title: Floor Map Reconstruction Through Radio Sensing and Learning By a Large Intelligent Surface
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[1534]  arXiv:2206.10802 (cross-list from eess.IV) [pdf, other]
Title: SVoRT: Iterative Transformer for Slice-to-Volume Registration in Fetal Brain MRI
Comments: Accepted by MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1535]  arXiv:2206.10810 (cross-list from eess.IV) [pdf, other]
Title: A Simple Baseline for Video Restoration with Grouped Spatial-temporal Shift
Comments: Accepted to CVPR2023
Journal-ref: 2023 Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1536]  arXiv:2206.10911 (cross-list from eess.IV) [pdf, other]
Title: Influence of uncertainty estimation techniques on false-positive reduction in liver lesion detection
Comments: Accepted for publication in the Journal of Machine Learning for Biomedical Imaging (MELBA)
Journal-ref: https://www.melba-journal.org/papers/2022:030.html
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1537]  arXiv:2206.10912 (cross-list from eess.IV) [pdf, ps, other]
Title: AI-based software for lung nodule detection in chest X-rays -- Time for a second reader approach?
Comments: This paper is in submission process to the European Radiology journal
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1538]  arXiv:2206.11048 (cross-list from eess.IV) [pdf, other]
Title: Automated GI tract segmentation using deep learning
Authors: Manhar Sharma
Comments: 8 pages, 9 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1539]  arXiv:2206.11127 (cross-list from eess.IV) [pdf, ps, other]
Title: CNN-based fully automatic wrist cartilage volume quantification in MR Image
Comments: 17 pages, 6 Figures, 6 Tables, 1 Suplementary
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1540]  arXiv:2206.11458 (cross-list from eess.IV) [pdf, other]
Title: Weighted Concordance Index Loss-based Multimodal Survival Modeling for Radiation Encephalopathy Assessment in Nasopharyngeal Carcinoma Radiotherapy
Comments: 11 pages, 3 figures, MICCAI2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1541]  arXiv:2206.11501 (cross-list from eess.IV) [pdf, other]
Title: A novel adversarial learning strategy for medical image classification
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1542]  arXiv:2206.11599 (cross-list from eess.IV) [pdf, other]
Title: Universal Learned Image Compression With Low Computational Cost
Comments: 5 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1543]  arXiv:2206.11669 (cross-list from physics.ao-ph) [pdf, other]
Title: Short-range forecasts of global precipitation using deep learning-augmented numerical weather prediction
Comments: Accepted at Tackling Climate Change with Machine Learning: workshop at NeurIPS 2022
Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1544]  arXiv:2206.11943 (cross-list from eess.IV) [pdf, other]
Title: TIAger: Tumor-Infiltrating Lymphocyte Scoring in Breast Cancer for the TiGER Challenge
Comments: TiGER Challenge entry
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1545]  arXiv:2206.12112 (cross-list from eess.IV) [pdf, other]
Title: Dissecting U-net for Seismic Application: An In-Depth Study on Deep Learning Multiple Removal
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1546]  arXiv:2206.12136 (cross-list from eess.IV) [pdf, other]
Title: Feature Representation Learning for Robust Retinal Disease Detection from Optical Coherence Tomography Images
Comments: Accepted to MICCAI2022 Ophthalmic Medical Image Analysis (OMIA) Workshop
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1547]  arXiv:2206.12300 (cross-list from eess.IV) [pdf, ps, other]
Title: Automatic extraction of coronary arteries using deep learning in invasive coronary angiograms
Comments: 22 pages,5 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1548]  arXiv:2206.12344 (cross-list from eess.IV) [pdf, other]
Title: Segmentation-free PVC for Cardiac SPECT using a Densely-connected Multi-dimensional Dynamic Network
Comments: 12 pages, 11 figures. Accepted for publication at IEEE Transactions on Medical Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1549]  arXiv:2206.12407 (cross-list from eess.IV) [pdf, ps, other]
Title: Independent evaluation of state-of-the-art deep networks for mammography
Comments: 17 pages, 8 figures, 4 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1550]  arXiv:2206.12417 (cross-list from eess.IV) [pdf, other]
Title: Deep embedded clustering algorithm for clustering PACS repositories
Journal-ref: Proceedings of the 2021 IEEE 34th International Symposium on Computer-Based Medical Systems (CBMS)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1551]  arXiv:2206.12512 (cross-list from eess.IV) [pdf, other]
Title: Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings
Comments: Accepted at MedIA (Medical Image Analysis)
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1552]  arXiv:2206.12809 (cross-list from eess.SP) [pdf, other]
Title: Role and Integration of Image Processing Systems in Maritime Target Tracking
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[1553]  arXiv:2206.12815 (cross-list from eess.IV) [pdf, other]
Title: Breast Cancer Classification using Deep Learned Features Boosted with Handcrafted Features
Journal-ref: Biomedical Signal Processing and Control 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1554]  arXiv:2206.12980 (cross-list from eess.IV) [pdf, ps, other]
Title: Detecting Schizophrenia with 3D Structural Brain MRI Using Deep Learning
Comments: 13 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1555]  arXiv:2206.13086 (cross-list from stat.ML) [pdf, other]
Title: RankSEG: A Consistent Ranking-based Framework for Segmentation
Authors: Ben Dai, Chunlin Li
Comments: 50 pages
Journal-ref: Journal of Machine Learning Research, 24(224), 1-50 (2023)
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Statistics Theory (math.ST)
[1556]  arXiv:2206.13123 (cross-list from eess.IV) [pdf, other]
Title: Unsupervised Domain Adaptation Using Feature Disentanglement And GCNs For Medical Image Classification
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1557]  arXiv:2206.13173 (cross-list from eess.IV) [pdf, ps, other]
Title: Context-Aware Transformers For Spinal Cancer Detection and Radiological Grading
Comments: Pre-print of paper accepted to MICCAI 2022. 15 pages, 7 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1558]  arXiv:2206.13295 (cross-list from eess.IV) [pdf, other]
Title: Diffusion Deformable Model for 4D Temporal Medical Image Generation
Comments: Accepted for MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1559]  arXiv:2206.13385 (cross-list from eess.IV) [pdf, other]
Title: 3D unsupervised anomaly detection and localization through virtual multi-view projection and reconstruction: Clinical validation on low-dose chest computed tomography
Comments: Kyung-Su Kim and Seong Je Oh have contributed equally to this work as the co-first author. Kyung-Su Kim (kskim.doc@gmail.com) and Myung Jin Chung (mj1.chung@samsung.com) have contributed equally to this work as the co-corresponding author
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1560]  arXiv:2206.13393 (cross-list from eess.IV) [pdf, other]
Title: Cross-Modal Transformer GAN: A Brain Structure-Function Deep Fusing Framework for Alzheimer's Disease
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[1561]  arXiv:2206.13394 (cross-list from eess.IV) [pdf, other]
Title: CS$^2$: A Controllable and Simultaneous Synthesizer of Images and Annotations with Minimal Human Intervention
Comments: 11 figures, Accepted by MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1562]  arXiv:2206.13419 (cross-list from eess.IV) [pdf, other]
Title: DeStripe: A Self2Self Spatio-Spectral Graph Neural Network with Unfolded Hessian for Stripe Artifact Removal in Light-sheet Microscopy
Comments: Accepted by 25th International Conference on Medical Image Computing and Computer Assisted Intervention
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1563]  arXiv:2206.13455 (cross-list from eess.IV) [pdf, other]
Title: IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments
Comments: Accepted for publication in the Journal of Intelligent & Robotic Systems
Journal-ref: J Intell Robot Syst 106, 53 (2022)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1564]  arXiv:2206.13468 (cross-list from math.AG) [pdf, ps, other]
Title: An Atlas for the Pinhole Camera
Comments: 47 pages with references and appendices, final version
Journal-ref: JFoCM, 2022
Subjects: Algebraic Geometry (math.AG); Computer Vision and Pattern Recognition (cs.CV); Commutative Algebra (math.AC)
[1565]  arXiv:2206.13504 (cross-list from eess.IV) [pdf, other]
Title: AI-based computer-aided diagnostic system of chest digital tomography synthesis: Demonstrating comparative advantage with X-ray-based AI systems
Comments: Kyung-Su Kim, Ju Hwan Lee, and Seong Je Oh have contributed equally to this work as the co-first author. Kyung-Su Kim (kskim.doc@gmail.com) and Myung Jin Chung (mj1.chung@samsung.com) have contributed equally to this work as the co-corresponding author
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1566]  arXiv:2206.13505 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning-Based Defect Classification and Detection in SEM Images
Journal-ref: In Metrology, Inspection, and Process Control XXXVI, SPIE (2022)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1567]  arXiv:2206.13506 (cross-list from eess.IV) [pdf, other]
Title: Tensor Recovery Based on A Novel Non-convex Function Minimax Logarithmic Concave Penalty Function
Comments: arXiv admin note: substantial text overlap with arXiv:2201.12709, arXiv:2109.12257
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1568]  arXiv:2206.13613 (cross-list from eess.IV) [pdf, other]
Title: Flexible-Rate Learned Hierarchical Bi-Directional Video Compression With Motion Refinement and Frame-Level Bit Allocation
Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP 2022)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1569]  arXiv:2206.13632 (cross-list from eess.IV) [pdf, other]
Title: Omni-Seg: A Scale-aware Dynamic Network for Renal Pathological Image Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1570]  arXiv:2206.13740 (cross-list from eess.IV) [pdf, other]
Title: GAN-based Super-Resolution and Segmentation of Retinal Layers in Optical coherence tomography Scans
Comments: 5 pages,7 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1571]  arXiv:2206.13872 (cross-list from stat.ML) [pdf, other]
Title: When are Post-hoc Conceptual Explanations Identifiable?
Comments: v5: UAI2023 camera-ready including supplementary material. The first two authors contributed equally
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1572]  arXiv:2206.13903 (cross-list from eess.IV) [pdf, other]
Title: AS-IntroVAE: Adversarial Similarity Distance Makes Robust IntroVAE
Comments: ACML conference paper
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1573]  arXiv:2206.14305 (cross-list from eess.IV) [pdf, ps, other]
Title: Multistep Automated Data Labelling Procedure (MADLaP) for Thyroid Nodules on Ultrasound: An Artificial Intelligence Approach for Automating Image Annotation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1574]  arXiv:2206.14678 (cross-list from eess.IV) [pdf, other]
Title: BiometryNet: Landmark-based Fetal Biometry Estimation from Standard Ultrasound Planes
Comments: 13 pages, 6 figures, Accepted to MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1575]  arXiv:2206.14713 (cross-list from eess.IV) [pdf, other]
Title: CONVIQT: Contrastive Video Quality Estimator
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1576]  arXiv:2206.14746 (cross-list from eess.IV) [pdf, other]
Title: Placenta Segmentation in Ultrasound Imaging: Addressing Sources of Uncertainty and Limited Field-of-View
Comments: 21 pages (18 + appendix), 13 figures (9 + appendix)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1577]  arXiv:2206.14820 (cross-list from astro-ph.CO) [pdf, other]
Title: Strong Lensing Source Reconstruction Using Continuous Neural Fields
Comments: 9+2 pages, 3+2 figures, Spotlight at the Machine Learning for Astrophysics Workshop at ICML 2022; v2, references added
Subjects: Cosmology and Nongalactic Astrophysics (astro-ph.CO); Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1578]  arXiv:2206.14847 (cross-list from eess.IV) [pdf, other]
Title: Deep Reinforcement Learning for Small Bowel Path Tracking using Different Types of Annotations
Comments: Accepted to MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1579]  arXiv:2206.14861 (cross-list from eess.IV) [pdf, other]
Title: Two-Stage COVID19 Classification Using BERT Features
Comments: arXiv admin note: text overlap with arXiv:2106.14403
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1580]  arXiv:2206.14903 (cross-list from eess.IV) [pdf, other]
Title: CIRDataset: A large-scale Dataset for Clinically-Interpretable lung nodule Radiomics and malignancy prediction
Comments: MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1581]  arXiv:2206.14919 (cross-list from eess.IV) [pdf, other]
Title: Identifying and Combating Bias in Segmentation Networks by leveraging multiple resolutions
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1582]  arXiv:2206.14951 (cross-list from eess.IV) [pdf, other]
Title: CLTS-GAN: Color-Lighting-Texture-Specular Reflection Augmentation for Colonoscopy
Comments: MICCAI 2022. **First two authors contributed equally
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1583]  arXiv:2206.15069 (cross-list from eess.IV) [pdf, other]
Title: PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis
Comments: 8 pages,1 figure
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1584]  arXiv:2206.15073 (cross-list from eess.IV) [pdf, other]
Title: COVID Detection and Severity Prediction with 3D-ConvNeXt and Custom Pretrainings
Comments: 17 pages, 3 figures, informations about challenge submission
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1585]  arXiv:2206.15134 (cross-list from eess.IV) [pdf, other]
Title: InsMix: Towards Realistic Generative Data Augmentation for Nuclei Instance Segmentation
Comments: Accepted by MICCAI 2022 (early accepted)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1586]  arXiv:2206.15179 (cross-list from eess.IV) [src]
Title: A Medical Image Fusion Method based on MDLatLRRv2
Comments: There are some errors that need to be corrected
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1587]  arXiv:2206.15182 (cross-list from eess.IV) [pdf, other]
Title: The (de)biasing effect of GAN-based augmentation methods on skin lesion images
Comments: Accepted to MICCAI2022
Journal-ref: In: Wang, L., Dou, Q., Fletcher, P.T., Speidel, S., Li, S. (eds) Medical Image Computing and Computer Assisted Intervention - MICCAI 2022. MICCAI 2022. Lecture Notes in Computer Science, vol 13438. Springer, Cham
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1588]  arXiv:2206.15217 (cross-list from eess.IV) [pdf, other]
Title: Implicit U-Net for volumetric medical image segmentation
Comments: 11 pages, 4 figures, Accepted MIUA 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1589]  arXiv:2206.15254 (cross-list from eess.IV) [pdf, other]
Title: Localizing the Recurrent Laryngeal Nerve via Ultrasound with a Bayesian Shape Framework
Comments: Early Accepted by MICCAI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1590]  arXiv:2206.15274 (cross-list from eess.IV) [pdf, other]
Title: Augment like there's no tomorrow: Consistently performing neural networks for medical imaging
Comments: Code for the paper is available from this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1591]  arXiv:2206.15431 (cross-list from eess.IV) [pdf, other]
Title: Ensemble CNN models for Covid-19 Recognition and Severity Perdition From 3D CT-scan
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ total of 1594 entries: 1-1591 | 1592-1594 ]
[ showing 1591 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)