Computer Vision and Pattern Recognition
Authors and titles for cs.CV in Jun 2022
[ total of 1593 entries: 1-1593 ][ showing 1593 entries per page: fewer | more ]
- [1] arXiv:2206.00048 [pdf, other]
-
Title: PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANsComments: Code available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [2] arXiv:2206.00069 [pdf, other]
-
Title: Comparing feature fusion strategies for Deep Learning-based kidney stone identificationAuthors: Elias Villalvazo-Avila, Francisco Lopez-Tiro, Daniel Flores-Araiza, Gilberto Ochoa-Ruiz, Jonathan El-Beze, Jacques Hubert, Christian DaulComments: 4 pages, 3 figures, XXVIII\`eme Colloque Francophone de Traitement du Signal et des ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [3] arXiv:2206.00092 [pdf, other]
-
Title: FHIST: A Benchmark for Few-shot Classification of Histological ImagesAuthors: Fereshteh Shakeri, Malik Boudiaf, Sina Mohammadi, Ivaxi Sheth, Mohammad Havaei, Ismail Ben Ayed, Samira Ebrahimi KahouComments: Code available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [4] arXiv:2206.00100 [pdf, other]
-
Title: VALHALLA: Visual Hallucination for Machine TranslationComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [5] arXiv:2206.00123 [pdf, other]
-
Title: Glo-In-One: Holistic Glomerular Detection, Segmentation, and Lesion Characterization with Large-scale Web Image MiningAuthors: Tianyuan Yao, Yuzhe Lu, Jun Long, Aadarsh Jha, Zheyu Zhu, Zuhayr Asad, Haichun Yang, Agnes B. Fogo, Yuankai HuoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [6] arXiv:2206.00148 [pdf, other]
-
Title: Hands-Up: Leveraging Synthetic Data for Hands-On-Wheel DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [7] arXiv:2206.00162 [pdf, other]
-
Title: PAGER: Progressive Attribute-Guided Extendable Robust Image GenerationComments: 19 pages, 12 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [8] arXiv:2206.00171 [pdf, other]
-
Title: Learning Sequential Contexts using Transformer for 3D Hand Pose EstimationComments: Accepted to ICPR'22Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [9] arXiv:2206.00181 [pdf, other]
-
Title: Labeling Where Adapting Fails: Cross-Domain Semantic Segmentation with Point Supervision via Active SelectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [10] arXiv:2206.00182 [pdf, other]
-
Title: Differentiable Soft-Masked AttentionComments: arXiv admin note: text overlap with arXiv:2112.09131Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [11] arXiv:2206.00205 [pdf, other]
-
Title: CAFA: Class-Aware Feature Alignment for Test-Time AdaptationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [12] arXiv:2206.00214 [pdf, other]
-
Title: LiDAR-MIMO: Efficient Uncertainty Estimation for LiDAR-based 3D Object DetectionComments: 8 pages, 4 figures and 5 tables. Accepted in IEEE IV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [13] arXiv:2206.00222 [pdf, other]
-
Title: Cross-domain Detection Transformer based on Spatial-aware and Semantic-aware Token AlignmentComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [14] arXiv:2206.00227 [pdf, other]
-
Title: Rethinking the Augmentation Module in Contrastive Learning: Learning Hierarchical Augmentation Invariance with Expanded ViewsComments: Accepted to CVPR 2022Journal-ref: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [15] arXiv:2206.00244 [pdf, other]
-
Title: Fair Comparison between Efficient AttentionsComments: 4 pages abstractSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [16] arXiv:2206.00252 [pdf, other]
-
Title: Interpretable Deep Learning Classifier by Detection of Prototypical Parts on Kidney Stones ImagesAuthors: Daniel Flores-Araiza, Francisco Lopez-Tiro, Elias Villalvazo-Avila, Jonathan El-Beze, Jacques Hubert, Gilberto Ochoa-Ruiz, Christian DaulComments: Extended abstract accepted at LatinX in Computer Vision Research Workshop, at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [17] arXiv:2206.00272 [pdf, other]
-
Title: Vision GNN: An Image is Worth Graph of NodesComments: NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [18] arXiv:2206.00274 [pdf, other]
-
Title: Point-Teaching: Weakly Semi-Supervised Object Detection with Point AnnotationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [19] arXiv:2206.00280 [pdf, other]
-
Title: Automatic Bounding Box Annotation with Small Training Data Sets for Industrial ManufacturingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [20] arXiv:2206.00282 [pdf, other]
-
Title: Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At ScaleComments: 26 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
- [21] arXiv:2206.00291 [pdf, other]
-
Title: Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for Edge DevicesComments: Accepted into Embedded Vision Workshop 2022 of CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [22] arXiv:2206.00309 [pdf, other]
-
Title: Label-Efficient Online Continual Object Detection in Streaming VideoComments: PreprintSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [23] arXiv:2206.00311 [pdf, other]
-
Title: MaskOCR: Text Recognition with Masked Encoder-Decoder PretrainingAuthors: Pengyuan Lyu, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong WangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [24] arXiv:2206.00343 [pdf, other]
-
Title: Towards view-invariant vehicle speed detection from driving simulator imagesComments: 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [25] arXiv:2206.00344 [pdf, other]
-
Title: Self-Supervised Learning as a Means To Reduce the Need for Labeled Data in Medical Image AnalysisComments: Accepted by 30th European Signal Processing Conference, EUSIPCO 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [26] arXiv:2206.00359 [pdf, other]
-
Title: DeepCluE: Enhanced Image Clustering via Multi-layer Ensembles in Deep Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [27] arXiv:2206.00364 [pdf, other]
-
Title: Elucidating the Design Space of Diffusion-Based Generative ModelsComments: NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
- [28] arXiv:2206.00384 [pdf, other]
-
Title: A Generalized Supervised Contrastive Learning FrameworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [29] arXiv:2206.00386 [pdf, other]
-
Title: DiVAE: Photorealistic Images Synthesis with Denoising Diffusion DecoderSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [30] arXiv:2206.00415 [pdf, other]
-
Title: Learning Invariant Visual Representations for Compositional Zero-Shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [31] arXiv:2206.00447 [pdf, other]
-
Title: CD$^2$: Fine-grained 3D Mesh Reconstruction with Twice Chamfer DistanceComments: under major review in TOMMSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [32] arXiv:2206.00468 [pdf, other]
-
Title: PanopticDepth: A Unified Framework for Depth-aware Panoptic SegmentationComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [33] arXiv:2206.00481 [pdf, other]
-
Title: Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision TransformerComments: Accepted to BMVC 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [34] arXiv:2206.00489 [pdf, other]
-
Title: Attack-Agnostic Adversarial DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [35] arXiv:2206.00491 [pdf, other]
-
Title: Semantic Room Wireframe Detection from a Single ViewComments: Accepted for ICPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [36] arXiv:2206.00506 [pdf, other]
-
Title: Proximally Sensitive Error for Anomaly Detection and Feature LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [37] arXiv:2206.00515 [pdf, other]
-
Title: Landslide4Sense: Reference Benchmark Data and Deep Learning Models for Landslide DetectionJournal-ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-17, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [38] arXiv:2206.00527 [pdf, other]
-
Title: Amodal Cityscapes: A New Dataset, its Generation, and an Amodal Semantic Segmentation Challenge BaselineComments: This paper is accepted at IEEE Intelligent Vehicles Symposium 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [39] arXiv:2206.00535 [pdf, other]
-
Title: Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machinesComments: 9 pages, 5 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Social and Information Networks (cs.SI)
- [40] arXiv:2206.00580 [pdf, other]
-
Title: Dog nose print matching with dual global descriptor based on Contrastive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [41] arXiv:2206.00608 [pdf, other]
-
Title: On the Choice of Data for Efficient Training and Validation of End-to-End Driving ModelsAuthors: Marvin Klingner, Konstantin Müller, Mona Mirzaie, Jasmin Breitenstein, Jan-Aike Termöhlen, Tim FingscheidtComments: Accepted at CVPR VDU Workshop 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [42] arXiv:2206.00614 [pdf, other]
-
Title: Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cageSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [43] arXiv:2206.00629 [pdf, other]
-
Title: CLIP4IDC: CLIP for Image Difference CaptioningComments: Accepted to AACL-IJCNLP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [44] arXiv:2206.00630 [pdf, other]
-
Title: Unifying Voxel-based Representation with Transformer for 3D Object DetectionComments: Accepted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [45] arXiv:2206.00645 [pdf, other]
-
Title: Extreme Floorplan Reconstruction by Structure-Hallucinating Transformer CascadesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [46] arXiv:2206.00665 [pdf, other]
-
Title: MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface ReconstructionComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [47] arXiv:2206.00718 [pdf, other]
-
Title: Context-Driven Detection of Invertebrate Species in Deep-Sea VideoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [48] arXiv:2206.00735 [pdf, other]
-
Title: Cascaded Video Generation for Videos In-the-WildComments: Accepted to the 26th International Conference on Pattern Recognition (ICPR 2022). arXiv admin note: substantial text overlap with arXiv:2106.02719Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [49] arXiv:2206.00746 [pdf, other]
-
Title: Residual Multiplicative Filter Networks for Multiscale ReconstructionComments: NeurIPS 2022, Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [50] arXiv:2206.00771 [pdf, other]
-
Title: Dynamic Linear Transformer for 3D Biomedical Image SegmentationComments: 8 PagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [51] arXiv:2206.00790 [pdf, other]
-
Title: Efficient Self-supervised Vision Pretraining with Local Masked ReconstructionComments: Add codeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [52] arXiv:2206.00798 [pdf, other]
-
Title: Multi-scale frequency separation network for image deblurringComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [53] arXiv:2206.00800 [pdf, other]
-
Title: CcHarmony: Color-checker based Image Harmonization DatasetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [54] arXiv:2206.00806 [pdf, other]
-
Title: XBound-Former: Toward Cross-scale Boundary Modeling in TransformersAuthors: Jiacheng Wang, Fei Chen, Yuxi Ma, Liansheng Wang, Zhaodong Fei, Jianwei Shuai, Xiangdong Tang, Qichao Zhou, Jing QinComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [55] arXiv:2206.00812 [pdf, other]
-
Title: Modeling sRGB Camera Noise with Normalizing FlowsComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [56] arXiv:2206.00859 [pdf, other]
-
Title: Disentangled Generation Network for Enlarged License Plate Recognition and A Unified DatasetComments: Submission to TIPSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [57] arXiv:2206.00878 [pdf, other]
-
Title: EfficientNeRF: Efficient Neural Radiance FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [58] arXiv:2206.00893 [pdf, other]
-
Title: Leveraging Systematic Knowledge of 2D TransformationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [59] arXiv:2206.00897 [pdf, other]
-
Title: xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar ImageryAuthors: Fernando Paolo, Tsu-ting Tim Lin, Ritwik Gupta, Bryce Goodman, Nirav Patel, Daniel Kuster, David Kroodsma, Jared DunnmonComments: Accepted to NeurIPS 2022. 10 pages (25 with references and supplement)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [60] arXiv:2206.00902 [pdf, other]
-
Title: MISSU: 3D Medical Image Segmentation via Self-distilling TransUNetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [61] arXiv:2206.00923 [pdf, other]
-
Title: Modeling Image Composition for Complex Scene GenerationComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [62] arXiv:2206.00924 [pdf, other]
-
Title: FACM: Correct the Output of Deep Neural Network with Middle Layers Features against Adversarial SamplesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [63] arXiv:2206.00930 [pdf, other]
-
Title: Predicting Physical Object Properties from VideoComments: accepted for International Joint Conference on Neural Networks (IJCNN) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [64] arXiv:2206.00947 [pdf, other]
-
Title: A Bhattacharyya Coefficient-Based Framework for Noise Model-Aware Random Walker Image SegmentationComments: Dominik Drees and Florian Eilers contributed equally to this workSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [65] arXiv:2206.00960 [pdf, other]
-
Title: SparseDet: Towards End-to-End 3D Object DetectionJournal-ref: Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, pp. 781- 792. Feb. 6-8, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [66] arXiv:2206.00971 [pdf, other]
-
Title: CVM-Cervix: A Hybrid Cervical Pap-Smear Image Classification Framework Using CNN, Visual Transformer and Multilayer PerceptronAuthors: Wanli Liu, Chen Li, Ning Xu, Tao Jiang, Md Mamunur Rahaman, Hongzan Sun, Xiangchen Wu, Weiming Hu, Haoyuan Chen, Changhao Sun, Yudong Yao, Marcin GrzegorzekSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [67] arXiv:2206.00997 [pdf, other]
-
Title: Is Mapping Necessary for Realistic PointGoal Navigation?Authors: Ruslan Partsey, Erik Wijmans, Naoki Yokoyama, Oles Dobosevych, Dhruv Batra, Oleksandr MaksymetsComments: Corrected typos in the AbstractSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [68] arXiv:2206.01009 [pdf, other]
-
Title: Unified Recurrence Modeling for Video Action AnticipationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [69] arXiv:2206.01010 [pdf, other]
-
Title: Long-tailed Recognition by Learning from Latent CategoriesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [70] arXiv:2206.01014 [pdf, other]
-
Title: Suggestive Annotation of Brain MR Images with Gradient-guided SamplingComments: Manuscript accepted by MedIASubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [71] arXiv:2206.01017 [pdf, other]
-
Title: Structured Two-stream Attention Network for Video Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [72] arXiv:2206.01034 [pdf]
-
Title: Adversarial Laser Spot: Robust and Covert Physical Adversarial Attack to DNNsAuthors: Chengyin HuSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [73] arXiv:2206.01038 [pdf, other]
-
Title: A Survey on Video Action Recognition in Sports: Datasets, Methods and ApplicationsAuthors: Fei Wu, Qingzhong Wang, Jian Bian, Haoyi Xiong, Ning Ding, Feixiang Lu, Jun Cheng, Dejing DouComments: 26 pages. The toolbox is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [74] arXiv:2206.01061 [pdf, other]
-
Title: FV-UPatches: Enhancing Universality in Finger Vein RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [75] arXiv:2206.01062 [pdf, other]
-
Title: DocLayNet: A Large Human-Annotated Dataset for Document-Layout AnalysisComments: 9 pages, 6 figures, 5 tables. Accepted paper at SIGKDD 2022 conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [76] arXiv:2206.01102 [pdf, other]
-
Title: A temporal chrominance trigger for clean-label backdoor attack against anti-spoof rebroadcast detectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
- [77] arXiv:2206.01125 [pdf, other]
-
Title: Prefix Conditioning Unifies Language and Label SupervisionAuthors: Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas PfisterSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [78] arXiv:2206.01127 [pdf, other]
-
Title: VL-BEiT: Generative Vision-Language PretrainingSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [79] arXiv:2206.01136 [pdf, other]
-
Title: Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectivesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [80] arXiv:2206.01153 [pdf, other]
-
Title: Multi-View Active Fine-Grained RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [81] arXiv:2206.01160 [pdf, other]
-
Title: DE-Net: Dynamic Text-guided Image Editing Adversarial NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [82] arXiv:2206.01161 [pdf, other]
-
Title: Optimizing Relevance Maps of Vision Transformers Improves RobustnessSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [83] arXiv:2206.01191 [pdf, other]
-
Title: EfficientFormer: Vision Transformers at MobileNet SpeedAuthors: Yanyu Li, Geng Yuan, Yang Wen, Ju Hu, Georgios Evangelidis, Sergey Tulyakov, Yanzhi Wang, Jian RenSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [84] arXiv:2206.01198 [pdf, other]
- [85] arXiv:2206.01201 [pdf, other]
-
Title: REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question AnsweringComments: Accepted by NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [86] arXiv:2206.01202 [pdf, other]
-
Title: Unveiling The Mask of Position-Information Pattern Through the Mist of Image FeaturesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [87] arXiv:2206.01203 [pdf, other]
-
Title: Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding BoxesComments: Project page: this https URLJournal-ref: European Conference on Computer Vision (ECCV), 2022, Oral PresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [88] arXiv:2206.01204 [pdf, other]
-
Title: Siamese Image Modeling for Self-Supervised Vision Representation LearningAuthors: Chenxin Tao, Xizhou Zhu, Weijie Su, Gao Huang, Bin Li, Jie Zhou, Yu Qiao, Xiaogang Wang, Jifeng DaiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [89] arXiv:2206.01232 [pdf, other]
-
Title: What Are Expected Queries in End-to-End Object Detection?Comments: The source code is publicly available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [90] arXiv:2206.01244 [pdf, other]
-
Title: Real-Time Portrait Stylization on the EdgeSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [91] arXiv:2206.01256 [pdf, other]
-
Title: PETRv2: A Unified Framework for 3D Perception from Multi-Camera ImagesAuthors: Yingfei Liu, Junjie Yan, Fan Jia, Shuailin Li, Aqi Gao, Tiancai Wang, Xiangyu Zhang, Jian SunComments: Adding 3D lane detection results on OpenLane DatasetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [92] arXiv:2206.01290 [pdf, other]
-
Title: Points2NeRF: Generating Neural Radiance Fields from 3D point cloudComments: arXiv admin note: text overlap with arXiv:2003.08934 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [93] arXiv:2206.01297 [pdf, other]
-
Title: Lossless Compression of Point Cloud Sequences Using Sequence Optimized CNN ModelsComments: 9 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [94] arXiv:2206.01309 [pdf, other]
-
Title: H-EMD: A Hierarchical Earth Mover's Distance Method for Instance SegmentationAuthors: Peixian Liang, Yizhe Zhang, Yifan Ding, Jianxu Chen, Chinedu S. Madukoma, Tim Weninger, Joshua D. Shrout, Danny Z. ChenComments: Accepted at IEEE Transactions On Medical Imaging (TMI)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [95] arXiv:2206.01319 [pdf, other]
-
Title: Learning Unbiased Transferability for Domain Adaptation by Uncertainty ModelingComments: This paper has been accepted by ECCV2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [96] arXiv:2206.01326 [pdf, other]
-
Title: Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic InformationAuthors: Zu Kim, André Araujo, Bingyi Cao, Cam Askew, Jack Sim, Mike Green, N'Mah Fodiatu Yilla, Tobias WeyandSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [97] arXiv:2206.01327 [pdf, other]
-
Title: RELAY: Robotic EyeLink AnalYsis of the EyeLink 1000 using an Artificial EyeComments: 12 Pages, 17 Figures, 2 Tables. Git Repository: this https URL Appendix Repository: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [98] arXiv:2206.01334 [pdf, other]
-
Title: Long Scale Error Control in Low Light Image and Video Enhancement Using EquivarianceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [99] arXiv:2206.01365 [pdf, other]
-
Title: Adversarial Attacks on Human VisionComments: 21 pages, 8 figures, 1 tableJournal-ref: Extended version of IEEE MultiMedia, vol. 23, no. 1, pp. 82-91, Jan.-Mar. 2016Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [100] arXiv:2206.01369 [pdf, other]
-
Title: Incremental Learning Meets Transfer Learning: Application to Multi-site Prostate MRI SegmentationAuthors: Chenyu You, Jinlin Xiang, Kun Su, Xiaoran Zhang, Siyuan Dong, John Onofrey, Lawrence Staib, James S. DuncanSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [101] arXiv:2206.01370 [pdf, other]
-
Title: Slot Order Matters for Compositional Scene UnderstandingComments: 30 pages, 17 figures. Code and videos available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [102] arXiv:2206.01381 [pdf, other]
-
Title: CF-YOLO: Cross Fusion YOLO for Object Detection in Adverse Weather with a High-quality Real Snow DatasetAuthors: Qiqi Ding, Peng Li, Xuefeng Yan, Ding Shi, Luming Liang, Weiming Wang, Haoran Xie, Jonathan Li, Mingqiang WeiComments: 10pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [103] arXiv:2206.01384 [pdf]
-
Title: End-to-End 3D Hand Pose Estimation from Stereo CamerasSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [104] arXiv:2206.01408 [pdf, other]
-
Title: MetaLR: Layer-wise Learning Rate based on Meta-Learning for Adaptively Fine-tuning Medical Pre-trained ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [105] arXiv:2206.01417 [pdf, other]
-
Title: Learning an Adaptation Function to Assess Image Visual SimilaritiesAuthors: Olivier Risser-Maroix (LIPADE), Amine Marzouki (LIPADE), Hala Djeghim (LIPADE), Camille Kurtz (LIPADE), Nicolas Lomenie (LIPADE)Journal-ref: ORASIS 2021, Centre National de la Recherche Scientifique [CNRS], Sep 2021, Saint Ferr{\'e}ol, FranceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [106] arXiv:2206.01429 [pdf, other]
-
Title: Learning rich optical embeddings for privacy-preserving lensless image classificationComments: 29 pages, 23 figures, under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [107] arXiv:2206.01441 [pdf, other]
-
Title: Exploring Transformers for Behavioural Biometrics: A Case Study in Gait RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [108] arXiv:2206.01466 [pdf, other]
-
Title: Zero-Shot Bird Species Recognition by Learning from Field GuidesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [109] arXiv:2206.01467 [pdf, other]
-
Title: The Importance of Image Interpretation: Patterns of Semantic Misclassification in Real-World Adversarial ImagesComments: International Conference on Multimedia Modeling (MMM) 2023. Resources are publicly available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [110] arXiv:2206.01473 [pdf, other]
-
Title: Distributional loss for convolutional neural network regression and application to GNSS multi-path estimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [111] arXiv:2206.01498 [pdf]
-
Title: YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack detectionAuthors: Xiao RuiqiangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [112] arXiv:2206.01524 [pdf, other]
-
Title: Anomaly detection in surveillance videos using transformer based attention modelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [113] arXiv:2206.01627 [pdf, other]
-
Title: Pruning for Interpretable, Feature-Preserving Circuits in CNNsComments: Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [114] arXiv:2206.01646 [pdf, other]
-
Title: Rethinking Positive Sampling for Contrastive Learning with KernelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [115] arXiv:2206.01651 [pdf, other]
-
Title: D'ARTAGNAN: Counterfactual Video GenerationAuthors: Hadrien Reynaud, Athanasios Vlontzos, Mischa Dombrowski, Ciarán Lee, Arian Beqiri, Paul Leeson, Bernhard KainzComments: Accepted for MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [116] arXiv:2206.01653 [pdf, other]
-
Title: Metrics reloaded: Pitfalls and recommendations for image analysis validationAuthors: Lena Maier-Hein, Annika Reinke, Patrick Godau, Minu D. Tizabi, Evangelia Christodoulou, Ben Glocker, Fabian Isensee, Jens Kleesiek, Michal Kozubek, Mauricio Reyes, Michael A. Riegler, Manuel Wiesenfarth, Michael Baumgartner, Matthias Eisenmann, Doreen Heckmann-Nötzel, A. Emre Kavur, Tim Rädsch, Laura Acion, Michela Antonelli, Tal Arbel, Spyridon Bakas, Peter Bankhead, Arriel Benis, M. Jorge Cardoso, Veronika Cheplygina, Beth Cimini, Gary S. Collins, Keyvan Farahani, Luciana Ferrer, Adrian Galdran, Bram van Ginneken, Robert Haase, Daniel A. Hashimoto, Michael M. Hoffman, Merel Huisman, Pierre Jannin, Charles E. Kahn, Dagmar Kainmueller, Bernhard Kainz, Alexandros Karargyris, Alan Karthikesalingam, Hannes Kenngott, Florian Kofler, Annette Kopp-Schneider, Anna Kreshuk, Tahsin Kurc, et al. (27 additional authors not shown)Comments: Shared first authors: Lena Maier-Hein, Annika ReinkeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [117] arXiv:2206.01658 [pdf]
-
Title: Identification via Retinal Vessels Combining LBP and HOGAuthors: Ali NooriSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [118] arXiv:2206.01661 [pdf, other]
-
Title: Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image SynthesisAuthors: Jan ZuiderveldSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [119] arXiv:2206.01670 [pdf, other]
-
Title: Egocentric Video-Language PretrainingAuthors: Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng ShouComments: Accepted by NeurIPS 2022. Double champions at Ego4D and EPIC-Kitchens, CVPR 2022 challenges. 23 pages, 13 figures, 12 tables. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [120] arXiv:2206.01705 [pdf, other]
-
Title: Gradient Obfuscation Checklist Test Gives a False Sense of SecuritySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [121] arXiv:2206.01714 [pdf, other]
-
Title: Compositional Visual Generation with Composable Diffusion ModelsComments: ECCV 2022. First three authors contributed equally. Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [122] arXiv:2206.01718 [pdf, other]
-
Title: A-OKVQA: A Benchmark for Visual Question Answering using World KnowledgeSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [123] arXiv:2206.01720 [pdf, other]
-
Title: Revisiting the "Video" in Video-Language UnderstandingAuthors: Shyamal Buch, Cristóbal Eyzaguirre, Adrien Gaidon, Jiajun Wu, Li Fei-Fei, Juan Carlos NieblesComments: CVPR 2022 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [124] arXiv:2206.01724 [pdf, other]
-
Title: SNAKE: Shape-aware Neural 3D Keypoint FieldAuthors: Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing HuangComments: Accepted by NeurIPS 2022. Codes are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [125] arXiv:2206.01733 [pdf, other]
-
Title: Adversarial RAW: Image-Scaling Attack Against Imaging PipelineSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [126] arXiv:2206.01734 [pdf]
-
Title: Using UAS Imagery and Computer Vision to Support Site-Specific Weed Control in CornComments: 16 Figures, 3 Tables,. arXiv admin note: substantial text overlap with arXiv:2204.12417Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [127] arXiv:2206.01772 [pdf, other]
-
Title: Radar Guided Dynamic Visual Attention for Resource-Efficient RGB Object DetectionComments: Accepted in International Joint Conference on Neural Networks (IJCNN) 2022Journal-ref: 2022 International Joint Conference on Neural Networks (IJCNN)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [128] arXiv:2206.01777 [pdf, other]
-
Title: Real-Time Super-Resolution for Real-World Images on Mobile DevicesComments: arXiv admin note: text overlap with arXiv:2004.13674Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [129] arXiv:2206.01794 [pdf, other]
-
Title: Additive MIL: Intrinsically Interpretable Multiple Instance Learning for PathologyAuthors: Syed Ashar Javed, Dinkar Juyal, Harshith Padigela, Amaro Taylor-Weiner, Limin Yu, Aaditya PrakashSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [130] arXiv:2206.01813 [pdf, other]
-
Title: Learning sRGB-to-Raw-RGB De-rendering with Content-Aware MetadataComments: CVPR 2022 (GitHub: this https URL)Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [131] arXiv:2206.01821 [pdf, other]
-
Title: EAANet: Efficient Attention Augmented Convolutional NetworksComments: 8 pages, 4 figures. Not publishedSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [132] arXiv:2206.01831 [pdf, other]
-
Title: Spatial Feature Mapping for 6DoF Object Pose EstimationComments: Pattern RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [133] arXiv:2206.01841 [pdf, other]
-
Title: Coffee Roast IntelligenceComments: 6 pages, 13 figures, 3 tables, this work was presented at the CSC498 COMPUTER SCIENCE CAPSTONE PROJECT I and CSC499 COMPUTER SCIENCE CAPSTONE PROJECT II coursesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [134] arXiv:2206.01843 [pdf, other]
-
Title: Visual Clues: Bridging Vision and Language Foundations for Image Paragraph CaptioningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [135] arXiv:2206.01863 [pdf, other]
-
Title: Recursive Deformable Image Registration Network with Mutual AttentionAuthors: Jian-Qing Zheng, Ziyang Wang, Baoru Huang, Ngee Han Lim, Tonia Vincent, Bartlomiej W. PapiezComments: arXiv admin note: text overlap with arXiv:2203.04290Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [136] arXiv:2206.01867 [pdf, other]
-
Title: SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional SpaceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [137] arXiv:2206.01881 [pdf, other]
-
Title: Face Recognition Accuracy Across Demographics: Shining a Light Into the ProblemSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [138] arXiv:2206.01884 [pdf]
-
Title: A Superimposed Divide-and-Conquer Image Recognition Method for SEM Images of Nanoparticles on The Surface of Monocrystalline silicon with High Aggregation DegreeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [139] arXiv:2206.01908 [pdf, other]
-
Title: Video-based Human-Object Interaction Detection from Tubelet TokensSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [140] arXiv:2206.01910 [pdf, other]
-
Title: The Spike Gating Flow: A Hierarchical Structure Based Spiking Neural Network for Online Gesture RecognitionAuthors: Zihao Zhao, Yanhong Wang, Qiaosha Zou, Tie Xu, Fangbo Tao, Jiansong Zhang, Xiaoan Wang, C.-J. Richard Shi, Junwen Luo, Yuan XieSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [141] arXiv:2206.01916 [pdf, other]
-
Title: Nerfels: Renderable Neural Codes for Improved Camera Pose EstimationAuthors: Gil Avraham, Julian Straub, Tianwei Shen, Tsun-Yi Yang, Hugo Germain, Chris Sweeney, Vasileios Balntas, David Novotny, Daniel DeTone, Richard NewcombeComments: Published at CVPRW with supplementary materialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [142] arXiv:2206.01923 [pdf, other]
-
Title: From Pixels to Objects: Cubic Visual Attention for Visual Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [143] arXiv:2206.01942 [pdf, other]
-
Title: Occlusion-Resistant Instance Segmentation of Piglets in Farrowing Pens Using Center Clustering NetworkAuthors: Endai Huang, Axiu Mao, Yongjian Wu, Haiming Gan, Maria Camila Ceballos, Thomas D. Parsons, Junhui Hou, Kai LiuComments: Accepted in CV4Animals Workshop of CVPR 2022 (IJCV journal track)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [144] arXiv:2206.01961 [pdf, other]
-
Title: C$^3$Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in ColonoscopySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [145] arXiv:2206.01986 [pdf, other]
-
Title: Delving into the Openness of CLIPComments: 22 pages, 12 figures. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [146] arXiv:2206.01988 [pdf, other]
-
Title: Cross-modal Clinical Graph Transformer for Ophthalmic Report GenerationComments: CVPR 2022 (Poster)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [147] arXiv:2206.01992 [pdf, other]
-
Title: CAINNFlow: Convolutional block Attention modules and Invertible Neural Networks Flow for anomaly detection and localization tasksAuthors: Ruiqing Yan, Fan Zhang, Mengyuan Huang, Wu Liu, Dongyu Hu, Jinfeng Li, Qiang Liu, Jinrong Jiang, Qianjin Guo, Linghan ZhengSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [148] arXiv:2206.01999 [pdf, other]
-
Title: MSR: Making Self-supervised learning Robust to Aggressive AugmentationsAuthors: Yingbin Bai, Erkun Yang, Zhaoqing Wang, Yuxuan Du, Bo Han, Cheng Deng, Dadong Wang, Tongliang LiuSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [149] arXiv:2206.02002 [pdf, other]
-
Title: CVNets: High Performance Library for Computer VisionComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [150] arXiv:2206.02015 [pdf, other]
-
Title: APES: Articulated Part Extraction from Sprite SheetsAuthors: Zhan Xu, Matthew Fisher, Yang Zhou, Deepali Aneja, Rushikesh Dudhat, Li Yi, Evangelos KalogerakisSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [151] arXiv:2206.02027 [pdf, other]
-
Title: Implicit Neural Representation for Mesh-Free Inverse Obstacle ScatteringComments: 6 pages, 8 figures, to be published in 2022 Asilomar Conference on Signals, Systems, and ComputersSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [152] arXiv:2206.02029 [pdf, other]
-
Title: Guided Deep Metric LearningAuthors: Jorge Gonzalez-Zapata, Ivan Reyes-Amezcua, Daniel Flores-Araiza, Mauricio Mendez-Ruiz, Gilberto Ochoa-Ruiz, Andres Mendez-VazquezSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [153] arXiv:2206.02050 [pdf, other]
-
Title: Learning Speaker-specific Lip-to-Speech GenerationComments: Accepted at ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [154] arXiv:2206.02066 [pdf, other]
-
Title: PIDNet: A Real-time Semantic Segmentation Network Inspired from PID ControllerComments: 11 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [155] arXiv:2206.02070 [pdf, other]
- [156] arXiv:2206.02082 [pdf, other]
-
Title: Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language RetrievalAuthors: Xudong Lin, Simran Tiwari, Shiyuan Huang, Manling Li, Mike Zheng Shou, Heng Ji, Shih-Fu ChangComments: Work in progressSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [157] arXiv:2206.02086 [pdf, other]
-
Title: Towards the Creation of a Nutrition and Food Group Based Image DatabaseAuthors: Zeman Shao, Jiangpeng He, Ya-Yuan Yu, Luotao Lin, Alexandra Cowan, Heather Eicher-Miller, Fengqing ZhuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [158] arXiv:2206.02087 [pdf, other]
-
Title: Accurate Scoliosis Vertebral Landmark Localization on X-ray Images via Shape-constrained Multi-stage Cascaded CNNsComments: 9 pages, submitted to IEEE Journal of Biomedical and Health InformaticsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [159] arXiv:2206.02099 [pdf, other]
-
Title: Point-to-Voxel Knowledge Distillation for LiDAR Semantic SegmentationComments: CVPR 2022; Our model ranks 1st on Waymo and SemanticKITTI (single-scan) challenges, and ranks 3rd on SemanticKITTI (multi-scan) challenge; Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [160] arXiv:2206.02104 [pdf, other]
-
Title: ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentencesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [161] arXiv:2206.02110 [pdf, other]
-
Title: Computer Vision-based Characterization of Large-scale Jet Flames using a Synthetic Infrared Image Generation ApproachAuthors: Carmina Pérez-Guerrero, Jorge Francisco Ciprián-Sánchez, Adriana Palacios, Gilberto Ochoa-Ruiz, Miguel Gonzalez-Mendoza, Vahid Foroughi, Elsa Pastor, Gerardo Rodriguez-HernandezComments: Pre-print submitted to Engineering Science and Technology, an International JournalSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [162] arXiv:2206.02116 [pdf, other]
-
Title: Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in VideosComments: Accepted to CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [163] arXiv:2206.02118 [pdf, other]
-
Title: ShapePU: A New PU Learning Framework Regularized by Global Consistency for Scribble Supervised Cardiac SegmentationComments: 11 pages,4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [164] arXiv:2206.02120 [pdf, other]
-
Title: MPANet: Multi-Patch Attention For Infrared Small Target object DetectionComments: 4 pages 3 figuresJournal-ref: 2022IGARSSSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [165] arXiv:2206.02136 [pdf, other]
-
Title: LDRNet: Enabling Real-time Document Localization on Mobile DevicesComments: In the proceedings of ECML-PKDD 2022 Workshop on IoT, Edge, and Mobile for Embedded Machine Learning (ITEM)Subjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
- [166] arXiv:2206.02146 [pdf, other]
-
Title: Recurrent Video Restoration Transformer with Guided Deformable AttentionAuthors: Jingyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, Jiezhang Cao, Kai Zhang, Radu Timofte, Luc Van GoolComments: Accepted by NeurIPS 2022. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [167] arXiv:2206.02153 [pdf, other]
-
Title: HPGNN: Using Hierarchical Graph Neural Networks for Outdoor Point Cloud ProcessingAuthors: Arulmolivarman Thieshanthan, Amashi Niwarthana, Pamuditha Somarathne, Tharindu Wickremasinghe, Ranga RodrigoComments: Accepted for ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [168] arXiv:2206.02158 [pdf, other]
-
Title: Vanilla Feature Distillation for Improving the Accuracy-Robustness Trade-Off in Adversarial TrainingComments: 12 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [169] arXiv:2206.02163 [pdf, other]
-
Title: MotionCNN: A Strong Baseline for Motion Prediction in Autonomous DrivingComments: CVPR Workshop on Autonomous Driving 2021. Waymo Motion Prediction Challenge 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [170] arXiv:2206.02180 [pdf, other]
-
Title: Semi-Supervised Learning for Mars Imagery Classification and SegmentationComments: Accepted by ACM Trans. on Multimedia Computing Communications and Applications (TOMM)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [171] arXiv:2206.02187 [pdf, other]
-
Title: M2FNet: Multi-modal Fusion Network for Emotion Recognition in ConversationAuthors: Vishal Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki OnoeComments: Accepted for publication in the 5th Multimodal Learning and Applications (MULA) Workshop at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [172] arXiv:2206.02194 [pdf, other]
-
Title: FOF: Learning Fourier Occupancy Field for Monocular Real-time Human ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [173] arXiv:2206.02200 [pdf, other]
-
Title: GridShift: A Faster Mode-seeking Algorithm for Image Segmentation and Object TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [174] arXiv:2206.02203 [pdf]
-
Title: 3D Convolutional with Attention for Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [175] arXiv:2206.02220 [pdf, other]
-
Title: U(1) Symmetry-breaking Observed in Generic CNN Bottleneck LayersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [176] arXiv:2206.02234 [pdf, other]
-
Title: Two Decades of Bengali Handwritten Digit Recognition: A SurveyAuthors: A.B.M. Ashikur Rahman, Md. Bakhtiar Hasan, Sabbir Ahmed, Tasnim Ahmed, Md. Hamjajul Ashmafee, Mohammad Ridwan Kabir, Md. Hasanul KabirComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. 38 pages, 23 figures, 12 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [177] arXiv:2206.02257 [pdf, other]
-
Title: Efficient Annotation and Learning for 3D Hand Pose Estimation: A SurveySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [178] arXiv:2206.02260 [pdf, other]
-
Title: SealID: Saimaa ringed seal re-identification datasetAuthors: Ekaterina Nepovinnykh, Tuomas Eerola, Vincent Biard, Piia Mutka, Marja Niemi, Heikki Kälviäinen, Mervi KunnasrantaComments: 15 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Populations and Evolution (q-bio.PE)
- [179] arXiv:2206.02261 [pdf, other]
-
Title: Towards Individual Grevy's Zebra Identification via Deep 3D Fitting and Metric LearningComments: 4 pages, 5 figures, 1 table; typos corrected, references updatedSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [180] arXiv:2206.02270 [pdf, other]
-
Title: Estimating building energy efficiency from street view imagery, aerial imagery, and land surface temperature dataAuthors: Kevin Mayer, Lukas Haas, Tianyuan Huang, Juan Bernabé-Moreno, Ram Rajagopal, Martin FischerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [181] arXiv:2206.02281 [pdf, other]
-
Title: E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial VehiclesAuthors: Zhenyu Hu, Zhenyu Wu, Pengcheng Pi, Yunhe Xue, Jiayi Shen, Jianchao Tan, Xiangru Lian, Zhangyang Wang, Ji LiuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [182] arXiv:2206.02288 [pdf, other]
-
Title: ACT: Semi-supervised Domain-adaptive Medical Image Segmentation with Asymmetric Co-trainingAuthors: Xiaofeng Liu, Fangxu Xing, Nadya Shusharina, Ruth Lim, C-C Jay Kuo, Georges El Fakhri, Jonghye WooComments: MICCAI 2022 (early accept)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [183] arXiv:2206.02295 [pdf, other]
-
Title: HIFI-Net: A Novel Network for Enhancement to Underwater ImagesComments: 7 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [184] arXiv:2206.02307 [pdf, other]
-
Title: Bootstrapping Semi-supervised Medical Image Segmentation with Anatomical-aware Contrastive DistillationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [185] arXiv:2206.02325 [pdf, other]
-
Title: Evaluation-oriented Knowledge Distillation for Deep Face RecognitionComments: CVPR2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [186] arXiv:2206.02327 [pdf, other]
-
Title: JigsawHSI: a network for Hyperspectral Image classificationComments: 7 pages, 7 figures, not peer reviewedSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [187] arXiv:2206.02331 [pdf]
-
Title: MASNet:Improve Performance of Siamese Networks with Mutual-attention for Remote Sensing Change Detection TasksComments: XXIV ISPRS CongressSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [188] arXiv:2206.02338 [pdf, other]
-
Title: OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal RegressionComments: Accepted by NeurIPS2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [189] arXiv:2206.02342 [pdf, other]
-
Title: WHU-Stereo: A Challenging Benchmark for Stereo Matching of High-Resolution Satellite ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [190] arXiv:2206.02343 [pdf, other]
-
Title: Contrastive Graph Multimodal Model for Text Classification in VideosSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [191] arXiv:2206.02345 [pdf, other]
-
Title: Anomaly Detection with Test Time Augmentation and Consistency EvaluationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [192] arXiv:2206.02349 [pdf, other]
-
Title: Invariant Grounding for Video Question AnsweringComments: CVPR2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [193] arXiv:2206.02355 [pdf, other]
-
Title: Relation Matters: Foreground-aware Graph-based Relational Reasoning for Domain Adaptive Object DetectionAuthors: Chaoqi Chen, Jiongcheng Li, Hong-Yu Zhou, Xiaoguang Han, Yue Huang, Xinghao Ding, Yizhou YuComments: Accepted by IEEE T-PAMISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [194] arXiv:2206.02366 [pdf, other]
-
Title: Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D ScansAuthors: Alexandr Notchenko, Vladislav Ishimtsev, Alexey Artemov, Vadim Selyutin, Emil Bogomolov, Evgeny BurnaevComments: In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and ApplicationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [195] arXiv:2206.02373 [pdf, other]
-
Title: Sports Re-ID: Improving Re-Identification Of Players In Broadcast Videos Of Team SportsAuthors: Bharath ComandurSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [196] arXiv:2206.02374 [pdf, other]
-
Title: CorticalFlow: A Diffeomorphic Mesh Deformation Module for Cortical Surface ReconstructionAuthors: Léo Lebrat, Rodrigo Santa Cruz, Frédéric de Gournay, Darren Fu, Pierrick Bourgeat, Jurgen Fripp, Clinton Fookes, Olivier SalvadoSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [197] arXiv:2206.02377 [pdf, other]
-
Title: BInGo: Bayesian Intrinsic Groupwise Registration via Explicit Hierarchical DisentanglementSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [198] arXiv:2206.02392 [pdf]
-
Title: Semi-Supervised Segmentation of Mitochondria from Electron Microscopy Images Using Spatial ContinuityComments: 4 pages of main text, 5 pages of supplementary material and 1 page of referencesJournal-ref: 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). IEEE, 2022: 1-5Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [199] arXiv:2206.02405 [src]
-
Title: Robust Image Protection Countering Cropping ManipulationComments: Redo some of the experiments to re-evaluate the role of KD-JPEG in robustnessSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [200] arXiv:2206.02424 [pdf]
-
Title: Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehiclesComments: 18 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [201] arXiv:2206.02452 [pdf, other]
-
Title: Universal Photometric Stereo Network using Global Lighting ContextsAuthors: Satoshi IkehataComments: Accepted to CVPR2022. Code and Dataset at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [202] arXiv:2206.02454 [pdf, other]
-
Title: Why do CNNs Learn Consistent Representations in their First Layer Independent of Labels and Architecture?Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [203] arXiv:2206.02498 [pdf, other]
-
Title: NORPPA: NOvel Ringed seal re-identification by Pelage Pattern AggregationComments: 22 pages, 13 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [204] arXiv:2206.02502 [pdf, other]
-
Title: BehavePassDB: Public Database for Mobile Behavioral Biometrics and Benchmark EvaluationComments: 11 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [205] arXiv:2206.02531 [pdf, other]
-
Title: 3D-Augmented Contrastive Knowledge Distillation for Image-based Object Pose EstimationComments: Accepted for presentation at International Conference on Multimedia Retrieval (ICMR '22)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [206] arXiv:2206.02539 [pdf, other]
-
Title: Robustness Evaluation and Adversarial Training of an Instance Segmentation ModelComments: 15 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [207] arXiv:2206.02544 [pdf, other]
-
Title: RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene GenerationComments: Accepted at the IEEE Winter Conference on Applications of Computer Vision, WACV 2022Journal-ref: 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 2723-2732Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [208] arXiv:2206.02547 [pdf]
-
Title: Towards retrieving dispersion profiles using quantum-mimic Optical Coherence Tomography and Machine LearninComments: 11 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
- [209] arXiv:2206.02559 [pdf, other]
-
Title: Conversation Group Detection With Spatio-Temporal ContextSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [210] arXiv:2206.02564 [pdf, other]
-
Title: Machine Learning for Detection of 3D Features using sparse X-ray dataAuthors: Bradley T. Wolfe, Michael J. Falato, Xinhua Zhang, Nga T. T. Nguyen-Fotiadis, J.P. Sauppe, P. M. Kozlowski, P. A. Keiter, R. E. Reinovsky, S. A. Batha, Zhehui WangSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Data Analysis, Statistics and Probability (physics.data-an)
- [211] arXiv:2206.02573 [pdf, other]
- [212] arXiv:2206.02598 [pdf, other]
-
Title: [Reproducibility Report] Explainable Deep One-Class ClassificationComments: Submitted to the ML Reproducibility Challenge 2021 FallSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [213] arXiv:2206.02609 [pdf, other]
-
Title: Real-World Image Super-Resolution by Exclusionary Dual-LearningComments: IEEE TMM 2022; Considering large volume of RealSR datasets, a multi-dataset sampling scheme is developedSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [214] arXiv:2206.02619 [pdf, other]
-
Title: VPIT: Real-time Embedded Single Object 3D Tracking Using Voxel Pseudo ImagesAuthors: Illia Oleksiienko, Paraskevi Nousi, Nikolaos Passalis, Anastasios Tefas, Alexandros IosifidisComments: 10 pages, 5 figures, 4 tables. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [215] arXiv:2206.02622 [pdf, other]
-
Title: Hardware-accelerated Mars Sample Localization via deep transfer learning from photorealistic simulationsAuthors: Raúl Castilla-Arquillo, Carlos Jesús Pérez-del-Pulgar, Gonzalo Jesús Paz-Delgado, Levin GerdesComments: Preprint version only. Final version at IEEE Xplore. Accepted for IEEE Robotics and Automation LettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [216] arXiv:2206.02647 [pdf, other]
-
Title: Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised LearningAuthors: Richard J. Chen, Chengkuan Chen, Yicong Li, Tiffany Y. Chen, Andrew D. Trister, Rahul G. Krishnan, Faisal MahmoodComments: Accepted to CVPR 2022 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [217] arXiv:2206.02664 [pdf, other]
-
Title: Learning with Capsules: A SurveyComments: 29 pages, 43 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [218] arXiv:2206.02680 [pdf, other]
-
Title: Separable Self-attention for Mobile Vision TransformersComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [219] arXiv:2206.02714 [pdf, other]
-
Title: FuSS: Fusing Superpixels for Improved Segmentation ConsistencyComments: submitted to IEEEACCESS. 19 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [220] arXiv:2206.02715 [pdf, other]
-
Title: Day-to-Night Image Synthesis for Training Nighttime Neural ISPsAuthors: Abhijith Punnappurath, Abdullah Abuolaim, Abdelrahman Abdelhamed, Alex Levinshtein, Michael S. BrownSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [221] arXiv:2206.02717 [pdf, other]
-
Title: Scene Aware Person Image Generation through Global Contextual ConditioningComments: Accepted in The International Conference on Pattern Recognition (ICPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [222] arXiv:2206.02721 [pdf, other]
- [223] arXiv:2206.02735 [pdf, other]
-
Title: People Tracking in Panoramic Video for Guiding RobotsComments: Accepted to 17th International Conference on Intelligent Autonomous Systems (IAS-17)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [224] arXiv:2206.02749 [pdf, other]
-
Title: CORE: Consistent Representation Learning for Face Forgery DetectionComments: Accepted by CVPRW 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [225] arXiv:2206.02761 [pdf, other]
-
Title: Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical ImagesComments: 12 pages, 5 figures. In proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022. Copyright 2022 by the author(s)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [226] arXiv:2206.02770 [pdf, other]
-
Title: Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of ExpertsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [227] arXiv:2206.02776 [pdf, other]
-
Title: Volumetric Disentanglement for 3D Scene ManipulationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [228] arXiv:2206.02777 [pdf, other]
-
Title: Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [229] arXiv:2206.02779 [pdf, other]
-
Title: Blended Latent DiffusionComments: Project page is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [230] arXiv:2206.02780 [pdf, other]
-
Title: GenSDF: Two-Stage Learning of Generalizable Signed Distance FunctionsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [231] arXiv:2206.02846 [pdf, other]
-
Title: A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic InformationAuthors: Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. DerpanisComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [232] arXiv:2206.02850 [pdf, other]
-
Title: GLF-CR: SAR-Enhanced Cloud Removal with Global-Local FusionSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [233] arXiv:2206.02876 [pdf, other]
-
Title: SpikiLi: A Spiking Simulation of LiDAR based Real-time Object Detection for Autonomous DrivingAuthors: Sambit Mohapatra, Thomas Mesquida, Mona Hodaei, Senthil Yogamani, Heinrich Gotzig, Patrick MaderComments: Accepted at Workshop on Event Sensing and Neuromorphic Engineering - 8th International Conference on Event-based Control, Communication, and Signal ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [234] arXiv:2206.02903 [pdf, other]
-
Title: Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph MapsComments: CVPR 2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [235] arXiv:2206.02912 [pdf]
-
Title: Learning Image Representations for Content Based Image Retrieval of Radiotherapy Treatment PlansAuthors: Charles Huang, Varun Vasudevan, Oscar Pastor-Serrano, Md Tauhidul Islam, Yusuke Nomura, Piotr Dubrowski, Jen-Yeu Wang, Joseph B. Schulz, Yong Yang, Lei XingSubjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [236] arXiv:2206.02967 [pdf, other]
-
Title: Masked Unsupervised Self-training for Zero-shot Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [237] arXiv:2206.02977 [pdf, other]
-
Title: DETR++: Taming Your Multi-Scale Detection TransformerComments: T4V: Transformers for Vision workshop @ CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [238] arXiv:2206.02985 [pdf, other]
-
Title: Structured Context Transformer for Generic Event Boundary DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [239] arXiv:2206.02997 [pdf, other]
-
Title: TadML: A fast temporal action detection with Mechanics-MLPComments: 8 pages,3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [240] arXiv:2206.03001 [pdf, other]
-
Title: PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR SystemAuthors: Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, Dianhai Yu, Yanjun MaComments: arXiv admin note: text overlap with arXiv:2109.03144Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [241] arXiv:2206.03010 [pdf, other]
-
Title: MS-RNN: A Flexible Multi-Scale Framework for Spatiotemporal Predictive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [242] arXiv:2206.03012 [pdf, other]
-
Title: TriBYOL: Triplet BYOL for Self-Supervised Representation LearningComments: Published as a conference paper at ICASSP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [243] arXiv:2206.03014 [pdf, other]
-
Title: The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph GenerationComments: Accepted by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [244] arXiv:2206.03017 [pdf, other]
-
Title: Development of Automatic Endotracheal Tube and Carina Detection on Portable Supine Chest Radiographs using Artificial IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [245] arXiv:2206.03033 [pdf, other]
-
Title: Deep Learning Techniques for Visual CountingAuthors: Luca CiampiComments: Version with high-quality images can be found at this https URL arXiv admin note: text overlap with arXiv:1802.03601, arXiv:1707.01202, arXiv:1809.02165, arXiv:1901.06026, arXiv:1808.01244 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [246] arXiv:2206.03048 [pdf, other]
-
Title: Layered Depth Refinement with Mask GuidanceComments: Accepted to CVPR 2022 (camera-ready version)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [247] arXiv:2206.03061 [pdf, other]
-
Title: Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detectionComments: Accepted by IJCNN2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [248] arXiv:2206.03062 [pdf, other]
-
Title: Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud MapComments: 7 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [249] arXiv:2206.03064 [pdf, other]
-
Title: A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action DetectorComments: Accepted By WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [250] arXiv:2206.03086 [pdf, other]
-
Title: Online Deep Clustering with Video Track ConsistencyComments: Accepted at ICPR2022 as oralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [251] arXiv:2206.03087 [pdf, other]
-
Title: Critical Regularizations for Neural Surface Reconstruction in the WildComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [252] arXiv:2206.03105 [pdf, other]
- [253] arXiv:2206.03111 [pdf, other]
-
Title: Medical Image Registration via Neural FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [254] arXiv:2206.03113 [pdf, other]
-
Title: Wavelet Prior Attention Learning in Axial Inpainting NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [255] arXiv:2206.03149 [pdf, other]
-
Title: Self-Training of Handwritten Word Recognition for Synthetic-to-Real AdaptationComments: Accepted for publication in International Conference on Pattern Recognition (ICPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [256] arXiv:2206.03164 [pdf, other]
-
Title: Utility of Equivariant Message Passing in Cortical Mesh SegmentationComments: 13 pages, 3 figures, accepted for MIUA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [257] arXiv:2206.03196 [pdf, other]
-
Title: Improving Image Captioning with Control Signal of Sentence QualitySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [258] arXiv:2206.03207 [pdf, other]
-
Title: Omnivision forecasting: combining satellite observations with sky images for improved intra-hour solar energy predictionsComments: Submitted to Renewable EnergySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [259] arXiv:2206.03210 [pdf, other]
-
Title: Deep Neural Patchworks: Coping with Large Segmentation TasksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [260] arXiv:2206.03287 [pdf, other]
-
Title: NeMF: Neural Motion Fields for Kinematic AnimationComments: Accepted to NeurIPS 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [261] arXiv:2206.03361 [pdf, other]
-
Title: Hierarchical Similarity Learning for Aliasing Suppression Image Super-ResolutionComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [262] arXiv:2206.03367 [pdf, other]
-
Title: Localizing Semantic Patches for Accelerating Image ClassificationComments: Accepted by ICME-2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [263] arXiv:2206.03368 [pdf, other]
-
Title: IL-MCAM: An interactive learning and multi-channel attention mechanism-based weakly supervised colorectal histopathology image classification approachAuthors: Haoyuan Chen, Chen Li, Xiaoyan Li, Md Mamunur Rahaman, Weiming Hu, Yixin Li, Wanli Liu, Changhao Sun, Hongzan Sun, Xinyu Huang, Marcin GrzegorzekJournal-ref: Computers in Biology and Medicine, Volume 143, April 2022, 105265Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [264] arXiv:2206.03373 [pdf, other]
-
Title: Garment Avatars: Realistic Cloth Driving using Pattern RegistrationAuthors: Oshri Halimi, Fabian Prada, Tuur Stuyck, Donglai Xiang, Timur Bagautdinov, He Wen, Ron Kimmel, Takaaki Shiratori, Chenglei Wu, Yaser SheikhSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [265] arXiv:2206.03410 [pdf, other]
-
Title: Fast and Robust Non-Rigid Registration Using Accelerated Majorization-MinimizationSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [266] arXiv:2206.03428 [pdf, other]
-
Title: Revealing Single Frame Bias for Video-and-Language LearningComments: 19 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [267] arXiv:2206.03429 [pdf, other]
-
Title: Generating Long Videos of Dynamic ScenesAuthors: Tim Brooks, Janne Hellsten, Miika Aittala, Ting-Chun Wang, Timo Aila, Jaakko Lehtinen, Ming-Yu Liu, Alexei A. Efros, Tero KarrasSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [268] arXiv:2206.03431 [pdf, other]
-
Title: Self-supervised Domain Adaptation in Crowd CountingComments: Accepted at ICIP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [269] arXiv:2206.03452 [pdf, other]
-
Title: Can CNNs Be More Robust Than Transformers?Comments: tech report; code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [270] arXiv:2206.03461 [pdf, other]
-
Title: Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion ModelsAuthors: Walter H. L. Pinaya, Mark S. Graham, Robert Gray, Pedro F Da Costa, Petru-Daniel Tudosiu, Paul Wright, Yee H. Mah, Andrew D. MacKinnon, James T. Teo, Rolf Jager, David Werring, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge CardosoSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
- [271] arXiv:2206.03480 [pdf, other]
-
Title: SHRED: 3D Shape Region Decomposition with Learned Local OperationsComments: SIGGRAPH ASIA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [272] arXiv:2206.03484 [pdf, other]
-
Title: Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language EmbeddingAuthors: Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang JiangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [273] arXiv:2206.03544 [pdf, other]
-
Title: A Penny for Your (visual) Thoughts: Self-Supervised Reconstruction of Natural Movies from Brain ActivitySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [274] arXiv:2206.03591 [pdf, other]
-
Title: ObPose: Leveraging Pose for Object-Centric Scene Inference in 3DComments: 19 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [275] arXiv:2206.03600 [pdf, other]
-
Title: OneRing: A Simple Method for Source-free Open-partial Domain AdaptationComments: Updated. It only focuses on source-free open-partial domain adaptation, to avoid any potential misunderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [276] arXiv:2206.03612 [pdf, other]
-
Title: Predictive Modeling of Charge Levels for Battery Electric Vehicles using CNN EfficientNet and IGTD AlgorithmSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Signal Processing (eess.SP)
- [277] arXiv:2206.03657 [pdf, other]
-
Title: Delving into the Pre-training Paradigm of Monocular 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [278] arXiv:2206.03661 [pdf, other]
-
Title: One Hyper-Initializer for All Network Architectures in Medical Image AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [279] arXiv:2206.03666 [pdf, other]
-
Title: Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and TrackingAuthors: Longlong Jing, Ruichi Yu, Henrik Kretzschmar, Kang Li, Charles R. Qi, Hang Zhao, Alper Ayvaci, Xu Chen, Dillon Cower, Yingwei Li, Yurong You, Han Deng, Congcong Li, Dragomir AnguelovJournal-ref: ICRA2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [280] arXiv:2206.03673 [pdf, other]
-
Title: Unsupervised Learning of 3D Scene Flow from Monocular CameraComments: ICRA2021Journal-ref: 2021 IEEE International Conference on Robotics and Automation (ICRA)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [281] arXiv:2206.03678 [pdf, other]
-
Title: UHD Image Deblurring via Multi-scale Cubic-MixerComments: 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [282] arXiv:2206.03680 [pdf, other]
-
Title: DebiasBench: Benchmark for Fair Comparison of Debiasing in Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [283] arXiv:2206.03687 [pdf, other]
-
Title: A Unified Model for Multi-class Anomaly DetectionComments: Accepted by NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [284] arXiv:2206.03691 [pdf, other]
-
Title: Robust Deep Ensemble Method for Real-world Image DenoisingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [285] arXiv:2206.03697 [pdf, other]
-
Title: Blind Face Restoration: Benchmark Datasets and a Baseline ModelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [286] arXiv:2206.03698 [pdf, other]
-
Title: What do we learn? Debunking the Myth of Unsupervised Outlier DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [287] arXiv:2206.03727 [pdf, other]
-
Title: Wavelet Regularization Benefits Adversarial TrainingComments: Preprint versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [288] arXiv:2206.03740 [pdf, other]
-
Title: Large Loss Matters in Weakly Supervised Multi-Label ClassificationComments: CVPR 2022. First two authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [289] arXiv:2206.03753 [pdf, other]
-
Title: Learning Task Agnostic Temporal Consistency CorrectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [290] arXiv:2206.03775 [pdf, other]
-
Title: PixSelect: Less but Reliable Pixels for Accurate and Efficient LocalizationAuthors: Mohammad AltillawiJournal-ref: IEEE International Conference on Robotics and Automation (ICRA), May 23-27, 2022. Philadelphia, PA, USA, p 4156-4162Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [291] arXiv:2206.03778 [pdf, other]
-
Title: Learning Digital Terrain Models from Point Clouds: ALS2DTM Dataset and Rasterization-based GANSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [292] arXiv:2206.03789 [pdf, other]
-
Title: Language-Bridged Spatial-Temporal Interaction for Referring Video Object SegmentationComments: Accepted by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [293] arXiv:2206.03799 [pdf, other]
-
Title: Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth MapsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [294] arXiv:2206.03820 [pdf, ps, other]
-
Title: SUPER-IVIM-DC: Intra-voxel incoherent motion based Fetal lung maturity assessment from limited DWI data using supervised learning coupled with data-consistencyAuthors: Noam Korngut, Elad Rotman, Onur Afacan, Sila Kurugol, Yael Zaffrani-Reznikov, Shira Nemirovsky-Rotman, Simon Warfield, Moti FreimanComments: Accepted to the International Conference on Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, to be held during Sept 18-22 in SingaporeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
- [295] arXiv:2206.03858 [pdf, other]
-
Title: Rotation-Equivariant Conditional Spherical Neural Fields for Learning a Natural Illumination PriorComments: NeurIPS 2022 - Project Website: jadgardner.github.io/RENISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [296] arXiv:2206.03860 [pdf, other]
-
Title: Orthonormal Convolutions for the Rotation Based Iterative GaussianizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [297] arXiv:2206.03862 [pdf, other]
-
Title: Perceptual Quality Assessment for Fine-Grained Compressed ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [298] arXiv:2206.03876 [pdf, other]
-
Title: Progressive GANomaly: Anomaly detection with progressively growing GANsComments: SPIE Medical Imaging 2022: Image Processing conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [299] arXiv:2206.03888 [pdf, other]
-
Title: ConFUDA: Contrastive Fewshot Unsupervised Domain Adaptation for Medical Image SegmentationAuthors: Mingxuan Gu, Sulaiman Vesal, Mareike Thies, Zhaoya Pan, Fabian Wagner, Mirabela Rusu, Andreas Maier, Ronak KostiSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [300] arXiv:2206.03891 [pdf, other]
-
Title: PrivHAR: Recognizing Human Actions From Privacy-preserving LensAuthors: Carlos Hinojosa, Miguel Marquez, Henry Arguello, Ehsan Adeli, Li Fei-Fei, Juan Carlos NieblesSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [301] arXiv:2206.03928 [pdf, other]
-
Title: Direct Triangulation with Spherical Projection for Omnidirectional CamerasAuthors: Ciarán EisingComments: 8 pages, 4 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [302] arXiv:2206.03939 [pdf, other]
-
Title: Depth-Adapted CNNs for RGB-D Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [303] arXiv:2206.03943 [pdf, other]
-
Title: Robust Environment Perception for Automated Driving: A Unified Learning Pipeline for Visual-Infrared Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [304] arXiv:2206.03970 [pdf, other]
-
Title: Narrowing the Coordinate-frame Gap in Behavior Prediction Models: Distillation for Efficient and Accurate Scene-centric Motion ForecastingComments: Accepted at ICRA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [305] arXiv:2206.04003 [pdf, other]
-
Title: Patch-based Object-centric Transformers for Efficient Video GenerationComments: Project Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [306] arXiv:2206.04028 [pdf, other]
-
Title: CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous DrivingComments: Pre-trained backbones and fine-tuned downstream models are now available: this https URL Code will be releasedSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [307] arXiv:2206.04029 [pdf, other]
-
Title: Accelerating Score-based Generative Models for High-Resolution Image SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [308] arXiv:2206.04040 [pdf, other]
-
Title: An Improved One millisecond Mobile BackboneSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [309] arXiv:2206.04042 [pdf, other]
-
Title: Learning Ego 3D Representation as Ray TracingComments: ECCV 2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [310] arXiv:2206.04046 [pdf, other]
-
Title: Sparse Mixture-of-Experts are Domain Generalizable LearnersComments: remake preprint versionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [311] arXiv:2206.04124 [pdf, other]
-
Title: DRHDR: A Dual branch Residual Network for Multi-Bracket High Dynamic Range ImagingComments: Accepted by CVPRW 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [312] arXiv:2206.04125 [pdf, other]
-
Title: Towards Self-supervised and Weight-preserving Neural Architecture SearchAuthors: Zhuowei Li, Yibo Gao, Zhenzhou Zha, Zhiqiang HU, Qing Xia, Shaoting Zhang, Dimitris N. MetaxasSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [313] arXiv:2206.04158 [pdf, other]
-
Title: Texture Extraction Methods Based Ensembling Framework for Improved ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [314] arXiv:2206.04170 [pdf, other]
-
Title: CASS: Cross Architectural Self-Supervision for Medical Image AnalysisComments: (27 pages, 14 figures), Accepted at NeurIPS 2022 Workshop: Self-Supervised Learning - Theory and PracticeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [315] arXiv:2206.04176 [pdf, other]
-
Title: VN-Transformer: Rotation-Equivariant Attention for Vector NeuronsComments: Published in Transactions on Machine Learning Research (TMLR), 2023; Previous version appeared in Workshop on Machine Learning for Autonomous Driving, Conference on Neural Information Processing Systems (NeurIPS), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [316] arXiv:2206.04197 [pdf, other]
-
Title: SCAMPS: Synthetics for Camera Measurement of Physiological SignalsAuthors: Daniel McDuff, Miah Wander, Xin Liu, Brian L. Hill, Javier Hernandez, Jonathan Lester, Tadas BaltrusaitisSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [317] arXiv:2206.04231 [pdf, other]
-
Title: JNMR: Joint Non-linear Motion Regression for Video Frame InterpolationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [318] arXiv:2206.04242 [pdf, other]
-
Title: OOD Augmentation May Be at Odds with Open-Set RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [319] arXiv:2206.04246 [pdf, other]
-
Title: SwinCheX: Multi-label classification on chest X-ray images with transformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [320] arXiv:2206.04271 [pdf, other]
-
Title: DeepVerge: Classification of Roadside Verge Biodiversity and Conservation PotentialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [321] arXiv:2206.04281 [pdf, other]
-
Title: Local Spatiotemporal Representation Learning for Longitudinally-consistent Neuroimage AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [322] arXiv:2206.04295 [pdf, other]
-
Title: Reconstruct Face from Features Using GAN Generator as a Distribution ConstraintSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [323] arXiv:2206.04325 [pdf, other]
-
Title: CFA: Coupled-hypersphere-based Feature Adaptation for Target-Oriented Anomaly LocalizationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [324] arXiv:2206.04349 [pdf, other]
-
Title: Deep radiomic signature with immune cell markers predicts the survival of glioma patientsAuthors: Ahmad Chaddad, Paul Daniel Mingli Zhang, Saima Rathore, Paul Sargos, Christian Desrosiers, Tamim NiaziJournal-ref: Neurocomputing, Volume 469, 16 January 2022, Pages 366-375Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
- [325] arXiv:2206.04365 [pdf, other]
-
Title: CARLA-GeAR: a Dataset Generator for a Systematic Evaluation of Adversarial Robustness of Vision ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [326] arXiv:2206.04374 [pdf, other]
-
Title: Uncovering bias in the PlantVillage datasetAuthors: Mehmet Alican NoyanSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [327] arXiv:2206.04381 [pdf, other]
-
Title: STIP: A SpatioTemporal Information-Preserving and Perception-Augmented Model for High-Resolution Video PredictionComments: This journal paper is extended from our previous work accepted in CVPR2022 and has been submitted to IEEE Transactions on MultimediaSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [328] arXiv:2206.04382 [pdf, other]
-
Title: CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human MeshesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
- [329] arXiv:2206.04399 [pdf]
-
Title: Depression Recognition using Remote Photoplethysmography from Facial VideosComments: 10 pages, 5 figures, 8 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
- [330] arXiv:2206.04401 [pdf, other]
-
Title: Cross-modal Local Shortest Path and Global Enhancement for Visible-Thermal Person Re-IdentificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [331] arXiv:2206.04403 [pdf, other]
-
Title: VITA: Video Instance Segmentation via Object Token AssociationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [332] arXiv:2206.04406 [pdf, other]
-
Title: Unsupervised Learning of the Total Variation FlowSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [333] arXiv:2206.04425 [pdf, other]
-
Title: Multiple Instance Learning for Digital Pathology: A Review on the State-of-the-Art, Limitations & Future PotentialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [334] arXiv:2206.04449 [pdf, other]
-
Title: Segmentation Enhanced Lameness Detection in Dairy Cows from RGB and Depth VideoComments: Accepted at the CV4Animals workshop in CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [335] arXiv:2206.04452 [pdf, other]
-
Title: Draft-and-Revise: Effective Image Generation with Contextual RQ-TransformerComments: 20 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [336] arXiv:2206.04453 [pdf, other]
-
Title: The Missing Link: Finding label relations across datasetsComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [337] arXiv:2206.04479 [pdf]
-
Title: BSM loss: A superior way in modeling aleatory uncertainty of fine_grained classificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [338] arXiv:2206.04503 [pdf, other]
-
Title: cycle text2face: cycle text-to-face gan via transformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [339] arXiv:2206.04511 [pdf, other]
-
Title: Efficient Human Pose Estimation via 3D Event Point CloudComments: Accepted to 3DV 2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [340] arXiv:2206.04531 [pdf, other]
-
Title: ECLAD: Extracting Concepts with Local Aggregated DescriptorsComments: 46 pages, under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [341] arXiv:2206.04557 [pdf, other]
-
Title: SparseFormer: Attention-based Depth Completion NetworkComments: Accepted at CV4ARVR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [342] arXiv:2206.04558 [pdf, other]
-
Title: BFS-Net: Weakly Supervised Cell Instance Segmentation from Bright-Field Microscopy Z-StacksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [343] arXiv:2206.04575 [pdf, other]
-
Title: Transformer based Urdu Handwritten Text Optical Character ReaderSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [344] arXiv:2206.04584 [pdf, other]
-
Title: Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerComments: Tech report. Work in progressSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [345] arXiv:2206.04590 [pdf, other]
-
Title: GASP: Gated Attention For Saliency PredictionComments: International Joint Conference on Artificial Intelligence (IJCAI-21)Journal-ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (2021) 584-591Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [346] arXiv:2206.04636 [pdf, other]
-
Title: Spatial Entropy as an Inductive Bias for Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [347] arXiv:2206.04655 [pdf, other]
-
Title: Towards Layer-wise Image VectorizationAuthors: Xu Ma, Yuqian Zhou, Xingqian Xu, Bin Sun, Valerii Filev, Nikita Orlov, Yun Fu, Humphrey ShiComments: Accepted as Oral Presentation at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [348] arXiv:2206.04656 [pdf, other]
-
Title: Simple Cues Lead to a Strong Multi-Object TrackerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [349] arXiv:2206.04662 [pdf, other]
-
Title: DiSparse: Disentangled Sparsification for Multitask Model CompressionComments: Accepted at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [350] arXiv:2206.04664 [pdf, other]
-
Title: On Data Scaling in Masked Image ModelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [351] arXiv:2206.04665 [pdf, other]
-
Title: AGConv: Adaptive Graph Convolution on 3D Point CloudsAuthors: Mingqiang Wei, Zeyong Wei, Haoran Zhou, Fei Hu, Huajian Si, Zhilei Chen, Zhe Zhu, Jingbo Qiu, Xuefeng Yan, Yanwen Guo, Jun Wang, Jing QinComments: arXiv admin note: substantial text overlap with arXiv:2108.08035Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [352] arXiv:2206.04667 [pdf, other]
-
Title: Extreme Masking for Learning Instance and Distributed Visual RepresentationsComments: Technical ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [353] arXiv:2206.04668 [pdf, other]
-
Title: GateHUB: Gated History Unit with Background Suppression for Online Action DetectionComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [354] arXiv:2206.04669 [pdf, other]
-
Title: Beyond RGB: Scene-Property Synthesis with Neural Radiance FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [355] arXiv:2206.04670 [pdf, other]
-
Title: PointNeXt: Revisiting PointNet++ with Improved Training and Scaling StrategiesAuthors: Guocheng Qian, Yuchen Li, Houwen Peng, Jinjie Mai, Hasan Abed Al Kader Hammoud, Mohamed Elhoseiny, Bernard GhanemComments: Accepted by NeurIPS'22. Code and models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [356] arXiv:2206.04671 [pdf, other]
-
Title: Open Challenges in Deep Stereo: the Booster DatasetAuthors: Pierluigi Zama Ramirez, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di StefanoComments: CVPR 2022, New Orleans. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [357] arXiv:2206.04673 [pdf, other]
-
Title: Neural Prompt SearchComments: Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [358] arXiv:2206.04674 [pdf, other]
-
Title: Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEsComments: Code shall be released at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [359] arXiv:2206.04783 [pdf, other]
-
Title: ReFace: Real-time Adversarial Attacks on Face Recognition SystemsAuthors: Shehzeen Hussain, Todd Huster, Chris Mesterharm, Paarth Neekhara, Kevin An, Malhar Jere, Harshvardhan Sikka, Farinaz KoushanfarSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [360] arXiv:2206.04785 [pdf, other]
-
Title: Building Spatio-temporal Transformers for Egocentric 3D Pose EstimationComments: 4 pages, Extended abstract, Joint International Workshop on Egocentric Perception, Interaction and Computing (EPIC) and Ego4D, IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [361] arXiv:2206.04790 [pdf, other]
-
Title: Learn2Augment: Learning to Composite Videos for Data Augmentation in Action RecognitionComments: Accepted to ECCV-2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [362] arXiv:2206.04797 [pdf, other]
-
Title: Memory-efficient model-based deep learning with convergence and robustness guaranteesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [363] arXiv:2206.04831 [pdf, other]
-
Title: R4D: Utilizing Reference Objects for Long-Range Distance EstimationComments: ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [364] arXiv:2206.04846 [pdf, other]
-
Title: Masked Autoencoders are Robust Data AugmentorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [365] arXiv:2206.04854 [pdf, other]
-
Title: Heterogeneous Face Recognition via Face Synthesis with Identity-Attribute DisentanglementComments: Accepted for publication in IEEE Transactions on Information Forensics and Security (TIFS)Journal-ref: IEEE Transactions on Information Forensics and Security, vol. 17, pp. 1344-1358, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [366] arXiv:2206.04863 [pdf, other]
-
Title: Symbolic image detection using scene and knowledge graphsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [367] arXiv:2206.04867 [pdf, other]
-
Title: The Gender Gap in Face Recognition Accuracy Is a Hairy ProblemSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [368] arXiv:2206.04874 [pdf]
-
Title: The 1st Data Science for Pavements ChallengeAuthors: Ashkan Behzadian, Tanner Wambui Muturi, Tianjie Zhang, Hongmin Kim, Amanda Mullins, Yang Lu, Neema Jasika Owor, Yaw Adu-Gyamfi, William Buttlar, Majidifard Hamed, Armstrong Aboah, David Mensching, Spragg Robert, Matthew Corrigan, Jack Youtchef, Dave EshanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [369] arXiv:2206.04879 [pdf, other]
-
Title: Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label DiffusionComments: IEEE Transactions on Image Processing 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [370] arXiv:2206.04901 [pdf, other]
-
Title: NeRF-In: Free-Form NeRF Inpainting with RGB-D PriorsComments: Hao-Kang Liu and I-Chao Shen contributed equally to the paper. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [371] arXiv:2206.04906 [pdf, other]
-
Title: Out of Sight, Out of Mind: A Source-View-Wise Feature Aggregation for Multi-View Image-Based RenderingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [372] arXiv:2206.04916 [pdf, other]
-
Title: PatchComplete: Learning Multi-Resolution Patch Priors for 3D Shape Completion on Unseen CategoriesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [373] arXiv:2206.04927 [pdf, other]
-
Title: Ego2HandsPose: A Dataset for Egocentric Two-hand 3D Global Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [374] arXiv:2206.04942 [pdf, other]
-
Title: Neural Template: Topology-aware Reconstruction and Disentangled Generation of 3D MeshesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [375] arXiv:2206.04949 [pdf, other]
-
Title: Deep Multi-view Semi-supervised Clustering with Sample Pairwise ConstraintsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [376] arXiv:2206.04958 [pdf, other]
-
Title: Self-Supervised Deep Subspace Clustering with Entropy-normSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [377] arXiv:2206.04975 [pdf, other]
-
Title: NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression RecognitionComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [378] arXiv:2206.04979 [pdf, ps, other]
-
Title: Convolutional Layers are Equivariant to Discrete Shifts But Not Continuous TranslationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [379] arXiv:2206.04981 [pdf, other]
-
Title: Positional Label for Self-Supervised Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [380] arXiv:2206.05028 [pdf, other]
-
Title: Spatial Cross-Attention Improves Self-Supervised Visual Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [381] arXiv:2206.05039 [pdf, other]
-
Title: Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [382] arXiv:2206.05099 [pdf, other]
-
Title: SimVP: Simpler yet Better Video PredictionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [383] arXiv:2206.05102 [pdf, other]
-
Title: Saccade Mechanisms for Image Classification, Object Detection and TrackingComments: 4 Pages, 6 figures, will be presented at CVPR2022-NeuroVision workshop as a Lightning talkSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [384] arXiv:2206.05127 [pdf, other]
-
Title: Globally-Optimal Contrast Maximisation for Event CamerasComments: arXiv admin note: substantial text overlap with arXiv:2203.03914Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [385] arXiv:2206.05128 [pdf]
-
Title: Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware AcceleratorsAuthors: Indhumathi Kandaswamy, Saurabh Farkya, Zachary Daniels, Gooitzen van der Wal, Aswin Raghavan, Yuzheng Zhang, Jun Hu, Michael Lomnitz, Michael Isnardi, David Zhang, Michael PiacentinoComments: 9 pages, 15 figures. Will be presented in Embedded Vision Workshop at CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR)
- [386] arXiv:2206.05149 [pdf, other]
-
Title: Referring Image MattingComments: The dataset, code and models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [387] arXiv:2206.05158 [pdf, other]
-
Title: MEAT: Maneuver Extraction from Agent TrajectoriesComments: Accepted at IEEE Intelligent Vehicles Symposium (IV) 2022 2nd Workshop on Autonomy@ScaleSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [388] arXiv:2206.05159 [pdf]
-
Title: An Image Processing Pipeline for Camera Trap Time-Lapse RecordingsComments: 5 pages, 2 figures, presented at the CV4Animals workshop of CVIP2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [389] arXiv:2206.05184 [pdf, other]
-
Title: Exploring Feature Self-relation for Self-supervised TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [390] arXiv:2206.05194 [pdf, other]
-
Title: Learning the Space of Deep ModelsComments: Accepted at ICPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [391] arXiv:2206.05225 [pdf, other]
-
Title: ClamNet: Using contrastive learning with variable depth Unets for medical image segmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [392] arXiv:2206.05252 [pdf, other]
-
Title: Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning ModelsComments: 12 pages, 12 figures (with supplemental: 34 pages)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [393] arXiv:2206.05253 [pdf, other]
-
Title: Rethinking Spatial Invariance of Convolutional Networks for Object CountingComments: Accepted to CVPR 2022, Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
- [394] arXiv:2206.05257 [pdf, other]
-
Title: Explaining Image Classifiers Using Contrastive Counterfactuals in Generative Latent SpacesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [395] arXiv:2206.05259 [pdf, other]
-
Title: Is Self-Supervised Learning More Robust Than Supervised Learning?Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [396] arXiv:2206.05260 [pdf, other]
-
Title: Balanced Product of Calibrated Experts for Long-Tailed RecognitionComments: 19 pages, under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [397] arXiv:2206.05275 [pdf, other]
-
Title: Spatial-temporal Concept based Explanation of 3D ConvNetsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [398] arXiv:2206.05281 [pdf, other]
-
Title: Less Is More: Linear Layers on CLIP Features as Powerful VizWiz ModelComments: VizWiz Grand Challenge: Describing Images and Videos Taken by Blind People (CVPR Workshop 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [399] arXiv:2206.05282 [pdf, other]
-
Title: Learning to Estimate Shapley Values with Vision TransformersComments: Updated versionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [400] arXiv:2206.05291 [pdf, other]
-
Title: ProActive: Self-Attentive Temporal Point Process Flows for Activity SequencesComments: KDD 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [401] arXiv:2206.05309 [pdf, ps, other]
-
Title: EigenFairing: 3D Model Fairing using Image CoherenceComments: British Machine Vision Conference, BMVC 2004, Kingston, UK, September 7-9, 2004Journal-ref: Proceedings of the British Machine Conference, pages 1-10, BMVA Press, September 2004Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [402] arXiv:2206.05319 [pdf, other]
-
Title: Object Instance Identification in Dynamic EnvironmentsComments: Joint 1st Ego4D and 10th EPIC Workshop (EPIC@CVPR2022) Extended AbstractSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [403] arXiv:2206.05375 [pdf, other]
-
Title: Generalizable Neural Radiance Fields for Novel View Synthesis with TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [404] arXiv:2206.05377 [pdf, other]
-
Title: Fast building segmentation from satellite imagery and few local labelsAuthors: Caleb Robinson, Anthony Ortiz, Hogeun Park, Nancy Lozano Gracia, Jon Kher Kaw, Tina Sederholm, Rahul Dodhia, Juan M. Lavista FerresComments: Accepted at EarthVision 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [405] arXiv:2206.05379 [pdf, other]
-
Title: A Benchmark for Compositional Visual ReasoningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [406] arXiv:2206.05390 [pdf, other]
-
Title: Transformer-based Self-Supervised Fish Segmentation in Underwater VideosComments: 11 pages, 6 figures. Submitted to the journal, International Journal of Intelligent SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [407] arXiv:2206.05394 [pdf, other]
-
Title: Applications of Deep Learning in Fish Habitat Monitoring: A Tutorial and SurveyComments: 26 pages, 7 figures. Submitted to the journal, Expert Systems With ApplicationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [408] arXiv:2206.05398 [pdf, other]
-
Title: E2PN: Efficient SE(3)-Equivariant Point NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [409] arXiv:2206.05420 [pdf, other]
-
Title: VAC2: Visual Analysis of Combined Causality in Event SequencesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [410] arXiv:2206.05422 [pdf, other]
-
Title: Access Control of Semantic Segmentation Models Using Encrypted Feature MapsSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [411] arXiv:2206.05424 [pdf, other]
-
Title: Precise Affordance Annotation for Egocentric Action Video DatasetsComments: Technical report for CVPR 2022 EPIC-Ego4D WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [412] arXiv:2206.05431 [pdf, other]
-
Title: Learned reconstruction methods with convergence guaranteesAuthors: Subhadip Mukherjee, Andreas Hauptmann, Ozan Öktem, Marcelo Pereyra, Carola-Bibiane SchönliebSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [413] arXiv:2206.05432 [pdf, ps, other]
-
Title: Luminance-Guided Chrominance Image Enhancement for HEVC Intra CodingComments: ISCAS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [414] arXiv:2206.05488 [pdf]
-
Title: Kaggle Kinship Recognition Challenge: Introduction of Convolution-Free Model to boost conventionalSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [415] arXiv:2206.05496 [pdf, other]
-
Title: An Evaluation of OCR on Egocentric DataComments: Extended Abstract, EPIC workshop at CVPR 22Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [416] arXiv:2206.05498 [pdf, other]
-
Title: A Review of Causality for Learning Algorithms in Medical Image AnalysisComments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL". ; Paper ID: 2022:028Journal-ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); General Literature (cs.GL)
- [417] arXiv:2206.05514 [pdf, other]
-
Title: Toward Real-world Single Image Deraining: A New Benchmark and BeyondSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [418] arXiv:2206.05520 [pdf, other]
-
Title: A Two-stage Method for Non-extreme Value Salt-and-Pepper Noise RemovalComments: UESTC course projectSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [419] arXiv:2206.05539 [pdf, other]
-
Title: A Simplified Un-Supervised Learning Based Approach for Ink Mismatch Detection in Handwritten Hyper-Spectral Document ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [420] arXiv:2206.05542 [pdf, other]
-
Title: Surround-View Cameras based Holistic Visual Perception for Automated DrivingAuthors: Varun Ravi KumarComments: Doctoral thesisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [421] arXiv:2206.05617 [pdf, other]
-
Title: Federated Learning with Research Prototypes for Multi-Center MRI-based Detection of Prostate Cancer with Diverse HistopathologyAuthors: Abhejit Rajagopal, Ekaterina Redekop, Anil Kemisetti, Rushi Kulkarni, Steven Raman, Kirti Magudia, Corey W. Arnold, Peder E. Z. LarsonComments: under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
- [422] arXiv:2206.05619 [pdf, other]
-
Title: Deep Learning Models for Automated Classification of Dog Emotional States from Facial ExpressionsAuthors: Tali Boneh-Shitrit, Shir Amir, Annika Bremhorst, Daniel S. Mills, Stefanie Riemer, Dror Fried, Anna ZamanskySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [423] arXiv:2206.05641 [pdf, ps, other]
-
Title: An Unsupervised Deep-Learning Method for Bone Age AssessmentSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [424] arXiv:2206.05648 [pdf, other]
-
Title: Indirect-Instant Attention Optimization for Crowd Counting in Dense ScenesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [425] arXiv:2206.05651 [pdf, other]
-
Title: STD-NET: Search of Image Steganalytic Deep-learning Architecture via Hierarchical Tensor DecompositionComments: Submitted to IEEE T-DSCSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [426] arXiv:2206.05683 [pdf, other]
-
Title: APT-36K: A Large-scale Benchmark for Animal Pose Estimation and TrackingComments: Neurips 2022 dataset and benchmark trackSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [427] arXiv:2206.05707 [pdf, other]
-
Title: DPCN++: Differentiable Phase Correlation Network for Versatile Pose RegistrationAuthors: Zexi Chen, Yiyi Liao, Haozhe Du, Haodong Zhang, Xuecheng Xu, Haojian Lu, Rong Xiong, Yue WangSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [428] arXiv:2206.05708 [pdf, other]
-
Title: Narrowing the Gap: Improved Detector Training with Noisy Location AnnotationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [429] arXiv:2206.05712 [pdf, other]
-
Title: Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory PredictionComments: This paper has been accepted by CVPR 2022. Reference: Li, L., Pagnucco, M. and Song, Y., 2022. Graph-Based Spatial Transformer With Memory Replay for Multi-Future Pedestrian Trajectory Prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 2231-2241)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [430] arXiv:2206.05717 [pdf, other]
-
Title: Crowd Localization from Gaussian Mixture Scoped Knowledge and Scoped TeacherSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [431] arXiv:2206.05730 [pdf, other]
-
Title: Object Occlusion of Adding New Categories in Objection DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [432] arXiv:2206.05737 [pdf, other]
-
Title: SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse ViewsComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [433] arXiv:2206.05741 [pdf, other]
-
Title: Bootstrapping Multi-view Representations for Fake News DetectionComments: Authors are from Fudan University, China. Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [434] arXiv:2206.05763 [pdf, other]
-
Title: SeATrans: Learning Segmentation-Assisted diagnosis model via TransformerAuthors: Junde Wu, Huihui Fang, Fangxin Shang, Dalu Yang, Zhaowei Wang, Jing Gao, Yehui Yang, Yanwu XuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [435] arXiv:2206.05765 [pdf, other]
-
Title: A Semantic Consistency Feature Alignment Object Detection Model Based on Mixed-Class Distribution MetricsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [436] arXiv:2206.05810 [pdf, other]
-
Title: Analysis of Branch Specialization and its Application in Image DecompositionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [437] arXiv:2206.05833 [pdf, other]
-
Title: COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion RecognitionAuthors: Mani Kumar Tellamekala, Shahin Amiriparian, Björn W. Schuller, Elisabeth André, Timo Giesbrecht, Michel ValstarSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
- [438] arXiv:2206.05836 [pdf, other]
-
Title: GLIPv2: Unifying Localization and Vision-Language UnderstandingAuthors: Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng GaoComments: NeurIPS 2022; updated with reviewers' comments addressed; Code is released at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
- [439] arXiv:2206.05837 [pdf, other]
-
Title: NeuralODF: Learning Omnidirectional Distance Fields for 3D Shape RepresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [440] arXiv:2206.05842 [pdf]
-
Title: Efficiency Comparison of AI classification algorithms for Image Detection and Recognition in Real-timeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [441] arXiv:2206.05844 [pdf, other]
-
Title: FisheyeEX: Polar Outpainting for Extending the FoV of Fisheye LensSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [442] arXiv:2206.05846 [pdf, other]
-
Title: InBiaseD: Inductive Bias Distillation to Improve Generalization and Robustness through Shape-awarenessComments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [443] arXiv:2206.05853 [pdf, other]
-
Title: Modeling Generalized Specialist Approach To Train Quality Resilient Snapshot EnsembleSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [444] arXiv:2206.05866 [pdf, other]
-
Title: TC-SfM: Robust Track-Community-Based Structure-from-MotionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [445] arXiv:2206.05896 [pdf, other]
-
Title: Improve Ranking Correlation of Super-net through Training Scheme from One-shot NAS to Few-shot NASSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [446] arXiv:2206.05897 [pdf, other]
-
Title: $\texttt{GradICON}$: Approximate Diffeomorphisms via Gradient Inverse ConsistencyAuthors: Lin Tian, Hastings Greer, François-Xavier Vialard, Roland Kwitt, Raúl San José Estépar, Richard Jarrett Rushmore, Nikolaos Makris, Sylvain Bouix, Marc NiethammerComments: 29 pages, 16 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [447] arXiv:2206.05898 [pdf, other]
-
Title: Pixel to Binary Embedding Towards Robustness for CNNsComments: Accepted to ICPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [448] arXiv:2206.05903 [pdf, other]
-
Title: Geometrically Guided Integrated GradientsComments: 19 pages, 23 figures, funding sources addedSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [449] arXiv:2206.05912 [pdf, other]
-
Title: INDIGO: Intrinsic Multimodality for Domain GeneralizationAuthors: Puneet Mangla, Shivam Chandhok, Milan Aggarwal, Vineeth N Balasubramanian, Balaji KrishnamurthyComments: Under SubmissionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [450] arXiv:2206.05927 [pdf, other]
-
Title: LinK3D: Linear Keypoints Representation for 3D LiDAR Point CloudSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [451] arXiv:2206.05962 [pdf, other]
-
Title: PRO-TIP: Phantom for RObust automatic ultrasound calibration by TIP detectionAuthors: Matteo Ronchetti, Julia Rackerseder, Maria Tirindelli, Mehrdad Salehi, Nassir Navab, Wolfgang Wein, Oliver ZettinigComments: This preprint was submitted to MICCAI 2022. The Version of Record of this contribution will be published in Springer LNCSSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [452] arXiv:2206.05963 [pdf]
-
Title: ATDN vSLAM: An all-through Deep Learning-Based Solution for Visual Simultaneous Localization and MappingComments: Published in Periodica Polytechnica Electrical Engineering 11 pagesJournal-ref: Periodica Polytechnica Electrical Engineering and Computer Science, 66(3), pp. 236-247, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [453] arXiv:2206.05967 [pdf, other]
-
Title: GoToNet: Fast Monocular Scene Exposure and ExplorationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [454] arXiv:2206.05970 [pdf, other]
-
Title: HyperRes: Efficient Hypernetwork-Based Continuous Image RestorationSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [455] arXiv:2206.05981 [pdf, other]
-
Title: Efficient Human-in-the-loop System for Guiding DNNs AttentionComments: 13 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [456] arXiv:2206.05982 [pdf, other]
-
Title: Learning Fashion Compatibility from In-the-wild ImagesComments: Accepted to ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [457] arXiv:2206.06014 [pdf, other]
-
Title: Exploring and Exploiting Hubness Priors for High-Quality GAN Latent SamplingComments: Accepted at ICML 2022. Our code is available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [458] arXiv:2206.06023 [pdf, other]
-
Title: Virtual embeddings and self-consistency for self-supervised learningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [459] arXiv:2206.06067 [pdf, other]
-
Title: Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge DistillationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [460] arXiv:2206.06079 [pdf, other]
-
Title: OHM: GPU Based Occupancy Map GenerationComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [461] arXiv:2206.06100 [pdf, other]
-
Title: AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance FieldsAuthors: Takuhiro KanekoComments: Accepted to CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [462] arXiv:2206.06103 [pdf, other]
-
Title: Learning Feature Disentanglement and Dynamic Fusion for Recaptured Image ForensicComments: Accepted by CVPR2022 workshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [463] arXiv:2206.06119 [pdf, other]
-
Title: Satellite-based high-resolution maps of cocoa for Côte d'Ivoire and GhanaAuthors: Nikolai Kalischek, Nico Lang, Cécile Renier, Rodrigo Caye Daudt, Thomas Addoah, William Thompson, Wilma J. Blaser-Hart, Rachael Garrett, Konrad Schindler, Jan D. WegnerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [464] arXiv:2206.06120 [pdf]
-
Title: Brain tumour segmentation with incomplete imaging dataComments: 25 pages, 7 figures, 4 supplementary tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Tissues and Organs (q-bio.TO)
- [465] arXiv:2206.06122 [pdf, other]
-
Title: Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuningAuthors: Yanpeng Sun, Qiang Chen, Xiangyu He, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jian Cheng, Zechao Li, Jingdong WangComments: Accepted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [466] arXiv:2206.06168 [pdf, other]
-
Title: 2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning ApproachComments: 2nd Place Solution for ICCV 2021 VIPriors Image Classification ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [467] arXiv:2206.06177 [pdf, other]
-
Title: Transductive CLIP with Class-Conditional Contrastive LearningComments: Published in IEEE ICASSP 2022Journal-ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [468] arXiv:2206.06214 [pdf, other]
-
Title: Learning a Degradation-Adaptive Network for Light Field Image Super-ResolutionComments: 13 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [469] arXiv:2206.06219 [pdf, other]
-
Title: Making Sense of Dependence: Efficient Black-box Explanations Using Dependence MeasureComments: Accepted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML); Other Statistics (stat.OT)
- [470] arXiv:2206.06252 [pdf, other]
-
Title: Transformer Lesion TrackerComments: Accepted MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [471] arXiv:2206.06258 [pdf, other]
-
Title: Featurized Query R-CNNComments: Tech ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [472] arXiv:2206.06289 [pdf, other]
-
Title: Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object ManipulationComments: Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Top-performing systems for both no interaction and no restriction tracks in SAPIEN ManiSkill Challenge 2021. The source code and model are publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Robotics (cs.RO)
- [473] arXiv:2206.06291 [pdf, other]
-
Title: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction DetectionComments: CVPR 2022; Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [474] arXiv:2206.06292 [pdf, other]
-
Title: MLP-3D: A MLP-like 3D Architecture with Grouped Time MixingComments: CVPR 2022; Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [475] arXiv:2206.06293 [pdf, other]
-
Title: Learning Domain Adaptive Object Detection with Probabilistic TeacherAuthors: Meilin Chen, Weijie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Yunfeng Yan, Donglian Qi, Yueting Zhuang, Di Xie, Shiliang PuComments: To appear in ICML 2022. Code is coming soon: this https URLJournal-ref: International Conference on Machine Learning (ICML), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [476] arXiv:2206.06323 [pdf, other]
-
Title: Visual Transformer for Object DetectionAuthors: Michael YangComments: In preparation for short paper of conferences. I am using the name Michael YangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [477] arXiv:2206.06340 [pdf, other]
-
Title: SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete DataComments: First two authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [478] arXiv:2206.06346 [pdf]
-
Title: Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object TokensAuthors: Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir GlobersonComments: Tech reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [479] arXiv:2206.06359 [pdf, other]
-
Title: EnergyMatch: Energy-based Pseudo-Labeling for Semi-Supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [480] arXiv:2206.06360 [pdf, other]
-
Title: ARF: Artistic Radiance FieldsComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [481] arXiv:2206.06363 [pdf, other]
-
Title: Discovering Object Masks with Transformers for Unsupervised Semantic SegmentationComments: Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [482] arXiv:2206.06404 [pdf, other]
-
Title: Compositional Mixture Representations for Vision and TextComments: Workshop on Learning with Limited Labelled Data for Image and Video Understanding (L3D-IVU), CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [483] arXiv:2206.06420 [pdf, other]
-
Title: GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [484] arXiv:2206.06427 [pdf, other]
-
Title: A Multi-purpose Real Haze Benchmark with Quantifiable Haze Levels and Ground TruthAuthors: Priya Narayanan, Xin Hu, Zhenyu Wu, Matthew D Thielke, John G Rogers, Andre V Harrison, John A D'Agostino, James D Brown, Long P Quang, James R Uplinger, Heesung Kwon, Zhangyang WangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [485] arXiv:2206.06430 [pdf]
-
Title: A Training Method For VideoPose3D With Ideology of Action RecognitionAuthors: Hao BaiComments: Published by IEEE, on conference CONF-SPMLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [486] arXiv:2206.06435 [pdf]
-
Title: ICP Algorithm: Theory, Practice And Its SLAM-oriented TaxonomyAuthors: Hao BaiComments: Accepted by CONF-CDS'22Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [487] arXiv:2206.06461 [pdf, other]
-
Title: Self-Supervised Representation Learning With MUlti-Segmental Informational Coding (MUSIC)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [488] arXiv:2206.06466 [pdf, other]
-
Title: Revisiting the Shape-Bias of Deep Learning for Dermoscopic Skin Lesion ClassificationAuthors: Adriano Lucieri, Fabian Schmeisser, Christoph Peter Balada, Shoaib Ahmed Siddiqui, Andreas Dengel, Sheraz AhmedComments: Submitted preprint accepted for MIUA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [489] arXiv:2206.06481 [pdf, other]
-
Title: RigNeRF: Fully Controllable Neural 3D PortraitsComments: The project page can be found here: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [490] arXiv:2206.06484 [pdf, other]
-
Title: On Image Segmentation With Noisy Labels: Characterization and Volume Properties of the Optimal Solutions to Accuracy and DiceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [491] arXiv:2206.06487 [pdf, other]
-
Title: The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge DistillationComments: The first three authors contribute equallySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [492] arXiv:2206.06488 [pdf, other]
-
Title: Multimodal Learning with Transformers: A SurveySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [493] arXiv:2206.06490 [pdf, other]
-
Title: Learning Task-Independent Game State Representations from Unlabeled ImagesComments: Conference on Games (CoG) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [494] arXiv:2206.06506 [pdf, other]
-
Title: Spiking Neural Networks for Frame-based and Event-based Single Object LocalizationComments: 21 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [495] arXiv:2206.06510 [pdf, other]
-
Title: Generalizable Method for Face Anti-Spoofing with Semi-Supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [496] arXiv:2206.06518 [pdf, other]
-
Title: Estimating Pose from Pressure Data for Smart Beds with Deep Image-based Pose EstimatorsComments: The version of record of this article, first published in Applied Intelligence, is available online at Publisher's website this https URL arXiv admin note: substantial text overlap with arXiv:1908.08919Journal-ref: Applied Intelligence (2021): 1-15Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [497] arXiv:2206.06533 [pdf, other]
-
Title: 3D scene reconstruction from monocular spherical video with motion parallaxAuthors: Kenji TanakaComments: 13 pages, 18 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
- [498] arXiv:2206.06544 [pdf, ps, other]
-
Title: A Survey of Automated Data Augmentation Algorithms for Deep Learning-based Image Classification TasksComments: 68 pages, 9 figures. Submitted to Knowledge and Information Systems (KAIS)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [499] arXiv:2206.06607 [pdf, other]
-
Title: Plug-and-Play Pseudo Label Correction Network for Unsupervised Person Re-identificationComments: 19 pages,9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [500] arXiv:2206.06608 [pdf, other]
-
Title: Label Matching Semi-Supervised Object DetectionAuthors: Binbin Chen, Weijie Chen, Shicai Yang, Yunyi Xuan, Jie Song, Di Xie, Shiliang Pu, Mingli Song, Yueting ZhuangComments: To appear in CVPR 2022. Code is coming soon: this https URLJournal-ref: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [501] arXiv:2206.06619 [pdf, other]
-
Title: TransVG++: End-to-End Visual Grounding with Language Conditioned Vision TransformerAuthors: Jiajun Deng, Zhengyuan Yang, Daqing Liu, Tianlang Chen, Wengang Zhou, Yanyong Zhang, Houqiang Li, Wanli OuyangComments: arXiv admin note: text overlap with arXiv:2104.08541Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [502] arXiv:2206.06620 [pdf, other]
-
Title: Slimmable Domain AdaptationAuthors: Rang Meng, Weijie Chen, Shicai Yang, Jie Song, Luojun Lin, Di Xie, Shiliang Pu, Xinchao Wang, Mingli Song, Yueting ZhuangComments: To appear in CVPR 2022. Code is coming soon: this https URLJournal-ref: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [503] arXiv:2206.06637 [pdf, other]
-
Title: RF-Next: Efficient Receptive Field Search for Convolutional Neural NetworksComments: Accepted by TPAMI. This paper is a journal extension of our CVPR 2021 paper (arXiv:2101.00910)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [504] arXiv:2206.06640 [pdf, other]
-
Title: Confidence Score for Source-Free Unsupervised Domain AdaptationComments: ICML 2022 camera readySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [505] arXiv:2206.06665 [pdf, other]
-
Title: Online Easy Example Mining for Weakly-supervised Gland Segmentation from Histology ImagesComments: MICCAI 2022 AccepetedSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [506] arXiv:2206.06694 [pdf, other]
-
Title: ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation datasetAuthors: Moritz Roman Hernandez Petzsche, Ezequiel de la Rosa, Uta Hanning, Roland Wiest, Waldo Enrique Valenzuela Pinilla, Mauricio Reyes, Maria Ines Meyer, Sook-Lei Liew, Florian Kofler, Ivan Ezhov, David Robben, Alexander Hutton, Tassilo Friedrich, Teresa Zarth, Johannes Bürkle, The Anh Baran, Bjoern Menze, Gabriel Broocks, Lukas Meyer, Claus Zimmer, Tobias Boeckh-Behrens, Maria Berndt, Benno Ikenberg, Benedikt Wiestler, Jan S. KirschkeComments: 12 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [507] arXiv:2206.06712 [pdf, other]
-
Title: Visual Radial Basis Q-NetworkComments: This paper has been accepted for publication at the 3rd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2022. \c{opyright}Springer Nature 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [508] arXiv:2206.06714 [pdf, other]
-
Title: Interpretable Gait Recognition by Granger CausalityComments: Preprint. Full paper accepted at the IEEE/IAPR International Conference on Pattern Recognition (ICPR), Montreal, Canada, August 2022. 7 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [509] arXiv:2206.06715 [pdf, other]
-
Title: Semi-signed prioritized neural fitting for surface reconstruction from unoriented point cloudsAuthors: Runsong Zhu, Di Kang, Ka-Hei Hui, Yue Qian, Xuefei Zhe, Zhen Dong, Linchao Bao, Pheng-Ann Heng, Chi-Wing FuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [510] arXiv:2206.06731 [pdf]
-
Title: Learning Dense Features for Point Cloud Registration Using a Graph Attention NetworkComments: 15 pages, 3 figuresJournal-ref: Applied Sciences 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [511] arXiv:2206.06741 [pdf, other]
-
Title: Recurrent Transformer Variational Autoencoders for Multi-Action Motion SynthesisComments: accepted at Transformers for Vision workshop at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [512] arXiv:2206.06743 [pdf, other]
-
Title: Weakly-Supervised Crack DetectionComments: Submitted to IEEE Transactions on Intelligent Transportation SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [513] arXiv:2206.06761 [pdf, other]
-
Title: Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINOComments: ICML 2022 Workshop paper accepted at AdvML FrontiersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [514] arXiv:2206.06801 [pdf, other]
-
Title: Peripheral Vision TransformerComments: Accepted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [515] arXiv:2206.06803 [pdf, other]
-
Title: Asymmetric Dual-Decoder U-Net for Joint Rain and Haze RemovalComments: 12 pages, 35 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [516] arXiv:2206.06829 [pdf, other]
-
Title: Efficient Decoder-free Object Detection with TransformersAuthors: Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua ShenComments: Update metadata, 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [517] arXiv:2206.06922 [pdf, other]
-
Title: Object Scene Representation TransformerAuthors: Mehdi S. M. Sajjadi, Daniel Duckworth, Aravindh Mahendran, Sjoerd van Steenkiste, Filip Pavetić, Mario Lučić, Leonidas J. Guibas, Klaus Greff, Thomas KipfComments: Accepted at NeurIPS '22. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [518] arXiv:2206.06923 [pdf]
-
Title: A Multi-task Framework for Infrared Small Target Detection and SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [519] arXiv:2206.06930 [pdf, other]
-
Title: Comprehending and Ordering Semantics for Image CaptioningComments: CVPR 2022; Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
- [520] arXiv:2206.06931 [pdf, other]
-
Title: Stand-Alone Inter-Frame Attention in Video ModelsComments: CVPR 2022; Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [521] arXiv:2206.06948 [pdf, other]
-
Title: Monitoring Urban Forests from Auto-Generated Segmentation MapsComments: accepted for presentation and publication at IGARSS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [522] arXiv:2206.06959 [pdf, other]
-
Title: AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled DataComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [523] arXiv:2206.07011 [pdf, other]
-
Title: Consistent Video Instance Segmentation with Inter-Frame Recurrent AttentionComments: 11 pages, 5 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [524] arXiv:2206.07018 [src]
-
Title: Turning a Curse Into a Blessing: Enabling Clean-Data-Free Defenses by Model InversionComments: Because of an equation and author informational error, this paper has been withdrawn by the submitterSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [525] arXiv:2206.07028 [pdf, other]
-
Title: Learning 3D Object Shape and Layout without 3D SupervisionComments: CVPR 2022, project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [526] arXiv:2206.07036 [pdf, other]
-
Title: Accurate 3D Body Shape Regression using Metric and Semantic AttributesAuthors: Vasileios Choutas, Lea Muller, Chun-Hao P. Huang, Siyu Tang, Dimitrios Tzionas, Michael J. BlackComments: First two authors contributed equallyJournal-ref: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [527] arXiv:2206.07038 [pdf, other]
-
Title: AnimeSR: Learning Real-World Super-Resolution Models for Animation VideosComments: NeurIPS 2022. Codes and models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [528] arXiv:2206.07045 [pdf, other]
-
Title: ReCo: Retrieve and Co-segment for Zero-shot TransferComments: Tech report. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [529] arXiv:2206.07047 [pdf, other]
-
Title: RGB-Multispectral Matching: Dataset, Learning Methodology, EvaluationAuthors: Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di StefanoComments: CVPR 2022, New Orleans. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [530] arXiv:2206.07117 [pdf, other]
-
Title: TriHorn-Net: A Model for Accurate Depth-Based 3D Hand Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [531] arXiv:2206.07125 [pdf, other]
-
Title: Self-Supervised Pretraining for Differentially Private LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [532] arXiv:2206.07160 [pdf, other]
-
Title: LAVENDER: Unifying Video-Language Understanding as Masked Language ModelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [533] arXiv:2206.07162 [pdf, other]
-
Title: Category-Agnostic 6D Pose Estimation with Conditional Neural ProcessesComments: Accepted at CVPR2022 workshop: Women in Computer Vision (WiCV)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [534] arXiv:2206.07163 [pdf, other]
-
Title: DeepRecon: Joint 2D Cardiac Segmentation and 3D Volume Reconstruction via A Structure-Specific Generative MethodAuthors: Qi Chang, Zhennan Yan, Mu Zhou, Di Liu, Khalid Sawalha, Meng Ye, Qilong Zhangli, Mikael Kanski, Subhi Al Aref, Leon Axel, Dimitris MetaxasComments: MICCAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [535] arXiv:2206.07171 [pdf, other]
-
Title: Automated image analysis in large-scale cellular electron microscopy: A literature surveySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [536] arXiv:2206.07198 [pdf, other]
-
Title: Surgical Phase Recognition in Laparoscopic CholecystectomyAuthors: Yunfan Li, Vinayak Shenoy, Prateek Prasanna, I.V. Ramakrishnan, Haibin Ling, Himanshu GuptaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [537] arXiv:2206.07207 [pdf, other]
-
Title: Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal WorldAuthors: Hammad A. Ayyubi, Christopher Thomas, Lovish Chum, Rahul Lokesh, Yulei Niu, Xudong Lin, Long Chen, Jaywon Koo, Sounak Ray, Shih-Fu ChangSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [538] arXiv:2206.07240 [pdf, other]
-
Title: Test-Time Adaptation for Visual Document UnderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [539] arXiv:2206.07255 [pdf, other]
-
Title: GRAM-HD: 3D-Consistent Image Generation at High Resolution with Generative Radiance ManifoldsComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [540] arXiv:2206.07259 [pdf, other]
-
Title: Self-Supervised Learning of Image Scale and OrientationComments: Presented in BMVC 2021, code is available on this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [541] arXiv:2206.07267 [pdf, other]
-
Title: Rethinking Generalization in Few-Shot ClassificationComments: Accepted at NeurIPS 2022. Code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [542] arXiv:2206.07272 [pdf]
-
Title: Machine vision for vial positioning detection toward the safe automation of material synthesisAuthors: Leslie Ching Ow Tiong, Hyuk Jun Yoo, Na Yeon Kim, Kwan-Young Lee, Sang Soo Han, Donghun KimSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [543] arXiv:2206.07282 [pdf, other]
-
Title: Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial NoisesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [544] arXiv:2206.07298 [pdf, other]
-
Title: S$^2$-FPN: Scale-ware Strip Attention Guided Feature Pyramid Network for Real-time Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [545] arXiv:2206.07307 [pdf, other]
-
Title: VCT: A Video Compression TransformerAuthors: Fabian Mentzer, George Toderici, David Minnen, Sung-Jin Hwang, Sergi Caelles, Mario Lucic, Eirikur AgustssonComments: NeurIPS'22 Camera Ready Version. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [546] arXiv:2206.07326 [pdf, other]
-
Title: Recent Advances in Scene Image Representation and ClassificationComments: This paper is under review in Multimedia Tools and Applications (Springer) journal. This article may be deleted or updated based on the policies of the journalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [547] arXiv:2206.07344 [pdf, other]
-
Title: Automatic Detection of Rice Disease in Images of Various Leaf SizesAuthors: Kantip Kiratiratanapruk, Pitchayagan Temniranrat, Wasin Sinthupinyo, Sanparith Marukatat, Sujin PatarapuwadolComments: 28 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [548] arXiv:2206.07348 [pdf]
-
Title: Unsupervised multi-branch Capsule for Hyperspectral and LiDAR classificationComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [549] arXiv:2206.07349 [pdf, other]
-
Title: XMorpher: Full Transformer for Deformable Medical Image Registration via Cross AttentionAuthors: Jiacheng Shi, Yuting He, Youyong Kong, Jean-Louis Coatrieux, Huazhong Shu, Guanyu Yang, Shuo LiComments: accepted by MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [550] arXiv:2206.07352 [pdf]
-
Title: Robust SAR ATR on MSTAR with Deep Learning Models trained on Full Synthetic MOCEM dataSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Image and Video Processing (eess.IV)
- [551] arXiv:2206.07372 [pdf, other]
- [552] arXiv:2206.07389 [pdf, other]
-
Title: Ultra Fast Deep Lane Detection with Hybrid Anchor Driven Ordinal ClassificationComments: TPAMI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [553] arXiv:2206.07394 [pdf, other]
-
Title: Efficient Adaptive Ensembling for Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [554] arXiv:2206.07423 [pdf, other]
-
Title: Zero-shot object goal visual navigationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [555] arXiv:2206.07431 [pdf, other]
-
Title: Physically-admissible polarimetric data augmentation for road-scene analysisAuthors: Cyprien Ruffino, Rachel Blin, Samia Ainouz, Gilles Gasso, Romain Hérault, Fabrice Meriaudeau, Stéphane CanuSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [556] arXiv:2206.07434 [pdf, other]
-
Title: Self-Supervised Implicit Attention: Guided Attention by The Model ItselfSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [557] arXiv:2206.07435 [pdf, other]
-
Title: Forecasting of depth and ego-motion with transformers and self-supervisionComments: Accepted in ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [558] arXiv:2206.07458 [pdf, other]
-
Title: VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature SelectionComments: Accepted by ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [559] arXiv:2206.07459 [pdf, other]
-
Title: READ: Aggregating Reconstruction Error into Out-of-distribution DetectionComments: Accepted to AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [560] arXiv:2206.07460 [pdf, other]
- [561] arXiv:2206.07468 [pdf]
-
Title: PolyU-BPCoMa: A Dataset and Benchmark Towards Mobile Colorized Mapping Using a Backpack Multisensorial SystemComments: 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [562] arXiv:2206.07510 [pdf, other]
-
Title: Deep Multi-Task Networks For Occluded Pedestrian Pose EstimationAuthors: Arindam Das, Sudip Das, Ganesh Sistu, Jonathan Horgan, Ujjwal Bhattacharya, Edward Jones, Martin Glavin, Ciarán EisingComments: 4 pages, 5 tables, 2 figuresJournal-ref: Proceedings of the 2022 Irish Machine Vision and Image Processing ConferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [563] arXiv:2206.07557 [pdf, other]
-
Title: How to Reduce Change Detection to Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [564] arXiv:2206.07565 [pdf, other]
-
Title: A Meta-Analysis of Distributionally-Robust ModelsComments: To be presented at ICML Workshop on Principles of Distribution Shift 2022. Copyright 2022 by the author(s)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [565] arXiv:2206.07578 [src]
-
Title: E2V-SDE: From Asynchronous Events to Fast and Continuous Video Reconstruction via Neural Stochastic Differential EquationsComments: arXiv admin note: This submission has been withdrawn by arXiv administrators due to inappropriate text overlap with external sources. Additional information at this https URLJournal-ref: The IEEE / CVF Computer Vision and Pattern Recognition Conference 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [566] arXiv:2206.07580 [pdf, other]
-
Title: Evaluating object detector ensembles for improving the robustness of artifact detection in endoscopic video streamsAuthors: Pedro Esteban Chavarrias-Solano, Carlos Axel Garcia-Vega, Francisco Javier Lopez-Tiro, Gilberto Ochoa-Ruiz, Thomas Bazin, Dominique Lamarque, Christian DaulSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [567] arXiv:2206.07634 [pdf, other]
-
Title: Real3D-Aug: Point Cloud Augmentation by Placing Real Objects with Occlusion Handling for 3D Detection and SegmentationComments: Submitted on 15th June 2022 to IEEE RA-L journalSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [568] arXiv:2206.07643 [pdf, other]
-
Title: Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneAuthors: Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan WangComments: NeurIPS 2022. Project Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [569] arXiv:2206.07662 [pdf, other]
-
Title: SP-ViT: Learning 2D Spatial Priors for Vision TransformersAuthors: Yuxuan Zhou, Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Lei Zhang, Margret Keuper, Xiansheng HuaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [570] arXiv:2206.07669 [pdf, other]
-
Title: A Unified Sequence Interface for Vision TasksComments: The first three authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [571] arXiv:2206.07684 [pdf, other]
-
Title: AVATAR: Unconstrained Audiovisual Speech RecognitionAuthors: Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia SchmidSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [572] arXiv:2206.07687 [pdf, other]
-
Title: Structured Sparsity Learning for Efficient Video Super-ResolutionSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [573] arXiv:2206.07689 [pdf, other]
-
Title: Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022Authors: Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir GlobersonComments: Ego4D CVPR22 Object State Localization challenge. arXiv admin note: substantial text overlap with arXiv:2206.06346Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [574] arXiv:2206.07690 [pdf, other]
-
Title: ELUDE: Generating interpretable explanations via a decomposition into labelled and unlabelled featuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [575] arXiv:2206.07692 [pdf, other]
-
Title: A Simple Data Mixing Prior for Improving Self-Supervised LearningComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [576] arXiv:2206.07695 [pdf, other]
-
Title: VoxGRAF: Fast 3D-Aware Image Synthesis with Sparse Voxel GridsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [577] arXiv:2206.07696 [pdf, other]
-
Title: Diffusion Models for Video Prediction and InfillingComments: Published in TMLR (11/2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [578] arXiv:2206.07698 [pdf, other]
-
Title: Neural Deformable Voxel Grid for Fast Optimization of Dynamic View SynthesisComments: Technical Report: 29 pages; project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [579] arXiv:2206.07699 [pdf, other]
-
Title: Prefix Language Models are Unified Modal LearnersComments: 22 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [580] arXiv:2206.07700 [pdf, other]
-
Title: Masked Siamese ConvNetsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [581] arXiv:2206.07704 [pdf, other]
-
Title: Waymo Open Dataset: Panoramic Video Panoptic SegmentationAuthors: Jieru Mei, Alex Zihao Zhu, Xinchen Yan, Hang Yan, Siyuan Qiao, Yukun Zhu, Liang-Chieh Chen, Henrik Kretzschmar, Dragomir AnguelovComments: Our dataset can be found at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [582] arXiv:2206.07705 [pdf, other]
-
Title: LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D DetectionComments: Find the primary metrics for the 2022 Waymo Open Dataset 3D Camera-Only Detection Challenge at this https URL . Find the code at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [583] arXiv:2206.07706 [pdf, other]
-
Title: Masked Frequency Modeling for Self-Supervised Visual Pre-TrainingComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [584] arXiv:2206.07707 [pdf, other]
-
Title: Variable Bitrate Neural FieldsAuthors: Towaki Takikawa, Alex Evans, Jonathan Tremblay, Thomas Müller, Morgan McGuire, Alec Jacobson, Sanja FidlerComments: SIGGRAPH 2022. Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
- [585] arXiv:2206.07710 [pdf, other]
-
Title: PlanarRecon: Real-time 3D Plane Detection and Reconstruction from Posed Monocular VideosComments: CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [586] arXiv:2206.07764 [pdf, other]
-
Title: SAVi++: Towards End-to-End Object-Centric Learning from Real-World VideosAuthors: Gamaleldin F. Elsayed, Aravindh Mahendran, Sjoerd van Steenkiste, Klaus Greff, Michael C. Mozer, Thomas KipfComments: Project page at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [587] arXiv:2206.07771 [pdf, other]
-
Title: Discrete Contrastive Diffusion for Cross-Modal and Conditional GenerationComments: Project at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [588] arXiv:2206.07802 [pdf, other]
-
Title: What makes domain generalization hard?Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
- [589] arXiv:2206.07835 [pdf, other]
-
Title: Disentangling visual and written concepts in CLIPSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [590] arXiv:2206.07846 [pdf, ps, other]
-
Title: Action Spotting using Dense Detection Anchors Revisited: Submission to the SoccerNet Challenge 2022Comments: v2: a few more experiments, more detailed method descriptionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [591] arXiv:2206.07850 [pdf, other]
-
Title: HF-NeuS: Improved Surface Reconstruction Using High-Frequency DetailsComments: To appear in NeurIPS 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [592] arXiv:2206.07893 [pdf, other]
-
Title: PeQuENet: Perceptual Quality Enhancement of Compressed Video with Adaptation- and Attention-based NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
- [593] arXiv:2206.07897 [pdf, other]
-
Title: NCAGC: A Neighborhood Contrast Framework for Attributed Graph ClusteringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [594] arXiv:2206.07932 [pdf, other]
-
Title: Lifelong Wandering: A realistic few-shot online continual learning settingComments: CVPR 2022 Workshop on Continual LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [595] arXiv:2206.07934 [pdf, other]
-
Title: BANet: Motion Forecasting with Boundary Aware NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [596] arXiv:2206.07953 [pdf, other]
-
Title: Analysis and Extensions of Adversarial Training for Video ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [597] arXiv:2206.07959 [pdf, other]
-
Title: Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [598] arXiv:2206.07967 [pdf, other]
-
Title: DreamNet: A Deep Riemannian Network based on SPD Manifold Learning for Visual ClassificationComments: 9 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [599] arXiv:2206.07981 [pdf, other]
-
Title: Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in VideosSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [600] arXiv:2206.07986 [pdf, other]
-
Title: Image Captioning based on Feature Refinement and Reflective DecodingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [601] arXiv:2206.07990 [pdf, other]
-
Title: Patch-level Representation Learning for Self-supervised Vision TransformersComments: Accepted to CVPR 2022 (Oral). Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [602] arXiv:2206.07994 [pdf, other]
-
Title: Joint Class-Affinity Loss Correction for Robust Medical Image Segmentation with Noisy LabelsComments: Accepted to MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [603] arXiv:2206.08009 [pdf, other]
-
Title: Balancing Discriminability and Transferability for Source-Free Domain AdaptationAuthors: Jogendra Nath Kundu, Akshay Kulkarni, Suvaansh Bhambri, Deepesh Mehta, Shreyas Kulkarni, Varun Jampani, R. Venkatesh BabuComments: ICML 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [604] arXiv:2206.08016 [pdf, other]
-
Title: Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning ApproachesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [605] arXiv:2206.08026 [pdf, other]
-
Title: DeepFormableTag: End-to-end Generation and Recognition of Deformable Fiducial MarkersJournal-ref: ACM Transactions on Graphics 40, 4, Article 67 (August 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [606] arXiv:2206.08083 [pdf, other]
-
Title: CARLANE: A Lane Detection Benchmark for Unsupervised Domain Adaptation from Simulation to multiple Real-World DomainsComments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks, 22 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [607] arXiv:2206.08084 [pdf, other]
-
Title: An Improved Normed-Deformable Convolution for Crowd CountingJournal-ref: IEEE Signal Processing Letters 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [608] arXiv:2206.08105 [pdf, other]
-
Title: A Simple Baseline for Adversarial Domain Adaptation-based Unsupervised Flood ForecastingComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [609] arXiv:2206.08126 [pdf, other]
-
Title: Channel Importance Matters in Few-Shot Image ClassificationComments: Accepted to ICML 2022; code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [610] arXiv:2206.08129 [pdf, other]
-
Title: Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong BaselineComments: Accepted at NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [611] arXiv:2206.08150 [pdf, other]
-
Title: Self-Adaptive Label Augmentation for Semi-supervised Few-shot ClassificationComments: 9 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [612] arXiv:2206.08155 [pdf, other]
-
Title: Zero-Shot Video Question Answering via Frozen Bidirectional Language ModelsComments: NeurIPS 2022 Camera-Ready; Project Webpage: this https URL; 25 pages; 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [613] arXiv:2206.08158 [pdf, other]
-
Title: Volumetric Supervised Contrastive Learning for Seismic Semantic SegmentationJournal-ref: The International Meeting for Applied Geoscience & Energy (IMAGE) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
- [614] arXiv:2206.08171 [pdf, other]
-
Title: K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather ConditionsComments: Accepted at NeurIPS 2022 Datasets and Benchmarks TrackJournal-ref: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS Datasets and Benchmarks 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [615] arXiv:2206.08172 [pdf, other]
-
Title: RefCrowd: Grounding the Target in Crowd with Referring ExpressionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [616] arXiv:2206.08176 [pdf, other]
-
Title: Level 2 Autonomous Driving on a Single Device: Diving into the Devils of OpenpilotAuthors: Li Chen, Tutian Tang, Zhitian Cai, Yang Li, Penghao Wu, Hongyang Li, Jianping Shi, Junchi Yan, Yu QiaoComments: Tech report. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [617] arXiv:2206.08182 [src]
-
Title: Nucleus Segmentation and Analysis in Breast Cancer with the MIScnn FrameworkComments: Error in Table 3.4 (moved row)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [618] arXiv:2206.08186 [pdf, other]
-
Title: Asymptotic Soft Cluster Pruning for Deep Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [619] arXiv:2206.08194 [pdf, other]
-
Title: Online Segmentation of LiDAR Sequences: Dataset and AlgorithmComments: Code and data are available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [620] arXiv:2206.08206 [pdf, other]
-
Title: Selective Multi-Scale Learning for Object DetectionComments: Accepted by ICANN2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [621] arXiv:2206.08219 [pdf, other]
-
Title: HaGRID - HAnd Gesture Recognition Image DatasetComments: 11 pages, 9 figures, open-source dataset for computer visionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [622] arXiv:2206.08222 [pdf, other]
-
Title: Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking ConsistencySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [623] arXiv:2206.08224 [pdf, other]
-
Title: Multi scale Feature Extraction and Fusion for Online Knowledge DistillationComments: 12 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [624] arXiv:2206.08227 [pdf, other]
-
Title: Delving into the Scale Variance Problem in Object DetectionComments: Accepted by ICTAI2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [625] arXiv:2206.08229 [pdf, other]
-
Title: Open-Set Recognition with Gradient-Based RepresentationsComments: Published at IEEE International Conference on Image Processing (ICIP) 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [626] arXiv:2206.08236 [pdf, other]
-
Title: Simple and Efficient Architectures for Semantic SegmentationAuthors: Dushyant Mehta, Andrii Skliar, Haitam Ben Yahia, Shubhankar Borse, Fatih Porikli, Amirhossein Habibian, Tijmen BlankevoortComments: To be presented at Efficient Deep Learning for Computer Vision Workshop at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [627] arXiv:2206.08275 [pdf, other]
-
Title: Rank the triplets: A ranking-based multiple instance learning framework for detecting HPV infection in head and neck cancers using routine H&E imagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [628] arXiv:2206.08304 [pdf, other]
-
Title: Adversarial Patch Attacks and Defences in Vision-Based Tasks: A SurveyComments: A. Sharma and Y. Bian share equal contributionSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [629] arXiv:2206.08339 [pdf, other]
-
Title: iBoot: Image-bootstrapped Self-Supervised Video Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [630] arXiv:2206.08343 [pdf, other]
-
Title: Realistic One-shot Mesh-based Head AvatarsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [631] arXiv:2206.08345 [pdf]
-
Title: Real-World Single Image Super-Resolution Under Rainy ConditionAuthors: Mohammad Shahab UddinSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [632] arXiv:2206.08347 [pdf, other]
-
Title: Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation LearningComments: CVPR 2022, project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [633] arXiv:2206.08355 [pdf, other]
-
Title: FWD: Real-time Novel View Synthesis with Forward Warping and DepthComments: CVPR 2022. Project website this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [634] arXiv:2206.08356 [pdf, other]
-
Title: OmniMAE: Single Model Masked Pretraining on Images and VideosAuthors: Rohit Girdhar, Alaaeldin El-Nouby, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan MisraSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [635] arXiv:2206.08357 [pdf, other]
-
Title: Spatially-Adaptive Multilayer Selection for GAN Inversion and EditingSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [636] arXiv:2206.08358 [pdf, other]
-
Title: MixGen: A New Multi-Modal Data AugmentationComments: First three authors contributed equally. Code are available at this https URL Oral presentation at WACV 2023 Pretraining Large Vision and Multimodal Models WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [637] arXiv:2206.08361 [pdf, other]
-
Title: Controllable 3D Face Synthesis with Conditional Generative Occupancy FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [638] arXiv:2206.08362 [pdf, other]
-
Title: Unified Fourier-based Kernel and Nonlinearity Design for Equivariant Networks on Homogeneous SpacesComments: Accepted at ICML2022 Thirty-ninth International Conference on Machine LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [639] arXiv:2206.08365 [pdf, other]
-
Title: Virtual Correspondence: Humans as a Cue for Extreme-View GeometryComments: CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [640] arXiv:2206.08367 [pdf, other]
-
Title: SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain AdaptationAuthors: Tao Sun, Mattia Segu, Janis Postels, Yuxuan Wang, Luc Van Gool, Bernt Schiele, Federico Tombari, Fisher YuComments: Published at IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [641] arXiv:2206.08368 [pdf, other]
-
Title: Unbiased 4D: Monocular 4D Reconstruction with a Neural Deformation ModelComments: 26 pages, 17 figures, 8 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [642] arXiv:2206.08405 [pdf, ps, other]
-
Title: Going Deeper than Tracking: a Survey of Computer-Vision Based Recognition of Animal Pain and Affective StatesAuthors: Sofia Broomé, Marcelo Feighelstein, Anna Zamansky, Gabriel Carreira Lencioni, Pia Haubro Andersen, Francisca Pessanha, Marwa Mahmoud, Hedvig Kjellström, Albert Ali SalahSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [643] arXiv:2206.08423 [pdf, other]
-
Title: IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor ScenesComments: CVPR 22 camera ready version with supplementarySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [644] arXiv:2206.08427 [pdf, other]
-
Title: SATBench: Benchmarking the speed-accuracy tradeoff in object recognition by humans and dynamic neural networksAuthors: Ajay Subramanian, Sara Price, Omkar Kumbhar, Elena Sizikova, Najib J. Majaj, Denis G. PelliComments: 19 pages, 12 figures. Under Review at NeurIPS Datasets and Benchmarks Track 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [645] arXiv:2206.08428 [pdf, other]
-
Title: EyeNeRF: A Hybrid Representation for Photorealistic Synthesis, Animation and Relighting of Human EyesAuthors: Gengyan Li (1 and 2), Abhimitra Meka (1), Franziska Müller (1), Marcel C. Bühler (2), Otmar Hilliges (2), Thabo Beeler (1) ((1) Google Inc., (2) ETH Zürich)Comments: 16 pages, 16 figures, 1 table, to be published in ACM Transactions on Graphics (TOG) (Volume: 41, Issue: 4), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [646] arXiv:2206.08429 [pdf, other]
-
Title: Scalable Temporal Localization of Sensitive Activities in Movies and TV EpisodesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [647] arXiv:2206.08460 [pdf, other]
-
Title: TUSK: Task-Agnostic Unsupervised KeypointsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [648] arXiv:2206.08462 [pdf, other]
-
Title: Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole HierarchiesComments: 9 pages, 6 figures. fixed LaTeX typo for algorithm referenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [649] arXiv:2206.08477 [pdf, other]
-
Title: Backdoor Attacks on Vision TransformersAuthors: Akshayvarun Subramanya, Aniruddha Saha, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Hamed PirsiavashSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [650] arXiv:2206.08488 [pdf, other]
-
Title: Controllable Image EnhancementSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [651] arXiv:2206.08500 [pdf, other]
-
Title: What do navigation agents learn about their environment?Comments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [652] arXiv:2206.08509 [pdf, other]
-
Title: Neural Architecture Adaptation for Object Detection by Searching Channel Dimensions and Mapping Pre-trained ParametersComments: Accepted to ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [653] arXiv:2206.08524 [pdf, other]
-
Title: CDNet: Contrastive Disentangled Network for Fine-Grained Image Categorization of Ocular B-Scan UltrasoundAuthors: Ruilong Dan, Yunxiang Li, Yijie Wang, Gangyong Jia, Ruiquan Ge, Juan Ye, Qun Jin, Yaqi WangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [654] arXiv:2206.08537 [pdf, ps, other]
-
Title: Large-Margin Representation Learning for Texture ClassificationAuthors: Jonathan de Matos, Luiz Eduardo Soares de Oliveira, Alceu de Souza Britto Junior, Alessandro Lameiras KoerichComments: 7 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [655] arXiv:2206.08547 [pdf, other]
-
Title: Texture Generation Using Graph Generative Adversarial Network And Differentiable RenderingComments: 17 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [656] arXiv:2206.08549 [pdf, other]
-
Title: Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [657] arXiv:2206.08566 [pdf, other]
-
Title: Active Data Discovery: Mining Unknown Data using Submodular Information MeasuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [658] arXiv:2206.08567 [pdf, other]
-
Title: Rectify ViT Shortcut Learning by Visual SaliencyAuthors: Chong Ma, Lin Zhao, Yuzhong Chen, David Weizhong Liu, Xi Jiang, Tuo Zhang, Xintao Hu, Dinggang Shen, Dajiang Zhu, Tianming LiuComments: NeurIPS2022 Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [659] arXiv:2206.08568 [pdf, other]
-
Title: Multi-Contextual Predictions with Vision Transformer for Video Anomaly DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [660] arXiv:2206.08572 [pdf, other]
-
Title: Enhanced Bi-directional Motion Estimation for Video Frame InterpolationComments: Accepted by WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [661] arXiv:2206.08585 [pdf, other]
-
Title: HairFIT: Pose-Invariant Hairstyle Transfer via Flow-based Hair Alignment and Semantic-Region-Aware InpaintingAuthors: Chaeyeon Chung, Taewoo Kim, Hyelin Nam, Seunghwan Choi, Gyojung Gu, Sunghyun Park, Jaegul ChooComments: BMVC 2021 Oral PresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [662] arXiv:2206.08605 [pdf]
-
Title: On Efficient Real-Time Semantic Segmentation: A SurveyComments: 19 pages, 13 figures, 4 tables This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [663] arXiv:2206.08610 [pdf, other]
-
Title: Masked Autoencoders for Generic Event Boundary Detection CVPR'2022 Kinetics-GEBD ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [664] arXiv:2206.08614 [pdf, other]
-
Title: Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic AssessmentComments: Accepted to NeurIPS Track on Datasets and Benchmarks 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [665] arXiv:2206.08632 [pdf, other]
-
Title: Learning Using Privileged Information for Zero-Shot Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [666] arXiv:2206.08638 [pdf, ps, other]
- [667] arXiv:2206.08640 [pdf, other]
-
Title: Uncertainty-aware Evaluation of Time-Series Classification for Online Handwriting Recognition with Domain ShiftAuthors: Andreas Klaß, Sven M. Lorenz, Martin W. Lauer-Schmaltz, David Rügamer, Bernd Bischl, Christopher Mutschler, Felix OttSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [668] arXiv:2206.08641 [pdf, other]
-
Title: Diverse Multiple Trajectory Prediction Using a Two-stage Prediction Network Trained with Lane LossComments: RA-L acceptedJournal-ref: IEEE Robotics and Automation Letters (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [669] arXiv:2206.08645 [pdf, other]
-
Title: Local Slot Attention for Vision-and-Language NavigationComments: ICMR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [670] arXiv:2206.08655 [pdf, other]
-
Title: Learning Implicit Feature Alignment Function for Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [671] arXiv:2206.08657 [pdf, other]
-
Title: BridgeTower: Building Bridges Between Encoders in Vision-Language Representation LearningComments: Accepted by AAAI 2023, OralSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [672] arXiv:2206.08683 [pdf, other]
-
Title: AggNet: Learning to Aggregate Faces for Group Membership VerificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [673] arXiv:2206.08701 [pdf]
-
Title: Towards Real-Time Visual Tracking with Graded Color-names FeaturesComments: 12 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [674] arXiv:2206.08712 [pdf, other]
-
Title: An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping FunctionsComments: Accepted to RAL2022, code at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [675] arXiv:2206.08748 [pdf]
-
Title: ReViSe: Remote Vital Signs Measurement Using Smartphone CameraSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [676] arXiv:2206.08749 [pdf, other]
-
Title: From a few Accurate 2D Correspondences to 3D Point CloudsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [677] arXiv:2206.08751 [pdf, other]
-
Title: A Database for Perceived Quality Assessment of User-Generated VR VideosSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [678] arXiv:2206.08778 [pdf, other]
-
Title: CTooth: A Fully Annotated 3D Dataset and Benchmark for Tooth Volume Segmentation on Cone Beam Computed Tomography ImagesAuthors: Weiwei Cui, Yaqi Wang, Qianni Zhang, Huiyu Zhou, Dan Song, Xingyong Zuo, Gangyong Jia, Liaoyuan ZengSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [679] arXiv:2206.08789 [pdf]
-
Title: Reconstructing vehicles from orthographic drawings using deep neural networksAuthors: Robin KlippertComments: 9 PagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [680] arXiv:2206.08791 [pdf, other]
-
Title: DU-Net based Unsupervised Contrastive Learning for Cancer Segmentation in Histology ImagesComments: arXiv admin note: text overlap with arXiv:2002.05709 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [681] arXiv:2206.08792 [pdf, other]
-
Title: FD-CAM: Improving Faithfulness and Discriminability of Visual Explanation for CNNsComments: Accepted by ICPR 2022 and also accepted by CVPR 2022 Explainable Artificial Intelligence for Computer Vision (XAI4CV) WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [682] arXiv:2206.08794 [pdf, other]
-
Title: The Importance of Background Information for Out of Distribution GeneralizationComments: 6 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [683] arXiv:2206.08801 [pdf, other]
-
Title: Video Shadow Detection via Spatio-Temporal Interpolation Consistency TrainingAuthors: Xiao Lu, Yihong Cao, Sheng Liu, Chengjiang Long, Zipei Chen, Xuanyu Zhou, Yimin Yang, Chunxia XiaoComments: Accepted in CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [684] arXiv:2206.08833 [pdf]
-
Title: A Comparative Study of Confidence Calibration in Deep Learning: From Computer Vision to Medical ImagingAuthors: Riqiang Gao, Thomas Li, Yucheng Tang, Zhoubing Xu, Michael Kammer, Sanja L. Antic, Kim Sandler, Fabien Moldonado, Thomas A. Lasko, Bennett LandmanComments: 17 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [685] arXiv:2206.08861 [pdf, other]
-
Title: DGMIL: Distribution Guided Multiple Instance Learning for Whole Slide Image ClassificationComments: accepted by MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [686] arXiv:2206.08880 [pdf, other]
-
Title: Improving Generalization of Metric Learning via Listwise Self-distillationComments: 11 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [687] arXiv:2206.08883 [pdf, other]
-
Title: CtrlFormer: Learning Transferable State Representation for Visual Control via TransformerComments: ICML 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [688] arXiv:2206.08898 [pdf, other]
-
Title: SimA: Simple Softmax-free Attention for Vision TransformersComments: Code is available here: $\href{this https URL}{\text{This https URL}}$Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [689] arXiv:2206.08903 [pdf, other]
-
Title: Colonoscopy 3D Video Dataset with Paired Depth from 2D-3D RegistrationAuthors: Taylor L. Bobrow, Mayank Golhar, Rohan Vijayan, Venkata S. Akshintala, Juan R. Garcia, Nicholas J. DurrSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [690] arXiv:2206.08916 [pdf, other]
-
Title: Unified-IO: A Unified Model for Vision, Language, and Multi-Modal TasksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [691] arXiv:2206.08919 [pdf, other]
-
Title: VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMixSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [692] arXiv:2206.08920 [pdf, other]
-
Title: VectorMapNet: End-to-end Vectorized HD Map LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [693] arXiv:2206.08927 [pdf, other]
-
Title: Cross-task Attention Mechanism for Dense Multi-task LearningComments: 10 figures, 6 tables, 23 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [694] arXiv:2206.08929 [pdf, other]
-
Title: TAVA: Template-free Animatable Volumetric ActorsAuthors: Ruilong Li, Julian Tanke, Minh Vo, Michael Zollhofer, Jurgen Gall, Angjoo Kanazawa, Christoph LassnerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [695] arXiv:2206.08948 [pdf, other]
-
Title: CMT-DeepLab: Clustering Mask Transformers for Panoptic SegmentationAuthors: Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh ChenComments: CVPR 2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [696] arXiv:2206.08954 [pdf, other]
-
Title: Intra-Instance VICReg: Bag of Self-Supervised Image Patch EmbeddingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [697] arXiv:2206.08970 [pdf, other]
-
Title: MultiEarth 2022 -- The Champion Solution for the Matrix Completion Challenge via Multimodal Regression and GenerationComments: CVPR 2022, MultiEarth 2022, Matrix Completion ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [698] arXiv:2206.08977 [pdf]
-
Title: BN-HTRd: A Benchmark Dataset for Document Level Offline Bangla Handwritten Text Recognition (HTR) and Line SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [699] arXiv:2206.08990 [pdf, other]
-
Title: Shadows Shed Light on 3D ObjectsComments: 19 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [700] arXiv:2206.09027 [pdf, other]
-
Title: Landscape Learning for Neural Network InversionComments: 15 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [701] arXiv:2206.09038 [pdf, other]
-
Title: Validation of Vector Data using Oblique ImagesComments: In Proceedings of 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM GIS'08)Journal-ref: Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM GIS '08), pp. 1-10. 2008Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [702] arXiv:2206.09055 [src]
-
Title: Augmented Imagefication: A Data-driven Fault Detection Method for Aircraft Air Data SensorsComments: a crucial design defect to acquire flying data by simulationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [703] arXiv:2206.09061 [pdf, other]
-
Title: Design of Supervision-Scalable Learning Systems: Methodology and Performance BenchmarkingComments: 16 pages, 12 figures, 4 tables, under consideration at Pattern RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [704] arXiv:2206.09068 [pdf, other]
-
Title: Attention-based Dynamic Subspace Learners for Medical Image AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [705] arXiv:2206.09071 [pdf, other]
-
Title: Analysis & Computational Complexity Reduction of Monocular and Stereo Depth Estimation TechniquesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [706] arXiv:2206.09082 [pdf, other]
-
Title: Context-aware Proposal Network for Temporal Action DetectionComments: First place winning solution for temporal action detection task in CVPR-2022 AcitivityNet Challenge. arXiv admin note: substantial text overlap with arXiv:2106.11812Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [707] arXiv:2206.09089 [pdf]
-
Title: A Dynamic Data Driven Approach for Explainable Scene UnderstandingComments: Unpublished draft of book chapterSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [708] arXiv:2206.09106 [pdf, other]
-
Title: Embodied Scene-aware Human Pose EstimationComments: NeurIPS 2022. Project website: this https URL Zhengyi Luo and Shun Iwase contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [709] arXiv:2206.09111 [pdf, other]
-
Title: VReBERT: A Simple and Flexible Transformer for Visual Relationship DetectionComments: Published at International Conference on Pattern Recognition (ICPR) 2022, Montreal QuebecSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [710] arXiv:2206.09114 [pdf, other]
-
Title: Bear the Query in Mind: Visual Grounding with Query-conditioned ConvolutionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [711] arXiv:2206.09132 [pdf, other]
-
Title: Replacing Labeled Real-image Datasets with Auto-generated ContoursAuthors: Hirokatsu Kataoka, Ryo Hayamizu, Ryosuke Yamada, Kodai Nakashima, Sora Takashima, Xinyu Zhang, Edgar Josafat Martinez-Noriega, Nakamasa Inoue, Rio YokotaComments: Accepted to CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [712] arXiv:2206.09148 [pdf, other]
-
Title: Deep Compatible Learning for Partially-Supervised Medical Image SegmentationComments: 16 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [713] arXiv:2206.09178 [pdf, other]
-
Title: REVECA -- Rich Encoder-decoder framework for Video Event CAptionerComments: The IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR). LOng-form VidEo Understanding (LOVEU) workshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [714] arXiv:2206.09191 [pdf, other]
-
Title: Gender Artifacts in Visual DatasetsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [715] arXiv:2206.09202 [pdf, other]
-
Title: Camera Adaptation for Fundus-Image-Based CVD Risk EstimationComments: This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this contribution will be added soonSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [716] arXiv:2206.09221 [pdf]
-
Title: 3D Face Parsing via Surface Parameterization and 2D Semantic Segmentation NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [717] arXiv:2206.09242 [pdf, other]
-
Title: GaLeNet: Multimodal Learning for Disaster Prediction, Management and ReliefAuthors: Rohit Saha, Mengyi Fang, Angeline Yasodhara, Kyryl Truskovskyi, Azin Asgarian, Daniel Homola, Raahil Shah, Frederik Dieleman, Jack Weatheritt, Thomas RogersComments: Accepted to CVPR 2022 Workshop on Multimodal Learning for Earth and EnvironmentSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [718] arXiv:2206.09243 [pdf, other]
-
Title: Structured Light with Redundancy CodesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [719] arXiv:2206.09244 [pdf, other]
-
Title: GAN2X: Non-Lambertian Inverse Rendering of Image GANsComments: Accepted to 3DV 2022. The video demo is available at the project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [720] arXiv:2206.09256 [pdf, other]
-
Title: Multistream Gaze Estimation with Anatomical Eye Region Isolation by Synthetic to Real Transfer LearningComments: 14 pages, 10 figures, 12 tables. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without noticeSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [721] arXiv:2206.09265 [pdf, ps, other]
-
Title: SAViR-T: Spatially Attentive Visual Reasoning with TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [722] arXiv:2206.09293 [pdf, other]
-
Title: Rethinking Bayesian Deep Learning Methods for Semi-Supervised Volumetric Medical Image SegmentationComments: To appear at CVPR 2022, and the supplementary material can be found at the official site. The source codes are at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [723] arXiv:2206.09325 [pdf, other]
-
Title: EATFormer: Improving Vision Transformer Inspired by Evolutionary AlgorithmSubjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET)
- [724] arXiv:2206.09358 [pdf, other]
-
Title: What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text InputsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [725] arXiv:2206.09362 [src]
-
Title: Towards Generalizable Person Re-identification with a Bi-stream Generative ModelComments: There is a mistake of equation 1Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [726] arXiv:2206.09365 [pdf, other]
-
Title: Semi-supervised Change Detection of Small Water Bodies Using RGB and Multispectral Images in Peruvian RainforestsAuthors: Kangning Cui, Seda Camalan, Ruoning Li, Victor P. Pauca, Sarra Alqahtani, Robert J. Plemmons, Miles Silman, Evan N. Dethier, David Lutz, Raymond H. ChanComments: 8 pages, 5 figures. Accepted to Proceedings of IEEE WHISPERS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
- [727] arXiv:2206.09372 [pdf, other]
-
Title: mvHOTA: A multi-view higher order tracking accuracy metric to measure spatial and temporal associations in multi-point detectionAuthors: Lalith Sharan, Halvar Kelm, Gabriele Romano, Matthias Karck, Raffaele De Simone, Sandy EngelhardtComments: 16 pages, 9 figuresJournal-ref: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization (2022) 1-9Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [728] arXiv:2206.09410 [pdf, other]
-
Title: JPEG Compression-Resistant Low-Mid Adversarial Perturbation against Unauthorized Face Recognition SystemSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [729] arXiv:2206.09414 [pdf, other]
-
Title: Terrain Classification using Transfer Learning on Hyperspectral Images: A Comparative studySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [730] arXiv:2206.09420 [pdf, other]
-
Title: Agricultural Plantation Classification using Transfer Learning Approach based on CNNAuthors: Uphar Singh, Tushar Musale, Ranjana Vyas, O.P.Vyas (Indian Institute of Information Technology, Allahabad, India)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [731] arXiv:2206.09474 [pdf, other]
-
Title: 3D Object Detection for Autonomous Driving: A Review and New OutlooksComments: A survey on 3D object detection for autonomous driving. Project page is at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [732] arXiv:2206.09479 [pdf, other]
-
Title: StudioGAN: A Taxonomy and Benchmark of GANs for Image SynthesisComments: 30 pages, Submitted to journalSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [733] arXiv:2206.09485 [pdf, other]
-
Title: Video frame interpolation for high dynamic range sequences captured with dual-exposure sensorsComments: 13 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [734] arXiv:2206.09500 [pdf, other]
-
Title: Unbiased Teacher v2: Semi-supervised Object Detection for Anchor-free and Anchor-based DetectorsComments: Project Page is at this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [735] arXiv:2206.09504 [pdf, other]
-
Title: A Parallel Implementation of Computing Mean Average PrecisionAuthors: Beinan WangSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [736] arXiv:2206.09509 [pdf]
-
Title: Hybrid Facial Expression Recognition (FER2013) Model for Real-Time Emotion Classification and PredictionComments: 8 Pages, 8 Figures, 5 TablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
- [737] arXiv:2206.09541 [pdf, other]
-
Title: DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited AnnotationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [738] arXiv:2206.09548 [pdf, other]
-
Title: Variational Distillation for Multi-View LearningAuthors: Xudong Tian, Zhizhong Zhang, Cong Wang, Wensheng Zhang, Yanyun Qu, Lizhuang Ma, Zongze Wu, Yuan Xie, Dacheng TaoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [739] arXiv:2206.09552 [pdf, other]
-
Title: Dynamic Message Propagation Network for RGB-D Salient Object DetectionComments: 12 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [740] arXiv:2206.09553 [pdf, other]
-
Title: Capturing and Inferring Dense Full-Body Human-Scene ContactAuthors: Chun-Hao P. Huang, Hongwei Yi, Markus Höschle, Matvey Safroshkin, Tsvetelina Alexiadis, Senya Polikovsky, Daniel Scharstein, Michael J. BlackComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [741] arXiv:2206.09554 [pdf, other]
-
Title: Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic SegmentationComments: TMM2022, 11 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [742] arXiv:2206.09564 [pdf, other]
-
Title: A Novel Long-term Iterative Mining Scheme for Video Salient Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [743] arXiv:2206.09575 [pdf, other]
-
Title: C-SENN: Contrastive Self-Explaining Neural NetworkComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [744] arXiv:2206.09581 [pdf]
-
Title: Explicit and implicit models in infrared and visible image fusionComments: 8 pages, 5 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [745] arXiv:2206.09585 [pdf, other]
-
Title: 5th Place Solution for YouTube-VOS Challenge 2022: Video Object SegmentationComments: 5th Place Solution for Video Object Segmentation in the 4th Large-scale Video Object Segmentation Challenge, CVPR 2022 WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [746] arXiv:2206.09592 [pdf, other]
-
Title: DALL-E for Detection: Language-driven Compositional Image Synthesis for Object DetectionComments: v3(same as v2) version, update structure (add foreground generation, stable diffusion), add more experimentsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [747] arXiv:2206.09596 [pdf, other]
-
Title: Efficient and Flexible Sublabel-Accurate Energy MinimizationComments: To be published at ICPR 2022, Copyright 2022 IEEESubjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
- [748] arXiv:2206.09597 [pdf, other]
-
Title: Winning the CVPR'2022 AQTC Challenge: A Two-stage Function-centric ApproachSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [749] arXiv:2206.09604 [pdf, other]
-
Title: Distortion-Aware Network Pruning and Feature Reuse for Real-time Video SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [750] arXiv:2206.09664 [pdf, other]
-
Title: What Can be Seen is What You Get: Structure Aware Point Cloud AugmentationComments: Published in IEEE IV 2022Journal-ref: 33rd IEEE Intelligent Vehicles Symposium, Aachen, Germany, June 5th - June 9th 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [751] arXiv:2206.09667 [pdf, other]
-
Title: MSANet: Multi-Similarity and Attention Guidance for Boosting Few-Shot SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [752] arXiv:2206.09683 [pdf, other]
-
Title: Distribution Regularized Self-Supervised Learning for Domain Adaptation of Semantic SegmentationComments: Accepted for publication at Image and Vision ComputingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [753] arXiv:2206.09731 [pdf, other]
-
Title: Semantic Labeling of High Resolution Images Using EfficientUNets and TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [754] arXiv:2206.09736 [pdf, other]
-
Title: Geo-NI: Geometry-aware Neural Interpolation for Light Field RenderingComments: 13 pages, 8 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [755] arXiv:2206.09742 [pdf]
-
Title: Developing a Free and Open-source Automated Building Exterior Crack Inspection Software for Construction and Facility ManagersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [756] arXiv:2206.09753 [pdf, other]
-
Title: Visualizing and Understanding Self-Supervised Vision LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [757] arXiv:2206.09756 [pdf, other]
-
Title: Time Gated Convolutional Neural Networks for Crop ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [758] arXiv:2206.09769 [pdf, other]
-
Title: Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathologyComments: Accepted at MICCAI2022 ConferenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [759] arXiv:2206.09770 [pdf, other]
-
Title: Real-time Full-stack Traffic Scene Perception for Autonomous Driving with Roadside CamerasAuthors: Zhengxia Zou, Rusheng Zhang, Shengyin Shen, Gaurav Pandey, Punarjay Chakravarty, Armin Parchami, Henry X. LiuComments: This paper is accepted and presented in ICRA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [760] arXiv:2206.09796 [pdf, other]
-
Title: Knowledge Distillation for Oriented Object Detection on Aerial ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [761] arXiv:2206.09806 [pdf, other]
-
Title: Self-Supervised Consistent Quantization for Fully Unsupervised Image RetrievalComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [762] arXiv:2206.09842 [pdf, other]
-
Title: Practical Deepfake Detection: Vulnerabilities in Global ContextsComments: 6 pages, 6 figures, presented as a workshop paper at Responsible AI @ ICLR 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [763] arXiv:2206.09843 [pdf, other]
-
Title: Contextual Squeeze-and-Excitation for Efficient Few-Shot Image ClassificationAuthors: Massimiliano Patacchiola, John Bronskill, Aliaksandra Shysheya, Katja Hofmann, Sebastian Nowozin, Richard E. TurnerComments: Advances in Neural Information Processing Systems (NeurIPS 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [764] arXiv:2206.09852 [pdf, other]
-
Title: M&M Mix: A Multimodal Multiview Transformer EnsembleComments: Technical report for Epic-Kitchens challenge 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [765] arXiv:2206.09853 [pdf, other]
-
Title: DisCoVQA: Temporal Distortion-Content Transformers for Video Quality AssessmentSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [766] arXiv:2206.09885 [pdf, other]
-
Title: KOLOMVERSE: KRISO open large-scale image dataset for object detection in the maritime universeComments: 13 Pages, 12 figures, submitted to NeurIPS 2022 Datasets and Benchmarks Track (Under Review)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [767] arXiv:2206.09900 [pdf, other]
-
Title: Voxel-MAE: Masked Autoencoders for Self-supervised Pre-training Large-scale Point CloudsComments: 10 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [768] arXiv:2206.09907 [pdf, other]
-
Title: ORFD: A Dataset and Benchmark for Off-Road Freespace DetectionComments: Accepted by ICRA2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [769] arXiv:2206.09959 [pdf, other]
-
Title: Global Context Vision TransformersComments: 15 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [770] arXiv:2206.10033 [pdf, other]
-
Title: Test Time Transform Prediction for Open Set Histopathological Image RecognitionAuthors: Adrian Galdran, Katherine J. Hewitt, Narmin L. Ghaffari, Jakob N. Kather, Gustavo Carneiro, Miguel A. González BallesterComments: Accepted to MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [771] arXiv:2206.10041 [pdf, other]
-
Title: MPA: MultiPath++ Based Architecture for Motion PredictionAuthors: Stepan KonevComments: CVPR 2022, Workshop on Autonomous DrivingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [772] arXiv:2206.10059 [pdf, other]
-
Title: Bypass Network for Semantics Driven Image Paragraph CaptioningComments: Under consideration at Computer Vision and Image UnderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [773] arXiv:2206.10066 [pdf, other]
-
Title: RendNet: Unified 2D/3D Recognizer With Latent Space RenderingComments: CVPR 2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [774] arXiv:2206.10075 [pdf, other]
-
Title: Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [775] arXiv:2206.10080 [pdf, other]
-
Title: One-stage Action Detection TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [776] arXiv:2206.10082 [pdf, other]
-
Title: Optimally Controllable Perceptual Lossy CompressionComments: ICML 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [777] arXiv:2206.10090 [pdf, other]
-
Title: KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D CorrespondencesJournal-ref: Transaction on Circuits and Systems for Video Technology,2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [778] arXiv:2206.10092 [pdf, other]
-
Title: BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object DetectionAuthors: Yinhao Li, Zheng Ge, Guanyi Yu, Jinrong Yang, Zengran Wang, Yukang Shi, Jianjian Sun, Zeming LiComments: Accepted by AAAI2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [779] arXiv:2206.10095 [pdf, other]
-
Title: Pyramid Region-based Slot Attention Network for Temporal Action Proposal GenerationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [780] arXiv:2206.10096 [pdf]
-
Title: Transformers Improve Breast Cancer Diagnosis from Unregistered Multi-View MammogramsAuthors: Xuxin Chen, Ke Zhang, Neman Abdoli, Patrik W. Gilley, Ximin Wang, Hong Liu, Bin Zheng, Yuchen QiuSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [781] arXiv:2206.10098 [pdf, other]
-
Title: Reconstruct from BEV: A 3D Lane Detection Approach based on Geometry Structure PriorComments: Proceedings of the CVPR 2022 Workshop of Autonomous DrivingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [782] arXiv:2206.10107 [pdf, other]
-
Title: Sensitivity of Average Precision to Bounding Box PerturbationsAuthors: Ali BorjiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [783] arXiv:2206.10118 [pdf, other]
-
Title: HOPE: Hierarchical Spatial-temporal Network for Occupancy Flow PredictionAuthors: Yihan Hu, Wenxin Shao, Bo Jiang, Jiajie Chen, Siqi Chai, Zhening Yang, Jingyu Qian, Helong Zhou, Qiang LiuComments: 1st Ranking Solution for the Occupancy and Flow Prediction of the Waymo Open Dataset Challenges 2022 (this http URL)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [784] arXiv:2206.10129 [pdf, other]
-
Title: Automatic Concept Extraction for Concept Bottleneck-based Video ClassificationAuthors: Jeya Vikranth Jeyakumar, Luke Dickens, Luis Garcia, Yu-Hsi Cheng, Diego Ramirez Echavarria, Joseph Noor, Alessandra Russo, Lance Kaplan, Erik Blasch, Mani SrivastavaComments: 10 pages, Appendix: 2 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [785] arXiv:2206.10131 [pdf, other]
-
Title: An Integrated Representation & Compression Scheme Based on Convolutional Autoencoders with 4D DCT Perceptual Encoding for High Dynamic Range Light FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [786] arXiv:2206.10137 [pdf, other]
-
Title: Few-Max: Few-Shot Domain Adaptation for Unsupervised Contrastive Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [787] arXiv:2206.10145 [pdf, other]
- [788] arXiv:2206.10146 [pdf, other]
-
Title: KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute ParsingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [789] arXiv:2206.10155 [pdf, other]
-
Title: Review Neural Networks about Image Transformation Based on IGC Learning Framework with Annotated InformationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [790] arXiv:2206.10157 [pdf, other]
-
Title: Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [791] arXiv:2206.10177 [pdf, other]
-
Title: TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural NetworksAuthors: Rui-Jie Zhu, Qihang Zhao, Tianjing Zhang, Haoyu Deng, Yule Duan, Malu Zhang, Liang-Jian DengSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [792] arXiv:2206.10186 [pdf, other]
-
Title: Improving Localization for Semi-Supervised Object DetectionJournal-ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [793] arXiv:2206.10192 [pdf, other]
-
Title: LDD: A Dataset for Grape Diseases Object Detection and Instance SegmentationJournal-ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [794] arXiv:2206.10207 [pdf, other]
-
Title: SemMAE: Semantic-Guided Masking for Learning Masked AutoencodersComments: Accepted by NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [795] arXiv:2206.10213 [pdf, other]
-
Title: Rethinking Unsupervised Neural Superpixel SegmentationComments: ICIP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [796] arXiv:2206.10225 [pdf, other]
-
Title: Broken News: Making Newspapers Accessible to Print-ImpairedJournal-ref: Extended Abstract at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [797] arXiv:2206.10241 [pdf, other]
-
Title: Deep Active Latent Surfaces for Medical GeometriesComments: 14 pages, 9 figures, submitted for reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [798] arXiv:2206.10253 [pdf, other]
-
Title: Document Navigability: A Need for Print-ImpairedComments: Published at Accessibility, Vision, and Autonomy Meet, CVPR 2022 WorkshopJournal-ref: Extended Abstract for Poster Session at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [799] arXiv:2206.10254 [pdf, other]
-
Title: Towards Optimizing OCR for AccessibilityJournal-ref: Extended Abstract for Poster Session at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [800] arXiv:2206.10263 [pdf, other]
-
Title: Object Structural Points Representation for Graph-based Semantic Monocular Localization and MappingComments: submitted to IROS 2015 (rejected)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [801] arXiv:2206.10324 [pdf, other]
- [802] arXiv:2206.10329 [pdf, other]
-
Title: SVG Vector Font Generation for Chinese Characters with TransformerComments: Accepted to ICIP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [803] arXiv:2206.10360 [pdf, other]
-
Title: Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal LossComments: 5 pages, 3 figures; Accepted to ICIP2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [804] arXiv:2206.10375 [pdf, other]
-
Title: MEStereo-Du2CNN: A Novel Dual Channel CNN for Learning Robust Depth Estimates from Multi-exposure Stereo Images for HDR 3D ApplicationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [805] arXiv:2206.10411 [pdf, other]
-
Title: Audio-video fusion strategies for active speaker detection in meetingsAuthors: Lionel Pibre, Francisco Madrigal, Cyrille Equoy, Frédéric Lerasle, Thomas Pellegrini, Julien Pinquier, Isabelle FerranéSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [806] arXiv:2206.10436 [pdf, other]
-
Title: Transformer-Based Multi-modal Proposal and Re-Rank for Wikipedia Image-Caption MatchingComments: Accepted for publication at the Wiki-M3L workshop, co-located with ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [807] arXiv:2206.10457 [pdf, other]
-
Title: Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh RecoverySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [808] arXiv:2206.10465 [pdf, other]
-
Title: An Overview of Privacy-enhancing Technologies in Biometric RecognitionComments: 12 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [809] arXiv:2206.10491 [pdf, other]
-
Title: Bi-Calibration Networks for Weakly-Supervised Video Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [810] arXiv:2206.10520 [pdf, other]
-
Title: SFace: Privacy-friendly and Accurate Face Recognition using Synthetic DataSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [811] arXiv:2206.10526 [pdf, other]
-
Title: QuantFace: Towards Lightweight Face Recognition by Synthetic Data Low-bit QuantizationComments: Accepted ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [812] arXiv:2206.10531 [pdf, other]
-
Title: Neural Transformers for Intraductal Papillary Mucosal Neoplasms (IPMN) Classification in MRI imagesAuthors: Federica Proietto Salanitri, Giovanni Bellitto, Simone Palazzo, Ismail Irmakci, Michael B. Wallace, Candice W. Bolan, Megan Engels, Sanne Hoogenboom, Marco Aldinucci, Ulas Bagci, Daniela Giordano, Concetto SpampinatoSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [813] arXiv:2206.10535 [pdf, other]
-
Title: EpiGRAF: Rethinking training of 3D GANsComments: NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [814] arXiv:2206.10536 [pdf, other]
-
Title: HealNet -- Self-Supervised Acute Wound Heal-Stage ClassificationAuthors: Héctor Carrión, Mohammad Jafari, Hsin-Ya Yang, Roslyn Rivkah Isseroff, Marco Rolandi, Marcella Gomez, Narges NorouziSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [815] arXiv:2206.10552 [pdf, other]
-
Title: Vicinity Vision TransformerAuthors: Weixuan Sun, Zhen Qin, Hui Deng, Jianyuan Wang, Yi Zhang, Kaihao Zhang, Nick Barnes, Stan Birchfield, Lingpeng Kong, Yiran ZhongSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [816] arXiv:2206.10555 [pdf, other]
-
Title: Scaling up Kernels in 3D CNNsComments: Code and models will be available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [817] arXiv:2206.10562 [pdf, other]
-
Title: Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and DepthSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [818] arXiv:2206.10571 [pdf, other]
-
Title: Toward Unpaired Multi-modal Medical Image Segmentation via Learning Structured Semantic ConsistencySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [819] arXiv:2206.10573 [pdf]
-
Title: H&E-based Computational Biomarker Enables Universal EGFR Screening for Lung AdenocarcinomaAuthors: Gabriele Campanella, David Ho, Ida Häggström, Anton S Becker, Jason Chang, Chad Vanderbilt, Thomas J FuchsSubjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [820] arXiv:2206.10587 [pdf]
-
Title: Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye MovementsComments: 28 pages, 6 figures, 3 supplementary figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [821] arXiv:2206.10589 [pdf, other]
-
Title: EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsAuthors: Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Muhammad Anwer, Fahad Shahbaz KhanComments: Accepted at ECCVW 2022 (Oral, CADL: Computational Aspects of Deep Learning)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [822] arXiv:2206.10590 [pdf, other]
-
Title: Temporally Consistent Semantic Video EditingComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [823] arXiv:2206.10665 [pdf, other]
-
Title: BOSS: A Benchmark for Human Belief Prediction in Object-context ScenariosComments: 9 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [824] arXiv:2206.10673 [pdf, ps, other]
-
Title: Natural Backdoor DatasetsAuthors: Emily Wenger, Roma Bhattacharjee, Arjun Nitin Bhagoji, Josephine Passananti, Emilio Andere, Haitao Zheng, Ben Y. ZhaoComments: 18 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [825] arXiv:2206.10690 [pdf, other]
-
Title: Learning Continuous Rotation Canonicalization with Radial Beam SamplingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [826] arXiv:2206.10692 [pdf, other]
-
Title: Multi-level Domain Adaptation for Lane DetectionComments: Proceedings of the CVPR 2022 Workshop of Autonomous DrivingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [827] arXiv:2206.10698 [pdf, other]
-
Title: TiCo: Transformation Invariance and Covariance Contrast for Self-Supervised Visual Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [828] arXiv:2206.10711 [pdf, other]
-
Title: Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive LearningComments: Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS). Extended version of arXiv:2103.00868. The project is at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
- [829] arXiv:2206.10737 [pdf, other]
-
Title: Deep Metric Color Embeddings for Splicing Localization in Severely Degraded ImagesComments: 14 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [830] arXiv:2206.10779 [pdf, other]
-
Title: Not Just Streaks: Towards Ground Truth for Single Image DerainingAuthors: Yunhao Ba, Howard Zhang, Ethan Yang, Akira Suzuki, Arnold Pfahnl, Chethan Chinder Chandrappa, Celso de Melo, Suya You, Stefano Soatto, Alex Wong, Achuta KadambiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [831] arXiv:2206.10789 [pdf, other]
-
Title: Scaling Autoregressive Models for Content-Rich Text-to-Image GenerationAuthors: Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, Zirui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui WuComments: PreprintSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [832] arXiv:2206.10809 [pdf, other]
-
Title: SSMI: How to Make Objects of Interest Disappear without Accessing Object Detectors?Comments: 6 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [833] arXiv:2206.10821 [pdf, other]
-
Title: Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized ActivationsAuthors: Lin Zhao, Haixing Dai, Zihao Wu, Zhenxiang Xiao, Lu Zhang, David Weizhong Liu, Xintao Hu, Xi Jiang, Sheng Li, Dajiang Zhu, Tianming LiuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [834] arXiv:2206.10830 [pdf, other]
-
Title: A Feature Memory Rearrangement Network for Visual Inspection of Textured Surface Defects Toward Edge Intelligent ManufacturingComments: Revision to IEEE transactions on automation science and engineeringSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [835] arXiv:2206.10831 [pdf, other]
-
Title: MultiEarth 2022 Deforestation Challenge -- ForestGumpComments: CVPR 2022, MultiEarth 2022, Deforestation Estimation ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [836] arXiv:2206.10845 [pdf, other]
-
Title: Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance SegmentationAuthors: Ming Li, Jie Wu, Jinhang Cai, Jie Qin, Yuxi Ren, Xuefeng Xiao, Min Zheng, Rui Wang, Xin PanComments: The solution of 1st Place in AVA Accessibility Vision and Autonomy Challenge on CVPR 2022 workshop. Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [837] arXiv:2206.10861 [pdf, other]
-
Title: UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022Comments: 5 pages, 3 figures; technical report for AVA Challenge (see this https URL) at the International Challenge on Activity Recognition (ActivityNet), CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [838] arXiv:2206.10869 [pdf, other]
-
Title: NVIDIA-UNIBZ Submission for EPIC-KITCHENS-100 Action Anticipation Challenge 2022Authors: Tsung-Ming Tai, Oswald Lanz, Giuseppe Fiameni, Yi-Kwan Wong, Sze-Sen Poon, Cheng-Kuang Lee, Ka-Chun Cheung, Simon SeeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [839] arXiv:2206.10878 [pdf, other]
-
Title: Feature Re-calibration based Multiple Instance Learning for Whole Slide Image ClassificationAuthors: Philip Chikontwe, Soo Jeong Nam, Heounjeong Go, Meejeong Kim, Hyun Jung Sung, Sang Hyun ParkComments: MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [840] arXiv:2206.10879 [pdf, other]
-
Title: Symmetric Network with Spatial Relationship Modeling for Natural Language-based Vehicle RetrievalComments: 8 pages, 3 figures, publised to CVPRWJournal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 3226-3233Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [841] arXiv:2206.10885 [pdf, other]
-
Title: KiloNeuS: A Versatile Neural Implicit Surface Representation for Real-Time RenderingComments: 9 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [842] arXiv:2206.10886 [pdf, other]
-
Title: Optical Flow Regularization of Implicit Neural Representations for Video Frame InterpolationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [843] arXiv:2206.10892 [pdf, other]
-
Title: I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose EstimationAuthors: Yiwei Ding, Wenjin Deng, Yinglin Zheng, Pengfei Liu, Meihong Wang, Xuan Cheng, Jianmin Bao, Dong Chen, Ming ZengComments: Accepected by IJCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [844] arXiv:2206.10902 [pdf, other]
-
Title: S2TNet: Spatio-Temporal Transformer Networks for Trajectory Prediction in Autonomous DrivingComments: Accepted by ACML2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [845] arXiv:2206.10903 [pdf, ps, other]
-
Title: UniUD-FBK-UB-UniBZ Submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2022Comments: Ranked joint 1st place in the Multi-Instance Action Retrieval Challenge organized at EPIC@CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [846] arXiv:2206.10910 [pdf, other]
-
Title: SpA-Former: Transformer image shadow detection and removal via spatial attentionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [847] arXiv:2206.10915 [pdf, other]
-
Title: Understanding the effect of sparsity on neural networks robustnessSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [848] arXiv:2206.10965 [pdf, other]
-
Title: Polar Parametrization for Vision-based Surround-View 3D DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [849] arXiv:2206.10969 [pdf, other]
-
Title: Single Morphing Attack Detection using Siamese Network and Few-shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [850] arXiv:2206.10988 [pdf, other]
-
Title: AdvSmo: Black-box Adversarial Attack by Smoothing Linear Structure of TextureComments: 6 pages,3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [851] arXiv:2206.10989 [pdf, other]
-
Title: Identity Documents Authentication based on Forgery Detection of Guilloche PatternSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [852] arXiv:2206.10996 [pdf, other]
-
Title: Prototypical Contrastive Language Image PretrainingComments: PreprintSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [853] arXiv:2206.11011 [pdf, other]
-
Title: Weakly-Supervised Temporal Action Localization by Progressive Complementary LearningAuthors: Jia-Run Du, Jia-Chang Feng, Kun-Yu Lin, Fa-Ting Hong, Xiao-Ming Wu, Zhongang Qi, Ying Shan, Wei-Shi ZhengSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [854] arXiv:2206.11053 [pdf, other]
-
Title: Surgical-VQA: Visual Question Answering in Surgical Scenes using TransformerComments: Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO); Image and Video Processing (eess.IV)
- [855] arXiv:2206.11080 [pdf, other]
-
Title: Motion Gait: Gait Recognition via Motion ExcitationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [856] arXiv:2206.11095 [pdf, other]
-
Title: A High Resolution Multi-exposure Stereoscopic Image & Video Database of Natural ScenesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [857] arXiv:2206.11115 [pdf, other]
-
Title: ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition CanvasAuthors: Prathmesh Madhu, Tilman Marquart, Ronak Kosti, Dirk Suckow, Peter Bell, Andreas Maier, Vincent ChristleinSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [858] arXiv:2206.11134 [pdf, other]
-
Title: Open Vocabulary Object Detection with Proposal Mining and Prediction EqualizationAuthors: Peixian Chen, Kekai Sheng, Mengdan Zhang, Mingbao Lin, Yunhang Shen, Shaohui Lin, Bo Ren, Ke LiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [859] arXiv:2206.11180 [pdf, other]
-
Title: Optimal transport meets noisy label robust loss and MixUp regularization for domain adaptationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [860] arXiv:2206.11203 [pdf, other]
-
Title: Facke: a Survey on Generative Models for Face SwappingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [861] arXiv:2206.11212 [pdf, other]
-
Title: VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason ObjectivesComments: NeurIPS 2022 (first two authors contributed equally)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [862] arXiv:2206.11215 [pdf, other]
-
Title: Certifiable 3D Object Pose Estimation: Foundations, Learning Models, and Self-TrainingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [863] arXiv:2206.11250 [pdf, other]
-
Title: Depth-aware Glass Surface Detection with Cross-modal Context MiningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [864] arXiv:2206.11253 [pdf, other]
-
Title: Towards Robust Blind Face Restoration with Codebook Lookup TransformerComments: Accepted by NeurIPS 2022. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [865] arXiv:2206.11352 [pdf, ps, other]
-
Title: Doubly Reparameterized Importance Weighted Structure Learning for Scene Graph GenerationComments: arXiv admin note: substantial text overlap with arXiv:2205.07017Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [866] arXiv:2206.11358 [pdf, other]
-
Title: Monocular Spherical Depth Estimation with Explicitly Connected Weak Layout CuesComments: Project page at this https URLJournal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, Volume 183, January 2022, Pages 269-285Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [867] arXiv:2206.11404 [pdf, other]
-
Title: The ArtBench Dataset: Benchmarking Generative Models with ArtworksComments: The first two authors contributed equally to this work. The code and data are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [868] arXiv:2206.11428 [pdf, other]
-
Title: LidarMultiNet: Unifying LiDAR Semantic Segmentation, 3D Object Detection, and Panoptic Segmentation in a Single Multi-task NetworkComments: Official 1st Place Solution for the Waymo Open Dataset Challenges 2022 - 3D Semantic Segmentation. Official leaderboard: this https URL CVPR 2022 Workshop on Autonomous Driving: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [869] arXiv:2206.11443 [pdf, other]
-
Title: Image-based Stability QuantificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [870] arXiv:2206.11459 [pdf, other]
-
Title: Explore Spatio-temporal Aggregation for Insubstantial Object Detection: Benchmark Dataset and BaselineSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [871] arXiv:2206.11462 [pdf, ps, other]
-
Title: ICME 2022 Few-shot LOGO detection top 9 solutionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [872] arXiv:2206.11473 [pdf, other]
-
Title: Complementary datasets to COCO for object detectionAuthors: Ali BorjiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [873] arXiv:2206.11474 [pdf, other]
-
Title: Entropy-driven Sampling and Training Scheme for Conditional Diffusion GenerationComments: 24 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [874] arXiv:2206.11476 [pdf]
-
Title: Dynamic Scene Deblurring Base on Continuous Cross-Layer Attention TransmissionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [875] arXiv:2206.11493 [pdf, other]
-
Title: Learning to Refactor Action and Co-occurrence Features for Temporal Action LocalizationComments: Accepted by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [876] arXiv:2206.11499 [pdf, other]
-
Title: Parallel Structure from Motion for UAV Images via Weighted Connected Dominating SetComments: 14 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [877] arXiv:2206.11502 [pdf]
-
Title: A Review of Published Machine Learning Natural Language Processing Applications for Protocolling Radiology ImagingAuthors: Nihal Raju (5), Michael Woodburn (1 and 5), Stefan Kachel (2 and 3), Jack O'Shaughnessy (5), Laurence Sorace (5), Natalie Yang (2), Ruth P Lim (2 and 4) ((1) Harvard University, Extension School, Cambridge, MA, USA, (2) Department of Radiology, The University of Melbourne, Parkville, (3) Department of Radiology, Columbia University in the City of New York, (4) Department of Surgery, Austin, The University of Melbourne, (5) Austin Hospital, Austin Health, Melbourne, Australia)Comments: 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [878] arXiv:2206.11520 [pdf, other]
-
Title: ICOS Protein Expression Segmentation: Can Transformer Networks Give Better Results?Comments: Accepted MIUA conference (Abstract short paper)Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [879] arXiv:2206.11541 [pdf, other]
-
Title: A Neuromorphic Vision-Based Measurement for Robust Relative Localization in Future Space Exploration MissionsAuthors: Mohammed Salah, Mohammed Chehadah, Muhammed Humais, Mohammed Wahbah, Abdulla Ayyad, Rana Azzam, Lakmal Seneviratne, Yahya ZweiriSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [880] arXiv:2206.11589 [pdf, other]
-
Title: Learning Towards the Largest MarginsComments: ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [881] arXiv:2206.11610 [pdf, other]
-
Title: 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)Comments: Winner of the 2nd RxR-Habitat Competition @ CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [882] arXiv:2206.11629 [pdf, other]
-
Title: Global Sensing and Measurements Reuse for Image Compressed SensingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [883] arXiv:2206.11653 [pdf, other]
-
Title: Learning To Generate Scene Graph from Head to TailSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [884] arXiv:2206.11657 [pdf, other]
-
Title: Warped Convolutional Networks: Bridge Homography to sl(3) algebra by Group ConvolutionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [885] arXiv:2206.11678 [pdf, other]
-
Title: BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose EstimationAuthors: Ivan Grishchenko, Valentin Bazarevsky, Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, Richard Yee, Karthik Raveendran, Matsvei Zhdanovich, Matthias Grundmann, Cristian SminchisescuComments: 4 pages, 4 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [886] arXiv:2206.11695 [pdf, other]
-
Title: NTIRE 2022 Challenge on Perceptual Image Quality AssessmentComments: This report has been published in CVPR 2022 NTIRE workshop. arXiv admin note: text overlap with arXiv:2105.03072Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [887] arXiv:2206.11723 [pdf, other]
-
Title: Self-Supervised Training with Autoencoders for Visual Anomaly DetectionAuthors: Alexander BauerSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [888] arXiv:2206.11736 [pdf, other]
-
Title: NovelCraft: A Dataset for Novelty Detection and Discovery in Open WorldsAuthors: Patrick Feeney (1), Sarah Schneider (1 and 2), Panagiotis Lymperopoulos (1), Liping Liu (1), Matthias Scheutz (1), Michael C. Hughes (1) ((1) Dept. of Computer Science, Tufts University, (2) Center for Vision, Automation and Control, Austrian Institute of Technology)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [889] arXiv:2206.11739 [pdf, other]
-
Title: Evidence fusion with contextual discounting for multi-modality medical image segmentationComments: MICCAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [890] arXiv:2206.11752 [pdf, other]
-
Title: CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal PoseSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [891] arXiv:2206.11759 [pdf, other]
-
Title: What makes you, you? Analyzing Recognition by Swapping Face PartsComments: Accepted for publication at 26TH International Conference on Pattern Recognition (ICPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [892] arXiv:2206.11768 [pdf, other]
-
Title: FitGAN: Fit- and Shape-Realistic Generative Adversarial Networks for FashionComments: 26th International Conference on Pattern Recognition (ICPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [893] arXiv:2206.11804 [pdf, other]
-
Title: Rethinking Surgical Instrument Segmentation: A Background Image Can Be All You NeedComments: 10 pages, MICCAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [894] arXiv:2206.11808 [pdf, other]
-
Title: Unseen Object 6D Pose Estimation: A Benchmark and BaselinesSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [895] arXiv:2206.11825 [pdf, other]
-
Title: YOLOSA: Object detection based on 2D local feature superimposed self-attentionComments: This paper is under consideration at Pattern Recognition LettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [896] arXiv:2206.11826 [pdf, other]
- [897] arXiv:2206.11892 [pdf, other]
-
Title: DDPM-CD: Remote Sensing Change Detection using Denoising Diffusion Probabilistic ModelsComments: Code available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [898] arXiv:2206.11894 [pdf, other]
-
Title: MaskViT: Masked Visual Pre-Training for Video PredictionComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [899] arXiv:2206.11895 [pdf, other]
-
Title: Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D SpaceComments: NeurIPS 2022. Our code is at this https URL Our project page is at this https URL v3, v4 for minor updates on figures and visualizationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [900] arXiv:2206.11896 [pdf, other]
-
Title: EventNeRF: Neural Radiance Fields from a Single Colour Event CameraComments: 18 pages, 18 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [901] arXiv:2206.11920 [pdf, other]
-
Title: Agriculture-Vision Challenge 2022 -- The Runner-Up Solution for Agricultural Pattern Recognition via Transformer-based ModelsComments: CVPR 2022, Agriculture-Vision Challenge, Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [902] arXiv:2206.11927 [pdf, other]
-
Title: Towards Galaxy Foundation Models with Hybrid Contrastive LearningComments: Accepted at the ICML 2022 Workshop on Machine Learning for Astrophysics. Data: www.github.com/mwalmsley/pytorch-galaxy-datasets. Please reach out to share your labelled data - all contributions will be credited in future workSubjects: Computer Vision and Pattern Recognition (cs.CV); Astrophysics of Galaxies (astro-ph.GA)
- [903] arXiv:2206.11952 [pdf, other]
-
Title: UNeRF: Time and Memory Conscious U-Shaped Network for Training Neural Radiance FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [904] arXiv:2206.12035 [pdf, other]
-
Title: The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge--Track 3: Referring Video Object SegmentationComments: 4 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [905] arXiv:2206.12043 [pdf, other]
-
Title: Protecting President Zelenskyy against Deep FakesSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [906] arXiv:2206.12046 [pdf, other]
-
Title: Bilateral Network with Channel Splitting Network and Transformer for Thermal Image Super-ResolutionComments: The second place solution for CVPR2022 PBVS-TISR challengeSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [907] arXiv:2206.12055 [pdf, other]
-
Title: SDF-StyleGAN: Implicit SDF-Based StyleGAN for 3D Shape GenerationComments: Accepted to Computer Graphics Forum (SGP), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [908] arXiv:2206.12063 [src]
-
Title: Mutual Information-guided Knowledge Transfer for Novel Class DiscoveryComments: The derivation of Mutual Information in the manuscript is wrongSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [909] arXiv:2206.12071 [pdf, other]
-
Title: Contrastive Learning of Features between Images and LiDARComments: accepted in CASE2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [910] arXiv:2206.12073 [pdf, other]
-
Title: MaskRange: A Mask-classification Model for Range-view based LiDAR SegmentationComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [911] arXiv:2206.12099 [pdf]
-
Title: A novel approach for glaucoma classification by wavelet neural networks using graph-based, statisitcal features of qualitatively improved imagesComments: 25 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [912] arXiv:2206.12117 [pdf, other]
-
Title: Self Supervised Learning for Few Shot Hyperspectral Image ClassificationComments: Accepted in IGARSS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [913] arXiv:2206.12123 [pdf]
-
Title: Some theoretical results on discrete contour treesAuthors: Yuqing SongComments: 5 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
- [914] arXiv:2206.12126 [pdf, other]
-
Title: Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [915] arXiv:2206.12128 [pdf, other]
-
Title: Excavating RoI Attention for Underwater Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [916] arXiv:2206.12216 [pdf, other]
-
Title: Optimized Views Photogrammetry: Precision Analysis and A Large-scale Case Study in QingdaoComments: 16 pages, 24 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [917] arXiv:2206.12351 [pdf, other]
-
Title: Megapixel Image Generation with Step-Unrolled Denoising AutoencodersComments: 17 pages + 9 appendix pages. 20 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [918] arXiv:2206.12356 [pdf, other]
-
Title: HM3D-ABO: A Photo-realistic Dataset for Object-centric Multi-view 3D ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [919] arXiv:2206.12370 [pdf, other]
-
Title: Online Distillation with Mixed Sample AugmentationComments: 5 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [920] arXiv:2206.12372 [pdf, other]
-
Title: QReg: On Regularization Effects of Quantization