Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Jun 2022

[ total of 1594 entries: 1-1591 | 1592-1594 ]
[ showing 1591 entries per page: fewer | more | all ]

[1] arXiv:2206.00048 [pdf, other]: Title: PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs

Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

Comments: Accepted at ICLR 2023. Code available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2] arXiv:2206.00069 [pdf, other]: Title: Comparing feature fusion strategies for Deep Learning-based kidney stone identification

Authors: Elias Villalvazo-Avila, Francisco Lopez-Tiro, Daniel Flores-Araiza, Gilberto Ochoa-Ruiz, Jonathan El-Beze, Jacques Hubert, Christian Daul

Comments: 4 pages, 3 figures, XXVIII\`eme Colloque Francophone de Traitement du Signal et des Images

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3] arXiv:2206.00092 [pdf, other]: Title: FHIST: A Benchmark for Few-shot Classification of Histological Images

Authors: Fereshteh Shakeri, Malik Boudiaf, Sina Mohammadi, Ivaxi Sheth, Mohammad Havaei, Ismail Ben Ayed, Samira Ebrahimi Kahou

Comments: Code available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[4] arXiv:2206.00100 [pdf, other]: Title: VALHALLA: Visual Hallucination for Machine Translation

Authors: Yi Li, Rameswar Panda, Yoon Kim, Chun-Fu Chen, Rogerio Feris, David Cox, Nuno Vasconcelos

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[5] arXiv:2206.00123 [pdf, other]: Title: Glo-In-One: Holistic Glomerular Detection, Segmentation, and Lesion Characterization with Large-scale Web Image Mining

Authors: Tianyuan Yao, Yuzhe Lu, Jun Long, Aadarsh Jha, Zheyu Zhu, Zuhayr Asad, Haichun Yang, Agnes B. Fogo, Yuankai Huo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[6] arXiv:2206.00148 [pdf, other]: Title: Hands-Up: Leveraging Synthetic Data for Hands-On-Wheel Detection

Authors: Paul Yudkin, Eli Friedman, Orly Zvitia, Gil Elbaz

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[7] arXiv:2206.00162 [pdf, other]: Title: PAGER: Progressive Attribute-Guided Extendable Robust Image Generation

Authors: Zohreh Azizi, C.-C. Jay Kuo

Comments: 19 pages, 12 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[8] arXiv:2206.00171 [pdf, other]: Title: Learning Sequential Contexts using Transformer for 3D Hand Pose Estimation

Authors: Leyla Khaleghi, Joshua Marshall, Ali Etemad

Comments: Accepted to ICPR'22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9] arXiv:2206.00181 [pdf, other]: Title: Labeling Where Adapting Fails: Cross-Domain Semantic Segmentation with Point Supervision via Active Selection

Authors: Fei Pan, Francois Rameau, Junsik Kim, In So Kweon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[10] arXiv:2206.00182 [pdf, other]: Title: Differentiable Soft-Masked Attention

Authors: Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe

Comments: arXiv admin note: text overlap with arXiv:2112.09131

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[11] arXiv:2206.00205 [pdf, other]: Title: CAFA: Class-Aware Feature Alignment for Test-Time Adaptation

Authors: Sanghun Jung, Jungsoo Lee, Nanhee Kim, Amirreza Shaban, Byron Boots, Jaegul Choo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12] arXiv:2206.00214 [pdf, other]: Title: LiDAR-MIMO: Efficient Uncertainty Estimation for LiDAR-based 3D Object Detection

Authors: Matthew Pitropov, Chengjie Huang, Vahdat Abdelzad, Krzysztof Czarnecki, Steven Waslander

Comments: 8 pages, 4 figures and 5 tables. Accepted in IEEE IV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13] arXiv:2206.00222 [pdf, other]: Title: Cross-domain Detection Transformer based on Spatial-aware and Semantic-aware Token Alignment

Authors: Jinhong Deng, Xiaoyue Zhang, Wen Li, Lixin Duan

Comments: Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[14] arXiv:2206.00227 [pdf, other]: Title: Rethinking the Augmentation Module in Contrastive Learning: Learning Hierarchical Augmentation Invariance with Expanded Views

Authors: Junbo Zhang, Kaisheng Ma

Comments: Accepted to CVPR 2022

Journal-ref: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[15] arXiv:2206.00244 [pdf, other]: Title: Fair Comparison between Efficient Attentions

Authors: Jiuk Hong, Chaehyeon Lee, Soyoun Bang, Heechul Jung

Comments: 4 pages abstract

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[16] arXiv:2206.00252 [pdf, other]: Title: Interpretable Deep Learning Classifier by Detection of Prototypical Parts on Kidney Stones Images

Authors: Daniel Flores-Araiza, Francisco Lopez-Tiro, Elias Villalvazo-Avila, Jonathan El-Beze, Jacques Hubert, Gilberto Ochoa-Ruiz, Christian Daul

Comments: Extended abstract accepted at LatinX in Computer Vision Research Workshop, at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[17] arXiv:2206.00272 [pdf, other]: Title: Vision GNN: An Image is Worth Graph of Nodes

Authors: Kai Han, Yunhe Wang, Jianyuan Guo, Yehui Tang, Enhua Wu

Comments: NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18] arXiv:2206.00274 [pdf, other]: Title: Point-Teaching: Weakly Semi-Supervised Object Detection with Point Annotations

Authors: Yongtao Ge, Qiang Zhou, Xinlong Wang, Zhibin Wang, Hao Li, Chunhua Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[19] arXiv:2206.00280 [pdf, other]: Title: Automatic Bounding Box Annotation with Small Training Data Sets for Industrial Manufacturing

Authors: Manuela Geiß, Raphael Wagner, Martin Baresch, Josef Steiner, Michael Zwick

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[20] arXiv:2206.00282 [pdf, other]: Title: Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At Scale

Authors: Cyril Vallez, Andrei Kucharavy, Ljiljana Dolamic

Comments: 26 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
[21] arXiv:2206.00291 [pdf, other]: Title: Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for Edge Devices

Authors: Bahri Batuhan Bilecen, Alparslan Fisne, Mustafa Ayazoglu

Comments: Accepted into Embedded Vision Workshop 2022 of CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[22] arXiv:2206.00309 [pdf, other]: Title: Label-Efficient Online Continual Object Detection in Streaming Video

Authors: Jay Zhangjie Wu, David Junhao Zhang, Wynne Hsu, Mengmi Zhang, Mike Zheng Shou

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23] arXiv:2206.00311 [pdf, other]: Title: MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining

Authors: Pengyuan Lyu, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2206.00343 [pdf, other]: Title: Towards view-invariant vehicle speed detection from driving simulator images

Authors: Antonio Hernández Martínez, David Fernandez Llorca, Iván García Daza

Comments: 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[25] arXiv:2206.00344 [pdf, other]: Title: Self-Supervised Learning as a Means To Reduce the Need for Labeled Data in Medical Image Analysis

Authors: Marin Benčević, Marija Habijan, Irena Galić, Aleksandra Pizurica

Comments: Accepted by 30th European Signal Processing Conference, EUSIPCO 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[26] arXiv:2206.00359 [pdf, other]: Title: DeepCluE: Enhanced Image Clustering via Multi-layer Ensembles in Deep Neural Networks

Authors: Dong Huang, Ding-Hua Chen, Xiangji Chen, Chang-Dong Wang, Jian-Huang Lai

Comments: To appear in IEEE Transactions on Emerging Topics in Computational Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[27] arXiv:2206.00364 [pdf, other]: Title: Elucidating the Design Space of Diffusion-Based Generative Models

Authors: Tero Karras, Miika Aittala, Timo Aila, Samuli Laine

Comments: NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[28] arXiv:2206.00384 [pdf, other]: Title: Generalized Supervised Contrastive Learning

Authors: Jaewon Kim, Hyukjong Lee, Jooyoung Chang, Sang Min Park

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[29] arXiv:2206.00386 [pdf, other]: Title: DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder

Authors: Jie Shi, Chenfei Wu, Jian Liang, Xiang Liu, Nan Duan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[30] arXiv:2206.00415 [pdf, other]: Title: Learning Invariant Visual Representations for Compositional Zero-Shot Learning

Authors: Tian Zhang, Kongming Liang, Ruoyi Du, Xian Sun, Zhanyu Ma, Jun Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31] arXiv:2206.00447 [pdf, other]: Title: CD$^2$: Fine-grained 3D Mesh Reconstruction With Twice Chamfer Distance

Authors: Rongfei Zeng, Mai Su, Ruiyun Yu, Xingwei Wang

Comments: Just accepted by TOMM

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32] arXiv:2206.00468 [pdf, other]: Title: PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation

Authors: Naiyu Gao, Fei He, Jian Jia, Yanhu Shan, Haoyang Zhang, Xin Zhao, Kaiqi Huang

Comments: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33] arXiv:2206.00481 [pdf, other]: Title: Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer

Authors: Guglielmo Camporese, Elena Izzo, Lamberto Ballan

Comments: Accepted to BMVC 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[34] arXiv:2206.00489 [pdf, other]: Title: Attack-Agnostic Adversarial Detection

Authors: Jiaxin Cheng, Mohamed Hussein, Jay Billa, Wael AbdAlmageed

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35] arXiv:2206.00491 [pdf, other]: Title: Semantic Room Wireframe Detection from a Single View

Authors: David Gillsjö, Gabrielle Flood, Kalle Åström

Comments: Accepted for ICPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36] arXiv:2206.00506 [pdf, other]: Title: Proximally Sensitive Error for Anomaly Detection and Feature Learning

Authors: Amogh Gudi, Fritjof Büttner, Jan van Gemert

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[37] arXiv:2206.00515 [pdf, other]: Title: Landslide4Sense: Reference Benchmark Data and Deep Learning Models for Landslide Detection

Authors: Omid Ghorbanzadeh, Yonghao Xu, Pedram Ghamisi, Michael Kopp, David Kreil

Journal-ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-17, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[38] arXiv:2206.00527 [pdf, other]: Title: Amodal Cityscapes: A New Dataset, its Generation, and an Amodal Semantic Segmentation Challenge Baseline

Authors: Jasmin Breitenstein, Tim Fingscheidt

Comments: This paper is accepted at IEEE Intelligent Vehicles Symposium 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[39] arXiv:2206.00535 [pdf, other]: Title: Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machines

Authors: Camilo Fosco, Emilie Josephs, Alex Andonian, Allen Lee, Xi Wang, Aude Oliva

Comments: 9 pages, 5 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Social and Information Networks (cs.SI)
[40] arXiv:2206.00580 [pdf, other]: Title: Dog nose print matching with dual global descriptor based on Contrastive Learning

Authors: Bin Li, Zhongan Wang, Nan Wu, Shuai Shi, Qijun Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41] arXiv:2206.00608 [pdf, other]: Title: On the Choice of Data for Efficient Training and Validation of End-to-End Driving Models

Authors: Marvin Klingner, Konstantin Müller, Mona Mirzaie, Jasmin Breitenstein, Jan-Aike Termöhlen, Tim Fingscheidt

Comments: Accepted at CVPR VDU Workshop 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[42] arXiv:2206.00614 [pdf, other]: Title: Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage

Authors: Ezechukwu I. Nwokedi, Rasneer S. Bains, Luc Bidaut, Xujiong Ye, Sara Wells, James M. Brown

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43] arXiv:2206.00629 [pdf, other]: Title: CLIP4IDC: CLIP for Image Difference Captioning

Authors: Zixin Guo, Tzu-Jui Julius Wang, Jorma Laaksonen

Comments: Accepted to AACL-IJCNLP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[44] arXiv:2206.00630 [pdf, other]: Title: Unifying Voxel-based Representation with Transformer for 3D Object Detection

Authors: Yanwei Li, Yilun Chen, Xiaojuan Qi, Zeming Li, Jian Sun, Jiaya Jia

Comments: Accepted to NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45] arXiv:2206.00645 [pdf, other]: Title: Floorplan Restoration by Structure Hallucinating Transformer Cascades

Authors: Sepidehsadat Hosseini, Yasutaka Furukawa

Comments: Published at BMVC 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[46] arXiv:2206.00665 [pdf, other]: Title: MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction

Authors: Zehao Yu, Songyou Peng, Michael Niemeyer, Torsten Sattler, Andreas Geiger

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47] arXiv:2206.00718 [pdf, other]: Title: Context-Driven Detection of Invertebrate Species in Deep-Sea Video

Authors: R. Austin McEver, Bowen Zhang, Connor Levenson, A S M Iftekhar, B.S. Manjunath

Journal-ref: International Journal of Computer Vision 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48] arXiv:2206.00735 [pdf, other]: Title: Cascaded Video Generation for Videos In-the-Wild

Authors: Lluis Castrejon, Nicolas Ballas, Aaron Courville

Comments: Accepted to the 26th International Conference on Pattern Recognition (ICPR 2022). arXiv admin note: substantial text overlap with arXiv:2106.02719

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[49] arXiv:2206.00746 [pdf, other]: Title: Residual Multiplicative Filter Networks for Multiscale Reconstruction

Authors: Shayan Shekarforoush, David B. Lindell, David J. Fleet, Marcus A. Brubaker

Comments: NeurIPS 2022, Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[50] arXiv:2206.00771 [pdf, other]: Title: Dynamic Linear Transformer for 3D Biomedical Image Segmentation

Authors: Zheyuan Zhang, Ulas Bagci

Comments: 8 Pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[51] arXiv:2206.00790 [pdf, other]: Title: Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction

Authors: Jun Chen, Ming Hu, Boyang Li, Mohamed Elhoseiny

Comments: Add code

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52] arXiv:2206.00798 [pdf, other]: Title: Multi-scale frequency separation network for image deblurring

Authors: Yanni Zhang, Qiang Li, Miao Qi, Di Liu, Jun Kong, Jianzhong Wang

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53] arXiv:2206.00800 [pdf, other]: Title: CcHarmony: Color-checker based Image Harmonization Dataset

Authors: Haoxu Huang, Li Niu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54] arXiv:2206.00806 [pdf, other]: Title: XBound-Former: Toward Cross-scale Boundary Modeling in Transformers

Authors: Jiacheng Wang, Fei Chen, Yuxi Ma, Liansheng Wang, Zhaodong Fei, Jianwei Shuai, Xiangdong Tang, Qichao Zhou, Jing Qin

Comments: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[55] arXiv:2206.00812 [pdf, other]: Title: Modeling sRGB Camera Noise with Normalizing Flows

Authors: Shayan Kousha, Ali Maleky, Michael S. Brown, Marcus A. Brubaker

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[56] arXiv:2206.00859 [pdf, other]: Title: Disentangled Generation Network for Enlarged License Plate Recognition and A Unified Dataset

Authors: Chenglong Li, Xiaobin Yang, Guohao Wang, Aihua Zheng, Chang Tan, Ruoran Jia, Jin Tang

Comments: Submission to CVIU

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[57] arXiv:2206.00878 [pdf, other]: Title: EfficientNeRF: Efficient Neural Radiance Fields

Authors: Tao Hu, Shu Liu, Yilun Chen, Tiancheng Shen, Jiaya Jia

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58] arXiv:2206.00893 [pdf, other]: Title: Leveraging Systematic Knowledge of 2D Transformations

Authors: Jiachen Kang, Wenjing Jia, Xiangjian He

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[59] arXiv:2206.00897 [pdf, other]: Title: xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar Imagery

Authors: Fernando Paolo, Tsu-ting Tim Lin, Ritwik Gupta, Bryce Goodman, Nirav Patel, Daniel Kuster, David Kroodsma, Jared Dunnmon

Comments: Accepted to NeurIPS 2022. 10 pages (25 with references and supplement)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[60] arXiv:2206.00902 [pdf, other]: Title: MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet

Authors: Nan Wang, Shaohui Lin, Xiaoxiao Li, Ke Li, Yunhang Shen, Yue Gao, Lizhuang Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61] arXiv:2206.00923 [pdf, other]: Title: Modeling Image Composition for Complex Scene Generation

Authors: Zuopeng Yang, Daqing Liu, Chaoyue Wang, Jie Yang, Dacheng Tao

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[62] arXiv:2206.00924 [pdf, other]: Title: FACM: Intermediate Layer Still Retain Effective Features against Adversarial Examples

Authors: Xiangyuan Yang, Jie Lin, Hanlin Zhang, Xinyu Yang, Peng Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63] arXiv:2206.00930 [pdf, other]: Title: Predicting Physical Object Properties from Video

Authors: Martin Link, Max Schwarz, Sven Behnke

Comments: accepted for International Joint Conference on Neural Networks (IJCNN) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[64] arXiv:2206.00947 [pdf, other]: Title: A Bhattacharyya Coefficient-Based Framework for Noise Model-Aware Random Walker Image Segmentation

Authors: Dominik Drees, Florian Eilers, Ang Bian, Xiaoyi Jiang

Comments: Dominik Drees and Florian Eilers contributed equally to this work

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[65] arXiv:2206.00960 [pdf, other]: Title: SparseDet: Towards End-to-End 3D Object Detection

Authors: Jianhong Han, Zhaoyi Wan, Zhe Liu, Jie Feng, Bingfeng Zhou

Journal-ref: Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, pp. 781- 792. Feb. 6-8, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[66] arXiv:2206.00971 [pdf, other]: Title: CVM-Cervix: A Hybrid Cervical Pap-Smear Image Classification Framework Using CNN, Visual Transformer and Multilayer Perceptron

Authors: Wanli Liu, Chen Li, Ning Xu, Tao Jiang, Md Mamunur Rahaman, Hongzan Sun, Xiangchen Wu, Weiming Hu, Haoyuan Chen, Changhao Sun, Yudong Yao, Marcin Grzegorzek

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[67] arXiv:2206.00997 [pdf, other]: Title: Is Mapping Necessary for Realistic PointGoal Navigation?

Authors: Ruslan Partsey, Erik Wijmans, Naoki Yokoyama, Oles Dobosevych, Dhruv Batra, Oleksandr Maksymets

Comments: Corrected typos in the Abstract

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[68] arXiv:2206.01009 [pdf, other]: Title: Unified Recurrence Modeling for Video Action Anticipation

Authors: Tsung-Ming Tai, Giuseppe Fiameni, Cheng-Kuang Lee, Simon See, Oswald Lanz

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[69] arXiv:2206.01010 [pdf, other]: Title: Long-tailed Recognition by Learning from Latent Categories

Authors: Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70] arXiv:2206.01014 [pdf, other]: Title: Suggestive Annotation of Brain MR Images with Gradient-guided Sampling

Authors: Chengliang Dai, Shuo Wang, Yuanhan Mo, Elsa Angelini, Yike Guo, Wenjia Bai

Comments: Manuscript accepted by MedIA

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[71] arXiv:2206.01017 [pdf, other]: Title: Structured Two-stream Attention Network for Video Question Answering

Authors: Lianli Gao, Pengpeng Zeng, Jingkuan Song, Yuan-Fang Li, Wu Liu, Tao Mei, Heng Tao Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2206.01034 [pdf, other]: Title: Adversarial Laser Spot: Robust and Covert Physical-World Attack to DNNs

Authors: Chengyin Hu, Yilong Wang, Kalibinuer Tiliwalidi, Wen Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[73] arXiv:2206.01038 [pdf, other]: Title: A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications

Authors: Fei Wu, Qingzhong Wang, Jian Bian, Haoyi Xiong, Ning Ding, Feixiang Lu, Jun Cheng, Dejing Dou

Comments: 26 pages. The toolbox is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[74] arXiv:2206.01061 [pdf, other]: Title: FV-UPatches: Enhancing Universality in Finger Vein Recognition

Authors: Ziyan Chen, Jiazhen Liu, Changwen Cao, Changlong Jin, Hakil Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[75] arXiv:2206.01062 [pdf, other]: Title: DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

Authors: Birgit Pfitzmann, Christoph Auer, Michele Dolfi, Ahmed S Nassar, Peter W J Staar

Comments: 9 pages, 6 figures, 5 tables. Accepted paper at SIGKDD 2022 conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[76] arXiv:2206.01102 [pdf, other]: Title: A temporal chrominance trigger for clean-label backdoor attack against anti-spoof rebroadcast detection

Authors: Wei Guo, Benedetta Tondi, Mauro Barni

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[77] arXiv:2206.01125 [pdf, other]: Title: Prefix Conditioning Unifies Language and Label Supervision

Authors: Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas Pfister

Comments: CVPR2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78] arXiv:2206.01127 [pdf, other]: Title: VL-BEiT: Generative Vision-Language Pretraining

Authors: Hangbo Bao, Wenhui Wang, Li Dong, Furu Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[79] arXiv:2206.01136 [pdf, other]: Title: Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Authors: Jun Li, Junyu Chen, Yucheng Tang, Ce Wang, Bennett A. Landman, S. Kevin Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[80] arXiv:2206.01153 [pdf, other]: Title: Multi-View Active Fine-Grained Recognition

Authors: Ruoyi Du, Wenqing Yu, Heqing Wang, Dongliang Chang, Ting-En Lin, Yongbin Li, Zhanyu Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[81] arXiv:2206.01160 [pdf, other]: Title: DE-Net: Dynamic Text-guided Image Editing Adversarial Networks

Authors: Ming Tao, Bing-Kun Bao, Hao Tang, Fei Wu, Longhui Wei, Qi Tian

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[82] arXiv:2206.01161 [pdf, other]: Title: Optimizing Relevance Maps of Vision Transformers Improves Robustness

Authors: Hila Chefer, Idan Schwartz, Lior Wolf

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83] arXiv:2206.01191 [pdf, other]: Title: EfficientFormer: Vision Transformers at MobileNet Speed

Authors: Yanyu Li, Geng Yuan, Yang Wen, Ju Hu, Georgios Evangelidis, Sergey Tulyakov, Yanzhi Wang, Jian Ren

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[84] arXiv:2206.01198 [pdf, other]: Title: Pruning-as-Search: Efficient Neural Architecture Search via Channel Pruning and Structural Reparameterization

Authors: Yanyu Li, Pu Zhao, Geng Yuan, Xue Lin, Yanzhi Wang, Xin Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[85] arXiv:2206.01201 [pdf, other]: Title: REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering

Authors: Yuanze Lin, Yujia Xie, Dongdong Chen, Yichong Xu, Chenguang Zhu, Lu Yuan

Comments: Accepted by NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[86] arXiv:2206.01202 [pdf, other]: Title: Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features

Authors: Chieh Hubert Lin, Hsin-Ying Lee, Hung-Yu Tseng, Maneesh Singh, Ming-Hsuan Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[87] arXiv:2206.01203 [pdf, other]: Title: Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding Boxes

Authors: Julian Chibane, Francis Engelmann, Tuan Anh Tran, Gerard Pons-Moll

Comments: Project page: this https URL

Journal-ref: European Conference on Computer Vision (ECCV), 2022, Oral Presentation

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2206.01204 [pdf, other]: Title: Siamese Image Modeling for Self-Supervised Vision Representation Learning

Authors: Chenxin Tao, Xizhou Zhu, Weijie Su, Gao Huang, Bin Li, Jie Zhou, Yu Qiao, Xiaogang Wang, Jifeng Dai

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2206.01232 [pdf, other]: Title: What Are Expected Queries in End-to-End Object Detection?

Authors: Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Kai Chen

Comments: The source code is publicly available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[90] arXiv:2206.01244 [pdf, other]: Title: Real-Time Portrait Stylization on the Edge

Authors: Yanyu Li, Xuan Shen, Geng Yuan, Jiexiong Guan, Wei Niu, Hao Tang, Bin Ren, Yanzhi Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[91] arXiv:2206.01256 [pdf, other]: Title: PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Authors: Yingfei Liu, Junjie Yan, Fan Jia, Shuailin Li, Aqi Gao, Tiancai Wang, Xiangyu Zhang, Jian Sun

Comments: Adding 3D lane detection results on OpenLane Dataset

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92] arXiv:2206.01290 [pdf, other]: Title: Points2NeRF: Generating Neural Radiance Fields from 3D point cloud

Authors: D. Zimny, T. Trzciński, P. Spurek

Comments: arXiv admin note: text overlap with arXiv:2003.08934 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93] arXiv:2206.01297 [pdf, other]: Title: Lossless Compression of Point Cloud Sequences Using Sequence Optimized CNN Models

Authors: Emre Can Kaya, Ioan Tabus

Comments: 9 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[94] arXiv:2206.01309 [pdf, other]: Title: H-EMD: A Hierarchical Earth Mover's Distance Method for Instance Segmentation

Authors: Peixian Liang, Yizhe Zhang, Yifan Ding, Jianxu Chen, Chinedu S. Madukoma, Tim Weninger, Joshua D. Shrout, Danny Z. Chen

Comments: Accepted at IEEE Transactions On Medical Imaging (TMI)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[95] arXiv:2206.01319 [pdf, other]: Title: Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling

Authors: Jian Hu, Haowen Zhong, Junchi Yan, Shaogang Gong, Guile Wu, Fei Yang

Comments: This paper has been accepted by ECCV2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[96] arXiv:2206.01326 [pdf, other]: Title: Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic Information

Authors: Zu Kim, André Araujo, Bingyi Cao, Cam Askew, Jack Sim, Mike Green, N'Mah Fodiatu Yilla, Tobias Weyand

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[97] arXiv:2206.01327 [pdf, other]: Title: RELAY: Robotic EyeLink AnalYsis of the EyeLink 1000 using an Artificial Eye

Authors: Anna-Maria Felßberg, Dominykas Strazdas

Comments: 12 Pages, 17 Figures, 2 Tables. Git Repository: this https URL Appendix Repository: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[98] arXiv:2206.01334 [pdf, other]: Title: Long Scale Error Control in Low Light Image and Video Enhancement Using Equivariance

Authors: Sara Aghajanzadeh, David Forsyth

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[99] arXiv:2206.01365 [pdf, other]: Title: Adversarial Attacks on Human Vision

Authors: Victor A. Mateescu, Ivan V. Bajić

Comments: 21 pages, 8 figures, 1 table

Journal-ref: Extended version of IEEE MultiMedia, vol. 23, no. 1, pp. 82-91, Jan.-Mar. 2016

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[100] arXiv:2206.01369 [pdf, other]: Title: Incremental Learning Meets Transfer Learning: Application to Multi-site Prostate MRI Segmentation

Authors: Chenyu You, Jinlin Xiang, Kun Su, Xiaoran Zhang, Siyuan Dong, John Onofrey, Lawrence Staib, James S. Duncan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[101] arXiv:2206.01370 [pdf, other]: Title: Towards Improving the Generation Quality of Autoregressive Slot VAEs

Authors: Patrick Emami, Pan He, Sanjay Ranka, Anand Rangarajan

Comments: Published in Neural Computation. 38 pages, 18 figures. Code and videos available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[102] arXiv:2206.01381 [pdf, other]: Title: CF-YOLO: Cross Fusion YOLO for Object Detection in Adverse Weather with a High-quality Real Snow Dataset

Authors: Qiqi Ding, Peng Li, Xuefeng Yan, Ding Shi, Luming Liang, Weiming Wang, Haoran Xie, Jonathan Li, Mingqiang Wei

Comments: 10pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103] arXiv:2206.01384 [pdf, ps, other]: Title: End-to-End 3D Hand Pose Estimation from Stereo Cameras

Authors: Yuncheng Li, Zehao Xue, Yingying Wang, Liuhao Ge, Zhou Ren, Jonathan Rodriguez

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104] arXiv:2206.01408 [pdf, other]: Title: MetaLR: Meta-tuning of Learning Rates for Transfer Learning in Medical Imaging

Authors: Yixiong Chen, Li Liu, Jingxian Li, Hua Jiang, Chris Ding, Zongwei Zhou

Comments: MICCAI 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[105] arXiv:2206.01417 [pdf, other]: Title: Learning an Adaptation Function to Assess Image Visual Similarities

Authors: Olivier Risser-Maroix (LIPADE), Amine Marzouki (LIPADE), Hala Djeghim (LIPADE), Camille Kurtz (LIPADE), Nicolas Lomenie (LIPADE)

Journal-ref: ORASIS 2021, Centre National de la Recherche Scientifique [CNRS], Sep 2021, Saint Ferr{\'e}ol, France

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106] arXiv:2206.01429 [pdf, other]: Title: Learning rich optical embeddings for privacy-preserving lensless image classification

Authors: Eric Bezzam, Martin Vetterli, Matthieu Simeoni

Comments: 29 pages, 23 figures, under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[107] arXiv:2206.01441 [pdf, other]: Title: Exploring Transformers for Behavioural Biometrics: A Case Study in Gait Recognition

Authors: Paula Delgado-Santos, Ruben Tolosana, Richard Guest, Farzin Deravi, Ruben Vera-Rodriguez

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[108] arXiv:2206.01466 [pdf, other]: Title: Recognition of Unseen Bird Species by Learning from Field Guides

Authors: Andrés C. Rodríguez, Stefano D'Aronco, Rodrigo Caye Daudt, Jan D. Wegner, Konrad Schindler

Comments: Accepted to WACV2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[109] arXiv:2206.01467 [pdf, other]: Title: The Importance of Image Interpretation: Patterns of Semantic Misclassification in Real-World Adversarial Images

Authors: Zhengyu Zhao, Nga Dang, Martha Larson

Comments: International Conference on Multimedia Modeling (MMM) 2023. Resources are publicly available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[110] arXiv:2206.01473 [pdf, other]: Title: Distributional loss for convolutional neural network regression and application to GNSS multi-path estimation

Authors: Thomas Gonzalez, Antoine Blais, Nicolas Couëllan, Christian Ruiz

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[111] arXiv:2206.01498 [pdf, ps, other]: Title: YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack detection

Authors: Xiao Ruiqiang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112] arXiv:2206.01524 [pdf, other]: Title: Anomaly detection in surveillance videos using transformer based attention model

Authors: Kapil Deshpande, Narinder Singh Punn, Sanjay Kumar Sonbhadra, Sonali Agarwal

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113] arXiv:2206.01627 [pdf, other]: Title: Pruning for Feature-Preserving Circuits in CNNs

Authors: Chris Hamblin, Talia Konkle, George Alvarez

Comments: Under Review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[114] arXiv:2206.01646 [pdf, other]: Title: Integrating Prior Knowledge in Contrastive Learning with Kernel

Authors: Benoit Dufumier, Carlo Alberto Barbano, Robin Louiset, Edouard Duchesnay, Pietro Gori

Comments: ICML 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[115] arXiv:2206.01651 [pdf, other]: Title: D'ARTAGNAN: Counterfactual Video Generation

Authors: Hadrien Reynaud, Athanasios Vlontzos, Mischa Dombrowski, Ciarán Lee, Arian Beqiri, Paul Leeson, Bernhard Kainz

Comments: Accepted for MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116] arXiv:2206.01653 [pdf, other]: Title: Metrics reloaded: Recommendations for image analysis validation

Authors: Lena Maier-Hein, Annika Reinke, Patrick Godau, Minu D. Tizabi, Florian Buettner, Evangelia Christodoulou, Ben Glocker, Fabian Isensee, Jens Kleesiek, Michal Kozubek, Mauricio Reyes, Michael A. Riegler, Manuel Wiesenfarth, A. Emre Kavur, Carole H. Sudre, Michael Baumgartner, Matthias Eisenmann, Doreen Heckmann-Nötzel, Tim Rädsch, Laura Acion, Michela Antonelli, Tal Arbel, Spyridon Bakas, Arriel Benis, Matthew Blaschko, M. Jorge Cardoso, Veronika Cheplygina, Beth A. Cimini, Gary S. Collins, Keyvan Farahani, Luciana Ferrer, Adrian Galdran, Bram van Ginneken, Robert Haase, Daniel A. Hashimoto, Michael M. Hoffman, Merel Huisman, Pierre Jannin, Charles E. Kahn, Dagmar Kainmueller, Bernhard Kainz, Alexandros Karargyris, Alan Karthikesalingam, Hannes Kenngott, Florian Kofler, Annette Kopp-Schneider, et al. (28 additional authors not shown)

Comments: Shared first authors: Lena Maier-Hein, Annika Reinke. arXiv admin note: substantial text overlap with arXiv:2104.05642 Published in Nature Methods

Journal-ref: Nature methods, 1-18 (2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117] arXiv:2206.01658 [pdf, ps, other]: Title: Identification via Retinal Vessels Combining LBP and HOG

Authors: Ali Noori

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[118] arXiv:2206.01661 [pdf, other]: Title: Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image Synthesis

Authors: Jan Zuiderveld

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[119] arXiv:2206.01670 [pdf, other]: Title: Egocentric Video-Language Pretraining

Authors: Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Comments: Accepted by NeurIPS 2022. Double champions at Ego4D and EPIC-Kitchens, CVPR 2022 challenges. 23 pages, 13 figures, 12 tables. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[120] arXiv:2206.01705 [pdf, other]: Title: Gradient Obfuscation Checklist Test Gives a False Sense of Security

Authors: Nikola Popovic, Danda Pani Paudel, Thomas Probst, Luc Van Gool

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121] arXiv:2206.01714 [pdf, other]: Title: Compositional Visual Generation with Composable Diffusion Models

Authors: Nan Liu, Shuang Li, Yilun Du, Antonio Torralba, Joshua B. Tenenbaum

Comments: ECCV 2022. First three authors contributed equally. Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[122] arXiv:2206.01718 [pdf, other]: Title: A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge

Authors: Dustin Schwenk, Apoorv Khandelwal, Christopher Clark, Kenneth Marino, Roozbeh Mottaghi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[123] arXiv:2206.01720 [pdf, other]: Title: Revisiting the "Video" in Video-Language Understanding

Authors: Shyamal Buch, Cristóbal Eyzaguirre, Adrien Gaidon, Jiajun Wu, Li Fei-Fei, Juan Carlos Niebles

Comments: CVPR 2022 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[124] arXiv:2206.01724 [pdf, other]: Title: SNAKE: Shape-aware Neural 3D Keypoint Field

Authors: Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing Huang

Comments: Accepted by NeurIPS 2022. Codes are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[125] arXiv:2206.01733 [pdf, other]: Title: Adversarial RAW: Image-Scaling Attack Against Imaging Pipeline

Authors: Junjian Li, Honglong Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[126] arXiv:2206.01734 [pdf, ps, other]: Title: Using UAS Imagery and Computer Vision to Support Site-Specific Weed Control in Corn

Authors: Ranjan Sapkota, Paulo Flores

Comments: 16 Figures, 3 Tables,. arXiv admin note: substantial text overlap with arXiv:2204.12417

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[127] arXiv:2206.01772 [pdf, other]: Title: Radar Guided Dynamic Visual Attention for Resource-Efficient RGB Object Detection

Authors: Hemant Kumawat, Saibal Mukhopadhyay

Comments: Accepted in International Joint Conference on Neural Networks (IJCNN) 2022

Journal-ref: 2022 International Joint Conference on Neural Networks (IJCNN)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[128] arXiv:2206.01777 [pdf, other]: Title: Real-Time Super-Resolution for Real-World Images on Mobile Devices

Authors: Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho

Comments: arXiv admin note: text overlap with arXiv:2004.13674

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[129] arXiv:2206.01794 [pdf, other]: Title: Additive MIL: Intrinsically Interpretable Multiple Instance Learning for Pathology

Authors: Syed Ashar Javed, Dinkar Juyal, Harshith Padigela, Amaro Taylor-Weiner, Limin Yu, Aaditya Prakash

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[130] arXiv:2206.01813 [pdf, other]: Title: Learning sRGB-to-Raw-RGB De-rendering with Content-Aware Metadata

Authors: Seonghyeon Nam, Abhijith Punnappurath, Marcus A. Brubaker, Michael S. Brown

Comments: CVPR 2022 (GitHub: this https URL)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[131] arXiv:2206.01821 [pdf, other]: Title: EAANet: Efficient Attention Augmented Convolutional Networks

Authors: Runqing Zhang, Tianshu Zhu

Comments: 8 pages, 4 figures. Not published

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132] arXiv:2206.01831 [pdf, other]: Title: Spatial Feature Mapping for 6DoF Object Pose Estimation

Authors: Jianhan Mei, Xudong Jiang, Henghui Ding

Comments: Pattern Recognition

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133] arXiv:2206.01841 [pdf, other]: Title: Coffee Roast Intelligence

Authors: Sakdipat Ontoum, Thitaree Khemanantakul, Pornphat Sroison, Tuul Triyason, Bunthit Watanapa

Comments: 6 pages, 13 figures, 3 tables, this work was presented at the CSC498 COMPUTER SCIENCE CAPSTONE PROJECT I and CSC499 COMPUTER SCIENCE CAPSTONE PROJECT II courses

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[134] arXiv:2206.01843 [pdf, other]: Title: Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning

Authors: Yujia Xie, Luowei Zhou, Xiyang Dai, Lu Yuan, Nguyen Bach, Ce Liu, Michael Zeng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[135] arXiv:2206.01863 [pdf, other]: Title: Recursive Deformable Image Registration Network with Mutual Attention

Authors: Jian-Qing Zheng, Ziyang Wang, Baoru Huang, Ngee Han Lim, Tonia Vincent, Bartlomiej W. Papiez

Comments: arXiv admin note: text overlap with arXiv:2203.04290

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[136] arXiv:2206.01867 [pdf, other]: Title: SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional Space

Authors: Zihan Wang, Ruimin Chen, Mengxuan Liu, Guanfang Dong, Anup Basu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137] arXiv:2206.01881 [pdf, other]: Title: Face Recognition Accuracy Across Demographics: Shining a Light Into the Problem

Authors: Haiyu Wu, Vítor Albiero, K. S. Krishnapriya, Michael C. King, Kevin W. Bowyer

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138] arXiv:2206.01884 [pdf, ps, other]: Title: A Superimposed Divide-and-Conquer Image Recognition Method for SEM Images of Nanoparticles on The Surface of Monocrystalline silicon with High Aggregation Degree

Authors: Ruiling Xiao, Jiayang Niu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[139] arXiv:2206.01908 [pdf, other]: Title: Video-based Human-Object Interaction Detection from Tubelet Tokens

Authors: Danyang Tu, Wei Sun, Xiongkuo Min, Guangtao Zhai, Wei Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[140] arXiv:2206.01910 [pdf, other]: Title: The Spike Gating Flow: A Hierarchical Structure Based Spiking Neural Network for Online Gesture Recognition

Authors: Zihao Zhao, Yanhong Wang, Qiaosha Zou, Tie Xu, Fangbo Tao, Jiansong Zhang, Xiaoan Wang, C.-J. Richard Shi, Junwen Luo, Yuan Xie

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[141] arXiv:2206.01916 [pdf, other]: Title: Nerfels: Renderable Neural Codes for Improved Camera Pose Estimation

Authors: Gil Avraham, Julian Straub, Tianwei Shen, Tsun-Yi Yang, Hugo Germain, Chris Sweeney, Vasileios Balntas, David Novotny, Daniel DeTone, Richard Newcombe

Comments: Published at CVPRW with supplementary material

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[142] arXiv:2206.01923 [pdf, other]: Title: From Pixels to Objects: Cubic Visual Attention for Visual Question Answering

Authors: Jingkuan Song, Pengpeng Zeng, Lianli Gao, Heng Tao Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[143] arXiv:2206.01942 [pdf, other]: Title: Occlusion-Resistant Instance Segmentation of Piglets in Farrowing Pens Using Center Clustering Network

Authors: Endai Huang, Axiu Mao, Junhui Hou, Yongjian Wu, Weitao Xu, Maria Camila Ceballos, Thomas D. Parsons, Kai Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[144] arXiv:2206.01961 [pdf, other]: Title: C$^3$Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy

Authors: Erez Posner, Adi Zholkover, Netanel Frank, Moshe Bouhnik

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[145] arXiv:2206.01986 [pdf, other]: Title: Delving into the Openness of CLIP

Authors: Shuhuai Ren, Lei Li, Xuancheng Ren, Guangxiang Zhao, Xu Sun

Comments: Accepted by Findings of ACL 2023 (Long Paper). Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[146] arXiv:2206.01988 [pdf, other]: Title: Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation

Authors: Mingjie Li, Wenjia Cai, Karin Verspoor, Shirui Pan, Xiaodan Liang, Xiaojun Chang

Comments: CVPR 2022 (Poster)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[147] arXiv:2206.01992 [pdf, other]: Title: CAINNFlow: Convolutional block Attention modules and Invertible Neural Networks Flow for anomaly detection and localization tasks

Authors: Ruiqing Yan, Fan Zhang, Mengyuan Huang, Wu Liu, Dongyu Hu, Jinfeng Li, Qiang Liu, Jinrong Jiang, Qianjin Guo, Linghan Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[148] arXiv:2206.01999 [pdf, other]: Title: MSR: Making Self-supervised learning Robust to Aggressive Augmentations

Authors: Yingbin Bai, Erkun Yang, Zhaoqing Wang, Yuxuan Du, Bo Han, Cheng Deng, Dadong Wang, Tongliang Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[149] arXiv:2206.02002 [pdf, other]: Title: CVNets: High Performance Library for Computer Vision

Authors: Sachin Mehta, Farzad Abdolhosseini, Mohammad Rastegari

Comments: Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[150] arXiv:2206.02015 [pdf, other]: Title: APES: Articulated Part Extraction from Sprite Sheets

Authors: Zhan Xu, Matthew Fisher, Yang Zhou, Deepali Aneja, Rushikesh Dudhat, Li Yi, Evangelos Kalogerakis

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[151] arXiv:2206.02027 [pdf, other]: Title: Implicit Neural Representation for Mesh-Free Inverse Obstacle Scattering

Authors: Tin Vlašić, Hieu Nguyen, AmirEhsan Khorashadizadeh, Ivan Dokmanić

Comments: 6 pages, 8 figures, to be published in 2022 Asilomar Conference on Signals, Systems, and Computers

Journal-ref: 2022 Asilomar Conference on Signals, Systems, and Computers

Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[152] arXiv:2206.02029 [pdf, other]: Title: Guided Deep Metric Learning

Authors: Jorge Gonzalez-Zapata, Ivan Reyes-Amezcua, Daniel Flores-Araiza, Mauricio Mendez-Ruiz, Gilberto Ochoa-Ruiz, Andres Mendez-Vazquez

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[153] arXiv:2206.02050 [pdf, other]: Title: Learning Speaker-specific Lip-to-Speech Generation

Authors: Munender Varshney, Ravindra Yadav, Vinay P. Namboodiri, Rajesh M Hegde

Comments: Accepted at ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[154] arXiv:2206.02066 [pdf, other]: Title: PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers

Authors: Jiacong Xu, Zixiang Xiong, Shankar P. Bhattacharyya

Comments: 11 pages, 9 figures; This paper will be published by CVPR2023 soon, please refer to the official version then

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[155] arXiv:2206.02070 [pdf, other]: Title: Priors in Deep Image Restoration and Enhancement: A Survey

Authors: Yunfan Lu, Yiqi Lin, Hao Wu, Yunhao Luo, Xu Zheng, Hui Xiong, Lin Wang

Comments: Preprint. Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[156] arXiv:2206.02082 [pdf, other]: Title: Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval

Authors: Xudong Lin, Simran Tiwari, Shiyuan Huang, Manling Li, Mike Zheng Shou, Heng Ji, Shih-Fu Chang

Comments: To appear in CVPR 2023; The code will be released at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[157] arXiv:2206.02086 [pdf, other]: Title: Towards the Creation of a Nutrition and Food Group Based Image Database

Authors: Zeman Shao, Jiangpeng He, Ya-Yuan Yu, Luotao Lin, Alexandra Cowan, Heather Eicher-Miller, Fengqing Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[158] arXiv:2206.02087 [pdf, other]: Title: Accurate Scoliosis Vertebral Landmark Localization on X-ray Images via Shape-constrained Multi-stage Cascaded CNNs

Authors: Zhiwei Wang, Jinxin Lv, Yunqiao Yang, Yuanhuai Liang, Yi Lin, Qiang Li, Xin Li, Xin Yang

Comments: 9 pages, submitted to IEEE Journal of Biomedical and Health Informatics

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[159] arXiv:2206.02099 [pdf, other]: Title: Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation

Authors: Yuenan Hou, Xinge Zhu, Yuexin Ma, Chen Change Loy, Yikang Li

Comments: CVPR 2022; Our model ranks 1st on Waymo and SemanticKITTI (single-scan) challenges, and ranks 3rd on SemanticKITTI (multi-scan) challenge; Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[160] arXiv:2206.02104 [pdf, other]: Title: ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences

Authors: Christos Tzelepis, James Oldfield, Georgios Tzimiropoulos, Ioannis Patras

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[161] arXiv:2206.02110 [pdf, other]: Title: Computer Vision-based Characterization of Large-scale Jet Flames using a Synthetic Infrared Image Generation Approach

Authors: Carmina Pérez-Guerrero, Jorge Francisco Ciprián-Sánchez, Adriana Palacios, Gilberto Ochoa-Ruiz, Miguel Gonzalez-Mendoza, Vahid Foroughi, Elsa Pastor, Gerardo Rodriguez-Hernandez

Comments: Pre-print submitted to Engineering Science and Technology, an International Journal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[162] arXiv:2206.02116 [pdf, other]: Title: Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in Videos

Authors: Sukjun Hwang, Miran Heo, Seoung Wug Oh, Seon Joo Kim

Comments: Accepted to CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[163] arXiv:2206.02118 [pdf, other]: Title: ShapePU: A New PU Learning Framework Regularized by Global Consistency for Scribble Supervised Cardiac Segmentation

Authors: Ke Zhang, Xiahai Zhuang

Comments: 11 pages,4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[164] arXiv:2206.02120 [pdf, other]: Title: MPANet: Multi-Patch Attention For Infrared Small Target object Detection

Authors: Ao Wang, Wei Li, Xin Wu, Zhanchao Huang, Ran Tao

Comments: 4 pages 3 figures

Journal-ref: 2022IGARSS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[165] arXiv:2206.02136 [pdf, other]: Title: LDRNet: Enabling Real-time Document Localization on Mobile Devices

Authors: Han Wu, Holland Qian, Huaming Wu, Aad van Moorsel

Comments: ECML-PKDD 2022 this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
[166] arXiv:2206.02146 [pdf, other]: Title: Recurrent Video Restoration Transformer with Guided Deformable Attention

Authors: Jingyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, Jiezhang Cao, Kai Zhang, Radu Timofte, Luc Van Gool

Comments: Accepted by NeurIPS 2022. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[167] arXiv:2206.02153 [pdf, other]: Title: HPGNN: Using Hierarchical Graph Neural Networks for Outdoor Point Cloud Processing

Authors: Arulmolivarman Thieshanthan, Amashi Niwarthana, Pamuditha Somarathne, Tharindu Wickremasinghe, Ranga Rodrigo

Comments: Accepted for ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[168] arXiv:2206.02158 [pdf, other]: Title: Vanilla Feature Distillation for Improving the Accuracy-Robustness Trade-Off in Adversarial Training

Authors: Guodong Cao, Zhibo Wang, Xiaowei Dong, Zhifei Zhang, Hengchang Guo, Zhan Qin, Kui Ren

Comments: 12 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[169] arXiv:2206.02163 [pdf, other]: Title: MotionCNN: A Strong Baseline for Motion Prediction in Autonomous Driving

Authors: Stepan Konev, Kirill Brodt, Artsiom Sanakoyeu

Comments: CVPR Workshop on Autonomous Driving 2021. Waymo Motion Prediction Challenge 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[170] arXiv:2206.02180 [pdf, other]: Title: Semi-Supervised Learning for Mars Imagery Classification and Segmentation

Authors: Wenjing Wang, Lilang Lin, Zejia Fan, Jiaying Liu

Comments: Accepted by ACM Trans. on Multimedia Computing Communications and Applications (TOMM)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[171] arXiv:2206.02187 [pdf, other]: Title: M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation

Authors: Vishal Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki Onoe

Comments: Accepted for publication in the 5th Multimodal Learning and Applications (MULA) Workshop at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[172] arXiv:2206.02194 [pdf, other]: Title: FOF: Learning Fourier Occupancy Field for Monocular Real-time Human Reconstruction

Authors: Qiao Feng, Yebin Liu, Yu-Kun Lai, Jingyu Yang, Kun Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[173] arXiv:2206.02200 [pdf, other]: Title: GridShift: A Faster Mode-seeking Algorithm for Image Segmentation and Object Tracking

Authors: Abhishek Kumar, Oladayo S. Ajani, Swagatam Das, Rammohan Mallipeddi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[174] arXiv:2206.02203 [pdf, ps, other]: Title: 3D Convolutional with Attention for Action Recognition

Authors: Labina Shrestha, Shikha Dubey, Farrukh Olimov, Muhammad Aasim Rafique, Moongu Jeon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[175] arXiv:2206.02220 [pdf, other]: Title: U(1) Symmetry-breaking Observed in Generic CNN Bottleneck Layers

Authors: Louis-François Bouchard, Mohsen Ben Lazreg, Matthew Toews

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[176] arXiv:2206.02234 [pdf, other]: Title: Two Decades of Bengali Handwritten Digit Recognition: A Survey

Authors: A.B.M. Ashikur Rahman, Md. Bakhtiar Hasan, Sabbir Ahmed, Tasnim Ahmed, Md. Hamjajul Ashmafee, Mohammad Ridwan Kabir, Md. Hasanul Kabir

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. 38 pages, 23 figures, 12 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[177] arXiv:2206.02257 [pdf, other]: Title: Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey

Authors: Takehiko Ohkawa, Ryosuke Furuta, Yoichi Sato

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[178] arXiv:2206.02260 [pdf, other]: Title: SealID: Saimaa ringed seal re-identification dataset

Authors: Ekaterina Nepovinnykh, Tuomas Eerola, Vincent Biard, Piia Mutka, Marja Niemi, Heikki Kälviäinen, Mervi Kunnasranta

Comments: 15 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Populations and Evolution (q-bio.PE)
[179] arXiv:2206.02261 [pdf, other]: Title: Towards Individual Grevy's Zebra Identification via Deep 3D Fitting and Metric Learning

Authors: Maria Stennett, Daniel I. Rubenstein, Tilo Burghardt

Comments: 4 pages, 5 figures, 1 table; typos corrected, references updated

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[180] arXiv:2206.02270 [pdf, other]: Title: Estimating building energy efficiency from street view imagery, aerial imagery, and land surface temperature data

Authors: Kevin Mayer, Lukas Haas, Tianyuan Huang, Juan Bernabé-Moreno, Ram Rajagopal, Martin Fischer

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[181] arXiv:2206.02281 [pdf, other]: Title: E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles

Authors: Zhenyu Hu, Zhenyu Wu, Pengcheng Pi, Yunhe Xue, Jiayi Shen, Jianchao Tan, Xiangru Lian, Zhangyang Wang, Ji Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[182] arXiv:2206.02288 [pdf, other]: Title: ACT: Semi-supervised Domain-adaptive Medical Image Segmentation with Asymmetric Co-training

Authors: Xiaofeng Liu, Fangxu Xing, Nadya Shusharina, Ruth Lim, C-C Jay Kuo, Georges El Fakhri, Jonghye Woo

Comments: MICCAI 2022 (early accept)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[183] arXiv:2206.02295 [pdf, other]: Title: HIFI-Net: A Novel Network for Enhancement to Underwater Images

Authors: Jiajia Zhou, Junbin Zhuang, Yan Zheng, Di Wu

Comments: 7 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[184] arXiv:2206.02307 [pdf, other]: Title: Bootstrapping Semi-supervised Medical Image Segmentation with Anatomical-aware Contrastive Distillation

Authors: Chenyu You, Weicheng Dai, Yifei Min, Lawrence Staib, James S. Duncan

Comments: Accepted at Information Processing in Medical Imaging (IPMI 2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[185] arXiv:2206.02325 [pdf, other]: Title: Evaluation-oriented Knowledge Distillation for Deep Face Recognition

Authors: Yuge Huang, Jiaxiang Wu, Xingkun Xu, Shouhong Ding

Comments: CVPR2022 Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[186] arXiv:2206.02327 [pdf, other]: Title: JigsawHSI: a network for Hyperspectral Image classification

Authors: Jaime Moraga, H. Sebnem Duzgun

Comments: 7 pages, 7 figures, not peer reviewed

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[187] arXiv:2206.02331 [pdf, ps, other]: Title: MASNet:Improve Performance of Siamese Networks with Mutual-attention for Remote Sensing Change Detection Tasks

Authors: Hongbin Zhou, Yupeng Ren, Qiankun Li, Jun Yin, Yonggang Lin

Comments: XXIV ISPRS Congress

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[188] arXiv:2206.02338 [pdf, other]: Title: OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression

Authors: Wanhua Li, Xiaoke Huang, Zheng Zhu, Yansong Tang, Xiu Li, Jie Zhou, Jiwen Lu

Comments: Accepted by NeurIPS2022. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[189] arXiv:2206.02342 [pdf, other]: Title: WHU-Stereo: A Challenging Benchmark for Stereo Matching of High-Resolution Satellite Images

Authors: Shenhong Li, Sheng He, San Jiang, Wanshou Jiang, Lin Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[190] arXiv:2206.02343 [pdf, other]: Title: Contrastive Graph Multimodal Model for Text Classification in Videos

Authors: Ye Liu, Changchong Lu, Chen Lin, Di Yin, Bo Ren

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[191] arXiv:2206.02345 [pdf, other]: Title: Anomaly Detection with Test Time Augmentation and Consistency Evaluation

Authors: Haowei He, Jiaye Teng, Yang Yuan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[192] arXiv:2206.02349 [pdf, other]: Title: Invariant Grounding for Video Question Answering

Authors: Yicong Li, Xiang Wang, Junbin Xiao, Wei Ji, Tat-Seng Chua

Comments: CVPR2022 Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[193] arXiv:2206.02355 [pdf, other]: Title: Relation Matters: Foreground-aware Graph-based Relational Reasoning for Domain Adaptive Object Detection

Authors: Chaoqi Chen, Jiongcheng Li, Hong-Yu Zhou, Xiaoguang Han, Yue Huang, Xinghao Ding, Yizhou Yu

Comments: Accepted by IEEE T-PAMI

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[194] arXiv:2206.02366 [pdf, other]: Title: Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D Scans

Authors: Alexandr Notchenko, Vladislav Ishimtsev, Alexey Artemov, Vadim Selyutin, Emil Bogomolov, Evgeny Burnaev

Comments: In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[195] arXiv:2206.02373 [pdf, other]: Title: Sports Re-ID: Improving Re-Identification Of Players In Broadcast Videos Of Team Sports

Authors: Bharath Comandur

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[196] arXiv:2206.02374 [pdf, other]: Title: CorticalFlow: A Diffeomorphic Mesh Deformation Module for Cortical Surface Reconstruction

Authors: Léo Lebrat, Rodrigo Santa Cruz, Frédéric de Gournay, Darren Fu, Pierrick Bourgeat, Jurgen Fripp, Clinton Fookes, Olivier Salvado

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[197] arXiv:2206.02377 [pdf, other]: Title: BInGo: Bayesian Intrinsic Groupwise Registration via Explicit Hierarchical Disentanglement

Authors: Xin Wang, Xinzhe Luo, Xiahai Zhuang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[198] arXiv:2206.02392 [pdf, ps, other]: Title: Semi-Supervised Segmentation of Mitochondria from Electron Microscopy Images Using Spatial Continuity

Authors: Yunpeng Xiao, Youpeng Zhao, Ge Yang

Comments: 4 pages of main text, 5 pages of supplementary material and 1 page of references

Journal-ref: 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). IEEE, 2022: 1-5

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[199] arXiv:2206.02405 [pdf, other]: Title: Image Protection for Robust Cropping Localization and Recovery

Authors: Qichao Ying, Hang Zhou, Xiaoxiao Hu, Zhenxing Qian, Sheng Li, Xinpeng Zhang

Comments: Accepted by IEEE ICME 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[200] arXiv:2206.02424 [pdf, ps, other]: Title: Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles

Authors: Hulin Li, Jun Li, Hanbing Wei, Zheng Liu, Zhenfei Zhan, Qiliang Ren

Comments: 18 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[201] arXiv:2206.02452 [pdf, other]: Title: Universal Photometric Stereo Network using Global Lighting Contexts

Authors: Satoshi Ikehata

Comments: Accepted to CVPR2022. Code and Dataset at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[202] arXiv:2206.02454 [pdf, other]: Title: What do CNNs Learn in the First Layer and Why? A Linear Systems Perspective

Authors: Rhea Chowers, Yair Weiss

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[203] arXiv:2206.02498 [pdf, other]: Title: NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation

Authors: Ekaterina Nepovinnykh, Ilia Chelak, Tuomas Eerola, Heikki Kälviäinen

Comments: 22 pages, 13 figures, 5 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[204] arXiv:2206.02502 [pdf, other]: Title: BehavePassDB: Public Database for Mobile Behavioral Biometrics and Benchmark Evaluation

Authors: Giuseppe Stragapede, Ruben Vera-Rodriguez, Ruben Tolosana, Aythami Morales

Comments: 11 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[205] arXiv:2206.02531 [pdf, other]: Title: 3D-Augmented Contrastive Knowledge Distillation for Image-based Object Pose Estimation

Authors: Zhidan Liu, Zhen Xing, Xiangdong Zhou, Yijiang Chen, Guichun Zhou

Comments: Accepted for presentation at International Conference on Multimedia Retrieval (ICMR '22)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[206] arXiv:2206.02539 [pdf, other]: Title: Robustness Evaluation and Adversarial Training of an Instance Segmentation Model

Authors: Jacob Bond, Andrew Lingg

Comments: 15 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[207] arXiv:2206.02544 [pdf, other]: Title: RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation

Authors: Azimkhon Ostonov, Peter Wonka, Dominik L. Michels

Comments: Accepted at the IEEE Winter Conference on Applications of Computer Vision, WACV 2022

Journal-ref: 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 2723-2732

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[208] arXiv:2206.02547 [pdf, ps, other]: Title: Towards retrieving dispersion profiles using quantum-mimic Optical Coherence Tomography and Machine Learnin

Authors: Krzysztof A. Maliszewski, Piotr Kolenderski, Varvara Vetrova, Sylwia M. Kolenderska

Comments: 11 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[209] arXiv:2206.02559 [pdf, other]: Title: Conversation Group Detection With Spatio-Temporal Context

Authors: Stephanie Tan, David M.J. Tax, Hayley Hung

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[210] arXiv:2206.02564 [pdf, other]: Title: Machine Learning for Detection of 3D Features using sparse X-ray data

Authors: Bradley T. Wolfe, Michael J. Falato, Xinhua Zhang, Nga T. T. Nguyen-Fotiadis, J.P. Sauppe, P. M. Kozlowski, P. A. Keiter, R. E. Reinovsky, S. A. Batha, Zhehui Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Data Analysis, Statistics and Probability (physics.data-an)
[211] arXiv:2206.02573 [pdf, other]: Title: Team VI-I2R Technical Report on EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021

Authors: Yi Cheng, Fen Fang, Ying Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[212] arXiv:2206.02598 [pdf, other]: Title: [Reproducibility Report] Explainable Deep One-Class Classification

Authors: Joao P. C. Bertoldo, Etienne Decencière

Comments: Submitted to the ML Reproducibility Challenge 2021 Fall

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[213] arXiv:2206.02609 [pdf, other]: Title: Real-World Image Super-Resolution by Exclusionary Dual-Learning

Authors: Hao Li, Jinghui Qin, Zhijing Yang, Pengxu Wei, Jinshan Pan, Liang Lin, Yukai Shi

Comments: IEEE TMM 2022; Considering large volume of RealSR datasets, a multi-dataset sampling scheme is developed

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[214] arXiv:2206.02619 [pdf, other]: Title: VPIT: Real-time Embedded Single Object 3D Tracking Using Voxel Pseudo Images

Authors: Illia Oleksiienko, Paraskevi Nousi, Nikolaos Passalis, Anastasios Tefas, Alexandros Iosifidis

Comments: 10 pages, 5 figures, 4 tables. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[215] arXiv:2206.02622 [pdf, other]: Title: Hardware-accelerated Mars Sample Localization via deep transfer learning from photorealistic simulations

Authors: Raúl Castilla-Arquillo, Carlos Jesús Pérez-del-Pulgar, Gonzalo Jesús Paz-Delgado, Levin Gerdes

Comments: Preprint version only. Final version at IEEE Xplore. Accepted for IEEE Robotics and Automation Letters

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[216] arXiv:2206.02647 [pdf, other]: Title: Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning

Authors: Richard J. Chen, Chengkuan Chen, Yicong Li, Tiffany Y. Chen, Andrew D. Trister, Rahul G. Krishnan, Faisal Mahmood

Comments: Accepted to CVPR 2022 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[217] arXiv:2206.02664 [pdf, other]: Title: Learning with Capsules: A Survey

Authors: Fabio De Sousa Ribeiro, Kevin Duarte, Miles Everett, Georgios Leontidis, Mubarak Shah

Comments: 29 pages, 43 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[218] arXiv:2206.02680 [pdf, other]: Title: Separable Self-attention for Mobile Vision Transformers

Authors: Sachin Mehta, Mohammad Rastegari

Comments: Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[219] arXiv:2206.02714 [pdf, other]: Title: FuSS: Fusing Superpixels for Improved Segmentation Consistency

Authors: Ian Nunes, Matheus B. Pereira, Hugo Oliveira, Jefersson A. Dos Santos, Marcus Poggi

Comments: submitted to IEEEACCESS. 19 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[220] arXiv:2206.02715 [pdf, other]: Title: Day-to-Night Image Synthesis for Training Nighttime Neural ISPs

Authors: Abhijith Punnappurath, Abdullah Abuolaim, Abdelrahman Abdelhamed, Alex Levinshtein, Michael S. Brown

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[221] arXiv:2206.02717 [pdf, other]: Title: Scene Aware Person Image Generation through Global Contextual Conditioning

Authors: Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein

Comments: Accepted in The International Conference on Pattern Recognition (ICPR) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[222] arXiv:2206.02721 [pdf, other]: Title: Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering

Authors: Yongyi Su, Xun Xu, Kui Jia

Comments: NeurIPS 2022 accepted paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[223] arXiv:2206.02735 [pdf, other]: Title: People Tracking in Panoramic Video for Guiding Robots

Authors: Alberto Bacchin, Filippo Berno, Emanuele Menegatti, Alberto Pretto

Comments: Accepted to 17th International Conference on Intelligent Autonomous Systems (IAS-17)

Journal-ref: Proceedings of the 17th International Conference on Intelligent Autonomous Systems (IAS 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[224] arXiv:2206.02749 [pdf, other]: Title: CORE: Consistent Representation Learning for Face Forgery Detection

Authors: Yunsheng Ni, Depu Meng, Changqian Yu, Chengbin Quan, Dongchun Ren, Youjian Zhao

Comments: Accepted by CVPRW 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[225] arXiv:2206.02761 [pdf, other]: Title: Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical Images

Authors: Tom Ron, Michal Weiler-Sagie, Tamir Hazan

Comments: 12 pages, 5 figures. In proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022. Copyright 2022 by the author(s)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[226] arXiv:2206.02770 [pdf, other]: Title: Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts

Authors: Basil Mustafa, Carlos Riquelme, Joan Puigcerver, Rodolphe Jenatton, Neil Houlsby

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[227] arXiv:2206.02776 [pdf, other]: Title: Volumetric Disentanglement for 3D Scene Manipulation

Authors: Sagie Benaim, Frederik Warburg, Peter Ebert Christensen, Serge Belongie

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[228] arXiv:2206.02777 [pdf, other]: Title: Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation

Authors: Feng Li, Hao Zhang, Huaizhe xu, Shilong Liu, Lei Zhang, Lionel M. Ni, Heung-Yeung Shum

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[229] arXiv:2206.02779 [pdf, other]: Title: Blended Latent Diffusion

Authors: Omri Avrahami, Ohad Fried, Dani Lischinski

Comments: Accepted to SIGGRAPH 2023. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[230] arXiv:2206.02780 [pdf, other]: Title: GenSDF: Two-Stage Learning of Generalizable Signed Distance Functions

Authors: Gene Chou, Ilya Chugunov, Felix Heide

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[231] arXiv:2206.02846 [pdf, other]: Title: A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information

Authors: Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232] arXiv:2206.02850 [pdf, other]: Title: GLF-CR: SAR-Enhanced Cloud Removal with Global-Local Fusion

Authors: Fang Xu, Yilei Shi, Patrick Ebel, Lei Yu, Gui-Song Xia, Wen Yang, Xiao Xiang Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[233] arXiv:2206.02876 [pdf, other]: Title: SpikiLi: A Spiking Simulation of LiDAR based Real-time Object Detection for Autonomous Driving

Authors: Sambit Mohapatra, Thomas Mesquida, Mona Hodaei, Senthil Yogamani, Heinrich Gotzig, Patrick Mader

Comments: Accepted at Workshop on Event Sensing and Neuromorphic Engineering - 8th International Conference on Event-based Control, Communication, and Signal Processing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[234] arXiv:2206.02903 [pdf, other]: Title: Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps

Authors: Seung Wook Kim, Karsten Kreis, Daiqing Li, Antonio Torralba, Sanja Fidler

Comments: CVPR 2022 Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[235] arXiv:2206.02912 [pdf, ps, other]: Title: Learning Image Representations for Content Based Image Retrieval of Radiotherapy Treatment Plans

Authors: Charles Huang, Varun Vasudevan, Oscar Pastor-Serrano, Md Tauhidul Islam, Yusuke Nomura, Piotr Dubrowski, Jen-Yeu Wang, Joseph B. Schulz, Yong Yang, Lei Xing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[236] arXiv:2206.02967 [pdf, other]: Title: Masked Unsupervised Self-training for Label-free Image Classification

Authors: Junnan Li, Silvio Savarese, Steven C.H. Hoi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[237] arXiv:2206.02977 [pdf, other]: Title: DETR++: Taming Your Multi-Scale Detection Transformer

Authors: Chi Zhang, Lijuan Liu, Xiaoxue Zang, Frederick Liu, Hao Zhang, Xinying Song, Jindong Chen

Comments: T4V: Transformers for Vision workshop @ CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[238] arXiv:2206.02985 [pdf, other]: Title: Structured Context Transformer for Generic Event Boundary Detection

Authors: Congcong Li, Xinyao Wang, Dexiang Hong, Yufei Wang, Libo Zhang, Tiejian Luo, Longyin Wen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[239] arXiv:2206.02997 [pdf, ps, other]: Title: TadML: A fast temporal action detection with Mechanics-MLP

Authors: Bowen Deng, Dongchang Liu

Comments: 8 pages,3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[240] arXiv:2206.03001 [pdf, other]: Title: PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

Authors: Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, Dianhai Yu, Yanjun Ma

Comments: arXiv admin note: text overlap with arXiv:2109.03144

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[241] arXiv:2206.03010 [pdf, other]: Title: MS-RNN: A Flexible Multi-Scale Framework for Spatiotemporal Predictive Learning

Authors: Zhifeng Ma, Hao Zhang, Jie Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[242] arXiv:2206.03012 [pdf, other]: Title: TriBYOL: Triplet BYOL for Self-Supervised Representation Learning

Authors: Guang Li, Ren Togo, Takahiro Ogawa, Miki Haseyama

Comments: Published as a conference paper at ICASSP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[243] arXiv:2206.03014 [pdf, other]: Title: The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation

Authors: Lin Li, Long Chen, Yifeng Huang, Zhimeng Zhang, Songyang Zhang, Jun Xiao

Comments: Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[244] arXiv:2206.03017 [pdf, other]: Title: Development of Automatic Endotracheal Tube and Carina Detection on Portable Supine Chest Radiographs using Artificial Intelligence

Authors: Chi-Yeh Chen, Min-Hsin Huang, Yung-Nien Sun, Chao-Han Lai

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[245] arXiv:2206.03033 [pdf, other]: Title: Deep Learning Techniques for Visual Counting

Authors: Luca Ciampi

Comments: Version with high-quality images can be found at this https URL arXiv admin note: text overlap with arXiv:1802.03601, arXiv:1707.01202, arXiv:1809.02165, arXiv:1901.06026, arXiv:1808.01244 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[246] arXiv:2206.03048 [pdf, other]: Title: Layered Depth Refinement with Mask Guidance

Authors: Soo Ye Kim, Jianming Zhang, Simon Niklaus, Yifei Fan, Simon Chen, Zhe Lin, Munchurl Kim

Comments: Accepted to CVPR 2022 (camera-ready version)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[247] arXiv:2206.03061 [pdf, other]: Title: Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detection

Authors: Hongsheng Li, Guangming Zhu, Wu Zhen, Lan Ni, Peiyi Shen, Liang Zhang, Ning Wang, Cong Hua

Comments: Accepted by IJCNN2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[248] arXiv:2206.03062 [pdf, other]: Title: Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map

Authors: Haodong Yuan, Yudong Zhang, Shengyin Fan, Xue Li, Jian Wang

Comments: 9 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249] arXiv:2206.03064 [pdf, other]: Title: A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector

Authors: Lin Sui, Chen-Lin Zhang, Lixin Gu, Feng Han

Comments: Accepted By WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250] arXiv:2206.03086 [pdf, other]: Title: Online Deep Clustering with Video Track Consistency

Authors: Alessandra Alfani, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo

Comments: Accepted at ICPR2022 as oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[251] arXiv:2206.03087 [pdf, other]: Title: Critical Regularizations for Neural Surface Reconstruction in the Wild

Authors: Jingyang Zhang, Yao Yao, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[252] arXiv:2206.03105 [pdf, other]: Title: Dual Swin-Transformer based Mutual Interactive Network for RGB-D Salient Object Detection

Authors: Chao Zeng, Sam Kwong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253] arXiv:2206.03111 [pdf, other]: Title: Medical Image Registration via Neural Fields

Authors: Shanlin Sun, Kun Han, Hao Tang, Deying Kong, Junayed Naushad, Xiangyi Yan, Xiaohui Xie

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[254] arXiv:2206.03113 [pdf, other]: Title: Wavelet Prior Attention Learning in Axial Inpainting Network

Authors: Chenjie Cao, Chengrong Wang, Yuntao Zhang, Yanwei Fu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[255] arXiv:2206.03149 [pdf, other]: Title: Self-Training of Handwritten Word Recognition for Synthetic-to-Real Adaptation

Authors: Fabian Wolf, Gernot A. Fink

Comments: Accepted for publication in International Conference on Pattern Recognition (ICPR) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[256] arXiv:2206.03164 [pdf, other]: Title: Utility of Equivariant Message Passing in Cortical Mesh Segmentation

Authors: Dániel Unyi, Ferdinando Insalata, Petar Veličković, Bálint Gyires-Tóth

Comments: 13 pages, 3 figures, accepted for MIUA 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[257] arXiv:2206.03196 [pdf, other]: Title: Improving Image Captioning with Control Signal of Sentence Quality

Authors: Zhangzi Zhu, Hong Qu

Comments: Accepted by ICASSP2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258] arXiv:2206.03207 [pdf, other]: Title: Omnivision forecasting: combining satellite observations with sky images for improved intra-hour solar energy predictions

Authors: Quentin Paletta, Guillaume Arbod, Joan Lasenby

Comments: Submitted to Renewable Energy

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[259] arXiv:2206.03210 [pdf, other]: Title: Deep Neural Patchworks: Coping with Large Segmentation Tasks

Authors: Marco Reisert, Maximilian Russe, Samer Elsheikh, Elias Kellner, Henrik Skibbe

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[260] arXiv:2206.03287 [pdf, other]: Title: NeMF: Neural Motion Fields for Kinematic Animation

Authors: Chengan He, Jun Saito, James Zachary, Holly Rushmeier, Yi Zhou

Comments: Accepted to NeurIPS 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[261] arXiv:2206.03361 [pdf, other]: Title: Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution

Authors: Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[262] arXiv:2206.03367 [pdf, other]: Title: Localizing Semantic Patches for Accelerating Image Classification

Authors: Chuanguang Yang, Zhulin An, Yongjun Xu

Comments: Accepted by ICME-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[263] arXiv:2206.03368 [pdf, other]: Title: IL-MCAM: An interactive learning and multi-channel attention mechanism-based weakly supervised colorectal histopathology image classification approach

Authors: Haoyuan Chen, Chen Li, Xiaoyan Li, Md Mamunur Rahaman, Weiming Hu, Yixin Li, Wanli Liu, Changhao Sun, Hongzan Sun, Xinyu Huang, Marcin Grzegorzek

Journal-ref: Computers in Biology and Medicine, Volume 143, April 2022, 105265

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[264] arXiv:2206.03373 [pdf, other]: Title: Garment Avatars: Realistic Cloth Driving using Pattern Registration

Authors: Oshri Halimi, Fabian Prada, Tuur Stuyck, Donglai Xiang, Timur Bagautdinov, He Wen, Ron Kimmel, Takaaki Shiratori, Chenglei Wu, Yaser Sheikh

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265] arXiv:2206.03410 [pdf, other]: Title: Fast and Robust Non-Rigid Registration Using Accelerated Majorization-Minimization

Authors: Yuxin Yao, Bailin Deng, Weiwei Xu, Juyong Zhang

Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[266] arXiv:2206.03428 [pdf, other]: Title: Revealing Single Frame Bias for Video-and-Language Learning

Authors: Jie Lei, Tamara L. Berg, Mohit Bansal

Comments: 19 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[267] arXiv:2206.03429 [pdf, other]: Title: Generating Long Videos of Dynamic Scenes

Authors: Tim Brooks, Janne Hellsten, Miika Aittala, Ting-Chun Wang, Timo Aila, Jaakko Lehtinen, Ming-Yu Liu, Alexei A. Efros, Tero Karras

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[268] arXiv:2206.03431 [pdf, other]: Title: Self-supervised Domain Adaptation in Crowd Counting

Authors: Pha Nguyen, Thanh-Dat Truong, Miaoqing Huang, Yi Liang, Ngan Le, Khoa Luu

Comments: Accepted at ICIP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[269] arXiv:2206.03452 [pdf, other]: Title: Can CNNs Be More Robust Than Transformers?

Authors: Zeyu Wang, Yutong Bai, Yuyin Zhou, Cihang Xie

Comments: ICLR2023. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270] arXiv:2206.03461 [pdf, other]: Title: Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models

Authors: Walter H. L. Pinaya, Mark S. Graham, Robert Gray, Pedro F Da Costa, Petru-Daniel Tudosiu, Paul Wright, Yee H. Mah, Andrew D. MacKinnon, James T. Teo, Rolf Jager, David Werring, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
[271] arXiv:2206.03480 [pdf, other]: Title: SHRED: 3D Shape Region Decomposition with Learned Local Operations

Authors: R. Kenny Jones, Aalia Habib, Daniel Ritchie

Comments: SIGGRAPH ASIA 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[272] arXiv:2206.03484 [pdf, other]: Title: Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding

Authors: Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

Comments: CVPR camera ready

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[273] arXiv:2206.03544 [pdf, other]: Title: A Penny for Your (visual) Thoughts: Self-Supervised Reconstruction of Natural Movies from Brain Activity

Authors: Ganit Kupershmidt, Roman Beliy, Guy Gaziv, Michal Irani

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[274] arXiv:2206.03591 [pdf, other]: Title: ObPose: Leveraging Pose for Object-Centric Scene Inference and Generation in 3D

Authors: Yizhe Wu, Oiwi Parker Jones, Ingmar Posner

Comments: 14 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[275] arXiv:2206.03600 [pdf, other]: Title: OneRing: A Simple Method for Source-free Open-partial Domain Adaptation

Authors: Shiqi Yang, Yaxing Wang, Kai Wang, Shangling Jui, Joost van de Weijer

Comments: Updated. It only focuses on source-free open-partial domain adaptation, to avoid any potential misunderstanding

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[276] arXiv:2206.03612 [pdf, other]: Title: Predictive Modeling of Charge Levels for Battery Electric Vehicles using CNN EfficientNet and IGTD Algorithm

Authors: Seongwoo Choi, Chongzhou Fang, David Haddad, Minsung Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Signal Processing (eess.SP)
[277] arXiv:2206.03657 [pdf, other]: Title: Delving into the Pre-training Paradigm of Monocular 3D Object Detection

Authors: Zhuoling Li, Chuanrui Zhang, En Yu, Haoqian Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278] arXiv:2206.03661 [pdf, other]: Title: One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Authors: Fangxin Shang, Yehui Yang, Dalu Yang, Junde Wu, Xiaorong Wang, Yanwu Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[279] arXiv:2206.03666 [pdf, other]: Title: Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking

Authors: Longlong Jing, Ruichi Yu, Henrik Kretzschmar, Kang Li, Charles R. Qi, Hang Zhao, Alper Ayvaci, Xu Chen, Dillon Cower, Yingwei Li, Yurong You, Han Deng, Congcong Li, Dragomir Anguelov

Journal-ref: ICRA2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[280] arXiv:2206.03673 [pdf, other]: Title: Unsupervised Learning of 3D Scene Flow from Monocular Camera

Authors: Guangming Wang, Xiaoyu Tian, Ruiqi Ding, Hesheng Wang

Comments: ICRA2021

Journal-ref: 2021 IEEE International Conference on Robotics and Automation (ICRA)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[281] arXiv:2206.03678 [pdf, other]: Title: UHD Image Deblurring via Multi-scale Cubic-Mixer

Authors: Zhuoran Zheng, Xiuyi Jia

Comments: 8 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[282] arXiv:2206.03680 [pdf, other]: Title: Improving Evaluation of Debiasing in Image Classification

Authors: Jungsoo Lee, Juyoung Lee, Sanghun Jung, Jaegul Choo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[283] arXiv:2206.03687 [pdf, other]: Title: A Unified Model for Multi-class Anomaly Detection

Authors: Zhiyuan You, Lei Cui, Yujun Shen, Kai Yang, Xin Lu, Yu Zheng, Xinyi Le

Comments: Accepted by NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284] arXiv:2206.03691 [pdf, other]: Title: Robust Deep Ensemble Method for Real-world Image Denoising

Authors: Pengju Liu, Hongzhi Zhang, Jinghui Wang, Yuzhi Wang, Dongwei Ren, Wangmeng Zuo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[285] arXiv:2206.03697 [pdf, other]: Title: Blind Face Restoration: Benchmark Datasets and a Baseline Model

Authors: Puyang Zhang, Kaihao Zhang, Wenhan Luo, Changsheng Li, Guoren Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[286] arXiv:2206.03698 [pdf, other]: Title: What do we learn? Debunking the Myth of Unsupervised Outlier Detection

Authors: Cosmin I. Bercea, Daniel Rueckert, Julia A. Schnabel

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[287] arXiv:2206.03727 [pdf, other]: Title: Wavelet Regularization Benefits Adversarial Training

Authors: Jun Yan, Huilin Yin, Xiaoyang Deng, Ziming Zhao, Wancheng Ge, Hao Zhang, Gerhard Rigoll

Comments: Preprint version

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288] arXiv:2206.03740 [pdf, other]: Title: Large Loss Matters in Weakly Supervised Multi-Label Classification

Authors: Youngwook Kim, Jae Myung Kim, Zeynep Akata, Jungwoo Lee

Comments: CVPR 2022. First two authors contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[289] arXiv:2206.03753 [pdf, other]: Title: Task Agnostic Restoration of Natural Video Dynamics

Authors: Muhammad Kashif Ali, Dongjin Kim, Tae Hyun Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[290] arXiv:2206.03775 [pdf, other]: Title: PixSelect: Less but Reliable Pixels for Accurate and Efficient Localization

Authors: Mohammad Altillawi

Journal-ref: IEEE International Conference on Robotics and Automation (ICRA), May 23-27, 2022. Philadelphia, PA, USA, p 4156-4162

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[291] arXiv:2206.03778 [pdf, other]: Title: Learning Digital Terrain Models from Point Clouds: ALS2DTM Dataset and Rasterization-based GAN

Authors: Hoàng-Ân Lê, Florent Guiotte, Minh-Tan Pham, Sébastien Lefèvre, Thomas Corpetti

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[292] arXiv:2206.03789 [pdf, other]: Title: Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation

Authors: Zihan Ding, Tianrui Hui, Junshi Huang, Xiaoming Wei, Jizhong Han, Si Liu

Comments: Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[293] arXiv:2206.03799 [pdf, other]: Title: Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth Maps

Authors: Kieran Saunders, George Vogiatzis, Luis J. Manso

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[294] arXiv:2206.03820 [pdf, ps, other]: Title: SUPER-IVIM-DC: Intra-voxel incoherent motion based Fetal lung maturity assessment from limited DWI data using supervised learning coupled with data-consistency

Authors: Noam Korngut, Elad Rotman, Onur Afacan, Sila Kurugol, Yael Zaffrani-Reznikov, Shira Nemirovsky-Rotman, Simon Warfield, Moti Freiman

Comments: Accepted to the International Conference on Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, to be held during Sept 18-22 in Singapore

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[295] arXiv:2206.03858 [pdf, other]: Title: Rotation-Equivariant Conditional Spherical Neural Fields for Learning a Natural Illumination Prior

Authors: James A. D. Gardner, Bernhard Egger, William A. P. Smith

Comments: NeurIPS 2022 - Project Website: jadgardner.github.io/RENI

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[296] arXiv:2206.03860 [pdf, other]: Title: Orthonormal Convolutions for the Rotation Based Iterative Gaussianization

Authors: Valero Laparra, Alexander Hepburn, J. Emmanuel Johnson, Jesús Malo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297] arXiv:2206.03862 [pdf, other]: Title: Perceptual Quality Assessment for Fine-Grained Compressed Images

Authors: Zicheng Zhang, Wei Sun, Wei Wu, Ying Chen, Xiongkuo Min, Guangtao Zhai

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[298] arXiv:2206.03876 [pdf, other]: Title: Progressive GANomaly: Anomaly detection with progressively growing GANs

Authors: Djennifer K. Madzia-Madzou, Hugo J. Kuijf

Comments: SPIE Medical Imaging 2022: Image Processing conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[299] arXiv:2206.03888 [pdf, other]: Title: ConFUDA: Contrastive Fewshot Unsupervised Domain Adaptation for Medical Image Segmentation

Authors: Mingxuan Gu, Sulaiman Vesal, Mareike Thies, Zhaoya Pan, Fabian Wagner, Mirabela Rusu, Andreas Maier, Ronak Kosti

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[300] arXiv:2206.03891 [pdf, other]: Title: PrivHAR: Recognizing Human Actions From Privacy-preserving Lens

Authors: Carlos Hinojosa, Miguel Marquez, Henry Arguello, Ehsan Adeli, Li Fei-Fei, Juan Carlos Niebles

Comments: Oral paper presented at European Conference on Computer Vision (ECCV) 2022, in Tel Aviv, Israel

Journal-ref: Computer Vision--ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23--27, 2022, Proceedings, Part IV

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[301] arXiv:2206.03928 [pdf, other]: Title: Direct Triangulation with Spherical Projection for Omnidirectional Cameras

Authors: Ciarán Eising

Comments: 8 pages, 4 figures, 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[302] arXiv:2206.03939 [pdf, other]: Title: Depth-Adapted CNNs for RGB-D Semantic Segmentation

Authors: Zongwei Wu, Guillaume Allibert, Christophe Stolz, Chao Ma, Cédric Demonceaux

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[303] arXiv:2206.03943 [pdf, other]: Title: Robust Environment Perception for Automated Driving: A Unified Learning Pipeline for Visual-Infrared Object Detection

Authors: Mohsen Vadidar, Ali Kariminezhad, Christian Mayr, Laurent Kloeker, Lutz Eckstein

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[304] arXiv:2206.03970 [pdf, other]: Title: Narrowing the Coordinate-frame Gap in Behavior Prediction Models: Distillation for Efficient and Accurate Scene-centric Motion Forecasting

Authors: DiJia Su, Bertrand Douillard, Rami Al-Rfou, Cheolho Park, Benjamin Sapp

Comments: Accepted at ICRA 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[305] arXiv:2206.04003 [pdf, other]: Title: Patch-based Object-centric Transformers for Efficient Video Generation

Authors: Wilson Yan, Ryo Okumura, Stephen James, Pieter Abbeel

Comments: Project Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[306] arXiv:2206.04028 [pdf, other]: Title: CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving

Authors: Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Zhenguo Li, Ping Luo

Comments: Pre-trained backbones and fine-tuned downstream models are now available: this https URL Code will be released

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[307] arXiv:2206.04029 [pdf, other]: Title: Accelerating Score-based Generative Models for High-Resolution Image Synthesis

Authors: Hengyuan Ma, Li Zhang, Xiatian Zhu, Jingfeng Zhang, Jianfeng Feng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[308] arXiv:2206.04040 [pdf, other]: Title: MobileOne: An Improved One millisecond Mobile Backbone

Authors: Pavan Kumar Anasosalu Vasu, James Gabriel, Jeff Zhu, Oncel Tuzel, Anurag Ranjan

Comments: Accepted at CVPR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[309] arXiv:2206.04042 [pdf, other]: Title: Learning Ego 3D Representation as Ray Tracing

Authors: Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang

Comments: ECCV 2022. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[310] arXiv:2206.04046 [pdf, other]: Title: Sparse Mixture-of-Experts are Domain Generalizable Learners

Authors: Bo Li, Yifei Shen, Jingkang Yang, Yezhen Wang, Jiawei Ren, Tong Che, Jun Zhang, Ziwei Liu

Comments: ICLR 2023 (accepted as Oral presentation)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[311] arXiv:2206.04124 [pdf, other]: Title: DRHDR: A Dual branch Residual Network for Multi-Bracket High Dynamic Range Imaging

Authors: Juan Marín-Vega, Michael Sloth, Peter Schneider-Kamp, Richard Röttger

Comments: Accepted by CVPRW 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[312] arXiv:2206.04125 [pdf, other]: Title: Towards Self-supervised and Weight-preserving Neural Architecture Search

Authors: Zhuowei Li, Yibo Gao, Zhenzhou Zha, Zhiqiang HU, Qing Xia, Shaoting Zhang, Dimitris N. Metaxas

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[313] arXiv:2206.04158 [pdf, other]: Title: Texture Extraction Methods Based Ensembling Framework for Improved Classification

Authors: Vijay Pandey, Trapti Kalra, Mayank Gubba, Mohammed Faisal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[314] arXiv:2206.04170 [pdf, other]: Title: CASS: Cross Architectural Self-Supervision for Medical Image Analysis

Authors: Pranav Singh, Elena Sizikova, Jacopo Cirrone

Comments: (27 pages, 14 figures), Accepted at NeurIPS 2022 Workshop: Self-Supervised Learning - Theory and Practice

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[315] arXiv:2206.04176 [pdf, other]: Title: VN-Transformer: Rotation-Equivariant Attention for Vector Neurons

Authors: Serge Assaad, Carlton Downey, Rami Al-Rfou, Nigamaa Nayakanti, Ben Sapp

Comments: Published in Transactions on Machine Learning Research (TMLR), 2023; Previous version appeared in Workshop on Machine Learning for Autonomous Driving, Conference on Neural Information Processing Systems (NeurIPS), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[316] arXiv:2206.04197 [pdf, other]: Title: SCAMPS: Synthetics for Camera Measurement of Physiological Signals

Authors: Daniel McDuff, Miah Wander, Xin Liu, Brian L. Hill, Javier Hernandez, Jonathan Lester, Tadas Baltrusaitis

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[317] arXiv:2206.04231 [pdf, other]: Title: JNMR: Joint Non-linear Motion Regression for Video Frame Interpolation

Authors: Meiqin Liu, Chenming Xu, Chao Yao, Chunyu Lin, Yao Zhao

Comments: Accepted by IEEE Transactions on Image Processing (TIP)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318] arXiv:2206.04242 [pdf, other]: Title: OOD Augmentation May Be at Odds with Open-Set Recognition

Authors: Mohammad Azizmalayeri, Mohammad Hossein Rohban

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319] arXiv:2206.04246 [pdf, other]: Title: SwinCheX: Multi-label classification on chest X-ray images with transformers

Authors: Sina Taslimi, Soroush Taslimi, Nima Fathi, Mohammadreza Salehi, Mohammad Hossein Rohban

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[320] arXiv:2206.04271 [pdf, other]: Title: DeepVerge: Classification of Roadside Verge Biodiversity and Conservation Potential

Authors: Andrew Perrett, Charlie Barnes, Mark Schofield, Lan Qie, Petra Bosilj, James M. Brown

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[321] arXiv:2206.04281 [pdf, other]: Title: Local Spatiotemporal Representation Learning for Longitudinally-consistent Neuroimage Analysis

Authors: Mengwei Ren, Neel Dey, Martin A. Styner, Kelly Botteron, Guido Gerig

Comments: Accepted at NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[322] arXiv:2206.04295 [pdf, other]: Title: Reconstruct Face from Features Using GAN Generator as a Distribution Constraint

Authors: Xingbo Dong, Zhihui Miao, Lan Ma, Jiajun Shen, Zhe Jin, Zhenhua Guo, Andrew Beng Jin Teoh

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323] arXiv:2206.04325 [pdf, other]: Title: CFA: Coupled-hypersphere-based Feature Adaptation for Target-Oriented Anomaly Localization

Authors: Sungwook Lee, Seunghyun Lee, Byung Cheol Song

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[324] arXiv:2206.04349 [pdf, other]: Title: Deep radiomic signature with immune cell markers predicts the survival of glioma patients

Authors: Ahmad Chaddad, Paul Daniel Mingli Zhang, Saima Rathore, Paul Sargos, Christian Desrosiers, Tamim Niazi

Journal-ref: Neurocomputing, Volume 469, 16 January 2022, Pages 366-375

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
[325] arXiv:2206.04365 [pdf, other]: Title: CARLA-GeAR: a Dataset Generator for a Systematic Evaluation of Adversarial Robustness of Vision Models

Authors: Federico Nesti, Giulio Rossolini, Gianluca D'Amico, Alessandro Biondi, Giorgio Buttazzo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326] arXiv:2206.04374 [pdf, other]: Title: Uncovering bias in the PlantVillage dataset

Authors: Mehmet Alican Noyan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[327] arXiv:2206.04381 [pdf, other]: Title: STIP: A SpatioTemporal Information-Preserving and Perception-Augmented Model for High-Resolution Video Prediction

Authors: Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

Comments: This journal paper is extended from our previous work accepted in CVPR2022 and has been submitted to IEEE Transactions on Multimedia

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[328] arXiv:2206.04382 [pdf, other]: Title: CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes

Authors: Kim Youwang, Kim Ji-Yeon, Tae-Hyun Oh

Comments: Accepted at ECCV 2022. [Project page] this https URL [Code] this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[329] arXiv:2206.04399 [pdf, ps, other]: Title: Depression Recognition using Remote Photoplethysmography from Facial Videos

Authors: Constantino Álvarez Casado, Manuel Lage Cañellas, Miguel Bordallo López

Comments: 10 pages, 5 figures, 8 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[330] arXiv:2206.04401 [pdf, other]: Title: Cross-modal Local Shortest Path and Global Enhancement for Visible-Thermal Person Re-Identification

Authors: Xiaohong Wang, Chaoqi Li, Xiangcai Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[331] arXiv:2206.04403 [pdf, other]: Title: VITA: Video Instance Segmentation via Object Token Association

Authors: Miran Heo, Sukjun Hwang, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[332] arXiv:2206.04406 [pdf, other]: Title: Unsupervised Learning of the Total Variation Flow

Authors: Tamara G. Grossmann, Sören Dittmer, Yury Korolev, Carola-Bibiane Schönlieb

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[333] arXiv:2206.04425 [pdf, other]: Title: Multiple Instance Learning for Digital Pathology: A Review on the State-of-the-Art, Limitations & Future Potential

Authors: Michael Gadermayr, Maximilian Tschuchnig

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[334] arXiv:2206.04449 [pdf, other]: Title: Segmentation Enhanced Lameness Detection in Dairy Cows from RGB and Depth Video

Authors: Eric Arazo, Robin Aly, Kevin McGuinness

Comments: Accepted at the CV4Animals workshop in CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[335] arXiv:2206.04452 [pdf, other]: Title: Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer

Authors: Doyup Lee, Chiheon Kim, Saehoon Kim, Minsu Cho, Wook-Shin Han

Comments: 20 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[336] arXiv:2206.04453 [pdf, other]: Title: The Missing Link: Finding label relations across datasets

Authors: Jasper Uijlings, Thomas Mensink, Vittorio Ferrari

Comments: ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[337] arXiv:2206.04479 [pdf, ps, other]: Title: BSM loss: A superior way in modeling aleatory uncertainty of fine_grained classification

Authors: Shuang Ge, Kehong Yuan, Maokun Han, Desheng Sun, Huabin Zhang, Qiongyu Ye

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[338] arXiv:2206.04503 [pdf, other]: Title: cycle text2face: cycle text-to-face gan via transformers

Authors: Faezeh Gholamrezaie, Mohammad Manthouri

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[339] arXiv:2206.04511 [pdf, other]: Title: Efficient Human Pose Estimation via 3D Event Point Cloud

Authors: Jiaan Chen, Hao Shi, Yaozu Ye, Kailun Yang, Lei Sun, Kaiwei Wang

Comments: Accepted to 3DV 2022. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[340] arXiv:2206.04531 [pdf, other]: Title: ECLAD: Extracting Concepts with Local Aggregated Descriptors

Authors: Andres Felipe Posada-Moreno, Nikita Surya, Sebastian Trimpe

Comments: 34 pages, under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[341] arXiv:2206.04557 [pdf, other]: Title: SparseFormer: Attention-based Depth Completion Network

Authors: Frederik Warburg, Michael Ramamonjisoa, Manuel López-Antequera

Comments: Accepted at CV4ARVR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[342] arXiv:2206.04558 [pdf, other]: Title: BFS-Net: Weakly Supervised Cell Instance Segmentation from Bright-Field Microscopy Z-Stacks

Authors: Shervin Dehghani, Benjamin Busam, Nassir Navab, Ali Nasseri

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[343] arXiv:2206.04575 [pdf, other]: Title: Transformer based Urdu Handwritten Text Optical Character Reader

Authors: Mohammad Daniyal Shaiq, Musa Dildar Ahmed Cheema, Ali Kamal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[344] arXiv:2206.04584 [pdf, other]: Title: Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer

Authors: Shaoyu Chen, Tianheng Cheng, Xinggang Wang, Wenming Meng, Qian Zhang, Wenyu Liu

Comments: Tech report. Work in progress

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[345] arXiv:2206.04590 [pdf, other]: Title: GASP: Gated Attention For Saliency Prediction

Authors: Fares Abawi, Tom Weber, Stefan Wermter

Comments: International Joint Conference on Artificial Intelligence (IJCAI-21)

Journal-ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (2021) 584-591

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[346] arXiv:2206.04636 [pdf, other]: Title: Spatial Entropy as an Inductive Bias for Vision Transformers

Authors: Elia Peruzzo, Enver Sangineto, Yahui Liu, Marco De Nadai, Wei Bi, Bruno Lepri, Nicu Sebe

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[347] arXiv:2206.04655 [pdf, other]: Title: Towards Layer-wise Image Vectorization

Authors: Xu Ma, Yuqian Zhou, Xingqian Xu, Bin Sun, Valerii Filev, Nikita Orlov, Yun Fu, Humphrey Shi

Comments: Accepted as Oral Presentation at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348] arXiv:2206.04656 [pdf, other]: Title: Simple Cues Lead to a Strong Multi-Object Tracker

Authors: Jenny Seidenschwarz, Guillem Brasó, Victor Castro Serrano, Ismail Elezi, Laura Leal-Taixé

Comments: Accepted to CVPR2023!

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349] arXiv:2206.04662 [pdf, other]: Title: DiSparse: Disentangled Sparsification for Multitask Model Compression

Authors: Xinglong Sun, Ali Hassani, Zhangyang Wang, Gao Huang, Humphrey Shi

Comments: Accepted at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[350] arXiv:2206.04664 [pdf, other]: Title: On Data Scaling in Masked Image Modeling

Authors: Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Yixuan Wei, Qi Dai, Han Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351] arXiv:2206.04665 [pdf, other]: Title: AGConv: Adaptive Graph Convolution on 3D Point Clouds

Authors: Mingqiang Wei, Zeyong Wei, Haoran Zhou, Fei Hu, Huajian Si, Zhilei Chen, Zhe Zhu, Jingbo Qiu, Xuefeng Yan, Yanwen Guo, Jun Wang, Jing Qin

Comments: arXiv admin note: substantial text overlap with arXiv:2108.08035

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2206.04667 [pdf, other]: Title: Extreme Masking for Learning Instance and Distributed Visual Representations

Authors: Zhirong Wu, Zihang Lai, Xiao Sun, Stephen Lin

Comments: Accepted in TMLR

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2206.04668 [pdf, other]: Title: GateHUB: Gated History Unit with Background Suppression for Online Action Detection

Authors: Junwen Chen, Gaurav Mittal, Ye Yu, Yu Kong, Mei Chen

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[354] arXiv:2206.04669 [pdf, other]: Title: Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields

Authors: Mingtong Zhang, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[355] arXiv:2206.04670 [pdf, other]: Title: PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

Authors: Guocheng Qian, Yuchen Li, Houwen Peng, Jinjie Mai, Hasan Abed Al Kader Hammoud, Mohamed Elhoseiny, Bernard Ghanem

Comments: Accepted by NeurIPS'22. Code and models are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[356] arXiv:2206.04671 [pdf, other]: Title: Open Challenges in Deep Stereo: the Booster Dataset

Authors: Pierluigi Zama Ramirez, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

Comments: CVPR 2022, New Orleans. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357] arXiv:2206.04673 [pdf, other]: Title: Neural Prompt Search

Authors: Yuanhan Zhang, Kaiyang Zhou, Ziwei Liu

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[358] arXiv:2206.04674 [pdf, other]: Title: Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs

Authors: Jinguo Zhu, Xizhou Zhu, Wenhai Wang, Xiaohua Wang, Hongsheng Li, Xiaogang Wang, Jifeng Dai

Comments: Code shall be released at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[359] arXiv:2206.04783 [pdf, other]: Title: ReFace: Real-time Adversarial Attacks on Face Recognition Systems

Authors: Shehzeen Hussain, Todd Huster, Chris Mesterharm, Paarth Neekhara, Kevin An, Malhar Jere, Harshvardhan Sikka, Farinaz Koushanfar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[360] arXiv:2206.04785 [pdf, other]: Title: Building Spatio-temporal Transformers for Egocentric 3D Pose Estimation

Authors: Jinman Park, Kimathi Kaai, Saad Hossain, Norikatsu Sumi, Sirisha Rambhatla, Paul Fieguth

Comments: 4 pages, Extended abstract, Joint International Workshop on Egocentric Perception, Interaction and Computing (EPIC) and Ego4D, IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[361] arXiv:2206.04790 [pdf, other]: Title: Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition

Authors: Shreyank N Gowda, Marcus Rohrbach, Frank Keller, Laura Sevilla-Lara

Comments: Accepted to ECCV-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362] arXiv:2206.04797 [pdf, other]: Title: Memory-efficient model-based deep learning with convergence and robustness guarantees

Authors: Aniket Pramanik, M. Bridget Zimmerman, Mathews Jacob

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[363] arXiv:2206.04831 [pdf, other]: Title: R4D: Utilizing Reference Objects for Long-Range Distance Estimation

Authors: Yingwei Li, Tiffany Chen, Maya Kabkab, Ruichi Yu, Longlong Jing, Yurong You, Hang Zhao

Comments: ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[364] arXiv:2206.04846 [pdf, other]: Title: Masked Autoencoders are Robust Data Augmentors

Authors: Haohang Xu, Shuangrui Ding, Xiaopeng Zhang, Hongkai Xiong, Qi Tian

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[365] arXiv:2206.04854 [pdf, other]: Title: Heterogeneous Face Recognition via Face Synthesis with Identity-Attribute Disentanglement

Authors: Ziming Yang, Jian Liang, Chaoyou Fu, Mandi Luo, Xiao-Yu Zhang

Comments: Accepted for publication in IEEE Transactions on Information Forensics and Security (TIFS)

Journal-ref: IEEE Transactions on Information Forensics and Security, vol. 17, pp. 1344-1358, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[366] arXiv:2206.04863 [pdf, other]: Title: Symbolic image detection using scene and knowledge graphs

Authors: Nasrin Kalanat, Adriana Kovashka

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[367] arXiv:2206.04867 [pdf, other]: Title: The Gender Gap in Face Recognition Accuracy Is a Hairy Problem

Authors: Aman Bhatta, Vítor Albiero, Kevin W. Bowyer, Michael C. King

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[368] arXiv:2206.04874 [pdf, ps, other]: Title: The 1st Data Science for Pavements Challenge

Authors: Ashkan Behzadian, Tanner Wambui Muturi, Tianjie Zhang, Hongmin Kim, Amanda Mullins, Yang Lu, Neema Jasika Owor, Yaw Adu-Gyamfi, William Buttlar, Majidifard Hamed, Armstrong Aboah, David Mensching, Spragg Robert, Matthew Corrigan, Jack Youtchef, Dave Eshan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369] arXiv:2206.04879 [pdf, other]: Title: Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion

Authors: Liang Liao, Wenyi Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

Comments: IEEE Transactions on Image Processing 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370] arXiv:2206.04901 [pdf, other]: Title: NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors

Authors: Hao-Kang Liu, I-Chao Shen, Bing-Yu Chen

Comments: Hao-Kang Liu and I-Chao Shen contributed equally to the paper. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[371] arXiv:2206.04906 [pdf, other]: Title: Out of Sight, Out of Mind: A Source-View-Wise Feature Aggregation for Multi-View Image-Based Rendering

Authors: Geonho Cha, Chaehun Shin, Sungroh Yoon, Dongyoon Wee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[372] arXiv:2206.04916 [pdf, other]: Title: PatchComplete: Learning Multi-Resolution Patch Priors for 3D Shape Completion on Unseen Categories

Authors: Yuchen Rao, Yinyu Nie, Angela Dai

Comments: Video link: this https URL ; Project page: this https URL ; Accepted to NeurIPS'22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373] arXiv:2206.04927 [pdf, other]: Title: Ego2HandsPose: A Dataset for Egocentric Two-hand 3D Global Pose Estimation

Authors: Fanqing Lin, Tony Martinez

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[374] arXiv:2206.04942 [pdf, other]: Title: Neural Template: Topology-aware Reconstruction and Disentangled Generation of 3D Meshes

Authors: Ka-Hei Hui, Ruihui Li, Jingyu Hu, Chi-Wing Fu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375] arXiv:2206.04949 [pdf, other]: Title: Deep Multi-View Semi-Supervised Clustering with Sample Pairwise Constraints

Authors: Rui Chen, Yongqiang Tang, Wensheng Zhang, Wenlong Feng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[376] arXiv:2206.04958 [pdf, other]: Title: Self-Supervised Deep Subspace Clustering with Entropy-norm

Authors: Guangyi Zhao, Simin Kou, Xuesong Yin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[377] arXiv:2206.04975 [pdf, other]: Title: NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition

Authors: Hanting Li, Mingzhe Sui, Zhaoqing Zhu, Feng zhao

Comments: 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[378] arXiv:2206.04979 [pdf, ps, other]: Title: Convolutional layers are equivariant to discrete shifts but not continuous translations

Authors: Nick McGreivy, Ammar Hakim

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[379] arXiv:2206.04981 [pdf, other]: Title: Positional Label for Self-Supervised Vision Transformer

Authors: Zhemin Zhang, Xun Gong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2206.05028 [pdf, other]: Title: Spatial Cross-Attention Improves Self-Supervised Visual Representation Learning

Authors: Mehdi Seyfi, Amin Banitalebi-Dehkordi, Yong Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[381] arXiv:2206.05039 [pdf, other]: Title: Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

Authors: Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M Patel

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[382] arXiv:2206.05099 [pdf, other]: Title: SimVP: Simpler yet Better Video Prediction

Authors: Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[383] arXiv:2206.05102 [pdf, other]: Title: Saccade Mechanisms for Image Classification, Object Detection and Tracking

Authors: Saurabh Farkya, Zachary Daniels, Aswin Nadamuni Raghavan, David Zhang, Michael Piacentino

Comments: 4 Pages, 6 figures, will be presented at CVPR2022-NeuroVision workshop as a Lightning talk

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[384] arXiv:2206.05127 [pdf, other]: Title: Globally-Optimal Contrast Maximisation for Event Cameras

Authors: Xin Peng, Ling Gao, Yifu Wang, Laurent Kneip

Comments: arXiv admin note: substantial text overlap with arXiv:2203.03914

Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385] arXiv:2206.05128 [pdf, ps, other]: Title: Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators

Authors: Indhumathi Kandaswamy, Saurabh Farkya, Zachary Daniels, Gooitzen van der Wal, Aswin Raghavan, Yuzheng Zhang, Jun Hu, Michael Lomnitz, Michael Isnardi, David Zhang, Michael Piacentino

Comments: 9 pages, 15 figures. Will be presented in Embedded Vision Workshop at CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR)
[386] arXiv:2206.05149 [pdf, other]: Title: Referring Image Matting

Authors: Jizhizi Li, Jing Zhang, Dacheng Tao

Comments: Accepted to CVPR2023. The dataset, code and models are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[387] arXiv:2206.05158 [pdf, other]: Title: MEAT: Maneuver Extraction from Agent Trajectories

Authors: Julian Schmidt, Julian Jordan, David Raba, Tobias Welz, Klaus Dietmayer

Comments: Accepted at IEEE Intelligent Vehicles Symposium (IV) 2022 2nd Workshop on Autonomy@Scale

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[388] arXiv:2206.05159 [pdf, ps, other]: Title: An Image Processing Pipeline for Camera Trap Time-Lapse Recordings

Authors: Michael L. Hilton, Mark T. Yamane, Leah M. Knezevich

Comments: 5 pages, 2 figures, presented at the CV4Animals workshop of CVIP2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[389] arXiv:2206.05184 [pdf, other]: Title: SERE: Exploring Feature Self-relation for Self-supervised Transformer

Authors: Zhong-Yu Li, Shanghua Gao, Ming-Ming Cheng

Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

Journal-ref: 10.1109/TPAMI.2023.3309979

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390] arXiv:2206.05194 [pdf, other]: Title: Learning the Space of Deep Models

Authors: Gianluca Berardi, Luca De Luigi, Samuele Salti, Luigi Di Stefano

Comments: Accepted at ICPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[391] arXiv:2206.05225 [pdf, other]: Title: ClamNet: Using contrastive learning with variable depth Unets for medical image segmentation

Authors: Samayan Bhattacharya, Sk Shahnawaz, Avigyan Bhattacharya

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2206.05252 [pdf, other]: Title: Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models

Authors: Trenton Chang, Daniel Y. Fu

Comments: 12 pages, 12 figures (with supplemental: 34 pages)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393] arXiv:2206.05253 [pdf, other]: Title: Rethinking Spatial Invariance of Convolutional Networks for Object Counting

Authors: Zhi-Qi Cheng, Qi Dai, Hong Li, JingKuan Song, Xiao Wu, Alexander G. Hauptmann

Comments: Accepted to CVPR 2022, Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
[394] arXiv:2206.05257 [pdf, other]: Title: Explaining Image Classifiers Using Contrastive Counterfactuals in Generative Latent Spaces

Authors: Kamran Alipour, Aditya Lahiri, Ehsan Adeli, Babak Salimi, Michael Pazzani

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[395] arXiv:2206.05259 [pdf, other]: Title: Is Self-Supervised Learning More Robust Than Supervised Learning?

Authors: Yuanyi Zhong, Haoran Tang, Junkun Chen, Jian Peng, Yu-Xiong Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[396] arXiv:2206.05260 [pdf, other]: Title: Balanced Product of Calibrated Experts for Long-Tailed Recognition

Authors: Emanuel Sanchez Aimar, Arvi Jonnarth, Michael Felsberg, Marco Kuhlmann

Comments: Accepted at CVPR 2023, 19 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[397] arXiv:2206.05275 [pdf, other]: Title: Spatial-temporal Concept based Explanation of 3D ConvNets

Authors: Ying Ji, Yu Wang, Kensaku Mori, Jien Kato

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[398] arXiv:2206.05281 [pdf, other]: Title: Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model

Authors: Fabian Deuser, Konrad Habel, Philipp J. Rösch, Norbert Oswald

Comments: VizWiz Grand Challenge: Describing Images and Videos Taken by Blind People (CVPR Workshop 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[399] arXiv:2206.05282 [pdf, other]: Title: Learning to Estimate Shapley Values with Vision Transformers

Authors: Ian Covert, Chanwoo Kim, Su-In Lee

Comments: ICLR 2023 camera-ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[400] arXiv:2206.05291 [pdf, other]: Title: ProActive: Self-Attentive Temporal Point Process Flows for Activity Sequences

Authors: Vinayak Gupta, Srikanta Bedathur

Comments: KDD 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[401] arXiv:2206.05309 [pdf, ps, other]: Title: EigenFairing: 3D Model Fairing using Image Coherence

Authors: Pragyana Mishra, Omead Amidi, Takeo Kanade

Comments: British Machine Vision Conference, BMVC 2004, Kingston, UK, September 7-9, 2004

Journal-ref: Proceedings of the British Machine Conference, pages 1-10, BMVA Press, September 2004

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[402] arXiv:2206.05319 [pdf, other]: Title: Object Instance Identification in Dynamic Environments

Authors: Takuma Yagi, Md Tasnimul Hasan, Yoichi Sato

Comments: Joint 1st Ego4D and 10th EPIC Workshop (EPIC@CVPR2022) Extended Abstract

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[403] arXiv:2206.05375 [pdf, other]: Title: Generalizable Neural Radiance Fields for Novel View Synthesis with Transformer

Authors: Dan Wang, Xinrui Cui, Septimiu Salcudean, Z. Jane Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[404] arXiv:2206.05377 [pdf, other]: Title: Fast building segmentation from satellite imagery and few local labels

Authors: Caleb Robinson, Anthony Ortiz, Hogeun Park, Nancy Lozano Gracia, Jon Kher Kaw, Tina Sederholm, Rahul Dodhia, Juan M. Lavista Ferres

Comments: Accepted at EarthVision 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[405] arXiv:2206.05379 [pdf, other]: Title: A Benchmark for Compositional Visual Reasoning

Authors: Aimen Zerroug, Mohit Vaishnav, Julien Colin, Sebastian Musslick, Thomas Serre

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[406] arXiv:2206.05390 [pdf, other]: Title: Transformer-based Self-Supervised Fish Segmentation in Underwater Videos

Authors: Alzayat Saleh, Marcus Sheaves, Dean Jerry, Mostafa Rahimi Azghadi

Comments: 11 pages, 6 figures. Submitted to the journal, International Journal of Intelligent Systems

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407] arXiv:2206.05394 [pdf, other]: Title: Applications of Deep Learning in Fish Habitat Monitoring: A Tutorial and Survey

Authors: Alzayat Saleh, Marcus Sheaves, Dean Jerry, Mostafa Rahimi Azghadi

Comments: 26 pages, 7 figures. Submitted to the journal, Expert Systems With Applications

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2206.05398 [pdf, other]: Title: E2PN: Efficient SE(3)-Equivariant Point Network

Authors: Minghan Zhu, Maani Ghaffari, William A. Clark, Huei Peng

Comments: CVPR 2023, 16 pages. See this https URL for code

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[409] arXiv:2206.05420 [pdf, other]: Title: VAC2: Visual Analysis of Combined Causality in Event Sequences

Authors: Sujia Zhu, Yue Shen, Zihao Zhu, Wang Xia, Baofeng Chang, Ronghua Liang, Guodao Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[410] arXiv:2206.05422 [pdf, other]: Title: Access Control of Semantic Segmentation Models Using Encrypted Feature Maps

Authors: Hiroki Ito, AprilPyone MaungMaung, Sayaka Shiota, Hitoshi Kiya

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[411] arXiv:2206.05424 [pdf, other]: Title: Precise Affordance Annotation for Egocentric Action Video Datasets

Authors: Zecheng Yu, Yifei Huang, Ryosuke Furuta, Takuma Yagi, Yusuke Goutsu, Yoichi Sato

Comments: Technical report for CVPR 2022 EPIC-Ego4D Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[412] arXiv:2206.05431 [pdf, other]: Title: Learned reconstruction methods with convergence guarantees

Authors: Subhadip Mukherjee, Andreas Hauptmann, Ozan Öktem, Marcelo Pereyra, Carola-Bibiane Schönlieb

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[413] arXiv:2206.05432 [pdf, ps, other]: Title: Luminance-Guided Chrominance Image Enhancement for HEVC Intra Coding

Authors: Hewei Liu, Renwei Yang, Shuyuan Zhu, Xing Wen, Bing Zeng

Comments: ISCAS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[414] arXiv:2206.05488 [pdf, ps, other]: Title: Kaggle Kinship Recognition Challenge: Introduction of Convolution-Free Model to boost conventional

Authors: Mingchuan Tian, Guangway Teng, Yipeng Bao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[415] arXiv:2206.05496 [pdf, other]: Title: An Evaluation of OCR on Egocentric Data

Authors: Valentin Popescu, Dima Damen, Toby Perrett

Comments: Extended Abstract, EPIC workshop at CVPR 22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[416] arXiv:2206.05498 [pdf, other]: Title: A Review of Causality for Learning Algorithms in Medical Image Analysis

Authors: Athanasios Vlontzos, Daniel Rueckert, Bernhard Kainz

Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL". ; Paper ID: 2022:028

Journal-ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); General Literature (cs.GL)
[417] arXiv:2206.05514 [pdf, other]: Title: Toward Real-world Single Image Deraining: A New Benchmark and Beyond

Authors: Wei Li, Qiming Zhang, Jing Zhang, Zhen Huang, Xinmei Tian, Dacheng Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[418] arXiv:2206.05520 [pdf, other]: Title: A Two-stage Method for Non-extreme Value Salt-and-Pepper Noise Removal

Authors: Renwei Yang, YiKe Liu, Bing Zeng

Comments: UESTC course project

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[419] arXiv:2206.05539 [pdf, other]: Title: A Simplified Un-Supervised Learning Based Approach for Ink Mismatch Detection in Handwritten Hyper-Spectral Document Images

Authors: Muhammad Farhan Humayun, Hassan Waseem Malik, Ahmed Ahsan Alvi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[420] arXiv:2206.05542 [pdf, other]: Title: Surround-View Cameras based Holistic Visual Perception for Automated Driving

Authors: Varun Ravi Kumar

Comments: Doctoral thesis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[421] arXiv:2206.05617 [pdf, other]: Title: Federated Learning with Research Prototypes for Multi-Center MRI-based Detection of Prostate Cancer with Diverse Histopathology

Authors: Abhejit Rajagopal, Ekaterina Redekop, Anil Kemisetti, Rushi Kulkarni, Steven Raman, Kirti Magudia, Corey W. Arnold, Peder E. Z. Larson

Comments: under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
[422] arXiv:2206.05619 [pdf, other]: Title: Deep Learning Models for Automated Classification of Dog Emotional States from Facial Expressions

Authors: Tali Boneh-Shitrit, Shir Amir, Annika Bremhorst, Daniel S. Mills, Stefanie Riemer, Dror Fried, Anna Zamansky

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[423] arXiv:2206.05641 [pdf, ps, other]: Title: An Unsupervised Deep-Learning Method for Bone Age Assessment

Authors: Hao Zhu, Wan-Jing Nie, Yue-Jie Hou, Qi-Meng Du, Si-Jing Li, Chi-Chun Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[424] arXiv:2206.05648 [pdf, other]: Title: Indirect-Instant Attention Optimization for Crowd Counting in Dense Scenes

Authors: Suyu Han, Guodong Wang, Donghua Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425] arXiv:2206.05651 [pdf, other]: Title: STD-NET: Search of Image Steganalytic Deep-learning Architecture via Hierarchical Tensor Decomposition

Authors: Shunquan Tan, Qiushi Li, Laiyuan Li, Bin Li, Jiwu Huang

Comments: Submitted to IEEE T-DSC

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[426] arXiv:2206.05683 [pdf, other]: Title: APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking

Authors: Yuxiang Yang, Junjie Yang, Yufei Xu, Jing Zhang, Long Lan, Dacheng Tao

Comments: Neurips 2022 dataset and benchmark track

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427] arXiv:2206.05707 [pdf, other]: Title: DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration

Authors: Zexi Chen, Yiyi Liao, Haozhe Du, Haodong Zhang, Xuecheng Xu, Haojian Lu, Rong Xiong, Yue Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[428] arXiv:2206.05708 [pdf, other]: Title: Narrowing the Gap: Improved Detector Training with Noisy Location Annotations

Authors: Shaoru Wang, Jin Gao, Bing Li, Weiming Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429] arXiv:2206.05712 [pdf, other]: Title: Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory Prediction

Authors: Lihuan Li, Maurice Pagnucco, Yang Song

Comments: This paper has been accepted by CVPR 2022. Reference: Li, L., Pagnucco, M. and Song, Y., 2022. Graph-Based Spatial Transformer With Memory Replay for Multi-Future Pedestrian Trajectory Prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 2231-2241)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430] arXiv:2206.05717 [pdf, other]: Title: Crowd Localization from Gaussian Mixture Scoped Knowledge and Scoped Teacher

Authors: Juncheng Wang, Junyu Gao, Yuan Yuan, Qi Wang

Comments: Accepted by IEEE TIP

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431] arXiv:2206.05730 [pdf, other]: Title: Object Occlusion of Adding New Categories in Objection Detection

Authors: Boyang Deng, Meiyan Lin, Shoulun Long

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432] arXiv:2206.05737 [pdf, other]: Title: SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views

Authors: Xiaoxiao Long, Cheng Lin, Peng Wang, Taku Komura, Wenping Wang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433] arXiv:2206.05741 [pdf, other]: Title: Bootstrapping Multi-view Representations for Fake News Detection

Authors: Qichao Ying, Xiaoxiao Hu, Yangming Zhou, Zhenxing Qian, Dan Zeng, Shiming Ge

Comments: Authors are from Fudan University, China. Under Review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[434] arXiv:2206.05763 [pdf, other]: Title: SeATrans: Learning Segmentation-Assisted diagnosis model via Transformer

Authors: Junde Wu, Huihui Fang, Fangxin Shang, Dalu Yang, Zhaowei Wang, Jing Gao, Yehui Yang, Yanwu Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435] arXiv:2206.05765 [pdf, other]: Title: A Semantic Consistency Feature Alignment Object Detection Model Based on Mixed-Class Distribution Metrics

Authors: Lijun Gou, Jinrong Yang, Hangcheng Yu, Pan Wang, Xiaoping Li, Chao Deng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[436] arXiv:2206.05810 [pdf, other]: Title: Analysis of Branch Specialization and its Application in Image Decomposition

Authors: Jonathan Brokman, Guy Gilboa

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[437] arXiv:2206.05833 [pdf, other]: Title: COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition

Authors: Mani Kumar Tellamekala, Shahin Amiriparian, Björn W. Schuller, Elisabeth André, Timo Giesbrecht, Michel Valstar

Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[438] arXiv:2206.05836 [pdf, other]: Title: GLIPv2: Unifying Localization and Vision-Language Understanding

Authors: Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng Gao

Comments: NeurIPS 2022; updated with reviewers' comments addressed; Code is released at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[439] arXiv:2206.05837 [pdf, other]: Title: NeuralODF: Learning Omnidirectional Distance Fields for 3D Shape Representation

Authors: Trevor Houchens, Cheng-You Lu, Shivam Duggal, Rao Fu, Srinath Sridhar

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440] arXiv:2206.05842 [pdf, ps, other]: Title: Efficiency Comparison of AI classification algorithms for Image Detection and Recognition in Real-time

Authors: Musarrat Saberin Nipun, Rejwan Bin Sulaiman, Amer Kareem

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[441] arXiv:2206.05844 [pdf, other]: Title: FisheyeEX: Polar Outpainting for Extending the FoV of Fisheye Lens

Authors: Kang Liao, Chunyu Lin, Yunchao Wei, Yao Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[442] arXiv:2206.05846 [pdf, other]: Title: InBiaseD: Inductive Bias Distillation to Improve Generalization and Robustness through Shape-awareness

Authors: Shruthi Gowda, Bahram Zonooz, Elahe Arani

Comments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[443] arXiv:2206.05853 [pdf, other]: Title: Modeling Generalized Specialist Approach To Train Quality Resilient Snapshot Ensemble

Authors: Ghalib Ahmed Tahir, Chu Kiong Loo, Zongying Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[444] arXiv:2206.05866 [pdf, other]: Title: TC-SfM: Robust Track-Community-Based Structure-from-Motion

Authors: Lei Wang, Linlin Ge, Shan Luo, Zihan Yan, Zhaopeng Cui, Jieqing Feng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[445] arXiv:2206.05896 [pdf, other]: Title: Improve Ranking Correlation of Super-net through Training Scheme from One-shot NAS to Few-shot NAS

Authors: Jiawei Liu, Kaiyu Zhang, Weitai Hu, Qing Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446] arXiv:2206.05897 [pdf, other]: Title: $\texttt{GradICON}$: Approximate Diffeomorphisms via Gradient Inverse Consistency

Authors: Lin Tian, Hastings Greer, François-Xavier Vialard, Roland Kwitt, Raúl San José Estépar, Richard Jarrett Rushmore, Nikolaos Makris, Sylvain Bouix, Marc Niethammer

Comments: 29 pages, 16 figures, CVPR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[447] arXiv:2206.05898 [pdf, other]: Title: Pixel to Binary Embedding Towards Robustness for CNNs

Authors: Ikki Kishida, Hideki Nakayama

Comments: Accepted to ICPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[448] arXiv:2206.05903 [pdf, other]: Title: Geometrically Guided Integrated Gradients

Authors: Md Mahfuzur Rahman, Noah Lewis, Sergey Plis

Comments: 19 pages, 23 figures, funding sources added

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[449] arXiv:2206.05912 [pdf, other]: Title: INDIGO: Intrinsic Multimodality for Domain Generalization

Authors: Puneet Mangla, Shivam Chandhok, Milan Aggarwal, Vineeth N Balasubramanian, Balaji Krishnamurthy

Comments: Under Submission

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450] arXiv:2206.05927 [pdf, other]: Title: LinK3D: Linear Keypoints Representation for 3D LiDAR Point Cloud

Authors: Yunge Cui, Yinlong Zhang, Jiahua Dong, Haibo Sun, Xieyuanli Chen, Feng Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[451] arXiv:2206.05962 [pdf, other]: Title: PRO-TIP: Phantom for RObust automatic ultrasound calibration by TIP detection

Authors: Matteo Ronchetti, Julia Rackerseder, Maria Tirindelli, Mehrdad Salehi, Nassir Navab, Wolfgang Wein, Oliver Zettinig

Comments: This preprint was submitted to MICCAI 2022. The Version of Record of this contribution will be published in Springer LNCS

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[452] arXiv:2206.05963 [pdf, ps, other]: Title: ATDN vSLAM: An all-through Deep Learning-Based Solution for Visual Simultaneous Localization and Mapping

Authors: Mátyás Szántó, György R. Bogár, László Vajta

Comments: Published in Periodica Polytechnica Electrical Engineering 11 pages

Journal-ref: Periodica Polytechnica Electrical Engineering and Computer Science, 66(3), pp. 236-247, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[453] arXiv:2206.05967 [pdf, other]: Title: GoToNet: Fast Monocular Scene Exposure and Exploration

Authors: Tom Avrech, Evgenii Zheltonozhskii, Chaim Baskin, Ehud Rivlin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[454] arXiv:2206.05970 [pdf, other]: Title: Hypernetwork-Based Adaptive Image Restoration

Authors: Shai Aharon, Gil Ben-Artzi

Comments: 5 pages, 5 Figures, ICASSP 2023

Journal-ref: ICASSP 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[455] arXiv:2206.05981 [pdf, other]: Title: Efficient Human-in-the-loop System for Guiding DNNs Attention

Authors: Yi He, Xi Yang, Chia-Ming Chang, Haoran Xie, Takeo Igarashi

Comments: 13 pages, 11 figures, proceeding of ACM IUI 2023, video this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[456] arXiv:2206.05982 [pdf, other]: Title: Learning Fashion Compatibility from In-the-wild Images

Authors: Additya Popli, Vijay Kumar, Sujit Jos, Saraansh Tandon

Comments: Accepted to ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[457] arXiv:2206.06014 [pdf, other]: Title: Exploring and Exploiting Hubness Priors for High-Quality GAN Latent Sampling

Authors: Yuanbang Liang, Jing Wu, Yu-Kun Lai, Yipeng Qin

Comments: Accepted at ICML 2022. Our code is available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[458] arXiv:2206.06023 [pdf, other]: Title: Virtual embeddings and self-consistency for self-supervised learning

Authors: Tariq Bdair, Hossam Abdelhamid, Nassir Navab, Shadi Albarqouni

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[459] arXiv:2206.06067 [pdf, other]: Title: Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation

Authors: Zengyu Qiu, Xinzhu Ma, Kunlin Yang, Chunya Liu, Jun Hou, Shuai Yi, Wanli Ouyang

Comments: ICLR'23 accepted

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[460] arXiv:2206.06079 [pdf, other]: Title: OHM: GPU Based Occupancy Map Generation

Authors: Kazys Stepanas, Jason Williams, Emili Hernández, Fabio Ruetz, Thomas Hines

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[461] arXiv:2206.06100 [pdf, other]: Title: AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields

Authors: Takuhiro Kaneko

Comments: Accepted to CVPR 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[462] arXiv:2206.06103 [pdf, other]: Title: Learning Feature Disentanglement and Dynamic Fusion for Recaptured Image Forensic

Authors: Shuyu Miao, Lin Zheng, Hong Jin

Comments: Accepted by CVPR2022 workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463] arXiv:2206.06119 [pdf, other]: Title: Satellite-based high-resolution maps of cocoa planted area for Côte d'Ivoire and Ghana

Authors: Nikolai Kalischek, Nico Lang, Cécile Renier, Rodrigo Caye Daudt, Thomas Addoah, William Thompson, Wilma J. Blaser-Hart, Rachael Garrett, Konrad Schindler, Jan D. Wegner

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[464] arXiv:2206.06120 [pdf, ps, other]: Title: Brain tumour segmentation with incomplete imaging data

Authors: James K Ruffle, Samia Mohinta, Robert J Gray, Harpreet Hyare, Parashkev Nachev

Comments: 26 pages, 8 figures, 4 supplementary tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Tissues and Organs (q-bio.TO)
[465] arXiv:2206.06122 [pdf, other]: Title: Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning

Authors: Yanpeng Sun, Qiang Chen, Xiangyu He, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jian Cheng, Zechao Li, Jingdong Wang

Comments: Accepted to NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[466] arXiv:2206.06168 [pdf, other]: Title: 2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning Approach

Authors: Yilu Guo, Shicai Yang, Weijie Chen, Liang Ma, Di Xie, Shiliang Pu

Comments: 2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[467] arXiv:2206.06177 [pdf, other]: Title: Transductive CLIP with Class-Conditional Contrastive Learning

Authors: Junchu Huang, Weijie Chen, Shicai Yang, Di Xie, Shiliang Pu, Yueting Zhuang

Comments: Published in IEEE ICASSP 2022

Journal-ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[468] arXiv:2206.06214 [pdf, other]: Title: Real-World Light Field Image Super-Resolution via Degradation Modulation

Authors: Yingqian Wang, Zhengyu Liang, Longguang Wang, Jungang Yang, Wei An, Yulan Guo

Comments: 15 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[469] arXiv:2206.06219 [pdf, other]: Title: Making Sense of Dependence: Efficient Black-box Explanations Using Dependence Measure

Authors: Paul Novello, Thomas Fel, David Vigouroux

Comments: Accepted to NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML); Other Statistics (stat.OT)
[470] arXiv:2206.06252 [pdf, other]: Title: Transformer Lesion Tracker

Authors: Wen Tang, Han Kang, Haoyue Zhang, Pengxin Yu, Corey W. Arnold, Rongguo Zhang

Comments: Accepted MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[471] arXiv:2206.06258 [pdf, other]: Title: Featurized Query R-CNN

Authors: Wenqiang Zhang, Tianheng Cheng, Xinggang Wang, Shaoyu Chen, Qian Zhang, Wenyu Liu

Comments: Tech Report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[472] arXiv:2206.06289 [pdf, other]: Title: Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation

Authors: Yingwei Pan, Yehao Li, Yiheng Zhang, Qi Cai, Fuchen Long, Zhaofan Qiu, Ting Yao, Tao Mei

Comments: Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Top-performing systems for both no interaction and no restriction tracks in SAPIEN ManiSkill Challenge 2021. The source code and model are publicly available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Robotics (cs.RO)
[473] arXiv:2206.06291 [pdf, other]: Title: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection

Authors: Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen

Comments: CVPR 2022; Code is publicly available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[474] arXiv:2206.06292 [pdf, other]: Title: MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing

Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei

Comments: CVPR 2022; Code is publicly available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[475] arXiv:2206.06293 [pdf, other]: Title: Learning Domain Adaptive Object Detection with Probabilistic Teacher

Authors: Meilin Chen, Weijie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Yunfeng Yan, Donglian Qi, Yueting Zhuang, Di Xie, Shiliang Pu

Comments: To appear in ICML 2022. Code is coming soon: this https URL

Journal-ref: International Conference on Machine Learning (ICML), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[476] arXiv:2206.06323 [pdf, other]: Title: Visual Transformer for Object Detection

Authors: Michael Yang

Comments: In preparation for short paper of conferences. I am using the name Michael Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[477] arXiv:2206.06340 [pdf, other]: Title: SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data

Authors: Eldar Insafutdinov, Dylan Campbell, João F. Henriques, Andrea Vedaldi

Comments: First two authors contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[478] arXiv:2206.06346 [pdf, ps, other]: Title: Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens

Authors: Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir Globerson

Comments: Tech report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479] arXiv:2206.06359 [pdf, other]: Title: EnergyMatch: Energy-based Pseudo-Labeling for Semi-Supervised Learning

Authors: Zhuoran Yu, Yin Li, Yong Jae Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[480] arXiv:2206.06360 [pdf, other]: Title: ARF: Artistic Radiance Fields

Authors: Kai Zhang, Nick Kolkin, Sai Bi, Fujun Luan, Zexiang Xu, Eli Shechtman, Noah Snavely

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[481] arXiv:2206.06363 [pdf, other]: Title: Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation

Authors: Wouter Van Gansbeke, Simon Vandenhende, Luc Van Gool

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[482] arXiv:2206.06404 [pdf, other]: Title: Compositional Mixture Representations for Vision and Text

Authors: Stephan Alaniz, Marco Federici, Zeynep Akata

Comments: Workshop on Learning with Limited Labelled Data for Image and Video Understanding (L3D-IVU), CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[483] arXiv:2206.06420 [pdf, other]: Title: GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation

Authors: Wenhao Li, Hong Liu, Tianyu Guo, Runwei Ding, Hao Tang

Comments: Open Sourced

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[484] arXiv:2206.06427 [pdf, other]: Title: A Multi-purpose Realistic Haze Benchmark with Quantifiable Haze Levels and Ground Truth

Authors: Priya Narayanan, Xin Hu, Zhenyu Wu, Matthew D Thielke, John G Rogers, Andre V Harrison, John A D'Agostino, James D Brown, Long P Quang, James R Uplinger, Heesung Kwon, Zhangyang Wang

Comments: This paper has been ACCEPTED for publication as a REGULAR paper in the IEEE Transactions on Image Processing (TIP)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485] arXiv:2206.06430 [pdf, ps, other]: Title: A Training Method For VideoPose3D With Ideology of Action Recognition

Authors: Hao Bai

Comments: Published by IEEE, on conference CONF-SPML

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[486] arXiv:2206.06435 [pdf, ps, other]: Title: ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy

Authors: Hao Bai

Comments: Accepted by CONF-CDS'22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487] arXiv:2206.06461 [pdf, other]: Title: Self-Supervised Representation Learning With MUlti-Segmental Informational Coding (MUSIC)

Authors: Chuang Niu, Ge Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[488] arXiv:2206.06466 [pdf, other]: Title: Revisiting the Shape-Bias of Deep Learning for Dermoscopic Skin Lesion Classification

Authors: Adriano Lucieri, Fabian Schmeisser, Christoph Peter Balada, Shoaib Ahmed Siddiqui, Andreas Dengel, Sheraz Ahmed

Comments: Submitted preprint accepted for MIUA 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[489] arXiv:2206.06481 [pdf, other]: Title: RigNeRF: Fully Controllable Neural 3D Portraits

Authors: ShahRukh Athar, Zexiang Xu, Kalyan Sunkavalli, Eli Shechtman, Zhixin Shu

Comments: The project page can be found here: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[490] arXiv:2206.06484 [pdf, other]: Title: On Image Segmentation With Noisy Labels: Characterization and Volume Properties of the Optimal Solutions to Accuracy and Dice

Authors: Marcus Nordström, Henrik Hult, Jonas Söderberg, Fredrik Löfman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[491] arXiv:2206.06487 [pdf, other]: Title: The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation

Authors: Zihui Xue, Zhengqi Gao, Sucheng Ren, Hang Zhao

Comments: Accepted by ICLR 2023 (top-5%). The first three authors contribute equally. Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[492] arXiv:2206.06488 [pdf, other]: Title: Multimodal Learning with Transformers: A Survey

Authors: Peng Xu, Xiatian Zhu, David A. Clifton

Comments: This paper is accepted by IEEE TPAMI

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[493] arXiv:2206.06490 [pdf, other]: Title: Learning Task-Independent Game State Representations from Unlabeled Images

Authors: Chintan Trivedi, Konstantinos Makantasis, Antonios Liapis, Georgios N. Yannakakis

Comments: Conference on Games (CoG) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[494] arXiv:2206.06506 [pdf, other]: Title: Spiking Neural Networks for Frame-based and Event-based Single Object Localization

Authors: Sami Barchid, José Mennesson, Jason Eshraghian, Chaabane Djéraba, Mohammed Bennamoun

Comments: 21 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[495] arXiv:2206.06510 [pdf, other]: Title: Generalizable Method for Face Anti-Spoofing with Semi-Supervised Learning

Authors: Nikolay Sergievskiy, Roman Vlasov, Roman Trusov

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[496] arXiv:2206.06518 [pdf, other]: Title: Estimating Pose from Pressure Data for Smart Beds with Deep Image-based Pose Estimators

Authors: Vandad Davoodnia, Saeed Ghorbani, Ali Etemad

Comments: The version of record of this article, first published in Applied Intelligence, is available online at Publisher's website this https URL arXiv admin note: substantial text overlap with arXiv:1908.08919

Journal-ref: Applied Intelligence (2021): 1-15

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[497] arXiv:2206.06533 [pdf, other]: Title: 3D scene reconstruction from monocular spherical video with motion parallax

Authors: Kenji Tanaka

Comments: 13 pages, 18 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[498] arXiv:2206.06544 [pdf, ps, other]: Title: A Survey of Automated Data Augmentation Algorithms for Deep Learning-based Image Classification Tasks

Authors: Zihan Yang, Richard O. Sinnott, James Bailey, Qiuhong Ke

Comments: 68 pages, 9 figures. Submitted to Knowledge and Information Systems (KAIS)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[499] arXiv:2206.06607 [pdf, other]: Title: Plug-and-Play Pseudo Label Correction Network for Unsupervised Person Re-identification

Authors: Tianyi Yan, Kuan Zhu, Haiyun guo, Guibo Zhu, Ming Tang, Jinqiao Wang

Comments: 19 pages,9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[500] arXiv:2206.06608 [pdf, other]: Title: Label Matching Semi-Supervised Object Detection

Authors: Binbin Chen, Weijie Chen, Shicai Yang, Yunyi Xuan, Jie Song, Di Xie, Shiliang Pu, Mingli Song, Yueting Zhuang

Comments: To appear in CVPR 2022. Code is coming soon: this https URL

Journal-ref: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[501] arXiv:2206.06619 [pdf, other]: Title: TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer

Authors: Jiajun Deng, Zhengyuan Yang, Daqing Liu, Tianlang Chen, Wengang Zhou, Yanyong Zhang, Houqiang Li, Wanli Ouyang

Comments: arXiv admin note: text overlap with arXiv:2104.08541

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[502] arXiv:2206.06620 [pdf, other]: Title: Slimmable Domain Adaptation

Authors: Rang Meng, Weijie Chen, Shicai Yang, Jie Song, Luojun Lin, Di Xie, Shiliang Pu, Xinchao Wang, Mingli Song, Yueting Zhuang

Comments: To appear in CVPR 2022. Code is coming soon: this https URL

Journal-ref: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[503] arXiv:2206.06637 [pdf, other]: Title: RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks

Authors: Shanghua Gao, Zhong-Yu Li, Qi Han, Ming-Ming Cheng, Liang Wang

Comments: Accepted by TPAMI. This paper is a journal extension of our CVPR 2021 paper (arXiv:2101.00910)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[504] arXiv:2206.06640 [pdf, other]: Title: Confidence Score for Source-Free Unsupervised Domain Adaptation

Authors: Jonghyun Lee, Dahuin Jung, Junho Yim, Sungroh Yoon

Comments: ICML 2022 camera ready

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[505] arXiv:2206.06665 [pdf, other]: Title: Online Easy Example Mining for Weakly-supervised Gland Segmentation from Histology Images

Authors: Yi Li, Yiduo Yu, Yiwen Zou, Tianqi Xiang, Xiaomeng Li

Comments: MICCAI 2022 Accepeted

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[506] arXiv:2206.06694 [pdf, other]: Title: ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation dataset

Authors: Moritz Roman Hernandez Petzsche, Ezequiel de la Rosa, Uta Hanning, Roland Wiest, Waldo Enrique Valenzuela Pinilla, Mauricio Reyes, Maria Ines Meyer, Sook-Lei Liew, Florian Kofler, Ivan Ezhov, David Robben, Alexander Hutton, Tassilo Friedrich, Teresa Zarth, Johannes Bürkle, The Anh Baran, Bjoern Menze, Gabriel Broocks, Lukas Meyer, Claus Zimmer, Tobias Boeckh-Behrens, Maria Berndt, Benno Ikenberg, Benedikt Wiestler, Jan S. Kirschke

Comments: 12 pages, 2 figures

Journal-ref: Scientific data 9.1 (2022): 762

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[507] arXiv:2206.06712 [pdf, other]: Title: Visual Radial Basis Q-Network

Authors: Julien Hautot, Céline Teuliere, Nourddine Azzaoui

Comments: This paper has been accepted for publication at the 3rd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2022. \c{opyright}Springer Nature 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[508] arXiv:2206.06714 [pdf, other]: Title: Interpretable Gait Recognition by Granger Causality

Authors: Michal Balazia, Katerina Hlavackova-Schindler, Petr Sojka, Claudia Plant

Comments: Preprint. Full paper accepted at the IEEE/IAPR International Conference on Pattern Recognition (ICPR), Montreal, Canada, August 2022. 7 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[509] arXiv:2206.06715 [pdf, other]: Title: Semi-signed prioritized neural fitting for surface reconstruction from unoriented point clouds

Authors: Runsong Zhu, Di Kang, Ka-Hei Hui, Yue Qian, Xuefei Zhe, Zhen Dong, Linchao Bao, Pheng-Ann Heng, Chi-Wing Fu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[510] arXiv:2206.06731 [pdf, ps, other]: Title: Learning Dense Features for Point Cloud Registration Using a Graph Attention Network

Authors: Quoc Vinh Lai Dang, Sarvar Hussain Nengroo, Hojun Jin

Comments: 15 pages, 3 figures

Journal-ref: Applied Sciences 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[511] arXiv:2206.06741 [pdf, other]: Title: Recurrent Transformer Variational Autoencoders for Multi-Action Motion Synthesis

Authors: Rania Briq, Chuhang Zou, Leonid Pishchulin, Chris Broaddus, Juergen Gall

Comments: accepted at Transformers for Vision workshop at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[512] arXiv:2206.06743 [pdf, other]: Title: Weakly-Supervised Crack Detection

Authors: Yuki Inoue, Hiroto Nagayoshi

Comments: Submitted to IEEE Transactions on Intelligent Transportation Systems

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[513] arXiv:2206.06761 [pdf, other]: Title: Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINO

Authors: Javier Rando, Nasib Naimi, Thomas Baumann, Max Mathys

Comments: ICML 2022 Workshop paper accepted at AdvML Frontiers

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[514] arXiv:2206.06801 [pdf, other]: Title: Peripheral Vision Transformer

Authors: Juhong Min, Yucheng Zhao, Chong Luo, Minsu Cho

Comments: Accepted to NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[515] arXiv:2206.06803 [pdf, other]: Title: Asymmetric Dual-Decoder U-Net for Joint Rain and Haze Removal

Authors: Yuan Feng, Yaojun Hu, Pengfei Fang, Yanhong Yang, Sheng Liu, Shengyong Chen

Comments: 12 pages, 35 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[516] arXiv:2206.06829 [pdf, other]: Title: Efficient Decoder-free Object Detection with Transformers

Authors: Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen

Comments: Update metadata, 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[517] arXiv:2206.06922 [pdf, other]: Title: Object Scene Representation Transformer

Authors: Mehdi S. M. Sajjadi, Daniel Duckworth, Aravindh Mahendran, Sjoerd van Steenkiste, Filip Pavetić, Mario Lučić, Leonidas J. Guibas, Klaus Greff, Thomas Kipf

Comments: Accepted at NeurIPS '22. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[518] arXiv:2206.06923 [pdf, ps, other]: Title: A Multi-task Framework for Infrared Small Target Detection and Segmentation

Authors: Yuhang Chen, Liyuan Li, Xin Liu, Xiaofeng Su, Fansheng Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519] arXiv:2206.06930 [pdf, other]: Title: Comprehending and Ordering Semantics for Image Captioning

Authors: Yehao Li, Yingwei Pan, Ting Yao, Tao Mei

Comments: CVPR 2022; Code is publicly available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[520] arXiv:2206.06931 [pdf, other]: Title: Stand-Alone Inter-Frame Attention in Video Models

Authors: Fuchen Long, Zhaofan Qiu, Yingwei Pan, Ting Yao, Jiebo Luo, Tao Mei

Comments: CVPR 2022; Code is publicly available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[521] arXiv:2206.06948 [pdf, other]: Title: Monitoring Urban Forests from Auto-Generated Segmentation Maps

Authors: Conrad M Albrecht, Chenying Liu, Yi Wang, Levente Klein, Xiao Xiang Zhu

Comments: accepted for presentation and publication at IGARSS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[522] arXiv:2206.06959 [pdf, other]: Title: AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

Authors: Amin Banitalebi-Dehkordi, Pratik Gujjar, Yong Zhang

Comments: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[523] arXiv:2206.07011 [pdf, other]: Title: Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention

Authors: Quanzeng You, Jiang Wang, Peng Chu, Andre Abrantes, Zicheng Liu

Comments: 11 pages, 5 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[524] arXiv:2206.07018 [pdf, other]: Title: Turning a Curse into a Blessing: Enabling In-Distribution-Data-Free Backdoor Removal via Stabilized Model Inversion

Authors: Si Chen, Yi Zeng, Jiachen T.Wang, Won Park, Xun Chen, Lingjuan Lyu, Zhuoqing Mao, Ruoxi Jia

Comments: Because of an equation and author informational error, this paper has been withdrawn by the submitter

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[525] arXiv:2206.07028 [pdf, other]: Title: Learning 3D Object Shape and Layout without 3D Supervision

Authors: Georgia Gkioxari, Nikhila Ravi, Justin Johnson

Comments: CVPR 2022, project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[526] arXiv:2206.07036 [pdf, other]: Title: Accurate 3D Body Shape Regression using Metric and Semantic Attributes

Authors: Vasileios Choutas, Lea Muller, Chun-Hao P. Huang, Siyu Tang, Dimitrios Tzionas, Michael J. Black

Comments: First two authors contributed equally

Journal-ref: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[527] arXiv:2206.07038 [pdf, other]: Title: AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos

Authors: Yanze Wu, Xintao Wang, Gen Li, Ying Shan

Comments: NeurIPS 2022. Codes and models are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[528] arXiv:2206.07045 [pdf, other]: Title: ReCo: Retrieve and Co-segment for Zero-shot Transfer

Authors: Gyungin Shin, Weidi Xie, Samuel Albanie

Comments: Tech report. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[529] arXiv:2206.07047 [pdf, other]: Title: RGB-Multispectral Matching: Dataset, Learning Methodology, Evaluation

Authors: Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di Stefano

Comments: CVPR 2022, New Orleans. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[530] arXiv:2206.07117 [pdf, other]: Title: TriHorn-Net: A Model for Accurate Depth-Based 3D Hand Pose Estimation

Authors: Mohammad Rezaei, Razieh Rastgoo, Vassilis Athitsos

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[531] arXiv:2206.07125 [pdf, other]: Title: Self-Supervised Pretraining for Differentially Private Learning

Authors: Arash Asadian, Evan Weidner, Lei Jiang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[532] arXiv:2206.07160 [pdf, other]: Title: LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling

Authors: Linjie Li, Zhe Gan, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Ce Liu, Lijuan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[533] arXiv:2206.07162 [pdf, other]: Title: Category-Agnostic 6D Pose Estimation with Conditional Neural Processes

Authors: Yumeng Li, Ning Gao, Hanna Ziesche, Gerhard Neumann

Comments: Accepted at CVPR2022 workshop: Women in Computer Vision (WiCV)

Journal-ref: CVPR2022 workshop: Women in Computer Vision (WiCV)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[534] arXiv:2206.07163 [pdf, other]: Title: DeepRecon: Joint 2D Cardiac Segmentation and 3D Volume Reconstruction via A Structure-Specific Generative Method

Authors: Qi Chang, Zhennan Yan, Mu Zhou, Di Liu, Khalid Sawalha, Meng Ye, Qilong Zhangli, Mikael Kanski, Subhi Al Aref, Leon Axel, Dimitris Metaxas

Comments: MICCAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[535] arXiv:2206.07171 [pdf, other]: Title: Segmentation in large-scale cellular electron microscopy with deep learning: A literature survey

Authors: Anusha Aswath, Ahmad Alsahaf, Ben N. G. Giepmans, George Azzopardi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[536] arXiv:2206.07198 [pdf, other]: Title: Surgical Phase Recognition in Laparoscopic Cholecystectomy

Authors: Yunfan Li, Vinayak Shenoy, Prateek Prasanna, I.V. Ramakrishnan, Haibin Ling, Himanshu Gupta

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[537] arXiv:2206.07207 [pdf, other]: Title: Beyond Grounding: Extracting Fine-Grained Event Hierarchies Across Modalities

Authors: Hammad A. Ayyubi, Christopher Thomas, Lovish Chum, Rahul Lokesh, Long Chen, Yulei Niu, Xudong Lin, Xuande Feng, Jaywon Koo, Sounak Ray, Shih-Fu Chang

Comments: AAAI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[538] arXiv:2206.07240 [pdf, other]: Title: Test-Time Adaptation for Visual Document Understanding

Authors: Sayna Ebrahimi, Sercan O. Arik, Tomas Pfister

Comments: Accepted at TMLR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[539] arXiv:2206.07255 [pdf, other]: Title: GRAM-HD: 3D-Consistent Image Generation at High Resolution with Generative Radiance Manifolds

Authors: Jianfeng Xiang, Jiaolong Yang, Yu Deng, Xin Tong

Comments: ICCV2023 camera ready version (more results and method comparisons). Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[540] arXiv:2206.07259 [pdf, other]: Title: Self-Supervised Learning of Image Scale and Orientation

Authors: Jongmin Lee, Yoonwoo Jeong, Minsu Cho

Comments: Presented in BMVC 2021, code is available on this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[541] arXiv:2206.07267 [pdf, other]: Title: Rethinking Generalization in Few-Shot Classification

Authors: Markus Hiller, Rongkai Ma, Mehrtash Harandi, Tom Drummond

Comments: Accepted at NeurIPS 2022. Code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[542] arXiv:2206.07272 [pdf, ps, other]: Title: Machine vision for vial positioning detection toward the safe automation of material synthesis

Authors: Leslie Ching Ow Tiong, Hyuk Jun Yoo, Na Yeon Kim, Kwan-Young Lee, Sang Soo Han, Donghun Kim

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[543] arXiv:2206.07282 [pdf, other]: Title: Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial Noises

Authors: Minkyu Choi, Yizhen Zhang, Kuan Han, Xiaokai Wang, Zhongming Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[544] arXiv:2206.07298 [pdf, other]: Title: S$^2$-FPN: Scale-ware Strip Attention Guided Feature Pyramid Network for Real-time Semantic Segmentation

Authors: Mohammed A. M. Elhassan, Chenhui Yang, Chenxi Huang, Tewodros Legesse Munea, Xin Hong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[545] arXiv:2206.07307 [pdf, other]: Title: VCT: A Video Compression Transformer

Authors: Fabian Mentzer, George Toderici, David Minnen, Sung-Jin Hwang, Sergi Caelles, Mario Lucic, Eirikur Agustsson

Comments: NeurIPS'22 Camera Ready Version. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[546] arXiv:2206.07326 [pdf, other]: Title: Recent Advances in Scene Image Representation and Classification

Authors: Chiranjibi Sitaula, Tej Bahadur Shahi, Faezeh Marzbanrad, Jagannath Aryal

Comments: This paper is under review in Multimedia Tools and Applications (Springer) journal. This article may be deleted or updated based on the policies of the journal

Journal-ref: Multimedia Tools and Applications, 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[547] arXiv:2206.07344 [pdf, other]: Title: Automatic Detection of Rice Disease in Images of Various Leaf Sizes

Authors: Kantip Kiratiratanapruk, Pitchayagan Temniranrat, Wasin Sinthupinyo, Sanparith Marukatat, Sujin Patarapuwadol

Comments: 28 pages, 13 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[548] arXiv:2206.07348 [pdf, ps, other]: Title: Unsupervised multi-branch Capsule for Hyperspectral and LiDAR classification

Authors: Quanfeng Xu, Yi Tang, Yumei She

Comments: 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[549] arXiv:2206.07349 [pdf, other]: Title: XMorpher: Full Transformer for Deformable Medical Image Registration via Cross Attention

Authors: Jiacheng Shi, Yuting He, Youyong Kong, Jean-Louis Coatrieux, Huazhong Shu, Guanyu Yang, Shuo Li

Comments: accepted by MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[550] arXiv:2206.07352 [pdf, ps, other]: Title: Robust SAR ATR on MSTAR with Deep Learning Models trained on Full Synthetic MOCEM data

Authors: Benjamin Camus, Corentin Le Barbu, Eric Monteux

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Image and Video Processing (eess.IV)
[551] arXiv:2206.07372 [pdf, other]: Title: MonoGround: Detecting Monocular 3D Objects from the Ground

Authors: Zequn Qin, Xi Li

Comments: CVPR22

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[552] arXiv:2206.07389 [pdf, other]: Title: Ultra Fast Deep Lane Detection with Hybrid Anchor Driven Ordinal Classification

Authors: Zequn Qin, Pengyi Zhang, Xi Li

Comments: TPAMI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[553] arXiv:2206.07394 [pdf, other]: Title: Efficient Adaptive Ensembling for Image Classification

Authors: Antonio Bruno, Davide Moroni, Massimo Martinelli

Journal-ref: Expert Systems (2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[554] arXiv:2206.07423 [pdf, other]: Title: Zero-shot object goal visual navigation

Authors: Qianfan Zhao, Lu Zhang, Bin He, Hong Qiao, Zhiyong Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[555] arXiv:2206.07431 [pdf, other]: Title: Physically-admissible polarimetric data augmentation for road-scene analysis

Authors: Cyprien Ruffino, Rachel Blin, Samia Ainouz, Gilles Gasso, Romain Hérault, Fabrice Meriaudeau, Stéphane Canu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[556] arXiv:2206.07434 [pdf, other]: Title: Self-Supervised Implicit Attention: Guided Attention by The Model Itself

Authors: Jinyi Wu, Xun Gong, Zhemin Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[557] arXiv:2206.07435 [pdf, other]: Title: Forecasting of depth and ego-motion with transformers and self-supervision

Authors: Houssem Boulahbal, Adrian Voicila, Andrew Comport

Comments: Accepted in ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[558] arXiv:2206.07458 [pdf, other]: Title: VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection

Authors: Joanna Hong, Minsu Kim, Yong Man Ro

Comments: Accepted by ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[559] arXiv:2206.07459 [pdf, other]: Title: READ: Aggregating Reconstruction Error into Out-of-distribution Detection

Authors: Wenyu Jiang, Yuxin Ge, Hao Cheng, Mingcai Chen, Shuai Feng, Chongjun Wang

Comments: Accepted to AAAI 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[560] arXiv:2206.07460 [pdf, other]: Title: Coarse-to-fine Deep Video Coding with Hyperprior-guided Mode Prediction

Authors: Zhihao Hu, Guo Lu, Jinyang Guo, Shan Liu, Wei Jiang, Dong Xu

Comments: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[561] arXiv:2206.07468 [pdf, ps, other]: Title: PolyU-BPCoMa: A Dataset and Benchmark Towards Mobile Colorized Mapping Using a Backpack Multisensorial System

Authors: Wenzhong Shi, Pengxin Chen, Muyang Wang, Sheng Bao, Haodong Xiang, Yue Yu, Daping Yang

Comments: 11 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[562] arXiv:2206.07510 [pdf, other]: Title: Deep Multi-Task Networks For Occluded Pedestrian Pose Estimation

Authors: Arindam Das, Sudip Das, Ganesh Sistu, Jonathan Horgan, Ujjwal Bhattacharya, Edward Jones, Martin Glavin, Ciarán Eising

Comments: 4 pages, 5 tables, 2 figures

Journal-ref: Proceedings of the 2022 Irish Machine Vision and Image Processing Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[563] arXiv:2206.07557 [pdf, other]: Title: How to Reduce Change Detection to Semantic Segmentation

Authors: Guo-Hua Wang, Bin-Bin Gao, Chengjie Wang

Comments: Accepted by Pattern Recognition. Code is at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[564] arXiv:2206.07565 [pdf, other]: Title: A Meta-Analysis of Distributionally-Robust Models

Authors: Benjamin Feuer, Ameya Joshi, Chinmay Hegde

Comments: To be presented at ICML Workshop on Principles of Distribution Shift 2022. Copyright 2022 by the author(s)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[565] arXiv:2206.07578 [src]: Title: E2V-SDE: From Asynchronous Events to Fast and Continuous Video Reconstruction via Neural Stochastic Differential Equations

Authors: Jongwan Kim, DongJin Lee, Byunggook Na, Seongsik Park, Jeonghee Jo, Sungroh Yoon

Comments: arXiv admin note: This submission has been withdrawn by arXiv administrators due to inappropriate text overlap with external sources. Additional information at this https URL

Journal-ref: The IEEE / CVF Computer Vision and Pattern Recognition Conference 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[566] arXiv:2206.07580 [pdf, other]: Title: Evaluating object detector ensembles for improving the robustness of artifact detection in endoscopic video streams

Authors: Pedro Esteban Chavarrias-Solano, Carlos Axel Garcia-Vega, Francisco Javier Lopez-Tiro, Gilberto Ochoa-Ruiz, Thomas Bazin, Dominique Lamarque, Christian Daul

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[567] arXiv:2206.07634 [pdf, other]: Title: Real3D-Aug: Point Cloud Augmentation by Placing Real Objects with Occlusion Handling for 3D Detection and Segmentation

Authors: Petr Šebek, Šimon Pokorný, Patrik Vacek, Tomáš Svoboda

Comments: Submitted on 15th June 2022 to IEEE RA-L journal

Journal-ref: Computer Vision Winter Workshop 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[568] arXiv:2206.07643 [pdf, other]: Title: Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Authors: Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan Wang

Comments: NeurIPS 2022. Project Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[569] arXiv:2206.07662 [pdf, other]: Title: SP-ViT: Learning 2D Spatial Priors for Vision Transformers

Authors: Yuxuan Zhou, Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Lei Zhang, Margret Keuper, Xiansheng Hua

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[570] arXiv:2206.07669 [pdf, other]: Title: A Unified Sequence Interface for Vision Tasks

Authors: Ting Chen, Saurabh Saxena, Lala Li, Tsung-Yi Lin, David J. Fleet, Geoffrey Hinton

Comments: The first three authors contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[571] arXiv:2206.07684 [pdf, other]: Title: AVATAR: Unconstrained Audiovisual Speech Recognition

Authors: Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[572] arXiv:2206.07687 [pdf, other]: Title: Structured Sparsity Learning for Efficient Video Super-Resolution

Authors: Bin Xia, Jingwen He, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Luc Van Gool

Comments: Accepted by CVPR2023, code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[573] arXiv:2206.07689 [pdf, other]: Title: Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022

Authors: Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir Globerson

Comments: Ego4D CVPR22 Object State Localization challenge. arXiv admin note: substantial text overlap with arXiv:2206.06346

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[574] arXiv:2206.07690 [pdf, other]: Title: ELUDE: Generating interpretable explanations via a decomposition into labelled and unlabelled features

Authors: Vikram V. Ramaswamy, Sunnie S. Y. Kim, Nicole Meister, Ruth Fong, Olga Russakovsky

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[575] arXiv:2206.07692 [pdf, other]: Title: A Simple Data Mixing Prior for Improving Self-Supervised Learning

Authors: Sucheng Ren, Huiyu Wang, Zhengqi Gao, Shengfeng He, Alan Yuille, Yuyin Zhou, Cihang Xie

Comments: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[576] arXiv:2206.07695 [pdf, other]: Title: VoxGRAF: Fast 3D-Aware Image Synthesis with Sparse Voxel Grids

Authors: Katja Schwarz, Axel Sauer, Michael Niemeyer, Yiyi Liao, Andreas Geiger

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[577] arXiv:2206.07696 [pdf, other]: Title: Diffusion Models for Video Prediction and Infilling

Authors: Tobias Höppe, Arash Mehrjou, Stefan Bauer, Didrik Nielsen, Andrea Dittadi

Comments: Published in TMLR (11/2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[578] arXiv:2206.07698 [pdf, other]: Title: Neural Deformable Voxel Grid for Fast Optimization of Dynamic View Synthesis

Authors: Xiang Guo, Guanying Chen, Yuchao Dai, Xiaoqing Ye, Jiadai Sun, Xiao Tan, Errui Ding

Comments: Technical Report: 29 pages; project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[579] arXiv:2206.07699 [pdf, other]: Title: Write and Paint: Generative Vision-Language Models are Unified Modal Learners

Authors: Shizhe Diao, Wangchunshu Zhou, Xinsong Zhang, Jiawei Wang

Comments: ICLR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[580] arXiv:2206.07700 [pdf, other]: Title: Masked Siamese ConvNets

Authors: Li Jing, Jiachen Zhu, Yann LeCun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[581] arXiv:2206.07704 [pdf, other]: Title: Waymo Open Dataset: Panoramic Video Panoptic Segmentation

Authors: Jieru Mei, Alex Zihao Zhu, Xinchen Yan, Hang Yan, Siyuan Qiao, Yukun Zhu, Liang-Chieh Chen, Henrik Kretzschmar, Dragomir Anguelov

Comments: Our dataset can be found at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[582] arXiv:2206.07705 [pdf, other]: Title: LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection

Authors: Wei-Chih Hung, Henrik Kretzschmar, Vincent Casser, Jyh-Jing Hwang, Dragomir Anguelov

Comments: Find the primary metrics for the 2022 Waymo Open Dataset 3D Camera-Only Detection Challenge at this https URL . Find the code at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[583] arXiv:2206.07706 [pdf, other]: Title: Masked Frequency Modeling for Self-Supervised Visual Pre-Training

Authors: Jiahao Xie, Wei Li, Xiaohang Zhan, Ziwei Liu, Yew Soon Ong, Chen Change Loy

Comments: ICLR 2023. Project page: this https URL Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[584] arXiv:2206.07707 [pdf, other]: Title: Variable Bitrate Neural Fields

Authors: Towaki Takikawa, Alex Evans, Jonathan Tremblay, Thomas Müller, Morgan McGuire, Alec Jacobson, Sanja Fidler

Comments: SIGGRAPH 2022. Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[585] arXiv:2206.07710 [pdf, other]: Title: PlanarRecon: Real-time 3D Plane Detection and Reconstruction from Posed Monocular Videos

Authors: Yiming Xie, Matheus Gadelha, Fengting Yang, Xiaowei Zhou, Huaizu Jiang

Comments: CVPR 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[586] arXiv:2206.07764 [pdf, other]: Title: SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos

Authors: Gamaleldin F. Elsayed, Aravindh Mahendran, Sjoerd van Steenkiste, Klaus Greff, Michael C. Mozer, Thomas Kipf

Comments: Project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[587] arXiv:2206.07771 [pdf, other]: Title: Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation

Authors: Ye Zhu, Yu Wu, Kyle Olszewski, Jian Ren, Sergey Tulyakov, Yan Yan

Comments: ICLR 2023. Project at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[588] arXiv:2206.07802 [pdf, other]: Title: Improving generalization by mimicking the human visual diet

Authors: Spandan Madan, You Li, Mengmi Zhang, Hanspeter Pfister, Gabriel Kreiman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[589] arXiv:2206.07835 [pdf, other]: Title: Disentangling visual and written concepts in CLIP

Authors: Joanna Materzynska, Antonio Torralba, David Bau

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[590] arXiv:2206.07846 [pdf, ps, other]: Title: Action Spotting using Dense Detection Anchors Revisited: Submission to the SoccerNet Challenge 2022

Authors: João V. B. Soares, Avijit Shah

Comments: v2: a few more experiments, more detailed method description

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[591] arXiv:2206.07850 [pdf, other]: Title: HF-NeuS: Improved Surface Reconstruction Using High-Frequency Details

Authors: Yiqun Wang, Ivan Skorokhodov, Peter Wonka

Comments: To appear in NeurIPS 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[592] arXiv:2206.07893 [pdf, other]: Title: PeQuENet: Perceptual Quality Enhancement of Compressed Video with Adaptation- and Attention-based Network

Authors: Saiping Zhang, Luis Herranz, Marta Mrak, Marc Gorriz Blanch, Shuai Wan, Fuzheng Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[593] arXiv:2206.07897 [pdf, other]: Title: NCAGC: A Neighborhood Contrast Framework for Attributed Graph Clustering

Authors: Tong Wang, Guanyu Yang, Qijia He, Zhenquan Zhang, Junhua Wu

Journal-ref: Neurocomputing, 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[594] arXiv:2206.07932 [pdf, other]: Title: Lifelong Wandering: A realistic few-shot online continual learning setting

Authors: Mayank Lunayach, James Smith, Zsolt Kira

Comments: CVPR 2022 Workshop on Continual Learning

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[595] arXiv:2206.07934 [pdf, other]: Title: BANet: Motion Forecasting with Boundary Aware Network

Authors: Chen Zhang, Honglin Sun, Chen Chen, Yandong Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[596] arXiv:2206.07953 [pdf, other]: Title: Analysis and Extensions of Adversarial Training for Video Classification

Authors: Kaleab A. Kinfu, René Vidal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[597] arXiv:2206.07959 [pdf, other]: Title: Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?

Authors: Adam W. Harley, Zhaoyuan Fang, Jie Li, Rares Ambrus, Katerina Fragkiadaki

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[598] arXiv:2206.07967 [pdf, other]: Title: DreamNet: A Deep Riemannian Network based on SPD Manifold Learning for Visual Classification

Authors: Rui Wang, Xiao-Jun Wu, Ziheng Chen, Tianyang Xu, Josef Kittler

Comments: 9 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[599] arXiv:2206.07981 [pdf, other]: Title: Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos

Authors: Lianyang Ma, Yu Yao, Tao Liang, Tongliang Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[600] arXiv:2206.07986 [pdf, other]: Title: Image Captioning based on Feature Refinement and Reflective Decoding

Authors: Ghadah Alabduljabbar, Hafida Benhidour, Said Kerrache

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[601] arXiv:2206.07990 [pdf, other]: Title: Patch-level Representation Learning for Self-supervised Vision Transformers

Authors: Sukmin Yun, Hankook Lee, Jaehyung Kim, Jinwoo Shin

Comments: Accepted to CVPR 2022 (Oral). Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[602] arXiv:2206.07994 [pdf, other]: Title: Joint Class-Affinity Loss Correction for Robust Medical Image Segmentation with Noisy Labels

Authors: Xiaoqing Guo, Yixuan Yuan

Comments: Accepted to MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[603] arXiv:2206.08009 [pdf, other]: Title: Balancing Discriminability and Transferability for Source-Free Domain Adaptation

Authors: Jogendra Nath Kundu, Akshay Kulkarni, Suvaansh Bhambri, Deepesh Mehta, Shreyas Kulkarni, Varun Jampani, R. Venkatesh Babu

Comments: ICML 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[604] arXiv:2206.08016 [pdf, other]: Title: Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning Approaches

Authors: Omar Elharrouss, Younes Akbari, Noor Almaadeed, Somaya Al-Maadeed

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[605] arXiv:2206.08026 [pdf, other]: Title: DeepFormableTag: End-to-end Generation and Recognition of Deformable Fiducial Markers

Authors: Mustafa B. Yaldiz, Andreas Meuleman, Hyeonjoong Jang, Hyunho Ha, Min H. Kim

Journal-ref: ACM Transactions on Graphics 40, 4, Article 67 (August 2021)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[606] arXiv:2206.08083 [pdf, other]: Title: CARLANE: A Lane Detection Benchmark for Unsupervised Domain Adaptation from Simulation to multiple Real-World Domains

Authors: Julian Gebele, Bonifaz Stuhr, Johann Haselberger

Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks, 22 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[607] arXiv:2206.08084 [pdf, other]: Title: An Improved Normed-Deformable Convolution for Crowd Counting

Authors: Xin Zhong, Zhaoyi Yan, Jing Qin, Wangmeng Zuo, Weigang Lu

Journal-ref: IEEE Signal Processing Letters 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[608] arXiv:2206.08105 [pdf, other]: Title: A Simple Baseline for Adversarial Domain Adaptation-based Unsupervised Flood Forecasting

Authors: Delong Chen, Ruizhi Zhou, Yanling Pan, Fan Liu

Comments: Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[609] arXiv:2206.08126 [pdf, other]: Title: Channel Importance Matters in Few-Shot Image Classification

Authors: Xu Luo, Jing Xu, Zenglin Xu

Comments: Accepted to ICML 2022; code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[610] arXiv:2206.08129 [pdf, other]: Title: Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline

Authors: Penghao Wu, Xiaosong Jia, Li Chen, Junchi Yan, Hongyang Li, Yu Qiao

Comments: Accepted at NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[611] arXiv:2206.08150 [pdf, other]: Title: Self-Adaptive Label Augmentation for Semi-supervised Few-shot Classification

Authors: Xueliang Wang, Jianyu Cai, Shuiwang Ji, Houqiang Li, Feng Wu, Jie Wang

Comments: 9 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[612] arXiv:2206.08155 [pdf, other]: Title: Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Authors: Antoine Yang, Antoine Miech, Josef Sivic, Ivan Laptev, Cordelia Schmid

Comments: NeurIPS 2022 Camera-Ready; Project Webpage: this https URL; 25 pages; 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[613] arXiv:2206.08158 [pdf, other]: Title: Volumetric Supervised Contrastive Learning for Seismic Semantic Segmentation

Authors: Kiran Kokilepersaud, Mohit Prabhushankar, Ghassan AlRegib

Journal-ref: The International Meeting for Applied Geoscience & Energy (IMAGE) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[614] arXiv:2206.08171 [pdf, other]: Title: K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather Conditions

Authors: Dong-Hee Paek, Seung-Hyun Kong, Kevin Tirta Wijaya

Comments: Accepted at NeurIPS 2022 Datasets and Benchmarks Track

Journal-ref: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS Datasets and Benchmarks 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[615] arXiv:2206.08172 [pdf, other]: Title: RefCrowd: Grounding the Target in Crowd with Referring Expressions

Authors: Heqian Qiu, Hongliang Li, Taijin Zhao, Lanxiao Wang, Qingbo Wu, Fanman Meng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[616] arXiv:2206.08176 [pdf, other]: Title: Level 2 Autonomous Driving on a Single Device: Diving into the Devils of Openpilot

Authors: Li Chen, Tutian Tang, Zhitian Cai, Yang Li, Penghao Wu, Hongyang Li, Jianping Shi, Junchi Yan, Yu Qiao

Comments: Tech report. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[617] arXiv:2206.08182 [pdf, other]: Title: Nucleus Segmentation and Analysis in Breast Cancer with the MIScnn Framework

Authors: Adrian Pfleiderer, Dominik Müller, Frank Kramer

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[618] arXiv:2206.08186 [pdf, other]: Title: Asymptotic Soft Cluster Pruning for Deep Neural Networks

Authors: Tao Niu, Yinglei Teng, Panpan Zou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[619] arXiv:2206.08194 [pdf, other]: Title: Online Segmentation of LiDAR Sequences: Dataset and Algorithm

Authors: Romain Loiseau, Mathieu Aubry, Loïc Landrieu

Comments: Code and data are available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[620] arXiv:2206.08206 [pdf, other]: Title: Selective Multi-Scale Learning for Object Detection

Authors: Junliang Chen, Weizeng Lu, Linlin Shen

Comments: Accepted by ICANN2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[621] arXiv:2206.08219 [pdf, other]: Title: HaGRID - HAnd Gesture Recognition Image Dataset

Authors: Alexander Kapitanov, Karina Kvanchiani, Alexander Nagaev, Roman Kraynov, Andrei Makhliarchuk

Comments: 12 pages, 5 figures, open-source dataset for computer vision

Journal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024) 4572-4581

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[622] arXiv:2206.08222 [pdf, other]: Title: Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency

Authors: Viraj Prabhu, Sriram Yenamandra, Aaditya Singh, Judy Hoffman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[623] arXiv:2206.08224 [pdf, other]: Title: Multi scale Feature Extraction and Fusion for Online Knowledge Distillation

Authors: Panpan Zou, Yinglei Teng, Tao Niu

Comments: 12 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[624] arXiv:2206.08227 [pdf, other]: Title: Delving into the Scale Variance Problem in Object Detection

Authors: Junliang Chen, Xiaodong Zhao, Linlin Shen

Comments: Accepted by ICTAI2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[625] arXiv:2206.08229 [pdf, other]: Title: Open-Set Recognition with Gradient-Based Representations

Authors: Jinsol Lee, Ghassan AlRegib

Comments: Published at IEEE International Conference on Image Processing (ICIP) 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[626] arXiv:2206.08236 [pdf, other]: Title: Simple and Efficient Architectures for Semantic Segmentation

Authors: Dushyant Mehta, Andrii Skliar, Haitam Ben Yahia, Shubhankar Borse, Fatih Porikli, Amirhossein Habibian, Tijmen Blankevoort

Comments: To be presented at Efficient Deep Learning for Computer Vision Workshop at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[627] arXiv:2206.08275 [pdf, other]: Title: Rank the triplets: A ranking-based multiple instance learning framework for detecting HPV infection in head and neck cancers using routine H&E images

Authors: Ruoyu Wang, Syed Ali Khurram, Amina Asif, Lawrence Young, Nasir Rajpoot

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[628] arXiv:2206.08304 [pdf, other]: Title: Adversarial Patch Attacks and Defences in Vision-Based Tasks: A Survey

Authors: Abhijith Sharma, Yijun Bian, Phil Munz, Apurva Narayan

Comments: A. Sharma and Y. Bian share equal contribution

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[629] arXiv:2206.08339 [pdf, other]: Title: iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

Authors: Fatemeh Saleh, Fuwen Tan, Adrian Bulat, Georgios Tzimiropoulos, Brais Martinez

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[630] arXiv:2206.08343 [pdf, other]: Title: Realistic One-shot Mesh-based Head Avatars

Authors: Taras Khakhulin, Vanessa Sklyarova, Victor Lempitsky, Egor Zakharov

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[631] arXiv:2206.08345 [pdf, ps, other]: Title: Real-World Single Image Super-Resolution Under Rainy Condition

Authors: Mohammad Shahab Uddin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[632] arXiv:2206.08347 [pdf, other]: Title: Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation Learning

Authors: Matthew Gwilliam, Abhinav Shrivastava

Comments: CVPR 2022, project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[633] arXiv:2206.08355 [pdf, other]: Title: FWD: Real-time Novel View Synthesis with Forward Warping and Depth

Authors: Ang Cao, Chris Rockwell, Justin Johnson

Comments: CVPR 2022. Project website this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[634] arXiv:2206.08356 [pdf, other]: Title: OmniMAE: Single Model Masked Pretraining on Images and Videos

Authors: Rohit Girdhar, Alaaeldin El-Nouby, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra

Comments: CVPR 2023. Code/models: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[635] arXiv:2206.08357 [pdf, other]: Title: Spatially-Adaptive Multilayer Selection for GAN Inversion and Editing

Authors: Gaurav Parmar, Yijun Li, Jingwan Lu, Richard Zhang, Jun-Yan Zhu, Krishna Kumar Singh

Comments: CVPR 2022. Github: this https URL Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[636] arXiv:2206.08358 [pdf, other]: Title: MixGen: A New Multi-Modal Data Augmentation

Authors: Xiaoshuai Hao, Yi Zhu, Srikar Appalaraju, Aston Zhang, Wanqian Zhang, Bo Li, Mu Li

Comments: First three authors contributed equally. Code are available at this https URL Oral presentation at WACV 2023 Pretraining Large Vision and Multimodal Models Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[637] arXiv:2206.08361 [pdf, other]: Title: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields

Authors: Keqiang Sun, Shangzhe Wu, Zhaoyang Huang, Ning Zhang, Quan Wang, HongSheng Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[638] arXiv:2206.08362 [pdf, other]: Title: Unified Fourier-based Kernel and Nonlinearity Design for Equivariant Networks on Homogeneous Spaces

Authors: Yinshuang Xu, Jiahui Lei, Edgar Dobriban, Kostas Daniilidis

Comments: Accepted at ICML2022 Thirty-ninth International Conference on Machine Learning

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[639] arXiv:2206.08365 [pdf, other]: Title: Virtual Correspondence: Humans as a Cue for Extreme-View Geometry

Authors: Wei-Chiu Ma, Anqi Joyce Yang, Shenlong Wang, Raquel Urtasun, Antonio Torralba

Comments: CVPR 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[640] arXiv:2206.08367 [pdf, other]: Title: SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation

Authors: Tao Sun, Mattia Segu, Janis Postels, Yuxuan Wang, Luc Van Gool, Bernt Schiele, Federico Tombari, Fisher Yu

Comments: Published at IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[641] arXiv:2206.08368 [pdf, other]: Title: Unbiased 4D: Monocular 4D Reconstruction with a Neural Deformation Model

Authors: Erik C.M. Johnson, Marc Habermann, Soshi Shimada, Vladislav Golyanik, Christian Theobalt

Comments: 26 pages, 17 figures, 8 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[642] arXiv:2206.08405 [pdf, ps, other]: Title: Going Deeper than Tracking: a Survey of Computer-Vision Based Recognition of Animal Pain and Affective States

Authors: Sofia Broomé, Marcelo Feighelstein, Anna Zamansky, Gabriel Carreira Lencioni, Pia Haubro Andersen, Francisca Pessanha, Marwa Mahmoud, Hedvig Kjellström, Albert Ali Salah

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[643] arXiv:2206.08423 [pdf, other]: Title: IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes

Authors: Rui Zhu, Zhengqin Li, Janarbek Matai, Fatih Porikli, Manmohan Chandraker

Comments: CVPR 22 camera ready version with supplementary

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[644] arXiv:2206.08427 [pdf, other]: Title: SATBench: Benchmarking the speed-accuracy tradeoff in object recognition by humans and dynamic neural networks

Authors: Ajay Subramanian, Sara Price, Omkar Kumbhar, Elena Sizikova, Najib J. Majaj, Denis G. Pelli

Comments: 19 pages, 12 figures. Under Review at NeurIPS Datasets and Benchmarks Track 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[645] arXiv:2206.08428 [pdf, other]: Title: EyeNeRF: A Hybrid Representation for Photorealistic Synthesis, Animation and Relighting of Human Eyes

Authors: Gengyan Li (1 and 2), Abhimitra Meka (1), Franziska Müller (1), Marcel C. Bühler (2), Otmar Hilliges (2), Thabo Beeler (1) ((1) Google Inc., (2) ETH Zürich)

Comments: 16 pages, 16 figures, 1 table, to be published in ACM Transactions on Graphics (TOG) (Volume: 41, Issue: 4), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[646] arXiv:2206.08429 [pdf, other]: Title: Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes

Authors: Xiang Hao, Jingxiang Chen, Shixing Chen, Ahmed Saad, Raffay Hamid

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[647] arXiv:2206.08460 [pdf, other]: Title: TUSK: Task-Agnostic Unsupervised Keypoints

Authors: Yuhe Jin, Weiwei Sun, Jan Hosang, Eduard Trulls, Kwang Moo Yi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[648] arXiv:2206.08462 [pdf, other]: Title: Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies

Authors: Ares Fisher, Rajesh P.N. Rao

Comments: 9 pages, 6 figures. fixed LaTeX typo for algorithm reference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[649] arXiv:2206.08477 [pdf, other]: Title: Backdoor Attacks on Vision Transformers

Authors: Akshayvarun Subramanya, Aniruddha Saha, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Hamed Pirsiavash

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[650] arXiv:2206.08488 [pdf, other]: Title: Controllable Image Enhancement

Authors: Heewon Kim, Kyoung Mu Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[651] arXiv:2206.08500 [pdf, other]: Title: What do navigation agents learn about their environment?

Authors: Kshitij Dwivedi, Gemma Roig, Aniruddha Kembhavi, Roozbeh Mottaghi

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[652] arXiv:2206.08509 [pdf, other]: Title: Neural Architecture Adaptation for Object Detection by Searching Channel Dimensions and Mapping Pre-trained Parameters

Authors: Harim Jung, Myeong-Seok Oh, Cheoljong Yang, Seong-Whan Lee

Comments: Accepted to ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[653] arXiv:2206.08524 [pdf, other]: Title: CDNet: Contrastive Disentangled Network for Fine-Grained Image Categorization of Ocular B-Scan Ultrasound

Authors: Ruilong Dan, Yunxiang Li, Yijie Wang, Gangyong Jia, Ruiquan Ge, Juan Ye, Qun Jin, Yaqi Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[654] arXiv:2206.08537 [pdf, ps, other]: Title: Large-Margin Representation Learning for Texture Classification

Authors: Jonathan de Matos, Luiz Eduardo Soares de Oliveira, Alceu de Souza Britto Junior, Alessandro Lameiras Koerich

Comments: 7 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[655] arXiv:2206.08547 [pdf, other]: Title: Texture Generation Using A Graph Generative Adversarial Network And Differentiable Rendering

Authors: Dharma KC, Clayton T. Morrison, Bradley Walls

Comments: The final publication is available at Springer via this http URL

Journal-ref: Springer.13836.(2023)388-401

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[656] arXiv:2206.08549 [pdf, other]: Title: Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images

Authors: Jiyeon Han, Hwanil Choi, Yunjey Choi, Junho Kim, Jung-Woo Ha, Jaesik Choi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[657] arXiv:2206.08566 [pdf, other]: Title: Active Data Discovery: Mining Unknown Data using Submodular Information Measures

Authors: Suraj Kothawade, Shivang Chopra, Saikat Ghosh, Rishabh Iyer

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[658] arXiv:2206.08567 [pdf, other]: Title: Rectify ViT Shortcut Learning by Visual Saliency

Authors: Chong Ma, Lin Zhao, Yuzhong Chen, David Weizhong Liu, Xi Jiang, Tuo Zhang, Xintao Hu, Dinggang Shen, Dajiang Zhu, Tianming Liu

Comments: NeurIPS2022 Under Review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[659] arXiv:2206.08568 [pdf, other]: Title: Multi-Contextual Predictions with Vision Transformer for Video Anomaly Detection

Authors: Joo-Yeon Lee, Woo-Jeoung Nam, Seong-Whan Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[660] arXiv:2206.08572 [pdf, other]: Title: Enhanced Bi-directional Motion Estimation for Video Frame Interpolation

Authors: Xin Jin, Longhai Wu, Guotao Shen, Youxin Chen, Jie Chen, Jayoon Koo, Cheul-hee Hahm

Comments: Accepted by WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[661] arXiv:2206.08585 [pdf, other]: Title: HairFIT: Pose-Invariant Hairstyle Transfer via Flow-based Hair Alignment and Semantic-Region-Aware Inpainting

Authors: Chaeyeon Chung, Taewoo Kim, Hyelin Nam, Seunghwan Choi, Gyojung Gu, Sunghyun Park, Jaegul Choo

Comments: BMVC 2021 Oral Presentation

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[662] arXiv:2206.08605 [pdf, ps, other]: Title: On Efficient Real-Time Semantic Segmentation: A Survey

Authors: Christopher J. Holder, Muhammad Shafique

Comments: 19 pages, 13 figures, 4 tables This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[663] arXiv:2206.08610 [pdf, other]: Title: Masked Autoencoders for Generic Event Boundary Detection CVPR'2022 Kinetics-GEBD Challenge

Authors: Rui He, Yuanxi Sun, Youzeng Li, Zuwei Huang, Feng Hu, Xu Cheng, Jie Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[664] arXiv:2206.08614 [pdf, other]: Title: Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment

Authors: Daniel Vera Nieto, Luigi Celona, Clara Fernandez-Labrador

Comments: Accepted to NeurIPS Track on Datasets and Benchmarks 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[665] arXiv:2206.08632 [pdf, other]: Title: Learning Using Privileged Information for Zero-Shot Action Recognition

Authors: Zhiyi Gao, Yonghong Hou, Wanqing Li, Zihui Guo, Bin Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[666] arXiv:2206.08638 [pdf, ps, other]: Title: Minimum Noticeable Difference based Adversarial Privacy Preserving Image Generation

Authors: Wen Sun, Jian Jin, Weisi Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[667] arXiv:2206.08640 [pdf, other]: Title: Uncertainty-aware Evaluation of Time-Series Classification for Online Handwriting Recognition with Domain Shift

Authors: Andreas Klaß, Sven M. Lorenz, Martin W. Lauer-Schmaltz, David Rügamer, Bernd Bischl, Christopher Mutschler, Felix Ott

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[668] arXiv:2206.08641 [pdf, other]: Title: Diverse Multiple Trajectory Prediction Using a Two-stage Prediction Network Trained with Lane Loss

Authors: Sanmin Kim, Hyeongseok Jeon, Junwon Choi, Dongsuk Kum

Comments: RA-L accepted

Journal-ref: IEEE Robotics and Automation Letters (2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[669] arXiv:2206.08645 [pdf, other]: Title: Local Slot Attention for Vision-and-Language Navigation

Authors: Yifeng Zhuang, Qiang Sun, Yanwei Fu, Lifeng Chen, Xiangyang Xue

Comments: ICMR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[670] arXiv:2206.08655 [pdf, other]: Title: Learning Implicit Feature Alignment Function for Semantic Segmentation

Authors: Hanzhe Hu, Yinbo Chen, Jiarui Xu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[671] arXiv:2206.08657 [pdf, other]: Title: BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning

Authors: Xiao Xu, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan

Comments: Accepted by AAAI 2023, Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[672] arXiv:2206.08683 [pdf, other]: Title: AggNet: Learning to Aggregate Faces for Group Membership Verification

Authors: Marzieh Gheisari, Javad Amirian, Teddy Furon, Laurent Amsaleg

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[673] arXiv:2206.08701 [pdf, ps, other]: Title: Towards Real-Time Visual Tracking with Graded Color-names Features

Authors: Lin Li, Guoli Wang, Xuemei Guo,

Comments: 12 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[674] arXiv:2206.08712 [pdf, other]: Title: An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions

Authors: Yijun Yuan, Andreas Nuechter

Comments: Accepted to RAL2022, code at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[675] arXiv:2206.08748 [pdf, ps, other]: Title: ReViSe: Remote Vital Signs Measurement Using Smartphone Camera

Authors: Donghao Qiao, Amtul Haq Ayesha, Farhana Zulkernine, Raihan Masroor, Nauman Jaffar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[676] arXiv:2206.08749 [pdf, other]: Title: From a few Accurate 2D Correspondences to 3D Point Clouds

Authors: Trung-Kien Le, Ping Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[677] arXiv:2206.08751 [pdf, other]: Title: Perceptual Quality Assessment of Virtual Reality Videos in the Wild

Authors: Wen Wen, Mu Li, Yiru Yao, Xiangjie Sui, Yabin Zhang, Long Lan, Yuming Fang, Kede Ma

Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[678] arXiv:2206.08778 [pdf, other]: Title: CTooth: A Fully Annotated 3D Dataset and Benchmark for Tooth Volume Segmentation on Cone Beam Computed Tomography Images

Authors: Weiwei Cui, Yaqi Wang, Qianni Zhang, Huiyu Zhou, Dan Song, Xingyong Zuo, Gangyong Jia, Liaoyuan Zeng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[679] arXiv:2206.08789 [pdf, ps, other]: Title: Reconstructing vehicles from orthographic drawings using deep neural networks

Authors: Robin Klippert

Comments: 9 Pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[680] arXiv:2206.08791 [pdf, other]: Title: DU-Net based Unsupervised Contrastive Learning for Cancer Segmentation in Histology Images

Authors: Yilong Li, Yaqi Wang, Huiyu Zhou, Huaqiong Wang, Gangyong Jia, Qianni Zhang

Comments: arXiv admin note: text overlap with arXiv:2002.05709 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[681] arXiv:2206.08792 [pdf, other]: Title: FD-CAM: Improving Faithfulness and Discriminability of Visual Explanation for CNNs

Authors: Hui Li, Zihao Li, Rui Ma, Tieru Wu

Comments: Accepted by ICPR 2022 and also accepted by CVPR 2022 Explainable Artificial Intelligence for Computer Vision (XAI4CV) Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[682] arXiv:2206.08794 [pdf, other]: Title: The Importance of Background Information for Out of Distribution Generalization

Authors: Jupinder Parmar, Khaled Saab, Brian Pogatchnik, Daniel Rubin, Christopher Ré

Comments: 6 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[683] arXiv:2206.08801 [pdf, other]: Title: Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training

Authors: Xiao Lu, Yihong Cao, Sheng Liu, Chengjiang Long, Zipei Chen, Xuanyu Zhou, Yimin Yang, Chunxia Xiao

Comments: Accepted in CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[684] arXiv:2206.08833 [pdf, ps, other]: Title: A Comparative Study of Confidence Calibration in Deep Learning: From Computer Vision to Medical Imaging

Authors: Riqiang Gao, Thomas Li, Yucheng Tang, Zhoubing Xu, Michael Kammer, Sanja L. Antic, Kim Sandler, Fabien Moldonado, Thomas A. Lasko, Bennett Landman

Comments: 17 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[685] arXiv:2206.08861 [pdf, other]: Title: DGMIL: Distribution Guided Multiple Instance Learning for Whole Slide Image Classification

Authors: Linhao Qu, Xiaoyuan Luo, Shaolei Liu, Manning Wang, Zhijian Song

Comments: accepted by MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[686] arXiv:2206.08880 [pdf, other]: Title: Improving Generalization of Metric Learning via Listwise Self-distillation

Authors: Zelong Zeng, Fan Yang, Zheng Wang, Shin'ichi Satoh

Comments: 11 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[687] arXiv:2206.08883 [pdf, other]: Title: CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer

Authors: Yao Mu, Shoufa Chen, Mingyu Ding, Jianyu Chen, Runjian Chen, Ping Luo

Comments: ICML 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[688] arXiv:2206.08898 [pdf, other]: Title: SimA: Simple Softmax-free Attention for Vision Transformers

Authors: Soroush Abbasi Koohpayegani, Hamed Pirsiavash

Comments: Code is available here: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[689] arXiv:2206.08903 [pdf, other]: Title: Colonoscopy 3D Video Dataset with Paired Depth from 2D-3D Registration

Authors: Taylor L. Bobrow, Mayank Golhar, Rohan Vijayan, Venkata S. Akshintala, Juan R. Garcia, Nicholas J. Durr

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[690] arXiv:2206.08916 [pdf, other]: Title: Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks

Authors: Jiasen Lu, Christopher Clark, Rowan Zellers, Roozbeh Mottaghi, Aniruddha Kembhavi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[691] arXiv:2206.08919 [pdf, other]: Title: VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix

Authors: Teng Wang, Wenhao Jiang, Zhichao Lu, Feng Zheng, Ran Cheng, Chengguo Yin, Ping Luo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[692] arXiv:2206.08920 [pdf, other]: Title: VectorMapNet: End-to-end Vectorized HD Map Learning

Authors: Yicheng Liu, Tianyuan Yuan, Yue Wang, Yilun Wang, Hang Zhao

Comments: Accepted by ICML 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[693] arXiv:2206.08927 [pdf, other]: Title: Cross-task Attention Mechanism for Dense Multi-task Learning

Authors: Ivan Lopes, Tuan-Hung Vu, Raoul de Charette

Comments: 10 figures, 6 tables, 23 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[694] arXiv:2206.08929 [pdf, other]: Title: TAVA: Template-free Animatable Volumetric Actors

Authors: Ruilong Li, Julian Tanke, Minh Vo, Michael Zollhofer, Jurgen Gall, Angjoo Kanazawa, Christoph Lassner

Comments: Code: this https URL; Project Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[695] arXiv:2206.08948 [pdf, other]: Title: CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

Authors: Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen

Comments: CVPR 2022 Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[696] arXiv:2206.08954 [pdf, other]: Title: Bag of Image Patch Embedding Behind the Success of Self-Supervised Learning

Authors: Yubei Chen, Adrien Bardes, Zengyi Li, Yann LeCun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[697] arXiv:2206.08970 [pdf, other]: Title: MultiEarth 2022 -- The Champion Solution for the Matrix Completion Challenge via Multimodal Regression and Generation

Authors: Bo Peng, Hongchen Liu, Hang Zhou, Yuchuan Gou, Jui-Hsin Lai

Comments: CVPR 2022, MultiEarth 2022, Matrix Completion Challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[698] arXiv:2206.08977 [pdf, ps, other]: Title: BN-HTRd: A Benchmark Dataset for Document Level Offline Bangla Handwritten Text Recognition (HTR) and Line Segmentation

Authors: Md. Ataur Rahman, Nazifa Tabassum, Mitu Paul, Riya Pal, Mohammad Khairul Islam

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[699] arXiv:2206.08990 [pdf, other]: Title: Shadows Shed Light on 3D Objects

Authors: Ruoshi Liu, Sachit Menon, Chengzhi Mao, Dennis Park, Simon Stent, Carl Vondrick

Comments: 19 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[700] arXiv:2206.09027 [pdf, other]: Title: Landscape Learning for Neural Network Inversion

Authors: Ruoshi Liu, Chengzhi Mao, Purva Tendulkar, Hao Wang, Carl Vondrick

Comments: 15 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[701] arXiv:2206.09038 [pdf, other]: Title: Validation of Vector Data using Oblique Images

Authors: Pragyana Mishra, Eyal Ofek, Gur Kimchi

Comments: In Proceedings of 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM GIS'08)

Journal-ref: Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM GIS '08), pp. 1-10. 2008

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[702] arXiv:2206.09055 [src]: Title: Augmented Imagefication: A Data-driven Fault Detection Method for Aircraft Air Data Sensors

Authors: Hang Zhao, Jinyi Ma, Zhongzhi Li, Yiqun Dong, Jianliang Ai

Comments: a crucial design defect to acquire flying data by simulation

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[703] arXiv:2206.09061 [pdf, other]: Title: Design of Supervision-Scalable Learning Systems: Methodology and Performance Benchmarking

Authors: Yijing Yang, Hongyu Fu, C.-C. Jay Kuo

Comments: 16 pages, 12 figures, 4 tables, under consideration at Pattern Recognition

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[704] arXiv:2206.09068 [pdf, other]: Title: Attention-based Dynamic Subspace Learners for Medical Image Analysis

Authors: Sukesh Adiga V, Jose Dolz, Herve Lombaert

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[705] arXiv:2206.09071 [pdf, other]: Title: Analysis & Computational Complexity Reduction of Monocular and Stereo Depth Estimation Techniques

Authors: Rajeev Patwari, Varo Ly

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[706] arXiv:2206.09082 [pdf, other]: Title: Context-aware Proposal Network for Temporal Action Detection

Authors: Xiang Wang, Huaxin Zhang, Shiwei Zhang, Changxin Gao, Yuanjie Shao, Nong Sang

Comments: First place winning solution for temporal action detection task in CVPR-2022 AcitivityNet Challenge. arXiv admin note: substantial text overlap with arXiv:2106.11812

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[707] arXiv:2206.09089 [pdf, ps, other]: Title: A Dynamic Data Driven Approach for Explainable Scene Understanding

Authors: Zachary A Daniels, Dimitris Metaxas

Comments: Unpublished draft of book chapter

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[708] arXiv:2206.09106 [pdf, other]: Title: Embodied Scene-aware Human Pose Estimation

Authors: Zhengyi Luo, Shun Iwase, Ye Yuan, Kris Kitani

Comments: NeurIPS 2022. Project website: this https URL Zhengyi Luo and Shun Iwase contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[709] arXiv:2206.09111 [pdf, other]: Title: VReBERT: A Simple and Flexible Transformer for Visual Relationship Detection

Authors: Yu Cui, Moshiur Farazi

Comments: Published at International Conference on Pattern Recognition (ICPR) 2022, Montreal Quebec

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[710] arXiv:2206.09114 [pdf, other]: Title: Bear the Query in Mind: Visual Grounding with Query-conditioned Convolution

Authors: Chonghan Chen, Qi Jiang, Chih-Hao Wang, Noel Chen, Haohan Wang, Xiang Li, Bhiksha Raj

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[711] arXiv:2206.09132 [pdf, other]: Title: Replacing Labeled Real-image Datasets with Auto-generated Contours

Authors: Hirokatsu Kataoka, Ryo Hayamizu, Ryosuke Yamada, Kodai Nakashima, Sora Takashima, Xinyu Zhang, Edgar Josafat Martinez-Noriega, Nakamasa Inoue, Rio Yokota

Comments: Accepted to CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[712] arXiv:2206.09148 [pdf, other]: Title: Deep Compatible Learning for Partially-Supervised Medical Image Segmentation

Authors: Ke Zhang, Xiahai Zhuang

Comments: 16 pages, 13 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[713] arXiv:2206.09178 [pdf, other]: Title: REVECA -- Rich Encoder-decoder framework for Video Event CAptioner

Authors: Jaehyuk Heo, YongGi Jeong, Sunwoo Kim, Jaehee Kim, Pilsung Kang

Comments: The IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR). LOng-form VidEo Understanding (LOVEU) workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[714] arXiv:2206.09191 [pdf, other]: Title: Gender Artifacts in Visual Datasets

Authors: Nicole Meister, Dora Zhao, Angelina Wang, Vikram V. Ramaswamy, Ruth Fong, Olga Russakovsky

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[715] arXiv:2206.09202 [pdf, other]: Title: Camera Adaptation for Fundus-Image-Based CVD Risk Estimation

Authors: Zhihong Lin, Danli Shi, Donghao Zhang, Xianwen Shang, Mingguang He, Zongyuan Ge

Comments: This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this contribution will be added soon

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[716] arXiv:2206.09221 [pdf, ps, other]: Title: 3D Face Parsing via Surface Parameterization and 2D Semantic Segmentation Network

Authors: Wenyuan Sun, Ping Zhou, Yangang Wang, Zongpu Yu, Jing Jin, Guangquan Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[717] arXiv:2206.09242 [pdf, other]: Title: GaLeNet: Multimodal Learning for Disaster Prediction, Management and Relief

Authors: Rohit Saha, Mengyi Fang, Angeline Yasodhara, Kyryl Truskovskyi, Azin Asgarian, Daniel Homola, Raahil Shah, Frederik Dieleman, Jack Weatheritt, Thomas Rogers

Comments: Accepted to CVPR 2022 Workshop on Multimodal Learning for Earth and Environment

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[718] arXiv:2206.09243 [pdf, other]: Title: Structured Light with Redundancy Codes

Authors: Zhanghao Sun, Yu Zhang, Yicheng Wu, Dong Huo, Yiming Qian, Jian Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[719] arXiv:2206.09244 [pdf, other]: Title: GAN2X: Non-Lambertian Inverse Rendering of Image GANs

Authors: Xingang Pan, Ayush Tewari, Lingjie Liu, Christian Theobalt

Comments: Accepted to 3DV 2022. The video demo is available at the project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[720] arXiv:2206.09256 [pdf, other]: Title: Multistream Gaze Estimation with Anatomical Eye Region Isolation by Synthetic to Real Transfer Learning

Authors: Zunayed Mahmud, Paul Hungler, Ali Etemad

Comments: 15 pages, 7 figures, 14 tables. This work has been accepted to the IEEE Transactions on Artificial Intelligence $\copyright$ 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses

Journal-ref: IEEE Transactions on Artificial Intelligence, 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[721] arXiv:2206.09265 [pdf, ps, other]: Title: SAViR-T: Spatially Attentive Visual Reasoning with Transformers

Authors: Pritish Sahu, Kalliopi Basioti, Vladimir Pavlovic

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[722] arXiv:2206.09293 [pdf, other]: Title: Rethinking Bayesian Deep Learning Methods for Semi-Supervised Volumetric Medical Image Segmentation

Authors: Jianfeng Wang, Thomas Lukasiewicz

Comments: To appear at CVPR 2022, and the supplementary material can be found at the official site. The source codes are at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[723] arXiv:2206.09325 [pdf, other]: Title: EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm

Authors: Jiangning Zhang, Xiangtai Li, Yabiao Wang, Chengjie Wang, Yibo Yang, Yong Liu, Dacheng Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET)
[724] arXiv:2206.09358 [pdf, other]: Title: What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs

Authors: Tal Shaharabany, Yoad Tewel, Lior Wolf

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[725] arXiv:2206.09362 [src]: Title: Towards Generalizable Person Re-identification with a Bi-stream Generative Model

Authors: Xin Xu, Wei Liu, Zheng Wang, Ruiming Hu, Qi Tian

Comments: There is a mistake of equation 1

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[726] arXiv:2206.09365 [pdf, other]: Title: Semi-supervised Change Detection of Small Water Bodies Using RGB and Multispectral Images in Peruvian Rainforests

Authors: Kangning Cui, Seda Camalan, Ruoning Li, Victor P. Pauca, Sarra Alqahtani, Robert J. Plemmons, Miles Silman, Evan N. Dethier, David Lutz, Raymond H. Chan

Comments: 8 pages, 5 figures. Accepted to Proceedings of IEEE WHISPERS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[727] arXiv:2206.09372 [pdf, other]: Title: mvHOTA: A multi-view higher order tracking accuracy metric to measure spatial and temporal associations in multi-point detection

Authors: Lalith Sharan, Halvar Kelm, Gabriele Romano, Matthias Karck, Raffaele De Simone, Sandy Engelhardt

Comments: 16 pages, 9 figures

Journal-ref: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization (2022) 1-9

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[728] arXiv:2206.09410 [pdf, other]: Title: Low-Mid Adversarial Perturbation against Unauthorized Face Recognition System

Authors: Jiaming Zhang, Qi Yi, Dongyuan Lu, Jitao Sang

Comments: published in Information Sciences

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[729] arXiv:2206.09414 [pdf, other]: Title: Terrain Classification using Transfer Learning on Hyperspectral Images: A Comparative study

Authors: Uphar Singh, Kumar Saurabh, Neelaksh Trehan, Ranjana Vyas, O.P. Vyas

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[730] arXiv:2206.09420 [pdf, other]: Title: Agricultural Plantation Classification using Transfer Learning Approach based on CNN

Authors: Uphar Singh, Tushar Musale, Ranjana Vyas, O.P.Vyas (Indian Institute of Information Technology, Allahabad, India)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[731] arXiv:2206.09474 [pdf, other]: Title: 3D Object Detection for Autonomous Driving: A Comprehensive Survey

Authors: Jiageng Mao, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li

Comments: Accepted to International Journal of Computer Vision (IJCV). Project page is at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[732] arXiv:2206.09479 [pdf, other]: Title: StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis

Authors: Minguk Kang, Joonghyuk Shin, Jaesik Park

Comments: 32 pages, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, 2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[733] arXiv:2206.09485 [pdf, other]: Title: Video frame interpolation for high dynamic range sequences captured with dual-exposure sensors

Authors: Uğur Çoğalan, Mojtaba Bemana, Hans-Peter Seidel, Karol Myszkowski

Comments: 13 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[734] arXiv:2206.09500 [pdf, other]: Title: Unbiased Teacher v2: Semi-supervised Object Detection for Anchor-free and Anchor-based Detectors

Authors: Yen-Cheng Liu, Chih-Yao Ma, Zsolt Kira

Comments: Project Page is at this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[735] arXiv:2206.09504 [pdf, other]: Title: A Parallel Implementation of Computing Mean Average Precision

Authors: Beinan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[736] arXiv:2206.09509 [pdf, ps, other]: Title: Hybrid Facial Expression Recognition (FER2013) Model for Real-Time Emotion Classification and Prediction

Authors: Ozioma Collins Oguine, Kanyifeechukwu Jane Oguine, Hashim Ibrahim Bisallah, Daniel Ofuani

Comments: 8 Pages, 8 Figures, 5 Tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
[737] arXiv:2206.09541 [pdf, other]: Title: DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations

Authors: Ximeng Sun, Ping Hu, Kate Saenko

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[738] arXiv:2206.09548 [pdf, other]: Title: Variational Distillation for Multi-View Learning

Authors: Xudong Tian, Zhizhong Zhang, Cong Wang, Wensheng Zhang, Yanyun Qu, Lizhuang Ma, Zongze Wu, Yuan Xie, Dacheng Tao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[739] arXiv:2206.09552 [pdf, other]: Title: Dynamic Message Propagation Network for RGB-D Salient Object Detection

Authors: Baian Chen, Zhilei Chen, Xiaowei Hu, Jun Xu, Haoran Xie, Mingqiang Wei, Jing Qin

Comments: 12 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[740] arXiv:2206.09553 [pdf, other]: Title: Capturing and Inferring Dense Full-Body Human-Scene Contact

Authors: Chun-Hao P. Huang, Hongwei Yi, Markus Höschle, Matvey Safroshkin, Tsvetelina Alexiadis, Senya Polikovsky, Daniel Scharstein, Michael J. Black

Comments: CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[741] arXiv:2206.09554 [pdf, other]: Title: Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation

Authors: Tao Chen, Yazhou Yao, Lei Zhang, Qiong Wang, Guo-Sen Xie, Fumin Shen

Comments: TMM2022, 11 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[742] arXiv:2206.09564 [pdf, other]: Title: A Novel Long-term Iterative Mining Scheme for Video Salient Object Detection

Authors: Chenglizhao Chen, Hengsen Wang, Yuming Fang, Chong Peng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[743] arXiv:2206.09575 [pdf, other]: Title: C-SENN: Contrastive Self-Explaining Neural Network

Authors: Yoshihide Sawada, Keigo Nakamura

Comments: 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[744] arXiv:2206.09581 [pdf, ps, other]: Title: Explicit and implicit models in infrared and visible image fusion

Authors: Zixuan Wang, Bin Sun

Comments: 8 pages, 5 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[745] arXiv:2206.09585 [pdf, other]: Title: 5th Place Solution for YouTube-VOS Challenge 2022: Video Object Segmentation

Authors: Wangwang Yang, Jinming Su, Yiting Duan, Tingyi Guo, Junfeng Luo

Comments: 5th Place Solution for Video Object Segmentation in the 4th Large-scale Video Object Segmentation Challenge, CVPR 2022 Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[746] arXiv:2206.09592 [pdf, other]: Title: DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection

Authors: Yunhao Ge, Jiashu Xu, Brian Nlong Zhao, Neel Joshi, Laurent Itti, Vibhav Vineet

Comments: v3(same as v2) version, update structure (add foreground generation, stable diffusion), add more experiments

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[747] arXiv:2206.09596 [pdf, other]: Title: Efficient and Flexible Sublabel-Accurate Energy Minimization

Authors: Zhakshylyk Nurlanov, Daniel Cremers, Florian Bernard

Comments: To be published at ICPR 2022, Copyright 2022 IEEE

Subjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[748] arXiv:2206.09597 [pdf, other]: Title: Winning the CVPR'2022 AQTC Challenge: A Two-stage Function-centric Approach

Authors: Shiwei Wu, Weidong He, Tong Xu, Hao Wang, Enhong Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[749] arXiv:2206.09604 [pdf, other]: Title: Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation

Authors: Hyunsu Rhee, Dongchan Min, Sunil Hwang, Bruno Andreis, Sung Ju Hwang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[750] arXiv:2206.09664 [pdf, other]: Title: What Can be Seen is What You Get: Structure Aware Point Cloud Augmentation

Authors: Frederik Hasecke, Martin Alsfasser, Anton Kummert

Comments: Published in IEEE IV 2022

Journal-ref: 33rd IEEE Intelligent Vehicles Symposium, Aachen, Germany, June 5th - June 9th 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[751] arXiv:2206.09667 [pdf, other]: Title: MSANet: Multi-Similarity and Attention Guidance for Boosting Few-Shot Segmentation

Authors: Ehtesham Iqbal, Sirojbek Safarov, Seongdeok Bang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[752] arXiv:2206.09683 [pdf, other]: Title: Distribution Regularized Self-Supervised Learning for Domain Adaptation of Semantic Segmentation

Authors: Javed Iqbal, Hamza Rawal, Rehan Hafiz, Yu-Tseh Chi, Mohsen Ali

Comments: Accepted for publication at Image and Vision Computing

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[753] arXiv:2206.09731 [pdf, other]: Title: Semantic Labeling of High Resolution Images Using EfficientUNets and Transformers

Authors: Hasan AlMarzouqi, Lyes Saad Saoud

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[754] arXiv:2206.09736 [pdf, other]: Title: Geo-NI: Geometry-aware Neural Interpolation for Light Field Rendering

Authors: Gaochang Wu, Yuemei Zhou, Yebin Liu, Lu Fang, Tianyou Chai

Comments: 13 pages, 8 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[755] arXiv:2206.09742 [pdf, ps, other]: Title: Developing a Free and Open-source Automated Building Exterior Crack Inspection Software for Construction and Facility Managers

Authors: Pi Ko, Samuel A. Prieto, Borja Garcia de Soto

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[756] arXiv:2206.09753 [pdf, other]: Title: Visualizing and Understanding Contrastive Learning

Authors: Fawaz Sammani, Boris Joukovsky, Nikos Deligiannis

Comments: Accepted to IEEE Transactions on Image Processing

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[757] arXiv:2206.09756 [pdf, other]: Title: Time Gated Convolutional Neural Networks for Crop Classification

Authors: Longlong Weng, Yashu Kang, Kezhao Jiang, Chunlei Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[758] arXiv:2206.09769 [pdf, other]: Title: Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathology

Authors: Marin Scalbert, Maria Vakalopoulou, Florent Couzinié-Devy

Comments: Accepted at MICCAI2022 Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[759] arXiv:2206.09770 [pdf, other]: Title: Real-time Full-stack Traffic Scene Perception for Autonomous Driving with Roadside Cameras

Authors: Zhengxia Zou, Rusheng Zhang, Shengyin Shen, Gaurav Pandey, Punarjay Chakravarty, Armin Parchami, Henry X. Liu

Comments: This paper is accepted and presented in ICRA 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[760] arXiv:2206.09796 [pdf, other]: Title: Knowledge Distillation for Oriented Object Detection on Aerial Images

Authors: Yicheng Xiao, Junpeng Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[761] arXiv:2206.09806 [pdf, other]: Title: Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval

Authors: Guile Wu, Chao Zhang, Stephan Liwicki

Comments: 10 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[762] arXiv:2206.09842 [pdf, other]: Title: Practical Deepfake Detection: Vulnerabilities in Global Contexts

Authors: Yang A. Chuming, Daniel J. Wu, Ken Hong

Comments: 6 pages, 6 figures, presented as a workshop paper at Responsible AI @ ICLR 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[763] arXiv:2206.09843 [pdf, other]: Title: Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification

Authors: Massimiliano Patacchiola, John Bronskill, Aliaksandra Shysheya, Katja Hofmann, Sebastian Nowozin, Richard E. Turner

Comments: Advances in Neural Information Processing Systems (NeurIPS 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[764] arXiv:2206.09852 [pdf, other]: Title: M&M Mix: A Multimodal Multiview Transformer Ensemble

Authors: Xuehan Xiong, Anurag Arnab, Arsha Nagrani, Cordelia Schmid

Comments: Technical report for Epic-Kitchens challenge 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[765] arXiv:2206.09853 [pdf, other]: Title: DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment

Authors: Haoning Wu, Chaofeng Chen, Liang Liao, Jingwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[766] arXiv:2206.09885 [pdf, other]: Title: KOLOMVERSE: KRISO open large-scale image dataset for object detection in the maritime universe

Authors: Abhilasha Nanda, Sung Won Cho, Hyeopwoo Lee, Jin Hyoung Park

Comments: 13 Pages, 12 figures, submitted to NeurIPS 2022 Datasets and Benchmarks Track (Under Review)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[767] arXiv:2206.09900 [pdf, other]: Title: Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders

Authors: Chen Min, Xinli Xu, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

Comments: Accepted by TIV

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[768] arXiv:2206.09907 [pdf, other]: Title: ORFD: A Dataset and Benchmark for Off-Road Freespace Detection

Authors: Chen Min, Weizhong Jiang, Dawei Zhao, Jiaolong Xu, Liang Xiao, Yiming Nie, Bin Dai

Comments: Accepted by ICRA2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[769] arXiv:2206.09959 [pdf, other]: Title: Global Context Vision Transformers

Authors: Ali Hatamizadeh, Hongxu Yin, Greg Heinrich, Jan Kautz, Pavlo Molchanov

Comments: Accepted to ICML 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[770] arXiv:2206.10033 [pdf, other]: Title: Test Time Transform Prediction for Open Set Histopathological Image Recognition

Authors: Adrian Galdran, Katherine J. Hewitt, Narmin L. Ghaffari, Jakob N. Kather, Gustavo Carneiro, Miguel A. González Ballester

Comments: Accepted to MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[771] arXiv:2206.10041 [pdf, other]: Title: MPA: MultiPath++ Based Architecture for Motion Prediction

Authors: Stepan Konev

Comments: CVPR 2022, Workshop on Autonomous Driving

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[772] arXiv:2206.10059 [pdf, other]: Title: Bypass Network for Semantics Driven Image Paragraph Captioning

Authors: Qi Zheng, Chaoyue Wang, Dadong Wang

Comments: Under consideration at Computer Vision and Image Understanding

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[773] arXiv:2206.10066 [pdf, other]: Title: RendNet: Unified 2D/3D Recognizer With Latent Space Rendering

Authors: Ruoxi Shi, Xinyang Jiang, Caihua Shan, Yansen Wang, Dongsheng Li

Comments: CVPR 2022 Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[774] arXiv:2206.10075 [pdf, other]: Title: Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer Estimation

Authors: Yuehai Chen, Jing Yang, Badong Chen, Shaoyi Du

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[775] arXiv:2206.10080 [pdf, other]: Title: One-stage Action Detection Transformer

Authors: Lijun Li, Li'an Zhuo, Bang Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[776] arXiv:2206.10082 [pdf, other]: Title: Optimally Controllable Perceptual Lossy Compression

Authors: Zeyu Yan, Fei Wen, Peilin Liu

Comments: ICML 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[777] arXiv:2206.10090 [pdf, other]: Title: KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences

Authors: Xuanhan Wang, Lianli Gao, Yixuan Zhou, Jingkuan Song, Meng Wang

Journal-ref: Transaction on Circuits and Systems for Video Technology,2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[778] arXiv:2206.10092 [pdf, other]: Title: BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection

Authors: Yinhao Li, Zheng Ge, Guanyi Yu, Jinrong Yang, Zengran Wang, Yukang Shi, Jianjian Sun, Zeming Li

Comments: Accepted by AAAI2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[779] arXiv:2206.10095 [pdf, other]: Title: Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation

Authors: Shuaicheng Li, Feng Zhang, Rui-Wei Zhao, Rui Feng, Kunlin Yang, Lingbo Liu, Jun Hou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[780] arXiv:2206.10096 [pdf, ps, other]: Title: Transformers Improve Breast Cancer Diagnosis from Unregistered Multi-View Mammograms

Authors: Xuxin Chen, Ke Zhang, Neman Abdoli, Patrik W. Gilley, Ximin Wang, Hong Liu, Bin Zheng, Yuchen Qiu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[781] arXiv:2206.10098 [pdf, other]: Title: Reconstruct from BEV: A 3D Lane Detection Approach based on Geometry Structure Prior

Authors: Chenguang Li, Jia Shi, Ya Wang, Guangliang Cheng

Comments: Proceedings of the CVPR 2022 Workshop of Autonomous Driving

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[782] arXiv:2206.10107 [pdf, other]: Title: Sensitivity of Average Precision to Bounding Box Perturbations

Authors: Ali Borji

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[783] arXiv:2206.10118 [pdf, other]: Title: HOPE: Hierarchical Spatial-temporal Network for Occupancy Flow Prediction

Authors: Yihan Hu, Wenxin Shao, Bo Jiang, Jiajie Chen, Siqi Chai, Zhening Yang, Jingyu Qian, Helong Zhou, Qiang Liu

Comments: 1st Ranking Solution for the Occupancy and Flow Prediction of the Waymo Open Dataset Challenges 2022 (this http URL)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[784] arXiv:2206.10129 [pdf, other]: Title: Automatic Concept Extraction for Concept Bottleneck-based Video Classification

Authors: Jeya Vikranth Jeyakumar, Luke Dickens, Luis Garcia, Yu-Hsi Cheng, Diego Ramirez Echavarria, Joseph Noor, Alessandra Russo, Lance Kaplan, Erik Blasch, Mani Srivastava

Comments: 10 pages, Appendix: 2 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[785] arXiv:2206.10131 [pdf, other]: Title: An Integrated Representation & Compression Scheme Based on Convolutional Autoencoders with 4D DCT Perceptual Encoding for High Dynamic Range Light Fields

Authors: Sally Khaidem, Mansi Sharma

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[786] arXiv:2206.10137 [pdf, other]: Title: Few-Max: Few-Shot Domain Adaptation for Unsupervised Contrastive Representation Learning

Authors: Ali Lotfi Rezaabad, Sidharth Kumar, Sriram Vishwanath, Jonathan I. Tamir

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[787] arXiv:2206.10145 [pdf, other]: Title: Deep Learning Eliminates Massive Dust Storms from Images of Tianwen-1

Authors: Hongyu Li, Jia Li, Xin Ren, Long Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[788] arXiv:2206.10146 [pdf, other]: Title: KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing

Authors: Xuanhan Wang, Jingkuan Song, Xiaojia Chen, Lechao Cheng, Lianli Gao, Heng Tao Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[789] arXiv:2206.10155 [pdf, other]: Title: Review Neural Networks about Image Transformation Based on IGC Learning Framework with Annotated Information

Authors: Yuanjie Yan, Suorong Yang, Yan Wang, Jian Zhao, Furao Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[790] arXiv:2206.10157 [pdf, other]: Title: Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive Learning

Authors: Shuaicheng Li, Feng Zhang, Kunlin Yang, Lingbo Liu, Shinan Liu, Jun Hou, Shuai Yi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[791] arXiv:2206.10177 [pdf, other]: Title: TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural Networks

Authors: Rui-Jie Zhu, Malu Zhang, Qihang Zhao, Haoyu Deng, Yule Duan, Liang-Jian Deng

Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[792] arXiv:2206.10186 [pdf, other]: Title: Improving Localization for Semi-Supervised Object Detection

Authors: Leonardo Rossi, Akbar Karimi, Andrea Prati

Journal-ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[793] arXiv:2206.10192 [pdf, other]: Title: LDD: A Dataset for Grape Diseases Object Detection and Instance Segmentation

Authors: Leonardo Rossi, Marco Valenti, Sara Elisabetta Legler, Andrea Prati

Journal-ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[794] arXiv:2206.10207 [pdf, other]: Title: SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders

Authors: Gang Li, Heliang Zheng, Daqing Liu, Chaoyue Wang, Bing Su, Changwen Zheng

Comments: Accepted by NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[795] arXiv:2206.10213 [pdf, other]: Title: Rethinking Unsupervised Neural Superpixel Segmentation

Authors: Moshe Eliasof, Nir Ben Zikri, Eran Treister

Comments: ICIP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[796] arXiv:2206.10225 [pdf, other]: Title: Broken News: Making Newspapers Accessible to Print-Impaired

Authors: Vishal Agarwal, Tanuja Ganu, Saikat Guha

Journal-ref: Extended Abstract at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[797] arXiv:2206.10241 [pdf, other]: Title: Deep Active Latent Surfaces for Medical Geometries

Authors: Patrick M. Jensen, Udaranga Wickramasinghe, Anders B. Dahl, Pascal Fua, Vedrana A. Dahl

Comments: 14 pages, 9 figures, submitted for review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[798] arXiv:2206.10253 [pdf, other]: Title: Document Navigability: A Need for Print-Impaired

Authors: Anukriti Kumar, Tanuja Ganu, Saikat Guha

Comments: Published at Accessibility, Vision, and Autonomy Meet, CVPR 2022 Workshop

Journal-ref: Extended Abstract for Poster Session at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[799] arXiv:2206.10254 [pdf, other]: Title: Towards Optimizing OCR for Accessibility

Authors: Peya Mowar, Tanuja Ganu, Saikat Guha

Journal-ref: Extended Abstract for Poster Session at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[800] arXiv:2206.10263 [pdf, other]: Title: Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping

Authors: Davide Tateo, Davide Antonio Cucci, Matteo Matteucci, Andrea Bonarini

Comments: submitted to IROS 2015 (rejected)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[801] arXiv:2206.10324 [pdf, other]: Title: Online progressive instance-balanced sampling for weakly supervised object detection

Authors: M. Chen, Y. Tian, Z. Li, E. Li, Z. Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[802] arXiv:2206.10329 [pdf, other]: Title: SVG Vector Font Generation for Chinese Characters with Transformer

Authors: Haruka Aoki, Kiyoharu Aizawa

Comments: Accepted to ICIP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[803] arXiv:2206.10360 [pdf, other]: Title: Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss

Authors: Yikang Ding, Zhenyang Li, Dihe Huang, Zhiheng Li, Kai Zhang

Comments: 5 pages, 3 figures; Accepted to ICIP2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[804] arXiv:2206.10375 [pdf, other]: Title: MEStereo-Du2CNN: A Novel Dual Channel CNN for Learning Robust Depth Estimates from Multi-exposure Stereo Images for HDR 3D Applications

Authors: Rohit Choudhary, Mansi Sharma, Uma T V, Rithvik Anil

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[805] arXiv:2206.10411 [pdf, other]: Title: Audio-video fusion strategies for active speaker detection in meetings

Authors: Lionel Pibre, Francisco Madrigal, Cyrille Equoy, Frédéric Lerasle, Thomas Pellegrini, Julien Pinquier, Isabelle Ferrané

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[806] arXiv:2206.10436 [pdf, other]: Title: Transformer-Based Multi-modal Proposal and Re-Rank for Wikipedia Image-Caption Matching

Authors: Nicola Messina, Davide Alessandro Coccomini, Andrea Esuli, Fabrizio Falchi

Comments: Accepted for publication at the Wiki-M3L workshop, co-located with ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[807] arXiv:2206.10457 [pdf, other]: Title: Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery

Authors: Zhenzhen Weng, Kuan-Chieh Wang, Angjoo Kanazawa, Serena Yeung

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[808] arXiv:2206.10465 [pdf, other]: Title: An Overview of Privacy-enhancing Technologies in Biometric Recognition

Authors: Pietro Melzi, Christian Rathgeb, Ruben Tolosana, Ruben Vera-Rodriguez, Christoph Busch

Comments: 12 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[809] arXiv:2206.10491 [pdf, other]: Title: Bi-Calibration Networks for Weakly-Supervised Video Representation Learning

Authors: Fuchen Long, Ting Yao, Zhaofan Qiu, Xinmei Tian, Jiebo Luo, Tao Mei

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[810] arXiv:2206.10520 [pdf, other]: Title: SFace: Privacy-friendly and Accurate Face Recognition using Synthetic Data

Authors: Fadi Boutros, Marco Huber, Patrick Siebke, Tim Rieber, Naser Damer

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[811] arXiv:2206.10526 [pdf, other]: Title: QuantFace: Towards Lightweight Face Recognition by Synthetic Data Low-bit Quantization

Authors: Fadi Boutros, Naser Damer, Arjan Kuijper

Comments: Accepted ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[812] arXiv:2206.10531 [pdf, other]: Title: Neural Transformers for Intraductal Papillary Mucosal Neoplasms (IPMN) Classification in MRI images

Authors: Federica Proietto Salanitri, Giovanni Bellitto, Simone Palazzo, Ismail Irmakci, Michael B. Wallace, Candice W. Bolan, Megan Engels, Sanne Hoogenboom, Marco Aldinucci, Ulas Bagci, Daniela Giordano, Concetto Spampinato

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[813] arXiv:2206.10535 [pdf, other]: Title: EpiGRAF: Rethinking training of 3D GANs

Authors: Ivan Skorokhodov, Sergey Tulyakov, Yiqun Wang, Peter Wonka

Comments: NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[814] arXiv:2206.10536 [pdf, other]: Title: HealNet -- Self-Supervised Acute Wound Heal-Stage Classification

Authors: Héctor Carrión, Mohammad Jafari, Hsin-Ya Yang, Roslyn Rivkah Isseroff, Marco Rolandi, Marcella Gomez, Narges Norouzi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[815] arXiv:2206.10552 [pdf, other]: Title: Vicinity Vision Transformer

Authors: Weixuan Sun, Zhen Qin, Hui Deng, Jianyuan Wang, Yi Zhang, Kaihao Zhang, Nick Barnes, Stan Birchfield, Lingpeng Kong, Yiran Zhong

Comments: code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[816] arXiv:2206.10555 [pdf, other]: Title: LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs

Authors: Yukang Chen, Jianhui Liu, Xiangyu Zhang, Xiaojuan Qi, Jiaya Jia

Comments: In CVPR 2023. Code is at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[817] arXiv:2206.10562 [pdf, other]: Title: Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth

Authors: Nitin Bansal, Pan Ji, Junsong Yuan, Yi Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[818] arXiv:2206.10571 [pdf, other]: Title: Toward Unpaired Multi-modal Medical Image Segmentation via Learning Structured Semantic Consistency

Authors: Jie Yang, Ye Zhu, Chaoqun Wang, Zhen Li, Ruimao Zhang

Comments: MIDL23

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[819] arXiv:2206.10573 [pdf, ps, other]: Title: H&E-based Computational Biomarker Enables Universal EGFR Screening for Lung Adenocarcinoma

Authors: Gabriele Campanella, David Ho, Ida Häggström, Anton S Becker, Jason Chang, Chad Vanderbilt, Thomas J Fuchs

Subjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[820] arXiv:2206.10587 [pdf, ps, other]: Title: Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements

Authors: Leonard E. van Dyck, Sebastian J. Denzler, Walter R. Gruber

Comments: 28 pages, 6 figures, 3 supplementary figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[821] arXiv:2206.10589 [pdf, other]: Title: EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications

Authors: Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Muhammad Anwer, Fahad Shahbaz Khan

Comments: Accepted at ECCVW 2022 (Oral, CADL: Computational Aspects of Deep Learning)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[822] arXiv:2206.10590 [pdf, other]: Title: Temporally Consistent Semantic Video Editing

Authors: Yiran Xu, Badour AlBahar, Jia-Bin Huang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[823] arXiv:2206.10665 [pdf, other]: Title: BOSS: A Benchmark for Human Belief Prediction in Object-context Scenarios

Authors: Jiafei Duan, Samson Yu, Nicholas Tan, Li Yi, Cheston Tan

Comments: 9 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[824] arXiv:2206.10673 [pdf, ps, other]: Title: Natural Backdoor Datasets

Authors: Emily Wenger, Roma Bhattacharjee, Arjun Nitin Bhagoji, Josephine Passananti, Emilio Andere, Haitao Zheng, Ben Y. Zhao

Comments: 18 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[825] arXiv:2206.10690 [pdf, other]: Title: Learning Continuous Rotation Canonicalization with Radial Beam Sampling

Authors: Johann Schmidt, Sebastian Stober

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[826] arXiv:2206.10692 [pdf, other]: Title: Multi-level Domain Adaptation for Lane Detection

Authors: Chenguang Li, Boheng Zhang, Jia Shi, Guangliang Cheng

Comments: Proceedings of the CVPR 2022 Workshop of Autonomous Driving

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[827] arXiv:2206.10698 [pdf, other]: Title: TiCo: Transformation Invariance and Covariance Contrast for Self-Supervised Visual Representation Learning

Authors: Jiachen Zhu, Rafael M. Moraes, Serkan Karakulak, Vlad Sobol, Alfredo Canziani, Yann LeCun

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[828] arXiv:2206.10711 [pdf, other]: Title: Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning

Authors: Alexander Jaus, Kailun Yang, Rainer Stiefelhagen

Comments: Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS). Extended version of arXiv:2103.00868. The project is at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[829] arXiv:2206.10737 [pdf, other]: Title: Deep Metric Color Embeddings for Splicing Localization in Severely Degraded Images

Authors: Benjamin Hadwiger, Christian Riess

Comments: 14 pages, 13 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[830] arXiv:2206.10779 [pdf, other]: Title: Not Just Streaks: Towards Ground Truth for Single Image Deraining

Authors: Yunhao Ba, Howard Zhang, Ethan Yang, Akira Suzuki, Arnold Pfahnl, Chethan Chinder Chandrappa, Celso de Melo, Suya You, Stefano Soatto, Alex Wong, Achuta Kadambi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[831] arXiv:2206.10789 [pdf, other]: Title: Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

Authors: Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, Zirui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui Wu

Comments: Preprint

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[832] arXiv:2206.10809 [pdf, other]: Title: SSMI: How to Make Objects of Interest Disappear without Accessing Object Detectors?

Authors: Hui Xia, Rui Zhang, Zi Kang, Shuliang Jiang

Comments: 6 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[833] arXiv:2206.10821 [pdf, other]: Title: Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized Activations

Authors: Lin Zhao, Haixing Dai, Zihao Wu, Zhenxiang Xiao, Lu Zhang, David Weizhong Liu, Xintao Hu, Xi Jiang, Sheng Li, Dajiang Zhu, Tianming Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[834] arXiv:2206.10830 [pdf, other]: Title: A Feature Memory Rearrangement Network for Visual Inspection of Textured Surface Defects Toward Edge Intelligent Manufacturing

Authors: Haiming Yao, Wenyong Yu, Xue Wang

Comments: Revision to IEEE transactions on automation science and engineering

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[835] arXiv:2206.10831 [pdf, other]: Title: MultiEarth 2022 Deforestation Challenge -- ForestGump

Authors: Dongoo Lee, Yeonju Choi

Comments: CVPR 2022, MultiEarth 2022, Deforestation Estimation Challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[836] arXiv:2206.10845 [pdf, other]: Title: Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation

Authors: Ming Li, Jie Wu, Jinhang Cai, Jie Qin, Yuxi Ren, Xuefeng Xiao, Min Zheng, Rui Wang, Xin Pan

Comments: The solution of 1st Place in AVA Accessibility Vision and Autonomy Challenge on CVPR 2022 workshop. Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[837] arXiv:2206.10861 [pdf, other]: Title: UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022

Authors: Yuanhang Zhang, Susan Liang, Shuang Yang, Shiguang Shan

Comments: 5 pages, 3 figures; technical report for AVA Challenge (see this https URL) at the International Challenge on Activity Recognition (ActivityNet), CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[838] arXiv:2206.10869 [pdf, other]: Title: NVIDIA-UNIBZ Submission for EPIC-KITCHENS-100 Action Anticipation Challenge 2022

Authors: Tsung-Ming Tai, Oswald Lanz, Giuseppe Fiameni, Yi-Kwan Wong, Sze-Sen Poon, Cheng-Kuang Lee, Ka-Chun Cheung, Simon See

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[839] arXiv:2206.10878 [pdf, other]: Title: Feature Re-calibration based Multiple Instance Learning for Whole Slide Image Classification

Authors: Philip Chikontwe, Soo Jeong Nam, Heounjeong Go, Meejeong Kim, Hyun Jung Sung, Sang Hyun Park

Comments: MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[840] arXiv:2206.10879 [pdf, other]: Title: Symmetric Network with Spatial Relationship Modeling for Natural Language-based Vehicle Retrieval

Authors: Chuyang Zhao, Haobo Chen, Wenyuan Zhang, Junru Chen, Sipeng Zhang, Yadong Li, Boxun Li

Comments: 8 pages, 3 figures, publised to CVPRW

Journal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 3226-3233

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[841] arXiv:2206.10885 [pdf, other]: Title: KiloNeuS: A Versatile Neural Implicit Surface Representation for Real-Time Rendering

Authors: Stefano Esposito, Daniele Baieri, Stefan Zellmann, André Hinkenjann, Emanuele Rodolà

Comments: 9 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[842] arXiv:2206.10886 [pdf, other]: Title: Optical Flow Regularization of Implicit Neural Representations for Video Frame Interpolation

Authors: Weihao Zhuang, Tristan Hascoet, Ryoichi Takashima, Tetsuya Takiguchi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[843] arXiv:2206.10892 [pdf, other]: Title: I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose Estimation

Authors: Yiwei Ding, Wenjin Deng, Yinglin Zheng, Pengfei Liu, Meihong Wang, Xuan Cheng, Jianmin Bao, Dong Chen, Ming Zeng

Comments: Accepected by IJCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[844] arXiv:2206.10902 [pdf, other]: Title: S2TNet: Spatio-Temporal Transformer Networks for Trajectory Prediction in Autonomous Driving

Authors: Weihuang Chen, Fangfang Wang, Hongbin Sun

Comments: Accepted by ACML2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[845] arXiv:2206.10903 [pdf, ps, other]: Title: UniUD-FBK-UB-UniBZ Submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2022

Authors: Alex Falcon, Giuseppe Serra, Sergio Escalera, Oswald Lanz

Comments: Ranked joint 1st place in the Multi-Instance Action Retrieval Challenge organized at EPIC@CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[846] arXiv:2206.10910 [pdf, other]: Title: SpA-Former: Transformer image shadow detection and removal via spatial attention

Authors: Xiao Feng Zhang, Chao Chen Gu, Shan Ying Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[847] arXiv:2206.10915 [pdf, other]: Title: Understanding the effect of sparsity on neural networks robustness

Authors: Lukas Timpl, Rahim Entezari, Hanie Sedghi, Behnam Neyshabur, Olga Saukh

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[848] arXiv:2206.10965 [pdf, other]: Title: Polar Parametrization for Vision-based Surround-View 3D Detection

Authors: Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Qian Zhang, Chang Huang, Wenyu Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[849] arXiv:2206.10969 [pdf, other]: Title: Single Morphing Attack Detection using Siamese Network and Few-shot Learning

Authors: Juan Tapia, Daniel Schulz, Christoph Busch

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[850] arXiv:2206.10988 [pdf, other]: Title: AdvSmo: Black-box Adversarial Attack by Smoothing Linear Structure of Texture

Authors: Hui Xia, Rui Zhang, Shuliang Jiang, Zi Kang

Comments: 6 pages,3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[851] arXiv:2206.10989 [pdf, other]: Title: Identity Documents Authentication based on Forgery Detection of Guilloche Pattern

Authors: Musab Al-Ghadi, Zuheng Ming, Petra Gomez-Krämer, Jean-Christophe Burie

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[852] arXiv:2206.10996 [pdf, other]: Title: ProtoCLIP: Prototypical Contrastive Language Image Pretraining

Authors: Delong Chen, Zhao Wu, Fan Liu, Zaiquan Yang, Huaxi Huang, Ying Tan, Erjin Zhou

Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[853] arXiv:2206.11011 [pdf, other]: Title: Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning

Authors: Jia-Run Du, Jia-Chang Feng, Kun-Yu Lin, Fa-Ting Hong, Xiao-Ming Wu, Zhongang Qi, Ying Shan, Wei-Shi Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[854] arXiv:2206.11053 [pdf, other]: Title: Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformer

Authors: Lalithkumar Seenivasan, Mobarakol Islam, Adithya K Krishna, Hongliang Ren

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO); Image and Video Processing (eess.IV)
[855] arXiv:2206.11080 [pdf, other]: Title: Motion Gait: Gait Recognition via Motion Excitation

Authors: Yunpeng Zhang, Zhengyou Wang, Shanna Zhuang, Hui Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[856] arXiv:2206.11095 [pdf, other]: Title: A High Resolution Multi-exposure Stereoscopic Image & Video Database of Natural Scenes

Authors: Rohit Choudhary, Mansi Sharma, Aditya Wadaskar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[857] arXiv:2206.11115 [pdf, other]: Title: ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas

Authors: Prathmesh Madhu, Tilman Marquart, Ronak Kosti, Dirk Suckow, Peter Bell, Andreas Maier, Vincent Christlein

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[858] arXiv:2206.11134 [pdf, other]: Title: Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization

Authors: Peixian Chen, Kekai Sheng, Mengdan Zhang, Mingbao Lin, Yunhang Shen, Shaohui Lin, Bo Ren, Ke Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[859] arXiv:2206.11180 [pdf, other]: Title: Optimal transport meets noisy label robust loss and MixUp regularization for domain adaptation

Authors: Kilian Fatras, Hiroki Naganuma, Ioannis Mitliagkas

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[860] arXiv:2206.11203 [pdf, other]: Title: Facke: a Survey on Generative Models for Face Swapping

Authors: Wei Jiang, Wentao Dong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[861] arXiv:2206.11212 [pdf, other]: Title: VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason Objectives

Authors: Zhuofan Ying, Peter Hase, Mohit Bansal

Comments: NeurIPS 2022 (first two authors contributed equally)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[862] arXiv:2206.11215 [pdf, other]: Title: Certifiable 3D Object Pose Estimation: Foundations, Learning Models, and Self-Training

Authors: Rajat Talak, Lisa Peng, Luca Carlone

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[863] arXiv:2206.11250 [pdf, other]: Title: Depth-aware Glass Surface Detection with Cross-modal Context Mining

Authors: Jiaying Lin, Yuen Hei Yeung, Rynson W.H. Lau

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[864] arXiv:2206.11253 [pdf, other]: Title: Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Authors: Shangchen Zhou, Kelvin C.K. Chan, Chongyi Li, Chen Change Loy

Comments: Accepted by NeurIPS 2022. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[865] arXiv:2206.11352 [pdf, ps, other]: Title: Doubly Reparameterized Importance Weighted Structure Learning for Scene Graph Generation

Authors: Daqi Liu, Miroslaw Bober, Josef Kittler

Comments: arXiv admin note: substantial text overlap with arXiv:2205.07017

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[866] arXiv:2206.11358 [pdf, other]: Title: Monocular Spherical Depth Estimation with Explicitly Connected Weak Layout Cues

Authors: Nikolaos Zioulis, Federico Alvarez, Dimitrios Zarpalas, Petros Daras

Comments: Project page at this https URL

Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, Volume 183, January 2022, Pages 269-285

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[867] arXiv:2206.11404 [pdf, other]: Title: The ArtBench Dataset: Benchmarking Generative Models with Artworks

Authors: Peiyuan Liao, Xiuyu Li, Xihui Liu, Kurt Keutzer

Comments: The first two authors contributed equally to this work. The code and data are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[868] arXiv:2206.11428 [pdf, other]: Title: LidarMultiNet: Unifying LiDAR Semantic Segmentation, 3D Object Detection, and Panoptic Segmentation in a Single Multi-task Network

Authors: Dongqiangzi Ye, Weijia Chen, Zixiang Zhou, Yufei Xie, Yu Wang, Panqu Wang, Hassan Foroosh

Comments: Official 1st Place Solution for the Waymo Open Dataset Challenges 2022 - 3D Semantic Segmentation. Official leaderboard: this https URL CVPR 2022 Workshop on Autonomous Driving: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[869] arXiv:2206.11443 [pdf, other]: Title: Image-based Stability Quantification

Authors: Jesse Scott, John Challis, Robert T. Collins, Yanxi Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[870] arXiv:2206.11459 [pdf, other]: Title: Explore Spatio-temporal Aggregation for Insubstantial Object Detection: Benchmark Dataset and Baseline

Authors: Kailai Zhou, Yibo Wang, Tao Lv, Yunqian Li, Linsen Chen, Qiu Shen, Xun Cao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[871] arXiv:2206.11462 [pdf, ps, other]: Title: ICME 2022 Few-shot LOGO detection top 9 solution

Authors: Ka Ho Tong, Ka Wai Cheung, Xiaochuan Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[872] arXiv:2206.11473 [pdf, other]: Title: Complementary datasets to COCO for object detection

Authors: Ali Borji

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[873] arXiv:2206.11474 [pdf, other]: Title: Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation

Authors: Shengming Li, Guangcong Zheng, Hui Wang, Taiping Yao, Yang Chen, Shoudong Ding, Xi Li

Comments: 24 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[874] arXiv:2206.11476 [pdf, other]: Title: Dynamic Scene Deblurring Based on Continuous Cross-Layer Attention Transmission

Authors: Xia Hua, Mingxin Li, Junxiong Fei, Yu Shi, JianGuo Liu, Hanyu Hong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[875] arXiv:2206.11493 [pdf, other]: Title: Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization

Authors: Kun Xia, Le Wang, Sanping Zhou, Nanning Zheng, Wei Tang

Comments: Accepted by CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[876] arXiv:2206.11499 [pdf, other]: Title: Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set

Authors: San Jiang, Qingquan Li, Wanshou Jiang, Wu Chen

Comments: 14 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[877] arXiv:2206.11502 [pdf, ps, other]: Title: A Review of Published Machine Learning Natural Language Processing Applications for Protocolling Radiology Imaging

Authors: Nihal Raju (5), Michael Woodburn (1 and 5), Stefan Kachel (2 and 3), Jack O'Shaughnessy (5), Laurence Sorace (5), Natalie Yang (2), Ruth P Lim (2 and 4) ((1) Harvard University, Extension School, Cambridge, MA, USA, (2) Department of Radiology, The University of Melbourne, Parkville, (3) Department of Radiology, Columbia University in the City of New York, (4) Department of Surgery, Austin, The University of Melbourne, (5) Austin Hospital, Austin Health, Melbourne, Australia)

Comments: 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[878] arXiv:2206.11520 [pdf, other]: Title: ICOS Protein Expression Segmentation: Can Transformer Networks Give Better Results?

Authors: Vivek Kumar Singh, Paul O Reilly, Jacqueline James, Manuel Salto Tellez, Perry Maxwell

Comments: Accepted MIUA conference (Abstract short paper)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[879] arXiv:2206.11541 [pdf, other]: Title: A Neuromorphic Vision-Based Measurement for Robust Relative Localization in Future Space Exploration Missions

Authors: Mohammed Salah, Mohammed Chehadah, Muhammed Humais, Mohammed Wahbah, Abdulla Ayyad, Rana Azzam, Lakmal Seneviratne, Yahya Zweiri

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[880] arXiv:2206.11589 [pdf, other]: Title: Learning Towards the Largest Margins

Authors: Xiong Zhou, Xianming Liu, Deming Zhai, Junjun Jiang, Xin Gao, Xiangyang Ji

Comments: ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[881] arXiv:2206.11610 [pdf, other]: Title: 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)

Authors: Dong An, Zun Wang, Yangguang Li, Yi Wang, Yicong Hong, Yan Huang, Liang Wang, Jing Shao

Comments: Winner of the 2nd RxR-Habitat Competition @ CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[882] arXiv:2206.11629 [pdf, other]: Title: Global Sensing and Measurements Reuse for Image Compressed Sensing

Authors: Zi-En Fan, Feng Lian, Jia-Ni Quan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[883] arXiv:2206.11653 [pdf, other]: Title: Learning To Generate Scene Graph from Head to Tail

Authors: Chaofan Zheng, Xinyu Lyu, Yuyu Guo, Pengpeng Zeng, Jingkuan Song, Lianli Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[884] arXiv:2206.11657 [pdf, other]: Title: Warped Convolutional Networks: Bridge Homography to sl(3) algebra by Group Convolution

Authors: Xinrui Zhan, Yang Li, Wenyu Liu, Jianke Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[885] arXiv:2206.11678 [pdf, other]: Title: BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose Estimation

Authors: Ivan Grishchenko, Valentin Bazarevsky, Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, Richard Yee, Karthik Raveendran, Matsvei Zhdanovich, Matthias Grundmann, Cristian Sminchisescu

Comments: 4 pages, 4 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[886] arXiv:2206.11695 [pdf, other]: Title: NTIRE 2022 Challenge on Perceptual Image Quality Assessment

Authors: Jinjin Gu, Haoming Cai, Chao Dong, Jimmy S. Ren, Radu Timofte

Comments: This report has been published in CVPR 2022 NTIRE workshop. arXiv admin note: text overlap with arXiv:2105.03072

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[887] arXiv:2206.11723 [pdf, other]: Title: Self-Supervised Training with Autoencoders for Visual Anomaly Detection

Authors: Alexander Bauer, Shinichi Nakajima, Klaus-Robert Müller

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[888] arXiv:2206.11736 [pdf, other]: Title: NovelCraft: A Dataset for Novelty Detection and Discovery in Open Worlds

Authors: Patrick Feeney, Sarah Schneider, Panagiotis Lymperopoulos, Li-Ping Liu, Matthias Scheutz, Michael C. Hughes

Comments: Published in Transactions on Machine Learning Research (03/2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[889] arXiv:2206.11739 [pdf, other]: Title: Evidence fusion with contextual discounting for multi-modality medical image segmentation

Authors: Ling Huang, Thierry Denoeux, Pierre Vera, Su Ruan

Comments: MICCAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[890] arXiv:2206.11752 [pdf, other]: Title: CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose

Authors: Xu Zhang, Wen Wang, Zhe Chen, Yufei Xu, Jing Zhang, Dacheng Tao

Comments: CVPR2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[891] arXiv:2206.11759 [pdf, other]: Title: What makes you, you? Analyzing Recognition by Swapping Face Parts

Authors: Claudio Ferrari, Matteo Serpentoni, Stefano Berretti, Alberto Del Bimbo

Comments: Accepted for publication at 26TH International Conference on Pattern Recognition (ICPR), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[892] arXiv:2206.11768 [pdf, other]: Title: FitGAN: Fit- and Shape-Realistic Generative Adversarial Networks for Fashion

Authors: Sonia Pecenakova, Nour Karessli, Reza Shirvany

Comments: 26th International Conference on Pattern Recognition (ICPR) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[893] arXiv:2206.11804 [pdf, other]: Title: Rethinking Surgical Instrument Segmentation: A Background Image Can Be All You Need

Authors: An Wang, Mobarakol Islam, Mengya Xu, Hongliang Ren

Comments: 10 pages, MICCAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[894] arXiv:2206.11808 [pdf, other]: Title: Unseen Object 6D Pose Estimation: A Benchmark and Baselines

Authors: Minghao Gou, Haolin Pan, Hao-Shu Fang, Ziyuan Liu, Cewu Lu, Ping Tan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[895] arXiv:2206.11825 [pdf, other]: Title: YOLOSA: Object detection based on 2D local feature superimposed self-attention

Authors: Weisheng Li, Lin Huang

Comments: This paper is under consideration at Pattern Recognition Letters

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[896] arXiv:2206.11826 [pdf, other]: Title: Toward Clinically Assisted Colorectal Polyp Recognition via Structured Cross-modal Representation Consistency

Authors: Weijie Ma, Ye Zhu, Ruimao Zhang, Jie Yang, Yiwen Hu, Zhen Li, Li Xiang

Comments: Early Accepted by MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[897] arXiv:2206.11892 [pdf, other]: Title: DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection

Authors: Wele Gedara Chaminda Bandara, Nithin Gopalakrishnan Nair, Vishal M. Patel

Comments: Code available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[898] arXiv:2206.11894 [pdf, other]: Title: MaskViT: Masked Visual Pre-Training for Video Prediction

Authors: Agrim Gupta, Stephen Tian, Yunzhi Zhang, Jiajun Wu, Roberto Martín-Martín, Li Fei-Fei

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[899] arXiv:2206.11895 [pdf, other]: Title: Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space

Authors: Jinghuan Shang, Srijan Das, Michael S. Ryoo

Comments: NeurIPS 2022. Our code is at this https URL Our project page is at this https URL v3, v4 for minor updates on figures and visualizations

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[900] arXiv:2206.11896 [pdf, other]: Title: EventNeRF: Neural Radiance Fields from a Single Colour Event Camera

Authors: Viktor Rudnev, Mohamed Elgharib, Christian Theobalt, Vladislav Golyanik

Comments: 19 pages, 21 figures, 3 tables; CVPR 2023

Journal-ref: Computer Vision and Pattern Recognition (CVPR) 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[901] arXiv:2206.11920 [pdf, other]: Title: Agriculture-Vision Challenge 2022 -- The Runner-Up Solution for Agricultural Pattern Recognition via Transformer-based Models

Authors: Zhicheng Yang, Jui-Hsin Lai, Jun Zhou, Hang Zhou, Chen Du, Zhongcheng Lai

Comments: CVPR 2022, Agriculture-Vision Challenge, Remote Sensing

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[902] arXiv:2206.11927 [pdf, other]: Title: Towards Galaxy Foundation Models with Hybrid Contrastive Learning

Authors: Mike Walmsley, Inigo Val Slijepcevic, Micah Bowles, Anna M. M. Scaife

Comments: Accepted at the ICML 2022 Workshop on Machine Learning for Astrophysics. Data: www.github.com/mwalmsley/pytorch-galaxy-datasets. Please reach out to share your labelled data - all contributions will be credited in future work

Subjects: Computer Vision and Pattern Recognition (cs.CV); Astrophysics of Galaxies (astro-ph.GA)
[903] arXiv:2206.11952 [pdf, other]: Title: UNeRF: Time and Memory Conscious U-Shaped Network for Training Neural Radiance Fields

Authors: Abiramy Kuganesan, Shih-yang Su, James J. Little, Helge Rhodin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[904] arXiv:2206.12035 [pdf, other]: Title: The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge--Track 3: Referring Video Object Segmentation

Authors: Leilei Cao, Zhuang Li, Bo Yan, Feng Zhang, Fengliang Qi, Yuchen Hu, Hongbin Wang

Comments: 4 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[905] arXiv:2206.12043 [pdf, other]: Title: Protecting President Zelenskyy against Deep Fakes

Authors: Matyáš Boháček, Hany Farid

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[906] arXiv:2206.12046 [pdf, other]: Title: Bilateral Network with Channel Splitting Network and Transformer for Thermal Image Super-Resolution

Authors: Bo Yan, Leilei Cao, Fengliang Qi, Hongbin Wang

Comments: The second place solution for CVPR2022 PBVS-TISR challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[907] arXiv:2206.12055 [pdf, other]: Title: SDF-StyleGAN: Implicit SDF-Based StyleGAN for 3D Shape Generation

Authors: Xin-Yang Zheng, Yang Liu, Peng-Shuai Wang, Xin Tong

Comments: Accepted to Computer Graphics Forum (SGP), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[908] arXiv:2206.12063 [src]: Title: Mutual Information-guided Knowledge Transfer for Novel Class Discovery

Authors: Chuyu Zhang, Chuanyang Hu, Ruijie Xu, Zhitong Gao, Qian He, Xuming He

Comments: The derivation of Mutual Information in the manuscript is wrong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[909] arXiv:2206.12071 [pdf, other]: Title: Contrastive Learning of Features between Images and LiDAR

Authors: Peng Jiang, Srikanth Saripalli

Comments: accepted in CASE2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[910] arXiv:2206.12073 [pdf, other]: Title: MaskRange: A Mask-classification Model for Range-view based LiDAR Segmentation

Authors: Yi Gu, Yuming Huang, Chengzhong Xu, Hui Kong

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[911] arXiv:2206.12099 [pdf, ps, other]: Title: A novel approach for glaucoma classification by wavelet neural networks using graph-based, statisitcal features of qualitatively improved images

Authors: N. Krishna Santosh, Dr. Soubhagya Sankar Barpanda

Comments: 25 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[912] arXiv:2206.12117 [pdf, other]: Title: Self Supervised Learning for Few Shot Hyperspectral Image Classification

Authors: Nassim Ait Ali Braham, Lichao Mou, Jocelyn Chanussot, Julien Mairal, Xiao Xiang Zhu

Comments: Accepted in IGARSS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[913] arXiv:2206.12123 [pdf, ps, other]: Title: Some theoretical results on discrete contour trees

Authors: Yuqing Song

Comments: 5 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
[914] arXiv:2206.12126 [pdf, other]: Title: Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning

Authors: Cheng Tan, Zhangyang Gao, Lirong Wu, Yongjie Xu, Jun Xia, Siyuan Li, Stan Z. Li

Comments: Accepted by CVPR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[915] arXiv:2206.12128 [pdf, other]: Title: Excavating RoI Attention for Underwater Object Detection

Authors: Xutao Liang, Pinhao Song

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[916] arXiv:2206.12216 [pdf, other]: Title: Optimized Views Photogrammetry: Precision Analysis and A Large-scale Case Study in Qingdao

Authors: Qingquan Li, Wenshuai Yu, San Jiang

Comments: 16 pages, 24 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[917] arXiv:2206.12351 [pdf, other]: Title: Megapixel Image Generation with Step-Unrolled Denoising Autoencoders

Authors: Alex F. McKinney, Chris G. Willcocks

Comments: 17 pages + 9 appendix pages. 20 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[918] arXiv:2206.12356 [pdf, other]: Title: HM3D-ABO: A Photo-realistic Dataset for Object-centric Multi-view 3D Reconstruction

Authors: Zhenpei Yang, Zaiwei Zhang, Qixing Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[919] arXiv:2206.12370 [pdf, other]: Title: Mixed Sample Augmentation for Online Distillation

Authors: Yiqing Shen, Liwu Xu, Yuzhe Yang, Yaqian Li, Yandong Guo

Comments: 5 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[920] arXiv:2206.12372 [pdf, other]: Title: QReg: On Regularization Effects of Quantization

Authors: MohammadHossein AskariHemmat, Reyhane Askari Hemmat, Alex Hoffman, Ivan Lazarevich, Ehsan Saboori, Olivier Mastropietro, Sudhakar Sah, Yvon Savaria, Jean-Pierre David

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[921] arXiv:2206.12381 [pdf, other]: Title: Defending Backdoor Attacks on Vision Transformer via Patch Processing

Authors: Khoa D. Doan, Yingjie Lao, Peng Yang, Ping Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[922] arXiv:2206.12396 [pdf, other]: Title: Text-Driven Stylization of Video Objects

Authors: Sebastian Loeschcke, Serge Belongie, Sagie Benaim

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[923] arXiv:2206.12403 [pdf, other]: Title: ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings

Authors: Arjun Majumdar, Gunjan Aggarwal, Bhavika Devnani, Judy Hoffman, Dhruv Batra

Comments: code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[924] arXiv:2206.12455 [pdf, other]: Title: Ev-NeRF: Event Based Neural Radiance Field

Authors: Inwoo Hwang, Junho Kim, Young Min Kim

Comments: Accepted to WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[925] arXiv:2206.12458 [pdf, other]: Title: Bag of Tricks for Long-Tail Visual Recognition of Animal Species in Camera-Trap Images

Authors: Fagner Cunha, Eulanda M. dos Santos, Juan G. Colonna

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[926] arXiv:2206.12464 [pdf, other]: Title: Motion Estimation for Large Displacements and Deformations

Authors: Qiao Chen, Charalambos Poullis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[927] arXiv:2206.12480 [pdf, other]: Title: Attention-Guided Autoencoder for Automated Progression Prediction of Subjective Cognitive Decline with Structural MRI

Authors: Hao Guan, Ling Yue, Pew-Thian Yap, Shifu Xiao, Andrea Bozoki, Mingxia Liu

Comments: 10 pages, 12 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[928] arXiv:2206.12498 [pdf, other]: Title: Optimal and Robust Category-level Perception: Object Pose and Shape Estimation from 2D and 3D Semantic Keypoints

Authors: Jingnan Shi, Heng Yang, Luca Carlone

Comments: arXiv admin note: text overlap with arXiv:2104.08383

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[929] arXiv:2206.12505 [pdf, other]: Title: Stain Based Contrastive Co-training for Histopathological Image Analysis

Authors: Bodong Zhang, Beatrice Knudsen, Deepika Sirohi, Alessandro Ferrero, Tolga Tasdizen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[930] arXiv:2206.12533 [pdf, other]: Title: From Shallow to Deep: Compositional Reasoning over Graphs for Visual Question Answering

Authors: Zihao Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[931] arXiv:2206.12534 [pdf, other]: Title: SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos

Authors: Salar Hosseini Khorasgani, Yuxuan Chen, Florian Shkurti

Comments: CVPR2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[932] arXiv:2206.12558 [pdf, other]: Title: FastBVP-Net: a lightweight pulse extraction network for measuring heart rhythm via facial videos

Authors: Jialiang Zhuang, Yuheng Chen, Yun Zhang, Xiujuan Zheng

Comments: 9 pages, 2figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[933] arXiv:2206.12571 [pdf, other]: Title: CV 3315 Is All You Need : Semantic Segmentation Competition

Authors: Akide Liu, Zihan Wang

Comments: arXiv admin note: text overlap with arXiv:2105.15203 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[934] arXiv:2206.12590 [pdf, other]: Title: RSTAM: An Effective Black-Box Impersonation Attack on Face Recognition using a Mobile and Compact Printer

Authors: Xiaoliang Liu, Furao Shen, Jian Zhao, Changhai Nie

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[935] arXiv:2206.12592 [pdf, other]: Title: Asymmetric Transfer Hashing with Adaptive Bipartite Graph Learning

Authors: Jianglin Lu, Jie Zhou, Yudong Chen, Witold Pedrycz, Kwok-Wai Hung

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[936] arXiv:2206.12596 [pdf, ps, other]: Title: Non-iterative Coarse-to-fine Registration based on Single-pass Deep Cumulative Learning

Authors: Mingyuan Meng, Lei Bi, Dagan Feng, Jinman Kim

Comments: Accepted at International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2022)

Journal-ref: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 88-97, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[937] arXiv:2206.12612 [pdf, other]: Title: Learn to Predict How Humans Manipulate Large-sized Objects from Interactive Motions

Authors: Weilin Wan, Lei Yang, Lingjie Liu, Zhuoying Zhang, Ruixing Jia, Yi-King Choi, Jia Pan, Christian Theobalt, Taku Komura, Wenping Wang

Journal-ref: IEEE Robotics and Automation Letters ( Volume: 7, Issue: 2, April 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[938] arXiv:2206.12614 [pdf, other]: Title: BokehMe: When Neural Rendering Meets Classical Rendering

Authors: Juewen Peng, Zhiguo Cao, Xianrui Luo, Hao Lu, Ke Xian, Jianming Zhang

Comments: Accepted by CVPR 2022 (Oral); Project: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[939] arXiv:2206.12622 [pdf, other]: Title: SAT: Self-adaptive training for fashion compatibility prediction

Authors: Ling Xiao, Toshihiko Yamasaki

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[940] arXiv:2206.12623 [pdf, other]: Title: Inverted Semantic-Index for Image Retrieval

Authors: Ying Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
[941] arXiv:2206.12634 [pdf, other]: Title: SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection

Authors: Dexiang Hong, Xiaoqi Ma, Xinyao Wang, Congcong Li, Yufei Wang, Longyin Wen

Comments: winner method at LOVEU@CVPR'22 Generic Event Boundary Detection Challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[942] arXiv:2206.12648 [pdf, other]: Title: BIMS-PU: Bi-Directional and Multi-Scale Point Cloud Upsampling

Authors: Yechao Bai, Xiaogang Wang, Marcelo H. Ang Jr, Daniela Rus

Comments: Accepted to RA-L 2022. in IEEE Robotics and Automation Letters

Journal-ref: in IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 7447-7454, July 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[943] arXiv:2206.12650 [pdf, ps, other]: Title: Machine Learning-based Biological Ageing Estimation Technologies: A Survey

Authors: Zhaonian Zhang, Richard Jiang, Danny Crookes, Paul Chazot

Comments: in Recent Advances in AI-enabled Automated Medical Diagnosis this https URL

Journal-ref: Recent Advances in AI-enabled Automated Medical Diagnosis, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[944] arXiv:2206.12651 [pdf, ps, other]: Title: Review on Social Behavior Analysis of Laboratory Animals: From Methodologies to Applications

Authors: Ziping Jiang, Paul L. Chazot, Richard Jiang

Comments: this https URL

Journal-ref: Recent Advances in AI-enabled Automated Medical Diagnosis, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[945] arXiv:2206.12653 [pdf, ps, other]: Title: Diagnostic Communication and Visual System based on Vehicle UDS Protocol

Authors: Hong Zhang, Ding Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[946] arXiv:2206.12657 [pdf, other]: Title: Enhanced Deep Animation Video Interpolation

Authors: Wang Shen, Cheng Ming, Wenbo Bao, Guangtao Zhai, Li Chen, Zhiyong Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[947] arXiv:2206.12675 [pdf, other]: Title: Learning to Infer 3D Shape Programs with Differentiable Renderer

Authors: Yichao Liang

Comments: Technical report written in 2020; 10 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:1901.02875 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[948] arXiv:2206.12681 [pdf, other]: Title: UltraMNIST Classification: A Benchmark to Train CNNs for Very Large Images

Authors: Deepak K. Gupta, Udbhav Bamba, Abhishek Thakur, Akash Gupta, Suraj Sharan, Ertugrul Demir, Dilip K. Prasad

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[949] arXiv:2206.12685 [pdf, ps, other]: Title: Defense against adversarial attacks on deep convolutional neural networks through nonlocal denoising

Authors: Sandhya Aneja, Nagender Aneja, Pg Emeroylariffion Abas, Abdul Ghani Naim

Journal-ref: IAES International Journal of Artificial Intelligence, Vol. 11, No. 3, September 2022, pp. 961~968, ISSN: 2252-8938

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[950] arXiv:2206.12694 [pdf, other]: Title: RandStainNA: Learning Stain-Agnostic Features from Histology Slides by Bridging Stain Augmentation and Normalization

Authors: Yiqing Shen, Yulin Luo, Dinggang Shen, Jing Ke

Comments: 12 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[951] arXiv:2206.12704 [pdf, other]: Title: Anatomy-Guided Weakly-Supervised Abnormality Localization in Chest X-rays

Authors: Ke Yu, Shantanu Ghosh, Zhexiong Liu, Christopher Deible, Kayhan Batmanghelich

Comments: Accepted by MICCAI 20222

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[952] arXiv:2206.12714 [pdf, other]: Title: Defending Multimodal Fusion Models against Single-Source Adversaries

Authors: Karren Yang, Wan-Yi Lin, Manash Barman, Filipe Condessa, Zico Kolter

Comments: CVPR 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[953] arXiv:2206.12725 [pdf, other]: Title: Empirical Evaluation of Physical Adversarial Patch Attacks Against Overhead Object Detection Models

Authors: Gavin S. Hartnett, Li Ang Zhang, Caolionn O'Connell, Andrew J. Lohn, Jair Aguirre

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[954] arXiv:2206.12738 [pdf, other]: Title: Self-Supervised 3D Monocular Object Detection by Recycling Bounding Boxes

Authors: Sugirtha T, Sridevi M, Khailash Santhakumar, Hao Liu, B Ravi Kiran, Thomas Gauthier, Senthil Yogamani

Comments: Published at ICCVW-SSLAD 2021. arXiv admin note: substantial text overlap with arXiv:2104.10786

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[955] arXiv:2206.12740 [pdf, other]: Title: Multi Visual Modality Fall Detection Dataset

Authors: Stefan Denkovski, Shehroz S. Khan, Brandon Malamis, Sae Young Moon, Bing Ye, Alex Mihailidis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[956] arXiv:2206.12745 [pdf, ps, other]: Title: Sequential image recovery using joint hierarchical Bayesian learning

Authors: Yao Xiao, Jan Glaubitz

Comments: 24 pages, 15 figures

Journal-ref: J Sci Comput 96, 4 (2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[957] arXiv:2206.12755 [pdf, other]: Title: Training Your Sparse Neural Network Better with Any Mask

Authors: Ajay Jaiswal, Haoyu Ma, Tianlong Chen, Ying Ding, Zhangyang Wang

Comments: Accepted by ICML 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[958] arXiv:2206.12772 [pdf, other]: Title: Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation

Authors: Jinxiang Liu, Chen Ju, Weidi Xie, Ya Zhang

Comments: Camera-ready Version for ACMMM 2022, Project page is this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[959] arXiv:2206.12788 [pdf, other]: Title: Representative Teacher Keys for Knowledge Distillation Model Compression Based on Attention Mechanism for Image Classification

Authors: Jun-Teng Yang, Sheng-Che Kao, Scott C.-H. Huang

Comments: eight pages, six figures, three tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[960] arXiv:2206.12794 [pdf, other]: Title: CTMQ: Cyclic Training of Convolutional Neural Networks with Multiple Quantization Steps

Authors: HyunJin Kim, Jungwoo Shin, Alberto A. Del Barrio

Comments: submitted to NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[961] arXiv:2206.12798 [pdf, other]: Title: Multiple Instance Learning with Mixed Supervision in Gleason Grading

Authors: Hao Bian, Zhuchen Shao, Yang Chen, Yifeng Wang, Haoqian Wang, Jian Zhang, Yongbing Zhang

Comments: Accepted by MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[962] arXiv:2206.12837 [pdf, other]: Title: Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

Authors: Ailin Huang, Zhewei Huang, Shuchang Zhou

Comments: Ailin and Zhewei contributed equally to this work. ACM MM22 workshop paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[963] arXiv:2206.12845 [pdf, other]: Title: RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval

Authors: Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim

Comments: Preprint, under review in TCSVT Journal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[964] arXiv:2206.12849 [pdf, other]: Title: Semantic Role Aware Correlation Transformer for Text to Video Retrieval

Authors: Burak Satar, Hongyuan Zhu, Xavier Bresson, Joo Hwee Lim

Comments: Camera-ready for ICIP 2021

Journal-ref: IEEE International Conference on Image Processing (ICIP), 2021, pp. 1334-1338

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[965] arXiv:2206.12869 [pdf, other]: Title: Image Aesthetics Assessment Using Graph Attention Network

Authors: Koustav Ghosal, Aljosa Smolic

Comments: International Conference on Pattern Recognition (ICPR), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[966] arXiv:2206.12885 [pdf, ps, other]: Title: FingerGAN: A Constrained Fingerprint Generation Scheme for Latent Fingerprint Enhancement

Authors: Yanming Zhu, Xuefei Yin, Jiankun Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[967] arXiv:2206.12912 [pdf, other]: Title: Woodscape Fisheye Object Detection for Autonomous Driving -- CVPR 2022 OmniCV Workshop Challenge

Authors: Saravanabalagi Ramachandran, Ganesh Sistu, Varun Ravi Kumar, John McDonald, Senthil Yogamani

Comments: Workshop on Omnidirectional Computer Vision (OmniCV) at Conference on Computer Vision and Pattern Recognition (CVPR) 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[968] arXiv:2206.12914 [pdf, other]: Title: Video Anomaly Detection via Prediction Network with Enhanced Spatio-Temporal Memory Exchange

Authors: Guodong Shen, Yuqi Ouyang, Victor Sanchez

Comments: Accepted at ICASSP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[969] arXiv:2206.12921 [pdf, other]: Title: Non-Parametric Style Transfer

Authors: Jeong-Sik Lee, Hyun-Chul Choi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[970] arXiv:2206.12923 [pdf, other]: Title: Video Activity Localisation with Uncertainties in Temporal Boundary

Authors: Jiabo Huang, Hailin Jin, Shaogang Gong, Yang Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[971] arXiv:2206.12925 [pdf, other]: Title: Vision Transformer for Contrastive Clustering

Authors: Hua-Bao Ling, Bowen Zhu, Dong Huang, Ding-Hua Chen, Chang-Dong Wang, Jian-Huang Lai

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[972] arXiv:2206.12930 [pdf, other]: Title: SVBR-NET: A Non-Blind Spatially Varying Defocus Blur Removal Network

Authors: Ali Karaali, Claudio Rosito Jung

Comments: Accepted to ICIP2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[973] arXiv:2206.12943 [pdf, other]: Title: Multi-view Feature Augmentation with Adaptive Class Activation Mapping

Authors: Xiang Gao, Yingjie Tian, Zhiquan Qi

Comments: An arxiv version of the paper published in Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21). See this https URL

Journal-ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence. Main Track. 2021. Pages 678-684

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[974] arXiv:2206.12946 [pdf, other]: Title: AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry Estimation

Authors: Nimet Kaygusuz, Oscar Mendez, Richard Bowden

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[975] arXiv:2206.12952 [pdf, other]: Title: Nonwatertight Mesh Reconstruction

Authors: Partha Ghosh

Comments: arXiv admin note: text overlap with arXiv:2106.03452 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[976] arXiv:2206.12958 [pdf, ps, other]: Title: Szloca: towards a framework for full 3D tracking through a single camera in context of interactive arts

Authors: Sahaj Garg

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[977] arXiv:2206.12959 [pdf, other]: Title: Probabilistic PolarGMM: Unsupervised Cluster Learning of Very Noisy Projection Images of Unknown Pose

Authors: Supawit Chockchowwat, Chandrajit L. Bajaj

Comments: 13 pages, including appendices

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[978] arXiv:2206.12963 [pdf, other]: Title: Self-Healing Robust Neural Networks via Closed-Loop Control

Authors: Zhuotong Chen, Qianxiao Li, Zheng Zhang

Comments: 48 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[979] arXiv:2206.12972 [pdf, other]: Title: VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

Authors: Kashu Yamazaki, Sang Truong, Khoa Vo, Michael Kidd, Chase Rainwater, Khoa Luu, Ngan Le

Comments: accepted by The 29th IEEE International Conference on Image Processing (IEEE ICIP) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[980] arXiv:2206.12994 [pdf, other]: Title: Automatic Generation of Product-Image Sequence in E-commerce

Authors: Xiaochuan Fan, Chi Zhang, Yong Yang, Yue Shang, Xueying Zhang, Zhen He, Yun Xiao, Bo Long, Lingfei Wu

Comments: Accepted by KDD 2022 ADS

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[981] arXiv:2206.13028 [pdf, other]: Title: Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

Authors: Zhan Chen, Sicheng Li, Bing Yang, Qinghan Li, Hong Liu

Comments: 10 pages, 4 figures, accepted by AAAI 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[982] arXiv:2206.13042 [pdf, other]: Title: A Strategy Optimized Pix2pix Approach for SAR-to-Optical Image Translation Task

Authors: Fujian Cheng, Yashu Kang, Chunlei Chen, Kezhao Jiang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[983] arXiv:2206.13076 [pdf, other]: Title: SearchMorph:Multi-scale Correlation Iterative Network for Deformable Registration

Authors: Xiao Fan, Shuxin Zhuang, Zhemin Zhuang, Ye Yuan, Shunmin Qiu, Alex Noel Joseph Raj, Yibiao Rong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[984] arXiv:2206.13078 [pdf, other]: Title: Video2StyleGAN: Encoding Video in Latent Space for Manipulation

Authors: Jiyang Yu, Jingen Liu, Jing Huang, Wei Zhang, Tao Mei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[985] arXiv:2206.13079 [pdf, other]: Title: Dynamic Bank Learning for Semi-supervised Federated Image Diagnosis with Class Imbalance

Authors: Meirui Jiang, Hongzheng Yang, Xiaoxiao Li, Quande Liu, Pheng-Ann Heng, Qi Dou

Comments: Early accepted by 25th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI'22)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[986] arXiv:2206.13082 [pdf, ps, other]: Title: PST: Plant segmentation transformer for 3D point clouds of rapeseed plants at the podding stage

Authors: Ruiming Du, Zhihong Ma, Pengyao Xie, Yong He, Haiyan Cen

Comments: 46 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[987] arXiv:2206.13115 [pdf, other]: Title: Lesion-Aware Contrastive Representation Learning for Histopathology Whole Slide Images Analysis

Authors: Jun Li, Yushan Zheng, Kun Wu, Jun Shi, Fengying Xie, Zhiguo Jiang

Comments: accepted for MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[988] arXiv:2206.13117 [src]: Title: SARNet: Semantic Augmented Registration of Large-Scale Urban Point Clouds

Authors: Chao Liu, Jianwei Guo, Dong-Ming Yan, Zhirong Liang, Xiaopeng Zhang, Zhanglin Cheng

Comments: Author information changes

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[989] arXiv:2206.13142 [pdf, other]: Title: Representing motion as a sequence of latent primitives, a flexible approach for human motion modelling

Authors: Mathieu Marsot, Stefanie Wuhrer, Jean-Sebastien Franco, Anne Hélène Olivier

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[990] arXiv:2206.13155 [pdf, other]: Title: Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding

Authors: Chuwei Luo, Guozhi Tang, Qi Zheng, Cong Yao, Lianwen Jin, Chenliang Li, Yang Xue, Luo Si

Comments: Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[991] arXiv:2206.13156 [pdf, other]: Title: Kernel Attention Transformer (KAT) for Histopathology Whole Slide Image Classification

Authors: Yushan Zheng, Jun Li, Jun Shi, Fengying Xie, Zhiguo Jiang

Comments: accepted for MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[992] arXiv:2206.13188 [pdf, other]: Title: Self-supervised Learning in Remote Sensing: A Review

Authors: Yi Wang, Conrad M Albrecht, Nassim Ait Ali Braham, Lichao Mou, Xiao Xiang Zhu

Comments: Accepted by IEEE Geoscience and Remote Sensing Magazine. 32 pages, 22 content pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[993] arXiv:2206.13199 [pdf, other]: Title: MGNet: Monocular Geometric Scene Understanding for Autonomous Driving

Authors: Markus Schön, Michael Buchholz, Klaus Dietmayer

Journal-ref: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 15784-15795

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[994] arXiv:2206.13263 [pdf, other]: Title: Learning with Weak Annotations for Robust Maritime Obstacle Detection

Authors: Lojze Žust, Matej Kristan

Comments: Published in MDPI Sensors, 23 pages, 8 figures

Journal-ref: Sensors 2022, 22, 9139

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[995] arXiv:2206.13282 [pdf, other]: Title: Monocular Depth Decomposition of Semi-Transparent Volume Renderings

Authors: Dominik Engel, Sebastian Hartwig, Timo Ropinski

Comments: accepted at IEEE TVCG 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[996] arXiv:2206.13294 [pdf, other]: Title: LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation

Authors: Florent Bartoccioni, Éloi Zablocki, Andrei Bursuc, Patrick Pérez, Matthieu Cord, Karteek Alahari

Journal-ref: CoRL 2022 https://openreview.net/forum?id=abd_D-iVjk0

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[997] arXiv:2206.13296 [pdf, other]: Title: Consistency-preserving Visual Question Answering in Medical Imaging

Authors: Sergio Tascon-Morales, Pablo Márquez-Neila, Raphael Sznitman

Comments: Appears in Medical Image Computing and Computer Assisted Interventions (MICCAI), 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[998] arXiv:2206.13304 [pdf, other]: Title: PARTICUL: Part Identification with Confidence measure using Unsupervised Learning

Authors: Romain Xu-Darme (LSL, MRIM ), Georges Quénot (MRIM ), Zakaria Chihani (LSL), Marie-Christine Rousset (SLIDE )

Comments: Accepted at XAIE: 2nd Workshop on Explainable and Ethical AI -- ICPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[999] arXiv:2206.13317 [pdf, other]: Title: Automatic identification of segmentation errors for radiotherapy using geometric learning

Authors: Edward G. A. Henderson, Andrew F. Green, Marcel van Herk, Eliana M. Vasquez Osorio

Comments: Accepted in 25th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2022). This preprint has not undergone peer review or any post-submission improvements or corrections

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1000] arXiv:2206.13318 [pdf, other]: Title: Key-frame Guided Network for Thyroid Nodule Recognition using Ultrasound Videos

Authors: Yuchen Wang, Zhongyu Li, Xiangxiang Cui, Liangliang Zhang, Xiang Luo, Meng Yang, Shi Chang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1001] arXiv:2206.13329 [pdf, other]: Title: Prior-Guided One-shot Neural Architecture Search

Authors: Peijie Dong, Xin Niu, Lujun Li, Linzhen Xie, Wenbin Zou, Tian Ye, Zimian Wei, Hengyue Pan

Comments: Official 3st Place Solution for the Second workshop Neural Architecture Search Second lightweight NAS Challenge 2022 - Track1 Supernet Track. Official leaderboard: this https URL CVPR 2022 Workshop: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1002] arXiv:2206.13342 [pdf, other]: Title: Open Set Classification of Untranscribed Handwritten Documents

Authors: José Ramón Prieto, Juan José Flores, Enrique Vidal, Alejandro H. Toselli, David Garrido, Carlos Alonso

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[1003] arXiv:2206.13346 [pdf, other]: Title: Distributional Gaussian Processes Layers for Out-of-Distribution Detection

Authors: Sebastian G. Popescu, David J. Sharp, James H. Cole, Konstantinos Kamnitsas, Ben Glocker

Comments: Published in Journal of Machine Learning for Biomedical Imaging: Special Issue: Information Processing in Medical Imaging (IPMI) 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1004] arXiv:2206.13356 [pdf, other]: Title: iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition

Authors: Xu Yang, Daoyuan Wu, Xiao Yi, Jimmy H. M. Lee, Tan Lee

Comments: This is a technical report from the Chinese University of Hong Kong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1005] arXiv:2206.13381 [pdf, other]: Title: TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask

Authors: Yuchen Su, Zhiwen Shao, Yong Zhou, Fanrong Meng, Hancheng Zhu, Bing Liu, Rui Yao

Comments: This paper has been accepted by IEEE Transactions on Multimedia

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1006] arXiv:2206.13383 [pdf, ps, other]: Title: Mushroom image recognition and distance generation based on attention-mechanism model and genetic information

Authors: Wenbin Liao, Jiewen Xiao, Chengbo Zhao, Yonggong Han, ZhiJie Geng, Jianxin Wang, Yihua Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1007] arXiv:2206.13386 [pdf, other]: Title: Uncovering variability in human driving behavior through automatic extraction of similar traffic scenes from large naturalistic datasets

Authors: Olger Siebinga, Arkady Zgonnikov, David Abbink

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1008] arXiv:2206.13388 [pdf, ps, other]: Title: Rotated Digit Recognition by Variational Autoencoders with Fixed Output Distributions

Authors: David Yevick

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1009] arXiv:2206.13389 [pdf, other]: Title: UI Layers Merger: Merging UI layers via Visual Learning and Boundary Prior

Authors: Yun-nong Chen, Yan-kun Zhen, Chu-ning Shi, Jia-zhi Li, Liu-qing Chen, Ze-jian Li, Ling-yun Sun, Ting-ting Zhou, Yan-fang Chang

Comments: 15 pages, accepted to Frontiers of Information Technology & Electronic Engineering. This is a preprint version, the copyright belongs to the Springer Nature journals

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1010] arXiv:2206.13390 [pdf, other]: Title: A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!

Authors: Chenglizhao Chen, Mengke Song, Wenfeng Song, Li Guo, Muwei Jian

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1011] arXiv:2206.13391 [pdf, other]: Title: Deep reinforced active learning for multi-class image classification

Authors: Emma Slade, Kim M. Branson

Comments: 10 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1012] arXiv:2206.13392 [pdf, ps, other]: Title: Remote Sensing Image Classification using Transfer Learning and Attention Based Deep Neural Network

Authors: Lam Pham, Khoa Tran, Dat Ngo, Jasmin Lampert, Alexander Schindler

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1013] arXiv:2206.13395 [pdf, other]: Title: Gait Cycle Reconstruction and Human Identification from Occluded Sequences

Authors: Abhishek Paul, Manav Mukesh Jain, Jinesh Jain, Pratik Chattopadhyay

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1014] arXiv:2206.13396 [pdf, other]: Title: A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic Search

Authors: Brandon Trabucco, Gunnar Sigurdsson, Robinson Piramuthu, Gaurav S. Sukhatme, Ruslan Salakhutdinov

Comments: Winner of the Rearrangement Challenge at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[1015] arXiv:2206.13397 [pdf, other]: Title: Generative Modelling With Inverse Heat Dissipation

Authors: Severi Rissanen, Markus Heinonen, Arno Solin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1016] arXiv:2206.13398 [pdf, other]: Title: An Efficient Industrial Federated Learning Framework for AIoT: A Face Recognition Application

Authors: Youlong Ding, Xueyang Wu, Zhitao Li, Zeheng Wu, Shengqi Tan, Qian Xu, Weike Pan, Qiang Yang

Comments: FL-IJCAL'22 Accepted Paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1017] arXiv:2206.13413 [pdf, other]: Title: RES: A Robust Framework for Guiding Visual Explanation

Authors: Yuyang Gao, Tong Steven Sun, Guangji Bai, Siyi Gu, Sungsoo Ray Hong, Liang Zhao

Comments: Published in KDD 2022

Journal-ref: In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1018] arXiv:2206.13434 [pdf, other]: Title: ContraReg: Contrastive Learning of Multi-modality Unsupervised Deformable Image Registration

Authors: Neel Dey, Jo Schlemper, Seyed Sadegh Mohseni Salehi, Bo Zhou, Guido Gerig, Michal Sofka

Comments: Accepted by MICCAI 2022. 13 pages, 6 figures, and 1 table

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1019] arXiv:2206.13454 [pdf, other]: Title: Optimizing Video Prediction via Video Frame Interpolation

Authors: Yue Wu, Qiang Wen, Qifeng Chen

Comments: Accepted by the CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1020] arXiv:2206.13462 [pdf, other]: Title: Learn Fast, Segment Well: Fast Object Segmentation Learning on the iCub Robot

Authors: Federico Ceola, Elisa Maiettini, Giulia Pasquale, Giacomo Meanti, Lorenzo Rosasco, Lorenzo Natale

Comments: \copyright 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1021] arXiv:2206.13500 [pdf, other]: Title: Neural Neural Textures Make Sim2Real Consistent

Authors: Ryan Burgert, Jinghuan Shang, Xiang Li, Michael Ryoo

Comments: 9 pages, 10 figures (without references or appendix); 16 pages, 16 figures (with appendix)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
[1022] arXiv:2206.13502 [pdf, other]: Title: Programmatic Concept Learning for Human Motion Description and Synthesis

Authors: Sumith Kulal, Jiayuan Mao, Alex Aiken, Jiajun Wu

Comments: CVPR 2022. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1023] arXiv:2206.13559 [pdf, other]: Title: ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning

Authors: Junting Pan, Ziyi Lin, Xiatian Zhu, Jing Shao, Hongsheng Li

Comments: Accepted in NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1024] arXiv:2206.13577 [pdf, other]: Title: A View Independent Classification Framework for Yoga Postures

Authors: Mustafa Chasmai, Nirjhar Das, Aman Bhardwaj, Rahul Garg

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1025] arXiv:2206.13597 [pdf, other]: Title: NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors

Authors: Jiepeng Wang, Peng Wang, Xiaoxiao Long, Christian Theobalt, Taku Komura, Lingjie Liu, Wenping Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1026] arXiv:2206.13608 [pdf, other]: Title: Reducing Annotation Need in Self-Explanatory Models for Lung Nodule Diagnosis

Authors: Jiahao Lu, Chong Yin, Oswin Krause, Kenny Erleben, Michael Bachmann Nielsen, Sune Darkner

Comments: 10 pages, 4 figures, 2 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1027] arXiv:2206.13626 [pdf, other]: Title: Patch Selection for Melanoma Classification

Authors: Guillaume Lachaud, Patricia Conde-Cespedes, Maria Trocan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1028] arXiv:2206.13628 [pdf, other]: Title: Multi-scale Network with Attentional Multi-resolution Fusion for Point Cloud Semantic Segmentation

Authors: Yuyan Li, Ye Duan

Comments: ICPR 2022, poster

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1029] arXiv:2206.13644 [pdf, other]: Title: Feature Refinement to Improve High Resolution Image Inpainting

Authors: Prakhar Kulshreshtha, Brian Pugh, Salma Jiddi

Comments: 5 pages, 5 figures, Published in CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1030] arXiv:2206.13673 [pdf, other]: Title: How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels

Authors: Tobias Fischer, Michael Milford

Comments: 8 pages

Journal-ref: IEEE Robotics and Automation Letters 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[1031] arXiv:2206.13677 [pdf, other]: Title: Towards Global-Scale Crowd+AI Techniques to Map and Assess Sidewalks for People with Disabilities

Authors: Maryam Hosseini, Mikey Saugstad, Fabio Miranda, Andres Sevtsuk, Claudio T. Silva, Jon E. Froehlich

Comments: CVPR 2022 AVA (Accessibility, Vision, and Autonomy Meet) Workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1032] arXiv:2206.13718 [pdf, other]: Title: The Third Place Solution for CVPR2022 AVA Accessibility Vision and Autonomy Challenge

Authors: Bo Yan, Leilei Cao, Zhuang Li, Hongbin Wang

Comments: The third place solution for CVPR2022 AVA Accessibility Vision and Autonomy Challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1033] arXiv:2206.13728 [pdf, ps, other]: Title: Boosting R-CNN: Reweighting R-CNN Samples by RPN's Error for Underwater Object Detection

Authors: Pinhao Song, Pengteng Li, Linhui Dai, Tao Wang, Zhan Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1034] arXiv:2206.13732 [pdf, other]: Title: A Comprehensive Survey on Deep Gait Recognition: Algorithms, Datasets and Challenges

Authors: Chuanfu Shen, Shiqi Yu, Jilong Wang, George Q. Huang, Liang Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1035] arXiv:2206.13737 [pdf, other]: Title: Adversarial Consistency for Single Domain Generalization in Medical Image Segmentation

Authors: Yanwu Xu, Shaoan Xie, Maxwell Reynolds, Matthew Ragoza, Mingming Gong, Kayhan Batmanghelich

Comments: MICCAI2022 accpted

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1036] arXiv:2206.13785 [pdf, other]: Title: 3D Multi-Object Tracking with Differentiable Pose Estimation

Authors: Dominik Schmauser, Zeju Qiu, Norman Müller, Matthias Nießner

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1037] arXiv:2206.13803 [pdf, other]: Title: FedIIC: Towards Robust Federated Learning for Class-Imbalanced Medical Image Classification

Authors: Nannan Wu, Li Yu, Xin Yang, Kwang-Ting Cheng, Zengqiang Yan

Comments: This paper has been accepted by MICCAI 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1038] arXiv:2206.13829 [pdf, other]: Title: Cross-Forgery Analysis of Vision Transformers and CNNs for Deepfake Image Detection

Authors: Davide Alessandro Coccomini, Roberto Caldelli, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1039] arXiv:2206.13850 [pdf, other]: Title: When the Sun Goes Down: Repairing Photometric Losses for All-Day Depth Estimation

Authors: Madhu Vankadari, Stuart Golodetz, Sourav Garg, Sangyun Shin, Andrew Markham, Niki Trigoni

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1040] arXiv:2206.13858 [pdf, other]: Title: Accurate and Real-time Pseudo Lidar Detection: Is Stereo Neural Network Really Necessary?

Authors: Haitao Meng, Changcai Li, Gang Chen, Alois Knoll

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1041] arXiv:2206.13887 [pdf, other]: Title: Generating near-infrared facial expression datasets with dimensional affect labels

Authors: Calvin Chen, Stefan Winkler

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1042] arXiv:2206.13951 [pdf, other]: Title: Robustifying Vision Transformer without Retraining from Scratch by Test-Time Class-Conditional Feature Alignment

Authors: Takeshi Kojima, Yutaka Matsuo, Yusuke Iwasawa

Comments: Accepted to IJCAI-ECAI2022. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1043] arXiv:2206.13962 [src]: Title: Multi-Prior Learning via Neural Architecture Search for Blind Face Restoration

Authors: Yanjiang Yu, Puyang Zhang, Kaihao Zhang, Wenhan Luo, Changsheng Li, Ye Yuan, Guoren Wang

Comments: We found some problems with the article and need to withdrawal it

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1044] arXiv:2206.13963 [pdf, other]: Title: Primitive Graph Learning for Unified Vector Mapping

Authors: Lei Wang, Min Dai, Jianan He, Jingwei Huang, Mingwei Sun

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1045] arXiv:2206.13964 [pdf, other]: Title: Learning Gait Representation from Massive Unlabelled Walking Videos: A Benchmark

Authors: Chao Fan, Saihui Hou, Jilong Wang, Yongzhen Huang, Shiqi Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1046] arXiv:2206.13996 [pdf, other]: Title: Detecting tiny objects in aerial images: A normalized Wasserstein distance and a new benchmark

Authors: Chang Xu, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, Gui-Song Xia

Comments: Accepted by ISPRS Journal of Photogrammetry and Remote Sensing

Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing (2022) 190:79-93

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1047] arXiv:2206.14009 [pdf, other]: Title: Show Me Your Face, And I'll Tell You How You Speak

Authors: Christen Millerdurai, Lotfy Abdel Khaliq, Timon Ulrich

Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[1048] arXiv:2206.14011 [pdf, ps, other]: Title: Taxonomy and evolution predicting using deep learning in images

Authors: Jiewen Xiao, Wenbin Liao, Ming Zhang, Jing Wang, Jianxin Wang, Yihua Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1049] arXiv:2206.14020 [pdf, other]: Title: Rethinking Adversarial Examples for Location Privacy Protection

Authors: Trung-Nghia Le, Ta Gu, Huy H. Nguyen, Isao Echizen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1050] arXiv:2206.14116 [pdf, other]: Title: SSL-Lanes: Self-Supervised Learning for Motion Forecasting in Autonomous Driving

Authors: Prarthana Bhattacharyya, Chengjie Huang, Krzysztof Czarnecki

Comments: Accepted to CoRL-2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[1051] arXiv:2206.14164 [pdf, ps, other]: Title: Visualizing and Alleviating the Effect of Radial Distortion on Camera Calibration Using Principal Lines

Authors: Jen-Hui Chuang, Hsin-Yi Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1052] arXiv:2206.14180 [pdf, other]: Title: High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions

Authors: Sangyun Lee, Gyojung Gu, Sunghyun Park, Seunghwan Choi, Jaegul Choo

Comments: Accepted to ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1053] arXiv:2206.14195 [pdf, other]: Title: Pedestrian 3D Bounding Box Prediction

Authors: Saeed Saadatnejad, Yi Zhou Ju, Alexandre Alahi

Comments: Accepted and published in hEART2022 (the 10th Symposium of the European Association for Research in Transportation): this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1054] arXiv:2206.14245 [pdf, other]: Title: SImProv: Scalable Image Provenance Framework for Robust Content Attribution

Authors: Alexander Black, Tu Bui, Simon Jenni, Zhifei Zhang, Viswanathan Swaminanthan, John Collomosse

Comments: Under consideration at Computer Vision and Image Understanding

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1055] arXiv:2206.14263 [pdf, other]: Title: ZoDIAC: Zoneout Dropout Injection Attention Calculation

Authors: Zanyar Zohourianshahzadi, Jugal Kalita

Comments: This work has been submitted to SN-AIRE journal and is currently under review

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1056] arXiv:2206.14302 [pdf, ps, other]: Title: Reinforcement Learning in Medical Image Analysis: Concepts, Applications, Challenges, and Future Directions

Authors: Mingzhe Hu, Jiahan Zhang, Luke Matkovic, Tian Liu, Xiaofeng Yang

Comments: 30 pages, 13 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1057] arXiv:2206.14314 [pdf, other]: Title: Generative Neural Articulated Radiance Fields

Authors: Alexander W. Bergman, Petr Kellnhofer, Wang Yifan, Eric R. Chan, David B. Lindell, Gordon Wetzstein

Comments: Project website: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1058] arXiv:2206.14344 [pdf, other]: Title: A New Adjacency Matrix Configuration in GCN-based Models for Skeleton-based Action Recognition

Authors: Zheng Fang, Xiongwei Zhang, Tieyong Cao, Yunfei Zheng, Meng Sun

Comments: 19 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1059] arXiv:2206.14350 [pdf, ps, other]: Title: Convolutional Neural Network Based Partial Face Detection

Authors: Md. Towfiqul Islam, Tanzim Ahmed, A.B.M. Raihanur Rashid, Taminul Islam, Md. Sadekur Rahman, Md. Tarek Habib

Comments: Accepted in 7th International Conference for Convergence in Technology (I2CT), 2022, 6 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1060] arXiv:2206.14355 [pdf, other]: Title: EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering

Authors: Violetta Shevchenko, Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel, Damien Teney

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1061] arXiv:2206.14381 [pdf, other]: Title: Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

Authors: Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim

Comments: Ranked joint 3rd place in the Multi-Instance Retrieval Challenge at EPIC@CVPR2022. (v2: ref error is corrected)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[1062] arXiv:2206.14409 [pdf, ps, other]: Title: BATFormer: Towards Boundary-Aware Lightweight Transformer for Efficient Medical Image Segmentation

Authors: Xian Lin, Li Yu, Kwang-Ting Cheng, Zengqiang Yan

Comments: Accepted by IEEE Journal of Biomedical and Health Informatics The source code is publicly available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1063] arXiv:2206.14413 [pdf, other]: Title: The Lighter The Better: Rethinking Transformers in Medical Image Segmentation Through Adaptive Pruning

Authors: Xian Lin, Li Yu, Kwang-Ting Cheng, Zengqiang Yan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1064] arXiv:2206.14437 [pdf, other]: Title: MaNi: Maximizing Mutual Information for Nuclei Cross-Domain Unsupervised Segmentation

Authors: Yash Sharma, Sana Syed, Donald E. Brown

Comments: Accepted at MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1065] arXiv:2206.14451 [pdf, other]: Title: SRCN3D: Sparse R-CNN 3D for Compact Convolutional Multi-View 3D Object Detection and Tracking

Authors: Yining Shi, Jingyan Shen, Yifan Sun, Yunlong Wang, Jiaxin Li, Shiqi Sun, Kun Jiang, Diange Yang

Comments: Accepted to Vision-centric Autonomous Driving(VCAD) Workshop at CVPR2023, For more details refer to this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1066] arXiv:2206.14467 [pdf, other]: Title: Single-domain Generalization in Medical Image Segmentation via Test-time Adaptation from Shape Dictionary

Authors: Quande Liu, Cheng Chen, Qi Dou, Pheng-Ann Heng

Comments: Accepted to AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1067] arXiv:2206.14475 [pdf, other]: Title: Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning

Authors: Xiangyu Li, Xu Yang, Kun Wei, Cheng Deng, Muli Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1068] arXiv:2206.14538 [pdf, other]: Title: vMFNet: Compositionality Meets Domain-generalised Segmentation

Authors: Xiao Liu, Spyridon Thermos, Pedro Sanchez, Alison Q. O'Neil, Sotirios A. Tsaftaris

Comments: Accepted by MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1069] arXiv:2206.14554 [pdf, other]: Title: Uncertainty-aware Panoptic Segmentation

Authors: Kshitij Sirohi, Sajad Marvi, Daniel Büscher, Wolfram Burgard

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1070] arXiv:2206.14555 [pdf, other]: Title: Technical Report for CVPR 2022 LOVEU AQTC Challenge

Authors: Hyeonyu Kim, Jongeun Kim, Jeonghun Kang, Sanguk Park, Dongchan Park, Taehwan Kim

Comments: 4 pages, 3 figures, technical report for track3 of CVPR 2022 LOVEU challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1071] arXiv:2206.14651 [pdf, other]: Title: BoT-SORT: Robust Associations Multi-Pedestrian Tracking

Authors: Nir Aharon, Roy Orfaig, Ben-Zion Bobrovsky

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1072] arXiv:2206.14702 [pdf, other]: Title: Interventional Contrastive Learning with Meta Semantic Regularizer

Authors: Wenwen Qiang, Jiangmeng Li, Changwen Zheng, Bing Su, Hui Xiong

Comments: Accepted by ICML 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1073] arXiv:2206.14718 [pdf, other]: Title: LViT: Language meets Vision Transformer in Medical Image Segmentation

Authors: Zihan Li, Yunxiang Li, Qingde Li, Puyang Wang, Dazhou Guo, Le Lu, Dakai Jin, You Zhang, Qingqi Hong

Comments: Accepted by IEEE Transactions on Medical Imaging (TMI)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1074] arXiv:2206.14735 [pdf, other]: Title: GO-Surf: Neural Feature Grid Optimization for Fast, High-Fidelity RGB-D Surface Reconstruction

Authors: Jingwen Wang, Tymoteusz Bleja, Lourdes Agapito

Comments: 3DV2022 (Oral), first two authors contributed equally. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1075] arXiv:2206.14797 [pdf, other]: Title: 3D-Aware Video Generation

Authors: Sherwin Bahmani, Jeong Joon Park, Despoina Paschalidou, Hao Tang, Gordon Wetzstein, Leonidas Guibas, Luc Van Gool, Radu Timofte

Comments: TMLR 2023; Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1076] arXiv:2206.14841 [pdf, other]: Title: Causality for Inherently Explainable Transformers: CAT-XPLAIN

Authors: Subash Khanal, Benjamin Brodie, Xin Xing, Ai-Ling Lin, Nathan Jacobs

Comments: Accepted for spotlight presentation at the Explainable Artificial Intelligence for Computer Vision Workshop at CVPR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1077] arXiv:2206.14892 [pdf, other]: Title: Semantic Unfolding of StyleGAN Latent Space

Authors: Mustafa Shukor, Xu Yao, Bharath Bushan Damodaran, Pierre Hellier

Comments: Accepted at ICIP22

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1078] arXiv:2206.14923 [pdf, other]: Title: On Non-Random Missing Labels in Semi-Supervised Learning

Authors: Xinting Hu, Yulei Niu, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang

Journal-ref: ICLR 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1079] arXiv:2206.14938 [pdf, other]: Title: Regularization of NeRFs using differential geometry

Authors: Thibaud Ehret, Roger Marí, Gabriele Facciolo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1080] arXiv:2206.14971 [pdf, other]: Title: Boosting 3D Object Detection by Simulating Multimodality on Point Clouds

Authors: Wu Zheng, Mingxuan Hong, Li Jiang, Chi-Wing Fu

Comments: Published in CVPR 2022 as Oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1081] arXiv:2206.14973 [pdf, other]: Title: Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital Pathology

Authors: Yunlong Zhang, Yuxuan Sun, Honglin Li, Sunyi Zheng, Chenglu Zhu, Lin Yang

Comments: MICAAI2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1082] arXiv:2206.14989 [pdf, other]: Title: A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA

Authors: Yangyang Guo, Liqiang Nie, Yongkang Wong, Yibing Liu, Zhiyong Cheng, Mohan Kankanhalli

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1083] arXiv:2206.14996 [pdf, other]: Title: Cross-domain Federated Object Detection

Authors: Shangchao Su, Bin Li, Chengzhi Zhang, Mingzhao Yang, Xiangyang Xue

Comments: ICME 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1084] arXiv:2206.15002 [pdf, other]: Title: Spatial Transformer Network with Transfer Learning for Small-scale Fine-grained Skeleton-based Tai Chi Action Recognition

Authors: Lin Yuan, Zhen He, Qiang Wang, Leiyang Xu, Xiang Ma

Comments: 6 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1085] arXiv:2206.15015 [pdf, other]: Title: Exploring Temporally Dynamic Data Augmentation for Video Recognition

Authors: Taeoh Kim, Jinhyung Kim, Minho Shim, Sangdoo Yun, Myunggu Kang, Dongyoon Wee, Sangyoun Lee

Comments: Technical Report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1086] arXiv:2206.15031 [pdf, other]: Title: Timestamp-Supervised Action Segmentation with Graph Convolutional Networks

Authors: Hamza Khan, Sanjay Haresh, Awais Ahmed, Shakeeb Siddiqui, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran

Comments: Accepted to IROS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1087] arXiv:2206.15083 [pdf, other]: Title: UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration

Authors: Jingyi Zhang, Jiaxing Huang, Xiaoqin Zhang, Shijian Lu

Comments: Accepted to CVPR2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1088] arXiv:2206.15085 [pdf, other]: Title: Skeleton-based Action Recognition via Adaptive Cross-Form Learning

Authors: Xuanhan Wang, Yan Dai, Lianli Gao, Jingkuan Song

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1089] arXiv:2206.15109 [pdf, ps, other]: Title: MKIoU Loss: Towards Accurate Oriented Object Detection in Aerial Images

Authors: Xinyi Yu, Jiangping Lu, Xinyi Yu, Mi Lin, Linlin Ou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1090] arXiv:2206.15128 [pdf, other]: Title: Detecting and Recovering Adversarial Examples from Extracting Non-robust and Highly Predictive Adversarial Perturbations

Authors: Mingyu Dong, Jiahao Chen, Diqun Yan, Jingxing Gao, Li Dong, Rangding Wang

Comments: 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1091] arXiv:2206.15138 [pdf, other]: Title: DFGC 2022: The Second DeepFake Game Competition

Authors: Bo Peng, Wei Xiang, Yue Jiang, Wei Wang, Jing Dong, Zhenan Sun, Zhen Lei, Siwei Lyu

Comments: Accepted by IJCB 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1092] arXiv:2206.15154 [pdf, other]: Title: BoxGraph: Semantic Place Recognition and Pose Estimation from 3D LiDAR

Authors: Georgi Pramatarov, Daniele De Martini, Matthew Gadd, Paul Newman

Comments: Accepted for publication at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1093] arXiv:2206.15157 [pdf, other]: Title: HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection

Authors: Tim Broedermann (1), Christos Sakaridis (1), Dengxin Dai (2), Luc Van Gool (1 and 3) ((1) ETH Zurich, (2) MPI for Informatics, (3) KU Leuven)

Comments: IEEE International Conference on Intelligent Transportation Systems (ITSC) 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1094] arXiv:2206.15186 [pdf, other]: Title: Out-of-Distribution Detection for Long-tailed and Fine-grained Skin Lesion Images

Authors: Deval Mehta, Yaniv Gal, Adrian Bowling, Paul Bonnington, Zongyuan Ge

Comments: Accepted to MICCAI 2022 (top 13% paper; early accept)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1095] arXiv:2206.15189 [pdf, other]: Title: Multi-Granularity Regularized Re-Balancing for Class Incremental Learning

Authors: Huitong Chen, Yu Wang, Qinghua Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1096] arXiv:2206.15248 [pdf, other]: Title: CTrGAN: Cycle Transformers GAN for Gait Transfer

Authors: Shahar Mahpod, Noam Gaash, Hay Hoffman, Gil Ben-Artzi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1097] arXiv:2206.15255 [pdf, other]: Title: Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery

Authors: Yuehao Wang, Yonghao Long, Siu Hin Fan, Qi Dou

Comments: 11 pages, 4 figures, conference

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1098] arXiv:2206.15258 [pdf, other]: Title: Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera

Authors: Hongrui Cai, Wanquan Feng, Xuetao Feng, Yan Wang, Juyong Zhang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1099] arXiv:2206.15268 [pdf, other]: Title: Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach

Authors: Jiaqi Tang, Zhaoyang Liu, Jing Tan, Chen Qian, Wayne Wu, Limin Wang

Comments: arXiv admin note: text overlap with arXiv:2112.04771

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1100] arXiv:2206.15275 [pdf, other]: Title: Multiclass-SGCN: Sparse Graph-based Trajectory Prediction with Agent Class Embedding

Authors: Ruochen Li, Stamos Katsigiannis, Hubert P. H. Shum

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1101] arXiv:2206.15282 [pdf, other]: Title: TINC: Temporally Informed Non-Contrastive Learning for Disease Progression Modeling in Retinal OCT Volumes

Authors: Taha Emre, Arunava Chakravarty, Antoine Rivail, Sophie Riedl, Ursula Schmidt-Erfurth, Hrvoje Bogunović

Comments: Accepted at MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1102] arXiv:2206.15296 [pdf, other]: Title: Self-SuperFlow: Self-supervised Scene Flow Prediction in Stereo Sequences

Authors: Katharina Bendig, René Schuster, Didier Stricker

Comments: Accepted at ICIP 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1103] arXiv:2206.15328 [pdf, other]: Title: Neural Annotation Refinement: Development of a New 3D Dataset for Adrenal Gland Analysis

Authors: Jiancheng Yang, Rui Shi, Udaranga Wickramasinghe, Qikui Zhu, Bingbing Ni, Pascal Fua

Comments: MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1104] arXiv:2206.15349 [pdf, other]: Title: Revisiting Competitive Coding Approach for Palmprint Recognition: A Linear Discriminant Analysis Perspective

Authors: Lingfei Song, Hua Huang

Comments: 12 pages, 14 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1105] arXiv:2206.15351 [pdf, ps, other]: Title: Deep Learning to See: Towards New Foundations of Computer Vision

Authors: Alessandro Betti, Marco Gori, Stefano Melacci

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1106] arXiv:2206.15353 [pdf, other]: Title: Learning Underrepresented Classes from Decentralized Partially Labeled Medical Images

Authors: Nanqing Dong, Michael Kampffmeyer, Irina Voiculescu

Comments: Accepted by MICCAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1107] arXiv:2206.15369 [pdf, other]: Title: No Reason for No Supervision: Improved Generalization in Supervised Models

Authors: Mert Bulent Sariyildiz, Yannis Kalantidis, Karteek Alahari, Diane Larlus

Comments: Accepted to ICLR 2023 (spotlight)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1108] arXiv:2206.15398 [pdf, other]: Title: PolarFormer: Multi-camera 3D Object Detection with Polar Transformer

Authors: Yanqin Jiang, Li Zhang, Zhenwei Miao, Xiatian Zhu, Jin Gao, Weiming Hu, Yu-Gang Jiang

Comments: Accepted to AAAI2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1109] arXiv:2206.15415 [pdf, other]: Title: MEAD: A Multi-Armed Approach for Evaluation of Adversarial Examples Detectors

Authors: Federica Granese, Marine Picot, Marco Romanelli, Francisco Messina, Pablo Piantanida

Comments: This paper has been accepted to appear in the Proceedings of the 2022 European Conference on Machine Learning and Data Mining (ECML-PKDD), 19th to the 23rd of September, Grenoble, France

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1110] arXiv:2206.15436 [pdf, other]: Title: Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset

Authors: Yang Fu, Xiaolong Wang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1111] arXiv:2206.15462 [pdf, other]: Title: Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations

Authors: Ziyan Yang, Kushal Kafle, Franck Dernoncourt, Vicente Ordonez

Comments: CVPR 2023. Fix ReferIt results. Code: this https URL Project Webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1112] arXiv:2206.15472 [pdf, other]: Title: On-Device Training Under 256KB Memory

Authors: Ji Lin, Ligeng Zhu, Wei-Ming Chen, Wei-Chen Wang, Chuang Gan, Song Han

Comments: NeurIPS 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1113] arXiv:2206.00169 (cross-list from cs.LG) [pdf, other]: Title: Discovering the Hidden Vocabulary of DALLE-2

Authors: Giannis Daras, Alexandros G. Dimakis

Comments: 6 pages, 4 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1114] arXiv:2206.00266 (cross-list from cs.RO) [pdf, other]: Title: PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry

Authors: Dong-Uk Seo, Hyungtae Lim, Seungjae Lee, Hyun Myung

Comments: 7 pages, 5 figures, conference

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1115] arXiv:2206.00380 (cross-list from cs.LG) [pdf, other]: Title: Strongly Augmented Contrastive Clustering

Authors: Xiaozhi Deng, Dong Huang, Ding-Hua Chen, Chang-Dong Wang, Jian-Huang Lai

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1116] arXiv:2206.00393 (cross-list from cs.SD) [pdf, other]: Title: Towards Generalisable Audio Representations for Audio-Visual Navigation

Authors: Shunqi Mao, Chaoyi Zhang, Heng Wang, Weidong Cai

Comments: CVPR 2022 Embodied AI Workshop

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO); Audio and Speech Processing (eess.AS)
[1117] arXiv:2206.00432 (cross-list from cs.RO) [pdf, ps, other]: Title: Evaluating Gaussian Grasp Maps for Generative Grasping Models

Authors: William Prew, Toby P. Breckon, Magnus Bordewich, Ulrik Beierholm

Comments: 9 pages, 6 figures, to be published in IJCNN 2022

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1118] arXiv:2206.00471 (cross-list from cs.LG) [pdf, other]: Title: Augmentation Component Analysis: Modeling Similarity via the Augmentation Overlaps

Authors: Lu Han, Han-Jia Ye, De-Chuan Zhan

Comments: Accept to ICLR 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1119] arXiv:2206.00606 (cross-list from cs.LG) [pdf, other]: Title: Topological Deep Learning: Going Beyond Graph Data

Authors: Mustafa Hajij, Ghada Zamzmi, Theodore Papamarkou, Nina Miolane, Aldo Guzmán-Sáenz, Karthikeyan Natesan Ramamurthy, Tolga Birdal, Tamal K. Dey, Soham Mukherjee, Shreyas N. Samaga, Neal Livesay, Robin Walters, Paul Rosen, Michael T. Schaub

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI); Algebraic Topology (math.AT); Machine Learning (stat.ML)
[1120] arXiv:2206.00621 (cross-list from cs.CL) [pdf, other]: Title: Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training

Authors: Yan Zeng, Wangchunshu Zhou, Ao Luo, Ziming Cheng, Xinsong Zhang

Comments: ACL 2023

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1121] arXiv:2206.00719 (cross-list from cs.LG) [pdf, other]: Title: Dataset Distillation using Neural Feature Regression

Authors: Yongchao Zhou, Ehsan Nezhadarya, Jimmy Ba

Comments: NeurIPS 2022 camera-ready version

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1122] arXiv:2206.00785 (cross-list from cs.DL) [pdf, other]: Title: Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness

Authors: Christoph Auer (1), Michele Dolfi (1), André Carvalho (2), Cesar Berrospi Ramis (1), Peter W. J. Staar (1) ((1) IBM Research, (2) SoftINSA Lda.)

Comments: 11 pages, 7 figures, to be published in IEEE CLOUD 2022

Subjects: Digital Libraries (cs.DL); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[1123] arXiv:2206.00809 (cross-list from cs.MM) [pdf, other]: Title: Distilling Knowledge from Object Classification to Aesthetics Assessment

Authors: Jingwen Hou, Henghui Ding, Weisi Lin, Weide Liu, Yuming Fang

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1124] arXiv:2206.00843 (cross-list from cs.LG) [pdf, other]: Title: DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks

Authors: Yonggan Fu, Haichuan Yang, Jiayi Yuan, Meng Li, Cheng Wan, Raghuraman Krishnamoorthi, Vikas Chandra, Yingyan Lin

Comments: Accepted at ICML 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1125] arXiv:2206.00845 (cross-list from cs.LG) [pdf, other]: Title: Hyperspherical Consistency Regularization

Authors: Cheng Tan, Zhangyang Gao, Lirong Wu, Siyuan Li, Stan Z. Li

Comments: Accepted by CVPR 2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1126] arXiv:2206.00913 (cross-list from cs.LG) [pdf, other]: Title: Improving the Robustness and Generalization of Deep Neural Network with Confidence Threshold Reduction

Authors: Xiangyuan Yang, Jie Lin, Hanlin Zhang, Xinyu Yang, Peng Zhao

Comments: Under review

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1127] arXiv:2206.00941 (cross-list from cs.LG) [pdf, other]: Title: Improving Diffusion Models for Inverse Problems using Manifold Constraints

Authors: Hyungjin Chung, Byeongsu Sim, Dohoon Ryu, Jong Chul Ye

Comments: NeurIPS 2022 camera-ready; 29 pages, 16 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1128] arXiv:2206.00944 (cross-list from cs.LG) [pdf, other]: Title: Feature Space Particle Inference for Neural Network Ensembles

Authors: Shingo Yashima, Teppei Suzuki, Kohta Ishikawa, Ikuro Sato, Rei Kawakami

Comments: ICML2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1129] arXiv:2206.00991 (cross-list from cs.RO) [pdf, ps, other]: Title: StopNet: Scalable Trajectory and Occupancy Prediction for Urban Autonomous Driving

Authors: Jinkyu Kim, Reza Mahjourian, Scott Ettinger, Mayank Bansal, Brandyn White, Ben Sapp, Dragomir Anguelov

Journal-ref: IEEE International Conference on Robotics and Automation 2022

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1130] arXiv:2206.01002 (cross-list from cs.LG) [pdf, other]: Title: Introducing One Sided Margin Loss for Solving Classification Problems in Deep Networks

Authors: Ali Karimi, Zahra Mousavi Kouzehkanan, Reshad Hosseini, Hadi Asheri

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1131] arXiv:2206.01094 (cross-list from cs.MM) [pdf, ps, other]: Title: A DTCWT-SVD Based Video Watermarking resistant to frame rate conversion

Authors: Yifei Wang, Qichao Ying, Zhenxing Qian, Sheng Li, Xinpeng Zhang

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1132] arXiv:2206.01178 (cross-list from cs.LG) [pdf, other]: Title: Discretization Invariant Networks for Learning Maps between Neural Fields

Authors: Clinton J. Wang, Polina Golland

Comments: Published in Transactions on Machine Learning Research 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1133] arXiv:2206.01197 (cross-list from cs.LG) [pdf, other]: Title: Hard Negative Sampling Strategies for Contrastive Representation Learning

Authors: Afrina Tabassum, Muntasir Wahed, Hoda Eldardiry, Ismini Lourentzou

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1134] arXiv:2206.01251 (cross-list from cs.LG) [pdf, other]: Title: Using Representation Expressiveness and Learnability to Evaluate Self-Supervised Learning Methods

Authors: Yuchen Lu, Zhen Liu, Aristide Baratin, Romain Laroche, Aaron Courville, Alessandro Sordoni

Journal-ref: TMLR 2023 -- Transactions of Machine Learning Research, 11/2023

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1135] arXiv:2206.01366 (cross-list from cs.LG) [pdf, other]: Title: Supernet Training for Federated Image Classification under System Heterogeneity

Authors: Taehyeon Kim, Se-Young Yun

Comments: Oral paper on ICML 22 Workshop: "Dynamic Neural Networks"; Under review

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1136] arXiv:2206.01382 (cross-list from cs.DS) [pdf, ps, other]: Title: Falconn++: A Locality-sensitive Filtering Approach for Approximate Nearest Neighbor Search

Authors: Ninh Pham, Tao Liu

Comments: To appear in NeurIPS 2022

Subjects: Data Structures and Algorithms (cs.DS); Computer Vision and Pattern Recognition (cs.CV)
[1137] arXiv:2206.01612 (cross-list from cs.LG) [pdf, other]: Title: OmniXAI: A Library for Explainable AI

Authors: Wenzhuo Yang, Hung Le, Tanmay Laud, Silvio Savarese, Steven C.H. Hoi

Comments: Github repo: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1138] arXiv:2206.01634 (cross-list from cs.LG) [pdf, other]: Title: Reinforcement Learning with Neural Radiance Fields

Authors: Danny Driess, Ingmar Schubert, Pete Florence, Yunzhu Li, Marc Toussaint

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1139] arXiv:2206.01690 (cross-list from cs.LG) [pdf, other]: Title: Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-learning

Authors: Arnav Chavan, Rishabh Tiwari, Udbhav Bamba, Deepak K. Gupta

Comments: Published at CVPR 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1140] arXiv:2206.01829 (cross-list from cs.LG) [pdf, other]: Title: Drawing out of Distribution with Neuro-Symbolic Generative Models

Authors: Yichao Liang, Joshua B. Tenenbaum, Tuan Anh Le, N. Siddharth

Comments: Preprint. Under review. 25 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Symbolic Computation (cs.SC)
[1141] arXiv:2206.01898 (cross-list from cs.LG) [pdf, other]: Title: Saliency Attack: Towards Imperceptible Black-box Adversarial Attack

Authors: Zeyu Dai, Shengcai Liu, Ke Tang, Qing Li

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1142] arXiv:2206.02102 (cross-list from cs.LG) [pdf, other]: Title: AUTM Flow: Atomic Unrestricted Time Machine for Monotonic Normalizing Flows

Authors: Difeng Cai, Yuliang Ji, Huan He, Qiang Ye, Yuanzhe Xi

Comments: 20 pages, 3 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[1143] arXiv:2206.02131 (cross-list from cs.LG) [pdf, other]: Title: Federated Adversarial Training with Transformers

Authors: Ahmed Aldahdooh, Wassim Hamidouche, Olivier Déforges

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1144] arXiv:2206.02183 (cross-list from cs.LG) [pdf, other]: Title: Functional Ensemble Distillation

Authors: Coby Penso, Idan Achituve, Ethan Fetaya

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1145] arXiv:2206.02284 (cross-list from cs.SD) [pdf, other]: Title: Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator

Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Jiachen Zhuo, Maureen Stone, Georges El Fakhri, Jonghye Woo

Comments: MICCAI 2022 (early accept, Oral Presentation ~3%)

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1146] arXiv:2206.02286 (cross-list from cs.LG) [pdf, other]: Title: AugLoss: A Robust Augmentation-based Fine Tuning Methodology

Authors: Kyle Otstot, Andrew Yang, John Kevin Cava, Lalitha Sankar

Comments: 10 pages, 6 figures, 6 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1147] arXiv:2206.02353 (cross-list from cs.LG) [pdf, other]: Title: Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

Authors: Shohreh Deldari, Hao Xue, Aaqib Saeed, Jiayuan He, Daniel V. Smith, Flora D. Salim

Comments: 36 pages, 5 figures, 9 tables, Survey paper

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1148] arXiv:2206.02409 (cross-list from cs.AI) [pdf, other]: Title: Is More Data All You Need? A Causal Exploration

Authors: Athanasios Vlontzos, Hadrien Reynaud, Bernhard Kainz

Comments: 10 pages

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1149] arXiv:2206.02574 (cross-list from cs.LG) [pdf, other]: Title: On the duality between contrastive and non-contrastive self-supervised learning

Authors: Quentin Garrido (FAIR, LIGM), Yubei Chen (FAIR), Adrien Bardes (FAIR, WILLOW), Laurent Najman (LIGM), Yann Lecun (FAIR, CIMS)

Comments: The Eleventh International Conference on Learning Representations, 2023, Kigali, Rwanda

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1150] arXiv:2206.02659 (cross-list from cs.LG) [pdf, other]: Title: Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees

Authors: Haotian Ju, Dongyue Li, Hongyang R. Zhang

Comments: 38 pages. Appeared in ICML 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Statistics Theory (math.ST); Machine Learning (stat.ML)
[1151] arXiv:2206.02671 (cross-list from cs.SD) [pdf, ps, other]: Title: Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Audio-Visual Hearing Aids

Authors: Leandro A. Passos, João Paulo Papa, Amir Hussain, Ahsan Adeel

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1152] arXiv:2206.02792 (cross-list from cs.LG) [pdf, other]: Title: FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced Data

Authors: Zhun Deng, Jiayao Zhang, Linjun Zhang, Ting Ye, Yates Coley, Weijie J. Su, James Zou

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (stat.ML)
[1153] arXiv:2206.02840 (cross-list from cs.RO) [pdf, other]: Title: Spatial Acoustic Projection for 3D Imaging Sonar Reconstruction

Authors: Sascha Arnold, Bilal Wehbe

Comments: Preprint

Journal-ref: IEEE International Conference on Robotics and Automation (ICRA) 2022

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1154] arXiv:2206.02881 (cross-list from cs.RO) [pdf, other]: Title: Mesh-based Dynamics with Occlusion Reasoning for Cloth Manipulation

Authors: Zixuan Huang, Xingyu Lin, David Held

Comments: RSS 2022, $\href{this https URL}{\text{project website}}$

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1155] arXiv:2206.02916 (cross-list from cs.LG) [pdf, other]: Title: Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks

Authors: Zhiwei Deng, Olga Russakovsky

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1156] arXiv:2206.02958 (cross-list from cs.LG) [pdf, other]: Title: Saliency Cards: A Framework to Characterize and Compare Saliency Methods

Authors: Angie Boggust, Harini Suresh, Hendrik Strobelt, John V. Guttag, Arvind Satyanarayan

Comments: Published at FAccT 2023, 19 pages, 8 figures, 2 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1157] arXiv:2206.03083 (cross-list from cs.RO) [pdf, other]: Title: Pushing the Limits of Learning-based Traversability Analysis for Autonomous Driving on CPU

Authors: Daniel Fusaro, Emilio Olivastri, Daniele Evangelista, Marco Imperoli, Emanuele Menegatti, Alberto Pretto

Comments: Accepted to 17th International Conference on Intelligent Autonomous Systems (IAS-17)

Journal-ref: Proceedings of the 17th International Conference on Intelligent Autonomous Systems (IAS 2022)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1158] arXiv:2206.03271 (cross-list from cs.LG) [pdf, other]: Title: On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

Authors: Zhao Mandi, Pieter Abbeel, Stephen James

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1159] arXiv:2206.03354 (cross-list from cs.CL) [pdf, other]: Title: cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation

Authors: Kshitij Gupta, Devansh Gautam, Radhika Mamidi

Comments: Accepted at ICPR 2022; 9 pages

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1160] arXiv:2206.03380 (cross-list from cs.GR) [pdf, other]: Title: Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and Denoising

Authors: Jon Hasselgren, Nikolai Hofmann, Jacob Munkberg

Comments: Project website: this https URL

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1161] arXiv:2206.03382 (cross-list from cs.DC) [pdf, other]: Title: Tutel: Adaptive Mixture-of-Experts at Scale

Authors: Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, Joe Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang Xiong

Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1162] arXiv:2206.03398 (cross-list from cs.LG) [pdf, other]: Title: Towards a General Purpose CNN for Long Range Dependencies in $N$D

Authors: David W. Romero, David M. Knigge, Albert Gu, Erik J. Bekkers, Efstratios Gavves, Jakub M. Tomczak, Mark Hoogendoorn

Comments: First two authors contributed equally to this work

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1163] arXiv:2206.03430 (cross-list from cs.RO) [pdf, other]: Title: Robot Self-Calibration Using Actuated 3D Sensors

Authors: Arne Peters

Comments: 15 pages, 9 figures

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1164] arXiv:2206.03491 (cross-list from cs.AI) [pdf, other]: Title: EiX-GNN : Concept-level eigencentrality explainer for graph neural networks

Authors: Adrien Raison (XLIM-ASALI), Pascal Bourdon (XLIM-ASALI), David Helbert (XLIM-ASALI)

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1165] arXiv:2206.03583 (cross-list from cs.CR) [pdf, other]: Title: Contributor-Aware Defenses Against Adversarial Backdoor Attacks

Authors: Glenn Dawson, Muhammad Umer, Robi Polikar

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1166] arXiv:2206.03584 (cross-list from cs.CR) [pdf, ps, other]: Title: White-box Membership Attack Against Machine Learning Based Retinopathy Classification

Authors: Mounia Hamidouche, Reda Bellafqira, Gwenolé Quellec, Gouenou Coatrieux

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1167] arXiv:2206.03596 (cross-list from cs.LG) [pdf, other]: Title: Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning

Authors: Ziqi Zhou, Li Lian, Yilong Yin, Ze Wang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1168] arXiv:2206.03739 (cross-list from cs.AI) [pdf, other]: Title: Disentangled Ontology Embedding for Zero-shot Learning

Authors: Yuxia Geng, Jiaoyan Chen, Wen Zhang, Yajing Xu, Zhuo Chen, Jeff Z. Pan, Yufeng Huang, Feiyu Xiong, Huajun Chen

Comments: Accepted by KDD'22

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1169] arXiv:2206.03826 (cross-list from cs.LG) [pdf, other]: Title: Towards Understanding Why Mask-Reconstruction Pretraining Helps in Downstream Tasks

Authors: Jiachun Pan, Pan Zhou, Shuicheng Yan

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[1170] arXiv:2206.04006 (cross-list from cs.SD) [pdf, other]: Title: Few-Shot Audio-Visual Learning of Environment Acoustics

Authors: Sagnik Majumder, Changan Chen, Ziad Al-Halah, Kristen Grauman

Comments: Accepted to NeurIPS 2022

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1171] arXiv:2206.04016 (cross-list from cs.NE) [pdf, other]: Title: SYNERgy between SYNaptic consolidation and Experience Replay for general continual learning

Authors: Fahad Sarfraz, Elahe Arani, Bahram Zonooz

Comments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)

Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1172] arXiv:2206.04129 (cross-list from cs.RO) [pdf, other]: Title: Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D Convolutions

Authors: Benedikt Mersch, Xieyuanli Chen, Ignacio Vizzo, Lucas Nunes, Jens Behley, Cyrill Stachniss

Comments: Accepted for RA-L

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1173] arXiv:2206.04310 (cross-list from cs.LG) [pdf, other]: Title: GSmooth: Certified Robustness against Semantic Transformations via Generalized Randomized Smoothing

Authors: Zhongkai Hao, Chengyang Ying, Yinpeng Dong, Hang Su, Jun Zhu, Jian Song

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1174] arXiv:2206.04318 (cross-list from cs.MM) [pdf, other]: Title: Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency

Authors: Wei Lu, Wei Sun, Wenhan Zhu, Xiongkuo Min, Zicheng Zhang, Tao Wang, Guangtao Zhai

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1175] arXiv:2206.04363 (cross-list from cs.MM) [pdf, other]: Title: Deep Neural Network for Blind Visual Quality Assessment of 4K Content

Authors: Wei Lu, Wei Sun, Xiongkuo Min, Wenhan Zhu, Quan Zhou, Jun He, Qiyuan Wang, Zicheng Zhang, Tao Wang, Guangtao Zhai

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1176] arXiv:2206.04459 (cross-list from cs.LG) [pdf, other]: Title: SDQ: Stochastic Differentiable Quantization with Mixed Precision

Authors: Xijie Huang, Zhiqiang Shen, Shichao Li, Zechun Liu, Xianghong Hu, Jeffry Wicaksana, Eric Xing, Kwang-Ting Cheng

Comments: ICML 2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1177] arXiv:2206.04523 (cross-list from cs.CL) [pdf, other]: Title: Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos

Authors: Alexander Waibel, Moritz Behr, Fevziye Irem Eyiokur, Dogucan Yaman, Tuan-Nam Nguyen, Carlos Mullov, Mehmet Arif Demirtas, Alperen Kantarcı, Stefan Constantin, Hazım Kemal Ekenel

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[1178] arXiv:2206.04530 (cross-list from cs.LG) [pdf, other]: Title: DORA: Exploring Outlier Representations in Deep Neural Networks

Authors: Kirill Bykov, Mayukh Deb, Dennis Grinwald, Klaus-Robert Müller, Marina M.-C. Höhne

Comments: 24 pages, 18 figures

Journal-ref: Published in Transactions on Machine Learning Research (06/2023)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1179] arXiv:2206.04625 (cross-list from cs.LG) [pdf, other]: Title: AttX: Attentive Cross-Connections for Fusion of Wearable Signals in Emotion Recognition

Authors: Anubhav Bhatti, Behnam Behinaein, Paul Hungler, Ali Etemad

Comments: 13 pages, 8 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1180] arXiv:2206.04676 (cross-list from cs.LG) [pdf, other]: Title: Extending Momentum Contrast with Cross Similarity Consistency Regularization

Authors: Mehdi Seyfi, Amin Banitalebi-Dehkordi, Yong Zhang

Comments: IEEE Transactions on Circuits and Systems for Video Technology

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1181] arXiv:2206.04677 (cross-list from cs.CR) [pdf, other]: Title: On the Permanence of Backdoors in Evolving Models

Authors: Huiying Li, Arjun Nitin Bhagoji, Yuxin Chen, Haitao Zheng, Ben Y. Zhao

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1182] arXiv:2206.04679 (cross-list from cs.LG) [pdf, other]: Title: POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples

Authors: Duong H. Le, Khoi D. Nguyen, Khoi Nguyen, Quoc-Huy Tran, Rang Nguyen, Binh-Son Hua

Comments: Accepted at NeurIPS 2021 (First two authors contribute equally)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1183] arXiv:2206.04756 (cross-list from cs.LG) [pdf, other]: Title: An Empirical Study on Disentanglement of Negative-free Contrastive Learning

Authors: Jinkun Cao, Ruiqian Nai, Qing Yang, Jialei Huang, Yang Gao

Comments: Accepted to NeurIPS 2022; 10 pages main text + 15 pages appendix

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1184] arXiv:2206.04776 (cross-list from cs.LG) [pdf, other]: Title: What should AI see? Using the Public's Opinion to Determine the Perception of an AI

Authors: Robin Chan, Radin Dardashti, Meike Osinski, Matthias Rottmann, Dominik Brüggemann, Cilia Rücker, Peter Schlicht, Fabian Hüger, Nikol Rummel, Hanno Gottschalk

Comments: 26 pages, 12 figures

Journal-ref: AI and Ethics (2023)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1185] arXiv:2206.04779 (cross-list from cs.LG) [pdf, other]: Title: Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Authors: Cong Lu, Philip J. Ball, Tim G. J. Rudner, Jack Parker-Holder, Michael A. Osborne, Yee Whye Teh

Comments: Published at TMLR, 2023

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1186] arXiv:2206.04881 (cross-list from cs.CR) [pdf, other]: Title: Enhancing Clean Label Backdoor Attack with Two-phase Specific Triggers

Authors: Nan Luo, Yuanzhang Li, Yajie Wang, Shangbo Wu, Yu-an Tan, Quanxin Zhang

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1187] arXiv:2206.04888 (cross-list from cs.MM) [pdf, other]: Title: AntPivot: Livestream Highlight Detection via Hierarchical Attention Mechanism

Authors: Yang Zhao, Xuan Lin, Wenqiang Xu, Maozong Zheng, Zhengyong Liu, Zhou Zhao

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1188] arXiv:2206.05008 (cross-list from cs.GR) [pdf, other]: Title: Subjective Quality Assessment for Images Generated by Computer Graphics

Authors: Tao Wang, Zicheng Zhang, Wei Sun, Xiongkuo Min, Wei Lu, Guangtao Zhai

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1189] arXiv:2206.05093 (cross-list from cs.LG) [pdf, other]: Title: Federated Momentum Contrastive Clustering

Authors: Runxuan Miao, Erdem Koyuncu

Comments: Originally submitted March 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1190] arXiv:2206.05263 (cross-list from cs.LG) [pdf, other]: Title: Causal Balancing for Domain Generalization

Authors: Xinyi Wang, Michael Saxon, Jiachen Li, Hongyang Zhang, Kun Zhang, William Yang Wang

Comments: Published at ICLR 2023

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1191] arXiv:2206.05266 (cross-list from cs.LG) [pdf, other]: Title: Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?

Authors: Xiang Li, Jinghuan Shang, Srijan Das, Michael S. Ryoo

Comments: NeurIPS 2022. Code for ELo-SACv3 is at this https URL and code for ELo-Rainbow is at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1192] arXiv:2206.05323 (cross-list from cs.LG) [pdf, other]: Title: Memory Classifiers: Two-stage Classification for Robustness in Machine Learning

Authors: Souradeep Dutta, Yahan Yang, Elena Bernardis, Edgar Dobriban, Insup Lee

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1193] arXiv:2206.05344 (cross-list from cs.GR) [pdf, other]: Title: Differentiable Rendering of Neural SDFs through Reparameterization

Authors: Sai Praveen Bangaru, Michaël Gharbi, Tzu-Mao Li, Fujun Luan, Kalyan Sunkavalli, Miloš Hašan, Sai Bi, Zexiang Xu, Gilbert Bernstein, Frédo Durand

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1194] arXiv:2206.05365 (cross-list from cs.LG) [pdf, ps, other]: Title: Object Detection, Recognition, Deep Learning, and the Universal Law of Generalization

Authors: Faris B. Rustom, Haluk Öğmen, Arash Yazdanbakhsh

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1195] arXiv:2206.05400 (cross-list from cs.RO) [pdf, ps, other]: Title: High-Definition Map Generation Technologies For Autonomous Driving

Authors: Zhibin Bao, Sabir Hossain, Haoxiang Lang, Xianke Lin

Comments: 25 pages, 17 figures, submitted to a journal

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1196] arXiv:2206.05555 (cross-list from cs.CL) [pdf, other]: Title: A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training

Authors: Zhihao Fan, Zhongyu Wei, Jingjing Chen, Siyuan Wang, Zejun Li, Jiarong Xu, Xuanjing Huang

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1197] arXiv:2206.05625 (cross-list from cs.AI) [pdf, ps, other]: Title: Exploring the Intersection between Neural Architecture Search and Continual Learning

Authors: Mohamed Shahawy, Elhadj Benkhelifa, David White

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1198] arXiv:2206.05649 (cross-list from cs.GR) [pdf, other]: Title: TileGen: Tileable, Controllable Material Generation and Capture

Authors: Xilong Zhou, Miloš Hašan, Valentin Deschaintre, Paul Guerrero, Kalyan Sunkavalli, Nima Kalantari

Comments: 18 pages, 19 figures

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1199] arXiv:2206.05687 (cross-list from cs.HC) [pdf, other]: Title: DRNet: Decomposition and Reconstruction Network for Remote Physiological Measurement

Authors: Yuhang Dong, Gongping Yang, Yilong Yin

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1200] arXiv:2206.05751 (cross-list from cs.LG) [pdf, other]: Title: Consistent Attack: Universal Adversarial Perturbation on Embodied Vision Navigation

Authors: Chengyang Ying, You Qiaoben, Xinning Zhou, Hang Su, Wenbo Ding, Jianyong Ai

Journal-ref: Pattern Recognition Letters (PRL), 2023

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1201] arXiv:2206.05859 (cross-list from cs.LG) [pdf, ps, other]: Title: A Directed-Evolution Method for Sparsification and Compression of Neural Networks with Application to Object Identification and Segmentation and considerations of optimal quantization using small number of bits

Authors: Luiz M Franca-Neto

Comments: 12 pages total, 5 figures, 2 appendices

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1202] arXiv:2206.05893 (cross-list from cs.LG) [pdf, other]: Title: Deploying Convolutional Networks on Untrusted Platforms Using 2D Holographic Reduced Representations

Authors: Mohammad Mahmudul Alam, Edward Raff, Tim Oates, James Holt

Comments: To appear in the Proceedings of the 39 th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1203] arXiv:2206.05930 (cross-list from cs.LG) [pdf, other]: Title: Faster Optimization-Based Meta-Learning Adaptation Phase

Authors: Kostiantyn Khabarlak

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1204] arXiv:2206.06173 (cross-list from eess.SY) [pdf, other]: Title: LiVeR: Lightweight Vehicle Detection and Classification in Real-Time

Authors: Chandra Shekhar, Jagnyashini Debadarshini, Sudipta Saha

Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
[1205] arXiv:2206.06273 (cross-list from cs.CG) [pdf, other]: Title: Learning Joint Surface Atlases

Authors: Theo Deprelle, Thibault Groueix, Noam Aigerman, Vladimir G. Kim, Mathieu Aubry

Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[1206] arXiv:2206.06489 (cross-list from cs.AI) [pdf, other]: Title: BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents

Authors: Ziang Liu, Roberto Martín-Martín, Fei Xia, Jiajun Wu, Li Fei-Fei

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1207] arXiv:2206.06522 (cross-list from cs.CL) [pdf, other]: Title: LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning

Authors: Yi-Lin Sung, Jaemin Cho, Mohit Bansal

Comments: NeurIPS 2022 (our code is available at: this https URL)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1208] arXiv:2206.06553 (cross-list from cs.RO) [pdf, other]: Title: Safe Output Feedback Motion Planning from Images via Learned Perception Modules and Contraction Theory

Authors: Glen Chou, Necmiye Ozay, Dmitry Berenson

Comments: Workshop on the Algorithmic Foundations of Robotics (WAFR) XV, 2022, College Park, MD, USA

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1209] arXiv:2206.06577 (cross-list from cs.GR) [pdf, other]: Title: Physics Informed Neural Fields for Smoke Reconstruction with Sparse Data

Authors: Mengyu Chu, Lingjie Liu, Quan Zheng, Erik Franz, Hans-Peter Seidel, Christian Theobalt, Rhaleb Zayer

Comments: accepted to ACM Transactions On Graphics (SIGGRAPH 2022), further info:\url{this https URL}

Journal-ref: ACM Trans. Graph.41, 4 (2022), 119:1-119:14

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1210] arXiv:2206.06662 (cross-list from cs.LG) [pdf, other]: Title: Learning Best Combination for Efficient N:M Sparsity

Authors: Yuxin Zhang, Mingbao Lin, Zhihang Lin, Yiting Luo, Ke Li, Fei Chao, Yongjian Wu, Rongrong Ji

Comments: Accepted by 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1211] arXiv:2206.06737 (cross-list from cs.LG) [pdf, other]: Title: Adversarial Vulnerability of Randomized Ensembles

Authors: Hassan Dbouk, Naresh R. Shanbhag

Comments: Published as a conference paper in ICML 2022

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1212] arXiv:2206.06854 (cross-list from cs.AI) [pdf, other]: Title: On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport Perspective

Authors: Mathieu Serrurier (IRIT-ADRIA, UT), Franck Mamalet (UT), Thomas Fel (UT), Louis Béthune (UT3, UT, IRIT-ADRIA), Thibaut Boissin (UT)

Journal-ref: Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, Dec 2023, New Orleans (Louisiana), United States

Subjects: Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1213] arXiv:2206.06994 (cross-list from cs.AI) [pdf, other]: Title: ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

Authors: Matt Deitke, Eli VanderBilt, Alvaro Herrasti, Luca Weihs, Jordi Salvador, Kiana Ehsani, Winson Han, Eric Kolve, Ali Farhadi, Aniruddha Kembhavi, Roozbeh Mottaghi

Comments: ProcTHOR website: this https URL

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1214] arXiv:2206.07081 (cross-list from cs.LG) [pdf, ps, other]: Title: Applications of Generative Adversarial Networks in Neuroimaging and Clinical Neuroscience

Authors: Rongguang Wang, Vishnu Bashyam, Zhijian Yang, Fanyang Yu, Vasiliki Tassopoulou, Sai Spandana Chintapalli, Ioanna Skampardoni, Lasya P. Sreepada, Dushyant Sahoo, Konstantina Nikita, Ahmed Abdulkadir, Junhao Wen, Christos Davatzikos

Journal-ref: NeuroImage 269:119898 (2023)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1215] arXiv:2206.07136 (cross-list from cs.LG) [pdf, other]: Title: Automatic Clipping: Differentially Private Deep Learning Made Easier and Stronger

Authors: Zhiqi Bu, Yu-Xiang Wang, Sheng Zha, George Karypis

Comments: accepted to NeurIPS 2023

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1216] arXiv:2206.07137 (cross-list from cs.LG) [pdf, other]: Title: Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet Learnt

Authors: Sören Mindermann, Jan Brauner, Muhammed Razzak, Mrinank Sharma, Andreas Kirsch, Winnie Xu, Benedikt Höltgen, Aidan N. Gomez, Adrien Morisot, Sebastian Farquhar, Yarin Gal

Comments: ICML 2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1217] arXiv:2206.07148 (cross-list from cs.MM) [pdf, other]: Title: It's Time for Artistic Correspondence in Music and Video

Authors: Didac Suris, Carl Vondrick, Bryan Russell, Justin Salamon

Comments: CVPR 2022

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1218] arXiv:2206.07155 (cross-list from cs.LG) [pdf, other]: Title: Self-Supervision on Images and Text Reduces Reliance on Visual Shortcut Features

Authors: Anil Palepu, Andrew L Beam

Comments: 4 pages, 2 figures, spotlight talk at SCIS workshop, ICML 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1219] arXiv:2206.07173 (cross-list from cs.CY) [pdf, other]: Title: Measuring Representational Harms in Image Captioning

Authors: Angelina Wang, Solon Barocas, Kristen Laird, Hanna Wallach

Comments: ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2022

Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
[1220] arXiv:2206.07179 (cross-list from cs.LG) [pdf, other]: Title: Proximal Splitting Adversarial Attacks for Semantic Segmentation

Authors: Jérôme Rony, Jean-Christophe Pesquet, Ismail Ben Ayed

Comments: CVPR 2023. Code available at: this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1221] arXiv:2206.07260 (cross-list from cs.LG) [pdf, other]: Title: On Enforcing Better Conditioned Meta-Learning for Rapid Few-Shot Adaptation

Authors: Markus Hiller, Mehrtash Harandi, Tom Drummond

Comments: Accepted at NeurIPS 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1222] arXiv:2206.07290 (cross-list from cs.LG) [pdf, other]: Title: Differentiable Top-k Classification Learning

Authors: Felix Petersen, Hilde Kuehne, Christian Borgelt, Oliver Deussen

Comments: Published at ICML 2022, Code @ this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1223] arXiv:2206.07387 (cross-list from cs.LG) [pdf, other]: Title: The Manifold Hypothesis for Gradient-Based Explanations

Authors: Sebastian Bordt, Uddeshya Upadhyay, Zeynep Akata, Ulrike von Luxburg

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1224] arXiv:2206.07538 (cross-list from cs.RO) [pdf, other]: Title: Body Gesture Recognition to Control a Social Robot

Authors: Javier Laplaza, Joan Jaume Oliver, Ramón Romero, Alberto Sanfeliu, Anaís Garrell

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1225] arXiv:2206.07736 (cross-list from cs.LG) [pdf, other]: Title: Improving Diversity with Adversarially Learned Transformations for Domain Generalization

Authors: Tejas Gokhale, Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Chitta Baral, Yezhou Yang

Comments: WACV 2023. Code: this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1226] arXiv:2206.07741 (cross-list from cs.LG) [pdf, other]: Title: Edge Inference with Fully Differentiable Quantized Mixed Precision Neural Networks

Authors: Clemens JS Schaefer, Siddharth Joshi, Shan Li, Raul Blazquez

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1227] arXiv:2206.07758 (cross-list from cs.LG) [pdf, other]: Title: Reconstructing Training Data from Trained Neural Networks

Authors: Niv Haim, Gal Vardi, Gilad Yehudai, Ohad Shamir, Michal Irani

Comments: Fixed a typo in the acknowledgements

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[1228] arXiv:2206.07795 (cross-list from cs.LG) [pdf, other]: Title: On Calibrated Model Uncertainty in Deep Learning

Authors: Biraja Ghoshal, Allan Tucker

Comments: The European Conference on Machine Learning (ECML PKDD 2020). arXiv admin note: text overlap with arXiv:2103.11214

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1229] arXiv:2206.07898 (cross-list from cs.AI) [pdf, other]: Title: Multimodal Dialogue State Tracking

Authors: Hung Le, Nancy F. Chen, Steven C.H. Hoi

Comments: Accepted at NAACL 2022 (Oral)

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1230] arXiv:2206.08010 (cross-list from cs.GR) [pdf, other]: Title: MoDi: Unconditional Motion Synthesis from Diverse Data

Authors: Sigal Raab, Inbal Leibovitch, Peizhuo Li, Kfir Aberman, Olga Sorkine-Hornung, Daniel Cohen-Or

Comments: Video: this https URL, Project page: this https URL, Code: this https URL

Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1231] arXiv:2206.08076 (cross-list from cs.HC) [pdf, other]: Title: Learning Effect of Lay People in Gesture-Based Locomotion in Virtual Reality

Authors: Alexander Schäfer, Gerd Reis, Didier Stricker

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[1232] arXiv:2206.08077 (cross-list from cs.RO) [pdf, other]: Title: Neural Scene Representation for Locomotion on Structured Terrain

Authors: David Hoeller, Nikita Rudin, Christopher Choy, Animashree Anandkumar, Marco Hutter

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1233] arXiv:2206.08138 (cross-list from cs.LG) [pdf, other]: Title: Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification

Authors: Adrian El Baz, Ihsan Ullah, Edesio Alcobaça, André C. P. L. F. Carvalho, Hong Chen, Fabio Ferreira, Henry Gouk, Chaoyu Guan, Isabelle Guyon, Timothy Hospedales, Shell Hu, Mike Huisman, Frank Hutter, Zhengying Liu, Felix Mohr, Ekrem Öztürk, Jan N. van Rijn, Haozhe Sun, Xin Wang, Wenwu Zhu

Comments: version 2 is the correct version, including supplementary material at the end

Journal-ref: NeurIPS 2021 Competition and Demonstration Track, Dec 2021, On-line, United States

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1234] arXiv:2206.08213 (cross-list from cs.LG) [pdf, other]: Title: A Closer Look at Smoothness in Domain Adversarial Training

Authors: Harsh Rangwani, Sumukh K Aithal, Mayank Mishra, Arihant Jain, R. Venkatesh Babu

Comments: ICML 2022. Code: this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1235] arXiv:2206.08242 (cross-list from cs.LG) [pdf, other]: Title: Catastrophic overfitting can be induced with discriminative non-robust features

Authors: Guillermo Ortiz-Jiménez, Pau de Jorge, Amartya Sanyal, Adel Bibi, Puneet K. Dokania, Pascal Frossard, Gregory Rogéz, Philip H.S. Torr

Comments: Published in Transactions on Machine Learning Research (TMLR)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1236] arXiv:2206.08255 (cross-list from cs.LG) [pdf, other]: Title: Gradient-Based Adversarial and Out-of-Distribution Detection

Authors: Jinsol Lee, Mohit Prabhushankar, Ghassan AlRegib

Comments: International Conference on Machine Learning (ICML) Workshop on New Frontiers in Adversarial Machine Learning, July 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1237] arXiv:2206.08312 (cross-list from cs.SD) [pdf, other]: Title: SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning

Authors: Changan Chen, Carl Schissler, Sanchit Garg, Philip Kobernik, Alexander Clegg, Paul Calamia, Dhruv Batra, Philip W Robinson, Kristen Grauman

Comments: Camera-ready version. Website: this https URL Project page: this https URL

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1238] arXiv:2206.08316 (cross-list from cs.LG) [pdf, other]: Title: Boosting the Adversarial Transferability of Surrogate Models with Dark Knowledge

Authors: Dingcheng Yang, Zihao Xiao, Wenjian Yu

Comments: Accepted at 2023 International Conference on Tools with Artificial Intelligence (ICTAI)

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1239] arXiv:2206.08422 (cross-list from cs.GR) [pdf, ps, other]: Title: Real-time motion amplification on mobile devices

Authors: Henning U. Voss

Comments: Supplemental data at this https URL Changes to v1: Inclusion of offline video processing

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1240] arXiv:2206.08476 (cross-list from cs.LG) [pdf, other]: Title: Zero-Shot AutoML with Pretrained Models

Authors: Ekrem Öztürk, Fabio Ferreira, Hadi S. Jomaa, Lars Schmidt-Thieme, Josif Grabocka, Frank Hutter

Journal-ref: International Conference on Machine Learning 2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1241] arXiv:2206.08497 (cross-list from cs.GR) [pdf, other]: Title: Unsupervised Kinematic Motion Detection for Part-segmented 3D Shape Collections

Authors: Xianghao Xu, Yifan Ruan, Srinath Sridhar, Daniel Ritchie

Comments: SIGGRAPH 2022

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1242] arXiv:2206.08517 (cross-list from cs.RO) [pdf, other]: Title: ECTLO: Effective Continuous-time Odometry Using Range Image for LiDAR with Small FoV

Authors: Xin Zheng, Jianke Zhu

Comments: 8 pages, 5 figures. Accepted for publication in the Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1243] arXiv:2206.08522 (cross-list from cs.RO) [pdf, other]: Title: VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation

Authors: Kaizhi Zheng, Xiaotong Chen, Odest Chadwicke Jenkins, Xin Eric Wang

Subjects: Robotics (cs.RO); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1244] arXiv:2206.08653 (cross-list from cs.LG) [pdf, other]: Title: All Mistakes Are Not Equal: Comprehensive Hierarchy Aware Multi-label Predictions (CHAMP)

Authors: Ashwin Vaswani, Gaurav Aggarwal, Praneeth Netrapalli, Narayan G Hegde

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1245] arXiv:2206.08684 (cross-list from cs.LG) [pdf, other]: Title: Sparse Double Descent: Where Network Pruning Aggravates Overfitting

Authors: Zheng He, Zeke Xie, Quanzhi Zhu, Zengchang Qin

Comments: ICML 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1246] arXiv:2206.08704 (cross-list from cs.LG) [pdf, other]: Title: Maximum Class Separation as Inductive Bias in One Matrix

Authors: Tejaswi Kasarla, Gertjan J. Burghouts, Max van Spengler, Elise van der Pol, Rita Cucchiara, Pascal Mettes

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1247] arXiv:2206.08802 (cross-list from cs.LG) [pdf, other]: Title: Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets

Authors: Hongxin Wei, Lue Tao, Renchunzi Xie, Lei Feng, Bo An

Comments: Accepted by ICML 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1248] arXiv:2206.08826 (cross-list from cs.LG) [pdf, other]: Title: Multimodal Attention-based Deep Learning for Alzheimer's Disease Diagnosis

Authors: Michal Golovanevsky, Carsten Eickhoff, Ritambhara Singh

Comments: 11 pages, 5 figures

Journal-ref: Journal of the American Medical Informatics Association, 2022; ocac168

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1249] arXiv:2206.08842 (cross-list from cs.MM) [pdf, other]: Title: Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval

Authors: Xiao Dong, Xunlin Zhan, Yunchao Wei, Xiaoyong Wei, Yaowei Wang, Minlong Lu, Xiaochun Cao, Xiaodan Liang

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Information Retrieval (cs.IR)
[1250] arXiv:2206.08853 (cross-list from cs.LG) [pdf, other]: Title: MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Authors: Linxi Fan, Guanzhi Wang, Yunfan Jiang, Ajay Mandlekar, Yuncong Yang, Haoyi Zhu, Andrew Tang, De-An Huang, Yuke Zhu, Anima Anandkumar

Comments: Outstanding Paper Award at NeurIPS 2022. Project website: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1251] arXiv:2206.08869 (cross-list from cs.LG) [pdf, other]: Title: Fast Lossless Neural Compression with Integer-Only Discrete Flows

Authors: Siyu Wang, Jianfei Chen, Chongxuan Li, Jun Zhu, Bo Zhang

Comments: Accepted as a conference paper at International Conference on Machine Learning (ICML) 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1252] arXiv:2206.08882 (cross-list from cs.MA) [pdf, other]: Title: Edge-Aided Sensor Data Sharing in Vehicular Communication Networks

Authors: Rui Song, Anupama Hegde, Numan Senel, Alois Knoll, Andreas Festag

Comments: Accepted for IEEE 95th Vehicular Technology Conference (VTC2022-Spring)

Subjects: Multiagent Systems (cs.MA); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1253] arXiv:2206.08890 (cross-list from cs.LG) [pdf, other]: Title: Disentangling Model Multiplicity in Deep Learning

Authors: Ari Heljakka, Martin Trapp, Juho Kannala, Arno Solin

Comments: 13 pages, 6 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1254] arXiv:2206.08965 (cross-list from cs.AI) [pdf, other]: Title: KitBit: A New AI Model for Solving Intelligence Tests and Numerical Series

Authors: Víctor Corsino, José Manuel Gilpérez, Luis Herrera

Comments: 11 pages

Journal-ref: Corsino, V., Gilperez, J. M., & Herrera, L. (2023). "KitBit: A New AI Model for Solving Intelligence Tests and Numerical Series." IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(11), 13893-13903

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1255] arXiv:2206.09012 (cross-list from cs.LG) [pdf, other]: Title: Diffusion models as plug-and-play priors

Authors: Alexandros Graikos, Nikolay Malkin, Nebojsa Jojic, Dimitris Samaras

Comments: NeurIPS 2022; code: this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1256] arXiv:2206.09034 (cross-list from cs.LG) [pdf, other]: Title: Towards Better Selective Classification

Authors: Leo Feng, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Amir Abdi

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1257] arXiv:2206.09059 (cross-list from cs.CL) [pdf, other]: Title: CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks

Authors: Tejas Srinivasan, Ting-Yun Chang, Leticia Leonor Pinto Alva, Georgios Chochlakis, Mohammad Rostami, Jesse Thomason

Comments: Accepted to NeurIPS 2022 Datasets and Benchmarks track

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1258] arXiv:2206.09203 (cross-list from cs.AI) [pdf, other]: Title: Interactive Visual Reasoning under Uncertainty

Authors: Manjie Xu, Guangyuan Jiang, Wei Liang, Chi Zhang, Yixin Zhu

Comments: Accepted at NeurIPS 2023 (Datasets and Benchmarks)

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1259] arXiv:2206.09272 (cross-list from cs.CR) [pdf, other]: Title: DECK: Model Hardening for Defending Pervasive Backdoors

Authors: Guanhong Tao, Yingqi Liu, Siyuan Cheng, Shengwei An, Zhuo Zhang, Qiuling Xu, Guangyu Shen, Xiangyu Zhang

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1260] arXiv:2206.09286 (cross-list from cs.GR) [pdf, other]: Title: From Universal Humanoid Control to Automatic Physically Valid Character Creation

Authors: Zhengyi Luo, Ye Yuan, Kris M. Kitani

Comments: Project page: this https URL

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1261] arXiv:2206.09359 (cross-list from cs.LG) [pdf, other]: Title: Productive Reproducible Workflows for DNNs: A Case Study for Industrial Defect Detection

Authors: Perry Gibson, José Cano

Comments: 7 pages, 5 figures, AccML 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF); Software Engineering (cs.SE)
[1262] arXiv:2206.09378 (cross-list from cs.CL) [pdf, ps, other]: Title: A Self-Guided Framework for Radiology Report Generation

Authors: Jun Li, Shibo Li, Ying Hu, Huiren Tao

Comments: 11 pages, 3 figures, accepted by Medical Image Computing and Computer Assisted Intervention 2022(MICCAI 2022)

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1263] arXiv:2206.09386 (cross-list from cs.LG) [pdf, other]: Title: Scalable Neural Data Server: A Data Recommender for Transfer Learning

Authors: Tianshi Cao, Sasha Doubov, David Acuna, Sanja Fidler

Comments: Neurips 2021

Journal-ref: Advances in Neural Information Processing Systems, Volume 34, pages 8984-8997, year 2021

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1264] arXiv:2206.09391 (cross-list from cs.LG) [pdf, other]: Title: Towards Adversarial Attack on Vision-Language Pre-training Models

Authors: Jiaming Zhang, Qi Yi, Jitao Sang

Comments: Accepted by ACM MM2022. Code is available in GitHub

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1265] arXiv:2206.09449 (cross-list from cs.NE) [pdf, other]: Title: SNN2ANN: A Fast and Memory-Efficient Training Framework for Spiking Neural Networks

Authors: Jianxiong Tang, Jianhuang Lai, Xiaohua Xie, Lingxiao Yang, Wei-Shi Zheng

Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1266] arXiv:2206.09570 (cross-list from cs.HC) [pdf, other]: Title: Guardian Angel: A Novel Walking Aid for the Visually Impaired

Authors: Ko-Wei Tai, HuaYen Lee, Hsin-Huei Chen, Jeng-Sheng Yeh, Ming Ouhyoung

Comments: 2 pages, 1 figure

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[1267] arXiv:2206.09616 (cross-list from cs.LG) [pdf, other]: Title: Revisiting lp-constrained Softmax Loss: A Comprehensive Study

Authors: Chintan Trivedi, Konstantinos Makantasis, Antonios Liapis, Georgios N. Yannakakis

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1268] arXiv:2206.09628 (cross-list from cs.LG) [pdf, other]: Title: Diversified Adversarial Attacks based on Conjugate Gradient Method

Authors: Keiichiro Yamamura, Haruki Sato, Nariaki Tateiwa, Nozomi Hata, Toru Mitsutake, Issa Oe, Hiroki Ishikura, Katsuki Fujisawa

Comments: Proceedings of the 39th International Conference on Machine Learning (ICML 2022)

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1269] arXiv:2206.09699 (cross-list from cs.CG) [pdf, other]: Title: FoR$^2$M: Recognition and Repair of Foldings in Mesh Surfaces. Application to 3D Object Degradation

Authors: K. Sfikas, P. Perakis, T. Theoharis

Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1270] arXiv:2206.09811 (cross-list from cs.LG) [pdf, other]: Title: Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search

Authors: Han Xiao, Ziwei Wang, Zheng Zhu, Jie Zhou, Jiwen Lu

Comments: Accepted to CVPR2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1271] arXiv:2206.09868 (cross-list from cs.LG) [pdf, other]: Title: Understanding Robust Learning through the Lens of Representation Similarities

Authors: Christian Cianfarani, Arjun Nitin Bhagoji, Vikash Sehwag, Ben Y. Zhao, Prateek Mittal, Haitao Zheng

Comments: 35 pages, 29 figures; Accepted to Neurips 2022

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1272] arXiv:2206.09880 (cross-list from cs.LG) [pdf, ps, other]: Title: Breaking Down Out-of-Distribution Detection: Many Methods Based on OOD Training Data Estimate a Combination of the Same Core Quantities

Authors: Julian Bitterwolf, Alexander Meinke, Maximilian Augustin, Matthias Hein

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1273] arXiv:2206.09946 (cross-list from cs.CY) [pdf, ps, other]: Title: Short Video Uprising: How #BlackLivesMatter Content on TikTok Challenges the Protest Paradigm

Authors: Yanru Jiang, Xin Jin, Qinhao Deng

Comments: Workshop Proceedings of the 16th International AAAI Conference on Web and Social Media

Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
[1274] arXiv:2206.10011 (cross-list from cs.LG) [pdf, other]: Title: When Does Re-initialization Work?

Authors: Sheheryar Zaidi, Tudor Berariu, Hyunjik Kim, Jörg Bornschein, Claudia Clopath, Yee Whye Teh, Razvan Pascanu

Comments: Published in PMLR Volume 187; spotlight presentation at I Can't Believe It's Not Better Workshop at NeurIPS 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1275] arXiv:2206.10244 (cross-list from cs.RO) [pdf, other]: Title: Experimental Evaluation of Pose Initialization Methods for Relative Navigation Between Non-Cooperative Satellites

Authors: Sebastiano Chiodini, Marco Pertile, Pierdomenico Fracchiolla, Andrea Valmorbida, Enrico Lorenzini, Stefano Debei

Comments: To be presented at the 2022 IEEE INTERNATIONAL WORKSHOP ON Metrology for AeroSpace

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1276] arXiv:2206.10249 (cross-list from cs.HC) [pdf, other]: Title: Incorporating Voice Instructions in Model-Based Reinforcement Learning for Self-Driving Cars

Authors: Mingze Wang, Ziyang Zhang, Grace Hui Yang

Comments: NeurIPS 2021 Workshop on Machine Learning for Autonomous Driving

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1277] arXiv:2206.10255 (cross-list from eess.SY) [pdf, other]: Title: GNN-PMB: A Simple but Effective Online 3D Multi-Object Tracker without Bells and Whistles

Authors: Jianan Liu, Liping Bai, Yuxuan Xia, Tao Huang, Bing Zhu, Qing-Long Han

Comments: accepted by IEEE Transactions on Intelligent Vehicles

Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
[1278] arXiv:2206.10274 (cross-list from cs.RO) [pdf, other]: Title: Attention-driven Active Vision for Efficient Reconstruction of Plants and Targeted Plant Parts

Authors: Akshay K. Burusa, Eldert J. van Henten, Gert Kootstra

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1279] arXiv:2206.10326 (cross-list from cs.HC) [pdf, other]: Title: The Metaverse Data Deluge: What Can We Do About It?

Authors: Beng Chin Ooi, Gang Chen, Mike Zheng Shou, Kian-Lee Tan, Anthony Tung, Xiaokui Xiao, James Wei Luen Yip, Meihui Zhang

Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[1280] arXiv:2206.10352 (cross-list from cs.HC) [pdf, other]: Title: Psychologically-Inspired, Unsupervised Inference of Perceptual Groups of GUI Widgets from GUI Images

Authors: Mulong Xie, Zhenchang Xing, Sidong Feng, Chunyang Chen, Liming Zhu, Xiwei Xu

Comments: 12 Pages, accepted to ESEC/FSE '2022

Journal-ref: In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022)

Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Software Engineering (cs.SE)
[1281] arXiv:2206.10365 (cross-list from cs.LG) [pdf, other]: Title: A Flexible Diffusion Model

Authors: Weitao Du, Tao Yang, He Zhang, Yuanqi Du

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1282] arXiv:2206.10421 (cross-list from cs.SD) [pdf, other]: Title: Rethinking Audio-visual Synchronization for Active Speaker Detection

Authors: Abudukelimu Wuerkaixi, You Zhang, Zhiyao Duan, Changshui Zhang

Comments: Accepted by IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2022)

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1283] arXiv:2206.10480 (cross-list from cs.LG) [pdf, other]: Title: Learning to Estimate and Refine Fluid Motion with Physical Dynamics

Authors: Mingrui Zhang, Jianhong Wang, James Tlhomole, Matthew D. Piggott

Comments: published at ICML 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[1284] arXiv:2206.10620 (cross-list from cs.LG) [pdf, other]: Title: CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing Framework

Authors: Xiaofeng Li, Bin Ren, Xipeng Shen, Yanzhi Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Programming Languages (cs.PL)
[1285] arXiv:2206.10670 (cross-list from cs.RO) [pdf, other]: Title: SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene Understanding

Authors: Hermann Blum, Marcus G. Müller, Abel Gawel, Roland Siegwart, Cesar Cadena

Comments: accepted at ISRR 2022

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1286] arXiv:2206.10797 (cross-list from cs.LG) [pdf, other]: Title: Imitation Learning for Generalizable Self-driving Policy with Sim-to-real Transfer

Authors: Zoltán Lőrincz, Márton Szemenyei, Róbert Moni

Comments: Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Source code is available at: this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1287] arXiv:2206.10816 (cross-list from cs.LG) [pdf, other]: Title: Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming

Authors: Chuan Wen, Jianing Qian, Jierui Lin, Jiaye Teng, Dinesh Jayaraman, Yang Gao

Comments: 28 pages, 13 figures, ICML2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1288] arXiv:2206.10843 (cross-list from cs.LG) [pdf, other]: Title: Learning Debiased Classifier with Biased Committee

Authors: Nayeong Kim, Sehyun Hwang, Sungsoo Ahn, Jaesik Park, Suha Kwak

Comments: Conference on Neural Information Processing Systems (NeurIPS), New Orleans, 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1289] arXiv:2206.10935 (cross-list from cs.LG) [pdf, other]: Title: A Study on the Evaluation of Generative Models

Authors: Eyal Betzalel, Coby Penso, Aviv Navon, Ethan Fetaya

Comments: 13 pages

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1290] arXiv:2206.11073 (cross-list from cs.NE) [pdf, other]: Title: A Unified and Biologically-Plausible Relational Graph Representation of Vision Transformers

Authors: Yuzhong Chen, Yu Du, Zhenxiang Xiao, Lin Zhao, Lu Zhang, David Weizhong Liu, Dajiang Zhu, Tuo Zhang, Xintao Hu, Tianming Liu, Xi Jiang

Comments: 11 pages,7 figures, submitted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1291] arXiv:2206.11141 (cross-list from cs.RO) [pdf, other]: Title: Hybrid Physical Metric For 6-DoF Grasp Pose Detection

Authors: Yuhao Lu, Beixing Deng, Zhenyu Wang, Peiyuan Zhi, Yali Li, Shengjin Wang

Comments: 7 pages, 7 figures, accepted by ICRA 2022

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1292] arXiv:2206.11229 (cross-list from cs.IR) [pdf, other]: Title: Business Document Information Extraction: Towards Practical Benchmarks

Authors: Matyáš Skalický, Štěpán Šimsa, Michal Uřičář, Milan Šulc

Comments: Accepted to CLEF 2022

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1293] arXiv:2206.11251 (cross-list from cs.LG) [pdf, other]: Title: Behavior Transformers: Cloning $k$ modes with one stone

Authors: Nur Muhammad Mahi Shafiullah, Zichen Jeff Cui, Ariuntuya Altanzaya, Lerrel Pinto

Comments: Code and data available at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1294] arXiv:2206.11260 (cross-list from cs.SD) [pdf, other]: Title: Few-shot Long-Tailed Bird Audio Recognition

Authors: Marcos V. Conde, Ui-Jin Choi

Comments: LifeCLEF2022 (best paper award)

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1295] arXiv:2206.11376 (cross-list from cs.RO) [pdf, other]: Title: Real-Time Online Skeleton Extraction and Gesture Recognition on Pepper

Authors: Axel Lefrant, Jean-Marc Montanier

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1296] arXiv:2206.11461 (cross-list from cs.GR) [pdf, other]: Title: Towards Better User Studies in Computer Graphics and Vision

Authors: Zoya Bylinskii, Laura Herman, Aaron Hertzmann, Stefanie Hutka, Yile Zhang

Comments: 18 pages of text, 6 pages of references, 3 figures, 1 table

Journal-ref: Foundations and Trends in Computer Graphics and Vision (2023). Vol. 15: No. 3, pp 201-252

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1297] arXiv:2206.11481 (cross-list from cs.CG) [pdf, ps, other]: Title: A Novel Algorithm for Exact Concave Hull Extraction

Authors: Kevin Christopher VanHorn, Murat Can Çobanoğlu

Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[1298] arXiv:2206.11488 (cross-list from cs.LG) [pdf, other]: Title: On the Importance and Applicability of Pre-Training for Federated Learning

Authors: Hong-You Chen, Cheng-Hao Tu, Ziwei Li, Han-Wei Shen, Wei-Lun Chao

Comments: Accepted to ICLR 2023

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1299] arXiv:2206.11602 (cross-list from cs.LG) [pdf, other]: Title: Prototype-Anchored Learning for Learning with Imperfect Annotations

Authors: Xiong Zhou, Xianming Liu, Deming Zhai, Junjun Jiang, Xin Gao, Xiangyang Ji

Comments: ICML 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1300] arXiv:2206.11623 (cross-list from cs.RO) [pdf, other]: Title: Waypoint Generation in Row-based Crops with Deep Learning and Contrastive Clustering

Authors: Francesco Salvetti, Simone Angarano, Mauro Martini, Simone Cerrato, Marcello Chiaberge

Comments: Accepted at ECML PKDD 2022

Journal-ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13718, Springer

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1301] arXiv:2206.11849 (cross-list from cs.LG) [pdf, other]: Title: Sample Condensation in Online Continual Learning

Authors: Mattia Sangermano, Antonio Carta, Andrea Cossu, Davide Bacciu

Comments: Accepted as a conference paper at 2022 International Joint Conference on Neural Networks (IJCNN 2022). Part of 2022 IEEE World Congress on Computational Intelligence (IEEE WCCI 2022)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1302] arXiv:2206.12139 (cross-list from cs.NI) [pdf, other]: Title: HARU: Haptic Augmented Reality-Assisted User-Centric Industrial Network Planning

Authors: Qi Liao, Tianlun Hu, Nikolaj Marchenko, Peter Kulics, Lutz Ewe

Subjects: Networking and Internet Architecture (cs.NI); Computer Vision and Pattern Recognition (cs.CV)
[1303] arXiv:2206.12145 (cross-list from cs.RO) [pdf, other]: Title: Efficient and Robust Training of Dense Object Nets for Multi-Object Robot Manipulation

Authors: David B. Adrian, Andras Gabor Kupcsik, Markus Spies, Heiko Neumann

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1304] arXiv:2206.12251 (cross-list from cs.CR) [pdf, other]: Title: Adversarial Zoom Lens: A Novel Physical-World Attack to DNNs

Authors: Chengyin Hu, Weiwen Shi

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1305] arXiv:2206.12292 (cross-list from cs.LG) [pdf, other]: Title: InfoAT: Improving Adversarial Training Using the Information Bottleneck Principle

Authors: Mengting Xu, Tao Zhang, Zhongnian Li, Daoqiang Zhang

Comments: Published in: IEEE Transactions on Neural Networks and Learning Systems ( Early Access )

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1306] arXiv:2206.12322 (cross-list from cs.LG) [pdf, other]: Title: How to train accurate BNNs for embedded systems?

Authors: Floran de Putter, Henk Corporaal

Journal-ref: Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing (2023)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1307] arXiv:2206.12484 (cross-list from cs.LG) [pdf, other]: Title: An Intensity and Phase Stacked Analysis of Phase-OTDR System using Deep Transfer Learning and Recurrent Neural Networks

Authors: Ceyhun Efe Kayan, Kivilcim Yuksel Aldogan, Abdurrahman Gumus

Comments: 15 pages, 9 figures. Title updated

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1308] arXiv:2206.12649 (cross-list from cs.CL) [pdf, other]: Title: Sentiment Analysis with R: Natural Language Processing for Semi-Automated Assessments of Qualitative Data

Authors: Dennis Klinkhammer

Comments: 14 pages, 6 figures

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[1309] arXiv:2206.12705 (cross-list from cs.LG) [pdf, other]: Title: p-Meta: Towards On-device Deep Model Adaptation

Authors: Zhongnan Qu, Zimu Zhou, Yongxin Tong, Lothar Thiele

Comments: Published in SIGKDD 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1310] arXiv:2206.12753 (cross-list from cs.DB) [pdf, other]: Title: Spatiotemporal Data Mining: A Survey

Authors: Arun Sharma, Zhe Jiang, Shashi Shekhar

Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[1311] arXiv:2206.12941 (cross-list from cs.RO) [pdf, ps, other]: Title: Object Detection and Tracking with Autonomous UAV

Authors: A. Huzeyfe Demir, Berke Yavas, Mehmet Yazici, Dogukan Aksu, M. Ali Aydin

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1312] arXiv:2206.13043 (cross-list from cs.LG) [pdf, other]: Title: Automated Systems For Diagnosis of Dysgraphia in Children: A Survey and Novel Framework

Authors: Jayakanth Kunhoth, Somaya Al-Maadeed, Suchithra Kunhoth, Younus Akbari

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1313] arXiv:2206.13387 (cross-list from cs.AI) [pdf, other]: Title: ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning

Authors: Yuxiao Chen, Boris Ivanovic, Marco Pavone

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1314] arXiv:2206.13399 (cross-list from cs.LG) [pdf, other]: Title: Transfer Learning via Test-Time Neural Networks Aggregation

Authors: Bruno Casella, Alessio Barbaro Chisari, Sebastiano Battiato, Mario Valerio Giuffrida

Comments: 8 pages

Journal-ref: Proceedings of the 17th international joint conference on computer vision, imaging and computer graphics theory and applications, VISIGRAPP 2022, volume 5: VISAPP, online streaming, february 6-8, 2022, 2022, pp. 642-649

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1315] arXiv:2206.13406 (cross-list from cs.RO) [pdf, other]: Title: Explicitly incorporating spatial information to recurrent networks for agriculture

Authors: Claus Smitt, Michael Halstead, Alireza Ahmadi, Chris McCool

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1316] arXiv:2206.13491 (cross-list from cs.LG) [pdf, other]: Title: Effective training-time stacking for ensembling of deep neural networks

Authors: Polina Proscura, Alexey Zaytsev

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1317] arXiv:2206.13497 (cross-list from cs.LG) [pdf, other]: Title: Robustness Implies Generalization via Data-Dependent Generalization Bounds

Authors: Kenji Kawaguchi, Zhun Deng, Kyle Luh, Jiaoyang Huang

Comments: Accepted by ICML 2022, and selected for ICML long presentation (top 2% of submissions)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Probability (math.PR); Machine Learning (stat.ML)
[1318] arXiv:2206.13498 (cross-list from cs.LG) [pdf, other]: Title: Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior

Authors: Jean-Stanislas Denain, Jacob Steinhardt

Comments: Fixed backdoor localization results, made changes to abstract and introduction

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1319] arXiv:2206.13499 (cross-list from cs.LG) [pdf, other]: Title: Prompting Decision Transformer for Few-Shot Policy Generalization

Authors: Mengdi Xu, Yikang Shen, Shun Zhang, Yuchen Lu, Ding Zhao, Joshua B. Tenenbaum, Chuang Gan

Comments: ICML 2022. Project page: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1320] arXiv:2206.13630 (cross-list from cs.AI) [pdf, ps, other]: Title: Toward an ImageNet Library of Functions for Global Optimization Benchmarking

Authors: Boris Yazmir, Ofer M. Shir

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1321] arXiv:2206.13687 (cross-list from cs.LG) [pdf, other]: Title: POEM: Out-of-Distribution Detection with Posterior Sampling

Authors: Yifei Ming, Ying Fan, Yixuan Li

Comments: ICML 2022 (Long Talk); First two authors contributed equally

Journal-ref: Thirty-ninth International Conference on Machine Learning (2022)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1322] arXiv:2206.13883 (cross-list from cs.RO) [pdf, other]: Title: Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems

Authors: Stephen Hausler, Ming Xu, Sourav Garg, Punarjay Chakravarty, Shubham Shrivastava, Ankit Vora, Michael Milford

Comments: 8 pages, 5 figures, To be published in RA-L 2022

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1323] arXiv:2206.13932 (cross-list from cs.LG) [pdf, other]: Title: Discrete Morse Sandwich: Fast Computation of Persistence Diagrams for Scalar Data -- An Algorithm and A Benchmark

Authors: Pierre Guillou, Jules Vidal, Julien Tierny

Subjects: Machine Learning (cs.LG); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
[1324] arXiv:2206.13968 (cross-list from cs.LG) [pdf, other]: Title: Information Entropy Initialized Concrete Autoencoder for Optimal Sensor Placement and Reconstruction of Geophysical Fields

Authors: Nikita Turko, Alexander Lobashev, Konstantin Ushakov, Maxim Kaurkin, Rashit Ibrayev

Comments: 18 pages, 6 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Atmospheric and Oceanic Physics (physics.ao-ph)
[1325] arXiv:2206.13991 (cross-list from cs.LG) [pdf, other]: Title: Increasing Confidence in Adversarial Robustness Evaluations

Authors: Roland S. Zimmermann, Wieland Brendel, Florian Tramer, Nicholas Carlini

Comments: Oral at CVPR 2022 Workshop (Art of Robustness). Project website this https URL

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1326] arXiv:2206.14056 (cross-list from cs.LG) [pdf, ps, other]: Title: Deep Neural Networks pruning via the Structured Perspective Regularization

Authors: Matteo Cacciola, Antonio Frangioni, Xinlin Li, Andrea Lodi

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[1327] arXiv:2206.14085 (cross-list from cs.LG) [pdf, other]: Title: Continual Learning with Transformers for Image Classification

Authors: Beyza Ermis, Giovanni Zappella, Martin Wistuba, Aditya Rawal, Cedric Archambeau

Comments: Appeared in CVPR CLVision workshop. arXiv admin note: substantial text overlap with arXiv:2203.04640

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1328] arXiv:2206.14098 (cross-list from cs.LG) [pdf, other]: Title: RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network

Authors: Vitaliy Chiley, Vithursan Thangarasa, Abhay Gupta, Anshul Samar, Joel Hestness, Dennis DeCoste

Comments: Presented at MLSys 2023. Code available from Cerebras Systems: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1329] arXiv:2206.14137 (cross-list from cs.NE) [pdf, ps, other]: Title: aSTDP: A More Biologically Plausible Learning

Authors: Shiyuan Li

Comments: 17 pages, 6 figures. arXiv admin note: text overlap with arXiv:1912.00009

Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[1330] arXiv:2206.14244 (cross-list from cs.RO) [pdf, other]: Title: Masked World Models for Visual Control

Authors: Younggyo Seo, Danijar Hafner, Hao Liu, Fangchen Liu, Stephen James, Kimin Lee, Pieter Abbeel

Comments: Project website: this https URL Accepted to CoRL 2022

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1331] arXiv:2206.14256 (cross-list from cs.LG) [pdf, other]: Title: GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning

Authors: Doğay Kamar (1), Nazım Kemal Üre (1 and 2), Gözde Ünal (1 and 2) ((1) Faculty of Computer and Informatics, Istanbul Technical University (2) Artificial Intelligence and Data Science Research Center, Istanbul Technical University)

Journal-ref: International Conference on Agents and Artificial Intelligence - ICAART, Volume 2, 264-272 (2022)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1332] arXiv:2206.14372 (cross-list from cs.RO) [pdf, other]: Title: Formalizing and Evaluating Requirements of Perception Systems for Automated Vehicles using Spatio-Temporal Perception Logic

Authors: Mohammad Hekmatnejad, Bardh Hoxha, Jyotirmoy V. Deshmukh, Yezhou Yang, Georgios Fainekos

Comments: 32 pages, 11 figures, 6 tables, 4 algorithms, 2 appendixes

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Formal Languages and Automata Theory (cs.FL)
[1333] arXiv:2206.14486 (cross-list from cs.LG) [pdf, other]: Title: Beyond neural scaling laws: beating power law scaling via data pruning

Authors: Ben Sorscher, Robert Geirhos, Shashank Shekhar, Surya Ganguli, Ari S. Morcos

Comments: Outstanding Paper Award @ NeurIPS 2022. Added github link to metric scores

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1334] arXiv:2206.14502 (cross-list from cs.LG) [pdf, other]: Title: RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness

Authors: Francesco Pinto, Harry Yang, Ser-Nam Lim, Philip H.S. Torr, Puneet K. Dokania

Comments: 22 pages, 18 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1335] arXiv:2206.14528 (cross-list from cs.RO) [pdf, other]: Title: Procrustes Analysis with Deformations: A Closed-Form Solution by Eigenvalue Decomposition

Authors: Fang Bai, Adrien Bartoli

Comments: Published on International journal of computer vision (IJCV) 2022

Journal-ref: International Journal of Computer Vision 130, no. 2 (2022): 567-593

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[1336] arXiv:2206.14541 (cross-list from cs.LG) [pdf, other]: Title: Why patient data cannot be easily forgotten?

Authors: Ruolin Su, Xiao Liu, Sotirios A. Tsaftaris

Comments: Ruolin Su and Xiao Liu contributed equally. Accepted by MICCAI 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1337] arXiv:2206.14579 (cross-list from cs.CL) [pdf, other]: Title: Competence-based Multimodal Curriculum Learning for Medical Report Generation

Authors: Fenglin Liu, Shen Ge, Yuexian Zou, Xian Wu

Comments: Accepted by ACL 2021 (Oral)

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1338] arXiv:2206.14581 (cross-list from cs.ET) [pdf, other]: Title: On-device Synaptic Memory Consolidation using Fowler-Nordheim Quantum-tunneling

Authors: Mustafizur Rahman, Subhankar Bose, Shantanu Chakrabartty

Subjects: Emerging Technologies (cs.ET); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1339] arXiv:2206.14617 (cross-list from cs.GR) [pdf, other]: Title: Perspective (In)consistency of Paint by Text

Authors: Hany Farid

Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[1340] arXiv:2206.14658 (cross-list from cs.LG) [pdf, other]: Title: Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs

Authors: Bo-Kyeong Kim, Shinkook Choi, Hancheol Park

Comments: ICML Workshop on Hardware Aware Efficient Training, 2022

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1341] arXiv:2206.14687 (cross-list from cs.LG) [pdf, other]: Title: Multi-scale Physical Representations for Approximating PDE Solutions with Graph Neural Operators

Authors: Léon Migus, Yuan Yin, Jocelyn Ahmed Mazari, Patrick Gallinari

Comments: ICLR 2022 Workshop on Geometrical and Topological Representation Learning

Journal-ref: ICLR 2022 Workshop on Geometrical and Topological Representation Learning

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1342] arXiv:2206.14709 (cross-list from cs.LG) [pdf, other]: Title: An extensible Benchmarking Graph-Mesh dataset for studying Steady-State Incompressible Navier-Stokes Equations

Authors: Florent Bonnet, Jocelyn Ahmed Mazari, Thibaut Munzer, Pierre Yser, Patrick Gallinari

Comments: ICLR 2022 Workshop on Geometrical and Topological Representation Learning

Journal-ref: ICLR 2022 Workshop on Geometrical and Topological Representation Learning

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[1343] arXiv:2206.14854 (cross-list from cs.RO) [pdf, other]: Title: Neural Motion Fields: Encoding Grasp Trajectories as Implicit Value Functions

Authors: Yun-Chun Chen, Adithyavairavan Murali, Balakumar Sundaralingam, Wei Yang, Animesh Garg, Dieter Fox

Comments: RSS 2022 Workshop on Implicit Representations for Robotic Manipulation

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1344] arXiv:2206.14868 (cross-list from cs.LG) [pdf, other]: Title: Teach me how to Interpolate a Myriad of Embeddings

Authors: Shashanka Venkataramanan, Ewa Kijak, Laurent Amsaleg, Yannis Avrithis

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1345] arXiv:2206.15007 (cross-list from cs.CL) [pdf, other]: Title: GSCLIP : A Framework for Explaining Distribution Shifts in Natural Language

Authors: Zhiying Zhu, Weixin Liang, James Zou

Comments: Accepted by ICML 2022 DataPerf

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1346] arXiv:2206.15170 (cross-list from cs.AI) [pdf, other]: Title: LiDAR-as-Camera for End-to-End Driving

Authors: Ardi Tampuu, Romet Aidla, Jan Are van Gent, Tambet Matiisen

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1347] arXiv:2206.15316 (cross-list from cs.LG) [pdf, other]: Title: Anomaly Detection in Echocardiograms with Dynamic Variational Trajectory Models

Authors: Alain Ryser, Laura Manduchi, Fabian Laumer, Holger Michel, Sven Wellmann, Julia E. Vogt

Journal-ref: Proceedings of the 7th Machine Learning for Healthcare Conference, PMLR 182:425-458, 2022

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computation (stat.CO); Machine Learning (stat.ML)
[1348] arXiv:2206.15469 (cross-list from cs.RO) [pdf, other]: Title: Watch and Match: Supercharging Imitation with Regularized Optimal Transport

Authors: Siddhant Haldar, Vaibhav Mathur, Denis Yarats, Lerrel Pinto

Comments: Code and robot videos are available on this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1349] arXiv:2206.15470 (cross-list from cs.GR) [pdf, other]: Title: Dressing Avatars: Deep Photorealistic Appearance for Physically Simulated Clothing

Authors: Donglai Xiang, Timur Bagautdinov, Tuur Stuyck, Fabian Prada, Javier Romero, Weipeng Xu, Shunsuke Saito, Jingfan Guo, Breannan Smith, Takaaki Shiratori, Yaser Sheikh, Jessica Hodgins, Chenglei Wu

Comments: SIGGRAPH Asia 2022 (ACM ToG) camera ready. The supplementary video can be found on this https URL

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1350] arXiv:2206.00002 (cross-list from eess.IV) [pdf, other]: Title: Calibrated Bagging Deep Learning for Image Semantic Segmentation: A Case Study on COVID-19 Chest X-ray Image

Authors: Lucy Nwosu, Xiangfang Li, Lijun Qian, Seungchan Kim, Xishuang Dong

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1351] arXiv:2206.00041 (cross-list from eess.IV) [pdf, ps, other]: Title: Characterization of 3D Printers and X-Ray Computerized Tomography

Authors: Sunita Khod, Akshay Dvivedi, Mayank Goswami

Comments: Total 13 Pages, 11 Figures, 5 Tables, 10 References

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1352] arXiv:2206.00105 (cross-list from eess.IV) [pdf, other]: Title: Deep learning pipeline for image classification on mobile phones

Authors: Muhammad Muneeb, Samuel F. Feng, Andreas Henschel

Comments: 20 pages

Journal-ref: 9th International Conference on Artificial Intelligence and Applications (AIAPP 2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1353] arXiv:2206.00305 (cross-list from eess.IV) [pdf, ps, other]: Title: Supervised Denoising of Diffusion-Weighted Magnetic Resonance Images Using a Convolutional Neural Network and Transfer Learning

Authors: Jakub Jurek, Andrzej Materka, Kamil Ludwisiak, Agata Majos, Kamil Gorczewski, Kamil Cepuch, Agata Zawadzka

Comments: Preprint submitted to NeuroImage

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1354] arXiv:2206.00338 (cross-list from eess.IV) [pdf, other]: Title: CellCentroidFormer: Combining Self-attention and Convolution for Cell Detection

Authors: Royden Wagner, Karl Rohr

Comments: Accepted at MIUA 2022; Added experiments with CircleNets and extended figure captions

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1355] arXiv:2206.00356 (cross-list from eess.IV) [pdf, other]: Title: A Survey on Deep Learning for Skin Lesion Segmentation

Authors: Zahra Mirikharaji, Kumar Abhishek, Alceu Bissoto, Catarina Barata, Sandra Avila, Eduardo Valle, M. Emre Celebi, Ghassan Hamarneh

Comments: Published in Medical Image Analysis (2023); 55 pages, 10 figures; Mirikharaji and Abhishek: Joint first authors; Celebi and Hamarneh: Joint senior authors

Journal-ref: Medical Image Analysis (2023): 102863

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1356] arXiv:2206.00389 (cross-list from eess.IV) [pdf, other]: Title: A comparative study between vision transformers and CNNs in digital pathology

Authors: Luca Deininger, Bernhard Stimpel, Anil Yuce, Samaneh Abbasi-Sureshjani, Simon Schönenberger, Paolo Ocampo, Konstanty Korski, Fabien Gaire

Comments: 8 pages, 2 figures, accepted for workshop T4Vision (CVPR 2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1357] arXiv:2206.00455 (cross-list from q-bio.QM) [pdf, ps, other]: Title: A robust and lightweight deep attention multiple instance learning algorithm for predicting genetic alterations

Authors: Bangwei Guo, Xingyu Li, Miaomiao Yang, Hong Zhang, Xu Steven Xu

Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Genomics (q-bio.GN)
[1358] arXiv:2206.00536 (cross-list from eess.IV) [pdf, other]: Title: Impact of loss function in Deep Learning methods for accurate retinal vessel segmentation

Authors: Daniela Herrera, Gilberto Ochoa-Ruiz, Miguel Gonzalez-Mendoza, Christian Mata

Comments: Paper submitted to MICAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1359] arXiv:2206.00566 (cross-list from eess.IV) [pdf, ps, other]: Title: The Fully Convolutional Transformer for Medical Image Segmentation

Authors: Athanasios Tragakis, Chaitanya Kaul, Roderick Murray-Smith, Dirk Husmeier

Journal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, pp. 3660-3669

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1360] arXiv:2206.00831 (cross-list from eess.IV) [pdf, other]: Title: Dynamic Cardiac MRI Reconstruction Using Combined Tensor Nuclear Norm and Casorati Matrix Nuclear Norm Regularizations

Authors: Yinghao Zhang, Yue Hu

Comments: 4 pages, 3 figures, 1 table, accepted in IEEE ISBI 2022

Journal-ref: [C]//2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). IEEE, 2022: 1-4

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1361] arXiv:2206.00850 (cross-list from eess.IV) [pdf, other]: Title: Dynamic MRI using Learned Transform-based Tensor Low-Rank Network (LT$^2$LR-Net)

Authors: Yinghao Zhang, Peng Li, Yue Hu

Comments: 4 pages, 2 figures, 1 tabel, accepted by IEEE ISBI 2023 Conference

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1362] arXiv:2206.01088 (cross-list from eess.IV) [pdf, other]: Title: Machine Learning-based Lung and Colon Cancer Detection using Deep Feature Extraction and Ensemble Learning

Authors: Md. Alamin Talukder, Md. Manowarul Islam, Md Ashraf Uddin, Arnisha Akhter, Khondokar Fida Hasan, Mohammad Ali Moni

Comments: Accepted for publication in the Special Issue of Expert Systems with Applications (IF:6.954, Cite:12.70) How to Cite: Md. Alamin Talukder, Md. Manowarul Islam, Md Ashraf Uddin, Arnisha Akhter, Khondokar Fida Hasan, Mohammad Ali Moni. "Machine Learning-based Lung and Colon Cancer Detection using Deep Feature Extraction and Ensemble Learning", Expert Systems with Applications. 2022 Jun 1

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1363] arXiv:2206.01096 (cross-list from eess.IV) [pdf, ps, other]: Title: A Dual-fusion Semantic Segmentation Framework With GAN For SAR Images

Authors: Donghui Li, Jia Liu, Fang Liu, Wenhua Zhang, Andi Zhang, Wenfei Gao, Jiao Shi

Comments: 4 pages,4 figures, 2022 IEEE International Geoscience and Remote Sensing Symposium

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1364] arXiv:2206.01103 (cross-list from eess.IV) [pdf, other]: Title: Noise2NoiseFlow: Realistic Camera Noise Modeling without Clean Images

Authors: Ali Maleky, Shayan Kousha, Michael S. Brown, Marcus A. Brubaker

Comments: CVPR 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1365] arXiv:2206.01118 (cross-list from eess.IV) [pdf, ps, other]: Title: Comparing Conventional and Deep Feature Models for Classifying Fundus Photography of Hemorrhages

Authors: Tamoor Aziz, Chalie Charoenlarpnopparut, Srijidtra Mahapakulchai

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1366] arXiv:2206.01344 (cross-list from eess.IV) [pdf, ps, other]: Title: Detecting Pulmonary Embolism from Computed Tomography Using Convolutional Neural Network

Authors: Chia-Hung Yang, Yun-Chien Cheng, Chin Kuo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1367] arXiv:2206.01397 (cross-list from physics.optics) [pdf, other]: Title: Dynamic Structured Illumination Microscopy with a Neural Space-time Model

Authors: Ruiming Cao, Fanglin Linda Liu, Li-Hao Yeh, Laura Waller

Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[1368] arXiv:2206.01430 (cross-list from eess.IV) [pdf, other]: Title: LenslessPiCam: A Hardware and Software Platform for Lensless Computational Imaging with a Raspberry Pi

Authors: Eric Bezzam, Sepand Kashani, Martin Vetterli, Matthieu Simeoni

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1369] arXiv:2206.01644 (cross-list from quant-ph) [pdf, ps, other]: Title: Mirror modular cloning and fast quantum associative retrieval

Authors: M. C. Diamantini, C. A. Trugenberger

Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1370] arXiv:2206.01728 (cross-list from eess.IV) [pdf, ps, other]: Title: A review of machine learning approaches, challenges and prospects for computational tumor pathology

Authors: Liangrui Pan, Zhichao Feng, Shaoliang Peng

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1371] arXiv:2206.01731 (cross-list from eess.IV) [pdf, other]: Title: Empirical Study of Quality Image Assessment for Synthesis of Fetal Head Ultrasound Imaging with DCGANs

Authors: Thea Bautista, Jacqueline Matthew, Hamideh Kerdegari, Laura Peralta Pereira, Miguel Xochicale

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1372] arXiv:2206.01735 (cross-list from eess.IV) [pdf, other]: Title: Examining the behaviour of state-of-the-art convolutional neural networks for brain tumor detection with and without transfer learning

Authors: Md. Atik Ahamed, Rabeya Tus Sadia

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1373] arXiv:2206.01736 (cross-list from eess.IV) [pdf, other]: Title: Adaptive Adversarial Training to Improve Adversarial Robustness of DNNs for Medical Image Segmentation and Detection

Authors: Linhai Ma, Liang Liang

Comments: 17 pages

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1374] arXiv:2206.01737 (cross-list from eess.IV) [pdf, other]: Title: MaxStyle: Adversarial Style Composition for Robust Medical Image Segmentation

Authors: Chen Chen, Zeju Li, Cheng Ouyang, Matt Sinclair, Wenjia Bai, Daniel Rueckert

Comments: Early accepted by MICCAI 2022 (Camera-ready version)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1375] arXiv:2206.01738 (cross-list from eess.IV) [pdf, other]: Title: RIDDLE: Lidar Data Compression with Range Image Deep Delta Encoding

Authors: Xuanyu Zhou, Charles R. Qi, Yin Zhou, Dragomir Anguelov

Comments: 14 pages, 10 figures; CVPR 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1376] arXiv:2206.01739 (cross-list from eess.IV) [pdf, ps, other]: Title: Mutual- and Self- Prototype Alignment for Semi-supervised Medical Image Segmentation

Authors: Zhenxi Zhang, Chunna Tian, Zhicheng Jiao

Comments: 11 pages, 3 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1377] arXiv:2206.01740 (cross-list from eess.IV) [pdf, other]: Title: Denoising Fast X-Ray Fluorescence Raster Scans of Paintings

Authors: Henry Chopp, Alicia McGeachy, Matthias Alfeld, Oliver Cossairt, Marc Walton, Aggelos Katsaggelos

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1378] arXiv:2206.01741 (cross-list from eess.IV) [pdf, other]: Title: Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation

Authors: Yanglan Ou, Ye Yuan, Xiaolei Huang, Stephen T.C. Wong, John Volpi, James Z. Wang, Kelvin Wong

Comments: MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1379] arXiv:2206.01742 (cross-list from eess.IV) [pdf, other]: Title: Learning Probabilistic Topological Representations Using Discrete Morse Theory

Authors: Xiaoling Hu, Dimitris Samaras, Chao Chen

Comments: 16 pages, 11 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1380] arXiv:2206.01743 (cross-list from eess.IV) [pdf, other]: Title: Orthogonal Transform based Generative Adversarial Network for Image Dehazing

Authors: Ahlad Kumar, Mantra Sanathra, Manish Khare, Vijeta Khare

Comments: 12 pages, 14 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1381] arXiv:2206.01745 (cross-list from eess.IV) [pdf, ps, other]: Title: Detection of Fibrosis in Cine Magnetic Resonance Images Using Artificial Intelligence Techniques

Authors: Ariel. H. Curiale, Facundo Cabrera, Pablo Jimenez, Jorgelina Medus, GermÁn Mato, MatÍas E. Calandrelli

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1382] arXiv:2206.01746 (cross-list from eess.IV) [pdf, ps, other]: Title: Automatic Quantification of Volumes and Biventricular Function in Cardiac Resonance. Validation of a New Artificial Intelligence Approach

Authors: Ariel H. Curiale, MatÍas E. Calandrelli, Lucca Dellazoppa, Mariano Trevisan, Jorge Luis BociÁn, Juan Pablo Bonifacio, GermÁn Mato

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1383] arXiv:2206.01774 (cross-list from eess.IV) [pdf, other]: Title: Monkeypox Image Data collection

Authors: Md Manjurul Ahsan, Muhammad Ramiz Uddin, Shahana Akter Luna

Comments: This is the attempt of creating monkeypox image dataset collected from various sources and it will continue to update by collectiong samples from journals and other public access domains

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1384] arXiv:2206.01793 (cross-list from eess.IV) [pdf, ps, other]: Title: R2U++: A Multiscale Recurrent Residual U-Net with Dense Skip Connections for Medical Image Segmentation

Authors: Mehreen Mubashar, Hazrat Ali, Christer Gronlund, Shoaib Azmat

Comments: Paper accepted in Neural Computing and Applications (2022). Please cite the final version available from Springer website this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1385] arXiv:2206.01826 (cross-list from stat.ME) [pdf, other]: Title: The Gamma Generalized Normal Distribution: A Descriptor of SAR Imagery

Authors: G. M. Cordeiro, R. J. Cintra, L. C. Rêgo, A. D. C. Nascimento

Comments: 21 pages, 6 figures, 6 tables

Journal-ref: Journal of Computational and Applied Mathematics, vol. 347, pages 257-272, February 2019

Subjects: Methodology (stat.ME); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Statistics Theory (math.ST); Data Analysis, Statistics and Probability (physics.data-an)
[1386] arXiv:2206.01856 (cross-list from eess.IV) [pdf, other]: Title: Poisson2Sparse: Self-Supervised Poisson Denoising From a Single Image

Authors: Calvin-Khang Ta, Abhishek Aich, Akash Gupta, Amit K. Roy-Chowdhury

Comments: Accepted to MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1387] arXiv:2206.01862 (cross-list from eess.IV) [pdf, other]: Title: Image Data collection and implementation of deep learning-based model in detecting Monkeypox disease using modified VGG16

Authors: Md Manjurul Ahsan, Muhammad Ramiz Uddin, Mithila Farjana, Ahmed Nazmus Sakib, Khondhaker Al Momin, Shahana Akter Luna

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1388] arXiv:2206.01897 (cross-list from eess.IV) [pdf, other]: Title: Modeling of Textures to Predict Immune Cell Status and Survival of Brain Tumour Patients

Authors: Ahmad Chaddad, Mingli Zhang, Lama Hassan, Tamim Niazi

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
[1389] arXiv:2206.01903 (cross-list from eess.IV) [pdf, other]: Title: Deep Radiomic Analysis for Predicting Coronavirus Disease 2019 in Computerized Tomography and X-ray Images

Authors: Ahmad Chaddad, Lama Hassan, Christian Desrosiers

Journal-ref: IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):3-11

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1390] arXiv:2206.02061 (cross-list from eess.SP) [pdf, other]: Title: Low Power Neuromorphic EMG Gesture Classification

Authors: Sai Sukruth Bezugam, Ahmed Shaban, Manan Suri

Comments: 3 Pages, 5 figures, 1 table

Subjects: Signal Processing (eess.SP); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Neural and Evolutionary Computing (cs.NE)
[1391] arXiv:2206.02225 (cross-list from eess.IV) [pdf, other]: Title: Physically Inspired Constraint for Unsupervised Regularized Ultrasound Elastography

Authors: Ali K. Z. Tehrani, Hassan Rivaz

Comments: Accepted in MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1392] arXiv:2206.02278 (cross-list from eess.IV) [pdf, other]: Title: Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks

Authors: B. G. Palm, D. I. Alves, V. T. Vu, M. I. Pettersson, F. M. Bayer, R. J. Cintra, R. Machado, P. Dammert, H. Hellsten

Comments: 9 pages, 10 figures

Journal-ref: Proceedings Volume 10789, Image and Signal Processing for Remote Sensing XXIV; 1078916 (2018)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP); Methodology (stat.ME)
[1393] arXiv:2206.02358 (cross-list from eess.SP) [pdf, other]: Title: Implementation of a Modified U-Net for Medical Image Segmentation on Edge Devices

Authors: Owais Ali, Hazrat Ali, Syed Ayaz Ali Shah, Aamir Shahzad

Comments: Preprint of paper accepted in IEEE Transactions on Circuits and Systems II: Express Brief

Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[1394] arXiv:2206.02425 (cross-list from eess.IV) [pdf, other]: Title: mmFormer: Multimodal Medical Transformer for Incomplete Multimodal Learning of Brain Tumor Segmentation

Authors: Yao Zhang, Nanjun He, Jiawei Yang, Yuexiang Li, Dong Wei, Yawen Huang, Yang Zhang, Zhiqiang He, Yefeng Zheng

Comments: Accepted to MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1395] arXiv:2206.02510 (cross-list from physics.optics) [pdf, other]: Title: Single pixel imaging at high pixel resolutions

Authors: Rafał Stojek, Anna Pastuszczak, Piotr Wróbel, Rafał Kotyński

Comments: Paper accepted to Optics Express on 23/05/2022

Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1396] arXiv:2206.02558 (cross-list from q-bio.NC) [pdf, other]: Title: Binding Dancers Into Attractors

Authors: Franziska Kaltenberger, Sebastian Otte, Martin V. Butz

Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1397] arXiv:2206.02748 (cross-list from eess.IV) [pdf, other]: Title: Compound Multi-branch Feature Fusion for Real Image Restoration

Authors: Chi-Mao Fan, Tsung-Jung Liu, Kuan-Hsien Liu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1398] arXiv:2206.02797 (cross-list from eess.AS) [pdf, ps, other]: Title: FedNST: Federated Noisy Student Training for Automatic Speech Recognition

Authors: Haaris Mehmood, Agnieszka Dobrowolska, Karthikeyan Saravanan, Mete Ozay

Comments: Accepted at Interspeech 2022

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[1399] arXiv:2206.02837 (cross-list from eess.IV) [pdf, other]: Title: EVAC+: Multi-scale V-net with Deep Feature CRF Layers for Brain Extraction

Authors: Jong Sung Park, Shreyas Fadnavis, Eleftherios Garyfallidis

Comments: Replaced with advancements in the model and results

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1400] arXiv:2206.02838 (cross-list from eess.IV) [pdf, other]: Title: Invertible Sharpening Network for MRI Reconstruction Enhancement

Authors: Siyuan Dong, Eric Z. Chen, Lin Zhao, Xiao Chen, Yikang Liu, Terrence Chen, Shanhui Sun

Comments: Accepted by MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1401] arXiv:2206.02959 (cross-list from eess.IV) [pdf, other]: Title: HMRNet: High and Multi-Resolution Network with Bidirectional Feature Calibration for Brain Structure Segmentation in Radiotherapy

Authors: Hao Fu, Guotai Wang, Wenhui Lei, Wei Xu, Qianfei Zhao, Shichuan Zhang, Kang Li, Shaoting Zhang

Comments: 11 pages, 6 figures, Accepted by IEEE JBHI

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1402] arXiv:2206.03003 (cross-list from eess.IV) [pdf, other]: Title: Transformer-based Personalized Attention Mechanism for Medical Images with Clinical Records

Authors: Yusuke Takagi, Noriaki Hashimoto, Hiroki Masuda, Hiroaki Miyoshi, Koichi Ohshima, Hidekata Hontani, Ichiro Takeuchi

Journal-ref: Takagi, Yusuke, et al. "Transformer-based personalized attention mechanism for medical images with clinical records." Journal of Pathology Informatics (2023): 100185

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1403] arXiv:2206.03009 (cross-list from eess.IV) [pdf, other]: Title: Self-Knowledge Distillation based Self-Supervised Learning for Covid-19 Detection from Chest X-Ray Images

Authors: Guang Li, Ren Togo, Takahiro Ogawa, Miki Haseyama

Comments: Published as a conference paper at ICASSP 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1404] arXiv:2206.03043 (cross-list from eess.IV) [pdf, other]: Title: COVIDx CT-3: A Large-scale, Multinational, Open-Source Benchmark Dataset for Computer-aided COVID-19 Screening from Chest CT Images

Authors: Hayden Gunraj, Tia Tuinstra, Alexander Wong

Comments: 6 pages, MED-NeurIPS 2022 workshop

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1405] arXiv:2206.03049 (cross-list from eess.IV) [pdf, other]: Title: Siamese Encoder-based Spatial-Temporal Mixer for Growth Trend Prediction of Lung Nodules on CT Scans

Authors: Jiansheng Fang, Jingwen Wang, Anwei Li, Yuguang Yan, Yonghe Hou, Chao Song, Hongbo Liu, Jiang Liu

Comments: MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1406] arXiv:2206.03066 (cross-list from quant-ph) [pdf, other]: Title: Recent Advances for Quantum Neural Networks in Generative Learning

Authors: Jinkai Tian, Xiaoyu Sun, Yuxuan Du, Shanshan Zhao, Qing Liu, Kaining Zhang, Wei Yi, Wanrong Huang, Chaoyue Wang, Xingyao Wu, Min-Hsiu Hsieh, Tongliang Liu, Wenjing Yang, Dacheng Tao

Comments: The first two authors contributed equally to this work

Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1407] arXiv:2206.03247 (cross-list from eess.IV) [pdf, other]: Title: Towards better Interpretable and Generalizable AD detection using Collective Artificial Intelligence

Authors: Huy-Dung Nguyen, Michaël Clément, Boris Mansencal, Pierrick Coupé

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1408] arXiv:2206.03336 (cross-list from eess.IV) [pdf, other]: Title: Parotid Gland MRI Segmentation Based on Swin-Unet and Multimodal Images

Authors: Zi'an Xu, Yin Dai, Fayu Liu, Siqi Li, Sheng Liu, Lifu Shi, Jun Fu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1409] arXiv:2206.03359 (cross-list from eess.IV) [pdf, other]: Title: An efficient semi-supervised quality control system trained using physics-based MRI-artefact generators and adversarial training

Authors: Daniele Ravi (for the Alzheimer's Disease Neuroimaging Initiative), Frederik Barkhof, Daniel C. Alexander, Lemuel Puglisi, Geoffrey JM Parker, Arman Eshaghi

Journal-ref: Medical Image Analysis 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1410] arXiv:2206.03413 (cross-list from physics.med-ph) [pdf, ps, other]: Title: Deep Learning based Direct Segmentation Assisted by Deformable Image Registration for Cone-Beam CT based Auto-Segmentation for Adaptive Radiotherapy

Authors: Xiao Liang, Howard Morgan, Ti Bai, Michael Dohopolski, Dan Nguyen, Steve Jiang

Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[1411] arXiv:2206.03603 (cross-list from eess.IV) [pdf, ps, other]: Title: A new method incorporating deep learning with shape priors for left ventricular segmentation in myocardial perfusion SPECT images

Authors: Fubao Zhu, Jinyu Zhao, Chen Zhao, Shaojie Tang, Jiaofen Nan, Yanting Li, Zhongqiang Zhao, Jianzhou Shi, Zenghong Chen, Zhixin Jiang, Weihua Zhou

Comments: 21 pages, 14 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1412] arXiv:2206.03671 (cross-list from eess.IV) [pdf, other]: Title: COVIDx CXR-3: A Large-Scale, Open-Source Benchmark Dataset of Chest X-ray Images for Computer-Aided COVID-19 Diagnostics

Authors: Maya Pavlova, Tia Tuinstra, Hossein Aboutalebi, Andy Zhao, Hayden Gunraj, Alexander Wong

Comments: 5 pages, MED-NeurIPS 2022 workshop

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1413] arXiv:2206.03709 (cross-list from eess.IV) [pdf, other]: Title: Hypernetwork-based Personalized Federated Learning for Multi-Institutional CT Imaging

Authors: Ziyuan Yang, Wenjun Xia, Zexin Lu, Yingyu Chen, Xiaoxiao Li, Yi Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1414] arXiv:2206.03803 (cross-list from eess.IV) [pdf, other]: Title: Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window

Authors: Qiuli Wang, Xin Tan, Chen Liu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1415] arXiv:2206.03830 (cross-list from eess.IV) [pdf, other]: Title: Generative Myocardial Motion Tracking via Latent Space Exploration with Biomechanics-informed Prior

Authors: Chen Qin, Shuo Wang, Chen Chen, Wenjia Bai, Daniel Rueckert

Comments: Under review

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1416] arXiv:2206.03900 (cross-list from eess.IV) [pdf, other]: Title: Unsupervised Deformable Image Registration with Absent Correspondences in Pre-operative and Post-Recurrence Brain Tumor MRI Scans

Authors: Tony C. W. Mok, Albert C. S. Chung

Comments: Accepted by MICCAI2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1417] arXiv:2206.03935 (cross-list from eess.IV) [pdf, other]: Title: Dual-Distribution Discrepancy for Anomaly Detection in Chest X-Rays

Authors: Yu Cai, Hao Chen, Xin Yang, Yu Zhou, Kwang-Ting Cheng

Comments: Early Accepted to MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1418] arXiv:2206.03955 (cross-list from stat.ML) [pdf, other]: Title: Out-of-Distribution Detection with Class Ratio Estimation

Authors: Mingtian Zhang, Andi Zhang, Tim Z. Xiao, Yitong Sun, Steven McDonagh

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1419] arXiv:2206.04056 (cross-list from eess.IV) [pdf, ps, other]: Title: An Improved Deep Convolutional Neural Network by Using Hybrid Optimization Algorithms to Detect and Classify Brain Tumor Using Augmented MRI Images

Authors: Shko M. Qader, Bryar A. Hassan, Tarik A. Rashid

Comments: Multimed Tools Appl (2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1420] arXiv:2206.04145 (cross-list from eess.IV) [pdf, other]: Title: Deep Estimation of Speckle Statistics Parametric Images

Authors: Ali K. Z. Tehrani, Ivan M. Rosado-Mendez, Hassan Rivaz

Comments: Accepted in EMBC 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1421] arXiv:2206.04238 (cross-list from eess.IV) [pdf, other]: Title: Cardiac Adipose Tissue Segmentation via Image-Level Annotations

Authors: Ziyi Huang, Yu Gan, Theresa Lye, Yanchen Liu, Haofeng Zhang, Andrew Laine, Elsa Angelini, Christine Hendon

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1422] arXiv:2206.04272 (cross-list from cond-mat.mes-hall) [pdf, ps, other]: Title: STEM image analysis based on deep learning: identification of vacancy defects and polymorphs of ${MoS_2}$

Authors: Kihyun Lee, Jinsub Park, Soyeon Choi, Yangjin Lee, Sol Lee, Joowon Jung, Jong-Young Lee, Farman Ullah, Zeeshan Tahir, Yong Soo Kim, Gwan-Hyoung Lee, Kwanpyo Kim

Comments: 24 pages, 5 figures

Journal-ref: Nano Letters, 2022

Subjects: Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Materials Science (cond-mat.mtrl-sci); Computer Vision and Pattern Recognition (cs.CV)
[1423] arXiv:2206.04289 (cross-list from eess.IV) [pdf, other]: Title: A No-Reference Deep Learning Quality Assessment Method for Super-resolution Images Based on Frequency Maps

Authors: Zicheng Zhang, Wei Sun, Xiongkuo Min, Wenhan Zhu, Tao Wang, Wei Lu, Guangtao Zhai

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1424] arXiv:2206.04328 (cross-list from eess.IV) [pdf, other]: Title: Novel projection schemes for graph-based Light Field coding

Authors: Bach Gia Nguyen, Chanh Minh Tran, Tho Nguyen Duc, Tan Xuan Phan, Kamioka Eiji

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1425] arXiv:2206.04336 (cross-list from eess.IV) [pdf, other]: Title: Joint Modeling of Image and Label Statistics for Enhancing Model Generalizability of Medical Image Segmentation

Authors: Shangqi Gao, Hangqi Zhou, Yibo Gao, Xiahai Zhuang

Comments: MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1426] arXiv:2206.04341 (cross-list from eess.IV) [pdf, other]: Title: How Asynchronous Events Encode Video

Authors: Karen Adam, Adam Scholefield, Martin Vetterli

Comments: 6 pages, 4 figures

Journal-ref: 2021 55th Asilomar Conference on Signals, Systems, and Computers

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1427] arXiv:2206.04431 (cross-list from eess.IV) [pdf, other]: Title: Cross-boosting of WNNM Image Denoising method by Directional Wavelet Packets

Authors: Amir Averbuch, Pekka Neittaanmäki, Valery Zheludev, Moshe Salhov, Jonathan Hauser

Comments: 30 pages, 28 figures. arXiv admin note: substantial text overlap with arXiv:2008.11595. text overlap with arXiv:2001.04899

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1428] arXiv:2206.04514 (cross-list from eess.IV) [pdf, ps, other]: Title: SAR Despeckling using a Denoising Diffusion Probabilistic Model

Authors: Malsha V. Perera, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

Comments: Our code is available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1429] arXiv:2206.04548 (cross-list from eess.IV) [pdf, other]: Title: Classification of COVID-19 in Chest X-ray Images Using Fusion of Deep Features and LightGBM

Authors: Hamid Nasiri, Ghazal Kheyroddin, Morteza Dorrigiv, Mona Esmaeili, Amir Raeisi Nafchi, Mohsen Haji Ghorbani, Payman Zarkesh-Ha

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1430] arXiv:2206.04647 (cross-list from eess.IV) [pdf, other]: Title: VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution

Authors: Zeyuan Chen, Yinbo Chen, Jingwen Liu, Xingqian Xu, Vidit Goel, Zhangyang Wang, Humphrey Shi, Xiaolong Wang

Comments: Accepted to CVPR 2022. Project page: this http URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1431] arXiv:2206.04681 (cross-list from eess.IV) [pdf, other]: Title: Gaussian Fourier Pyramid for Local Laplacian Filter

Authors: Yuto Sumiya, Tomoki Otsuka, Yoshihiro Maeda, Norishige Fukushima

Journal-ref: IEEE Signal Processing Letters (SPL), vol. 29, pp. 11-15, 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1432] arXiv:2206.04682 (cross-list from eess.IV) [pdf, other]: Title: RT-DNAS: Real-time Constrained Differentiable Neural Architecture Search for 3D Cardiac Cine MRI Segmentation

Authors: Qing Lu, Xiaowei Xu, Shunjie Dong, Cong Hao, Lei Yang, Cheng Zhuo, Yiyu Shi

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1433] arXiv:2206.04684 (cross-list from eess.IV) [pdf, other]: Title: Structure-consistent Restoration Network for Cataract Fundus Image Enhancement

Authors: Heng Li, Haofeng Liu, Huazhu Fu, Hai Shu, Yitian Zhao, Xiaoling Luo, Yan Hu, Jiang Liu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1434] arXiv:2206.04689 (cross-list from eess.IV) [pdf, ps, other]: Title: AI-based Clinical Assessment of Optic Nerve Head Robustness Superseding Biomechanical Testing

Authors: Fabian A. Braeu, Thanadet Chuangsuwanich, Tin A. Tun, Alexandre H. Thiery, Tin Aung, George Barbastathis, Michaël J.A. Girard

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1435] arXiv:2206.04732 (cross-list from eess.IV) [pdf, other]: Title: AI-MIA: COVID-19 Detection & Severity Analysis through Medical Imaging

Authors: Dimitrios Kollias, Anastasios Arsenos, Stefanos Kollias

Comments: arXiv admin note: substantial text overlap with arXiv:2106.07524

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1436] arXiv:2206.04877 (cross-list from eess.IV) [pdf, other]: Title: Efficient Per-Shot Convex Hull Prediction By Recurrent Learning

Authors: Somdyuti Paul, Andrey Norkin, Alan C. Bovik

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1437] arXiv:2206.05047 (cross-list from eess.IV) [pdf, other]: Title: A GPU-Accelerated Light-field Super-resolution Framework Based on Mixed Noise Model and Weighted Regularization

Authors: Trung-Hieu Tran, Kaicong Sun, Sven Simon

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
[1438] arXiv:2206.05049 (cross-list from eess.IV) [pdf, other]: Title: Denoising Generalized Expectation-Consistent Approximation for MR Image Recovery

Authors: Saurav K. Shastri, Rizwan Ahmad, Christopher A. Metzler, Philip Schniter

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1439] arXiv:2206.05054 (cross-list from eess.IV) [pdf, other]: Title: A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences

Authors: Yu Fan, Zicheng Zhang, Wei Sun, Xiongkuo Min, Wei Lu, Tao Wang, Ning Liu, Guangtao Zhai

Comments: Accepted to IEEE 24th International Workshop on Multimedia Signal Processing, 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1440] arXiv:2206.05092 (cross-list from eess.IV) [pdf, other]: Title: Learning self-calibrated optic disc and cup segmentation from multi-rater annotations

Authors: Junde Wu, Huihui Fang, Fangxin Shang, Zhaowei Wang, Dalu Yang, Wenshuo Zhou, Yehui Yang, Yanwu Xu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1441] arXiv:2206.05148 (cross-list from eess.IV) [pdf, other]: Title: Weakly-supervised segmentation using inherently-explainable classification models and their application to brain tumour classification

Authors: Soumick Chatterjee, Hadya Yassin, Florian Dubost, Andreas Nürnberger, Oliver Speck

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1442] arXiv:2206.05236 (cross-list from physics.optics) [pdf, ps, other]: Title: Optical Diffraction Tomography based on 3D Physics-Inspired Neural Network (PINN)

Authors: Ahmed B. Ayoub, Amirhossein Saba, Carlo Gigli, Demetri Psaltis

Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV)
[1443] arXiv:2206.05277 (cross-list from eess.IV) [pdf, other]: Title: Superresolution and Segmentation of OCT scans using Multi-Stage adversarial Guided Attention Training

Authors: Paria Jeihouni, Omid Dehzangi, Annahita Amireskandari, Ali Dabouei, Ali Rezai, Nasser M. Nasrabadi

Comments: 5 pages,conference

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1444] arXiv:2206.05278 (cross-list from eess.IV) [pdf, other]: Title: Dual-Branch Squeeze-Fusion-Excitation Module for Cross-Modality Registration of Cardiac SPECT and CT

Authors: Xiongchao Chen, Bo Zhou, Huidong Xie, Xueqi Guo, Jiazhen Zhang, Albert J. Sinusas, John A. Onofrey, Chi liu

Comments: 10 pages, 4 figures, accepted at MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1445] arXiv:2206.05279 (cross-list from eess.IV) [pdf, other]: Title: PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework

Authors: Ning Kang, Shanzhao Qiu, Shifeng Zhang, Zhenguo Li, Shutao Xia

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1446] arXiv:2206.05283 (cross-list from eess.IV) [pdf, other]: Title: Poissonian Blurred Image Deconvolution by Framelet based Local Minimal Prior

Authors: Reza Parvaz

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1447] arXiv:2206.05284 (cross-list from eess.IV) [pdf, other]: Title: Decoupling Predictions in Distributed Learning for Multi-Center Left Atrial MRI Segmentation

Authors: Zheyao Gao, Lei Li, Fuping Wu, Sihan Wang, Xiahai Zhuang

Comments: Accepted by MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1448] arXiv:2206.05288 (cross-list from eess.IV) [pdf, other]: Title: From Labels to Priors in Capsule Endoscopy: A Prior Guided Approach for Improving Generalization with Few Labels

Authors: Anuja Vats, Ahmed Mohammed, Marius Pedersen

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1449] arXiv:2206.05289 (cross-list from eess.IV) [pdf, other]: Title: Localized adversarial artifacts for compressed sensing MRI

Authors: Rima Alaifari, Giovanni S. Alberti, Tandri Gauksson

Comments: 14 pages, 7 figures

Journal-ref: SIAM Journal on Imaging Sciences, 16(4):SC14-SC26, 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1450] arXiv:2206.05472 (cross-list from eess.IV) [pdf, other]: Title: Differentiable Projection from Optical Coherence Tomography B-Scan without Retinal Layer Segmentation Supervision

Authors: Dingyi Rong, Jiancheng Yang, Bingbing Ni, Bilian Ke

Comments: ISBI2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1451] arXiv:2206.05516 (cross-list from eess.IV) [pdf, other]: Title: Deep Learning-Based MR Image Re-parameterization

Authors: Abhijeet Narang, Abhigyan Raj, Mihaela Pop, Mehran Ebrahimi

Comments: A. Narang, A. Raj, M. Pop and M. Ebrahimi, "Deep Learning-Based MR Image Re-parameterization," 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE), Las Vegas, NV, USA, 2023, pp. 536-541, doi: 10.1109/CSCE60160.2023.00094

Journal-ref: 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1452] arXiv:2206.05575 (cross-list from eess.IV) [pdf, ps, other]: Title: MammoFL: Mammographic Breast Density Estimation using Federated Learning

Authors: Ramya Muthukrishnan, Angelina Heyler, Keshava Katti, Sarthak Pati, Walter Mankowski, Aprupa Alahari, Michael Sanborn, Emily F. Conant, Christopher Scott, Stacey Winham, Celine Vachon, Pratik Chaudhari, Despina Kontos, Spyridon Bakas

Comments: Deep learning, federated learning, mammography, breast density, risk assessment

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[1453] arXiv:2206.05615 (cross-list from eess.IV) [pdf, other]: Title: Machine learning approaches for COVID-19 detection from chest X-ray imaging: A Systematic Review

Authors: Harold Brayan Arteaga-Arteaga (1), Melissa delaPava (1), Alejandro Mora-Rubio (1), Mario Alejandro Bravo-Ortíz (1), Jesus Alejandro Alzate-Grisales (1), Daniel Arias-Garzón (1), Luis Humberto López-Murillo (2), Felipe Buitrago-Carmona (3), Juan Pablo Villa-Pulgarín (1), Esteban Mercado-Ruiz (1), Simon Orozco-Arias (3 and 4), M. Hassaballah (5), Maria de la Iglesia-Vaya (6), Oscar Cardona-Morales (1), Reinel Tabares-Soto (1) ((1) Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia, (2) Department of Chemical Engineering, Universidad Nacional de Colombia, Manizales, Colombia, (3) Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia, (4) Department of Systems and informatics, Universidad de Caldas, Manizales, Colombia, (5) Faculty of Computers and Information, South Valley University, Qena, Egypt, (6) Unidad Mixta de Imagen Biomédica FISABIO-CIPF, Fundación para el Fomento de la Investigación Sanitario y Biomédica de la Comunidad Valenciana, Valencia, Spain)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1454] arXiv:2206.05618 (cross-list from physics.med-ph) [pdf, other]: Title: Synthetic PET via Domain Translation of 3D MRI

Authors: Abhejit Rajagopal, Yutaka Natsuaki, Kristen Wangerin, Mahdjoub Hamdi, Hongyu An, John J. Sunderland, Richard Laforest, Paul E. Kinahan, Peder E.Z. Larson, Thomas A.Hope

Comments: under review

Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[1455] arXiv:2206.05647 (cross-list from eess.IV) [pdf, other]: Title: A Fast Alternating Minimization Algorithm for Coded Aperture Snapshot Spectral Imaging Based on Sparsity and Deep Image Priors

Authors: Qile Zhao, Xianhong Zhao, Xu Ma, Xudong Chen, Gonzalo R. Arce

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[1456] arXiv:2206.05650 (cross-list from eess.IV) [pdf, other]: Title: Preprocessing Enhanced Image Compression for Machine Vision

Authors: Guo Lu, Xingtong Ge, Tianxiong Zhong, Jing Geng, Qiang Hu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1457] arXiv:2206.05695 (cross-list from eess.IV) [pdf, ps, other]: Title: PD-DWI: Predicting response to neoadjuvant chemotherapy in invasive breast cancer with Physiologically-Decomposed Diffusion-Weighted MRI machine-learning model

Authors: Maya Gilad, Moti Freiman

Comments: Accepted to Medical Image Computing and Computer Assisted Intervention - MICCAI 2022 to be held during Sept 18-22 in Singapore

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1458] arXiv:2206.05782 (cross-list from eess.IV) [pdf, other]: Title: DSCA: A Dual-Stream Network with Cross-Attention on Whole-Slide Image Pyramids for Cancer Prognosis

Authors: Pei Liu, Bo Fu, Feng Ye, Rui Yang, Bin Xu, Luping Ji

Comments: 12 pages, 6 figures, 7 tables

Journal-ref: Expert Systems with Applications, 120280 (2023)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1459] arXiv:2206.05935 (cross-list from eess.IV) [pdf, other]: Title: Fluorescence angiography classification in colorectal surgery -- A preliminary report

Authors: Antonio S Soares, Sophia Bano, Neil T Clancy, Laurence B Lovat, Danail Stoyanov, Manish Chand

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1460] arXiv:2206.06065 (cross-list from eess.IV) [pdf, ps, other]: Title: Deep ensemble learning for segmenting tuberculosis-consistent manifestations in chest radiographs

Authors: Sivaramakrishnan Rajaraman, Feng Yang, Ghada Zamzmi, Peng Guo, Zhiyun Xue, Sameer K Antani

Comments: 13 pages, 6 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1461] arXiv:2206.06070 (cross-list from eess.IV) [pdf, other]: Title: Annular Computational Imaging: Capture Clear Panoramic Images through Simple Lens

Authors: Qi Jiang, Hao Shi, Lei Sun, Shaohua Gao, Kailun Yang, Kaiwei Wang

Comments: Accepted to IEEE Transactions on Computational Imaging (TCI). Code and datasets are publicly available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
[1462] arXiv:2206.06127 (cross-list from eess.IV) [pdf, other]: Title: SyntheX: Scaling Up Learning-based X-ray Image Analysis Through In Silico Experiments

Authors: Cong Gao, Benjamin D. Killeen, Yicheng Hu, Robert B. Grupp, Russell H. Taylor, Mehran Armand, Mathias Unberath

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1463] arXiv:2206.06235 (cross-list from eess.IV) [pdf, ps, other]: Title: Prostate Cancer Malignancy Detection and localization from mpMRI using auto-Deep Learning: One Step Closer to Clinical Utilization

Authors: Weiwei Zong, Eric Carver, Simeng Zhu, Eric Schaff, Daniel Chapman, Joon Lee, Hassan Bagher Ebadian, Indrin Chetty, Benjamin Movsas, Winston Wen, Tarik Alafif, Xiangyun Zong

Comments: arXiv admin note: text overlap with arXiv:1903.12331

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1464] arXiv:2206.06253 (cross-list from eess.IV) [pdf, ps, other]: Title: RPLHR-CT Dataset and Transformer Baseline for Volumetric Super-Resolution from CT Scans

Authors: Pengxin Yu, Haoyue Zhang, Han Kang, Wen Tang, Corey W. Arnold, Rongguo Zhang

Comments: Accepted MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1465] arXiv:2206.06264 (cross-list from eess.IV) [pdf, other]: Title: Automatic Polyp Segmentation with Multiple Kernel Dilated Convolution Network

Authors: Nikhil Kumar Tomar, Abhishek Srivastava, Ulas Bagci, Debesh Jha

Journal-ref: Published CBMS 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1466] arXiv:2206.06267 (cross-list from eess.IV) [pdf, other]: Title: MMMNA-Net for Overall Survival Time Prediction of Brain Tumor Patients

Authors: Wen Tang, Haoyue Zhang, Pengxin Yu, Han Kang, Rongguo Zhang

Comments: Accepted EMBC 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1467] arXiv:2206.06341 (cross-list from eess.IV) [pdf, other]: Title: Unsupervised inter-frame motion correction for whole-body dynamic PET using convolutional long short-term memory in a convolutional neural network

Authors: Xueqi Guo, Bo Zhou, David Pigg, Bruce Spottiswoode, Michael E. Casey, Chi Liu, Nicha C. Dvornek

Comments: Preprint submitted to Medical Image Analysis

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP); Applications (stat.AP)
[1468] arXiv:2206.06445 (cross-list from eess.IV) [pdf, other]: Title: Fitting Segmentation Networks on Varying Image Resolutions using Splatting

Authors: Mikael Brudfors, Yael Balbastre, John Ashburner, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

Comments: Accepted for MIUA 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1469] arXiv:2206.06448 (cross-list from eess.IV) [pdf, ps, other]: Title: Assessing Privacy Leakage in Synthetic 3-D PET Imaging using Transversal GAN

Authors: Robert V. Bergen, Jean-Francois Rajotte, Fereshteh Yousefirizi, Arman Rahmim, Raymond T. Ng

Comments: arXiv admin note: text overlap with arXiv:2111.01866

Subjects: Image and Video Processing (eess.IV); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1470] arXiv:2206.06541 (cross-list from eess.IV) [pdf, other]: Title: Pixel-by-pixel Mean Opinion Score (pMOS) for No-Reference Image Quality Assessment

Authors: Wook-Hyung Kim, Cheul-hee Hahm, Anant Baijal, Namuk Kim, Ilhyun Cho, Jayoon Koo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1471] arXiv:2206.06575 (cross-list from eess.IV) [pdf, other]: Title: Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation

Authors: Wenxuan Wang, Chen Chen, Jing Wang, Sen Zha, Yan Zhang, Jiangyun Li

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1472] arXiv:2206.06598 (cross-list from eess.IV) [pdf, other]: Title: CorticalFlow$^{++}$: Boosting Cortical Surface Reconstruction Accuracy, Regularity, and Interoperability

Authors: Rodrigo Santa Cruz, Léo Lebrat, Darren Fu, Pierrick Bourgeat, Jurgen Fripp, Clinton Fookes, Olivier Salvado

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1473] arXiv:2206.06623 (cross-list from eess.IV) [pdf, other]: Title: ULTRA: Uncertainty-aware Label Distribution Learning for Breast Tumor Cellularity Assessment

Authors: Xiangyu Li, Xinjie Liang, Gongning Luo, Wei Wang, Kuanquan Wang, Shuo Li

Comments: Paper accepted by MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1474] arXiv:2206.06654 (cross-list from eess.IV) [pdf, other]: Title: The Kidneys Are Not All Normal: Investigating the Speckle Distributions of Transplanted Kidneys

Authors: Rohit Singla, Ricky Hu, Cailin Ringstrom, Victoria Lessoway, Janice Reid, Christopher Nguan, Robert Rohling

Comments: 25 pages, 2 figures, 3 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[1475] arXiv:2206.06657 (cross-list from eess.IV) [pdf, other]: Title: The Open Kidney Ultrasound Data Set

Authors: Rohit Singla, Cailin Ringstrom, Grace Hu, Victoria Lessoway, Janice Reid, Christopher Nguan, Robert Rohling

Comments: 21 pages, 1 figure, 5 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1476] arXiv:2206.06663 (cross-list from q-bio.QM) [pdf, ps, other]: Title: Quantitative Imaging Principles Improves Medical Image Learning

Authors: Lambert T. Leong, Michael C. Wong, Yannik Glaser, Thomas Wolfgruber, Steven B. Heymsfield, Peter Sadowski, John A. Shepherd

Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1477] arXiv:2206.06701 (cross-list from eess.IV) [pdf, other]: Title: CNN-based Classification Framework for Lung Tissues with Auxiliary Information

Authors: Huafeng Hu, Ruijie Ye, Jeyarajan Thiyagalingam, Frans Coenen, Jionglong Su

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1478] arXiv:2206.06725 (cross-list from eess.IV) [pdf, other]: Title: Automated SSIM Regression for Detection and Quantification of Motion Artefacts in Brain MR Images

Authors: Alessandro Sciarra, Soumick Chatterjee, Max Dünnwald, Giuseppe Placidi, Andreas Nürnberger, Oliver Speck, Steffen Oeltze-Jafra

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1479] arXiv:2206.06730 (cross-list from eess.IV) [pdf, other]: Title: Automated Precision Localization of Peripherally Inserted Central Catheter Tip through Model-Agnostic Multi-Stage Networks

Authors: Subin Park, Yoon Ki Cha, Soyoung Park, Kyung-Su Kim, Myung Jin Chung

Comments: Subin Park and Yoon Ki Cha have contributed equally to this work as the co-first author. Kyung-Su Kim (kskim.doc@gmail.com) and Myung Jin Chung (mj1.chung@samsung.com) have contributed equally to this work as the co-corresponding author

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1480] arXiv:2206.06813 (cross-list from eess.IV) [pdf, other]: Title: Learning towards Synchronous Network Memorizability and Generalizability for Continual Segmentation across Multiple Sites

Authors: Jingyang Zhang, Peng Xue, Ran Gu, Yuning Gu, Mianxin Liu, Yongsheng Pan, Zhiming Cui, Jiawei Huang, Lei Ma, Dinggang Shen

Comments: Early accepted in MICCAI2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1481] arXiv:2206.06862 (cross-list from q-bio.QM) [pdf, other]: Title: Evaluating histopathology transfer learning with ChampKit

Authors: Jakub R. Kaczmarzyk, Tahsin M. Kurc, Shahira Abousamra, Rajarsi Gupta, Joel H. Saltz, Peter K. Koo

Comments: Submitted to NeurIPS 2022 Track on Datasets and Benchmarks. Source code available at this https URL

Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1482] arXiv:2206.06947 (cross-list from eess.IV) [pdf, other]: Title: K-Space Transformer for Undersampled MRI Reconstruction

Authors: Ziheng Zhao, Tianjiao Zhang, Weidi Xie, Yanfeng Wang, Ya Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1483] arXiv:2206.07122 (cross-list from stat.ML) [pdf, other]: Title: Loss Functions for Classification using Structured Entropy

Authors: Brian Lucena

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG)
[1484] arXiv:2206.07156 (cross-list from eess.IV) [pdf, other]: Title: Federated Multi-organ Segmentation with Inconsistent Labels

Authors: Xuanang Xu, Hannah H. Deng, Jaime Gateno, Pingkun Yan

Comments: v1: 10 pages, 5 figures; v2: 14 pages, 5 figures, accepted by IEEE Transactions on Medical Imaging (TMI), published version available at this https URL, source code available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1485] arXiv:2206.07219 (cross-list from eess.IV) [pdf, ps, other]: Title: A Projection-Based K-space Transformer Network for Undersampled Radial MRI Reconstruction with Limited Training Subjects

Authors: Chang Gao, Shu-Fu Shih, J. Paul Finn, Xiaodong Zhong

Comments: Accepted at MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1486] arXiv:2206.07280 (cross-list from eess.IV) [pdf, ps, other]: Title: ERNAS: An Evolutionary Neural Architecture Search for Magnetic Resonance Image Reconstructions

Authors: Samira Vafay Eslahi, Jian Tao, Jim Ji

Comments: 11 pages, 9 figures, and 4 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1487] arXiv:2206.07281 (cross-list from physics.optics) [pdf, ps, other]: Title: Super-resolution image display using diffractive decoders

Authors: Cagatay Isil, Deniz Mengu, Yifan Zhao, Anika Tabassum, Jingxi Li, Yi Luo, Mona Jarrahi, Aydogan Ozcan

Comments: 26 Pages, 9 Figures

Journal-ref: Science Advances (2022)

Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Applied Physics (physics.app-ph)
[1488] arXiv:2206.07364 (cross-list from eess.IV) [pdf, other]: Title: Seeking Common Ground While Reserving Differences: Multiple Anatomy Collaborative Framework for Undersampled MRI Reconstruction

Authors: Jiangpeng Yan, Chenghui Yu, Hanbo Chen, Zhe Xu, Junzhou Huang, Xiu Li, Jianhua Yao

Comments: submitted to an IEEE journal

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1489] arXiv:2206.07388 (cross-list from physics.geo-ph) [pdf, ps, other]: Title: Subsurface Depths Structure Maps Reconstruction with Generative Adversarial Networks

Authors: Dmitry Ivlev

Comments: 12 pages, 12 figures, 1 table

Subjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1490] arXiv:2206.07417 (cross-list from eess.IV) [pdf, other]: Title: Interpretable differential diagnosis for Alzheimer's disease and Frontotemporal dementia

Authors: Huy-Dung Nguyen, Michaël Clément, Boris Mansencal, Pierrick Coupé

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1491] arXiv:2206.07422 (cross-list from eess.IV) [pdf, other]: Title: Deep Neural Network Pruning for Nuclei Instance Segmentation in Hematoxylin & Eosin-Stained Histological Images

Authors: Amirreza Mahbod, Rahim Entezari, Isabella Ellinger, Olga Saukh

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1492] arXiv:2206.07481 (cross-list from eess.SP) [pdf, ps, other]: Title: A Survey of Detection Methods for Die Attachment and Wire Bonding Defects in Integrated Circuit Manufacturing

Authors: Lamia Alam, Nasser Kehtarnavaz

Comments: 13 pages, 9 figures, 8 tables

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1493] arXiv:2206.07494 (cross-list from cond-mat.stat-mech) [pdf, other]: Title: Counting Phases and Faces Using Bayesian Thermodynamic Integration

Authors: Alexander Lobashev, Mikhail V. Tamm

Comments: 20 pages, 9 figures, plus appendix with additional figures

Subjects: Statistical Mechanics (cond-mat.stat-mech); Disordered Systems and Neural Networks (cond-mat.dis-nn); Computer Vision and Pattern Recognition (cs.CV); Data Analysis, Statistics and Probability (physics.data-an)
[1494] arXiv:2206.07542 (cross-list from q-bio.NC) [pdf, other]: Title: A Deep Generative Model of Neonatal Cortical Surface Development

Authors: Abdulah Fawaz, Logan Z. Williams, A. David Edwards, Emma Robinson

Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1495] arXiv:2206.07595 (cross-list from eess.IV) [pdf, ps, other]: Title: BIO-CXRNET: A Robust Multimodal Stacking Machine Learning Technique for Mortality Risk Prediction of COVID-19 Patients using Chest X-Ray Images and Clinical Data

Authors: Tawsifur Rahman, Muhammad E. H. Chowdhury, Amith Khandakar, Zaid Bin Mahbub, Md Sakib Abrar Hossain, Abraham Alhatou, Eynas Abdalla, Sreekumar Muthiyal, Khandaker Farzana Islam, Saad Bin Abul Kashem, Muhammad Salman Khan, Susu M. Zughaier, Maqsud Hossain

Comments: 25 pages, 8 Tables, 10 Figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1496] arXiv:2206.07599 (cross-list from eess.IV) [pdf, other]: Title: How GNNs Facilitate CNNs in Mining Geometric Information from Large-Scale Medical Images

Authors: Yiqing Shen, Bingxin Zhou, Xinye Xiong, Ruitian Gao, Yu Guang Wang

Comments: 21 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1497] arXiv:2206.07664 (cross-list from eess.IV) [pdf, other]: Title: CRISP - Reliable Uncertainty Estimation for Medical Image Segmentation

Authors: Thierry Judge, Olivier Bernard, Mihaela Porumb, Agis Chartsias, Arian Beqiri, Pierre-Marc Jodoin

Comments: 9 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1498] arXiv:2206.08019 (cross-list from eess.IV) [pdf, other]: Title: Multi-View Imputation and Cross-Attention Network Based on Incomplete Longitudinal and Multimodal Data for Conversion Prediction of Mild Cognitive Impairment

Authors: Tao Wang, Xiumei Chen, Xiaoling Zhang, Shuoling Zhou, Qianjin Feng, Meiyan Huang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1499] arXiv:2206.08023 (cross-list from eess.IV) [pdf, other]: Title: AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation

Authors: Yuanfeng Ji, Haotian Bai, Jie Yang, Chongjian Ge, Ye Zhu, Ruimao Zhang, Zhen Li, Lingyan Zhang, Wanling Ma, Xiang Wan, Ping Luo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1500] arXiv:2206.08078 (cross-list from eess.IV) [pdf, other]: Title: U-PET: MRI-based Dementia Detection with Joint Generation of Synthetic FDG-PET Images

Authors: Marcel Kollovieh, Matthias Keicher, Stephan Wunderlich, Hendrik Burwinkel, Thomas Wendler, Nassir Navab

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1501] arXiv:2206.08272 (cross-list from eess.IV) [pdf, other]: Title: Longitudinal detection of new MS lesions using Deep Learning

Authors: Reda Abdellah Kamraoui, Boris Mansencal, José V Manjon, Pierrick Coupé

Comments: preprint

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1502] arXiv:2206.08277 (cross-list from astro-ph.EP) [pdf, other]: Title: A machine-generated catalogue of Charon's craters and implications for the Kuiper belt

Authors: Mohamad Ali-Dib

Comments: 16 pages, 2 figures, accepted for publication in Icarus

Subjects: Earth and Planetary Astrophysics (astro-ph.EP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1503] arXiv:2206.08298 (cross-list from eess.IV) [pdf, other]: Title: Video Capsule Endoscopy Classification using Focal Modulation Guided Convolutional Neural Network

Authors: Abhishek Srivastava, Nikhil Kumar Tomar, Ulas Bagci, Debesh Jha

Journal-ref: CBMS 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1504] arXiv:2206.08308 (cross-list from eess.IV) [pdf, ps, other]: Title: Deepfake histological images for enhancing digital pathology

Authors: Kianoush Falahkheirkhah, Saumya Tiwari, Kevin Yeh, Sounak Gupta, Loren Herrera-Hernandez, Michael R. McCarthy, Rafael E. Jimenez, John C. Cheville, Rohit Bhargava

Subjects: Image and Video Processing (eess.IV); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1505] arXiv:2206.08398 (cross-list from eess.IV) [pdf, other]: Title: Learning Generic Lung Ultrasound Biomarkers for Decoupling Feature Extraction from Downstream Tasks

Authors: Gautam Rajendrakumar Gare, Tom Fox, Pete Lowery, Kevin Zamora, Hai V. Tran, Laura Hutchins, David Montgomery, Amita Krishnan, Deva Kannan Ramanan, Ricardo Luis Rodriguez, Bennett P deBoisblanc, John Michael Galeotti

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1506] arXiv:2206.08439 (cross-list from eess.IV) [pdf, other]: Title: OpenSRH: optimizing brain tumor surgery using intraoperative stimulated Raman histology

Authors: Cheng Jiang, Asadur Chowdury, Xinhai Hou, Akhil Kondepudi, Christian W. Freudiger, Kyle Conway, Sandra Camelo-Piragua, Daniel A. Orringer, Honglak Lee, Todd C. Hollon

Comments: Neural Information Processing Systems (NeurIPS) 2022 Datasets and Benchmarks Track

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1507] arXiv:2206.08481 (cross-list from eess.IV) [pdf, other]: Title: Orientation-guided Graph Convolutional Network for Bone Surface Segmentation

Authors: Aimon Rahman, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M Patel

Comments: Accepted at MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1508] arXiv:2206.08543 (cross-list from eess.IV) [pdf, ps, other]: Title: Multi-Classification of Brain Tumor Images Using Transfer Learning Based Deep Neural Network

Authors: Pramit Dutta, Khaleda Akhter Sathi, Md. Saiful Islam

Comments: 7 pages, 4 figures, 2 tables, International Virtual Conference on ARTIFICIAL INTELLIGENCE FOR SMART COMMUNITY, Malaysia

Journal-ref: Conference proceedings \c{opyright} 2023 International Conference on Artificial Intelligence for Smart Community

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1509] arXiv:2206.08557 (cross-list from eess.IV) [pdf, other]: Title: COVID-19 Detection using Transfer Learning with Convolutional Neural Network

Authors: Pramit Dutta, Tanny Roy, Nafisa Anjum

Comments: 4 pages, 4 figures, 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST), DHAKA, Bangladesh

Journal-ref: 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST), DHAKA, Bangladesh, 2021, pp. 429-432

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1510] arXiv:2206.08612 (cross-list from eess.IV) [pdf, other]: Title: OADAT: Experimental and Synthetic Clinical Optoacoustic Data for Standardized Image Processing

Authors: Firat Ozdemir, Berkan Lafci, Xosé Luís Deán-Ben, Daniel Razansky, Fernando Perez-Cruz

Comments: Accepted to TMLR. 32 pages, 24 figures, 9 tables

Journal-ref: Transactions on Machine Learning Research (2023) 2835-8856

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1511] arXiv:2206.08671 (cross-list from stat.ML) [pdf, other]: Title: FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification

Authors: Aliaksandra Shysheya, John Bronskill, Massimiliano Patacchiola, Sebastian Nowozin, Richard E Turner

Journal-ref: The Eleventh International Conference on Learning Representations (ICLR 2023)

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1512] arXiv:2206.08787 (cross-list from eess.IV) [pdf, other]: Title: Leveraging Uncertainty in Deep Learning for Pancreatic Adenocarcinoma Grading

Authors: Biraja Ghoshal, Bhargab Ghoshal, Allan Tucker

Comments: 26th UK Conference on Medical Image Understanding and Analysis; 27 - 29 July 2022; University of Cambridge, UK. arXiv admin note: text overlap with arXiv:2003.10769

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1513] arXiv:2206.08885 (cross-list from eess.IV) [pdf, other]: Title: Incorporating intratumoral heterogeneity into weakly-supervised deep learning models via variance pooling

Authors: Iain Carmichael, Andrew H. Song, Richard J. Chen, Drew F.K. Williamson, Tiffany Y. Chen, Faisal Mahmood

Comments: MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Methodology (stat.ME)
[1514] arXiv:2206.08936 (cross-list from eess.IV) [pdf, other]: Title: Simultaneous Bone and Shadow Segmentation Network using Task Correspondence Consistency

Authors: Aimon Rahman, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M Patel

Comments: Accepted at MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1515] arXiv:2206.08984 (cross-list from eess.IV) [pdf, other]: Title: Multi-scale Super-resolution Magnetic Resonance Spectroscopic Imaging with Adjustable Sharpness

Authors: Siyuan Dong, Gilbert Hangel, Wolfgang Bogner, Georg Widhalm, Karl Rössler, Siegfried Trattnig, Chenyu You, Robin de Graaf, John Onofrey, James Duncan

Comments: Accepted by MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1516] arXiv:2206.08985 (cross-list from eess.IV) [pdf, other]: Title: TransResU-Net: Transformer based ResU-Net for Real-Time Colonoscopy Polyp Segmentation

Authors: Nikhil Kumar Tomar, Annie Shergill, Brandon Rieders, Ulas Bagci, Debesh Jha

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1517] arXiv:2206.08994 (cross-list from stat.ML) [pdf, other]: Title: Robust Group Synchronization via Quadratic Programming

Authors: Yunpeng Shi, Cole Wyeth, Gilad Lerman

Comments: Accepted to ICML 2022

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[1518] arXiv:2206.09065 (cross-list from eess.IV) [pdf, ps, other]: Title: Free-form Lesion Synthesis Using a Partial Convolution Generative Adversarial Network for Enhanced Deep Learning Liver Tumor Segmentation

Authors: Yingao Liu, Fei Yang, Yidong Yang

Comments: The paper is under review by JACMP-Journal of Applied Medical Physics

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1519] arXiv:2206.09128 (cross-list from eess.IV) [pdf, other]: Title: A Combined PCA-MLP Network for Early Breast Cancer Detection

Authors: Md. Wahiduzzaman Khan Arnob, Arunima Dey Pooja, Md. Saif Hassan Onim

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1520] arXiv:2206.09146 (cross-list from eess.IV) [pdf, other]: Title: A Perceptually Optimized and Self-Calibrated Tone Mapping Operator

Authors: Peibei Cao, Chenyang Le, Yuming Fang, Kede Ma

Comments: 15 pages,17 figures

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1521] arXiv:2206.09193 (cross-list from eess.IV) [pdf, ps, other]: Title: Multi-Modality Image Super-Resolution using Generative Adversarial Networks

Authors: Aref Abedjooy, Mehran Ebrahimi

Comments: to be published in the Proceedings of 16th International Conference on Computer Graphics, Visualization, Computer Vision and Image Processing (CGVCVIP), Lisbon, Portugal, July 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1522] arXiv:2206.09210 (cross-list from eess.IV) [pdf, other]: Title: Multi-Modality Image Inpainting using Generative Adversarial Networks

Authors: Aref Abedjooy, Mehran Ebrahimi

Comments: to be published in the Proceedings of 26th Int'l Conf on Image Processing, Computer Vision, & Pattern Recognition (IPCV), July 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1523] arXiv:2206.09309 (cross-list from eess.IV) [pdf, other]: Title: TBraTS: Trusted Brain Tumor Segmentation

Authors: Ke Zou, Xuedong Yuan, Xiaojing Shen, Meng Wang, Huazhu Fu

Comments: 11 pages, 4 figures, Accepted by MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1524] arXiv:2206.09611 (cross-list from eess.IV) [pdf, other]: Title: SJ-HD^2R: Selective Joint High Dynamic Range and Denoising Imaging for Dynamic Scenes

Authors: Wei Li, Shuai Xiao, Tianhong Dai, Shanxin Yuan, Tao Wang, Cheng Li, Fenglong Song

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1525] arXiv:2206.09867 (cross-list from eess.SP) [pdf, other]: Title: WiFi-based Spatiotemporal Human Action Perception

Authors: Yanling Hao, Zhiyuan Shi, Yuanwei Liu

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[1526] arXiv:2206.10152 (cross-list from physics.optics) [pdf, ps, other]: Title: Diffractive Interconnects: All-Optical Permutation Operation Using Diffractive Networks

Authors: Deniz Mengu, Yifan Zhao, Anika Tabassum, Mona Jarrahi, Aydogan Ozcan

Comments: 22 Pages, 6 Figures

Journal-ref: Nanophotonics (2022)

Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1527] arXiv:2206.10183 (cross-list from eess.IV) [pdf, ps, other]: Title: covEcho Resource constrained lung ultrasound image analysis tool for faster triaging and active learning

Authors: Jinu Joseph, Mahesh Raveendranatha Panicker, Yale Tung Chen, Kesavadas Chandrasekharan, Vimal Chacko Mondy, Anoop Ayyappan, Jineesh Valakkada, Kiran Vishnu Narayan

Comments: Submitted to Elsevier CMPBUP on Dec 1, 2021

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1528] arXiv:2206.10286 (cross-list from eess.IV) [pdf, other]: Title: Position-prior Clustering-based Self-attention Module for Knee Cartilage Segmentation

Authors: Dong Liang, Jun Liu, Kuanquan Wang, Gongning Luo, Wei Wang, Shuo Li

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1529] arXiv:2206.10294 (cross-list from eess.IV) [pdf, other]: Title: Using the Polar Transform for Efficient Deep Learning-Based Aorta Segmentation in CTA Images

Authors: Marin Benčević, Marija Habijan, Irena Galić, Danilo Babin

Comments: Accepted to 64th International Symposium ELMAR-2022, Zadar, Croatia

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1530] arXiv:2206.10357 (cross-list from eess.IV) [pdf, other]: Title: Confidence-Guided Unsupervised Domain Adaptation for Cerebellum Segmentation

Authors: Xuan Li, Paule-J Toussaint, Alan Evans, Xue Liu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1531] arXiv:2206.10455 (cross-list from eess.IV) [src]: Title: Automated Coronary Calcium Scoring using U-Net Models through Semi-supervised Learning on Non-Gated CT Scans

Authors: Sanskriti Singh

Comments: There is no correlation between gated and non-gated CT scans causing the points used in the training and results to be flawed. It was inaccurately assumed that there was a correlation between the scans

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1532] arXiv:2206.10543 (cross-list from eess.IV) [pdf, other]: Title: Faster Diffusion Cardiac MRI with Deep Learning-based breath hold reduction

Authors: Michael Tanzer, Pedro Ferreira, Andrew Scott, Zohya Khalique, Maria Dwornik, Dudley Pennell, Guang Yang, Daniel Rueckert, Sonia Nielles-Vallespin

Comments: 15 pages, 1 figures, 2 tables. To be published in MIUA22

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1533] arXiv:2206.10750 (cross-list from eess.SP) [pdf, other]: Title: Floor Map Reconstruction Through Radio Sensing and Learning By a Large Intelligent Surface

Authors: Cristian J. Vaca-Rubio, Roberto Pereira, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth de Carvalho, Petar Popovski

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[1534] arXiv:2206.10802 (cross-list from eess.IV) [pdf, other]: Title: SVoRT: Iterative Transformer for Slice-to-Volume Registration in Fetal Brain MRI

Authors: Junshen Xu, Daniel Moyer, P. Ellen Grant, Polina Golland, Juan Eugenio Iglesias, Elfar Adalsteinsson

Comments: Accepted by MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1535] arXiv:2206.10810 (cross-list from eess.IV) [pdf, other]: Title: A Simple Baseline for Video Restoration with Grouped Spatial-temporal Shift

Authors: Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li

Comments: Accepted to CVPR2023

Journal-ref: 2023 Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1536] arXiv:2206.10911 (cross-list from eess.IV) [pdf, other]: Title: Influence of uncertainty estimation techniques on false-positive reduction in liver lesion detection

Authors: Ishaan Bhat, Josien P. W. Pluim, Max A. Viergever, Hugo J. Kuijf

Comments: Accepted for publication in the Journal of Machine Learning for Biomedical Imaging (MELBA)

Journal-ref: https://www.melba-journal.org/papers/2022:030.html

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1537] arXiv:2206.10912 (cross-list from eess.IV) [pdf, ps, other]: Title: AI-based software for lung nodule detection in chest X-rays -- Time for a second reader approach?

Authors: Susanne Ohlmann-Knafo, Naglis Ramanauskas, Sebastian Huettinger, Emil Johnson Jeyakumar, Darius Barušauskas, Neringa Bielskienė, Vytautas Naujalis, Jonas Bialopetravičius, Jonas Ražanskas, Artūras Samuilis, Jūratė Dementavičienė, Dirk Pickuth

Comments: This paper is in submission process to the European Radiology journal

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1538] arXiv:2206.11048 (cross-list from eess.IV) [pdf, other]: Title: Automated GI tract segmentation using deep learning

Authors: Manhar Sharma

Comments: 8 pages, 9 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1539] arXiv:2206.11127 (cross-list from eess.IV) [pdf, ps, other]: Title: CNN-based fully automatic wrist cartilage volume quantification in MR Image

Authors: Nikita Vladimirov, Ekaterina Brui, Anatoliy Levchuk, Vladimir Fokin, Aleksandr Efimtcev, David Bendahan

Comments: 17 pages, 6 Figures, 6 Tables, 1 Suplementary

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1540] arXiv:2206.11458 (cross-list from eess.IV) [pdf, other]: Title: Weighted Concordance Index Loss-based Multimodal Survival Modeling for Radiation Encephalopathy Assessment in Nasopharyngeal Carcinoma Radiotherapy

Authors: Jiansheng Fang, Anwei Li, Pu-Yun OuYang, Jiajian Li, Jingwen Wang, Hongbo Liu, Fang-Yun Xie, Jiang Liu

Comments: 11 pages, 3 figures, MICCAI2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1541] arXiv:2206.11501 (cross-list from eess.IV) [pdf, other]: Title: A novel adversarial learning strategy for medical image classification

Authors: Zong Fan, Xiaohui Zhang, Jacob A. Gasienica, Jennifer Potts, Su Ruan, Wade Thorstad, Hiram Gay, Pengfei Song, Xiaowei Wang, Hua Li

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1542] arXiv:2206.11599 (cross-list from eess.IV) [pdf, other]: Title: Universal Learned Image Compression With Low Computational Cost

Authors: Bowen Li, Yao Xin, Youneng Bao, Fanyang Meng, Yongsheng Liang, Wen Tan

Comments: 5 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1543] arXiv:2206.11669 (cross-list from physics.ao-ph) [pdf, other]: Title: Short-range forecasts of global precipitation using deep learning-augmented numerical weather prediction

Authors: Manmeet Singh, Vaisakh S B, Nachiketa Acharya, Aditya Grover, Suryachandra A Rao, Bipin Kumar, Zong-Liang Yang, Dev Niyogi

Comments: Accepted at Tackling Climate Change with Machine Learning: workshop at NeurIPS 2022

Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1544] arXiv:2206.11943 (cross-list from eess.IV) [pdf, other]: Title: TIAger: Tumor-Infiltrating Lymphocyte Scoring in Breast Cancer for the TiGER Challenge

Authors: Adam Shephard, Mostafa Jahanifar, Ruoyu Wang, Muhammad Dawood, Simon Graham, Kastytis Sidlauskas, Syed Ali Khurram, Nasir Rajpoot, Shan E Ahmed Raza

Comments: TiGER Challenge entry

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1545] arXiv:2206.12112 (cross-list from eess.IV) [pdf, other]: Title: Dissecting U-net for Seismic Application: An In-Depth Study on Deep Learning Multiple Removal

Authors: Ricard Durall, Ammar Ghanim, Norman Ettrich, Janis Keuper

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1546] arXiv:2206.12136 (cross-list from eess.IV) [pdf, other]: Title: Feature Representation Learning for Robust Retinal Disease Detection from Optical Coherence Tomography Images

Authors: Sharif Amit Kamran, Khondker Fariha Hossain, Alireza Tavakkoli, Stewart Lee Zuckerbrod, Salah A. Baker

Comments: Accepted to MICCAI2022 Ophthalmic Medical Image Analysis (OMIA) Workshop

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1547] arXiv:2206.12300 (cross-list from eess.IV) [pdf, ps, other]: Title: Automatic extraction of coronary arteries using deep learning in invasive coronary angiograms

Authors: Yinghui Meng, Zhenglong Du, Chen Zhao, Minghao Dong, Drew Pienta, Zhihui Xu, Weihua Zhou

Comments: 22 pages,5 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1548] arXiv:2206.12344 (cross-list from eess.IV) [pdf, other]: Title: Segmentation-free PVC for Cardiac SPECT using a Densely-connected Multi-dimensional Dynamic Network

Authors: Huidong Xie, Zhao Liu, Luyao Shi, Kathleen Greco, Xiongchao Chen, Bo Zhou, Attila Feher, John C. Stendahl, Nabil Boutagy, Tassos C. Kyriakides, Ge Wang, Albert J. Sinusas, Chi Liu

Comments: 12 pages, 11 figures. Accepted for publication at IEEE Transactions on Medical Imaging

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1549] arXiv:2206.12407 (cross-list from eess.IV) [pdf, ps, other]: Title: Independent evaluation of state-of-the-art deep networks for mammography

Authors: Osvaldo Matias Velarde, Lucas Parra

Comments: 17 pages, 8 figures, 4 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1550] arXiv:2206.12417 (cross-list from eess.IV) [pdf, other]: Title: Deep embedded clustering algorithm for clustering PACS repositories

Authors: Teo Manojlović, Matija Milanič, Ivan Štajduhar

Journal-ref: Proceedings of the 2021 IEEE 34th International Symposium on Computer-Based Medical Systems (CBMS)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1551] arXiv:2206.12512 (cross-list from eess.IV) [pdf, other]: Title: Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings

Authors: Sophia Bano, Alessandro Casella, Francisco Vasconcelos, Abdul Qayyum, Abdesslam Benzinou, Moona Mazher, Fabrice Meriaudeau, Chiara Lena, Ilaria Anita Cintorrino, Gaia Romana De Paolis, Jessica Biagioli, Daria Grechishnikova, Jing Jiao, Bizhe Bai, Yanyan Qiao, Binod Bhattarai, Rebati Raman Gaire, Ronast Subedi, Eduard Vazquez, Szymon Płotka, Aneta Lisowska, Arkadiusz Sitek, George Attilakos, Ruwan Wimalasundera, Anna L David, Dario Paladini, Jan Deprest, Elena De Momi, Leonardo S Mattos, Sara Moccia, Danail Stoyanov

Comments: Accepted at MedIA (Medical Image Analysis)

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1552] arXiv:2206.12809 (cross-list from eess.SP) [pdf, other]: Title: Role and Integration of Image Processing Systems in Maritime Target Tracking

Authors: Yassir Zardoua, Bilal Sebbar, Moussab Chbeine, Abdelali Astito, Mohammed Boulaala

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[1553] arXiv:2206.12815 (cross-list from eess.IV) [pdf, other]: Title: Breast Cancer Classification using Deep Learned Features Boosted with Handcrafted Features

Authors: Unaiza Sajid, Rizwan Ahmed Khan, Shahid Munir Shah, Sheeraz Arif

Journal-ref: Biomedical Signal Processing and Control 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1554] arXiv:2206.12980 (cross-list from eess.IV) [pdf, ps, other]: Title: Detecting Schizophrenia with 3D Structural Brain MRI Using Deep Learning

Authors: Junhao Zhang, Vishwanatha M. Rao, Ye Tian, Yanting Yang, Nicolas Acosta, Zihan Wan, Pin-Yu Lee, Chloe Zhang, Lawrence S. Kegeles, Scott A. Small, Jia Guo

Comments: 13 pages, 6 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1555] arXiv:2206.13086 (cross-list from stat.ML) [pdf, other]: Title: RankSEG: A Consistent Ranking-based Framework for Segmentation

Authors: Ben Dai, Chunlin Li

Comments: 50 pages

Journal-ref: Journal of Machine Learning Research, 24(224), 1-50 (2023)

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Statistics Theory (math.ST)
[1556] arXiv:2206.13123 (cross-list from eess.IV) [pdf, other]: Title: Unsupervised Domain Adaptation Using Feature Disentanglement And GCNs For Medical Image Classification

Authors: Dwarikanath Mahapatra

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1557] arXiv:2206.13173 (cross-list from eess.IV) [pdf, ps, other]: Title: Context-Aware Transformers For Spinal Cancer Detection and Radiological Grading

Authors: Rhydian Windsor, Amir Jamaludin, Timor Kadir, Andrew Zisserman

Comments: Pre-print of paper accepted to MICCAI 2022. 15 pages, 7 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1558] arXiv:2206.13295 (cross-list from eess.IV) [pdf, other]: Title: Diffusion Deformable Model for 4D Temporal Medical Image Generation

Authors: Boah Kim, Jong Chul Ye

Comments: Accepted for MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1559] arXiv:2206.13385 (cross-list from eess.IV) [pdf, other]: Title: 3D unsupervised anomaly detection and localization through virtual multi-view projection and reconstruction: Clinical validation on low-dose chest computed tomography

Authors: Kyung-Su Kim, Seong Je Oh, Ju Hwan Lee, Myung Jin Chung

Comments: Kyung-Su Kim and Seong Je Oh have contributed equally to this work as the co-first author. Kyung-Su Kim (kskim.doc@gmail.com) and Myung Jin Chung (mj1.chung@samsung.com) have contributed equally to this work as the co-corresponding author

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1560] arXiv:2206.13393 (cross-list from eess.IV) [pdf, other]: Title: Cross-Modal Transformer GAN: A Brain Structure-Function Deep Fusing Framework for Alzheimer's Disease

Authors: Junren Pan, Shuqiang Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[1561] arXiv:2206.13394 (cross-list from eess.IV) [pdf, other]: Title: CS$^2$: A Controllable and Simultaneous Synthesizer of Images and Annotations with Minimal Human Intervention

Authors: Xiaodan Xing, Jiahao Huang, Yang Nan, Yinzhe Wu, Chengjia Wang, Zhifan Gao, Simon Walsh, Guang Yang

Comments: 11 figures, Accepted by MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1562] arXiv:2206.13419 (cross-list from eess.IV) [pdf, other]: Title: DeStripe: A Self2Self Spatio-Spectral Graph Neural Network with Unfolded Hessian for Stripe Artifact Removal in Light-sheet Microscopy

Authors: Yu Liu, Kurt Weiss, Nassir Navab, Carsten Marr, Jan Huisken, Tingying Peng

Comments: Accepted by 25th International Conference on Medical Image Computing and Computer Assisted Intervention

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1563] arXiv:2206.13455 (cross-list from eess.IV) [pdf, other]: Title: IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments

Authors: Abanob Soliman, Fabien Bonardi, Désiré Sidibé, Samia Bouchafa

Comments: Accepted for publication in the Journal of Intelligent & Robotic Systems

Journal-ref: J Intell Robot Syst 106, 53 (2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1564] arXiv:2206.13468 (cross-list from math.AG) [pdf, ps, other]: Title: An Atlas for the Pinhole Camera

Authors: Sameer Agarwal, Timothy Duff, Max Lieblich, Rekha Thomas

Comments: 47 pages with references and appendices, final version

Journal-ref: JFoCM, 2022

Subjects: Algebraic Geometry (math.AG); Computer Vision and Pattern Recognition (cs.CV); Commutative Algebra (math.AC)
[1565] arXiv:2206.13504 (cross-list from eess.IV) [pdf, other]: Title: AI-based computer-aided diagnostic system of chest digital tomography synthesis: Demonstrating comparative advantage with X-ray-based AI systems

Authors: Kyung-Su Kim, Ju Hwan Lee, Seong Je Oh, Myung Jin Chung

Comments: Kyung-Su Kim, Ju Hwan Lee, and Seong Je Oh have contributed equally to this work as the co-first author. Kyung-Su Kim (kskim.doc@gmail.com) and Myung Jin Chung (mj1.chung@samsung.com) have contributed equally to this work as the co-corresponding author

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1566] arXiv:2206.13505 (cross-list from eess.IV) [pdf, other]: Title: Deep Learning-Based Defect Classification and Detection in SEM Images

Authors: Bappaditya Deya, Dipam Goswamif, Sandip Haldera, Kasem Khalilb, Philippe Leraya, Magdy A. Bayoumi

Journal-ref: In Metrology, Inspection, and Process Control XXXVI, SPIE (2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1567] arXiv:2206.13506 (cross-list from eess.IV) [pdf, other]: Title: Tensor Recovery Based on A Novel Non-convex Function Minimax Logarithmic Concave Penalty Function

Authors: Hongbing Zhang, Xinyi Liu, Chang Liu, Hongtao Fan, Yajing Li, Xinyun Zhu

Comments: arXiv admin note: substantial text overlap with arXiv:2201.12709, arXiv:2109.12257

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1568] arXiv:2206.13613 (cross-list from eess.IV) [pdf, other]: Title: Flexible-Rate Learned Hierarchical Bi-Directional Video Compression With Motion Refinement and Frame-Level Bit Allocation

Authors: Eren Cetin, M. Akin Yilmaz, A. Murat Tekalp

Comments: Accepted for publication in IEEE International Conference on Image Processing (ICIP 2022)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1569] arXiv:2206.13632 (cross-list from eess.IV) [pdf, other]: Title: Omni-Seg: A Scale-aware Dynamic Network for Renal Pathological Image Segmentation

Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Jun Long, Zuhayr Asad, R. Michael Womick, Zheyu Zhu, Agnes B. Fogo, Shilin Zhao, Haichun Yang, Yuankai Huo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1570] arXiv:2206.13740 (cross-list from eess.IV) [pdf, other]: Title: GAN-based Super-Resolution and Segmentation of Retinal Layers in Optical coherence tomography Scans

Authors: Paria Jeihouni, Omid Dehzangi, Annahita Amireskandari, Ali Rezai, Nasser M. Nasrabadi

Comments: 5 pages,7 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1571] arXiv:2206.13872 (cross-list from stat.ML) [pdf, other]: Title: When are Post-hoc Conceptual Explanations Identifiable?

Authors: Tobias Leemann, Michael Kirchhof, Yao Rong, Enkelejda Kasneci, Gjergji Kasneci

Comments: v5: UAI2023 camera-ready including supplementary material. The first two authors contributed equally

Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1572] arXiv:2206.13903 (cross-list from eess.IV) [pdf, other]: Title: AS-IntroVAE: Adversarial Similarity Distance Makes Robust IntroVAE

Authors: Changjie Lu, Shen Zheng, Zirui Wang, Omar Dib, Gaurav Gupta

Comments: ACML conference paper

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1573] arXiv:2206.14305 (cross-list from eess.IV) [pdf, ps, other]: Title: Multistep Automated Data Labelling Procedure (MADLaP) for Thyroid Nodules on Ultrasound: An Artificial Intelligence Approach for Automating Image Annotation

Authors: Jikai Zhang, Maciej M. Mazurowski, Brian C. Allen, Benjamin Wildman-Torbiner

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1574] arXiv:2206.14678 (cross-list from eess.IV) [pdf, other]: Title: BiometryNet: Landmark-based Fetal Biometry Estimation from Standard Ultrasound Planes

Authors: Netanell Avisdris, Leo Joskowicz, Brian Dromey, Anna L. David, Donald M. Peebles, Danail Stoyanov, Dafna Ben Bashat, Sophia Bano

Comments: 13 pages, 6 figures, Accepted to MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1575] arXiv:2206.14713 (cross-list from eess.IV) [pdf, other]: Title: CONVIQT: Contrastive Video Quality Estimator

Authors: Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1576] arXiv:2206.14746 (cross-list from eess.IV) [pdf, other]: Title: Placenta Segmentation in Ultrasound Imaging: Addressing Sources of Uncertainty and Limited Field-of-View

Authors: Veronika A. Zimmer, Alberto Gomez, Emily Skelton, Robert Wright, Gavin Wheeler, Shujie Deng, Nooshin Ghavami, Karen Lloyd, Jacqueline Matthew, Bernhard Kainz, Daniel Rueckert, Joseph V. Hajnal, Julia A. Schnabel

Comments: 21 pages (18 + appendix), 13 figures (9 + appendix)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1577] arXiv:2206.14820 (cross-list from astro-ph.CO) [pdf, other]: Title: Strong Lensing Source Reconstruction Using Continuous Neural Fields

Authors: Siddharth Mishra-Sharma, Ge Yang

Comments: 9+2 pages, 3+2 figures, Spotlight at the Machine Learning for Astrophysics Workshop at ICML 2022; v2, references added

Subjects: Cosmology and Nongalactic Astrophysics (astro-ph.CO); Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1578] arXiv:2206.14847 (cross-list from eess.IV) [pdf, other]: Title: Deep Reinforcement Learning for Small Bowel Path Tracking using Different Types of Annotations

Authors: Seung Yeon Shin, Ronald M. Summers

Comments: Accepted to MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1579] arXiv:2206.14861 (cross-list from eess.IV) [pdf, other]: Title: Two-Stage COVID19 Classification Using BERT Features

Authors: Weijun Tan, Qi Yao, Jingfeng Liu

Comments: arXiv admin note: text overlap with arXiv:2106.14403

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1580] arXiv:2206.14903 (cross-list from eess.IV) [pdf, other]: Title: CIRDataset: A large-scale Dataset for Clinically-Interpretable lung nodule Radiomics and malignancy prediction

Authors: Wookjin Choi, Navdeep Dahiya, Saad Nadeem

Comments: MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1581] arXiv:2206.14919 (cross-list from eess.IV) [pdf, other]: Title: Identifying and Combating Bias in Segmentation Networks by leveraging multiple resolutions

Authors: Leonie Henschel, David Kügler, Derek S Andrews, Christine W Nordahl, Martin Reuter

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1582] arXiv:2206.14951 (cross-list from eess.IV) [pdf, other]: Title: CLTS-GAN: Color-Lighting-Texture-Specular Reflection Augmentation for Colonoscopy

Authors: Shawn Mathew, Saad Nadeem, Arie Kaufman

Comments: MICCAI 2022. **First two authors contributed equally

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1583] arXiv:2206.15069 (cross-list from eess.IV) [pdf, other]: Title: PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis

Authors: Lilang Zheng, Jiaxuan Fang, Xiaorun Tang, Hanzhang Li, Jiaxin Fan, Tianyi Wang, Rui Zhou, Zhaoyan Yan

Comments: 8 pages,1 figure

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1584] arXiv:2206.15073 (cross-list from eess.IV) [pdf, other]: Title: COVID Detection and Severity Prediction with 3D-ConvNeXt and Custom Pretrainings

Authors: Daniel Kienzle, Julian Lorenz, Robin Schön, Katja Ludwig, Rainer Lienhart

Comments: 17 pages, 3 figures, informations about challenge submission

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1585] arXiv:2206.15134 (cross-list from eess.IV) [pdf, other]: Title: InsMix: Towards Realistic Generative Data Augmentation for Nuclei Instance Segmentation

Authors: Yi Lin, Zeyu Wang, Kwang-Ting Cheng, Hao Chen

Comments: Accepted by MICCAI 2022 (early accepted)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1586] arXiv:2206.15179 (cross-list from eess.IV) [src]: Title: A Medical Image Fusion Method based on MDLatLRRv2

Authors: Xu Song, Xiao-Jun Wu, Hui Li

Comments: There are some errors that need to be corrected

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1587] arXiv:2206.15182 (cross-list from eess.IV) [pdf, other]: Title: The (de)biasing effect of GAN-based augmentation methods on skin lesion images

Authors: Agnieszka Mikołajczyk, Sylwia Majchrowska, Sandra Carrasco Limeros

Comments: Accepted to MICCAI2022

Journal-ref: In: Wang, L., Dou, Q., Fletcher, P.T., Speidel, S., Li, S. (eds) Medical Image Computing and Computer Assisted Intervention - MICCAI 2022. MICCAI 2022. Lecture Notes in Computer Science, vol 13438. Springer, Cham

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1588] arXiv:2206.15217 (cross-list from eess.IV) [pdf, other]: Title: Implicit U-Net for volumetric medical image segmentation

Authors: Sergio Naval Marimont, Giacomo Tarroni

Comments: 11 pages, 4 figures, Accepted MIUA 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1589] arXiv:2206.15254 (cross-list from eess.IV) [pdf, other]: Title: Localizing the Recurrent Laryngeal Nerve via Ultrasound with a Bayesian Shape Framework

Authors: Haoran Dou, Luyi Han, Yushuang He, Jun Xu, Nishant Ravikumar, Ritse Mann, Alejandro F. Frangi, Pew-Thian Yap, Yunzhi Huang

Comments: Early Accepted by MICCAI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1590] arXiv:2206.15274 (cross-list from eess.IV) [pdf, other]: Title: Augment like there's no tomorrow: Consistently performing neural networks for medical imaging

Authors: Joona Pohjonen, Carolin Stürenberg, Atte Föhr, Reija Randen-Brady, Lassi Luomala, Jouni Lohi, Esa Pitkänen, Antti Rannikko, Tuomas Mirtti

Comments: Code for the paper is available from this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1591] arXiv:2206.15431 (cross-list from eess.IV) [pdf, other]: Title: Ensemble CNN models for Covid-19 Recognition and Severity Perdition From 3D CT-scan

Authors: Fares Bougourzi, Cosimo Distante, Fadi Dornaika, Abdelmalik Taleb-Ahmed

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

[ total of 1594 entries: 1-1591 | 1592-1594 ]
[ showing 1591 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Jun 2022