Computer Vision and Pattern Recognition
Authors and titles for cs.CV in Nov 2021
[ total of 1539 entries: 1-1539 ][ showing 1539 entries per page: fewer | more ]
- [1] arXiv:2111.00006 [pdf, other]
-
Title: Adaptive Hierarchical Similarity Metric Learning with Noisy LabelsComments: 11 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [2] arXiv:2111.00007 [pdf, other]
-
Title: Domain Agnostic Few-Shot Learning For Document IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [3] arXiv:2111.00038 [pdf, other]
-
Title: On-device Real-time Hand Gesture RecognitionAuthors: George Sung, Kanstantsin Sokal, Esha Uboweja, Valentin Bazarevsky, Jonathan Baccash, Eduard Gabriel Bazavan, Chuo-Ling Chang, Matthias GrundmannComments: 5 pages, 6 figures; ICCV Workshop on Computer Vision for Augmented and Virtual Reality, Montreal, Canada, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [4] arXiv:2111.00042 [pdf, other]
-
Title: CvS: Classification via Segmentation For Small DatasetsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [5] arXiv:2111.00056 [pdf, other]
-
Title: Generalized Data Weighting via Class-level Gradient ManipulationComments: 17 pages, 8 figures, accepted by NeurIPS 2021 for a poster session, camera-ready version, initial submission to arXivSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG)
- [6] arXiv:2111.00063 [pdf, other]
-
Title: Polyline Generative Navigable Space Segmentation for Autonomous Visual NavigationSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [7] arXiv:2111.00079 [pdf, other]
-
Title: Deep Deterministic Uncertainty for Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [8] arXiv:2111.00112 [pdf, ps, other]
-
Title: Classification of jujube fruit based on several pricing factors using machine learning methodsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [9] arXiv:2111.00116 [pdf, other]
-
Title: Visual Explanations for Convolutional Neural Networks via Latent Traversal of Generative Adversarial NetworksComments: 2 pages, 2 figures, to appear as extended abstract at AAAI-22Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [10] arXiv:2111.00121 [pdf, other]
-
Title: Longitudinal Analysis of Mask and No-Mask on Child Face RecognitionComments: 5 Pages, 3 FigureSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [11] arXiv:2111.00131 [pdf, other]
-
Title: Three approaches to facilitate DNN generalization to objects in out-of-distribution orientations and illuminationsAuthors: Akira Sakai, Taro Sunagawa, Spandan Madan, Kanata Suzuki, Takashi Katoh, Hiromichi Kobashi, Hanspeter Pfister, Pawan Sinha, Xavier Boix, Tomotake SasakiSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [12] arXiv:2111.00140 [pdf, other]
-
Title: DIB-R++: Learning to Predict Lighting and Material with a Hybrid Differentiable RendererAuthors: Wenzheng Chen, Joey Litalien, Jun Gao, Zian Wang, Clement Fuji Tsang, Sameh Khamis, Or Litany, Sanja FidlerSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [13] arXiv:2111.00164 [pdf, other]
-
Title: HIERMATCH: Leveraging Label Hierarchies for Improving Semi-Supervised LearningComments: 11 pages, 1 figure, Accepted in WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [14] arXiv:2111.00176 [pdf, ps, other]
-
Title: Iris Recognition Based on SIFT FeaturesAuthors: Fernando Alonso-Fernandez, Pedro Tome-Gonzalez, Virginia Ruiz-Albacete, Javier Ortega-GarciaComments: Published at IEEE International Conference on Biometrics, Identity and Security (BIdS)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [15] arXiv:2111.00178 [pdf, other]
-
Title: Direct attacks using fake images in iris verificationAuthors: Virginia Ruiz-Albacete, Pedro Tome-Gonzalez, Fernando Alonso-Fernandez, Javier Galbally, Julian Fierrez, Javier Ortega-GarciaComments: Published at European Workshop on Biometrics and Identity Management (BIOID)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [16] arXiv:2111.00184 [pdf, other]
-
Title: Geometry-Aware Hierarchical Bayesian Learning on ManifoldsComments: Published in WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [17] arXiv:2111.00190 [pdf, other]
-
Title: Leveraging SE(3) Equivariance for Self-Supervised Category-Level Object Pose EstimationComments: 20 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [18] arXiv:2111.00201 [pdf, other]
-
Title: A Comparative Review of Recent Few-Shot Object Detection AlgorithmsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [19] arXiv:2111.00203 [pdf, other]
-
Title: Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face SynthesisComments: Accepted by MM2021, code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Multimedia (cs.MM)
- [20] arXiv:2111.00207 [pdf, other]
-
Title: PatchFormer: An Efficient Point Transformer with Patch AttentionComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [21] arXiv:2111.00228 [pdf, other]
-
Title: whu-nercms at trecvid2021:instance search taskAuthors: Yanrui Niu, Jingyao Yang, Ankang Lu, Baojin Huang, Yue Zhang, Ji Huang, Shishi Wen, Dongshu Xu, Chao Liang, Zhongyuan Wang, Jun ChenComments: 9 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Multimedia (cs.MM)
- [22] arXiv:2111.00231 [pdf, other]
-
Title: Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and SegmentationComments: Accepted in BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [23] arXiv:2111.00232 [pdf, other]
-
Title: MFNet: Multi-class Few-shot Segmentation Network with Pixel-wise Metric LearningComments: Accepted on IEEE Transactions on Circuits and Systems for Video TechnologySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [24] arXiv:2111.00298 [pdf, other]
-
Title: A fast accurate fine-grain object detection model based on YOLOv4 deep neural networkSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [25] arXiv:2111.00312 [pdf, other]
-
Title: 3DP3: 3D Scene Perception via Probabilistic ProgrammingAuthors: Nishad Gothoskar, Marco Cusumano-Towner, Ben Zinberg, Matin Ghavamizadeh, Falk Pollok, Austin Garrett, Joshua B. Tenenbaum, Dan Gutfreund, Vikash K. MansinghkaComments: NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [26] arXiv:2111.00398 [pdf, other]
-
Title: A Simple Approach to Image Tilt Correction with Self-Attention MobileNet for SmartphonesComments: Accepted - British Machine vision Conference 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [27] arXiv:2111.00406 [pdf, other]
-
Title: PANet: Perspective-Aware Network with Dynamic Receptive Fields and Self-Distilling Supervision for Crowd CountingComments: The paper is under consideration at Computer Vision and Image UnderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [28] arXiv:2111.00440 [pdf, other]
-
Title: Loop closure detection using local 3D deep descriptorsComments: This work is accepted for publication in IEEE Robotics and Automation LettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [29] arXiv:2111.00454 [pdf, other]
-
Title: Gaussian Kernel Mixture Network for Single Image Defocus DeblurringComments: Accepted by NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [30] arXiv:2111.00485 [pdf, other]
-
Title: Learned Image Compression with Separate Hyperprior DecodersComments: This paper has been accepted by IEEE Open Journal of Circuits and SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [31] arXiv:2111.00487 [pdf, other]
-
Title: Smart(Sampling)Augment: Optimal and Efficient Data Augmentation for Semantic SegmentationComments: Negassi and Wagner provided an equal contributionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [32] arXiv:2111.00500 [pdf, other]
-
Title: DPNET: Dual-Path Network for Efficient Object Detectioj with Lightweight Self-AttentionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [33] arXiv:2111.00508 [pdf, other]
-
Title: Fully convolutional Siamese neural networks for buildings damage assessment from satellite imagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [34] arXiv:2111.00509 [pdf, other]
-
Title: DRBANET: A Lightweight Dual-Resolution Network for Semantic Segmentation with Boundary AuxiliarySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [35] arXiv:2111.00531 [pdf, other]
-
Title: Learning Debiased and Disentangled Representations for Semantic SegmentationComments: Accepted by NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [36] arXiv:2111.00538 [pdf, other]
-
Title: From Face to Gait: Weakly-Supervised Learning of Gender Information from Walking PatternsComments: Accepted at Face & Gesture Recognition 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [37] arXiv:2111.00598 [pdf, other]
-
Title: The 5th Recognizing Families in the Wild Data Challenge: Predicting Kinship from FacesComments: 2021 IEEE Conference on Automatic Face and Gesture RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [38] arXiv:2111.00643 [pdf, other]
-
Title: Learning Distilled Collaboration Graph for Multi-Agent PerceptionComments: Accepted to 35th Conference on Neural Information Processing Systems (NeurIPS 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [39] arXiv:2111.00648 [pdf, other]
-
Title: Accurate Point Cloud Registration with Robust Optimal TransportAuthors: Zhengyang Shen, Jean Feydy, Peirong Liu, Ariel Hernán Curiale, Ruben San Jose Estepar, Raul San Jose Estepar, Marc NiethammerComments: Accepted in NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
- [40] arXiv:2111.00657 [pdf, other]
-
Title: TriVoC: Efficient Voting-based Consensus Maximization for Robust Point Cloud Registration with Extreme Outlier RatiosJournal-ref: IEEE Robotics and Automation Letters (Volume: 7, Issue: 2, April 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [41] arXiv:2111.00659 [pdf, other]
-
Title: Feature Aggregation and Refinement Network for 2D AnatomicalLandmark DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [42] arXiv:2111.00660 [pdf, other]
-
Title: Evaluation of Human and Machine Face Detection using a Novel Distinctive Human Appearance DatasetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [43] arXiv:2111.00674 [pdf, other]
-
Title: Distilling Object Detectors with Feature RichnessComments: Accepted in NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [44] arXiv:2111.00687 [pdf, other]
-
Title: RMNet: Equivalently Removing Residual Connection from NetworksComments: Equivalently removing residual connection from ResBlock with non-linear layer inside it, towards an efficient plain modelSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [45] arXiv:2111.00728 [pdf, other]
-
Title: Learning Iterative Robust Transformation SynchronizationComments: To appear in 3DV2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [46] arXiv:2111.00754 [pdf, other]
-
Title: Few-shot learning with improved local representations via bias rectify moduleSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [47] arXiv:2111.00763 [pdf, other]
-
Title: Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized RefinementsComments: Accepted to 3DV 2021. Code and demo is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [48] arXiv:2111.00770 [pdf, other]
-
Title: Dense Prediction with Attentive Feature AggregationComments: 20 pages, 14 figures, WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [49] arXiv:2111.00772 [pdf, other]
-
Title: AdaPool: Exponential Adaptive Pooling for Information-Retaining DownsamplingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [50] arXiv:2111.00775 [pdf, other]
-
Title: PP-ShiTu: A Practical Lightweight Image Recognition SystemAuthors: Shengyu Wei, Ruoyu Guo, Cheng Cui, Bin Lu, Shuilong Dong, Tingquan Gao, Yuning Du, Ying Zhou, Xueying Lyu, Qiwen Liu, Xiaoguang Hu, Dianhai Yu, Yanjun MaComments: 9 pages, 5 figures, 9 tables. arXiv admin note: text overlap with arXiv:2109.03144Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [51] arXiv:2111.00791 [pdf, other]
-
Title: A New Look at Spike-Timing-Dependent Plasticity Networks for Spatio-Temporal Feature LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [52] arXiv:2111.00794 [pdf, other]
-
Title: Geodesic Models with Convexity Shape PriorComments: This paper has been accepted by TPAMISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [53] arXiv:2111.00801 [pdf, other]
-
Title: Livestock Monitoring with TransformerAuthors: Bhavesh Tangirala, Ishan Bhandari, Daniel Laszlo, Deepak K. Gupta, Rajat M. Thomas, Devanshu AryaComments: Accepted at BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [54] arXiv:2111.00823 [pdf, other]
-
Title: LSTA-Net: Long short-term Spatio-Temporal Aggregation Network for Skeleton-based Action RecognitionComments: Accepted by BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [55] arXiv:2111.00861 [pdf, other]
-
Title: A Frequency Perspective of Adversarial RobustnessAuthors: Shishira R Maiya, Max Ehrlich, Vatsal Agarwal, Ser-Nam Lim, Tom Goldstein, Abhinav ShrivastavaSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [56] arXiv:2111.00865 [pdf, other]
-
Title: MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion RecognitionComments: 4 papges, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [57] arXiv:2111.00869 [pdf, other]
-
Title: DetectorNet: Transformer-enhanced Spatial Temporal Graph Neural Network for Traffic PredictionAuthors: He Li, Shiyu Zhang, Xuejiao Li, Liangcai Su, Hongjie Huang, Duo Jin, Linghao Chen, Jianbing Huang, Jaesoo YooComments: The 29th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [58] arXiv:2111.00880 [pdf, other]
-
Title: Benchmarks for Corruption Invariant Person Re-identificationComments: Accepted by NeurIPS 2021 Track on Datasets and Benchmarks. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [59] arXiv:2111.00892 [pdf, other]
-
Title: Hierarchical Image Classification with A Literally Toy DatasetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [60] arXiv:2111.00899 [pdf, other]
-
Title: Equivariant Contrastive LearningAuthors: Rumen Dangovski, Li Jing, Charlotte Loh, Seungwook Han, Akash Srivastava, Brian Cheung, Pulkit Agrawal, Marin SoljačićSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Applied Physics (physics.app-ph)
- [61] arXiv:2111.00902 [pdf, other]
-
Title: PP-PicoDet: A Better Real-Time Object Detector on Mobile DevicesAuthors: Guanghua Yu, Qinyao Chang, Wenyu Lv, Chang Xu, Cheng Cui, Wei Ji, Qingqing Dang, Kaipeng Deng, Guanzhong Wang, Yuning Du, Baohua Lai, Qiwen Liu, Xiaoguang Hu, Dianhai Yu, Yanjun MaComments: 9 pages, 3 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [62] arXiv:2111.00905 [pdf, ps, other]
-
Title: Smart Fashion: A Review of AI Applications in the Fashion & Apparel IndustryAuthors: Seyed Omid Mohammadi, Ahmad Kalhor (University of Tehran, College of Engineering, School of Electrical and Computer Engineering, Tehran, Iran)Comments: 99 Pages, 79 Figures, 24 Tables, Full length manuscriptSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [63] arXiv:2111.00919 [pdf, other]
-
Title: DFCANet: Dense Feature Calibration-Attention Guided Network for Cross Domain Iris Presentation Attack DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [64] arXiv:2111.00928 [pdf, other]
-
Title: Combating Noise: Semi-supervised Learning by Region Uncertainty QuantificationComments: Accepted by NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [65] arXiv:2111.00931 [pdf, ps, other]
-
Title: Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [66] arXiv:2111.00941 [pdf, other]
-
Title: Turning Traffic Monitoring Cameras into Intelligent Sensors for Traffic Density EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [67] arXiv:2111.00943 [pdf, other]
-
Title: SVBRDF Recovery From a Single Image With Highlights using a Pretrained Generative Adversarial NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [68] arXiv:2111.00950 [pdf, other]
-
Title: Higher-Order Implicit Fairing Networks for 3D Human Pose EstimationJournal-ref: British Machine Vision Conference, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [69] arXiv:2111.00966 [pdf, other]
-
Title: VPFNet: Voxel-Pixel Fusion Network for Multi-class 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [70] arXiv:2111.00969 [pdf, other]
-
Title: Generative Occupancy Fields for 3D Surface-Aware Image SynthesisComments: Accepted to NeurIPS2021. We propose Generative Occupancy Fields(GOF), a 3D-aware generative model which could synthesize realistic images with 3D consistency and simultaneously learn compact object surfacesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [71] arXiv:2111.00993 [pdf, other]
-
Title: Egocentric Human Trajectory Forecasting with a Wearable Camera and Multi-Modal FusionAuthors: Jianing Qiu, Lipeng Chen, Xiao Gu, Frank P.-W. Lo, Ya-Yen Tsai, Jiankai Sun, Jiaqi Liu, Benny LoJournal-ref: IEEE Robotics and Automation Letters, June, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [72] arXiv:2111.00995 [pdf, other]
-
Title: Sign-to-Speech Model for Sign Language Understanding: A Case Study of Nigerian Sign LanguageJournal-ref: 2022, Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence Demo Track. Pages 5924-5927Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [73] arXiv:2111.01004 [pdf, other]
-
Title: Improving Contrastive Learning on Imbalanced Seed Data via Open-World SamplingComments: Neurips 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [74] arXiv:2111.01007 [pdf, other]
-
Title: Projected GANs Converge FasterComments: To appear in NeurIPS 2021. Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [75] arXiv:2111.01024 [pdf, other]
-
Title: With a Little Help from my Temporal Context: Multimodal Egocentric Action RecognitionComments: Accepted at BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [76] arXiv:2111.01026 [pdf, other]
-
Title: Introspective Distillation for Robust Question AnsweringComments: Accepted by NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [77] arXiv:2111.01029 [pdf, other]
-
Title: Render In-between: Motion Guided Video Synthesis for Action InterpolationComments: 12 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [78] arXiv:2111.01035 [pdf, other]
-
Title: A Unified View of cGANs with and without ClassifiersComments: Accepted by NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [79] arXiv:2111.01048 [pdf, other]
-
Title: MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image ManipulationAuthors: Safa C. Medin, Bernhard Egger, Anoop Cherian, Ye Wang, Joshua B. Tenenbaum, Xiaoming Liu, Tim K. MarksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
- [80] arXiv:2111.01082 [pdf, other]
-
Title: FaceScape: 3D Facial Dataset and Benchmark for Single-View 3D Face ReconstructionAuthors: Hao Zhu, Haotian Yang, Longwei Guo, Yidi Zhang, Yanru Wang, Mingkai Huang, Menghua Wu, Qiu Shen, Ruigang Yang, Xun CaoComments: Accepted to T-PAMI 2023; Extension of FaceScape(CVPR 2020); Code & data are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [81] arXiv:2111.01105 [pdf, ps, other]
-
Title: FREGAN : an application of generative adversarial networks in enhancing the frame rate of videosSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [82] arXiv:2111.01118 [pdf, other]
-
Title: Rebooting ACGAN: Auxiliary Classifier GANs with Stable TrainingComments: 34 pages, 26 figures, 35th Conference on Neural Information Processing Systems (NeurIPS 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [83] arXiv:2111.01124 [pdf, other]
-
Title: When Does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?Comments: NeurIPS 2021. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [84] arXiv:2111.01215 [pdf, other]
-
Title: Gradient Frequency Modulation for Visually Explaining Video Understanding ModelsComments: Accepted by BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [85] arXiv:2111.01236 [pdf, other]
-
Title: Multi-Scale High-Resolution Vision Transformer for Semantic SegmentationAuthors: Jiaqi Gu, Hyoukjun Kwon, Dilin Wang, Wei Ye, Meng Li, Yu-Hsin Chen, Liangzhen Lai, Vikas Chandra, David Z. PanComments: 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [86] arXiv:2111.01253 [pdf, other]
-
Title: Neural Scene Flow PriorComments: accepted by NeurIPS 2021 as "spotlight"Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [87] arXiv:2111.01261 [pdf, other]
-
Title: Joint Detection of Motion Boundaries and OcclusionsJournal-ref: The British Machine Vision Conference (BMVC), 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [88] arXiv:2111.01300 [pdf, other]
-
Title: Masking Modalities for Cross-modal Video RetrievalComments: Accepted at WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [89] arXiv:2111.01323 [pdf, other]
-
Title: Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic PerspectiveComments: modified version to appear in IJCV. arXiv admin note: substantial text overlap with arXiv:2010.12176Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [90] arXiv:2111.01325 [pdf, other]
-
Title: Attribute-Based Deep Periocular Recognition: Leveraging Soft Biometrics to Improve Periocular RecognitionComments: Accepted to be published in WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [91] arXiv:2111.01353 [pdf, other]
-
Title: Can Vision Transformers Perform Convolution?Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [92] arXiv:2111.01396 [pdf, other]
-
Title: Boundary Distribution Estimation for Precise Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [93] arXiv:2111.01418 [pdf, other]
-
Title: A Pixel-Level Meta-Learner for Weakly Supervised Few-Shot Semantic SegmentationComments: Accepted to WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [94] arXiv:2111.01440 [pdf, other]
-
Title: HHP-Net: A light Heteroscedastic neural network for Head Pose estimation with uncertaintyComments: Accepted at WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [95] arXiv:2111.01590 [pdf, other]
-
Title: Detect-and-Segment: a Deep Learning Approach to Automate Wound Image SegmentationAuthors: Gaetano Scebba, Jia Zhang, Sabrina Catanzaro, Carina Mihai, Oliver Distler, Martin Berli, Walter KarlenSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [96] arXiv:2111.01591 [pdf, other]
-
Title: Estimating 3D Motion and Forces of Human-Object Interactions from Internet VideosComments: arXiv admin note: substantial text overlap with arXiv:1904.02683Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [97] arXiv:2111.01604 [pdf, ps, other]
-
Title: A Critical Study on the Recent Deep Learning Based Semi-Supervised Video Anomaly Detection MethodsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [98] arXiv:2111.01606 [pdf, other]
-
Title: PolyTrack: Tracking with Bounding PolygonsComments: NeurIPS 2021 Machine Learning for Autonomous Driving WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [99] arXiv:2111.01619 [pdf, other]
-
Title: StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGANSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [100] arXiv:2111.01623 [pdf, other]
-
Title: A Tri-attention Fusion Guided Multi-modal Segmentation NetworkComments: 33 pages, 11 figures, accepted by Pattern Recognition on 01 November 2021. arXiv admin note: substantial text overlap with arXiv:2102.03111Journal-ref: Pattern Recognition 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [101] arXiv:2111.01628 [pdf, other]
-
Title: Human Attention in Fine-grained ClassificationComments: 19 pages, 9 figuresJournal-ref: British Machine Vision Conference (BMVC) 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [102] arXiv:2111.01673 [pdf, other]
-
Title: Relational Self-Attention: What's Missing in Attention for Video UnderstandingComments: Accepted to NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [103] arXiv:2111.01677 [pdf, other]
-
Title: Top1 Solution of QQ Browser 2021 Ai Algorithm Competition Track 1 : Multimodal Video SimilaritySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [104] arXiv:2111.01681 [pdf, ps, other]
-
Title: Saliency detection with moving camera via background model completionSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [105] arXiv:2111.01683 [pdf, other]
-
Title: Using Synthetic Images To Uncover Population Biases In Facial Landmarks DetectionComments: to be published in DCAI workshop / NEURIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [106] arXiv:2111.01684 [pdf, other]
-
Title: Rethinking the Knowledge Distillation From the Perspective of Model CalibrationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [107] arXiv:2111.01715 [pdf, ps, other]
-
Title: Absolute distance prediction based on deep learning object detection and monocular depth estimation modelsAuthors: Armin Masoumian, David G. F. Marei, Saddam Abdulwahab, Julian Cristiano, Domenec Puig, Hatem A. RashwanComments: 10 pages, Submitted to 23rd International Conference of the Catalan Association for Artificial Intelligence (CCIA 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [108] arXiv:2111.01717 [pdf, other]
-
Title: MixFace: Improving Face Verification Focusing on Fine-grained ConditionsComments: 9 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [109] arXiv:2111.01723 [pdf, other]
-
Title: CPSeg: Cluster-free Panoptic Segmentation of 3D LiDAR Point CloudsComments: Accepted at ICRA 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [110] arXiv:2111.01740 [pdf, other]
-
Title: Personalized One-Shot Lipreading for an ALS PatientJournal-ref: BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [111] arXiv:2111.01785 [pdf, other]
-
Title: PatchGame: Learning to Signal Mid-level Patches in Referential GamesAuthors: Kamal Gupta, Gowthami Somepalli, Anubhav Gupta, Vinoj Jayasundara, Matthias Zwicker, Abhinav ShrivastavaComments: To appear at NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [112] arXiv:2111.01884 [pdf, other]
-
Title: Body Size and Depth Disambiguation in Multi-Person Reconstruction from Single ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [113] arXiv:2111.01888 [pdf, ps, other]
-
Title: A dataset for multi-sensor drone detectionComments: Published at Elsevier Data in Brief journal. arXiv admin note: text overlap with arXiv:2007.07396Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [114] arXiv:2111.01898 [pdf, ps, other]
-
Title: A high performance fingerprint liveness detection method based on quality related featuresComments: Published at Elsevier Future Generation Computer Systems journalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [115] arXiv:2111.01930 [pdf, other]
-
Title: Deep learning for identification and face, gender, expression recognition under constraintsAuthors: Ahmad B. Hassanat, Abeer Albustanji, Ahmad S. Tarawneh, Malek Alrashidi, Hani Alharbi, Mohammed Alanazi, Mansoor Alghamdi, Ibrahim S Alkhazi, V. B. Surya PrasathComments: Submitted to International Journal of BiometricsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [116] arXiv:2111.01936 [pdf, other]
-
Title: Revisiting spatio-temporal layouts for compositional action recognitionComments: Published in BMVC 2021 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [117] arXiv:2111.01965 [pdf, ps, other]
-
Title: Adversarially Perturbed Wavelet-based Morphed Face GenerationAuthors: Kelsey O'Haire, Sobhan Soleymani, Baaria Chaudhary, Poorya Aghdaie, Jeremy Dawson, Nasser M. NasrabadiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [118] arXiv:2111.02018 [pdf, other]
-
Title: Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled AttentionComments: Accepted at BMVC 2021Journal-ref: The British Machine Vision Conference (BMVC) 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [119] arXiv:2111.02042 [pdf, other]
-
Title: Recent Advancements in Self-Supervised Paradigms for Visual Feature RepresentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [120] arXiv:2111.02045 [pdf, other]
-
Title: Deep Point Set Resampling via Gradient FieldsComments: arXiv admin note: text overlap with arXiv:2107.10981Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [121] arXiv:2111.02058 [pdf, ps, other]
-
Title: Rethinking the Image Feature Biases Exhibited by Deep CNN ModelsComments: 15 pages, 15 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [122] arXiv:2111.02061 [pdf, other]
-
Title: Deep-Learning-Based Single-Image Height Reconstruction from Very-High-Resolution SAR Intensity DataComments: 19 pages, 14 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [123] arXiv:2111.02064 [pdf, ps, other]
-
Title: Event and Activity Recognition in Video Surveillance for Cyber-Physical SystemsComments: This is a preprint of the chapter:S Bhaumik, P Jana, PP Mohanta, Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems, published in Emergence of Cyber Physical System.., edited by KK Singh et al, 2021, Springer reproduced with permission of Springer Nature Switzerland AG. The final authenticated version is available online at this http URLJournal-ref: Emergence of Cyber Physical System and IoT in Smart Automation and Robotics. Springer, Cham, 2021. 51-68Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [124] arXiv:2111.02073 [pdf, other]
-
Title: Dual Progressive Prototype Network for Generalized Zero-Shot LearningComments: Accepted by NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [125] arXiv:2111.02078 [pdf, ps, other]
-
Title: FaceQvec: Vector Quality Assessment for Face Biometrics based on ISO ComplianceAuthors: Javier Hernandez-Ortega, Julian Fierrez, Luis F. Gomez, Aythami Morales, Jose Luis Gonzalez-de-Suso, Francisco Zamora-MartinezSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [126] arXiv:2111.02079 [pdf, ps, other]
-
Title: Influence of image noise on crack detection performance of deep convolutional neural networksComments: 8 pages, 16 figures, 4 tablesJournal-ref: 10th International Conference on Structural Health Monitoring of Intelligent Infrastructure, SHMII 10, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [127] arXiv:2111.02114 [pdf, other]
-
Title: LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text PairsAuthors: Christoph Schuhmann, Richard Vencu, Romain Beaumont, Robert Kaczmarczyk, Clayton Mullis, Aarush Katta, Theo Coombes, Jenia Jitsev, Aran KomatsuzakiComments: Short version. Accepted at Data Centric AI NeurIPS Workshop 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [128] arXiv:2111.02135 [pdf, other]
-
Title: Efficient 3D Deep LiDAR OdometryComments: 17 pages, 13 figures. Accepted by PAMI 2022. arXiv admin note: substantial text overlap with arXiv:2012.00972Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [129] arXiv:2111.02139 [pdf, other]
-
Title: An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [130] arXiv:2111.02144 [pdf, other]
-
Title: Beyond PRNU: Learning Robust Device-Specific Fingerprint for Source Camera IdentificationComments: 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Image and Video Processing (eess.IV)
- [131] arXiv:2111.02172 [pdf, other]
-
Title: A cross-modal fusion network based on self-attention and residual structure for multimodal emotion recognitionComments: 5 pages, 1 figure, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
- [132] arXiv:2111.02175 [pdf, other]
-
Title: Discriminator Synthesis: On reusing the other half of Generative Adversarial NetworksAuthors: Diego PorresComments: 7 pages, 4 figures, NeurIPS Workshop on Machine Learning for Creativity and Design 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [133] arXiv:2111.02273 [pdf, other]
-
Title: Multi-Cue Adaptive Emotion Recognition NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
- [134] arXiv:2111.02322 [pdf, ps, other]
-
Title: A Comparison of Deep Learning Models for the Prediction of Hand Hygiene VideosAuthors: Rashmi BakshiComments: arXiv admin note: substantial text overlap with arXiv:2110.02842Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [135] arXiv:2111.02331 [pdf, other]
-
Title: LTD: Low Temperature Distillation for Robust Adversarial TrainingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [136] arXiv:2111.02333 [pdf, other]
-
Title: HS3: Learning with Proper Task Complexity in Hierarchically Supervised Semantic SegmentationComments: Accepted to BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [137] arXiv:2111.02358 [pdf, other]
-
Title: VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-ExpertsAuthors: Hangbo Bao, Wenhui Wang, Li Dong, Qiang Liu, Owais Khan Mohammed, Kriti Aggarwal, Subhojit Som, Furu WeiComments: Work in progressSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [138] arXiv:2111.02360 [pdf, other]
-
Title: Subpixel Heatmap Regression for Facial Landmark LocalizationComments: Accepted at BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [139] arXiv:2111.02368 [pdf, other]
-
Title: Video Salient Object Detection via Contrastive Features and Attention ModulesComments: Accepted in WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [140] arXiv:2111.02387 [pdf, other]
-
Title: An Empirical Study of Training End-to-End Vision-and-Language TransformersAuthors: Zi-Yi Dou, Yichong Xu, Zhe Gan, Jianfeng Wang, Shuohang Wang, Lijuan Wang, Chenguang Zhu, Pengchuan Zhang, Lu Yuan, Nanyun Peng, Zicheng Liu, Michael ZengComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [141] arXiv:2111.02394 [pdf, other]
-
Title: FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel RepresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [142] arXiv:2111.02444 [pdf, other]
-
Title: Panoptic 3D Scene Reconstruction From a Single RGB ImageSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [143] arXiv:2111.02447 [pdf, other]
-
Title: On the Frequency Bias of Generative ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [144] arXiv:2111.02450 [pdf, other]
-
Title: Unified 3D Mesh Recovery of Humans and Animals by Learning Animal ExerciseComments: BMVC 2021, 10 pages excluding reference pageSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [145] arXiv:2111.02500 [pdf, other]
-
Title: Improving Pose Estimation through Contextual Activity FusionComments: 8 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [146] arXiv:2111.02521 [pdf, other]
-
Title: Sequence-to-Sequence Modeling for Action Identification at High Temporal ResolutionAuthors: Aakash Kaku, Kangning Liu, Avinash Parnandi, Haresh Rengaraj Rajamohan, Kannan Venkataramanan, Anita Venkatesan, Audre Wirtanen, Natasha Pandit, Heidi Schambra, Carlos Fernandez-GrandaComments: Under review as a conference paper at ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [147] arXiv:2111.02548 [pdf, other]
-
Title: Understanding Cross Domain Presentation Attack Detection for Visible Face RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [148] arXiv:2111.02586 [pdf, other]
-
Title: Building Damage Mapping with Self-PositiveUnlabeled LearningComments: 7 pages, 1 figure, Artificial Intelligence for Humanitarian Assistance and Disaster Response Workshop, NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [149] arXiv:2111.02651 [pdf, other]
-
Title: Temporal Fusion Based Mutli-scale Semantic Segmentation for Detecting Concealed Baggage ThreatsComments: Accepted in IEEE SMC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [150] arXiv:2111.02668 [pdf, other]
-
Title: LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [151] arXiv:2111.02679 [pdf, other]
-
Title: MixSiam: A Mixture-based Approach to Self-supervised Representation LearningComments: 9 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [152] arXiv:2111.02682 [pdf, other]
-
Title: TimeMatch: Unsupervised Cross-Region Adaptation by Temporal Shift EstimationJournal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, Volume 188, June 2022, Pages 301-313Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [153] arXiv:2111.02703 [pdf, ps, other]
-
Title: Towards Smart Monitored AM: Open Source in-Situ Layer-wise 3D Printing Image Anomaly Detection Using Histograms of Oriented Gradients and a Physics-Based Rendering EngineSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [154] arXiv:2111.02717 [pdf, ps, other]
-
Title: Facial Emotion Recognition using Deep Residual Networks in Real-World EnvironmentsSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [155] arXiv:2111.02724 [pdf, ps, other]
-
Title: Tea Chrysanthemum Detection under Unstructured Environments Using the TC-YOLO ModelAuthors: Chao Qi (1), Junfeng Gao (2), Simon Pearson (2), Helen Harman (2), Kunjie Chen (1), Lei Shu (1) ((1) Nanjing Agricultural University, (2) University of Lincoln)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [156] arXiv:2111.02741 [pdf, other]
-
Title: Multi-scale 2D Representation Learning for weakly-supervised moment retrievalComments: 8 pages, 4 figuers. Accepted for publication in 2020 25th International Conference on Pattern Recognition (ICPR)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [157] arXiv:2111.02751 [pdf, other]
-
Title: FEAFA+: An Extended Well-Annotated Dataset for Facial Expression Analysis and 3D Facial AnimationJournal-ref: Proc. SPIE 12342, Fourteenth International Conference on Digital Image Processing (ICDIP 2022), 1234211 (12 October 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [158] arXiv:2111.02757 [pdf, other]
-
Title: Online Continual Learning via Multiple Deep Metric Learning and Uncertainty-guided Episodic Memory Replay -- 3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A Continual Object ClassificationComments: 6 pages, 2 figures, 3 algorithms, 1 tableSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [159] arXiv:2111.02847 [pdf, ps, other]
-
Title: Stable and Compact Face Recognition via Unlabeled Data Driven Sparse Representation-Based ClassificationComments: 43 pages, 10 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [160] arXiv:2111.02901 [pdf, other]
-
Title: Certainty Volume Prediction for Unsupervised Domain AdaptationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [161] arXiv:2111.03039 [pdf, other]
-
Title: Towards Panoptic 3D Parsing for Single Image in the WildSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [162] arXiv:2111.03042 [pdf, other]
-
Title: Unsupervised Learning of Compositional Energy ConceptsComments: NeurIPS 2021, website and code at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [163] arXiv:2111.03056 [pdf, other]
-
Title: Bootstrap Your Object Detector via Mixed TrainingJournal-ref: NeurIPS 2021, SpotlightSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [164] arXiv:2111.03098 [pdf, other]
-
Title: Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single ImageComments: NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [165] arXiv:2111.03106 [pdf, other]
-
Title: Skeleton-Split Framework using Spatial Temporal Graph Convolutional Networks for Action RecogntionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [166] arXiv:2111.03129 [pdf, other]
-
Title: Attention on Classification for Fire SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [167] arXiv:2111.03133 [pdf, other]
-
Title: StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing SynthesisComments: Superseded by arXiv:2202.12362Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [168] arXiv:2111.03186 [pdf, other]
-
Title: EditGAN: High-Precision Semantic Image EditingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [169] arXiv:2111.03195 [pdf, other]
-
Title: Addressing Multiple Salient Object Detection via Dual-Space Long-Range DependenciesComments: 10 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [170] arXiv:2111.03216 [pdf, other]
-
Title: Fast Camouflaged Object Detection via Edge-based Reversible Re-calibration NetworkComments: 35 pages, 7 figures, 5 tables (Accepted by Pattern Recognition 2022)Journal-ref: Pattern Recognition 123 (2022): 108414Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [171] arXiv:2111.03225 [pdf, other]
-
Title: Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action ParsingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [172] arXiv:2111.03260 [pdf, ps, other]
-
Title: Remote Sensing Image Super-resolution and Object Detection: Benchmark and State of the ArtAuthors: Yi Wang, Syed Muhammad Arsalan Bashir, Mahrukh Khan, Qudrat Ullah, Rui Wang, Yilin Song, Zhe Guo, Yilong NiuComments: 39 pages, 15 figures, 5 tables. Submitted to Elsevier journal for reviewJournal-ref: Expert Systems with Applications, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [173] arXiv:2111.03281 [pdf, other]
-
Title: Recognizing Vector Graphics without RasterizationComments: Accepted by NeurIPS2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [174] arXiv:2111.03286 [pdf, other]
-
Title: FBNet: Feature Balance Network for Urban-Scene SegmentationComments: Tech ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [175] arXiv:2111.03319 [pdf, other]
-
Title: KORSAL: Key-point Detection based Online Real-Time Spatio-Temporal Action LocalizationAuthors: Kalana Abeywardena, Shechem Sumanthiran, Sakuna Jayasundara, Sachira Karunasena, Ranga Rodrigo, Peshala JayasekaraComments: 7 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [176] arXiv:2111.03349 [pdf, other]
-
Title: Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [177] arXiv:2111.03384 [pdf, other]
-
Title: Seamless Satellite-image SynthesisComments: 12 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [178] arXiv:2111.03388 [pdf, other]
-
Title: A Deep Learning Generative Model Approach for Image Synthesis of Plant LeavesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [179] arXiv:2111.03392 [pdf, other]
-
Title: SSA: Semantic Structure Aware Inference for Weakly Pixel-Wise Dense Predictions without CostSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [180] arXiv:2111.03408 [pdf, other]
-
Title: MSC-VO: Exploiting Manhattan and Structural Constraints for Visual OdometryComments: Submitted to RAL + ICRA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [181] arXiv:2111.03414 [pdf, other]
-
Title: Structure-aware Image Inpainting with Two Parallel StreamsComments: 9 pages, 8 figures, rejected by IJCAI 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [182] arXiv:2111.03420 [pdf, other]
-
Title: Sampling Equivariant Self-attention Networks for Object Detection in Aerial ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [183] arXiv:2111.03421 [pdf, other]
-
Title: Solving Traffic4Cast Competition with U-Net and Temporal Domain AdaptationComments: Conference on Neural Information Processing Systems (NeurIPS 2021) Traffic4cast CompetitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [184] arXiv:2111.03443 [pdf, ps, other]
-
Title: Nondestructive Testing of Composite Fibre Materials with Hyperspectral Imaging : Evaluative Studies in the EU H2020 FibreEUse ProjectAuthors: Yijun Yan, Jinchang Ren, Huan Zhao, James F.C. Windmill, Winifred Ijomah, Jesper de Wit, Justus von FreedenComments: 11 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [185] arXiv:2111.03480 [pdf, other]
-
Title: DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional AutoencoderComments: 2021 IEEE Winter Conference on Applications of Computer Vision Workshops (WACVW)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [186] arXiv:2111.03481 [pdf, other]
-
Title: Improving Visual Quality of Image Synthesis by A Token-based Generator with TransformersComments: NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [187] arXiv:2111.03483 [pdf, other]
-
Title: Event-based Motion Segmentation by Cascaded Two-Level Multi-Model FittingComments: Accepted for presentation at the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [188] arXiv:2111.03505 [pdf, other]
-
Title: Visualizing the Emergence of Intermediate Visual Patterns in DNNsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [189] arXiv:2111.03522 [pdf, other]
-
Title: Semantically Consistent Image-to-Image Translation for Unsupervised Domain AdaptationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [190] arXiv:2111.03549 [pdf, other]
-
Title: Interpreting Representation Quality of DNNs for 3D Point Cloud ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [191] arXiv:2111.03552 [pdf, other]
-
Title: SmartDepthSync: Open Source Synchronized Video Recording System of Smartphone RGB and Depth Camera Range Image Frames with Sub-millisecond PrecisionAuthors: Marsel Faizullin, Anastasiia Kornilova, Azat Akhmetyanov, Konstantin Pakulev, Andrey Sadkov, Gonzalo FerrerComments: IEEE Sensors Journal paperSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [192] arXiv:2111.03574 [pdf, other]
-
Title: Spatial-Temporal Residual Aggregation for High Resolution Video InpaintingComments: Accepted by BMVC 2021. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [193] arXiv:2111.03580 [pdf, other]
-
Title: AGPCNet: Attention-Guided Pyramid Context Networks for Infrared Small Target DetectionComments: 12 pages, 13 figures, 8 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [194] arXiv:2111.03605 [pdf, other]
-
Title: Edge Tracing using Gaussian Process RegressionComments: 15 pages, 6 figures. Accepted to be published in IEEE Transactions on Image Processing. Github repository: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [195] arXiv:2111.03615 [pdf, other]
-
Title: Single Image Deraining Network with Rain Embedding Consistency and Layered LSTMComments: Accepted by WACV2022, January 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [196] arXiv:2111.03635 [pdf, other]
-
Title: BBC-Oxford British Sign Language DatasetAuthors: Samuel Albanie, Gül Varol, Liliane Momeni, Hannah Bull, Triantafyllos Afouras, Himel Chowdhury, Neil Fox, Bencie Woll, Rob Cooper, Andrew McParland, Andrew ZissermanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [197] arXiv:2111.03643 [pdf, other]
-
Title: TermiNeRF: Ray Termination Prediction for Efficient Neural RenderingComments: 3DV 2021; Project page with videos: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [198] arXiv:2111.03649 [pdf, other]
-
Title: Normalizing Flow as a Flexible Fidelity Objective for Photo-Realistic Super-resolutionJournal-ref: WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [199] arXiv:2111.03651 [pdf, other]
-
Title: The Curious Layperson: Fine-Grained Image Recognition without Expert LabelsComments: To appear in BMVC 2021 (Oral). Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [200] arXiv:2111.03690 [pdf, ps, other]
-
Title: Do we still need ImageNet pre-training in remote sensing scene classification?Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [201] arXiv:2111.03693 [pdf, other]
-
Title: Disaster mapping from satellites: damage detection with crowdsourced point labelsComments: 3rd Workshop on Artificial Intelligence for Humanitarian Assistance and Disaster Response at NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [202] arXiv:2111.03789 [pdf, other]
-
Title: Generation of microbial colonies dataset with deep learning style transferComments: 13 pages, 9 figures, 2 tablesJournal-ref: Scientific Reports 12, 5212 (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
- [203] arXiv:2111.03819 [pdf, other]
-
Title: Will You Ever Become Popular? Learning to Predict Virality of Dance ClipsComments: Accepted by TOMMSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [204] arXiv:2111.03821 [pdf, ps, other]
-
Title: ROFT: Real-Time Optical Flow-Aided 6D Object Pose and Velocity TrackingComments: To cite this work, please refer to the journal reference entry. For more information, code, pictures and video please visit this https URLJournal-ref: IEEE Robotics and Automation Letters Volume 7, Issue 1, Jan. 2022, pp 159-166Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [205] arXiv:2111.03824 [pdf, other]
-
Title: Neural Implicit Event Generator for Motion TrackingComments: Submitted to ICRA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [206] arXiv:2111.03845 [pdf, other]
-
Title: Multi-modal land cover mapping of remote sensing images using pyramid attention and gated fusion networksComments: 24 pages, 11 figures, submitted to IJRSSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [207] arXiv:2111.03861 [pdf, other]
-
Title: What augmentations are sensitive to hyper-parameters and why?Comments: 10 pages, 17 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [208] arXiv:2111.03874 [pdf, other]
-
Title: Towards Calibrated Model for Long-Tailed Visual Recognition from Prior PerspectiveComments: Accepted at NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [209] arXiv:2111.03882 [pdf, other]
-
Title: Action Recognition using Transfer Learning and Majority Voting for CSGOSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [210] arXiv:2111.03911 [pdf, other]
-
Title: Domain Attention Consistency for Multi-Source Domain AdaptationComments: Accepted to BMVC 2021 as oral presentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [211] arXiv:2111.03930 [pdf, other]
-
Title: Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language ModelingAuthors: Renrui Zhang, Rongyao Fang, Wei Zhang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng LiComments: preprintsSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [212] arXiv:2111.03940 [pdf, ps, other]
-
Title: Convolutional Gated MLP: Combining Convolutions & gMLPComments: ConferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [213] arXiv:2111.03952 [pdf, other]
-
Title: CALText: Contextual Attention Localization for Offline Handwritten TextComments: 25 pages, 15 figures and 6 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [214] arXiv:2111.03993 [pdf, other]
-
Title: Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [215] arXiv:2111.04012 [pdf, ps, other]
-
Title: A-PixelHop: A Green, Robust and Explainable Fake-Image DetectorSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [216] arXiv:2111.04017 [pdf, other]
-
Title: Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online AdaptationComments: 14 pages, 13 figures; code repositoty: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [217] arXiv:2111.04026 [pdf, other]
-
Title: SL-CycleGAN: Blind Motion Deblurring in Cycles using Sparse LearningComments: 12 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [218] arXiv:2111.04028 [pdf, other]
-
Title: Style Transfer with Target Feature Palette and Attention ColoringSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [219] arXiv:2111.04053 [pdf, other]
-
Title: Registration Techniques for Deformable ObjectsAuthors: Alireza AhmadiSubjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
- [220] arXiv:2111.04060 [pdf, other]
-
Title: Are we ready for a new paradigm shift? A Survey on Visual Deep MLPComments: With the development of MLP, the survey has been updated to the latest version in AprilSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [221] arXiv:2111.04076 [pdf, other]
-
Title: Direct Multi-view Multi-person 3D Pose EstimationComments: NeurIPS-2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [222] arXiv:2111.04080 [pdf, other]
-
Title: Cross-modal Zero-shot Hashing by Label Attributes EmbeddingComments: 7 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
- [223] arXiv:2111.04123 [pdf, other]
-
Title: NeurInt : Learning to Interpolate through Neural ODEsComments: Accepted (Spotlight paper) at the NeurIPS 2021 Workshop on the Symbiosis of Deep Learning and Differential Equations (DLDE)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [224] arXiv:2111.04129 [pdf, other]
-
Title: Global-Local Attention for Emotion RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [225] arXiv:2111.04138 [pdf, other]
-
Title: Look at the Variance! Efficient Black-box Explanations with Sobol-based Sensitivity AnalysisComments: NeurIPS2021Journal-ref: Conference on Neural Information Processing Systems (NeurIPS), Dec 2022, Sydney, AustraliaSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [226] arXiv:2111.04204 [pdf, other]
-
Title: Natural Adversarial ObjectsJournal-ref: Advances in Neural Information Processing Systems Data Centric AI workshop 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [227] arXiv:2111.04226 [pdf, ps, other]
-
Title: Rethinking Deconvolution for 2D Human Pose Estimation Light yet Accurate Model for Real-time Edge ComputingComments: IEEE International Conference on Automatic Face and Gesture Recognition 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [228] arXiv:2111.04228 [pdf, other]
-
Title: Practical, Fast and Robust Point Cloud Registration for 3D Scene Stitching and Object LocalizationAuthors: Lei SunSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [229] arXiv:2111.04230 [pdf, other]
-
Title: A Study of the Human Perception of Synthetic FacesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [230] arXiv:2111.04237 [pdf, other]
-
Title: Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object ImagesComments: 10 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [231] arXiv:2111.04264 [pdf, other]
-
Title: Cross-Modal Object Tracking: Modality-Aware Representations and A Unified BenchmarkComments: In SubmissionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [232] arXiv:2111.04266 [pdf, other]
- [233] arXiv:2111.04276 [pdf, other]
-
Title: Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [234] arXiv:2111.04310 [pdf, other]
-
Title: Residual-Guided Learning Representation for Self-Supervised Monocular Depth EstimationComments: 5 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [235] arXiv:2111.04316 [pdf, other]
-
Title: SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot LearningComments: 11 pages, 7 figures, 4 tables. Accepted by WACV2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [236] arXiv:2111.04321 [pdf, other]
-
Title: Towards Debiasing Temporal Sentence Grounding in VideoComments: 13 pages, 6 figures, 11 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [237] arXiv:2111.04331 [pdf, other]
-
Title: Enhancing Prototypical Few-Shot Learning by Leveraging the Local-Level StrategyComments: 5 pages, 4 figures, submitted to ICASSP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [238] arXiv:2111.04336 [pdf, other]
-
Title: Partial Attack Supervision and Regional Weighted Inference for Masked Face Presentation Attack DetectionComments: Accepted at the 16th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [239] arXiv:2111.04352 [pdf, other]
-
Title: Grassmannian learning mutual subspace method for image set recognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [240] arXiv:2111.04371 [pdf, other]
-
Title: Geometrically Adaptive Dictionary Attack on Face RecognitionComments: Accepted at WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [241] arXiv:2111.04397 [pdf, other]
-
Title: GROWL: Group Detection With Link PredictionSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [242] arXiv:2111.04426 [pdf, other]
-
Title: 3D Siamese Voxel-to-BEV Tracker for Sparse Point CloudsComments: Accepted by NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [243] arXiv:2111.04506 [pdf, other]
-
Title: Self-Supervised Intrinsic Image Decomposition Network Considering Reflectance ConsistencySubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [244] arXiv:2111.04554 [pdf, other]
-
Title: Tensor-based Subspace Factorization for StyleGANComments: Accepted for FG2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [245] arXiv:2111.04647 [pdf, other]
-
Title: Composition and Style Attributes Guided Image Aesthetic AssessmentJournal-ref: IEEE Transactions on Image Processing, 31 (2022) 5009-5024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [246] arXiv:2111.04673 [pdf, other]
-
Title: Information-Theoretic Bias Assessment Of Learned Representations Of Pretrained Face RecognitionComments: IEEE International Conference on Automatic Face and Gesture Recognition 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [247] arXiv:2111.04731 [pdf, other]
-
Title: Survey of Deep Learning Methods for Inverse ProblemsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [248] arXiv:2111.04780 [pdf, other]
-
Title: Frustum Fusion: Pseudo-LiDAR and LiDAR Fusion for 3D DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [249] arXiv:2111.04785 [pdf, other]
-
Title: Visual Question Answering based on Formal LogicSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [250] arXiv:2111.04839 [pdf, other]
-
Title: Evolving Evocative 2D Views of Generated 3D ObjectsAuthors: Eric ChuJournal-ref: NeurIPS 2021 Workshop on Machine Learning for Creativity and DesignSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [251] arXiv:2111.04845 [pdf, other]
-
Title: Hybrid BYOL-ViT: Efficient approach to deal with small datasetsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [252] arXiv:2111.04862 [pdf, other]
-
Title: Explaining Face Presentation Attack Detection Using Natural LanguageAuthors: Hengameh Mirzaalian, Mohamed E. Hussein, Leonidas Spinoulas, Jonathan May, Wael Abd-AlmageedComments: To Appear in the Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
- [253] arXiv:2111.04875 [pdf, other]
-
Title: LiMoSeg: Real-time Bird's Eye View based LiDAR Motion SegmentationAuthors: Sambit Mohapatra, Mona Hodaei, Senthil Yogamani, Stefan Milz, Heinrich Gotzig, Martin Simon, Hazem Rashed, Patrick MaederComments: Accepted for Presentation at International Conference on Computer Vision Theory and Applications (VISAPP 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [254] arXiv:2111.04927 [pdf, other]
-
Title: Self-Interpretable Model with TransformationEquivariant InterpretationComments: Accepted by NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [255] arXiv:2111.04928 [pdf, other]
-
Title: SAFA: Structure Aware Face AnimationComments: Accepted at 3DV2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [256] arXiv:2111.04945 [pdf, ps, other]
-
Title: PREMA: Part-based REcurrent Multi-view Aggregation Network for 3D Shape RetrievalComments: Accepted by ICCSMT 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [257] arXiv:2111.04946 [pdf, other]
-
Title: Graph-Based Depth Denoising & Dequantization for Point Cloud EnhancementComments: 16 pages,14 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [258] arXiv:2111.04982 [pdf, other]
-
Title: Dual Prototypical Contrastive Learning for Few-shot Semantic SegmentationComments: 8 pages, 7 figures, this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [259] arXiv:2111.04987 [pdf, other]
-
Title: Video Text Tracking With a Spatio-Temporal Complementary ModelComments: update Fig.7, in the third row of part (c), the second and third frame is wrong and we update the right picturesJournal-ref: [J]. IEEE Transactions on Image Processing, 2021, 30: 9321-9331Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [260] arXiv:2111.04993 [pdf, other]
-
Title: Incremental Meta-Learning via Episodic Replay Distillation for Few-Shot Image RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [261] arXiv:2111.05059 [pdf, other]
-
Title: MMD-ReID: A Simple but Effective Solution for Visible-Thermal Person ReIDComments: Accepted in BMVC 2021 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [262] arXiv:2111.05060 [pdf, other]
-
Title: View Birdification in the Crowd: Ground-Plane Localization from Perceived MovementsComments: Extended journal version of the original paper at BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [263] arXiv:2111.05066 [pdf, ps, other]
-
Title: Deep Convolution Network Based Emotion Analysis for Automatic Detection of Mild Cognitive Impairment in the ElderlyComments: 17 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [264] arXiv:2111.05080 [pdf, other]
-
Title: Residual Quantity in Percentage of Factory Machines Using Computer Vision and Mathematical MethodsComments: 4 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [265] arXiv:2111.05149 [pdf, other]
-
Title: Ethically aligned Deep Learning: Unbiased Facial Aesthetic PredictionComments: Peer reviewed and accepted at CEPE/IACAP 2021 as Extended AbstractSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [266] arXiv:2111.05170 [pdf, ps, other]
-
Title: Exploiting Robust Unsupervised Video Person Re-identificationComments: Preprint version; Accepted by IET Image ProcessingJournal-ref: IET Image Processing 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [267] arXiv:2111.05191 [pdf, other]
-
Title: Does Thermal data make the detection systems more reliable?Comments: Accepted at NeurIPS 2021 - ML4AD workshop (The code for this research is available at: this https URL)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [268] arXiv:2111.05222 [pdf, other]
-
Title: Cross Attentional Audio-Visual Fusion for Dimensional Emotion RecognitionComments: Accepted in FG2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
- [269] arXiv:2111.05283 [pdf, other]
-
Title: Unsupervised Spiking Instance Segmentation on Event Data using STDPComments: 20 Pages, 13 FiguresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [270] arXiv:2111.05297 [pdf, other]
-
Title: Sliced Recursive TransformerComments: ECCV 2022, 31 pages with Appendix. Code and models are available at this https URL (v3: update license and fix arxiv timestamp)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [271] arXiv:2111.05319 [pdf, other]
-
Title: Monocular Human Shape and Pose with Dense Mesh-borne Local Image FeaturesComments: FG 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [272] arXiv:2111.05328 [pdf, other]
-
Title: Data Augmentation Can Improve RobustnessAuthors: Sylvestre-Alvise Rebuffi, Sven Gowal, Dan A. Calian, Florian Stimberg, Olivia Wiles, Timothy MannComments: Accepted at NeurIPS 2021. arXiv admin note: substantial text overlap with arXiv:2103.01946; text overlap with arXiv:2110.09468Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [273] arXiv:2111.05329 [pdf, other]
-
Title: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal SynchronicityComments: Accepted in AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [274] arXiv:2111.05409 [pdf, other]
-
Title: Pipeline for 3D reconstruction of the human body from AR/VR headset mounted egocentric camerasComments: 11 pages, 12 figures and 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [275] arXiv:2111.05448 [pdf, other]
-
Title: Towards Active Vision for Action Localization with Reactive Control and Predictive LearningComments: To appear at WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [276] arXiv:2111.05464 [pdf, other]
-
Title: Are Transformers More Robust Than CNNs?Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [277] arXiv:2111.05468 [pdf, other]
-
Title: Sparse Adversarial Video Attacks with Spatial TransformationsComments: The short version of this work will appear in the BMVC 2021 conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [278] arXiv:2111.05471 [pdf, ps, other]
-
Title: Analysis of PDE-based binarization model for degraded document imagesAuthors: Uche A. NnolimComments: 11 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [279] arXiv:2111.05476 [pdf, other]
-
Title: Learning to Disentangle Scenes for Person Re-identificationComments: Preprint Version; Accepted by Image and Vision ComputingJournal-ref: Image and Vision Computing 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [280] arXiv:2111.05483 [pdf, ps, other]
-
Title: Handwritten Digit Recognition Using Improved Bounding Box Recognition TechniqueComments: 41 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [281] arXiv:2111.05485 [pdf, other]
-
Title: A Structure Feature Algorithm for Multi-modal Forearm RegistrationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [282] arXiv:2111.05506 [pdf, ps, other]
-
Title: Automated Pulmonary Embolism Detection from CTPA Images Using an End-to-End Convolutional Neural NetworkComments: Accepted to MICCAI 2019Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [283] arXiv:2111.05526 [pdf, other]
-
Title: Space-Time Memory Network for Sounding Object Localization in VideosComments: Accepted to BMVC2021. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [284] arXiv:2111.05541 [pdf, ps, other]
-
Title: 3D modelling of survey scene from images enhanced with a multi-exposure fusionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [285] arXiv:2111.05547 [pdf, other]
-
Title: ICDAR 2021 Competition on Document VisualQuestion AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [286] arXiv:2111.05548 [pdf, other]
-
Title: Deep Attention-guided Graph Clustering with Dual Self-supervisionComments: Accepted by IEEE TCSVTSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [287] arXiv:2111.05562 [pdf, ps, other]
-
Title: TomoSLAM: factor graph optimization for rotation angle refinement in microtomographySubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Robotics (cs.RO)
- [288] arXiv:2111.05610 [pdf, other]
-
Title: CLIP2TV: Align, Match and Distill for Video-Text RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [289] arXiv:2111.05615 [pdf, other]
-
Title: Leveraging Geometry for Shape Estimation from a Single RGB ImageSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [290] arXiv:2111.05684 [pdf, other]
-
Title: Learning to ignore: rethinking attention in CNNsComments: accepted to BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [291] arXiv:2111.05688 [pdf, other]
-
Title: Robust reconstructions by multi-scale/irregular tangential coveringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [292] arXiv:2111.05700 [pdf, other]
-
Title: Multi-Scale Single Image Dehazing Using Laplacian and Gaussian PyramidsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [293] arXiv:2111.05701 [pdf, other]
-
Title: Single image dehazing via combining the prior knowledge and CNNsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [294] arXiv:2111.05759 [pdf, other]
-
Title: Multimodal Transformer with Variable-length Memory for Vision-and-Language NavigationComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [295] arXiv:2111.05826 [pdf, other]
-
Title: Palette: Image-to-Image Diffusion ModelsAuthors: Chitwan Saharia, William Chan, Huiwen Chang, Chris A. Lee, Jonathan Ho, Tim Salimans, David J. Fleet, Mohammad NorouziSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [296] arXiv:2111.05890 [pdf, ps, other]
-
Title: Multimodal End-to-End Group Emotion Recognition using Cross-Modal AttentionAuthors: Lev EvtodienkoSubjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [297] arXiv:2111.05901 [pdf, other]
-
Title: An Extensive Study of User Identification via Eye Movements across Multiple DatasetsComments: 11 pages, 5 figures, submitted to Signal Processing: Image CommunicationJournal-ref: Signal Processing: Image Communication, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [298] arXiv:2111.05916 [pdf, other]
-
Title: Dance In the Wild: Monocular Human Animation with Neural Dynamic Appearance SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [299] arXiv:2111.05943 [pdf, other]
-
Title: Self-Supervised Multi-Object Tracking with Cross-Input ConsistencyComments: NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [300] arXiv:2111.05956 [pdf, other]
-
Title: Feature Generation for Long-tail ClassificationComments: Accepted at ICVGIP'21. Code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [301] arXiv:2111.05980 [pdf, other]
-
Title: Self-Supervised Real-time Video StabilizationComments: BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [302] arXiv:2111.05990 [pdf, ps, other]
-
Title: Traffic4cast -- Large-scale Traffic Prediction using 3DResNet and Sparse-UNetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [303] arXiv:2111.06016 [pdf, other]
-
Title: Synthetic Document Generator for Annotation-free Layout RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [304] arXiv:2111.06020 [pdf, other]
-
Title: csBoundary: City-scale Road-boundary Detection in Aerial Images for High-definition MapsComments: Accepted by IEEE Robotics and Automation Letters and IEEE International Conference on Robotics and Automation (ICRA) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [305] arXiv:2111.06021 [pdf, other]
-
Title: Probabilistic Contrastive Learning for Domain AdaptationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [306] arXiv:2111.06031 [pdf, other]
-
Title: FINO: Flow-based Joint Image and Noise ModelSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [307] arXiv:2111.06038 [pdf, other]
-
Title: Hybrid Saturation Restoration for LDR Images of HDR ScenesComments: arXiv admin note: text overlap with arXiv:2007.02042Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [308] arXiv:2111.06054 [pdf, other]
-
Title: Indian Licence Plate Dataset in the wildComments: 5 pages, 4 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [309] arXiv:2111.06075 [pdf, other]
-
Title: Graph Relation Transformer: Incorporating pairwise object features into the Transformer architectureComments: Presented as poster in CVPR 2021 Visual Question Answering WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [310] arXiv:2111.06091 [pdf, other]
-
Title: A Survey of Visual TransformersAuthors: Yang Liu, Yao Zhang, Yixin Wang, Feng Hou, Jin Yuan, Jiang Tian, Yang Zhang, Zhongchao Shi, Jianping Fan, Zhiqiang HeComments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [311] arXiv:2111.06098 [pdf, other]
-
Title: Open surgery tool classification and hand utilization using a multi-camera systemComments: 12 pages, 3 figures, submitted to IPCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [312] arXiv:2111.06119 [pdf, other]
-
Title: Fine-Grained Image Analysis with Deep Learning: A SurveyAuthors: Xiu-Shen Wei, Yi-Zhe Song, Oisin Mac Aodha, Jianxin Wu, Yuxin Peng, Jinhui Tang, Jian Yang, Serge BelongieComments: Accepted by IEEE TPAMISubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [313] arXiv:2111.06123 [pdf, other]
-
Title: Spatio-Temporal Scene-Graph Embedding for Autonomous Vehicle Collision PredictionAuthors: Arnav V. Malawade, Shih-Yuan Yu, Brandon Hsu, Deepan Muthirayan, Pramod P. Khargonekar, Mohammad A. Al FaruqueSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [314] arXiv:2111.06162 [pdf, other]
-
Title: Clicking Matters:Towards Interactive Human ParsingComments: Human parsing, interactive segmentation, semantic segmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [315] arXiv:2111.06195 [pdf, other]
-
Title: Towards Domain-Independent and Real-Time Gesture Recognition Using mmWave SignalAuthors: Yadong Li, Dongheng Zhang, Jinbo Chen, Jinwei Wan, Dong Zhang, Yang Hu, Qibin Sun, Yan ChenComments: This paper has been accepted by IEEE Transactions on Mobile Computing (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [316] arXiv:2111.06265 [pdf, other]
-
Title: Dense Unsupervised Learning for Video SegmentationComments: To appear at NeurIPS*2021. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [317] arXiv:2111.06276 [pdf, ps, other]
-
Title: 6D Pose Estimation with Combined Deep Learning and 3D Vision Techniques for a Fast and Accurate Object GraspingSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [318] arXiv:2111.06306 [pdf, other]
-
Title: Automatically identifying a mobile phone user's position within a vehicleComments: 4 pages, 1 figureSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [319] arXiv:2111.06349 [pdf, other]
-
Title: Unsupervised Part Discovery from Contrastive ReconstructionComments: NeurIPS 2021. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [320] arXiv:2111.06377 [pdf, other]
-
Title: Masked Autoencoders Are Scalable Vision LearnersComments: Tech report. arXiv v2: add more transfer learning results; v3: add robustness evaluationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [321] arXiv:2111.06394 [pdf, other]
-
Title: The Emergence of Objectness: Learning Zero-Shot Segmentation from VideosComments: This paper has been accepted to NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [322] arXiv:2111.06500 [pdf, other]
-
Title: Dynamic Iterative Refinement for Efficient 3D Hand Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [323] arXiv:2111.06575 [pdf, other]
-
Title: Self-supervised GAN DetectorSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [324] arXiv:2111.06636 [pdf, other]
-
Title: Closed-Loop Data Transcription to an LDR via Minimaxing Rate ReductionAuthors: Xili Dai, Shengbang Tong, Mingyang Li, Ziyang Wu, Michael Psenka, Kwan Ho Ryan Chan, Pengyuan Zhai, Yaodong Yu, Xiaojun Yuan, Heung Yeung Shum, Yi MaComments: 41 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [325] arXiv:2111.06638 [pdf, other]
-
Title: Meta-Teacher For Face Anti-SpoofingComments: Accepted by IEEE TPAMI-2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [326] arXiv:2111.06639 [pdf, other]
-
Title: Attention Guided Cosine Margin For Overcoming Class-Imbalance in Few-Shot Road Object DetectionComments: 8 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [327] arXiv:2111.06660 [pdf, other]
-
Title: Frequency learning for structured CNN filters with Gaussian fractional derivativesComments: Accepted at BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [328] arXiv:2111.06662 [pdf, other]
-
Title: A comprehensive study of clustering a class of 2D shapesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [329] arXiv:2111.06670 [pdf, other]
-
Title: Robust Analytics for Video-Based Gait BiometricsAuthors: Ebenezer R.H.P. IsaacComments: Ph.D. Thesis, Anna University, Chennai, Feb. 2018Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [330] arXiv:2111.06677 [pdf, other]
-
Title: AlphaRotate: A Rotation Detection Benchmark using TensorFlowComments: 7 pages, 1 figure, 1 tableSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [331] arXiv:2111.06738 [pdf, other]
-
Title: Improving Structured Text Recognition with Regular Expression BiasingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [332] arXiv:2111.06754 [pdf, other]
-
Title: Monte Carlo dropout increases model repeatabilityAuthors: Andreanne Lemay, Katharina Hoebel, Christopher P. Bridge, Didem Egemen, Ana Cecilia Rodriguez, Mark Schiffman, John Peter Campbell, Jayashree Kalpathy-CramerComments: Machine Learning for Health (ML4H) at NeurIPS 2021 - Extended AbstractSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [333] arXiv:2111.06762 [pdf, other]
-
Title: Diversity-Promoting Human Motion Interpolation via Conditional Variational Auto-EncoderSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [334] arXiv:2111.06774 [pdf, other]
-
Title: Identifying On-road Scenarios Predictive of ADHD usingDriving Simulator Time Series DataSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [335] arXiv:2111.06812 [pdf, other]
-
Title: Sci-Net: Scale Invariant Model for Buildings Segmentation from Aerial ImagerySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [336] arXiv:2111.06827 [pdf, ps, other]
-
Title: NRC-GAMMA: Introducing a Novel Large Gas Meter Image DatasetComments: 12 pages, 7 figures, 1 tableSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [337] arXiv:2111.06830 [pdf, other]
-
Title: Small or Far Away? Exploiting Deep Super-Resolution and Altitude Data for Aerial Animal SurveillanceComments: 11 pages, 7 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [338] arXiv:2111.06838 [pdf, other]
-
Title: Temporally-Consistent Surface Reconstruction using Metrically-Consistent AtlasesAuthors: Jan Bednarik, Noam Aigerman, Vladimir G. Kim, Siddhartha Chaudhuri, Shaifali Parashar, Mathieu Salzmann, Pascal FuaComments: 21 pages. arXiv admin note: substantial text overlap with arXiv:2104.06950Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [339] arXiv:2111.06839 [pdf, other]
-
Title: The self-supervised spectral-spatial attention-based transformer network for automated, accurate prediction of crop nitrogen status from UAV imagerySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [340] arXiv:2111.06849 [pdf, other]
-
Title: Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited DataSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [341] arXiv:2111.06881 [pdf, other]
-
Title: Multimodal Virtual Point 3D DetectionComments: NeurIPS 2021, code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [342] arXiv:2111.06913 [pdf, other]
-
Title: Visual Intelligence through Human InteractionComments: This is a preprint of the following chapter: Ranjay Krishna, Mitchell Gordon, Li Fei-Fei, Michael Bernstein, Visual Intelligence through Human Interaction, published in Artificial Intelligence for Human Computer Interaction: A Modern Approach, edited by Yang Li and Otmar Hilliges, 2021, Springer reproduced with permission of Springer Nature. arXiv admin note: substantial text overlap with arXiv:1602.04506, arXiv:1904.01121Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [343] arXiv:2111.06925 [pdf, other]
-
Title: Action2video: Generating Videos of Human 3D ActionsComments: Accepted by IJCVSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [344] arXiv:2111.06934 [pdf, other]
-
Title: Contrastive Feature Loss for Image PredictionComments: Appeared in Advances in Image Manipulation Workshop at ICCV 2021. GitHub: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [345] arXiv:2111.06959 [pdf, other]
-
Title: Through-Foliage Tracking with Airborne Optical SectioningComments: 9 Pages, 9 Figures, 1 Table and supplementary videos and materialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [346] arXiv:2111.06994 [pdf, other]
-
Title: Learning Online for Unified Segmentation and Tracking ModelsJournal-ref: International Joint Conference on Neural Networks (IJCNN), 2021, pp. 1-8Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [347] arXiv:2111.06995 [pdf, other]
-
Title: A Central Difference Graph Convolutional Operator for Skeleton-Based Action RecognitionComments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [348] arXiv:2111.07009 [pdf, other]
-
Title: Leveraging Unsupervised Image Registration for Discovery of Landmark Shape DescriptorComments: Published in Medical Image AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [349] arXiv:2111.07039 [pdf, other]
-
Title: UET-Headpose: A sensor-based top-view head pose datasetSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [350] arXiv:2111.07044 [pdf, other]
-
Title: Hyperspectral Mixed Noise Removal via Subspace Representation and Weighted Low-rank Tensor RegularizationSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
- [351] arXiv:2111.07047 [pdf, other]
-
Title: Facial Landmark Points Detection Using Knowledge Distillation-Based Neural NetworksComments: Accepted in Computer Vision and Image Understanding JournalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [352] arXiv:2111.07048 [pdf, other]
-
Title: Image Classification with Consistent Supporting EvidenceComments: 13 pages, 6 figures, proceedings of the Machine Learning for Health NeurIPS Workshop, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [353] arXiv:2111.07072 [pdf, other]
-
Title: Factorial Convolution Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [354] arXiv:2111.07090 [pdf, other]
-
Title: D$^2$LV: A Data-Driven and Local-Verification Approach for Image Copy DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [355] arXiv:2111.07102 [pdf, other]
-
Title: Deep Neural Networks for Automatic Grain-matrix Segmentation in Plane and Cross-polarized Sandstone PhotomicrographsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [356] arXiv:2111.07117 [pdf, other]
-
Title: Learning Object-Centric Representations of Multi-Object Scenes from Multiple ViewsComments: Accepted at NeurIPS 2020 (Spotlight)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [357] arXiv:2111.07129 [pdf, other]
-
Title: Visual Understanding of Complex Table Structures from Document ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [358] arXiv:2111.07139 [pdf, other]
-
Title: Full-attention based Neural Architecture Search using Context Auto-regressionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [359] arXiv:2111.07145 [pdf, other]
-
Title: New Performance Measures for Object Tracking under Complex EnvironmentsAuthors: Ajoy MondalSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [360] arXiv:2111.07156 [pdf, ps, other]
-
Title: Developing a Novel Approach for Periapical Dental Radiographs SegmentationComments: Accepted in 2013 5th Conference on Information and Knowledge Technology (IKT) this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [361] arXiv:2111.07169 [pdf, other]
-
Title: Where to Look: A Unified Attention Model for Visual Recognition with Reinforcement LearningAuthors: Gang ChenComments: 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [362] arXiv:2111.07195 [pdf, other]
-
Title: PhysXNet: A Customizable Approach for LearningCloth Dynamics on Dressed PeopleSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [363] arXiv:2111.07224 [pdf, other]
-
Title: Local Multi-Head Channel Self-Attention for Facial Expression RecognitionComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [364] arXiv:2111.07239 [pdf, other]
-
Title: Robust and Accurate Object Detection via Self-Knowledge DistillationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [365] arXiv:2111.07248 [pdf, other]
-
Title: Background-Aware 3D Point Cloud Segmentationwith Dynamic Point Feature AggregationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [366] arXiv:2111.07258 [pdf, other]
-
Title: Sign Language Translation with Hierarchical Spatio-TemporalGraph Neural NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [367] arXiv:2111.07279 [pdf, other]
-
Title: Auxiliary Loss Reweighting for Image InpaintingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [368] arXiv:2111.07283 [pdf, other]
-
Title: Novel Intensity Mapping Functions: Weighted Histogram AveragingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [369] arXiv:2111.07344 [pdf, other]
-
Title: Towards Privacy-Preserving Affect Recognition: A Two-Level Deep Learning ArchitectureAuthors: Jimiama M. Mase, Natalie Leesakul, Fan Yang, Grazziela P. Figueredo, Mercedes Torres TorresComments: 8 pages, 6 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [370] arXiv:2111.07370 [pdf, other]
-
Title: Co-segmentation Inspired Attention Module for Video-based Computer Vision TasksAuthors: Arulkumar Subramaniam, Jayesh Vaidya, Muhammed Abdul Majeed Ameen, Athira Nambiar, Anurag MittalComments: 26 pages, 14 figures, Preprint submitted to CVIU journalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [371] arXiv:2111.07383 [pdf, other]
-
Title: Sparse Steerable Convolutions: An Efficient Learning of SE(3)-Equivariant Features for Estimation and Tracking of Object Poses in 3D SpaceComments: Accepted by NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [372] arXiv:2111.07418 [pdf, other]
-
Title: TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view StereoComments: CoRL 2021. The manuscript contains the main paper and the supplementary materials. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [373] arXiv:2111.07424 [pdf, other]
-
Title: Generating Band-Limited Adversarial Surfaces Using Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [374] arXiv:2111.07426 [pdf, other]
-
Title: Unsupervised Action Localization Crop in Video Retargeting for 3D ConvNetsComments: Accepted for Publication in Proceedings of 2021 IEEE Region 10 Conference (TENCON), 7-10 December 2021, Auckland, New ZealandSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [375] arXiv:2111.07432 [pdf, ps, other]
-
Title: A Comparative Study of Fingerprint Image-Quality Estimation MethodsAuthors: Fernando Alonso-Fernandez, Julian Fierrez, Javier Ortega-Garcia, Joaquin Gonzalez-Rodriguez, Hartwig Fronthaler, Klaus Kollreider, Josef BigunComments: Published at IEEE Transactions on Information Forensics and SecuritySubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [376] arXiv:2111.07468 [pdf, other]
-
Title: Impact of Benign Modifications on Discriminative Performance of Deepfake DetectorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [377] arXiv:2111.07492 [pdf, other]
-
Title: Finding Optimal Tangent Points for Reducing Distortions of Hard-label AttacksComments: Accepted at NeurIPS 2021. The missing square term in Eqn.(13), as well as many other mistakes of the previous version, have been fixed in the current versionSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [378] arXiv:2111.07499 [pdf, other]
-
Title: Reinforcement Learning of Self Enhancing Camera Image and Signal ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [379] arXiv:2111.07529 [pdf, other]
-
Title: Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance SegmentationComments: Accepted at CVPR RVSU Workshop 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [380] arXiv:2111.07534 [pdf, other]
-
Title: A Probabilistic Hard Attention Model For Sequentially Observed ScenesComments: Accepted to BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [381] arXiv:2111.07547 [pdf, other]
-
Title: Searching for TrioNet: Combining Convolution with Local and Global Self-AttentionComments: BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [382] arXiv:2111.07548 [pdf, other]
-
Title: Unsupervised Lightweight Single Object Tracking with UHP-SOT++Comments: updated content: comparison with state-of-the-art deep unsupervised methodsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [383] arXiv:2111.07556 [pdf, other]
-
Title: High-Quality Real Time Facial Capture Based on Single CameraComments: arXiv admin note: text overlap with arXiv:1609.06536 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [384] arXiv:2111.07593 [pdf, ps, other]
-
Title: Weakly-Supervised Dense Action AnticipationComments: BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [385] arXiv:2111.07597 [pdf, other]
-
Title: DFC: Deep Feature Consistency for Robust Point Cloud RegistrationComments: 12 pages, 7 figures, 6 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [386] arXiv:2111.07601 [pdf, other]
-
Title: FakeTransformer: Exposing Face Forgery From Spatial-Temporal Representation Modeled By Facial Pixel VariationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [387] arXiv:2111.07620 [pdf, other]
-
Title: Fingerprint Presentation Attack Detection by Channel-wise Feature DenoisingComments: 15 pages, 8 figures, Accepted by TIFSJournal-ref: IEEE Transactions on Information Forensics and Security, vol. 17, pp. 2963-2976, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [388] arXiv:2111.07624 [pdf, other]
-
Title: Attention Mechanisms in Computer Vision: A SurveyAuthors: Meng-Hao Guo, Tian-Xing Xu, Jiang-Jiang Liu, Zheng-Ning Liu, Peng-Tao Jiang, Tai-Jiang Mu, Song-Hai Zhang, Ralph R. Martin, Ming-Ming Cheng, Shi-Min HuComments: 27 pages, 9 figuresJournal-ref: Computational Visual Media, 2022, Vol. 8, No. 3, 331-368Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [389] arXiv:2111.07625 [pdf, ps, other]
-
Title: On the validation of pansharpening methodsAuthors: Gintautas PalubinskasComments: 18 pages, 5 figures, 6 tables. arXiv admin note: substantial text overlap with arXiv:2103.03062Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [390] arXiv:2111.07632 [pdf, ps, other]
-
Title: CoReS: Compatible Representations via StationarityComments: in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [391] arXiv:2111.07646 [pdf, other]
-
Title: Multimodal Generalized Zero Shot Learning for Gleason Grading using Self-Supervised LearningAuthors: Dwarikanath MahapatraSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [392] arXiv:2111.07677 [pdf, other]
-
Title: FastFlow: Unsupervised Anomaly Detection and Localization via 2D Normalizing FlowsComments: 11 pages,8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [393] arXiv:2111.07716 [pdf, other]
-
Title: Interactive Medical Image Segmentation with Self-Adaptive Confidence CalibrationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [394] arXiv:2111.07722 [pdf, other]
-
Title: Stacked BNAS: Rethinking Broad Convolutional Neural Network for Neural Architecture SearchComments: 12 pages, 10 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [395] arXiv:2111.07746 [pdf, other]
-
Title: Real-time Emotion and Gender Classification using Ensemble CNNSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [396] arXiv:2111.07749 [pdf, other]
-
Title: Fast Computation of Hahn Polynomials for High Order MomentsComments: This first of this paper is submitted to Pattern Recognition at June 15, 2020 This version of the manuscript is submitted to PLOS ONEJournal-ref: IEEE Access, 10 (2022) 48719-48732Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [397] arXiv:2111.07774 [pdf, other]
-
Title: D^2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in VideosComments: Accepted to WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [398] arXiv:2111.07783 [pdf, other]
-
Title: FILIP: Fine-grained Interactive Language-Image Pre-TrainingAuthors: Lewei Yao, Runhui Huang, Lu Hou, Guansong Lu, Minzhe Niu, Hang Xu, Xiaodan Liang, Zhenguo Li, Xin Jiang, Chunjing XuSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [399] arXiv:2111.07832 [pdf, other]
-
Title: iBOT: Image BERT Pre-Training with Online TokenizerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [400] arXiv:2111.07837 [pdf, other]
-
Title: Multi-View Motion Synthesis via Applying Rotated Dual-Pixel Blur KernelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [401] arXiv:2111.07839 [pdf, other]
-
Title: Learnable Locality-Sensitive Hashing for Video Anomaly DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [402] arXiv:2111.07846 [pdf, other]
-
Title: Multi-Task Classification of Sewer Pipe Defects and Properties using a Cross-Task Graph Neural Network DecoderComments: WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [403] arXiv:2111.07868 [pdf, other]
-
Title: Tracking People with 3D RepresentationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [404] arXiv:2111.07898 [pdf, other]
-
Title: Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorizationComments: 13 pages, 9 figures, peer-reviewed and accepted at the SVRHM 2021 workshop at NeurIPS (+ 2 additional sections in the Appendix presenting newer supplementary results). SVRHM 2021 Workshop@ NeurIPS. 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [405] arXiv:2111.07900 [pdf, other]
-
Title: Volumetric Parameterization of the Placenta to a Flattened TemplateAuthors: S. Mazdak Abulnaga, Esra Abaci Turk, Mikhail Bessmeltsev, P. Ellen Grant, Justin Solomon, Polina GollandComments: Accepted to IEEE TMI ( (c) IEEE). This manuscript expands the MICCAI 2019 paper (arXiv:1903.05044) by developing additional template models and extensions to improve robustness, expanded evaluation on a significantly larger dataset, and experiments and discussion demonstrating utility for clinical research. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [406] arXiv:2111.07902 [pdf, other]
-
Title: Deep Semantic Manipulation of Facial VideosComments: 4th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW), European Conference on Computer Vision (ECCV), Tel Aviv, Israel, October 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [407] arXiv:2111.07910 [pdf, other]
-
Title: Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image ReconstructionAuthors: Yuanhao Cai, Jing Lin, Xiaowan Hu, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc Van GoolComments: CVPR 2022; The first Transformer-based method for snapshot compressive imagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [408] arXiv:2111.07945 [pdf, other]
-
Title: Large-Scale Hyperspectral Image Clustering Using Contrastive LearningComments: Under review by IEEE Trans. xxxSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [409] arXiv:2111.07950 [pdf, other]
-
Title: Occluded Video Instance Segmentation: Dataset and ICCV 2021 ChallengeAuthors: Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge Belongie, Alan Yuille, Philip H.S. Torr, Song BaiComments: Accepted by NeurIPS 2021 Datasets and Benchmarks Track. arXiv admin note: text overlap with arXiv:2102.01558Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [410] arXiv:2111.07954 [pdf, other]
- [411] arXiv:2111.07971 [pdf, other]
-
Title: Towards Optimal Strategies for Training Self-Driving Perception Models in SimulationComments: NeurIPS 2021; Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [412] arXiv:2111.07991 [pdf, other]
-
Title: LiT: Zero-Shot Transfer with Locked-image text TuningAuthors: Xiaohua Zhai, Xiao Wang, Basil Mustafa, Andreas Steiner, Daniel Keysers, Alexander Kolesnikov, Lucas BeyerComments: Xiaohua, Xiao, Basil, Andreas and Lucas contributed equally; CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [413] arXiv:2111.08004 [pdf, other]
-
Title: Bag of Tricks and A Strong baseline for Image Copy DetectionComments: arXiv admin note: substantial text overlap with arXiv:2111.07090Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [414] arXiv:2111.08046 [pdf, other]
-
Title: Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal AttentionComments: To appear in WACV 2022. arXiv admin note: text overlap with arXiv:2108.04906Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [415] arXiv:2111.08062 [pdf, other]
-
Title: Synthetic Unknown Class Learning for Learning UnknownsAuthors: Jaeyeon JangComments: 11 pages, 7 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [416] arXiv:2111.08069 [pdf, other]
-
Title: Two-dimensional Deep Regression for Early Yield Prediction of Winter WheatComments: Accepted to appear in the SPIE Future Sensing Technologies 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [417] arXiv:2111.08094 [pdf, other]
-
Title: LIMEcraft: Handcrafted superpixel selection and inspection for Visual eXplanationsJournal-ref: Machine Learning (2022) 1-18Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [418] arXiv:2111.08174 [pdf, other]
-
Title: ShapeY: Measuring Shape Recognition Capacity Using Nearest Neighbor MatchingComments: 6 pages, 5 figures, Accepted to NeurIPS: ImageNet Past, Present, and FutureSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [419] arXiv:2111.08176 [pdf, other]
-
Title: Coarse-to-fine Animal Pose and Shape EstimationComments: Accepted by Neurips2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [420] arXiv:2111.08184 [pdf, other]
- [421] arXiv:2111.08243 [pdf, other]
-
Title: CAR -- Cityscapes Attributes Recognition A Multi-category Attributes Dataset for Autonomous VehiclesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [422] arXiv:2111.08249 [pdf, other]
-
Title: Bengali Handwritten Grapheme Classification: Deep Learning ApproachComments: 8 pages, 15 figures, pre-printSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [423] arXiv:2111.08251 [pdf, other]
-
Title: Enabling equivariance for arbitrary Lie groupsComments: Oral presentation at the Conference on Computer Vision and Pattern Recognition (CVPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [424] arXiv:2111.08259 [pdf, ps, other]
-
Title: Pose Recognition in the Wild: Animal pose estimation using Agglomerative Clustering and Contrastive LearningComments: 9 pages, 4 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [425] arXiv:2111.08270 [pdf, other]
-
Title: Data Augmentation using Random Image Cropping for High-resolution Virtual Try-On (VITON-CROP)Comments: 4 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [426] arXiv:2111.08279 [pdf, other]
-
Title: Keypoint Message Passing for Video-based Person Re-IdentificationComments: To appear in AAAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [427] arXiv:2111.08282 [pdf, other]
-
Title: Self-supervised Re-renderable Facial Albedo Reconstruction from Single ImageSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [428] arXiv:2111.08313 [pdf, other]
-
Title: Towards Comprehensive Monocular Depth Estimation: Multiple Heads Are Better Than OneAuthors: Shuwei Shao, Ran Li, Zhongcai Pei, Zhong Liu, Weihai Chen, Wentao Zhu, Xingming Wu, Baochang ZhangComments: Accepted by TMM 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [429] arXiv:2111.08314 [pdf, other]
-
Title: TRIG: Transformer-Based Text Recognizer with Initial Embedding GuidanceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [430] arXiv:2111.08318 [pdf, other]
-
Title: DRINet++: Efficient Voxel-as-point Point Cloud SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [431] arXiv:2111.08320 [pdf, other]
-
Title: Which CNNs and Training Settings to Choose for Action Unit Detection? A Study Based on a Large-Scale DatasetComments: FG 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [432] arXiv:2111.08324 [pdf, other]
-
Title: Choose Settings Carefully: Comparing Action Unit detection at Different Settings Using a Large-Scale DatasetComments: ICIP 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [433] arXiv:2111.08334 [pdf, other]
-
Title: Pansharpening by convolutional neural networks in the full resolution frameworkJournal-ref: IEEE Transactions on Geoscience and Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [434] arXiv:2111.08349 [pdf, other]
-
Title: SEnSeI: A Deep Learning Module for Creating Sensor Independent Cloud MasksComments: 22 pages, 7 figures. This is an accepted version of work to be published in the IEEE Transactions on Geoscience and Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [435] arXiv:2111.08370 [pdf, other]
-
Title: Fight Detection from Still Images in the WildComments: Accepted for publication at Winter Conference of Applications on Computer Vision Workshops (WACV-W 2022), Workshop on Real-World Surveillance: Applications and ChallengesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [436] arXiv:2111.08383 [pdf, other]
-
Title: Single Image Object Counting and Localizing using Active-LearningComments: Published in IEEE Winter Conference on Applications of Computer Vision (WACV) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [437] arXiv:2111.08401 [pdf, other]
-
Title: Weakly-supervised fire segmentation by visualizing intermediate CNN layersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [438] arXiv:2111.08413 [pdf, other]
-
Title: Improved Robustness of Vision Transformer via PreLayerNorm in Patch EmbeddingComments: 7 pages, 8 figures. Work in ProgressSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [439] arXiv:2111.08419 [pdf, other]
-
Title: Delta-GAN-Encoder: Encoding Semantic Changes for Explicit Image Editing, using Few Synthetic SamplesComments: 8 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [440] arXiv:2111.08434 [pdf, other]
-
Title: Robust 3D Scene Segmentation through Hierarchical and Learnable Part-FusionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [441] arXiv:2111.08468 [pdf, other]
-
Title: Point detection through multi-instance deep heatmap regression for sutures in endoscopyAuthors: Lalith Sharan, Gabriele Romano, Julian Brand, Halvar Kelm, Matthias Karck, Raffaele De Simone, Sandy EngelhardtComments: Accepted to International Journal of Computer Assisted Radiology and Surgery, 15 pages, 5 figuresJournal-ref: Int J CARS (2021) 1861-6429Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [442] arXiv:2111.08485 [pdf, other]
-
Title: Consistent Semantic Attacks on Optical FlowComments: Paper and supplementary materialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [443] arXiv:2111.08492 [pdf, other]
-
Title: Real-time 3D human action recognition based on Hyperpoint sequenceComments: The paper has been published in IEEE Transactions on Industrial Informatics. [1]Li X, Huang Q, Wang Z, et al. Real-Time 3D Human Action Recognition Based on Hyperpoint Sequence[J]. IEEE Transactions on Industrial Informatics, 2022. The code of this paper has been made public at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [444] arXiv:2111.08521 [pdf, other]
-
Title: Learning Intrinsic Images for ClothingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [445] arXiv:2111.08531 [pdf, other]
-
Title: Language bias in Visual Question Answering: A Survey and TaxonomyAuthors: Desen YuanComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [446] arXiv:2111.08557 [pdf, other]
-
Title: Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [447] arXiv:2111.08567 [pdf, other]
-
Title: Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face VideosComments: 21 pages, 15 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [448] arXiv:2111.08614 [pdf, other]
-
Title: IKEA Object State Dataset: A 6DoF object pose estimation dataset and benchmark for multi-state assembly objectsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [449] arXiv:2111.08620 [pdf, ps, other]
-
Title: A Data-Driven Approach for Linear and Nonlinear Damage Detection Using Variational Mode Decomposition and GARCH ModelAuthors: Vahid Reza Gharehbaghi, Hashem Kalbkhani, Ehsan Noroozinejad Farsangi, T.Y. Yang, Seyedali MirjaliliComments: 30 Pages, 12 Figures and 8 Tables, Submitted Journal: Engineering with Computers, SpringerJournal-ref: Engineering with Computers (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [450] arXiv:2111.08644 [pdf, other]
-
Title: UBnormal: New Benchmark for Supervised Open-Set Video Anomaly DetectionAuthors: Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak ShahComments: Accepted at CVPR 2022. Paper + supplementary (15 pages, 9 figures)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [451] arXiv:2111.08651 [pdf, ps, other]
-
Title: Diversified Multi-prototype Representation for Semi-supervised SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [452] arXiv:2111.08687 [pdf, other]
-
Title: INTERN: A New Learning Paradigm Towards General VisionAuthors: Jing Shao, Siyu Chen, Yangguang Li, Kun Wang, Zhenfei Yin, Yinan He, Jianing Teng, Qinghong Sun, Mengya Gao, Jihao Liu, Gengshi Huang, Guanglu Song, Yichao Wu, Yuming Huang, Fenggang Liu, Huan Peng, Shuo Qin, Chengyu Wang, Yujie Wang, Conghui He, Ding Liang, Yu Liu, Fengwei Yu, Junjie Yan, Dahua Lin, Xiaogang Wang, Yu QiaoSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [453] arXiv:2111.08702 [pdf, ps, other]
-
Title: The Multiscenario Multienvironment BioSecure Multimodal Database (BMDB)Authors: Javier Ortega-Garcia, Julian Fierrez, Fernando Alonso-Fernandez, Javier Galbally, Manuel R Freire, Joaquin Gonzalez-Rodriguez, Carmen Garcia-Mateo, Jose-Luis Alba-Castro, Elisardo Gonzalez-Agulla, Enrique Otero-Muras, Sonia Garcia-Salicetti, Lorene Allano, Bao Ly-Van, Bernadette Dorizzi, Josef Kittler, Thirimachos Bourlai, Norman Poh, Farzin Deravi, Ming NR Ng, Michael Fairhurst, Jean Hennebert, Andreas Humm, Massimo Tistarelli, Linda Brodo, Jonas Richiardi, Andrezj Drygajlo, Harald Ganster, Federico M Sukno, Sri-Kaushik Pavani, Alejandro Frangi, Lale Akarun, Arman SavranComments: Published at IEEE Transactions on Pattern Analysis and Machine Intelligence journalSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [454] arXiv:2111.08703 [pdf, ps, other]
-
Title: Benchmarking Quality-Dependent and Cost-Sensitive Score-Level Multimodal Biometric Fusion AlgorithmsAuthors: Norman Poh, Thirimachos Bourlai, Josef Kittler, Lorene Allano, Fernando Alonso-Fernandez, Onkar Ambekar, John Baker, Bernadette Dorizzi, Omolara Fatukasi, Julian Fierrez, Harald Ganster, Javier Ortega-Garcia, Donald Maurer, Albert Ali Salah, Tobias Scheidat, Claus VielhauerComments: Published at IEEE Transactions on Information Forensics and Security journalSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [455] arXiv:2111.08704 [pdf, ps, other]
-
Title: Quality Measures in Biometric SystemsComments: Published at IEEE Security & Privacy journalSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [456] arXiv:2111.08738 [pdf, other]
-
Title: Synthesis-Guided Feature Learning for Cross-Spectral Periocular RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [457] arXiv:2111.08755 [pdf, other]
-
Title: Learning Scene Dynamics from Point Cloud SequencesComments: Accepted for publication in International Journal of Computer Vision, Special Issue on 3D Computer Vision. Code and data: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [458] arXiv:2111.08772 [pdf, other]
-
Title: Computer Vision for Supporting Image SearchAuthors: Alan F. SmeatonComments: 10 pagesJournal-ref: Advances in Visual Informatics. H. Badioze Zaman et al (Eds). IVIC 2021, LNCS 13051, pp1-10, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
- [459] arXiv:2111.08774 [pdf, other]
-
Title: Film Trailer Generation via Task DecompositionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [460] arXiv:2111.08785 [pdf, ps, other]
-
Title: Detecting AutoAttack Perturbations in the Frequency DomainComments: accepted at ICML 2021 workshop for robustnessSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [461] arXiv:2111.08799 [pdf, other]
-
Title: DeltaConv: Anisotropic Operators for Geometric Deep Learning on Point CloudsComments: 8 pages, 5 figures, 7 tables; ACM Transactions on Graphics 41, 4, Article 105 (SIGGRAPH 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [462] arXiv:2111.08826 [pdf, other]
-
Title: A Benchmark for Modeling Violation-of-Expectation in Physical Reasoning Across Event CategoriesAuthors: Arijit Dasgupta, Jiafei Duan, Marcelo H. Ang Jr, Yi Lin, Su-hua Wang, Renée Baillargeon, Cheston TanComments: arXiv admin note: text overlap with arXiv:2110.05836Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [463] arXiv:2111.08831 [pdf, other]
-
Title: HARA: A Hierarchical Approach for Robust Rotation AveragingComments: Accepted to CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [464] arXiv:2111.08867 [pdf, other]
-
Title: TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in VideoAuthors: Mario Alberto Duran-Vega, Miguel Gonzalez-Mendoza, Leonardo Chang, Cuauhtemoc Daniel Suarez-RamirezSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [465] arXiv:2111.08869 [pdf, other]
-
Title: Enhanced Correlation Matching based Video Frame InterpolationComments: Accepted to WACV 2022, equal contribution from first two authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [466] arXiv:2111.08872 [pdf, other]
-
Title: TorchGeo: Deep Learning With Geospatial DataAuthors: Adam J. Stewart, Caleb Robinson, Isaac A. Corley, Anthony Ortiz, Juan M. Lavista Ferres, Arindam BanerjeeSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [467] arXiv:2111.08892 [pdf, other]
-
Title: SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive DerainingComments: Accepted by WACV2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [468] arXiv:2111.08897 [pdf, other]
-
Title: ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D DataAuthors: Gilad Baruch, Zhuoyuan Chen, Afshin Dehghan, Tal Dimry, Yuri Feigin, Peter Fu, Thomas Gebauer, Brandon Joffe, Daniel Kurz, Arik Schwartz, Elad ShulmanSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [469] arXiv:2111.08913 [pdf, other]
-
Title: Hierarchical Knowledge Guided Learning for Real-world Retinal Diseases RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [470] arXiv:2111.08918 [pdf, other]
-
Title: Local Texture Estimator for Implicit Representation FunctionComments: CVPR 2022 camera-ready version (this https URL)Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [471] arXiv:2111.08919 [pdf, other]
-
Title: EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding MatchingComments: cvpr2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [472] arXiv:2111.08927 [pdf, other]
-
Title: Protection of SVM Model with Secret Key from Unauthorized AccessComments: To appear in IWAIT 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [473] arXiv:2111.08954 [pdf, other]
-
Title: Tracklet-Switch Adversarial Attack against Pedestrian Multi-Object Tracking TrackersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [474] arXiv:2111.08960 [pdf, other]
-
Title: Compositional Transformers for Scene GenerationComments: Published as a conference paper at NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [475] arXiv:2111.08973 [pdf, other]
-
Title: Generating Unrestricted 3D Adversarial Point CloudsSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [476] arXiv:2111.08974 [pdf, other]
-
Title: Pedestrian Detection by Exemplar-Guided Contrastive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [477] arXiv:2111.08994 [pdf, ps, other]
-
Title: Nonlinear Intensity Sonar Image Matching based on Deep Convolution FeaturesComments: 5 pages, 9 figures. The final manuscript we submitted is a research under the original title. Compared with the previous papers, we adopted a more novel research method and experimental designSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [478] arXiv:2111.09006 [pdf, other]
-
Title: Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [479] arXiv:2111.09027 [pdf, other]
- [480] arXiv:2111.09034 [pdf, other]
-
Title: Using Convolutional Neural Networks to Detect Compression AlgorithmsAuthors: Shubham BharadwajComments: Pre-print Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [481] arXiv:2111.09056 [pdf, other]
-
Title: Improving Person Re-Identification with Temporal ConstraintsComments: 10 pages, RWS @ WACV2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Multimedia (cs.MM)
- [482] arXiv:2111.09091 [pdf, other]
-
Title: Motion Detection using CSI from Raspberry Pi 4Comments: 8 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [483] arXiv:2111.09094 [pdf, other]
-
Title: STEEX: Steering Counterfactual Explanations with SemanticsComments: ECCV 2022 --- 14 pages + supplementarySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [484] arXiv:2111.09099 [pdf, other]
-
Title: Self-Supervised Predictive Convolutional Attentive Block for Anomaly DetectionAuthors: Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak ShahComments: Accepted at CVPR 2022. Paper + supplementary (14 pages, 9 figures)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [485] arXiv:2111.09113 [pdf, ps, other]
-
Title: 2nd Place Solution to Facebook AI Image Similarity Challenge Matching TrackAuthors: SeungKee JeonSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [486] arXiv:2111.09117 [pdf, ps, other]
-
Title: Image-based monitoring of bolt loosening through deep-learning-based integrated detection and trackingComments: 15 pages, 11 figures, accepted in Journal of Computer Aided Civil and Infrastructure Engineering, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [487] arXiv:2111.09136 [pdf, other]
-
Title: IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network QuantizationAuthors: Yunshan Zhong, Mingbao Lin, Gongrui Nan, Jianzhuang Liu, Baochang Zhang, Yonghong Tian, Rongrong JiComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [488] arXiv:2111.09137 [pdf, other]
-
Title: Two-Face: Adversarial Audit of Commercial Face Recognition SystemsComments: This work has been accepted for publication at ICWSM 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [489] arXiv:2111.09155 [pdf, ps, other]
-
Title: Oil and Gas Pipeline Monitoring during COVID-19 Pandemic via Unmanned Aerial VehicleComments: 14th International Conference of Education, Research and Innovation (ICERI2021)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Computers and Society (cs.CY)
- [490] arXiv:2111.09162 [pdf, other]
-
Title: It's About Time: Analog Clock Reading in the WildComments: CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [491] arXiv:2111.09171 [pdf, ps, other]
-
Title: Automated Approach for Computer Vision-based Vehicle Movement Classification at Traffic IntersectionsAuthors: Udita Jana, Jyoti Prakash Das Karmakar, Pranamesh Chakraborty, Tingting Huang, Dave Ness, Duane Ritcher, Anuj SharmaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [492] arXiv:2111.09267 [pdf, other]
-
Title: DiverGAN: An Efficient and Effective Single-Stage Framework for Diverse Text-to-Image GenerationJournal-ref: Neurocomputing 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [493] arXiv:2111.09276 [pdf, other]
-
Title: Induce, Edit, Retrieve: Language Grounded Multimodal Schema for Instructional Video RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [494] arXiv:2111.09297 [pdf, other]
-
Title: Learning to Compose Visual RelationsComments: NeurIPS 2021 (Spotlight), first three authors contributed equally, Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO); Machine Learning (stat.ML)
- [495] arXiv:2111.09298 [pdf, other]
-
Title: SeCGAN: Parallel Conditional Generative Adversarial Networks for Face Editing via Semantic ConsistencyComments: Accepted by AI for Content Creation (AI4CC) workshop at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [496] arXiv:2111.09301 [pdf, other]
-
Title: Learning to Align Sequential Actions in the WildSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [497] arXiv:2111.09303 [pdf, ps, other]
-
Title: Facial Information Analysis Technology for Gender and Age EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [498] arXiv:2111.09337 [pdf, other]
-
Title: Temporally Consistent Online Depth Estimation in Dynamic ScenesAuthors: Zhaoshuo Li, Wei Ye, Dilin Wang, Francis X. Creighton, Russell H. Taylor, Ganesh Venkatesh, Mathias UnberathComments: WACV 2023, project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [499] arXiv:2111.09378 [pdf, other]
-
Title: MPF6D: Masked Pyramid Fusion 6D Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [500] arXiv:2111.09383 [pdf, other]
-
Title: DeepCurrents: Learning Implicit Representations of Shapes with BoundariesSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [501] arXiv:2111.09403 [pdf, ps, other]
-
Title: Fine-Grained Vehicle Classification in Urban Traffic Scenes using Deep LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [502] arXiv:2111.09406 [pdf, other]
-
Title: Rethinking Drone-Based Search and Rescue with Aerial Person DetectionComments: 10 pages, 5 figures, 3 tables, 1 algorithmSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [503] arXiv:2111.09450 [pdf, other]
-
Title: See Eye to Eye: A Lidar-Agnostic 3D Detection Framework for Unsupervised Multi-Target Domain AdaptationComments: Published in RAL and presented in IROS 2022. Code is available at this https URLJournal-ref: IEEE Robotics and Automation Letters (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [504] arXiv:2111.09451 [pdf, other]
-
Title: Benchmarking and scaling of deep learning models for land cover image classificationAuthors: Ioannis Papoutsis, Nikolaos-Ioannis Bountos, Angelos Zavras, Dimitrios Michail, Christos TryfonopoulosComments: 25 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [505] arXiv:2111.09452 [pdf, other]
-
Title: Open Vocabulary Object Detection with Pseudo Bounding-Box LabelsComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [506] arXiv:2111.09485 [pdf, other]
-
Title: 3D Lip Event Detection via Interframe Motion Divergence at Multiple Temporal ResolutionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [507] arXiv:2111.09492 [pdf, other]
-
Title: Reference-based Magnetic Resonance Image Reconstruction Using Texture TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [508] arXiv:2111.09496 [pdf, ps, other]
-
Title: Developing a Machine Learning Algorithm-Based Classification Models for the Detection of High-Energy Gamma ParticlesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [509] arXiv:2111.09499 [pdf, other]
-
Title: Dynamically pruning segformer for efficient semantic segmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [510] arXiv:2111.09503 [pdf, other]
-
Title: Blind VQA on 360° Video via Progressively Learning from Pixels, Frames and VideoComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [511] arXiv:2111.09515 [pdf, other]
-
Title: Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Point Density Level EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [512] arXiv:2111.09526 [pdf, other]
-
Title: Learning Modified Indicator Functions for Surface ReconstructionComments: Accepted by Computers & Graphics from SMI 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [513] arXiv:2111.09539 [pdf, other]
-
Title: Deep neural networks-based denoising models for CT imaging and their efficacyComments: 13 pages, 9 figures, SPIE proceedingJournal-ref: Prabhat KC, Rongping Zeng, M. Mehdi Farhangi, Kyle J. Myers, "Deep neural networks-based denoising models for CT imaging and their efficacy," Proc. SPIE 11595, Medical Imaging 2021: Physics of Medical Imaging, 115950H (15 February 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [514] arXiv:2111.09560 [pdf, other]
-
Title: Adaptive Shrink-Mask for Text DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [515] arXiv:2111.09571 [pdf, other]
-
Title: Person Re-identification Method Based on Color Attack and Joint DefenceComments: Accepted by CVPR2022 Workshops (this https URL)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [516] arXiv:2111.09621 [pdf, other]
-
Title: SimpleTrack: Understanding and Rethinking 3D Multi-object TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [517] arXiv:2111.09624 [pdf, other]
-
Title: IMFNet: Interpretable Multimodal Fusion for Point Cloud RegistrationComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [518] arXiv:2111.09635 [pdf, other]
-
Title: Automatic Neural Network Pruning that Efficiently Preserves the Model AccuracyComments: 11 pages, 6 figures, 5 tables, accepted in AAAI2023 Workshop (Practical AI)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [519] arXiv:2111.09641 [pdf, other]
-
Title: Evaluating Transformers for Lightweight Action RecognitionComments: pre-printSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [520] arXiv:2111.09692 [pdf, other]
-
Title: SUB-Depth: Self-distillation and Uncertainty Boosting Self-supervised Monocular Depth EstimationComments: bmvc versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [521] arXiv:2111.09733 [pdf, other]
-
Title: Perceiving and Modeling Density is All You Need for Image DehazingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [522] arXiv:2111.09734 [pdf, other]
-
Title: ClipCap: CLIP Prefix for Image CaptioningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [523] arXiv:2111.09740 [pdf, other]
-
Title: Interactive segmentation using U-Net with weight map and dynamic user interactionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [524] arXiv:2111.09748 [pdf, other]
-
Title: The Way to my Heart is through Contrastive Learning: Remote Photoplethysmography from Unlabelled VideoComments: Code available at this https URLJournal-ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 3995-4004Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [525] arXiv:2111.09779 [pdf, other]
-
Title: Wiggling Weights to Improve the Robustness of ClassifiersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [526] arXiv:2111.09797 [pdf, other]
-
Title: Boosting Supervised Learning Performance with Co-trainingComments: 2021 IEEE Intelligent Vehicles SymposiumSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [527] arXiv:2111.09799 [pdf, other]
-
Title: LiDAR Cluster First and Camera Inference Later: A New Perspective Towards Autonomous DrivingAuthors: Jiyang Chen, Simon Yu, Rohan Tabish, Ayoosh Bansal, Shengzhong Liu, Tarek Abdelzaher, Lui ShaComments: 6 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [528] arXiv:2111.09833 [pdf, other]
-
Title: TransMix: Attend to Mix for Vision TransformersComments: Code will be made publicly available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [529] arXiv:2111.09847 [pdf, other]
-
Title: Edge-preserving Domain Adaptation for semantic segmentation of Medical ImagesComments: 5 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [530] arXiv:2111.09862 [pdf, ps, other]
-
Title: Postdisaster image-based damage detection and repair cost estimation of reinforced concrete buildings using dual convolutional neural networksComments: 16 pages, 21 figuresJournal-ref: Computer Aided Civil and Infrastructure Engineering, 2020Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [531] arXiv:2111.09876 [pdf, other]
-
Title: One-Shot Generative Domain AdaptationComments: Technical ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [532] arXiv:2111.09881 [pdf, other]
-
Title: Restormer: Efficient Transformer for High-Resolution Image RestorationAuthors: Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan YangComments: Accepted at CVPR 2022. #CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [533] arXiv:2111.09883 [pdf, other]
-
Title: Swin Transformer V2: Scaling Up Capacity and ResolutionAuthors: Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong, Furu Wei, Baining GuoJournal-ref: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [534] arXiv:2111.09886 [pdf, other]
-
Title: SimMIM: A Simple Framework for Masked Image ModelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [535] arXiv:2111.09887 [pdf, other]
-
Title: PyTorchVideo: A Deep Learning Library for Video UnderstandingAuthors: Haoqi Fan, Tullie Murrell, Heng Wang, Kalyan Vasudev Alwala, Yanghao Li, Yilei Li, Bo Xiong, Nikhila Ravi, Meng Li, Haichuan Yang, Jitendra Malik, Ross Girshick, Matt Feiszli, Aaron Adcock, Wan-Yen Lo, Christoph FeichtenhoferComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [536] arXiv:2111.09888 [pdf, other]
-
Title: Simple but Effective: CLIP Embeddings for Embodied AIComments: Published in CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [537] arXiv:2111.09950 [pdf, other]
-
Title: Correcting Face Distortion in Wide-Angle VideosComments: Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [538] arXiv:2111.09957 [pdf, other]
-
Title: Rethinking Dilated Convolution for Real-time Semantic SegmentationAuthors: Roland GaoComments: CVPR 2023 Efficient CV workshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [539] arXiv:2111.09976 [pdf, other]
-
Title: M2A: Motion Aware Attention for Accurate Video Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [540] arXiv:2111.09985 [pdf, other]
-
Title: DeMFI: Deep Joint Deblurring and Multi-Frame Interpolation with Flow-Guided Attentive Correlation and Recursive BoostingComments: 18 pages, 16 figures, 4 tables, GitHub page: this https URLJournal-ref: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [541] arXiv:2111.09996 [pdf, other]
-
Title: LOLNeRF: Learn from One LookComments: See this https URL for additional resultsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [542] arXiv:2111.09999 [pdf, other]
-
Title: TnT Attacks! Universal Naturalistic Adversarial Patches Against Deep Neural Network SystemsComments: Accepted for publication in the IEEE Transactions on Information Forensics & Security (TIFS)Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [543] arXiv:2111.10007 [pdf, other]
-
Title: FBNetV5: Neural Architecture Search for Multiple Tasks in One RunAuthors: Bichen Wu, Chaojian Li, Hang Zhang, Xiaoliang Dai, Peizhao Zhang, Matthew Yu, Jialiang Wang, Yingyan Lin, Peter VajdaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [544] arXiv:2111.10014 [pdf, other]
-
Title: CoCAtt: A Cognitive-Conditioned Driver Attention DatasetComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [545] arXiv:2111.10017 [pdf, ps, other]
-
Title: Rethinking Query, Key, and Value Embedding in Vision Transformer under Tiny Model ConstraintsJournal-ref: Mathematics 2023, 11, 1933Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [546] arXiv:2111.10023 [pdf, other]
-
Title: UFO: A UniFied TransfOrmer for Vision-Language Representation LearningAuthors: Jianfeng Wang, Xiaowei Hu, Zhe Gan, Zhengyuan Yang, Xiyang Dai, Zicheng Liu, Yumao Lu, Lijuan WangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [547] arXiv:2111.10032 [pdf, other]
-
Title: Meta Clustering Learning for Large-scale Unsupervised Person Re-identificationAuthors: Xin Jin, Tianyu He, Xu Shen, Tongliang Liu, Xinchao Wang, Jianqiang Huang, Zhibo Chen, Xian-Sheng HuaComments: Accepted by ACMMM2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [548] arXiv:2111.10056 [pdf, other]
-
Title: Medical Visual Question Answering: A SurveyAuthors: Zhihong Lin, Donghao Zhang, Qingyi Tao, Danli Shi, Gholamreza Haffari, Qi Wu, Mingguang He, Zongyuan GeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [549] arXiv:2111.10075 [pdf, other]
-
Title: Enhanced countering adversarial attacks via input denoising and feature restoringSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [550] arXiv:2111.10079 [pdf, other]
-
Title: Evaluating Self and Semi-Supervised Methods for Remote Sensing Segmentation TasksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [551] arXiv:2111.10101 [pdf, other]
-
Title: Deep Domain Adaptation for Pavement Crack DetectionComments: Published on IEEE Transactions on Intelligent Transportation SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [552] arXiv:2111.10127 [pdf, other]
-
Title: Neural Image Beauty Predictor Based on Bradley-Terry ModelComments: 16 pages 18 fiuguresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [553] arXiv:2111.10135 [pdf, other]
-
Title: Grounded Situation Recognition with TransformersComments: Accepted to BMVC 2021, Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [554] arXiv:2111.10137 [pdf, other]
-
Title: Learning to Detect Instance-level Salient Objects Using Complementary Image LabelsComments: to appear IJCV. arXiv admin note: text overlap with arXiv:2009.13898Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [555] arXiv:2111.10139 [pdf, other]
-
Title: More than Words: In-the-Wild Visually-Driven Prosody for Text-to-SpeechAuthors: Michael Hassid, Michelle Tadmor Ramanovich, Brendan Shillingford, Miaosen Wang, Ye Jia, Tal RemezSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [556] arXiv:2111.10146 [pdf, other]
-
Title: DVCFlow: Modeling Information Flow Towards Human-like Video CaptioningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [557] arXiv:2111.10204 [pdf, other]
-
Title: Augmentation of base classifier performance via HMMs on a handwritten character data setSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [558] arXiv:2111.10221 [pdf, other]
-
Title: Semi-Supervised Domain Generalization with Evolving Intermediate DomainComments: 13 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [559] arXiv:2111.10233 [pdf, other]
-
Title: Xp-GAN: Unsupervised Multi-object Controllable Video GenerationComments: 8 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [560] arXiv:2111.10250 [pdf, ps, other]
-
Title: Panoptic Segmentation: A ReviewAuthors: Omar Elharrouss, Somaya Al-Maadeed, Nandhini Subramanian, Najmath Ottakath, Noor Almaadeed, Yassine HimeurSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [561] arXiv:2111.10265 [pdf, other]
-
Title: ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object SegmentationComments: NeurIPS 2021 Datasets and BenchmarksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [562] arXiv:2111.10293 [pdf, ps, other]
-
Title: A 3D 2D convolutional Neural Network Model for Hyperspectral Image ClassificationComments: arXiv admin note: text overlap with arXiv:1902.06701 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [563] arXiv:2111.10296 [pdf, other]
-
Title: Probabilistic Regression with Huber DistributionsComments: to be published at BMVC, 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [564] arXiv:2111.10320 [pdf, other]
-
Title: Toward Compact Parameter Representations for Architecture-Agnostic Neural Network CompressionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [565] arXiv:2111.10326 [pdf, other]
-
Title: Factorisation-based Image LabellingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [566] arXiv:2111.10332 [pdf, other]
-
Title: DSPoint: Dual-scale Point Cloud Recognition with High-frequency FusionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [567] arXiv:2111.10337 [pdf, other]
-
Title: Advancing High-Resolution Video-Language Representation with Large-Scale Video TranscriptionsAuthors: Hongwei Xue, Tiankai Hang, Yanhong Zeng, Yuchong Sun, Bei Liu, Huan Yang, Jianlong Fu, Baining GuoJournal-ref: published in CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [568] arXiv:2111.10339 [pdf, other]
-
Title: Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [569] arXiv:2111.10346 [pdf, other]
-
Title: Global and Local Alignment Networks for Unpaired Image-to-Image TranslationAuthors: Guanglei Yang, Hao Tang, Humphrey Shi, Mingli Ding, Nicu Sebe, Radu Timofte, Luc Van Gool, Elisa RicciSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [570] arXiv:2111.10399 [pdf, other]
-
Title: What Stops Learning-based 3D Registration from Working in the Real World?Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [571] arXiv:2111.10427 [pdf, other]
-
Title: DIVeR: Real-time and Accurate Neural Radiance Fields with Deterministic Integration for Volume RenderingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [572] arXiv:2111.10481 [pdf, other]
-
Title: PatchCensor: Patch Robustness Certification for Transformers via Exhaustive TestingComments: This paper has been accepted by ACM Transactions on Software Engineering and Methodology (TOSEM'23) in "Continuous Special Section: AI and SE." Please include TOSEM for any citationsJournal-ref: ACM Trans. Softw. Eng. Methodol. (2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
- [573] arXiv:2111.10493 [pdf, other]
-
Title: Discrete Representations Strengthen Vision Transformer RobustnessSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [574] arXiv:2111.10502 [pdf, other]
-
Title: CamLiFlow: Bidirectional Camera-LiDAR Fusion for Joint Optical Flow and Scene Flow EstimationComments: Accepted to CVPR 2022 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [575] arXiv:2111.10520 [pdf, other]
-
Title: StylePart: Image-based Shape Part ManipulationComments: 10 pages, Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [576] arXiv:2111.10524 [pdf, other]
-
Title: ACR-Pose: Adversarial Canonical Representation Reconstruction Network for Category Level 6D Object Pose EstimationComments: 13 pages, 9 figures, 7 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [577] arXiv:2111.10531 [pdf, other]
-
Title: FAMINet: Learning Real-time Semi-supervised Video Object Segmentation with Steepest Optimized Optical FlowComments: Accepted by TIM (IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [578] arXiv:2111.10533 [pdf, other]
-
Title: Temporal-MPI: Enabling Multi-Plane Images for Dynamic Scene Modelling via Temporal Basis LearningComments: In ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [579] arXiv:2111.10544 [pdf, other]
-
Title: Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GANComments: 12 pages, 8 figures, 35th Conference on Neural Information Processing SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [580] arXiv:2111.10546 [pdf, ps, other]
-
Title: Delving into Rectifiers in Style-Based Image TranslationComments: 14 pages,14 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [581] arXiv:2111.10561 [pdf, other]
-
Title: Teacher-Student Training and Triplet Loss to Reduce the Effect of Drastic Face OcclusionComments: Accepted in Machine Vision and Applications. arXiv admin note: text overlap with arXiv:2008.01003Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [582] arXiv:2111.10563 [pdf, other]
-
Title: A Deeper Look into DeepCapComments: arXiv admin note: substantial text overlap with arXiv:2003.08325Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [583] arXiv:2111.10591 [pdf, other]
-
Title: AGA-GAN: Attribute Guided Attention Generative Adversarial Network with U-Net for Face HallucinationComments: 27 pages, 9 FiguresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [584] arXiv:2111.10602 [pdf, other]
-
Title: Unsupervised Domain Adaptation for Device-free Gesture RecognitionComments: The paper is submitted to the journal of IEEE Transactions on Mobile Computing. And it is still under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [585] arXiv:2111.10605 [pdf, other]
-
Title: Exploiting Multi-Scale Fusion, Spatial Attention and Patch Interaction Techniques for Text-Independent Writer IdentificationComments: 14 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [586] arXiv:2111.10612 [pdf, ps, other]
-
Title: A photosensor employing data-driven binning for ultrafast image recognitionAuthors: Lukas Mennel, Aday J. Molina-Mendoza, Matthias Paur, Dmitry K. Polyushkin, Dohyun Kwak, Miriam Giparakis, Maximilian Beiser, Aaron Maxwell Andrews, Thomas MuellerComments: 10 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Applied Physics (physics.app-ph)
- [587] arXiv:2111.10617 [pdf, other]
-
Title: Extracting Deformation-Aware Local Features by Learning to DeformComments: To appear in Proceedings of the Thirty-fifth Annual Conference on Neural Information Processing Systems (NeurIPS) 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [588] arXiv:2111.10621 [pdf, other]
-
Title: FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object SegmentationComments: To appear at BMVC 2021; 13 pages, 4 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [589] arXiv:2111.10633 [pdf, other]
-
Title: Sparse Tensor-based Multiscale Representation for Point Cloud Geometry CompressionComments: 17 pages, 15 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [590] arXiv:2111.10634 [pdf, other]
-
Title: Identity-Preserving Pose-Robust Face Hallucination Through Face Subspace PriorComments: A shorter version of this paper has been submitted to IEEE Transactions on Image ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [591] arXiv:2111.10650 [pdf, ps, other]
-
Title: Simulated LiDAR Repositioning: a novel point cloud data augmentation methodComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [592] arXiv:2111.10653 [pdf, ps, other]
-
Title: Real-time Human Detection Model for Edge DevicesComments: 19 pages 6 figures 9 TablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [593] arXiv:2111.10659 [pdf, other]
-
Title: Are Vision Transformers Robust to Patch Perturbations?Journal-ref: European Conference on Computer Vision (ECCV) , 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [594] arXiv:2111.10677 [pdf, other]
-
Title: VideoPose: Estimating 6D object pose from videosSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [595] arXiv:2111.10686 [pdf, other]
-
Title: Representing Prior Knowledge Using Randomly, Weighted Feature Networks for Visual Relationship DetectionComments: 9 pages, 2 figures, Accepted to CLeaR2022 at AAAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [596] arXiv:2111.10701 [pdf, other]
-
Title: Self-Supervised Point Cloud Completion via InpaintingComments: BMVC 2021 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [597] arXiv:2111.10737 [pdf, other]
-
Title: 3D Visual Tracking Framework with Deep Learning for Asteroid ExplorationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [598] arXiv:2111.10747 [pdf, other]
-
Title: MaIL: A Unified Mask-Image-Language Trimodal Network for Referring Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [599] arXiv:2111.10759 [pdf, other]
-
Title: Adversarial Mask: Real-World Universal Adversarial Attack on Face Recognition ModelComments: 16 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [600] arXiv:2111.10780 [pdf, ps, other]
-
Title: FCOSR: A Simple Anchor-free Rotated Detector for Aerial Object DetectionComments: 10 pages, 6 tables, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [601] arXiv:2111.10794 [pdf, other]
-
Title: HoughCL: Finding Better Positive Pairs in Dense Self-supervised LearningComments: Accepted to ICML 2021 Workshop: Self-Supervised Learning for Reasoning and PerceptionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [602] arXiv:2111.10817 [pdf, other]
-
Title: Understanding Pixel-level 2D Image Semantics with 3D Keypoint Knowledge EngineAuthors: Yang You, Chengkun Li, Yujing Lou, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Weiming Wang, Cewu LuComments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence; To appear in upcoming issuesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [603] arXiv:2111.10844 [pdf, other]
-
Title: Denoised Internal Models: a Brain-Inspired Autoencoder against Adversarial AttacksAuthors: Kaiyuan Liu, Xingyu Li, Yurui Lai, Ge Zhang, Hang Su, Jiachen Wang, Chunxu Guo, Jisong Guan, Yi ZhouComments: 16 pages, 3 figuresJournal-ref: Machine Intelligence Research, vol. 19, no. 5, pp.456-471, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [604] arXiv:2111.10854 [pdf, other]
-
Title: XnODR and XnIDR: Two Accurate and Fast Fully Connected Layers For Convolutional Neural NetworksComments: 19 pages, 5 figures, 9 tables, 2 algorithmsJournal-ref: J Intell Robot Syst 109, 17 (2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [605] arXiv:2111.10866 [pdf, other]
-
Title: CpT: Convolutional Point Transformer for 3D Point Cloud ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [606] arXiv:2111.10882 [pdf, other]
-
Title: Geometry-Aware Multi-Task Learning for Binaural Audio Generation from VideoComments: Published in BMVC 2021, project page: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [607] arXiv:2111.10916 [pdf, other]
-
Title: Video Content Swapping Using GANComments: 9 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [608] arXiv:2111.10917 [pdf, other]
-
Title: Deep Reinforced Attention Regression for Partial Sketch Based Image RetrievalComments: 2021 IEEE International Conference on Data Mining (ICDM)Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [609] arXiv:2111.10932 [pdf, other]
-
Title: Self-supervised Semi-supervised Learning for Data Labeling and Quality EvaluationComments: Accepted to NeurIPS 2021 DCAI WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [610] arXiv:2111.10943 [pdf, other]
-
Title: Model-Based Single Image Deep DehazingJournal-ref: 2022 IEEE International Conference on Image ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [611] arXiv:2111.10958 [pdf, other]
-
Title: MUM : Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object DetectionComments: Accept to CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [612] arXiv:2111.10961 [pdf, ps, other]
-
Title: MidNet: An Anchor-and-Angle-Free Detector for Oriented Ship Detection in Aerial ImagesComments: 9 pages, 5 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [613] arXiv:2111.10969 [pdf, ps, other]
-
Title: Medical Aegis: Robust adversarial protectors for medical imagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [614] arXiv:2111.10971 [pdf, other]
-
Title: Tracking Grow-Finish Pigs Across Large Pens Using Multiple CamerasAuthors: Aniket Shirke, Aziz Saifuddin, Achleshwar Luthra, Jiangong Li, Tawni Williams, Xiaodan Hu, Aneesh Kotnana, Okan Kocabalkanli, Narendra Ahuja, Angela Green-Miller, Isabella Condotta, Ryan N. Dilger, Matthew CaesarComments: 6 pages, 4 figures, Accepted at the CVPR 2021 CV4Animals workshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [615] arXiv:2111.10974 [pdf, other]
-
Title: Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask ArchitectureAuthors: Daria Bakshandaeva, Denis Dimitrov, Vladimir Arkhipkin, Alex Shonenkov, Mark Potanin, Denis Karachev, Andrey Kuznetsov, Anton Voronov, Vera Davydova, Elena Tutubalina, Aleksandr PetiushkoSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [616] arXiv:2111.10978 [pdf, other]
- [617] arXiv:2111.10984 [pdf, other]
-
Title: Topological Regularization for Dense PredictionSubjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG); Algebraic Topology (math.AT)
- [618] arXiv:2111.10985 [pdf, other]
-
Title: Efficient Non-Compression Auto-Encoder for Driving Noise-based Road Surface Anomaly DetectionComments: 8 pages, 5 figures, 6 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [619] arXiv:2111.10989 [pdf, other]
-
Title: Exploring Feature Representation Learning for Semi-supervised Medical Image SegmentationComments: Accepted by TNNLS (IEEE Transactions on Neural Networks and Learning Systems). Code available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [620] arXiv:2111.10990 [pdf, other]
-
Title: Imperceptible Transfer Attack and Defense on 3D Point Cloud ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [621] arXiv:2111.11011 [pdf, other]
-
Title: CDistNet: Perceiving Multi-Domain Character Distance for Robust Text RecognitionComments: Paper accepted for publication at IJCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [622] arXiv:2111.11029 [pdf, other]
-
Title: Auto-Encoding Score Distribution Regression for Action Quality AssessmentSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [623] arXiv:2111.11044 [pdf, other]
-
Title: Exploring Segment-level Semantics for Online Phase Recognition from Surgical VideosComments: Appear in IEEE TMISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [624] arXiv:2111.11046 [pdf, other]
-
Title: FRT-PAD: Effective Presentation Attack Detection Driven by Face Related TaskComments: Accepted by ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [625] arXiv:2111.11051 [pdf, other]
-
Title: Contrast-reconstruction Representation Learning for Self-supervised Skeleton-based Action RecognitionComments: Publised in IEEE TIP. (this https URL)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [626] arXiv:2111.11055 [pdf, other]
-
Title: Dense Uncertainty Estimation via an Ensemble-based Conditional Latent Variable ModelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [627] arXiv:2111.11056 [pdf, other]
-
Title: Evaluating Adversarial Attacks on ImageNet: A Reality Check on Misclassification ClassesComments: Accepted for publication in 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Workshop on ImageNet: Past,Present, and FutureSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [628] arXiv:2111.11057 [pdf, other]
-
Title: Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing ImagesComments: Accepted to IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [629] arXiv:2111.11066 [pdf, other]
-
Title: FedCV: A Federated Learning Framework for Diverse Computer Vision TasksAuthors: Chaoyang He, Alay Dilipbhai Shah, Zhenheng Tang, Di Fan1Adarshan Naiynar Sivashunmugam, Keerti Bhogaraju, Mita Shimpi, Li Shen, Xiaowen Chu, Mahdi Soltanolkotabi, Salman AvestimehrComments: Federated Learning for Computer Vision, an application of FedML Ecosystem (fedml.ai)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [630] arXiv:2111.11067 [pdf, other]
-
Title: Semi-Supervised Vision TransformersComments: 16 pages, 4 figures, ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [631] arXiv:2111.11089 [pdf, other]
-
Title: Monocular Road Planar Parallax EstimationComments: Accepted by IEEE TIPSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [632] arXiv:2111.11103 [pdf, other]
-
Title: Improving Semantic Image Segmentation via Label Fusion in Semantically Textured MeshesAuthors: Florian Fervers, Timo Breuer, Gregor Stachowiak, Sebastian Bullinger, Christoph Bodensteiner, Michael ArensSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [633] arXiv:2111.11114 [pdf, other]
-
Title: Depth-aware Object Segmentation and Grasp Detection for Robotic Picking TasksSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [634] arXiv:2111.11124 [pdf, other]
-
Title: Mesa: A Memory-saving Training Framework for TransformersComments: Tech reportSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [635] arXiv:2111.11127 [pdf, other]
-
Title: Myope Models -- Are face presentation attack detection models short-sighted?Comments: Accepted at the 2ND WORKSHOP ON EXPLAINABLE & INTERPRETABLE ARTIFICIAL INTELLIGENCE FOR BIOMETRICS AT WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [636] arXiv:2111.11133 [pdf, other]
-
Title: L-Verse: Bidirectional Generation Between Image and TextAuthors: Taehoon Kim, Gwangmo Song, Sihaeng Lee, Sangyun Kim, Yewon Seo, Soonyoung Lee, Seung Hwan Kim, Honglak Lee, Kyunghoon BaeComments: Accepted to CVPR 2022 as Oral Presentation (18 pages, 14 figures, 4 tables)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [637] arXiv:2111.11141 [pdf, other]
-
Title: Learning Generalized Visual Odometry Using Position-Aware Optical Flow and Geometric Bundle AdjustmentComments: 35 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [638] arXiv:2111.11186 [pdf, other]
-
Title: GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [639] arXiv:2111.11187 [pdf, other]
-
Title: PointMixer: MLP-Mixer for Point Cloud UnderstandingComments: Accepted to ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [640] arXiv:2111.11215 [pdf, other]
-
Title: Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [641] arXiv:2111.11250 [pdf, other]
-
Title: Action Recognition with Domain Invariant Features of Skeleton ImageSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [642] arXiv:2111.11260 [pdf, ps, other]
-
Title: MiNet: A Convolutional Neural Network for Identifying and Categorising MineralsJournal-ref: Agangiba, M.A., Asiedu, E.B. and Aikins, D., 2020. MiNet: A Convolutional Neural Network for Identifying and Categorising Minerals. Ghana Journal of Technology, 5(1), pp.86-92Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [643] arXiv:2111.11280 [pdf, other]
-
Title: Point Cloud Color ConstancyComments: 10 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [644] arXiv:2111.11288 [pdf, other]
-
Title: SSR: An Efficient and Robust Framework for Learning with Unknown Label NoiseComments: Accepted to BMVC2022Journal-ref: https://bmvc2022.mpi-inf.mpg.de/372/Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [645] arXiv:2111.11322 [pdf, other]
-
Title: Contour-guided Image Completion with Perceptual GroupingAuthors: Morteza Rezanejad, Sidharth Gupta, Chandra Gummaluru, Ryan Marten, John Wilder, Michael Gruninger, Dirk B. WaltherSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [646] arXiv:2111.11326 [pdf, other]
-
Title: DyTox: Transformers for Continual Learning with DYnamic TOken eXpansionComments: CVPR 2022, Code at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [647] arXiv:2111.11348 [pdf, other]
-
Title: Paris-CARLA-3D: A Real and Synthetic Outdoor Point Cloud Dataset for Challenging Tasks in 3D MappingAuthors: Jean-Emmanuel Deschaud, David Duque, Jean Pierre Richa, Santiago Velasco-Forero, Beatriz Marcotegui, and François GouletteComments: 24 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [648] arXiv:2111.11350 [pdf, ps, other]
-
Title: ShufaNet: Classification method for calligraphers who have reached the professional levelComments: 10pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [649] arXiv:2111.11366 [pdf, other]
-
Title: FFNB: Forgetting-Free Neural Blocks for Deep Continual Visual LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [650] arXiv:2111.11368 [pdf, other]
-
Title: Adversarial Examples on Segmentation Models Can be Easy to TransferSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [651] arXiv:2111.11388 [pdf, other]
-
Title: Conifer Seedling Detection in UAV-Imagery with RGB-Depth InformationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [652] arXiv:2111.11397 [pdf, other]
-
Title: Lebanon Solar Rooftop Potential Assessment using Buildings Segmentation from Aerial ImagesAuthors: Hasan Nasrallah, Abed Ellatif Samhat, Yilei Shi, Xiaoxiang Zhu, Ghaleb Faour, Ali J. GhandourSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [653] arXiv:2111.11398 [pdf, other]
-
Title: Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream TasksComments: Code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [654] arXiv:2111.11418 [pdf, other]
-
Title: MetaFormer Is Actually What You Need for VisionAuthors: Weihao Yu, Mi Luo, Pan Zhou, Chenyang Si, Yichen Zhou, Xinchao Wang, Jiashi Feng, Shuicheng YanComments: CVPR 2022 (Oral). Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [655] arXiv:2111.11426 [pdf, other]
-
Title: Neural Fields in Visual Computing and BeyondAuthors: Yiheng Xie, Towaki Takikawa, Shunsuke Saito, Or Litany, Shiqin Yan, Numair Khan, Federico Tombari, James Tompkin, Vincent Sitzmann, Srinath SridharComments: Equal advising: Vincent Sitzmann and Srinath SridharSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [656] arXiv:2111.11429 [pdf, other]
-
Title: Benchmarking Detection Transfer Learning with Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [657] arXiv:2111.11430 [pdf, other]
-
Title: Class-agnostic Object Detection with Multi-modal TransformerAuthors: Muhammad Maaz, Hanoona Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, Ming-Hsuan YangComments: Accepted at ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [658] arXiv:2111.11431 [pdf, other]
-
Title: RedCaps: web-curated image-text data created by the people, for the peopleComments: NeurIPS 2021 Datasets and Benchmarks. Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [659] arXiv:2111.11432 [pdf, other]
-
Title: Florence: A New Foundation Model for Computer VisionAuthors: Lu Yuan, Dongdong Chen, Yi-Ling Chen, Noel Codella, Xiyang Dai, Jianfeng Gao, Houdong Hu, Xuedong Huang, Boxin Li, Chunyuan Li, Ce Liu, Mengchen Liu, Zicheng Liu, Yumao Lu, Yu Shi, Lijuan Wang, Jianfeng Wang, Bin Xiao, Zhen Xiao, Jianwei Yang, Michael Zeng, Luowei Zhou, Pengchuan ZhangSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [660] arXiv:2111.11433 [pdf, other]
-
Title: Towards Tokenized Human Dynamics RepresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [661] arXiv:2111.11481 [pdf, ps, other]
-
Title: Real-time ground filtering algorithm of cloud points acquired using Terrestrial Laser Scanner (TLS)Comments: 25 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [662] arXiv:2111.11491 [pdf, other]
-
Title: Image Based Reconstruction of Liquids from 2D Surface DetectionsComments: 14 pages, 11 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [663] arXiv:2111.11535 [pdf, other]
-
Title: Ice hockey player identification via transformers and weakly supervised learningComments: CVSports 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [664] arXiv:2111.11546 [pdf, other]
-
Title: Lightweight Transformer Backbone for Medical Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [665] arXiv:2111.11547 [pdf, other]
-
Title: Camera Measurement of Physiological Vital SignsAuthors: Daniel McDuffSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [666] arXiv:2111.11567 [pdf, other]
-
Title: ATLANTIS: A Benchmark for Semantic Segmentation of Waterbody ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [667] arXiv:2111.11578 [pdf, other]
-
Title: Generative Adversarial Networks for Astronomical Images GenerationSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [668] arXiv:2111.11591 [pdf, other]
-
Title: Efficient Video Transformers with Spatial-Temporal Token SelectionComments: Accepted by ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [669] arXiv:2111.11595 [pdf, other]
-
Title: Semi-Supervised Learning with Taxonomic LabelsComments: BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [670] arXiv:2111.11604 [pdf, other]
-
Title: Simultaneous face detection and 360 degree headpose estimationComments: Accepted at The 13th International Conference on Knowledge and Systems Engineering (KSE 2021), 7 pages, 2 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [671] arXiv:2111.11615 [pdf, other]
-
Title: PointCrack3D: Crack Detection in Unstructured Environments using a 3D-Point-Cloud-Based Deep Neural NetworkComments: submitted, to be publishedSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [672] arXiv:2111.11616 [pdf, other]
-
Title: Using mixup as regularization and tuning hyper-parameters for ResNetsAuthors: Venkata Bhanu Teja PallakondaComments: 6 pages, 7 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [673] arXiv:2111.11625 [pdf, other]
-
Title: Learning Dynamic Compact Memory Embedding for Deformable Visual Object TrackingAuthors: Pengfei Zhu, Hongtao Yu, Kaihua Zhang, Yu Wang, Shuai Zhao, Lei Wang, Tianzhu Zhang, Qinghua HuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [674] arXiv:2111.11629 [pdf, other]
-
Title: Uncertainty-Aware Deep Co-training for Semi-supervised Medical Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [675] arXiv:2111.11631 [pdf, other]
-
Title: Self-Regulated Learning for Egocentric Video Activity AnticipationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [676] arXiv:2111.11646 [pdf, other]
-
Title: CytoImageNet: A large-scale pretraining dataset for bioimage transfer learningComments: Accepted paper at NeurIPS 2021 Learning Meaningful Representations for Life (LMRL) WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
- [677] arXiv:2111.11653 [pdf, other]
-
Title: Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [678] arXiv:2111.11656 [pdf, other]
-
Title: Few-Shot Object Detection via Association and DIscriminationComments: NeurIPS 2021 Camera ReadySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [679] arXiv:2111.11660 [pdf, other]
-
Title: Non-invasive hemodynamic analysis for aortic regurgitation using computational fluid dynamics and deep learningAuthors: Derek Long, Cameron McMurdo, Edward Ferdian, Charlene A. Mauger, David Marlevi, Alistair A. Young, Martyn P. NashSubjects: Computer Vision and Pattern Recognition (cs.CV); Computational Physics (physics.comp-ph)
- [680] arXiv:2111.11672 [pdf, other]
-
Title: Few-shot Image Generation with Mixup-based Distance LearningComments: ECCV 2022, 27 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [681] arXiv:2111.11691 [pdf, other]
-
Title: HybridGazeNet: Geometric model guided Convolutional Neural Networks for gaze estimationComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [682] arXiv:2111.11704 [pdf, other]
-
Title: Deep Point Cloud ReconstructionComments: ICLR 2022 acceptedSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [683] arXiv:2111.11709 [pdf, other]
-
Title: A Multi-Stage model based on YOLOv3 for defect detection in PV panels based on IR and Visible Imaging by Unmanned Aerial VehicleComments: Submitted to Elsevier. Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [684] arXiv:2111.11718 [pdf, other]
- [685] arXiv:2111.11720 [pdf, ps, other]
-
Title: Gait Identification under Surveillance Environment based on Human SkeletonSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [686] arXiv:2111.11723 [pdf, ps, other]
-
Title: A new dynamical model for solving rotation averaging problemSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [687] arXiv:2111.11734 [pdf, other]
-
Title: IR Motion DeblurringSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [688] arXiv:2111.11736 [pdf, other]
-
Title: Tensor Component Analysis for Interpreting the Latent Space of GANsComments: BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [689] arXiv:2111.11739 [pdf, other]
-
Title: AdaFusion: Visual-LiDAR Fusion with Adaptive Weights for Place RecognitionComments: 8 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [690] arXiv:2111.11745 [pdf, other]
-
Title: Intriguing Findings of Frequency Selection for Image DeblurringComments: AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [691] arXiv:2111.11747 [pdf, other]
-
Title: Semi-Online Knowledge DistillationComments: Accepted to BMVC2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [692] arXiv:2111.11759 [pdf, other]
-
Title: ReGroup: Recursive Neural Networks for Hierarchical Grouping of Vector Graphic PrimitivesSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [693] arXiv:2111.11771 [pdf, other]
-
Title: A self-training framework for glaucoma grading in OCT B-scansComments: 5 pages, 4 figures, 3 tables, 2 algorithms, international conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [694] arXiv:2111.11783 [pdf, other]
-
Title: GenReg: Deep Generative Method for Fast Point Cloud RegistrationComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [695] arXiv:2111.11794 [pdf, ps, other]
-
Title: Introduction to Presentation Attack Detection in Face Biometrics and Recent AdvancesComments: Chapter of the Handbook of Biometric Anti-Spoofing (Third Edition)Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [696] arXiv:2111.11802 [pdf, other]
-
Title: Pruning Self-attentions into Convolutional Layers in Single PathComments: Accepted by TPAMI 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [697] arXiv:2111.11821 [pdf, other]
-
Title: Learning Representation for Clustering via Prototype Scattering and Positive SamplingComments: Accepted by TPAMI 2022Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [698] arXiv:2111.11827 [pdf, other]
-
Title: A General Divergence Modeling Strategy for Salient Object DetectionComments: Code is available at: this https URLJournal-ref: ACCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [699] arXiv:2111.11837 [pdf, other]
-
Title: Focal and Global Knowledge Distillation for DetectorsComments: Accept by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [700] arXiv:2111.11843 [pdf, other]
-
Title: U-shape Transformer for Underwater Image EnhancementComments: under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [701] arXiv:2111.11854 [pdf, ps, other]
-
Title: Compresion y analisis de imagenes por medio de algoritmos para la ganaderia de precisionComments: in SpanishSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [702] arXiv:2111.11856 [pdf, other]
-
Title: Spatio-Temporal Split Learning for Autonomous Aerial Surveillance using Urban Air Mobility (UAM) NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Networking and Internet Architecture (cs.NI)
- [703] arXiv:2111.11862 [pdf, other]
-
Title: Inferring User Facial Affect in Work-like SettingsSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [704] arXiv:2111.11863 [pdf, other]
-
Title: Explainable Deep Image Classifiers for Skin Lesion DiagnosisAuthors: Carlo Metta, Andrea Beretta, Riccardo Guidotti, Yuan Yin, Patrick Gallinari, Salvatore Rinzivillo, Fosca GiannottiSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [705] arXiv:2111.11870 [pdf, other]
-
Title: DBIA: Data-free Backdoor Injection Attack against Transformer NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [706] arXiv:2111.11879 [pdf, other]
-
Title: Weakly-Supervised Cloud Detection with Fixed-Point GANsComments: Accepted to the 3rd IEEE Workshop on Machine Learning for Big Data Analytics in Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [707] arXiv:2111.11892 [pdf, other]
-
Title: LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object TrackingComments: Official version for CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [708] arXiv:2111.11899 [pdf, ps, other]
-
Title: Results of improved fractional/integer order PDE-based binarization modelAuthors: Uche A. NnolimComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [709] arXiv:2111.11927 [pdf, other]
-
Title: Hierarchical Graph Networks for 3D Human Pose EstimationAuthors: Han Li, Bowen Shi, Wenrui Dai, Yabo Chen, Botao Wang, Yu Sun, Min Guo, Chenlin Li, Junni Zou, Hongkai XiongComments: accepted by BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [710] arXiv:2111.11940 [pdf, other]
-
Title: PAM: Pose Attention Module for Pose-Invariant Face RecognitionComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [711] arXiv:2111.11952 [pdf, other]
-
Title: Leveraging Selective Prediction for Reliable Image GeolocationComments: Accepted to the 28th International Conference on MultiMedia Modeling (MMM' 22)Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [712] arXiv:2111.11969 [pdf, other]
-
Title: Lifting 2D Human Pose to 3D with Domain Adapted 3D Body ConceptComments: 15 pages, a paper submitted to IJCVJournal-ref: Int J Comput Vis 131 (2023) 1250 - 1268Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [713] arXiv:2111.11976 [pdf, other]
-
Title: KTNet: Knowledge Transfer for Unpaired 3D Shape CompletionJournal-ref: AAAI2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [714] arXiv:2111.11992 [pdf, ps, other]
-
Title: Sparse Fusion for Multimodal TransformersComments: 11 pages, 4 figures, 5 tables, Yi Ding and Alex Rich contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [715] arXiv:2111.12073 [pdf, other]
-
Title: Multi-Person 3D Motion Prediction with Multi-Range TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [716] arXiv:2111.12077 [pdf, other]
-
Title: Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance FieldsComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [717] arXiv:2111.12082 [pdf, other]
-
Title: PhysFormer: Facial Video-based Physiological Measurement with Temporal Difference TransformerComments: Accepted by CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [718] arXiv:2111.12084 [pdf, other]
-
Title: Self-Supervised Pre-Training for Transformer-Based Person Re-IdentificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [719] arXiv:2111.12085 [pdf, other]
-
Title: UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingAuthors: Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Faisal Ahmed, Zicheng Liu, Yumao Lu, Lijuan WangComments: ECCV 2022 (Oral Presentation)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [720] arXiv:2111.12115 [pdf, other]
-
Title: Algorithmic Fairness in Face Morphing Attack DetectionComments: Accepted to WACVW2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [721] arXiv:2111.12122 [pdf, ps, other]
-
Title: Bounding Box-Free Instance Segmentation Using Semi-Supervised Learning for Generating a City-Scale Vehicle DatasetAuthors: Osmar Luiz Ferreira de Carvalho, Osmar Abílio de Carvalho Júnior, Anesmar Olino de Albuquerque, Nickolas Castro Santana, Dibio Leandro Borges, Roberto Arnaldo Trancoso Gomes, Renato Fontes GuimarãesComments: 38 pages, 10 figures, submitted to journalSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Databases (cs.DB)
- [722] arXiv:2111.12123 [pdf, other]
-
Title: MICS : Multi-steps, Inverse Consistency and Symmetric deep learning registration networkAuthors: Théo Estienne, Maria Vakalopoulou, Enzo Battistella, Theophraste Henry, Marvin Lerousseau, Amaury Leroy, Nikos Paragios, Eric DeutschComments: In submissionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [723] arXiv:2111.12126 [pdf, other]
-
Title: Panoptic Segmentation Meets Remote SensingAuthors: Osmar Luiz Ferreira de Carvalho, Osmar Abílio de Carvalho Júnior, Cristiano Rosa e Silva, Anesmar Olino de Albuquerque, Nickolas Castro Santana, Dibio Leandro Borges, Roberto Arnaldo Trancoso Gomes, Renato Fontes GuimarãesComments: 40 pages, 10 figures, submitted to journalSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Databases (cs.DB)
- [724] arXiv:2111.12155 [pdf, ps, other]
-
Title: In-field early disease recognition of potato late blight based on deep learning and proximal hyperspectral imagingAuthors: Chao Qi (1 and 2), Murilo Sandroni (3), Jesper Cairo Westergaard (4), Ea Høegh Riis Sundmark (5), Merethe Bagge (5), Erik Alexandersson (3), Junfeng Gao (1 and 6) ((1) Lincoln Agri-Robotics, Lincoln Institute for Agri-Food Technology, University of Lincoln, Lincoln, UK, (2) College of Engineering, Nanjing Agricultural University, Nanjing 210031, China, (3) Department of Plant Protection Biology, Swedish University of Agricultural Sciences, Alnarp, Sweden, (4) Department of Plant and Environmental Sciences, University of Copenhagen, Taastrup, Denmark, (5) Danespo Breeding Company, Give, Denmark, (6) Lincoln Centre for Autonomous System, University of Lincoln, Lincoln, UK)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [725] arXiv:2111.12167 [pdf, other]
-
Title: PT-VTON: an Image-Based Virtual Try-On Network with Progressive Pose Attention TransferComments: Short Version with 4 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [726] arXiv:2111.12172 [pdf, other]
-
Title: Multi-label Iterated Learning for Image Classification with Label AmbiguitySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [727] arXiv:2111.12221 [pdf, ps, other]
-
Title: Source-free unsupervised domain adaptation for cross-modality abdominal multi-organ segmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [728] arXiv:2111.12231 [pdf, other]
-
Title: Universal Deep Network for Steganalysis of Color Image based on Channel RepresentationComments: To be improved versionSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [729] arXiv:2111.12232 [pdf, other]
-
Title: PMSSC: Parallelizable multi-subset based self-expressive model for subspace clusteringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [730] arXiv:2111.12233 [pdf, other]
-
Title: Scaling Up Vision-Language Pre-training for Image CaptioningSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [731] arXiv:2111.12242 [pdf, other]
-
Title: PU-Transformer: Point Cloud Upsampling TransformerComments: ACCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [732] arXiv:2111.12263 [pdf, other]
-
Title: APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic SegmentationComments: 12 pages, 7 figures, Accepted to IEEE Trans. on Multimedia. arXiv admin note: substantial text overlap with arXiv:2104.09216Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [733] arXiv:2111.12264 [pdf, other]
-
Title: Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving ScenesComments: ECCV 2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [734] arXiv:2111.12265 [pdf, other]
-
Title: Distribution Estimation to Automate Transformation Policies for Self-SupervisionComments: NeurIPS 2021 Workshop: Self-Supervised Learning - Theory and PracticeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [735] arXiv:2111.12273 [pdf, other]
-
Title: Sharpness-aware Quantization for Deep Neural NetworksComments: Tech reportSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [736] arXiv:2111.12276 [pdf, other]
-
Title: Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor LanguagesAuthors: Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo MasumuraComments: Accept as short paper at ACM MMAsia 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [737] arXiv:2111.12289 [pdf, ps, other]
-
Title: Real-time smart vehicle surveillance systemSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [738] arXiv:2111.12290 [pdf, other]
-
Title: Attention-based Dual-stream Vision Transformer for Radar Gait RecognitionComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [739] arXiv:2111.12292 [pdf, other]
-
Title: Improved Fine-Tuning by Better Leveraging Pre-Training DataSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [740] arXiv:2111.12293 [pdf, other]
-
Title: PTQ4ViT: Post-Training Quantization Framework for Vision Transformers with Twin Uniform QuantizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [741] arXiv:2111.12294 [pdf, other]
-
Title: An Image Patch is a Wave: Phase-Aware Vision MLPComments: This paper is accepted by CVPR 2022 (oral presentation)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [742] arXiv:2111.12296 [pdf, other]
-
Title: Spatial-context-aware deep neural network for multi-class image classificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [743] arXiv:2111.12301 [pdf, other]
-
Title: Two-stage Rule-induction Visual Reasoning on RPMs with an Application to Video PredictionComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [744] arXiv:2111.12309 [pdf, other]
-
Title: RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?Comments: ECCV2022, 15 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [745] arXiv:2111.12315 [pdf, other]
-
Title: Dynamic Texture Recognition using PDV Hashing and Dictionary Learning on Multi-scale Volume Local Binary PatternComments: 5 pages, 1 figureSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [746] arXiv:2111.12320 [pdf, other]
-
Title: Consistency Regularization for Deep Face Anti-SpoofingAuthors: Zezheng Wang, Zitong Yu, Xun Wang, Yunxiao Qin, Jiahong Li, Chenxu Zhao, Zhen Lei, Xin Liu, Size Li, Zhongyuan WangComments: 10 tables, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [747] arXiv:2111.12325 [pdf, other]
-
Title: MonoPLFlowNet: Permutohedral Lattice FlowNet for Real-Scale 3D Scene FlowEstimation with Monocular ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [748] arXiv:2111.12330 [pdf, other]
-
Title: Hidden-Fold Networks: Random Recurrent Residuals Using Sparse SupermasksComments: 13 pages, 7 figures. Accepted to the British Machine Vision Conference (BMVC) 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [749] arXiv:2111.12341 [pdf, other]
-
Title: EvDistill: Asynchronous Events to End-task Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge DistillationComments: CVPR 2021 (updated references in this version)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [750] arXiv:2111.12346 [pdf, other]
-
Title: Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and ClothingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [751] arXiv:2111.12351 [pdf, other]
-
Title: Decoupling Visual-Semantic Feature Learning for Robust Scene Text RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [752] arXiv:2111.12358 [pdf, other]
-
Title: SPCL: A New Framework for Domain Adaptive Semantic Segmentation via Semantic Prototype-based Contrastive LearningComments: 23 pages, 9 figures; The code is publicly available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [753] arXiv:2111.12374 [pdf, other]
-
Title: MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video ParsingComments: ACM MM 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [754] arXiv:2111.12375 [pdf, other]
-
Title: Human Activity Recognition Using 3D Orthogonally-projected EfficientNet on Radar Time-Range-Doppler SignatureSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [755] arXiv:2111.12379 [pdf, other]
-
Title: Efficient Anomaly Detection Using Self-Supervised Multi-Cue TasksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [756] arXiv:2111.12385 [pdf, other]
-
Title: Space-Partitioning RANSACSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [757] arXiv:2111.12386 [pdf, other]
-
Title: One to Transfer All: A Universal Transfer Framework for Vision Foundation Model with Few DataComments: Technical ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [758] arXiv:2111.12389 [pdf, other]
-
Title: Track Boosting and Synthetic Data Aided Drone DetectionComments: Published at AVSS 2021Journal-ref: 2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [759] arXiv:2111.12405 [pdf, other]
-
Title: An Attack on Facial Soft-biometric Privacy EnhancementAuthors: Dailé Osorio-Roig, Christian Rathgeb, Pawel Drozdowski, Philipp Terhörst, Vitomir Štruc, Christoph BuschSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [760] arXiv:2111.12406 [pdf, other]
-
Title: Auto robust relative radiometric normalization via latent change noise modellingAuthors: Shiqi Liu, Lu Wang, Jie Lian, Ting chen, Cong Liu, Xuchen Zhan, Jintao Lu, Jie Liu, Ting Wang, Dong Geng, Hongwei Duan, Yuze TianSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [761] arXiv:2111.12417 [pdf, other]
-
Title: NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [762] arXiv:2111.12419 [pdf, other]
-
Title: NAM: Normalization-based Attention ModuleComments: 3 pages, 2 figures, 2 tables, 2 tables in the appendixSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [763] arXiv:2111.12448 [pdf, other]
-
Title: 3D Shape Variational Autoencoder Latent Disentanglement via Mini-Batch Feature Swapping for Bodies and FacesComments: Accepted for publication at CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [764] arXiv:2111.12449 [pdf, other]
-
Title: Background-Click Supervision for Temporal Action LocalizationComments: To appear at TPAMISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [765] arXiv:2111.12460 [pdf, other]
-
Title: ViCE: Improving Dense Representation Learning by Superpixelization and Contrasting Cluster AssignmentAuthors: Robin Karlsson, Tomoki Hayashi, Keisuke Fujii, Alexander Carballo, Kento Ohtani, Kazuya TakedaComments: Accepted for BMVC 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [766] arXiv:2111.12465 [pdf, ps, other]
-
Title: Introduction to Presentation Attack Detection in Iris Biometrics and Recent AdvancesComments: Chapter of the Handbook of Biometric Anti-Spoofing (Third Edition)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [767] arXiv:2111.12476 [pdf, other]
-
Title: Hierarchical Modular Network for Video CaptioningComments: Accepted by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [768] arXiv:2111.12480 [pdf, other]
-
Title: Octree Transformer: Autoregressive 3D Shape Generation on Hierarchically Structured SequencesSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [769] arXiv:2111.12485 [pdf, other]
-
Title: Understanding the Dynamics of DNNs Using Graph ModularityComments: Accepted by ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [770] arXiv:2111.12488 [pdf, other]
-
Title: Intuitive Shape Editing in Latent SpaceSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [771] arXiv:2111.12498 [pdf, other]
-
Title: Meta Mask Correction for Nuclei Segmentation in Histopathological ImageComments: Accepted by BIBM 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [772] arXiv:2111.12502 [pdf, other]
-
Title: TriStereoNet: A Trinocular Framework for Multi-baseline Disparity EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [773] arXiv:2111.12503 [pdf, other]
-
Title: Extracting Triangular 3D Models, Materials, and Lighting From ImagesAuthors: Jacob Munkberg, Jon Hasselgren, Tianchang Shen, Jun Gao, Wenzheng Chen, Alex Evans, Thomas Müller, Sanja FidlerComments: Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [774] arXiv:2111.12525 [pdf, other]
-
Title: Causality-inspired Single-source Domain Generalization for Medical Image SegmentationComments: This is an early, non-peer-reviewed version. For the final peer-reviewed full version that has been substantially revised, please find: this https URL Please find the code at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [775] arXiv:2111.12527 [pdf, other]
-
Title: MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation LearningAuthors: David Junhao Zhang, Kunchang Li, Yali Wang, Yunpeng Chen, Shashwat Chandra, Yu Qiao, Luoqi Liu, Mike Zheng ShouComments: ECCV2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [776] arXiv:2111.12544 [pdf, other]
-
Title: LDDMM meets GANs: Generative Adversarial Networks for diffeomorphic registrationSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [777] arXiv:2111.12577 [pdf, other]
-
Title: A Method for Evaluating Deep Generative Models of Images via Assessing the Reproduction of High-order Spatial ContextComments: The paper is under consideration at Pattern Recognition Letters. Early version with preliminary results was accepted for poster presentation at SPIE-MI 2022. This version on arXiv contains new and updated designs of stochastic models, their mathematical representations and the corresponding results. Data from the designed ensembles available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
- [778] arXiv:2111.12580 [pdf, other]
-
Title: UDA-COPE: Unsupervised Domain Adaptation for Category-level Object Pose EstimationAuthors: Taeyeop Lee, Byeong-Uk Lee, Inkyu Shin, Jaesung Choe, Ukcheol Shin, In So Kweon, Kuk-Jin YoonComments: Accepted to CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [779] arXiv:2111.12583 [pdf, other]
-
Title: Optimizing Latent Space Directions For GAN-based Local Image EditingComments: 4 pages, 5 figures, 1 tableSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [780] arXiv:2111.12591 [pdf, other]
-
Title: Lepard: Learning partial point cloud matching in rigid and deformable scenesComments: Accepted to CVPR'2022. Code and data: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [781] arXiv:2111.12594 [pdf, other]
-
Title: Conditional Object-Centric Learning from VideoAuthors: Thomas Kipf, Gamaleldin F. Elsayed, Aravindh Mahendran, Austin Stone, Sara Sabour, Georg Heigold, Rico Jonschkowski, Alexey Dosovitskiy, Klaus GreffComments: Published at ICLR 2022. Project page at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [782] arXiv:2111.12602 [pdf, other]
-
Title: Hierarchical Graph-Convolutional Variational AutoEncoding for Generative Modelling of Human MotionAuthors: Anthony Bourached, Robert Gray, Xiaodong Guan, Ryan-Rhys Griffiths, Ashwani Jha, Parashkev NachevComments: Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Probability (math.PR)
- [783] arXiv:2111.12608 [pdf, other]
-
Title: Cerberus Transformer: Joint Semantic, Affordance and Attribute ParsingComments: Accepted to CVPR 2022. code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [784] arXiv:2111.12609 [pdf, other]
-
Title: GreedyNASv2: Greedier Search with a Greedy Path FilterComments: Accepted to CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [785] arXiv:2111.12624 [pdf, other]
-
Title: Self-slimmed Vision TransformerComments: Accepted by ECCV 2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [786] arXiv:2111.12631 [pdf, other]
-
Title: Unity is strength: Improving the Detection of Adversarial Examples with Ensemble ApproachesAuthors: Francesco Craighero, Fabrizio Angaroni, Fabio Stella, Chiara Damiani, Marco Antoniotti, Alex GraudenziComments: Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [787] arXiv:2111.12643 [pdf, other]
-
Title: SM3D: Simultaneous Monocular Mapping and 3D DetectionComments: This paper is published on 2021 IEEE International Conference on Image Processing (ICIP 2021), this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [788] arXiv:2111.12661 [pdf, ps, other]
-
Title: Analysing Statistical methods for Automatic Detection of Image ForgerySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [789] arXiv:2111.12664 [pdf, other]
-
Title: MIO : Mutual Information Optimization using Self-Supervised Binary Contrastive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [790] arXiv:2111.12681 [pdf, other]
-
Title: VIOLET : End-to-End Video-Language Transformers with Masked Visual-token ModelingComments: Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [791] arXiv:2111.12685 [pdf, other]
-
Title: EgoRenderer: Rendering Human Avatars from Egocentric Camera ImagesComments: ICCV 2021. this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [792] arXiv:2111.12696 [pdf, other]
-
Title: A Lightweight Graph Transformer Network for Human Mesh Reconstruction from 2D Human PoseComments: ACM Multimedia 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [793] arXiv:2111.12698 [pdf, other]
-
Title: Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-LabelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [794] arXiv:2111.12701 [pdf, other]
-
Title: Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized CodesComments: 19 pages, 14 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [795] arXiv:2111.12702 [pdf, other]
- [796] arXiv:2111.12704 [pdf, other]
-
Title: Investigating Tradeoffs in Real-World Video Super-ResolutionComments: Tech report, 14 pages, 14 figures. Code can be found at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [797] arXiv:2111.12705 [pdf, other]
-
Title: MixSyn: Learning Composition and Style for Multi-Source Image SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [798] arXiv:2111.12707 [pdf, other]
-
Title: MHFormer: Multi-Hypothesis Transformer for 3D Human Pose EstimationComments: Accepted by CVPR 2022. Open SourcedSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [799] arXiv:2111.12710 [pdf, other]
-
Title: PeCo: Perceptual Codebook for BERT Pre-training of Vision TransformersAuthors: Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu, Baining GuoComments: To appear at AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [800] arXiv:2111.12727 [pdf, other]
-
Title: Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source DatasetsComments: Accepted to IJCVSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
- [801] arXiv:2111.12728 [pdf, other]
-
Title: Online Adaptation for Implicit Object Tracking and Shape Reconstruction in the WildComments: Accepted to RA-L 2022 & IROS 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [802] arXiv:2111.12731 [pdf, other]
-
Title: Human Pose Manipulation and Novel View Synthesis using Differentiable RenderingComments: Accepted at Face and Gesture 2021, 8 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [803] arXiv:2111.12747 [pdf, other]
-
Title: Layered Controllable Video GenerationComments: This paper has been accepted to ECCV 2022 as an Oral paperSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [804] arXiv:2111.12757 [pdf, other]
-
Title: ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image RetrievalComments: the paper is accepted by IEEE Transactions on Circuits and Systems for Video Technology, please refer this https URL for an updated versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [805] arXiv:2111.12764 [pdf, other]
-
Title: Towards an Efficient Semantic Segmentation Method of ID Cards for Verification SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [806] arXiv:2111.12780 [pdf, other]
-
Title: Transferability Estimation using Bhattacharyya Class SeparabilityComments: Accepted for CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [807] arXiv:2111.12782 [pdf, other]
-
Title: Fast mesh denoising with data driven normal filtering using deep variational autoencodersComments: 12 pages, 12 figuresJournal-ref: IEEE Transactions on Industrial Informatics, Volume: 17, Issue: 2, Feb. 2021, Pages: 980 - 990Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [808] arXiv:2111.12792 [pdf, other]
-
Title: Improving the Perceptual Quality of 2D Animation InterpolationComments: published at ECCV2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [809] arXiv:2111.12805 [pdf, other]
-
Title: Application of deep learning to camera trap data for ecologists in planning / engineering -- Can captivity imagery train a model which generalises to the wild?Comments: Submitted to Big Data 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [810] arXiv:2111.12824 [pdf, other]
-
Title: Cross Your Body: A Cognitive Assessment System for ChildrenComments: Accepted in ISVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [811] arXiv:2111.12853 [pdf, other]
-
Title: Domain Prompt Learning for Efficiently Adapting CLIP to Unseen DomainsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [812] arXiv:2111.12855 [pdf, other]
-
Title: Robust Equivariant Imaging: a fully unsupervised framework for learning to image from noisy and partial measurementsComments: CVPR 2022. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [813] arXiv:2111.12866 [pdf, other]
-
Title: Uncertainty Aware Proposal Segmentation for Unknown Object DetectionComments: Accepted to WACV 2022 DNOW WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [814] arXiv:2111.12872 [pdf, other]
-
Title: Less is More: Generating Grounded Navigation Instructions from LandmarksAuthors: Su Wang, Ceslee Montgomery, Jordi Orbay, Vighnesh Birodkar, Aleksandra Faust, Izzeddin Gur, Natasha Jaques, Austin Waters, Jason Baldridge, Peter AndersonComments: CVPR 2022 Camera-readySubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [815] arXiv:2111.12873 [pdf, other]
-
Title: Quantised Transforming Auto-Encoders: Achieving Equivariance to Arbitrary Transformations in Deep NetworksComments: BMVC 2021 | Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [816] arXiv:2111.12878 [pdf, other]
-
Title: Multiway Non-rigid Point Cloud Registration via Learned Functional Map SynchronizationJournal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [817] arXiv:2111.12880 [pdf, other]
-
Title: Active Learning at the ImageNet ScaleAuthors: Zeyad Ali Sami Emam, Hong-Min Chu, Ping-Yeh Chiang, Wojciech Czaja, Richard Leapman, Micah Goldblum, Tom GoldsteinSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [818] arXiv:2111.12888 [pdf, other]
-
Title: Effectiveness of Detection-based and Regression-based Approaches for Estimating Mask-Wearing RatioSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [819] arXiv:2111.12890 [pdf, other]
-
Title: V2C: Visual Voice CloningComments: 15 pages, 14 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [820] arXiv:2111.12892 [pdf, other]
-
Title: Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware AssociationAuthors: Sen Yang, Zhicheng Wang, Ze Chen, Yanjie Li, Shoukui Zhang, Zhibin Quan, Shu-Tao Xia, Yiping Bao, Erjin Zhou, Wankou YangComments: 16 pages, 9 figures, 7 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [821] arXiv:2111.12903 [pdf, other]
-
Title: Perturbed and Strict Mean Teachers for Semi-supervised Semantic SegmentationComments: CVPR 2022 camera-readySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [822] arXiv:2111.12905 [pdf, other]
-
Title: CIRCLE: Convolutional Implicit Reconstruction and Completion for Large-scale Indoor SceneSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
- [823] arXiv:2111.12911 [pdf, other]
-
Title: Human and Scene Motion Deblurring using Pseudo-blur SynthesizerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [824] arXiv:2111.12912 [pdf, other]
-
Title: A War Beyond Deepfake: Benchmarking Facial Counterfeits and CountermeasuresAuthors: Minh Tam Pham, Thanh Trung Huynh, Van Vinh Tong, Thanh Tam Nguyen, Thanh Thi Nguyen, Hongzhi Yin, Quoc Viet Hung NguyenSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [825] arXiv:2111.12918 [pdf, other]
-
Title: ACPL: Anti-curriculum Pseudo-labelling for Semi-supervised Medical Image ClassificationComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [826] arXiv:2111.12924 [pdf, other]
-
Title: Joint stereo 3D object detection and implicit surface reconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Robotics (cs.RO)
- [827] arXiv:2111.12925 [pdf, other]
-
Title: ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical RepresentationComments: This paper is accepted by BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [828] arXiv:2111.12927 [pdf, other]
-
Title: Rethinking Generic Camera Models for Deep Single Image Camera Calibration to Recover Rotation and Fisheye DistortionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [829] arXiv:2111.12928 [pdf, other]
-
Title: Facial Depth and Normal Estimation using Single Dual-Pixel CameraComments: Github page : this https URL To be appeared in ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [830] arXiv:2111.12933 [pdf, other]
-
Title: ML-Decoder: Scalable and Versatile Classification HeadSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [831] arXiv:2111.12940 [pdf, other]
-
Title: Towards Fewer Annotations: Active Learning via Region Impurity and Prediction Uncertainty for Domain Adaptive Semantic SegmentationComments: CVPR 2022 camera-ready version. The code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [832] arXiv:2111.12941 [pdf, other]
-
Title: Exploiting Both Domain-specific and Invariant Knowledge via a Win-win Transformer for Unsupervised Domain AdaptationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [833] arXiv:2111.12958 [pdf, other]
-
Title: Self-Distilled Self-Supervised Representation LearningComments: WACV 23, 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [834] arXiv:2111.12960 [pdf, other]
-
Title: Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A BenchmarkComments: This paper has been accepted by IEEE Transactions on Geoscience and Remote Sensing. Qian Yin and Qingyong Hu have equal contributions to this work and are co-first authors. The dataset is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [835] arXiv:2111.12971 [pdf, other]
-
Title: Natural & Adversarial Bokeh Rendering via Circle-of-Confusion Predictive NetworkComments: 11 pages, accepted by TMMSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [836] arXiv:2111.12982 [pdf, other]
-
Title: CDNet is all you need: Cascade DCN based underwater object detection RCNNAuthors: Di ChangComments: 6 pages, 6 figures. arXiv admin note: text overlap with arXiv:1906.09756 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [837] arXiv:2111.12983 [pdf, other]
-
Title: Investigation of domain gap problem in several deep-learning-based CT metal artefact reduction methodsSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [838] arXiv:2111.12993 [pdf, other]
-
Title: PolyViT: Co-training Vision Transformers on Images, Videos and AudioAuthors: Valerii Likhosherstov, Anurag Arnab, Krzysztof Choromanski, Mario Lucic, Yi Tay, Adrian Weller, Mostafa DehghaniSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [839] arXiv:2111.12994 [pdf, other]
-
Title: NomMer: Nominate Synergistic Context in Vision Transformer for Visual RecognitionComments: Accepted to CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [840] arXiv:2111.13010 [pdf, other]
-
Title: Attribute-specific Control Units in StyleGAN for Fine-grained Image ManipulationComments: ACM MultiMedia 2021.Project: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [841] arXiv:2111.13011 [pdf, other]
-
Title: Transferability Metrics for Selecting Source Model EnsemblesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [842] arXiv:2111.13023 [pdf, other]
-
Title: Rotation Equivariant 3D Hand Mesh Generation from a Single RGB ImageSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [843] arXiv:2111.13063 [pdf, other]
-
Title: MegLoc: A Robust and Accurate Visual Localization PipelineSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [844] arXiv:2111.13065 [pdf, other]
-
Title: Robust Object Detection with Multi-input Multi-output Faster R-CNNSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [845] arXiv:2111.13069 [pdf, other]
-
Title: Continual Active Learning Using Pseudo-Domains for Limited Labelling Resources and Changing Acquisition CharacteristicsComments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [846] arXiv:2111.13074 [pdf, other]
-
Title: Exploring Versatile Prior for Human Motion via Motion Frequency GuidanceComments: Accepted by 3DV2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [847] arXiv:2111.13078 [pdf, other]
-
Title: A Close Look at Few-shot Real Image Super-resolution from the Distortion Relation PerspectiveComments: 12 pages, first paper for few-shot real image super-resolutionSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [848] arXiv:2111.13087 [pdf, other]
-
Title: BoxeR: Box-Attention for 2D and 3D TransformersComments: In Proceeding of CVPR'2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [849] arXiv:2111.13089 [pdf, other]
-
Title: GeomNet: A Neural Network Based on Riemannian Geometries of SPD Matrix Space and Cholesky Space for 3D Skeleton-Based Interaction RecognitionAuthors: Xuan Son NguyenComments: Accepted in ICCV 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [850] arXiv:2111.13111 [pdf, other]
-
Title: Surface Segmentation Using Implicit Divergence Constraint Between Adjacent Minimal PathsSubjects: Computer Vision and Pattern Recognition (cs.CV); Functional Analysis (math.FA)
- [851] arXiv:2111.13112 [pdf, other]
-
Title: VaxNeRF: Revisiting the Classic for Voxel-Accelerated Neural Radiance FieldAuthors: Naruya Kondo, Yuya Ikeda, Andrea Tagliasacchi, Yutaka Matsuo, Yoichi Ochiai, Shixiang Shane GuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [852] arXiv:2111.13122 [pdf, other]
-
Title: GPR1200: A Benchmark for General-Purpose Content-Based Image RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
- [853] arXiv:2111.13131 [pdf, other]
-
Title: Scene Graph Generation with Geometric ContextComments: Paper accepted at 6th IAPR International Conference on Computer Vision & Image Processing (CVIP2021), IIT Ropar, IndiaSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [854] arXiv:2111.13152 [pdf, other]
-
Title: Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene RepresentationsAuthors: Mehdi S. M. Sajjadi, Henning Meyer, Etienne Pot, Urs Bergmann, Klaus Greff, Noha Radwan, Suhani Vora, Mario Lucic, Daniel Duckworth, Alexey Dosovitskiy, Jakob Uszkoreit, Thomas Funkhouser, Andrea TagliasacchiComments: Accepted to CVPR 2022, Project website: this https URLJournal-ref: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
- [855] arXiv:2111.13154 [pdf, other]
-
Title: Country-wide Retrieval of Forest Structure From Optical and SAR Satellite Imagery With Deep EnsemblesAuthors: Alexander Becker, Stefania Russo, Stefano Puliti, Nico Lang, Konrad Schindler, Jan Dirk WegnerJournal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, Volume 195, January 2023, Pages 269-286Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [856] arXiv:2111.13156 [pdf, other]
-
Title: Global Interaction Modelling in Vision Transformer via Super TokensSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [857] arXiv:2111.13157 [pdf, other]
-
Title: DA$^{\textbf{2}}$-Net : Diverse & Adaptive Attention Convolutional Neural NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [858] arXiv:2111.13163 [pdf, other]
-
Title: Semantic-Aware Generation for Self-Supervised Visual Representation LearningAuthors: Yunjie Tian, Lingxi Xie, Xiaopeng Zhang, Jiemin Fang, Haohang Xu, Wei Huang, Jianbin Jiao, Qi Tian, Qixiang YeComments: 13 pages, 5 figures, 11 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [859] arXiv:2111.13175 [pdf, other]
-
Title: Homogeneous Low-Resolution Face Recognition Method based Correlation FeaturesAuthors: Xuan ZhaoComments: 8 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [860] arXiv:2111.13176 [pdf, other]
-
Title: Using Color To Identify Insider ThreatsAuthors: Sameer KhannaSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [861] arXiv:2111.13184 [pdf, other]
-
Title: Multiple target tracking with interaction using an MCMC MRF Particle FilterSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [862] arXiv:2111.13196 [pdf, other]
-
Title: SwinBERT: End-to-End Transformers with Sparse Attention for Video CaptioningAuthors: Kevin Lin, Linjie Li, Chung-Ching Lin, Faisal Ahmed, Zhe Gan, Zicheng Liu, Yumao Lu, Lijuan WangComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [863] arXiv:2111.13213 [pdf, other]
-
Title: OTB-morph: One-Time Biometrics via Morphing applied to Face TemplatesSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [864] arXiv:2111.13216 [pdf, other]
-
Title: Cross-Domain Adaptive Teacher for Object DetectionAuthors: Yu-Jhe Li, Xiaoliang Dai, Chih-Yao Ma, Yen-Cheng Liu, Kan Chen, Bichen Wu, Zijian He, Kris Kitani, Peter VajdaComments: 10 pages including references. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [865] arXiv:2111.13230 [pdf, other]
-
Title: FedDropoutAvg: Generalizable federated learning for histopathology image classificationComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [866] arXiv:2111.13233 [pdf, other]
-
Title: Look at here : Utilizing supervision to attend subtle key regionsComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [867] arXiv:2111.13241 [pdf, other]
-
Title: Learning from Temporal Gradient for Semi-supervised Action RecognitionAuthors: Junfei Xiao, Longlong Jing, Lin Zhang, Ju He, Qi She, Zongwei Zhou, Alan Yuille, Yingwei LiComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [868] arXiv:2111.13244 [pdf, other]
-
Title: Going Grayscale: The Road to Understanding and Improving Unlearnable ExamplesAuthors: Zhuoran Liu, Zhengyu Zhao, Alex Kolmus, Tijn Berns, Twan van Laarhoven, Tom Heskes, Martha LarsonSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [869] arXiv:2111.13260 [pdf, other]
-
Title: NeSF: Neural Semantic Fields for Generalizable Semantic Segmentation of 3D ScenesAuthors: Suhani Vora, Noha Radwan, Klaus Greff, Henning Meyer, Kyle Genova, Mehdi S. M. Sajjadi, Etienne Pot, Andrea Tagliasacchi, Daniel DuckworthComments: Project website: this https URL Updated with minor edits to textSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [870] arXiv:2111.13279 [pdf, other]
-
Title: Disentangled Unsupervised Image Translation via Restricted Information FlowSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [871] arXiv:2111.13280 [pdf, other]
-
Title: Efficient Self-Ensemble for Semantic SegmentationAuthors: Walid Bousselham, Guillaume Thibault, Lucas Pagano, Archana Machireddy, Joe Gray, Young Hwan Chang, Xubo SongComments: Code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [872] arXiv:2111.13285 [pdf, other]
-
Title: 3D Pose Estimation and Future Motion Prediction from 2D ImagesComments: Accepted by Pattern RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [873] arXiv:2111.13295 [pdf, other]
-
Title: Medial Spectral Coordinates for 3D Shape AnalysisAuthors: Morteza Rezanejad, Mohammad Khodadad, Hamidreza Mahyar, Herve Lombaert, Michael Gruninger, Dirk B. Walther, Kaleem SiddiqiSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [874] arXiv:2111.13307 [pdf, other]
-
Title: Self-supervised Correlation Mining Network for Person Image GenerationJournal-ref: A modified version compared with CVPR2022 versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [875] arXiv:2111.13309 [pdf, other]
-
Title: Data Augmented 3D Semantic Scene Completion with 2D Segmentation PriorsComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [876] arXiv:2111.13324 [pdf, ps, other]
-
Title: Hierarchical Motion Encoder-Decoder Network for Trajectory ForecastingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [877] arXiv:2111.13327 [pdf, other]
-
Title: Traditional Chinese Synthetic Datasets Verified with Labeled Data for Scene Text RecognitionComments: Accepted in ICPR Workshop DLVDR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [878] arXiv:2111.13333 [pdf, other]
-
Title: Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language ModelAuthors: Zipeng Xu, Tianwei Lin, Hao Tang, Fu Li, Dongliang He, Nicu Sebe, Radu Timofte, Luc Van Gool, Errui DingComments: To appear in CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [879] arXiv:2111.13336 [pdf, other]
- [880] arXiv:2111.13353 [pdf, other]
-
Title: Contrastive Vicinal Space for Unsupervised Domain AdaptationComments: 10 pages, 7 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [881] arXiv:2111.13359 [pdf, other]
-
Title: Neural Collaborative Graph Machines for Table Structure RecognitionComments: Accepted to CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [882] arXiv:2111.13362 [pdf, other]
-
Title: Data Invariants to Understand Unsupervised Out-of-Distribution DetectionComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [883] arXiv:2111.13363 [pdf, other]
-
Title: PicArrange -- Visually Sort, Search, and Explore Private Images on a Mac ComputerComments: 5 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [884] arXiv:2111.13386 [pdf, other]
-
Title: POEM: 1-bit Point-wise Operations based on Expectation-Maximization for Efficient Point Cloud ProcessingComments: Accepted by BMVC 2021. arXiv admin note: text overlap with arXiv:2010.05501 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [885] arXiv:2111.13406 [pdf, other]
-
Title: Reinforcement Explanation LearningComments: Accepted in NeurIPS 2021 workshop on eXplainable AI approaches for debugging and diagnosis. Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [886] arXiv:2111.13410 [pdf, other]
-
Title: Modeling Annotator Preference and Stochastic Annotation Error for Medical Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [887] arXiv:2111.13424 [pdf, other]
-
Title: ContIG: Self-supervised Multimodal Contrastive Learning for Medical Imaging with GeneticsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [888] arXiv:2111.13439 [pdf, other]
-
Title: Towards Explainable End-to-End Prostate Cancer Relapse Prediction from H&E Images Combining Self-Attention Multiple Instance Learning with a Recurrent Neural NetworkAuthors: Esther Dietrich, Patrick Fuhlert, Anne Ernst, Guido Sauter, Maximilian Lennartz, H. Siegfried Stiehl, Marina Zimmermann, Stefan BonnComments: Accepted as a regular conference paper at ML4H 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [889] arXiv:2111.13445 [pdf, other]
-
Title: How Well Do Sparse Imagenet Models Transfer?Comments: Accepted to CVPR'22. This version: 25 pages, 9 figures (including appendix). **Includes extended upstream training results, which are not present in the CVPR version.**Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [890] arXiv:2111.13460 [pdf, ps, other]
-
Title: Morphology Decoder: A Machine Learning Guided 3D Vision Quantifying Heterogenous Rock Permeability for Planetary Surveillance and Robotic FunctionsComments: 1- Added Affiliation section. 2- Added Remark section. 3- Applied grammar correctionsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Geometric Topology (math.GT); Geophysics (physics.geo-ph)
- [891] arXiv:2111.13470 [pdf, other]
-
Title: TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNsComments: ECCV 2022 Camera ReadySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [892] arXiv:2111.13475 [pdf, other]
-
Title: QMagFace: Simple and Accurate Quality-Aware Face RecognitionAuthors: Philipp Terhörst, Malte Ihlefeld, Marco Huber, Naser Damer, Florian Kirchbuchner, Kiran Raja, Arjan KuijperSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [893] arXiv:2111.13489 [pdf, other]
-
Title: SurfEmb: Dense and Continuous Correspondence Distributions for Object Pose Estimation with Learnt Surface EmbeddingsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [894] arXiv:2111.13495 [pdf, other]
-
Title: SQUID: Deep Feature In-Painting for Unsupervised Anomaly DetectionAuthors: Tiange Xiang, Yixiao Zhang, Yongyi Lu, Alan L. Yuille, Chaoyi Zhang, Weidong Cai, Zongwei ZhouComments: CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [895] arXiv:2111.13517 [pdf, other]
-
Title: Not All Relations are Equal: Mining Informative Labels for Scene Graph GenerationComments: 16 pagesJournal-ref: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [896] arXiv:2111.13539 [pdf, other]
-
Title: GeoNeRF: Generalizing NeRF with Geometry PriorsComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [897] arXiv:2111.13546 [pdf, other]
-
Title: Inside Out Visual Place RecognitionComments: Accepted at British Machine Vision Conference (BMVC) 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [898] arXiv:2111.13550 [pdf, other]
-
Title: Using Fictitious Class Representations to Boost Discriminative Zero-Shot LearnersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [899] arXiv:2111.13579 [pdf, other]
-
Title: VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual RecognitionComments: Accepted by ECCV 2022; 14 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [900] arXiv:2111.13587 [pdf, other]
-
Title: Adaptive Fourier Neural Operators: Efficient Token Mixers for TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [901] arXiv:2111.13651 [pdf, other]
-
Title: Contrastive Object-level Pre-training with Spatial Noise Curriculum LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [902] arXiv:2111.13652 [pdf, other]
-
Title: Gradient-SDF: A Semi-Implicit Surface Representation for 3D ReconstructionComments: First two authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [903] arXiv:2111.13656 [pdf, other]
-
Title: Towards Low-Cost and Efficient Malaria DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [904] arXiv:2111.13663 [pdf, other]
-
Title: 3D shape sensing and deep learning-based segmentation of strawberriesComments: 14 pages, 13 figures, accepted to Computers and Electronics in AgricultureJournal-ref: Computers and Electronics in Agriculture, Volume 190, November 2021, 106374Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [905] arXiv:2111.13672 [pdf, other]
-
Title: Immortal Tracker: Tracklet Never DiesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [906] arXiv:2111.13673 [pdf, other]
-
Title: Mask Transfiner for High-Quality Instance SegmentationComments: Project page: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [907] arXiv:2111.13674 [pdf, other]
-
Title: Neural Fields as Learnable Kernels for 3D ReconstructionAuthors: Francis Williams, Zan Gojcic, Sameh Khamis, Denis Zorin, Joan Bruna, Sanja Fidler, Or LitanySubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [908] arXiv:2111.13675 [pdf, other]
-
Title: Weakly-guided Self-supervised Pretraining for Temporal Activity DetectionComments: Published as a conference paper at AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [909] arXiv:2111.13677 [pdf, other]
-
Title: SWAT: Spatial Structure Within and Among TokensComments: Accepted to be published at IJCAI23Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [910] arXiv:2111.13679 [pdf, other]
-
Title: NeRF in the Dark: High Dynamic Range View Synthesis from Noisy Raw ImagesAuthors: Ben Mildenhall, Peter Hedman, Ricardo Martin-Brualla, Pratul Srinivasan, Jonathan T. BarronComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
- [911] arXiv:2111.13680 [pdf, other]
-
Title: GMFlow: Learning Optical Flow via Global MatchingComments: CVPR 2022, OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [912] arXiv:2111.13681 [pdf, other]
-
Title: ManiFest: Manifold Deformation for Few-shot Image TranslationComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [913] arXiv:2111.13738 [pdf, other]
-
Title: The Implicit Values of A Good Hand Shake: Handheld Multi-Frame Neural Depth RefinementComments: Project github: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [914] arXiv:2111.13769 [pdf, other]
-
Title: Unsupervised MKL in Multi-layer Kernel MachinesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [915] arXiv:2111.13790 [pdf, other]
-
Title: Benchmarking Shadow Removal for Facial Landmark Detection and BeyondSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [916] arXiv:2111.13792 [pdf, other]
-
Title: LAFITE: Towards Language-Free Training for Text-to-Image GenerationAuthors: Yufan Zhou, Ruiyi Zhang, Changyou Chen, Chunyuan Li, Chris Tensmeyer, Tong Yu, Jiuxiang Gu, Jinhui Xu, Tong SunComments: Accepted by CVPR 2022, this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [917] arXiv:2111.13809 [pdf, other]
-
Title: Document Layout Analysis with Aesthetic-Guided Image AugmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [918] arXiv:2111.13813 [pdf, ps, other]
-
Title: Video Content Classification using Deep LearningComments: for assosiated Dataset check :- this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [919] arXiv:2111.13817 [pdf, other]
-
Title: Video Frame Interpolation TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [920] arXiv:2111.13818 [pdf, ps, other]
-
Title: Recognition and Co-Analysis of Pedestrian Activities in Different Parts of Road using Traffic Camera VideoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [921] arXiv:2111.13824 [pdf, other]
-
Title: FQ-ViT: Post-Training Quantization for Fully Quantized Vision TransformerComments: Accepted by IJCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [922] arXiv:2111.13838 [pdf, other]
-
Title: DSC: Deep Scan Context Descriptor for Large-Scale Place RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [923] arXiv:2111.13841 [pdf, other]
-
Title: Adaptive Perturbation for Adversarial AttackComments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). 18 pages, 7 figures, 14 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [924] arXiv:2111.13844 [pdf, other]
-
Title: Adaptive Image Transformations for Transfer-based Adversarial AttackComments: 34 pages, 7 figures, 11 tables. Accepted by ECCV2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [925] arXiv:2111.13850 [pdf, other]
- [926] arXiv:2111.13876 [pdf, other]
-
Title: Learning Discriminative Shrinkage Deep Networks for Image DeconvolutionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [927] arXiv:2111.13888 [pdf, other]
-
Title: Head and Body: Unified Detector and Graph Network for Person Search in MediaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [928] arXiv:2111.13907 [pdf, other]
-
Title: Pose Representations for Deep Skeletal AnimationComments: Presented at the ACM SIGGRAPH/Eurographics Symposium on Computer Animation, SCA'22Journal-ref: Computer Graphics Forum, Volume 41, Issue 8, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [929] arXiv:2111.13920 [pdf, ps, other]
-
Title: Sparse Subspace Clustering Friendly Deep Dictionary Learning for Hyperspectral Image ClassificationComments: IEEE Geoscience And Remote Sensing LettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [930] arXiv:2111.13924 [pdf, other]
-
Title: A Practical Contrastive Learning Framework for Single-Image Super-ResolutionComments: Accepted by IEEE Transactions on Neural Networks and Learning SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [931] arXiv:2111.13945 [pdf, other]
-
Title: Calibrated Feature Decomposition for Generalizable Person Re-IdentificationComments: Technical report, Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [932] arXiv:2111.13958 [pdf, other]
-
Title: Safe Screening for Sparse Conditional Random FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [933] arXiv:2111.13970 [pdf, other]
-
Title: Label Assistant: A Workflow for Assisted Data Annotation in Image Segmentation TasksAuthors: Marcel P. Schilling, Luca Rettenberger, Friedrich Münke, Haijun Cui, Anna A. Popova, Pavel A. Levkin, Ralf Mikut, Markus ReischlJournal-ref: Proceedings - 31. Workshop Computational Intelligence, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [934] arXiv:2111.13997 [pdf, other]
-
Title: Learning Continuous Environment Fields via Implicit FunctionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [935] arXiv:2111.13998 [pdf, other]
-
Title: Targeted Supervised Contrastive Learning for Long-Tailed RecognitionAuthors: Tianhong Li, Peng Cao, Yuan Yuan, Lijie Fan, Yuzhe Yang, Rogerio Feris, Piotr Indyk, Dina KatabiComments: The first two authors contributed equally to this paperSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [936] arXiv:2111.14014 [pdf, other]
-
Title: Unsupervised Domain Adaptive Person Re-Identification via Human Learning ImitationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [937] arXiv:2111.14021 [pdf, other]
-
Title: AI-supported Framework of Semi-Automatic Monoplotting for Monocular Oblique Visual Data AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [938] arXiv:2111.14055 [pdf, other]
-
Title: ESGN: Efficient Stereo Geometry Network for Fast 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [939] arXiv:2111.14059 [pdf, other]
-
Title: NoFADE: Analyzing Diminishing Returns on CO2 InvestmentAuthors: Andre Fu, Justin Tran, Andy Xie, Jonathan Spraggett, Elisa Ding, Chang-Won Lee, Kanav Singla, Mahdi S. Hosseini, Konstantinos N. PlataniotisComments: Climate Change with Machine Learning workshop at 35th Conference on Neural Information Processing Systems (NeurIPS2021-CCAI)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [940] arXiv:2111.14060 [pdf, other]
-
Title: Detection of E-scooter Riders in Naturalistic ScenesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [941] arXiv:2111.14067 [pdf, other]
- [942] arXiv:2111.14075 [pdf, ps, other]
-
Title: Image preprocessing and modified adaptive thresholding for improving OCRAuthors: Rohan Lal KshetryComments: 5 pages, 7 figuesJournal-ref: Stoli\'nski, Sebastian & Bieniecki, Wojciech. (2011). Application of OCR systems to processing and digitization of paper documents. WULS Press Warszawa 2011. 102-111Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [943] arXiv:2111.14093 [pdf, other]
-
Title: Adaptive Reordering Sampler with Neurally Guided MAGSACSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [944] arXiv:2111.14096 [pdf, other]
-
Title: Gated SwitchGAN for multi-domain facial image translationComments: Accepted in IEEE TRANSACTIONS ON MULTIMEDIA(TMM)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [945] arXiv:2111.14103 [pdf, other]
-
Title: CHARTER: heatmap-based multi-type chart data extractionAuthors: Joseph Shtok, Sivan Harary, Ophir Azulai, Adi Raz Goldfarb, Assaf Arbelle, Leonid KarlinskyComments: Joseph Shtok, Sivan Harary and Leonid Karlinsky had equal contributionJournal-ref: Document Intelligence workshop at KDD 2021 conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [946] arXiv:2111.14122 [pdf, other]
-
Title: Cross-Task Consistency Learning Framework for Multi-Task LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [947] arXiv:2111.14131 [pdf, other]
-
Title: Learning a Weight Map for Weakly-Supervised LocalizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [948] arXiv:2111.14145 [pdf, other]
-
Title: FashionSearchNet-v2: Learning Attribute Representations with Localization for Image Retrieval with Attribute ManipulationComments: 15 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [949] arXiv:2111.14157 [pdf, other]
-
Title: Implicit Equivariance in Convolutional NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [950] arXiv:2111.14160 [pdf, other]
-
Title: Learning To Segment Dominant Object Motion From Watching VideosComments: DICTA 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [951] arXiv:2111.14173 [pdf, other]
-
Title: CDGNet: Class Distribution Guided Network for Human ParsingComments: Accepted at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [952] arXiv:2111.14176 [pdf, other]
-
Title: UAV-based Crowd Surveillance in Post COVID-19 EraComments: Accepted for publication in IEEE Access; 14 pages, 13 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
- [953] arXiv:2111.14182 [pdf, other]
-
Title: Make an Omelette with Breaking Eggs: Zero-Shot Learning for Novel Attribute SynthesisComments: Accepted by the 36th Conference on Neural Information Processing Systems (NeurIPS 2022). (* Yu-Hsuan Li and Tzu-Yin Chao contributed equally to this work.)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [954] arXiv:2111.14243 [pdf, ps, other]
-
Title: EffCNet: An Efficient CondenseNet for Image Classification on NXP BlueBoxComments: 11 pages, 10 figures, published in American Journal of Electrical and Computer EngineeringJournal-ref: Vol. 5, No. 2, 2021, pp. 77-87Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [955] arXiv:2111.14267 [pdf, other]
-
Title: Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble MethodComments: 7 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
- [956] arXiv:2111.14270 [pdf, other]
-
Title: Automated Detection of Patients in Hospital Video RecordingsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [957] arXiv:2111.14271 [pdf, other]
-
Title: ExCon: Explanation-driven Supervised Contrastive Learning for Image ClassificationAuthors: Zhibo Zhang, Jongseong Jang, Chiheb Trabelsi, Ruiwen Li, Scott Sanner, Yeonjeong Jeong, Dongsub ShimSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [958] arXiv:2111.14290 [pdf, other]
-
Title: TAL: Two-stream Adaptive Learning for Generalizable Person Re-identificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [959] arXiv:2111.14292 [pdf, other]
- [960] arXiv:2111.14297 [pdf, other]
-
Title: Data Augmentation For Medical MR Image Using Generative Adversarial NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [961] arXiv:2111.14302 [pdf, other]
-
Title: Feature-Gate Coupling for Dynamic Network PruningComments: 31 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [962] arXiv:2111.14311 [pdf, other]
-
Title: The CSIRO Crown-of-Thorn Starfish Detection DatasetAuthors: Jiajun Liu, Brano Kusy, Ross Marchant, Brendan Do, Torsten Merz, Joey Crosswell, Andy Steven, Nic Heaney, Karl von Richter, Lachlan Tychsen-Smith, David Ahmedt-Aristizabal, Mohammad Ali Armin, Geoffrey Carlin, Russ Babcock, Peyman Moghadam, Daniel Smith, Tim Davis, Kemal El Moujahid, Martin Wicke, Megha MalpaniSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [963] arXiv:2111.14316 [pdf, other]
-
Title: Learning Context-Aware Embedding for Person SearchSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [964] arXiv:2111.14319 [pdf, other]
-
Title: TinyDefectNet: Highly Compact Deep Neural Network Architecture for High-Throughput Manufacturing Visual Quality InspectionComments: 6 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [965] arXiv:2111.14330 [pdf, other]
-
Title: Sparse DETR: Efficient End-to-End Object Detection with Learnable SparsityComments: ICLR 2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [966] arXiv:2111.14338 [pdf, other]
-
Title: Improving Deep Learning Interpretability by Saliency Guided TrainingJournal-ref: Thirty-fifth Conference on Neural Information Processing Systems 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [967] arXiv:2111.14339 [pdf, ps, other]
-
Title: Heterogeneous Visible-Thermal and Visible-Infrared Face Recognition using Unit-Class Loss and Cross-Modality DiscriminatorSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
- [968] arXiv:2111.14340 [pdf, other]
-
Title: Attention-based Feature Decomposition-Reconstruction Network for Scene Text DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [969] arXiv:2111.14341 [pdf, other]
-
Title: OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural ImagesAuthors: Bingchen Zhao, Shaozuo Yu, Wufei Ma, Mingxin Yu, Shenxiao Mei, Angtian Wang, Ju He, Alan Yuille, Adam KortylewskiComments: Project webpage: this http URL, this work is accepted as Oral at ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [970] arXiv:2111.14343 [pdf, other]
-
Title: Anomaly-Aware Semantic Segmentation by Leveraging Synthetic-Unknown DataSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [971] arXiv:2111.14349 [pdf, ps, other]
-
Title: First Power Linear Unit with SignAuthors: Boxi DuanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [972] arXiv:2111.14353 [pdf, other]
-
Title: Semi-supervised Domain Adaptation via Sample-to-Sample Self-DistillationComments: Accepted to WACV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [973] arXiv:2111.14356 [pdf, other]
-
Title: Improved Knowledge Distillation via Adversarial CollaborationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [974] arXiv:2111.14358 [pdf, other]
-
Title: IDR: Self-Supervised Image Denoising via Iterative Data RefinementComments: CVPR2022; code & dataset: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [975] arXiv:2111.14382 [pdf, other]
-
Title: VPFNet: Improving 3D Object Detection with Virtual Point based LiDAR and Stereo Data FusionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [976] arXiv:2111.14396 [pdf, other]
-
Title: Lightweight Deep Learning Architecture for MPI Correction and Transient ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [977] arXiv:2111.14411 [pdf, ps, other]
-
Title: PGGANet: Pose Guided Graph Attention Network for Person Re-identificationComments: 22 pages, 9 figures, 5 tables, 60 referencesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [978] arXiv:2111.14420 [pdf, other]
-
Title: IB-MVS: An Iterative Algorithm for Deep Multi-View Stereo based on Binary DecisionsAuthors: Christian Sormann (1), Mattia Rossi (2), Andreas Kuhn (2), Friedrich Fraundorfer (1) ((1) Graz University of Technology, (2) Sony Europe B.V.)Comments: accepted at BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [979] arXiv:2111.14422 [pdf, other]
-
Title: Agent-Centric Relation Graph for Object Visual NavigationComments: 16 pages, 13 figures, 7 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [980] arXiv:2111.14438 [pdf, ps, other]
-
Title: K-nearest neighbour and dynamic time warping for online signature verificationComments: 2nd International Conference on Machine Learning Techniques and Data Science (MLDS 2021) ISPN: 978-1-925953-52-7Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [981] arXiv:2111.14447 [pdf, other]
-
Title: ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic ArithmeticComments: To appear in CVPR'22Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [982] arXiv:2111.14448 [pdf, other]
-
Title: AVA-AVD: Audio-Visual Speaker Diarization in the WildComments: ACMMM 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
- [983] arXiv:2111.14451 [pdf, other]
-
Title: HDR-NeRF: High Dynamic Range Neural Radiance FieldsComments: Accepted to CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [984] arXiv:2111.14458 [pdf, ps, other]
-
Title: Decoupled Low-light Image EnhancementComments: This paper has been accepted in the ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [985] arXiv:2111.14465 [pdf, other]
-
Title: Motion-from-Blur: 3D Shape and Motion Estimation of Motion-blurred Objects in VideosComments: CVPR 2022 camera-readyJournal-ref: 2022 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
- [986] arXiv:2111.14482 [pdf, other]
-
Title: High Quality Segmentation for Ultra High-resolution ImagesAuthors: Tiancheng Shen, Yuechen Zhang, Lu Qi, Jason Kuen, Xingyu Xie, Jianlong Wu, Zhe Lin, Jiaya JiaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [987] arXiv:2111.14485 [pdf, other]
-
Title: CoNIC: Colon Nuclei Identification and Counting Challenge 2022Authors: Simon Graham, Mostafa Jahanifar, Quoc Dang Vu, Giorgos Hadjigeorghiou, Thomas Leech, David Snead, Shan E Ahmed Raza, Fayyaz Minhas, Nasir RajpootSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [988] arXiv:2111.14493 [pdf, other]
-
Title: On the Effectiveness of Neural Ensembles for Image Classification with Small DatasetsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [989] arXiv:2111.14507 [pdf, other]
-
Title: SPIN: Simplifying Polar Invariance for Neural networks Application to vision-based irradiance forecastingComments: CVPR 2022 - OmniCV workshop (oral)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [990] arXiv:2111.14517 [pdf, other]
-
Title: Robust and Accurate Superquadric Recovery: a Probabilistic ApproachComments: Accepted to CVPR2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [991] arXiv:2111.14547 [pdf, other]
-
Title: LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question AnsweringComments: 11 pages, 5 figures, Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [992] arXiv:2111.14549 [pdf, other]
-
Title: MeshUDF: Fast and Differentiable Meshing of Unsigned Distance Field NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [993] arXiv:2111.14556 [pdf, other]
-
Title: On the Integration of Self-Attention and ConvolutionComments: Accepted to CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [994] arXiv:2111.14557 [pdf, other]
-
Title: Image Segmentation to Identify Safe Landing Zones for Unmanned Aerial VehiclesComments: 12 pages, to appear in Proceedings of the 29th Irish Conference on Artificial Intelligence and Cognitive Science AICS'2021, December 2021Journal-ref: CEUR Workshop Proceedings Volume 3105, pp.235-247 urn:nbn:de:0074-3105-7 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [995] arXiv:2111.14562 [pdf, other]
-
Title: Instance-wise Occlusion and Depth Orders in Natural ScenesComments: Accepted to CVPR 2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [996] arXiv:2111.14564 [pdf, other]
-
Title: MedRDF: A Robust and Retrain-Less Diagnostic Framework for Medical Pretrained Models Against Adversarial AttackComments: TMI under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [997] arXiv:2111.14576 [pdf, other]
-
Title: Recurrent Vision Transformer for Solving Visual Reasoning ProblemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [998] arXiv:2111.14582 [pdf, other]
-
Title: Multi-instance Point Cloud Registration by Efficient Correspondence ClusteringComments: Accepted by CVPRSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [999] arXiv:2111.14585 [pdf, other]
-
Title: Similarity Contrastive Estimation for Self-Supervised Soft Contrastive LearningComments: Accepted to IEEE Winter Conference on Applications of Computer Vision (WACV) 2023Journal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1000] arXiv:2111.14595 [pdf, other]
-
Title: Overcoming the Domain Gap in Contrastive Learning of Neural Action RepresentationsComments: Accepted into NeurIPS 2021 Workshop: Self-Supervised Learning - Theory and PracticeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1001] arXiv:2111.14600 [pdf, other]
-
Title: TransMVSNet: Global Context-aware Multi-view Stereo Network with TransformersAuthors: Yikang Ding, Wentao Yuan, Qingtian Zhu, Haotian Zhang, Xiangyue Liu, Yuanjiang Wang, Xiao LiuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1002] arXiv:2111.14605 [pdf, other]
-
Title: Weakly-supervised Generative Adversarial Networks for medical image classificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1003] arXiv:2111.14637 [pdf, other]
-
Title: ILabel: Interactive Neural Scene LabellingAuthors: Shuaifeng Zhi, Edgar Sucar, Andre Mouton, Iain Haughton, Tristan Laidlow, Andrew J. DavisonSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1004] arXiv:2111.14643 [pdf, other]
-
Title: Urban Radiance FieldsAuthors: Konstantinos Rematas, Andrew Liu, Pratul P. Srinivasan, Jonathan T. Barron, Andrea Tagliasacchi, Thomas Funkhouser, Vittorio FerrariComments: Project: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1005] arXiv:2111.14646 [pdf, other]
-
Title: MUNet: Motion Uncertainty-aware Semi-supervised Video Object SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1006] arXiv:2111.14650 [pdf, other]
-
Title: Buildings Classification using Very High Resolution Satellite ImageryAuthors: Mohammad Dimassi, Abed Ellatif Samhat, Mohammad Zaraket, Jamal Haidar, Mustafa Shukor, Ali J. GhandourSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1007] arXiv:2111.14658 [pdf, other]
-
Title: diffConv: Analyzing Irregular Point Clouds with an Irregular ViewComments: Accepted by ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1008] arXiv:2111.14672 [pdf, other]
-
Title: Human Performance Capture from Monocular Video in the WildSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1009] arXiv:2111.14673 [pdf, other]
-
Title: 3D Compositional Zero-shot Learning with DeCompositional ConsensusSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1010] arXiv:2111.14680 [pdf, other]
-
Title: Graph Embedding via High Dimensional Model Representation for Hyperspectral ImagesComments: This is an accepted version of work to be published in the IEEE Transactions on Geoscience and Remote Sensing. 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [1011] arXiv:2111.14690 [pdf, other]
-
Title: DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse MotionComments: add change logSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1012] arXiv:2111.14707 [pdf, ps, other]
-
Title: Real-time Attention Span Tracking in Online EducationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1013] arXiv:2111.14725 [pdf, other]
-
Title: Searching the Search Space of Vision TransformerAuthors: Minghao Chen, Kan Wu, Bolin Ni, Houwen Peng, Bei Liu, Jianlong Fu, Hongyang Chao, Haibin LingComments: Accepted to NIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1014] arXiv:2111.14726 [pdf, other]
-
Title: Do Invariances in Deep Neural Networks Align with Human Perception?Authors: Vedant Nanda, Ayan Majumdar, Camila Kolling, John P. Dickerson, Krishna P. Gummadi, Bradley C. Love, Adrian WellerComments: AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1015] arXiv:2111.14741 [pdf, other]
-
Title: Domain Adaptation of Networks for Camera Pose Estimation: Learning Camera Pose Estimation Without Pose LabelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1016] arXiv:2111.14745 [pdf, other]
-
Title: A Simple Long-Tailed Recognition Baseline via Vision-Language ModelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1017] arXiv:2111.14762 [pdf, other]
-
Title: Riemannian Functional Map Synchronization for Probabilistic Partial Correspondence in Shape NetworksComments: 16 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1018] arXiv:2111.14777 [pdf, other]
-
Title: Deep Decomposition for Stochastic Normal-Abnormal TransportComments: Accepted as ORAL to CVPR 2022 (15 pages, 5 figures)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1019] arXiv:2111.14791 [pdf, other]
-
Title: Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image AnalysisAuthors: Yucheng Tang, Dong Yang, Wenqi Li, Holger Roth, Bennett Landman, Daguang Xu, Vishwesh Nath, Ali HatamizadehComments: CVPR'22 Accepted PaperSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1020] arXiv:2111.14792 [pdf, other]
-
Title: Classification-Regression for Chart ComprehensionComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1021] arXiv:2111.14798 [pdf, other]
-
Title: Semi-supervised Implicit Scene Completion from Sparse LiDARComments: code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1022] arXiv:2111.14799 [pdf, other]
-
Title: UBoCo : Unsupervised Boundary Contrastive Learning for Generic Event Boundary DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1023] arXiv:2111.14806 [pdf, other]
-
Title: Coarse-To-Fine Incremental Few-Shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1024] arXiv:2111.14813 [pdf, other]
-
Title: TransWeather: Transformer-based Restoration of Images Degraded by Adverse Weather ConditionsComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1025] arXiv:2111.14818 [pdf, other]
-
Title: Blended Diffusion for Text-driven Editing of Natural ImagesComments: CVPR 2022. Code is available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [1026] arXiv:2111.14819 [pdf, other]
-
Title: Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point ModelingComments: Accepted to CVPR 2022, Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1027] arXiv:2111.14821 [pdf, other]
-
Title: End-to-End Referring Video Object Segmentation with Multimodal TransformersComments: Accepted to CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [1028] arXiv:2111.14822 [pdf, other]
-
Title: Vector Quantized Diffusion Model for Text-to-Image SynthesisAuthors: Shuyang Gu, Dong Chen, Jianmin Bao, Fang Wen, Bo Zhang, Dongdong Chen, Lu Yuan, Baining GuoSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1029] arXiv:2111.14824 [pdf, other]
-
Title: Learning to Fit Morphable ModelsComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1030] arXiv:2111.14825 [pdf, other]
-
Title: Latent Transformations via NeuralODEs for GAN-based Image EditingComments: Published at ICCV 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [1031] arXiv:2111.14826 [pdf, other]
-
Title: Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through EstimationComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1032] arXiv:2111.14831 [pdf, ps, other]
-
Title: Multi-domain Integrative Swin Transformer network for Sparse-View Tomographic ReconstructionComments: 30 pages, 11 figures, 10 tables, 54 referencesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1033] arXiv:2111.14887 [pdf, other]
-
Title: DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic SegmentationComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1034] arXiv:2111.14893 [pdf, other]
-
Title: Learning Multiple Dense Prediction Tasks from Partially Annotated DataComments: CVPR2022, Multi-task Partially-supervised Learning, Code will be available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1035] arXiv:2111.14923 [pdf, other]
-
Title: Equitable modelling of brain imaging by counterfactual augmentation with morphologically constrained 3D deep generative modelsAuthors: Guilherme Pombo, Robert Gray, Jorge Cardoso, Sebastien Ourselin, Geraint Rees, John Ashburner, Parashkev NachevSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1036] arXiv:2111.14931 [pdf, other]
-
Title: How Facial Features Convey Attention in Stationary EnvironmentsAuthors: Janelle DomantaySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1037] arXiv:2111.14943 [pdf, other]
-
Title: Morph Detection Enhanced by Structured Group SparsitySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1038] arXiv:2111.14948 [pdf, ps, other]
-
Title: Image denoising by Super Neurons: Why go deep?Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1039] arXiv:2111.14973 [pdf, other]
-
Title: MultiPath++: Efficient Information Fusion and Trajectory Aggregation for Behavior PredictionAuthors: Balakrishnan Varadarajan, Ahmed Hefny, Avikalp Srivastava, Khaled S. Refaat, Nigamaa Nayakanti, Andre Cornman, Kan Chen, Bertrand Douillard, Chi Pang Lam, Dragomir Anguelov, Benjamin SappSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [1040] arXiv:2111.15000 [pdf, other]
-
Title: Deformable ProtoPNet: An Interpretable Image Classifier Using Deformable PrototypesComments: This was published in CVPR 2022Journal-ref: 2022 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1041] arXiv:2111.15015 [pdf, other]
-
Title: Neural Attention for Image Captioning: Review of Outstanding MethodsComments: This is the accepted version, which we are allowed to publish on arxiv based on Springer Nature policies. For the published version please refer to Springer Nature Artificial Intelligence Review Journal. DOI number is attached. For Citation refer to AIRE journal using DOI linkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1042] arXiv:2111.15018 [pdf, other]
-
Title: Hyperspectral Image Segmentation based on Graph Processing over Multilayer NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [1043] arXiv:2111.15047 [pdf, other]
-
Title: Adaptive Gating for Single-Photon 3D ImagingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1044] arXiv:2111.15050 [pdf, other]
-
Title: AssistSR: Task-oriented Video Segment Retrieval for Personal AI AssistantAuthors: Stan Weixian Lei, Difei Gao, Yuxuan Wang, Dongxing Mao, Zihan Liang, Lingmin Ran, Mike Zheng ShouComments: 20 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1045] arXiv:2111.15056 [pdf, other]
-
Title: Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-LearningComments: Accepted to ICCV 2021 (poster)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1046] arXiv:2111.15064 [pdf, ps, other]
-
Title: Hole-robust Wireframe DetectionComments: To appear in Proceedings of the 2022 IEEE Winter Conference on Applications of Computer Vision (WACV 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1047] arXiv:2111.15077 [pdf, other]
-
Title: Unsupervised Domain Generalization for Person Re-identification: A Domain-specific Adaptive FrameworkComments: Accepted to Pattern Recognition (PR)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1048] arXiv:2111.15078 [pdf, other]
-
Title: SketchEdit: Mask-Free Local Image Manipulation with Partial SketchesSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1049] arXiv:2111.15097 [pdf, other]
-
Title: EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANsComments: Accepted in ECCV2022, Guohao Yin and Xin He contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [1050] arXiv:2111.15111 [pdf, ps, other]
-
Title: Automatic tracing of mandibular canal pathways using deep learningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1051] arXiv:2111.15113 [pdf, other]
-
Title: LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human BodiesAuthors: Sandro Lombardi, Bangbang Yang, Tianxing Fan, Hujun Bao, Guofeng Zhang, Marc Pollefeys, Zhaopeng CuiComments: Accepted to 3DV 2021. Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1052] arXiv:2111.15114 [pdf, other]
-
Title: ePose: Let's Make EfficientPose More Generally ApplicableComments: 7 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1053] arXiv:2111.15119 [pdf, other]
-
Title: Aerial Images Meet Crowdsourced Trajectories: A New Approach to Robust Road ExtractionComments: This work has been accepted by IEEE Transactions on Neural Networks and Learning SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1054] arXiv:2111.15121 [pdf, other]
-
Title: Pyramid Adversarial Training Improves ViT PerformanceAuthors: Charles Herrmann, Kyle Sargent, Lu Jiang, Ramin Zabih, Huiwen Chang, Ce Liu, Dilip Krishnan, Deqing SunComments: Accepted to CVPR22 (oral, best paper finalist). 33 pages, including references & supplementary materialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1055] arXiv:2111.15124 [pdf, other]
-
Title: In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image DomainsComments: In the IEEE International Symposium on Biomedical Imaging (ISBI)Journal-ref: ISBI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1056] arXiv:2111.15127 [pdf, other]
-
Title: A Unified Pruning Framework for Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1057] arXiv:2111.15129 [pdf, other]
-
Title: Anonymization for Skeleton Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1058] arXiv:2111.15140 [pdf, other]
-
Title: Robust 3D Garment Digitization from Monocular 2D Images for 3D Virtual Try-On SystemsAuthors: Sahib Majithia, Sandeep N. Parameswaran, Sadbhavana Babar, Vikram Garg, Astitva Srivastava, Avinash SharmaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1059] arXiv:2111.15143 [pdf, other]
-
Title: HEAT: Holistic Edge Attention Transformer for Structured ReconstructionComments: CVPR 2022 camera-readySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1060] arXiv:2111.15150 [pdf, other]
-
Title: AirObject: A Temporally Evolving Graph Embedding for Object IdentificationComments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022Journal-ref: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1061] arXiv:2111.15157 [pdf, other]
-
Title: MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking BenchmarkAuthors: Xiaotian Han, Quanzeng You, Chunyu Wang, Zhizheng Zhang, Peng Chu, Houdong Hu, Jiang Wang, Zicheng LiuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1062] arXiv:2111.15158 [pdf, other]
-
Title: A Dataset-Dispersion Perspective on Reconstruction Versus Recognition in Single-View 3D Reconstruction NetworksComments: Accepted to 3DV 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1063] arXiv:2111.15162 [pdf, other]
-
Title: CLIP Meets Video Captioning: Concept-Aware Representation Learning Does MatterComments: to appear in the 5th Chinese Conference on Pattern Recognition and Computer Vision (PRCV 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1064] arXiv:2111.15171 [pdf, other]
-
Title: Generative Convolution Layer for Image GenerationComments: Submitted to Neural NetworksJournal-ref: Neural Networks 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1065] arXiv:2111.15174 [pdf, other]
-
Title: CRIS: CLIP-Driven Referring Image SegmentationComments: 10 pages, 5 figures, Accepted by CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1066] arXiv:2111.15181 [pdf, other]
-
Title: Zero-Shot Semantic Segmentation via Spatial and Multi-Scale Aware Visual Class EmbeddingComments: Under Review on Pattern Recognition LettersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1067] arXiv:2111.15185 [pdf, other]
-
Title: SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-ResolutionComments: BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1068] arXiv:2111.15192 [pdf, other]
-
Title: PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1069] arXiv:2111.15193 [pdf, other]
-
Title: Shunted Self-Attention via Multi-Scale Token AggregationComments: CVPR2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1070] arXiv:2111.15199 [pdf, other]
-
Title: Semi-Supervised 3D Hand Shape and Pose Estimation with Label PropagationComments: DICTA 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1071] arXiv:2111.15207 [pdf, other]
-
Title: NeeDrop: Self-supervised Shape Representation from Sparse Point Clouds using Needle DroppingComments: 22 pagesJournal-ref: International Conference on 3D Vision (3DV), 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG); Machine Learning (cs.LG)
- [1072] arXiv:2111.15208 [pdf, ps, other]
-
Title: HRNET: AI on Edge for mask detection and social distancingJournal-ref: SN Computer Science, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1073] arXiv:2111.15210 [pdf, other]
-
Title: Point Cloud Instance Segmentation with Semi-supervised Bounding-Box MiningComments: IEEE Trans on Pattern Analysis and Machine IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1074] arXiv:2111.15213 [pdf, other]
-
Title: Using a GAN to Generate Adversarial Examples to Facial Image RecognitionComments: 8 pages, to appear at the Media Watermarking, Security, and Forensics Conference at Electronic Imaging, January, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1075] arXiv:2111.15234 [pdf, other]
-
Title: NeRFReN: Neural Radiance Fields with ReflectionsComments: Accepted to CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1076] arXiv:2111.15242 [pdf, other]
-
Title: ConDA: Unsupervised Domain Adaptation for LiDAR Segmentation via Regularized Domain ConcatenationComments: 8 pages, 6 figures, 4 tables; ICRA 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [1077] arXiv:2111.15246 [pdf, other]
-
Title: Hallucinated Neural Radiance Fields in the WildComments: Accepted by CVPR 2022. Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1078] arXiv:2111.15257 [pdf, other]
-
Title: ARTSeg: Employing Attention for Thermal images Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1079] arXiv:2111.15263 [pdf, other]
-
Title: Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic FeaturesComments: Accepted for publication at ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1080] arXiv:2111.15264 [pdf, other]
-
Title: EdiBERT, a generative model for image editingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1081] arXiv:2111.15266 [pdf, other]
-
Title: Two-stage Temporal Modelling Framework for Video-based Depression Recognition using Graph RepresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1082] arXiv:2111.15271 [pdf, other]
-
Title: Affect-DML: Context-Aware One-Shot Recognition of Human Affect using Deep Metric LearningAuthors: Kunyu Peng, Alina Roitberg, David Schneider, Marios Koulakis, Kailun Yang, Rainer StiefelhagenComments: Accepted to IEEE International Conference on Automatic Face and Gesture Recognition 2021 (FG2021). Benchmark, models, and code are at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1083] arXiv:2111.15288 [pdf, other]
-
Title: Revisiting Temporal Alignment for Video RestorationComments: 15 pages. 17 figures, 10 tables/Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1084] arXiv:2111.15300 [pdf, other]
-
Title: TridentAdapt: Learning Domain-invariance via Source-Target Confrontation and Self-induced Cross-domain AugmentationJournal-ref: 32nd British Machine Vision Conference 2021, BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1085] arXiv:2111.15318 [pdf, other]
-
Title: DiffSDFSim: Differentiable Rigid-Body Dynamics With Implicit ShapesComments: 22 pages, 23 Figures (including supplementary material). Presented 3DV 2021. Project website: this https URLJournal-ref: 2021 International Conference on 3D Vision (3DV)Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
- [1086] arXiv:2111.15340 [pdf, other]
-
Title: MC-SSL0.0: Towards Multi-Concept Self-Supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1087] arXiv:2111.15341 [pdf, other]
-
Title: ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point CloudsComments: CVPR 2022 camera readySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1088] arXiv:2111.15361 [pdf, other]
-
Title: Seeking Salient Facial Regions for Cross-Database Micro-Expression RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1089] arXiv:2111.15362 [pdf, other]
-
Title: ISNAS-DIP: Image-Specific Neural Architecture Search for Deep Image PriorSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1090] arXiv:2111.15363 [pdf, other]
-
Title: Voint Cloud: Multi-View Point Cloud Representation for 3D UnderstandingComments: Accepted at ICLR 2023. The code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1091] arXiv:2111.15376 [pdf, ps, other]
-
Title: Reconstruction Student with Attention for Student-Teacher Pyramid MatchingComments: 10 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1092] arXiv:2111.15400 [pdf, other]
-
Title: CT-block: a novel local and global features extractor for point cloudComments: 15 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1093] arXiv:2111.15404 [pdf, other]
-
Title: Probabilistic Estimation of 3D Human Shape and Pose with a Semantic Local Parametric ModelComments: BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1094] arXiv:2111.15416 [pdf, other]
-
Title: Worst-Case Morphs: a Theoretical and a Practical ApproachSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1095] arXiv:2111.15430 [pdf, other]
-
Title: The Devil is in the Margin: Margin-based Label Smoothing for Network CalibrationComments: CVPR 2022. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1096] arXiv:2111.15438 [pdf, other]
-
Title: FMD-cGAN: Fast Motion Deblurring using Conditional Generative Adversarial NetworksComments: International Conference on Computer Vision and Image Processing 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1097] arXiv:2111.15449 [pdf, other]
- [1098] arXiv:2111.15451 [pdf, other]
-
Title: Large-Scale Video Analytics through Object-Level ConsolidationSubjects: Computer Vision and Pattern Recognition (cs.CV); Networking and Internet Architecture (cs.NI)
- [1099] arXiv:2111.15454 [pdf, other]
-
Title: Boosting Discriminative Visual Representation Learning with Scenario-Agnostic MixupComments: Preprint version v3 with 9 pages main body and 8 pages appendix. The source code is available at \url{this https URL}Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1100] arXiv:2111.15463 [pdf, other]
-
Title: Consensus Synergizes with Memory: A Simple Approach for Anomaly Segmentation in Urban ScenesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1101] arXiv:2111.15475 [pdf, ps, other]
-
Title: Natural Scene Text Editing Based on AIAuthors: Yujie ZhangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1102] arXiv:2111.15479 [pdf, ps, other]
-
Title: Analysis of Multiscale Wavelet-based Fractional Gradient-Anisotropic Diffusion Fusion for single hazy and underwater image enhancementAuthors: Uche A. NnolimComments: 15 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1103] arXiv:2111.15483 [pdf, other]
-
Title: ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame InterpolationComments: Accepted in CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1104] arXiv:2111.15490 [pdf, other]
-
Title: FENeRF: Face Editing in Neural Radiance FieldsComments: Accepted to CVPR 2022. Project: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1105] arXiv:2111.15491 [pdf, other]
-
Title: PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1106] arXiv:2111.15509 [pdf, other]
-
Title: Regularized directional representations for medical image registrationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1107] arXiv:2111.15510 [pdf, other]
-
Title: ESL: Event-based Structured LightJournal-ref: IEEE International Conference on 3D Vision (3DV), 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1108] arXiv:2111.15513 [pdf, other]
-
Title: RADU: Ray-Aligned Depth Update Convolutions for ToF Data DenoisingComments: Accepted at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1109] arXiv:2111.15514 [pdf, ps, other]
-
Title: Nonlinear Intensity Underwater Sonar Image Matching Method Based on Phase Information and Deep Convolution FeaturesComments: 6 pages, letters, 9 figures. arXiv admin note: substantial text overlap with arXiv:2111.08994Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1110] arXiv:2111.15552 [pdf, other]
-
Title: NeuSample: Neural Sample Field for Efficient View SynthesisComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1111] arXiv:2111.15557 [pdf, other]
-
Title: Low-light Image Enhancement via Breaking Down the DarknessComments: 9 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1112] arXiv:2111.15581 [pdf, other]
-
Title: Automated Damage Inspection of Power Transmission Towers from UAV ImagesComments: 8 pages, 10 figures, accepted for VISAPP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1113] arXiv:2111.15592 [pdf, other]
-
Title: MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at ScaleComments: 13 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1114] arXiv:2111.15603 [pdf, other]
-
Title: Human Imperceptible Attacks and Applications to Improve FairnessSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1115] arXiv:2111.15606 [pdf, other]
-
Title: Robust Partial-to-Partial Point Cloud Registration in a Full RangeComments: 15 pages, 9 figures. Github Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1116] arXiv:2111.15613 [pdf, other]
-
Title: The MIS Check-Dam Dataset for Object Detection and Instance Segmentation TasksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1117] arXiv:2111.15615 [pdf, other]
-
Title: Semi-Local Convolutions for LiDAR Scan ProcessingComments: arXiv admin note: text overlap with arXiv:2004.11803Journal-ref: ICBINB Workshop at NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1118] arXiv:2111.15624 [pdf, other]
-
Title: Image Style Transfer and Content-Style DisentanglementComments: 10 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1119] arXiv:2111.15637 [pdf, ps, other]
-
Title: Building extraction with vision transformerComments: Submitted to TGRSSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1120] arXiv:2111.15639 [pdf, other]
-
Title: DeDUCE: Generating Counterfactual Explanations EfficientlyComments: Presented at the 1st Workshop on eXplainable AI approaches for debugging and diagnosis (XAI4Debugging@NeurIPS2021)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [1121] arXiv:2111.15640 [pdf, other]
-
Title: Diffusion Autoencoders: Toward a Meaningful and Decodable RepresentationComments: Please visit our project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1122] arXiv:2111.15651 [pdf, other]
-
Title: Leveraging The Topological Consistencies of Learning in Deep Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1123] arXiv:2111.15656 [pdf, other]
-
Title: Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1124] arXiv:2111.15666 [pdf, other]
-
Title: HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image EditingComments: Accepted to CVPR 2022; Project page available at this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1125] arXiv:2111.15667 [pdf, other]
-
Title: Adaptive Token Sampling For Efficient Vision TransformersAuthors: Mohsen Fayyaz, Soroush Abbasi Koohpayegani, Farnoush Rezaei Jafari, Sunando Sengupta, Hamid Reza Vaezi Joze, Eric Sommerlade, Hamed Pirsiavash, Juergen GallComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1126] arXiv:2111.15668 [pdf, other]
-
Title: AdaViT: Adaptive Vision Transformers for Efficient Image RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1127] arXiv:2111.15669 [pdf, other]
-
Title: 360MonoDepth: High-Resolution 360° Monocular Depth EstimationComments: CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1128] arXiv:2111.15672 [pdf, other]
-
Title: Unsupervised Domain Adaptation: A Reality CheckSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1129] arXiv:2111.00110 (cross-list from cs.LG) [pdf, other]
-
Title: FC2T2: The Fast Continuous Convolutional Taylor Transform with Applications in Vision and GraphicsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1130] arXiv:2111.00124 (cross-list from cs.LG) [pdf, other]
-
Title: Predicting Atlantic Multidecadal VariabilityComments: 7 pages, 3 figuresJournal-ref: Tackling Climate Change with Machine Learning workshop at NeurIPS 2021Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Atmospheric and Oceanic Physics (physics.ao-ph)
- [1131] arXiv:2111.00210 (cross-list from cs.LG) [pdf, other]
-
Title: Mastering Atari Games with Limited DataComments: Published at NeurIPS 2021; Homepage: this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1132] arXiv:2111.00295 (cross-list from cs.LG) [pdf, other]
-
Title: Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided Curriculum Learning ApproachComments: 16 pages, 9 figures, Accepted at NeurIPS 2021, Code at this https URLSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1133] arXiv:2111.00417 (cross-list from cs.MM) [pdf, other]
-
Title: Hierarchical Deep Residual Reasoning for Temporal Moment LocalizationSubjects: Multimedia (cs.MM); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
- [1134] arXiv:2111.00619 (cross-list from cs.LG) [pdf, other]
-
Title: PIE: Pseudo-Invertible EncoderSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1135] arXiv:2111.00629 (cross-list from cs.MM) [pdf, other]
-
Title: Distantly Supervised Semantic Text Detection and Recognition for Broadcast Sports Videos UnderstandingComments: 9 pages, 7 figures and 6 tables. To be published in the proceedings of ACM Multimedia 21, Industrial Track, held from October 20-24 in ChinaSubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [1136] arXiv:2111.00743 (cross-list from cs.LG) [pdf, other]
-
Title: Towards the Generalization of Contrastive Self-Supervised LearningComments: Accepted by ICLR 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1137] arXiv:2111.00758 (cross-list from cs.IR) [pdf, ps, other]
-
Title: Single-Item Fashion Recommender: Towards Cross-Domain RecommendationsAuthors: Seyed Omid Mohammadi, Hossein Bodaghi, Ahmad Kalhor (University of Tehran, College of Engineering, School of Electrical and Computer Engineering, Tehran, Iran)Comments: 5 Pages, 6 Figures, 1 TableJournal-ref: IEEE 2022 30th International Conference on Electrical Engineering (ICEE), 2022, pp. 12-16Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1138] arXiv:2111.00909 (cross-list from cs.LG) [pdf, other]
-
Title: Multi-Attribute Balanced Sampling for Disentangled GAN ControlsAuthors: Perla Doubinsky (CEDRIC - VERTIGO, CNAM), Nicolas Audebert (CEDRIC - VERTIGO, CNAM), Michel Crucianu (CEDRIC - VERTIGO, CNAM), Hervé Le Borgne (LIST)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1139] arXiv:2111.00947 (cross-list from cs.LG) [pdf, other]
-
Title: Nested Multiple Instance Learning with Attention MechanismsComments: Submitted to ICIP 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1140] arXiv:2111.01067 (cross-list from cs.GR) [pdf, other]
-
Title: OctField: Hierarchical Implicit Functions for 3D ModelingComments: 13 pages, 9 figures, NeurIPS 2021Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1141] arXiv:2111.01135 (cross-list from cs.LG) [pdf, other]
-
Title: Arch-Net: Model Distillation for Architecture Agnostic Model DeploymentComments: 15 pages, 6 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1142] arXiv:2111.01245 (cross-list from cs.RO) [pdf, other]
-
Title: Learning Eye-in-Hand Camera Calibration from a Single ImageComments: Published at the 2021 Conference on Robot Learning (CoRL). Webpage and video: this https URLSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1143] arXiv:2111.01549 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Overcoming Catastrophic Forgetting in Incremental Few-Shot Learning by Finding Flat MinimaSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1144] arXiv:2111.01584 (cross-list from cs.LG) [pdf, other]
-
Title: Fitness Landscape Footprint: A Framework to Compare Neural Architecture Search ProblemsComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1145] arXiv:2111.01592 (cross-list from cs.RO) [pdf, other]
-
Title: Trajectory Prediction with Graph-based Dual-scale Context FusionComments: Accepted by IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022. Code: this https URLSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1146] arXiv:2111.01674 (cross-list from cs.RO) [pdf, other]
-
Title: Minimizing Energy Consumption Leads to the Emergence of Gaits in Legged RobotsComments: CoRL 2021. Website at this https URLSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1147] arXiv:2111.01714 (cross-list from cs.LG) [pdf, other]
-
Title: Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial AttacksComments: accepted at NeurIPS 2021; updated the numbers in Table 5 and added references; added acknowledgementsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1148] arXiv:2111.01742 (cross-list from cs.LG) [pdf, ps, other]
-
Title: LogAvgExp Provides a Principled and Performant Global Pooling OperatorSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1149] arXiv:2111.01760 (cross-list from cs.NE) [pdf, other]
-
Title: Increasing Liquid State Machine Performance with Edge-of-Chaos Dynamics Organized by Astrocyte-modulated PlasticityComments: 23 pages, 9 figures, NeurIPS 2021Journal-ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
- [1150] arXiv:2111.02044 (cross-list from cs.AI) [pdf, other]
-
Title: Categorical Difference and Related Brain Regions of the Attentional Blink EffectComments: Accepted in PhotonIcs and Electromagnetics Research Symposium (PIERS) 2021Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
- [1151] arXiv:2111.02168 (cross-list from cs.LG) [pdf, other]
-
Title: The Klarna Product Page Dataset: Web Element Nomination with Graph Neural Networks and Large Language ModelsComments: 12 pages, 8 figures, 3 tables, under reviewSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR)
- [1152] arXiv:2111.02327 (cross-list from cs.HC) [pdf, other]
-
Title: ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving VehicleJournal-ref: In Proceedings of the 2021 International Conference on Multimodal Interaction, pp. 318-327. 2021Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
- [1153] arXiv:2111.02399 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Pruned Structure and Weights Simultaneously from Scratch: an Attention based ApproachSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1154] arXiv:2111.02400 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Deep AUC Maximization for Medical Image Classification: Challenges and OpportunitiesAuthors: Tianbao YangComments: Medical Imaging meets NeurIPS 2021 workshopSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Optimization and Control (math.OC)
- [1155] arXiv:2111.02452 (cross-list from cs.CY) [pdf, other]
-
Title: Slapping Cats, Bopping Heads, and Oreo Shakes: Understanding Indicators of Virality in TikTok Short VideosSubjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
- [1156] arXiv:2111.02625 (cross-list from cs.LG) [pdf, other]
-
Title: Qimera: Data-free Quantization with Synthetic Boundary Supporting SamplesComments: Accepted to Neurips 2021Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1157] arXiv:2111.02732 (cross-list from cs.LG) [pdf, other]
-
Title: When Neural Networks Using Different Sensors Create Similar FeaturesJournal-ref: EAI MobiCase 2021, Nov 2021, Online, ChinaSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1158] arXiv:2111.02736 (cross-list from cs.LG) [pdf, other]
-
Title: Deep Learning Methods for Daily Wildfire Danger ForecastingAuthors: Ioannis Prapas, Spyros Kondylatos, Ioannis Papoutsis, Gustau Camps-Valls, Michele Ronco, Miguel-Ángel Fernández-Torres, Maria Piles Guillem, Nuno CarvalhaisComments: Accepted to the workshop on Artificial Intelligence for Humanitarian Assistance and Disaster Response at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1159] arXiv:2111.02865 (cross-list from cs.LG) [pdf, other]
-
Title: Testing using Privileged Information by Adapting Features with Statistical DependenceComments: Published at ICCV 2021. Webpage: this http URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1160] arXiv:2111.02870 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Extended Abstract Version: CNN-based Human Detection System for UAVs in Search and RescueAuthors: Nikite MesvanComments: 3 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:2110.01930Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1161] arXiv:2111.02995 (cross-list from cs.LG) [pdf, other]
-
Title: Unsupervised Change Detection of Extreme Events Using ML On-BoardAuthors: Vít Růžička, Anna Vaughan, Daniele De Martini, James Fulton, Valentina Salvatelli, Chris Bridges, Gonzalo Mateo-Garcia, Valentina ZantedeschiComments: 5 pages (+2 in appendix), 5 figures (+1 in appendix), 2 tables (+3 in appendix), NeurIPS Workshop on Artificial Intelligence for Humanitarian Assistance and Disaster Response Workshop (AI+HADR), 2021Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1162] arXiv:2111.03062 (cross-list from cs.RO) [pdf, other]
-
Title: Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task LearningComments: Website at this https URLSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
- [1163] arXiv:2111.03162 (cross-list from cs.LG) [pdf, other]
-
Title: GraN-GAN: Piecewise Gradient Normalization for Generative Adversarial NetworksComments: WACV 2022 Main Conference Paper (Submitted: 18 Aug 2021, Accepted: 4 Oct 2021)Journal-ref: 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 2432-2441Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1164] arXiv:2111.03472 (cross-list from cs.CR) [pdf, ps, other]
-
Title: BiosecurID: a multimodal biometric databaseAuthors: Julian Fierrez, Javier Galbally, Javier Ortega-Garcia, Manuel R Freire, Fernando Alonso-Fernandez, Daniel Ramos, Doroteo Torre Toledano, Joaquin Gonzalez-Rodriguez, Juan A Siguenza, Javier Garrido-Salas, E Anguiano, Guillermo Gonzalez-de-Rivera, Ricardo Ribalda, Marcos Faundez-Zanuy, JA Ortega, Valentín Cardeñoso-Payo, A Viloria, Carlos E Vivaracho, Q Isaac Moro, Juan J Igarza, J Sanchez, Inmaculada Hernaez, Carlos Orrite-Urunuela, Francisco Martinez-Contreras, Juan José Gracia-RocheComments: Published at Pattern Analysis and Applications journalSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1165] arXiv:2111.03536 (cross-list from cs.LG) [src]
-
Title: A Unified Game-Theoretic Interpretation of Adversarial RobustnessAuthors: Jie Ren, Die Zhang, Yisen Wang, Lu Chen, Zhanpeng Zhou, Yiting Chen, Xu Cheng, Xin Wang, Meng Zhou, Jie Shi, Quanshi ZhangComments: the previous version is arXiv:2103.07364, but I mistakenly apply a new ID for the paperSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1166] arXiv:2111.03702 (cross-list from cs.LG) [pdf, other]
-
Title: Reconstructing Training Data from Diverse ML Models by Ensemble InversionComments: 9 pages, 8 figures, WACV 2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1167] arXiv:2111.03759 (cross-list from cs.LG) [pdf, other]
-
Title: MQBench: Towards Reproducible and Deployable Model Quantization BenchmarkAuthors: Yuhang Li, Mingzhu Shen, Jian Ma, Yan Ren, Mingxin Zhao, Qi Zhang, Ruihao Gong, Fengwei Yu, Junjie YanComments: Accepted by 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and BenchmarksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1168] arXiv:2111.03797 (cross-list from cs.GR) [pdf, other]
-
Title: Neural BRDFs: Representation and OperationsSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1169] arXiv:2111.03987 (cross-list from cs.RO) [pdf, other]
-
Title: V-MAO: Generative Modeling for Multi-Arm Manipulation of Articulated ObjectsComments: CoRL 2021Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1170] arXiv:2111.03994 (cross-list from cs.HC) [src]
-
Title: NarrationBot and InfoBot: A Hybrid System for Automated Video DescriptionAuthors: Shasta Ihorn, Yue-Ting Siu, Aditya Bodi, Lothar Narins, Jose M. Castanon, Yash Kant, Abhishek Das, Ilmi Yoon, Pooyan FazliComments: arXiv admin note: This article has been withdrawn by arXiv administration due to an unresolvable authorship disputeSubjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1171] arXiv:2111.04045 (cross-list from cs.CL) [pdf, other]
-
Title: Information Extraction from Visually Rich Documents with Font Style EmbeddingsAuthors: Ismail Oussaid, William Vanhuffel, Pirashanth Ratnamogan, Mhamed Hajaiej, Alexis Mathey, Thomas GillesJournal-ref: 26th International Conference on Pattern Recognition (ICPR), 2022Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1172] arXiv:2111.04064 (cross-list from cs.HC) [pdf, other]
-
Title: Can viewer proximity be a behavioural marker for Autism Spectrum Disorder?Authors: Rahul Bishain, Bhismadev Chakrabarti, Jayashree Dasgupta, Indu Dubey, Sharat Chandran (on behalf of the START consortium)Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
- [1173] arXiv:2111.04096 (cross-list from cs.RO) [pdf, other]
-
Title: Online Mutual Adaptation of Deep Depth Prediction and Visual SLAMComments: 11 pages, 6 figuresSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1174] arXiv:2111.04101 (cross-list from cs.RO) [pdf, other]
-
Title: Hierarchical Segment-based Optimization for SLAMComments: IROS 2021Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1175] arXiv:2111.04265 (cross-list from cs.CG) [pdf, other]
-
Title: Adaptive area-preserving parameterization of open and closed anatomical surfacesJournal-ref: Computers in Biology and Medicine, 105715 (2022)Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1176] arXiv:2111.04313 (cross-list from cs.LG) [pdf, other]
-
Title: A Relational Model for One-Shot ClassificationComments: Published at ESANN 2021Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1177] arXiv:2111.04318 (cross-list from cs.LG) [pdf, other]
-
Title: Auto-Encoding Knowledge Graph for Unsupervised Medical Report GenerationSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1178] arXiv:2111.04345 (cross-list from cs.LG) [pdf, other]
-
Title: Off-policy Imitation Learning from Visual InputsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1179] arXiv:2111.04394 (cross-list from cs.CR) [pdf, other]
-
Title: Get a Model! Model Hijacking Attack Against Machine Learning ModelsComments: To Appear in NDSS 2022Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1180] arXiv:2111.04578 (cross-list from cs.LG) [pdf, other]
-
Title: Improved Regularization and Robustness for Fine-tuning in Neural NetworksComments: 22 pages, 6 figures, 11 tablesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1181] arXiv:2111.04625 (cross-list from cs.CR) [pdf, other]
-
Title: DeepSteal: Advanced Model Extractions Leveraging Efficient Weight Stealing in MemoriesSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1182] arXiv:2111.04639 (cross-list from cs.LG) [pdf, other]
-
Title: S3RP: Self-Supervised Super-Resolution and Prediction for Advection-Diffusion ProcessComments: 9 pages, 8 figuresJournal-ref: Neural Information Processing Systems (NeurIPS 2021) WorkshopSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computational Physics (physics.comp-ph)
- [1183] arXiv:2111.04670 (cross-list from cs.LG) [pdf, other]
-
Title: Approximate Neural Architecture Search via Operation Distribution LearningComments: WACV 2022. 10 pages, 3 figures and 5 tables (15 pages, 7 figures and 6 tables including appendices)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1184] arXiv:2111.04682 (cross-list from cs.LG) [pdf, other]
-
Title: SMU: smooth activation function for deep networks using smoothing maximum techniqueComments: 7 pagesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1185] arXiv:2111.04710 (cross-list from cs.CR) [pdf, other]
-
Title: OMD: Orthogonal Malware Detection Using Audio, Image, and Static FeaturesAuthors: Lakshmanan Nataraj, Tajuddin Manhar Mohammed, Tejaswi Nanjundaswamy, Satish Chikkagoudar, Shivkumar Chandrasekaran, B.S. ManjunathComments: Submitted version - MILCOM 2021 IEEE Military Communications ConferenceSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
- [1186] arXiv:2111.04724 (cross-list from cs.LG) [pdf, other]
-
Title: SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine LearningAuthors: Christopher Yeh, Chenlin Meng, Sherrie Wang, Anne Driscoll, Erik Rozi, Patrick Liu, Jihyeon Lee, Marshall Burke, David B. Lobell, Stefano ErmonComments: NeurIPS 2021 (Track on Datasets and Benchmarks)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1187] arXiv:2111.04798 (cross-list from cs.LG) [pdf, other]
-
Title: TAGLETS: A System for Automatic Semi-Supervised Learning with Auxiliary DataAuthors: Wasu Piriyakulkij, Cristina Menghini, Ross Briden, Nihal V. Nayak, Jeffrey Zhu, Elaheh Raisi, Stephen H. BachComments: Paper published at MLSys 2022. It passed the artifact evaluation earning two ACM badges: (1) Artifacts Evaluated Functional v1.1 and (2) Artifacts Available v1.1Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1188] arXiv:2111.04823 (cross-list from cs.CL) [pdf, other]
-
Title: Cascaded Multilingual Audio-Visual Learning from VideosAuthors: Andrew Rouditchenko, Angie Boggust, David Harwath, Samuel Thomas, Hilde Kuehne, Brian Chen, Rameswar Panda, Rogerio Feris, Brian Kingsbury, Michael Picheny, James GlassComments: Presented at Interspeech 2021. This version contains updated results using the YouCook-Japanese datasetSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
- [1189] arXiv:2111.04901 (cross-list from cs.LG) [pdf, other]
-
Title: Label-Aware Distribution Calibration for Long-tailed ClassificationComments: 9 pagesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1190] arXiv:2111.05073 (cross-list from cs.LG) [pdf, other]
-
Title: MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel MapsComments: Accepted by NeurIPS 2021Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1191] arXiv:2111.05308 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Using The Feedback of Dynamic Active-Pixel Vision Sensor (Davis) to Prevent Slip in Real TimeAuthors: Armin Masoumian, Pezhman kazemi, Mohammad Chehreghani Montazer, Hatem A. Rashwan, Domenec Puig VallsComments: 5 pages, Accepted for The 6th International Conference on Mechatronics and Robotics Engineering (ICMRE 2020)Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1192] arXiv:2111.05309 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Designing and Analyzing the PID and Fuzzy Control System for an Inverted PendulumAuthors: Armin Masoumian, Pezhman kazemi, Mohammad Chehreghani Montazer, Hatem A. Rashwan, Domenec Puig VallsComments: 5 pages, Accepted for The 6th International Conference on Mechatronics and Robotics Engineering (ICMRE 2020)Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1193] arXiv:2111.05393 (cross-list from cs.LG) [pdf, other]
-
Title: Object-Centric Representation Learning with Generative Spatial-Temporal FactorizationComments: Accepted at NeurIPS 2021Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1194] arXiv:2111.05423 (cross-list from cs.LG) [pdf, other]
-
Title: Efficient Data Compression for 3D Sparse TPC via Bicephalous Convolutional AutoencoderComments: 6 pages, 6 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1195] arXiv:2111.05529 (cross-list from cs.LG) [pdf, other]
-
Title: Understanding the Generalization Benefit of Model Invariance from a Data PerspectiveComments: Accepted to NeurIPS 2021. Version 2 includes several content clarifications and image format revisionsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1196] arXiv:2111.05623 (cross-list from cs.RO) [pdf, other]
-
Title: FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based PolicyComments: CoRL 2021Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1197] arXiv:2111.05663 (cross-list from cs.CG) [pdf, other]
-
Title: The Impact of Changes in Resolution on the Persistent Homology of ImagesAuthors: Teresa Heiss, Sarah Tymochko, Brittany Story, Adélie Garin, Hoa Bui, Bea Bleile, Vanessa RobinsComments: accepted for the IEEE Big Data 2021 workshop: Applications of Topological Data Analysis to 'Big Data'Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Algebraic Topology (math.AT)
- [1198] arXiv:2111.05685 (cross-list from cs.LG) [pdf, other]
-
Title: Efficient Neural Network Training via Forward and Backward Propagation SparsificationSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1199] arXiv:2111.05736 (cross-list from cs.IR) [pdf, other]
-
Title: Multimodal Approach for Metadata Extraction from German Scientific PublicationsComments: 8 pages, 5 figures, 4 tablesSubjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1200] arXiv:2111.05778 (cross-list from cs.GR) [pdf, other]
-
Title: Theoretical and Empirical Analysis of a Fast Algorithm for Extracting Polygons from Signed Distance BoundsSubjects: Graphics (cs.GR); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
- [1201] arXiv:2111.05794 (cross-list from cs.HC) [pdf, other]
-
Title: PIMIP: An Open Source Platform for Pathology Information Management and IntegrationAuthors: Jialun Wu, Anyu Mao, Xinrui Bao, Haichuan Zhang, Zeyu Gao, Chunbao Wang, Tieliang Gong, Chen LiComments: BIBM 2021 accepted, including 8 pages, 8 figuresSubjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1202] arXiv:2111.05814 (cross-list from cs.LG) [pdf, other]
-
Title: SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal RetrievalAuthors: Minyoung KimSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1203] arXiv:2111.05846 (cross-list from cs.SD) [pdf, other]
-
Title: Structure from Silence: Learning Scene Structure from Ambient SoundComments: Accepted to CoRL 2021 (Oral Presentation)Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Robotics (cs.RO); Audio and Speech Processing (eess.AS)
- [1204] arXiv:2111.05849 (cross-list from cs.GR) [pdf, other]
-
Title: Advances in Neural RenderingAuthors: Ayush Tewari, Justus Thies, Ben Mildenhall, Pratul Srinivasan, Edgar Tretschk, Yifan Wang, Christoph Lassner, Vincent Sitzmann, Ricardo Martin-Brualla, Stephen Lombardi, Tomas Simon, Christian Theobalt, Matthias Niessner, Jonathan T. Barron, Gordon Wetzstein, Michael Zollhoefer, Vladislav GolyanikComments: 33 pages, 14 figures, 5 tables; State of the Art Report at EUROGRAPHICS 2022Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1205] arXiv:2111.05934 (cross-list from cs.RO) [pdf, other]
-
Title: A soft thumb-sized vision-based sensor with accurate all-round force perceptionComments: 1 table, 5 figures, 24 pages for the main manuscript. 5 tables, 12 figures, 27 pages for the supplementary material. 8 supplementary videosSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
- [1206] arXiv:2111.05950 (cross-list from cs.LG) [pdf, other]
-
Title: Self-Compression in Bayesian Neural NetworksComments: submitted to 2020 IEEE International Workshop on Machine Learning for Signal ProcessingSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1207] arXiv:2111.05953 (cross-list from cs.LG) [pdf, other]
-
Title: Robust Learning via Ensemble Density Propagation in Deep Neural NetworksComments: submitted to 2020 IEEE International Workshop on Machine Learning for Signal ProcessingSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Probability (math.PR)
- [1208] arXiv:2111.05955 (cross-list from cs.LG) [pdf, other]
-
Title: Keys to Accurate Feature Extraction Using Residual Spiking Neural NetworksComments: 17 pages, 6 figures, 17 tablesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1209] arXiv:2111.06155 (cross-list from cs.LG) [pdf, ps, other]
-
Title: A Novel Approach for Deterioration and Damage Identification in Building Structures Based on Stockwell-Transform and Deep Convolutional Neural NetworkAuthors: Vahidreza Gharehbaghi, Hashem Kalbkhani, Ehsan Noroozinejad Farsangi, T.Y. Yang, Andy Nguyen, Seyedali Mirjalili, C. Malaga-ChuquitaypeComments: 11 figures and 11 Tables, Accepted in Journal of Structural Integrity and MaintenanceSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1210] arXiv:2111.06206 (cross-list from cs.LG) [pdf, other]
-
Title: Defining and Quantifying the Emergence of Sparse Concepts in DNNsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1211] arXiv:2111.06236 (cross-list from cs.LG) [pdf, other]
-
Title: Discovering and Explaining the Representation Bottleneck of DNNsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1212] arXiv:2111.06263 (cross-list from cs.NI) [pdf, other]
-
Title: Towards Live Video Analytics with On-Drone Deeper-yet-Compatible CompressionSubjects: Networking and Internet Architecture (cs.NI); Computer Vision and Pattern Recognition (cs.CV)
- [1213] arXiv:2111.06387 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Signal-Agnostic Manifolds of Neural FieldsComments: NeurIPS 2021, additional results and code at this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1214] arXiv:2111.06389 (cross-list from cs.RO) [pdf, other]
-
Title: Full-Body Visual Self-Modeling of Robot MorphologiesComments: Project website: this https URLSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
- [1215] arXiv:2111.06449 (cross-list from cs.AI) [pdf, other]
-
Title: Expert Human-Level Driving in Gran Turismo Sport Using Deep Reinforcement Learning with Image-based RepresentationComments: Accepted at Deep Reinforcement Learning Workshop at Neural Information Processing Systems 2021Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1216] arXiv:2111.06517 (cross-list from cs.GR) [pdf, other]
-
Title: Neuromuscular Control of the Face-Head-Neck Biomechanical Complex With Learning-Based Expression Transfer From Images and VideosComments: 12 pages, 7 figures, 2 tablesSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1217] arXiv:2111.06628 (cross-list from cs.LG) [pdf, other]
-
Title: Learning to Break Deep Perceptual Hashing: The Use Case NeuralHashComments: Accepted by ACM FAccT 2022 as OralSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1218] arXiv:2111.06643 (cross-list from cs.SD) [pdf, other]
-
Title: Fully Automatic Page Turning on Real ScoresComments: ISMIR 2021 Late Breaking/DemoSubjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
- [1219] arXiv:2111.06889 (cross-list from cs.LG) [pdf, other]
-
Title: DriverGym: Democratising Reinforcement Learning for Autonomous DrivingComments: Accepted to NeurIPS 2021 ML4AD WorkshopSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1220] arXiv:2111.06977 (cross-list from cs.LG) [pdf, other]
-
Title: Scalable Diverse Model Selection for Accessible Transfer LearningComments: NeurIPS 2021 camera ready v2 + Appendix. Added a missing citation and fixed Table 4 header. Table 1 is still purple. No, I do not know whySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1221] arXiv:2111.07074 (cross-list from cs.LG) [pdf, other]
-
Title: Memotion Analysis through the Lens of Joint EmbeddingComments: Accepted as Student Abstract at AAAI-22Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1222] arXiv:2111.07228 (cross-list from cs.LG) [pdf, other]
-
Title: Curriculum Learning for Vision-and-Language NavigationComments: Accepted by NeurIPS 2021Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1223] arXiv:2111.07346 (cross-list from cs.IR) [pdf, ps, other]
-
Title: A Study on the Efficient Product Search Service for the Damaged Image InformationAuthors: Yonghyun KimComments: 5 pages, 8 figuresSubjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV)
- [1224] arXiv:2111.07640 (cross-list from cs.AI) [pdf, other]
-
Title: AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head ReenactmentComments: 40 pages; Accepted to ECCV 2022; code and dataset URL addedSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1225] arXiv:2111.07668 (cross-list from cs.LG) [pdf, other]
-
Title: Fast Axiomatic Attribution for Neural NetworksComments: To appear at NeurIPS*2021. Project page and code: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1226] arXiv:2111.07737 (cross-list from cs.LG) [pdf, other]
-
Title: Progress in Self-Certified Neural NetworksAuthors: Maria Perez-Ortiz, Omar Rivasplata, Emilio Parrado-Hernandez, Benjamin Guedj, John Shawe-TaylorJournal-ref: Published at NeurIPS 2021 workshop: Bayesian Deep LearningSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1227] arXiv:2111.07775 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Representations for Pixel-based Control: What Matters and Why?Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1228] arXiv:2111.07785 (cross-list from cs.NE) [pdf, other]
-
Title: Spiking CapsNet: A Spiking Neural Network With A Biologically Plausible Routing Rule Between CapsulesSubjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
- [1229] arXiv:2111.07928 (cross-list from cs.LG) [pdf, other]
-
Title: Target Layer Regularization for Continual Learning Using Cramer-Wold GeneratorComments: The paper is under consideration at Computer Vision and Image UnderstandingSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1230] arXiv:2111.07942 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Fully Linear Graph Convolutional Networks for Semi-Supervised Learning and ClusteringComments: Under review by IEEE Trans. xxxSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1231] arXiv:2111.07975 (cross-list from cs.RO) [pdf, other]
-
Title: Semantically Grounded Object Matching for Robust Robotic Scene RearrangementComments: 8 pages, 5 figuresSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1232] arXiv:2111.08163 (cross-list from cs.LG) [pdf, other]
-
Title: An Underexplored Dilemma between Confidence and Calibration in Quantized Neural NetworksComments: Accepted at I (Still) Can't Believe It's Not Better Workshop at NeurIPS 2021Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1233] arXiv:2111.08165 (cross-list from cs.LG) [pdf, other]
-
Title: RapidRead: Global Deployment of State-of-the-art Radiology AI for a Large Veterinary Teleradiology PracticeAuthors: Michael Fitzke, Conrad Stack, Andre Dourson, Rodrigo M. B. Santana, Diane Wilson, Lisa Ziemer, Arjun Soin, Matthew P. Lungren, Paul Fisher, Mark ParkinsonSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1234] arXiv:2111.08226 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Comparative Analysis of Machine Learning Models for Predicting Travel TimeSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1235] arXiv:2111.08276 (cross-list from cs.CL) [pdf, other]
-
Title: Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual ConceptsComments: ICML 2022Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1236] arXiv:2111.08398 (cross-list from cs.RO) [pdf, other]
-
Title: 2.5D Vehicle Odometry EstimationAuthors: Ciaran Eising, Leroy-Francisco Pereira, Jonathan Horgan, Anbuchezhiyan Selvaraju, John McDonald, Paul MoranComments: 13 pages, 16 figures, 2 tablesJournal-ref: IET Intelligent Transport Systems, 2020Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1237] arXiv:2111.08409 (cross-list from cs.LG) [pdf, other]
-
Title: Grounding Psychological Shape Space in Convolutional Neural NetworksComments: accepted at CIFMA2021 (this https URL)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1238] arXiv:2111.08429 (cross-list from cs.CR) [pdf, other]
-
Title: An Overview of Backdoor Attacks Against Deep Neural Networks and Possible DefencesSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1239] arXiv:2111.08566 (cross-list from cs.DB) [pdf, other]
-
Title: SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor SearchAuthors: Qi Chen, Bing Zhao, Haidong Wang, Mingqin Li, Chuanjie Liu, Zengzhong Li, Mao Yang, Jingdong WangComments: Accepted to NeurIPS 2021Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [1240] arXiv:2111.08575 (cross-list from cs.RO) [pdf, other]
-
Title: GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous DrivingSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1241] arXiv:2111.08591 (cross-list from cs.LG) [pdf, other]
-
Title: Robustness of Bayesian Neural Networks to White-Box Adversarial AttacksComments: Accepted at the fourth IEEE International Conference on Artificial Intelligence and Knowledge Engineering (AIKE 2021)Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1242] arXiv:2111.08851 (cross-list from cs.LG) [pdf, other]
-
Title: Deep Neural Networks for Rank-Consistent Ordinal Regression Based On Conditional ProbabilitiesComments: Accepted for publication in Pattern Analysis and ApplicationsJournal-ref: Pattern Analysis and Applications 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1243] arXiv:2111.08896 (cross-list from cs.CL) [pdf, other]
-
Title: Achieving Human Parity on Visual Question AnsweringAuthors: Ming Yan, Haiyang Xu, Chenliang Li, Junfeng Tian, Bin Bi, Wei Wang, Weihua Chen, Xianzhe Xu, Fan Wang, Zheng Cao, Zhicheng Zhang, Qiyu Zhang, Ji Zhang, Songfang Huang, Fei Huang, Luo Si, Rong JinSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1244] arXiv:2111.08940 (cross-list from cs.CL) [pdf, other]
-
Title: Transparent Human Evaluation for Image CaptioningAuthors: Jungo Kasai, Keisuke Sakaguchi, Lavinia Dunagan, Jacob Morrison, Ronan Le Bras, Yejin Choi, Noah A. SmithComments: Proc. of NAACL 2022Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1245] arXiv:2111.09030 (cross-list from cs.LG) [pdf, other]
-
Title: Trustworthy Long-Tailed ClassificationComments: IEEE Conference on Computer Vision and Pattern Recognition 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1246] arXiv:2111.09076 (cross-list from cs.LG) [pdf, other]
-
Title: To Trust or Not To Trust Prediction Scores for Membership Inference AttacksComments: 15 pages, 8 figures, 10 tablesSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1247] arXiv:2111.09204 (cross-list from cs.RO) [pdf, other]
-
Title: Tiny Obstacle Discovery by Occlusion-Aware Multilayer RegressionComments: Published in Transaction on Image Processing 2021Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1248] arXiv:2111.09389 (cross-list from cs.LG) [pdf, other]
-
Title: Low Precision Decentralized Distributed Training over IID and non-IID DataComments: 11 pages, 7 figures, 9 tablesJournal-ref: Neural Networks 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
- [1249] arXiv:2111.09463 (cross-list from cs.LG) [pdf, other]
-
Title: Self-Attending Task Generative Adversarial Network for Realistic Satellite Image CreationComments: 9 pages, 11 figures, 1 table, to be published in IEEE Aerospace 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1250] arXiv:2111.09497 (cross-list from cs.RO) [pdf, other]
-
Title: Lidar with Velocity: Correcting Moving Objects Point Cloud Distortion from Oscillating Scanning Lidars by Fusion with CameraSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1251] arXiv:2111.09613 (cross-list from cs.LG) [pdf, other]
-
Title: Improving Transferability of Representations via Augmentation-Aware Self-SupervisionComments: Accepted to NeurIPS 2021Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1252] arXiv:2111.09642 (cross-list from cs.SD) [pdf, other]
-
Title: Towards Intelligibility-Oriented Audio-Visual Speech EnhancementComments: 6 pages, 4 figuresSubjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
- [1253] arXiv:2111.09793 (cross-list from cs.RO) [pdf, other]
-
Title: Unsupervised Online Learning for Robotic Interestingness with Visual MemoryComments: Accepted to The IEEE Transactions on Robotics (T-RO). A substantial extension of the ECCV 2020 paper arXiv:2005.08829Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1254] arXiv:2111.09808 (cross-list from cs.LG) [pdf, other]
-
Title: Exploring the Limits of Epistemic Uncertainty Quantification in Low-Shot SettingsAuthors: Matias Valdenegro-ToroComments: 7 pages, 3 figures, with supplementary material. LatinX in AI Research Workshop @ NeurIPS 2021Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1255] arXiv:2111.09858 (cross-list from cs.LG) [pdf, other]
-
Title: Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement LearningComments: NeurIPS 2021. Video and code at this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1256] arXiv:2111.10050 (cross-list from cs.LG) [pdf, other]
-
Title: Combined Scaling for Zero-shot Transfer LearningAuthors: Hieu Pham, Zihang Dai, Golnaz Ghiasi, Kenji Kawaguchi, Hanxiao Liu, Adams Wei Yu, Jiahui Yu, Yi-Ting Chen, Minh-Thang Luong, Yonghui Wu, Mingxing Tan, Quoc V. LeSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1257] arXiv:2111.10130 (cross-list from cs.LG) [pdf, other]
-
Title: Fooling Adversarial Training with Inducing NoiseSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1258] arXiv:2111.10144 (cross-list from cs.LG) [pdf, other]
-
Title: Positional Encoder Graph Neural Networks for Geographic DataComments: AISTATS 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1259] arXiv:2111.10245 (cross-list from cs.LG) [pdf, other]
-
Title: Ubi-SleepNet: Advanced Multimodal Fusion Techniques for Three-stage Sleep Classification Using Ubiquitous SensingComments: Accepted in IMWUT for 2021 Dec issueSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1260] arXiv:2111.10291 (cross-list from cs.LG) [pdf, other]
-
Title: Meta Adversarial PerturbationsComments: Published in AAAI 2022 WorkshopSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1261] arXiv:2111.10622 (cross-list from cs.LG) [pdf, other]
-
Title: SPINE: Soft Piecewise Interpretable Neural EquationsComments: 31 pages, 23 figures, was submitted to NeurIPS 2020Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [1262] arXiv:2111.10734 (cross-list from cs.LG) [pdf, other]
-
Title: Deep Probability EstimationAuthors: Sheng Liu, Aakash Kaku, Weicheng Zhu, Matan Leibovich, Sreyas Mohan, Boyang Yu, Haoxiang Huang, Laure Zanna, Narges Razavian, Jonathan Niles-Weed, Carlos Fernandez-GrandaComments: SL, AK, WZ, ML, SM contributed equally to this work; 36 pages, 17 figures, 12 tablesJournal-ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:13746-13781, 2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1263] arXiv:2111.10752 (cross-list from cs.LG) [pdf, other]
-
Title: Stochastic Variance Reduced Ensemble Adversarial Attack for Boosting the Adversarial TransferabilityComments: 11 pages, 6 figures, accepted by CVPR 2022Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1264] arXiv:2111.10756 (cross-list from cs.CL) [pdf, other]
-
Title: TraVLR: Now You See It, Now You Don't! A Bimodal Dataset for Evaluating Visio-Linguistic ReasoningComments: The first two authors contributed equallySubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1265] arXiv:2111.10763 (cross-list from cs.LG) [pdf, other]
-
Title: Decentralized Unsupervised Learning of Visual RepresentationsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1266] arXiv:2111.10937 (cross-list from cs.LG) [pdf, other]
-
Title: Adaptive Transfer Learning: a simple but effective transfer learningAuthors: Jung H Lee, Henry J Kvinge, Scott Howland, Zachary New, John Buckheit, Lauren A. Phillips, Elliott Skomski, Jessica Hibler, Courtney D. Corley, Nathan O. HodasComments: 10 pages, 7 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1267] arXiv:2111.10991 (cross-list from cs.CR) [pdf, other]
-
Title: Backdoor Attack through Frequency DomainSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1268] arXiv:2111.11099 (cross-list from cs.RO) [pdf, other]
-
Title: Talk-to-Resolve: Combining scene understanding and spatial dialogue to resolve granular task ambiguity for a collocated robotComments: Accepted in Elsevier Journal of Robotics and Autonomous Systems (RAS)Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1269] arXiv:2111.11236 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Nanorobot queue: Cooperative treatment of cancer based on team member communication and image processingAuthors: Xinyu ZhouComments: 7pages,2figuresSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1270] arXiv:2111.11576 (cross-list from cs.LG) [pdf, other]
-
Title: Building Goal-Oriented Dialogue Systems with Situated Visual ContextSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1271] arXiv:2111.11581 (cross-list from cs.LG) [pdf, other]
-
Title: Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile AccelerationAuthors: Yifan Gong, Geng Yuan, Zheng Zhan, Wei Niu, Zhengang Li, Pu Zhao, Yuxuan Cai, Sijia Liu, Bin Ren, Xue Lin, Xulong Tang, Yanzhi WangSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
- [1272] arXiv:2111.11652 (cross-list from cs.LG) [pdf, other]
-
Title: CoDiM: Learning with Noisy Labels via Contrastive Semi-Supervised LearningAuthors: Xin Zhang, Zixuan Liu, Kaiwen Xiao, Tian Shen, Junzhou Huang, Wei Yang, Dimitris Samaras, Xiao HanComments: 19 Pages, 9 figures, conference paperSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1273] arXiv:2111.11828 (cross-list from cs.LG) [pdf, other]
-
Title: Variance Reduction in Deep Learning: More Momentum is All You NeedComments: 23 pages, 8 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1274] arXiv:2111.12062 (cross-list from cs.LG) [pdf, other]
-
Title: DABS: A Domain-Agnostic Benchmark for Self-Supervised LearningComments: NeurIPS 2021Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1275] arXiv:2111.12083 (cross-list from cs.RO) [pdf, other]
-
Title: VISTA 2.0: An Open, Data-driven Simulator for Multimodal Sensing and Policy Learning for Autonomous VehiclesAuthors: Alexander Amini, Tsun-Hsuan Wang, Igor Gilitschenski, Wilko Schwarting, Zhijian Liu, Song Han, Sertac Karaman, Daniela RusComments: First two authors contributed equally. Code and project website is available here: this https URLSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1276] arXiv:2111.12137 (cross-list from cs.RO) [pdf, other]
-
Title: Learning Interactive Driving Policies via Data-driven SimulationAuthors: Tsun-Hsuan Wang, Alexander Amini, Wilko Schwarting, Igor Gilitschenski, Sertac Karaman, Daniela RusComments: The first two authors contributed equally to this this work. Code is available here: this http URLSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1277] arXiv:2111.12170 (cross-list from cs.LG) [pdf, other]
-
Title: Domain-Agnostic Clustering with Self-DistillationComments: NeurIPS 2021 Workshop: Self-Supervised Learning - Theory and PracticeSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1278] arXiv:2111.12427 (cross-list from cs.LG) [pdf, other]
-
Title: Challenges of Adversarial Image AugmentationsComments: To appear at the ICBINB 2021 Neurips WorkshopSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1279] arXiv:2111.12579 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Water Care: Water Surface Cleaning Bot and Water Body Surveillance SystemAuthors: Harsh Sankar Naicker, Yash Srivastava, Akshara Pramod, Niket Paresh Ganatra, Deepakshi Sood, Saumya Singh, Velmathi GuruviahComments: This paper was presented in RIACT 2021, an international conference, and was selected for publication in springer special issue 2021Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1280] arXiv:2111.12772 (cross-list from cs.LG) [pdf, other]
-
Title: JoinABLe: Learning Bottom-up Assembly of Parametric CAD JointsAuthors: Karl D.D. Willis, Pradeep Kumar Jayaraman, Hang Chu, Yunsheng Tian, Yifei Li, Daniele Grandi, Aditya Sanghi, Linh Tran, Joseph G. Lambourne, Armando Solar-Lezama, Wojciech MatusikSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1281] arXiv:2111.12798 (cross-list from cs.LG) [pdf, other]
-
Title: Geometric Priors for Scientific Generative Models in Inertial Confinement FusionAuthors: Ankita Shukla, Rushil Anirudh, Eugene Kur, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Brian K. Spears, Tammy Ma, Pavan TuragaComments: 5 pages, 4 figures, Fourth Workshop on Machine Learning and the Physical Sciences, NeurIPS 2021Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1282] arXiv:2111.12965 (cross-list from cs.CR) [pdf, other]
-
Title: Towards Practical Deployment-Stage Backdoor Attack on Deep Neural NetworksSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1283] arXiv:2111.12990 (cross-list from cs.AI) [pdf, other]
-
Title: Learning Algebraic Representation for Systematic Generalization in Abstract ReasoningSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1284] arXiv:2111.13094 (cross-list from cs.GR) [pdf, other]
-
Title: Path Guiding Using Spatio-Directional Mixture ModelsComments: 17 pagesSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1285] arXiv:2111.13129 (cross-list from cs.RO) [pdf, other]
-
Title: Robot Skill Adaptation via Soft Actor-Critic Gaussian Mixture ModelsAuthors: Iman Nematollahi, Erick Rosete-Beas, Adrian Röfer, Tim Welschehold, Abhinav Valada, Wolfram BurgardComments: Accepted at the 2022 IEEE International Conference on Robotics and Automation (ICRA)Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1286] arXiv:2111.13171 (cross-list from cs.LG) [pdf, other]
-
Title: Intrinsic Dimension, Persistent Homology and Generalization in Neural NetworksComments: Appears at NeurIPS 2021Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); General Topology (math.GN); Machine Learning (stat.ML)
- [1287] arXiv:2111.13236 (cross-list from cs.LG) [pdf, other]
-
Title: Joint inference and input optimization in equilibrium networksComments: Neurips 2021Journal-ref: Neurips 2021Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1288] arXiv:2111.13282 (cross-list from cs.LG) [pdf, other]
-
Title: Generative Adversarial Networks and Adversarial Autoencoders: Tutorial and SurveyComments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learningSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
- [1289] arXiv:2111.13330 (cross-list from cs.LG) [pdf, other]
-
Title: ArchRepair: Block-Level Architecture-Oriented Repairing for Deep Neural NetworksComments: 33 pages, 7 figuresSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1290] arXiv:2111.13350 (cross-list from cs.LG) [pdf, other]
-
Title: Jointly Learning Agent and Lane Information for Multimodal Trajectory PredictionSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1291] arXiv:2111.13420 (cross-list from cs.LG) [pdf, other]
-
Title: Confounder Identification-free Causal Visual Feature LearningComments: 21 pagesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1292] arXiv:2111.13545 (cross-list from cs.LG) [pdf, other]
-
Title: $μ$NCA: Texture Generation with Ultra-Compact Neural Cellular AutomataSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1293] arXiv:2111.13606 (cross-list from cs.LG) [pdf, other]
-
Title: Conditional Image Generation with Score-Based Diffusion ModelsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1294] arXiv:2111.13650 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Latent Space Smoothing for Individually Fair RepresentationsComments: ECCV 2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1295] arXiv:2111.13826 (cross-list from cs.RO) [pdf, other]
-
Title: Average Outward Flux Skeletons for Environment Mapping and Topology MatchingAuthors: Morteza Rezanejad, Babak Samari, Elham Karimi, Ioannis Rekleitis, Gregory Dudek, Kaleem SiddiqiSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1296] arXiv:2111.13839 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Principled Disentanglement for Domain GeneralizationComments: CVPR 2022 OralSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1297] arXiv:2111.13984 (cross-list from cs.LG) [pdf, other]
-
Title: NCVX: A User-Friendly and Scalable Package for Nonconvex Optimization in Machine LearningComments: NCVX is available at this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Mathematical Software (cs.MS); Signal Processing (eess.SP); Optimization and Control (math.OC)
- [1298] arXiv:2111.14210 (cross-list from cs.CL) [pdf, other]
-
Title: Emergent Graphical Conventions in a Visual Communication GameSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1299] arXiv:2111.14213 (cross-list from cs.LG) [pdf, other]
-
Title: Local Learning Matters: Rethinking Data Heterogeneity in Federated LearningComments: CVPR 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
- [1300] arXiv:2111.14262 (cross-list from cs.HC) [pdf, ps, other]
-
Title: Customizing an Affective Tutoring System Based on Facial Expression and Head Pose EstimationSubjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1301] arXiv:2111.14309 (cross-list from cs.LG) [pdf, other]
-
Title: A General Framework for Defending Against Backdoor Attacks via Influence GraphSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1302] arXiv:2111.14426 (cross-list from cs.LG) [pdf, other]
-
Title: Improving traffic sign recognition by active searchComments: 6 pages, 7 FiguresJournal-ref: DAGM GCPR 2022 Pattern Recognition pp. 594--606 (2022)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1303] arXiv:2111.14581 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Fair Classifiers with Partially Annotated Group LabelsComments: Accepted to CVPR 2022; Code is available at this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [1304] arXiv:2111.14693 (cross-list from cs.RO) [pdf, other]
-
Title: SAGCI-System: Towards Sample-Efficient, Generalizable, Compositional, and Incremental Robot LearningComments: Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2022Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1305] arXiv:2111.14755 (cross-list from cs.GR) [pdf, ps, other]
-
Title: FaceAtlasAR: Atlas of Facial Acupuncture Points in Augmented RealityJournal-ref: Computer Science & Information Technology 2021Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [1306] arXiv:2111.14820 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Robust and Adaptive Motion Forecasting: A Causal Representation PerspectiveComments: CVPR 2022. Code is available at this https URL v4: fixed typoSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1307] arXiv:2111.14843 (cross-list from cs.SD) [pdf, other]
-
Title: Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving SoundsComments: This paper has been accepted for publication at IEEE ROBOTICS AND AUTOMATION LETTERSSubjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO); Audio and Speech Processing (eess.AS)
- [1308] arXiv:2111.14934 (cross-list from cs.GR) [pdf, other]
-
Title: Generative Adversarial Networks with Conditional Neural Movement Primitives for An Interactive Generative Drawing ToolComments: 9 pages, 10 figuresSubjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [1309] arXiv:2111.15099 (cross-list from cs.LG) [pdf, other]
-
Title: Trust the Critics: Generatorless and Multipurpose WGANs with Initial Convergence GuaranteesComments: 20 pages, 8 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC)
- [1310] arXiv:2111.15133 (cross-list from cs.LG) [pdf, other]
-
Title: LossPlot: A Better Way to Visualize Loss LandscapesComments: 5 pages; 2 large figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [1311] arXiv:2111.15179 (cross-list from cs.LG) [pdf, other]
-
Title: A Highly Effective Low-Rank Compression of Deep Neural Networks with Modified Beam-Search and Modified Stable RankComments: 8 pages, 8 figures, 2 tablesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1312] arXiv:2111.15186 (cross-list from cs.LG) [pdf, other]
-
Title: Automatic Synthesis of Diverse Weak Supervision Sources for Behavior AnalysisComments: 8 pages, to appear at CVPR 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1313] arXiv:2111.15373 (cross-list from cs.RO) [pdf, other]
-
Title: ColibriDoc: An Eye-in-Hand Autonomous Trocar Docking SystemAuthors: Shervin Dehghani, Michael Sommersperger, Junjie Yang, Benjamin Busam, Kai Huang, Peter Gehlbach, Iulian Iordachita, Nassir Navab, M. Ali NasseriSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1314] arXiv:2111.15542 (cross-list from cs.LG) [pdf, other]
-
Title: Learning to Transfer for Traffic Forecasting via Multi-task LearningAuthors: Yichao LuSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1315] arXiv:2111.15646 (cross-list from cs.LG) [pdf, other]
-
Title: The Exponentially Tilted Gaussian Prior for Variational AutoencodersSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1316] arXiv:2111.00077 (cross-list from physics.med-ph) [pdf, ps, other]
-
Title: DeepDoseNet: A Deep Learning model for 3D Dose Prediction in Radiation TherapyAuthors: Mumtaz Hussain Soomro, Victor Gabriel Leandro Alves, Hamidreza Nourzadeh, Jeffrey V. SiebersSubjects: Medical Physics (physics.med-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [1317] arXiv:2111.00102 (cross-list from eess.IV) [pdf, other]
-
Title: Fetal MRI by robust deep generative prior reconstruction and diffeomorphic registration: application to gestational age predictionAuthors: Lucilio Cordero-Grande, Juan Enrique Ortuño-Fisac, Alena Uus, Maria Deprez, Andrés Santos, Joseph V. Hajnal, María Jesús Ledesma-CarbayoComments: 23 pages, 15 figures, 1 tableSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1318] arXiv:2111.00193 (cross-list from eess.IV) [pdf, other]
-
Title: M2MRF: Many-to-Many Reassembly of Features for Tiny Lesion Segmentation in Fundus ImagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1319] arXiv:2111.00219 (cross-list from eess.IV) [pdf, other]
-
Title: Unpaired Learning for High Dynamic Range Image Tone MappingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1320] arXiv:2111.00273 (cross-list from eess.IV) [pdf, other]
-
Title: Cross-Modality Fusion Transformer for Multispectral Object DetectionComments: 9 figures, 5 tables, submitted to IMAGE AND VISION COMPUTINGSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1321] arXiv:2111.00361 (cross-list from eess.IV) [pdf, other]
-
Title: Functional Neural Networks for Parametric Image Restoration ProblemsComments: NeurIPS 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1322] arXiv:2111.00390 (cross-list from eess.IV) [pdf, other]
-
Title: Dual Attention Network for Heart Rate and Respiratory Rate EstimationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [1323] arXiv:2111.00395 (cross-list from physics.flu-dyn) [pdf, other]
-
Title: A robust single-pixel particle image velocimetry based on fully convolutional networks with cross-correlation embeddedSubjects: Fluid Dynamics (physics.flu-dyn); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1324] arXiv:2111.00484 (cross-list from eess.IV) [pdf, other]
-
Title: IGCN: Image-to-graph Convolutional Network for 2D/3D Deformable RegistrationJournal-ref: IEEE Transactions on Medical Imaging, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1325] arXiv:2111.00528 (cross-list from eess.IV) [pdf, other]
-
Title: Calibrating the Dice loss to handle neural network overconfidence for biomedical image segmentationSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
- [1326] arXiv:2111.00533 (cross-list from eess.IV) [pdf, other]
-
Title: Incorporating Boundary Uncertainty into loss functions for biomedical image segmentationSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
- [1327] arXiv:2111.00534 (cross-list from eess.IV) [pdf, other]
-
Title: Focal Attention Networks: optimising attention for biomedical image segmentationSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
- [1328] arXiv:2111.00551 (cross-list from eess.SP) [pdf, other]
-
Title: Learning to Detect Open Carry and Concealed Object with 77GHz RadarComments: 12 pagesJournal-ref: IEEE Journal of Selected Topics in Signal Processing, 2022Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
- [1329] arXiv:2111.00595 (cross-list from eess.IV) [pdf, other]
-
Title: TorchXRayVision: A library of chest X-ray datasets and modelsAuthors: Joseph Paul Cohen, Joseph D. Viviano, Paul Bertin, Paul Morrison, Parsa Torabian, Matteo Guarrera, Matthew P Lungren, Akshay Chaudhari, Rupert Brooks, Mohammad Hashir, Hadrien BertrandComments: Library source code: this https URLSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1330] arXiv:2111.00666 (cross-list from eess.IV) [pdf, other]
-
Title: Self-Verification in Image DenoisingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1331] arXiv:2111.00698 (cross-list from eess.IV) [pdf, other]
-
Title: Influential Prototypical Networks for Few Shot Learning: A Dermatological Case StudyComments: Computer Vision and Pattern RecognitionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1332] arXiv:2111.00742 (cross-list from eess.IV) [pdf, other]
-
Title: Redundancy Reduction in Semantic Segmentation of 3D Brain Tumor MRIsComments: BraTS 2021, BrainLes, MICCAI 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1333] arXiv:2111.00837 (cross-list from eess.IV) [pdf, other]
-
Title: Simulating Realistic MRI variations to Improve Deep Learning model and visual explanations using GradCAMAuthors: Muhammad Ilyas Patel, Shrey Singla, Razeem Ahmad Ali Mattathodi, Sumit Sharma, Deepam Gautam, Srinivasa Rao KundetiComments: 8 pages, 9 figures, IEEE-CCEM 2021 conferenceSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1334] arXiv:2111.00939 (cross-list from physics.comp-ph) [pdf, other]
-
Title: IRA: A shape matching approach for recognition and comparison of generic atomic patternsComments: 18 pages, 19 figuresSubjects: Computational Physics (physics.comp-ph); Computer Vision and Pattern Recognition (cs.CV)
- [1335] arXiv:2111.00961 (cross-list from astro-ph.GA) [pdf, other]
-
Title: Robustness of deep learning algorithms in astronomy -- galaxy morphology studiesAuthors: A. Ćiprijanović, D. Kafkes, G. N. Perdue, K. Pedro, G. Snyder, F. J. Sánchez, S. Madireddy, S. M. Wild, B. NordComments: Accepted in: Fourth Workshop on Machine Learning and the Physical Sciences (35th Conference on Neural Information Processing Systems; NeurIPS2021); final versionSubjects: Astrophysics of Galaxies (astro-ph.GA); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1336] arXiv:2111.01093 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Correlation between image quality metrics of magnetic resonance images and the neural network segmentation accuracyAuthors: Rajarajeswari Muthusivarajan, Adrian Celaya, Joshua P. Yung, Satish Viswanath, Daniel S. Marcus, Caroline Chung, David FuentesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1337] arXiv:2111.01134 (cross-list from eess.IV) [pdf, other]
-
Title: Comparing Bayesian Models for Organ Contouring in Head and Neck RadiotherapyAuthors: Prerak Mody, Nicolas Chaves-de-Plaza, Klaus Hildebrandt, Rene van Egmond, Huib de Ridder, Marius StaringComments: 10 pages, 5 figures, To be published in "SPIE Medical Imaging 2022"Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1338] arXiv:2111.01338 (cross-list from eess.IV) [pdf, other]
-
Title: Federated Split Vision Transformer for COVID-19 CXR Diagnosis using Task-Agnostic TrainingComments: Accepted for NeurIPS 2021Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1339] arXiv:2111.01350 (cross-list from eess.IV) [pdf, other]
-
Title: Constructing High-Order Signed Distance Maps from Computed Tomography Data with Application to Bone MorphometryComments: 14 pages, 14 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1340] arXiv:2111.01505 (cross-list from eess.IV) [pdf, other]
-
Title: Out of distribution detection for skin and malaria imagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1341] arXiv:2111.01511 (cross-list from eess.IV) [pdf, other]
-
Title: ISP-Agnostic Image Reconstruction for Under-Display CamerasSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1342] arXiv:2111.01544 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Comprehensive and Clinically Accurate Head and Neck Organs at Risk Delineation via Stratified Deep Learning: A Large-scale Multi-Institutional StudyAuthors: Dazhou Guo, Jia Ge, Xianghua Ye, Senxiang Yan, Yi Xin, Yuchen Song, Bing-shen Huang, Tsung-Min Hung, Zhuotun Zhu, Ling Peng, Yanping Ren, Rui Liu, Gong Zhang, Mengyuan Mao, Xiaohua Chen, Zhongjie Lu, Wenxiang Li, Yuzhen Chen, Lingyun Huang, Jing Xiao, Adam P. Harrison, Le Lu, Chien-Yu Lin, Dakai Jin, Tsung-Ying HoSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1343] arXiv:2111.01556 (cross-list from eess.IV) [pdf, other]
-
Title: Accounting for Dependencies in Deep Learning Based Multiple Instance Learning for Whole Slide ImagingComments: MICCAI 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1344] arXiv:2111.01557 (cross-list from eess.IV) [pdf, other]
-
Title: PointNu-Net: Keypoint-assisted Convolutional Neural Network for Simultaneous Multi-tissue Histology Nuclei Segmentation and ClassificationComments: 12 pages,7 figures, journalSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1345] arXiv:2111.01561 (cross-list from eess.IV) [pdf, other]
-
Title: Sub-cortical structure segmentation database for young populationAuthors: Jayanthi Sivaswamy, Alphin J Thottupattu, Mythri V, Raghav Mehta, R Sheelakumari, Chandrasekharan KesavadasSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1346] arXiv:2111.01665 (cross-list from eess.IV) [pdf, other]
-
Title: Explainable Medical Image Segmentation via Generative Adversarial Networks and Layer-wise Relevance PropagationComments: Nordic Machine IntelligenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1347] arXiv:2111.01682 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Progressive observation of Covid-19 vaccination effects on skin-cellular structures by use of Intelligent Laser Speckle Classification (ILSC)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1348] arXiv:2111.01866 (cross-list from eess.IV) [pdf, ps, other]
-
Title: 3-D PET Image Generation with tumour masks using TGANAuthors: Robert V Bergen, Jean-Francois Rajotte, Fereshteh Yousefirizi, Ivan S Klyuzhin, Arman Rahmim, Raymond T. NgSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1349] arXiv:2111.02249 (cross-list from eess.IV) [pdf, other]
-
Title: Learned Image Compression for Machine PerceptionComments: 13 pages, 6 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1350] arXiv:2111.02398 (cross-list from eess.IV) [pdf, other]
-
Title: Transparency of Deep Neural Networks for Medical Image Analysis: A Review of Interpretability MethodsSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1351] arXiv:2111.02402 (cross-list from eess.IV) [pdf, other]
-
Title: Skin Cancer Classification using Inception Network and Transfer LearningAuthors: Priscilla Benedetti, Damiano Perri, Marco Simonetti, Osvaldo Gervasi, Gianluca Reali, Mauro FemminellaComments: International Conference on Computational Science and Its Applications, ICCSA 2020Journal-ref: LNCS, volume 12249, 2020Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1352] arXiv:2111.02403 (cross-list from eess.IV) [pdf, other]
-
Title: WORD: A large scale dataset, benchmark and clinical applicable study for abdominal organ segmentation from CT imageAuthors: Xiangde Luo, Wenjun Liao, Jianghong Xiao, Jieneng Chen, Tao Song, Xiaofan Zhang, Kang Li, Dimitris N. Metaxas, Guotai Wang, Shaoting ZhangComments: Accepted to Medical Image Analysis, dataset at: this https URL (we corrected the results or description in this version.)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1353] arXiv:2111.02408 (cross-list from eess.IV) [pdf, other]
-
Title: Partial supervision for the FeTA challenge 2021Authors: Lucas Fidon, Michael Aertsen, Suprosanna Shit, Philippe Demaerel, Sébastien Ourselin, Jan Deprest, Tom VercauterenComments: Accepted as a poster at the MICCAI 2021 Perinatal, Preterm and Paediatric Image Analysis (PIPPI) workshopSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1354] arXiv:2111.02409 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Breast Cancer Classification Using: Pixel InterpolationComments: 9 pages, 9 figures, Acta Scientific Computer SciencesJournal-ref: Acta Scientific Open Access, Publication date 2021/10/28Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1355] arXiv:2111.02461 (cross-list from eess.IV) [pdf, other]
-
Title: Automatic ultrasound vessel segmentation with deep spatiotemporal context learningSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1356] arXiv:2111.02493 (cross-list from eess.SP) [pdf, ps, other]
-
Title: Roadmap on Signal Processing for Next Generation Measurement SystemsAuthors: D.K. Iakovidis, M. Ooi, Y.C. Kuang, S. Demidenko, A. Shestakov, V. Sinitsin, M. Henry, A. Sciacchitano, A. Discetti, S. Donati, M. Norgia, A. Menychtas, I. Maglogiannis, S.C. Wriessnegger, L.A. Barradas Chacon, G. Dimas, D. Filos, A.H. Aletras, J. Töger, F. Dong, S. Ren, A. Uhl, J. Paziewski, J. Geng, F. Fioranelli, R.M. Narayanan, C. Fernandez, C. Stiller, K. Malamousi, S. Kamnis, K. Delibasis, D. Wang, J. Zhang, R.X. GaoComments: 48 pages, this https URLJournal-ref: Measurement Science and Technology 33(1) (2022) 1-48Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Instrumentation and Detectors (physics.ins-det)
- [1357] arXiv:2111.02520 (cross-list from eess.IV) [pdf, other]
-
Title: Resampling and super-resolution of hexagonally sampled images using deep learningComments: 31 pages, 16 figures, 5 tables. \c{opyright} 2021 Society of Photo-Optical Instrumentation Engineers (SPIE)Journal-ref: Optical Engineering 60(10), 103105 (29 October 2021)Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1358] arXiv:2111.02676 (cross-list from physics.med-ph) [pdf, ps, other]
-
Title: A semi-automatic ultrasound image analysis system for the grading diagnosis of COVID-19 pneumoniaSubjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1359] arXiv:2111.02710 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Towards dynamic multi-modal phenotyping using chest radiographs and physiological dataComments: Accepted in medical imaging meets NeurIPS 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1360] arXiv:2111.02771 (cross-list from eess.IV) [pdf, other]
-
Title: The role of MRI physics in brain segmentation CNNs: achieving acquisition invariance and instructive uncertaintiesAuthors: Pedro Borges, Richard Shaw, Thomas Varsavsky, Kerstin Klaser, David Thomas, Ivana Drobnjak, Sebastien Ourselin, M Jorge CardosoComments: 10 pages, 3 figures, published in: Simulation and Synthesis in Medical Imaging 6th International Workshop, SASHIMI 2021, Held in Conjunction with MICCAI 2021, Strasbourg, France, September 27, 2021, ProceedingsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1361] arXiv:2111.03047 (cross-list from astro-ph.IM) [pdf, other]
-
Title: A deep ensemble approach to X-ray polarimetryComments: Fourth Workshop on Machine Learning and the Physical Sciences (NeurIPS 2021)Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1362] arXiv:2111.03063 (cross-list from eess.IV) [pdf, other]
-
Title: PDBL: Improving Histopathological Tissue Classification with Plug-and-Play Pyramidal Deep-Broad LearningAuthors: Jiatai Lin, Guoqiang Han, Xipeng Pan, Hao Chen, Danyi Li, Xiping Jia, Zhenwei Shi, Zhizhen Wang, Yanfen Cui, Haiming Li, Changhong Liang, Li Liang, Zaiyi Liu, Chu HanComments: 10 pages, 5 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1363] arXiv:2111.03231 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-Spectral Multi-Image Super-Resolution of Sentinel-2 with Radiometric Consistency Losses and Its Effect on Building DelineationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1364] arXiv:2111.03258 (cross-list from eess.SP) [pdf, ps, other]
-
Title: Learning of Time-Frequency Attention Mechanism for Automatic Modulation RecognitionSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
- [1365] arXiv:2111.03274 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Pathological Analysis of Blood Cells Using Deep Learning TechniquesComments: 6 Page, 3 Table and 6 FiguresJournal-ref: Recent Advances in Computer Science and Communications(Formerly Recent Patents on Computer Science),04 September,2020, Article ID e140921185564Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
- [1366] arXiv:2111.03301 (cross-list from eess.IV) [pdf, other]
-
Title: Frequency-Aware Physics-Inspired Degradation Model for Real-World Image Super-ResolutionComments: 22 pages,12 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1367] arXiv:2111.03368 (cross-list from eess.IV) [pdf, other]
-
Title: Hepatic vessel segmentation based on 3D swin-transformer with inductive biased multi-head self-attentionComments: 20 pages, 6 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1368] arXiv:2111.03370 (cross-list from eess.IV) [pdf, other]
-
Title: Segmentation of 2D Brain MR ImagesAuthors: Angad Ripudaman Singh BajwaSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1369] arXiv:2111.03386 (cross-list from eess.IV) [pdf, other]
-
Title: Versatile Learned Video CompressionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1370] arXiv:2111.03404 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A bone suppression model ensemble to improve COVID-19 detection in chest X-raysComments: 29 pages, 10 figures, 4 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1371] arXiv:2111.03433 (cross-list from physics.soc-ph) [pdf, ps, other]
-
Title: Numerisation D'un Siecle de Paysage Ferroviaire Français : recul du rail, conséquences territoriales et coût environnementalAuthors: Robert Jeansoulin (LIGM)Comments: in French. Territoire(s) et Num\'erique : innovations, mutations et d\'ecision, ASRDLF (Association de Sciences R\'egionales de Langue Fran\c{c}aise), Sep 2021, Avignon, FranceSubjects: Physics and Society (physics.soc-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
- [1372] arXiv:2111.03452 (cross-list from eess.IV) [pdf, other]
-
Title: Generalized Radiograph Representation Learning via Cross-supervision between Images and Free-text Radiology ReportsComments: Accepted by Nature Machine Intelligence. The official version is at this https URL Codes are available at this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1373] arXiv:2111.03459 (cross-list from physics.soc-ph) [pdf, other]
-
Title: ProSTformer: Pre-trained Progressive Space-Time Self-attention Model for Traffic Flow ForecastingSubjects: Physics and Society (physics.soc-ph); Computer Vision and Pattern Recognition (cs.CV)
- [1374] arXiv:2111.03485 (cross-list from eess.IV) [pdf, other]
-
Title: Cross Modality 3D Navigation Using Reinforcement Learning and Neural Style TransferSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1375] arXiv:2111.03663 (cross-list from eess.IV) [pdf, ps, other]
-
Title: First steps on Gamification of Lung Fluid Cells Annotations in the Flower DomainAuthors: Sonja Kunzmann, Christian Marzahl, Felix Denzinger, Christof A. Bertram, Robert Klopfleisch, Katharina Breininger, Vincent Christlein, Andreas MaierComments: 6 pages, 4 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1376] arXiv:2111.03708 (cross-list from eess.IV) [pdf, other]
-
Title: Damage Estimation and Localization from Sparse Aerial ImageryComments: Version presented at NeurIPS 2021 AI+HADR workshopSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1377] arXiv:2111.03729 (cross-list from eess.IV) [pdf, other]
-
Title: Explaining neural network predictions of material strengthSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1378] arXiv:2111.03780 (cross-list from eess.IV) [pdf, other]
-
Title: Artifact- and content-specific quality assessment for MRI with image rulersSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1379] arXiv:2111.03815 (cross-list from eess.IV) [pdf, other]
-
Title: Order-Guided Disentangled Representation Learning for Ulcerative Colitis Classification with Limited LabelsComments: Accepted by MICCAI 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1380] arXiv:2111.03848 (cross-list from eess.IV) [pdf, other]
-
Title: Multimodal PET/CT Tumour Segmentation and Prediction of Progression-Free Survival using a Full-Scale UNet with AttentionComments: 13 pages, 3 figures, 2 tables. To appear in Head and Neck Tumor Segmentation in PET/CT: The HECKTOR Challenge,Valentin Oreiller et al., Medical Image Analysis,2021, HECKTOR 2021, Lecture Notes in Computer Science, SpringerSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1381] arXiv:2111.03853 (cross-list from eess.IV) [pdf, other]
-
Title: A new baseline for retinal vessel segmentation: Numerical identification and correction of methodological inconsistencies affecting 100+ papersSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1382] arXiv:2111.03890 (cross-list from eess.IV) [pdf, other]
-
Title: Demystifying Deep Learning Models for Retinal OCT Disease Classification using Explainable AISubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1383] arXiv:2111.03997 (cross-list from eess.IV) [pdf, ps, other]
-
Title: The Three-Dimensional Structural Configuration of the Central Retinal Vessel Trunk and Branches as a Glaucoma BiomarkerAuthors: Satish K. Panda, Haris Cheong, Tin A. Tun, Thanadet Chuangsuwanich, Aiste Kadziauskiene, Vijayalakshmi Senthil, Ramaswami Krishnadas, Martin L. Buist, Shamira Perera, Ching-Yu Cheng, Tin Aung, Alexandre H. Thiery, Michael J. A. GirardSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1384] arXiv:2111.04019 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-Fake Evolutionary Generative Adversarial Networks for Imbalance Hyperspectral Image ClassificationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1385] arXiv:2111.04069 (cross-list from eess.IV) [pdf, other]
-
Title: Texture-enhanced Light Field Super-resolution with Spatio-Angular Decomposition KernelsComments: Accepted by IEEE TIMSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1386] arXiv:2111.04094 (cross-list from eess.IV) [pdf, other]
-
Title: Acquisition-invariant brain MRI segmentation with informative uncertaintiesAuthors: Pedro Borges, Richard Shaw, Thomas Varsavsky, Kerstin Klaser, David Thomas, Ivana Drobnjak, Sebastien Ourselin, M Jorge CardosoComments: 25 pages, 8 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1387] arXiv:2111.04212 (cross-list from eess.IV) [pdf, other]
-
Title: Dense Representative Tooth Landmark/axis Detection Network on 3D ModelAuthors: Guangshun Wei, Zhiming Cui, Jie Zhu, Lei Yang, Yuanfeng Zhou, Pradeep Singh, Min Gu, Wenping WangComments: 11pages,27figuresSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1388] arXiv:2111.04459 (cross-list from eess.IV) [pdf, other]
-
Title: Triple-level Model Inferred Collaborative Network Architecture for Video DerainingComments: Accepted at IEEE Transactions on Image ProcessingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1389] arXiv:2111.04612 (cross-list from physics.geo-ph) [pdf, ps, other]
-
Title: Machine Learning Guided 3D Image Recognition for Carbonate Pore and Mineral Volumes DeterminationComments: 1- Added Affiliation section. 2- Updated the Acknowledgement sectionSubjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV)
- [1390] arXiv:2111.04699 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Automated pharyngeal phase detection and bolus localization in videofluoroscopic swallowing study: Killing two birds with one stone?Journal-ref: Computer methods and programs in biomedicine 225 (2022): 107058Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1391] arXiv:2111.04733 (cross-list from eess.IV) [pdf, other]
-
Title: Real-time landmark detection for precise endoscopic submucosal dissection via shape-aware relation networkAuthors: Jiacheng Wang, Yueming Jin, Shuntian Cai, Hongzhi Xu, Pheng-Ann Heng, Jing Qin, Liansheng WangSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1392] arXiv:2111.04734 (cross-list from eess.IV) [pdf, other]
-
Title: Mixed Transformer U-Net For Medical Image SegmentationAuthors: Hongyi Wang, Shiao Xie, Lanfen Lin, Yutaro Iwamoto, Xian-Hua Han, Yen-Wei Chen, Ruofeng TongSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1393] arXiv:2111.04735 (cross-list from eess.IV) [pdf, other]
-
Title: Feature-enhanced Generation and Multi-modality Fusion based Deep Neural Network for Brain Tumor Segmentation with Missing MR ModalitiesComments: 30 pages, 7 figuresJournal-ref: Neurocomputing 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1394] arXiv:2111.04736 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-Modality Cardiac Image Analysis with Deep LearningComments: Under review as a chapter of book 'Deep Learning for Medical Image Analysis, 2E'Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1395] arXiv:2111.04737 (cross-list from eess.IV) [pdf, other]
-
Title: Synthetic magnetic resonance images for domain adaptation: Application to fetal brain tissue segmentationAuthors: Priscille de Dumast, Hamza Kebiri, Kelly Payette, Andras Jakab, Hélène Lajous, Meritxell Bach CuadraComments: 4 pages, 4 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1396] arXiv:2111.04738 (cross-list from q-bio.QM) [pdf, ps, other]
-
Title: HEROHE Challenge: assessing HER2 status in breast cancer without immunohistochemistry or in situ hybridizationAuthors: Eduardo Conde-Sousa, João Vale, Ming Feng, Kele Xu, Yin Wang, Vincenzo Della Mea, David La Barbera, Ehsan Montahaei, Mahdieh Soleymani Baghshah, Andreas Turzynski, Jacob Gildenblat, Eldad Klaiman, Yiyu Hong, Guilherme Aresta, Teresa Araújo, Paulo Aguiar, Catarina Eloy, António PolóniaSubjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1397] arXiv:2111.04739 (cross-list from eess.IV) [pdf, other]
-
Title: DR-VNet: Retinal Vessel Segmentation via Dense Residual UNetComments: Accepted to ICPRAI 2022 - 3rd International Conference on Pattern Recognition and Artificial IntelligenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1398] arXiv:2111.04740 (cross-list from q-bio.QM) [pdf, other]
-
Title: BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology ImagesAuthors: Nadia Brancati, Anna Maria Anniciello, Pushpak Pati, Daniel Riccio, Giosuè Scognamiglio, Guillaume Jaume, Giuseppe De Pietro, Maurizio Di Bonito, Antonio Foncubierta, Gerardo Botti, Maria Gabrani, Florinda Feroce, Maria FrucciComments: 10 pages, 3 figures, 8 tables, 30 referencesSubjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1399] arXiv:2111.04742 (cross-list from astro-ph.IM) [pdf, other]
-
Title: E(2) Equivariant Self-Attention for Radio AstronomyComments: Accepted in: Fourth Workshop on Machine Learning and the Physical Sciences (35th Conference on Neural Information Processing Systems; NeurIPS2021); final version; 7 pages, 3 figuresSubjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV)
- [1400] arXiv:2111.04807 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Unsupervised Approaches for Out-Of-Distribution Dermoscopic Lesion DetectionAuthors: Max Torop, Sandesh Ghimire, Wenqian Liu, Dana H. Brooks, Octavia Camps, Milind Rajadhyaksha, Jennifer Dy, Kivanc KoseComments: NeurIPS: Medical Imaging Meets NeurIPS WorkshopSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1401] arXiv:2111.04881 (cross-list from cond-mat.quant-gas) [pdf, other]
-
Title: Combining machine learning with physics: A framework for tracking and sorting multiple dark solitonsComments: 13 pages, 9 figuresJournal-ref: Phys. Rev. Research 4, 023163 (2022)Subjects: Quantum Gases (cond-mat.quant-gas); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantum Physics (quant-ph)
- [1402] arXiv:2111.04885 (cross-list from eess.IV) [pdf, other]
-
Title: Lymph Node Detection in T2 MRI with TransformersAuthors: Tejas Sudharshan Mathai, Sungwon Lee, Daniel C. Elton, Thomas C. Shen, Yifan Peng, Zhiyong Lu, Ronald M. SummersComments: Accepted at SPIE 2022Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1403] arXiv:2111.04886 (cross-list from eess.IV) [pdf, other]
-
Title: Universal Lesion Detection in CT Scans using Neural Network EnsemblesComments: Accepted at SPIE 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1404] arXiv:2111.04893 (cross-list from eess.IV) [pdf, other]
-
Title: Mitigating domain shift in AI-based tuberculosis screening with unsupervised domain adaptationAuthors: Nishanjan Ravin, Sourajit Saha, Alan Schweitzer, Ameena Elahi, Farouk Dako, Daniel Mollura, David ChapmanSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1405] arXiv:2111.04911 (cross-list from eess.IV) [pdf, other]
-
Title: Real-time Instance Segmentation of Surgical Instruments using Attention and Multi-scale Feature FusionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1406] arXiv:2111.04985 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Bilinear pooling and metric learning network for early Alzheimer's disease identification with FDG-PET imagesAuthors: Wenju Cui, Caiying Yan, Zhuangzhi Yan, Yunsong Peng, Yilin Leng, Chenlu Liu, Shuangqing Chen, Xi JiangSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1407] arXiv:2111.05014 (cross-list from eess.IV) [pdf, other]
-
Title: GDCA: GAN-based single image super resolution with Dual discriminators and Channel AttentionJournal-ref: Korean Association of Artificial Intelligence 2019Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1408] arXiv:2111.05055 (cross-list from eess.IV) [pdf, other]
-
Title: MAC-ReconNet: A Multiple Acquisition Context based Convolutional Neural Network for MR Image Reconstruction using Dynamic Weight PredictionJournal-ref: Proceedings of the Third Conference on Medical Imaging with Deep Learning, PMLR 121:696-708, 2020Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1409] arXiv:2111.05100 (cross-list from eess.SP) [pdf, other]
-
Title: EEGEyeNet: a Simultaneous Electroencephalography and Eye-tracking Dataset and Benchmark for Eye Movement PredictionAuthors: Ard Kastrati, Martyna Beata Płomecka, Damián Pascual, Lukas Wolf, Victor Gillioz, Roger Wattenhofer, Nicolas LangerComments: Published at NeurIPS 2021 Datasets and Benchmarks TrackSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1410] arXiv:2111.05125 (cross-list from eess.IV) [pdf, other]
-
Title: Segmentation of Multiple Myeloma Plasma Cells in Microscopy Images with Noisy LabelsComments: Accepted to SPIE Medical Imaging conferenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1411] arXiv:2111.05133 (cross-list from eess.IV) [pdf, other]
-
Title: Approaching the Limit of Image Rescaling via Flow GuidanceComments: BMVC 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1412] arXiv:2111.05194 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Deep Learning Adapted Acceleration for Limited-view Photoacoustic Computed TomographyComments: submitted the journal versionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1413] arXiv:2111.05226 (cross-list from eess.IV) [pdf, other]
-
Title: Leveraging blur information for plenoptic camera calibrationComments: arXiv admin note: text overlap with arXiv:2004.07745Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1414] arXiv:2111.05315 (cross-list from q-bio.QM) [pdf, ps, other]
-
Title: Stain-free Detection of Embryo Polarization using Deep LearningSubjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Biological Physics (physics.bio-ph)
- [1415] arXiv:2111.05408 (cross-list from eess.IV) [pdf, other]
-
Title: Robust deep learning-based semantic organ segmentation in hyperspectral imagesAuthors: Silvia Seidlitz (1 and 2), Jan Sellner (1 and 2), Jan Odenthal (3), Berkin Özdemir (3 and 4), Alexander Studier-Fischer (3 and 4), Samuel Knödler (3 and 4), Leonardo Ayala (1 and 4), Tim J. Adler (1 and 6), Hannes G. Kenngott (2 and 3), Minu Tizabi (1), Martin Wagner (2 and 3 and 4), Felix Nickel (2 and 3 and 4), Beat P. Müller-Stich (3 and 4), Lena Maier-Hein (1 and 2 and 4 and 5 and 6) ((1) Division of Intelligent Medical Systems, German Cancer Research Center (DKFZ), Heidelberg, Germany, (2) Helmholtz Information and Data Science School for Health, Karlsruhe/Heidelberg, Germany, (3) Department of General, Visceral, and Transplantation Surgery, Heidelberg University Hospital, Heidelberg, Germany, (4) Medical Faculty, Heidelberg University, Heidelberg, Germany, (5) HIP Helmholtz Imaging Platform, German Cancer Research Center (DKFZ), Heidelberg, Germany, (6) Faculty of Mathematics and Computer Science, Heidelberg University, Germany)Comments: The first two authors (Silvia Seidlitz and Jan Sellner) contributed equally to this paperJournal-ref: Medical Image Analysis, Volume 80, 2022, 102488, ISSN 1361-8415Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1416] arXiv:2111.05679 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Explanatory Analysis and Rectification of the Pitfalls in COVID-19 DatasetsAuthors: Samyak Prajapati, Japman Singh Monga, Shaanya Singh, Amrit Raj, Yuvraj Singh Champawat, Chandra PrakashSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1417] arXiv:2111.05789 (cross-list from eess.IV) [pdf, other]
-
Title: Evaluation of Deep Learning Topcoders Method for Neuron Individualization in Histological Macaque Brain SectionAuthors: Huaqian Wu, Nicolas Souedet, Zhenzhen You, Caroline Jan, Cédric Clouchoux, Thierry DelzescauxSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1418] arXiv:2111.05790 (cross-list from eess.IV) [pdf, other]
-
Title: Early Myocardial Infarction Detection over Multi-view EchocardiographySubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1419] arXiv:2111.05882 (cross-list from q-bio.QM) [pdf, ps, other]
-
Title: A Histopathology Study Comparing Contrastive Semi-Supervised and Fully Supervised LearningAuthors: Lantian Zhang (1 and 2), Mohamed Amgad (2), Lee A.D. Cooper (2) ((1) North Shore Country Day, Winnetka, IL, USA, (2) Department of Pathology, Northwestern University, Chicago, IL, USA)Comments: 7 pages, 4 figures, 4 tablesSubjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1420] arXiv:2111.05959 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Advancing Brain Metastases Detection in T1-Weighted Contrast-Enhanced 3D MRI using Noisy Student-based TrainingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1421] arXiv:2111.05978 (cross-list from eess.IV) [pdf, other]
-
Title: SUPER-Net: Trustworthy Medical Image Segmentation with Uncertainty Propagation in Encoder-Decoder NetworksAuthors: Giuseppina Carannante, Dimah Dera, Nidhal C.Bouaynaya, Hassan M. Fathallah-Shaykh, Ghulam RasoolSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1422] arXiv:2111.06063 (cross-list from stat.ML) [pdf, other]
-
Title: On the Equivalence between Neural Network and Support Vector MachineComments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Optimization and Control (math.OC)
- [1423] arXiv:2111.06069 (cross-list from eess.IV) [pdf, other]
-
Title: CodEx: A Modular Framework for Joint Temporal De-blurring and Tomographic ReconstructionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1424] arXiv:2111.06254 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Detecting COVID-19 from Chest Computed Tomography Scans using AI-Driven Android ApplicationJournal-ref: Computers in Biology and Medicine, 143 (2022), 105298Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1425] arXiv:2111.06291 (cross-list from eess.IV) [src]
-
Title: Related Work on Image Quality AssessmentAuthors: Dongxu WangComments: This is just my reading notesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1426] arXiv:2111.06398 (cross-list from eess.IV) [pdf, other]
-
Title: A Multi-attribute Controllable Generative Model for Histopathology Image SynthesisComments: MICCAI 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1427] arXiv:2111.06399 (cross-list from eess.IV) [pdf, other]
-
Title: Selective Synthetic Augmentation with HistoGAN for Improved Histopathology Image ClassificationAuthors: Yuan Xue, Jiarong Ye, Qianying Zhou, Rodney Long, Sameer Antani, Zhiyun Xue, Carl Cornwell, Richard Zaino, Keith Cheng, Xiaolei HuangComments: Elsevier Medical Image Analysis Best Paper Award runner up. arXiv admin note: substantial text overlap with arXiv:1912.03837Journal-ref: Medical Image Analysis 67 (2021): 101816Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1428] arXiv:2111.06400 (cross-list from eess.IV) [pdf, other]
-
Title: Fast T2w/FLAIR MRI Acquisition by Optimal Sampling of Information Complementary to Pre-acquired T1w MRISubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1429] arXiv:2111.06401 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Stacked U-Nets with Self-Assisted Priors Towards Robust Correction of Rigid Motion Artifact in Brain MRIAuthors: Mohammed A. Al-masni, Seul Lee, Jaeuk Yi, Sewook Kim, Sung-Min Gho, Young Hun Choi, Dong-Hyun KimComments: 24 pages, 10 figures, 3 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1430] arXiv:2111.06425 (cross-list from eess.IV) [pdf, other]
-
Title: Multiple Hypothesis Hypergraph Tracking for Posture Identification in Embryonic Caenorhabditis elegansSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Cell Behavior (q-bio.CB)
- [1431] arXiv:2111.06693 (cross-list from q-bio.QM) [pdf, other]
-
Title: Deep-learning in the bioimaging wild: Handling ambiguous data with deepflash2Authors: Matthias Griebel, Dennis Segebarth, Nikolai Stein, Nina Schukraft, Philip Tovote, Robert Blum, Christoph M. FlathSubjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV)
- [1432] arXiv:2111.06707 (cross-list from eess.IV) [pdf, other]
-
Title: Transformer-based Image CompressionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1433] arXiv:2111.06890 (cross-list from eess.IV) [pdf, other]
-
Title: Impact of loss functions on the performance of a deep neural network designed to restore low-dose digital mammographyAuthors: Hongming Shan, Rodrigo de Barros Vimieiro, Lucas Rodrigues Borges, Marcelo Andrade da Costa Vieira, Ge WangComments: 15 pages, 12 figuresJournal-ref: Artificial Intelligence In Medicine, 142(2023), 102555, 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1434] arXiv:2111.06894 (cross-list from eess.IV) [pdf, other]
-
Title: Convolutional Nets Versus Vision Transformers for Diabetic Foot Ulcer ClassificationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1435] arXiv:2111.07031 (cross-list from eess.IV) [pdf, other]
-
Title: Improving the Otsu Thresholding Method of Global Binarization Using Ring Theory for Ultrasonographies of Congestive Heart FailureSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Rings and Algebras (math.RA); Medical Physics (physics.med-ph)
- [1436] arXiv:2111.07104 (cross-list from eess.IV) [pdf, other]
-
Title: A strong baseline for image and video quality assessmentSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1437] arXiv:2111.07254 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Moment Transform-Based Compressive Sensing in Image ProcessingComments: 12 pages, 13 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1438] arXiv:2111.07281 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Joint Demosaicing and High Dynamic Range Imaging within a Single ShotComments: 15 pages, 17 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1439] arXiv:2111.07355 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Fracture Detection in Wrist X-ray Images Using Deep Learning-Based Object Detection ModelsAuthors: Fırat Hardalaç, Fatih Uysal, Ozan Peker, Murat Çiçeklidağ, Tolga Tolunay, Nil Tokgöz, Uğurhan Kutbay, Boran Demirciler, Fatih MertComments: This paper is accepted at Sensors, MDPI, 2022, 22, 1285. Section: "Sensing and Imaging"Journal-ref: Sensors, MDPI, 2022, 22, 1285. Section: "Sensing and Imaging"Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1440] arXiv:2111.07369 (cross-list from eess.IV) [pdf, other]
-
Title: Estimation of Acetabular Version from Anteroposterior Pelvic Radiograph Employing Deep LearningAuthors: Ata Jodeiri, Hadi Seyedarabi, Fatemeh Shahbazi, Seyed Mohammad Mahdi Hashemi, Seyyedhossein ShafieiComments: 12 pages, 8 figuresSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1441] arXiv:2111.07535 (cross-list from eess.IV) [pdf, other]
-
Title: T-AutoML: Automated Machine Learning for Lesion Segmentation using Transformers in 3D Medical ImagingComments: Accepted at ICCV 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1442] arXiv:2111.07634 (cross-list from eess.IV) [pdf, other]
-
Title: Pseudo-domains in imaging data improve prediction of future disease status in multi-center studiesAuthors: Matthias Perkonigg, Peter Mesenbrink, Alexander Goehler, Miljen Martic, Ahmed Ba-Ssalamah, Georg LangsComments: Accepted at Medical Imaging Meets NeurIPS 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1443] arXiv:2111.07892 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Data privacy protection in microscopic image analysis for material data miningComments: 14 pagesSubjects: Image and Video Processing (eess.IV); Materials Science (cond-mat.mtrl-sci); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1444] arXiv:2111.07918 (cross-list from eess.IV) [pdf, other]
-
Title: Transformer for Polyp DetectionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1445] arXiv:2111.08005 (cross-list from eess.IV) [pdf, other]
-
Title: Solving Inverse Problems in Medical Imaging with Score-Based Generative ModelsComments: Published at ICLR 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [1446] arXiv:2111.08006 (cross-list from eess.IV) [pdf, other]
-
Title: Disparities in Dermatology AI: Assessments Using Diverse Clinical ImagesAuthors: Roxana Daneshjou, Kailas Vodrahalli, Weixin Liang, Roberto A Novoa, Melissa Jenkins, Veronica Rotemberg, Justin Ko, Susan M Swetter, Elizabeth E Bailey, Olivier Gevaert, Pritam Mukherjee, Michelle Phung, Kiana Yekrang, Bradley Fong, Rachna Sahasrabudhe, James Zou, Albert ChiouComments: Machine Learning for Health (ML4H) - Extended AbstractSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1447] arXiv:2111.08256 (cross-list from eess.IV) [pdf, other]
-
Title: Online Meta Adaptation for Variable-Rate Learned Image CompressionComments: 9 pages, 7 figuresJournal-ref: CVPRW on NTIRE 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1448] arXiv:2111.08362 (cross-list from eess.IV) [pdf, other]
-
Title: Image-specific Convolutional Kernel Modulation for Single Image Super-resolutionComments: 13 pages, submitted to IEEE Transactions, codes are available at this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1449] arXiv:2111.08430 (cross-list from q-bio.QM) [pdf, ps, other]
-
Title: Code-free development and deployment of deep segmentation models for digital pathologyAuthors: Henrik Sahlin Pettersen, Ilya Belevich, Elin Synnøve Røyset, Erik Smistad, Eija Jokitalo, Ingerid Reinertsen, Ingunn Bakke, André PedersenComments: 18 pages, 4 figures, 2 tablesSubjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1450] arXiv:2111.08502 (cross-list from eess.SP) [pdf, other]
-
Title: Human-error-potential Estimation based on Wearable Biometric SensorsComments: Accepted by KDIR 2021 : 13th International Conference on Knowledge Discovery and Information Retrieval. (ISBN 978-989-758-533-3; ISSN 2184-3228, this https URL&t=1)Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Applications (stat.AP)
- [1451] arXiv:2111.08570 (cross-list from physics.plasm-ph) [pdf, other]
-
Title: Tracking Blobs in the Turbulent Edge Plasma of a Tokamak Fusion DeviceAuthors: Woonghee Han, Randall A. Pietersen, Rafael Villamor-Lora, Matthew Beveridge, Nicola Offeddu, Theodore Golfinopoulos, Christian Theiler, James L. Terry, Earl S. Marmar, Iddo DroriComments: 14 pages, 9 figuresSubjects: Plasma Physics (physics.plasm-ph); Computer Vision and Pattern Recognition (cs.CV)
- [1452] arXiv:2111.08597 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A layer-stress learning framework universally augments deep neural network tasksSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1453] arXiv:2111.08606 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Advancement of Deep Learning in Pneumonia and Covid-19 Classification and Localization: A Qualitative and Quantitative AnalysisComments: 20 pages, 5 figures, 5 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1454] arXiv:2111.08685 (cross-list from eess.IV) [pdf, other]
-
Title: A Latent Encoder Coupled Generative Adversarial Network (LE-GAN) for Efficient Hyperspectral Image Super-resolutionComments: 18 pages, 10 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1455] arXiv:2111.08705 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Automated Atlas-based Segmentation of Single Coronal Mouse Brain Slices using Linear 2D-2D RegistrationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1456] arXiv:2111.08708 (cross-list from eess.IV) [pdf, other]
-
Title: Automated skin lesion segmentation using multi-scale feature extraction scheme and dual-attention mechanismJournal-ref: In 2021 3rd International Conference on Advances in Computing, Communication Control and Networking (ICAC3N) (pp. 1763-1771). IEEE (2021)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1457] arXiv:2111.08710 (cross-list from eess.IV) [pdf, other]
-
Title: CNN Filter Learning from Drawn Markers for the Detection of Suggestive Signs of COVID-19 in CT ImagesComments: 4 pages. To be published in the 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology SocietySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1458] arXiv:2111.08711 (cross-list from eess.IV) [pdf, other]
-
Title: Two-step adversarial debiasing with partial learning -- medical image case-studiesAuthors: Ramon Correa, Jiwoong Jason Jeong, Bhavik Patel, Hari Trivedi, Judy W. Gichoya, Imon BanerjeeSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1459] arXiv:2111.08712 (cross-list from eess.IV) [pdf, other]
-
Title: Automatic Semantic Segmentation of the Lumbar Spine: Clinical Applicability in a Multi-parametric and Multi-centre Study on Magnetic Resonance ImagesAuthors: Jhon Jairo Saenz-Gamboa (1), Julio Domenech (2), Antonio Alonso-Manjarrés (3), Jon A. Gómez (4), Maria de la Iglesia-Vayá (1 and 5) ((1) FISABIO-CIPF Joint Research Unit in Biomedical Imaging - València Spain, (2) Orthopedic Surgery Department Hospital Arnau de Vilanova - València Spain, (3) Radiology Department Hospital Arnau de Vilanova - València Spain, (4) Pattern Recognition and Human Language Technology research center - Universitat Politècnica de València, (5) Regional ministry of Universal Health and Public Health in Valencia)Comments: 19 pages, 9 Figures, 8 Tables; Supplementary Material: 6 pages, 8 TablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1460] arXiv:2111.09013 (cross-list from eess.IV) [pdf, other]
-
Title: Image Super-Resolution Using T-Tetromino PixelsComments: 10 pages, 9 figures, 4 tables. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1461] arXiv:2111.09103 (cross-list from eess.IV) [pdf, other]
-
Title: Fast and Light-Weight Network for Single Frame Structured Illumination Microscopy Super-ResolutionComments: 9 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1462] arXiv:2111.09114 (cross-list from q-bio.QM) [pdf, other]
-
Title: Cryo-shift: Reducing domain shift in cryo-electron subtomograms with unsupervised domain adaptation and randomizationAuthors: Hmrishav Bandyopadhyay, Zihao Deng, Leiting Ding, Sinuo Liu, Mostofa Rafid Uddin, Xiangrui Zeng, Sima Behpour, Min XuComments: 14 pagesJournal-ref: Bioinformatics 2021Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1463] arXiv:2111.09118 (cross-list from q-bio.NC) [pdf, ps, other]
-
Title: The Neural Correlates of Image Texture in the Human Vision Using MagnetoencephalographySubjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1464] arXiv:2111.09126 (cross-list from stat.AP) [pdf, ps, other]
-
Title: Identifying the Factors that Influence Urban Public Transit DemandSubjects: Applications (stat.AP); Computer Vision and Pattern Recognition (cs.CV)
- [1465] arXiv:2111.09172 (cross-list from eess.IV) [pdf, other]
-
Title: End-to-end optimized image compression with competition of prior distributionsJournal-ref: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1466] arXiv:2111.09212 (cross-list from eess.IV) [pdf, other]
-
Title: Single-pass Object-adaptive Data Undersampling and Reconstruction for MRIJournal-ref: in IEEE Transactions on Computational Imaging, vol. 8, pp. 333-345, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1467] arXiv:2111.09262 (cross-list from eess.IV) [pdf, other]
-
Title: Segmentation of Lung Tumor from CT Images using Deep SupervisionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1468] arXiv:2111.09460 (cross-list from eess.IV) [pdf, other]
-
Title: Large-scale Building Height Retrieval from Single SAR Imagery based on Bounding Box Regression NetworksSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1469] arXiv:2111.09631 (cross-list from stat.AP) [pdf, other]
-
Title: Neural Network Kalman filtering for 3D object tracking from linear array ultrasound dataAuthors: Arttu Arjas, Erwin J. Alles, Efthymios Maneas, Simon Arridge, Adrien Desjardins, Mikko J. Sillanpää, Andreas HauptmannComments: 13 pages, 8 figuresSubjects: Applications (stat.AP); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Signal Processing (eess.SP); Machine Learning (stat.ML)
- [1470] arXiv:2111.09639 (cross-list from eess.IV) [pdf, other]
-
Title: Recurrent Variational Network: A Deep Learning Inverse Problem Solver applied to the task of Accelerated MRI ReconstructionComments: 18 pages, 10 figures, 3 tables, CVPR 22Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1471] arXiv:2111.09696 (cross-list from math.OC) [pdf, ps, other]
-
Title: Casting graph isomorphism as a point set registration problem using a simplex embedding and samplingAuthors: Yigit OktarSubjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV)
- [1472] arXiv:2111.09701 (cross-list from eess.IV) [pdf, other]
-
Title: Visual design intuition: Predicting dynamic properties of beams from raw cross-section imagesComments: Accepted for publication in Journal Of The Royal Society InterfaceSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1473] arXiv:2111.09708 (cross-list from eess.IV) [pdf, other]
-
Title: A Trainable Spectral-Spatial Sparse Coding Model for Hyperspectral Image RestorationAuthors: Théo Bodrito (Thoth, Inria, UGA, CNRS, Grenoble INP, LJK), Alexandre Zouaoui (Thoth, Inria, UGA, CNRS, Grenoble INP, LJK), Jocelyn Chanussot (Thoth, Inria, UGA, CNRS, Grenoble INP, LJK), Julien Mairal (Thoth, Inria, UGA, CNRS, Grenoble INP, LJK)Journal-ref: 2021 Conference on Neural Information Processing Systems, Dec 2021, Sydney, AustraliaSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1474] arXiv:2111.09972 (cross-list from eess.IV) [pdf, other]
-
Title: COVID-19 Detection on Chest X-Ray Images: A comparison of CNN architectures and ensemblesAuthors: Fabricio BreveComments: 23 pages, 2 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1475] arXiv:2111.10255 (cross-list from eess.IV) [pdf, other]
-
Title: An Analysis of the Influence of Transfer Learning When Measuring the Tortuosity of Blood VesselsComments: Correction of typos. Change of mathematical notation. Addition of new sections, appendix, and supplementary materialSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1476] arXiv:2111.10270 (cross-list from math.OC) [pdf, other]
-
Title: FastDOG: Fast Discrete Optimization on GPUComments: Published at CVPR 2022. Alert before printing: last 10 pages just contains detailed results tableSubjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Computer Science and Game Theory (cs.GT)
- [1477] arXiv:2111.10302 (cross-list from eess.IV) [pdf, other]
-
Title: Instance-Adaptive Video Compression: Improving Neural Codecs by Training on the Test SetAuthors: Ties van Rozendaal, Johann Brehmer, Yunfan Zhang, Reza Pourreza, Auke Wiggers, Taco S. CohenComments: Matches version published in TMLRSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1478] arXiv:2111.10371 (cross-list from eess.IV) [pdf, other]
-
Title: ColDE: A Depth Estimation Framework for Colonoscopy ReconstructionAuthors: Yubo Zhang, Jan-Michael Frahm, Samuel Ehrenstein, Sarah K. McGill, Julian G. Rosenman, Shuxian Wang, Stephen M. PizerComments: 13 pages, 5 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1479] arXiv:2111.10372 (cross-list from eess.IV) [pdf, other]
-
Title: Resistance-Time Co-Modulated PointNet for Temporal Super-Resolution Simulation of Blood Vessel FlowsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1480] arXiv:2111.10374 (cross-list from q-bio.QM) [pdf, other]
-
Title: Urine Microscopic Image DatasetComments: 7 pages, 1 imageSubjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1481] arXiv:2111.10376 (cross-list from eess.IV) [pdf, other]
-
Title: Diabetic Foot Ulcer Grand Challenge 2021: Evaluation and SummaryAuthors: Bill Cassidy, Connah Kendrick, Neil D. Reeves, Joseph M. Pappachan, Claire O'Shea, David G. Armstrong, Moi Hoon YapComments: 17 pages, 4 figures, Preprint (author copy) to be published in MICCAI DFUC2021 ProceedingsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1482] arXiv:2111.10443 (cross-list from physics.med-ph) [pdf, other]
-
Title: Evaluation of automated airway morphological quantification for assessing fibrosing lung diseaseAuthors: Ashkan Pakzad, Wing Keung Cheung, Kin Quan, Nesrin Mogulkoc, Coline H.M. Van Moorsel, Brian J. Bartholmai, Hendrik W. Van Es, Alper Ezircan, Frouke Van Beek, Marcel Veltkamp, Ronald Karwoski, Tobias Peikert, Ryan D. Clay, Finbar Foley, Cassandra Braun, Recep Savas, Carole Sudre, Tom Doel, Daniel C. Alexander, Peter Wijeratne, David Hawkes, Yipeng Hu, John R Hurst, Joseph JacobComments: 14 pages, 8 Figures, for associated source code, see this https URLSubjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
- [1483] arXiv:2111.10480 (cross-list from eess.IV) [pdf, other]
-
Title: TransMorph: Transformer for unsupervised medical image registrationComments: Accepted to Medical Image Analysis ((c) MedIA). Code available at this https URL | This version: Several typographical errors were fixedJournal-ref: Medical Image Analysis (2022) 102615Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1484] arXiv:2111.10610 (cross-list from eess.IV) [pdf, other]
-
Title: Constrained Deep One-Class Feature Learning For Classifying Imbalanced Medical ImagesComments: Corrected inaccurate information in affiliation and acknowledgmentSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1485] arXiv:2111.10614 (cross-list from eess.IV) [pdf, other]
-
Title: GMSRF-Net: An improved generalizability with global multi-scale residual fusion network for polyp segmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1486] arXiv:2111.10618 (cross-list from eess.IV) [pdf, other]
-
Title: PAANet: Progressive Alternating Attention for Automatic Medical Image SegmentationAuthors: Abhishek Srivastava, Sukalpa Chanda, Debesh Jha, Michael A. Riegler, Pål Halvorsen, Dag Johansen, Umapada PalSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1487] arXiv:2111.10620 (cross-list from eess.IV) [pdf, other]
-
Title: Medical Knowledge-Guided Deep Learning for Imbalanced Medical Image ClassificationComments: Corrected inaccurate information in affiliation and acknowledgmentSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1488] arXiv:2111.10683 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A Review on The Division of Magnetic Resonant Prostate Images with Deep LearningSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Medical Physics (physics.med-ph)
- [1489] arXiv:2111.10755 (cross-list from math.ST) [pdf, ps, other]
-
Title: Generalized Inversion of Nonlinear OperatorsComments: A significant extension of the SSVM 2023 conference paper (see also v2 here), in particular, new sections 7--9Journal-ref: J Math Imaging Vision, 2024; L. Calatroni et al. (Eds.): SSVM 2023, LNCS 14009, pp. 29--41, 2023Subjects: Statistics Theory (math.ST); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP); Functional Analysis (math.FA)
- [1490] arXiv:2111.10762 (cross-list from eess.IV) [pdf, other]
-
Title: COVID-19 Detection through Deep Feature ExtractionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1491] arXiv:2111.10773 (cross-list from eess.IV) [pdf, other]
-
Title: One-shot Weakly-Supervised Segmentation in Medical ImagesAuthors: Wenhui Lei, Qi Su, Ran Gu, Na Wang, Xinglong Liu, Guotai Wang, Xiaofan Zhang, Shaoting ZhangSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1492] arXiv:2111.10790 (cross-list from eess.IV) [pdf, other]
-
Title: DuDoTrans: Dual-Domain Transformer Provides More Attention for Sinogram Restoration in Sparse-View CT ReconstructionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1493] arXiv:2111.10800 (cross-list from eess.IV) [pdf, other]
-
Title: FreqNet: A Frequency-domain Image Super-Resolution Network with Dicrete Cosine TransformSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1494] arXiv:2111.10803 (cross-list from eess.IV) [pdf, other]
-
Title: Structure-Preserving Graph Kernel for Brain Network ClassificationAuthors: Jun Yu, Zhaoming Kong, Aditya Kendre, Hao Peng, Carl Yang, Lichao Sun, Alex Leow, Lifang HeSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1495] arXiv:2111.10827 (cross-list from eess.IV) [pdf, other]
-
Title: Domain Generalization for Mammography Detection via Multi-style and Multi-view Contrastive LearningAuthors: Zheren Li, Zhiming Cui, Sheng Wang, Yuji Qi, Xi Ouyang, Qitian Chen, Yuezhi Yang, Zhong Xue, Dinggang Shen, Jie-Zhi ChengComments: Pages 98-108Journal-ref: International Conference on Medical Image Computing and Computer-Assisted Intervention 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1496] arXiv:2111.10887 (cross-list from eess.IV) [pdf, other]
-
Title: Dynamic imaging using motion-compensated smoothness regularization on manifolds (MoCo-SToRM)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1497] arXiv:2111.10889 (cross-list from eess.IV) [pdf, other]
-
Title: Joint alignment and reconstruction of multislice dynamic MRI using variational manifold learningSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1498] arXiv:2111.10892 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Image Prior using Stein's Unbiased Risk Estimator: SURE-DIPSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1499] arXiv:2111.10988 (cross-list from eess.IV) [pdf, other]
-
Title: Local-Selective Feature Distillation for Single Image Super-ResolutionComments: in reviewSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1500] arXiv:2111.11191 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Deep Learning Based Automated COVID-19 Classification from Computed Tomography ImagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1501] arXiv:2111.11269 (cross-list from eess.IV) [pdf, other]
-
Title: Automated cross-sectional view selection in CT angiography of aortic dissections with uncertainty awareness and retrospective clinical annotationsAuthors: Antonio Pepe, Jan Egger, Marina Codari, Martin J. Willemink, Christina Gsaxner, Jianning Li, Peter M. Roth, Gabriel Mistelbauer, Dieter Schmalstieg, Dominik FleischmannComments: 28 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1502] arXiv:2111.11394 (cross-list from eess.IV) [pdf, other]
-
Title: 4D iterative reconstruction of brain fMRI in the moving fetusAuthors: Athena Taymourtash, Hamza Kebiri, Sébastien Tourbier, Ernst Schwartz, Karl-Heinz Nenning, Roxane Licandro, Daniel Sobotka, Hélène Lajous, Priscille de Dumast, Meritxell Bach Cuadra, Georg LangsComments: 5 pages, 3 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1503] arXiv:2111.11419 (cross-list from eess.IV) [pdf, ps, other]
-
Title: FAZSeg: A New User-Friendly Software for Quantification of the Foveal Avascular ZoneComments: Submitted to the Clinical Ophthalmology JournalJournal-ref: 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1504] arXiv:2111.11439 (cross-list from eess.IV) [pdf, other]
-
Title: Image prediction of disease progression by style-based manifold extrapolationAuthors: Tianyu Han, Jakob Nikolas Kather, Federico Pedersoli, Markus Zimmermann, Sebastian Keil, Maximilian Schulze-Hagen, Marc Terwoelbeck, Peter Isfort, Christoph Haarburger, Fabian Kiessling, Volkmar Schulz, Christiane Kuhl, Sven Nebelung, Daniel TruhnSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1505] arXiv:2111.11602 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Unsupervised COVID-19 Lesion Segmentation in CT Using Cycle Consistent Generative Adversarial NetworkAuthors: Chengyijue Fang, Yingao Liu, Mengqiu Liu, Xiaohui Qiu, Ying Liu, Yang Li, Jie Wen, Yidong YangComments: It has been submitted to Medical Physics for peer-review on July 26, 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1506] arXiv:2111.11658 (cross-list from eess.IV) [pdf, other]
-
Title: The RETA Benchmark for Retinal Vascular Tree AnalysisComments: 13 pages,6 figures, 4 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1507] arXiv:2111.11665 (cross-list from eess.IV) [pdf, other]
-
Title: RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHRAuthors: Yuyin Zhou, Shih-Cheng Huang, Jason Alan Fries, Alaa Youssef, Timothy J. Amrhein, Marcello Chang, Imon Banerjee, Daniel Rubin, Lei Xing, Nigam Shah, Matthew P. LungrenComments: RadFusion dataset: this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1508] arXiv:2111.11873 (cross-list from eess.IV) [pdf, other]
-
Title: Deformable image registration with deep network priors: a study on longitudinal PET imagesAuthors: Constance Fourcade, Ludovic Ferrer, Noemie Moreau, Gianmarco Santini, Aishlinn Brennan, Caroline Rousseau, Marie Lacombe, Vincent Fleury, Mathilde Colombié, Pascal Jézéquel, Mario Campone, Mathieu Rubeaux, Diana MateusComments: 11 pages 3 figures in the main article 2 tables in the main article 2 figures in supplementary materialSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1509] arXiv:2111.11893 (cross-list from eess.IV) [pdf, other]
-
Title: Extending the Unmixing methods to Multispectral ImagesComments: 6 pages, CIC29 conferenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1510] arXiv:2111.11926 (cross-list from eess.IV) [pdf, other]
-
Title: An Educated Warm Start For Deep Image Prior-Based Micro CT ReconstructionAuthors: Riccardo Barbano, Johannes Leuschner, Maximilian Schmidt, Alexander Denker, Andreas Hauptmann, Peter Maaß, Bangti JinJournal-ref: in IEEE Transactions on Computational Imaging, vol. 8, pp. 1210-1222, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1511] arXiv:2111.12138 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-Modality Microscopy Image Style Transfer for Nuclei SegmentationComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1512] arXiv:2111.12215 (cross-list from eess.IV) [pdf, other]
-
Title: Explainable multiple abnormality classification of chest CT volumesComments: Published in Artificial Intelligence in Medicine, 2022. 8 figures, 7 tablesJournal-ref: Artificial Intelligence in Medicine, August 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1513] arXiv:2111.12483 (cross-list from eess.IV) [pdf, other]
-
Title: LDP-Net: An Unsupervised Pansharpening Network Based on Learnable Degradation ProcessesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1514] arXiv:2111.12541 (cross-list from astro-ph.IM) [pdf, other]
-
Title: Rethinking the modeling of the instrumental response of telescopes with a differentiable optical modelComments: 10 pages. Accepted for the Fourth Workshop on Machine Learning and the Physical Sciences (NeurIPS 2021)Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
- [1515] arXiv:2111.12854 (cross-list from physics.med-ph) [pdf, ps, other]
-
Title: Extending the Relative Seriality Formalism for Interpretable Deep Learning of Normal Tissue Complication Probability ModelsAuthors: Tahir I. YusufalySubjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Biological Physics (physics.bio-ph); Data Analysis, Statistics and Probability (physics.data-an); Tissues and Organs (q-bio.TO)
- [1516] arXiv:2111.12862 (cross-list from eess.IV) [pdf, other]
-
Title: Coded Illumination for Improved Lensless ImagingComments: Supplementary material, codes, and data are available at this https URLJournal-ref: IEEE Transactions on Computational Imaging, 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1517] arXiv:2111.12886 (cross-list from eess.IV) [pdf, other]
-
Title: Morphological feature visualization of Alzheimer's disease via Multidirectional Perception GANAuthors: Wen Yu, Baiying Lei, Yanyan Shen, Shuqiang Wang, Yong Liu, Zhiguang Feng, Yong Hu, Michael K. NgSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1518] arXiv:2111.12991 (cross-list from eess.IV) [pdf, other]
-
Title: Non Parametric Data Augmentations Improve Deep-Learning based Brain Tumor SegmentationJournal-ref: 2021 IEEE COMCASSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1519] arXiv:2111.13105 (cross-list from eess.IV) [pdf, other]
-
Title: A Novel Framework for Image-to-image Translation and Image CompressionComments: 14 pages, 15 figures, accepted by NeurocomputingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1520] arXiv:2111.13299 (cross-list from eess.IV) [pdf, other]
-
Title: Exploiting full Resolution Feature Context for Liver Tumor and Vessel Segmentation via Integrate Framework: Application to Liver Tumor and Vessel 3D Reconstruction under embedded microprocessorAuthors: Xiangyu Meng, Xudong Zhang, Gan Wang, Ying Zhang, Xin Shi, Huanhuan Dai, Zixuan Wang, Xun WangComments: 11 pages, 6 FiguresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1521] arXiv:2111.13300 (cross-list from eess.IV) [pdf, other]
-
Title: A Robust Volumetric Transformer for Accurate 3D Tumor SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1522] arXiv:2111.13537 (cross-list from q-bio.NC) [pdf, other]
-
Title: A model of semantic completion in generative episodic memoryComments: 15 pages, 9 figures, 58 referencesSubjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1523] arXiv:2111.13630 (cross-list from eess.IV) [pdf, other]
-
Title: Efficient Multi-Organ Segmentation Using SpatialConfiguration-Net with Low GPU Memory RequirementsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1524] arXiv:2111.13905 (cross-list from eess.IV) [pdf, other]
-
Title: AdaDM: Enabling Normalization for Image Super-ResolutionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1525] arXiv:2111.13923 (cross-list from eess.IV) [pdf, other]
-
Title: Learning A 3D-CNN and Transformer Prior for Hyperspectral Image Super-ResolutionComments: 10 pages, 5 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1526] arXiv:2111.14239 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Low-complexity Rounded KLT Approximation for Image CompressionComments: 10 pages, 7 figures, 3 tablesJournal-ref: J Real-Time Image Proc (2021)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP); Numerical Analysis (math.NA); Methodology (stat.ME)
- [1527] arXiv:2111.14250 (cross-list from q-bio.NC) [pdf, other]
-
Title: Learning a model of shape selectivity in V4 cells reveals shape encoding mechanisms in the brainComments: 20 pages, 7 figuresSubjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV)
- [1528] arXiv:2111.14259 (cross-list from eess.IV) [pdf, ps, other]
-
Title: 3D High-Quality Magnetic Resonance Image Restoration in Clinics Using Deep LearningComments: 16 pages, 10 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1529] arXiv:2111.14320 (cross-list from eess.IV) [pdf, other]
-
Title: SwiftSRGAN -- Rethinking Super-Resolution for Efficient and Real-time InferenceComments: 6 pages, 3 figures, "to be published in" International Conference on Intelligent Cybernetics Technology & Applications 2021 (ICICyTA)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1530] arXiv:2111.14362 (cross-list from eess.IV) [pdf, other]
-
Title: Unsupervised Image Denoising with Frequency Domain KnowledgeComments: Accepted to BMVC 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1531] arXiv:2111.14388 (cross-list from eess.IV) [pdf, other]
-
Title: Enhanced Transfer Learning Through Medical Imaging and Patient Demographic Data FusionAuthors: Spencer A. ThomasSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1532] arXiv:2111.14474 (cross-list from eess.IV) [pdf, other]
-
Title: Learning-Based Video Coding with Joint Deep Compression and EnhancementComments: 10 pages, 9 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1533] arXiv:2111.14804 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Unsupervised cross domain learning with applications to 7 layer segmentation of OCTsAuthors: Yue Wu, Abraham Olvera Barrios, Ryan Yanagihara, Irene Leung, Marian Blazes, Adnan Tufail, Aaron LeeSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1534] arXiv:2111.14953 (cross-list from eess.IV) [pdf, other]
-
Title: Localized Perturbations For Weakly-Supervised Segmentation of Glioma Brain TumoursSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1535] arXiv:2111.14959 (cross-list from eess.IV) [pdf, other]
-
Title: Improving the Segmentation of Pediatric Low-Grade Gliomas through Multitask LearningSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1536] arXiv:2111.15200 (cross-list from eess.IV) [pdf, other]
-
Title: Contrastive Learning for Local and Global Learning MRI ReconstructionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1537] arXiv:2111.15409 (cross-list from eess.IV) [pdf, other]
-
Title: Fully Automatic Deep Learning Framework for Pancreatic Ductal Adenocarcinoma Detection on Computed TomographyAuthors: Natália Alves, Megan Schuurmans, Geke Litjens, Joeran S. Bosma, John Hermans, Henkjan HuismanJournal-ref: lves, N.; Schuurmans, M.;Litjens, G.; Bosma, J.S.; Hermans, J.;Huisman, H. Fully Automatic DeepLearning Framework for PancreaticDuctal Adenocarcinoma Detection onComputed Tomography.Cancers2022,14, 376Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1538] arXiv:2111.15498 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Assessment of Data Consistency through Cascades of Independently Recurrent Inference Machines for fast and robust accelerated MRI reconstructionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1539] arXiv:2111.15519 (cross-list from eess.IV) [pdf, other]
-
Title: Gram Barcodes for Histopathology Tissue Texture RetrievalSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ showing 1539 entries per page: fewer | more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, 2404, contact, help (Access key information)