Computer Vision and Pattern Recognition
Authors and titles for cs.CV in Jan 2022
[ total of 1140 entries: 1-1138 | 1139-1140 ][ showing 1138 entries per page: fewer | more | all ]
- [1] arXiv:2201.00043 [pdf, other]
-
Title: Multi-Dimensional Model Compression of Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [2] arXiv:2201.00059 [pdf, other]
-
Title: iCaps: Iterative Category-level Object Pose and Shape EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [3] arXiv:2201.00080 [pdf, other]
-
Title: PatchTrack: Multiple Object Tracking Using Frame PatchesComments: 11 pages, 4 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [4] arXiv:2201.00095 [pdf, other]
-
Title: Computer Vision Based Parking Optimization SystemSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [5] arXiv:2201.00096 [pdf, other]
-
Title: SalyPath360: Saliency and Scanpath Prediction Framework for Omnidirectional ImagesComments: Accepted at Electornic Imaging Sympotium 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [6] arXiv:2201.00097 [pdf, other]
-
Title: Adversarial Attack via Dual-Stage Network ErosionSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [7] arXiv:2201.00103 [pdf, other]
-
Title: Robust Region Feature Synthesizer for Zero-Shot Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [8] arXiv:2201.00107 [pdf, other]
-
Title: Quality-aware Part Models for Occluded Person Re-identificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [9] arXiv:2201.00112 [pdf, other]
-
Title: SurfGen: Adversarial 3D Shape Synthesis with Explicit Surface DiscriminatorsComments: ICCV 2021. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [10] arXiv:2201.00132 [pdf, other]
-
Title: SAFL: A Self-Attention Scene Text Recognizer with Focal LossAuthors: Bao Hieu Tran, Thanh Le-Cong, Huu Manh Nguyen, Duc Anh Le, Thanh Hung Nguyen, Phi Le NguyenComments: Accepted to ICMLA 2020Journal-ref: 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [11] arXiv:2201.00177 [pdf, other]
-
Title: Adaptive Image InpaintingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [12] arXiv:2201.00220 [pdf, other]
-
Title: Turath-150K: Image Database of Arab HeritageSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [13] arXiv:2201.00239 [pdf, other]
-
Title: SporeAgent: Reinforced Scene-level Plausibility for Object Pose RefinementComments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [14] arXiv:2201.00267 [pdf, other]
-
Title: On the Cross-dataset Generalization in License Plate RecognitionComments: Accepted for presentation at the International Conference on Computer Vision Theory and Applications (VISAPP) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [15] arXiv:2201.00323 [pdf, other]
-
Title: V-LinkNet: Learning Contextual Inpainting Across Latent Space of Generative Adversarial NetworkComments: 13 pages including references, 9 figures and 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [16] arXiv:2201.00346 [pdf, other]
-
Title: Detail-Preserving Transformer for Light Field Image Super-ResolutionComments: AAAI2022, Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [17] arXiv:2201.00377 [pdf, other]
-
Title: Parkour Spot ID: Feature Matching in Satellite and Street view images using Deep LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [18] arXiv:2201.00392 [pdf, other]
-
Title: Fast and High-Quality Image Denoising via Malleable ConvolutionsAuthors: Yifan Jiang, Bartlomiej Wronski, Ben Mildenhall, Jonathan T. Barron, Zhangyang Wang, Tianfan XueComments: Accepted by ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [19] arXiv:2201.00411 [pdf, other]
-
Title: The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied AgentsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [20] arXiv:2201.00424 [pdf, other]
-
Title: Splicing ViT Features for Semantic Appearance TransferSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [21] arXiv:2201.00434 [pdf, other]
-
Title: TVNet: Temporal Voting Network for Action LocalizationComments: 9 pages, 7 figures, 11 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [22] arXiv:2201.00439 [pdf, other]
-
Title: Salient Object Detection by LTP Texture Characterization on Opposing Color Pairs under SLICO Superpixel ConstraintJournal-ref: J. Imaging 2022, 8(4), 110Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [23] arXiv:2201.00443 [pdf, other]
-
Title: Scene Graph Generation: A Comprehensive SurveyAuthors: Guangming Zhu, Liang Zhang, Youliang Jiang, Yixuan Dang, Haoran Hou, Peiyi Shen, Mingtao Feng, Xia Zhao, Qiguang Miao, Syed Afaq Ali Shah, Mohammed BennamounComments: Submitted to TPAMISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [24] arXiv:2201.00454 [pdf, other]
-
Title: Memory-Guided Semantic Learning Network for Temporal Sentence GroundingComments: Accepted by AAAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [25] arXiv:2201.00457 [pdf, other]
-
Title: Exploring Motion and Appearance Information for Temporal Sentence GroundingComments: Accepted by AAAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [26] arXiv:2201.00461 [pdf, other]
-
Title: Biometrics in the Time of Pandemic: 40% Masked Face Recognition Degradation can be Reduced to 2%Comments: 11 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [27] arXiv:2201.00462 [pdf, other]
-
Title: D-Former: A U-shaped Dilated Transformer for 3D Medical Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [28] arXiv:2201.00467 [pdf, other]
-
Title: maskGRU: Tracking Small Objects in the Presence of Large Background MotionsComments: 12 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [29] arXiv:2201.00471 [pdf, other]
-
Title: Revisiting Open World Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [30] arXiv:2201.00475 [pdf, other]
-
Title: CaFT: Clustering and Filter on Tokens of Transformer for Weakly Supervised Object LocalizationAuthors: Ming LiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [31] arXiv:2201.00487 [pdf, other]
-
Title: Language as Queries for Referring Video Object SegmentationComments: 14 pages, accepted by CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [32] arXiv:2201.00504 [pdf, ps, other]
-
Title: R-Theta Local Neighborhood Pattern for Unconstrained Facial Image Recognition and RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [33] arXiv:2201.00509 [pdf, ps, other]
-
Title: Local Gradient Hexa Pattern: A Descriptor for Face Recognition and RetrievalJournal-ref: IEEE Transactions on Circuits and Systems for Video Technology, vol-28, no-1, pp. 171-180, (2018). ISSN/ISBN: 1051-8215Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [34] arXiv:2201.00518 [pdf, ps, other]
-
Title: Cascaded Asymmetric Local Pattern: A Novel Descriptor for Unconstrained Facial Image Recognition and RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [35] arXiv:2201.00520 [pdf, other]
-
Title: Vision Transformer with Deformable AttentionComments: Accepted by CVPR2022 (12 pages, 7 figures)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [36] arXiv:2201.00531 [pdf, other]
-
Title: Novelty-based Generalization Evaluation for Traffic Light DetectionComments: Accepted/Presented at ICMLA 2021Journal-ref: 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [37] arXiv:2201.00572 [pdf, other]
-
Title: Enabling Verification of Deep Neural Networks in Perception Tasks Using Fuzzy Logic and Concept EmbeddingsComments: 32 pages (including 14 pages supplemental material), 11 Figures, 8 TablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
- [38] arXiv:2201.00577 [pdf, other]
-
Title: Semantically Grounded Visual Embeddings for Zero-Shot LearningComments: Accepted at CVPRWSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [39] arXiv:2201.00625 [pdf, other]
-
Title: GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD DrawingsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [40] arXiv:2201.00672 [pdf, other]
-
Title: Compression-Resistant Backdoor Attack against Deep Neural NetworksJournal-ref: Applied Intelligence, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [41] arXiv:2201.00708 [pdf, other]
-
Title: Multiview point cloud registration with anisotropic and space-varying localization noiseSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [42] arXiv:2201.00714 [pdf, other]
-
Title: Multi-view Data Classification with a Label-driven Auto-weighted StrategyComments: 11 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [43] arXiv:2201.00770 [pdf, other]
-
Title: FaceQgen: Semi-Supervised Deep Learning for Face Image Quality AssessmentJournal-ref: IEEE International Conference on Automatic Face and Gesture Recognition 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [44] arXiv:2201.00785 [pdf, other]
-
Title: Implicit Autoencoder for Point-Cloud Self-Supervised Representation LearningAuthors: Siming Yan, Zhenpei Yang, Haoxiang Li, Chen Song, Li Guan, Hao Kang, Gang Hua, Qixing HuangComments: Published in ICCV 2023. The code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [45] arXiv:2201.00791 [pdf, other]
-
Title: DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural RenderingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [46] arXiv:2201.00814 [pdf, other]
-
Title: Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization SpaceComments: CVPR 2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [47] arXiv:2201.00848 [pdf, ps, other]
-
Title: Runway Extraction and Improved Mapping from Space ImageryAuthors: David A. NoeverSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [48] arXiv:2201.00877 [pdf, other]
-
Title: Gaussian-Hermite Moment Invariants of General Multi-Channel FunctionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [49] arXiv:2201.00893 [pdf, other]
-
Title: Rice Diseases Detection and Classification Using Attention Based Neural Network and Bayesian OptimizationJournal-ref: Expert Systems with Applications, 178, 114770. (2021)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [50] arXiv:2201.00947 [pdf, other]
-
Title: HWRCNet: Handwritten Word Recognition in JPEG Compressed Domain using CNN-BiLSTM NetworkComments: Accepted in International Conference on Data Analytics and Learning, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [51] arXiv:2201.00966 [pdf, ps, other]
-
Title: AI visualization in Nanoscale MicroscopyAuthors: Rajagopal A (1), Nirmala V (2), Andrew J (3), Arun Muthuraj Vedamanickam. ((1) Indian Institute of Technology Madras, (2) Queen Marys College, (3) Karunya Institute of Technology and Sciences. India)Comments: Best paper award at International Conference On Big Data, Machine Learning and Applications 2021. this http URL In Springer Proceedings 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [52] arXiv:2201.00969 [pdf, ps, other]
-
Title: Interactive Attention AI to translate low light photos to captions for night scene understanding in women safetyComments: In Springer Proceedings. International Conference On Big Data, Machine Learning and Applications 2021. this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [53] arXiv:2201.00975 [pdf, other]
-
Title: StyleM: Stylized Metrics for Image Captioning Built with Contrastive N-gramsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [54] arXiv:2201.00977 [pdf, other]
-
Title: Underwater Object Classification and Detection: first results and open challengesJournal-ref: In Proceedings of OCEANS 2022 Chennai, February 21-24, pp. 1-6Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [55] arXiv:2201.00978 [pdf, other]
-
Title: PyramidTNT: Improved Transformer-in-Transformer Baselines with Pyramid ArchitectureComments: Tech Report. An extension of "Transformer in Transformer" (arXiv:2103.00112)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [56] arXiv:2201.00985 [pdf, other]
-
Title: Variational Stacked Local Attention Networks for Diverse Video CaptioningAuthors: Tonmoay Deb, Akib Sadmanee, Kishor Kumar Bhaumik, Amin Ahsan Ali, M Ashraful Amin, A K M Mahbubur RahmanComments: To be published in Winter Conference on Applications of Computer Vision 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [57] arXiv:2201.01001 [pdf, other]
-
Title: Attention Mechanism Meets with Hybrid Dense Network for Hyperspectral Image ClassificationAuthors: Muhammad Ahmad, Adil Mehmood Khan, Manuel Mazzara, Salvatore Distefano, Swalpa Kumar Roy, Xin WuSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [58] arXiv:2201.01002 [pdf, ps, other]
-
Title: Multi-Representation Adaptation Network for Cross-domain Image ClassificationComments: Neural Networks regular paper. Transfer Learning, Domain AdaptationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [59] arXiv:2201.01008 [pdf, other]
-
Title: Learning to Generate Novel Classes for Deep Metric LearningComments: Accepted to BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [60] arXiv:2201.01016 [pdf, other]
-
Title: Detailed Facial Geometry Recovery from Multi-View Images by Learning an Implicit FunctionComments: AAAI 2022 Oral, updated to camera ready versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [61] arXiv:2201.01029 [pdf, other]
-
Title: Weakly-supervised continual learning for class-incremental segmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [62] arXiv:2201.01030 [pdf, other]
-
Title: A Robust Visual Sampling Model Inspired by Receptive FieldSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [63] arXiv:2201.01046 [pdf, other]
-
Title: Sound and Visual Representation Learning with Multiple Pretraining TasksComments: 11 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [64] arXiv:2201.01047 [pdf, other]
-
Title: DIAL: Deep Interactive and Active Learning for Semantic Segmentation in Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [65] arXiv:2201.01073 [pdf, other]
-
Title: Towards Unsupervised Open World Semantic SegmentationComments: UAI 2022, published in PMLR, Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [66] arXiv:2201.01080 [pdf, other]
-
Title: Towards Understanding and Harnessing the Effect of Image Transformation in Adversarial DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [67] arXiv:2201.01081 [pdf, ps, other]
-
Title: Identifying the exterior image of buildings on a 3D map and extracting elevation information using deep learning and digital image processingComments: 16 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [68] arXiv:2201.01087 [pdf, other]
-
Title: Learning Quality-aware Representation for Multi-person Pose RegressionComments: Accepted by AAAI2022; Slightly different compared with the camera-ready versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [69] arXiv:2201.01090 [pdf, other]
-
Title: Short Range Correlation Transformer for Occluded Person Re-IdentificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [70] arXiv:2201.01102 [pdf, other]
-
Title: Towards Transferable Unrestricted Adversarial Examples with Minimum ChangesComments: Accepted at SaTML 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [71] arXiv:2201.01115 [pdf, other]
-
Title: Data Augmentation for Depression Detection Using Skeleton-Based Gait InformationComments: 10 pages,10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [72] arXiv:2201.01191 [pdf, ps, other]
-
Title: Automated 3D reconstruction of LoD2 and LoD1 models for all 10 million buildings of the NetherlandsComments: Submitted to Journal of Photogrammetric Engineering & Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [73] arXiv:2201.01275 [pdf, ps, other]
-
Title: Local Quadruple Pattern: A Novel Descriptor for Facial Image Recognition and RetrievalJournal-ref: Computers & Electrical Engineering, vol-62, pp. 92-104, (2017). (Elsevier) ISSN/ISBN: 0045-7906Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [74] arXiv:2201.01276 [pdf, ps, other]
-
Title: Local Directional Gradient Pattern: A Local Descriptor for Face RecognitionJournal-ref: Multimedia Tools and Applications, vol-76, no-1, pp. 1201-1216, (2017). (Springer) ISSN/ISBN: 1573-7721Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [75] arXiv:2201.01283 [pdf, other]
-
Title: Self-supervised Learning from 100 Million Medical ImagesAuthors: Florin C. Ghesu, Bogdan Georgescu, Awais Mansoor, Youngjin Yoo, Dominik Neumann, Pragneshkumar Patel, R.S. Vishwanath, James M. Balter, Yue Cao, Sasa Grbic, Dorin ComaniciuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [76] arXiv:2201.01293 [pdf, other]
-
Title: A Transformer-Based Siamese Network for Change DetectionComments: Accepted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. 4 pages, 2 figures. Code & trained models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [77] arXiv:2201.01294 [pdf, other]
-
Title: 3DVSR: 3D EPI Volume-based Approach for Angular and Spatial Light field Image Super-resolutionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [78] arXiv:2201.01297 [pdf, other]
-
Title: Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion EstimationComments: To Appear at Neurocomputing 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [79] arXiv:2201.01391 [pdf, other]
-
Title: Self-Supervised Approach to Addressing Zero-Shot Learning ProblemAuthors: Ademola Okerinde, Sam Hoggatt, Divya Vani Lakkireddy, Nolan Brubaker, William Hsu, Lior Shamir, Brian SpiesmanJournal-ref: The 4th International Conference on Computing and Data Science (CONF-CDS 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [80] arXiv:2201.01399 [pdf, other]
-
Title: Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System RobustnessComments: CSCI 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [81] arXiv:2201.01408 [pdf, other]
-
Title: Fusing Convolutional Neural Network and Geometric Constraint for Image-based Indoor LocalizationComments: Accepted by IEEE robotics and automation lettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [82] arXiv:2201.01410 [pdf, other]
-
Title: Synthesizing Tensor Transformations for Visual Self-attentionComments: 13 pages,3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [83] arXiv:2201.01415 [pdf, other]
-
Title: Problem-dependent attention and effort in neural networks with applications to image resolution and model selectionAuthors: Chris RohlfsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [84] arXiv:2201.01416 [pdf, other]
-
Title: Latent Vector Expansion using Autoencoder for Anomaly DetectionComments: 3 pages, 2 figures, In Proceedings of the 34th Workshop on Image Processing and Image Understanding (IPIU 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [85] arXiv:2201.01427 [pdf, other]
-
Title: Attention-based Dual Supervised Decoder for RGBD Semantic SegmentationComments: 12 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [86] arXiv:2201.01486 [pdf, ps, other]
-
Title: Sign Language Recognition System using TensorFlow Object Detection APIComments: 14 pages, 5 figures, ANTIC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
- [87] arXiv:2201.01494 [pdf, other]
-
Title: Improving Object Detection, Multi-object Tracking, and Re-Identification for Disaster Response DronesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [88] arXiv:2201.01501 [pdf, other]
-
Title: Rethinking Depth Estimation for Multi-View Stereo: A Unified RepresentationComments: CVPR 2022 AcceptedSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [89] arXiv:2201.01503 [pdf, other]
-
Title: Towards Uniform Point Distribution in Feature-preserving Point Cloud FilteringComments: This paper is accepted to CVMSubjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
- [90] arXiv:2201.01565 [pdf, other]
-
Title: Culture-to-Culture Image Translation and User EvaluationComments: 31 pages (bibliography excluded), 4 figures, 6 TablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [91] arXiv:2201.01592 [pdf, other]
-
Title: Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation LearningAuthors: Xingqun Qi, Muyi Sun, Zijian Wang, Jiaming Liu, Qi Li, Fang Zhao, Shanghang Zhang, Caifeng ShanComments: Accepted to IEEE TNNLSSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [92] arXiv:2201.01603 [pdf, other]
-
Title: Deep Probabilistic Graph MatchingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [93] arXiv:2201.01609 [pdf, other]
-
Title: All You Need In Sign Language ProductionComments: arXiv admin note: substantial text overlap with arXiv:2103.15910Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [94] arXiv:2201.01615 [pdf, other]
-
Title: Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window AttentionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [95] arXiv:2201.01636 [pdf, other]
-
Title: Tackling the Class Imbalance Problem of Deep Learning Based Head and Neck Organ SegmentationComments: 10 pages, 3 figures, 1 table, submitted to the International Journal of Computer Assisted Radiology and SurgerySubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [96] arXiv:2201.01654 [pdf, other]
-
Title: TableParser: Automatic Table Parsing with Weak Supervision from SpreadsheetsComments: accepted in the AAAI-22 Workshop on Scientific Document Understanding at the Thirty-Sixth AAAI Conference on Artificial Intelligence (SDU@AAAI-22)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [97] arXiv:2201.01661 [pdf, ps, other]
-
Title: Evaluation of Thermal Imaging on Embedded GPU Platforms for Application in Vehicular Assistance SystemsComments: 14 pages, 9 tables, and 27 figuresJournal-ref: Published in IEEE-TIV Journal in 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [98] arXiv:2201.01683 [pdf, other]
-
Title: Surface-Aligned Neural Radiance Fields for Controllable 3D Human SynthesisComments: CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [99] arXiv:2201.01699 [pdf, ps, other]
-
Title: An Investigation of "Benford's" Law Divergence and Machine Learning Techniques for "Intra-Class" Separability of Fingerprint ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [100] arXiv:2201.01703 [pdf, other]
-
Title: Probing TryOnGANComments: 5 pages, to appear in the proceedings of the 9th ACM IKDD CODS and 27th COMAD (CODS-COMAD '22)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [101] arXiv:2201.01709 [pdf, other]
-
Title: The Effect of Model Compression on Fairness in Facial Expression RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [102] arXiv:2201.01783 [pdf, ps, other]
-
Title: Automated Scoring of Graphical Open-Ended Responses Using Artificial Neural NetworksComments: 23 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
- [103] arXiv:2201.01823 [pdf, other]
-
Title: Learning Semantic Ambiguities for Zero-Shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [104] arXiv:2201.01831 [pdf, other]
-
Title: POCO: Point Convolution for Surface ReconstructionComments: Accepted at Conference on Computer Vision and Pattern Recognition (CVPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG); Machine Learning (cs.LG)
- [105] arXiv:2201.01850 [pdf, other]
-
Title: On the Real-World Adversarial Robustness of Real-Time Semantic Segmentation Models for Autonomous DrivingAuthors: Giulio Rossolini, Federico Nesti, Gianluca D'Amico, Saasha Nair, Alessandro Biondi, Giorgio ButtazzoSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [106] arXiv:2201.01857 [pdf, other]
-
Title: Multi-Grid Redundant Bounding Box Annotation for Accurate Object DetectionComments: Will appear on "The 19th IEEE International Conference on Pervasive Intelligence and Computing (PICom 2021)". Conference Held on 25 - 28 October 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [107] arXiv:2201.01858 [pdf, other]
-
Title: Towards realistic symmetry-based completion of previously unseen point cloudsAuthors: Taras Rumezhak, Oles Dobosevych, Rostyslav Hryniv, Vladyslav Selotkin, Volodymyr Karpiv, Mykola MaksymenkoJournal-ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, October, 2021, 2542-2550Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [108] arXiv:2201.01883 [pdf, other]
-
Title: Memory-guided Image De-raining Using Time-Lapse DataSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [109] arXiv:2201.01901 [pdf, other]
-
Title: Incremental Object Grounding Using Scene GraphsSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [110] arXiv:2201.01928 [pdf, other]
-
Title: Egocentric Deep Multi-Channel Audio-Visual Active Speaker LocalizationSubjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [111] arXiv:2201.01929 [pdf, other]
-
Title: Decompose to Adapt: Cross-domain Object Detection via Feature DisentanglementAuthors: Dongnan Liu, Chaoyi Zhang, Yang Song, Heng Huang, Chenyu Wang, Michael Barnett, Weidong CaiComments: Accepted to appear in IEEE Transactions on Multimedia; source code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [112] arXiv:2201.01953 [pdf, other]
-
Title: Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic LabelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [113] arXiv:2201.01961 [pdf, other]
-
Title: Diversity-boosted Generalization-Specialization Balancing for Zero-shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [114] arXiv:2201.01971 [pdf, other]
-
Title: Multi-Label Classification on Remote-Sensing ImagesComments: The report consists of 95 Pages, 45 Figures, 31 Tables, 85 ReferencesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [115] arXiv:2201.01976 [pdf, other]
-
Title: SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [116] arXiv:2201.01983 [pdf, other]
-
Title: Multi-Domain Joint Training for Person Re-IdentificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [117] arXiv:2201.01984 [pdf, other]
-
Title: Compact Bidirectional Transformer for Image CaptioningSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [118] arXiv:2201.02001 [pdf, other]
-
Title: TransVPR: Transformer-based place recognition with multi-level attention aggregationComments: CVPR 2022 oralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [119] arXiv:2201.02010 [pdf, other]
-
Title: Self-Training Vision Language BERTs with a Unified Conditional ModelSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [120] arXiv:2201.02011 [pdf, other]
-
Title: An unambiguous cloudiness index for nonwovensJournal-ref: Journal of Mathematics in Industry, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [121] arXiv:2201.02017 [pdf, other]
-
Title: Enhancing Egocentric 3D Pose Estimation with Third Person ViewsAuthors: Ameya Dhamanaskar, Mariella Dimiccoli, Enric Corona, Albert Pumarola, Francesc Moreno-NoguerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [122] arXiv:2201.02028 [pdf, other]
-
Title: A Light in the Dark: Deep Learning Practices for Industrial Computer VisionAuthors: Maximilian Harl, Marvin Herchenbach, Sven Kruschel, Nico Hambauer, Patrick Zschech, Mathias KrausComments: Preprint accepted for archival and presentation at the 17th International Conference on Wirtschaftsinformatik 2022. 14 pages, 5 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [123] arXiv:2201.02052 [pdf, other]
-
Title: A Unified Framework for Attention-Based Few-Shot Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [124] arXiv:2201.02065 [pdf, other]
-
Title: ASL-Skeleton3D and ASL-Phono: Two Novel Datasets for the American Sign LanguageJournal-ref: The paper is under consideration at Pattern Recognition Letters (2022) (under the manuscript number PRLETTERS-D-22-00140)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [125] arXiv:2201.02074 [pdf, other]
-
Title: EM-driven unsupervised learning for efficient motion segmentationComments: Accepted to : IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [126] arXiv:2201.02093 [pdf, ps, other]
-
Title: Deep Learning Based Classification System For Recognizing Local SpinachAuthors: Mirajul Islam, Nushrat Jahan Ria, Jannatul Ferdous Ani, Abu Kaisar Mohammad Masum, Sheikh Abujar, Syed Akhter HossainComments: 10 pages, 4 figures, supplemental materials. Accepted in 2nd International Conference on Deep Learning, Artificial Intelligence and Robotics,(ICDLAIR) 2020Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [127] arXiv:2201.02107 [pdf, other]
-
Title: HyperionSolarNet: Solar Panel Detection from Aerial ImagesAuthors: Poonam Parhar, Ryan Sawasaki, Alberto Todeschini, Colorado Reed, Hossein Vahabi, Nathan Nusaputra, Felipe VergaraSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [128] arXiv:2201.02110 [pdf, other]
-
Title: Eye Know You Too: A DenseNet Architecture for End-to-end Eye Movement BiometricsComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [129] arXiv:2201.02149 [pdf, other]
-
Title: Bio-inspired Min-Nets Improve the Performance and Robustness of Deep NetworksJournal-ref: Gruening, P., & Barth, E. (2021, October). Bio-inspired Min-Nets Improve the Performance and Robustness of Deep Networks. In SVRHM 2021 Workshop@ NeurIPSSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [130] arXiv:2201.02193 [pdf, other]
-
Title: Realistic Full-Body Anonymization with Surface-Guided GANsComments: 8 pages, 7 figures, 6 tables. Source code and appendix available at: this https URL Published at WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [131] arXiv:2201.02233 [pdf, other]
-
Title: Consistent Style TransferComments: 10 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [132] arXiv:2201.02260 [pdf, other]
-
Title: CitySurfaces: City-Scale Semantic Segmentation of Sidewalk MaterialsComments: Sustainable Cities and Society journal (accepted); Model: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [133] arXiv:2201.02263 [pdf, other]
-
Title: ITSA: An Information-Theoretic Approach to Automatic Shortcut Avoidance and Domain Generalization in Stereo Matching NetworksComments: 11 pages, 4 figures. Accepted by CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [134] arXiv:2201.02279 [pdf, other]
-
Title: De-rendering 3D Objects in the WildJournal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 18490-18499Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [135] arXiv:2201.02280 [pdf, other]
-
Title: Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image CroppingAuthors: Nora Horanyi, Kedi Xia, Kwang Moo Yi, Abhishake Kumar Bojja, Ales Leonardis, Hyung Jin ChangJournal-ref: Pattern Recognition, 2022, 108485, ISSN 0031-3203Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [136] arXiv:2201.02302 [pdf, other]
-
Title: Extending One-Stage Detection with Open-World ProposalsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [137] arXiv:2201.02304 [pdf, other]
-
Title: Budget-aware Few-shot Learning via Graph Convolutional NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [138] arXiv:2201.02365 [pdf, other]
-
Title: Motion Prediction via Joint Dependency Modeling in Phase SpaceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [139] arXiv:2201.02366 [pdf, other]
-
Title: Uncertainty-Aware Cascaded Dilation Filtering for High-Efficiency DerainingComments: 14 pages, 10 figures, 10 tables. This is the extention of our conference version this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [140] arXiv:2201.02369 [pdf, other]
-
Title: Deep Generative Framework for Interactive 3D Terrain Authoring and ManipulationSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
- [141] arXiv:2201.02396 [pdf, other]
-
Title: Detecting Human-to-Human-or-Object (H2O) Interactions with DIABOLOComments: ACCEPTED in IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [142] arXiv:2201.02494 [pdf, other]
-
Title: Progressive Video Summarization via Multimodal Self-supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [143] arXiv:2201.02495 [pdf, other]
-
Title: Sign Language Video Retrieval with Free-Form Textual QueriesComments: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [144] arXiv:2201.02503 [pdf, ps, other]
-
Title: A Review of Deep Learning Techniques for Markerless Human Motion on Synthetic DatasetsComments: 11 pages, 5 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [145] arXiv:2201.02526 [pdf, other]
-
Title: Learning Target-aware Representation for Visual Tracking via Informative InteractionsComments: 9 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [146] arXiv:2201.02533 [pdf, other]
-
Title: NeROIC: Neural Rendering of Objects from Online Image CollectionsAuthors: Zhengfei Kuang, Kyle Olszewski, Menglei Chai, Zeng Huang, Panos Achlioptas, Sergey TulyakovComments: SIGGRAPH 2022 (Journal Track). Project page: this https URL Code repository: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [147] arXiv:2201.02560 [pdf, other]
-
Title: A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband ItemsComments: IEEE Transactions on Systems, Man, and Cybernetics: Systems, Source code is available at this https URLJournal-ref: IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [148] arXiv:2201.02588 [pdf, other]
-
Title: FogAdapt: Self-Supervised Domain Adaptation for Semantic Segmentation of Foggy ImagesComments: Accepted at Elsevier Journal of NeurocomputingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [149] arXiv:2201.02593 [pdf, other]
-
Title: Equalized Focal Loss for Dense Long-Tailed Object DetectionComments: Accepted by the IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [150] arXiv:2201.02605 [pdf, other]
-
Title: Detecting Twenty-thousand Classes using Image-level SupervisionComments: ECCV 2022 camera ready. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [151] arXiv:2201.02609 [pdf, other]
-
Title: Generalized Category DiscoveryComments: CVPR 22. Changes from pre-print highlighted in GitHub repoSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [152] arXiv:2201.02639 [pdf, other]
-
Title: MERLOT Reserve: Neural Script Knowledge through Vision and Language and SoundAuthors: Rowan Zellers, Jiasen Lu, Ximing Lu, Youngjae Yu, Yanpeng Zhao, Mohammadreza Salehi, Aditya Kusupati, Jack Hessel, Ali Farhadi, Yejin ChoiComments: CVPR 2022. Project page at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [153] arXiv:2201.02698 [pdf, other]
-
Title: Development of Automatic Tree Counting Software from UAV Based Aerial Images With Machine LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [154] arXiv:2201.02714 [pdf, other]
-
Title: Pseudo-labelling and Meta Reweighting Learning for Image Aesthetic Quality AssessmentComments: 10 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [155] arXiv:2201.02726 [pdf, ps, other]
-
Title: Real-time Rail Recognition Based on 3D Point CloudsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [156] arXiv:2201.02767 [pdf, other]
-
Title: QuadTree Attention for Vision TransformersComments: ICLR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [157] arXiv:2201.02772 [pdf, other]
-
Title: A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
- [158] arXiv:2201.02779 [pdf, other]
-
Title: A Baseline Statistical Method For Robust User-Assisted Multiple SegmentationAuthors: Huseyin AfserComments: Submitted to IEEE Signal Processing Letters. Is a continuation to our work: H. Af\c{s}er, "Statistical Classification via Robust Hypothesis Testing: Non-Asymptotic and Simple Bounds," in IEEE Signal Processing Letters, vol. 28, pp. 2112-2116, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Signal Processing (eess.SP)
- [159] arXiv:2201.02784 [pdf, other]
-
Title: Relieving Long-tailed Instance Segmentation via Pairwise Class BalanceComments: Accepted to CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [160] arXiv:2201.02798 [pdf, other]
-
Title: RARA: Zero-shot Sim2Real Visual Navigation with Following Foreground CuesComments: 7 pages, submitted to IROS, code: github.com/kkelchte/fgbgSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [161] arXiv:2201.02799 [pdf, other]
-
Title: Counteracting Dark Web Text-Based CAPTCHA with Generative Adversarial Learning for Proactive Cyber Threat IntelligenceComments: Accepted by ACM Transactions on Management Information SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [162] arXiv:2201.02836 [pdf, ps, other]
-
Title: Self-aligned Spatial Feature Extraction Network for UAV Vehicle Re-identificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [163] arXiv:2201.02837 [pdf, other]
-
Title: Mushrooms Detection, Localization and 3D Pose Estimation using RGB-D Sensor for Robotic-picking ApplicationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [164] arXiv:2201.02848 [pdf, other]
-
Title: Learning Sample Importance for Cross-Scenario Video Temporal GroundingComments: 7 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
- [165] arXiv:2201.02849 [pdf, other]
-
Title: Spatio-Temporal Tuples Transformer for Skeleton-Based Action RecognitionComments: 14 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [166] arXiv:2201.02850 [pdf, other]
-
Title: Image-based Automatic Dial Meter Reading in Unconstrained ScenariosJournal-ref: Measurement, vol. 204, p. 112025, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [167] arXiv:2201.02853 [pdf, ps, other]
-
Title: Fake Hilsa Fish Detection Using Machine VisionComments: 12 pages, 8 figures, International Joint Conference on Advances in Computational Intelligence (IJCACI 2020)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [168] arXiv:2201.02861 [pdf, other]
-
Title: Decoupling Makes Weakly Supervised Local Feature BetterComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [169] arXiv:2201.02885 [pdf, other]
-
Title: Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer VisionAuthors: Maurice Günder, Facundo R. Ispizua Yamati, Jana Kierdorf, Ribana Roscher, Anne-Katrin Mahlein, Christian BauckhageComments: Preprint submitted to GigaScienceJournal-ref: GigaScience, Volume 11, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
- [170] arXiv:2201.02946 [pdf, other]
-
Title: Resolving Camera Position for a Practical Application of Gaze Estimation on Edge DevicesComments: 6 pages, 11 figures, conference paperJournal-ref: ICAIIC 2022 (The 4th International Conference on Artificial Intelligence in Information and Communication February 21 (Mon.) ~ 24 (Thur.), 2022, Guam, USA & Virtual Conference)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [171] arXiv:2201.02963 [pdf, other]
-
Title: Box2Seg: Learning Semantics of 3D Point Clouds with Box-Level SupervisionComments: 9 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [172] arXiv:2201.02980 [pdf, other]
-
Title: Invariance encoding in sliced-Wasserstein space for image classification with limited training dataAuthors: Mohammad Shifat E Rabbi, Yan Zhuang, Shiying Li, Abu Hasnat Mohammad Rubaiyat, Xuwang Yin, Gustavo K. RohdeSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [173] arXiv:2201.02991 [pdf, other]
-
Title: A Survey on Face Recognition SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [174] arXiv:2201.03002 [pdf, other]
-
Title: MaskMTL: Attribute prediction in masked facial images with deep multitask learningAuthors: Prerana Mukherjee, Vinay Kaushik, Ronak Gupta, Ritika Jha, Daneshwari Kankanwadi, Brejesh LallComments: In Proceedings of 9th International Conference on Pattern Recognition and Machine Intelligence (PReMI 2021), Kolkata, IndiaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [175] arXiv:2201.03013 [pdf, ps, other]
-
Title: ThreshNet: An Efficient DenseNet Using Threshold Mechanism to Reduce ConnectionsComments: IEEE AccessSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [176] arXiv:2201.03014 [pdf, other]
-
Title: Glance and Focus Networks for Dynamic Visual RecognitionComments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI). Journal version of arXiv:2010.05300 (NeurIPS 2020). The first two authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [177] arXiv:2201.03018 [pdf, other]
-
Title: Self-Supervised Feature Learning from Partial Point Clouds via Pose DisentanglementComments: 10 pages, 4 figures and 6 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [178] arXiv:2201.03043 [pdf, other]
-
Title: Semantics-driven Attentive Few-shot Learning over Clean and Noisy SamplesComments: 25 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [179] arXiv:2201.03045 [pdf, other]
-
Title: Applying Artificial Intelligence for Age Estimation in Digital Forensic InvestigationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [180] arXiv:2201.03080 [pdf, other]
-
Title: The State of Aerial Surveillance: A SurveyAuthors: Kien Nguyen, Clinton Fookes, Sridha Sridharan, Yingli Tian, Feng Liu, Xiaoming Liu, Arun RossSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [181] arXiv:2201.03101 [pdf, other]
-
Title: ImageSubject: A Large-scale Dataset for Subject DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [182] arXiv:2201.03141 [pdf, other]
-
Title: Multi-Level Attention for Unsupervised Person Re-IdentificationAuthors: Yi ZhengSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [183] arXiv:2201.03170 [pdf, other]
-
Title: TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands PotentialsComments: 19 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [184] arXiv:2201.03176 [pdf, other]
-
Title: Pedestrian Detection: Domain Generalization, CNNs, Transformers and BeyondComments: 13 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [185] arXiv:2201.03178 [pdf, ps, other]
-
Title: Swin Transformer coupling CNNs Makes Strong Contextual Encoders for VHR Image Road ExtractionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [186] arXiv:2201.03180 [pdf, other]
-
Title: Transfer Learning for Scene Text Recognition in Indian LanguagesComments: 16 pages, 5 figuresJournal-ref: ICDAR 2021: Document Analysis and Recognition, ICDAR 2021 Workshops, pp 182-197Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [187] arXiv:2201.03185 [pdf, other]
-
Title: Towards Boosting the Accuracy of Non-Latin Scene Text RecognitionComments: 12 pages, 6 figuresJournal-ref: ICDAR 2021: Document Analysis and Recognition, ICDAR 2021 Workshops, pp 282-293Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [188] arXiv:2201.03194 [pdf, other]
-
Title: Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [189] arXiv:2201.03212 [pdf, other]
-
Title: Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [190] arXiv:2201.03243 [pdf, ps, other]
-
Title: Small Object Detection using Deep LearningComments: 21 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [191] arXiv:2201.03246 [pdf, other]
-
Title: Vision in adverse weather: Augmentation using CycleGANs with various object detectors for robust perception in autonomous racingAuthors: Izzeddin Teeti, Valentina Musat, Salman Khan, Alexander Rast, Fabio Cuzzolin, Andrew BradleyComments: ICML 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [192] arXiv:2201.03297 [pdf, other]
-
Title: GhostNets on Heterogeneous Devices via Cheap OperationsComments: Accepted by IJCV 2022. Extension of GhostNet CVPR2020 paper (arXiv:1911.11907). arXiv admin note: substantial text overlap with arXiv:1911.11907Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [193] arXiv:2201.03299 [pdf, other]
-
Title: Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural NetworksComments: 27 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [194] arXiv:2201.03323 [pdf, other]
-
Title: Gait Recognition Based on Deep Learning: A SurveyAuthors: Claudio Filipi Gonçalves dos Santos, Diego de Souza Oliveira, Leandro A. Passos, Rafael Gonçalves Pires, Daniel Felipe Silva Santos, Lucas Pascotti Valem, Thierry P. Moreira, Marcos Cleison S. Santana, Mateus Roder, João Paulo Papa, Danilo ColomboSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [195] arXiv:2201.03342 [pdf, other]
-
Title: COIN: Counterfactual Image Generation for VQA InterpretationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
- [196] arXiv:2201.03353 [pdf, other]
-
Title: GMFIM: A Generative Mask-guided Facial Image Manipulation Model for Privacy PreservationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [197] arXiv:2201.03454 [pdf, other]
-
Title: 3D Face Morphing Attacks: Generation, Vulnerability and DetectionComments: The paper is accepted at IEEE Transactions on Biometrics, Behavior and Identity ScienceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [198] arXiv:2201.03545 [pdf, other]
-
Title: A ConvNet for the 2020sComments: CVPR 2022; Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [199] arXiv:2201.03546 [pdf, other]
-
Title: Language-driven Semantic SegmentationComments: ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [200] arXiv:2201.03556 [pdf, other]
-
Title: Reproducing BowNet: Learning Representations by Predicting Bags of Visual WordsComments: This is a reproducibility project. Original work is by Gidaris et al. published in CVPR 2020. Pytorch implementation is public on Github. v2 clarifies comments regarding communication with original authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [201] arXiv:2201.03597 [pdf, other]
-
Title: Cross-Modality Sub-Image Retrieval using Contrastive Multimodal Image RepresentationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [202] arXiv:2201.03639 [pdf, other]
-
Title: Multi-Query Video RetrievalComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [203] arXiv:2201.03674 [pdf, other]
-
Title: PrintsGAN: Synthetic Fingerprint GeneratorSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [204] arXiv:2201.03686 [pdf, other]
-
Title: NFANet: A Novel Method for Weakly Supervised Water Extraction from High-Resolution Remote Sensing ImagerySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [205] arXiv:2201.03746 [pdf, other]
-
Title: TSA-Net: Tube Self-Attention Network for Action Quality AssessmentComments: 9 pages, 7 figures, conference paperSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [206] arXiv:2201.03786 [pdf, other]
-
Title: Drone Object Detection Using RGB/IR FusionComments: Accepted to Electronic Imaging Symposium, Computational Imaging XX Conference, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [207] arXiv:2201.03791 [pdf, other]
-
Title: Classification of Beer Bottles using Object Detection and Transfer LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [208] arXiv:2201.03794 [pdf, other]
-
Title: Efficient Non-Local Contrastive Attention for Image Super-ResolutionComments: Code is available at this https URLJournal-ref: AAAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [209] arXiv:2201.03803 [pdf, other]
-
Title: Unsupervised Domain Adaptive Person Re-id with Local-enhance and Prototype Dictionary LearningAuthors: Haopeng HouSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [210] arXiv:2201.03808 [pdf, other]
-
Title: MobileFaceSwap: A Lightweight Framework for Video Face SwappingComments: AAAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [211] arXiv:2201.03859 [pdf, other]
-
Title: On Exploring Pose Estimation as an Auxiliary Learning Task for Visible-Infrared Person Re-identificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [212] arXiv:2201.03891 [pdf, ps, other]
-
Title: A Saliency based Feature Fusion Model for EEG Emotion EstimationAuthors: Victor Delvigne, Antoine Facchini, Hazem Wannous, Thierry Dutoit, Laurence Ris, Jean-Philippe VandeborreSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [213] arXiv:2201.03902 [pdf, other]
-
Title: Where Is My Mind (looking at)? Predicting Visual Attention from Brain ActivityAuthors: Victor Delvigne, Noé Tits, Luca La Fisca, Nathan Hubens, Antoine Maiorca, Hazem Wannous, Thierry Dutoit, Jean-Philippe VandeborreSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Signal Processing (eess.SP); Neurons and Cognition (q-bio.NC)
- [214] arXiv:2201.03965 [pdf, other]
-
Title: On the Efficacy of Co-Attention Transformer Layers in Visual Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [215] arXiv:2201.03993 [pdf, other]
-
Title: A Novel Home-Built Metrology to Analyze Oral Fluid Droplets and Quantify the Efficacy of MasksAuthors: Ava Tan BhowmikComments: 9 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [216] arXiv:2201.04011 [pdf, other]
-
Title: Similarity-based Gray-box Adversarial Attack Against Deep Face RecognitionComments: ACCEPTED in IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [217] arXiv:2201.04019 [pdf, other]
-
Title: Pyramid Fusion Transformer for Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [218] arXiv:2201.04021 [pdf, other]
-
Title: Optimization Planning for 3D ConvNetsComments: ICML 2021; Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [219] arXiv:2201.04022 [pdf, other]
-
Title: Condensing a Sequence to One Informative Frame for Video RecognitionComments: ICCV 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [220] arXiv:2201.04023 [pdf, other]
-
Title: Boosting Video Representation Learning with Multi-Faceted IntegrationComments: CVPR 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [221] arXiv:2201.04024 [pdf, other]
-
Title: Smart Director: An Event-Driven Directing System for Live BroadcastingComments: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [222] arXiv:2201.04026 [pdf, other]
-
Title: Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-trainingComments: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
- [223] arXiv:2201.04027 [pdf, other]
-
Title: Representing Videos as Discriminative Sub-graphs for Action RecognitionComments: CVPR 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [224] arXiv:2201.04029 [pdf, other]
-
Title: Motion-Focused Contrastive Learning of Video RepresentationsComments: ICCV 2021 (Oral); Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [225] arXiv:2201.04039 [pdf, other]
-
Title: MobilePhys: Personalized Mobile Camera-Based Contactless Physiological SensingComments: Published paper: this https URLJournal-ref: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Volume Issue 1, March 2022, Article No.: 24Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [226] arXiv:2201.04042 [pdf, other]
-
Title: Towards Lightweight Neural Animation : Exploration of Neural Network Pruning in Mixture of Experts-based Animation ModelsComments: 8 pages, 4 figures, 2 tables, 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 1: GRAPP, ISBN 978-989-758-555-5, ISSN 2184-4321, pages 286-293Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [227] arXiv:2201.04063 [pdf, ps, other]
-
Title: Identification of chicken egg fertility using SVM classifier based on first-order statistical feature extractionComments: 9 Pages, 5 Figures, 2 TablesJournal-ref: ILKOM Jurnal Ilmiah, 13(3), (2021), 285-293Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [228] arXiv:2201.04114 [pdf, other]
-
Title: DM-VIO: Delayed Marginalization Visual-Inertial OdometrySubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [229] arXiv:2201.04123 [pdf, other]
-
Title: gDNA: Towards Generative Detailed Neural AvatarsAuthors: Xu Chen, Tianjian Jiang, Jie Song, Jinlong Yang, Michael J. Black, Andreas Geiger, Otmar HilligesComments: Camera-ready for CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [230] arXiv:2201.04127 [pdf, other]
-
Title: HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular VideoAuthors: Chung-Yi Weng, Brian Curless, Pratul P. Srinivasan, Jonathan T. Barron, Ira Kemelmacher-ShlizermanComments: CVPR 2022 (oral). Project page with videos: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [231] arXiv:2201.04212 [pdf, other]
-
Title: MDPose: Human Skeletal Motion Reconstruction Using WiFi Micro-Doppler SignaturesSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [232] arXiv:2201.04214 [pdf, other]
-
Title: Region-based Layout Analysis of Music Score ImagesComments: Submitted to Expert Systems with ApplicationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [233] arXiv:2201.04236 [pdf, other]
-
Title: Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidentsAuthors: Ethan Weber, Dim P. Papadopoulos, Agata Lapedriza, Ferda Ofli, Muhammad Imran, Antonio TorralbaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [234] arXiv:2201.04279 [pdf, other]
-
Title: Dynamical Audio-Visual Navigation: Catching Unheard Moving Sound Sources in Unmapped 3D EnvironmentsAuthors: Abdelrahman YounesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [235] arXiv:2201.04288 [pdf, other]
-
Title: Multiview Transformers for Video RecognitionComments: CVPR 2022; arXiv v4: update results on Epic-Kitchens-100Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [236] arXiv:2201.04309 [pdf, other]
-
Title: Robust Contrastive Learning against Noisy ViewsAuthors: Ching-Yao Chuang, R Devon Hjelm, Xin Wang, Vibhav Vineet, Neel Joshi, Antonio Torralba, Stefanie Jegelka, Yale SongSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [237] arXiv:2201.04329 [pdf, other]
-
Title: Neural Residual Flow Fields for Efficient Video RepresentationsComments: Accepted for ACCV 2022, codes are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [238] arXiv:2201.04341 [pdf, other]
-
Title: MDS-Net: A Multi-scale Depth Stratification Based Monocular 3D Object Detection AlgorithmComments: 9 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [239] arXiv:2201.04358 [pdf, ps, other]
-
Title: Coarse-to-Fine Embedded PatchMatch and Multi-Scale Dynamic Aggregation for Reference-based Super-ResolutionComments: code is availavle at this https URLJournal-ref: AAAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [240] arXiv:2201.04364 [pdf, other]
-
Title: SCSNet: An Efficient Paradigm for Learning Simultaneously Image Colorization and Super-ResolutionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [241] arXiv:2201.04388 [pdf, other]
-
Title: OCSampler: Compressing Videos to One Clip with Single-step SamplingComments: Video Understanding, Efficient Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [242] arXiv:2201.04402 [pdf, other]
-
Title: MoViDNN: A Mobile Platform for Evaluating Video Quality Enhancement with Deep Neural NetworksComments: 8 pages, 3 figuresJournal-ref: MMM 2022: MultiMedia Modeling pp 465-472Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [243] arXiv:2201.04435 [pdf, other]
-
Title: Beyond the Visible: A Survey on Cross-spectral Face RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [244] arXiv:2201.04494 [pdf, other]
-
Title: SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point CloudsComments: Accepted by IJCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [245] arXiv:2201.04532 [pdf, other]
-
Title: Structure and position-aware graph neural network for airway labelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [246] arXiv:2201.04620 [pdf, other]
-
Title: SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive MiningComments: Accepted at ICCV2023. Project webpage: this https URL The first two authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [247] arXiv:2201.04623 [pdf, other]
-
Title: Virtual Elastic ObjectsAuthors: Hsiao-yu Chen, Edgar Tretschk, Tuur Stuyck, Petr Kadlecek, Ladislav Kavan, Etienne Vouga, Christoph LassnerSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [248] arXiv:2201.04676 [pdf, other]
-
Title: UniFormer: Unified Transformer for Efficient Spatiotemporal Representation LearningComments: Published as a conference paper at ICLR 2022; 19pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [249] arXiv:2201.04684 [pdf, other]
-
Title: BigDatasetGAN: Synthesizing ImageNet with Pixel-wise AnnotationsAuthors: Daiqing Li, Huan Ling, Seung Wook Kim, Karsten Kreis, Adela Barriuso, Sanja Fidler, Antonio TorralbaComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [250] arXiv:2201.04706 [pdf, other]
-
Title: Semantic Labeling of Human Action For Visually Impaired And Blind People Scene InteractionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [251] arXiv:2201.04755 [pdf, ps, other]
-
Title: Spatial-Temporal Map Vehicle Trajectory Detection Using Dynamic Mode Decomposition and Res-UNet+ Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [252] arXiv:2201.04756 [pdf, ps, other]
-
Title: Roadside Lidar Vehicle Detection and Tracking Using Range And Intensity Background SubtractionJournal-ref: Journal of Advanced Transportation, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [253] arXiv:2201.04766 [pdf, other]
-
Title: Collision Detection: An Improved Deep Learning Approach Using SENet and ResNextComments: 8 pages, 5 figures, submitted to IEEE-SMC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [254] arXiv:2201.04771 [pdf, other]
-
Title: Unlocking large-scale crop field delineation in smallholder farming systems with transfer learning and weak supervisionComments: Under submissionSubjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
- [255] arXiv:2201.04777 [pdf, other]
-
Title: A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-19Comments: 21 pages, 9 figures, 5 tables. IEEE Transactions on Artificial Intelligence, 2021, early accessSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [256] arXiv:2201.04788 [pdf, other]
-
Title: Trusted Media Challenge Dataset and User StudySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [257] arXiv:2201.04796 [pdf, other]
-
Title: CFNet: Learning Correlation Functions for One-Stage Panoptic SegmentationAuthors: Yifeng Chen, Wenqing Chu, Fangfang Wang, Ying Tai, Ran Yi, Zhenye Gan, Liang Yao, Chengjie Wang, Xi LiComments: Tech reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [258] arXiv:2201.04797 [pdf, other]
-
Title: Scalable Cluster-Consistency Statistics for Robust Multi-Object MatchingComments: accepted to International Conference on 3D Vision (3DV) 2021, Oral PresentationJournal-ref: Proceedings of the 2021 International Conference on 3D Vision (3DV), 2021, pp. 352-360Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [259] arXiv:2201.04806 [pdf, other]
-
Title: RealGait: Gait Recognition for Person Re-IdentificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [260] arXiv:2201.04809 [pdf, other]
-
Title: Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial NetworksAuthors: Yuchong Yao, Xiaohui Wangr, Yuanbang Ma, Han Fang, Jiaying Wei, Liyuan Chen, Ali Anaissi, Ali BrayteeSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [261] arXiv:2201.04819 [pdf, other]
-
Title: Deep Rank-Consistent Pyramid Model for Enhanced Crowd CountingAuthors: Jiaqi Gao, Zhizhong Huang, Yiming Lei, Hongming Shan, James Z. Wang, Fei-Yue Wang, Junping ZhangComments: Accepted by IEEE Transactions on Neural Networks and Learning SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [262] arXiv:2201.04833 [pdf, other]
-
Title: SnapshotNet: Self-supervised Feature Learning for Point Cloud Data Segmentation Using Minimal Labeled DataJournal-ref: Computer Vision and Image Understanding, Volume 216, 2022, 103339, ISSN 1077-3142Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [263] arXiv:2201.04850 [pdf, other]
- [264] arXiv:2201.04851 [pdf, other]
-
Title: MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [265] arXiv:2201.04866 [pdf, other]
-
Title: Weakly Supervised Scene Text Detection using Deep Reinforcement LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [266] arXiv:2201.04873 [pdf, other]
-
Title: VoLux-GAN: A Generative Model for 3D Face Synthesis with HDRI RelightingAuthors: Feitong Tan, Sean Fanello, Abhimitra Meka, Sergio Orts-Escolano, Danhang Tang, Rohit Pandey, Jonathan Taylor, Ping Tan, Yinda ZhangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [267] arXiv:2201.04898 [pdf, other]
-
Title: Flexible Style Image Super-Resolution using Conditional ObjectiveComments: Will be presented in IEEE ACCESS. Code and trained models will be available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [268] arXiv:2201.04906 [pdf, other]
-
Title: Hand-Object Interaction ReasoningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [269] arXiv:2201.04924 [pdf, other]
-
Title: Technical Report for ICCV 2021 Challenge SSLAD-Track3B: Transformers Are Better Continual LearnersComments: Rank 1st on ICCV2021 SSLAD-Track 3BSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [270] arXiv:2201.04945 [pdf, other]
-
Title: Learning Semantic Abstraction of Shape via 3D Region of InterestSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [271] arXiv:2201.05007 [pdf, ps, other]
-
Title: Multi-granularity Association Learning Framework for on-the-fly Fine-Grained Sketch-based Image RetrievalComments: 17 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [272] arXiv:2201.05020 [pdf, other]
-
Title: Automatic Sparse Connectivity Learning for Neural NetworksComments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [273] arXiv:2201.05022 [pdf, other]
-
Title: Self-semantic contour adaptation for cross modality brain tumor segmentationComments: Accepted to ISBI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [274] arXiv:2201.05023 [pdf, other]
-
Title: Stereo Magnification with Multi-Layer ImagesAuthors: Taras Khakhulin, Denis Korzhenkov, Pavel Solovev, Gleb Sterkin, Timotei Ardelean, Victor LempitskyComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [275] arXiv:2201.05047 [pdf, other]
-
Title: TransVOD: End-to-End Video Object Detection with Spatial-Temporal TransformersAuthors: Qianyu Zhou, Xiangtai Li, Lu He, Yibo Yang, Guangliang Cheng, Yunhai Tong, Lizhuang Ma, Dacheng TaoComments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), extended version of arXiv:2105.10920Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [276] arXiv:2201.05057 [pdf, other]
-
Title: On Adversarial Robustness of Trajectory Prediction for Autonomous VehiclesComments: 13 pages, 13 figures, accepted by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [277] arXiv:2201.05078 [pdf, other]
-
Title: CLIP-Event: Connecting Text and Images with Event StructuresAuthors: Manling Li, Ruochen Xu, Shuohang Wang, Luowei Zhou, Xudong Lin, Chenguang Zhu, Michael Zeng, Heng Ji, Shih-Fu ChangJournal-ref: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [278] arXiv:2201.05119 [pdf, other]
-
Title: Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?Authors: Nenad Tomasev, Ioana Bica, Brian McWilliams, Lars Buesing, Razvan Pascanu, Charles Blundell, Jovana MitrovicSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [279] arXiv:2201.05120 [pdf, other]
-
Title: SeamlessGAN: Self-Supervised Synthesis of Tileable Texture MapsComments: 12 pages. To be published in Transactions on Visualizations and Computer Graphics. Project website: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
- [280] arXiv:2201.05121 [pdf, other]
-
Title: STEdge: Self-training Edge Detection with Multi-layer Teaching and RegularizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [281] arXiv:2201.05131 [pdf, other]
-
Title: SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge DistillationComments: In BMVC 2021. Code available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [282] arXiv:2201.05151 [pdf, other]
-
Title: Beyond Simple Meta-Learning: Multi-Purpose Models for Multi-Domain, Active and Continual Few-Shot LearningAuthors: Peyman Bateni, Jarred Barber, Raghav Goyal, Vaden Masrani, Jan-Willem van de Meent, Leonid Sigal, Frank WoodSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [283] arXiv:2201.05275 [pdf, ps, other]
-
Title: Deep Leaning-Based Ultra-Fast Stair DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [284] arXiv:2201.05277 [pdf, other]
-
Title: Boundary-aware Self-supervised Learning for Video Scene SegmentationComments: The code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [285] arXiv:2201.05290 [pdf, other]
-
Title: Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube ProposalsSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [286] arXiv:2201.05297 [pdf, other]
-
Title: MMNet: Muscle motion-guided network for micro-expression recognitionComments: 8 pages, 4 figuresJournal-ref: Proc. 31st Int'l Joint Conf. Artificial Intelligence (IJCAI), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [287] arXiv:2201.05299 [pdf, other]
-
Title: A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [288] arXiv:2201.05307 [pdf, other]
-
Title: Unsupervised Temporal Video Grounding with Deep Semantic ClusteringComments: Accepted by AAAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [289] arXiv:2201.05314 [pdf, other]
-
Title: A Novel Skeleton-Based Human Activity Discovery Using Particle Swarm Optimization with Gaussian MutationSubjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
- [290] arXiv:2201.05346 [pdf, ps, other]
-
Title: Arbitrary Handwriting Image Style TransferSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [291] arXiv:2201.05386 [pdf, other]
-
Title: SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditionsComments: 11 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [292] arXiv:2201.05479 [pdf, other]
-
Title: HardBoost: Boosting Zero-Shot Learning with Hard ClassesComments: 15 pages, 8 figures, submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence on Sep.16 2021, This work is an extended version of our CVPR2021 work----Hardness sampling for self-training based transductive zero-shot learning (arXiv:2106.00264)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [293] arXiv:2201.05489 [pdf, other]
-
Title: Emergence of Machine Language: Towards Symbolic Intelligence with Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [294] arXiv:2201.05514 [pdf, other]
-
Title: Determination of building flood risk maps from LiDAR mobile mapping dataJournal-ref: Computers, Environment and Urban Systems, Vol. 93, April 2022, 101759Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [295] arXiv:2201.05541 [pdf, other]
-
Title: ViT2Hash: Unsupervised Information-Preserving HashingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [296] arXiv:2201.05545 [pdf, ps, other]
-
Title: Multimodal registration of FISH and nanoSIMS images using convolutional neural network modelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [297] arXiv:2201.05585 [pdf, other]
-
Title: Domain Adaptation in LiDAR Semantic Segmentation via Alternating Skip Connections and Hybrid LearningComments: 1) Introduced Fig 1, 2) Simplified Fig. 2 diagram, 3) Fixed typos in losses, 4) Introduced Fig. 3, 5) Updated evaluation results, included evaluation on SemanticPOSS, 6) Introduced Table 3 - effects on covariance matrix and mean, 7) Updated Fig. 5, 8) Added more references. Improved writing in general, especially the motivation and description of each element and contribution from the methodSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [298] arXiv:2201.05675 [pdf, other]
-
Title: Transformers in Action: Weakly Supervised Action SegmentationComments: Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [299] arXiv:2201.05706 [pdf, other]
-
Title: Perspective Transformation LayerComments: This paper has been accepted for publication by the 2022 International Conference on Computational Science & Computational Intelligence (CSCI'22), Research Track on Signal & Image Processing, Computer Vision & Pattern RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [300] arXiv:2201.05718 [pdf, other]
-
Title: Parameter-free Online Test-time AdaptationComments: CVPR 2022 (oral). Code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [301] arXiv:2201.05723 [pdf, other]
-
Title: Learning Temporally and Semantically Consistent Unpaired Video-to-video Translation Through Pseudo-Supervision From Synthetic Optical FlowSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [302] arXiv:2201.05729 [pdf, other]
-
Title: CLIP-TD: CLIP Targeted Distillation for Vision-Language TasksAuthors: Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Jianwei Yang, Xiyang Dai, Bin Xiao, Haoxuan You, Shih-Fu Chang, Lu YuanComments: This paper is greatly modified and updated to be re-submitted to another conference. The new paper is under the name "Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks", this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
- [303] arXiv:2201.05730 [pdf, other]
-
Title: Learning Hierarchical Graph Representation for Image Manipulation DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [304] arXiv:2201.05739 [pdf, other]
-
Title: Real-World Graph Convolution Networks (RW-GCNs) for Action Recognition in Smart Video SurveillanceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [305] arXiv:2201.05761 [pdf, other]
-
Title: A Survey on RGB-D DatasetsComments: This paper was published at Computer Vision and Image Understanding. Access the final paper using the DOI: this https URLJournal-ref: Computer Vision and Image Understanding 222 (2022) 103489Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [306] arXiv:2201.05772 [pdf, other]
-
Title: Asymmetric Hash Code Learning for Remote Sensing Image RetrievalComments: 14 pages, 12 figures, and 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
- [307] arXiv:2201.05775 [pdf, other]
-
Title: Explainability Tools Enabling Deep Learning in Future In-Situ Real-Time Planetary ExplorationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [308] arXiv:2201.05776 [pdf, other]
-
Title: Uncertainty-Aware Multi-View Representation LearningComments: AAAI 2021 published paperSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [309] arXiv:2201.05778 [pdf, other]
-
Title: Semantic decoupled representation learning for remote sensing image change detectionComments: Submitted to IEEE for possible publication. 4 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [310] arXiv:2201.05781 [pdf, other]
-
Title: OneDConv: Generalized Convolution For Transform-Invariant RepresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [311] arXiv:2201.05816 [pdf, other]
-
Title: A Critical Analysis of Image-based Camera Pose Estimation TechniquesSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [312] arXiv:2201.05820 [pdf, other]
-
Title: Offline-Online Associated Camera-Aware Proxies for Unsupervised Person Re-identificationComments: Accepted to TIPSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [313] arXiv:2201.05829 [pdf, other]
-
Title: Multi-View representation learning in Multi-Task SceneComments: 32 pagesJournal-ref: Neural Computing and Applications(2020)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [314] arXiv:2201.05834 [pdf, other]
-
Title: Tailor Versatile Multi-modal Learning for Multi-label Emotion RecognitionComments: To be published in AAAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [315] arXiv:2201.05858 [pdf, other]
-
Title: Smart Parking Space Detection under Hazy conditions using Convolutional Neural Networks: A Novel ApproachComments: 20 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [316] arXiv:2201.05869 [src]
-
Title: Prototype Guided Network for Anomaly SegmentationComments: Need for edit,and improve the method for better performanceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [317] arXiv:2201.05887 [pdf, other]
-
Title: Domain Adaptation via Bidirectional Cross-Attention TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [318] arXiv:2201.05914 [pdf, other]
-
Title: Towards Zero-shot Sign Language RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [319] arXiv:2201.05916 [pdf, other]
-
Title: Multi-level Second-order Few-shot LearningComments: IEEE Transactions on MultimediaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [320] arXiv:2201.05951 [pdf, ps, other]
-
Title: Global Regular Network for Writer IdentificationAuthors: Shiyu WangSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [321] arXiv:2201.05958 [pdf, ps, other]
-
Title: Cross-Centroid Ripple Pattern for Facial Expression RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [322] arXiv:2201.05972 [pdf, other]
-
Title: Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic SegmentationComments: Accepted by the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [323] arXiv:2201.05986 [pdf, other]
-
Title: Audio-Driven Talking Face Video Generation with Dynamic Convolution KernelsAuthors: Zipeng Ye, Mengfei Xia, Ran Yi, Juyong Zhang, Yu-Kun Lai, Xuwei Huang, Guoxin Zhang, Yong-jin LiuComments: in IEEE Transactions on MultimediaSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [324] arXiv:2201.05989 [pdf, other]
-
Title: Instant Neural Graphics Primitives with a Multiresolution Hash EncodingComments: To appear in ACM Transactions on Graphics (SIGGRAPH 2022). 15 pages, 13 figures, 3 tablesJournal-ref: ACM Trans. Graph. 41, 4, Article 102 (July 2022), 15 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [325] arXiv:2201.05991 [pdf, other]
-
Title: Video Transformers: A SurveyAuthors: Javier Selva, Anders S. Johansen, Sergio Escalera, Kamal Nasrollahi, Thomas B. Moeslund, Albert ClapésSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [326] arXiv:2201.06030 [pdf, ps, other]
-
Title: Fully Convolutional Change Detection Framework with Generative Adversarial Network for Unsupervised, Weakly Supervised and Regional Supervised Change DetectionComments: 13 pages, 19 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [327] arXiv:2201.06037 [pdf, other]
-
Title: Pursuing 3D Scene Structures with Optical Satellite Images from Affine Reconstruction to Euclidean ReconstructionComments: 11 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [328] arXiv:2201.06061 [pdf, other]
-
Title: PETS-SWINF: A regression method that considers images with metadata based Neural Network for pawpularity prediction on 2021 Kaggle Competition "PetFinder.my"Comments: 8 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [329] arXiv:2201.06070 [pdf, other]
-
Title: ALA: Naturalness-aware Adversarial Lightness AttackAuthors: Yihao Huang, Liangru Sun, Qing Guo, Felix Juefei-Xu, Jiayi Zhu, Jincao Feng, Yang Liu, Geguang PuComments: 9 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [330] arXiv:2201.06098 [pdf, other]
-
Title: An Edge Map based Ensemble Solution to Detect Water Level in StreamSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [331] arXiv:2201.06159 [pdf, other]
-
Title: YOLO -- You only look 10647 timesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [332] arXiv:2201.06164 [pdf, other]
-
Title: Synthesis and Reconstruction of Fingerprints using Generative Adversarial NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [333] arXiv:2201.06174 [pdf, other]
-
Title: A novel attention model for salient structure detection in seismic volumesComments: Published in Applied Computing and Intelligence, Nov. 2021Journal-ref: Applied Computing and Intelligence, vol. 1, no. 1, pp. 31-45, Nov. 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [334] arXiv:2201.06176 [pdf, ps, other]
-
Title: A fast and accurate iris segmentation method using an LoG filter and its zero-crossingsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [335] arXiv:2201.06192 [pdf, other]
-
Title: Fooling the Eyes of Autonomous Vehicles: Robust Physical Adversarial Examples Against Traffic Sign Recognition SystemsComments: 17 pages, 15 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [336] arXiv:2201.06207 [pdf, other]
-
Title: Discourse Analysis for Evaluating Coherence in Video Paragraph CaptionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [337] arXiv:2201.06220 [pdf, ps, other]
-
Title: Face Detection in Extreme Conditions: A Machine-learning ApproachAuthors: Sameer Aqib HashmiComments: 6 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [338] arXiv:2201.06260 [pdf, other]
-
Title: Towards Realistic Visual Dubbing with Heterogeneous SourcesAuthors: Tianyi Xie, Liucheng Liao, Cheng Bi, Benlai Tang, Xiang Yin, Jianfei Yang, Mingjie Wang, Jiali Yao, Yang Zhang, Zejun MaComments: 9 pages (including references), 7 figures, Accepted in ACM Multimedia, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [339] arXiv:2201.06289 [pdf, other]
-
Title: The CLEAR Benchmark: Continual LEArning on Real-World ImageryComments: Project site: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [340] arXiv:2201.06304 [pdf, other]
-
Title: Action Keypoint Network for Efficient Video RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [341] arXiv:2201.06311 [pdf, other]
-
Title: Graph Neural Networks for Cross-Camera Data AssociationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [342] arXiv:2201.06346 [pdf, other]
-
Title: Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?Comments: Accepted at IJCAI-2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [343] arXiv:2201.06357 [pdf, other]
-
Title: Disentangled Latent Transformer for Interpretable Monocular Height EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [344] arXiv:2201.06374 [pdf, other]
-
Title: RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value PairsComments: Accepted by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [345] arXiv:2201.06376 [pdf, other]
-
Title: UWC: Unit-wise Calibration Towards Rapid Network CompressionComments: Accepted by BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [346] arXiv:2201.06390 [pdf, other]
-
Title: SwinUNet3D -- A Hierarchical Architecture for Deep Traffic Prediction using Shifted Window TransformersComments: 7 pages, 1 figureSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [347] arXiv:2201.06415 [pdf, other]
-
Title: Improving Performance of Semantic Segmentation CycleGANs by Noise Injection into the Latent Segmentation SpaceSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [348] arXiv:2201.06427 [pdf, other]
-
Title: Masked Faces with Faced MasksComments: 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [349] arXiv:2201.06435 [pdf, other]
-
Title: FourierNet: Shape-Preserving Network for Henle's Fiber Layer Segmentation in Optical Coherence Tomography ImagesAuthors: Selahattin Cansiz, Cem Kesim, Sevval Nur Bektas, Zeynep Kulali, Murat Hasanreisoglu, Cigdem Gunduz-DemirJournal-ref: IEEE Journal of Biomedical and Health Informatics, vol. 27, no. 2, pp. 1036-1047, Feb. 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [350] arXiv:2201.06459 [pdf, other]
-
Title: A Novel Framework to Jointly Compress and Index Remote Sensing Images for Efficient Content-Based RetrievalComments: Accepted at IEEE International Geoscience and Remote Sensing Symposium (IGARSS) 2022. Our code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [351] arXiv:2201.06493 [pdf, other]
-
Title: AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object DetectionAuthors: Zehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinghong Jiang, Feng Zhao, Bolei Zhou, Hang ZhaoComments: Accepted to IJCAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [352] arXiv:2201.06569 [pdf, other]
-
Title: Automatic Quantification and Visualization of Street TreesComments: Accepted at ICVGIP 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [353] arXiv:2201.06570 [pdf, other]
-
Title: BDA-SketRet: Bi-Level Domain Adaptation for Zero-Shot SBIRSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [354] arXiv:2201.06578 [pdf, other]
-
Title: Collapse by Conditioning: Training Class-conditional GANs with Limited DataSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [355] arXiv:2201.06594 [pdf, other]
-
Title: Using Machine Learning to Detect Rotational Symmetries from Reflectional Symmetries in 2D ImagesComments: 8 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [356] arXiv:2201.06629 [pdf, other]
-
Title: Validation of object detection in UAV-based images using synthetic dataAuthors: Eung-Joo Lee, Damon M. Conover, Shuvra S. Bhattacharyyaa, Heesung Kwon, Jason Hill, Kenneth EvensenSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [357] arXiv:2201.06644 [pdf, other]
-
Title: HydraFusion: Context-Aware Selective Sensor Fusion for Robust and Efficient Autonomous Vehicle PerceptionComments: Accepted to be published in the 13th ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [358] arXiv:2201.06648 [pdf, other]
-
Title: OmniPrint: A Configurable Printed Character SynthesizerComments: Accepted at 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks. this https URLJournal-ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and BenchmarksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [359] arXiv:2201.06686 [pdf, ps, other]
-
Title: Unpaired Referring Expression Grounding via Bidirectional Cross-Modal MatchingComments: 9 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [360] arXiv:2201.06696 [pdf, other]
-
Title: ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP CuesComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [361] arXiv:2201.06734 [pdf, other]
-
Title: Cross-modal Contrastive Distillation for Instructional Activity AnticipationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [362] arXiv:2201.06740 [pdf, other]
-
Title: Convolutional Cobweb: A Model of Incremental Learning from 2D ImagesComments: 14 pages, 6 figures, Presented at Advances in Cognitive Systems 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [363] arXiv:2201.06750 [pdf, other]
-
Title: DDU-Net: Dual-Decoder-U-Net for Road Extraction Using High-Resolution Remote Sensing ImagesAuthors: Ying Wang, Yuexing Peng, Xinran Liu, Wei Li, George C. Alexandropoulos, Junchuan Yu, Daqing Ge, Wei XiangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [364] arXiv:2201.06775 [pdf, other]
-
Title: Deformable One-Dimensional Object Detection for Routing and ManipulationComments: Accepted to IEEE Robotics and Automation Letters, January 2022. 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [365] arXiv:2201.06776 [pdf, other]
-
Title: Pruning-aware Sparse Regularization for Network PruningComments: MIR 2023Journal-ref: Machine Intelligence Research, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [366] arXiv:2201.06781 [pdf, other]
-
Title: When Facial Expression Recognition Meets Few-Shot Learning: A Joint and Alternate Learning FrameworkComments: 9 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [367] arXiv:2201.06794 [pdf, other]
-
Title: Resistance Training using Prior Bias: toward Unbiased Scene Graph GenerationComments: Accepted by AAAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [368] arXiv:2201.06799 [pdf, other]
-
Title: Pistol: Pupil Invisible Supportive Tool to extract Pupil, Iris, Eye Opening, Eye Movements, Pupil and Iris Gaze Vector, and 2D as well as 3D GazeSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [369] arXiv:2201.06823 [pdf, other]
-
Title: Adaptive Weighted Guided Image Filtering for Depth Enhancement in Shape-From-FocusSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [370] arXiv:2201.06824 [pdf, ps, other]
-
Title: STURE: Spatial-Temporal Mutual Representation Learning for Robust Data Association in Online Multi-Object TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [371] arXiv:2201.06825 [pdf, other]
-
Title: Deep Learning Based Framework for Iranian License Plate Detection and RecognitionComments: 20 pages, journalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [372] arXiv:2201.06845 [pdf, other]
-
Title: Taylor3DNet: Fast 3D Shape Inference With Landmark Points Based Taylor SeriesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [373] arXiv:2201.06857 [pdf, other]
-
Title: RePre: Improving Self-Supervised Vision Transformer with Reconstructive Pre-trainingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [374] arXiv:2201.06888 [pdf, other]
-
Title: Autoencoding Video Latents for Adversarial Video GenerationAuthors: Sai Hemanth KasaraneniComments: preprintSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [375] arXiv:2201.06889 [pdf, other]
-
Title: Boosting Robustness of Image Matting with Context Assembling and Strong Data AugmentationComments: 19 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [376] arXiv:2201.06933 [pdf, other]
-
Title: Context-Aware Scene Prediction Network (CASPNet)Comments: 9 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [377] arXiv:2201.06945 [pdf, ps, other]
-
Title: It's All in the Head: Representation Knowledge Distillation through Classifier SharingAuthors: Emanuel Ben-Baruch, Matan Karklinsky, Yossi Biton, Avi Ben-Cohen, Hussam Lawen, Nadav ZamirSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [378] arXiv:2201.06974 [pdf, other]
-
Title: Continual Coarse-to-Fine Domain Adaptation in Semantic SegmentationComments: 24 pages, 9 figures, 6 tables, under submissionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [379] arXiv:2201.06978 [pdf, other]
-
Title: ASOCEM: Automatic Segmentation Of Contaminations in cryo-EMSubjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
- [380] arXiv:2201.07021 [pdf, other]
-
Title: MuSCLe: A Multi-Strategy Contrastive Learning Framework for Weakly Supervised Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [381] arXiv:2201.07070 [pdf, other]
-
Title: Attention-based Proposals Refinement for 3D Object DetectionComments: Accepted for IV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [382] arXiv:2201.07106 [pdf, other]
-
Title: Variational Inference for Quantifying Inter-observer Variability in Segmentation of Anatomical StructuresComments: SPIE Medical Imaging 2022 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [383] arXiv:2201.07120 [pdf, other]
-
Title: Contextual road lane and symbol generation for autonomous drivingSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [384] arXiv:2201.07124 [pdf, other]
-
Title: Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR ImageryComments: A raw version as the same as the early access published in TGRS. Personal use of this material is permitted. Permission from IEEE must be obtained for all other usesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [385] arXiv:2201.07131 [pdf, other]
-
Title: Leveraging Real Talking Faces via Self-Supervision for Robust Forgery DetectionComments: CVPR 2022. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [386] arXiv:2201.07189 [pdf, other]
-
Title: MUSE-VAE: Multi-Scale VAE for Environment-Aware Long Term Trajectory PredictionAuthors: Mihee Lee, Samuel S. Sohn, Seonghyeon Moon, Sejong Yoon, Mubbasir Kapadia, Vladimir PavlovicSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [387] arXiv:2201.07200 [pdf, other]
-
Title: Optimizing Active Learning for Low Annotation BudgetsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [388] arXiv:2201.07202 [pdf, other]
-
Title: GANmouflage: 3D Object Nondetection with Texture FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [389] arXiv:2201.07264 [pdf, other]
-
Title: Exploring Kervolutional Neural NetworksAuthors: Nicolas PerezComments: 5 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [390] arXiv:2201.07309 [pdf, other]
-
Title: OSSID: Online Self-Supervised Instance Detection by (and for) Pose EstimationComments: 10 pages, 6 figures. RA-L and ICRA 2022Journal-ref: IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 3022-3029, April 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [391] arXiv:2201.07366 [pdf, other]
-
Title: TriCoLo: Trimodal Contrastive Loss for Text to Shape RetrievalComments: Accepted by WACV 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [392] arXiv:2201.07384 [pdf, other]
-
Title: Swin-Pose: Swin Transformer Based Human Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [393] arXiv:2201.07394 [pdf, other]
-
Title: KappaFace: Adaptive Additive Angular Margin Loss for Deep Face RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [394] arXiv:2201.07412 [pdf, other]
-
Title: Poseur: Direct Human Pose Regression with TransformersAuthors: Weian Mao, Yongtao Ge, Chunhua Shen, Zhi Tian, Xinlong Wang, Zhibin Wang, Anton van den HengelComments: Accepted to Proc. Eur. Conf. Comp. Vision (ECCV) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [395] arXiv:2201.07422 [pdf, other]
-
Title: Self-Supervised Deep Blind Video Super-ResolutionComments: Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [396] arXiv:2201.07425 [pdf, other]
-
Title: WebUAV-3M: A Benchmark for Unveiling the Power of Million-Scale Deep UAV TrackingAuthors: Chunhui Zhang, Guanjie Huang, Li Liu, Shan Huang, Yinan Yang, Xiang Wan, Shiming Ge, Dacheng TaoComments: 25 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [397] arXiv:2201.07428 [pdf, ps, other]
-
Title: Variable Augmented Network for Invertible MR Coil CompressionSubjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [398] arXiv:2201.07436 [pdf, other]
-
Title: Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepthComments: 11pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [399] arXiv:2201.07451 [pdf, other]
-
Title: TransFuse: A Unified Transformer-based Image Fusion Framework using Self-supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [400] arXiv:2201.07459 [pdf, other]
-
Title: PT4AL: Using Self-Supervised Pretext Tasks for Active LearningComments: Code is available at this https URL Updated for ECCV 2022 submissionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [401] arXiv:2201.07486 [pdf, other]
-
Title: High-fidelity 3D Model Compression based on Key SpheresComments: Accepted in Data Compression Conference (DCC) 2022 as a full paperSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [402] arXiv:2201.07495 [pdf, other]
-
Title: Weakly Supervised Semantic Segmentation of Remote Sensing Images for Tree Species Classification Based on Explanation MethodsComments: 4 pages, 1 figure, submitted to IEEE Geosciences and Remote Sensing Symposium (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [403] arXiv:2201.07540 [pdf, ps, other]
-
Title: Virtual Coil Augmentation Technology for MR Coil Extrapolation via Deep LearningComments: arXiv admin note: text overlap with arXiv:2103.15061, arXiv:1907.03063, arXiv:1807.03039 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [404] arXiv:2201.07572 [pdf, other]
-
Title: Superpixel Pre-Segmentation of HER2 Slides for Efficient AnnotationAuthors: Mathias Öttl, Jana Mönius, Christian Marzahl, Matthias Rübner, Carol I. Geppert, Arndt Hartmann, Matthias W. Beckmann, Peter Fasching, Andreas Maier, Ramona Erber, Katharina BreiningerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [405] arXiv:2201.07583 [pdf, ps, other]
-
Title: DMF-Net: Dual-Branch Multi-Scale Feature Fusion Network for copy forgery identification of anti-counterfeiting QR codeComments: 17 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [406] arXiv:2201.07594 [pdf, ps, other]
-
Title: Real-time Recognition of Yoga Poses using computer Vision for Smart Health CareSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [407] arXiv:2201.07609 [pdf, other]
-
Title: A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view StereoComments: 17 pages, 13 figures, 7 tables. ICCV 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [408] arXiv:2201.07619 [pdf, other]
-
Title: CAST: Character labeling in Animation using Self-supervision by TrackingComments: Published as a conference paper at EuroGraphics 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [409] arXiv:2201.07661 [pdf, other]
-
Title: Open Source Handwritten Text Recognition on Medieval Manuscripts using Mixed Models and Document-Specific FinetuningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [410] arXiv:2201.07665 [pdf, other]
-
Title: Semi-automatic 3D Object Keypoint Annotation and Detection for the MassesComments: Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [411] arXiv:2201.07676 [pdf, other]
-
Title: Neighborhood Spatial Aggregation MC Dropout for Efficient Uncertainty-aware Semantic Segmentation in Point CloudsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [412] arXiv:2201.07692 [pdf, other]
-
Title: GroupGazer: A Tool to Compute the Gaze per Participant in Groups with integrated Calibration to Map the Gaze Online to a Screen or Beamer ProjectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [413] arXiv:2201.07703 [pdf, other]
-
Title: Q-ViT: Fully Differentiable Quantization for Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [414] arXiv:2201.07706 [pdf, ps, other]
-
Title: Object Detection in Autonomous Vehicles: Status and Open ChallengesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [415] arXiv:2201.07734 [pdf, other]
-
Title: Towards holistic scene understanding: Semantic segmentation and beyondAuthors: Panagiotis MeletisComments: PhD Thesis, Eindhoven University of Technology, October 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [416] arXiv:2201.07756 [pdf, other]
-
Title: A pipeline for automated processing of Corona KH-4 (1962-1972) stereo imageryComments: 24 Pages, 16 FiguresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [417] arXiv:2201.07781 [pdf, other]
-
Title: Towards a General Deep Feature Extractor for Facial Expression RecognitionComments: Published in: 2021 IEEE International Conference on Image Processing (ICIP). arXiv admin note: text overlap with arXiv:2103.09154Journal-ref: IEEE International Conference on Image Processing (ICIP), 2021, pp. 2339-2342Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [418] arXiv:2201.07788 [pdf, other]
-
Title: ConDor: Self-Supervised Canonicalization of 3D Pose for Partial ShapesAuthors: Rahul Sajnani, Adrien Poulenard, Jivitesh Jain, Radhika Dua, Leonidas J. Guibas, Srinath SridharComments: Accepted to CVPR 2022, New Orleans, Louisiana. For project page and code, see this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [419] arXiv:2201.07894 [pdf, other]
-
Title: Enhanced Performance of Pre-Trained Networks by Matched Augmentation DistributionsAuthors: Touqeer Ahmad, Mohsen Jafarzadeh, Akshay Raj Dhamija, Ryan Rabinowitz, Steve Cruz, Chunchun Li, Terrance E. BoultSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [420] arXiv:2201.07906 [pdf, other]
- [421] arXiv:2201.07927 [pdf, other]
-
Title: Learning-by-Novel-View-Synthesis for Full-Face Appearance-Based 3D Gaze EstimationComments: Camera-ready version for CVPR 2022 Workshop (GAZE 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [422] arXiv:2201.07929 [pdf, other]
-
Title: Estimating Egocentric 3D Human Pose in the Wild with External Weak SupervisionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [423] arXiv:2201.07931 [pdf, other]
-
Title: Experimental Large-Scale Jet Flames' Geometrical Features Extraction for Risk Management Using Infrared Images and Deep Learning Segmentation MethodsAuthors: Carmina Pérez-Guerrero, Adriana Palacios, Gilberto Ochoa-Ruiz, Christian Mata, Joaquim Casal, Miguel Gonzalez-Mendoza, Luis Eduardo Falcón-MoralesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [424] arXiv:2201.07937 [pdf, other]
-
Title: GASCN: Graph Attention Shape Completion NetworkComments: International Conference on 3D Vision (3DV)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [425] arXiv:2201.07989 [pdf, other]
-
Title: Self-supervised Video Representation Learning with Cascade Positive RetrievalComments: To appear in CVPR 2022 L3D-IVU WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [426] arXiv:2201.08001 [pdf, other]
-
Title: CELESTIAL: Classification Enabled via Labelless Embeddings with Self-supervised Telescope Image Analysis LearningComments: COSPAR 2021 Cross-Disciplinary Workshop on Machine Learning for Space Sciences, Sydney, AustraliaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [427] arXiv:2201.08002 [pdf, other]
-
Title: PRMI: A Dataset of Minirhizotron Images for Diverse Plant Root StudyAuthors: Weihuang Xu, Guohao Yu, Yiming Cui, Romain Gloaguen, Alina Zare, Jason Bonnette, Joel Reyes-Cabrera, Ashish Rajurkar, Diane Rowland, Roser Matamala, Julie D. Jastrow, Thomas E. Juenger, Felix B. FritschiComments: The 36th AAAI Conference on the AI for Agriculture and Food Systems (AIAFS) WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [428] arXiv:2201.08027 [pdf, ps, other]
-
Title: A Joint Morphological Profiles and Patch Tensor Change Detection for Hyperspectral ImagerySubjects: Computer Vision and Pattern Recognition (cs.CV); Methodology (stat.ME)
- [429] arXiv:2201.08029 [pdf, other]
-
Title: Domain Generalization via Frequency-domain-based Feature Disentanglement and InteractionComments: The paper is accepted by ACM Multimedia 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [430] arXiv:2201.08049 [pdf, other]
-
Title: Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature CorrelationComments: 11 pages, 6 figures, Accepted by IEEE Transactions on Geoscience and Remote Sensing 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [431] arXiv:2201.08050 [pdf, other]
-
Title: TerViT: An Efficient Ternary Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [432] arXiv:2201.08051 [pdf, other]
-
Title: Predicting Vegetation Stratum Occupancy from Airborne LiDAR Data with Deep LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [433] arXiv:2201.08071 [pdf, other]
-
Title: Temporal Sentence Grounding in Videos: A Survey and Future DirectionsComments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
- [434] arXiv:2201.08093 [pdf, other]
-
Title: AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [435] arXiv:2201.08098 [pdf, other]
-
Title: What can we learn from misclassified ImageNet images?Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [436] arXiv:2201.08122 [pdf, other]
-
Title: A Computational Model for Machine ThinkingAuthors: Slimane LarabiComments: Internal report, RIIMA LaboratorySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [437] arXiv:2201.08125 [pdf, other]
-
Title: Deep Unsupervised Contrastive Hashing for Large-Scale Cross-Modal Text-Image Retrieval in Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [438] arXiv:2201.08131 [pdf, other]
-
Title: GeoFill: Reference-Based Image Inpainting with Better Geometric UnderstandingAuthors: Yunhan Zhao, Connelly Barnes, Yuqian Zhou, Eli Shechtman, Sohrab Amirghodsi, Charless FowlkesComments: Accepted to WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [439] arXiv:2201.08141 [pdf, other]
-
Title: SPAMs: Structured Implicit Parametric ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [440] arXiv:2201.08157 [pdf, other]
-
Title: WPPNets and WPPFlows: The Power of Wasserstein Patch Priors for SuperresolutionJournal-ref: SIAM Journal on Imaging Sciences, vol. 16(3), pp. 1033-1067, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [441] arXiv:2201.08158 [pdf, other]
-
Title: HDhuman: High-quality Human Novel-view Rendering from Sparse ViewsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [442] arXiv:2201.08215 [pdf, other]
-
Title: CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [443] arXiv:2201.08217 [pdf, other]
-
Title: Watermarking Pre-trained Encoders in Contrastive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [444] arXiv:2201.08264 [pdf, other]
-
Title: End-to-end Generative Pretraining for Multimodal Video CaptioningJournal-ref: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
- [445] arXiv:2201.08295 [pdf, other]
-
Title: DIVA-DAF: A Deep Learning Framework for Historical Document Image AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [446] arXiv:2201.08361 [pdf, other]
-
Title: Stitch it in Time: GAN-Based Facial Editing of Real VideosComments: Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [447] arXiv:2201.08371 [pdf, other]
-
Title: Revisiting Weakly Supervised Pre-Training of Visual Perception ModelsAuthors: Mannat Singh, Laura Gustafson, Aaron Adcock, Vinicius de Freitas Reis, Bugra Gedik, Raj Prateek Kosaraju, Dhruv Mahajan, Ross Girshick, Piotr Dollár, Laurens van der MaatenComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [448] arXiv:2201.08377 [pdf, other]
-
Title: Omnivore: A Single Model for Many Visual ModalitiesAuthors: Rohit Girdhar, Mannat Singh, Nikhila Ravi, Laurens van der Maaten, Armand Joulin, Ishan MisraComments: Accepted at CVPR 2022 (Oral Presentation)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [449] arXiv:2201.08379 [pdf, other]
-
Title: Learning Pixel Trajectories with Multiscale Contrastive Random WalksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [450] arXiv:2201.08383 [pdf, other]
-
Title: MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video RecognitionAuthors: Chao-Yuan Wu, Yanghao Li, Karttikeya Mangalam, Haoqi Fan, Bo Xiong, Jitendra Malik, Christoph FeichtenhoferComments: Technical report. arXiv v2: add link to codeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [451] arXiv:2201.08425 [pdf, other]
-
Title: FaceOcc: A Diverse, High-quality Face Occlusion Dataset for Human Face ExtractionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [452] arXiv:2201.08465 [pdf, other]
-
Title: An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional FiltersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [453] arXiv:2201.08550 [pdf, other]
-
Title: What Can Machine Vision Do for Lymphatic Histopathology Image Analysis: A Comprehensive ReviewAuthors: Xiaoqi Li, Haoyuan Chen, Chen Li, Md Mamunur Rahaman, Xintong Li, Jian Wu, Xiaoyan Li, Hongzan Sun, Marcin GrzegorzekSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [454] arXiv:2201.08574 [pdf, other]
-
Title: Classroom Slide Narration SystemJournal-ref: CVIP 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [455] arXiv:2201.08613 [pdf, other]
-
Title: Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint LocalizationComments: To appear on ICLR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [456] arXiv:2201.08619 [pdf, other]
-
Title: Dangerous Cloaking: Natural Trigger based Backdoor Attacks on Object Detectors in the Physical WorldAuthors: Hua Ma, Yinshan Li, Yansong Gao, Alsharif Abuadbba, Zhi Zhang, Anmin Fu, Hyoungshick Kim, Said F. Al-Sarawi, Nepal Surya, Derek AbbottSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
- [457] arXiv:2201.08625 [pdf, other]
-
Title: VIPriors 2: Visual Inductive Priors for Data-Efficient Deep Learning ChallengesAuthors: Attila Lengyel, Robert-Jan Bruintjes, Marcos Baptista Rios, Osman Semih Kayhan, Davide Zambrano, Nergis Tomen, Jan van GemertComments: 11 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [458] arXiv:2201.08633 [pdf, other]
-
Title: Multi-view Monocular Depth and Uncertainty Prediction with Deep SfM in Dynamic EnvironmentsComments: 20 pages, 5 figures, 3 tables, submitted to ICPRAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [459] arXiv:2201.08636 [pdf, ps, other]
-
Title: Conceptor Learning for Class Activation MappingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [460] arXiv:2201.08657 [pdf, other]
-
Title: Enhancing Pseudo Label Quality for Semi-Supervised Domain-Generalized Medical Image SegmentationComments: Accepted by AAAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [461] arXiv:2201.08663 [pdf, other]
- [462] arXiv:2201.08669 [pdf, other]
-
Title: Dynamic Deep Convolutional Candlestick LearnerComments: 11 pages, 9 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [463] arXiv:2201.08673 [pdf, other]
-
Title: Exploring Fusion Strategies for Accurate RGBT Visual Object TrackingAuthors: Zhangyong Tang (1), Tianyang Xu (1), Hui Li (1), Xiao-Jun Wu (1), Xuefeng Zhu (1), Josef Kittler (2) ((1) Jiangnan University, Wuxi, China, (2) University of Surrey, UK)Comments: 13 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [464] arXiv:2201.08683 [pdf, other]
-
Title: A Comprehensive Study of Vision Transformers on Dense Prediction TasksAuthors: Kishaan Jeeveswaran, Senthilkumar Kathiresan, Arnav Varma, Omar Magdy, Bahram Zonooz, Elahe AraniComments: 17th International Conference on Computer Vision Theory and Applications (VISAP, 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [465] arXiv:2201.08746 [pdf, other]
-
Title: ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [466] arXiv:2201.08763 [pdf, other]
-
Title: Object Detection in Aerial Images: What Improves the Accuracy?Comments: 8 pages, 14 FiguresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [467] arXiv:2201.08779 [pdf, other]
-
Title: Contrastive and Selective Hidden Embeddings for Medical Image SegmentationAuthors: Zhuowei Li, Zihao Liu, Zhiqiang Hu, Qing Xia, Ruiqin Xiong, Shaoting Zhang, Dimitris Metaxas, Tingting JiangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [468] arXiv:2201.08789 [pdf, other]
-
Title: AiTLAS: Artificial Intelligence Toolbox for Earth ObservationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [469] arXiv:2201.08812 [pdf, ps, other]
-
Title: DeepMix: Mobility-aware, Lightweight, and Hybrid 3D Object Detection for HeadsetsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [470] arXiv:2201.08813 [pdf, other]
-
Title: Active Predictive Coding Networks: A Neural Solution to the Problem of Learning Reference Frames and Part-Whole HierarchiesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [471] arXiv:2201.08815 [pdf, other]
-
Title: Learning from One and Only One ShotSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [472] arXiv:2201.08816 [pdf, other]
-
Title: Skyline variations allow estimating distance to trees on landscape photos using semantic segmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Applications (stat.AP)
- [473] arXiv:2201.08831 [pdf, other]
-
Title: Reliable Detection of Doppelgängers based on Deep Face RepresentationsComments: accepted in IET BiometricsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [474] arXiv:2201.08845 [pdf, other]
-
Title: Point-NeRF: Point-based Neural Radiance FieldsAuthors: Qiangeng Xu, Zexiang Xu, Julien Philip, Sai Bi, Zhixin Shu, Kalyan Sunkavalli, Ulrich NeumannComments: Accepted to CVPR 2022 (Oral)Journal-ref: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 5438-5448) (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [475] arXiv:2201.08887 [pdf, other]
-
Title: Image-to-Video Re-Identification via Mutual Discriminative Knowledge TransferComments: accepted by ICASSP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [476] arXiv:2201.08893 [pdf, other]
-
Title: Signal Strength and Noise Drive Feature Preference in CNN Image ClassifiersComments: Accepted at SVRHM 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [477] arXiv:2201.08901 [pdf, ps, other]
-
Title: An Ensemble Model for Face Liveness DetectionComments: Accepted and presented at MLDM 2022. To be published in Lattice journalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [478] arXiv:2201.08938 [pdf, other]
-
Title: Adaptive DropBlock Enhanced Generative Adversarial Networks for Hyperspectral Image ClassificationJournal-ref: in IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 6, pp. 5040-5053, June 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [479] arXiv:2201.08949 [pdf, other]
-
Title: Temporal Aggregation for Adaptive RGBT TrackingComments: 12 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [480] arXiv:2201.08951 [pdf, other]
-
Title: Visual Representation Learning with Self-Supervised Attention for Low-Label High-data RegimeComments: Accepted to ICASSP-2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [481] arXiv:2201.08953 [pdf, other]
-
Title: FedMed-GAN: Federated Domain Translation on Unsupervised Cross-Modality Brain Image SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [482] arXiv:2201.08954 [pdf, other]
-
Title: Change Detection from Synthetic Aperture Radar Images via Graph-Based Knowledge Supplement NetworkComments: Accepted by IEEE JSTARSSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [483] arXiv:2201.08958 [pdf, other]
-
Title: Learning Efficient Representations for Enhanced Object Detection on Large-scene SAR ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [484] arXiv:2201.08959 [pdf, other]
-
Title: Few-shot Object Counting with Similarity-Aware Feature EnhancementComments: Accepted by WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [485] arXiv:2201.08962 [pdf, other]
-
Title: Collaborative Representation for SPD Matrices with Application to Image-Set ClassificationComments: 9 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [486] arXiv:2201.08970 [pdf, other]
-
Title: Parallel Rectangle Flip Attack: A Query-based Black-box Attack against Object DetectionComments: 8 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [487] arXiv:2201.08977 [pdf, other]
-
Title: Semi-Supervised Adversarial Recognition of Refined Window Structures for Inverse Procedural Façade ModelingAuthors: Han Hu, Xinrong Liang, Yulin Ding, Qisen Shang, Bo Xu, Xuming Ge, Min Chen, Ruofei Zhong, Qing ZhuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [488] arXiv:2201.08983 [pdf, other]
-
Title: BBA-net: A bi-branch attention network for crowd countingJournal-ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [489] arXiv:2201.08992 [pdf, other]
-
Title: Enhancing and Dissecting Crowd Counting By Synthetic DataSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [490] arXiv:2201.08996 [pdf, other]
-
Title: Linear Array Network for Low-light Image EnhancementAuthors: Keqi Wang, Ziteng Cui, Jieru Jia, Hao Xu, Ge Wu, Yin Zhuang, Lu Chen, Zhiguo Hu, Yuhua QianSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [491] arXiv:2201.09023 [pdf, other]
-
Title: Content-aware Warping for View SynthesisComments: arXiv admin note: text overlap with arXiv:2108.07408Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [492] arXiv:2201.09041 [pdf, other]
-
Title: Inter-Semantic Domain Adversarial in Histopathological ImagesAuthors: Nicolas Dumas, Valentin Derangère, Laurent Arnould, Sylvain Ladoire, Louis-Oscar Morel, Nathan VinçonComments: 8 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [493] arXiv:2201.09042 [pdf, other]
-
Title: Uncertainty-aware deep learning methods for robust diabetic retinopathy classificationAuthors: Joel Jaskari, Jaakko Sahlsten, Theodoros Damoulas, Jeremias Knoblauch, Simo Särkkä, Leo Kärkkäinen, Kustaa Hietala, Kimmo KaskiSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [494] arXiv:2201.09048 [pdf, other]
- [495] arXiv:2201.09049 [pdf, other]
-
Title: LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNNComments: 14Journal-ref: in IEEE Access, vol. 10, pp. 103041-103055, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [496] arXiv:2201.09061 [pdf, other]
-
Title: Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial NetworkAuthors: J. Rafid SiddiquiSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [497] arXiv:2201.09077 [pdf, other]
-
Title: LTC-GIF: Attracting More Clicks on Feature-length Sports VideosSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [498] arXiv:2201.09079 [pdf, other]
-
Title: Implicit Bias of Projected Subgradient Method Gives Provable Robust Recovery of Subspaces of Unknown CodimensionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [499] arXiv:2201.09089 [pdf, ps, other]
-
Title: A Comprehensive Study on Occlusion Invariant Face Recognition under Face Mask OcclusionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [500] arXiv:2201.09109 [pdf, other]
-
Title: Robust Unpaired Single Image Super-Resolution of FacesComments: 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [501] arXiv:2201.09120 [pdf, other]
-
Title: Investigating the Potential of Auxiliary-Classifier GANs for Image Classification in Low Data RegimesComments: 4 pages content, 1 page references, 3 figures, 2 tables, to appear in ICASSP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [502] arXiv:2201.09135 [pdf, other]
-
Title: MIDAS: Deep learning human action intention prediction from natural eye movement patternsSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [503] arXiv:2201.09139 [pdf, other]
-
Title: Dual-Flattening Transformers through Decomposed Row and Column Queries for Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [504] arXiv:2201.09144 [pdf, other]
-
Title: Background Invariant Classification on Infrared Imagery by Data Efficient Training and Reducing Bias in CNNsComments: Accepted in AAAI-22 WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [505] arXiv:2201.09152 [pdf, other]
-
Title: Generative Adversarial Network Applications in Creating a Meta-UniverseComments: Computational Science and Computational Intelligence; 2021 International Conference on IEEE CPS (IEEE XPLORE, Scopus), IEEE, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [506] arXiv:2201.09153 [pdf, other]
-
Title: An Integrated Approach for Video Captioning and ApplicationsComments: The 2021 World Congress in Computer Science, Computer Engineering, and Applied Computing (CSCE'21), IEEE, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [507] arXiv:2201.09156 [pdf, other]
-
Title: LSNet: Extremely Light-Weight Siamese Network For Change Detection in Remote Sensing ImageSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [508] arXiv:2201.09167 [pdf, other]
-
Title: Mixed X-Ray Image Separation for Artworks with Concealed DesignsAuthors: Wei Pu, Jun-Jie Huang, Barak Sober, Nathan Daly, Catherine Higgitt, Ingrid Daubechies, Pier Luigi Dragotti, Miguel RodiguesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [509] arXiv:2201.09168 [pdf, other]
-
Title: Reading-strategy Inspired Visual Representation Learning for Text-to-Video RetrievalComments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [510] arXiv:2201.09169 [pdf, other]
-
Title: Rich Action-semantic Consistent Knowledge for Early Action PredictionComments: Accepted by IEEE TIP,15pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [511] arXiv:2201.09193 [pdf, other]
-
Title: Learning to Minimize the Remainder in Supervised LearningComments: Accepted to IEEE TMMSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [512] arXiv:2201.09201 [pdf, other]
-
Title: Vision-Based UAV Self-Positioning in Low-Altitude Urban EnvironmentsComments: 13 pages,8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [513] arXiv:2201.09205 [pdf, other]
-
Title: Deeply Explain CNN via Hierarchical DecompositionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [514] arXiv:2201.09206 [pdf, other]
-
Title: A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-LocalizationComments: 14 pages, 13 figures, IEEE Transactions on Circuits and Systems for Video TechnologySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [515] arXiv:2201.09207 [src]
-
Title: Visual Object Tracking on Multi-modal RGB-D Videos: A ReviewComments: I prefer not to present this paper due to its subpar qualitySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [516] arXiv:2201.09208 [pdf, ps, other]
-
Title: Design of Sensor Fusion Driver Assistance System for Active Pedestrian SafetyAuthors: I-Hsi Kao, Ya-Zhu Yian, Jian-An Su, Yi-Horng Lai, Jau-Woei Perng, Tung-Li Hsieh, Yi-Shueh Tsai, Min-Shiu HsiehComments: The 14th International Conference on Automation Technology (Automation 2017), December 8-10, 2017, Kaohsiung, TaiwanSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [517] arXiv:2201.09213 [pdf, ps, other]
-
Title: FN-Net:Remove the Outliers by Filtering the NoiseAuthors: Kai LvComments: 6 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [518] arXiv:2201.09246 [pdf, other]
-
Title: Face recognition via compact second order image gradient orientationsComments: 26 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [519] arXiv:2201.09271 [pdf, other]
-
Title: Wavelet-Attention CNN for Image ClassificationAuthors: Zhao XiangyuComments: 17 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [520] arXiv:2201.09286 [pdf, other]
-
Title: How to scale hyperparameters for quickshift image segmentationAuthors: Damien GarreauComments: 33 pages, 16 figures. Accepted to AISTATS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [521] arXiv:2201.09296 [pdf, other]
-
Title: A Survey for Deep RGBT TrackingComments: 7 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [522] arXiv:2201.09302 [pdf, ps, other]
-
Title: 1000x Faster Camera and Machine Vision with Ordinary DevicesAuthors: Tiejun Huang, Yajing Zheng, Zhaofei Yu, Rui Chen, Yuan Li, Ruiqin Xiong, Lei Ma, Junwei Zhao, Siwei Dong, Lin Zhu, Jianing Li, Shanshan Jia, Yihua Fu, Boxin Shi, Si Wu, Yonghong TianSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [523] arXiv:2201.09308 [pdf, other]
-
Title: Basket-based SoftmaxSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [524] arXiv:2201.09318 [pdf, other]
-
Title: Sparse-view Cone Beam CT Reconstruction using Data-consistent Supervised and Adversarial Learning from Scarce Training DataSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [525] arXiv:2201.09352 [pdf, other]
-
Title: Out of Distribution Detection on ImageNet-OSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [526] arXiv:2201.09354 [pdf, other]
-
Title: Survey and Systematization of 3D Object Detection Models and MethodsComments: accepted at "The Visual Computer"Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [527] arXiv:2201.09355 [pdf, ps, other]
-
Title: Transformer-based SAR Image DespecklingAuthors: Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. PatelComments: Submitted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. Our code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [528] arXiv:2201.09373 [pdf, other]
-
Title: Unsupervised Severely Deformed Mesh Reconstruction (DMR) from a Single-View ImageAuthors: Jie Mei, Jingxi Yu, Suzanne Romain, Craig Rose, Kelsey Magrane, Graeme LeeSon, Jenq-Neng HwangComments: Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [529] arXiv:2201.09381 [pdf, other]
-
Title: vCLIMB: A Novel Video Class Incremental Learning BenchmarkAuthors: Andrés Villa, Kumail Alhamoud, Juan León Alcázar, Fabian Caba Heilbron, Victor Escorcia, Bernard GhanemComments: An updated version of our CVPR 2022 paper (oral); v2 adds minor text changes. The code of our benchmark can be found at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [530] arXiv:2201.09384 [pdf, ps, other]
-
Title: A Comprehensive Survey on Federated Learning: Concept and ApplicationsJournal-ref: Lecture Notes on Data Engineering and Communications Technologies 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [531] arXiv:2201.09388 [pdf, ps, other]
-
Title: A Survey on Patients Privacy Protection with Stganography and Visual EncryptionSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [532] arXiv:2201.09390 [pdf, other]
-
Title: AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder NetworksComments: 15th IAPR International Workshop on Document Analysis System (DAS)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [533] arXiv:2201.09395 [pdf, ps, other]
-
Title: MISeval: a Metric Library for Medical Image Segmentation EvaluationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [534] arXiv:2201.09396 [pdf, other]
-
Title: Dynamic Label Assignment for Object Detection by Combining Predicted IoUs and Anchor IoUsJournal-ref: Journal of Imaging 2022, 8(7), 193Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [535] arXiv:2201.09405 [pdf, other]
-
Title: Improving Chest X-Ray Report Generation by Leveraging Warm StartingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [536] arXiv:2201.09407 [pdf, other]
-
Title: Cross-Domain Document Layout Analysis via Unsupervised Document Style GuideSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [537] arXiv:2201.09421 [pdf, other]
- [538] arXiv:2201.09450 [pdf, other]
-
Title: UniFormer: Unifying Convolution and Self-attention for Visual RecognitionAuthors: Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu QiaoComments: 18 pages, 10 figures, 23 tables. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [539] arXiv:2201.09548 [pdf, other]
-
Title: Consistent 3D Hand Reconstruction in Video via self-supervised LearningComments: arXiv admin note: substantial text overlap with arXiv:2103.11703Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence. 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [540] arXiv:2201.09563 [pdf, ps, other]
-
Title: Debiasing pipeline improves deep learning model generalization for X-ray based lung nodule detectionAuthors: Michael Horry, Subrata Chakraborty, Biswajeet Pradhan, Manoranjan Paul, Jing Zhu, Hui Wen Loh, Prabal Datta Barua, U. Rajendra ArharyaComments: 32 pages, 17 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [541] arXiv:2201.09574 [pdf, other]
-
Title: Multi-Scale Iterative Refinement Network for RGB-D Salient Object DetectionComments: 40 pagesJournal-ref: Engineering Applications of Artificial Intelligence(2021)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [542] arXiv:2201.09575 [pdf, other]
-
Title: Importance of Textlines in Historical Document ClassificationComments: 13 pages, 7 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [543] arXiv:2201.09594 [pdf, other]
-
Title: Describe me if you can! Characterized Instance-level Human ParsingComments: 5 pagesJournal-ref: Published in: 2021 IEEE International Conference on Image Processing (ICIP)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [544] arXiv:2201.09604 [pdf, other]
-
Title: End-to-end Person Search Sequentially Trained on Aggregated DatasetComments: 5 pagesJournal-ref: Published in: 2019 IEEE International Conference on Image Processing (ICIP)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [545] arXiv:2201.09613 [pdf, other]
-
Title: SEN12MS-CR-TS: A Remote Sensing Data Set for Multi-modal Multi-temporal Cloud RemovalJournal-ref: IEEE Transactions on Geoscience and Remote Sensing, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [546] arXiv:2201.09633 [pdf, other]
-
Title: Paired Image to Image Translation for Strikethrough Removal From Handwritten WordsComments: accepted at DAS2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [547] arXiv:2201.09639 [pdf, other]
-
Title: Question Generation for Evaluating Cross-Dataset Shifts in Multi-modal GroundingAuthors: Arjun R. AkulaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [548] arXiv:2201.09689 [pdf, other]
-
Title: Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGANSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
- [549] arXiv:2201.09700 [pdf, ps, other]
-
Title: Feature transforms for image data augmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [550] arXiv:2201.09701 [pdf, other]
-
Title: Learning Semantics for Visual Place Recognition through Multi-Scale AttentionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [551] arXiv:2201.09717 [pdf, other]
-
Title: Keeping Deep Lithography Simulators Updated: Global-Local Shape-Based Novelty Detection and Active LearningAuthors: Hao-Chiang Shao, Hsing-Lei Ping, Kuo-shiuan Chen, Weng-Tai Su, Chia-Wen Lin, Shao-Yun Fang, Pin-Yian Tsai, Yan-Hsiu LiuSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [552] arXiv:2201.09724 [pdf, other]
-
Title: Hot-Refresh Model Upgrades with Regression-Alleviating Compatible Training in Image RetrievalComments: Accepted to ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [553] arXiv:2201.09792 [pdf, other]
-
Title: Patches Are All You Need?Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [554] arXiv:2201.09799 [pdf, other]
-
Title: Neural Architecture Searching for Facial Attributes-based Depression RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [555] arXiv:2201.09822 [pdf, ps, other]
-
Title: Spectral-PQ: A Novel Spectral Sensitivity-Orientated Perceptual Compression Technique for RGB 4:4:4 Video DataComments: arXiv admin note: text overlap with arXiv:2005.07928Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [556] arXiv:2201.09846 [pdf, other]
-
Title: A Novel Mix-normalization Method for Generalizable Multi-source Person Re-identificationComments: Accepted by IEEE Transactions on Multimedia (TMM)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [557] arXiv:2201.09865 [pdf, other]
-
Title: RePaint: Inpainting using Denoising Diffusion Probabilistic ModelsComments: We missed out on other diffusion models that work on inpainting. We corrected that and apologize for this mistakeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [558] arXiv:2201.09933 [pdf, other]
-
Title: Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear DevicesAuthors: Yingying Zhao, Yuhu Chang, Yutian Lu, Yujiang Wang, Mingzhi Dong, Qin Lv, Robert P. Dick, Fan Yang, Tun Lu, Ning Gu, Li ShangComments: The EMO-Film dataset is available at: this https URLJournal-ref: Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), Volume 6, Issue 1, Article 38. March 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [559] arXiv:2201.09935 [pdf, other]
-
Title: What is the cost of adding a constraint in linear least squares?Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
- [560] arXiv:2201.09967 [pdf, other]
-
Title: Attacks and Defenses for Free-Riders in Multi-Discriminator GANSubjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
- [561] arXiv:2201.09968 [pdf, other]
-
Title: ImpliCity: City Modeling from Satellite Images with Deep Implicit Occupancy FieldsComments: Accepted for publication in the International Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (camera-ready version including keywords + supplementary material)Journal-ref: ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., V-2-2022, 193-201, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [562] arXiv:2201.09973 [pdf, ps, other]
-
Title: The Vehicle Trajectory Prediction Based on ResNet and EfficientNet ModelSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [563] arXiv:2201.10015 [pdf, ps, other]
-
Title: Automatic Recognition and Digital Documentation of Cultural Heritage Hemispherical Domes using ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [564] arXiv:2201.10029 [pdf, other]
-
Title: PONI: Potential Functions for ObjectGoal Navigation with Interaction-free LearningAuthors: Santhosh Kumar Ramakrishnan, Devendra Singh Chaplot, Ziad Al-Halah, Jitendra Malik, Kristen GraumanComments: 8 pages + supplementary. Accepted in CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [565] arXiv:2201.10034 [pdf, other]
-
Title: Self-Supervised Point Cloud Registration with Deep Versatile DescriptorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [566] arXiv:2201.10047 [src]
-
Title: Are Commercial Face Detection Models as Biased as Academic Models?Comments: This preprint and arXiv:2108.12508 were combined and a more rigorous analysis added to result in the NeurIPS Datasets & Benchmark 2022 paper arXiv:2211.15937Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [567] arXiv:2201.10060 [pdf, other]
-
Title: ViT-HGR: Vision Transformer-based Hand Gesture Recognition from High Density Surface EMG SignalsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
- [568] arXiv:2201.10075 [pdf, other]
-
Title: Splatting-based Synthesis for Video Frame InterpolationComments: WACV 2023, this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [569] arXiv:2201.10079 [pdf, ps, other]
-
Title: Real-time automatic polyp detection in colonoscopy using feature enhancement module and spatiotemporal similarity correlation unitAuthors: Jianwei Xu, Ran Zhao, Yizhou Yu, Qingwei Zhang, Xianzhang Bian, Jun Wang, Zhizheng Ge, Dahong QianComments: This paper has been accepted by Biomedical Signal Processing and Control. Please cite the paper as Xu, J., Zhao, R., Yu, Y., Zhang, Q., Bian, X., Wang, J., Ge, Z., Qian, D., 2021. Real-time automatic polyp detection in colonoscopy using feature enhancement module and spatiotemporal similarity correlation unit. Biomedical Signal Processing and Control 66, 102503Journal-ref: Biomedical Signal Processing and Control, vol. 66, p. 102503, Apr. 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [570] arXiv:2201.10084 [pdf, other]
-
Title: Revisiting L1 Loss in Super-Resolution: A Probabilistic View and BeyondComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [571] arXiv:2201.10102 [pdf, ps, other]
-
Title: A Classical Approach to Handcrafted Feature Extraction Techniques for Bangla Handwritten Digit RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [572] arXiv:2201.10107 [pdf, other]
-
Title: ARPD: Anchor-free Rotation-aware People Detection using Topview Fisheye CameraComments: 2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [573] arXiv:2201.10110 [pdf, other]
-
Title: A Hybrid Quantum-Classical Algorithm for Robust FittingComments: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [574] arXiv:2201.10138 [pdf, other]
-
Title: SURDS: Self-Supervised Attention-guided Reconstruction and Dual Triplet Loss for Writer Independent Offline Signature VerificationComments: Accepted at ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [575] arXiv:2201.10145 [pdf, other]
-
Title: Riemannian Local Mechanism for SPD Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [576] arXiv:2201.10147 [pdf, other]
-
Title: TGFuse: An Infrared and Visible Image Fusion Approach Based on Transformer and Generative Adversarial NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [577] arXiv:2201.10152 [pdf, other]
-
Title: Unsupervised Image Fusion Method based on Feature Mutual MappingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [578] arXiv:2201.10162 [pdf, other]
-
Title: Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI TasksComments: 21 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [579] arXiv:2201.10168 [pdf, other]
-
Title: Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in VideosComments: Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [580] arXiv:2201.10175 [pdf, other]
-
Title: RFMask: A Simple Baseline for Human Silhouette Segmentation with Radio SignalsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [581] arXiv:2201.10182 [pdf, other]
-
Title: Pre-Trained Language Transformers are Universal Image ClassifiersAuthors: Rahul Goel, Modar Sulaiman, Kimia Noorbakhsh, Mahdi Sharifi, Rajesh Sharma, Pooyan Jamshidi, Kallol RoySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [582] arXiv:2201.10184 [pdf, other]
-
Title: Estimating the Direction and Radius of Pipe from GPR Image by Ellipse Inversion ModelSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [583] arXiv:2201.10185 [pdf, other]
-
Title: Zero-Shot Sketch Based Image Retrieval using Graph TransformerComments: Accepted at ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [584] arXiv:2201.10210 [pdf, ps, other]
-
Title: Universal Generative Modeling for Calibration-free Parallel Mr ImagingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [585] arXiv:2201.10212 [pdf, other]
-
Title: Feature Diversity Learning with Sample Dropout for Unsupervised Domain Adaptive Person Re-identificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [586] arXiv:2201.10243 [pdf, other]
-
Title: BERTHA: Video Captioning Evaluation Via Transfer-Learned Human AssessmentComments: In press in Language Resources and Evaluation Conference(LREC) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [587] arXiv:2201.10252 [pdf, other]
-
Title: DocEnTr: An End-to-End Document Image Enhancement TransformerAuthors: Mohamed Ali Souibgui, Sanket Biswas, Sana Khamekhem Jemni, Yousri Kessentini, Alicia Fornés, Josep Lladós, Umapada PalComments: submitted to ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [588] arXiv:2201.10271 [pdf, other]
-
Title: Convolutional Xformers for VisionComments: 9 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [589] arXiv:2201.10276 [pdf, other]
-
Title: City3D: Large-Scale Building Reconstruction from Airborne LiDAR Point CloudsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [590] arXiv:2201.10326 [pdf, other]
-
Title: ShapeFormer: Transformer-based Shape Completion via Sparse RepresentationComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [591] arXiv:2201.10366 [pdf, other]
-
Title: ADAPT: An Open-Source sUAS Payload for Real-Time Disaster Prediction and Response with AIAuthors: Daniel Davila, Joseph VanPelt, Alexander Lynch, Adam Romlein, Peter Webley, Matthew S. BrownComments: To be published in Workshop on Practical Deep Learning in the Wild at AAAI Conference on Artificial Intelligence 2022, 9 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [592] arXiv:2201.10369 [pdf, ps, other]
-
Title: Winograd Convolution for Deep Neural Networks: Efficient Point SelectionComments: 19 pages, 3 figures, 9 tables and 32 equationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [593] arXiv:2201.10389 [pdf, other]
-
Title: BLDNet: A Semi-supervised Change Detection Building Damage Framework using Graph Convolutional Networks and Urban Domain KnowledgeComments: 16 pages, 15 figures, submitted to IEEE Transactions on Geoscience and Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [594] arXiv:2201.10394 [pdf, other]
-
Title: Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action RecognitionComments: BMVC 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [595] arXiv:2201.10395 [pdf, other]
-
Title: Towards Cross-Disaster Building Damage Assessment with Graph Convolutional NetworksComments: 5 pages, 3 figures, submitted to IEEE IGARSS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [596] arXiv:2201.10410 [pdf, other]
-
Title: Comparison of Evaluation Metrics for Landmark Detection in CMR ImagesAuthors: Sven Koehler, Lalith Sharan, Julian Kuhm, Arman Ghanaat, Jelizaveta Gordejeva, Nike K. Simon, Niko M. Grell, Florian André, Sandy EngelhardtComments: Accepted at Bildverarbeitung f\"ur die Medizin (BVM), Informatik aktuell. Springer Vieweg, Wiesbaden 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [597] arXiv:2201.10423 [pdf, other]
-
Title: Rayleigh EigenDirections (REDs): GAN latent space traversals for multidimensional featuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [598] arXiv:2201.10431 [pdf, other]
-
Title: Main Product Detection with Graph Networks for FashionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [599] arXiv:2201.10439 [pdf, other]
-
Title: Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person VideoComments: 5 pages, 3 figures, published at Interspeech 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [600] arXiv:2201.10448 [pdf, other]
-
Title: How Low Can We Go? Pixel Annotation for Semantic SegmentationComments: Paper and SupplementarySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [601] arXiv:2201.10489 [pdf, other]
-
Title: Sphere2Vec: Multi-Scale Representation Learning over a Spherical Surface for Geospatial PredictionsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [602] arXiv:2201.10520 [pdf, ps, other]
-
Title: Adaptive Activation-based Structured PruningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [603] arXiv:2201.10521 [pdf, other]
-
Title: A Review of Deep Learning Based Image Super-resolution TechniquesAuthors: Fangyuan ZhuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [604] arXiv:2201.10522 [pdf, other]
-
Title: Blind Image Deblurring: a ReviewAuthors: Zhengrong XueSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [605] arXiv:2201.10523 [pdf, ps, other]
-
Title: Interpretability in Convolutional Neural Networks for Building Damage Classification in Satellite ImageryAuthors: Thomas Y. ChenComments: 8 pages; presented as Spotlight Talk at NeurIPS - Tackling Climate Change with Machine Learning workshop 2020Journal-ref: NeurIPS 2020 Workshop on Tackling Climate Change with Machine LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Geophysics (physics.geo-ph)
- [606] arXiv:2201.10526 [pdf, other]
-
Title: MonarchNet: Differentiating Monarch Butterflies from Butterflies Species with Similar PhenotypesAuthors: Thomas Y. ChenComments: 5 pages, 2 figures, Proceedings of NeurIPS 2020 - Learning Meaningful Representations of Life (LMRL) Workshop. The FASEB JournalJournal-ref: CVPR 2021 Workshop on CV4Animals (Computer Vision for Animal Behavior Tracking and Modeling)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Populations and Evolution (q-bio.PE); Applications (stat.AP)
- [607] arXiv:2201.10602 [pdf, other]
-
Title: Jacobian Computation for Cumulative B-Splines on SE(3) and Application to Continuous-Time Object TrackingComments: Accepted at IEEE Robotics and Automation LettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [608] arXiv:2201.10647 [pdf, other]
-
Title: Unsupervised Domain Adaptation for Vestibular Schwannoma and Cochlea Segmentation via Semi-supervised Learning and Label FusionComments: Accepted by MICCAI 2021 BrainLes Workshop. arXiv admin note: substantial text overlap with arXiv:2109.06274Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [609] arXiv:2201.10649 [pdf, other]
-
Title: Attentive Task Interaction Network for Multi-Task LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [610] arXiv:2201.10650 [pdf, other]
-
Title: Beyond Visual Image: Automated Diagnosis of Pigmented Skin Lesions Combining Clinical Image Features with Patient DataComments: 33 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [611] arXiv:2201.10654 [pdf, ps, other]
-
Title: SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [612] arXiv:2201.10656 [pdf, ps, other]
-
Title: MGA-VQA: Multi-Granularity Alignment for Visual Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [613] arXiv:2201.10664 [pdf, other]
-
Title: Do Neural Networks for Segmentation Understand Insideness?Authors: Kimberly Villalobos, Vilim Štih, Amineh Ahmadinejad, Shobhita Sundaram, Jamell Dozier, Andrew Francl, Frederico Azevedo, Tomotake Sasaki, Xavier BoixJournal-ref: Neural Computation 33 (2021) 2511-2549Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
- [614] arXiv:2201.10665 [pdf, other]
-
Title: Writer Recognition Using Off-line Handwritten Single Block CharactersComments: Accepted for publication at IEEE International Workshop on Biometrics and Forensics IWBF 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [615] arXiv:2201.10675 [pdf, ps, other]
-
Title: Virtual Adversarial Training for Semi-supervised Breast Mass ClassificationAuthors: Xuxin Chen, Ximin Wang, Ke Zhang, Kar-Ming Fung, Theresa C. Thai, Kathleen Moore, Robert S. Mannel, Hong Liu, Bin Zheng, Yuchen QiuComments: To appear in the conference Biophotonics and Immune Responses of SPIESubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [616] arXiv:2201.10695 [pdf, other]
-
Title: Estimation of Spectral Biophysical Skin Properties from Captured RGB AlbedoComments: 11 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [617] arXiv:2201.10700 [pdf, other]
-
Title: Deep Image Deblurring: A SurveyAuthors: Kaihao Zhang, Wenqi Ren, Wenhan Luo, Wei-Sheng Lai, Bjorn Stenger, Ming-Hsuan Yang, Hongdong LiComments: To appear in International Journal of Computer Vision (IJCV)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [618] arXiv:2201.10703 [pdf, other]
-
Title: Anomaly Detection via Reverse Distillation from One-Class EmbeddingComments: 10 pages, 7 figuresJournal-ref: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [619] arXiv:2201.10712 [pdf, other]
-
Title: Toward Data-Driven STAP RadarAuthors: Shyam Venkatasubramanian, Chayut Wongkamthong, Mohammadreza Soltani, Bosung Kang, Sandeep Gogineni, Ali Pezeshki, Muralidhar Rangaswamy, Vahid TarokhComments: 5 pages, 4 figures. Submitted to 2022 IEEE Radar Conference (RadarConf)Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [620] arXiv:2201.10725 [pdf, other]
-
Title: Image Generation with Self Pixel-wise NormalizationComments: 13 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [621] arXiv:2201.10728 [pdf, other]
-
Title: Training Vision Transformers with Only 2040 ImagesComments: 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [622] arXiv:2201.10734 [pdf, other]
-
Title: CrossRectify: Leveraging Disagreement for Semi-supervised Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [623] arXiv:2201.10736 [pdf, other]
-
Title: A Joint Convolution Auto-encoder Network for Infrared and Visible Image FusionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [624] arXiv:2201.10737 [pdf, other]
-
Title: Class-Aware Adversarial Transformers for Medical Image SegmentationAuthors: Chenyu You, Ruihan Zhao, Fenglin Liu, Siyuan Dong, Sandeep Chinchali, Ufuk Topcu, Lawrence Staib, James S. DuncanSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [625] arXiv:2201.10739 [pdf, other]
-
Title: Infrared and visible image fusion based on Multi-State Contextual Hidden Markov ModelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [626] arXiv:2201.10753 [pdf, other]
-
Title: Interactive Image Inpainting Using Semantic GuidanceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [627] arXiv:2201.10766 [pdf, other]
-
Title: A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual AttributesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [628] arXiv:2201.10781 [pdf, other]
-
Title: ASFD: Automatic and Scalable Face DetectorAuthors: Jian Li, Bin Zhang, Yabiao Wang, Ying Tai, ZhenYu Zhang, Chengjie Wang, Jilin Li, Xiaoming Huang, Yili XiaComments: ACM MM2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [629] arXiv:2201.10788 [pdf, other]
-
Title: Self-supervised 3D Semantic Representation Learning for Vision-and-Language NavigationSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [630] arXiv:2201.10801 [pdf, other]
-
Title: When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention MechanismComments: accepted by AAAI-22Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [631] arXiv:2201.10830 [pdf, other]
-
Title: MonoDistill: Learning Spatial Features for Monocular 3D Object DetectionComments: Accepted by ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [632] arXiv:2201.10836 [pdf, other]
-
Title: PARS: Pseudo-Label Aware Robust Sample Selection for Learning with Noisy LabelsComments: 16 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [633] arXiv:2201.10848 [pdf, other]
-
Title: Comparison of Depth Estimation Setups from Stereo Endoscopy and Optical Tracking for Point MeasurementsAuthors: Lukas Burger, Lalith Sharan, Samantha Fischer, Julian Brand, Maximillian Hehl, Gabriele Romano, Matthias Karck, Raffaele De Simone, Ivo Wolf, Sandy EngelhardtComments: Accepted at Bildverarbeitung fuer die Medizin (BVM), Informatik aktuell. Springer Vieweg, Wiesbaden 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [634] arXiv:2201.10865 [pdf, ps, other]
-
Title: On the Issues of TrueDepth Sensor Data for Computer Vision Tasks Across Different iPad GenerationsComments: 17 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [635] arXiv:2201.10873 [pdf, other]
-
Title: TransPPG: Two-stream Transformer for Remote Heart Rate EstimateSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [636] arXiv:2201.10937 [pdf, other]
-
Title: Boosting 3D Adversarial Attacks with Attacking On FrequencyComments: 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [637] arXiv:2201.10938 [pdf, other]
-
Title: Projective Urban TexturingJournal-ref: International Conference on 3D Vision 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [638] arXiv:2201.10943 [pdf, other]
-
Title: Event-based Video Reconstruction via Potential-assisted Spiking Neural NetworkComments: Accepted by CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [639] arXiv:2201.10953 [pdf, other]
-
Title: Dual-Tasks Siamese Transformer Framework for Building Damage AssessmentComments: IGARSS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [640] arXiv:2201.10963 [pdf, other]
-
Title: Learning to Compose Diversified Prompts for Image Emotion ClassificationComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [641] arXiv:2201.10972 [pdf, other]
-
Title: How Robust are Discriminatively Trained Zero-Shot Learning Models?Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [642] arXiv:2201.10985 [pdf, other]
-
Title: Jalisco's multiclass land cover analysis and classification using a novel lightweight convnet with real-world multispectral and relief dataAuthors: Alexander Quevedo, Abraham Sánchez, Raul Nancláres, Diana P. Montoya, Juan Pacho, Jorge Martínez, E. Ulises Moya-SánchezComments: 12 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [643] arXiv:2201.10990 [pdf, other]
-
Title: Learning To Recognize Procedural Activities with Distant SupervisionAuthors: Xudong Lin, Fabio Petroni, Gedas Bertasius, Marcus Rohrbach, Shih-Fu Chang, Lorenzo TorresaniComments: CVPR 2022. Code will be released here this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [644] arXiv:2201.11006 [pdf, other]
-
Title: An Overview of Compressible and Learnable Image Transformation with Secret Key and Its ApplicationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [645] arXiv:2201.11014 [pdf, other]
-
Title: Language-biased image classification: evaluation based on semantic representationsAuthors: Yoann Lemesle, Masataka Sawayama, Guillermo Valle-Perez, Maxime Adolphe, Hélène Sauzéon, Pierre-Yves OudeyerComments: Accepted at ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [646] arXiv:2201.11091 [pdf, ps, other]
-
Title: Momentum Capsule NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [647] arXiv:2201.11092 [pdf, ps, other]
-
Title: Self-Attention Neural Bag-of-FeaturesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [648] arXiv:2201.11095 [pdf, other]
-
Title: Self-attention fusion for audiovisual emotion recognition with incomplete dataSubjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [649] arXiv:2201.11097 [pdf, other]
-
Title: Adaptive Instance Distillation for Object Detection in Autonomous DrivingComments: 6 pages, 3 figuresJournal-ref: 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 2022, pp. 4559-4565Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [650] arXiv:2201.11103 [pdf, other]
-
Title: Auto-Compressing Subset Pruning for Semantic Image SegmentationComments: 10 pages, 5 figures, 1 table, appendixSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [651] arXiv:2201.11114 [pdf, other]
-
Title: Natural Language Descriptions of Deep Visual FeaturesAuthors: Evan Hernandez, Sarah Schwettmann, David Bau, Teona Bagashvili, Antonio Torralba, Jacob AndreasComments: To be published as a conference paper at ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [652] arXiv:2201.11187 [pdf, other]
-
Title: DIREG3D: DIrectly REGress 3D Hands from Multiple CamerasJournal-ref: ICCV 2021 Fifth Workshop on Computer Vision for AR/VRSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Robotics (cs.RO); Image and Video Processing (eess.IV)
- [653] arXiv:2201.11192 [pdf, other]
-
Title: ReforesTree: A Dataset for Estimating Tropical Forest Carbon Stock with Deep Learning and Aerial ImageryAuthors: Gyri Reiersen, David Dao, Björn Lütjens, Konstantin Klemmer, Kenza Amara, Attila Steinegger, Ce Zhang, Xiaoxiang ZhuComments: Accepted paper for the AI for Social Impact Track at the AAAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [654] arXiv:2201.11197 [pdf, ps, other]
-
Title: Challenges and Opportunities for Machine Learning Classification of Behavior and Mental State from ImagesAuthors: Peter Washington, Cezmi Onur Mutlu, Aaron Kline, Kelley Paskov, Nate Tyler Stockham, Brianna Chrisman, Nick Deveau, Mourya Surhabi, Nick Haber, Dennis P. WallComments: 30 pages, 1 figure, 1 tableSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [655] arXiv:2201.11228 [pdf, other]
-
Title: Continuous Examination by Automatic Quiz Assessment Using Spiral Codes and Image ProcessingComments: Accepted at 13th IEEE Global Engineering Education Conference, EDUCON, Tunis, Tunisia, 28-31 March 2022 (Educational Conference)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [656] arXiv:2201.11279 [pdf, other]
-
Title: Revisiting RCAN: Improved Training for Image Super-ResolutionAuthors: Zudi Lin, Prateek Garg, Atmadeep Banerjee, Salma Abdel Magid, Deqing Sun, Yulun Zhang, Luc Van Gool, Donglai Wei, Hanspeter PfisterComments: 13 pages with 10 tables and 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [657] arXiv:2201.11284 [pdf, other]
-
Title: Interactive 3D Character Modeling from 2D Orthogonal Drawings with AnnotationsComments: 6 pages, 4 figures, accepted in Proceedings of International Workshop on Advanced Image Technology 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [658] arXiv:2201.11296 [pdf, ps, other]
-
Title: Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape contextSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [659] arXiv:2201.11307 [pdf, other]
-
Title: Dissecting the impact of different loss functions with gradient surgerySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [660] arXiv:2201.11316 [pdf, other]
-
Title: Transformer Module Networks for Systematic Generalization in Visual Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [661] arXiv:2201.11319 [pdf, other]
-
Title: Dynamic Rectification Knowledge DistillationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [662] arXiv:2201.11345 [pdf, other]
-
Title: Exploring Global Diversity and Local Context for Video SummarizationComments: Accepted by IEEE AccessSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [663] arXiv:2201.11351 [pdf, other]
-
Title: Effective Shortcut Technique for GANComments: arXiv admin note: text overlap with arXiv:2112.14968Journal-ref: Applied Intelligence 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [664] arXiv:2201.11379 [pdf, other]
-
Title: Deep Confidence Guided Distance for 3D Partial Shape RegistrationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [665] arXiv:2201.11388 [pdf, other]
-
Title: Contrastive Embedding Distribution Refinement and Entropy-Aware Attention for 3D Point Cloud ClassificationComments: 15 pages, 10figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [666] arXiv:2201.11403 [pdf, other]
-
Title: Generalised Image Outpainting with U-TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [667] arXiv:2201.11407 [pdf, other]
-
Title: Non-linear Motion Estimation for Video Frame Interpolation using Space-time ConvolutionsComments: Accepted at CLIC workshop, CVPR 2022. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [668] arXiv:2201.11438 [pdf, other]
-
Title: DocSegTr: An Instance-Level End-to-End Document Image Segmentation TransformerComments: PreprintSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [669] arXiv:2201.11440 [pdf, ps, other]
-
Title: An Analysis on Ensemble Learning optimized Medical Image Classification with Deep Convolutional Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [670] arXiv:2201.11450 [pdf, other]
-
Title: In Defense of Kalman Filtering for Polyp Tracking from Colonoscopy VideosComments: Paper accepted to the International Symposium on Biomedical Imaging (ISBI) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [671] arXiv:2201.11460 [pdf, other]
-
Title: RelTR: Relation Transformer for Scene Graph GenerationComments: accepted by IEEE Transactions on Pattern Analysis and Machine IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [672] arXiv:2201.11479 [pdf, other]
-
Title: Eye-focused Detection of Bell's Palsy in VideosComments: Published in the Proceedings of the 34th Canadian Conference on Artificial Intelligence. Please cite this paper in the following manner: S. A. Ansari, K. R. Jerripothula, P. Nagpal, and A. Mittal. "Eye-focused Detection of Bell's Palsy in Videos". In: Proceedings of the 34th Canadian Conference on Artificial Intelligence (June 8, 2021). doi: 10.21428/594757db.d2f8342bSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Image and Video Processing (eess.IV)
- [673] arXiv:2201.11500 [pdf, other]
-
Title: Head and eye egocentric gesture recognition for human-robot interaction using eyewear camerasComments: Copyright 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other worksJournal-ref: IEEE Robotics and Automation Letters, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
- [674] arXiv:2201.11506 [pdf, other]
-
Title: Anomaly Detection in Retinal Images using Multi-Scale Deep Feature Sparse CodingComments: Accepted to ISBI 2022.\copyright IEEESubjects: Computer Vision and Pattern Recognition (cs.CV)
- [675] arXiv:2201.11523 [pdf, other]
-
Title: ResiDualGAN: Resize-Residual DualGAN for Cross-Domain Remote Sensing Images Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [676] arXiv:2201.11528 [pdf, other]
-
Title: Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box DomainsComments: Accepted by ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [677] arXiv:2201.11547 [pdf, other]
-
Title: ASOC: Adaptive Self-aware Object Co-localizationComments: Published in IEEE ICME 2021. Please cite this paper in the following manner: K. R. Jerripothula and P. Mukherjee, "ASOC: Adaptive Self-Aware Object Co-Localization," 2021 IEEE International Conference on Multimedia and Expo (ICME), 2021, pp. 1-6, doi: 10.1109/ICME51207.2021.9428191Journal-ref: 2021 IEEE International Conference on Multimedia and Expo (ICME), 2021, pp. 1-6Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
- [678] arXiv:2201.11608 [pdf, ps, other]
-
Title: A Probabilistic Framework for Dynamic Object Recognition in 3D Environment With A Novel Continuous Ground Estimation MethodAuthors: Pouria MehrabiComments: Master's Thesis Submitted in Partial Fulfillment of The Requirements For The Degree of Master of Science in Electrical EngineerinSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [679] arXiv:2201.11620 [pdf, ps, other]
-
Title: Domain generalization in deep learning-based mass detection in mammography: A large-scale multi-center studySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [680] arXiv:2201.11632 [pdf, other]
-
Title: Deep Video Prior for Video Consistency and PropagationComments: Accepted by TPAMI in Dec 2021; extension of NeurIPS2020 Blind Video Temporal Consistency via Deep Video Prior. arXiv admin note: substantial text overlap with arXiv:2010.11838Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [681] arXiv:2201.11664 [pdf, other]
-
Title: Team Yao at Factify 2022: Utilizing Pre-trained Models and Co-attention Networks for Multi-Modal Fact VerificationComments: Accepted by AAAI 2022 De-Factify Workshop: First Workshop on Multimodal Fact-Checking and Hate Speech DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
- [682] arXiv:2201.11674 [pdf, other]
-
Title: Vision Checklist: Towards Testable Error Analysis of Image Models to Help System Designers Interrogate Model CapabilitiesAuthors: Xin Du, Benedicte Legastelois, Bhargavi Ganesh, Ajitha Rajan, Hana Chockler, Vaishak Belle, Stuart Anderson, Subramanian RamamoorthyComments: 17 pages, 18 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [683] arXiv:2201.11697 [pdf, other]
-
Title: Constrained Structure Learning for Scene Graph GenerationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [684] arXiv:2201.11736 [pdf, other]
-
Title: Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked PositivesComments: AAAI 2022 (Main Track)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [685] arXiv:2201.11760 [pdf, other]
-
Title: Unsupervised Denoising of Retinal OCT with Diffusion Probabilistic ModelComments: SPIE medical imaging, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [686] arXiv:2201.11782 [pdf, other]
-
Title: An Empirical Analysis of Recurrent Learning Algorithms In Neural Lossy Image Compression SystemsComments: Accepted at DCC 2021, 15 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [687] arXiv:2201.11794 [pdf, other]
-
Title: A Survey on Visual Transfer Learning using Knowledge GraphsComments: Semantic Web Journal (SWJ)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [688] arXiv:2201.11808 [pdf, other]
-
Title: LAP: An Attention-Based Module for Concept Based Self-Interpretation and Knowledge Injection in Convolutional Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [689] arXiv:2201.11828 [pdf, other]
-
Title: Pressure Eye: In-bed Contact Pressure Estimation via Contact-less ImagingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [690] arXiv:2201.11843 [pdf, other]
-
Title: Discriminative Supervised Subspace Learning for Cross-modal RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [691] arXiv:2201.11852 [pdf, other]
-
Title: Towards an Automatic Diagnosis of Peripheral and Central Palsy Using Machine Learning on Facial FeaturesComments: 9 pages, 10 tables, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [692] arXiv:2201.11871 [pdf, other]
-
Title: Infrastructure-Based Object Detection and Tracking for Cooperative Driving Automation: A SurveySubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [693] arXiv:2201.11898 [pdf, other]
-
Title: Indicative Image Retrieval: Turning Blackbox Learning into GreyAuthors: Xulu Zhang (1), Zhenqun Yang (2), Hao Tian (1), Qing Li (3), Xiaoyong Wei (1 and 3) ((1) Sichuan University, (2) Chinese University of Hong Kong, (3) Hong Kong Polytechnic Univeristy)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
- [694] arXiv:2201.11937 [pdf, other]
-
Title: Stereo Matching with Cost Volume based Sparse Disparity PropagationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [695] arXiv:2201.11963 [pdf, other]
-
Title: Shuffle Augmentation of Features from Unlabeled Data for Unsupervised Domain AdaptationComments: 17 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [696] arXiv:2201.11975 [pdf, other]
-
Title: Generalized Visual Quality Assessment of GAN-Generated Face ImagesComments: 12 pages, 8 figures, journal paperSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [697] arXiv:2201.11995 [pdf, other]
-
Title: Hybrid Contrastive Learning with Cluster Ensemble for Unsupervised Person Re-identificationComments: accepted by ACPR2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [698] arXiv:2201.12010 [pdf, other]
-
Title: Unfolding a blurred imageComments: arXiv admin note: substantial text overlap with arXiv:1804.02913Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [699] arXiv:2201.12047 [src]
-
Title: Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAMAuthors: Ali Caglayan, Nevrez Imamoglu, Oguzhan Guclu, Ali Osman Serhatoglu, Weimin Wang, Ahmet Burak Can, Ryosuke NakamuraComments: This article has been removed by arXiv administrators because the submitter did not have the authority to grant the license at the time of submissionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [700] arXiv:2201.12051 [pdf, ps, other]
-
Title: Detection of fake faces in videosComments: 5 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [701] arXiv:2201.12078 [pdf, other]
-
Title: You Only Cut Once: Boosting Data Augmentation with a Single CutAuthors: Junlin Han, Pengfei Fang, Weihao Li, Jie Hong, Mohammad Ali Armin, Ian Reid, Lars Petersson, Hongdong LiComments: ICML 2022, Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [702] arXiv:2201.12083 [pdf, other]
-
Title: DynaMixer: A Vision MLP Architecture with Dynamic MixingComments: icml2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [703] arXiv:2201.12084 [pdf, other]
-
Title: Psychophysical Evaluation of Human Performance in Detecting Digital Face Image ManipulationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [704] arXiv:2201.12086 [pdf, other]
-
Title: BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and GenerationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [705] arXiv:2201.12089 [pdf, other]
-
Title: Label uncertainty-guided multi-stream model for disease screeningComments: To appear in ISBI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [706] arXiv:2201.12094 [pdf, other]
-
Title: Leveraging Inlier Correspondences Proportion for Point Cloud RegistrationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [707] arXiv:2201.12099 [pdf, other]
-
Title: Detecting Owner-member Relationship with Graph Convolution Network in Fisheye Camera SystemComments: Accepted by Pattern Recognition. arXiv admin note: substantial text overlap with arXiv:2103.16099Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [708] arXiv:2201.12133 [pdf, other]
-
Title: O-ViT: Orthogonal Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [709] arXiv:2201.12170 [pdf, other]
-
Title: Unsupervised Single-shot Depth Estimation using Perceptual ReconstructionAuthors: Christoph Angermann, Matthias Schwab, Markus Haltmeier, Christian Laubichler, Steinbjörn JónssonComments: arXiv admin note: text overlap with arXiv:2103.16938Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [710] arXiv:2201.12184 [pdf, other]
-
Title: A tomographic workflow to enable deep learning for X-ray based foreign object detectionAuthors: Mathé T. Zeegers, Tristan van Leeuwen, Daniël M. Pelt, Sophia Bethany Coban, Robert van Liere, Kees Joost BatenburgComments: This paper is under consideration at Expert Systems with Applications. 22 pages, 15 figuresJournal-ref: Expert Systems with Applications 206 (2022) 117768Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [711] arXiv:2201.12212 [pdf, other]
-
Title: Möbius Convolutions for Spherical CNNsComments: SIGGRAPH 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Representation Theory (math.RT)
- [712] arXiv:2201.12216 [pdf, other]
-
Title: Self-paced learning to improve text row detection in historical documents with missing labelsComments: Accepted at ECCV Workshop on Text in Everything (TiE 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [713] arXiv:2201.12265 [pdf, other]
-
Title: 3D-FlowNet: Event-based optical flow estimation with 3D representationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [714] arXiv:2201.12269 [pdf, ps, other]
-
Title: HSADML: Hyper-Sphere Angular Deep Metric based Learning for Brain Tumor ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [715] arXiv:2201.12285 [pdf, other]
-
Title: Benchmarking Conventional Vision Models on Neuromorphic Fall Detection and Action Recognition DatasetComments: 6 Pages, 2 FiguresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [716] arXiv:2201.12288 [pdf, other]
-
Title: VRT: A Video Restoration TransformerAuthors: Jingyun Liang, Jiezhang Cao, Yuchen Fan, Kai Zhang, Rakesh Ranjan, Yawei Li, Radu Timofte, Luc Van GoolComments: add results on VFI and STVSR; SOTA results (+up to 2.16dB) on video SR, video deblurring, video denoising, video frame interpolation and space-time video super-resolution. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [717] arXiv:2201.12329 [pdf, other]
-
Title: DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETRComments: Accepted to ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [718] arXiv:2201.12346 [pdf, other]
-
Title: DiriNet: A network to estimate the spatial and spectral degradation functionsAuthors: Ting HuSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [719] arXiv:2201.12384 [pdf, ps, other]
-
Title: Developing a Machine-Learning Algorithm to Diagnose Age-Related Macular DegenerationAuthors: Ananya Dua, Pham Hung Minh, Sajid Fahmid, Shikhar Gupta, Sophia Zheng, Vanessa Moyo, Yanran Elisa XueComments: 7 pages, 7 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [720] arXiv:2201.12385 [pdf, other]
-
Title: A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noiseComments: SPIE Medical Imaging 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [721] arXiv:2201.12386 [pdf, other]
-
Title: Few-shot Unsupervised Domain Adaptation for Multi-modal Cardiac Image SegmentationComments: Accepted t0 BVM2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [722] arXiv:2201.12425 [pdf, other]
-
Title: CoordX: Accelerating Implicit Neural Representation with a Split MLP ArchitectureSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [723] arXiv:2201.12437 [pdf, other]
-
Title: Mobile Robot Manipulation using Pure Object DetectionAuthors: Brent GriffinComments: WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [724] arXiv:2201.12467 [pdf, other]
-
Title: Improving Federated Learning Face Recognition via Privacy-Agnostic ClustersComments: ICLR2022, SpotlightSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [725] arXiv:2201.12499 [pdf, other]
-
Title: Reconstruction of Power Lines from Point CloudsComments: 15 pages, 8 figures, 1 tableSubjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
- [726] arXiv:2201.12506 [pdf, other]
-
Title: 2D+3D facial expression recognition via embedded tensor manifold regularizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [727] arXiv:2201.12525 [pdf, other]
-
Title: Spherical Convolution empowered FoV Prediction in 360-degree Video Multicast with Limited FoV FeedbackSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [728] arXiv:2201.12527 [pdf, other]
-
Title: Scale-Invariant Adversarial Attack for Evaluating and Enhancing Adversarial DefensesComments: TDSC under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [729] arXiv:2201.12528 [pdf, ps, other]
-
Title: SupWMA: Consistent and Efficient Tractography Parcellation of Superficial White Matter with Deep LearningAuthors: Tengfei Xue, Fan Zhang, Chaoyi Zhang, Yuqian Chen, Yang Song, Nikos Makris, Yogesh Rathi, Weidong Cai, Lauren J. O'DonnellComments: ISBI 2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [730] arXiv:2201.12533 [pdf, other]
-
Title: Light field Rectification based on relative pose estimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [731] arXiv:2201.12543 [pdf, other]
-
Title: Fast Differentiable Matrix Square Root and Inverse Square RootComments: T-PAMI 2022. arXiv admin note: substantial text overlap with arXiv:2201.08663Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [732] arXiv:2201.12558 [pdf, other]
-
Title: The KFIoU Loss for Rotated Object DetectionAuthors: Xue Yang, Yue Zhou, Gefan Zhang, Jirui Yang, Wentao Wang, Junchi Yan, Xiaopeng Zhang, Qi TianComments: 18 pages, 6 figures, 8 tables, accepted by ICLR 2023, TensorFlow code: this https URL, PyTorch code: this https URL, Jittor code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [733] arXiv:2201.12559 [pdf, other]
-
Title: Rebalancing Batch Normalization for Exemplar-based Class-Incremental LearningComments: CVPR 2023 camera readySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [734] arXiv:2201.12576 [pdf, other]
-
Title: Scale-arbitrary Invertible Image DownscalingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [735] arXiv:2201.12592 [pdf, other]
-
Title: Exact Decomposition of Joint Low Rankness and Local Smoothness Plus Sparse MatricesComments: 15 pages, 14 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [736] arXiv:2201.12596 [pdf, other]
-
Title: MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage LearningComments: Accepted by ACM MM22Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [737] arXiv:2201.12599 [pdf, other]
-
Title: Semantic-assisted image compressionAuthors: Qizheng Sun (1), Caili Guo (1), Yang Yang (1), Jiujiu Chen (1), Xijun Xue (2) ((1) bupt.edu.cn, (2) chinatelecom.cn )Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [738] arXiv:2201.12622 [pdf, ps, other]
-
Title: Hand Gesture Recognition of Dumb Person Using one Against All Neural NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [739] arXiv:2201.12625 [pdf, ps, other]
-
Title: ADC-Net: An Open-Source Deep Learning Network for Automated Dispersion Compensation in Optical Coherence TomographyAuthors: Shaiban Ahmed (1), David Le (1), Taeyoon Son (1), Tobiloba Adejumo (1), Xincheng Yao (1,2) (1) Department of Biomedical Engineering, University of Illinois at Chicago (2) Department of Ophthalmology, Visual Science, University of Illinois at ChicagoComments: 18 pages, 5 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Tissues and Organs (q-bio.TO)
- [740] arXiv:2201.12626 [pdf, other]
-
Title: Assessing Cross-dataset Generalization of Pedestrian Crossing PredictorsComments: Submitted to the 33rd IEEE Intelligent Vehicles SymposiumSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [741] arXiv:2201.12633 [pdf, other]
-
Title: Image Classification using Graph Neural Network and Multiscale Wavelet SuperpixelsComments: 17 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [742] arXiv:2201.12646 [pdf, other]
-
Title: Self Semi Supervised Neural Architecture Search for Semantic SegmentationComments: 21 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [743] arXiv:2201.12649 [pdf, ps, other]
-
Title: Transfer Learning for Estimation of Pendubot Angular Position Using Deep Neural NetworksAuthors: Sina KhanaghaSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Systems and Control (eess.SY)
- [744] arXiv:2201.12693 [pdf, other]
-
Title: Extracting Built Environment Features for Planning Research with Computer Vision: A Review and Discussion of State-of-the-Art ApproachesComments: CUPUM 2021 (The 17th International Conference on Computational Urban Planning and Urban Management)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [745] arXiv:2201.12705 [pdf, ps, other]
-
Title: A Robust Framework for Deep Learning Approaches to Facial Emotion Recognition and EvaluationSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [746] arXiv:2201.12709 [pdf, other]
-
Title: Low-Rank Tensor Completion Based on Bivariate Equivalent Minimax-Concave PenaltyComments: arXiv admin note: text overlap with arXiv:2109.12257Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [747] arXiv:2201.12712 [pdf, other]
-
Title: Win the Lottery Ticket via Fourier Analysis: Frequencies Guided Network PruningComments: accepted to ICASSP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [748] arXiv:2201.12723 [pdf, other]
-
Title: A Frustratingly Simple Approach for End-to-End Image CaptioningComments: Work in progressSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [749] arXiv:2201.12725 [pdf, other]
-
Title: Generalized Global Ranking-Aware Neural Architecture Ranker for Efficient Image Classifier SearchSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [750] arXiv:2201.12728 [pdf, other]
-
Title: Video-based Facial Micro-Expression Analysis: A Survey of Datasets, Features and AlgorithmsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [751] arXiv:2201.12733 [pdf, other]
- [752] arXiv:2201.12763 [pdf, other]
-
Title: RIM-Net: Recursive Implicit Fields for Unsupervised Learning of Hierarchical Shape StructuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [753] arXiv:2201.12765 [pdf, other]
-
Title: Improving Robustness by Enhancing Weak SubnetsComments: To appear in ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [754] arXiv:2201.12769 [pdf, other]
-
Title: MVP-Net: Multiple View Pointwise Semantic Segmentation of Large-Scale Point CloudsJournal-ref: 30. International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision(WSCG), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [755] arXiv:2201.12771 [pdf, other]
-
Title: Self-Supervised Moving Vehicle Detection from Audio-Visual CuesComments: 8 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [756] arXiv:2201.12792 [pdf, other]
-
Title: SelfRecon: Self Reconstruction Your Digital Avatar from Monocular VideoComments: CVPR 2022, Oral. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [757] arXiv:2201.12805 [pdf, other]
-
Title: Automatic Segmentation of Left Ventricle in Cardiac Magnetic Resonance ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [758] arXiv:2201.12813 [pdf, other]
-
Title: Contrastive Learning from DemonstrationsJournal-ref: IEEE Robotic Computing, Naples, Italy, December 5-7, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [759] arXiv:2201.12826 [pdf, other]
-
Title: OptG: Optimizing Gradient-driven Criteria in Network SparsityComments: 11 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [760] arXiv:2201.12828 [pdf, other]
-
Title: Comprehensive Saliency Fusion for Object Co-segmentationComments: Published in IEEE ISM 2021. Please cite this paper in the following manner. H. S. Chhabra and K. Rao Jerripothula, "Comprehensive Saliency Fusion for Object Co-segmentation," 2021 IEEE International Symposium on Multimedia (ISM), 2021, pp. 107-110, doi: 10.1109/ISM52913.2021.00026Journal-ref: 2021 IEEE International Symposium on Multimedia (ISM), 2021, pp. 107-110Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
- [761] arXiv:2201.12888 [pdf, other]
-
Title: A Dataset for Medical Instructional Video Classification and Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [762] arXiv:2201.12903 [pdf, other]
-
Title: Aggregating Global Features into Local Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [763] arXiv:2201.12944 [pdf, other]
-
Title: Deep Learning Approaches on Image Captioning: A ReviewComments: 41 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [764] arXiv:2201.12961 [pdf, other]
-
Title: Plug-In Inversion: Model-Agnostic Inversion for Vision with Data AugmentationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [765] arXiv:2201.13013 [pdf, other]
-
Title: A Simple And Effective Filtering Scheme For Improving Neural FieldsAuthors: Yixin ZhuangComments: Accepted to Computational Visual MediaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [766] arXiv:2201.13027 [pdf, other]
-
Title: BOAT: Bilateral Local Attention Vision TransformerComments: BMVC2022 oralSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [767] arXiv:2201.13063 [pdf, other]
-
Title: NeuralTailor: Reconstructing Sewing Pattern Structures from 3D Point Clouds of GarmentsComments: Updated to the version accepted to SIGGRAPH 2022 (Journal Track)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
- [768] arXiv:2201.13065 [pdf, other]
-
Title: Rigidity Preserving Image Transformations and Equivariance in PerspectiveComments: v2: Substantially revised version. Among other things, experiments with the PixLoc model addedSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [769] arXiv:2201.13066 [pdf, ps, other]
-
Title: Single Object Tracking: A Survey of Methods, Datasets, and Evaluation MetricsComments: 15 pages. This paper is about object tracking and review of methods in this task. The paper first published in the ICCKE2019 conference and then extended in this new paperSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [770] arXiv:2201.13078 [pdf, other]
-
Title: Lymphoma segmentation from 3D PET-CT images using a deep evidential networkComments: Preprint submitted to International Journal of Approximate ReasoningJournal-ref: International Journal of Approximate Reasoning, Volume 149, 2022, Pages 39-60,Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [771] arXiv:2201.13081 [pdf, other]
-
Title: Unsupervised Anomaly Detection in 3D Brain MRI using Deep Learning with Multi-Task Brain Age PredictionAuthors: Marcel Bengs, Finn Behrendt, Max-Heinrich Laves, Julia Krüger, Roland Opfer, Alexander SchlaeferComments: Accepted at SPIE Medical Imaging 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [772] arXiv:2201.13084 [pdf, other]
-
Title: Crowd-powered Face Manipulation Detection: Fusing Human Examiner DecisionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [773] arXiv:2201.13100 [pdf, other]
-
Title: Adversarial Masking for Self-Supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [774] arXiv:2201.13164 [pdf, other]
-
Title: Imperceptible and Multi-channel Backdoor Attack against Deep Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [775] arXiv:2201.13178 [pdf, other]
-
Title: Few-Shot Backdoor Attacks on Visual Object TrackingComments: This work is accepted by the ICLR 2022. The first two authors contributed equally to this work. In this version, we fix some typos and errors contained in the last one. 21 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [776] arXiv:2201.13182 [pdf, other]
-
Title: Learning Super-Features for Image RetrievalComments: ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [777] arXiv:2201.13229 [pdf, other]
-
Title: Network-level Safety Metrics for Overall Traffic Safety Assessment: A Case StudyAuthors: Xiwen Chen, Hao Wang, Abolfazl Razi, Brendan Russo, Jason Pacheco, John Roberts, Jeffrey Wishart, Larry Head, Alonso Granados BacaSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
- [778] arXiv:2201.13271 [pdf, other]
-
Title: StRegA: Unsupervised Anomaly Detection in Brain MRIs using a Compact Context-encoding Variational AutoencoderAuthors: Soumick Chatterjee, Alessandro Sciarra, Max Dünnwald, Pavan Tummala, Shubham Kumar Agrawal, Aishwarya Jauhari, Aman Kalra, Steffen Oeltze-Jafra, Oliver Speck, Andreas NürnbergerJournal-ref: Computers in Biology and Medicine, 106093 (2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [779] arXiv:2201.13278 [pdf, other]
-
Title: Combining Local and Global Pose Estimation for Precise Tracking of Similar ObjectsComments: Accepted at VISAPP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [780] arXiv:2201.13279 [pdf, other]
-
Title: UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [781] arXiv:2201.13291 [pdf, other]
-
Title: Metrics for saliency map evaluation of deep learning explanation methodsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [782] arXiv:2201.13312 [pdf, other]
-
Title: On scale-invariant properties in natural images and their simulationsComments: 7 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [783] arXiv:2201.13322 [pdf, other]
-
Title: Learning to Hash Naturally SortsComments: IJCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [784] arXiv:2201.13338 [pdf, other]
-
Title: Modeling the Background for Incremental and Weakly-Supervised Semantic SegmentationComments: Accepted by T-PAMI (this https URL). arXiv admin note: substantial text overlap with arXiv:2002.00718Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [785] arXiv:2201.13392 [src]
-
Title: MHSnet: Multi-head and Spatial Attention Network with False-Positive Reduction for Pulmonary Nodules DetectionAuthors: Juanyun Mai, Minghao Wang, Jiayin Zheng, Yanbo Shao, Zhaoqi Diao, Xinliang Fu, Yulong Chen, Jianyu Xiao, Jian You, Airu Yin, Yang Yang, Xiangcheng Qiu, Jinsheng Tao, Bo Wang, Hua JiComments: We have to revise the experiment results and conclusionsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [786] arXiv:2201.13433 [pdf, other]
-
Title: Third Time's the Charm? Image and Video Editing with StyleGAN3Authors: Yuval Alaluf, Or Patashnik, Zongze Wu, Asif Zamir, Eli Shechtman, Dani Lischinski, Daniel Cohen-OrComments: Project page available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [787] arXiv:2201.00063 (cross-list from eess.SY) [pdf, other]
-
Title: Croesus: Multi-Stage Processing and Transactions for Video-Analytics in Edge-Cloud SystemsComments: Published in ICDE2022Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
- [788] arXiv:2201.00148 (cross-list from cs.LG) [pdf, other]
-
Title: Rethinking Feature Uncertainty in Stochastic Neural Networks for Adversarial RobustnessSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [789] arXiv:2201.00168 (cross-list from cs.LG) [pdf, other]
-
Title: Self-attention Multi-view Representation Learning with Diversity-promoting ComplementaritySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [790] arXiv:2201.00171 (cross-list from cs.LG) [pdf, other]
-
Title: Multi-view Subspace Adaptive Learning via Autoencoder and AttentionSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [791] arXiv:2201.00308 (cross-list from cs.LG) [pdf, other]
-
Title: DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional LatentsComments: 12 pages main content. Camera-Ready version accepted at Transactions on Machine Learning ResearchSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [792] arXiv:2201.00511 (cross-list from cs.MM) [pdf, ps, other]
-
Title: Centre Symmetric Quadruple Pattern: A Novel Descriptor for Facial Image Recognition and RetrievalComments: arXiv admin note: text overlap with arXiv:2201.00504Journal-ref: Pattern Recognition Letters, vol-115, pp.50-58, (2018). (Elsevier) ISSN/ISBN: 0167-8655Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [793] arXiv:2201.00596 (cross-list from cs.RO) [pdf, other]
-
Title: LiDAR Point--to--point Correspondences for Rigorous Registration of Kinematic Scanning in Dynamic NetworksSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [794] arXiv:2201.00604 (cross-list from cs.LG) [pdf, other]
-
Title: An analysis of over-sampling labeled data in semi-supervised learning with FixMatchAuthors: Miquel Martí i Rabadán, Sebastian Bujwid, Alessandro Pieropan, Hossein Azizpour, Atsuto MakiComments: 10 pages, 3 figures. Published at NLDL 2022Journal-ref: Vol. 3 (2022): Proceedings of the Northern Lights Deep Learning Workshop 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [795] arXiv:2201.00693 (cross-list from cs.IR) [pdf, other]
-
Title: Multimodal Entity Tagging with Multimodal Knowledge BaseComments: 11 pages, 4 figuresSubjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [796] arXiv:2201.00849 (cross-list from cs.LG) [pdf, other]
-
Title: Delving into Sample Loss Curve to Embrace Noisy and Imbalanced DataComments: Accepted by AAAI-2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [797] arXiv:2201.01003 (cross-list from cs.LG) [pdf, other]
-
Title: Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple SourcesComments: AAAI 2019 long paper. Multi-source Domain AdaptationSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [798] arXiv:2201.01155 (cross-list from cs.LG) [pdf, other]
-
Title: DeepVisualInsight: Time-Travelling Visualization for Spatio-Temporal Causality of Deep Classification TrainingComments: Accepted in AAAI'22Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Software Engineering (cs.SE)
- [799] arXiv:2201.01222 (cross-list from cs.LG) [pdf, other]
-
Title: The cluster structure functionSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [800] arXiv:2201.01230 (cross-list from cs.LG) [pdf, other]
-
Title: Robust Semi-supervised Federated Learning for Images Automatic Recognition in Internet of DronesAuthors: Zhe Zhang, Shiyao Ma, Zhaohui Yang, Zehui Xiong, Jiawen Kang, Yi Wu, Kejia Zhang, Dusit NiyatoComments: arXiv admin note: text overlap with arXiv:2110.13388Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [801] arXiv:2201.01250 (cross-list from cs.LG) [pdf, other]
-
Title: Transfer Learning for Retinal Vascular Disease Detection: A Pilot Study with Diabetic Retinopathy and Retinopathy of PrematuritySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [802] arXiv:2201.01353 (cross-list from cs.LG) [pdf, other]
-
Title: Linear Variational State-Space FilteringComments: 18 pages, 6 figures. Fixed proof in appendix. For associated code, see this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
- [803] arXiv:2201.01367 (cross-list from cs.RO) [pdf, other]
-
Title: DenseTact: Optical Tactile Sensor for Dense Shape ReconstructionSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [804] arXiv:2201.01466 (cross-list from cs.AI) [pdf, ps, other]
-
Title: Challenges of Artificial Intelligence -- From Machine Learning and Computer Vision to Emotional IntelligenceComments: 234 pages. Published as an electronic publication at the University of Oulu, Finland, in December 2021, ISBN: 978-952-62-3199-0 link this http URLSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [805] arXiv:2201.01488 (cross-list from cs.LG) [pdf, other]
-
Title: Exemplar-free Class Incremental Learning via Discriminative and Comparable One-class ClassifiersJournal-ref: [J]. Pattern Recognition, 2023, 140: 109561Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [806] arXiv:2201.01490 (cross-list from cs.LG) [pdf, other]
-
Title: Debiased Learning from Naturally Imbalanced Pseudo-LabelsComments: Accepted by CVPR 2022Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [807] arXiv:2201.01760 (cross-list from cs.RO) [pdf, other]
-
Title: Multi-Robot Collaborative Perception with Graph Neural NetworksComments: 8 pages, 10 figures, 3 tables, Accepted at the IEEE Robotics Automation Letter (RAL) and the IEEE International Conference on Robotics and Automation (ICRA), 2022Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [808] arXiv:2201.01763 (cross-list from cs.SD) [pdf, other]
-
Title: Robust Self-Supervised Audio-Visual Speech RecognitionComments: Interspeech 2022Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
- [809] arXiv:2201.01806 (cross-list from cs.LG) [pdf, other]
-
Title: Revisiting Deep Subspace Alignment for Unsupervised Domain AdaptationComments: arXiv admin note: text overlap with arXiv:1906.04338Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [810] arXiv:2201.01819 (cross-list from cs.LG) [pdf, other]
-
Title: Formal Analysis of Art: Proxy Learning of Visual Concepts from Style Through Language ModelsComments: 23 pages, This paper is an extended version of a paper that will be published at the 36th AAAI Conference on Artificial Intelligence, to beheld in Vancouver, BC, Canada, February 22 - March 1, 2022Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [811] arXiv:2201.01873 (cross-list from cs.GR) [pdf, other]
-
Title: NeuralMLS: Geometry-Aware Control Point DeformationComments: Eurographics 2022 Short PapersSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [812] arXiv:2201.01922 (cross-list from cs.LG) [pdf, other]
-
Title: Contrastive Neighborhood AlignmentAuthors: Pengkai Zhu, Zhaowei Cai, Yuanjun Xiong, Zhuowen Tu, Luis Goncalves, Vijay Mahadevan, Stefano SoattoComments: 10 pages, 7 tables, 3 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [813] arXiv:2201.01978 (cross-list from cs.LG) [pdf, other]
-
Title: An Abstraction-Refinement Approach to Verifying Convolutional Neural NetworksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Logic in Computer Science (cs.LO)
- [814] arXiv:2201.02057 (cross-list from cs.LG) [pdf, other]
-
Title: GLAN: A Graph-based Linear Assignment NetworkSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [815] arXiv:2201.02478 (cross-list from cs.LG) [pdf, other]
-
Title: Bayesian Neural Networks for Reversible SteganographyAuthors: Ching-Chun ChangJournal-ref: IEEE Access (2022), vol. 10, pp. 36327-36334Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [816] arXiv:2201.02610 (cross-list from cs.GR) [pdf, other]
-
Title: Embodied Hands: Modeling and Capturing Hands and Bodies TogetherComments: SIGGRAPH ASIA 2017Journal-ref: ACM Transactions on Graphics, Vol. 36, No. 6, Article 245. Publication date: November 2017Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [817] arXiv:2201.02620 (cross-list from cs.LG) [pdf, other]
-
Title: Compressing Models with Few Samples: Mimicking then ReplacingComments: 12 pages, 3 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [818] arXiv:2201.02693 (cross-list from cs.LG) [pdf, other]
-
Title: BottleFit: Learning Compressed Representations in Deep Neural Networks for Effective and Efficient Split ComputingComments: Accepted to IEEE WoWMoM 2022. Code and models are available at this https URLJournal-ref: 2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [819] arXiv:2201.02711 (cross-list from cs.LG) [pdf, other]
-
Title: Block Walsh-Hadamard Transform Based Binary Layers in Deep Neural NetworksComments: This paper has been accepted by ACM Transactions on Embedded Computing SystemsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [820] arXiv:2201.03102 (cross-list from cs.LG) [pdf, other]
-
Title: Preserving Domain Private Representation via Mutual Information MaximizationSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [821] arXiv:2201.03215 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Handwriting recognition and automatic scoring for descriptive answers in Japanese language testsComments: Keywords: handwritten Japanese answers, handwriting recognition, automatic scoring, ensemble recognition, deep neural networks; Reported in IEICE technical report, PRMU2021-32, pp.45-50 (2021.12) Published after peer review and Presented in ICFHR2022, Lecture Notes in Computer Science, vol. 13639, pp. 274-284 (2022.11)Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [822] arXiv:2201.03364 (cross-list from cs.RO) [pdf, other]
-
Title: High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAMComments: 6 pages plus references, 5 figuresSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [823] arXiv:2201.03446 (cross-list from cs.GR) [pdf, ps, other]
-
Title: Two Methods for Iso-Surface Extraction from Volumetric Data and Their ComparisonJournal-ref: Machine Graphics & Vision, No.1/2, Vol.9, pp.149-166, Poland Academy of Sciences, Poland, ISSN 1230-0535, 2000Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [824] arXiv:2201.03529 (cross-list from cs.LG) [pdf, other]
-
Title: Head2Toe: Utilizing Intermediate Representations for Better Transfer LearningComments: presented at ICML 2022 (Oral)Journal-ref: ICML 2022, Proceedings of the 39th International Conference on Machine LearningSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [825] arXiv:2201.03668 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Group Robustness in the presence of Partial Group LabelsAuthors: Vishnu Suresh Lokhande, Kihyuk Sohn, Jinsung Yoon, Madeleine Udell, Chen-Yu Lee, Tomas PfisterSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [826] arXiv:2201.03942 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Feature Extraction Framework based on Contrastive Learning with Adaptive Positive and Negative SamplesAuthors: Hongjie ZhangSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [827] arXiv:2201.03969 (cross-list from cs.LG) [pdf, other]
-
Title: Multimodal Representations Learning Based on Mutual Information Maximization and Minimization and Identity Embedding for Multimodal Sentiment AnalysisSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [828] arXiv:2201.04014 (cross-list from cs.CR) [pdf, other]
-
Title: Captcha Attack: Turning Captchas Against HumanityComments: Currently under submissionSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [829] arXiv:2201.04100 (cross-list from cs.HC) [pdf, other]
-
Title: Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at ScaleComments: Accepted to ACM CHI 2022Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [830] arXiv:2201.04122 (cross-list from cs.LG) [pdf, other]
-
Title: In Defense of the Unitary Scalarization for Deep Multi-Task LearningComments: NeurIPS 2022 camera-ready version, fixed training loss y axis scaleSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [831] arXiv:2201.04182 (cross-list from cs.LG) [pdf, other]
-
Title: HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot LearningSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [832] arXiv:2201.04194 (cross-list from cs.LG) [pdf, other]
-
Title: Neural Capacitance: A New Perspective of Neural Network Selection via Edge DynamicsComments: 19 pages, 7 figures, neural architecture search, mean-fieldSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [833] arXiv:2201.04235 (cross-list from cs.DC) [pdf, other]
-
Title: SmartDet: Context-Aware Dynamic Control of Edge Task Offloading for Mobile Object DetectionSubjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
- [834] arXiv:2201.04387 (cross-list from cs.RO) [pdf, other]
-
Title: Maximizing Self-supervision from Thermal Image for Effective Self-supervised Learning of Depth and Ego-motionComments: 8 pages, Accepted by IEEE Robotics and Automation Letters (RA-L) with IROS 2022 optionSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [835] arXiv:2201.04439 (cross-list from cs.GR) [pdf, other]
-
Title: Real-Time Style Modelling of Human Locomotion via Feature-Wise Transformations and Local Motion PhasesSubjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [836] arXiv:2201.04473 (cross-list from cs.RO) [pdf, other]
-
Title: Globally Optimal Multi-Scale Monocular Hand-Eye Calibration Using Dual QuaternionsJournal-ref: 2021 International Conference on 3D Vision (3DV)Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [837] arXiv:2201.04569 (cross-list from cs.CR) [pdf, other]
-
Title: Get your Foes Fooled: Proximal Gradient Split Learning for Defense against Model Inversion Attacks on IoMT dataAuthors: Sunder Ali Khowaja, Ik Hyun Lee, Kapal Dev, Muhammad Aslam Jarwar, Nawab Muhammad Faseeh QureshiComments: 10 pages, 5 figures, 2 tablesJournal-ref: IEEE Transactions on Network Science and Engineering, 2022Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [838] arXiv:2201.04733 (cross-list from cs.LG) [pdf, other]
-
Title: Adversarially Robust Classification by Conditional Generative Model InversionSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [839] arXiv:2201.04813 (cross-list from cs.LG) [pdf, other]
-
Title: Recursive Least Squares for Training and Pruning Convolutional Neural NetworksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [840] arXiv:2201.04990 (cross-list from cs.LG) [pdf, other]
-
Title: Toddler-Guidance Learning: Impacts of Critical Period on Multimodal AI AgentsAuthors: Junseok Park, Kwanyoung Park, Hyunseok Oh, Ganghun Lee, Minsu Lee, Youngki Lee, Byoung-Tak ZhangComments: ICMI2021 Oral Presentation, 9 pages, 9 figuresSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [841] arXiv:2201.05026 (cross-list from cs.AI) [pdf, other]
-
Title: Fantastic Data and How to Query ThemJournal-ref: NeurIPS Data-Centric AI Workshop 2021Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
- [842] arXiv:2201.05071 (cross-list from cs.CR) [pdf, other]
-
Title: Evaluation of Neural Networks Defenses and Attacks using NDCG and Reciprocal Rank MetricsComments: 12 pages, 5 figuresJournal-ref: International Journal of Information Security 2022Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [843] arXiv:2201.05125 (cross-list from cs.LG) [pdf, other]
-
Title: GradMax: Growing Neural Networks using Gradient InformationComments: ICLR 2022Journal-ref: International Conference on Learning Representations, 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [844] arXiv:2201.05217 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Enhancement of CNNs via Separation Index Maximizing at the First Convolutional LayerSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [845] arXiv:2201.05279 (cross-list from cs.LG) [pdf, other]
-
Title: Manifoldron: Direct Space Partition via Manifold DiscoveryAuthors: Dayang Wang, Feng-Lei Fan, Bo-Jian Hou, Hao Zhang, Zhen Jia, Boce Zhou, Rongjie Lai, Hengyong Yu, Fei WangSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [846] arXiv:2201.05610 (cross-list from cs.LG) [pdf, other]
-
Title: When less is more: Simplifying inputs aids neural network understandingSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [847] arXiv:2201.05809 (cross-list from cs.LG) [pdf, other]
-
Title: Weighting and Pruning based Ensemble Deep Random Vector Functional Link Network for Tabular Data ClassificationComments: 8 tables, 8 figures, 31 pagesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [848] arXiv:2201.05938 (cross-list from cs.LG) [pdf, other]
-
Title: GradTail: Learning Long-Tailed Data Using Gradient-based Sample WeightingComments: 15 pages (including Appendix), 8 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [849] arXiv:2201.05977 (cross-list from cs.RO) [pdf, other]
-
Title: Lightweight Object-level Topological Semantic Mapping and Long-term Global Localization based on Graph MatchingComments: 9 pages, 12 figures, 23 referencesSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [850] arXiv:2201.05996 (cross-list from cs.CR) [pdf, ps, other]
-
Title: Hardware Implementation of Multimodal Biometric using Fingerprint and IrisAuthors: Tariq M KhanSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [851] arXiv:2201.06173 (cross-list from cs.LG) [pdf, other]
-
Title: SunCast: Solar Irradiance Nowcasting from Geosynchronous Satellite DataAuthors: Dhileeban Kumaresan, Richard Wang, Ernesto Martinez, Richard Cziva, Alberto Todeschini, Colorado J Reed, Hossein VahabiSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [852] arXiv:2201.06268 (cross-list from cs.AI) [pdf, other]
-
Title: Continual Transformers: Redundancy-Free Attention for Online InferenceComments: 16 pages, 6 figures, 7 tablesJournal-ref: International Conference on Learning Representations, 2023Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [853] arXiv:2201.06321 (cross-list from cs.LG) [pdf, other]
-
Title: Landscape of Neural Architecture Search across sensors: how much do they differ ?Comments: This work is under review for a conference publicationSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [854] arXiv:2201.06378 (cross-list from cs.AI) [pdf, other]
-
Title: Self-Supervised Anomaly Detection by Self-Distillation and Negative SamplingAuthors: Nima Rafiee, Rahil Gholamipoorfard, Nikolas Adaloglou, Simon Jaxy, Julius Ramakers, Markus KollmannSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [855] arXiv:2201.06406 (cross-list from cs.AI) [pdf, ps, other]
-
Title: Deep Learning-based Quality Assessment of Clinical Protocol Adherence in Fetal Ultrasound Dating ScansComments: 13 pages, 2 figures, 3 tables. Proceedings of Machine Learning Research, Under Review. Full Paper MIDL 2022 submissionSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [856] arXiv:2201.06494 (cross-list from cs.AI) [pdf, other]
-
Title: AugLy: Data Augmentations for RobustnessSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [857] arXiv:2201.06505 (cross-list from cs.AI) [pdf, ps, other]
-
Title: Data Harmonisation for Information Fusion in Digital Healthcare: A State-of-the-Art Systematic Review, Meta-Analysis and Future Research DirectionsAuthors: Yang Nan, Javier Del Ser, Simon Walsh, Carola Schönlieb, Michael Roberts, Ian Selby, Kit Howard, John Owen, Jon Neville, Julien Guiot, Benoit Ernst, Ana Pastor, Angel Alberich-Bayarri, Marion I. Menzel, Sean Walsh, Wim Vos, Nina Flerin, Jean-Paul Charbonnier, Eva van Rikxoort, Avishek Chatterjee, Henry Woodruff, Philippe Lambin, Leonor Cerdá-Alberich, Luis Martí-Bonmatí, Francisco Herrera, Guang YangComments: 54 pages, 14 figures, accepted by the Information Fusion journalSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [858] arXiv:2201.06599 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Who supervises the supervisor? Model monitoring in production using deep feature embeddings with applications to workpiece inspectionSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [859] arXiv:2201.06618 (cross-list from cs.LG) [pdf, other]
-
Title: VAQF: Fully Automatic Software-Hardware Co-Design Framework for Low-Bit Vision TransformerAuthors: Mengshu Sun, Haoyu Ma, Guoliang Kang, Yifan Jiang, Tianlong Chen, Xiaolong Ma, Zhangyang Wang, Yanzhi WangSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [860] arXiv:2201.06640 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Adversarial Evaluations for Inexact Machine UnlearningAuthors: Shashwat Goel, Ameya Prabhu, Amartya Sanyal, Ser-Nam Lim, Philip Torr, Ponnurangam KumaraguruComments: Tech ReportSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [861] arXiv:2201.07207 (cross-list from cs.LG) [pdf, other]
-
Title: Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied AgentsComments: Project website at this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [862] arXiv:2201.07383 (cross-list from cs.LG) [pdf, other]
-
Title: Online Deep Learning based on Auto-EncoderComments: 30 pagesJournal-ref: Applied Intelligence (2021)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [863] arXiv:2201.07544 (cross-list from cs.LG) [pdf, other]
-
Title: Simpler is better: spectral regularization and up-sampling techniques for variational autoencodersComments: Submitted to ICASSP 2022, 2022 IEEE International Conference on Acoustics, Speech and Signal ProcessingSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [864] arXiv:2201.07646 (cross-list from cs.LG) [pdf, other]
-
Title: A Survey on Training Challenges in Generative Adversarial Networks for Biomedical Image AnalysisComments: Submitted to the AI Review JournalSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [865] arXiv:2201.07698 (cross-list from cs.HC) [pdf, other]
-
Title: Visualization and Analysis of Wearable Health Data From COVID-19 PatientsAuthors: Susanne K. Suter, Georg R. Spinner, Bianca Hoelz, Sofia Rey, Sujeanthraa Thanabalasingam, Jens Eckstein, Sven HirschComments: 17 pages, 9 figures, conferenceSubjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
- [866] arXiv:2201.07779 (cross-list from cs.RO) [pdf, other]
-
Title: Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic ManipulationComments: Accepted in Robotics and Automation Letters Journal (RA-L 2022). Website at this https URL .8 PagesSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [867] arXiv:2201.07823 (cross-list from cs.MM) [pdf, ps, other]
-
Title: BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra CodingJournal-ref: Journal of Real-Time Image Processing (2022)Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [868] arXiv:2201.07863 (cross-list from cs.RO) [pdf, other]
-
Title: ROS georegistration: Aerial Multi-spectral Image Simulator for the Robot Operating SystemSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [869] arXiv:2201.07882 (cross-list from cs.RO) [pdf, ps, other]
-
Title: An Automated Robotic Arm: A Machine Learning ApproachSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [870] arXiv:2201.07899 (cross-list from cs.CL) [pdf, ps, other]
-
Title: ASL Video Corpora & Sign Bank: Resources Available through the American Sign Language Linguistic Research Project (ASLLRP)Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [871] arXiv:2201.07935 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Towards deep observation: A systematic survey on artificial intelligence techniques to monitor fetus via Ultrasound ImagesAuthors: Mahmood Alzubaidi, Marco Agus, Khalid Alyafei, Khaled A Althelaya, Uzair Shah, Alaa Abd-Alrazaq, Mohammed Anbar, Michel Makhlouf, Mowafa HousehComments: 25 pages, 4 figures, submitted to Artificial Intelligence in MedicineJournal-ref: IScience,Volume 25, Issue 8, 19 August 2022, 104713Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Image and Video Processing (eess.IV)
- [872] arXiv:2201.08142 (cross-list from cs.RO) [pdf, other]
-
Title: Physically Embodied Deep Image OptimisationJournal-ref: 5th Workshop on Machine Learning for Creativity and Design of the Neural Information Processing Systems (NeurIPS) 2021 ConferenceSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [873] arXiv:2201.08266 (cross-list from cs.GR) [src]
-
Title: A Real-Time Rendering Method for Light Field DisplayAuthors: Quanzhen WanComments: We are reminded by our supervisors and peers that we have not taken many potential influential factors into consideration, which might lead to a rather different outcome. If the whole idea will be certified correctly in the future, we will resubmit our updated version at that timeSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [874] arXiv:2201.08279 (cross-list from cs.CG) [pdf, other]
-
Title: Modeling and hexahedral meshing of cerebral arterial networks from centerlinesSubjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
- [875] arXiv:2201.08429 (cross-list from cs.LG) [pdf, other]
-
Title: A Visual Analytics Approach to Building Logistic Regression Models and its Application to Health RecordsComments: 16 pages and 13 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [876] arXiv:2201.08676 (cross-list from cs.LG) [pdf, other]
-
Title: Distance-Ratio-Based Formulation for Metric LearningComments: 17 pages. Codes for our experiments are available in this https URL . Perhaps, we will write a new version with experiments using normalized embedding and common metric learning performance metricsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [877] arXiv:2201.09130 (cross-list from cs.AI) [pdf, ps, other]
-
Title: Artificial Intelligence for Suicide Assessment using Audiovisual Cues: A ReviewComments: Manuscript submitted to Arificial Intelligence Reviews (2022)Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [878] arXiv:2201.09165 (cross-list from cs.MM) [pdf, other]
-
Title: A Pre-trained Audio-Visual Transformer for Emotion RecognitionComments: Accepted by IEEE ICASSP 2022Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [879] arXiv:2201.09196 (cross-list from cs.LG) [pdf, other]
-
Title: Learning to Predict Gradients for Semi-Supervised Continual LearningComments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)Journal-ref: IEEE Transactions on Neural Networks and Learning Systems, 2024Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [880] arXiv:2201.09243 (cross-list from cs.CR) [pdf, other]
-
Title: Increasing the Cost of Model Extraction with Calibrated Proof of WorkComments: Published as a conference paper at ICLR 2022 (Spotlight - 5% of submitted papers)Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [881] arXiv:2201.09367 (cross-list from cs.GR) [pdf, other]
-
Title: Sketch2PQ: Freeform Planar Quadrilateral Mesh Design via a Single SketchComments: To appear in IEEE Transactions on Visualization and Computer GraphicsSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [882] arXiv:2201.09463 (cross-list from cs.SE) [pdf, other]
-
Title: Cyber Mobility Mirror for Enabling Cooperative Driving Automation in Mixed Traffic: A Co-Simulation PlatformComments: Accepted by the IEEE Intelligent Transportation Systems MagazineJournal-ref: IEEE Intelligent Transportation Systems Magazine 2022Subjects: Software Engineering (cs.SE); Computer Vision and Pattern Recognition (cs.CV)
- [883] arXiv:2201.09487 (cross-list from cs.CR) [pdf, ps, other]
-
Title: Forgery Attack Detection in Surveillance Video Streams Using Wi-Fi Channel State InformationComments: To appear in IEEE Transactions on Wireless Communications. arXiv admin note: text overlap with arXiv:2101.00848Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [884] arXiv:2201.09671 (cross-list from cs.LG) [pdf, other]
-
Title: Analyzing Multispectral Satellite Imagery of South American Wildfires Using Deep LearningAuthors: Christopher SunComments: IEEE International Conference on Applied Artificial Intelligence (May 2022)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [885] arXiv:2201.09679 (cross-list from cs.LG) [pdf, other]
-
Title: A Review of Deep Transfer Learning and Recent AdvancementsComments: 18 pages, 2 figures, 1 tableJournal-ref: Technologies 2023, 11, 40Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [886] arXiv:2201.09725 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Machine Learning Algorithms for Prediction of Penetration Depth and Geometrical Analysis of Weld in Friction Stir Spot Welding ProcessSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [887] arXiv:2201.09765 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Generative Planning for Temporally Coordinated Exploration in Reinforcement LearningComments: Spotlight paper at the 10th International Conference on Learning Representations (ICLR 2022)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [888] arXiv:2201.09828 (cross-list from cs.LG) [pdf, other]
-
Title: MMLatch: Bottom-up Top-down Fusion for Multimodal Sentiment AnalysisComments: Accepted, ICASSP 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [889] arXiv:2201.09884 (cross-list from cs.LG) [pdf, other]
-
Title: AutoMC: Automated Model Compression based on Domain Knowledge and Progressive search strategySubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [890] arXiv:2201.10000 (cross-list from cs.LG) [pdf, other]
-
Title: Neural Manifold Clustering and EmbeddingSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [891] arXiv:2201.10266 (cross-list from cs.AI) [pdf, other]
-
Title: Combining Commonsense Reasoning and Knowledge Acquisition to Guide Deep Learning in RoboticsComments: 37 pages, 17 figures, 5 tablesSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Robotics (cs.RO)
- [892] arXiv:2201.10353 (cross-list from cs.LG) [pdf, ps, other]
-
Title: A Multi-modal Fusion Framework Based on Multi-task Correlation Learning for Cancer Prognosis PredictionSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [893] arXiv:2201.10444 (cross-list from cs.LG) [pdf, other]
-
Title: AggMatch: Aggregating Pseudo Labels for Semi-Supervised LearningAuthors: Jiwon Kim, Kwangrok Ryoo, Gyuseong Lee, Seokju Cho, Junyoung Seo, Daehwan Kim, Hansang Cho, Seungryong KimSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [894] arXiv:2201.10859 (cross-list from cs.LG) [pdf, other]
-
Title: Visualizing the Diversity of Representations Learned by Bayesian Neural NetworksComments: 16 pages, 18 figuresJournal-ref: Published in Transactions on Machine Learning Research (11/2023)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [895] arXiv:2201.10890 (cross-list from cs.LG) [pdf, other]
-
Title: One Student Knows All Experts Know: From Sparse to DenseSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [896] arXiv:2201.10899 (cross-list from cs.LG) [pdf, other]
-
Title: Speeding up Heterogeneous Federated Learning with Sequentially Trained SuperclientsComments: Published at the 26th International Conference on Pattern Recognition (ICPR), 2022, pp. 3376-3382Journal-ref: 26th International Conference on Pattern Recognition (ICPR), 2022, pp. 3376-3382Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [897] arXiv:2201.10947 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Enabling Deep Learning on Edge Devices through Filter Pruning and Knowledge TransferSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [898] arXiv:2201.11259 (cross-list from cs.LG) [pdf, other]
-
Title: Controlling Directions Orthogonal to a ClassifierComments: accepted by ICLR 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [899] arXiv:2201.11511 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Density-Aware Hyper-Graph Neural Networks for Graph-based Semi-supervised Node ClassificationSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [900] arXiv:2201.11613 (cross-list from cs.LG) [pdf, other]
-
Title: Domain-Invariant Representation Learning from EEG with Private EncodersAuthors: David Bethge, Philipp Hallgarten, Tobias Grosse-Puppendahl, Mohamed Kari, Ralf Mikut, Albrecht Schmidt, Ozan ÖzdenizciComments: 5 pages, 1 figureSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [901] arXiv:2201.11678 (cross-list from cs.LG) [pdf, other]
-
Title: Unsupervised Change Detection using DRE-CUSUMSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [902] arXiv:2201.11679 (cross-list from cs.LG) [pdf, other]
-
Title: DropNAS: Grouped Operation Dropout for Differentiable Architecture SearchSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [903] arXiv:2201.11706 (cross-list from cs.LG) [pdf, other]
-
Title: A Systematic Study of Bias AmplificationSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [904] arXiv:2201.11732 (cross-list from cs.CL) [pdf, other]
-
Title: IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and LanguagesAuthors: Emanuele Bugliarello, Fangyu Liu, Jonas Pfeiffer, Siva Reddy, Desmond Elliott, Edoardo Maria Ponti, Ivan VulićComments: ICML 2022Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [905] arXiv:2201.11812 (cross-list from cs.CR) [pdf, other]
-
Title: A Transfer Learning and Optimized CNN Based Intrusion Detection System for Internet of VehiclesComments: Accepted and to appear in IEEE International Conference on Communications (ICC); Code is available at Github link: this https URLSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
- [906] arXiv:2201.11844 (cross-list from cs.CR) [pdf, ps, other]
-
Title: Speckle-based optical cryptosystem and its application for human face recognition via deep learningAuthors: Qi Zhao, Huanhao Li, Zhipeng Yu, Chi Man Woo, Tianting Zhong, Shengfu Cheng, Yuanjin Zheng, Honglin Liu, Jie Tian, Puxiang LaiSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
- [907] arXiv:2201.11857 (cross-list from cs.LG) [pdf, other]
-
Title: Using Shape Metrics to Describe 2D Data PointsAuthors: William Franz LambertiSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [908] arXiv:2201.11944 (cross-list from cs.RO) [pdf, other]
-
Title: DICP: Doppler Iterative Closest Point AlgorithmComments: Accepted at Robotics: Science and Systems (RSS) 2022Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [909] arXiv:2201.11999 (cross-list from cs.SD) [pdf, other]
-
Title: Dual Learning Music Composition and Dance ChoreographyComments: ACMMM 2021 (Oral)Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
- [910] arXiv:2201.12107 (cross-list from cs.AI) [pdf, ps, other]
-
Title: Feature Visualization within an Automated Design Assessment leveraging Explainable Artificial Intelligence MethodsComments: CIRP Design 2021, 10.1016/j.procir.2021.05.075Journal-ref: 2021, Procedia CIRP 100(7):331-336Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [911] arXiv:2201.12114 (cross-list from cs.LG) [pdf, other]
-
Title: Rethinking Attention-Model Explainability through Faithfulness Violation TestComments: Accepted to ICML 2022Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [912] arXiv:2201.12123 (cross-list from cs.LG) [pdf, other]
-
Title: DELAUNAY: a dataset of abstract art for psychophysical and machine learning researchSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
- [913] arXiv:2201.12179 (cross-list from cs.LG) [pdf, other]
-
Title: Plug & Play Attacks: Towards Robust and Flexible Model Inversion AttacksAuthors: Lukas Struppek, Dominik Hintersdorf, Antonio De Almeida Correia, Antonia Adler, Kristian KerstingComments: Accepted by ICML 2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [914] arXiv:2201.12240 (cross-list from cs.LG) [pdf, other]
-
Title: Continuous Deep Equilibrium Models: Training Neural ODEs faster by integrating them to InfinitySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Dynamical Systems (math.DS)
- [915] arXiv:2201.12296 (cross-list from cs.LG) [pdf, other]
-
Title: Benchmarking Robustness of 3D Point Cloud Recognition Against Common CorruptionsComments: Codebase and dataset are included in this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [916] arXiv:2201.12311 (cross-list from cs.LG) [pdf, ps, other]
-
Title: REET: Robustness Evaluation and Enhancement Toolbox for Computational PathologySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [917] arXiv:2201.12351 (cross-list from cs.LG) [pdf, other]
-
Title: Low-rank features based double transformation matrices learning for image classificationSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [918] arXiv:2201.12382 (cross-list from cs.AI) [pdf, other]
-
Title: Deep Learning Methods for Abstract Visual Reasoning: A Survey on Raven's Progressive MatricesSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [919] arXiv:2201.12406 (cross-list from cs.LG) [pdf, other]
-
Title: Syfer: Neural Obfuscation for Private Data ReleaseAuthors: Adam Yala, Victor Quach, Homa Esfahanizadeh, Rafael G. L. D'Oliveira, Ken R. Duffy, Muriel Médard, Tommi S. Jaakkola, Regina BarzilaySubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [920] arXiv:2201.12577 (cross-list from cs.CR) [pdf, other]
-
Title: Volley Revolver: A Novel Matrix-Encoding Method for Privacy-Preserving Neural Networks (Inference)Authors: John ChiangComments: The encoding method we proposed in this work, $\texttt{Volley Revolver}$, is particularly tailored for privacy-preserving neural networks. There is a good chance that it can be used to assist the private neural networks training, in which case for the backpropagation algorithm of the fully-connected layer the first matrix $A$ is revolved while the second matrix $B$ is settled to be stillSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [921] arXiv:2201.12604 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning SystemComments: Published as a conference paper at ICLR 2022 (camera-ready version)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [922] arXiv:2201.12678 (cross-list from cs.LG) [pdf, ps, other]
-
Title: A Stochastic Bundle Method for Interpolating NetworksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [923] arXiv:2201.12680 (cross-list from cs.LG) [pdf, other]
-
Title: Understanding Deep Contrastive Learning via Coordinate-wise OptimizationAuthors: Yuandong TianComments: Add code linksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [924] arXiv:2201.12716 (cross-list from cs.RO) [pdf, other]
-
Title: You Only Demonstrate Once: Category-Level Manipulation from Single Visual DemonstrationJournal-ref: Robotics: Science and Systems (RSS) 2022Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
- [925] arXiv:2201.12803 (cross-list from cs.LG) [pdf, other]
-
Title: Generalizing similarity in noisy setups: the DIBS phenomenonComments: v3: version accepted at ECAI 2023 + Supplementary MaterialSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [926] arXiv:2201.12896 (cross-list from cs.LG) [pdf, other]
-
Title: Augmenting Novelty Search with a Surrogate Model to Engineer Meta-Diversity in Ensembles of ClassifiersComments: 16 pages, 4 figures, 3 tables, EvoStar 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [927] arXiv:2201.12904 (cross-list from cs.LG) [pdf, other]
-
Title: COIN++: Neural Compression Across ModalitiesAuthors: Emilien Dupont, Hrushikesh Loya, Milad Alizadeh, Adam Goliński, Yee Whye Teh, Arnaud DoucetComments: TMLR camera readySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
- [928] arXiv:2201.12910 (cross-list from cs.LG) [pdf, other]
-
Title: Sparse Centroid-Encoder: A Nonlinear Model for Feature SelectionComments: 13 pages,56 figures, 5 tables. Used 12 data sets and 5 state-of-the-art models for comparisonSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [929] arXiv:2201.12926 (cross-list from cs.CL) [pdf, other]
-
Title: Compositionality as Lexical SymmetryComments: ACL2023 Final VersionSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [930] arXiv:2201.13168 (cross-list from cs.GR) [pdf, other]
-
Title: SPAGHETTI: Editing Implicit Shapes Through Part Aware GenerationSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [931] arXiv:2201.13190 (cross-list from cs.GR) [pdf, other]
-
Title: Differentiable Neural RadiositySubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [932] arXiv:2201.13361 (cross-list from cs.LG) [pdf, other]
-
Title: Signing the Supermask: Keep, Hide, InvertComments: ICLR 2022 camera readySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [933] arXiv:2201.00084 (cross-list from eess.IV) [pdf, other]
-
Title: Performance Comparison of Deep Learning Architectures for Artifact Removal in Gastrointestinal Endoscopic ImagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [934] arXiv:2201.00100 (cross-list from eess.IV) [pdf, other]
-
Title: Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB ImagesComments: Accepted by IEEE TIPSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [935] arXiv:2201.00155 (cross-list from eess.IV) [pdf, other]
-
Title: Adaptive Single Image DeblurringComments: arXiv admin note: substantial text overlap with arXiv:2004.05343Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [936] arXiv:2201.00163 (cross-list from eess.IV) [pdf, other]
-
Title: Development of Diabetic Foot Ulcer Datasets: An OverviewAuthors: Moi Hoon Yap, Connah Kendrick, Neil D. Reeves, Manu Goyal, Joseph M. Pappachan, Bill CassidyComments: Preprint (author copy) to be published in MICCAI DFUC2021 ProceedingsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [937] arXiv:2201.00169 (cross-list from eess.IV) [pdf, other]
-
Title: Dynamic Scene Video Deblurring using Non-Local AttentionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [938] arXiv:2201.00187 (cross-list from eess.IV) [pdf, other]
-
Title: Image Restoration using Feature-guidanceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [939] arXiv:2201.00227 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Deep Learning Applications for Lung Cancer Diagnosis: A systematic reviewComments: 32 pages, 14 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [940] arXiv:2201.00259 (cross-list from eess.IV) [pdf, other]
-
Title: Subspace modeling for fast and high-sensitivity X-ray chemical imagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [941] arXiv:2201.00317 (cross-list from eess.IV) [pdf, other]
-
Title: Recurrent Feature Propagation and Edge Skip-Connections for Automatic Abdominal Organ SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [942] arXiv:2201.00337 (cross-list from eess.IV) [pdf, other]
-
Title: Riemannian Nearest-Regularized Subspace Classification for Polarimetric SAR imagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [943] arXiv:2201.00404 (cross-list from q-bio.NC) [pdf, other]
-
Title: MHATC: Autism Spectrum Disorder identification utilizing multi-head attention encoder along with temporal consolidation modulesSubjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [944] arXiv:2201.00414 (cross-list from eess.IV) [pdf, ps, other]
-
Title: FUSeg: The Foot Ulcer Segmentation ChallengeAuthors: Chuanbo Wang, Amirreza Mahbod, Isabella Ellinger, Adrian Galdran, Sandeep Gopalakrishnan, Jeffrey Niezgoda, Zeyun YuSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [945] arXiv:2201.00429 (cross-list from eess.IV) [pdf, other]
-
Title: Image Denoising with Control over Deep Network HallucinationComments: Published in Electronic Imaging 2022, code available at this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [946] arXiv:2201.00458 (cross-list from eess.IV) [pdf, other]
-
Title: Lung-Originated Tumor Segmentation from Computed Tomography Scan (LOTUS) BenchmarkAuthors: Parnian Afshar, Arash Mohammadi, Konstantinos N. Plataniotis, Keyvan Farahani, Justin Kirby, Anastasia Oikonomou, Amir Asif, Leonard Wee, Andre Dekker, Xin Wu, Mohammad Ariful Haque, Shahruk Hossain, Md. Kamrul Hasan, Uday Kamal, Winston Hsu, Jhih-Yuan Lin, M. Sohel Rahman, Nabil Ibtehaz, Sh. M. Amir Foisol, Kin-Man Lam, Zhong Guang, Runze Zhang, Sumohana S. Channappayya, Shashank Gupta, Chander DevSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [947] arXiv:2201.00466 (cross-list from eess.IV) [pdf, other]
-
Title: RFormer: Transformer-based Generative Adversarial Network for Real Fundus Image Restoration on A New Clinical BenchmarkAuthors: Zhuo Deng, Yuanhao Cai, Lu Chen, Zheng Gong, Qiqi Bao, Xue Yao, Dong Fang, Shaochong Zhang, Lan MaComments: IEEE J-BHI 2022; The First Benchmark and First Transformer-based Method for Real Clinical Fundus Image RestorationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [948] arXiv:2201.00636 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Improving Feature Extraction from Histopathological Images Through A Fine-tuning ImageNet ModelSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [949] arXiv:2201.00767 (cross-list from eess.IV) [pdf, other]
-
Title: BDG-Net: Boundary Distribution Guided Network for Accurate Polyp SegmentationComments: Accepted by SPIE Medical Imaging 2022Journal-ref: Proc. SPIE 12032, Medical Imaging 2022: Image Processing, 1203230 (4 April 2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [950] arXiv:2201.00820 (cross-list from eess.IV) [pdf, other]
-
Title: Low dosage 3D volume fluorescence microscopy imaging using compressive sensingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an); Instrumentation and Detectors (physics.ins-det); Optics (physics.optics)
- [951] arXiv:2201.00895 (cross-list from eess.IV) [pdf, other]
-
Title: A Gradient Mapping Guided Explainable Deep Neural Network for Extracapsular Extension Identification in 3D Head and Neck Cancer Computed Tomography ImagesAuthors: Yibin Wang, Abdur Rahman, W. Neil. Duggar, P. Russell Roberts, Toms V. Thomas, Linkan Bian, Haifeng WangSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [952] arXiv:2201.00942 (cross-list from eess.IV) [pdf, other]
-
Title: External Attention Assisted Multi-Phase Splenic Vascular Injury Segmentation with Limited DataComments: IEEE TMISubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [953] arXiv:2201.00957 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Stain Normalized Breast Histopathology Image Recognition using Convolutional Neural Networks for Cancer DetectionComments: 26 pages, 11 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [954] arXiv:2201.01014 (cross-list from eess.IV) [pdf, other]
-
Title: Local Motion and Contrast Priors Driven Deep Network for Infrared Small Target Super-ResolutionJournal-ref: JSTARS 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [955] arXiv:2201.01034 (cross-list from eess.IV) [pdf, other]
-
Title: Uncovering the Over-smoothing Challenge in Image Super-Resolution: Entropy-based Quantification and Contrastive OptimizationAuthors: Tianshuo Xu, Lijiang Li, Peng Mi, Xiawu Zheng, Fei Chao, Rongrong Ji, Yonghong Tian, Qiang ShenComments: Accepted in IEEE Transactions on Pattern Analysis and Machine IntelligenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [956] arXiv:2201.01173 (cross-list from eess.IV) [pdf, other]
-
Title: DeepFGS: Fine-Grained Scalable Coding for Learned Image CompressionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [957] arXiv:2201.01266 (cross-list from eess.IV) [pdf, other]
-
Title: Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI ImagesComments: 13 pages, 3 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [958] arXiv:2201.01380 (cross-list from eess.IV) [pdf, other]
-
Title: Image Processing Methods for Coronal Hole Segmentation, Matching, and Map ClassificationJournal-ref: IEEE Transactions on Image Processing 29 (2019): 1641-1653Subjects: Image and Video Processing (eess.IV); Solar and Stellar Astrophysics (astro-ph.SR); Computer Vision and Pattern Recognition (cs.CV)
- [959] arXiv:2201.01426 (cross-list from eess.IV) [pdf, other]
-
Title: Advancing 3D Medical Image Analysis with Variable Dimension Transform based Supervised 3D Pre-trainingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [960] arXiv:2201.01443 (cross-list from eess.IV) [pdf, other]
-
Title: Neural KEM: A Kernel Method with Deep Coefficient Prior for PET Image ReconstructionComments: arXiv admin note: text overlap with arXiv:2110.01174Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [961] arXiv:2201.01449 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis of Gastric Intestinal MetaplasiaSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [962] arXiv:2201.01453 (cross-list from eess.IV) [pdf, other]
-
Title: Robust photon-efficient imaging using a pixel-wise residual shrinkage networkJournal-ref: Optics Express 30(11):18856-18873, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [963] arXiv:2201.01458 (cross-list from eess.IV) [pdf, other]
-
Title: Cross-SRN: Structure-Preserving Super-Resolution Network with Cross ConvolutionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [964] arXiv:2201.01492 (cross-list from eess.IV) [pdf, other]
-
Title: FAVER: Blind Quality Prediction of Variable Frame Rate VideosComments: 12 pages, 8 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [965] arXiv:2201.01586 (cross-list from eess.IV) [pdf, other]
-
Title: Learning True Rate-Distortion-Optimization for End-To-End Image CompressionComments: Accepted to DCC as PosterSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [966] arXiv:2201.01778 (cross-list from quant-ph) [pdf, other]
-
Title: Quantum Capsule NetworksComments: 7 pages (main text) + 8 pages (supplementary information), 8 figuresJournal-ref: Quantum Sci. Technol. 8 015016 (2022)Subjects: Quantum Physics (quant-ph); Disordered Systems and Neural Networks (cond-mat.dis-nn); Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [967] arXiv:2201.01832 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Multiple Sclerosis Lesions Segmentation using Attention-Based CNNs in FLAIR ImagesSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [968] arXiv:2201.01838 (cross-list from eess.IV) [pdf, other]
-
Title: Lumbar Bone Mineral Density Estimation from Chest X-ray Images: Anatomy-aware Attentive Multi-ROI ModelingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [969] arXiv:2201.01893 (cross-list from eess.IV) [pdf, other]
-
Title: Flow-Guided Sparse Transformer for Video DeblurringAuthors: Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, Luc Van GoolComments: ICML 2022; The First Transformer-based method for Video DeblurringSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [970] arXiv:2201.02184 (cross-list from eess.AS) [pdf, other]
-
Title: Learning Audio-Visual Speech Representation by Masked Multimodal Cluster PredictionComments: ICLR 2022Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
- [971] arXiv:2201.02198 (cross-list from eess.IV) [pdf, other]
-
Title: 3D Intracranial Aneurysm Classification and Segmentation via Unsupervised Dual-branch LearningComments: under review (corresponding: {xuequan.lu@deakin.edu.au})Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [972] arXiv:2201.02242 (cross-list from eess.IV) [pdf, other]
-
Title: A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image RegistrationAuthors: Aline Sindel (1), Bettina Hohberger (2), Sebastian Fassihi Dehcordi (2), Christian Mardin (2), Robert Lämmer (2), Andreas Maier (1), Vincent Christlein (1) ((1) Pattern Recognition Lab, FAU Erlangen-Nürnberg, (2) Department of Ophthalmology, Universitätsklinikum Erlangen)Comments: 6 pages, 4 figures, 1 table, accepted to BVM 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [973] arXiv:2201.02295 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Persistent Homology for Breast Tumor Classification using Mammogram ScansComments: 14 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Algebraic Topology (math.AT)
- [974] arXiv:2201.02309 (cross-list from eess.IV) [pdf, other]
-
Title: A three-dimensional dual-domain deep network for high-pitch and sparse helical CT reconstructionComments: 13 pages, 5 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [975] arXiv:2201.02314 (cross-list from eess.IV) [pdf, other]
-
Title: RestoreDet: Degradation Equivariant Representation for Object Detection in Low Resolution ImagesAuthors: Ziteng Cui, Yingying Zhu, Lin Gu, Guo-Jun Qi, Xiaoxiao Li, Peng Gao, Zenghui Zhang, Tatsuya HaradaComments: 11 pages, 3figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [976] arXiv:2201.02350 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Multiresolution Fully Convolutional Networks to detect Clouds and Snow through Optical Satellite ImagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [977] arXiv:2201.02356 (cross-list from eess.IV) [pdf, other]
-
Title: Cross-Modality Deep Feature Learning for Brain Tumor SegmentationComments: published on Pattern Recognition 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [978] arXiv:2201.02409 (cross-list from eess.IV) [pdf, other]
-
Title: Amplitude SAR Imagery Splicing LocalizationComments: The manuscript has been published in IEEE Access. Changes include the full citation to the IEEE published versionJournal-ref: in IEEE Access, vol. 10, pp. 33882-33899, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [979] arXiv:2201.02420 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video CodingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [980] arXiv:2201.02428 (cross-list from eess.IV) [pdf, other]
-
Title: Effect of Prior-based Losses on Segmentation Performance: A BenchmarkComments: To be submitted to SPIE: Journal of Medical ImagingSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [981] arXiv:2201.02445 (cross-list from eess.IV) [pdf, other]
-
Title: Negative Evidence Matters in Interpretable Histology Image ClassificationComments: 9 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [982] arXiv:2201.02475 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Domain Adversarial Adaptation for Photon-efficient ImagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [983] arXiv:2201.02574 (cross-list from eess.IV) [pdf, other]
-
Title: An Incremental Learning Approach to Automatically Recognize Pulmonary Diseases from the Multi-vendor Chest RadiographsComments: Computers in Biology and MedicineJournal-ref: Computers in Biology and Medicine, 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [984] arXiv:2201.02624 (cross-list from eess.IV) [pdf, other]
-
Title: Microdosing: Knowledge Distillation for GAN based CompressionAuthors: Leonhard Helminger, Roberto Azevedo, Abdelaziz Djelouah, Markus Gross, Christopher SchroersComments: BMVC 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [985] arXiv:2201.02625 (cross-list from eess.IV) [pdf, other]
-
Title: FlexHDR: Modelling Alignment and Exposure Uncertainties for Flexible HDR ImagingAuthors: Sibi Catley-Chandar, Thomas Tanay, Lucas Vandroux, Aleš Leonardis, Gregory Slabaugh, Eduardo Pérez-PelliteroComments: Accepted to IEEE Transactions on Image Processing (TIP) 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [986] arXiv:2201.02627 (cross-list from eess.IV) [pdf, other]
-
Title: Learning with Less Labels in Digital Pathology via Scribble Supervision from Natural ImagesComments: To appear in IEEE International Symposium on Biomedical Imaging (ISBI) 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [987] arXiv:2201.02629 (cross-list from eess.IV) [pdf, other]
-
Title: United adversarial learning for liver tumor segmentation and detection of multi-modality non-contrast MRISubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [988] arXiv:2201.02656 (cross-list from eess.IV) [pdf, other]
-
Title: GPU-Net: Lightweight U-Net with more diverse featuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [989] arXiv:2201.02689 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Video Coding for Machines: Partial transmission of SIFT featuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [990] arXiv:2201.02746 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Expert Knowledge-guided Geometric Representation Learning for Magnetic Resonance Imaging-based Glioma GradingComments: 10 pages, 9 figures, 2 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [991] arXiv:2201.02771 (cross-list from eess.IV) [pdf, other]
-
Title: A Sneak Attack on Segmentation of Medical Images Using Deep Neural Network ClassifiersComments: 8 pages, 10 figures. Accepted by IEEE AIPR 2021 (Oral)Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [992] arXiv:2201.02812 (cross-list from eess.IV) [pdf, other]
-
Title: Hyperspectral Image Denoising Using Non-convex Local Low-rank and Sparse Separation with Spatial-Spectral Total Variation RegularizationAuthors: Chong Peng, Yang Liu, Yongyong Chen, Xinxin Wu, Andrew Cheng, Zhao Kang, Chenglizhao Chen, Qiang ChengSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [993] arXiv:2201.02821 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Classification of Hyperspectral Images by Using Spectral Data and Fully Connected Neural NetworkSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [994] arXiv:2201.02831 (cross-list from eess.IV) [pdf, other]
-
Title: CrossMoDA 2021 challenge: Benchmark of Cross-Modality Domain Adaptation techniques for Vestibular Schwannoma and Cochlea SegmentationAuthors: Reuben Dorent, Aaron Kujawa, Marina Ivory, Spyridon Bakas, Nicola Rieke, Samuel Joutard, Ben Glocker, Jorge Cardoso, Marc Modat, Kayhan Batmanghelich, Arseniy Belkov, Maria Baldeon Calisto, Jae Won Choi, Benoit M. Dawant, Hexin Dong, Sergio Escalera, Yubo Fan, Lasse Hansen, Mattias P. Heinrich, Smriti Joshi, Victoriya Kashtanova, Hyeon Gyu Kim, Satoshi Kondo, Christian N. Kruse, Susana K. Lai-Yuen, Hao Li, Han Liu, Buntheng Ly, Ipek Oguz, Hyungseob Shin, Boris Shirokikh, Zixian Su, Guotai Wang, Jianghao Wu, Yanwu Xu, Kai Yao, Li Zhang, Sebastien Ourselin, Jonathan Shapey, Tom VercauterenComments: In Medical Image AnalysisSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [995] arXiv:2201.02832 (cross-list from eess.IV) [pdf, other]
-
Title: SGUIE-Net: Semantic Attention Guided Underwater Image Enhancement with Multi-Scale PerceptionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [996] arXiv:2201.02833 (cross-list from eess.IV) [pdf, other]
-
Title: Weighted Encoding Optimization for Dynamic Single-pixel Imaging and SensingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [997] arXiv:2201.02867 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Generative Modeling for Volume Reconstruction in Cryo-Electron MicroscopySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
- [998] arXiv:2201.02876 (cross-list from eess.IV) [pdf, other]
-
Title: Defocus Deblur Microscopy via Head-to-Tail Cross-scale FusionComments: published on ICIP 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [999] arXiv:2201.02973 (cross-list from eess.IV) [pdf, other]
-
Title: MAXIM: Multi-Axis MLP for Image ProcessingAuthors: Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao LiComments: CVPR 2022 Oral; Code: \url{this https URL}Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1000] arXiv:2201.02979 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Enhanced total variation minimization for stable image reconstructionComments: 29 pages, 8 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Numerical Analysis (math.NA)
- [1001] arXiv:2201.03016 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Learning from Synthetic InSAR with Vision Transformers: The case of volcanic unrest detectionComments: This work has been submitted to the IEEE for possible publicationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1002] arXiv:2201.03050 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Lung infection and normal region segmentation from CT volumes of COVID-19 casesAuthors: Masahiro Oda, Yuichiro Hayashi, Yoshito Otake, Masahiro Hashimoto, Toshiaki Akashi, Kensaku MoriComments: Accepted paper as a poster presentation at SPIE Medical Imaging 2021Journal-ref: Proceedings of SPIE Medical Imaging 2021: Computer-Aided Diagnosis, Vol.11597, 115972X-1-6Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1003] arXiv:2201.03053 (cross-list from eess.IV) [pdf, other]
-
Title: COVID-19 Infection Segmentation from Chest CT Images Based on Scale UncertaintyAuthors: Masahiro Oda, Tong Zheng, Yuichiro Hayashi, Yoshito Otake, Masahiro Hashimoto, Toshiaki Akashi, Shigeki Aoki, Kensaku MoriComments: Accepted paper as a oral presentation at CILP2021, 10th MICCAI CLIP WorkshopJournal-ref: DCL 2021, PPML 2021, LL-COVID19 2021, CLIP 2021, Lecture Notes in Computer Science (LNCS) 12969, pp.88-97Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1004] arXiv:2201.03114 (cross-list from eess.SP) [pdf, other]
-
Title: Signal Reconstruction from Quantized Noisy Samples of the Discrete Fourier TransformSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1005] arXiv:2201.03131 (cross-list from astro-ph.GA) [pdf, other]
-
Title: Systematic biases when using deep neural networks for annotating large catalogs of astronomical imagesComments: A&C, acceptedSubjects: Astrophysics of Galaxies (astro-ph.GA); Cosmology and Nongalactic Astrophysics (astro-ph.CO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1006] arXiv:2201.03145 (cross-list from eess.IV) [pdf, other]
-
Title: Enhancing Low-Light Images in Real World via Cross-Image DisentanglementSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1007] arXiv:2201.03186 (cross-list from eess.IV) [pdf, other]
-
Title: MyoPS: A Benchmark of Myocardial Pathology Segmentation Combining Three-Sequence Cardiac Magnetic Resonance ImagesAuthors: Lei Li, Fuping Wu, Sihan Wang, Xinzhe Luo, Carlos Martin-Isla, Shuwei Zhai, Jianpeng Zhang, Yanfei Liu7, Zhen Zhang, Markus J. Ankenbrand, Haochuan Jiang, Xiaoran Zhang, Linhong Wang, Tewodros Weldebirhan Arega, Elif Altunok, Zhou Zhao, Feiyan Li, Jun Ma, Xiaoping Yang, Elodie Puybareau, Ilkay Oksuz, Stephanie Bricq, Weisheng Li, Kumaradevan Punithakumar, Sotirios A. Tsaftaris, Laura M. Schreiber, Mingjing Yang, Guocai Liu, Yong Xia, Guotai Wang, Sergio Escalera, Xiahai ZhuangSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1008] arXiv:2201.03195 (cross-list from eess.IV) [pdf, other]
- [1009] arXiv:2201.03210 (cross-list from eess.IV) [pdf, other]
-
Title: Model-Based Image Signal Processors via Learnable DictionariesComments: AAAI 2022Journal-ref: Vol. 36 No. 1: AAAI-22 Technical Tracks 1 (2022) 481-489Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1010] arXiv:2201.03230 (cross-list from eess.IV) [pdf, other]
-
Title: Swin Transformer for Fast MRIAuthors: Jiahao Huang, Yingying Fang, Yinzhe Wu, Huanjun Wu, Zhifan Gao, Yang Li, Javier Del Ser, Jun Xia, Guang YangComments: 55 pages, 19 figures, submitted to Neurocomputing journalSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1011] arXiv:2201.03288 (cross-list from eess.IV) [pdf, other]
-
Title: A statistical shape model for radiation-free assessment and classification of craniosynostosisAuthors: Matthias Schaufelberger, Reinald Peter Kühle, Andreas Wachter, Frederic Weichel, Niclas Hagen, Friedemann Ringwald, Urs Eisenmann, Jürgen Hoffmann, Michael Engel, Christian Freudlsperger, Werner NahmSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1012] arXiv:2201.03319 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Comparison of Representation Learning Techniques for Tracking in time resolved 3D UltrasoundComments: Presented at Medical Imaging with Deep Learning (MIDL) 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1013] arXiv:2201.03481 (cross-list from eess.IV) [pdf, other]
-
Title: Learning Population-level Shape Statistics and Anatomy Segmentation From Images: A Joint Deep Learning ModelSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1014] arXiv:2201.03559 (cross-list from eess.IV) [pdf, other]
-
Title: Demonstrating The Risk of Imbalanced Datasets in Chest X-ray Image-based Diagnostics by Prototypical Relevance PropagationComments: To appear in ISBI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1015] arXiv:2201.03560 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Iterative training of robust k-space interpolation networks for improved image reconstruction with limited scan specific training samplesAuthors: Peter Dawood, Felix Breuer, Paul R. Burd, István Homolya, Johannes Oberberger, Peter M. Jakob, Martin BlaimerComments: Submitted to Magnetic Resonance in MedicineSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1016] arXiv:2201.03644 (cross-list from eess.IV) [pdf, other]
-
Title: 3D Segmentation with Fully Trainable Gabor Kernels and Pearson's Correlation CoefficientComments: This paper was accepted by the International Workshop on Machine Learning in Medical Imaging (MLMI 2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1017] arXiv:2201.03669 (cross-list from eess.IV) [pdf, other]
-
Title: Neuroplastic graph attention networks for nuclei segmentation in histopathology imagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1018] arXiv:2201.03715 (cross-list from eess.IV) [pdf, other]
-
Title: An analysis of reconstruction noise from undersampled 4D flow MRISubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph); Applications (stat.AP); Methodology (stat.ME)
- [1019] arXiv:2201.03777 (cross-list from eess.IV) [pdf, other]
-
Title: Reciprocal Adversarial Learning for Brain Tumor Segmentation: A Solution to BraTS Challenge 2021 Segmentation TaskSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1020] arXiv:2201.03795 (cross-list from eess.IV) [pdf, other]
-
Title: COROLLA: An Efficient Multi-Modality Fusion Framework with Supervised Contrastive Learning for Glaucoma GradingComments: 5 pages, To be published in ISBI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1021] arXiv:2201.03992 (cross-list from eess.IV) [pdf, other]
-
Title: Image quality measurements and denoising using Fourier Ring CorrelationsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1022] arXiv:2201.04138 (cross-list from eess.IV) [pdf, other]
-
Title: Overview of the HECKTOR Challenge at MICCAI 2021: Automatic Head and Neck Tumor Segmentation and Outcome Prediction in PET/CT ImagesAuthors: Vincent Andrearczyk, Valentin Oreiller, Sarah Boughdad, Catherine Chez Le Rest, Hesham Elhalawani, Mario Jreige, John O. Prior, Martin Vallières, Dimitris Visvikis, Mathieu Hatt, Adrien DepeursingeSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1023] arXiv:2201.04229 (cross-list from q-bio.NC) [pdf, other]
-
Title: Brain Signals Analysis Based Deep Learning Methods: Recent advances in the study of non-invasive brain signalsComments: 18 pagesSubjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1024] arXiv:2201.04318 (cross-list from eess.IV) [pdf, other]
-
Title: Knee Cartilage Defect Assessment by Graph Representation and Surface ConvolutionAuthors: Zixu Zhuang, Liping Si, Sheng Wang, Kai Xuan, Xi Ouyang, Yiqiang Zhan, Zhong Xue, Lichi Zhang, Dinggang Shen, Weiwu Yao, Qian WangComments: 10 pages, 4 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1025] arXiv:2201.04370 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Predicting Alzheimer's Disease Using 3DMgNetSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1026] arXiv:2201.04397 (cross-list from eess.IV) [pdf, other]
-
Title: Towards Adversarially Robust Deep Image DenoisingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1027] arXiv:2201.04416 (cross-list from eess.IV) [pdf, other]
-
Title: Optimizing Prediction of MGMT Promoter Methylation from MRI Scans using Adversarial LearningAuthors: Sauman DasSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1028] arXiv:2201.04485 (cross-list from eess.IV) [pdf, other]
-
Title: Depth Estimation from Single-shot Monocular Endoscope Image Using Image Domain Adaptation And Edge-Aware Depth EstimationAuthors: Masahiro Oda, Hayato Itoh, Kiyohito Tanaka, Hirotsugu Takabatake, Masaki Mori, Hiroshi Natori, Kensaku MoriComments: Accepted paper as an oral presentation at Joint MICCAI workshop 2021, AE-CAI/CARE/OR2.0Journal-ref: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1029] arXiv:2201.04584 (cross-list from eess.IV) [pdf, other]
-
Title: ECONet: Efficient Convolutional Online Likelihood Network for Scribble-based Interactive SegmentationComments: Accepted at MIDL 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1030] arXiv:2201.04631 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Early Diagnosis of Parkinsons Disease by Analyzing Magnetic Resonance Imaging Brain Scans and Patient CharacteristicsAuthors: Sabrina ZhuSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1031] arXiv:2201.04714 (cross-list from astro-ph.IM) [pdf, other]
-
Title: Partial-Attribution Instance Segmentation for Astronomical Source Detection and DeblendingComments: Accepted to the Fourth Workshop on Machine Learning and the Physical Sciences, NeurIPS 2021, 6 pages, 1 figureSubjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Astrophysics of Galaxies (astro-ph.GA); Computer Vision and Pattern Recognition (cs.CV)
- [1032] arXiv:2201.04769 (cross-list from eess.IV) [pdf, other]
-
Title: MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide imagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1033] arXiv:2201.04795 (cross-list from eess.IV) [pdf, ps, other]
-
Title: EMT-NET: Efficient multitask network for computer-aided diagnosis of breast cancerSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1034] arXiv:2201.04812 (cross-list from eess.IV) [pdf, other]
-
Title: Unsupervised Domain Adaptation for Cross-Modality Retinal Vessel Segmentation via Disentangling Representation Style Transfer and Collaborative Consistency LearningComments: To be published in ISBI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1035] arXiv:2201.04918 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Realistic Endoscopic Image Generation Method Using Virtual-to-real Image-domain TranslationAuthors: Masahiro Oda, Kiyohito Tanaka, Hirotsugu Takabatake, Masaki Mori, Hiroshi Natori, Kensaku MoriComments: Accepted paper as an oral presentation at the Joint MICCAI workshop MIAR | AE-CAI | CARE 2019Journal-ref: Healthcare Technology Letters, Vol.6, No.6, pp.214-219, 2019Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1036] arXiv:2201.05145 (cross-list from astro-ph.IM) [pdf, other]
-
Title: Fully Adaptive Bayesian Algorithm for Data Analysis, FABADAComments: 13 pages, 6 figures. Accepted for publication in RAS Techniques and InstrumentsSubjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Astrophysics of Galaxies (astro-ph.GA); Solar and Stellar Astrophysics (astro-ph.SR); Computer Vision and Pattern Recognition (cs.CV); Data Analysis, Statistics and Probability (physics.data-an)
- [1037] arXiv:2201.05233 (cross-list from physics.flu-dyn) [pdf, other]
-
Title: Density reconstruction from schlieren images through Bayesian nonparametric modelsAuthors: Bryn Noel Ubald (1), Pranay Seshadri (1 and 2), Andrew Duncan (1 and 2) ((1) The Alan Turing Institute, (2) Imperial College London)Subjects: Fluid Dynamics (physics.flu-dyn); Computer Vision and Pattern Recognition (cs.CV)
- [1038] arXiv:2201.05331 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Semi-automated Virtual Unfolded View Generation Method of Stomach from CT VolumesAuthors: Masahiro Oda, Tomoaki Suito, Yuichiro Hayashi, Takayuki Kitasaka, Kazuhiro Furukawa, Ryoji Miyahara, Yoshiki Hirooka, Hidemi Goto, Gen Iinuma, Kazunari Misawa, Shigeru Nawano, Kensaku MoriComments: Accepted paper as a poster presentation at MICCAI 2013 (International Conference on Medical Image Computing and Computer-Assisted Intervention), Nagoya, JapanJournal-ref: Published in Proceedings of MICCAI 2013, LNCS 8149, pp.332-339, 2013Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1039] arXiv:2201.05344 (cross-list from eess.IV) [pdf, other]
-
Title: AWSnet: An Auto-weighted Supervision Attention Network for Myocardial Scar and Edema Segmentation in Multi-sequence Cardiac Magnetic Resonance ImagesAuthors: Kai-Ni Wang, Xin Yang, Juzheng Miao, Lei Li, Jing Yao, Ping Zhou, Wufeng Xue, Guang-Quan Zhou, Xiahai Zhuang, Dong NiComments: 19 pages, 10 figures, accepted by Medical Image AnalysisSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1040] arXiv:2201.05373 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A New Deep Hybrid Boosted and Ensemble Learning-based Brain Tumor Analysis using MRIComments: 26 pages, 9 figures, 8 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1041] arXiv:2201.05650 (cross-list from eess.IV) [pdf, other]
-
Title: Disentanglement enables cross-domain Hippocampus SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1042] arXiv:2201.05768 (cross-list from eess.IV) [pdf, other]
-
Title: Spectral Compressive Imaging Reconstruction Using Convolution and Contextual TransformerSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1043] arXiv:2201.05810 (cross-list from eess.IV) [pdf, other]
-
Title: Two-Stage is Enough: A Concise Deep Unfolding Reconstruction Network for Flexible Video Compressive SensingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1044] arXiv:2201.05865 (cross-list from eess.IV) [pdf, ps, other]
-
Title: SDT-DCSCN for Simultaneous Super-Resolution and Deblurring of Text ImagesAuthors: Hala Neji, Mohamed Ben Halima, Javier Nogueras-Iso, Tarek. M. Hamdani, Abdulrahman M. Qahtani, Omar Almutiry, Habib Dhahri, Adel M. AlimiSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1045] arXiv:2201.05905 (cross-list from eess.IV) [pdf, other]
-
Title: SS-3DCapsNet: Self-supervised 3D Capsule Networks for Medical Segmentation on Less Labeled DataComments: Accepted to ISBI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1046] arXiv:2201.05920 (cross-list from eess.IV) [pdf, other]
-
Title: ViTBIS: Vision Transformer for Biomedical Image SegmentationAuthors: Abhinav SagarComments: Published at Clinical Image-Based Procedures, Distributed and Collaborative Learning, Artificial Intelligence for Combating COVID-19 and Secure and Privacy-Preserving Machine Learning workshop at MICCAI 2021Journal-ref: Springer, Cham 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1047] arXiv:2201.05963 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A Residual Encoder-Decoder Network for Segmentation of Retinal Image-Based Exudates in Diabetic Retinopathy ScreeningSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1048] arXiv:2201.06045 (cross-list from eess.IV) [pdf, other]
-
Title: CISRNet: Compressed Image Super-Resolution NetworkSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1049] arXiv:2201.06052 (cross-list from eess.IV) [pdf, other]
-
Title: Self-Supervision and Multi-Task Learning: Challenges in Fine-Grained COVID-19 Multi-Class Classification from Chest X-raysAuthors: Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarette, Ibrahim Almakky, Mohammad YaqubComments: Accepted to Conference on Medical Image Understanding and Analysis (MIUA) 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1050] arXiv:2201.06086 (cross-list from eess.IV) [pdf, other]
-
Title: Is it Possible to Predict MGMT Promoter Methylation from Brain Tumor MRI Scans using Deep Learning Models?Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1051] arXiv:2201.06133 (cross-list from stat.ML) [pdf, other]
-
Title: On Maximum-a-Posteriori estimation with Plug & Play priors and stochastic gradient descentAuthors: Rémi Laumont, Valentin de Bortoli, Andrés Almansa, Julie Delon, Alain Durmus, Marcelo PereyraSubjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Optimization and Control (math.OC)
- [1052] arXiv:2201.06143 (cross-list from eess.IV) [pdf, other]
-
Title: Robust Scatterer Number Density Segmentation of Ultrasound ImagesComments: Accepted in IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency ControlSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1053] arXiv:2201.06250 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Improving Clinical Diagnosis Performance with Automated X-ray Scan Quality Enhancement AlgorithmsComments: Presented and Accepted in International Conference on Advances in Systems, Control and Computing (AISCC-2020) at Malaviya National Institute of Technology, Jaipur, India, February 27-28, 2020Journal-ref: International Conference on Advances in Systems, Control and Computing (AISCC-2020) at Malaviya National Institute of Technology, Jaipur, India, February 27-28, 2020Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1054] arXiv:2201.06251 (cross-list from eess.IV) [pdf, other]
-
Title: Automatic Segmentation of Head and Neck Tumor: How Powerful Transformers Are?Comments: 8 pages, 2 figures (3 more figures in Appendix), 2 tables; accepted to MIDL conferenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1055] arXiv:2201.06259 (cross-list from eess.IV) [pdf, other]
-
Title: Segmentation of the Carotid Lumen and Vessel Wall using Deep Learning and Location PriorsAuthors: Florian Thamm, Felix Denzinger, Leonhard Rist, Celia Martin Vicario, Florian Kordon, Andreas MaierComments: Challenge Report - PreprintSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1056] arXiv:2201.06329 (cross-list from eess.IV) [pdf, other]
-
Title: H&E-adversarial network: a convolutional neural network to learn stain-invariant features through Hematoxylin & Eosin regressionAuthors: Niccoló Marini, Manfredo Atzori, Sebastian Otálora, Stephane Marchand-Maillet, Henning MüllerComments: Errata corrige Proceedings of the IEEE/CVF International Conference on Computer Vision 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1057] arXiv:2201.06358 (cross-list from eess.IV) [pdf, other]
-
Title: Few-shot image segmentation for cross-institution male pelvic organs using registration-assisted prototypical learningAuthors: Yiwen Li, Yunguan Fu, Qianye Yang, Zhe Min, Wen Yan, Henkjan Huisman, Dean Barratt, Victor Adrian Prisacariu, Yipeng HuComments: To appear in the proceedings of the IEEE International Symposium on Biomedical Imaging (ISBI) 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1058] arXiv:2201.06383 (cross-list from eess.IV) [pdf, other]
-
Title: Dual Perceptual Loss for Single Image Super-Resolution Using ESRGANSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1059] arXiv:2201.06574 (cross-list from eess.IV) [pdf, other]
-
Title: Neural Computed TomographyComments: this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1060] arXiv:2201.06931 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Equilibrium Models for Video Snapshot Compressive ImagingComments: 9 pages, 7 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1061] arXiv:2201.07066 (cross-list from eess.IV) [pdf, other]
-
Title: Joint denoising and HDR for RAW video sequencesComments: arXiv admin note: text overlap with arXiv:1812.11207Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1062] arXiv:2201.07219 (cross-list from eess.IV) [pdf, other]
-
Title: Contrastive Pretraining for Echocardiography Segmentation with Limited DataSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1063] arXiv:2201.07227 (cross-list from eess.IV) [pdf, other]
-
Title: Explainable Ensemble Machine Learning for Breast Cancer Diagnosis based on Ultrasound Image Texture FeaturesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1064] arXiv:2201.07231 (cross-list from eess.IV) [pdf, other]
-
Title: AI-based Carcinoma Detection and Classification Using Histopathological Images: A Systematic ReviewComments: accepted to Computers in Biology and MedicineSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1065] arXiv:2201.07344 (cross-list from eess.IV) [pdf, other]
-
Title: Lung Swapping Autoencoder: Learning a Disentangled Structure-texture Representation of Chest RadiographsAuthors: Lei Zhou, Joseph Bae, Huidong Liu, Gagandeep Singh, Jeremy Green, Amit Gupta, Dimitris Samaras, Prateek PrasannaComments: Extended version of the MICCAI 2021 paper this https URL The code is available at this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1066] arXiv:2201.07357 (cross-list from eess.IV) [pdf, other]
-
Title: Weakly Supervised Contrastive Learning for Better Severity Scoring of Lung UltrasoundAuthors: Gautam Rajendrakumar Gare, Hai V. Tran, Bennett P deBoisblanc, Ricardo Luis Rodriguez, John Michael GaleottiComments: Under Review for MIDL 2022 conferenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1067] arXiv:2201.07368 (cross-list from eess.IV) [pdf, other]
-
Title: The Role of Pleura and Adipose in Lung Ultrasound AIAuthors: Gautam Rajendrakumar Gare, Wanwen Chen, Alex Ling Yu Hung, Edward Chen, Hai V. Tran, Tom Fox, Pete Lowery, Kevin Zamora, Bennett P deBoisblanc, Ricardo Luis Rodriguez, John Michael GaleottiComments: Published in MICCAI 2021 workshop on Lessons Learned from the development and application of medical imaging-based AI technologies for combating COVID-19 (LL-COVID19). The first two authors contributed equally to this workJournal-ref: LL-COVID19 2021. Lecture Notes in Computer Science, vol 12969. Springer, ChamSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1068] arXiv:2201.07562 (cross-list from eess.IV) [pdf, other]
-
Title: Learned Cone-Beam CT Reconstruction Using Neural Ordinary Differential EquationsComments: 6 pagesJournal-ref: 7th International Conference on Image Formation in X-Ray Computed Tomography, Proc. Vol. 12304 (2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1069] arXiv:2201.07610 (cross-list from math.OC) [pdf, other]
-
Title: Nonlinear Unknown Input Observability and Unknown Input Reconstruction: The General Analytical SolutionAuthors: Agostino MartinelliComments: This paper was published by the journal of Information FusionJournal-ref: Journal of Information Fusion, Volume 85, September 2022, Pages 23-51Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV)
- [1070] arXiv:2201.07890 (cross-list from eess.SP) [pdf, other]
-
Title: Convolutional Neural Networks for Spherical Signal Processing via Spherical Haar Tight FrameletsSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Functional Analysis (math.FA)
- [1071] arXiv:2201.07891 (cross-list from eess.SP) [pdf, other]
-
Title: Homogenization of Existing Inertial-Based Datasets to Support Human Activity RecognitionSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1072] arXiv:2201.08385 (cross-list from eess.IV) [pdf, other]
-
Title: Improving Specificity in Mammography Using Cross-correlation between Wavelet and Fourier TransformAuthors: Liuhua ZhangSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1073] arXiv:2201.08388 (cross-list from eess.IV) [pdf, other]
-
Title: Steerable Pyramid Transform Enables Robust Left Ventricle QuantificationComments: 10 pages, 13 figures, journal paperSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1074] arXiv:2201.08418 (cross-list from eess.IV) [pdf, other]
-
Title: SoftDropConnect (SDC) -- Effective and Efficient Quantification of the Network Uncertainty in Deep MR Image AnalysisSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1075] arXiv:2201.08512 (cross-list from eess.SP) [pdf, other]
-
Title: Vertical Federated Edge Learning with Distributed Integrated Sensing and CommunicationComments: 5 pages, 7 figures, accepted by IEEE Communications LettersSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Networking and Internet Architecture (cs.NI)
- [1076] arXiv:2201.08582 (cross-list from eess.IV) [pdf, other]
-
Title: SegTransVAE: Hybrid CNN -- Transformer with Regularization for medical image segmentationAuthors: Quan-Dung Pham (1), Hai Nguyen-Truong (1, 2 and 3), Nam Nguyen Phuong (1), Khoa N. A. Nguyen (1, 2 and 3) ((1) VinBrain JSC., Vietnam, (2) University of Science, Ho Chi Minh City, Vietnam, (3) Vietnam National University, Ho Chi Minh City, Vietnam)Journal-ref: 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1077] arXiv:2201.08706 (cross-list from eess.IV) [pdf, other]
-
Title: SparseAlign: A Super-Resolution Algorithm for Automatic Marker Localization and Deformation Estimation in Cryo-Electron TomographyAuthors: Poulami Somanya Ganguly, Felix Lucka, Holger Kohr, Erik Franken, Hermen Jan Hupkes, K Joost BatenburgSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA); Optimization and Control (math.OC); Quantitative Methods (q-bio.QM)
- [1078] arXiv:2201.08741 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Improving Across-Dataset Brain Tissue Segmentation Using TransformerAuthors: Vishwanatha M. Rao, Zihan Wan, Soroush Arabshahi, David J. Ma, Pin-Yu Lee, Ye Tian, Xuzhe Zhang, Andrew F. Laine, Jia GuoSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1079] arXiv:2201.08865 (cross-list from eess.IV) [pdf, other]
-
Title: On the in vivo recognition of kidney stones using machine learningAuthors: Francisco Lopez-Tiro, Vincent Estrade, Jacques Hubert, Daniel Flores-Araiza, Miguel Gonzalez-Mendoza, Gilberto Ochoa-Ruiz, Christian DaulComments: Paper submitted to IEEE AccessSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1080] arXiv:2201.08935 (cross-list from eess.IV) [pdf, other]
-
Title: SAR Image Change Detection Based on Multiscale Capsule NetworkJournal-ref: in IEEE Geoscience and Remote Sensing Letters, vol. 18, no. 3, pp. 484-488, March 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1081] arXiv:2201.08944 (cross-list from eess.IV) [pdf, other]
-
Title: DCNGAN: A Deformable Convolutional-Based GAN with QP Adaptation for Perceptual Quality Enhancement of Compressed VideoComments: 5 pages, 4 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1082] arXiv:2201.08955 (cross-list from eess.IV) [pdf, other]
-
Title: Modality Bank: Learn multi-modality images across data centers without sharing medical dataComments: arXiv admin note: substantial text overlap with arXiv:2012.08604Journal-ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2022, pp. 4758-4763Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1083] arXiv:2201.08964 (cross-list from physics.optics) [pdf, ps, other]
-
Title: Diffractive all-optical computing for quantitative phase imagingComments: 23 Pages, 5 FiguresJournal-ref: Advanced Optical Materials (2022)Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Applied Physics (physics.app-ph)
- [1084] arXiv:2201.09163 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Pulmonary Fissure Segmentation in CT Images Based on ODoS Filter and Shape FeaturesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1085] arXiv:2201.09240 (cross-list from eess.IV) [pdf, other]
-
Title: Learning-Driven Lossy Image Compression; A Comprehensive SurveySubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1086] arXiv:2201.09267 (cross-list from stat.ML) [pdf, other]
-
Title: Spectral, Probabilistic, and Deep Metric Learning: Tutorial and SurveyComments: To appear as a part of an upcoming textbook on dimensionality reduction and manifold learningSubjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1087] arXiv:2201.09314 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Perceptual cGAN for MRI Super-resolutionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1088] arXiv:2201.09360 (cross-list from eess.IV) [pdf, other]
-
Title: POTHER: Patch-Voted Deep Learning-Based Chest X-ray Bias Analysis for COVID-19 DetectionComments: Accepted at International Conference on Computational Science (ICCS) 2022, LondonSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1089] arXiv:2201.09376 (cross-list from eess.IV) [pdf, other]
-
Title: ReconFormer: Accelerated MRI Reconstruction Using Recurrent TransformerSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1090] arXiv:2201.09400 (cross-list from eess.IV) [pdf, other]
-
Title: Fast MRI Reconstruction: How Powerful Transformers Are?Comments: 5 pages, 5 figures, EMBC 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1091] arXiv:2201.09522 (cross-list from eess.SP) [pdf, other]
-
Title: Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement LearningAuthors: Tristan S.W. Stevens, Nishith Chennakeshava, Frederik J. de Bruijn, Martin Pekař, Ruud J.G. van SlounComments: 5 pages, 3 figures, conferenceJournal-ref: ICASSP 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1092] arXiv:2201.09579 (cross-list from eess.IV) [pdf, other]
-
Title: AutoSeg -- Steering the Inductive Biases for Automatic Pathology SegmentationComments: 8 pages, 3 figures, part of the MICCAI MOOD Challenge 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1093] arXiv:2201.09693 (cross-list from eess.IV) [pdf, other]
-
Title: Shape-consistent Generative Adversarial Networks for multi-modal Medical segmentation mapsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1094] arXiv:2201.09851 (cross-list from eess.IV) [pdf, other]
-
Title: Hyperspectral Image Super-resolution with Deep Priors and Degradation Model InversionComments: Proc. IEEE Int. Conf. on Acoust, Speech, Signal Process. (ICASSP), to be published. Manuscript submitted October 6th, 2021; revised January 8th, 2022; accepted January 22nd, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1095] arXiv:2201.09867 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Importance of Preprocessing in Histopathology Image Classification Using Deep Convolutional Neural NetworkComments: 6 PagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1096] arXiv:2201.09873 (cross-list from eess.IV) [pdf, other]
-
Title: Transformers in Medical Imaging: A SurveyAuthors: Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, Huazhu FuComments: 41 pages, \url{this https URL}Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1097] arXiv:2201.09929 (cross-list from math.DG) [pdf, other]
-
Title: Euclidean and Affine Curve ReconstructionComments: This paper is a result of an REU project conducted at the North Carolina State University in the Summer and Fall 2020. This version has several minor correctionsJournal-ref: Involve 17 (2024) 29-63Subjects: Differential Geometry (math.DG); Computer Vision and Pattern Recognition (cs.CV)
- [1098] arXiv:2201.09952 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A Deep Learning Approach for the Detection of COVID-19 from Chest X-Ray Images using Convolutional Neural NetworksSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1099] arXiv:2201.09972 (cross-list from eess.IV) [pdf, ps, other]
-
Title: COVID-19 Detection Using CT Image Based On YOLOv5 NetworkSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1100] arXiv:2201.10166 (cross-list from eess.IV) [pdf, other]
-
Title: Dense Pixel-Labeling for Reverse-Transfer and Diagnostic Learning on Lung Ultrasound for COVID-19 and Pneumonia DetectionAuthors: Gautam Rajendrakumar Gare, Andrew Schoenling, Vipin Philip, Hai V Tran, Bennett P deBoisblanc, Ricardo Luis Rodriguez, John Michael GaleottiComments: Published in 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) \copyright 2021 IEEEJournal-ref: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), 2021, pp. 1406-1410Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1101] arXiv:2201.10294 (cross-list from eess.IV) [pdf, other]
-
Title: S2MS: Self-Supervised Learning Driven Multi-Spectral CT Image EnhancementSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1102] arXiv:2201.10305 (cross-list from eess.IV) [pdf, other]
-
Title: Mutual information neural estimation for unsupervised multi-modal registration of brain imagesAuthors: Gerard Snaauw (1), Michele Sasdelli (1), Gabriel Maicas (1), Stephan Lau (1 and 2), Johan Verjans (1 and 2), Mark Jenkinson (1 and 2), Gustavo Carneiro (1) ((1) Australian Institute for Machine Learning (AIML), University of Adelaide, Adelaide, Australia, (2) South Australian Health and Medical Research Institute (SAHMRI), Adelaide, Australia)Comments: 4 pages, 4 figures, 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), oral presentationJournal-ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2022, pp. 3510-3513Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1103] arXiv:2201.10324 (cross-list from eess.IV) [pdf, other]
-
Title: Addressing the Intra-class Mode Collapse Problem using Adaptive Input Image Normalization in GAN-based X-ray ImagesComments: Accepted to the IEEE EMBC22 ConferenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1104] arXiv:2201.10345 (cross-list from eess.IV) [pdf, other]
-
Title: Ultra Low-Parameter Denoising: Trainable Bilateral Filter Layers in Computed TomographyAuthors: Fabian Wagner, Mareike Thies, Mingxuan Gu, Yixing Huang, Sabrina Pechmann, Mayank Patwari, Stefan Ploner, Oliver Aust, Stefan Uderhardt, Georg Schett, Silke Christiansen, Andreas MaierJournal-ref: Med.Phys. 49 (2022) 5107-5120Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1105] arXiv:2201.10360 (cross-list from eess.SP) [pdf, other]
-
Title: Resource-efficient Deep Neural Networks for Automotive Radar Interference MitigationComments: 15 pages; published in IEEE Journal of Selected Topics in Signal Processing, Special Issue on Recent Advances in Automotive Radar Signal Processing, Volume: 15, Issue: 4, June 2021. arXiv admin note: text overlap with arXiv:2011.12706Journal-ref: IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 4, pp. 927-940, June 2021Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
- [1106] arXiv:2201.10424 (cross-list from eess.IV) [pdf, other]
-
Title: Improving segmentation of calcified and non-calcified plaques on CCTA-CPR scans via masking of the artery wallAuthors: Antonio Tejero-de-Pablos, Hiroaki Yamane, Yusuke Kurose, Junichi Iho, Youji Tokunaga, Makoto Horie, Keisuke Nishizawa, Yusaku Hayashi, Yasushi Koyama, Tatsuya HaradaComments: Extended abstract (see SPIE for final published version)Journal-ref: SPIE 12465, Medical Imaging 2023: Computer-Aided DiagnosisSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1107] arXiv:2201.10511 (cross-list from eess.IV) [pdf, other]
-
Title: Initial Investigations Towards Non-invasive Monitoring of Chronic Wound Healing Using Deep Learning and Ultrasound ImagingAuthors: Maja Schlereth (1,2), Daniel Stromer (2), Yash Mantri (3), Jason Tsujimoto (3), Katharina Breininger (1), Andreas Maier (2), Caesar Anderson (4), Pranav S. Garimella (5), Jesse V. Jokerst (6) ((1) Department Artificial Intelligence in Biomedical Engineering, FAU Erlangen-Nürnberg, Erlangen, (2) Pattern Recognition Lab, FAU Erlangen-Nürnberg, Erlangen, (3) Department of Bioengineering, University of California, San Diego, (4) Department of Emergency Medicine, San Diego, (5) Division of Nephrology and Hypertension, Department of Medicine, San Diego, (6) Department of Nanoengineering, University of California, San Diego)Comments: 6 pages, 2 figures, accepted by BVM conference proceedings 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1108] arXiv:2201.10747 (cross-list from eess.IV) [pdf, other]
-
Title: Learning Multiple Probabilistic Degradation Generators for Unsupervised Real World Image Super ResolutionComments: Accepted to ECCVW 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1109] arXiv:2201.10776 (cross-list from eess.IV) [pdf, other]
-
Title: DSFormer: A Dual-domain Self-supervised Transformer for Accelerated Multi-contrast MRI ReconstructionAuthors: Bo Zhou, Neel Dey, Jo Schlemper, Seyed Sadegh Mohseni Salehi, Chi Liu, James S. Duncan, Michal SofkaComments: Accepted at WACV 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1110] arXiv:2201.10849 (cross-list from eess.IV) [pdf, other]
-
Title: Predicting Knee Osteoarthritis Progression from Structural MRI using Deep LearningComments: $\copyright$ 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other worksSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1111] arXiv:2201.10885 (cross-list from eess.IV) [pdf, other]
-
Title: Hyperparameter Optimization for COVID-19 Chest X-Ray ClassificationComments: 15 pages, 13 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
- [1112] arXiv:2201.10910 (cross-list from eess.IV) [pdf, other]
-
Title: A Bayesian Based Deep Unrolling Algorithm for Single-Photon Lidar SystemsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1113] arXiv:2201.10981 (cross-list from eess.IV) [pdf, other]
-
Title: Joint Liver and Hepatic Lesion Segmentation in MRI using a Hybrid CNN with Transformer LayersAuthors: Georg Hille, Shubham Agrawal, Pavan Tummala, Christian Wybranski, Maciej Pech, Alexey Surov, Sylvia SaalfeldSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1114] arXiv:2201.11000 (cross-list from eess.IV) [pdf, other]
-
Title: One shot PACS: Patient specific Anatomic Context and Shape prior aware recurrent registration-segmentation of longitudinal thoracic cone beam CTsComments: This manuscript is currently under minor revision at IEEE Transactions on Medical ImagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1115] arXiv:2201.11002 (cross-list from eess.IV) [pdf, other]
-
Title: A Multi-rater Comparative Study of Automatic Target Localization Methods for Epilepsy Deep Brain Stimulation ProceduresComments: Accepted by SPIE Medical Imaging 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1116] arXiv:2201.11037 (cross-list from eess.IV) [pdf, other]
-
Title: RTNet: Relation Transformer Network for Diabetic Retinopathy Multi-lesion SegmentationComments: IEEE Transactions on Medical ImagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1117] arXiv:2201.11246 (cross-list from eess.IV) [pdf, other]
-
Title: HistoKT: Cross Knowledge Transfer in Computational PathologyAuthors: Ryan Zhang, Jiadai Zhu, Stephen Yang, Mahdi S. Hosseini, Angelo Genovese, Lina Chen, Corwyn Rowsell, Savvas Damaskinos, Sonal Varma, Konstantinos N. PlataniotisComments: Accepted in ICASSP2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1118] arXiv:2201.11333 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Few-shot Transfer Learning for Holographic Image Reconstruction using a Recurrent Neural NetworkComments: 10 Pages, 3 FiguresJournal-ref: APL Photonics (2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1119] arXiv:2201.11389 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-Frame Quality Enhancement On Compressed Video Using Quantised Data of Deep Belief NetworksComments: 7 pages, 11 figures and 3 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1120] arXiv:2201.11446 (cross-list from eess.IV) [pdf, other]
-
Title: Pan-tumor CAnine cuTaneous Cancer Histology (CATCH) datasetAuthors: Frauke Wilm, Marco Fragoso, Christian Marzahl, Jingna Qiu, Chloé Puget, Laura Diehl, Christof A. Bertram, Robert Klopfleisch, Andreas Maier, Katharina Breininger, Marc AubrevilleComments: Submitted to Scientific Data. 15 pages, 9 figures, 6 tablesJournal-ref: Scientific Data vol. 9 (2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1121] arXiv:2201.11630 (cross-list from eess.IV) [pdf, other]
-
Title: Automatic Classification of Neuromuscular Diseases in Children Using Photoacoustic ImagingAuthors: Maja Schlereth, Daniel Stromer, Katharina Breininger, Alexandra Wagner, Lina Tan, Andreas Maier, Ferdinand KnielingComments: accepted by BVM conference proceedings 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1122] arXiv:2201.11700 (cross-list from eess.IV) [pdf, other]
-
Title: Matched IlluminationComments: 15 pages, 7 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1123] arXiv:2201.11737 (cross-list from eess.IV) [pdf, ps, other]
-
Title: PRNU Based Source Camera Identification for Webcam and Smartphone VideosComments: 4 pages, 5 figures, 4 tables. arXiv admin note: substantial text overlap with arXiv:2107.01885Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1124] arXiv:2201.11793 (cross-list from eess.IV) [pdf, other]
-
Title: Denoising Diffusion Restoration ModelsComments: Project page: this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1125] arXiv:2201.11795 (cross-list from eess.IV) [pdf, other]
-
Title: Neural JPEG: End-to-End Image Compression Leveraging a Standard JPEG Encoder-DecoderComments: Accepted in DCC 2022, 11 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1126] arXiv:2201.11864 (cross-list from eess.IV) [pdf, other]
-
Title: Classification of White Blood Cell Leukemia with Low Number of Interpretable and Explainable FeaturesAuthors: William Franz LambertiSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1127] arXiv:2201.11866 (cross-list from eess.IV) [pdf, other]
-
Title: Calibrating Histopathology Image Classifiers using Label SmoothingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1128] arXiv:2201.11987 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Computer-aided Recognition and Assessment of a Porous Bioelastomer on Ultrasound Images for Regenerative Medicine ApplicationsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1129] arXiv:2201.11996 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Networks for Image and Video Super-ResolutionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1130] arXiv:2201.11998 (cross-list from eess.IV) [pdf, other]
-
Title: Image Superresolution using Scale-Recurrent Dense NetworkSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1131] arXiv:2201.12152 (cross-list from eess.IV) [pdf, other]
-
Title: Carotid artery wall segmentation in ultrasound image sequences using a deep convolutional neural networkComments: 5 pages, 4 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1132] arXiv:2201.12260 (cross-list from eess.IV) [pdf, other]
-
Title: A Review on Deep-Learning Algorithms for Fetal Ultrasound-Image AnalysisAuthors: Maria Chiara Fiorentino, Francesca Pia Villani, Mariachiara Di Cosmo, Emanuele Frontoni, Sara MocciaJournal-ref: Medical Image Analysis 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1133] arXiv:2201.12389 (cross-list from eess.IV) [pdf, other]
-
Title: DoubleU-Net++: Architecture with Exploit Multiscale Features for Vertebrae SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1134] arXiv:2201.12589 (cross-list from eess.IV) [pdf, other]
-
Title: FedMed-ATL: Misaligned Unpaired Brain Image Synthesis via Affine Transform LossComments: arXiv admin note: text overlap with arXiv:2201.08953Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1135] arXiv:2201.12773 (cross-list from eess.IV) [pdf, other]
-
Title: Practical Noise Simulation for RGB ImagesComments: Reference paper for the codeSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1136] arXiv:2201.12785 (cross-list from eess.IV) [pdf, other]
-
Title: TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical ImagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1137] arXiv:2201.13256 (cross-list from math.OC) [pdf, other]
-
Title: Proximal Denoiser for Convergent Plug-and-Play Optimization with Nonconvex RegularizationComments: 21 pages. arXiv admin note: text overlap with arXiv:2110.03220Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV)
- [1138] arXiv:2201.13309 (cross-list from physics.data-an) [pdf, other]
-
Title: Accelerating Laue Depth Reconstruction Algorithm with CUDAComments: 2015 IEEE International Conference on Cluster ComputingSubjects: Data Analysis, Statistics and Probability (physics.data-an); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
[ showing 1138 entries per page: fewer | more | all ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, 2404, contact, help (Access key information)