Computer Vision and Pattern Recognition
Authors and titles for cs.CV in Jan 2023
[ total of 1138 entries: 1-1138 ][ showing 1138 entries per page: fewer | more ]
- [1] arXiv:2301.00023 [pdf, other]
-
Title: Imitator: Personalized Speech-driven 3D Facial AnimationAuthors: Balamurugan Thambiraja, Ikhsanul Habibie, Sadegh Aliakbarian, Darren Cosker, Christian Theobalt, Justus ThiesComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [2] arXiv:2301.00060 [pdf, other]
-
Title: Morphology-based non-rigid registration of coronary computed tomography and intravascular images through virtual catheter path optimizationAuthors: Karim Kadry, Abhishek Karmakar, Andreas Schuh, Kersten Peterson, Michiel Schaap, David Marlevi, Charles Taylor, Elazer Edelman, Farhad NezamiSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [3] arXiv:2301.00114 [pdf, other]
-
Title: Skeletal Video Anomaly Detection using Deep Learning: Survey, Challenges and Future DirectionsComments: This work has been accepted by IEEE Transactions on Emerging Topics in Computational IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [4] arXiv:2301.00122 [pdf, ps, other]
-
Title: Hair and Scalp Disease Detection using Machine Learning and Image ProcessingJournal-ref: EJ-Compute.2023;3(1):7-13Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [5] arXiv:2301.00131 [pdf, other]
-
Title: Guided Hybrid Quantization for Object detection in Multimodal Remote Sensing Imagery via One-to-one Self-teachingComments: This article has been delivered to TRGS and is under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [6] arXiv:2301.00135 [pdf, other]
-
Title: TeViS:Translating Text Synopses to Video StoryboardsComments: Accepted to ACM Multimedia 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [7] arXiv:2301.00145 [pdf, other]
-
Title: Attentional Graph Convolutional Network for Structure-aware Audio-Visual Scene ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [8] arXiv:2301.00146 [pdf, other]
-
Title: Peer Learning for Unbiased Scene Graph GenerationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [9] arXiv:2301.00149 [pdf, other]
-
Title: Rethinking Rotation Invariance with Point Cloud RegistrationComments: Accepted by AAAI23Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [10] arXiv:2301.00157 [pdf, other]
-
Title: Ponder: Point Cloud Pre-training via Neural RenderingComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [11] arXiv:2301.00182 [pdf, other]
-
Title: Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language ModelsComments: Accepted by CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [12] arXiv:2301.00184 [pdf, other]
-
Title: Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?Comments: Accepted by CVPR 2023. Selected as a Highlight (Top 2.5% of ALL submissions)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [13] arXiv:2301.00190 [src]
-
Title: Tracking Passengers and Baggage Items using Multiple Overhead Cameras at Security CheckpointsComments: Mistaken upload. See arXiv:2007.07924 for the latest versionJournal-ref: IEEE Transactions on Systems, Man, and Cybernetics: Systems, Early Access, 14 December 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [14] arXiv:2301.00230 [pdf, other]
-
Title: Disjoint Masking with Joint Distillation for Efficient Masked Image ModelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [15] arXiv:2301.00236 [pdf, other]
-
Title: DiRaC-I: Identifying Diverse and Rare Training Classes for Zero-Shot LearningComments: 22 pages, 10 FiguresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [16] arXiv:2301.00250 [pdf, other]
-
Title: DensePose From WiFiComments: 13 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [17] arXiv:2301.00264 [pdf, ps, other]
-
Title: Application Of ADNN For Background Subtraction In Smart Surveillance SystemSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [18] arXiv:2301.00265 [pdf, other]
-
Title: Source-Free Unsupervised Domain Adaptation: A SurveyComments: 19 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [19] arXiv:2301.00330 [pdf, other]
-
Title: Efficient On-device Training via Gradient FilteringComments: CVPR2023, 19 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [20] arXiv:2301.00345 [pdf, other]
-
Title: MTNeuro: A Benchmark for Evaluating Representations of Brain Structure Across Multiple Levels of AbstractionAuthors: Jorge Quesada, Lakshmi Sathidevi, Ran Liu, Nauman Ahad, Joy M. Jackson, Mehdi Azabou, Jingyun Xiao, Christopher Liding, Matthew Jin, Carolina Urzay, William Gray-Roncal, Erik C. Johnson, Eva L. DyerComments: 10 pages, 4 figures, Accepted at NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [21] arXiv:2301.00363 [pdf, ps, other]
-
Title: Mapping smallholder cashew plantations to inform sustainable tree crop expansion in BeninAuthors: Leikun Yin, Rahul Ghosh, Chenxi Lin, David Hale, Christoph Weigl, James Obarowski, Junxiong Zhou, Jessica Till, Xiaowei Jia, Troy Mao, Vipin Kumar, Zhenong JinJournal-ref: Remote Sensing of Environment, 295, p.113695 (2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Applications (stat.AP)
- [22] arXiv:2301.00366 [pdf, other]
-
Title: SS-CPGAN: Self-Supervised Cut-and-Pasting Generative Adversarial Network for Object SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [23] arXiv:2301.00371 [pdf, other]
-
Title: Robust Domain Adaptive Object Detection with Unified Multi-Granularity AlignmentSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [24] arXiv:2301.00394 [pdf, other]
- [25] arXiv:2301.00406 [pdf, other]
-
Title: Curvature regularization for Non-line-of-sight Imaging from Under-sampled DataSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [26] arXiv:2301.00409 [pdf, other]
-
Title: Diffusion Model based Semi-supervised Learning on Brain Hemorrhage Images for Efficient Midline Shift QuantificationAuthors: Shizhan Gong, Cheng Chen, Yuqi Gong, Nga Yan Chan, Wenao Ma, Calvin Hoi-Kwan Mak, Jill Abrigo, Qi DouComments: 12 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [27] arXiv:2301.00411 [pdf, other]
-
Title: Detachable Novel Views Synthesis of Dynamic Scenes Using Distribution-Driven Neural Radiance FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [28] arXiv:2301.00424 [pdf, other]
-
Title: GoogLe2Net: Going Transverse with ConvolutionsAuthors: Yuanpeng HeComments: 33 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [29] arXiv:2301.00436 [pdf, other]
-
Title: Hierarchical Explanations for Video Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [30] arXiv:2301.00447 [pdf, other]
-
Title: Image To Tree with Recursive PromptingComments: 12 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [31] arXiv:2301.00493 [pdf, other]
-
Title: Argoverse 2: Next Generation Datasets for Self-Driving Perception and ForecastingAuthors: Benjamin Wilson, William Qi, Tanmay Agarwal, John Lambert, Jagjeet Singh, Siddhesh Khandelwal, Bowen Pan, Ratnesh Kumar, Andrew Hartnett, Jhony Kaesemodel Pontes, Deva Ramanan, Peter Carr, James HaysComments: Proceedings of the Neural Information Processing Systems Track on Datasets and BenchmarksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [32] arXiv:2301.00514 [pdf, other]
-
Title: Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence GroundingAuthors: Jiahao Zhu, Daizong Liu, Pan Zhou, Xing Di, Yu Cheng, Song Yang, Wenzheng Xu, Zichuan Xu, Yao Wan, Lichao Sun, Zeyu XiongComments: Accepted by EMNLP Findings, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [33] arXiv:2301.00524 [pdf, other]
-
Title: Learning Confident Classifiers in the Presence of Label NoiseAuthors: Asma Ahmed Hashmi, Aigerim Zhumabayeva, Nikita Kotelevskii, Artem Agafonov, Mohammad Yaqub, Maxim Panov, Martin TakáčSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [34] arXiv:2301.00527 [pdf, other]
-
Title: Diffusion Probabilistic Models for Scene-Scale 3D Categorical DataSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [35] arXiv:2301.00531 [pdf, other]
-
Title: Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-identificationComments: This manuscript was just accepted for publication as a regular paper in the IEEE Transactions on Multimedia. We have uploaded source PdfLateX files this timeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [36] arXiv:2301.00555 [pdf, other]
-
Title: Task-specific Scene Structure RepresentationsComments: 10 pages, 9 figures, Accepted on AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [37] arXiv:2301.00580 [pdf, other]
-
Title: Urban Visual Intelligence: Studying Cities with AI and Street-level ImageryAuthors: Fan Zhang, Arianna Salazar Miranda, Fábio Duarte, Lawrence Vale, Gary Hack, Min Chen, Yu Liu, Michael Batty, Carlo RattiSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [38] arXiv:2301.00592 [pdf, other]
-
Title: Edge Enhanced Image Style Transfer via TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [39] arXiv:2301.00596 [pdf, other]
-
Title: A contrastive learning approach for individual re-identification in a wild fish populationAuthors: Ørjan Langøy Olsen, Tonje Knutsen Sørdalen, Morten Goodwin, Ketil Malde, Kristian Muri Knausgård, Kim Tallaksen HalvorsenSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [40] arXiv:2301.00618 [pdf, other]
-
Title: An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and MappingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [41] arXiv:2301.00620 [pdf, other]
-
Title: Dynamically Modular and Sparse General Continual LearningComments: Camera ready version - 18th International Conference on Computer Vision Theory and Applications (VISAPP 2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [42] arXiv:2301.00622 [pdf, other]
-
Title: Credible Remote Sensing Scene Classification Using Evidential Fusion on Aerial-Ground Dual-view ImagesComments: 16 pages, 16 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [43] arXiv:2301.00695 [pdf, other]
-
Title: Image-Coupled Volume Propagation for Stereo MatchingComments: two-columns, 8 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [44] arXiv:2301.00704 [pdf, other]
-
Title: Muse: Text-To-Image Generation via Masked Generative TransformersAuthors: Huiwen Chang, Han Zhang, Jarred Barber, AJ Maschinot, Jose Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip KrishnanSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [45] arXiv:2301.00714 [pdf, other]
-
Title: Learning Road Scene-level Representations via Semantic Region PredictionComments: 18 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [46] arXiv:2301.00725 [pdf, other]
-
Title: Learning Invariance from Generated Variance for Unsupervised Person Re-identificationComments: Extension of conference paper arXiv:2012.09071. Accepted to TPAMI. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [47] arXiv:2301.00740 [pdf, other]
-
Title: P3DC-Shot: Prior-Driven Discrete Data Calibration for Nearest-Neighbor Few-Shot ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [48] arXiv:2301.00746 [pdf, other]
-
Title: NaQ: Leveraging Narrations as Queries to Supervise Episodic MemoryComments: 13 pages, 7 figures, appearing in CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [49] arXiv:2301.00772 [pdf, other]
-
Title: PCRLv2: A Unified Visual Information Preservation Framework for Self-supervised Pre-training in Medical Image AnalysisComments: Accepted by IEEE TPAMI. Codes and pre-trained models are available at \url{this https URL}Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [50] arXiv:2301.00794 [pdf, other]
-
Title: STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural VideosComments: Accepted at ICCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [51] arXiv:2301.00805 [pdf, other]
-
Title: Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance SegmentationAuthors: Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change LoyComments: ICCV-2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [52] arXiv:2301.00808 [pdf, other]
-
Title: ConvNeXt V2: Co-designing and Scaling ConvNets with Masked AutoencodersAuthors: Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, Saining XieComments: Code and models available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [53] arXiv:2301.00812 [pdf, ps, other]
-
Title: One-shot domain adaptation in video-based assessment of surgical skillsComments: 12 pages (+9 pages of Supplementary Materials), 4 figures (+2 Supplementary Figures), 2 tables (+5 Supplementary Tables)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [54] arXiv:2301.00896 [pdf, other]
-
Title: Efficient Robustness Assessment via Adversarial Spatial-Temporal Focus on VideosComments: accepted by TPAMI2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [55] arXiv:2301.00950 [pdf, other]
-
Title: Class-Continuous Conditional Generative Neural Radiance FieldComments: BMVC 2023 (Accepted)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [56] arXiv:2301.00954 [pdf, other]
-
Title: PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part SegmentationAuthors: Xiangtai Li, Shilin Xu, Yibo Yang, Haobo Yuan, Guangliang Cheng, Yunhai Tong, Zhouchen Lin, Ming-Hsuan Yang, Dacheng TaoComments: Extension of PanopticPartFormer (ECCV 2022). Code: this https URL Update ResultsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [57] arXiv:2301.00965 [pdf, other]
-
Title: OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided MixupComments: To be published in IEEE T-MM; Code is available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [58] arXiv:2301.00970 [pdf, other]
-
Title: Benchmarking the Robustness of LiDAR Semantic Segmentation ModelsComments: IJCV-2024. The benchmark will be made available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [59] arXiv:2301.00973 [pdf, other]
-
Title: Detecting Severity of Diabetic Retinopathy from Fundus Images using Ensembled TransformersComments: 9 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [60] arXiv:2301.00975 [pdf, other]
-
Title: Surveillance Face Anti-spoofingComments: 15 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [61] arXiv:2301.00985 [pdf, other]
-
Title: More is Better: A Database for Spontaneous Micro-Expression with High Frame RatesAuthors: Sirui Zhao, Huaying Tang, Xinglong Mao, Shifeng Liu, Hanqing Tao, Hao Wang, Tong Xu, Enhong ChenComments: 16 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [62] arXiv:2301.00986 [pdf, other]
-
Title: Look, Listen, and Attack: Backdoor Attacks Against Video Action RecognitionAuthors: Hasan Abed Al Kader Hammoud, Shuming Liu, Mohammed Alkhrashi, Fahad AlBalawi, Bernard GhanemSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [63] arXiv:2301.00989 [pdf, ps, other]
-
Title: A New Perspective to Boost Vision Transformer for Medical Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [64] arXiv:2301.00998 [pdf, other]
-
Title: Vocabulary-informed Zero-shot and Open-set LearningComments: 17 pages, 8 figures. TPAMI 2019 extended from CVPR 2016 (arXiv:1604.07093)Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [65] arXiv:2301.01006 [pdf, other]
-
Title: Policy Pre-training for Autonomous Driving via Self-supervised Geometric ModelingComments: ICLR2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [66] arXiv:2301.01015 [pdf, other]
-
Title: Semi-Structured Object Sequence EncodersAuthors: Rudra Murthy V, Riyaz Bhat, Chulaka Gunasekara, Siva Sankalp Patel, Hui Wan, Tejas Indulal Dhamecha, Danish Contractor, Marina DanilevskySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [67] arXiv:2301.01019 [pdf, other]
-
Title: Correlation Loss: Enforcing Correlation between Classification and LocalizationComments: Accepted to AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [68] arXiv:2301.01033 [pdf, other]
-
Title: Dissecting Continual Learning a Structural and Data AnalysisAuthors: Francesco PelosinSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [69] arXiv:2301.01036 [pdf, other]
-
Title: High-Quality Real-Time Rendering Using Subpixel Sampling ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [70] arXiv:2301.01057 [pdf, other]
-
Title: BS3D: Building-scale 3D Reconstruction from RGB-D ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [71] arXiv:2301.01060 [pdf, other]
-
Title: Knowledge-guided Causal Intervention for Weakly-supervised Object LocalizationComments: 13 pages, 7 figures, 7 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [72] arXiv:2301.01081 [pdf, other]
-
Title: StyleTalk: One-shot Talking Head Generation with Controllable Speaking StylesAuthors: Yifeng Ma, Suzhen Wang, Zhipeng Hu, Changjie Fan, Tangjie Lv, Yu Ding, Zhidong Deng, Xin YuComments: Accepted at AAAI2023 as Oral. Demo: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [73] arXiv:2301.01100 [pdf, other]
-
Title: Understanding Imbalanced Semantic Segmentation Through Neural CollapseComments: Technical ReportSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [74] arXiv:2301.01123 [pdf, other]
-
Title: MGTAB: A Multi-Relational Graph-Based Twitter Account Detection BenchmarkComments: 14 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [75] arXiv:2301.01143 [pdf, other]
-
Title: Asymmetric Co-teaching with Multi-view Consensus for Noisy Label LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [76] arXiv:2301.01146 [pdf, other]
-
Title: Rethinking Mobile Block for Efficient Attention-based ModelsAuthors: Jiangning Zhang, Xiangtai Li, Jian Li, Liang Liu, Zhucun Xue, Boshen Zhang, Zhengkai Jiang, Tianxin Huang, Yabiao Wang, Chengjie WangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [77] arXiv:2301.01147 [pdf, other]
-
Title: 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging ConditionsComments: arXiv admin note: substantial text overlap with arXiv:2009.06364Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [78] arXiv:2301.01149 [pdf, other]
-
Title: I2F: A Unified Image-to-Feature Approach for Domain Adaptive Semantic SegmentationComments: To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence(TPAMI)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [79] arXiv:2301.01156 [pdf, other]
-
Title: Reference Twice: A Simple and Unified Baseline for Few-Shot Instance SegmentationAuthors: Yue Han, Jiangning Zhang, Zhucun Xue, Chao Xu, Xintian Shen, Yabiao Wang, Chengjie Wang, Yong Liu, Xiangtai LiComments: 10 pages, 5 figures, under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [80] arXiv:2301.01161 [pdf, other]
-
Title: Procedural Humans for Computer VisionAuthors: Charlie Hewitt, Tadas Baltrušaitis, Erroll Wood, Lohit Petikam, Louis Florentin, Hanz Cuevas VelasquezSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [81] arXiv:2301.01200 [pdf, other]
-
Title: Common Practices and Taxonomy in Deep Multi-view Fusion for Remote Sensing ApplicationsComments: appendix with additional tables. Preprint submitted to journalJournal-ref: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, pp. 1-21 (2024)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [82] arXiv:2301.01201 [pdf, other]
-
Title: Uncertainty in Real-Time Semantic Segmentation on Embedded SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [83] arXiv:2301.01202 [pdf, ps, other]
-
Title: DGNet: Distribution Guided Efficient Learning for Oil Spill Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [84] arXiv:2301.01206 [pdf, other]
-
Title: Speed up the inference of diffusion models via shortcut MCMC samplingAuthors: Gang ChenComments: 9Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [85] arXiv:2301.01208 [pdf, other]
-
Title: Mask Matching Transformer for Few-Shot SegmentationAuthors: Siyu Jiao, Gengwei Zhang, Shant Navasardyan, Ling Chen, Yao Zhao, Yunchao Wei, Humphrey ShiComments: 14 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [86] arXiv:2301.01211 [pdf, other]
-
Title: Generative appearance replay for continual unsupervised domain adaptationComments: Fixed typosSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [87] arXiv:2301.01216 [pdf, ps, other]
-
Title: An end-to-end multi-scale network for action prediction in videosComments: 12 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [88] arXiv:2301.01283 [pdf, other]
-
Title: Cross Modal Transformer: Towards Fast and Robust 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [89] arXiv:2301.01296 [pdf, other]
-
Title: TinyMIM: An Empirical Study of Distilling MIM Pre-trained ModelsComments: Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [90] arXiv:2301.01343 [pdf, other]
-
Title: Explainability and Robustness of Deep Visual Classification ModelsAuthors: Jindong GuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [91] arXiv:2301.01380 [pdf, other]
-
Title: Ego-Only: Egocentric Action Detection without Exocentric TransferringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [92] arXiv:2301.01413 [pdf, other]
-
Title: Attribute-Centric Compositional Text-to-Image GenerationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [93] arXiv:2301.01431 [pdf, other]
-
Title: Semi-MAE: Masked Autoencoders for Semi-supervised Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [94] arXiv:2301.01441 [pdf, other]
-
Title: Automatically Prepare Training Data for YOLO Using Robotic In-Hand Observation and SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [95] arXiv:2301.01449 [pdf, other]
-
Title: Building Coverage Estimation with Low-resolution Remote Sensing ImageryAuthors: Enci Liu, Chenlin Meng, Matthew Kolodner, Eun Jee Sung, Sihang Chen, Marshall Burke, David Lobell, Stefano ErmonSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [96] arXiv:2301.01456 [pdf, other]
-
Title: Audio-Visual Efficient Conformer for Robust Speech RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [97] arXiv:2301.01481 [pdf, other]
-
Title: On Fairness of Medical Image Classification with Multiple Sensitive Attributes via Learning Orthogonal RepresentationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [98] arXiv:2301.01482 [pdf, other]
-
Title: Underwater Object Tracker: UOSTrack for Marine Organism Grasping of Underwater VehiclesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [99] arXiv:2301.01490 [pdf, other]
-
Title: Towards a Pipeline for Real-Time Visualization of Faces for VR-based Telepresence and Live Broadcasting Utilizing Neural RenderingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [100] arXiv:2301.01501 [pdf, other]
-
Title: Towards Edge-Cloud Architectures for Personal Protective Equipment DetectionAuthors: Jaroslaw Legierski, Kajetan Rachwal, Piotr Sowinski, Wojciech Niewolski, Przemyslaw Ratuszek, Zbigniew Kopertowski, Marcin Paprzycki, Maria GanzhaComments: Presented on the 4th International Conference on Information Management and Machine Intelligence (ICIMMI 2022). In printJournal-ref: ICIMMI 2022: Proceedings of the 4th International Conference on Information Management & Machine IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [101] arXiv:2301.01531 [pdf, other]
-
Title: MoBYv2AL: Self-supervised Active Learning for Image ClassificationComments: Poster accepted at BMVC 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [102] arXiv:2301.01583 [pdf, other]
-
Title: Why Capsule Neural Networks Do Not Scale: Challenging the Dynamic Parse-Tree AssumptionComments: To appear in AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [103] arXiv:2301.01615 [pdf, other]
-
Title: StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-based 3D Object DetectionComments: Accepted by AAAI-2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [104] arXiv:2301.01635 [pdf, other]
-
Title: SPTS v2: Single-Point Scene Text SpottingAuthors: Yuliang Liu, Jiaxin Zhang, Dezhi Peng, Mingxin Huang, Xinyu Wang, Jingqun Tang, Can Huang, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen JinComments: Accepted for publication in TPAMI 2023. arXiv admin note: text overlap with arXiv:2112.07917Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [105] arXiv:2301.01661 [pdf, other]
-
Title: RecRecNet: Rectangling Rectified Wide-Angle Images by Thin-Plate Spline Model and DoF-based Curriculum LearningComments: Accepted to ICCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [106] arXiv:2301.01767 [pdf, other]
-
Title: Self-Supervised Video Forensics by Audio-Visual Anomaly DetectionComments: CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [107] arXiv:2301.01795 [pdf, other]
-
Title: PACO: Parts and Attributes of Common ObjectsAuthors: Vignesh Ramanathan, Anmol Kalia, Vladan Petrovic, Yi Wen, Baixue Zheng, Baishan Guo, Rui Wang, Aaron Marquez, Rama Kovvuri, Abhishek Kadian, Amir Mousavi, Yiwen Song, Abhimanyu Dubey, Dhruv MahajanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [108] arXiv:2301.01802 [pdf, other]
-
Title: MonoEdge: Monocular 3D Object Detection Using Local PerspectivesComments: WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [109] arXiv:2301.01841 [pdf, ps, other]
-
Title: Classification of Single Tree Decay Stages from Combined Airborne LiDAR Data and CIR ImagerySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [110] arXiv:2301.01842 [pdf, other]
-
Title: Detecting Neighborhood Gentrification at Scale via Street-level Visual DataAuthors: Tianyuan Huang, Timothy Dai, Zhecheng Wang, Hesu Yoon, Hao Sheng, Andrew Y. Ng, Ram Rajagopal, Jackelyn HwangSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [111] arXiv:2301.01871 [pdf, other]
-
Title: Hypotheses Tree Building for One-Shot Temporal Sentence LocalizationComments: Accepted by AAAI2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [112] arXiv:2301.01879 [pdf, other]
-
Title: Learning Feature Recovery Transformer for Occluded Person Re-identificationJournal-ref: IEEE Transactions on Image Processing, vol. 31, pp. 4651-4662, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [113] arXiv:2301.01882 [pdf, other]
-
Title: InsPro: Propagating Instance Query and Proposal for Online Video Instance SegmentationComments: NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [114] arXiv:2301.01893 [pdf, other]
-
Title: GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training MethodsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [115] arXiv:2301.01914 [pdf, ps, other]
-
Title: Accuracy and Fidelity Comparison of Luna and DALL-E 2 Diffusion-Based Image Generation SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [116] arXiv:2301.01917 [pdf, other]
-
Title: Flying Bird Object Detection Algorithm in Surveillance Video Based on Motion InformationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [117] arXiv:2301.01922 [pdf, other]
-
Title: Open-Set Face Identification on Few-Shot Gallery by Fine-TuningJournal-ref: 2022 26th International Conference on Pattern Recognition (ICPR), 2022, pp. 1026-1032Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [118] arXiv:2301.01928 [pdf, other]
-
Title: Event Camera Data Pre-trainingComments: Accepted by ICCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [119] arXiv:2301.01931 [pdf, ps, other]
-
Title: Reduced Deep Convolutional Activation Features (R-DeCAF) in Histopathology Images to Improve the Classification Performance for Breast Cancer DiagnosisSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [120] arXiv:2301.01932 [pdf, other]
-
Title: PA-GM: Position-Aware Learning of Embedding Networks for Deep Graph MatchingComments: for dataset link, see this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [121] arXiv:2301.01953 [pdf, other]
-
Title: Learning Trajectory-Word Alignments for Video-Language TasksAuthors: Xu Yang, Zhangzikang Li, Haiyang Xu, Hanwang Zhang, Qinghao Ye, Chenliang Li, Ming Yan, Yu Zhang, Fei Huang, Songfang HuangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [122] arXiv:2301.01955 [pdf, other]
-
Title: Adaptively Clustering Neighbor Elements for Image CaptioningAuthors: Zihua Wang, Xu Yang, Haiyang Xu, Hanwang Zhang, and Qinghao Ye, Chenliang Li, and Weiwei Sun, Ming Yan, Songfang Huang, Fei Huang, Yu ZhangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [123] arXiv:2301.01956 [pdf, other]
-
Title: High-level semantic feature matters few-shot unsupervised domain adaptationComments: AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [124] arXiv:2301.01970 [pdf, other]
-
Title: CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object DetectionComments: CVPR 2023 camera-ready versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [125] arXiv:2301.02008 [pdf, other]
-
Title: Expressive Speech-driven Facial Animation with controllable emotionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [126] arXiv:2301.02009 [pdf, other]
-
Title: Learning by Sorting: Self-supervised Learning with Group Ordering ConstraintsComments: Published at ICCV 2023, Code @ this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [127] arXiv:2301.02031 [pdf, other]
-
Title: DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-ResolutionComments: More information is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [128] arXiv:2301.02064 [pdf, other]
-
Title: Single-round Self-supervised Distributed Learning using Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [129] arXiv:2301.02074 [pdf, other]
-
Title: Test of Time: Instilling Video-Language Models with a Sense of TimeComments: Accepted for publication at CVPR 2023. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [130] arXiv:2301.02086 [pdf, other]
-
Title: A Probabilistic Framework for Visual Localization in Ambiguous ScenesSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [131] arXiv:2301.02092 [pdf, other]
-
Title: DepthP+P: Metric Accurate Monocular Depth Estimation using Planar and ParallaxSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [132] arXiv:2301.02110 [pdf, other]
-
Title: FICE: Text-Conditioned Fashion Image Editing With Guided GAN InversionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [133] arXiv:2301.02126 [pdf, other]
-
Title: CRADL: Contrastive Representations for Unsupervised Anomaly Detection and LocalizationAuthors: Carsten T. Lüth, David Zimmerer, Gregor Koehler, Paul F. Jaeger, Fabian Isensee, Jens Petersen, Klaus H. Maier-HeinSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [134] arXiv:2301.02145 [pdf, other]
-
Title: Domain Generalization via Ensemble Stacking for Face Presentation Attack DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [135] arXiv:2301.02160 [pdf, other]
-
Title: ANNA: Abstractive Text-to-Image Synthesis with Filtered News CaptionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [136] arXiv:2301.02184 [pdf, other]
-
Title: Chat2Map: Efficient Scene Mapping from Multi-Ego ConversationsAuthors: Sagnik Majumder, Hao Jiang, Pierre Moulon, Ethan Henderson, Paul Calamia, Kristen Grauman, Vamsi Krishna IthapuComments: Accepted to CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [137] arXiv:2301.02217 [pdf, other]
-
Title: EgoDistill: Egocentric Head Motion Distillation for Efficient Video UnderstandingComments: Tech report. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [138] arXiv:2301.02229 [pdf, other]
-
Title: All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [139] arXiv:2301.02232 [pdf, other]
-
Title: CA$^2$T-Net: Category-Agnostic 3D Articulation Transfer from Single ImageComments: 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
- [140] arXiv:2301.02238 [pdf, other]
-
Title: HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned SamplingAuthors: Benjamin Attal, Jia-Bin Huang, Christian Richardt, Michael Zollhoefer, Johannes Kopf, Matthew O'Toole, Changil KimComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [141] arXiv:2301.02239 [pdf, other]
-
Title: Robust Dynamic Radiance FieldsAuthors: Yu-Lun Liu, Chen Gao, Andreas Meuleman, Hung-Yu Tseng, Ayush Saraf, Changil Kim, Yung-Yu Chuang, Johannes Kopf, Jia-Bin HuangComments: CVPR 2023. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [142] arXiv:2301.02240 [pdf, other]
-
Title: Skip-Attention: Improving Vision Transformers by Paying Less AttentionAuthors: Shashanka Venkataramanan, Amir Ghodrati, Yuki M. Asano, Fatih Porikli, Amirhossein HabibianSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [143] arXiv:2301.02241 [pdf, other]
-
Title: CiT: Curation in Training for Effective Vision-Language DataAuthors: Hu Xu, Saining Xie, Po-Yao Huang, Licheng Yu, Russell Howes, Gargi Ghosh, Luke Zettlemoyer, Christoph FeichtenhoferComments: Technical ReportSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [144] arXiv:2301.02277 [pdf, ps, other]
-
Title: LostNet: A smart way for lost and findSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [145] arXiv:2301.02280 [pdf, other]
-
Title: Filtering, Distillation, and Hard Negatives for Vision-Language Pre-TrainingAuthors: Filip Radenovic, Abhimanyu Dubey, Abhishek Kadian, Todor Mihaylov, Simon Vandenhende, Yash Patel, Yi Wen, Vignesh Ramanathan, Dhruv MahajanComments: CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [146] arXiv:2301.02307 [pdf, other]
-
Title: What You Say Is What You Show: Visual Narration Detection in Instructional VideosComments: Technical ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [147] arXiv:2301.02310 [pdf, other]
-
Title: PressureVision++: Estimating Fingertip Pressure from Diverse RGB ImagesAuthors: Patrick Grady, Jeremy A. Collins, Chengcheng Tang, Christopher D. Twigg, Kunal Aneja, James Hays, Charles C. KempComments: WACV 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [148] arXiv:2301.02311 [pdf, other]
-
Title: HierVL: Learning Hierarchical Video-Language EmbeddingsComments: CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [149] arXiv:2301.02315 [pdf, other]
-
Title: TempSAL -- Uncovering Temporal Information for Deep Saliency PredictionComments: 10 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [150] arXiv:2301.02364 [pdf, other]
-
Title: Object as Query: Lifting any 2D Object Detector to 3D DetectionComments: Accepted by ICCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [151] arXiv:2301.02371 [pdf, other]
-
Title: Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane DetectionAuthors: Shaofei Huang, Zhenwei Shen, Zehao Huang, Zi-han Ding, Jiao Dai, Jizhong Han, Naiyan Wang, Si LiuComments: Accepted by CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [152] arXiv:2301.02379 [pdf, other]
-
Title: CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion PriorSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [153] arXiv:2301.02403 [pdf, other]
-
Title: CyberLoc: Towards Accurate Long-term Visual LocalizationAuthors: Liu Liu, Yukai Lin, Xiao Liang, Qichao Xu, Miao Jia, Yangdong Liu, Yuxiang Wen, Wei Luo, Jiangwei LiComments: MLAD-ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [154] arXiv:2301.02419 [pdf, other]
-
Title: Exploring Efficient Few-shot Adaptation for Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [155] arXiv:2301.02440 [pdf, ps, other]
-
Title: An Image captioning algorithm based on the Hybrid Deep Learning Technique (CNN+GRU)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [156] arXiv:2301.02484 [pdf, other]
-
Title: Graph-Collaborated Auto-Encoder Hashing for Multi-view Binary ClusteringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [157] arXiv:2301.02508 [pdf, other]
-
Title: End-to-End 3D Dense Captioning with Vote2Cap-DETRSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [158] arXiv:2301.02524 [pdf, other]
-
Title: Tackling Data Bias in Painting Classification with Style TransferComments: International Conference on Computer Vision Theory and Applications (VISAPP), 2023 ,12 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [159] arXiv:2301.02560 [pdf, other]
-
Title: GeoDE: a Geographically Diverse Evaluation Dataset for Object RecognitionAuthors: Vikram V. Ramaswamy, Sing Yu Lin, Dora Zhao, Aaron B. Adcock, Laurens van der Maaten, Deepti Ghadiyaram, Olga RussakovskySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [160] arXiv:2301.02562 [pdf, other]
-
Title: Super Sparse 3D Object DetectionComments: Extension of Fully Sparse 3D Object Detection [arXiv:2207.10035]Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [161] arXiv:2301.02642 [pdf, other]
-
Title: Triple-stream Deep Metric Learning of Great Ape Behavioural ActionsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [162] arXiv:2301.02650 [pdf, other]
-
Title: Model-Agnostic Hierarchical Attention for 3D Object DetectionAuthors: Manli Shu, Le Xue, Ning Yu, Roberto Martín-Martín, Juan Carlos Niebles, Caiming Xiong, Ran XuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [163] arXiv:2301.02657 [pdf, other]
-
Title: TarViS: A Unified Approach for Target-based Video SegmentationComments: Accepted to CVPR'23 (Highlight). Code is available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [164] arXiv:2301.02667 [pdf, other]
-
Title: Locomotion-Action-Manipulation: Synthesizing Human-Scene Interactions in Complex 3D EnvironmentsComments: Accepted to ICCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Robotics (cs.RO)
- [165] arXiv:2301.02693 [pdf, ps, other]
-
Title: Design of Arabic Sign Language Recognition ModelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [166] arXiv:2301.02700 [pdf, other]
-
Title: 3DAvatarGAN: Bridging Domains for Personalized Editable AvatarsAuthors: Rameen Abdal, Hsin-Ying Lee, Peihao Zhu, Menglei Chai, Aliaksandr Siarohin, Peter Wonka, Sergey TulyakovComments: Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [167] arXiv:2301.02703 [pdf, other]
-
Title: RUPNet: Residual upsampling network for real-time polyp segmentationComments: Accepted SPIE Medical Imaging 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [168] arXiv:2301.02726 [pdf, other]
-
Title: Augmenting Ego-Vehicle for Traffic Near-Miss and Accident Classification Dataset using Manipulating Conditional Style TranslationComments: 8 pages, conferenceJournal-ref: International Conference on Digital Image Computing: Techniques and Applications (DICTA) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [169] arXiv:2301.02778 [pdf, other]
-
Title: Lightweight Salient Object Detection in Optical Remote-Sensing Images via Semantic Matching and Edge AlignmentComments: 11 pages, 4 figures, Accepted by IEEE Transactions on Geoscience and Remote Sensing 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [170] arXiv:2301.02789 [pdf, other]
-
Title: CGI-Stereo: Accurate and Real-Time Stereo Matching via Context and Geometry InteractionComments: 10 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [171] arXiv:2301.02830 [pdf, other]
-
Title: Image Data Augmentation Approaches: A Comprehensive Survey and Future directionsComments: We need to make a lot changes to make its quality betterSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [172] arXiv:2301.02836 [pdf, other]
- [173] arXiv:2301.02869 [pdf, ps, other]
-
Title: Deep Learning-Based UAV Aerial Triangulation without Image Control PointsComments: Accepted to the 42nd Asian Conference on Remote Sensing 2021 (ACRS2021)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [174] arXiv:2301.02911 [pdf, other]
-
Title: Towards early prediction of neurodevelopmental disorders: Computational model for Face Touch and Self-adaptors in InfantsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [175] arXiv:2301.02925 [pdf, other]
-
Title: Multiclass Semantic Segmentation to Identify Anatomical Sub-Regions of Brain and Measure Neuronal Health in Parkinson's DiseaseAuthors: Hosein Barzekar, Hai Ngu, Han Hui Lin, Mohsen Hejrati, Steven Ray Valdespino, Sarah Chu, Baris Bingol, Somaye Hashemifar, Soumitra GhoshSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [176] arXiv:2301.02933 [pdf, other]
-
Title: Weakly Supervised Joint Whole-Slide Segmentation and Classification in Prostate CancerAuthors: Pushpak Pati, Guillaume Jaume, Zeineb Ayadi, Kevin Thandiackal, Behzad Bozorgtabar, Maria Gabrani, Orcun GokselSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [177] arXiv:2301.02934 [pdf, ps, other]
-
Title: Advancing 3D finger knuckle recognition via deep feature learningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [178] arXiv:2301.02969 [pdf, ps, other]
-
Title: Multi-scale multi-modal micro-expression recognition algorithm based on transformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [179] arXiv:2301.02979 [pdf, other]
-
Title: CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-wild 2D AnnotationsAuthors: Cheng-Yen Yang, Jiajia Luo, Lu Xia, Yuyin Sun, Nan Qiao, Ke Zhang, Zhongyu Jiang, Jenq-Neng HwangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [180] arXiv:2301.02989 [pdf, other]
-
Title: Fair Multi-Exit Framework for Facial Attribute ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [181] arXiv:2301.02993 [pdf, other]
-
Title: DeepMatcher: A Deep Transformer-based Network for Robust and Accurate Local Feature MatchingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [182] arXiv:2301.03033 [pdf, other]
-
Title: RGB-T Multi-Modal Crowd Counting Based on TransformerJournal-ref: BMVC2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [183] arXiv:2301.03036 [pdf, other]
-
Title: HRTransNet: HRFormer-Driven Two-Modality Salient Object DetectionJournal-ref: TCSVT2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [184] arXiv:2301.03039 [pdf, ps, other]
-
Title: Equivalence of Two Expressions of Principal LineComments: 5 pages, 3 figures, 47 equationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [185] arXiv:2301.03041 [pdf, other]
-
Title: Learning the Relation between Similarity Loss and Clustering Loss in Self-Supervised LearningAuthors: Jidong Ge, Yuxiang Liu, Jie Gui, Lanting Fang, Ming Lin, James Tin-Yau Kwok, LiGuo Huang, Bin LuoComments: This paper is accepted by IEEE Transactions on Image ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [186] arXiv:2301.03045 [pdf, other]
-
Title: Seamless Multimodal Biometrics for Continuous Personalised Wellbeing MonitoringAuthors: João Ribeiro PintoComments: Doctoral thesis presented and approved on the 21st of December 2022 to the University of PortoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [187] arXiv:2301.03046 [pdf, other]
-
Title: STPrivacy: Spatio-Temporal Privacy-Preserving Action RecognitionAuthors: Ming Li, Xiangyu Xu, Hehe Fan, Pan Zhou, Jun Liu, Jia-Wei Liu, Jiahe Li, Jussi Keppo, Mike Zheng Shou, Shuicheng YanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [188] arXiv:2301.03110 [pdf, other]
-
Title: RobArch: Designing Robust Architectures against Adversarial AttacksAuthors: ShengYun Peng, Weilin Xu, Cory Cornelius, Kevin Li, Rahul Duggal, Duen Horng Chau, Jason MartinSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [189] arXiv:2301.03130 [pdf, ps, other]
-
Title: SFI-Swin: Symmetric Face Inpainting with Swin Transformer by Distinctly Learning Face Components DistributionsAuthors: MohammadReza Naderi, MohammadHossein Givkashi, Nader Karimi, Shahram Shirani, Shadrokh SamaviComments: 13 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [190] arXiv:2301.03155 [pdf, other]
-
Title: Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram ImagesComments: As submitted to ICPRAM23Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [191] arXiv:2301.03160 [pdf, other]
-
Title: Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding NetworkComments: 9 pages, 5 figures, accepted by AAAI23Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [192] arXiv:2301.03164 [pdf, other]
-
Title: Cursive Caption Text Detection in VideosComments: 19 pages, 16 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [193] arXiv:2301.03169 [pdf, other]
-
Title: A Study on the Generality of Neural Network Structures for Monocular Depth EstimationComments: Accepted in TPAMISubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [194] arXiv:2301.03178 [pdf, other]
-
Title: Deep Planar Parallax for Monocular Depth EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [195] arXiv:2301.03182 [pdf, other]
-
Title: Structure-Informed Shadow Removal NetworksComments: IEEE TIPSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [196] arXiv:2301.03194 [pdf, other]
-
Title: Few-shot Semantic Segmentation with Support-induced Graph Convolutional NetworkComments: Accepted in BMVC2022 as oral presentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [197] arXiv:2301.03198 [pdf, ps, other]
-
Title: The Algonauts Project 2023 Challenge: How the Human Brain Makes Sense of Natural ScenesAuthors: A. T. Gifford, B. Lahner, S. Saba-Sadiya, M. G. Vilas, A. Lascelles, A. Oliva, K. Kay, G. Roig, R. M. CichyComments: 5 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
- [198] arXiv:2301.03213 [pdf, other]
-
Title: EgoTracks: A Long-term Egocentric Visual Object Tracking DatasetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [199] arXiv:2301.03322 [pdf, other]
-
Title: Simplifying Open-Set Video Domain Adaptation with Contrastive LearningComments: Currently under review at Computer Vision and Image Understanding (CVIU) journalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [200] arXiv:2301.03330 [pdf, other]
-
Title: HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action RecognitionComments: An extended version of a paper arXiv:2204.13423 published in CVPR 2022. This work has been submitted to the Springer for possible publicationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [201] arXiv:2301.03331 [pdf, other]
-
Title: A Specific Task-oriented Semantic Image Communication System for substation patrol inspectionComments: 9 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [202] arXiv:2301.03393 [pdf, other]
-
Title: Difference of Anisotropic and Isotropic TV for Segmentation under Blur and Poisson NoiseComments: Accepted to Frontiers in Computer Science: this https URL; Arxiv version has clearer images best for zooming inSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [203] arXiv:2301.03396 [pdf, other]
-
Title: Diffused Heads: Diffusion Models Beat GANs on Talking-Face GenerationAuthors: Michał Stypułkowski, Konstantinos Vougioukas, Sen He, Maciej Zięba, Stavros Petridis, Maja PanticSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [204] arXiv:2301.03407 [pdf, other]
-
Title: On Advantages of Mask-level Recognition for Outlier-aware SegmentationComments: Accepted to CVPR 2023 workshop on Visual Anomaly and Novelty Detection (VAND)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [205] arXiv:2301.03410 [pdf, other]
-
Title: In Defense of Structural Symbolic Representation for Video Event-Relation PredictionComments: CVPRW 23, Learning with Limited Labelled DataSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [206] arXiv:2301.03426 [pdf, other]
-
Title: LTS-NET: End-to-end Unsupervised Learning of Long-Term 3D Stable objectsSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [207] arXiv:2301.03432 [pdf, other]
-
Title: High-Resolution Cloud Removal with Multi-Modal and Multi-Resolution Data Fusion: A New Baseline and BenchmarkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [208] arXiv:2301.03461 [pdf, other]
-
Title: DeMT: Deformable Mixer Transformer for Multi-Task Learning of Dense PredictionComments: Accepted by AAAI2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [209] arXiv:2301.03495 [pdf, other]
-
Title: On the challenges to learn from Natural Data StreamsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [210] arXiv:2301.03505 [pdf, other]
-
Title: Advances in Medical Image Analysis with Vision Transformers: A Comprehensive ReviewAuthors: Reza Azad, Amirhossein Kazerouni, Moein Heidari, Ehsan Khodapanah Aghdam, Amirali Molaei, Yiwei Jia, Abin Jose, Rijo Roy, Dorit MerhofComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [211] arXiv:2301.03510 [pdf, other]
-
Title: Parallel Reasoning Network for Human-Object Interaction DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [212] arXiv:2301.03512 [pdf, other]
-
Title: SCENE: Reasoning about Traffic Scenes using Heterogeneous Graph Neural NetworksAuthors: Thomas Monninger, Julian Schmidt, Jan Rupprecht, David Raba, Julian Jordan, Daniel Frank, Steffen Staab, Klaus DietmayerComments: Thomas Monninger and Julian Schmidt are co-first authors. The order was determined alphabeticallyJournal-ref: IEEE Robotics and Automation Letters (RA-L), 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [213] arXiv:2301.03561 [pdf, other]
-
Title: Ancilia: Scalable Intelligent Video Surveillance for the Artificial Intelligence of ThingsAuthors: Armin Danesh Pazho, Christopher Neff, Ghazal Alinezhad Noghre, Babak Rahimi Ardabili, Shanle Yao, Mohammadreza Baharani, Hamed TabkhiSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
- [214] arXiv:2301.03563 [pdf, other]
-
Title: An Impartial Transformer for Story VisualizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [215] arXiv:2301.03580 [pdf, other]
-
Title: Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked ModelingComments: v2: fixed some formatting errorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [216] arXiv:2301.03767 [pdf, other]
-
Title: Online Backfilling with No Regret for Large-Scale Image RetrievalAuthors: Seonguk Seo, Mustafa Gokhan Uzunbas, Bohyung Han, Sara Cao, Joena Zhang, Taipeng Tian, Ser-Nam LimSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [217] arXiv:2301.03769 [pdf, other]
-
Title: Learning from What is Already Out There: Few-shot Sign Language Recognition with Online DictionariesComments: 6 pages, 2 figures, IEEE Face & Gestures 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [218] arXiv:2301.03786 [pdf, other]
-
Title: DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits AnimationComments: Project page this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [219] arXiv:2301.03796 [pdf, other]
-
Title: Assessing the applicability of common performance metrics for real-world infrared small-target detectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [220] arXiv:2301.03826 [pdf, other]
-
Title: CDA: Contrastive-adversarial Domain AdaptationAuthors: Nishant Yadav, Mahbubul Alam, Ahmed Farahat, Dipanjan Ghosh, Chetan Gupta, Auroop R. GangulySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [221] arXiv:2301.03831 [pdf, other]
-
Title: Dynamic Grained Encoder for Vision TransformersAuthors: Lin Song, Songyang Zhang, Songtao Liu, Zeming Li, Xuming He, Hongbin Sun, Jian Sun, Nanning ZhengComments: Accepted by NeurIPS2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [222] arXiv:2301.03832 [pdf, other]
-
Title: Video Semantic Segmentation with Inter-Frame Feature Fusion and Inner-Frame Feature RefinementSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [223] arXiv:2301.03843 [pdf, other]
-
Title: A Privacy Preserving Method with a Random Orthogonal Matrix for ConvMixer ModelsComments: To appear in 2023 RISP International Workshop on Nonlinear Circuits, Communications and Signal ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [224] arXiv:2301.03949 [pdf, other]
-
Title: Modiff: Action-Conditioned 3D Motion Generation with Denoising Diffusion Probabilistic ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [225] arXiv:2301.03966 [pdf, other]
-
Title: AdvBiom: Adversarial Attacks on Biometric MatchersComments: arXiv admin note: text overlap with arXiv:1908.05008Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [226] arXiv:2301.03976 [pdf, ps, other]
-
Title: Semi-Supervised Learning with Pseudo-Negative Labels for Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [227] arXiv:2301.03992 [pdf, other]
-
Title: Vision Transformers Are Good Mask Auto-LabelersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
- [228] arXiv:2301.04011 [pdf, other]
-
Title: Learning Support and Trivial Prototypes for Interpretable Image ClassificationAuthors: Chong Wang, Yuyuan Liu, Yuanhong Chen, Fengbei Liu, Yu Tian, Davis J. McCarthy, Helen Frazer, Gustavo CarneiroComments: ICCV 2023, Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [229] arXiv:2301.04019 [pdf, other]
-
Title: FGAHOI: Fine-Grained Anchors for Human-Object Interaction DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [230] arXiv:2301.04037 [pdf, other]
-
Title: ROBUSfT: Robust Real-Time Shape-from-Template, a C++ LibraryComments: This is the arXiv version of an article published in Image and Vision Computing. Please cite the accepted version: M. Shetab-Bushehri, M. Aranda, E. Ozgur, Y. Mezouar and Adrien Bartoli "ROBUSfT: Robust Real-Time Shape-from-Template, a C++ Library," in Image and Vision Computing, doi: 10.1016/j.imavis.2023.104867Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [231] arXiv:2301.04058 [pdf, other]
-
Title: Rethinking Voxelization and Classification for 3D Object DetectionComments: Accepted in ICONIP 2022. arXiv admin note: text overlap with arXiv:1902.06326 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [232] arXiv:2301.04075 [pdf, other]
-
Title: Benchmarking Robustness in Neural Radiance FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [233] arXiv:2301.04101 [pdf, other]
-
Title: Neural Radiance Field CodebooksAuthors: Matthew Wallingford, Aditya Kusupati, Alex Fang, Vivek Ramanujan, Aniruddha Kembhavi, Roozbeh Mottaghi, Ali FarhadiComments: 19 pages, 8 figures, 9 tablesJournal-ref: International Conference on Learning Representations 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [234] arXiv:2301.04212 [pdf, other]
-
Title: Deep Learning based Multi-Label Image Classification of Protest ActivitiesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [235] arXiv:2301.04218 [pdf, other]
-
Title: Leveraging Diffusion For Strong and High Quality Face Morphing AttacksComments: Under ReviewJournal-ref: IEEE Transactions on Biometrics, Behavior, and Identity Science ( Volume: 6, Issue: 1, January 2024)Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [236] arXiv:2301.04221 [pdf, other]
-
Title: Explaining Deep Models through Forgettable Learning DynamicsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [237] arXiv:2301.04224 [pdf, other]
-
Title: Pix2Map: Cross-modal Retrieval for Inferring Street Maps from ImagesComments: 12 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [238] arXiv:2301.04233 [pdf, other]
- [239] arXiv:2301.04243 [pdf, other]
-
Title: Robust Human Identity Anonymization using Pose EstimationComments: Source code will be available at this https URLJournal-ref: 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE), Mexico City, Mexico, 2022, pp. 619-626Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [240] arXiv:2301.04258 [pdf, other]
-
Title: CARD: Semantic Segmentation with Efficient Class-Aware Regularized DecoderAuthors: Ye Huang, Di Kang, Liang Chen, Wenjing Jia, Xiangjian He, Lixin Duan, Xuefei Zhe, Linchao BaoComments: Tech report, text extended from arXiv:2203.07160Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [241] arXiv:2301.04265 [pdf, other]
-
Title: Adversarial Alignment for Source Free Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [242] arXiv:2301.04275 [pdf, other]
-
Title: LENet: Lightweight And Efficient LiDAR Semantic Segmentation Using Multi-Scale Convolution AttentionAuthors: Ben DingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [243] arXiv:2301.04288 [pdf, other]
-
Title: Generic Event Boundary Detection in Video with Pyramid FeaturesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [244] arXiv:2301.04352 [pdf, ps, other]
-
Title: Graph based Environment Representation for Vision-and-Language Navigation in Continuous EnvironmentsComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [245] arXiv:2301.04410 [pdf, other]
-
Title: GraVIS: Grouping Augmented Views from Independent Sources for Dermatology AnalysisComments: Accepted by IEEE Transactions on Medical Imaging. The code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [246] arXiv:2301.04414 [pdf, ps, other]
-
Title: How Does Traffic Environment Quantitatively Affect the Autonomous Driving Prediction?Comments: 16 pages, 15 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [247] arXiv:2301.04422 [pdf, other]
-
Title: Optical Flow for Autonomous Driving: Applications, Challenges and ImprovementsComments: Accepted for publication in Electronic Imaging, Autonomous Vehicles and Machines 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [248] arXiv:2301.04447 [pdf, other]
-
Title: VS-Net: Multiscale Spatiotemporal Features for Lightweight Video Salient Document DetectionJournal-ref: https://ictai.computer.org/2022/Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [249] arXiv:2301.04454 [pdf, other]
-
Title: Allo-centric Occupancy Grid Prediction for Urban Traffic Scene Using Video Prediction NetworksComments: ICARCV 2022-17th International Conference on Control, Automation, Robotics and VisionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [250] arXiv:2301.04460 [pdf, other]
-
Title: Fast spline detection in high density microscopy dataJournal-ref: Nature Communications Biology, 6 (2023) 754Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
- [251] arXiv:2301.04465 [pdf, ps, other]
-
Title: Co-training with High-Confidence Pseudo Labels for Semi-supervised Medical Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [252] arXiv:2301.04467 [pdf, other]
-
Title: FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D DetectionComments: Accepted to CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [253] arXiv:2301.04470 [pdf, other]
-
Title: InstaGraM: Instance-level Graph Modeling for Vectorized HD Map LearningComments: Workshop on Vision-Centric Autonomous Driving (VCAD) at Conference on Computer Vision and Pattern Recognition (CVPR) 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [254] arXiv:2301.04474 [pdf, other]
-
Title: Speech Driven Video Editing via an Audio-Conditioned Diffusion ModelAuthors: Dan Bigioi, Shubhajit Basak, Michał Stypułkowski, Maciej Zięba, Hugh Jordan, Rachel McDonnell, Peter CorcoranComments: 8 Pages, code and project page available here: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [255] arXiv:2301.04494 [pdf, other]
-
Title: Multi-label Image Classification using Adaptive Graph Convolutional Networks: from a Single Domain to Multiple DomainsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [256] arXiv:2301.04497 [pdf, other]
-
Title: Dynamic Background Reconstruction via MAE for Infrared Small Target DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [257] arXiv:2301.04502 [pdf, other]
-
Title: Pruning Compact ConvNets for Efficient InferenceAuthors: Sayan Ghosh, Karthik Prasad, Xiaoliang Dai, Peizhao Zhang, Bichen Wu, Graham Cormode, Peter VajdaSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [258] arXiv:2301.04517 [pdf, other]
-
Title: A new sampling methodology for defining heterogeneous subsets of samples for training image segmentation algorithmsAuthors: Matheus Viana da Silva, Natália de Carvalho Santos, Julie Ouellette, Baptiste Lacoste, Cesar Henrique CominComments: 10 pages, 9 figures. Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [259] arXiv:2301.04545 [pdf, other]
-
Title: AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [260] arXiv:2301.04554 [pdf, other]
-
Title: Universal Detection of Backdoor Attacks via Density-based Clustering and Centroids AnalysisJournal-ref: IEEE TIFS 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [261] arXiv:2301.04558 [pdf, other]
-
Title: Learning to Exploit Temporal Structure for Biomedical Vision-Language ProcessingAuthors: Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Pérez-García, Maximilian Ilse, Daniel C. Castro, Benedikt Boecking, Harshita Sharma, Kenza Bouzid, Anja Thieme, Anton Schwaighofer, Maria Wetscherek, Matthew P. Lungren, Aditya Nori, Javier Alvarez-Valle, Ozan OktayComments: To appear in CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [262] arXiv:2301.04581 [pdf, other]
-
Title: Elevation Estimation-Driven Building 3D Reconstruction from Single-View Remote Sensing ImageryAuthors: Yongqiang Mao, Kaiqiang Chen, Liangjin Zhao, Wei Chen, Deke Tang, Wenjie Liu, Zhirui Wang, Wenhui Diao, Xian Sun, Kun FuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [263] arXiv:2301.04604 [pdf, other]
-
Title: LinkGAN: Linking GAN Latents to Pixels for Controllable Image SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [264] arXiv:2301.04608 [pdf, other]
-
Title: Padding Module: Learning the Padding in Deep Neural NetworksComments: This paper has been accepted for publication by the IEEE AccessSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [265] arXiv:2301.04612 [pdf, other]
-
Title: Generative-Contrastive Learning for Self-Supervised Latent Representations of 3D Shapes from Multi-Modal Euclidean InputSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [266] arXiv:2301.04613 [pdf, other]
-
Title: Object Detection in 3D Point Clouds via Local Correlation-Aware Point EmbeddingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [267] arXiv:2301.04619 [pdf, other]
-
Title: TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps DistillationAuthors: Feiyan Hu, Simone Palazzo, Federica Proietto Salanitri, Giovanni Bellitto, Morteza Moradi, Concetto Spampinato, Kevin McGuinnessComments: WACV2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [268] arXiv:2301.04623 [pdf, other]
-
Title: Enhancing ResNet Image Classification Performance by using Parameterized Hypercomplex MultiplicationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [269] arXiv:2301.04626 [pdf, other]
-
Title: Deep Axial Hypercomplex NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [270] arXiv:2301.04628 [pdf, other]
-
Title: Face Attribute Editing with Disentangled Latent VectorsComments: See this https URL for the project webpage. arXiv admin note: substantial text overlap with arXiv:2207.03411Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [271] arXiv:2301.04631 [pdf, other]
-
Title: Deep Residual Axial NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [272] arXiv:2301.04634 [pdf, other]
-
Title: Street-View Image Generation from a Bird's-Eye View LayoutSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [273] arXiv:2301.04644 [pdf, other]
-
Title: Does progress on ImageNet transfer to real-world datasets?Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [274] arXiv:2301.04647 [pdf, other]
-
Title: EXIF as Language: Learning Cross-Modal Associations Between Images and Camera MetadataComments: CVPR 2023 (Highlight). Project link: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [275] arXiv:2301.04648 [pdf, other]
-
Title: Head-Free Lightweight Semantic Segmentation with Linear TransformerComments: Accepted by AAAI2023; codes and models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [276] arXiv:2301.04650 [pdf, other]
-
Title: Geometry-biased Transformers for Novel View SynthesisComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [277] arXiv:2301.04685 [pdf, other]
-
Title: SHUNIT: Style Harmonization for Unpaired Image-to-Image TranslationComments: Accepted to AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [278] arXiv:2301.04695 [pdf, other]
-
Title: Learning Continuous Mesh Representation with Spherical Implicit SurfaceAuthors: Zhongpai GaoComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [279] arXiv:2301.04705 [pdf, other]
-
Title: Inverse Quantum Fourier Transform Inspired Algorithm for Unsupervised Image SegmentationComments: 8 pages, 10 figures, conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantum Physics (quant-ph)
- [280] arXiv:2301.04733 [pdf, ps, other]
-
Title: AGMN: Association Graph-based Graph Matching Network for Coronary Artery Semantic Labeling on Invasive Coronary AngiogramsAuthors: Chen Zhao, Zhihui Xu, Jingfeng Jiang, Michele Esposito, Drew Pienta, Guang-Uei Hung, Weihua ZhouComments: 26 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [281] arXiv:2301.04742 [pdf, other]
-
Title: HADA: A Graph-based Amalgamation Framework in Image-text RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [282] arXiv:2301.04748 [pdf, other]
-
Title: LSDM: Long-Short Diffeomorphic Motion for Weakly-Supervised Ultrasound Landmark TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [283] arXiv:2301.04751 [pdf, other]
-
Title: Artificial Intelligence Generated Coins for Size ComparisonAuthors: Gerald ArtnerJournal-ref: Mitteilungen der \"Osterreichischen Numismatischen Gesellschaft, vol. 62, no. 2, pp. 9-16, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [284] arXiv:2301.04783 [pdf, other]
-
Title: Predictive World Models from Real-World Partial ObservationsComments: Best Paper Award at IEEE MOST 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [285] arXiv:2301.04795 [pdf, other]
-
Title: 1st Place Solution for ECCV 2022 OOD-CV Challenge Image Classification TrackComments: Tech ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [286] arXiv:2301.04796 [pdf, other]
-
Title: 1st Place Solution for ECCV 2022 OOD-CV Challenge Object Detection TrackComments: Tech ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [287] arXiv:2301.04799 [pdf, ps, other]
-
Title: Adaptive Context Selection for Polyp SegmentationComments: Accepted by MICCAI2020Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [288] arXiv:2301.04805 [pdf, other]
-
Title: DEA-Net: Single image dehazing based on detail-enhanced convolution and content-guided attentionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [289] arXiv:2301.04811 [pdf, ps, other]
-
Title: Deformation measurement of a soil mixing retaining wall using terrestrial laser scanningComments: 22 pagesJournal-ref: Lasers in Engineering: Volume 54, Number 1-3 (2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [290] arXiv:2301.04842 [pdf, other]
-
Title: Towards High Performance One-Stage Human Pose EstimationComments: 5 pages, 5 figures, accepted at ACM Multimedia Asia (MMAsia) 2022Journal-ref: ACM Multimedia Asia (MMAsia '22), December 13-16, 2022, Tokyo, JapanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [291] arXiv:2301.04847 [pdf, other]
-
Title: Real-time FPGA implementation of the Semi-Global Matching stereo vision algorithm for a 4K/UHD video streamComments: Paper accepted for the DASIP 2023 workshop in conjunction with HiPEAC 2023Journal-ref: Design and Architecture for Signal and Image Processing. DASIP 2023. Lecture Notes in Computer Science, vol 13879. Springer, ChamSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [292] arXiv:2301.04860 [pdf, other]
-
Title: Edge Preserving Implicit Surface Representation of Point CloudsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [293] arXiv:2301.04866 [pdf, other]
-
Title: Self-Supervised Correction Learning for Semi-Supervised Biomedical Image SegmentationComments: Accepted by MICCAI2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [294] arXiv:2301.04870 [pdf, other]
-
Title: Semantic Segmentation via Pixel-to-Center Similarity CalculationComments: 13 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [295] arXiv:2301.04882 [pdf, other]
-
Title: ZScribbleSeg: Zen and the Art of Scribble Supervised Medical Image SegmentationComments: 31 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [296] arXiv:2301.04926 [pdf, other]
-
Title: CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIPAuthors: Runnan Chen, Youquan Liu, Lingdong Kong, Xinge Zhu, Yuexin Ma, Yikang Li, Yuenan Hou, Yu Qiao, Wenping WangComments: CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [297] arXiv:2301.04937 [pdf, other]
-
Title: Density-based clustering with fully-convolutional networks for crowd flow detection from dronesComments: Accepted manuscriptJournal-ref: Neurocomputing (2023)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [298] arXiv:2301.04944 [pdf, other]
-
Title: ViTs for SITS: Vision Transformers for Satellite Image Time SeriesComments: 11 pages, 5 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [299] arXiv:2301.04956 [pdf, other]
-
Title: Graph Laplacian for Semi-Supervised LearningComments: 12 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
- [300] arXiv:2301.04970 [pdf, other]
-
Title: Hierarchical Dynamic Masks for Visual Explanation of Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [301] arXiv:2301.05012 [pdf, other]
-
Title: Fairly Private: Investigating The Fairness of Visual Privacy Preservation AlgorithmsComments: Camera-ready version for the PPAI-23 workshop of the AAAI23Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [302] arXiv:2301.05027 [pdf, other]
-
Title: SynMotor: A Benchmark Suite for Object Attribute Regression and Multi-task LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [303] arXiv:2301.05033 [pdf, other]
-
Title: Sim2real Transfer Learning for Point Cloud Segmentation: An Industrial Application Case on Autonomous DisassemblySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [304] arXiv:2301.05065 [pdf, other]
-
Title: Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding TasksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [305] arXiv:2301.05070 [pdf, ps, other]
-
Title: Wildfire Smoke Detection with Computer VisionAuthors: Eldan R. DanielSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [306] arXiv:2301.05124 [pdf, other]
-
Title: Poses of People in Art: A Data Set for Human Pose Estimation in Digital Art HistorySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [307] arXiv:2301.05158 [pdf, other]
-
Title: SemPPL: Predicting pseudo-labels for better contrastive representationsAuthors: Matko Bošnjak, Pierre H. Richemond, Nenad Tomasev, Florian Strub, Jacob C. Walker, Felix Hill, Lars Holger Buesing, Razvan Pascanu, Charles Blundell, Jovana MitrovicComments: Published as a conference paper at ICLR 2023. For checkpoints and source code see this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [308] arXiv:2301.05175 [pdf, other]
-
Title: Scene-Aware 3D Multi-Human Motion Capture from a Single CameraComments: Accepted to Eurographics 2023. See also github: this https URL project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [309] arXiv:2301.05187 [pdf, other]
-
Title: WIRE: Wavelet Implicit Neural RepresentationsAuthors: Vishwanath Saragadam, Daniel LeJeune, Jasper Tan, Guha Balakrishnan, Ashok Veeraraghavan, Richard G. BaraniukSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
- [310] arXiv:2301.05191 [pdf, other]
-
Title: Event-Based Frame Interpolation with Ad-hoc DeblurringAuthors: Lei Sun, Christos Sakaridis, Jingyun Liang, Peng Sun, Jiezhang Cao, Kai Zhang, Qi Jiang, Kaiwei Wang, Luc Van GoolSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [311] arXiv:2301.05211 [pdf, other]
-
Title: Accidental Light ProbesAuthors: Hong-Xing Yu, Samir Agarwala, Charles Herrmann, Richard Szeliski, Noah Snavely, Jiajun Wu, Deqing SunComments: CVPR2023. Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [312] arXiv:2301.05213 [pdf, other]
-
Title: Learning to Summarize Videos by Contrasting ClipsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [313] arXiv:2301.05219 [pdf, other]
-
Title: Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network PruningComments: 17 pages, v2, corrected wrong referencesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [314] arXiv:2301.05221 [pdf, other]
-
Title: Open-vocabulary Object Segmentation with Diffusion ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [315] arXiv:2301.05225 [pdf, other]
-
Title: Domain Expansion of Image GeneratorsAuthors: Yotam Nitzan, Michaël Gharbi, Richard Zhang, Taesung Park, Jun-Yan Zhu, Daniel Cohen-Or, Eli ShechtmanComments: Project Page and code are available at this https URL CVPR 2023 Camera-ReadySubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [316] arXiv:2301.05226 [pdf, other]
-
Title: See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual ReasoningComments: The first two authors contributed equally to this workSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [317] arXiv:2301.05246 [pdf, other]
-
Title: Online Class-Incremental Learning For Real-World Food Image ClassificationComments: Accepted at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [318] arXiv:2301.05315 [pdf, other]
-
Title: GH-Feat: Learning Versatile Generative Hierarchical Features from GANsComments: Accepted by TPAMI 2022. arXiv admin note: text overlap with arXiv:2007.10379Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [319] arXiv:2301.05323 [pdf, other]
-
Title: Salient Object Detection for Images Taken by People With Vision ImpairmentsComments: Computer Vision and Pattern RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [320] arXiv:2301.05372 [pdf, other]
-
Title: Text to Point Cloud Localization with Relation-Enhanced TransformerComments: 9 pages, 5 figures, accepted to AAAI-2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [321] arXiv:2301.05392 [pdf, other]
-
Title: Multi-Target Landmark Detection with Incomplete Images via Reinforcement Learning and Shape PriorAuthors: Kaiwen Wan, Lei Li, Dengqiang Jia, Shangqi Gao, Wei Qian, Yingzhi Wu, Huandong Lin, Xiongzheng Mu, Xin Gao, Sijia Wang, Fuping Wu, Xiahai ZhuangComments: 29 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [322] arXiv:2301.05421 [pdf, other]
-
Title: Anti-aliasing Predictive Coding Network for Future Video Frame PredictionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [323] arXiv:2301.05434 [pdf, other]
-
Title: LVRNet: Lightweight Image Restoration for Aerial Images under Low VisibilitySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [324] arXiv:2301.05435 [pdf, other]
-
Title: Towards Single Camera Human 3D-KinematicsAuthors: Marian Bittner, Wei-Tse Yang, Xucong Zhang, Ajay Seth, Jan van Gemert, Frans C. T. van der HelmComments: Published in the MDPI Sensors special Issue "Sensors and Musculoskeletal Dynamics to Evaluate Human Movement" on December 28, 2022Journal-ref: Sensors 2023, 23(1), 341Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [325] arXiv:2301.05440 [pdf, ps, other]
-
Title: Learnable Heterogeneous Convolution: Learning both topology and strengthComments: Published in Neural Networks journalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [326] arXiv:2301.05465 [pdf, other]
-
Title: Explicit Temporal Embedding in Deep Generative Latent Models for Longitudinal Medical Image SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [327] arXiv:2301.05489 [pdf, other]
-
Title: A Residual Diffusion Model for High Perceptual Quality Codec AugmentationComments: v1: 26 pages, 13 figures v2: corrected typo in first author name in arxiv metadata v3: major paper update to add base codecs and lpips lossSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [328] arXiv:2301.05496 [pdf, other]
-
Title: Learning Transformations To Reduce the Geometric Shift in Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [329] arXiv:2301.05499 [pdf, other]
-
Title: CLIP the Gap: A Single Domain Generalization Approach for Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [330] arXiv:2301.05500 [pdf, other]
-
Title: RCPS: Rectified Contrastive Pseudo Supervision for Semi-Supervised Medical Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [331] arXiv:2301.05526 [pdf, other]
-
Title: Self-Training Guided Disentangled Adaptation for Cross-Domain Remote Sensing Image Semantic SegmentationComments: 19 pages, 9 figures, 8 tables, 22 formulasSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [332] arXiv:2301.05528 [pdf, ps, other]
-
Title: Development of a Prototype Application for Rice Disease Detection Using Convolutional Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [333] arXiv:2301.05565 [pdf, ps, other]
-
Title: DINF: Dynamic Instance Noise Filter for Occluded Pedestrian DetectionComments: 15 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [334] arXiv:2301.05575 [pdf, other]
-
Title: Deep learning-based approaches for human motion decoding in smart walkers for rehabilitationAuthors: Carolina Gonçalves, João M. Lopes, Sara Moccia, Daniele Berardini, Lucia Migliorelli, Cristina P. SantosSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [335] arXiv:2301.05586 [pdf, other]
-
Title: YOLOv6 v3.0: A Full-Scale ReloadingAuthors: Chuyi Li, Lulu Li, Yifei Geng, Hongliang Jiang, Meng Cheng, Bo Zhang, Zaidan Ke, Xiaoming Xu, Xiangxiang ChuComments: Tech Report. arXiv admin note: text overlap with arXiv:2209.02976Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [336] arXiv:2301.05623 [pdf, ps, other]
-
Title: Reworking geometric morphometrics into a methodology of transformation gridsAuthors: Fred L. BooksteinComments: 44 pages including 19 figures; under review by Evolutionary BiologySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [337] arXiv:2301.05624 [pdf, other]
-
Title: Layout-guided Indoor Panorama Inpainting with Plane-aware NormalizationComments: Accepted by ACCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [338] arXiv:2301.05709 [pdf, other]
-
Title: Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive LossComments: Accepted in CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [339] arXiv:2301.05711 [pdf, other]
-
Title: OA-BEV: Bringing Object Awareness to Bird's-Eye-View Representation for Multi-Camera 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [340] arXiv:2301.05747 [pdf, other]
-
Title: Laser: Latent Set Representations for 3D Generative ModelingAuthors: Pol Moreno, Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Rosalia G. Schneider, Björn Winckler, Larisa Markeeva, Théophane Weber, Danilo J. RezendeComments: See this https URL for video resultsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [341] arXiv:2301.05768 [pdf, other]
-
Title: RxRx1: A Dataset for Evaluating Experimental Batch Correction MethodsAuthors: Maciej Sypetkowski, Morteza Rezanejad, Saber Saberian, Oren Kraus, John Urbanik, James Taylor, Ben Mabey, Mason Victors, Jason Yosinski, Alborz Rezazadeh Sereshkeh, Imran Haque, Berton EarnshawSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [342] arXiv:2301.05776 [pdf, other]
-
Title: Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces RecognitionComments: 11 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [343] arXiv:2301.05792 [pdf, other]
-
Title: RMM: Reinforced Memory Management for Class-Incremental LearningComments: NeurIPS 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [344] arXiv:2301.05796 [pdf, other]
-
Title: Learning Trajectory-Conditioned Relations to Predict Pedestrian Crossing BehaviorSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [345] arXiv:2301.05804 [pdf, ps, other]
-
Title: Salient Sign Detection In Safe Autonomous Driving: AI Which Reasons Over Full Visual ContextSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [346] arXiv:2301.05805 [pdf, ps, other]
-
Title: Safe Control Transitions: Machine Vision Based Observable Readiness Index and Data-Driven Takeover Time PredictionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [347] arXiv:2301.05819 [pdf, other]
-
Title: Deepfake Detection using Biological Features: A SurveySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
- [348] arXiv:2301.05838 [pdf, ps, other]
-
Title: (Safe) SMART Hands: Hand Activity Analysis and Distraction Alerts Using a Multi-Camera FrameworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [349] arXiv:2301.05839 [pdf, other]
-
Title: NCP: Neural Correspondence Prior for Effective Unsupervised Shape MatchingComments: NeurIPS 2022, 10 pages, 9 figuresJournal-ref: 2022 Advances in Neural Information Processing Systems (NeurIPS)Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [350] arXiv:2301.05842 [pdf, ps, other]
-
Title: CHAMP: Crowdsourced, History-Based Advisory of Mapped Pedestrians for Safer Driver Assistance SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [351] arXiv:2301.05845 [pdf, other]
-
Title: ${S}^{2}$Net: Accurate Panorama Depth Estimation on Spherical SurfaceComments: Accepted by IEEE Robotics and Automation LettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Robotics (cs.RO)
- [352] arXiv:2301.05856 [pdf, other]
-
Title: EARL: An Elliptical Distribution aided Adaptive Rotation Label Assignment for Oriented Object Detection in Remote Sensing ImagesJournal-ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1-15, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [353] arXiv:2301.05858 [pdf, other]
-
Title: Robust Remote Sensing Scene Classification with Multi-View Voting and Entropy RankingComments: Paper accepted by the 4th International Conference on Machine Learning for Cyber Security (ML4CS 2022), Guangzhou, ChinaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [354] arXiv:2301.05865 [pdf, other]
-
Title: Gated Self-supervised Learning For Improving Supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [355] arXiv:2301.05871 [pdf, other]
-
Title: Dyna-DepthFormer: Multi-frame Transformer for Self-Supervised Depth Estimation in Dynamic ScenesComments: ICRA 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [356] arXiv:2301.05892 [pdf, other]
-
Title: Object Detection performance variation on compressed satellite image datasets with iquaflowSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [357] arXiv:2301.05897 [pdf, ps, other]
-
Title: Model-based Transfer Learning for Automatic Optical Inspection based on domain discrepancyAuthors: Erik Isai Valle Salgado, Haoxin Yan, Yue Hong, Peiyuan Zhu, Shidong Zhu, Chengwei Liao, Yanxiang Wen, Xiu Li, Xiang Qian, Xiaohao Wang, Xinghui LiComments: This is a fix of the published paper "Relational-based transfer learning for automatic optical inspection based on domain discrepancy"Journal-ref: Proc. SPIE 12317, Optoelectronic Imaging and Multimedia Technology IXMultimedia Technology IX, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [358] arXiv:2301.05935 [pdf, other]
-
Title: End-to-End Page-Level Assessment of Handwritten Text RecognitionComments: Published in Pattern RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [359] arXiv:2301.05938 [pdf, ps, other]
-
Title: Deep Learning Provides Rapid Screen for Breast Cancer Metastasis with Sentinel Lymph NodesAuthors: Kareem Allam, Xiaohong Iris Wang, Songlin Zhang, Jianmin Ding, Kevin Chiu, Karan Saluja, Amer Wahed, Hongxia Sun, Andy N.D. NguyenComments: 9 pages, 3 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
- [360] arXiv:2301.05957 [pdf, other]
-
Title: Towards Spatial Equilibrium Object DetectionComments: Our source codes are publicly available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [361] arXiv:2301.05993 [pdf, other]
-
Title: Empirical study of the modulus as activation function in computer vision applicationsAuthors: Iván Vallés-Pérez, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, Joan Vila-Francés, Juan Gómez-SanchísComments: Accepted at Engineering Applications of AISubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [362] arXiv:2301.05994 [pdf, other]
-
Title: Min-Max-Jump distance and its applicationsAuthors: Gangli LiuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [363] arXiv:2301.05997 [pdf, other]
-
Title: Exploiting Auxiliary Caption for Video GroundingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [364] arXiv:2301.06002 [pdf, other]
-
Title: ACTIVE: A Deep Model for Sperm and Impurity Detection in Microscopic VideosAuthors: Ao Chen, Jinghua Zhang, Md Mamunur Rahaman, Hongzan Sun, M.D., Tieyong Zeng, Marcin Grzegorzek, Feng-Lei Fan, Chen LiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [365] arXiv:2301.06013 [pdf, other]
-
Title: Rethinking Precision of Pseudo Label: Test-Time Adaptation via Complementary LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [366] arXiv:2301.06015 [pdf, other]
-
Title: Diffusion-based Generation, Optimization, and Planning in 3D ScenesAuthors: Siyuan Huang, Zan Wang, Puhao Li, Baoxiong Jia, Tengyu Liu, Yixin Zhu, Wei Liang, Song-Chun ZhuComments: 20 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [367] arXiv:2301.06018 [pdf, other]
-
Title: CMAE-V: Contrastive Masked Autoencoders for Video Action RecognitionComments: Technical ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [368] arXiv:2301.06020 [pdf, other]
-
Title: Delving Deep into Pixel Alignment Feature for Accurate Multi-view Human Mesh RecoveryComments: Project Page: this https URLJournal-ref: Proceedings of the AAAI Conference on Artificial Intelligence, 37(1), 989-997 (2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [369] arXiv:2301.06051 [pdf, other]
-
Title: DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsAuthors: Haiyang Wang, Chen Shi, Shaoshuai Shi, Meng Lei, Sen Wang, Di He, Bernt Schiele, Liwei WangComments: Accepted by CVPR2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [370] arXiv:2301.06052 [pdf, other]
-
Title: T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete RepresentationsAuthors: Jianrong Zhang, Yangsong Zhang, Xiaodong Cun, Shaoli Huang, Yong Zhang, Hongwei Zhao, Hongtao Lu, Xi ShenComments: Accepted to CVPR 2023. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [371] arXiv:2301.06077 [pdf, ps, other]
-
Title: MN-Pair Contrastive Damage Representation and Clustering for Prognostic ExplanationComments: 8 pages, 10 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [372] arXiv:2301.06080 [pdf, ps, other]
-
Title: Comprehensive Literature Survey on Deep Learning used in Image Memorability Prediction and ModificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [373] arXiv:2301.06082 [pdf, other]
-
Title: A Survey on Human Action RecognitionAuthors: Zhou ShuchangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [374] arXiv:2301.06083 [pdf, other]
-
Title: Discrete Point-wise Attack Is Not Enough: Generalized Manifold Adversarial Attack for Face RecognitionComments: Accepted by CVPR2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [375] arXiv:2301.06084 [pdf, ps, other]
-
Title: Scattering-induced entropy boost for highly-compressed optical sensing and encryptionAuthors: Liheng Bian, Xinrui Zhan, Xuyang Chang, Daoyu Li, Rong Yan, Yinuo Zhang, Haowen Ruan, Jun ZhangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [376] arXiv:2301.06103 [pdf, other]
-
Title: Learning Sparse Temporal Video Mapping for Action Quality Assessment in Floor GymnasticsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [377] arXiv:2301.06115 [pdf, other]
-
Title: Learning to Compress Unmanned Aerial Vehicle (UAV) Captured Video: Benchmark and AnalysisComments: MPAI End-to-end Video group progress report, DCC 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [378] arXiv:2301.06116 [pdf, other]
-
Title: Maximally Compact and Separated Features with Regular Polytope NetworksComments: DEEPVISION 2019 CVPR 2019, LONG BEACH Sunday, 16th June @ Room "Terrace Theater" this https URL this https URL arXiv admin note: text overlap with arXiv:1902.10441Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [379] arXiv:2301.06122 [pdf, other]
-
Title: CORE: Learning Consistent Ordinal REpresentations for Image Ordinal EstimationComments: 13 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [380] arXiv:2301.06132 [pdf, other]
-
Title: Deep Diversity-Enhanced Feature Representation of Hyperspectral ImagesComments: 15 pages, 12 figures. arXiv admin note: substantial text overlap with arXiv:2207.04266Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [381] arXiv:2301.06143 [pdf, other]
-
Title: Multi-Camera Lighting Estimation for Photorealistic Front-Facing Mobile Augmented RealitySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [382] arXiv:2301.06152 [pdf, ps, other]
-
Title: Inpainting borehole images using Generative Adversarial NetworksComments: 4 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [383] arXiv:2301.06184 [pdf, other]
-
Title: LitAR: Visually Coherent Lighting for Mobile Augmented RealitySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [384] arXiv:2301.06187 [pdf, ps, other]
-
Title: CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A SurveyComments: 29 pages, 20 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [385] arXiv:2301.06190 [pdf, other]
-
Title: BuildSeg: A General Framework for the Segmentation of BuildingsSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [386] arXiv:2301.06262 [pdf, other]
-
Title: Collaborative Perception in Autonomous Driving: Methods, Datasets and ChallengesComments: 18 pages, 6 figures. Accepted by IEEE Intelligent Transportation Systems Magazine. URL: this https URLJournal-ref: IEEE Intelligent Transportation Systems MagazineSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [387] arXiv:2301.06267 [pdf, other]
-
Title: Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal ModelsComments: CVPR 2023. Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [388] arXiv:2301.06269 [pdf, other]
-
Title: DarkVision: A Benchmark for Low-light Image/Video PerceptionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [389] arXiv:2301.06281 [pdf, other]
-
Title: DPE: Disentanglement of Pose and Expression for General Video Portrait EditingComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [390] arXiv:2301.06286 [pdf, other]
-
Title: Meta Generative Attack on Person ReidentificationAuthors: A V SubramanyamSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [391] arXiv:2301.06293 [pdf, other]
-
Title: Representation Learning for Tablet and Paper Domain Adaptation in Favor of Online Handwriting RecognitionComments: Accepted at IAPR Intl. Workshop on Multimodal Pattern Recognition of Social Signals in Human Computer Interaction (MPRSS), Montreal, Canada, August 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [392] arXiv:2301.06309 [pdf, other]
-
Title: UATVR: Uncertainty-Adaptive Text-Video RetrievalAuthors: Bo Fang, Wenhao Wu, Chang Liu, Yu Zhou, Yuxin Song, Weiping Wang, Xiangbo Shu, Xiangyang Ji, Jingdong WangComments: To appear at ICCV2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [393] arXiv:2301.06324 [pdf, other]
-
Title: Img2Tab: Automatic Class Relevant Concept Discovery from StyleGAN Features for Explainable Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [394] arXiv:2301.06358 [pdf, other]
-
Title: Post-Train Adaptive U-Net for Image SegmentationAuthors: Kostiantyn KhabarlakSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [395] arXiv:2301.06372 [pdf, other]
-
Title: Disambiguation of One-Shot Visual Classification Tasks: A Simplex-Based ApproachSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [396] arXiv:2301.06392 [pdf, other]
-
Title: I See-Through You: A Framework for Removing Foreground Occlusion in Both Sparse and Dense Light Field ImagesComments: WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [397] arXiv:2301.06429 [pdf, other]
-
Title: Linguistic Query-Guided Mask Generation for Referring Image SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [398] arXiv:2301.06442 [pdf, other]
-
Title: Modeling Uncertain Feature Representation for Domain GeneralizationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [399] arXiv:2301.06443 [pdf, ps, other]
-
Title: Sparse resultant based minimal solvers in computer vision and their connection with the action matrixComments: arXiv admin note: text overlap with arXiv:1912.10268Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [400] arXiv:2301.06567 [pdf, other]
-
Title: Scalable Surface Water Mapping up to Fine-scale using Geometric Features of Water from Topographic Airborne LiDAR DataSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [401] arXiv:2301.06624 [pdf, other]
-
Title: TAAL: Test-time Augmentation for Active Learning in Medical Image SegmentationComments: Accepted to MICCAI-DALI 2022 (LNCS Proceedings, vol.13567), 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [402] arXiv:2301.06629 [pdf, other]
-
Title: Diverse Multimedia Layout Generation with Multi Choice LearningComments: 9 pagesJournal-ref: Proceedings of the 29th ACM International Conference on Multimedia 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [403] arXiv:2301.06648 [pdf, other]
-
Title: Neuromorphic High-Frequency 3D Dancing Pose Estimation in Dynamic EnvironmentAuthors: Zhongyang Zhang, Kaidong Chai, Haowen Yu, Ramzi Majaj, Francesca Walsh, Edward Wang, Upal Mahbub, Hava Siegelmann, Donghyun Kim, Tauhidur RahmanJournal-ref: Neurocomputing, Volume 547, 2023, 126388Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [404] arXiv:2301.06657 [src]
-
Title: Free Lunch for Generating Effective Outlier SupervisionComments: We have rewritten this paper, and published as "Image Background Serves as Good Proxy for Out-of-distribution Data" arXiv:2307.00519Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [405] arXiv:2301.06675 [pdf, other]
-
Title: Artificial intelligence as a gateway to scientific discovery: Uncovering features in retinal fundus imagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [406] arXiv:2301.06678 [pdf, other]
-
Title: Feature-based Image Matching for Identifying Individual KākāComments: 42 pages, honour's report from Victoria University of WellingtonSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [407] arXiv:2301.06679 [pdf, other]
-
Title: Rethinking Lightweight Salient Object Detection via Network Depth-Width TradeoffSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [408] arXiv:2301.06680 [pdf, other]
-
Title: DIGITOUR: Automatic Digital Tours for Real-Estate PropertiesComments: Published at CODS-COMAD '23Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [409] arXiv:2301.06683 [pdf, other]
-
Title: Surgical Aggregation: Federated Class-Heterogeneous LearningComments: 9 pages, 7 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [410] arXiv:2301.06685 [pdf, other]
-
Title: Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [411] arXiv:2301.06690 [pdf, other]
-
Title: Audio2Gestures: Generating Diverse Gestures from AudioComments: arXiv admin note: substantial text overlap with arXiv:2108.06720Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [412] arXiv:2301.06715 [pdf, other]
-
Title: SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded NetworkComments: ICRA 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [413] arXiv:2301.06719 [pdf, other]
-
Title: FemtoDet: An Object Detection Baseline for Energy Versus Performance TradeoffsComments: ICCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [414] arXiv:2301.06733 [pdf, other]
-
Title: Face Inverse Rendering via Hierarchical DecouplingJournal-ref: IEEE Transactions on Image Processing, Volume: 31; Year: 2022; Page: 5748 - 5761Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [415] arXiv:2301.06782 [pdf, other]
-
Title: A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [416] arXiv:2301.06844 [pdf, other]
-
Title: USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text RetrievalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [417] arXiv:2301.06855 [pdf, other]
-
Title: Event-based Shape from PolarizationComments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [418] arXiv:2301.06866 [pdf, other]
-
Title: Building Scalable Video Understanding Benchmarks through SportsAuthors: Aniket Agarwal, Alex Zhang, Karthik Narasimhan, Igor Gilitschenski, Vishvak Murahari, Yash KantSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [419] arXiv:2301.06869 [pdf, other]
-
Title: SAT: Size-Aware Transformer for 3D Point Cloud Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [420] arXiv:2301.06874 [pdf, ps, other]
-
Title: Training Methods of Multi-label Prediction Classifiers for Hyperspectral Remote Sensing ImagesComments: 1- Added references. 2- updated methodology figure and added new figures to visualise the different training schemes and 3- Correcting typos 4- Revised introduction, no change in results or discussionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [421] arXiv:2301.06892 [pdf, ps, other]
-
Title: Cooperation Learning Enhanced Colonic Polyp Segmentation Based on Transformer-CNN FusionComments: This paper has been submitted to a journalSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [422] arXiv:2301.06910 [pdf, other]
-
Title: BSNet: Lane Detection via Draw B-spline Curves NearbyComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [423] arXiv:2301.06936 [pdf, other]
-
Title: The use of Octree in point cloud analysis with application to cultural heritageComments: 6 pages, 12 figures, 7 citationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [424] arXiv:2301.06944 [pdf, other]
-
Title: DR-WLC: Dimensionality Reduction cognition for object detection and pose estimation by Watching, Learning and CheckingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [425] arXiv:2301.06958 [pdf, other]
-
Title: RILS: Masked Visual Reconstruction in Language Semantic SpaceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [426] arXiv:2301.06962 [pdf, other]
-
Title: Long Range Pooling for 3D Large-Scale Scene UnderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [427] arXiv:2301.06975 [pdf, other]
-
Title: Vision Based Machine Learning Algorithms for Out-of-Distribution GeneralisationComments: Computing Conference, 22-23 June 2023, London, United Kingdom. 15 pages, 5 Figures, 3 TablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [428] arXiv:2301.07002 [pdf, other]
-
Title: Opti-CAM: Optimizing saliency maps for interpretabilitySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [429] arXiv:2301.07037 [pdf, other]
-
Title: Explain What You See: Open-Ended Segmentation and Recognition of Occluded 3D ObjectsComments: Accepted at ICRA 2023 ConferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [430] arXiv:2301.07053 [pdf, ps, other]
-
Title: Preserving Privacy in Surgical Video Analysis Using Artificial Intelligence: A Deep Learning Classifier to Identify Out-of-Body Scenes in Endoscopic VideosAuthors: Joël L. Lavanchy, Armine Vardazaryan, Pietro Mascagni, AI4SafeChole Consortium, Didier Mutter, Nicolas PadoyComments: Jo\"el L. Lavanchy and Armine Vardazaryan contributed equally and share first co-authorshipJournal-ref: Scientific Reports 13, 9235 (2023)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [431] arXiv:2301.07074 [pdf, other]
-
Title: SegViz: A federated-learning based framework for multi-organ segmentation on heterogeneous data sets with partial annotationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [432] arXiv:2301.07088 [pdf, other]
-
Title: Vision Learners Meet Web Image-Text PairsComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [433] arXiv:2301.07093 [pdf, other]
-
Title: GLIGEN: Open-Set Grounded Text-to-Image GenerationAuthors: Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li, Yong Jae LeeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
- [434] arXiv:2301.07094 [pdf, other]
-
Title: Learning Customized Visual Models with Retrieval-Augmented KnowledgeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [435] arXiv:2301.07174 [pdf, other]
-
Title: Creating awareness about security and safety on highways to mitigate wildlife-vehicle collisions by detecting and recognizing wildlife fences using deep learning and drone technologyAuthors: Irene Nandutu, Marcellin Atemkeng, Patrice Okouma, Nokubonga Mgqatsa, Jean Louis Ebongue Kedieng Fendji, Franklin TchakounteSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [436] arXiv:2301.07178 [pdf, other]
-
Title: Using Large Text-to-Image Models with Structured Prompts for Skin Disease Identification: A Case StudySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [437] arXiv:2301.07213 [pdf, other]
-
Title: SCARP: 3D Shape Completion in ARbitrary Poses for Improved GraspingComments: Accepted at ICRA 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [438] arXiv:2301.07236 [pdf, other]
-
Title: Effective End-to-End Vision Language Pretraining with Semantic Visual LossSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [439] arXiv:2301.07247 [pdf, other]
-
Title: Tailor: Altering Skip Connections for Resource-Efficient InferenceAuthors: Olivia Weng, Gabriel Marcano, Vladimir Loncar, Alireza Khodamoradi, Nojan Sheybani, Andres Meza, Farinaz Koushanfar, Kristof Denolf, Javier Mauricio Duarte, Ryan KastnerSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [440] arXiv:2301.07266 [pdf, ps, other]
-
Title: ACQ: Improving Generative Data-free Quantization Via Attention CorrectionAuthors: Jixing Li, Xiaozhou Guo, Benzhe Dai, Guoliang Gong, Min Jin, Gang Chen, Wenyu Mao, Huaxiang LuSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [441] arXiv:2301.07283 [pdf, other]
-
Title: Contrastive Learning for Self-Supervised Pre-Training of Point Cloud Segmentation Networks With Image DataComments: In Proceedings of the Conference on Robots and Vision (CRV'23), Montreal, Canada, Jun. 6-8, 2023. arXiv admin note: substantial text overlap with arXiv:2211.11801Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [442] arXiv:2301.07301 [pdf, other]
-
Title: PTA-Det: Point Transformer Associating Point cloud and Image for 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [443] arXiv:2301.07315 [pdf, other]
-
Title: Face Recognition in the age of CLIP & Billion image datasetsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [444] arXiv:2301.07316 [pdf, other]
-
Title: Adaptively Integrated Knowledge Distillation and Prediction Uncertainty for Continual LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [445] arXiv:2301.07320 [pdf, other]
-
Title: Robust Knowledge Adaptation for Federated Unsupervised Person ReIDSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [446] arXiv:2301.07322 [pdf, other]
-
Title: HSTFormer: Hierarchical Spatial-Temporal Transformers for 3D Human Pose EstimationComments: The first two authors have equal contributionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [447] arXiv:2301.07329 [pdf, other]
-
Title: Deep Dynamic Scene Deblurring from Optical FlowAuthors: Jiawei Zhang, Jinshan Pan, Daoye Wang, Shangchen Zhou, Xing Wei, Furong Zhao, Jianbo Liu, Jimmy RenComments: accepted by tcsvtSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [448] arXiv:2301.07330 [pdf, other]
-
Title: FPANet: Frequency-based Video Demoireing using Frame-level Post AlignmentSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [449] arXiv:2301.07336 [pdf, other]
-
Title: Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [450] arXiv:2301.07340 [pdf, other]
-
Title: Semi-Supervised Semantic Segmentation via Gentle Teaching AssistantComments: NeurIPS2022 camera readySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [451] arXiv:2301.07354 [pdf, other]
-
Title: MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation SegmentationAuthors: Munan Ning, Donghuan Lu, Yujia Xie, Dongdong Chen, Dong Wei, Yefeng Zheng, Yonghong Tian, Shuicheng Yan, Li YuanComments: Accepted by TPAMI-IEEE Transactions on Pattern Analysis and Machine Intelligence. arXiv admin note: substantial text overlap with arXiv:2108.08012Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [452] arXiv:2301.07382 [pdf, other]
-
Title: ViT-AE++: Improving Vision Transformer Autoencoder for Self-supervised Medical Image RepresentationsAuthors: Chinmay Prabhakar, Hongwei Bran Li, Jiancheng Yang, Suprosana Shit, Benedikt Wiestler, Bjoern MenzeComments: Accepted in MIDL 2023. C. Prabhakar and H. B. Li contribute equally. Codes here: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [453] arXiv:2301.07385 [pdf, other]
-
Title: Three-dimensional reconstruction and characterization of bladder deformationsComments: 17 pages, 7 figures, full article paperSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [454] arXiv:2301.07389 [pdf, other]
-
Title: Towards Models that Can See and ReadSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [455] arXiv:2301.07405 [pdf, other]
-
Title: HiDAnet: RGB-D Salient Object Detection via Hierarchical Depth AwarenessSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [456] arXiv:2301.07407 [pdf, other]
-
Title: TAME: Attention Mechanism Based Feature Fusion for Generating Explanation Maps of Convolutional Neural NetworksComments: Accepted for publication in the proceedings of IEEE Int. Symposium on Multimedia (ISM), Naples, Italy, Dec. 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [457] arXiv:2301.07409 [pdf, other]
-
Title: Representing Noisy Image Without DenoisingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [458] arXiv:2301.07431 [pdf, other]
-
Title: Sharp Eyes: A Salient Object Detector Working The Same Way as Human Visual CharacteristicsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [459] arXiv:2301.07463 [pdf, other]
-
Title: Temporal Perceiving Video-Language Pre-trainingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [460] arXiv:2301.07464 [pdf, other]
-
Title: CLIPTER: Looking at the Bigger Picture in Scene Text RecognitionAuthors: Aviad Aberdam, David Bensaïd, Alona Golts, Roy Ganz, Oren Nuriel, Royee Tichauer, Shai Mazor, Ron LitmanComments: Accepted for publication by ICCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [461] arXiv:2301.07468 [pdf, other]
-
Title: Model-based inexact graph matching on top of CNNs for semantic scene understandingComments: 27 pages, 9 figures, 11 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [462] arXiv:2301.07485 [pdf, other]
-
Title: Image Embedding for Denoising Generative ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [463] arXiv:2301.07499 [pdf, other]
-
Title: A Comprehensive Review of Modern Object Segmentation ApproachesComments: 173 pages, 49 figures, published in Foundations and Trends in Computer Graphics and Vision on 10/4/22. Authors retain copyrightJournal-ref: Foundations and Trends in Computer Graphics and Vision: Vol. 13: No. 2-3, pp 111-283Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [464] arXiv:2301.07519 [pdf, other]
-
Title: Site-specific weed management in corn using UAS imagery analysis and computer vision techniquesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [465] arXiv:2301.07525 [pdf, other]
-
Title: OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and GenerationAuthors: Tong Wu, Jiarui Zhang, Xiao Fu, Yuxin Wang, Jiawei Ren, Liang Pan, Wayne Wu, Lei Yang, Jiaqi Wang, Chen Qian, Dahua Lin, Ziwei LiuComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [466] arXiv:2301.07533 [pdf, other]
-
Title: A Multi-Scale Framework for Out-of-Distribution Detection in Dermoscopic ImagesComments: Paper accepted by the 4th International Conference on Machine Learning for Cyber Security (ML4CS 2022), Guangzhou, ChinaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [467] arXiv:2301.07565 [pdf, other]
-
Title: Gated-ViGAT: Efficient Bottom-Up Event Recognition and Explanation Using a New Frame Selection Policy and Gating MechanismComments: Accepted for publication in the proceedings of IEEE Int. Symposium on Multimedia (ISM), Naples, Italy, Dec. 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [468] arXiv:2301.07581 [pdf, other]
-
Title: Blur Invariants for Image RecognitionComments: 15 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [469] arXiv:2301.07583 [pdf, other]
-
Title: A Survey of Advanced Computer Vision Techniques for SportsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [470] arXiv:2301.07584 [pdf, other]
-
Title: Joint Representation Learning for Text and 3D Point CloudSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [471] arXiv:2301.07613 [pdf, ps, other]
-
Title: Development, Optimization, and Deployment of Thermal Forward Vision Systems for Advance Vehicular Applications on Edge DevicesComments: The paper is accepted and in the publication phase at ICMV 2022 Conference. Link: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [472] arXiv:2301.07627 [pdf, ps, other]
-
Title: A novel dataset and a two-stage mitosis nuclei detection method based on hybrid anchor branchComments: 22 pages,10 figures, 8 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [473] arXiv:2301.07634 [pdf, other]
-
Title: Training Semantic Segmentation on Heterogeneous DatasetsComments: Submitted 2021 (under review)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [474] arXiv:2301.07650 [pdf, ps, other]
-
Title: Facial Thermal and Blood Perfusion Patterns of Human Emotions: Proof-of-ConceptAuthors: Victor H. Aristizabal-Tique (1), Marcela Henao-Perez (2), Diana Carolina Lopez-Medina (2), Renato Zambrano-Cruz (3), Gloria Diaz-Londoñod (4) ((1) School of Engineering - Universidad Cooperativa de Colombia - Medellin - Colombia, (2) School of Medicine - Universidad Cooperativa de Colombia - Medellin -Colombia, (3) School of Psychology - Universidad Cooperativa de Colombia - Medellin - Colombia, (4) School of Science - Universidad Nacional de Colombia - Medellin - Colombia)Comments: 22 pages, 9 figuresJournal-ref: Journal of Thermal Biology, Vol. 112, 2023, pp. 103464Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [475] arXiv:2301.07652 [pdf, other]
-
Title: HMDO: Markerless Multi-view Hand Manipulation Capture with Deformable ObjectsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [476] arXiv:2301.07666 [pdf, other]
-
Title: DDS: Decoupled Dynamic Scene-Graph Generation NetworkComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [477] arXiv:2301.07668 [pdf, other]
-
Title: Behind the Scenes: Density Fields for Single View ReconstructionComments: Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [478] arXiv:2301.07670 [pdf, other]
-
Title: Active learning for medical image segmentation with stochastic batchesComments: Accepted to Medical Image Analysis, 17 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [479] arXiv:2301.07673 [pdf, other]
-
Title: OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD ModelsComments: Accepted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [480] arXiv:2301.07700 [pdf, ps, other]
-
Title: Attention2Minority: A salient instance inference-based multiple instance learning for classifying small lesions in whole slide imagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [481] arXiv:2301.07702 [pdf, other]
-
Title: Learning 3D-aware Image Synthesis with Unknown Pose DistributionAuthors: Zifan Shi, Yujun Shen, Yinghao Xu, Sida Peng, Yiyi Liao, Sheng Guo, Qifeng Chen, Dit-Yan YeungComments: CVPR 2023. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [482] arXiv:2301.07805 [pdf, other]
-
Title: Multi-target multi-camera vehicle tracking using transformer-based camera link model and spatial-temporal informationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [483] arXiv:2301.07807 [pdf, other]
-
Title: Measuring uncertainty in human visual segmentationComments: 32 pages, 9 figures, 5 appendix, 5 figures in appendixSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
- [484] arXiv:2301.07836 [pdf, other]
-
Title: Masked Autoencoding Does Not Help Natural Language Supervision at ScaleComments: Accepted at CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [485] arXiv:2301.07845 [pdf, other]
-
Title: Foresee What You Will Learn: Data Augmentation for Domain Generalization in Non-stationary EnvironmentComments: 12 pages, 6 figures, accepted by AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [486] arXiv:2301.07861 [pdf, other]
-
Title: Improving Food Detection For Images From a Wearable Egocentric CameraComments: 6 pages, 6 figures, Conference Paper for Imaging and Multimedia Analytics in a Web and Mobile World Conference, IS&T Electronic Imaging Symposium, Burlingame, CA (Virtual), January, 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [487] arXiv:2301.07868 [pdf, other]
-
Title: Multimodal Video Adapter for Parameter Efficient Video Text RetrievalAuthors: Bowen Zhang, Xiaojie Jin, Weibo Gong, Kai Xu, Zhao Zhang, Peng Wang, Xiaohui Shen, Jiashi FengSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [488] arXiv:2301.07870 [pdf, other]
-
Title: Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionAuthors: Bin Huang, Yangguang Li, Enze Xie, Feng Liang, Luya Wang, Mingzhu Shen, Fenggang Liu, Tianqi Wang, Ping Luo, Jing ShaoComments: Accepted by NeurIPS2022_ML4AD on October 22, 2022Journal-ref: NeurIPS2022_ML4ADSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [489] arXiv:2301.07879 [pdf, other]
-
Title: Unposed: Unsupervised Pose Estimation based Product Image RecommendationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [490] arXiv:2301.07921 [pdf, other]
-
Title: Spatio-Temporal Context Modeling for Road Obstacle DetectionComments: Paper accepted by the 4th International Conference on Machine Learning for Cyber Security (ML4CS 2022), Guangzhou, ChinaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [491] arXiv:2301.07923 [pdf, ps, other]
-
Title: Human-Scene Network: A Novel Baseline with Self-rectifying Loss for Weakly supervised Video Anomaly DetectionAuthors: Snehashis Majhi, Rui Dai, Quan Kong, Lorenzo Garattoni, Gianpiero Francesca, Francois BremondSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [492] arXiv:2301.07927 [pdf, other]
-
Title: Exploiting Style Transfer-based Task Augmentation for Cross-Domain Few-Shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [493] arXiv:2301.07944 [pdf, other]
-
Title: Revisiting the Spatial and Temporal Modeling for Few-shot Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [494] arXiv:2301.07947 [pdf, other]
-
Title: Point Cloud Data Simulation and Modelling with Aize WorkspaceComments: Extended abstract, Northern Lights Deep Learning Conference, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [495] arXiv:2301.07958 [pdf, other]
-
Title: RecolorNeRF: Layer Decomposed Radiance Fields for Efficient Color Editing of 3D ScenesComments: To appear in ACM Multimedia 2023. Project website is accessible at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [496] arXiv:2301.07969 [pdf, other]
-
Title: Fast Inference in Denoising Diffusion Models via MMD FinetuningSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [497] arXiv:2301.08044 [pdf, other]
-
Title: Reference Guided Image Inpainting using Facial AttributesComments: BMVC 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [498] arXiv:2301.08064 [pdf, other]
-
Title: Position Regression for Unsupervised Anomaly DetectionJournal-ref: Proceedings of The 5th International Conference on Medical Imaging with Deep Learning, PMLR 172:160-172, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [499] arXiv:2301.08067 [pdf, ps, other]
-
Title: Interpreting CNN Predictions using Conditional Generative Adversarial NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [500] arXiv:2301.08072 [pdf, other]
-
Title: Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion ModelsComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [501] arXiv:2301.08092 [pdf, other]
-
Title: RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge DistillationComments: 17 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [502] arXiv:2301.08113 [pdf, other]
-
Title: Soft Thresholding for Visual Image EnhancementAuthors: Christoph DalitzComments: 6 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [503] arXiv:2301.08125 [pdf, other]
-
Title: Diagnose Like a Pathologist: Transformer-Enabled Hierarchical Attention-Guided Multiple Instance Learning for Whole Slide Image ClassificationComments: Accepted to IJCAI2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [504] arXiv:2301.08140 [pdf, other]
-
Title: Regularising disparity estimation via multi task learning with structured light reconstructionJournal-ref: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [505] arXiv:2301.08141 [pdf, other]
-
Title: Self-supervised Learning for Segmentation and Quantification of Dopamine Neurons in Parkinson's DiseaseAuthors: Fatemeh Haghighi, Soumitra Ghosh, Hai Ngu, Sarah Chu, Han Lin, Mohsen Hejrati, Baris Bingol, Somaye HashemifarSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [506] arXiv:2301.08147 [pdf, other]
-
Title: RGB-D-Based Categorical Object Pose and Shape Estimation: Methods, Datasets, and EvaluationComments: arXiv admin note: substantial text overlap with arXiv:2202.10346Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [507] arXiv:2301.08153 [pdf, other]
-
Title: SwiftAvatar: Efficient Auto-Creation of Parameterized Stylized Character on Arbitrary Avatar EnginesAuthors: Shizun Wang, Weihong Zeng, Xu Wang, Hao Yang, Li Chen, Yi Yuan, Yunzhao Zeng, Min Zheng, Chuang Zhang, Ming WuComments: AAAI 2023 OralSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [508] arXiv:2301.08160 [pdf, other]
-
Title: FECANet: Boosting Few-Shot Semantic Segmentation with Feature-Enhanced Context-Aware NetworkComments: accepted by IEEE Transactions on MultimediaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [509] arXiv:2301.08189 [pdf, other]
-
Title: Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applicationsAuthors: Mihir Durve, Sibilla Orsini, Adriano Tiribocchi, Andrea Montessori, Jean-Michel Tucny, Marco Lauricella, Andrea Camposeo, Dario Pisignano, Sauro SucciComments: 13 pages, 4 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Fluid Dynamics (physics.flu-dyn)
- [510] arXiv:2301.08229 [pdf, other]
-
Title: Estimating Remaining Lifespan from the FaceAuthors: Amir FekrazadComments: 15 pages, 15 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [511] arXiv:2301.08237 [pdf, other]
-
Title: LoCoNet: Long-Short Context Network for Active Speaker DetectionComments: tech reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [512] arXiv:2301.08243 [pdf, other]
-
Title: Self-Supervised Learning from Images with a Joint-Embedding Predictive ArchitectureAuthors: Mahmoud Assran, Quentin Duval, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Yann LeCun, Nicolas BallasComments: 2023 IEEE/CVF International Conference on Computer VisionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [513] arXiv:2301.08245 [pdf, other]
-
Title: Booster: a Benchmark for Depth from Images of Specular and Transparent SurfacesAuthors: Pierluigi Zama Ramirez, Alex Costanzino, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di StefanoComments: Extension of the paper "Open Challenges in Deep Stereo: the Booster Dataset" presented at CVPR 2022. Accepted at TPAMISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [514] arXiv:2301.08247 [pdf, other]
-
Title: Multiview Compressive Coding for 3D ReconstructionComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [515] arXiv:2301.08317 [pdf, other]
-
Title: Ultrasound Plane Pose Regression: Assessing Generalized Pose Coordinates in the Fetal BrainAuthors: Chiara Di Vece, Maela Le Lous, Brian Dromey, Francisco Vasconcelos, Anna L David, Donald Peebles, Danail StoyanovComments: 13 pages, 9 figures, 2 tables. This article has been accepted for publication in IEEE Transactions on Medical Robotics and Bionics. This is the author's version which has not been fully edited and content may change prior to final publication. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [516] arXiv:2301.08390 [pdf, other]
-
Title: Open-Set Likelihood Maximization for Few-Shot LearningAuthors: Malik Boudiaf, Etienne Bennequin, Myriam Tami, Antoine Toubhans, Pablo Piantanida, Céline Hudelot, Ismail Ben AyedComments: CVPR 2023. Supercedes arXiv:2206.09236Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [517] arXiv:2301.08408 [pdf, other]
-
Title: Identity masking effectiveness and gesture recognition: Effects of eye enhancement in seeing through the maskComments: 8 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [518] arXiv:2301.08413 [pdf, other]
-
Title: Chaos to Order: A Label Propagation Perspective on Source-Free Domain AdaptationComments: Accepted by ACM MM2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [519] arXiv:2301.08414 [pdf, other]
-
Title: FG-Depth: Flow-Guided Unsupervised Monocular Depth EstimationComments: Accepted by ICRA2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [520] arXiv:2301.08433 [pdf, other]
-
Title: Unsupervised Light Field Depth Estimation via Multi-view Feature Matching with Occlusion PredictionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [521] arXiv:2301.08443 [pdf, other]
-
Title: DIFAI: Diverse Facial Inpainting using StyleGAN InversionComments: ICIP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [522] arXiv:2301.08455 [pdf, other]
-
Title: Spatial Steerability of GANs via Self-Supervision from DiscriminatorComments: This manuscript is a journal extension of our previous conference work (arXiv:2112.00718), submitted to TPAMISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [523] arXiv:2301.08555 [pdf, other]
-
Title: Hybrid Open-set Segmentation with Synthetic Negative DataSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [524] arXiv:2301.08590 [pdf, other]
-
Title: Improving Sketch Colorization using Adversarial Segmentation ConsistencyComments: Under review at Pattern Recognition Letters. arXiv admin note: substantial text overlap with arXiv:2102.06192Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [525] arXiv:2301.08647 [pdf, other]
-
Title: Image Memorability Prediction with Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [526] arXiv:2301.08664 [pdf, other]
-
Title: AccDecoder: Accelerated Decoding for Neural-enhanced Video AnalyticsComments: Accepted by 2023 IEEE INFOCOMSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
- [527] arXiv:2301.08669 [pdf, other]
-
Title: Holistically Explainable Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [528] arXiv:2301.08730 [pdf, other]
-
Title: Novel-View Acoustic SynthesisAuthors: Changan Chen, Alexander Richard, Roman Shapovalov, Vamsi Krishna Ithapu, Natalia Neverova, Kristen Grauman, Andrea VedaldiComments: Accepted at CVPR 2023. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [529] arXiv:2301.08739 [pdf, other]
-
Title: FlatFormer: Flattened Window Attention for Efficient Point Cloud TransformerComments: CVPR 2023. The first two authors contributed equally to this work. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [530] arXiv:2301.08783 [pdf, other]
-
Title: An Asynchronous Intensity Representation for Framed and Event Video SourcesComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [531] arXiv:2301.08800 [pdf, other]
-
Title: In-situ Water quality monitoring in Oil and Gas operationsComments: 15 pages, 8 figures, SPIE Defense + Commercial: Algorithms, Technologies, and Applications for Multispectral and Hyperspectral Imaging XXIXSubjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP); Computation (stat.CO); Methodology (stat.ME)
- [532] arXiv:2301.08802 [pdf, other]
-
Title: Impact of PCA-based preprocessing and different CNN structures on deformable registration of sonogramsComments: 8 pages, 7 figures Presented at WSCG 2022Journal-ref: Computer Science Research Notes , CSRN 3201, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [533] arXiv:2301.08849 [pdf, other]
-
Title: CADA-GAN: Context-Aware GAN with Data AugmentationComments: Submitted to ETHDL2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [534] arXiv:2301.08874 [pdf, other]
-
Title: Improving Zero-Shot Action Recognition using Human Instruction with Text DescriptionComments: 18 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [535] arXiv:2301.08880 [pdf, other]
-
Title: A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film EnhancementComments: Accepted by International Joint Conference on Artificial Intelligence (IJCAI 2023)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [536] arXiv:2301.08898 [pdf, other]
-
Title: Recurrent Generic Contour-based Instance Segmentation with Progressive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [537] arXiv:2301.08915 [pdf, other]
-
Title: Improving Deep Regression with Ordinal EntropyComments: Accepted to ICLR 2023. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [538] arXiv:2301.08930 [pdf, other]
-
Title: Dense RGB SLAM with Neural Implicit MapsComments: Accepted by ICLR 2023; Camera-Ready Version; The code is at poptree.github.io/DIM-SLAMSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [539] arXiv:2301.08939 [pdf, other]
-
Title: Counterfactual Explanation and Instance-Generation using Cycle-Consistent Generative Adversarial NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [540] arXiv:2301.08951 [pdf, other]
-
Title: Time-Conditioned Generative Modeling of Object-Centric Representations for Video Decomposition and PredictionJournal-ref: Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence (UAI-23), pp.613-623, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [541] arXiv:2301.08957 [pdf, other]
-
Title: Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud MapsComments: Accepted in IEEE International Conference on Robotics and Automation (ICRA), 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [542] arXiv:2301.08965 [pdf, other]
-
Title: Raw or Cooked? Object Detection on RAW ImagesComments: SCIA 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [543] arXiv:2301.09007 [pdf, other]
-
Title: MultiNet with Transformers: A Model for Cancer Diagnosis Using ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [544] arXiv:2301.09015 [pdf, other]
-
Title: E$^3$Pose: Energy-Efficient Edge-assisted Multi-camera System for Multi-human 3D Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
- [545] arXiv:2301.09045 [pdf, other]
-
Title: Champion Solution for the WSDM2023 Toloka VQA ChallengeComments: Technical report in WSDM Cup 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [546] arXiv:2301.09055 [pdf, ps, other]
-
Title: Resource-constrained FPGA Design for Satellite Component Feature ExtractionAuthors: Andrew Ekblad, Trupti Mahendrakar, Ryan T. White, Markus Wilde, Isaac Silver, Brooke WheelerComments: 9 pages, 7 figures, 4 tables, Accepted at IEEE Aerospace Conference 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [547] arXiv:2301.09060 [pdf, ps, other]
-
Title: 3D Reconstruction of Non-cooperative Resident Space Objects using Instant NGP-accelerated NeRF and D-NeRFComments: Presented at AAS/AIAA Spaceflight Mechanics Conference 2023, 14 pages, 10 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [548] arXiv:2301.09063 [pdf, other]
-
Title: DASTSiam: Spatio-Temporal Fusion and Discriminative Augmentation for Improved Siamese TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [549] arXiv:2301.09071 [pdf, other]
-
Title: Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal GroundingAuthors: Juncheng Li, Siliang Tang, Linchao Zhu, Wenqiao Zhang, Yi Yang, Tat-Seng Chua, Fei Wu, Yueting ZhuangComments: arXiv admin note: substantial text overlap with arXiv:2203.13049Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [550] arXiv:2301.09077 [pdf, other]
-
Title: Unleash the Potential of Image Branch for Cross-modal 3D Object DetectionComments: Accepted to NeurIPS 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [551] arXiv:2301.09091 [pdf, other]
-
Title: BallGAN: 3D-aware Image Synthesis with a Spherical BackgroundAuthors: Minjung Shin, Yunji Seo, Jeongmin Bae, Young Sun Choi, Hyunsu Kim, Hyeran Byun, Youngjung UhComments: ICCV 2023, Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [552] arXiv:2301.09120 [src]
-
Title: Causality-based Dual-Contrastive Learning Framework for Domain GeneralizationComments: Inadequate proof of the effectiveness of the methodSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [553] arXiv:2301.09121 [pdf, other]
-
Title: Learning Open-vocabulary Semantic Segmentation Models From Natural Language SupervisionComments: Accepted by CVPR2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [554] arXiv:2301.09123 [pdf, ps, other]
-
Title: Face Generation from Textual Features using Conditionally Trained Inputs to Generative Adversarial NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [555] arXiv:2301.09174 [pdf, other]
-
Title: MATT: Multimodal Attention Level Estimation for e-learning PlatformsAuthors: Roberto Daza, Luis F. Gomez, Aythami Morales, Julian Fierrez, Ruben Tolosana, Ruth Cobos, Javier Ortega-GarciaComments: Preprint of the paper presented to the Workshop on Artificial Intelligence for Education (AI4EDU) of AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [556] arXiv:2301.09190 [pdf, ps, other]
-
Title: Apples and Oranges? Assessing Image Quality over Content RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [557] arXiv:2301.09209 [pdf, other]
-
Title: Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction AnticipationAuthors: Razvan-George Pasca, Alexey Gavryushin, Muhammad Hamza, Yen-Ling Kuo, Kaichun Mo, Luc Van Gool, Otmar Hilliges, Xi WangSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [558] arXiv:2301.09219 [pdf, other]
-
Title: Applied Deep Learning to Identify and Localize Polyps from Endoscopic ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [559] arXiv:2301.09249 [pdf, other]
-
Title: Exploring Active 3D Object Detection from a Generalization PerspectiveComments: To appear in ICLR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [560] arXiv:2301.09253 [pdf, other]
-
Title: CircNet: Meshing 3D Point Clouds with Circumcenter DetectionComments: accepted to ICLR2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [561] arXiv:2301.09254 [pdf, other]
-
Title: Learning to Linearize Deep Neural Networks for Secure and Efficient Private InferenceComments: 15 pages, 10 figures, 11 tables. Accepted as a conference paper at ICLR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [562] arXiv:2301.09255 [pdf, other]
-
Title: Combined Use of Federated Learning and Image Encryption for Privacy-Preserving Image Classification with Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [563] arXiv:2301.09257 [pdf, other]
-
Title: Real-Time Simultaneous Localization and Mapping with LiDAR intensityComments: Accepted by ICRA 2023, code released: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [564] arXiv:2301.09266 [pdf, other]
-
Title: FInC Flow: Fast and Invertible $k \times k$ Convolutions for Normalizing FlowsComments: accepted: VISAPP'23Journal-ref: VISIGRAPP, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [565] arXiv:2301.09268 [pdf, other]
-
Title: PCBDet: An Efficient Deep Neural Network Object Detection Architecture for Automatic PCB Component Detection on the EdgeAuthors: Brian Li (1), Steven Palayew (1), Francis Li (1), Saad Abbasi (1 and 2), Saeejith Nair (2), Alexander Wong (1 and 2) ((1) DarwinAI, (2) University of Waterloo)Comments: 7 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [566] arXiv:2301.09299 [pdf, other]
-
Title: Self-Supervised Image Representation Learning: Transcending Masking with Paired Image OverlaySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [567] arXiv:2301.09315 [pdf, ps, other]
-
Title: AI-Based Framework for Understanding Car Following Behaviors of Drivers in A Naturalistic Driving EnvironmentSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [568] arXiv:2301.09318 [pdf, other]
-
Title: Toward Foundation Models for Earth Monitoring: Generalizable Deep Learning Models for Natural Hazard SegmentationComments: Accepted at IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [569] arXiv:2301.09338 [pdf, other]
-
Title: Employing similarity to highlight differences: On the impact of anatomical assumptions in chest X-ray registration methodsAuthors: Astrid Berg, Eva Vandersmissen, Maria Wimmer, David Major, Theresa Neubauer, Dimitrios Lenis, Jeroen Cant, Annemiek Snoeckx, Katja BühlerJournal-ref: Computers in Biology and Medicine, Volume 154, 2023, 106543, ISSN 0010-4825Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [570] arXiv:2301.09339 [pdf, ps, other]
-
Title: Computer Vision for a Camel-Vehicle Collision Mitigation SystemSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [571] arXiv:2301.09376 [pdf, other]
-
Title: Crowd3D: Towards Hundreds of People Reconstruction from a Single ImageComments: Accepted by CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [572] arXiv:2301.09416 [pdf, other]
-
Title: Towards Robust Video Instance Segmentation with Temporal-Aware TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [573] arXiv:2301.09430 [pdf, other]
-
Title: RainDiffusion: When Unsupervised Learning Meets Diffusion Models for Real-world Image DerainingComments: 9 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [574] arXiv:2301.09451 [pdf, other]
-
Title: A Simple Recipe for Competitive Low-compute Self supervised Vision ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [575] arXiv:2301.09460 [pdf, other]
-
Title: HRVQA: A Visual Question Answering Benchmark for High-Resolution Aerial ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [576] arXiv:2301.09461 [pdf, other]
-
Title: Study on the identification limits of craniofacial superimpositionComments: 7 pages, 4 figures. To be submitted to Scientific ReportsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [577] arXiv:2301.09489 [pdf, other]
-
Title: Contracting Skeletal Kinematics for Human-Related Video Anomaly DetectionAuthors: Alessandro Flaborea, Guido D'Amely, Stefano D'Arrigo, Marco Aurelio Sterpa, Alessio Sampieri, Fabio GalassoComments: Submitted to Pattern Recognition JournalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [578] arXiv:2301.09498 [pdf, other]
-
Title: Triplet Contrastive Representation Learning for Unsupervised Vehicle Re-identificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [579] arXiv:2301.09506 [pdf, other]
-
Title: OvarNet: Towards Open-vocabulary Object Attribute RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [580] arXiv:2301.09522 [pdf, other]
-
Title: Optimising Event-Driven Spiking Neural Network with Regularisation and CutoffSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [581] arXiv:2301.09542 [pdf, other]
-
Title: Improving Presentation Attack Detection for ID Cards on Remote Verification SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [582] arXiv:2301.09595 [pdf, other]
-
Title: Zorro: the masked multimodal transformerAuthors: Adrià Recasens, Jason Lin, Joāo Carreira, Drew Jaegle, Luyu Wang, Jean-baptiste Alayrac, Pauline Luc, Antoine Miech, Lucas Smaira, Ross Hemsley, Andrew ZissermanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [583] arXiv:2301.09602 [pdf, other]
-
Title: Adapting the Hypersphere Loss Function from Anomaly Detection to Anomaly SegmentationComments: Submitted to the 2023 IEEE International Conference on Image Processing (ICIP 2023)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [584] arXiv:2301.09617 [pdf, other]
-
Title: Fully transformer-based biomarker prediction from colorectal cancer histology: a large-scale multicentric studyAuthors: Sophia J. Wagner, Daniel Reisenbüchler, Nicholas P. West, Jan Moritz Niehues, Gregory Patrick Veldhuizen, Philip Quirke, Heike I. Grabsch, Piet A. van den Brandt, Gordon G. A. Hutchins, Susan D. Richman, Tanwei Yuan, Rupert Langer, Josien Christina Anna Jenniskens, Kelly Offermans, Wolfram Mueller, Richard Gray, Stephen B. Gruber, Joel K. Greenson, Gad Rennert, Joseph D. Bonner, Daniel Schmolze, Jacqueline A. James, Maurice B. Loughrey, Manuel Salto-Tellez, Hermann Brenner, Michael Hoffmeister, Daniel Truhn, Julia A. Schnabel, Melanie Boxberg, Tingying Peng, Jakob Nikolas KatherComments: Updated Figure 2 and Table A.5Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [585] arXiv:2301.09620 [pdf, other]
-
Title: Tracking the industrial growth of modern China with high-resolution panchromatic imagery: A sequential convolutional approachComments: Fixed typosSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [586] arXiv:2301.09629 [pdf, other]
-
Title: LEGO-Net: Learning Regular Rearrangements of Objects in RoomsAuthors: Qiuhong Anna Wei, Sijie Ding, Jeong Joon Park, Rahul Sajnani, Adrien Poulenard, Srinath Sridhar, Leonidas GuibasComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [587] arXiv:2301.09632 [pdf, other]
-
Title: HexPlane: A Fast Representation for Dynamic ScenesComments: CVPR 2023, Camera Ready Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [588] arXiv:2301.09637 [pdf, other]
-
Title: InfiniCity: Infinite-Scale City SynthesisAuthors: Chieh Hubert Lin, Hsin-Ying Lee, Willi Menapace, Menglei Chai, Aliaksandr Siarohin, Ming-Hsuan Yang, Sergey TulyakovSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
- [589] arXiv:2301.09667 [pdf, other]
-
Title: Improving Performance of Object Detection using the Mechanisms of Visual Recognition in HumansSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [590] arXiv:2301.09724 [pdf, other]
-
Title: Long-tail Detection with Effective Class-MarginsComments: ECCV 2022 Oral. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [591] arXiv:2301.09850 [pdf, other]
-
Title: RD-NAS: Enhancing One-shot Supernet Ranking Ability via Ranking Distillation from Zero-cost ProxiesAuthors: Peijie Dong, Xin Niu, Lujun Li, Zhiliang Tian, Xiaodong Wang, Zimian Wei, Hengyue Pan, Dongsheng LiComments: 6 pages, 2 figures, 4 tables, ICASSP 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [592] arXiv:2301.09858 [pdf, other]
-
Title: PowerQuant: Automorphism Search for Non-Uniform QuantizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [593] arXiv:2301.09869 [pdf, other]
-
Title: Image Super-Resolution using Efficient Striped Window TransformerAuthors: Jinpeng Shi, Hui Li, Tianle Liu, Yulong Liu, Mingjian Zhang, Jinchen Zhu, Ling Zheng, Shizhuang WengComments: SOTA lightweight super-resolution transformer. 8 pages, 9 figures and 6 tables. The Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [594] arXiv:2301.09878 [pdf, other]
-
Title: ODOR: The ICPR2022 ODeuropa Challenge on Olfactory Object RecognitionAuthors: Mathias Zinnen, Prathmesh Madhu, Ronak Kosti, Peter Bell, Andreas Maier, Vincent ChristleinComments: 6 pages, 6 figuresJournal-ref: 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 2022, pp. 4989-4994Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [595] arXiv:2301.09879 [pdf, other]
-
Title: Data Augmentation Alone Can Improve Adversarial TrainingComments: published at conference ICLR2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [596] arXiv:2301.09906 [pdf, other]
-
Title: Transfer Learning for Olfactory Object DetectionComments: 6 pages, 4 figuresJournal-ref: 2022 Digital Humanities Conference, Tokyo, Japan, 2022, pp.409-413Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [597] arXiv:2301.09914 [pdf, other]
-
Title: Multimodal Interactive Lung Lesion Segmentation: A Framework for Annotating PET/CT Images based on Physiological and Anatomical CuesAuthors: Verena Jasmin Hallitschke, Tobias Schlumberger, Philipp Kataliakos, Zdravko Marinov, Moon Kim, Lars Heiliger, Constantin Seibold, Jens Kleesiek, Rainer StiefelhagenComments: Accepted at ISBI 2023; 5 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [598] arXiv:2301.09964 [pdf, other]
-
Title: Uncertainty-Aware Distillation for Semi-Supervised Few-Shot Class-Incremental LearningComments: Submitted to IEEE Transactions on Neural Networks and Learning SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [599] arXiv:2301.10008 [pdf, other]
-
Title: Few-shot Font Generation by Learning Style Difference and SimilarityComments: 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [600] arXiv:2301.10018 [pdf, other]
-
Title: GyroFlow+: Gyroscope-Guided Unsupervised Deep Homography and Optical Flow LearningComments: 12 pages. arXiv admin note: substantial text overlap with arXiv:2103.13725Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [601] arXiv:2301.10038 [pdf, other]
-
Title: Progressive Meta-Pooling Learning for Lightweight Image Classification ModelAuthors: Peijie Dong, Xin Niu, Zhiliang Tian, Lujun Li, Xiaodong Wang, Zimian Wei, Hengyue Pan, Dongsheng LiComments: 5 pages, 2 figures, ICASSP23Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [602] arXiv:2301.10048 [pdf, other]
-
Title: Exploiting Optical Flow Guidance for Transformer-Based Video InpaintingComments: Accepted to TPAMI. This manuscript is a journal extension of our ECCV 2022 paper (arXiv:2208.06768)Journal-ref: Zhang K, Peng J, Fu J, et al. Exploiting optical flow guidance for transformer-based video inpainting[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [603] arXiv:2301.10051 [pdf, other]
-
Title: Wise-IoU: Bounding Box Regression Loss with Dynamic Focusing MechanismSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [604] arXiv:2301.10052 [pdf, other]
-
Title: Event Detection in Football using Graph Convolutional NetworksAuthors: Aditya Sangram Singh RanaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [605] arXiv:2301.10057 [pdf, other]
-
Title: Planar Object Tracking via Weighted Optical FlowComments: WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [606] arXiv:2301.10062 [pdf, other]
-
Title: Proceedings of the 1st International Workshop on Reading Music SystemsComments: Proceedings edited by Jorge Calvo-Zaragoza, Jan Haji\v{c} jr. and Alexander PachaSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [607] arXiv:2301.10092 [pdf, other]
-
Title: Model soups to increase inference without increasing compute timeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [608] arXiv:2301.10100 [pdf, other]
-
Title: Using a Waffle Iron for Automotive Point Cloud Semantic SegmentationComments: Accepted at ICCV23. Code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [609] arXiv:2301.10134 [pdf, other]
-
Title: Bipartite Graph Diffusion Model for Human Interaction GenerationComments: accepted in WACV 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [610] arXiv:2301.10208 [pdf, other]
-
Title: A Simple Adaptive Unfolding Network for Hyperspectral Image ReconstructionComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [611] arXiv:2301.10222 [pdf, other]
-
Title: RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous DrivingComments: CVPR 2023. Code at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [612] arXiv:2301.10241 [pdf, other]
-
Title: K-Planes: Explicit Radiance Fields in Space, Time, and AppearanceComments: Project page this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [613] arXiv:2301.10293 [pdf, ps, other]
-
Title: A Fast Feature Point Matching Algorithm Based on IMU SensorAuthors: Lu CaoComments: 6 pages, 4 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [614] arXiv:2301.10295 [pdf, other]
-
Title: Object Segmentation with Audio ContextComments: Research project for Introduction to Deep Learning (11785) at Carnegie Mellon UniversitySubjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [615] arXiv:2301.10351 [pdf, other]
-
Title: Few-Shot Learning Enables Population-Scale Analysis of Leaf Traits in Populus trichocarpaAuthors: John Lagergren, Mirko Pavicic, Hari B. Chhetri, Larry M. York, P. Doug Hyatt, David Kainer, Erica M. Rutter, Kevin Flores, Jack Bailey-Bale, Marie Klein, Gail Taylor, Daniel Jacobson, Jared StreichSubjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [616] arXiv:2301.10413 [pdf, other]
-
Title: Local Feature Extraction from Salient Regions by Feature Map TransformationComments: British Machine Vision Conference (BMVC) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [617] arXiv:2301.10431 [pdf, other]
-
Title: Bias-Compensated Integral Regression for Human Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [618] arXiv:2301.10441 [pdf, other]
-
Title: Learning Trustworthy Model from Noisy Labels based on Rough Set for Surface Defect DetectionComments: 12 pages, 8figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [619] arXiv:2301.10460 [pdf, other]
-
Title: HAL3D: Hierarchical Active Learning for Fine-Grained 3D Part LabelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [620] arXiv:2301.10473 [pdf, other]
-
Title: Aircraft Skin Inspections: Towards a New Model for Dent EvaluationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [621] arXiv:2301.10492 [pdf, other]
-
Title: Flow-guided Semi-supervised Video Object SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [622] arXiv:2301.10531 [pdf, other]
-
Title: 3D Tooth Mesh Segmentation with Simplified Mesh Cell RepresentationComments: accepted at IEEE ISBI 2023 International Symposium on Biomedical ImagingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [623] arXiv:2301.10540 [pdf, other]
-
Title: Modelling Long Range Dependencies in $N$D: From Task-Specific to a General Purpose CNNAuthors: David M. Knigge, David W. Romero, Albert Gu, Efstratios Gavves, Erik J. Bekkers, Jakub M. Tomczak, Mark Hoogendoorn, Jan-Jakob SonkeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [624] arXiv:2301.10551 [pdf, other]
-
Title: Variation-Aware Semantic Image SynthesisComments: 12 pages, 3 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [625] arXiv:2301.10559 [pdf, other]
-
Title: Tracking Different Ant Species: An Unsupervised Domain Adaptation Framework and a Dataset for Multi-object TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [626] arXiv:2301.10575 [pdf, other]
-
Title: Trainable Loss Weights in Super-ResolutionComments: 9 pages, 6 figures, 2 tableSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [627] arXiv:2301.10583 [pdf, other]
-
Title: An Efficient Approximate Method for Online Convolutional Dictionary LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [628] arXiv:2301.10584 [pdf, other]
-
Title: A Method For Eliminating Contour Errors In Self-Encoder Reconstructed ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [629] arXiv:2301.10593 [pdf, other]
-
Title: Faster DAN: Multi-target Queries with Document Positional Encoding for End-to-end Handwritten Document RecognitionJournal-ref: International Conference on Document Analysis and Recognition - ICDAR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [630] arXiv:2301.10608 [pdf, other]
-
Title: Connecting metrics for shape-texture knowledge in computer visionComments: 7 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [631] arXiv:2301.10611 [pdf, other]
-
Title: Discriminator-free Unsupervised Domain Adaptation for Multi-label Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [632] arXiv:2301.10625 [pdf, other]
-
Title: Navigating the Pitfalls of Active Learning Evaluation: A Systematic Framework for Meaningful Performance AssessmentComments: Accepted at NeurIPS 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [633] arXiv:2301.10670 [pdf, other]
-
Title: Towards Arbitrary Text-driven Image Manipulation via Space AlignmentComments: 8 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [634] arXiv:2301.10732 [pdf, other]
-
Title: An Efficient Semi-Automated Scheme for Infrastructure LiDAR AnnotationComments: Submitted to IEEE Intelligent Transportation Systems TransactionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [635] arXiv:2301.10750 [pdf, ps, other]
-
Title: Out of Distribution Performance of State of Art Vision ModelComments: incomplete work - need to complete itSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [636] arXiv:2301.10759 [pdf, other]
-
Title: Efficient Flow-Guided Multi-frame De-fencingComments: 16 pages, 12 figures. Published at the Winter Conference on Application of Computer Vision (WACV) 2023Journal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023, pp. 1838-1847Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [637] arXiv:2301.10766 [pdf, other]
-
Title: On the Adversarial Robustness of Camera-based 3D Object DetectionComments: Transactions on Machine Learning Research, 2024. ISSN 2835-8856Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [638] arXiv:2301.10847 [pdf, other]
-
Title: Enhancing Medical Image Segmentation with TransCeption: A Multi-Scale Feature Fusion ApproachComments: Submitted to IEEE TMI JournalSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [639] arXiv:2301.10863 [pdf, other]
-
Title: Shape Reconstruction from Thoracoscopic Images using Self-supervised Virtual LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [640] arXiv:2301.10877 [pdf, other]
-
Title: The Projection-Enhancement Network (PEN)Comments: Main text: 14 pages, 5 figures; Supplementary text: 4 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [641] arXiv:2301.10900 [pdf, other]
-
Title: Graph Contrastive Learning for Skeleton-based Action RecognitionAuthors: Xiaohu Huang, Hao Zhou, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang, Xinggang Wang, Wenyu Liu, Bin FengComments: Accepted by ICLR2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [642] arXiv:2301.10906 [pdf, other]
-
Title: Facial Expression Recognition using Squeeze and Excitation-powered Swin TransformersComments: arXiv admin note: text overlap with arXiv:2103.14030 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [643] arXiv:2301.10916 [pdf, other]
-
Title: ITstyler: Image-optimized Text-based Style TransferComments: 8 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [644] arXiv:2301.10922 [pdf, other]
-
Title: Detecting Building Changes with Off-Nadir Aerial ImagesJournal-ref: SCIENCE CHINA Information Sciences (SCIS) 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [645] arXiv:2301.10931 [pdf, other]
-
Title: Towards Continual Egocentric Activity Recognition: A Multi-modal Egocentric Activity Dataset for Continual LearningAuthors: Linfeng Xu, Qingbo Wu, Lili Pan, Fanman Meng, Hongliang Li, Chiyuan He, Hanxin Wang, Shaoxu Cheng, Yu DaiComments: IEEE Transactions on MultimediaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [646] arXiv:2301.10938 [pdf, other]
-
Title: Compact Transformer Tracker with Correlative Masked ModelingComments: AAAI2023 oralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [647] arXiv:2301.10939 [pdf, other]
-
Title: Affective Faces for Goal-Driven Dyadic CommunicationSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [648] arXiv:2301.10941 [pdf, other]
-
Title: GeCoNeRF: Few-shot Neural Radiance Fields via Geometric ConsistencyComments: ICML 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [649] arXiv:2301.10951 [pdf, other]
-
Title: Cross Modal Global Local Representation Learning from Radiology Reports and X-Ray Chest ImagesComments: Accepted to Computer-Aided Diagnosis, SPIE Medical Imaging 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [650] arXiv:2301.10957 [pdf, other]
-
Title: Neurorehab: An Interface for RehabilitationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [651] arXiv:2301.10972 [pdf, other]
-
Title: On the Importance of Noise Scheduling for Diffusion ModelsAuthors: Ting ChenComments: tech reportSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
- [652] arXiv:2301.11015 [pdf, other]
-
Title: Explore the Power of Dropout on Few-shot LearningComments: arXiv admin note: substantial text overlap with arXiv:2210.06409Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [653] arXiv:2301.11022 [pdf, other]
-
Title: Semantic Segmentation Enhanced Transformer Model for Human Attention PredictionAuthors: Shuo ZhangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [654] arXiv:2301.11063 [pdf, other]
-
Title: Rewarded meta-pruning: Meta Learning with Rewards for Channel PruningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [655] arXiv:2301.11093 [pdf, other]
-
Title: Simple diffusion: End-to-end diffusion for high resolution imagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [656] arXiv:2301.11100 [pdf, other]
-
Title: Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based DisparitiesSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)
- [657] arXiv:2301.11116 [pdf, other]
-
Title: Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge TransferringSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [658] arXiv:2301.11145 [pdf, other]
-
Title: Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Machine Learning (stat.ML)
- [659] arXiv:2301.11154 [pdf, other]
-
Title: Multitemporal and multispectral data fusion for super-resolution of Sentinel-2 imagesAuthors: Tomasz Tarasiewicz, Jakub Nalepa, Reuben A. Farrugia, Gianluca Valentino, Mang Chen, Johann A. Briffa, Michal KawulokComments: Submitted to IEEE Transactions On Geoscience And Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [660] arXiv:2301.11174 [pdf, other]
-
Title: Semi-Supervised Image Captioning by Adversarially Propagating Labeled DataComments: Journal extension of our EMNLP 2019 paper (arXiv:1909.02201)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [661] arXiv:2301.11180 [pdf, other]
-
Title: Low-Rank Winograd Transformation for 3D Convolutional Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [662] arXiv:2301.11233 [pdf, other]
-
Title: BiBench: Benchmarking and Analyzing Network BinarizationAuthors: Haotong Qin, Mingyuan Zhang, Yifu Ding, Aoyu Li, Zhongang Cai, Ziwei Liu, Fisher Yu, Xianglong LiuJournal-ref: 2023 International Conference on Machine LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [663] arXiv:2301.11274 [pdf, other]
-
Title: Self-Supervised RGB-T Tracking with Cross-Input ConsistencyComments: 12 pages,9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [664] arXiv:2301.11280 [pdf, other]
-
Title: Text-To-4D Dynamic Scene GenerationAuthors: Uriel Singer, Shelly Sheynin, Adam Polyak, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv TaigmanSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [665] arXiv:2301.11310 [pdf, other]
-
Title: Learning Good Features to Transfer Across Tasks and DomainsAuthors: Pierluigi Zama Ramirez, Adriano Cardace, Luca De Luigi, Alessio Tonioni, Samuele Salti, Luigi Di StefanoComments: Extended version of the paper "Learning Across Tasks and Domains" presented at ICCV 2019. Accepted at TPAMISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [666] arXiv:2301.11315 [pdf, other]
-
Title: Evaluate underdiagnosis and overdiagnosis bias of deep learning model on primary open-angle glaucoma diagnosis in under-served patient populationsAuthors: Mingquan Lin, Yuyun Xiao, Bojian Hou, Tingyi Wanyan, Mohit Manoj Sharma, Zhangyang Wang, Fei Wang, Sarah Van Tassel, Yifan PengComments: 9 pages, 2 figures, Accepted by AMIA 2023 Informatics SummitJournal-ref: AMIA 2023 Informatics SummitSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [667] arXiv:2301.11320 [pdf, other]
-
Title: Cut and Learn for Unsupervised Object Detection and Instance SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [668] arXiv:2301.11326 [pdf, other]
-
Title: Unsupervised Volumetric AnimationAuthors: Aliaksandr Siarohin, Willi Menapace, Ivan Skorokhodov, Kyle Olszewski, Jian Ren, Hsin-Ying Lee, Menglei Chai, Sergey TulyakovSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [669] arXiv:2301.11357 [pdf, other]
-
Title: Multimodal Event Transformer for Image-guided Story Ending GenerationComments: EACL 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [670] arXiv:2301.11360 [pdf, other]
-
Title: The Power of Linear Combinations: Learning with Random ConvolutionsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [671] arXiv:2301.11362 [pdf, other]
-
Title: Improving Cross-modal Alignment for Text-Guided Image InpaintingComments: EACL 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [672] arXiv:2301.11367 [pdf, other]
-
Title: Style-Aware Contrastive Learning for Multi-Style Image CaptioningComments: Findings of EACL 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [673] arXiv:2301.11387 [pdf, other]
-
Title: Universal Domain Adaptation for Remote Sensing Image Scene ClassificationComments: 15 pages, 6 figures, IEEE Transactions on Geoscience and Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [674] arXiv:2301.11417 [pdf, other]
-
Title: Are Labels Needed for Incremental Instance Learning?Comments: Accepted at CVPRW on CLVISION (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [675] arXiv:2301.11418 [pdf, other]
-
Title: Parkinson gait modelling from an anomaly deep representationComments: Journal not submitted to any editorialSubjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
- [676] arXiv:2301.11422 [pdf, other]
-
Title: RMSim: Controlled Respiratory Motion Simulation on Static Patient ScansComments: Physics in Medicine & Biology 2023. Last two authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [677] arXiv:2301.11431 [pdf, other]
-
Title: Semidefinite Relaxations for Robust Multiview TriangulationSubjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
- [678] arXiv:2301.11445 [pdf, other]
-
Title: 3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion ModelsComments: Accepted by SIGGRAPH 2023 (Journal Track), Project website: this https URL, Project demo: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [679] arXiv:2301.11454 [pdf, other]
-
Title: Boundary Aware U-Net for Glacier SegmentationComments: Vol. 4 (2023): Proceedings of the Northern Lights Deep Learning Workshop 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [680] arXiv:2301.11457 [pdf, other]
- [681] arXiv:2301.11495 [pdf, other]
-
Title: Skeleton-based Action Recognition through Contrasting Two-Stream Spatial-Temporal NetworksComments: 14 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [682] arXiv:2301.11497 [pdf, other]
-
Title: D$^2$CSG: Unsupervised Learning of Compact CSG Trees with Dual Complements and DropoutsComments: 9 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [683] arXiv:2301.11499 [pdf, ps, other]
-
Title: Dual-View Selective Instance Segmentation Network for Unstained Live Adherent Cells in Differential Interference Contrast ImagesAuthors: Fei Pan, Yutong Wu, Kangning Cui, Shuxun Chen, Yanfang Li, Yaofang Liu, Adnan Shakoor, Han Zhao, Beijia Lu, Shaohua Zhi, Raymond Chan, Dong SunComments: 13 pages, 5 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [684] arXiv:2301.11507 [pdf, other]
-
Title: Semi-Parametric Video-Grounded Text GenerationComments: Preprint (16 pages, 5 figures)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [685] arXiv:2301.11513 [pdf, other]
-
Title: CellMix: A General Instance Relationship based Method for Data Augmentation Towards Pathology Image ClassificationAuthors: Tianyi Zhang, Zhiling Yan, Chunhui Li, Nan Ying, Yanli Lei, Yunlu Feng, Yu Zhao, Guanglei ZhangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [686] arXiv:2301.11514 [pdf, other]
-
Title: Deep Industrial Image Anomaly Detection: A SurveySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [687] arXiv:2301.11525 [pdf, other]
-
Title: Mixed Attention Network for Hyperspectral Image DenoisingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [688] arXiv:2301.11551 [pdf, other]
-
Title: Harmonizing Flows: Unsupervised MR harmonization based on normalizing flowsComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [689] arXiv:2301.11553 [pdf, other]
-
Title: Robust Transformer with Locality Inductive Bias and Feature NormalizationAuthors: Omid Nejati Manzari, Hossein Kashiani, Hojat Asgarian Dehkordi, Shahriar Baradaran ShokouhiComments: 9 pages, 3 Figures, 6 TablesJournal-ref: Engineering Science and Technology, an International Journal, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [690] arXiv:2301.11558 [pdf, other]
-
Title: Accelerating Guided Diffusion Sampling with Splitting Numerical MethodsComments: Code now available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [691] arXiv:2301.11630 [pdf, other]
-
Title: Joint Geometry and Attribute Upsampling of Point Clouds Using Frequency-Selective Models with Overlapped SupportComments: 10 pages, 10 figures, Under Review at IEEE TMM Special Issue on Point Cloud Processing and UnderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [692] arXiv:2301.11631 [pdf, other]
-
Title: HyperNeRFGAN: Hypernetwork approach to 3D NeRF GANSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [693] arXiv:2301.11650 [pdf, other]
-
Title: Fast Region of Interest Proposals on Maritime UAVsComments: 6+2 pages, accepted for publication at ICRA 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [694] arXiv:2301.11663 [pdf, other]
-
Title: Deep Residual Compensation Convolutional Network without BackpropagationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [695] arXiv:2301.11726 [pdf, other]
-
Title: GAN-Based Object Removal in High-Resolution Satellite ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [696] arXiv:2301.11752 [pdf, ps, other]
-
Title: Inter-View Depth Consistency Testing in Depth Difference SubspaceSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
- [697] arXiv:2301.11753 [pdf, ps, other]
-
Title: Détection d'Objets dans les documents numérisés par réseaux de neurones profondsAuthors: Mélodie BoilletComments: Ph.D Thesis, in French languageSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [698] arXiv:2301.11785 [pdf, other]
-
Title: Dual Diffusion Architecture for Fisheye Image Rectification: Synthetic-to-Real GeneralizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [699] arXiv:2301.11790 [pdf, other]
-
Title: Leveraging the Third Dimension in Contrastive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [700] arXiv:2301.11806 [pdf, other]
-
Title: PCV: A Point Cloud-Based Network VerifierComments: 11 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Software Engineering (cs.SE)
- [701] arXiv:2301.11880 [pdf, other]
-
Title: Optical Flow Estimation in 360$^\circ$ Videos: Dataset, Model and ApplicationComments: 20 pages, 14 figures, conference extension. arXiv admin note: substantial text overlap with arXiv:2208.03620Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [702] arXiv:2301.11915 [pdf, other]
-
Title: Understanding Self-Supervised Pretraining with Part-Aware Representation LearningAuthors: Jie Zhu, Jiyang Qi, Mingyu Ding, Xiaokang Chen, Ping Luo, Xinggang Wang, Wenyu Liu, Leye Wang, Jingdong WangComments: Accepted by TMLRSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [703] arXiv:2301.11932 [pdf, other]
-
Title: RGB Arabic Alphabets Sign Language DatasetAuthors: Muhammad Al-Barham, Adham Alsharkawi, Musa Al-Yaman, Mohammad Al-Fetyani, Ashraf Elnagar, Ahmad Abu SaAleek, Mohammad Al-OdatComments: Reference for the dataset that has inspired us to create our dataset: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [704] arXiv:2301.11986 [pdf, ps, other]
-
Title: Enhancing Face Recognition with Latent Space Data Augmentation and Facial Posture ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [705] arXiv:2301.12025 [pdf, other]
-
Title: Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised LearningComments: 24 pages, 14 figures, Under Review. arXiv admin note: text overlap with arXiv:2206.04170Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [706] arXiv:2301.12032 [pdf, other]
-
Title: BinaryVQA: A Versatile Test Set to Evaluate the Out-of-Distribution Generalization of VQA ModelsAuthors: Ali BorjiSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [707] arXiv:2301.12046 [pdf, other]
-
Title: Semantic Adversarial Attacks on Face Recognition through Significant AttributesComments: 13 pages, 8 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [708] arXiv:2301.12048 [pdf, other]
- [709] arXiv:2301.12053 [pdf, other]
- [710] arXiv:2301.12057 [pdf, other]
-
Title: Object Preserving Siamese Network for Single Object Tracking on Point CloudsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [711] arXiv:2301.12058 [pdf, ps, other]
- [712] arXiv:2301.12073 [pdf, other]
-
Title: Towards Equitable Representation in Text-to-Image Synthesis Models with the Cross-Cultural Understanding Benchmark (CCUB) DatasetAuthors: Zhixuan Liu, Youeun Shin, Beverley-Claire Okogwu, Youngsik Yun, Lia Coleman, Peter Schaldenbrand, Jihie Kim, Jean OhComments: Still on going workSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [713] arXiv:2301.12077 [pdf, other]
-
Title: ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [714] arXiv:2301.12082 [pdf, other]
-
Title: Pushing the Limits of Fewshot Anomaly Detection in Industry Vision: GraphcoreSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [715] arXiv:2301.12093 [pdf, other]
-
Title: Local Contrast and Global Contextual Information Make Infrared Small Object Salient AgainSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [716] arXiv:2301.12135 [pdf, other]
-
Title: AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from MotionComments: accepted by ICRA 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
- [717] arXiv:2301.12141 [pdf, other]
-
Title: What Decreases Editing Capability? Domain-Specific Hybrid Refinement for Improved GAN InversionComments: Accepted by WACV 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [718] arXiv:2301.12149 [pdf, other]
-
Title: POSTER++: A simpler and stronger facial expression recognition networkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [719] arXiv:2301.12159 [pdf, ps, other]
-
Title: ClusterFuG: Clustering Fully connected Graphs by MulticutComments: ICML 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [720] arXiv:2301.12165 [pdf, other]
-
Title: Dynamic Point Cloud Geometry Compression Using Multiscale Inter Conditional CodingComments: 5 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [721] arXiv:2301.12171 [pdf, other]
-
Title: ZegOT: Zero-shot Segmentation Through Optimal Transport of Text PromptsComments: 18pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [722] arXiv:2301.12219 [pdf, other]
-
Title: Towards Accurate Acne Detection via Decoupled Sequential Detection HeadComments: 9 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [723] arXiv:2301.12247 [pdf, other]
-
Title: SEGA: Instructing Text-to-Image Models using Semantic GuidanceAuthors: Manuel Brack, Felix Friedrich, Dominik Hintersdorf, Lukas Struppek, Patrick Schramowski, Kristian KerstingComments: arXiv admin note: text overlap with arXiv:2212.06013 Proceedings of the Advances in Neural Information Processing Systems: Annual Conference on Neural Information Processing Systems (NeurIPS)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [724] arXiv:2301.12257 [pdf, other]
-
Title: Few-shot Face Image Translation via GAN Prior DistillationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [725] arXiv:2301.12276 [pdf, other]
-
Title: ProtoSeg: Interpretable Semantic Segmentation with Prototypical PartsJournal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, pp. 1481-1492Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [726] arXiv:2301.12332 [pdf, other]
- [727] arXiv:2301.12352 [pdf, other]
-
Title: Maximal Cliques on Multi-Frame Proposal Graph for Unsupervised Video Object SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [728] arXiv:2301.12416 [pdf, other]
-
Title: Deep Learning for Human Parsing: A SurveySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [729] arXiv:2301.12429 [pdf, other]
-
Title: Debiased Fine-Tuning for Vision-language Models by Prompt RegularizationComments: AAAI2023 acceptedSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [730] arXiv:2301.12436 [pdf, other]
-
Title: Team VI-I2R Technical Report on EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [731] arXiv:2301.12439 [pdf, other]
-
Title: Unsupervised Domain Adaptation on Person Re-Identification via Dual-level Asymmetric Mutual LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [732] arXiv:2301.12459 [pdf, other]
-
Title: The Influences of Color and Shape Features in Visual Contrastive LearningAuthors: Xiaoqi ZhuangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [733] arXiv:2301.12470 [pdf, ps, other]
-
Title: Gesture Control of Micro-drone: A Lightweight-Net with Domain Randomization and Trajectory GeneratorsAuthors: Isaac Osei Agyemang, Isaac Adjei Mensah, Sophyani Banaamwini Yussif, Fiasam Linda Delali, Bernard Cobinnah Mawuli, Bless Lord Y. Agbley, Collins Sey, Joshua BerkohdSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [734] arXiv:2301.12511 [pdf, other]
-
Title: Fast-BEV: A Fast and Strong Bird's-Eye View Perception BaselineAuthors: Yangguang Li, Bin Huang, Zeren Chen, Yufeng Cui, Feng Liang, Mingzhu Shen, Fenggang Liu, Enze Xie, Lu Sheng, Wanli Ouyang, Jing ShaoComments: submitted to TPAMI. arXiv admin note: substantial text overlap with arXiv:2301.07870Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [735] arXiv:2301.12515 [pdf, other]
-
Title: LiDAR-CS Dataset: LiDAR Point Cloud Dataset with Cross-Sensors for 3D Object DetectionAuthors: Jin Fang, Dingfu Zhou, Jingjing Zhao, Chenming Wu, Chulin Tang, Cheng-Zhong Xu, Liangjun ZhangComments: 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [736] arXiv:2301.12519 [src]
-
Title: 3D Object Detection in LiDAR Point Clouds using Graph Neural NetworksComments: Errors in the results section. Experiments are carried out to rectify the resultsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [737] arXiv:2301.12527 [pdf, other]
-
Title: Diverse, Difficult, and Odd Instances (D2O): A New Test Set for Object ClassificationAuthors: Ali BorjiSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [738] arXiv:2301.12541 [pdf, ps, other]
-
Title: Supervised and Contrastive Self-Supervised In-Domain Representation Learning for Dense Prediction Problems in Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [739] arXiv:2301.12589 [pdf, other]
-
Title: Confidence-Aware Calibration and Scoring Functions for Curriculum LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [740] arXiv:2301.12597 [pdf, other]
-
Title: BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [741] arXiv:2301.12613 [pdf, other]
-
Title: AudioEar: Single-View Ear Reconstruction for Personalized Spatial AudioComments: Accepted by Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [742] arXiv:2301.12637 [pdf, other]
-
Title: Lateralized Learning for Multi-Class Visual Classification TasksComments: 13 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [743] arXiv:2301.12643 [pdf, other]
-
Title: Adversarial Style Augmentation for Domain GeneralizationComments: Initially finished in March 2022; Code will be available at \url{this https URL}Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [744] arXiv:2301.12644 [pdf, other]
-
Title: Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text RetrievalComments: Accepted to AAAI 2023 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [745] arXiv:2301.12682 [pdf, other]
-
Title: Image Contrast Enhancement using Fuzzy Technique with Parameter Determination using MetaheuristicsComments: 14 pages, 7 figures, Image Processing, Computer Vision, Evolutionary ComputationSubjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [746] arXiv:2301.12689 [pdf, other]
-
Title: Edge-guided Multi-domain RGB-to-TIR image Translation for Training Vision Tasks with Challenging LabelsComments: Accepted Contributed Paper to 2023 IEEE International Conference on Robotics and Automation (ICRA)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [747] arXiv:2301.12698 [pdf, other]
-
Title: Robust Meta Learning for Image based tasksComments: IEEE International Conference on Robotics and Automation SRLworkshop 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [748] arXiv:2301.12739 [pdf, other]
- [749] arXiv:2301.12744 [pdf, other]
-
Title: PointSmile: Point Self-supervised Learning via Curriculum Mutual InformationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [750] arXiv:2301.12796 [pdf, other]
-
Title: Rendering the Directional TSDF for Tracking and Multi-Sensor Registration with Point-To-Plane Scale ICPComments: Published in Robotics and Autonomous Systems, 2023. arXiv admin note: substantial text overlap with arXiv:2108.08115Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [751] arXiv:2301.12798 [pdf, other]
-
Title: Reliable Federated Disentangling Network for Non-IID Domain FeatureAuthors: Meng Wang, Kai Yu, Chun-Mei Feng, Yiming Qian, Ke Zou, Lianyu Wang, Rick Siow Mong Goh, Yong Liu, Huazhu FuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [752] arXiv:2301.12799 [pdf, ps, other]
-
Title: Eye Image-based Algorithms to Estimate Percentage Closure of Eye and Saccadic Ratio for Alertness DetectionAuthors: Supratim GuptaSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [753] arXiv:2301.12827 [pdf, other]
-
Title: YOLO-based Object Detection in Industry 4.0 Fischertechnik Model EnvironmentAuthors: Slavomira Schneidereit, Ashkan Mansouri Yarahmadi, Toni Schneidereit, Michael Breuß, Marc GebauerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [754] arXiv:2301.12891 [pdf, ps, other]
-
Title: Half of an image is enough for quality assessmentJournal-ref: IEEE Int. Conference on Image Processing, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [755] arXiv:2301.12914 [pdf, other]
-
Title: PromptMix: Text-to-image diffusion models enhance the performance of lightweight networksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [756] arXiv:2301.12943 [pdf, other]
-
Title: Factors that affect Camera based Self-Monitoring of Vitals in the WildAuthors: Nikhil S. Narayan, Shashanka B. R., Rohit Damodaran, Dr. Chandrashekhar Jayaram, Dr. M. A. Kareem, Dr. Mamta P., Dr. Saravanan K. R., Dr. Monu Krishnan, Dr. Raja IndanaComments: 10 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [757] arXiv:2301.12959 [pdf, other]
-
Title: GALIP: Generative Adversarial CLIPs for Text-to-Image SynthesisComments: 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [758] arXiv:2301.12972 [pdf, other]
-
Title: Human Vision Based 3D Point Cloud Semantic Segmentation of Large-Scale Outdoor SceneSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [759] arXiv:2301.12993 [pdf, other]
-
Title: Benchmarking Robustness to Adversarial Image ObfuscationsAuthors: Florian Stimberg, Ayan Chakrabarti, Chun-Ta Lu, Hussein Hazimeh, Otilia Stretcu, Wei Qiao, Yintao Liu, Merve Kaya, Cyrus Rashtchian, Ariel Fuxman, Mehmet Tek, Sven GowalSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [760] arXiv:2301.13007 [pdf, other]
-
Title: EuclidNet: Deep Visual Reasoning for Constructible Problems in GeometryComments: Accepted by 2nd MATH-AI Workshop at NeurIPS'22Journal-ref: Adv. Artif. Intell. Mach. Learn.(2023), 3(1):839-852Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [761] arXiv:2301.13012 [pdf, other]
-
Title: Key Feature Replacement of In-Distribution Samples for Out-of-Distribution DetectionComments: Accepted to the 37th AAAI Conference on Artificial Intelligence (AAAI 2023) Main TrackSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [762] arXiv:2301.13013 [pdf, other]
-
Title: RFPose-OT: RF-Based 3D Human Pose Estimation via Optimal Transport TheorySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [763] arXiv:2301.13014 [pdf, other]
-
Title: Attribute-Guided Multi-Level Attention Network for Fine-Grained Fashion RetrievalComments: arXiv admin note: substantial text overlap with arXiv:2002.02814, arXiv:2104.02429 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [764] arXiv:2301.13081 [pdf, other]
-
Title: STAIR: Learning Sparse Text and Image Representation in Grounded TokensAuthors: Chen Chen, Bowen Zhang, Liangliang Cao, Jiguang Shen, Tom Gunter, Albin Madappally Jose, Alexander Toshev, Jonathon Shlens, Ruoming Pang, Yinfei YangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [765] arXiv:2301.13082 [pdf, other]
-
Title: PaCaNet: A Study on CycleGAN with Transfer Learning for Diversifying Fused Chinese Painting and CalligraphyAuthors: Zuhao Yang, Huajun Bai, Zhang Luo, Yang Xu, Wei Pang, Yue Wang, Yisheng Yuan, Yingfang YuanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [766] arXiv:2301.13090 [pdf, other]
-
Title: Action Capsules: Human Skeleton Action RecognitionComments: 11 pages, 11 figuresJournal-ref: Computer Vision and Image Understanding Volume 233, August 2023, 103722Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [767] arXiv:2301.13096 [pdf, other]
-
Title: Language-Driven Anchors for Zero-Shot Adversarial RobustnessComments: Accepted by CVPR 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [768] arXiv:2301.13104 [pdf, other]
-
Title: Equivariant Differentially Private Deep Learning: Why DP-SGD Needs Sparser ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [769] arXiv:2301.13141 [pdf, other]
-
Title: Consistency Regularisation in Varying Contexts and Feature Perturbations for Semi-Supervised Semantic Segmentation of Histology ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [770] arXiv:2301.13155 [pdf, other]
-
Title: Advancing Radiograph Representation Learning with Masked Record ModelingComments: Camera ready at ICLR 2023. Code and models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [771] arXiv:2301.13156 [pdf, other]
-
Title: SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic SegmentationComments: ICLR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [772] arXiv:2301.13173 [pdf, other]
-
Title: Shape-aware Text-driven Layered Video EditingComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [773] arXiv:2301.13186 [pdf, ps, other]
-
Title: Accurate Gaze Estimation using an Active-gaze Morphable ModelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [774] arXiv:2301.13190 [pdf, other]
-
Title: Audio-Visual Segmentation with SemanticsAuthors: Jinxing Zhou, Xuyang Shen, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Lingpeng Kong, Meng Wang, Yiran ZhongComments: Submitted to TPAMI as a journal extension of ECCV 2022. Jinxing Zhou, Xuyang Shen, and Jianyuan Wang contribute equally to this work. Meng Wang and Yiran Zhong are the corresponding authors. Code is available at this https URL Online benchmark is available at this http URL arXiv admin note: substantial text overlap with arXiv:2207.05042Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [775] arXiv:2301.13254 [pdf, other]
-
Title: Deep Monocular Hazard Detection for Safe Small Body LandingComments: Presented at the AAS/AIAA Space Flight Mechanics Meeting, January 14-19, 2023, Austin, TX, USASubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
- [776] arXiv:2301.13319 [pdf, other]
-
Title: ParticleSeg3D: A Scalable Out-of-the-Box Deep Learning Segmentation Solution for Individual Particle Characterization from Micro CT Images in Mineral Processing and RecyclingAuthors: Karol Gotkowski, Shuvam Gupta, Jose R. A. Godinho, Camila G. S. Tochtrop, Klaus H. Maier-Hein, Fabian IsenseeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [777] arXiv:2301.13335 [pdf, other]
-
Title: Multi-modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense ReasoningSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [778] arXiv:2301.13337 [src]
-
Title: DAFD: Domain Adaptation via Feature Disentanglement for Image ClassificationComments: Update the experimental resultsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [779] arXiv:2301.13356 [pdf, other]
-
Title: Inference Time Evidences of Adversarial Attacks for Forensic on TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [780] arXiv:2301.13358 [pdf, other]
-
Title: Hierarchical Disentangled Representation for Invertible Image Denoising and BeyondComments: Technical ReportSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [781] arXiv:2301.13359 [pdf, other]
-
Title: IM-IAD: Industrial Image Anomaly Detection Benchmark in ManufacturingAuthors: Guoyang Xie, Jinbao Wang, Jiaqi Liu, Jiayi Lyu, Yong Liu, Chengjie Wang, Feng Zheng, Yaochu JinSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [782] arXiv:2301.13360 [pdf, other]
-
Title: Skeleton-based Human Action Recognition via Convolutional Neural Networks (CNN)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [783] arXiv:2301.13361 [pdf, other]
-
Title: Iterative Loop Method Combining Active and Semi-Supervised Learning for Domain Adaptive Semantic SegmentationComments: 10 pages,5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [784] arXiv:2301.13384 [pdf, other]
-
Title: GaitSADA: Self-Aligned Domain Adaptation for mmWave Gait RecognitionAuthors: Ekkasit Pinyoanuntapong, Ayman Ali, Kalvik Jakkala, Pu Wang, Minwoo Lee, Qucheng Peng, Chen Chen, Zhi SunComments: Submitted to IEEE MASS 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [785] arXiv:2301.13385 [pdf, ps, other]
-
Title: Fisheye traffic data set of point center markersAuthors: Chung-I Huang, Wei-Yu Chen, Wei Jan Ko, Jih-Sheng Chang, Chen-Kai Sun, Hui Hung Yu, Fang-Pang LinComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [786] arXiv:2301.13402 [pdf, other]
-
Title: ReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image EditingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [787] arXiv:2301.13403 [pdf, other]
-
Title: A Modular Multi-stage Lightweight Graph Transformer Network for Human Pose and Shape Estimation from 2D Human PoseSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [788] arXiv:2301.13411 [pdf, other]
-
Title: Few-Shot Object Detection via Variational Feature AggregationComments: Accepted by AAAI2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [789] arXiv:2301.13416 [pdf, other]
-
Title: Structure Flow-Guided Network for Real Depth Super-ResolutionComments: Accepted by AAAI-2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [790] arXiv:2301.13418 [pdf, other]
-
Title: BRAIxDet: Learning to Detect Malignant Breast Lesion with Incomplete AnnotationsAuthors: Yuanhong Chen, Yuyuan Liu, Chong Wang, Michael Elliott, Chun Fung Kwok, Carlos Pena-Solorzano, Yu Tian, Fengbei Liu, Helen Frazer, Davis J. McCarthy, Gustavo CarneiroComments: Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [791] arXiv:2301.13419 [pdf, other]
-
Title: Recurrent Structure Attention Guidance for Depth Super-ResolutionComments: Accepted by AAAI-2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [792] arXiv:2301.13422 [pdf, ps, other]
-
Title: Anomaly Segmentation for High-Resolution Remote Sensing Images Based on Pixel DescriptorsComments: Accepted in AAAI2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [793] arXiv:2301.13428 [pdf, other]
-
Title: Contrast and Clustering: Learning Neighborhood Pair Representation for Source-free Domain AdaptationComments: Journal articlesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [794] arXiv:2301.13430 [pdf, other]
-
Title: GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face SynthesisComments: Accepted by ICLR2023. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [795] arXiv:2301.13444 [pdf, other]
-
Title: Rethinking Soft Label in Label Distribution Learning PerspectiveComments: 11 pages main manuscript + references and 11 pages supplementary materialsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [796] arXiv:2301.13445 [pdf, other]
-
Title: A Survey of Explainable AI in Deep Visual Modeling: Methods and MetricsAuthors: Naveed AkhtarComments: Short accessible survey (9pgs)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [797] arXiv:2301.13459 [pdf, other]
-
Title: Learning Generalized Hybrid Proximity Representation for Image RecognitionComments: The paper has been accepted by the IEEE ICTAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
- [798] arXiv:2301.13473 [pdf, other]
-
Title: CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [799] arXiv:2301.13487 [pdf, other]
-
Title: Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World AttacksComments: Initially accepted at ICLR2023 (Spotlight)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [800] arXiv:2301.13504 [pdf, ps, other]
-
Title: Transfer Learning and Class Decomposition for Detecting the Cognitive Decline of Alzheimer DiseaseComments: 12 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [801] arXiv:2301.13510 [pdf, other]
-
Title: 3D Former: Monocular Scene Reconstruction with 3D SDF TransformersComments: Accepted to ICLR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [802] arXiv:2301.13514 [pdf, other]
-
Title: Fourier Sensitivity and Regularization of Computer Vision ModelsComments: Published in TMLR, this https URLJournal-ref: TMLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [803] arXiv:2301.13538 [pdf, other]
-
Title: AMD: Adaptive Masked Distillation for Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [804] arXiv:2301.13549 [pdf, other]
-
Title: Review of methods for automatic cerebral microbleeds detectionAuthors: Maria Ferlin, Zuzanna Klawikowska, Michał Grochowski, Małgorzata Grzywińska, Edyta SzurowskaComments: 32 pages, 6 figures, 3 tables, 174 referencesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
- [805] arXiv:2301.13554 [pdf, other]
-
Title: NoiseTransfer: Image Noise Generation with Contrastive EmbeddingsComments: ACCV 2022 oralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [806] arXiv:2301.13558 [pdf, other]
-
Title: Lidar Upsampling with Sliced Wasserstein DistanceJournal-ref: in IEEE Robotics and Automation Letters, vol. 8, no. 1, pp. 392-399, Jan. 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [807] arXiv:2301.13569 [pdf, other]
-
Title: NP-Match: Towards a New Probabilistic Model for Semi-Supervised LearningComments: An extended version of our previous ICML 2022 paper arXiv:2207.01066 with more experimentsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [808] arXiv:2301.13591 [src]
-
Title: Zero3D: Semantic-Driven Multi-Category 3D Shape GenerationComments: work in progressSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [809] arXiv:2301.13592 [pdf, other]
-
Title: Priors are Powerful: Improving a Transformer for Multi-camera 3D Detection with 2D PriorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [810] arXiv:2301.13606 [pdf, other]
-
Title: Multi-video Moment Ranking with Multimodal ClueComments: 9 pages,6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [811] arXiv:2301.13656 [pdf, other]
-
Title: A Survey and Benchmark of Automatic Surface Reconstruction from Point CloudsComments: 20 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
- [812] arXiv:2301.13659 [pdf, other]
-
Title: Spyker: High-performance Library for Spiking Deep Neural NetworksComments: 11 pages, 6 figures, 6 listingsSubjects: Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
- [813] arXiv:2301.13670 [pdf, other]
-
Title: What Makes Good Examples for Visual In-Context Learning?Comments: code and models:this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [814] arXiv:2301.13721 [pdf, other]
-
Title: DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic ModelsComments: Accepted by NeurIPS 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [815] arXiv:2301.13741 [pdf, other]
-
Title: UPop: Unified and Progressive Pruning for Compressing Vision-Language TransformersComments: ICML 2023. Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [816] arXiv:2301.13743 [pdf, other]
-
Title: Zero-shot-Learning Cross-Modality Data Translation Through Mutual Information Guided Stochastic DiffusionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [817] arXiv:2301.13803 [pdf, other]
-
Title: Fairness-aware Vision Transformer via Debiased Self-AttentionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [818] arXiv:2301.13817 [pdf, other]
-
Title: Patch Gradient Descent: Training Neural Networks on Very Large ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [819] arXiv:2301.13826 [pdf, other]
-
Title: Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion ModelsComments: Accepted to SIGGRAPH 2023; Project page available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
- [820] arXiv:2301.13865 [pdf, other]
-
Title: From Semi-supervised to Omni-supervised Room Layout Estimation Using Point CloudsAuthors: Huan-ang Gao, Beiwen Tian, Pengfei Li, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Yurong Chen, Hongbin ZhaComments: Accepted to ICRA2023. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [821] arXiv:2301.00142 (cross-list from cs.HC) [pdf, other]
-
Title: Computational Charisma -- A Brick by Brick Blueprint for Building Charismatic Artificial IntelligenceAuthors: Björn W. Schuller, Shahin Amiriparian, Anton Batliner, Alexander Gebhard, Maurice Gerzcuk, Vincent Karas, Alexander Kathan, Lennart Seizer, Johanna LöchnerSubjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [822] arXiv:2301.00243 (cross-list from cs.LG) [pdf, other]
-
Title: Approaching Peak Ground TruthAuthors: Florian Kofler, Johannes Wahle, Ivan Ezhov, Sophia Wagner, Rami Al-Maskari, Emilia Gryska, Mihail Todorov, Christina Bukas, Felix Meissen, Tingying Peng, Ali Ertürk, Daniel Rueckert, Rolf Heckemann, Jan Kirschke, Claus Zimmer, Benedikt Wiestler, Bjoern Menze, Marie PiraudComments: 7pages, 2 figures (minor corrections to text, affiliations and layout)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [823] arXiv:2301.00254 (cross-list from cs.MM) [pdf, other]
-
Title: Depression Diagnosis and Analysis via Multimodal Multi-order Factor FusionSubjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [824] arXiv:2301.00314 (cross-list from cs.LG) [pdf, other]
-
Title: Causal Deep Learning: Causal Capsules and Tensor TransformersAuthors: M. Alex O. VasilescuComments: The document contains both the article and the supplemental materialJournal-ref: Proceedings of the 26th International Conference on Pattern Recognition (ICPR 2022) Montreal, Canada, Aug. 21-25, 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [825] arXiv:2301.00364 (cross-list from cs.LG) [pdf, other]
-
Title: Generalizable Black-Box Adversarial Attack with Meta LearningComments: T-PAMI 2022. Project Page is at this https URLSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [826] arXiv:2301.00383 (cross-list from cs.LG) [pdf, other]
-
Title: Discriminative Radial Domain AdaptationComments: 13 pages, 14 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [827] arXiv:2301.00433 (cross-list from cs.AI) [pdf, other]
-
Title: Optimization of Image Transmission in a Cooperative Semantic Communication NetworksComments: 29 pages, 10 figuresSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [828] arXiv:2301.00452 (cross-list from cs.RO) [pdf, other]
-
Title: Human-in-the-loop Embodied Intelligence with Interactive Simulation Environment for Surgical Robot LearningSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [829] arXiv:2301.00545 (cross-list from cs.LG) [pdf, other]
-
Title: Knockoffs-SPR: Clean Sample Selection in Learning with Noisy LabelsComments: update: final version, to appear in TPAMISubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [830] arXiv:2301.00750 (cross-list from cs.GR) [pdf, other]
-
Title: Interactive Control over Temporal Consistency while Stylizing Video StreamsSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [831] arXiv:2301.00752 (cross-list from cs.NI) [pdf, other]
-
Title: Point Cloud-based Proactive Link Quality Prediction for Millimeter-wave CommunicationsJournal-ref: IEEE Transactions on Machine Learning in Communications and Networking, vol. 1, pp. 258-276, 2023Subjects: Networking and Internet Architecture (cs.NI); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [832] arXiv:2301.00897 (cross-list from cs.NE) [pdf, other]
-
Title: Game of Intelligent LifeSubjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [833] arXiv:2301.01087 (cross-list from cs.GR) [pdf, other]
-
Title: Neural Point Catacaustics for Novel-View Synthesis of ReflectionsComments: SIGGRAPH Asia 2022 (ToG) this https URLJournal-ref: ACM Transactions on Graphics, Vol. 41, No. 6, Article 201 (2022)Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [834] arXiv:2301.01088 (cross-list from cs.LG) [pdf, other]
-
Title: Explaining Imitation Learning through FramesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [835] arXiv:2301.01134 (cross-list from cs.MM) [pdf, other]
-
Title: Ring That Bell: A Corpus and Method for Multimodal Metaphor Detection in VideosComments: Figlang 2022Subjects: Multimedia (cs.MM); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [836] arXiv:2301.01218 (cross-list from cs.CR) [pdf, other]
-
Title: Tracing the Origin of Adversarial Attack for Forensic Investigation and DeterrenceSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [837] arXiv:2301.01224 (cross-list from cs.SE) [pdf, other]
-
Title: An Empirical Investigation into the Use of Image Captioning for Automated Software DocumentationAuthors: Kevin Moran, Ali Yachnes, George Purnell, Junayed Mahmud, Michele Tufano, Carlos Bernal-Cárdenas, Denys Poshyvanyk, Zach H'DoublerComments: Published in the Proceedings of the 29th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER'22), Honolulu, Hawaii, March 15-18, 2022, pp. 514-525Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [838] arXiv:2301.01350 (cross-list from cs.RO) [pdf, other]
-
Title: LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover NavigationAuthors: Shreyansh Daftry, Zhanlin Chen, Yang Cheng, Scott Tepsuporn, Brian Coltin, Ussama Naam, Lanssie Mingyue Ma, Shehryar Khattak, Matthew Deans, Larry MatthiesComments: IEEE Aerospace Conference 2023. arXiv admin note: text overlap with arXiv:2203.10073Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [839] arXiv:2301.01352 (cross-list from cs.LG) [pdf, other]
-
Title: WLD-Reg: A Data-dependent Within-layer Diversity RegularizerComments: accepted at AAAI 2023. arXiv admin note: substantial text overlap with arXiv:2106.06012Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [840] arXiv:2301.01424 (cross-list from cs.GR) [pdf, other]
-
Title: Scene Synthesis from Human MotionComments: 9 pages, 8 figures. Published in SIGGRAPH Asia 2022. Sifan Ye and Yixing Wang share equal contribution. Huazhe Xu and Jiajun Wu share equal contributionSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [841] arXiv:2301.01454 (cross-list from cs.AR) [pdf, other]
-
Title: Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGASubjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [842] arXiv:2301.01495 (cross-list from cs.LG) [pdf, other]
-
Title: Beckman DefenseAuthors: A. V. SubramanyamSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [843] arXiv:2301.01520 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Explainable Land Cover Mapping: a Counterfactual-based StrategySubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [844] arXiv:2301.01569 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Decorrelated Representations Efficiently Using Fast Fourier TransformComments: Accepted for CVPR 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [845] arXiv:2301.01758 (cross-list from cs.DC) [pdf, other]
-
Title: An Ensemble Mobile-Cloud Computing Method for Affordable and Accurate Glucometer ReadoutComments: 12 pages, 12 figures, 8 tablesSubjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computer Vision and Pattern Recognition (cs.CV)
- [846] arXiv:2301.01805 (cross-list from cs.LG) [pdf, other]
-
Title: Unsupervised Manifold Linearizing and ClusteringSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [847] arXiv:2301.01947 (cross-list from cs.LG) [pdf, ps, other]
-
Title: StitchNet: Composing Neural Networks from Pre-Trained FragmentsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [848] arXiv:2301.01949 (cross-list from cs.CL) [pdf, other]
-
Title: SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout GraphAuthors: Yuxing Long, Binyuan Hui, Fulong Ye, Yanyang Li, Zhuoxin Han, Caixia Yuan, Yongbin Li, Xiaojie WangComments: AAAI 2023Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
- [849] arXiv:2301.02051 (cross-list from cs.RO) [pdf, other]
-
Title: A Distance-Geometric Method for Recovering Robot Joint Angles From an RGB ImageComments: IFAC 2023Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [850] arXiv:2301.02211 (cross-list from cs.CY) [pdf, other]
-
Title: Teaching Computer Vision for EcologyAuthors: Elijah Cole, Suzanne Stathatos, Björn Lütjens, Tarun Sharma, Justin Kay, Jason Parham, Benjamin Kellenberger, Sara BeerySubjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
- [851] arXiv:2301.02363 (cross-list from cs.MM) [pdf, other]
-
Title: Text2Poster: Laying out Stylized Texts on Retrieved ImagesComments: 5 pages, Accepted to ICASSP 2022Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [852] arXiv:2301.02464 (cross-list from cs.LG) [pdf, other]
-
Title: Architect, Regularize and Replay (ARR): a Flexible Hybrid Approach for Continual LearningComments: Book Chapter Preprint: 15 pages, 7 figures, 2 tables. arXiv admin note: text overlap with arXiv:1912.01100Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [853] arXiv:2301.02615 (cross-list from cs.CR) [pdf, other]
-
Title: Silent Killer: A Stealthy, Clean-Label, Black-Box Backdoor AttackSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [854] arXiv:2301.02761 (cross-list from cs.LG) [pdf, other]
-
Title: Active Learning Guided by Efficient Surrogate LearnersSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [855] arXiv:2301.02903 (cross-list from cs.LG) [pdf, other]
-
Title: Transferring Pre-trained Multimodal Representations with Cross-modal Similarity MatchingComments: 20 pages, 10 figures, NeurIPS 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [856] arXiv:2301.03064 (cross-list from cs.CR) [pdf, other]
-
Title: Deepfake CAPTCHA: A Method for Preventing Fake CallsSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [857] arXiv:2301.03127 (cross-list from cs.CL) [pdf, other]
-
Title: Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder ArchitectureComments: Accepted in AAAI'23: Second Workshop on Multimodal Fact-Checking and Hate Speech Detection, February 2023, Washington, DC, USASubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [858] arXiv:2301.03167 (cross-list from cs.CG) [pdf, ps, other]
-
Title: Machining feature recognition using descriptors with range constraints for mechanical 3D modelsSubjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
- [859] arXiv:2301.03344 (cross-list from cs.CL) [pdf, other]
-
Title: Universal Multimodal Representation for Language UnderstandingComments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [860] arXiv:2301.03439 (cross-list from eess.SY) [pdf, other]
-
Title: Generalized adaptive smoothing based neural network architecture for traffic state estimationJournal-ref: 2023 The 22nd World Congress of the International Federation of Automatic Control (IFAC). 56(2):3483-3490Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [861] arXiv:2301.03553 (cross-list from cs.SE) [pdf, other]
-
Title: FedDebug: Systematic Debugging for Federated Learning ApplicationsComments: Published at ICSE 2023. Link this https URLJournal-ref: In 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE) (pp. 456-789). IEEE (2023)Subjects: Software Engineering (cs.SE); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
- [862] arXiv:2301.03573 (cross-list from cs.LG) [pdf, other]
-
Title: Balance is Essence: Accelerating Sparse Training via Adaptive Gradient CorrectionSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [863] arXiv:2301.03730 (cross-list from cs.LG) [pdf, other]
-
Title: Learning to Perceive in Deep Model-Free Reinforcement LearningComments: 8 pages; 7 figures; fixed author name; added link for codeSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [864] arXiv:2301.03829 (cross-list from cs.LG) [pdf, other]
-
Title: From Plate to Prevention: A Dietary Nutrient-aided Platform for Health Promotion in SingaporeAuthors: Kaiping Zheng, Thao Nguyen, Jesslyn Hwei Sing Chong, Charlene Enhui Goh, Melanie Herschel, Hee Hoon Lee, Changshuo Liu, Beng Chin Ooi, Wei Wang, James YipSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Multimedia (cs.MM)
- [865] arXiv:2301.03844 (cross-list from cs.LG) [pdf, other]
-
Title: Look Beyond Bias with Entropic Adversarial Data AugmentationAuthors: Thomas Duboudin (imagine), Emmanuel Dellandréa, Corentin Abgrall, Gilles Hénaff, Liming ChenJournal-ref: International Conference on Pattern Recognition 2022, Aug 2022, Montr{\'e}al, CanadaSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [866] arXiv:2301.03867 (cross-list from cs.RO) [pdf, other]
-
Title: Sentiment-based Engagement Strategies for intuitive Human-Robot InteractionComments: Camera ready version - 18th International Conference on Computer Vision Theory and Applications (VISAPP 2023)Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [867] arXiv:2301.03926 (cross-list from cs.HC) [pdf, other]
-
Title: Video Surveillance System Incorporating Expert Decision-making Process: A Case Study on Detecting Calving Signs in CattleSubjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [868] arXiv:2301.03947 (cross-list from cs.RO) [pdf, other]
-
Title: Autonomous Strawberry Picking Robotic System (Robofruit)Comments: To appear in the Journal of Field Robotics (Accepted) Please watch the video at this https URLSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [869] arXiv:2301.04261 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Microstructural State Variables in Materials SystemsSubjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci); Computer Vision and Pattern Recognition (cs.CV)
- [870] arXiv:2301.04272 (cross-list from cs.LG) [pdf, other]
-
Title: Data Distillation: A SurveyComments: Accepted at TMLR '23. 21 pages, 4 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
- [871] arXiv:2301.04371 (cross-list from cs.GR) [pdf, other]
-
Title: Recognising geometric primitives in 3D point clouds of mechanical CAD objectsJournal-ref: Computer-Aided Design 157 (2023) 103479Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [872] arXiv:2301.04402 (cross-list from cs.CR) [pdf, ps, other]
-
Title: Secure access system using signature verification over tablet PCAuthors: Fernando Alonso-Fernandez, Julian Fierrez-Aguilar, Javier Ortega-Garcia, Joaquin Gonzalez-RodriguezComments: Published at IEEE Aerospace and Electronic Systems MagazineSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [873] arXiv:2301.04421 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Failure Detection for Motion Prediction of Autonomous Driving: An Uncertainty PerspectiveComments: Accepted by 2023 IEEE International Conference on Robotics and Automation (ICRA 2023)Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [874] arXiv:2301.04451 (cross-list from cs.LG) [pdf, other]
-
Title: Heterogeneous Tri-stream Clustering NetworkComments: To appear in Neural Processing LettersSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [875] arXiv:2301.04506 (cross-list from cs.LG) [pdf, other]
-
Title: A Distinct Unsupervised Reference Model From The Environment Helps Continual LearningAuthors: Seyyed AmirHossein Ameli Kalkhoran, Mohammadamin Banayeeanzade, Mahdi Samiei, Mahdieh Soleymani BaghshahComments: 22 pages, 3 figures, and 18 tablesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [876] arXiv:2301.04584 (cross-list from cs.LG) [pdf, other]
-
Title: Continual Few-Shot Learning Using HyperTransformersSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [877] arXiv:2301.04630 (cross-list from cs.RO) [pdf, other]
-
Title: ShadowNav: Crater-Based Localization for Nighttime and Permanently Shadowed Region Lunar NavigationAuthors: Abhishek Cauligi, R. Michael Swan, Masahiro Ono, Shreyansh Daftry, John Elliott, Larry Matthies, Deegan AthaComments: IEEE Aerospace Conference 2023Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [878] arXiv:2301.04746 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Switchable Lightweight Anti-symmetric Processing (SLAP) with CNN Outspeeds Data Augmentation by Smaller Sample -- Application in Gomoku Reinforcement LearningComments: In Berndt M\"uller (Ed.), Proceedings of AISB Convention 2023 (pp. 69-75). AISBSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [879] arXiv:2301.04802 (cross-list from cs.LG) [pdf, other]
-
Title: Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic ImagesAuthors: Mohamed Akrout, Bálint Gyepesi, Péter Holló, Adrienn Poór, Blága Kincső, Stephen Solis, Katrina Cirone, Jeremy Kawahara, Dekker Slade, Latif Abid, Máté Kovács, István FazekasSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [880] arXiv:2301.04875 (cross-list from cs.CR) [pdf, other]
-
Title: Color-NeuraCrypt: Privacy-Preserving Color-Image Classification Using Extended Random Neural NetworksSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [881] arXiv:2301.04883 (cross-list from cs.CL) [pdf, other]
-
Title: SlideVQA: A Dataset for Document Visual Question Answering on Multiple ImagesComments: Accepted by AAAI2023Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [882] arXiv:2301.04954 (cross-list from cs.LG) [pdf, other]
-
Title: Reaching the Edge of the Edge: Image Analysis in SpaceSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
- [883] arXiv:2301.05028 (cross-list from cs.RO) [pdf, other]
-
Title: MotorFactory: A Blender Add-on for Large Dataset Generation of Small Electric MotorsAuthors: Chengzhi Wu, Kanran Zhou, Jan-Philipp Kaiser, Norbert Mitschke, Jan-Felix Klein, Julius Pfrommer, Jürgen Beyerer, Gisela Lanza, Michael Heizmann, Kai FurmansSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [884] arXiv:2301.05032 (cross-list from cs.LG) [pdf, other]
-
Title: Online Hyperparameter Optimization for Class-Incremental LearningComments: AAAI 2023 Oral. Code is available at this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [885] arXiv:2301.05058 (cross-list from cs.NE) [pdf, other]
-
Title: Sparse Coding in a Dual Memory System for Lifelong LearningComments: Camera ready version - "Thirty-Seventh AAAI Conference on Artificial Intelligence" (AAAI-2023)Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [886] arXiv:2301.05106 (cross-list from cs.LG) [pdf, other]
-
Title: Forgetful Active Learning with Switch Events: Efficient Sampling for Out-of-Distribution DataSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [887] arXiv:2301.05169 (cross-list from cs.LG) [pdf, other]
-
Title: Causal Triplet: An Open Challenge for Intervention-centric Causal Representation LearningAuthors: Yuejiang Liu, Alexandre Alahi, Chris Russell, Max Horn, Dominik Zietlow, Bernhard Schölkopf, Francesco LocatelloComments: Conference on Causal Learning and Reasoning (CLeaR) 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [888] arXiv:2301.05174 (cross-list from cs.IR) [pdf, other]
-
Title: Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility StudyComments: 18 pages, accepted as a reproducibility paper at ECIR 2023Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
- [889] arXiv:2301.05180 (cross-list from cs.LG) [pdf, other]
- [890] arXiv:2301.05206 (cross-list from cs.RO) [pdf, other]
-
Title: ImMesh: An Immediate LiDAR Localization and Meshing FrameworkAuthors: Jiarong Lin, Chongjiang Yuan, Yixi Cai, Haotian Li, Yunfan Ren, Yuying Zou, Xiaoping Hong, Fu ZhangSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [891] arXiv:2301.05339 (cross-list from cs.GR) [pdf, other]
-
Title: A Comprehensive Review of Data-Driven Co-Speech Gesture GenerationComments: Accepted for EUROGRAPHICS 2023Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [892] arXiv:2301.05345 (cross-list from cs.AI) [pdf, other]
-
Title: GOHSP: A Unified Framework of Graph and Optimization-based Heterogeneous Structured Pruning for Vision TransformerComments: This manuscript was accepted to AAAI 2023 Main TrackSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [893] arXiv:2301.05506 (cross-list from cs.CR) [pdf, other]
-
Title: On the feasibility of attacking Thai LPR systems with adversarial examplesSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [894] arXiv:2301.05609 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Co-manipulation of soft-materials estimating deformation from depth imagesComments: Pre-print, Accepted to Robotics and Computer Integrated ManufacturingSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [895] arXiv:2301.05908 (cross-list from cs.SD) [pdf, other]
-
Title: An Order-Complexity Model for Aesthetic Quality Assessment of Symbolic Homophony Music ScoresSubjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
- [896] arXiv:2301.06059 (cross-list from cs.GR) [pdf, other]
-
Title: Learning Audio-Driven Viseme Dynamics for 3D Face AnimationComments: Project page: this https URLSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [897] arXiv:2301.06133 (cross-list from cs.LG) [pdf, other]
-
Title: Improving Reliability of Fine-tuning with Block-wise OptimisationComments: 10 pagesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [898] arXiv:2301.06160 (cross-list from cs.DL) [pdf, other]
-
Title: TextileNet: A Material Taxonomy-based Fashion Textile DatasetComments: 10 papes, 4 figures, 2 tablesSubjects: Digital Libraries (cs.DL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [899] arXiv:2301.06180 (cross-list from cs.CR) [pdf, other]
-
Title: Secure Video Streaming Using Dedicated HardwareSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [900] arXiv:2301.06193 (cross-list from cs.LG) [pdf, other]
-
Title: RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of Quantized CNNsComments: 17 pages, 4 figures, 14 tablesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [901] arXiv:2301.06230 (cross-list from cs.RO) [pdf, other]
-
Title: Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot SystemsComments: Code: this https URLSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [902] arXiv:2301.06363 (cross-list from cs.NI) [pdf, other]
-
Title: A$^2$-UAV: Application-Aware Content and Network Optimization of Edge-Assisted UAV SystemsAuthors: Andrea Coletta, Flavio Giorgi, Gaia Maselli, Matteo Prata, Domenicomichele Silvestri, Jonathan Ashdown, Francesco RestucciaComments: Accepted to INFOCOM 2023Subjects: Networking and Internet Architecture (cs.NI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [903] arXiv:2301.06375 (cross-list from cs.MM) [pdf, other]
-
Title: OLKAVS: An Open Large-Scale Korean Audio-Visual Speech DatasetAuthors: Jeongkyun Park, Jung-Wook Hwang, Kwanghee Choi, Seung-Hyun Lee, Jun Hwan Ahn, Rae-Hong Park, Hyung-Min ParkSubjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)
- [904] arXiv:2301.06489 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Simplex AutoencodersSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [905] arXiv:2301.06626 (cross-list from cs.LG) [src]
-
Title: Masked Vector QuantizationComments: A newer version of this manuscript was archived under 2312.11735Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [906] arXiv:2301.06730 (cross-list from cs.HC) [pdf, other]
-
Title: Bag of States: A Non-sequential Approach to Video-based Engagement MeasurementSubjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [907] arXiv:2301.06871 (cross-list from cs.LG) [pdf, other]
-
Title: Denoising Diffusion Probabilistic Models as a Defense against Adversarial AttacksSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [908] arXiv:2301.06882 (cross-list from cs.CR) [pdf, other]
-
Title: Multi-Biometric Fuzzy Vault based on Face and FingerprintsAuthors: Christian Rathgeb, Benjamin Tams, Johannes Merkle, Vanessa Nesterowicz, Ulrike Korte, Matthias NeuSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [909] arXiv:2301.07147 (cross-list from cs.RO) [pdf, other]
-
Title: COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAMComments: 6+1 Pages, 5 Figures, 3 Tables, Accepted at ICRA 2023, LondonSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
- [910] arXiv:2301.07150 (cross-list from cs.RO) [pdf, other]
-
Title: Embodied Agents for Efficient Exploration and Smart Scene DescriptionComments: Accepted by IEEE International Conference on Robotics and Automation (ICRA 2023)Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [911] arXiv:2301.07204 (cross-list from cs.RO) [pdf, other]
-
Title: Robotic Navigation Autonomy for Subretinal Injection via Intelligent Real-Time Virtual iOCT Volume SlicingAuthors: Shervin Dehghani, Michael Sommersperger, Peiyao Zhang, Alejandro Martin-Gomez, Benjamin Busam, Peter Gehlbach, Nassir Navab, M. Ali Nasseri, Iulian IordachitaSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [912] arXiv:2301.07279 (cross-list from cs.RO) [pdf, other]
-
Title: SensorX2car: Sensors-to-car calibration for autonomous driving in road scenariosComments: 8 pages, 12 figuresSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [913] arXiv:2301.07294 (cross-list from cs.LG) [pdf, other]
-
Title: Enhancing Self-Training MethodsAuthors: Aswathnarayan Radhakrishnan, Jim Davis, Zachary Rabin, Benjamin Lewis, Matthew Scherreik, Roman IlinSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [914] arXiv:2301.07306 (cross-list from cs.LG) [pdf, other]
-
Title: Improve Noise Tolerance of Robust Loss via Noise-AwarenessComments: arXiv admin note: text overlap with arXiv:2002.06482Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [915] arXiv:2301.07502 (cross-list from cs.LG) [pdf, other]
-
Title: Multimodal Side-Tuning for Document ClassificationComments: 2020 25th International Conference on Pattern Recognition (ICPR)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [916] arXiv:2301.07681 (cross-list from cs.MM) [pdf, other]
-
Title: Reduced-Reference Quality Assessment of Point Clouds via Content-Oriented Saliency ProjectionSubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [917] arXiv:2301.07799 (cross-list from cs.LG) [pdf, other]
-
Title: A Domain-Agnostic Approach for Characterization of Lifelong Learning SystemsAuthors: Megan M. Baker, Alexander New, Mario Aguilar-Simon, Ziad Al-Halah, Sébastien M. R. Arnold, Ese Ben-Iwhiwhu, Andrew P. Brna, Ethan Brooks, Ryan C. Brown, Zachary Daniels, Anurag Daram, Fabien Delattre, Ryan Dellana, Eric Eaton, Haotian Fu, Kristen Grauman, Jesse Hostetler, Shariq Iqbal, Cassandra Kent, Nicholas Ketz, Soheil Kolouri, George Konidaris, Dhireesha Kudithipudi, Erik Learned-Miller, Seungwon Lee, Michael L. Littman, Sandeep Madireddy, Jorge A. Mendez, Eric Q. Nguyen, Christine D. Piatko, Praveen K. Pilly, Aswin Raghavan, Abrar Rahman, Santhosh Kumar Ramakrishnan, Neale Ratzlaff, Andrea Soltoggio, Peter Stone, Indranil Sur, Zhipeng Tang, Saket Tiwari, Kyle Vedder, Felix Wang, Zifan Xu, Angel Yanguas-Gil, Harel Yedidsion, Shangqun Yu, Gautam K. VallabhaComments: To appear in Neural NetworksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [918] arXiv:2301.08174 (cross-list from cs.RO) [pdf, other]
-
Title: Collaborative Robotic Ultrasound Tissue Scanning for Surgical Resection Guidance in NeurosurgeryAuthors: Alistair Weld, Michael Dyck, Julian Klodmann, Giulio Anichini, Luke Dixon, Sophie Camp, Alin Albu-Schäffer, Stamatia GiannarouJournal-ref: Proceedings of the Hamlyn Symposium 2022Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [919] arXiv:2301.08387 (cross-list from cs.RO) [pdf, other]
-
Title: Occlusion Reasoning for Skeleton Extraction of Self-Occluded Tree CanopiesComments: 7 pages, 10 figures, submitted to ICRA 2023Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [920] arXiv:2301.08529 (cross-list from eess.SY) [pdf, ps, other]
-
Title: Exploration of Various Fractional Order Derivatives in Parkinson's Disease Dysgraphia AnalysisAuthors: Jan Mucha, Zoltan Galaz, Jiri Mekyska, Marcos Faundez-Zanuy, Vojtech Zvoncak, Zdenek Smekal, Lubos Brabenec, Irena RektorovaComments: Print ISBN 978-3-031-19744-4Journal-ref: In: Carmona-Duarte, C., Diaz, M., Ferrer, M.A., Morales, A. (eds) Intertwining Graphonomics with Human Movements. IGS 2022. Lecture Notes in Computer Science, vol 13424. Springer, ChamSubjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
- [921] arXiv:2301.08556 (cross-list from cs.LG) [pdf, other]
-
Title: NeRF in the Palm of Your Hand: Corrective Augmentation for Robotics via Novel-View SynthesisSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [922] arXiv:2301.08571 (cross-list from cs.CL) [pdf, other]
-
Title: Visual Writing Prompts: Character-Grounded Story Generation with Curated Image SequencesComments: Paper accepted by Transactions of the Association for Computational Linguistics (TACL). This is a pre-MIT Press publication version. 15 pages, 6 figuresSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [923] arXiv:2301.08784 (cross-list from cs.CL) [pdf, other]
-
Title: Visual Semantic Relatedness Dataset for Image CaptioningComments: Project Page: bit.ly/project-page-paperSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [924] arXiv:2301.08794 (cross-list from cs.RO) [pdf, other]
-
Title: Robot Skill Learning Via Classical Robotics-Based Generated Datasets: Advantages, Disadvantages, and Future ImprovementAuthors: Batu Kaan OezenSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Systems and Control (eess.SY)
- [925] arXiv:2301.08846 (cross-list from cs.LG) [pdf, other]
-
Title: Regeneration Learning: A Learning Paradigm for Data GenerationSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
- [926] arXiv:2301.09056 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Performance Study of YOLOv5 and Faster R-CNN for Autonomous Navigation around Non-Cooperative TargetsAuthors: Trupti Mahendrakar, Andrew Ekblad, Nathan Fischer, Ryan T. White, Markus Wilde, Brian Kish, Isaac SilverComments: 12 pages, 10 figures, 9 tables, IEEE Aerospace Conference 2022Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [927] arXiv:2301.09059 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Autonomous Rendezvous with Non-cooperative Target Objects with Swarm Chasers and ObserversAuthors: Trupti Mahendrakar, Steven Holmberg, Andrew Ekblad, Emma Conti, Ryan T. White, Markus Wilde, Isaac SilverComments: Presented at AAS/AIAA Spaceflight Mechanics Meeting 2023, 17 pages, 9 figures, 3 tablesSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [928] arXiv:2301.09164 (cross-list from cs.LG) [pdf, other]
-
Title: Unifying Synergies between Self-supervised Learning and Dynamic ComputationAuthors: Tarun Krishna, Ayush K Rai, Alexandru Drimbarean, Eric Arazo, Paul Albert, Alan F Smeaton, Kevin McGuinness, Noel E O'ConnorComments: Accepted in BMVC 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [929] arXiv:2301.09213 (cross-list from cs.RO) [pdf, other]
-
Title: FRAME: Fast and Robust Autonomous 3D point cloud Map-merging for Egocentric multi-robot explorationComments: to be publishedSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [930] arXiv:2301.09264 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Efficient Training Under Limited ResourcesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [931] arXiv:2301.09420 (cross-list from cs.LG) [pdf, other]
-
Title: On Multi-Agent Deep Deterministic Policy Gradients and their Explainability for SMARTS EnvironmentComments: 6 pages, 5 figuresSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [932] arXiv:2301.09422 (cross-list from cs.LG) [pdf, other]
-
Title: HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural NetworksAuthors: Jinqi Xiao, Chengming Zhang, Yu Gong, Miao Yin, Yang Sui, Lizhi Xiang, Dingwen Tao, Bo YuanComments: AAAI-23Journal-ref: Proceedings of the AAAI Conference on Artificial Intelligence. 37, 9 (Jun. 2023), 10464-10472Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [933] arXiv:2301.09515 (cross-list from cs.LG) [pdf, other]
-
Title: StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image SynthesisComments: Project page: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [934] arXiv:2301.09544 (cross-list from cs.RO) [pdf, other]
-
Title: Learning to View: Decision Transformers for Active Object DetectionAuthors: Wenhao Ding, Nathalie Majcherczyk, Mohit Deshpande, Xuewei Qi, Ding Zhao, Rajasimman Madhivanan, Arnie SenComments: Accepted to ICRA 2023Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [935] arXiv:2301.10047 (cross-list from cs.GR) [pdf, other]
-
Title: DiffMotion: Speech-Driven Gesture Synthesis Using Denoising Diffusion ModelComments: 13 pages, 3 figuresSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
- [936] arXiv:2301.10056 (cross-list from cs.CR) [pdf, ps, other]
-
Title: Side Eye: Characterizing the Limits of POV Acoustic Eavesdropping from Smartphone Cameras with Rolling Shutters and Movable LensesJournal-ref: 2023 IEEE Symposium on Security and PrivacySubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [937] arXiv:2301.10127 (cross-list from cs.LG) [pdf, other]
-
Title: Improving Open-Set Semi-Supervised Learning with Self-SupervisionComments: WACV2024Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [938] arXiv:2301.10327 (cross-list from cs.LG) [pdf, other]
-
Title: Generating Multidimensional Clusters With Support LinesComments: The peer-reviewed version of this paper is published in Knowledge-Based Systems at this https URL This version is typeset by the author and differs only in pagination and typographical detailJournal-ref: Knowledge-Based Systems, 277, 110836, 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Programming Languages (cs.PL)
- [939] arXiv:2301.10418 (cross-list from cs.LG) [pdf, other]
-
Title: DEJA VU: Continual Model Generalization For Unseen DomainsComments: Published as a conference paper at ICLR 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [940] arXiv:2301.10454 (cross-list from cs.LG) [pdf, other]
-
Title: A Data-Centric Approach for Improving Adversarial Training Through the Lens of Out-of-Distribution DetectionAuthors: Mohammad Azizmalayeri, Arman Zarei, Alireza Isavand, Mohammad Taghi Manzuri, Mohammad Hossein RohbanComments: Accepted to CSICC 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [941] arXiv:2301.10876 (cross-list from cs.LG) [pdf, other]
-
Title: Reef-insight: A framework for reef habitat mapping with clustering methods via remote sensingJournal-ref: Information, 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [942] arXiv:2301.10908 (cross-list from cs.LG) [pdf, other]
-
Title: Distilling Cognitive Backdoor Patterns within an ImageComments: ICLR2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [943] arXiv:2301.10921 (cross-list from cs.LG) [pdf, other]
-
Title: SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised LearningAuthors: Hao Chen, Ran Tao, Yue Fan, Yidong Wang, Jindong Wang, Bernt Schiele, Xing Xie, Bhiksha Raj, Marios SavvidesComments: Accepted by ICLR 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [944] arXiv:2301.11065 (cross-list from cs.LG) [pdf, other]
-
Title: Inspecting class hierarchies in classification-based metric learning modelsComments: The main manuscript is 22 pages. The whole paper is 49 pages. The codes for our experiments will be available in this https URL . The plankton datasets are available from the Norwegian Marine Data Center (MicroS: this https URL , MicroL: this https URL , MesoZ: this https URL )Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [945] arXiv:2301.11104 (cross-list from cs.LG) [pdf, other]
-
Title: Discovering and Mitigating Visual Biases through Keyword ExplanationComments: CVPR 2024. First two authors contributed equallySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [946] arXiv:2301.11275 (cross-list from cs.RO) [pdf, other]
-
Title: A Review of Scene Representations for Robot ManipulatorsAuthors: Carter SiffermanComments: 23 pages, 7 figures, 2 tablesSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [947] arXiv:2301.11316 (cross-list from cs.LG) [pdf, other]
-
Title: Open Problems in Applied Deep LearningAuthors: Maziar RaissiSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR)
- [948] arXiv:2301.11405 (cross-list from cs.LG) [pdf, other]
-
Title: Discriminative Entropy Clustering and its Relation to K-means and SVMSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [949] arXiv:2301.11494 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Vortex Dynamics for Fluid Inference and PredictionComments: ICLR 2023, project webpage: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [950] arXiv:2301.11520 (cross-list from cs.LG) [pdf, other]
-
Title: SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement LearningComments: ICML 2023. First two authors contributed equally. Order was determined by coin flipSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [951] arXiv:2301.11522 (cross-list from cs.AI) [pdf, other]
-
Title: A Comparison of Tiny-nerf versus Spatial Representations for 3d ReconstructionSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [952] arXiv:2301.11564 (cross-list from cs.RO) [pdf, other]
-
Title: Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance GroundingComments: 10 pages, 3 figures, 7 tablesSubjects: Robotics (cs.RO); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [953] arXiv:2301.11699 (cross-list from cs.LG) [pdf, other]
-
Title: Image Restoration with Mean-Reverting Stochastic Differential EquationsComments: Accepted by ICML 2023; Project page: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [954] arXiv:2301.11706 (cross-list from cs.LG) [pdf, other]
-
Title: Input Perturbation Reduces Exposure Bias in Diffusion ModelsComments: accepted by ICML 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [955] arXiv:2301.11745 (cross-list from cs.CR) [pdf, other]
- [956] arXiv:2301.11779 (cross-list from cs.LG) [pdf, other]
-
Title: Invariant Meta Learning for Out-of-Distribution GeneralizationComments: IEEE Conference on Computer Vision and Pattern Recognition 2022 The Ninth Workshop on Fine-Grained Visual CategorizationSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [957] arXiv:2301.11810 (cross-list from cs.LG) [pdf, other]
-
Title: BOMP-NAS: Bayesian Optimization Mixed Precision NASSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [958] arXiv:2301.11823 (cross-list from cs.RO) [pdf, other]
-
Title: HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual CameraAuthors: Mostafa Ahmadi, Amin Alizadeh Naeini, Mohammad Moein Sheikholeslami, Zahra Arjmandi, Yujia Zhang, Gunho SohnComments: 8 pages, 3 figures, To be published in IEEE International Conference on Automation Science and Engineering (CASE) 2023Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [959] arXiv:2301.11843 (cross-list from cs.CL) [pdf, other]
-
Title: Reading and Reasoning over Chart Images for Evidence-based Automated Fact-CheckingComments: Accepted to EACL 2023 (Findings)Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [960] arXiv:2301.11892 (cross-list from cs.LG) [pdf, other]
-
Title: Streaming LifeLong Learning With Any-Time InferenceComments: arXiv admin note: substantial text overlap with arXiv:2110.10741Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [961] arXiv:2301.11990 (cross-list from cs.LG) [pdf, other]
-
Title: Alignment with human representations supports robust few-shot learningComments: Spotlight at NeurIPS 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (stat.ML)
- [962] arXiv:2301.12003 (cross-list from cs.LG) [pdf, other]
-
Title: Minimizing Trajectory Curvature of ODE-based Generative ModelsComments: ICML 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [963] arXiv:2301.12006 (cross-list from cs.LG) [pdf, other]
-
Title: Improved knowledge distillation by utilizing backward pass knowledge in neural networksSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [964] arXiv:2301.12067 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Optimal Features via Partial InvarianceComments: Presented at the 37th AAAI Conference on Artificial Intelligence, 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [965] arXiv:2301.12168 (cross-list from cs.LG) [pdf, other]
-
Title: Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks via Aggregated Early ExitsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [966] arXiv:2301.12246 (cross-list from cs.LG) [pdf, other]
-
Title: A Closer Look at Few-shot Classification AgainComments: Accepted at ICML 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [967] arXiv:2301.12269 (cross-list from cs.CY) [pdf, other]
-
Title: Methods and Tools for Monitoring Driver's BehaviorAuthors: Muhammad Tanveer Jan, Sonia Moshfeghi, Joshua William Conniff, Jinwoo Jang, Kwangsoo Yang, Jiannan Zhai, Monica Rosselli, David Newman, Ruth Tappen, Borko FurhtSubjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
- [968] arXiv:2301.12293 (cross-list from cs.AI) [pdf, other]
-
Title: ACL-Fig: A Dataset for Scientific Figure ClassificationComments: 6 pages, 4 figures, accepted by the AAAI-23 Workshop on Scientific Document UnderstandingSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Digital Libraries (cs.DL)
- [969] arXiv:2301.12334 (cross-list from cs.LG) [pdf, other]
-
Title: Don't Play Favorites: Minority Guidance for Diffusion ModelsComments: ICLR 2024Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [970] arXiv:2301.12356 (cross-list from cs.NE) [pdf, other]
-
Title: Exploiting High Performance Spiking Neural Networks with Efficient Spiking PatternsSubjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
- [971] arXiv:2301.12378 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Inference Efficient Deep Ensemble LearningComments: 11 pages, accepted in AAAI 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [972] arXiv:2301.12456 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Verifying the Geometric Robustness of Large-scale Neural NetworksSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [973] arXiv:2301.12549 (cross-list from cs.LG) [pdf, other]
-
Title: Unlocking Deterministic Robustness Certification on ImageNetSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [974] arXiv:2301.12554 (cross-list from cs.LG) [pdf, other]
-
Title: Improving the Accuracy-Robustness Trade-Off of Classifiers via Adaptive SmoothingSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [975] arXiv:2301.12592 (cross-list from cs.LG) [pdf, other]
-
Title: Ensemble Learning for Fusion of Multiview Vision with Occlusion and Missing Information: Framework and Evaluations with Real-World Data and Applications in Driver Hand Activity RecognitionSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [976] arXiv:2301.12614 (cross-list from cs.RO) [pdf, other]
-
Title: RREx-BoT: Remote Referring Expressions with a Bag of TricksSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [977] arXiv:2301.12667 (cross-list from cs.LG) [pdf, other]
-
Title: NeSyFOLD: Neurosymbolic Framework for Interpretable Image ClassificationSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [978] arXiv:2301.12686 (cross-list from cs.LG) [pdf, other]
-
Title: GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion RestorationAuthors: Naoki Murata, Koichi Saito, Chieh-Hsin Lai, Yuhta Takida, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano ErmonSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [979] arXiv:2301.12688 (cross-list from cs.GR) [pdf, other]
-
Title: Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video ProductionComments: Project page: this https URLSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM); Image and Video Processing (eess.IV)
- [980] arXiv:2301.12831 (cross-list from cs.MM) [pdf, other]
-
Title: M3FAS: An Accurate and Robust MultiModal Mobile Face Anti-Spoofing SystemSubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [981] arXiv:2301.12900 (cross-list from cs.AI) [pdf, other]
-
Title: DepGraph: Towards Any Structural PruningSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [982] arXiv:2301.12935 (cross-list from cs.LG) [pdf, other]
-
Title: ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion Probabilistic ModelsComments: 16 pages, 12 figuresSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [983] arXiv:2301.12995 (cross-list from cs.LG) [pdf, other]
-
Title: FedFA: Federated Feature AugmentationComments: Accepted to ICLR 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [984] arXiv:2301.13018 (cross-list from cs.LG) [pdf, other]
-
Title: DELTA: degradation-free fully test-time adaptationSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [985] arXiv:2301.13166 (cross-list from cs.AI) [pdf, other]
-
Title: ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object NavigationAuthors: Kaiwen Zhou, Kaizhi Zheng, Connor Pryor, Yilin Shen, Hongxia Jin, Lise Getoor, Xin Eric WangSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [986] arXiv:2301.13188 (cross-list from cs.CR) [pdf, other]
-
Title: Extracting Training Data from Diffusion ModelsAuthors: Nicholas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramèr, Borja Balle, Daphne Ippolito, Eric WallaceSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [987] arXiv:2301.13195 (cross-list from cs.LG) [pdf, other]
-
Title: Adaptive Computation with Elastic Input SequenceSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [988] arXiv:2301.13197 (cross-list from cs.LG) [pdf, other]
-
Title: Unlocking Slot Attention by Changing Optimal Transport CostsComments: Published at International Conference on Machine Learning (ICML) 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [989] arXiv:2301.13244 (cross-list from cs.RO) [pdf, other]
-
Title: Mono-STAR: Mono-camera Scene-level Tracking and ReconstructionComments: This paper has been accepted by ICRA2023Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [990] arXiv:2301.13261 (cross-list from cs.AI) [pdf, other]
-
Title: Emergence of Maps in the Memories of Blind Navigation AgentsComments: Accepted to ICLR 2023Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [991] arXiv:2301.13330 (cross-list from cs.LG) [pdf, other]
-
Title: Efficient and Effective Methods for Mixed Precision Neural Network Quantization for Faster, Energy-efficient InferenceAuthors: Deepika Bablani, Jeffrey L. Mckinstry, Steven K. Esser, Rathinakumar Appuswamy, Dharmendra S. ModhaSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [992] arXiv:2301.13338 (cross-list from cs.LG) [pdf, other]
-
Title: Continuous Spatiotemporal TransformersComments: Updated version, after reviewsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [993] arXiv:2301.13343 (cross-list from cs.LG) [pdf, other]
-
Title: Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement LearningComments: The 2022 International Joint Conference on Neural Networks (IJCNN2022)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [994] arXiv:2301.13376 (cross-list from cs.LG) [pdf, other]
-
Title: Quantized Neural Networks for Low-Precision Accumulation with Guaranteed Overflow AvoidanceSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
- [995] arXiv:2301.13381 (cross-list from cs.LG) [pdf, other]
-
Title: When Source-Free Domain Adaptation Meets Learning with Noisy LabelsAuthors: Li Yi, Gezheng Xu, Pengcheng Xu, Jiaqi Li, Ruizhi Pu, Charles Ling, A. Ian McLeod, Boyu WangComments: ICLR 2023 camera-readySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [996] arXiv:2301.13530 (cross-list from cs.LG) [pdf, other]
-
Title: Domain-Generalizable Multiple-Domain ClusteringComments: 13 pages, 3 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [997] arXiv:2301.13576 (cross-list from cs.AI) [pdf, other]
-
Title: Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022Authors: Pierre-Etienne Martin (MPI-EVA), Jordan Calandre (MIA), Boris Mansencal (LaBRI), Jenny Benois-Pineau (LaBRI), Renaud Péteri (MIA), Laurent Mascarilla (MIA), Julien MorlierComments: MediaEval 2022 Workshop, Jan 2023, Bergen, Norway. arXiv admin note: substantial text overlap with arXiv:2112.11384Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Multimedia (cs.MM)
- [998] arXiv:2301.13622 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Data Representations with Joint Diffusion ModelsComments: Code: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [999] arXiv:2301.13809 (cross-list from cs.RO) [pdf, other]
-
Title: A Prototype System for High Frame Rate Ultrasound Imaging based Prosthetic Arm ControlSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1000] arXiv:2301.13823 (cross-list from cs.CL) [pdf, other]
-
Title: Grounding Language Models to Images for Multimodal Inputs and OutputsComments: Published in ICML 2023. Project page: this https URLSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1001] arXiv:2301.13838 (cross-list from cs.CR) [pdf, other]
-
Title: Image Shortcut Squeezing: Countering Perturbative Availability Poisons with CompressionComments: To appear at ICML 2023. Our code is available at this https URLSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1002] arXiv:2301.13862 (cross-list from cs.LG) [pdf, other]
-
Title: Salient Conditional Diffusion for Defending Against Backdoor AttacksComments: 14 pages, 5 figures. Edit: Added new baselinesSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1003] arXiv:2301.00127 (cross-list from eess.IV) [pdf, other]
-
Title: Spatiotemporal implicit neural representation for unsupervised dynamic MRI reconstructionComments: 9 pages, 5 figures; corrected the code availability description for arXivSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1004] arXiv:2301.00326 (cross-list from math.OC) [pdf, other]
-
Title: Yuille-Poggio's Flow and Global Minimizer of polynomials through convexification by Heat EvolutionAuthors: Qiao WangSubjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV)
- [1005] arXiv:2301.00349 (cross-list from eess.IV) [pdf, other]
-
Title: EvidenceCap: Towards trustworthy medical image segmentation via evidential identity capAuthors: Ke Zou, Xuedong Yuan, Xiaojing Shen, Yidi Chen, Meng Wang, Rick Siow Mong Goh, Yong Liu, Huazhu FuComments: 38 pages, 6 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1006] arXiv:2301.00504 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Spectral Bandwidth Recovery of Optical Coherence Tomography Images using Deep LearningSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [1007] arXiv:2301.00765 (cross-list from eess.IV) [pdf, other]
-
Title: Segmentation based tracking of cells in 2D+time microscopy images of macrophagesComments: Computers in Biology and Medicine, Volume 153, 106499,(2023)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Cell Behavior (q-bio.CB)
- [1008] arXiv:2301.00785 (cross-list from eess.IV) [pdf, other]
-
Title: CLIP-Driven Universal Model for Organ Segmentation and Tumor DetectionAuthors: Jie Liu, Yixiao Zhang, Jie-Neng Chen, Junfei Xiao, Yongyi Lu, Bennett A. Landman, Yixuan Yuan, Alan Yuille, Yucheng Tang, Zongwei ZhouComments: ICCV-2023; Rank first in Medical Segmentation Decathlon (MSD) CompetitionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1009] arXiv:2301.00934 (cross-list from eess.IV) [pdf, other]
-
Title: Finding the Most Transferable Tasks for Brain Image SegmentationComments: Accepted by BIBM 2022Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1010] arXiv:2301.01054 (cross-list from eess.IV) [pdf, other]
-
Title: Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noiseComments: 22 pages, 5 figures, 5 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
- [1011] arXiv:2301.01069 (cross-list from eess.IV) [pdf, other]
-
Title: Saliency-Aware Spatio-Temporal Artifact Detection for Compressed Video Quality AssessmentSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
- [1012] arXiv:2301.01079 (cross-list from eess.IV) [pdf, other]
-
Title: Fine-Grained Hard Negative Mining: Generalizing Mitosis Detection with a Fifth of the MIDOG 2022 DatasetSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1013] arXiv:2301.01182 (cross-list from eess.IV) [pdf, other]
-
Title: PMT-IQA: Progressive Multi-task Learning for Blind Image Quality AssessmentSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1014] arXiv:2301.01355 (cross-list from eess.IV) [pdf, other]
-
Title: Holistic Multi-Slice Framework for Dynamic Simultaneous Multi-Slice MRI ReconstructionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1015] arXiv:2301.01369 (cross-list from eess.IV) [pdf, other]
-
Title: Brain Tissue Segmentation Across the Human Lifespan via Supervised Contrastive LearningAuthors: Xiaoyang Chen, Jinjian Wu, Wenjiao Lyu, Yicheng Zou, Kim-Han Thung, Siyuan Liu, Ye Wu, Sahar Ahmad, Pew-Thian YapSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1016] arXiv:2301.01448 (cross-list from eess.IV) [pdf, other]
-
Title: A deep local attention network for pre-operative lymph node metastasis prediction in pancreatic cancer via multiphase CT imagingAuthors: Zhilin Zheng, Xu Fang, Jiawen Yao, Mengmeng Zhu, Le Lu, Lingyun Huang, Jing Xiao, Yu Shi, Hong Lu, Jianping Lu, Ling Zhang, Chengwei Shao, Yun BianComments: 14 pages,5 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1017] arXiv:2301.01679 (cross-list from eess.IV) [pdf, other]
-
Title: COVID-Net USPro: An Open-Source Explainable Few-Shot Deep Prototypical Network to Monitor and Detect COVID-19 Infection from Point-of-Care Ultrasound ImagesComments: 12 pages, 5 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1018] arXiv:2301.01732 (cross-list from eess.IV) [pdf, ps, other]
-
Title: UNAEN: Unsupervised Abnormality Extraction Network for MRI Motion Artifact ReductionAuthors: Yusheng Zhou, Hao Li, Jianan Liu, Zhengmin Kong, Tao Huang, Euijoon Ahn, Zhihan Lv, Jinman Kim, David Dagan FengSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1019] arXiv:2301.01791 (cross-list from eess.IV) [pdf, other]
-
Title: Fully Automated Artery-Vein ratio and vascular tortuosity measurement in retinal fundus imagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1020] arXiv:2301.01814 (cross-list from physics.soc-ph) [pdf, ps, other]
-
Title: Living Images: A Recursive Approach to Computing the Structural Beauty of Images or the Livingness of SpaceComments: 19 pages, 10 figures, 6 tablesSubjects: Physics and Society (physics.soc-ph); Computer Vision and Pattern Recognition (cs.CV)
- [1021] arXiv:2301.01911 (cross-list from eess.IV) [pdf, ps, other]
-
Title: TractGraphCNN: anatomically informed graph CNN for classification using diffusion MRI tractographyAuthors: Yuqian Chen, Fan Zhang, Leo R. Zekelman, Tengfei Xue, Chaoyi Zhang, Yang Song, Nikos Makris, Yogesh Rathi, Weidong Cai, Lauren J. O'DonnellComments: 5 pages, 3 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1022] arXiv:2301.01940 (cross-list from eess.IV) [pdf, other]
-
Title: Enabling Augmented Segmentation and Registration in Ultrasound-Guided Spinal Surgery via Realistic Ultrasound Synthesis from Diagnostic CT VolumeComments: Submitted to IEEE Transactions on Automation Science and Engineering. Copyright may be transferred without notice, after which this version may no longer be accessible. Note that the abstract is shorter than that in the pdf file due to character limitationsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1023] arXiv:2301.02069 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Learning for Breast MRI Style Transfer with Limited Training DataComments: preprint version, accepted in the Journal of Digital Imaging (JDIM). 16 pages (+ author names + references + supplementary), 6 figuresJournal-ref: J Digit Imaging (2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1024] arXiv:2301.02166 (cross-list from eess.IV) [pdf, other]
-
Title: Identification of lung nodules CT scan using YOLOv5 based on convolution neural networkComments: 14 pages, 10 Postscript figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1025] arXiv:2301.02181 (cross-list from eess.IV) [pdf, other]
-
Title: A Critical Appraisal of Data Augmentation Methods for Imaging-Based Medical Diagnosis ApplicationsAuthors: Tara M. Pattilachan, Ugur Demir, Elif Keles, Debesh Jha, Derk Klatte, Megan Engels, Sanne Hoogenboom, Candice Bolan, Michael Wallace, Ulas BagciSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1026] arXiv:2301.02228 (cross-list from eess.IV) [pdf, other]
-
Title: MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training in RadiologySubjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1027] arXiv:2301.02268 (cross-list from math.OC) [pdf, other]
-
Title: Restarts subject to approximate sharpness: A parameter-free and optimal scheme for first-order methodsSubjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Numerical Analysis (math.NA)
- [1028] arXiv:2301.02317 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Convolutional XGBoost (C-XGBOOST) Model for Brain Tumor DetectionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1029] arXiv:2301.02341 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A survey on Organoid Image Analysis PlatformsComments: 19 pages, 10 figures, 5 tables, research reviewSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1030] arXiv:2301.02388 (cross-list from eess.IV) [pdf, other]
-
Title: Generating corneal panoramic images from contact specular microscope imagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1031] arXiv:2301.02390 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Deep-learning models in medical image analysis: Detection of esophagitis from the Kvasir DatasetSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1032] arXiv:2301.02393 (cross-list from eess.IV) [pdf, other]
-
Title: Graph Convolution Based Cross-Network Multi-Scale Feature Fusion for Deep Vessel SegmentationAuthors: Gangming Zhao, Kongming Liang, Chengwei Pan, Fandong Zhang, Xianpeng Wu, Xinyang Hu, Yizhou YuSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1033] arXiv:2301.02437 (cross-list from stat.ML) [pdf, other]
-
Title: Valid P-Value for Deep Learning-Driven Salient RegionSubjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1034] arXiv:2301.02468 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Deep Learning For Classification Of Chest X-Ray Images (Covid 19)Comments: 6 pages, 3 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1035] arXiv:2301.02488 (cross-list from eess.SP) [pdf, ps, other]
-
Title: TWR-MCAE: A Data Augmentation Method for Through-the-Wall Radar Human Motion RecognitionComments: Publisher: IEEE Transactions on Geoscience and Remote Sensing (Volume: 60). Total Pages: 17. Total Figures: 17Journal-ref: in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-17, 2022, Art no. 5118617Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [1036] arXiv:2301.02608 (cross-list from eess.IV) [pdf, other]
-
Title: A CAD System for Colorectal Cancer from WSI: A Clinically Validated Interpretable ML-based PrototypeAuthors: Pedro C. Neto, Diana Montezuma, Sara P. Oliveira, Domingos Oliveira, João Fraga, Ana Monteiro, João Monteiro, Liliana Ribeiro, Sofia Gonçalves, Stefan Reinhard, Inti Zlobec, Isabel M. Pinto, Jaime S. CardosoComments: Under ReviewSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1037] arXiv:2301.02735 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Designing an Improved Deep Learning-based Model for COVID-19 Recognition in Chest X-ray Images: A Knowledge Distillation ApproachComments: 25 pages, 3 figures , 5 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1038] arXiv:2301.02757 (cross-list from physics.ins-det) [pdf, other]
-
Title: Mimicking non-ideal instrument behavior for hologram processing using neural style translationComments: 23 pages, 9 figuresSubjects: Instrumentation and Detectors (physics.ins-det); Computer Vision and Pattern Recognition (cs.CV)
- [1039] arXiv:2301.03027 (cross-list from eess.IV) [pdf, other]
-
Title: Annealed Score-Based Diffusion Model for MR Motion Artifact ReductionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1040] arXiv:2301.03047 (cross-list from eess.IV) [pdf, other]
-
Title: Large-scale Global Low-rank Optimization for Computational Compressed ImagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
- [1041] arXiv:2301.03081 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Automatic Diagnosis of Carotid Atherosclerosis Using a Portable Freehand 3D Ultrasound Imaging SystemAuthors: Jiawen Li, Yunqian Huang, Sheng Song, Hongbo Chen, Junni Shi, Duo Xu, Haibin Zhang, Man Chen, Rui ZhengSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [1042] arXiv:2301.03162 (cross-list from physics.optics) [pdf, ps, other]
-
Title: eFIN: Enhanced Fourier Imager Network for generalizable autofocusing and pixel super-resolution in holographic imagingComments: 10 Pages, 4 FiguresJournal-ref: IEEE Journal of Selected Topics in Quantum Electronics (2023)Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1043] arXiv:2301.03202 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Integrating features from lymph node stations for metastatic lymph node detectionJournal-ref: Computerized Medical Imaging and Graphics, Volume 101, 2022, 102108, ISSN 0895-6111Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1044] arXiv:2301.03281 (cross-list from eess.IV) [pdf, other]
-
Title: The state-of-the-art 3D anisotropic intracranial hemorrhage segmentation on non-contrast head CT: The INSTANCE challengeAuthors: Xiangyu Li, Gongning Luo, Kuanquan Wang, Hongyu Wang, Jun Liu, Xinjie Liang, Jie Jiang, Zhenghao Song, Chunyue Zheng, Haokai Chi, Mingwang Xu, Yingte He, Xinghua Ma, Jingwen Guo, Yifan Liu, Chuanpu Li, Zeli Chen, Md Mahfuzur Rahman Siddiquee, Andriy Myronenko, Antoine P. Sanner, Anirban Mukhopadhyay, Ahmed E. Othman, Xingyu Zhao, Weiping Liu, Jinhuang Zhang, Xiangyuan Ma, Qinghui Liu, Bradley J. MacIntosh, Wei Liang, Moona Mazher, Abdul Qayyum, Valeriia Abramova, Xavier Lladó, Shuo LiComments: Summarized paper for the MICCAI INSTANCE 2022 ChallengeSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1045] arXiv:2301.03335 (cross-list from eess.IV) [pdf, other]
-
Title: Nearest Neighbor-Based Contrastive Learning for Hyperspectral and LiDAR Data ClassificationComments: IEEE TGRS 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1046] arXiv:2301.03362 (cross-list from eess.IV) [pdf, other]
-
Title: Image Denoising: The Deep Learning Revolution and Beyond -- A Survey Paper --Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1047] arXiv:2301.03367 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Leukemia detection based on microscopic blood smear images using deep learningSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1048] arXiv:2301.03418 (cross-list from eess.IV) [pdf, other]
-
Title: Nuclear Segmentation and Classification: On Color & Compression GeneralizationAuthors: Quoc Dang Vu, Robert Jewsbury, Simon Graham, Mostafa Jahanifar, Shan E Ahmed Raza, Fayyaz Minhas, Abhir Bhalerao, Nasir RajpootComments: Oral presentation at MICCAI MLMI 2022, 7 pages, 6 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1049] arXiv:2301.03589 (cross-list from eess.IV) [pdf, other]
-
Title: Explainable, Physics Aware, Trustworthy AI Paradigm Shift for Synthetic Aperture RadarSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1050] arXiv:2301.03701 (cross-list from eess.IV) [pdf, other]
-
Title: Artificial Intelligence Model for Tumoral Clinical Decision Support SystemsAuthors: Guillermo Iglesias, Edgar Talavera, Jesús Troya Garcìa, Alberto Díaz-Álvarez, Miguel Gracía-RemesalComments: 16 pages, 7 figures, 3 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
- [1051] arXiv:2301.03711 (cross-list from q-bio.NC) [pdf, other]
-
Title: 3D Shape Perception Integrates Intuitive Physics and Analysis-by-SynthesisSubjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1052] arXiv:2301.03914 (cross-list from eess.IV) [pdf, other]
-
Title: Learning with minimal effort: leveraging in silico labeling for cell and nucleus segmentationAuthors: Thomas Bonte, Maxence Philbert, Emeline Coleno, Edouard Bertrand, Arthur Imbert, Thomas WalterSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1053] arXiv:2301.04032 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Does image resolution impact chest X-ray based fine-grained Tuberculosis-consistent lesion segmentation?Comments: 17 pages, 7 figures, 5 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1054] arXiv:2301.04168 (cross-list from astro-ph.IM) [pdf, other]
-
Title: Pixelated Reconstruction of Foreground Density and Background Surface Brightness in Gravitational Lensing Systems using Recurrent Inference MachinesComments: 13+7 pages, 13 figures; Accepted by The Astrophysical Journal. arXiv admin note: text overlap with arXiv:2207.01073Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV)
- [1055] arXiv:2301.04401 (cross-list from eess.IV) [pdf, other]
-
Title: An atrium segmentation network with location guidance and siamese adjustmentComments: 17 pages,9 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1056] arXiv:2301.04416 (cross-list from q-bio.QM) [pdf, other]
-
Title: pyssam -- a Python library for statistical modelling of biomedical shape and appearanceComments: 5 pages, 3 figures, Journal of Open Source Software submissionSubjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1057] arXiv:2301.04423 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-Scanner Canine Cutaneous Squamous Cell Carcinoma Histopathology DatasetAuthors: Frauke Wilm, Marco Fragoso, Christof A. Bertram, Nikolas Stathonikos, Mathias Öttl, Jingna Qiu, Robert Klopfleisch, Andreas Maier, Katharina Breininger, Marc AubrevilleComments: 6 pages, 3 figures, 1 table, accepted at BVM workshop 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1058] arXiv:2301.04525 (cross-list from eess.IV) [pdf, other]
-
Title: Clustering disease trajectories in contrastive feature space for biomarker discovery in age-related macular degenerationAuthors: Robbie Holland, Oliver Leingang, Christopher Holmes, Philipp Anders, Rebecca Kaye, Sophie Riedl, Johannes C. Paetzold, Ivan Ezhov, Hrvoje Bogunović, Ursula Schmidt-Erfurth, Lars Fritsche, Hendrik P. N. Scholl, Sobha Sivaprasad, Andrew J. Lotery, Daniel Rueckert, Martin J. MentenComments: Submitted to MICCAI2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1059] arXiv:2301.04791 (cross-list from stat.ML) [pdf, other]
-
Title: Self-Attention Amortized Distributional Projection Optimization for Sliced Wasserstein Point-Cloud ReconstructionComments: Accepted to ICML 2023, 23 pages, 6 figures, 9 tables,Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [1060] arXiv:2301.04904 (cross-list from eess.IV) [pdf, other]
-
Title: Lesion-aware Dynamic Kernel for Polyp SegmentationComments: Accepted by MICCAI2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1061] arXiv:2301.06043 (cross-list from eess.IV) [pdf, other]
-
Title: Unsupervised Cardiac Segmentation Utilizing Synthesized Images from Anatomical LabelsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1062] arXiv:2301.06081 (cross-list from eess.IV) [pdf, other]
-
Title: A Hyper-weight Network for Hyperspectral Image DenoisingComments: 16 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1063] arXiv:2301.06226 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Learning based Novel Cascaded Approach for Skin Lesion AnalysisComments: Accepted to be published in 7th International Conference, CVIP 2022, Nagpur, India November 04-06, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1064] arXiv:2301.06304 (cross-list from eess.IV) [pdf, ps, other]
-
Title: LYSTO: The Lymphocyte Assessment Hackathon and Benchmark DatasetAuthors: Yiping Jiao, Jeroen van der Laak, Shadi Albarqouni, Zhang Li, Tao Tan, Abhir Bhalerao, Jiabo Ma, Jiamei Sun, Johnathan Pocock, Josien P.W. Pluim, Navid Alemi Koohbanani, Raja Muhammad Saad Bashir, Shan E Ahmed Raza, Sibo Liu, Simon Graham, Suzanne Wetstein, Syed Ali Khurram, Thomas Watson, Nasir Rajpoot, Mitko Veta, Francesco CiompiComments: will be sumitted to IEEE-JBHISubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1065] arXiv:2301.06366 (cross-list from eess.IV) [pdf, other]
-
Title: Evaluating clinical diversity and plausibility of synthetic capsule endoscopic imagesComments: 13 pages,10 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1066] arXiv:2301.06496 (cross-list from physics.optics) [pdf, ps, other]
-
Title: Efficient data transport over multimode light-pipes with Megapixel images using differentiable ray tracing and Machine-learningAuthors: Joowon Lim, Jannes Gladrow, Douglas Kelly, Greg O'Shea, Govert Verkes, Ioan Stefanovici, Sebastian Nowozin, Benn ThomsenComments: 21 pages, 5 figuresSubjects: Optics (physics.optics); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1067] arXiv:2301.06673 (cross-list from eess.IV) [pdf, other]
-
Title: Multi Kernel Positional Embedding ConvNeXt for Polyp SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1068] arXiv:2301.06681 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Cross-domain Self-supervised Framework for Photoacoustic Computed Tomography Image ReconstructionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1069] arXiv:2301.06793 (cross-list from eess.IV) [pdf, other]
-
Title: Acute ischemic stroke lesion segmentation in non-contrast CT images using 3D convolutional neural networksAuthors: A.V.Dobshik, S.K. Verbitskiy, I.A. Pestunov, K.M. Sherman, Yu.N. Sinyavskiy, A.A. Tulupov, V.B. BerikovComments: 18 pages, 4 figures, 2 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1070] arXiv:2301.06943 (cross-list from eess.IV) [pdf, other]
-
Title: Self-supervised Domain Adaptation for Breaking the Limits of Low-quality Fundus Image Quality EnhancementSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1071] arXiv:2301.06961 (cross-list from eess.IV) [pdf, other]
-
Title: Composite Deep Network with Feature Weighting for Improved Delineation of COVID Infection in Lung CTSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1072] arXiv:2301.07030 (cross-list from eess.IV) [pdf, other]
-
Title: Computational Pathology for Brain DisordersComments: Machine Learning for Brain Disorders, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1073] arXiv:2301.07234 (cross-list from eess.IV) [pdf, other]
-
Title: DRIMET: Deep Registration for 3D Incompressible Motion Estimation in Tagged-MRI with Application to the TongueAuthors: Zhangxing Bian, Fangxu Xing, Jinglun Yu, Muhan Shao, Yihao Liu, Aaron Carass, Jiachen Zhuo, Jonghye Woo, Jerry L. PrinceComments: Accepted to MIDL 2023 (oral)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1074] arXiv:2301.07286 (cross-list from eess.IV) [pdf, other]
-
Title: Reslicing Ultrasound Images for Data Augmentation and Vessel ReconstructionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [1075] arXiv:2301.07475 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Curvilinear object segmentation in medical images based on ODoS filter and deep learning networkComments: 20 pages, 8 figures. Applied Intelligence, 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1076] arXiv:2301.07541 (cross-list from physics.flu-dyn) [pdf, other]
-
Title: Generative Adversarial Networks to infer velocity components in rotating turbulent flowsJournal-ref: Eur. Phys. J. E 46, 31 (2023)Subjects: Fluid Dynamics (physics.flu-dyn); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD); Data Analysis, Statistics and Probability (physics.data-an)
- [1077] arXiv:2301.07895 (cross-list from eess.IV) [pdf, other]
-
Title: Spatially Covariant Lesion SegmentationComments: 9 pages, 7 figures, and 2 tablesJournal-ref: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI 2023), pp. 1713-1721Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [1078] arXiv:2301.08157 (cross-list from eess.IV) [pdf, other]
-
Title: SoftEnNet: Symbiotic Monocular Depth Estimation and Lumen Segmentation for Colonoscopy EndorobotsSubjects: Image and Video Processing (eess.IV); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
- [1079] arXiv:2301.08187 (cross-list from stat.ML) [pdf, other]
-
Title: A Multi-Resolution Framework for U-Nets with Applications to Hierarchical VAEsAuthors: Fabian Falck, Christopher Williams, Dominic Danks, George Deligiannidis, Christopher Yau, Chris Holmes, Arnaud Doucet, Matthew WillettsComments: NeurIPS 2022 (selected as oral)Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
- [1080] arXiv:2301.08252 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Evaluation of the potential of Near Infrared Hyperspectral Imaging for monitoring the invasive brown marmorated stink bugAuthors: Veronica Ferrari, Rosalba Calvini, Bas Boom, Camilla Menozzi, Aravind Krishnaswamy Rangarajan, Lara Maistrello, Peter Offermans, Alessandro UlriciComments: Accepted manuscriptJournal-ref: Chemometrics and Intelligent Laboratory Systems, 2023, 234, 104751Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1081] arXiv:2301.08330 (cross-list from eess.IV) [pdf, other]
-
Title: The role of noise in denoising models for anomaly detection in medical imagesAuthors: Antanas Kascenas, Pedro Sanchez, Patrick Schrempf, Chaoyang Wang, William Clackett, Shadia S. Mikhael, Jeremy P. Voisey, Keith Goatman, Alexander Weir, Nicolas Pugeault, Sotirios A. Tsaftaris, Alison Q. O'NeilComments: Submitted to Medical Image Analysis special issue for MIDL 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1082] arXiv:2301.08365 (cross-list from eess.IV) [pdf, other]
-
Title: On Retrospective k-space Subsampling schemes For Deep MRI ReconstructionComments: 22 pages, 12 figures, 5 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1083] arXiv:2301.08448 (cross-list from eess.SP) [pdf, other]
-
Title: Source-free Subject Adaptation for EEG-based Visual RecognitionComments: Accepted by the 11th IEEE International Winter Conference on Brain-Computer Interface (BCI 2023). Code is available at this https URLSubjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1084] arXiv:2301.08479 (cross-list from eess.IV) [pdf, other]
-
Title: Pneumonia Detection in Chest X-Ray Images : Handling Class ImbalanceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1085] arXiv:2301.08534 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Prodromal Diagnosis of Lewy Body Diseases Based on the Assessment of Graphomotor and Handwriting DifficultiesAuthors: Zoltan Galaz, Jiri Mekyska, Jan Mucha, Vojtech Zvoncak, Zdenek Smekal, Marcos Faundez-Zanuy, Lubos Brabenec, Ivona Moravkova, Irena RektorovaComments: Print ISBN 978-3-031-19744-4Journal-ref: In: Carmona-Duarte, C., Diaz, M., Ferrer, M.A., Morales, A. (eds) Intertwining Graphonomics with Human Movements. IGS 2022. Lecture Notes in Computer Science, vol 13424. Springer, ChamSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1086] arXiv:2301.08605 (cross-list from eess.IV) [pdf, other]
-
Title: A Deep Learning Approach for SAR Tomographic Imaging of Forested AreasComments: Submitted to IEEE Geoscience and Remote Sensing Letters, January 2023Journal-ref: IEEE Geoscience and Remote Sensing Letters, vol. 20, pp. 1-5, 2023, Art no. 4007405Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1087] arXiv:2301.08654 (cross-list from cond-mat.mes-hall) [pdf, other]
-
Title: Automated extraction of capacitive coupling for quantum dot systemsComments: 9 pages, 5 figuresJournal-ref: Phys. Rev. Applied 19, 054077 (2023)Subjects: Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantum Physics (quant-ph)
- [1088] arXiv:2301.08782 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Estimation of mitral valve hinge point coordinates -- deep neural net for echocardiogram segmentationComments: 8 Pages, 11 figures Presented at WSCG 2022Journal-ref: Computer Science Research Notes , CSRN 3201, 2022, ISSN 2464-4617Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1089] arXiv:2301.08798 (cross-list from eess.IV) [pdf, ps, other]
-
Title: DeepCOVID-Fuse: A Multi-modality Deep Learning Model Fusing Chest X-Radiographs and Clinical Variables to Predict COVID-19 Risk LevelsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1090] arXiv:2301.08815 (cross-list from eess.IV) [pdf, other]
-
Title: DiffusionCT: Latent Diffusion Model for CT Image StandardizationComments: 6 pages, 03 figures and 01 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1091] arXiv:2301.08868 (cross-list from eess.IV) [pdf, other]
-
Title: Computationally Efficient 3D MRI Reconstruction with Adaptive MLPComments: MICCAI 2023 early acceptSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1092] arXiv:2301.08888 (cross-list from eess.IV) [pdf, other]
-
Title: Pre-text Representation Transfer for Deep Learning with Limited Imbalanced Data : Application to CT-based COVID-19 DetectionComments: Best paper at IVCNZSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1093] arXiv:2301.08959 (cross-list from eess.IV) [pdf, other]
-
Title: Successive Subspace Learning for Cardiac Disease Classification with Two-phase Deformation Fields from Cine MRIComments: ISBI 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1094] arXiv:2301.09282 (cross-list from eess.IV) [pdf, other]
-
Title: Classification of Luminal Subtypes in Full Mammogram Images Using Transfer LearningComments: Submitted to IEEE ISBI 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1095] arXiv:2301.09322 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Learning-Based Assessment of Cerebral Microbleeds in COVID-19Authors: Neus Rodeja Ferrer, Malini Vendela Sagar, Kiril Vadimovic Klein, Christina Kruuse, Mads Nielsen, Mostafa Mehdipour GhaziComments: International Symposium on Biomedical Imaging (ISBI) 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1096] arXiv:2301.09431 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-domain stain normalization for digital pathology: A cycle-consistent adversarial network for whole slide imagesComments: 19 pages, 11 figures, 3 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1097] arXiv:2301.09452 (cross-list from eess.IV) [pdf, other]
-
Title: Fast and robust single particle reconstruction in 3D fluorescence microscopyAuthors: Thibaut Eloy, Etienne Baudrier, Marine Laporte, Virginie Hamel, Paul Guichard, Denis FortunSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1098] arXiv:2301.09525 (cross-list from eess.IV) [pdf, other]
-
Title: DeepFEL: Deep Fastfood Ensemble Learning for Histopathology Image AnalysisAuthors: Nima HatamiComments: arXiv admin note: substantial text overlap with arXiv:2104.00669Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1099] arXiv:2301.09624 (cross-list from eess.IV) [pdf, other]
-
Title: Maximum Mean Discrepancy Kernels for Predictive and Prognostic Modeling of Whole Slide ImagesComments: * Joint first authorship Accepted: IEEE - ISBI 2023 International Symposium on Biomedical ImagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1100] arXiv:2301.09673 (cross-list from physics.med-ph) [pdf, ps, other]
-
Title: Prostate Lesion Estimation using Prostate Masks from Biparametric MRIAuthors: Ahmet Karagoz, Mustafa Ege Seker, Mert Yergin, Tarkan Atak Kan, Mustafa Said Kartal, Ercan Karaarslan, Deniz Alis, Ilkay OksuzSubjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1101] arXiv:2301.09702 (cross-list from eess.IV) [pdf, other]
-
Title: Illumination Variation Correction Using Image Synthesis For Unsupervised Domain Adaptive Person Re-IdentificationComments: 10 pages, 5 figures, 5 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1102] arXiv:2301.09733 (cross-list from physics.med-ph) [pdf, other]
-
Title: Minimally Invasive Live Tissue High-fidelity Thermophysical Modeling using Real-time ThermographyAuthors: Hamza El-Kebir, Junren Ran, Yongseok Lee, Leonardo P. Chamorro, Martin Ostoja-Starzewski, Richard Berlin, Gabriela M. Aguiluz Cornejo, Enrico Benedetti, Pier C. Giulianotti, Joseph BentsmanComments: Accepted for publication in the IEEE Transactions on Biomedical Engineering. Research reported in this publication was supported by the National Institute of Biomedical Imaging and Bioengineering of the National Institutes of Health under award number R01EB029766Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1103] arXiv:2301.09799 (cross-list from eess.IV) [pdf, other]
-
Title: LDMIC: Learning-based Distributed Multi-view Image CodingComments: Accepted by ICLR 2023Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
- [1104] arXiv:2301.09887 (cross-list from eess.IV) [pdf, other]
-
Title: Deep learning-based method for segmenting epithelial layer of tubules in histopathological images of testicular tissueAuthors: Azadeh Fakhrzadeh, Pouya Karimian, Mahsa Meyari, Cris L. Luengo Hendriks, Lena Holm, Christian Sonne, Rune Dietz, Ellinor Spörndly-NeesComments: submitted to Journal of Medical Imaging, 16 pages, 5 figuresJournal-ref: J. Med. Imag. 10(3) 037501 (3 May 2023)Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1105] arXiv:2301.10187 (cross-list from eess.IV) [pdf, other]
-
Title: Enhanced Sharp-GAN For Histopathology Image SynthesisSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1106] arXiv:2301.10218 (cross-list from eess.IV) [pdf, other]
-
Title: Detecting and measuring human gastric peristalsis using magnetically controlled capsule endoscopeComments: 5 pages, 5 figures, accepted by IEEE ISBI 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1107] arXiv:2301.10227 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Denoising Diffusion Probabilistic Models for Generation of Realistic Fully-Annotated Microscopy Image Data SetsAuthors: Dennis Eschweiler, Rüveyda Yilmaz, Matisse Baumann, Ina Laube, Rijo Roy, Abin Jose, Daniel Brückner, Johannes StegmaierComments: 9 pages, 2 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1108] arXiv:2301.10365 (cross-list from eess.IV) [pdf, other]
-
Title: Data Consistent Deep Rigid MRI Motion CorrectionAuthors: Nalini M. Singh, Neel Dey, Malte Hoffmann, Bruce Fischl, Elfar Adalsteinsson, Robert Frost, Adrian V. Dalca, Polina GollandComments: Presented at MIDL 2023. 14 pages, 6 figures. Keywords: motion correction, magnetic resonance imaging, deep learningSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1109] arXiv:2301.10455 (cross-list from eess.IV) [pdf, other]
-
Title: Rate-Perception Optimized Preprocessing for Video CodingAuthors: Chengqian Ma, Zhiqiang Wu, Chunlei Cai, Pengwei Zhang, Yi Wang, Long Zheng, Chao Chen, Quan ZhouSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1110] arXiv:2301.10520 (cross-list from eess.IV) [pdf, other]
-
Title: Ultra-NeRF: Neural Radiance Fields for Ultrasound ImagingAuthors: Magdalena Wysocki, Mohammad Farid Azampour, Christine Eilers, Benjamin Busam, Mehrdad Salehi, Nassir NavabComments: accepted for oral presentation at MIDL 2023 (this https URL)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1111] arXiv:2301.10687 (cross-list from eess.IV) [pdf, other]
-
Title: Self-Supervised Curricular Deep Learning for Chest X-Ray Image ClassificationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1112] arXiv:2301.10829 (cross-list from eess.IV) [pdf, other]
-
Title: TranSOP: Transformer-based Multimodal Classification for Stroke Treatment Outcome PredictionComments: Accepted at IEEE ISBI 2023, 5 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1113] arXiv:2301.10958 (cross-list from stat.ML) [pdf, ps, other]
-
Title: Learning Large Scale Sparse ModelsSubjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1114] arXiv:2301.11189 (cross-list from eess.IV) [pdf, other]
-
Title: Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood ModelsComments: Upload camera-ready to arXiv. Official version available at this https URLJournal-ref: Proceedings of the 40th International Conference on Machine Learning (2023) 25426-25443Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [1115] arXiv:2301.11198 (cross-list from eess.IV) [pdf, other]
-
Title: I-24 MOTION: An instrument for freeway traffic scienceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1116] arXiv:2301.11201 (cross-list from math.OC) [pdf, ps, other]
-
Title: Relative-Interior Solution for (Incomplete) Linear Assignment Problem with Applications to Quadratic Assignment ProblemSubjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV)
- [1117] arXiv:2301.11329 (cross-list from eess.IV) [pdf, other]
-
Title: Anatomy-aware and acquisition-agnostic joint registration with SynthMorphComments: 33 pages, 22 figures, 4 tables, affine registration, deformable registration, deep learning, hypernetwork, domain shift, neuroimagingSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1118] arXiv:2301.11468 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-limb Split Learning for Tumor Classification on Vertically Distributed DataJournal-ref: 2021 Tenth International Conference on Intelligent Computing and Information Systems (ICICIS) (pp. 88-92). IEEESubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1119] arXiv:2301.11482 (cross-list from eess.IV) [src]
-
Title: Diffusion Denoising for Low-Dose-CT ModelAuthors: Runyi LiComments: The method and experiment of this paper has some error, and we need to revise itSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1120] arXiv:2301.11798 (cross-list from eess.IV) [pdf, other]
-
Title: MedSegDiff-V2: Diffusion based Medical Image Segmentation with TransformerComments: Code will be released at this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1121] arXiv:2301.11813 (cross-list from eess.IV) [pdf, other]
-
Title: Biomedical Image Reconstruction: A SurveyAuthors: Samuel CahyawijayaSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1122] arXiv:2301.11871 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Exploiting the Generative Adversarial Network Approach to Create a Synthetic Topography Corneal ImageAuthors: Samer Kais Jameel, Sezgin Aydin, Nebras H. Ghaeb, Jafar Majidpour, Tarik A. Rashid, Sinan Q. Salih, P. S. JosephNgComments: 13 pagesJournal-ref: Biomolecules, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1123] arXiv:2301.12176 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Neural Gas Network Image Features and Segmentation for Brain Tumor Detection Using Magnetic Resonance Imaging DataAuthors: S. Muhammad Hossein MousaviComments: 7 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1124] arXiv:2301.12291 (cross-list from eess.IV) [pdf, other]
-
Title: CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT ScansAuthors: Jieneng Chen, Yingda Xia, Jiawen Yao, Ke Yan, Jianpeng Zhang, Le Lu, Fakai Wang, Bo Zhou, Mingyan Qiu, Qihang Yu, Mingze Yuan, Wei Fang, Yuxing Tang, Minfeng Xu, Jian Zhou, Yuqian Zhao, Qifeng Wang, Xianghua Ye, Xiaoli Yin, Yu Shi, Xin Chen, Jingren Zhou, Alan Yuille, Zaiyi Liu, Ling ZhangComments: ICCV 2023 Camera Ready VersionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1125] arXiv:2301.12340 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Incremental Value and Interpretability of Radiomics Features of Both Lung and Epicardial Adipose Tissue for Detecting the Severity of COVID-19 InfectionAuthors: Ni Yao, Yanhui Tian, Daniel Gama das Neves, Chen Zhao, Claudio Tinoco Mesquita, Wolney de Andrade Martins, Alair Augusto Sarmet Moreira Damas dos Santos, Yanting Li, Chuang Han, Fubao Zhu, Neng Dai, Weihua ZhouComments: 20 pages, 7 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1126] arXiv:2301.12531 (cross-list from eess.IV) [pdf, ps, other]
-
Title: PhyCV: The First Physics-inspired Computer Vision LibrarySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1127] arXiv:2301.12588 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Development of Machine learning algorithms to identify the Cobb angle in adolescents with idiopathic scoliosis based on lumbosacral joint efforts during gait (Case study)Comments: 30 pages, 2 Figures, 4 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1128] arXiv:2301.12636 (cross-list from eess.IV) [pdf, other]
-
Title: Exploring Image Augmentations for Siamese Representation Learning with Chest X-RaysComments: Equal contributions. Oral paper at MIDL 2023. Additional experiments in appendix in V2. Keywords: Data Augmentations, Self-Supervised Learning, Medical Imaging, Chest X-rays, Siamese Representation LearningJournal-ref: Proceedings of Machine Learning Research, MIDL 2023Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1129] arXiv:2301.12939 (cross-list from eess.SP) [pdf, other]
-
Title: Data-driven soiling detection in PV modulesAuthors: Alexandros Kalimeris, Ioannis Psarros, Giorgos Giannopoulos, Manolis Terrovitis, George Papastefanatos, Gregory KotsisComments: 12 pages, 4 figuresSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1130] arXiv:2301.13098 (cross-list from eess.IV) [pdf, other]
-
Title: CHeart: A Conditional Spatio-Temporal Generative Model for Cardiac AnatomyAuthors: Mengyun Qiao, Shuo Wang, Huaqi Qiu, Antonio de Marvao, Declan P. O'Regan, Daniel Rueckert, Wenjia BaiComments: Accepted by IEEE Transactions on Medical ImagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1131] arXiv:2301.13128 (cross-list from eess.IV) [pdf, other]
-
Title: Standardized CycleGAN training for unsupervised stain adaptation in invasive carcinoma classification for breast histopathologySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1132] arXiv:2301.13151 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Convolutional Neural Network-Based Automatic Classification of Colorectal and Prostate Tumor Biopsies Using Multispectral Imagery: System Development StudyAuthors: Remy Peyret, Duaa alSaeed, Fouad Khelifi, Nadia Al-Ghreimil, Heyam Al-Baity, Ahmed BouridaneJournal-ref: JMIR Bioinform Biotech 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1133] arXiv:2301.13366 (cross-list from eess.IV) [pdf, ps, other]
-
Title: CaraNet: Context Axial Reverse Attention Network for Segmentation of Small Medical ObjectsComments: arXiv admin note: text overlap with arXiv:2108.07368Journal-ref: Journal of Medical Imaging 10(1), 014005 (18 February 2023)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1134] arXiv:2301.13371 (cross-list from stat.ML) [pdf, other]
-
Title: Demystifying Disagreement-on-the-Line in High DimensionsSubjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1135] arXiv:2301.13648 (cross-list from eess.IV) [pdf, other]
-
Title: CSDN: Combing Shallow and Deep Networks for Accurate Real-time Segmentation of High-definition Intravascular Ultrasound ImagesComments: 5 pages, 2 figures, 1 table, submitted to the 20th IEEE International Symposium on Biomedical Imaging (IEEE ISBI 2023)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1136] arXiv:2301.13674 (cross-list from eess.IV) [pdf, other]
-
Title: Improved distinct bone segmentation in upper-body CT through multi-resolution networksAuthors: Eva Schnider, Julia Wolleb, Antal Huck, Mireille Toranelli, Georg Rauter, Magdalena Müller-Gerbl, Philippe C. CattinComments: Under submissionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1137] arXiv:2301.13731 (cross-list from stat.ML) [pdf, other]
-
Title: A relaxed proximal gradient descent algorithm for convergent plug-and-play with proximal denoiserSubjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Optimization and Control (math.OC)
- [1138] arXiv:2301.13786 (cross-list from cs.CV) [pdf, other]
-
Title: Deep learning-based lung segmentation and automatic regional template in chest X-ray images for pediatric tuberculosisAuthors: Daniel Capellán-Martín, Juan J. Gómez-Valverde, Ramon Sanchez-Jacob, David Bermejo-Peláez, Lara García-Delgado, Elisa López-Varela, Maria J. Ledesma-CarbayoComments: This work has been accepted at the SPIE Medical Imaging 2023, Image Processing conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[ showing 1138 entries per page: fewer | more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, 2403, contact, help (Access key information)