Computer Vision and Pattern Recognition
Authors and titles for cs.CV in Jun 2022
[ total of 1594 entries: 1-1591 | 1592-1594 ][ showing 1591 entries per page: fewer | more | all ]
- [1] arXiv:2206.00048 [pdf, other]
-
Title: PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANsComments: Accepted at ICLR 2023. Code available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [2] arXiv:2206.00069 [pdf, other]
-
Title: Comparing feature fusion strategies for Deep Learning-based kidney stone identificationAuthors: Elias Villalvazo-Avila, Francisco Lopez-Tiro, Daniel Flores-Araiza, Gilberto Ochoa-Ruiz, Jonathan El-Beze, Jacques Hubert, Christian DaulComments: 4 pages, 3 figures, XXVIII\`eme Colloque Francophone de Traitement du Signal et des ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [3] arXiv:2206.00092 [pdf, other]
-
Title: FHIST: A Benchmark for Few-shot Classification of Histological ImagesAuthors: Fereshteh Shakeri, Malik Boudiaf, Sina Mohammadi, Ivaxi Sheth, Mohammad Havaei, Ismail Ben Ayed, Samira Ebrahimi KahouComments: Code available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [4] arXiv:2206.00100 [pdf, other]
-
Title: VALHALLA: Visual Hallucination for Machine TranslationComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [5] arXiv:2206.00123 [pdf, other]
-
Title: Glo-In-One: Holistic Glomerular Detection, Segmentation, and Lesion Characterization with Large-scale Web Image MiningAuthors: Tianyuan Yao, Yuzhe Lu, Jun Long, Aadarsh Jha, Zheyu Zhu, Zuhayr Asad, Haichun Yang, Agnes B. Fogo, Yuankai HuoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [6] arXiv:2206.00148 [pdf, other]
-
Title: Hands-Up: Leveraging Synthetic Data for Hands-On-Wheel DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [7] arXiv:2206.00162 [pdf, other]
-
Title: PAGER: Progressive Attribute-Guided Extendable Robust Image GenerationComments: 19 pages, 12 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [8] arXiv:2206.00171 [pdf, other]
-
Title: Learning Sequential Contexts using Transformer for 3D Hand Pose EstimationComments: Accepted to ICPR'22Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [9] arXiv:2206.00181 [pdf, other]
-
Title: Labeling Where Adapting Fails: Cross-Domain Semantic Segmentation with Point Supervision via Active SelectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [10] arXiv:2206.00182 [pdf, other]
-
Title: Differentiable Soft-Masked AttentionComments: arXiv admin note: text overlap with arXiv:2112.09131Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [11] arXiv:2206.00205 [pdf, other]
-
Title: CAFA: Class-Aware Feature Alignment for Test-Time AdaptationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [12] arXiv:2206.00214 [pdf, other]
-
Title: LiDAR-MIMO: Efficient Uncertainty Estimation for LiDAR-based 3D Object DetectionComments: 8 pages, 4 figures and 5 tables. Accepted in IEEE IV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [13] arXiv:2206.00222 [pdf, other]
-
Title: Cross-domain Detection Transformer based on Spatial-aware and Semantic-aware Token AlignmentComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [14] arXiv:2206.00227 [pdf, other]
-
Title: Rethinking the Augmentation Module in Contrastive Learning: Learning Hierarchical Augmentation Invariance with Expanded ViewsComments: Accepted to CVPR 2022Journal-ref: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [15] arXiv:2206.00244 [pdf, other]
-
Title: Fair Comparison between Efficient AttentionsComments: 4 pages abstractSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [16] arXiv:2206.00252 [pdf, other]
-
Title: Interpretable Deep Learning Classifier by Detection of Prototypical Parts on Kidney Stones ImagesAuthors: Daniel Flores-Araiza, Francisco Lopez-Tiro, Elias Villalvazo-Avila, Jonathan El-Beze, Jacques Hubert, Gilberto Ochoa-Ruiz, Christian DaulComments: Extended abstract accepted at LatinX in Computer Vision Research Workshop, at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [17] arXiv:2206.00272 [pdf, other]
-
Title: Vision GNN: An Image is Worth Graph of NodesComments: NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [18] arXiv:2206.00274 [pdf, other]
-
Title: Point-Teaching: Weakly Semi-Supervised Object Detection with Point AnnotationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [19] arXiv:2206.00280 [pdf, other]
-
Title: Automatic Bounding Box Annotation with Small Training Data Sets for Industrial ManufacturingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [20] arXiv:2206.00282 [pdf, other]
-
Title: Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At ScaleComments: 26 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
- [21] arXiv:2206.00291 [pdf, other]
-
Title: Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for Edge DevicesComments: Accepted into Embedded Vision Workshop 2022 of CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [22] arXiv:2206.00309 [pdf, other]
-
Title: Label-Efficient Online Continual Object Detection in Streaming VideoComments: ICCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [23] arXiv:2206.00311 [pdf, other]
-
Title: MaskOCR: Text Recognition with Masked Encoder-Decoder PretrainingAuthors: Pengyuan Lyu, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong WangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [24] arXiv:2206.00343 [pdf, other]
-
Title: Towards view-invariant vehicle speed detection from driving simulator imagesComments: 14th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [25] arXiv:2206.00344 [pdf, other]
-
Title: Self-Supervised Learning as a Means To Reduce the Need for Labeled Data in Medical Image AnalysisComments: Accepted by 30th European Signal Processing Conference, EUSIPCO 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [26] arXiv:2206.00359 [pdf, other]
-
Title: DeepCluE: Enhanced Image Clustering via Multi-layer Ensembles in Deep Neural NetworksComments: To appear in IEEE Transactions on Emerging Topics in Computational IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [27] arXiv:2206.00364 [pdf, other]
-
Title: Elucidating the Design Space of Diffusion-Based Generative ModelsComments: NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
- [28] arXiv:2206.00384 [pdf, other]
-
Title: Generalized Supervised Contrastive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [29] arXiv:2206.00386 [pdf, other]
-
Title: DiVAE: Photorealistic Images Synthesis with Denoising Diffusion DecoderSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [30] arXiv:2206.00415 [pdf, other]
-
Title: Learning Invariant Visual Representations for Compositional Zero-Shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [31] arXiv:2206.00447 [pdf, other]
-
Title: CD$^2$: Fine-grained 3D Mesh Reconstruction With Twice Chamfer DistanceComments: Just accepted by TOMMSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [32] arXiv:2206.00468 [pdf, other]
-
Title: PanopticDepth: A Unified Framework for Depth-aware Panoptic SegmentationComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [33] arXiv:2206.00481 [pdf, other]
-
Title: Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision TransformerComments: Accepted to BMVC 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [34] arXiv:2206.00489 [pdf, other]
-
Title: Attack-Agnostic Adversarial DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [35] arXiv:2206.00491 [pdf, other]
-
Title: Semantic Room Wireframe Detection from a Single ViewComments: Accepted for ICPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [36] arXiv:2206.00506 [pdf, other]
-
Title: Proximally Sensitive Error for Anomaly Detection and Feature LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [37] arXiv:2206.00515 [pdf, other]
-
Title: Landslide4Sense: Reference Benchmark Data and Deep Learning Models for Landslide DetectionJournal-ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-17, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [38] arXiv:2206.00527 [pdf, other]
-
Title: Amodal Cityscapes: A New Dataset, its Generation, and an Amodal Semantic Segmentation Challenge BaselineComments: This paper is accepted at IEEE Intelligent Vehicles Symposium 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [39] arXiv:2206.00535 [pdf, other]
-
Title: Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machinesComments: 9 pages, 5 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Social and Information Networks (cs.SI)
- [40] arXiv:2206.00580 [pdf, other]
-
Title: Dog nose print matching with dual global descriptor based on Contrastive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [41] arXiv:2206.00608 [pdf, other]
-
Title: On the Choice of Data for Efficient Training and Validation of End-to-End Driving ModelsAuthors: Marvin Klingner, Konstantin Müller, Mona Mirzaie, Jasmin Breitenstein, Jan-Aike Termöhlen, Tim FingscheidtComments: Accepted at CVPR VDU Workshop 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [42] arXiv:2206.00614 [pdf, other]
-
Title: Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cageSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [43] arXiv:2206.00629 [pdf, other]
-
Title: CLIP4IDC: CLIP for Image Difference CaptioningComments: Accepted to AACL-IJCNLP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [44] arXiv:2206.00630 [pdf, other]
-
Title: Unifying Voxel-based Representation with Transformer for 3D Object DetectionComments: Accepted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [45] arXiv:2206.00645 [pdf, other]
-
Title: Floorplan Restoration by Structure Hallucinating Transformer CascadesComments: Published at BMVC 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [46] arXiv:2206.00665 [pdf, other]
-
Title: MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface ReconstructionComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [47] arXiv:2206.00718 [pdf, other]
-
Title: Context-Driven Detection of Invertebrate Species in Deep-Sea VideoJournal-ref: International Journal of Computer Vision 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [48] arXiv:2206.00735 [pdf, other]
-
Title: Cascaded Video Generation for Videos In-the-WildComments: Accepted to the 26th International Conference on Pattern Recognition (ICPR 2022). arXiv admin note: substantial text overlap with arXiv:2106.02719Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [49] arXiv:2206.00746 [pdf, other]
-
Title: Residual Multiplicative Filter Networks for Multiscale ReconstructionComments: NeurIPS 2022, Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [50] arXiv:2206.00771 [pdf, other]
-
Title: Dynamic Linear Transformer for 3D Biomedical Image SegmentationComments: 8 PagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [51] arXiv:2206.00790 [pdf, other]
-
Title: Efficient Self-supervised Vision Pretraining with Local Masked ReconstructionComments: Add codeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [52] arXiv:2206.00798 [pdf, other]
-
Title: Multi-scale frequency separation network for image deblurringComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [53] arXiv:2206.00800 [pdf, other]
-
Title: CcHarmony: Color-checker based Image Harmonization DatasetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [54] arXiv:2206.00806 [pdf, other]
-
Title: XBound-Former: Toward Cross-scale Boundary Modeling in TransformersAuthors: Jiacheng Wang, Fei Chen, Yuxi Ma, Liansheng Wang, Zhaodong Fei, Jianwei Shuai, Xiangdong Tang, Qichao Zhou, Jing QinComments: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [55] arXiv:2206.00812 [pdf, other]
-
Title: Modeling sRGB Camera Noise with Normalizing FlowsComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [56] arXiv:2206.00859 [pdf, other]
-
Title: Disentangled Generation Network for Enlarged License Plate Recognition and A Unified DatasetComments: Submission to CVIUSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [57] arXiv:2206.00878 [pdf, other]
-
Title: EfficientNeRF: Efficient Neural Radiance FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [58] arXiv:2206.00893 [pdf, other]
-
Title: Leveraging Systematic Knowledge of 2D TransformationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [59] arXiv:2206.00897 [pdf, other]
-
Title: xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar ImageryAuthors: Fernando Paolo, Tsu-ting Tim Lin, Ritwik Gupta, Bryce Goodman, Nirav Patel, Daniel Kuster, David Kroodsma, Jared DunnmonComments: Accepted to NeurIPS 2022. 10 pages (25 with references and supplement)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [60] arXiv:2206.00902 [pdf, other]
-
Title: MISSU: 3D Medical Image Segmentation via Self-distilling TransUNetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [61] arXiv:2206.00923 [pdf, other]
-
Title: Modeling Image Composition for Complex Scene GenerationComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [62] arXiv:2206.00924 [pdf, other]
-
Title: FACM: Intermediate Layer Still Retain Effective Features against Adversarial ExamplesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [63] arXiv:2206.00930 [pdf, other]
-
Title: Predicting Physical Object Properties from VideoComments: accepted for International Joint Conference on Neural Networks (IJCNN) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [64] arXiv:2206.00947 [pdf, other]
-
Title: A Bhattacharyya Coefficient-Based Framework for Noise Model-Aware Random Walker Image SegmentationComments: Dominik Drees and Florian Eilers contributed equally to this workSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [65] arXiv:2206.00960 [pdf, other]
-
Title: SparseDet: Towards End-to-End 3D Object DetectionJournal-ref: Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, pp. 781- 792. Feb. 6-8, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [66] arXiv:2206.00971 [pdf, other]
-
Title: CVM-Cervix: A Hybrid Cervical Pap-Smear Image Classification Framework Using CNN, Visual Transformer and Multilayer PerceptronAuthors: Wanli Liu, Chen Li, Ning Xu, Tao Jiang, Md Mamunur Rahaman, Hongzan Sun, Xiangchen Wu, Weiming Hu, Haoyuan Chen, Changhao Sun, Yudong Yao, Marcin GrzegorzekSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [67] arXiv:2206.00997 [pdf, other]
-
Title: Is Mapping Necessary for Realistic PointGoal Navigation?Authors: Ruslan Partsey, Erik Wijmans, Naoki Yokoyama, Oles Dobosevych, Dhruv Batra, Oleksandr MaksymetsComments: Corrected typos in the AbstractSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [68] arXiv:2206.01009 [pdf, other]
-
Title: Unified Recurrence Modeling for Video Action AnticipationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [69] arXiv:2206.01010 [pdf, other]
-
Title: Long-tailed Recognition by Learning from Latent CategoriesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [70] arXiv:2206.01014 [pdf, other]
-
Title: Suggestive Annotation of Brain MR Images with Gradient-guided SamplingComments: Manuscript accepted by MedIASubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [71] arXiv:2206.01017 [pdf, other]
-
Title: Structured Two-stream Attention Network for Video Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [72] arXiv:2206.01034 [pdf, other]
-
Title: Adversarial Laser Spot: Robust and Covert Physical-World Attack to DNNsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [73] arXiv:2206.01038 [pdf, other]
-
Title: A Survey on Video Action Recognition in Sports: Datasets, Methods and ApplicationsAuthors: Fei Wu, Qingzhong Wang, Jian Bian, Haoyi Xiong, Ning Ding, Feixiang Lu, Jun Cheng, Dejing DouComments: 26 pages. The toolbox is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [74] arXiv:2206.01061 [pdf, other]
-
Title: FV-UPatches: Enhancing Universality in Finger Vein RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [75] arXiv:2206.01062 [pdf, other]
-
Title: DocLayNet: A Large Human-Annotated Dataset for Document-Layout AnalysisComments: 9 pages, 6 figures, 5 tables. Accepted paper at SIGKDD 2022 conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [76] arXiv:2206.01102 [pdf, other]
-
Title: A temporal chrominance trigger for clean-label backdoor attack against anti-spoof rebroadcast detectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
- [77] arXiv:2206.01125 [pdf, other]
-
Title: Prefix Conditioning Unifies Language and Label SupervisionAuthors: Kuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas PfisterComments: CVPR2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [78] arXiv:2206.01127 [pdf, other]
-
Title: VL-BEiT: Generative Vision-Language PretrainingSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [79] arXiv:2206.01136 [pdf, other]
-
Title: Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectivesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [80] arXiv:2206.01153 [pdf, other]
-
Title: Multi-View Active Fine-Grained RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [81] arXiv:2206.01160 [pdf, other]
-
Title: DE-Net: Dynamic Text-guided Image Editing Adversarial NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [82] arXiv:2206.01161 [pdf, other]
-
Title: Optimizing Relevance Maps of Vision Transformers Improves RobustnessSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [83] arXiv:2206.01191 [pdf, other]
-
Title: EfficientFormer: Vision Transformers at MobileNet SpeedAuthors: Yanyu Li, Geng Yuan, Yang Wen, Ju Hu, Georgios Evangelidis, Sergey Tulyakov, Yanzhi Wang, Jian RenSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [84] arXiv:2206.01198 [pdf, other]
- [85] arXiv:2206.01201 [pdf, other]
-
Title: REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question AnsweringComments: Accepted by NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [86] arXiv:2206.01202 [pdf, other]
-
Title: Unveiling The Mask of Position-Information Pattern Through the Mist of Image FeaturesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [87] arXiv:2206.01203 [pdf, other]
-
Title: Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding BoxesComments: Project page: this https URLJournal-ref: European Conference on Computer Vision (ECCV), 2022, Oral PresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [88] arXiv:2206.01204 [pdf, other]
-
Title: Siamese Image Modeling for Self-Supervised Vision Representation LearningAuthors: Chenxin Tao, Xizhou Zhu, Weijie Su, Gao Huang, Bin Li, Jie Zhou, Yu Qiao, Xiaogang Wang, Jifeng DaiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [89] arXiv:2206.01232 [pdf, other]
-
Title: What Are Expected Queries in End-to-End Object Detection?Comments: The source code is publicly available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [90] arXiv:2206.01244 [pdf, other]
-
Title: Real-Time Portrait Stylization on the EdgeSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [91] arXiv:2206.01256 [pdf, other]
-
Title: PETRv2: A Unified Framework for 3D Perception from Multi-Camera ImagesAuthors: Yingfei Liu, Junjie Yan, Fan Jia, Shuailin Li, Aqi Gao, Tiancai Wang, Xiangyu Zhang, Jian SunComments: Adding 3D lane detection results on OpenLane DatasetSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [92] arXiv:2206.01290 [pdf, other]
-
Title: Points2NeRF: Generating Neural Radiance Fields from 3D point cloudComments: arXiv admin note: text overlap with arXiv:2003.08934 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [93] arXiv:2206.01297 [pdf, other]
-
Title: Lossless Compression of Point Cloud Sequences Using Sequence Optimized CNN ModelsComments: 9 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [94] arXiv:2206.01309 [pdf, other]
-
Title: H-EMD: A Hierarchical Earth Mover's Distance Method for Instance SegmentationAuthors: Peixian Liang, Yizhe Zhang, Yifan Ding, Jianxu Chen, Chinedu S. Madukoma, Tim Weninger, Joshua D. Shrout, Danny Z. ChenComments: Accepted at IEEE Transactions On Medical Imaging (TMI)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [95] arXiv:2206.01319 [pdf, other]
-
Title: Learning Unbiased Transferability for Domain Adaptation by Uncertainty ModelingComments: This paper has been accepted by ECCV2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [96] arXiv:2206.01326 [pdf, other]
-
Title: Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic InformationAuthors: Zu Kim, André Araujo, Bingyi Cao, Cam Askew, Jack Sim, Mike Green, N'Mah Fodiatu Yilla, Tobias WeyandSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [97] arXiv:2206.01327 [pdf, other]
-
Title: RELAY: Robotic EyeLink AnalYsis of the EyeLink 1000 using an Artificial EyeComments: 12 Pages, 17 Figures, 2 Tables. Git Repository: this https URL Appendix Repository: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [98] arXiv:2206.01334 [pdf, other]
-
Title: Long Scale Error Control in Low Light Image and Video Enhancement Using EquivarianceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [99] arXiv:2206.01365 [pdf, other]
-
Title: Adversarial Attacks on Human VisionComments: 21 pages, 8 figures, 1 tableJournal-ref: Extended version of IEEE MultiMedia, vol. 23, no. 1, pp. 82-91, Jan.-Mar. 2016Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [100] arXiv:2206.01369 [pdf, other]
-
Title: Incremental Learning Meets Transfer Learning: Application to Multi-site Prostate MRI SegmentationAuthors: Chenyu You, Jinlin Xiang, Kun Su, Xiaoran Zhang, Siyuan Dong, John Onofrey, Lawrence Staib, James S. DuncanSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [101] arXiv:2206.01370 [pdf, other]
-
Title: Towards Improving the Generation Quality of Autoregressive Slot VAEsComments: Published in Neural Computation. 38 pages, 18 figures. Code and videos available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [102] arXiv:2206.01381 [pdf, other]
-
Title: CF-YOLO: Cross Fusion YOLO for Object Detection in Adverse Weather with a High-quality Real Snow DatasetAuthors: Qiqi Ding, Peng Li, Xuefeng Yan, Ding Shi, Luming Liang, Weiming Wang, Haoran Xie, Jonathan Li, Mingqiang WeiComments: 10pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [103] arXiv:2206.01384 [pdf, ps, other]
-
Title: End-to-End 3D Hand Pose Estimation from Stereo CamerasSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [104] arXiv:2206.01408 [pdf, other]
-
Title: MetaLR: Meta-tuning of Learning Rates for Transfer Learning in Medical ImagingComments: MICCAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [105] arXiv:2206.01417 [pdf, other]
-
Title: Learning an Adaptation Function to Assess Image Visual SimilaritiesAuthors: Olivier Risser-Maroix (LIPADE), Amine Marzouki (LIPADE), Hala Djeghim (LIPADE), Camille Kurtz (LIPADE), Nicolas Lomenie (LIPADE)Journal-ref: ORASIS 2021, Centre National de la Recherche Scientifique [CNRS], Sep 2021, Saint Ferr{\'e}ol, FranceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [106] arXiv:2206.01429 [pdf, other]
-
Title: Learning rich optical embeddings for privacy-preserving lensless image classificationComments: 29 pages, 23 figures, under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [107] arXiv:2206.01441 [pdf, other]
-
Title: Exploring Transformers for Behavioural Biometrics: A Case Study in Gait RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [108] arXiv:2206.01466 [pdf, other]
-
Title: Recognition of Unseen Bird Species by Learning from Field GuidesComments: Accepted to WACV2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [109] arXiv:2206.01467 [pdf, other]
-
Title: The Importance of Image Interpretation: Patterns of Semantic Misclassification in Real-World Adversarial ImagesComments: International Conference on Multimedia Modeling (MMM) 2023. Resources are publicly available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [110] arXiv:2206.01473 [pdf, other]
-
Title: Distributional loss for convolutional neural network regression and application to GNSS multi-path estimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [111] arXiv:2206.01498 [pdf, ps, other]
-
Title: YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack detectionAuthors: Xiao RuiqiangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [112] arXiv:2206.01524 [pdf, other]
-
Title: Anomaly detection in surveillance videos using transformer based attention modelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [113] arXiv:2206.01627 [pdf, other]
-
Title: Pruning for Feature-Preserving Circuits in CNNsComments: Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [114] arXiv:2206.01646 [pdf, other]
-
Title: Integrating Prior Knowledge in Contrastive Learning with KernelComments: ICML 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [115] arXiv:2206.01651 [pdf, other]
-
Title: D'ARTAGNAN: Counterfactual Video GenerationAuthors: Hadrien Reynaud, Athanasios Vlontzos, Mischa Dombrowski, Ciarán Lee, Arian Beqiri, Paul Leeson, Bernhard KainzComments: Accepted for MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [116] arXiv:2206.01653 [pdf, other]
-
Title: Metrics reloaded: Recommendations for image analysis validationAuthors: Lena Maier-Hein, Annika Reinke, Patrick Godau, Minu D. Tizabi, Florian Buettner, Evangelia Christodoulou, Ben Glocker, Fabian Isensee, Jens Kleesiek, Michal Kozubek, Mauricio Reyes, Michael A. Riegler, Manuel Wiesenfarth, A. Emre Kavur, Carole H. Sudre, Michael Baumgartner, Matthias Eisenmann, Doreen Heckmann-Nötzel, Tim Rädsch, Laura Acion, Michela Antonelli, Tal Arbel, Spyridon Bakas, Arriel Benis, Matthew Blaschko, M. Jorge Cardoso, Veronika Cheplygina, Beth A. Cimini, Gary S. Collins, Keyvan Farahani, Luciana Ferrer, Adrian Galdran, Bram van Ginneken, Robert Haase, Daniel A. Hashimoto, Michael M. Hoffman, Merel Huisman, Pierre Jannin, Charles E. Kahn, Dagmar Kainmueller, Bernhard Kainz, Alexandros Karargyris, Alan Karthikesalingam, Hannes Kenngott, Florian Kofler, Annette Kopp-Schneider, et al. (28 additional authors not shown)Comments: Shared first authors: Lena Maier-Hein, Annika Reinke. arXiv admin note: substantial text overlap with arXiv:2104.05642 Published in Nature MethodsJournal-ref: Nature methods, 1-18 (2024)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [117] arXiv:2206.01658 [pdf, ps, other]
-
Title: Identification via Retinal Vessels Combining LBP and HOGAuthors: Ali NooriSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [118] arXiv:2206.01661 [pdf, other]
-
Title: Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image SynthesisAuthors: Jan ZuiderveldSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [119] arXiv:2206.01670 [pdf, other]
-
Title: Egocentric Video-Language PretrainingAuthors: Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng ShouComments: Accepted by NeurIPS 2022. Double champions at Ego4D and EPIC-Kitchens, CVPR 2022 challenges. 23 pages, 13 figures, 12 tables. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [120] arXiv:2206.01705 [pdf, other]
-
Title: Gradient Obfuscation Checklist Test Gives a False Sense of SecuritySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [121] arXiv:2206.01714 [pdf, other]
-
Title: Compositional Visual Generation with Composable Diffusion ModelsComments: ECCV 2022. First three authors contributed equally. Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [122] arXiv:2206.01718 [pdf, other]
-
Title: A-OKVQA: A Benchmark for Visual Question Answering using World KnowledgeSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [123] arXiv:2206.01720 [pdf, other]
-
Title: Revisiting the "Video" in Video-Language UnderstandingAuthors: Shyamal Buch, Cristóbal Eyzaguirre, Adrien Gaidon, Jiajun Wu, Li Fei-Fei, Juan Carlos NieblesComments: CVPR 2022 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [124] arXiv:2206.01724 [pdf, other]
-
Title: SNAKE: Shape-aware Neural 3D Keypoint FieldAuthors: Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing HuangComments: Accepted by NeurIPS 2022. Codes are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [125] arXiv:2206.01733 [pdf, other]
-
Title: Adversarial RAW: Image-Scaling Attack Against Imaging PipelineSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [126] arXiv:2206.01734 [pdf, ps, other]
-
Title: Using UAS Imagery and Computer Vision to Support Site-Specific Weed Control in CornComments: 16 Figures, 3 Tables,. arXiv admin note: substantial text overlap with arXiv:2204.12417Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [127] arXiv:2206.01772 [pdf, other]
-
Title: Radar Guided Dynamic Visual Attention for Resource-Efficient RGB Object DetectionComments: Accepted in International Joint Conference on Neural Networks (IJCNN) 2022Journal-ref: 2022 International Joint Conference on Neural Networks (IJCNN)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [128] arXiv:2206.01777 [pdf, other]
-
Title: Real-Time Super-Resolution for Real-World Images on Mobile DevicesComments: arXiv admin note: text overlap with arXiv:2004.13674Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [129] arXiv:2206.01794 [pdf, other]
-
Title: Additive MIL: Intrinsically Interpretable Multiple Instance Learning for PathologyAuthors: Syed Ashar Javed, Dinkar Juyal, Harshith Padigela, Amaro Taylor-Weiner, Limin Yu, Aaditya PrakashSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [130] arXiv:2206.01813 [pdf, other]
-
Title: Learning sRGB-to-Raw-RGB De-rendering with Content-Aware MetadataComments: CVPR 2022 (GitHub: this https URL)Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [131] arXiv:2206.01821 [pdf, other]
-
Title: EAANet: Efficient Attention Augmented Convolutional NetworksComments: 8 pages, 4 figures. Not publishedSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [132] arXiv:2206.01831 [pdf, other]
-
Title: Spatial Feature Mapping for 6DoF Object Pose EstimationComments: Pattern RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [133] arXiv:2206.01841 [pdf, other]
-
Title: Coffee Roast IntelligenceComments: 6 pages, 13 figures, 3 tables, this work was presented at the CSC498 COMPUTER SCIENCE CAPSTONE PROJECT I and CSC499 COMPUTER SCIENCE CAPSTONE PROJECT II coursesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [134] arXiv:2206.01843 [pdf, other]
-
Title: Visual Clues: Bridging Vision and Language Foundations for Image Paragraph CaptioningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [135] arXiv:2206.01863 [pdf, other]
-
Title: Recursive Deformable Image Registration Network with Mutual AttentionAuthors: Jian-Qing Zheng, Ziyang Wang, Baoru Huang, Ngee Han Lim, Tonia Vincent, Bartlomiej W. PapiezComments: arXiv admin note: text overlap with arXiv:2203.04290Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [136] arXiv:2206.01867 [pdf, other]
-
Title: SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional SpaceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [137] arXiv:2206.01881 [pdf, other]
-
Title: Face Recognition Accuracy Across Demographics: Shining a Light Into the ProblemSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [138] arXiv:2206.01884 [pdf, ps, other]
-
Title: A Superimposed Divide-and-Conquer Image Recognition Method for SEM Images of Nanoparticles on The Surface of Monocrystalline silicon with High Aggregation DegreeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [139] arXiv:2206.01908 [pdf, other]
-
Title: Video-based Human-Object Interaction Detection from Tubelet TokensSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [140] arXiv:2206.01910 [pdf, other]
-
Title: The Spike Gating Flow: A Hierarchical Structure Based Spiking Neural Network for Online Gesture RecognitionAuthors: Zihao Zhao, Yanhong Wang, Qiaosha Zou, Tie Xu, Fangbo Tao, Jiansong Zhang, Xiaoan Wang, C.-J. Richard Shi, Junwen Luo, Yuan XieSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [141] arXiv:2206.01916 [pdf, other]
-
Title: Nerfels: Renderable Neural Codes for Improved Camera Pose EstimationAuthors: Gil Avraham, Julian Straub, Tianwei Shen, Tsun-Yi Yang, Hugo Germain, Chris Sweeney, Vasileios Balntas, David Novotny, Daniel DeTone, Richard NewcombeComments: Published at CVPRW with supplementary materialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [142] arXiv:2206.01923 [pdf, other]
-
Title: From Pixels to Objects: Cubic Visual Attention for Visual Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [143] arXiv:2206.01942 [pdf, other]
-
Title: Occlusion-Resistant Instance Segmentation of Piglets in Farrowing Pens Using Center Clustering NetworkAuthors: Endai Huang, Axiu Mao, Junhui Hou, Yongjian Wu, Weitao Xu, Maria Camila Ceballos, Thomas D. Parsons, Kai LiuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [144] arXiv:2206.01961 [pdf, other]
-
Title: C$^3$Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in ColonoscopySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [145] arXiv:2206.01986 [pdf, other]
-
Title: Delving into the Openness of CLIPComments: Accepted by Findings of ACL 2023 (Long Paper). Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [146] arXiv:2206.01988 [pdf, other]
-
Title: Cross-modal Clinical Graph Transformer for Ophthalmic Report GenerationComments: CVPR 2022 (Poster)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [147] arXiv:2206.01992 [pdf, other]
-
Title: CAINNFlow: Convolutional block Attention modules and Invertible Neural Networks Flow for anomaly detection and localization tasksAuthors: Ruiqing Yan, Fan Zhang, Mengyuan Huang, Wu Liu, Dongyu Hu, Jinfeng Li, Qiang Liu, Jinrong Jiang, Qianjin Guo, Linghan ZhengSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [148] arXiv:2206.01999 [pdf, other]
-
Title: MSR: Making Self-supervised learning Robust to Aggressive AugmentationsAuthors: Yingbin Bai, Erkun Yang, Zhaoqing Wang, Yuxuan Du, Bo Han, Cheng Deng, Dadong Wang, Tongliang LiuSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [149] arXiv:2206.02002 [pdf, other]
-
Title: CVNets: High Performance Library for Computer VisionComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [150] arXiv:2206.02015 [pdf, other]
-
Title: APES: Articulated Part Extraction from Sprite SheetsAuthors: Zhan Xu, Matthew Fisher, Yang Zhou, Deepali Aneja, Rushikesh Dudhat, Li Yi, Evangelos KalogerakisSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [151] arXiv:2206.02027 [pdf, other]
-
Title: Implicit Neural Representation for Mesh-Free Inverse Obstacle ScatteringComments: 6 pages, 8 figures, to be published in 2022 Asilomar Conference on Signals, Systems, and ComputersJournal-ref: 2022 Asilomar Conference on Signals, Systems, and ComputersSubjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [152] arXiv:2206.02029 [pdf, other]
-
Title: Guided Deep Metric LearningAuthors: Jorge Gonzalez-Zapata, Ivan Reyes-Amezcua, Daniel Flores-Araiza, Mauricio Mendez-Ruiz, Gilberto Ochoa-Ruiz, Andres Mendez-VazquezSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [153] arXiv:2206.02050 [pdf, other]
-
Title: Learning Speaker-specific Lip-to-Speech GenerationComments: Accepted at ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [154] arXiv:2206.02066 [pdf, other]
-
Title: PIDNet: A Real-time Semantic Segmentation Network Inspired by PID ControllersComments: 11 pages, 9 figures; This paper will be published by CVPR2023 soon, please refer to the official version thenSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [155] arXiv:2206.02070 [pdf, other]
- [156] arXiv:2206.02082 [pdf, other]
-
Title: Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language RetrievalAuthors: Xudong Lin, Simran Tiwari, Shiyuan Huang, Manling Li, Mike Zheng Shou, Heng Ji, Shih-Fu ChangComments: To appear in CVPR 2023; The code will be released at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [157] arXiv:2206.02086 [pdf, other]
-
Title: Towards the Creation of a Nutrition and Food Group Based Image DatabaseAuthors: Zeman Shao, Jiangpeng He, Ya-Yuan Yu, Luotao Lin, Alexandra Cowan, Heather Eicher-Miller, Fengqing ZhuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [158] arXiv:2206.02087 [pdf, other]
-
Title: Accurate Scoliosis Vertebral Landmark Localization on X-ray Images via Shape-constrained Multi-stage Cascaded CNNsComments: 9 pages, submitted to IEEE Journal of Biomedical and Health InformaticsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [159] arXiv:2206.02099 [pdf, other]
-
Title: Point-to-Voxel Knowledge Distillation for LiDAR Semantic SegmentationComments: CVPR 2022; Our model ranks 1st on Waymo and SemanticKITTI (single-scan) challenges, and ranks 3rd on SemanticKITTI (multi-scan) challenge; Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [160] arXiv:2206.02104 [pdf, other]
-
Title: ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentencesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [161] arXiv:2206.02110 [pdf, other]
-
Title: Computer Vision-based Characterization of Large-scale Jet Flames using a Synthetic Infrared Image Generation ApproachAuthors: Carmina Pérez-Guerrero, Jorge Francisco Ciprián-Sánchez, Adriana Palacios, Gilberto Ochoa-Ruiz, Miguel Gonzalez-Mendoza, Vahid Foroughi, Elsa Pastor, Gerardo Rodriguez-HernandezComments: Pre-print submitted to Engineering Science and Technology, an International JournalSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [162] arXiv:2206.02116 [pdf, other]
-
Title: Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in VideosComments: Accepted to CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [163] arXiv:2206.02118 [pdf, other]
-
Title: ShapePU: A New PU Learning Framework Regularized by Global Consistency for Scribble Supervised Cardiac SegmentationComments: 11 pages,4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [164] arXiv:2206.02120 [pdf, other]
-
Title: MPANet: Multi-Patch Attention For Infrared Small Target object DetectionComments: 4 pages 3 figuresJournal-ref: 2022IGARSSSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [165] arXiv:2206.02136 [pdf, other]
-
Title: LDRNet: Enabling Real-time Document Localization on Mobile DevicesComments: ECML-PKDD 2022 this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
- [166] arXiv:2206.02146 [pdf, other]
-
Title: Recurrent Video Restoration Transformer with Guided Deformable AttentionAuthors: Jingyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, Jiezhang Cao, Kai Zhang, Radu Timofte, Luc Van GoolComments: Accepted by NeurIPS 2022. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [167] arXiv:2206.02153 [pdf, other]
-
Title: HPGNN: Using Hierarchical Graph Neural Networks for Outdoor Point Cloud ProcessingAuthors: Arulmolivarman Thieshanthan, Amashi Niwarthana, Pamuditha Somarathne, Tharindu Wickremasinghe, Ranga RodrigoComments: Accepted for ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [168] arXiv:2206.02158 [pdf, other]
-
Title: Vanilla Feature Distillation for Improving the Accuracy-Robustness Trade-Off in Adversarial TrainingComments: 12 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [169] arXiv:2206.02163 [pdf, other]
-
Title: MotionCNN: A Strong Baseline for Motion Prediction in Autonomous DrivingComments: CVPR Workshop on Autonomous Driving 2021. Waymo Motion Prediction Challenge 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [170] arXiv:2206.02180 [pdf, other]
-
Title: Semi-Supervised Learning for Mars Imagery Classification and SegmentationComments: Accepted by ACM Trans. on Multimedia Computing Communications and Applications (TOMM)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [171] arXiv:2206.02187 [pdf, other]
-
Title: M2FNet: Multi-modal Fusion Network for Emotion Recognition in ConversationAuthors: Vishal Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki OnoeComments: Accepted for publication in the 5th Multimodal Learning and Applications (MULA) Workshop at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [172] arXiv:2206.02194 [pdf, other]
-
Title: FOF: Learning Fourier Occupancy Field for Monocular Real-time Human ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [173] arXiv:2206.02200 [pdf, other]
-
Title: GridShift: A Faster Mode-seeking Algorithm for Image Segmentation and Object TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [174] arXiv:2206.02203 [pdf, ps, other]
-
Title: 3D Convolutional with Attention for Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [175] arXiv:2206.02220 [pdf, other]
-
Title: U(1) Symmetry-breaking Observed in Generic CNN Bottleneck LayersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [176] arXiv:2206.02234 [pdf, other]
-
Title: Two Decades of Bengali Handwritten Digit Recognition: A SurveyAuthors: A.B.M. Ashikur Rahman, Md. Bakhtiar Hasan, Sabbir Ahmed, Tasnim Ahmed, Md. Hamjajul Ashmafee, Mohammad Ridwan Kabir, Md. Hasanul KabirComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. 38 pages, 23 figures, 12 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [177] arXiv:2206.02257 [pdf, other]
-
Title: Efficient Annotation and Learning for 3D Hand Pose Estimation: A SurveySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [178] arXiv:2206.02260 [pdf, other]
-
Title: SealID: Saimaa ringed seal re-identification datasetAuthors: Ekaterina Nepovinnykh, Tuomas Eerola, Vincent Biard, Piia Mutka, Marja Niemi, Heikki Kälviäinen, Mervi KunnasrantaComments: 15 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Populations and Evolution (q-bio.PE)
- [179] arXiv:2206.02261 [pdf, other]
-
Title: Towards Individual Grevy's Zebra Identification via Deep 3D Fitting and Metric LearningComments: 4 pages, 5 figures, 1 table; typos corrected, references updatedSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [180] arXiv:2206.02270 [pdf, other]
-
Title: Estimating building energy efficiency from street view imagery, aerial imagery, and land surface temperature dataAuthors: Kevin Mayer, Lukas Haas, Tianyuan Huang, Juan Bernabé-Moreno, Ram Rajagopal, Martin FischerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [181] arXiv:2206.02281 [pdf, other]
-
Title: E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial VehiclesAuthors: Zhenyu Hu, Zhenyu Wu, Pengcheng Pi, Yunhe Xue, Jiayi Shen, Jianchao Tan, Xiangru Lian, Zhangyang Wang, Ji LiuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [182] arXiv:2206.02288 [pdf, other]
-
Title: ACT: Semi-supervised Domain-adaptive Medical Image Segmentation with Asymmetric Co-trainingAuthors: Xiaofeng Liu, Fangxu Xing, Nadya Shusharina, Ruth Lim, C-C Jay Kuo, Georges El Fakhri, Jonghye WooComments: MICCAI 2022 (early accept)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [183] arXiv:2206.02295 [pdf, other]
-
Title: HIFI-Net: A Novel Network for Enhancement to Underwater ImagesComments: 7 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [184] arXiv:2206.02307 [pdf, other]
-
Title: Bootstrapping Semi-supervised Medical Image Segmentation with Anatomical-aware Contrastive DistillationComments: Accepted at Information Processing in Medical Imaging (IPMI 2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [185] arXiv:2206.02325 [pdf, other]
-
Title: Evaluation-oriented Knowledge Distillation for Deep Face RecognitionComments: CVPR2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [186] arXiv:2206.02327 [pdf, other]
-
Title: JigsawHSI: a network for Hyperspectral Image classificationComments: 7 pages, 7 figures, not peer reviewedSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [187] arXiv:2206.02331 [pdf, ps, other]
-
Title: MASNet:Improve Performance of Siamese Networks with Mutual-attention for Remote Sensing Change Detection TasksComments: XXIV ISPRS CongressSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [188] arXiv:2206.02338 [pdf, other]
-
Title: OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal RegressionComments: Accepted by NeurIPS2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [189] arXiv:2206.02342 [pdf, other]
-
Title: WHU-Stereo: A Challenging Benchmark for Stereo Matching of High-Resolution Satellite ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [190] arXiv:2206.02343 [pdf, other]
-
Title: Contrastive Graph Multimodal Model for Text Classification in VideosSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [191] arXiv:2206.02345 [pdf, other]
-
Title: Anomaly Detection with Test Time Augmentation and Consistency EvaluationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [192] arXiv:2206.02349 [pdf, other]
-
Title: Invariant Grounding for Video Question AnsweringComments: CVPR2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [193] arXiv:2206.02355 [pdf, other]
-
Title: Relation Matters: Foreground-aware Graph-based Relational Reasoning for Domain Adaptive Object DetectionAuthors: Chaoqi Chen, Jiongcheng Li, Hong-Yu Zhou, Xiaoguang Han, Yue Huang, Xinghao Ding, Yizhou YuComments: Accepted by IEEE T-PAMISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [194] arXiv:2206.02366 [pdf, other]
-
Title: Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D ScansAuthors: Alexandr Notchenko, Vladislav Ishimtsev, Alexey Artemov, Vadim Selyutin, Emil Bogomolov, Evgeny BurnaevComments: In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and ApplicationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [195] arXiv:2206.02373 [pdf, other]
-
Title: Sports Re-ID: Improving Re-Identification Of Players In Broadcast Videos Of Team SportsAuthors: Bharath ComandurSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [196] arXiv:2206.02374 [pdf, other]
-
Title: CorticalFlow: A Diffeomorphic Mesh Deformation Module for Cortical Surface ReconstructionAuthors: Léo Lebrat, Rodrigo Santa Cruz, Frédéric de Gournay, Darren Fu, Pierrick Bourgeat, Jurgen Fripp, Clinton Fookes, Olivier SalvadoSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [197] arXiv:2206.02377 [pdf, other]
-
Title: BInGo: Bayesian Intrinsic Groupwise Registration via Explicit Hierarchical DisentanglementSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [198] arXiv:2206.02392 [pdf, ps, other]
-
Title: Semi-Supervised Segmentation of Mitochondria from Electron Microscopy Images Using Spatial ContinuityComments: 4 pages of main text, 5 pages of supplementary material and 1 page of referencesJournal-ref: 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). IEEE, 2022: 1-5Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [199] arXiv:2206.02405 [pdf, other]
-
Title: Image Protection for Robust Cropping Localization and RecoveryComments: Accepted by IEEE ICME 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [200] arXiv:2206.02424 [pdf, ps, other]
-
Title: Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehiclesComments: 18 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [201] arXiv:2206.02452 [pdf, other]
-
Title: Universal Photometric Stereo Network using Global Lighting ContextsAuthors: Satoshi IkehataComments: Accepted to CVPR2022. Code and Dataset at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [202] arXiv:2206.02454 [pdf, other]
-
Title: What do CNNs Learn in the First Layer and Why? A Linear Systems PerspectiveSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [203] arXiv:2206.02498 [pdf, other]
-
Title: NORPPA: NOvel Ringed seal re-identification by Pelage Pattern AggregationComments: 22 pages, 13 figures, 5 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [204] arXiv:2206.02502 [pdf, other]
-
Title: BehavePassDB: Public Database for Mobile Behavioral Biometrics and Benchmark EvaluationComments: 11 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [205] arXiv:2206.02531 [pdf, other]
-
Title: 3D-Augmented Contrastive Knowledge Distillation for Image-based Object Pose EstimationComments: Accepted for presentation at International Conference on Multimedia Retrieval (ICMR '22)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [206] arXiv:2206.02539 [pdf, other]
-
Title: Robustness Evaluation and Adversarial Training of an Instance Segmentation ModelComments: 15 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [207] arXiv:2206.02544 [pdf, other]
-
Title: RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene GenerationComments: Accepted at the IEEE Winter Conference on Applications of Computer Vision, WACV 2022Journal-ref: 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 2723-2732Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [208] arXiv:2206.02547 [pdf, ps, other]
-
Title: Towards retrieving dispersion profiles using quantum-mimic Optical Coherence Tomography and Machine LearninComments: 11 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
- [209] arXiv:2206.02559 [pdf, other]
-
Title: Conversation Group Detection With Spatio-Temporal ContextSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [210] arXiv:2206.02564 [pdf, other]
-
Title: Machine Learning for Detection of 3D Features using sparse X-ray dataAuthors: Bradley T. Wolfe, Michael J. Falato, Xinhua Zhang, Nga T. T. Nguyen-Fotiadis, J.P. Sauppe, P. M. Kozlowski, P. A. Keiter, R. E. Reinovsky, S. A. Batha, Zhehui WangSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Data Analysis, Statistics and Probability (physics.data-an)
- [211] arXiv:2206.02573 [pdf, other]
- [212] arXiv:2206.02598 [pdf, other]
-
Title: [Reproducibility Report] Explainable Deep One-Class ClassificationComments: Submitted to the ML Reproducibility Challenge 2021 FallSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [213] arXiv:2206.02609 [pdf, other]
-
Title: Real-World Image Super-Resolution by Exclusionary Dual-LearningComments: IEEE TMM 2022; Considering large volume of RealSR datasets, a multi-dataset sampling scheme is developedSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [214] arXiv:2206.02619 [pdf, other]
-
Title: VPIT: Real-time Embedded Single Object 3D Tracking Using Voxel Pseudo ImagesAuthors: Illia Oleksiienko, Paraskevi Nousi, Nikolaos Passalis, Anastasios Tefas, Alexandros IosifidisComments: 10 pages, 5 figures, 4 tables. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [215] arXiv:2206.02622 [pdf, other]
-
Title: Hardware-accelerated Mars Sample Localization via deep transfer learning from photorealistic simulationsAuthors: Raúl Castilla-Arquillo, Carlos Jesús Pérez-del-Pulgar, Gonzalo Jesús Paz-Delgado, Levin GerdesComments: Preprint version only. Final version at IEEE Xplore. Accepted for IEEE Robotics and Automation LettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [216] arXiv:2206.02647 [pdf, other]
-
Title: Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised LearningAuthors: Richard J. Chen, Chengkuan Chen, Yicong Li, Tiffany Y. Chen, Andrew D. Trister, Rahul G. Krishnan, Faisal MahmoodComments: Accepted to CVPR 2022 (Oral)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [217] arXiv:2206.02664 [pdf, other]
-
Title: Learning with Capsules: A SurveyComments: 29 pages, 43 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [218] arXiv:2206.02680 [pdf, other]
-
Title: Separable Self-attention for Mobile Vision TransformersComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [219] arXiv:2206.02714 [pdf, other]
-
Title: FuSS: Fusing Superpixels for Improved Segmentation ConsistencyComments: submitted to IEEEACCESS. 19 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [220] arXiv:2206.02715 [pdf, other]
-
Title: Day-to-Night Image Synthesis for Training Nighttime Neural ISPsAuthors: Abhijith Punnappurath, Abdullah Abuolaim, Abdelrahman Abdelhamed, Alex Levinshtein, Michael S. BrownSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [221] arXiv:2206.02717 [pdf, other]
-
Title: Scene Aware Person Image Generation through Global Contextual ConditioningComments: Accepted in The International Conference on Pattern Recognition (ICPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [222] arXiv:2206.02721 [pdf, other]
- [223] arXiv:2206.02735 [pdf, other]
-
Title: People Tracking in Panoramic Video for Guiding RobotsComments: Accepted to 17th International Conference on Intelligent Autonomous Systems (IAS-17)Journal-ref: Proceedings of the 17th International Conference on Intelligent Autonomous Systems (IAS 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [224] arXiv:2206.02749 [pdf, other]
-
Title: CORE: Consistent Representation Learning for Face Forgery DetectionComments: Accepted by CVPRW 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [225] arXiv:2206.02761 [pdf, other]
-
Title: Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical ImagesComments: 12 pages, 5 figures. In proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022. Copyright 2022 by the author(s)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [226] arXiv:2206.02770 [pdf, other]
-
Title: Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of ExpertsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [227] arXiv:2206.02776 [pdf, other]
-
Title: Volumetric Disentanglement for 3D Scene ManipulationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [228] arXiv:2206.02777 [pdf, other]
-
Title: Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [229] arXiv:2206.02779 [pdf, other]
-
Title: Blended Latent DiffusionComments: Accepted to SIGGRAPH 2023. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [230] arXiv:2206.02780 [pdf, other]
-
Title: GenSDF: Two-Stage Learning of Generalizable Signed Distance FunctionsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [231] arXiv:2206.02846 [pdf, other]
-
Title: A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic InformationAuthors: Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. DerpanisComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [232] arXiv:2206.02850 [pdf, other]
-
Title: GLF-CR: SAR-Enhanced Cloud Removal with Global-Local FusionSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [233] arXiv:2206.02876 [pdf, other]
-
Title: SpikiLi: A Spiking Simulation of LiDAR based Real-time Object Detection for Autonomous DrivingAuthors: Sambit Mohapatra, Thomas Mesquida, Mona Hodaei, Senthil Yogamani, Heinrich Gotzig, Patrick MaderComments: Accepted at Workshop on Event Sensing and Neuromorphic Engineering - 8th International Conference on Event-based Control, Communication, and Signal ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [234] arXiv:2206.02903 [pdf, other]
-
Title: Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph MapsComments: CVPR 2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [235] arXiv:2206.02912 [pdf, ps, other]
-
Title: Learning Image Representations for Content Based Image Retrieval of Radiotherapy Treatment PlansAuthors: Charles Huang, Varun Vasudevan, Oscar Pastor-Serrano, Md Tauhidul Islam, Yusuke Nomura, Piotr Dubrowski, Jen-Yeu Wang, Joseph B. Schulz, Yong Yang, Lei XingSubjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [236] arXiv:2206.02967 [pdf, other]
-
Title: Masked Unsupervised Self-training for Label-free Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [237] arXiv:2206.02977 [pdf, other]
-
Title: DETR++: Taming Your Multi-Scale Detection TransformerComments: T4V: Transformers for Vision workshop @ CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [238] arXiv:2206.02985 [pdf, other]
-
Title: Structured Context Transformer for Generic Event Boundary DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [239] arXiv:2206.02997 [pdf, ps, other]
-
Title: TadML: A fast temporal action detection with Mechanics-MLPComments: 8 pages,3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [240] arXiv:2206.03001 [pdf, other]
-
Title: PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR SystemAuthors: Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, Dianhai Yu, Yanjun MaComments: arXiv admin note: text overlap with arXiv:2109.03144Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [241] arXiv:2206.03010 [pdf, other]
-
Title: MS-RNN: A Flexible Multi-Scale Framework for Spatiotemporal Predictive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [242] arXiv:2206.03012 [pdf, other]
-
Title: TriBYOL: Triplet BYOL for Self-Supervised Representation LearningComments: Published as a conference paper at ICASSP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [243] arXiv:2206.03014 [pdf, other]
-
Title: The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph GenerationComments: Accepted by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [244] arXiv:2206.03017 [pdf, other]
-
Title: Development of Automatic Endotracheal Tube and Carina Detection on Portable Supine Chest Radiographs using Artificial IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [245] arXiv:2206.03033 [pdf, other]
-
Title: Deep Learning Techniques for Visual CountingAuthors: Luca CiampiComments: Version with high-quality images can be found at this https URL arXiv admin note: text overlap with arXiv:1802.03601, arXiv:1707.01202, arXiv:1809.02165, arXiv:1901.06026, arXiv:1808.01244 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [246] arXiv:2206.03048 [pdf, other]
-
Title: Layered Depth Refinement with Mask GuidanceComments: Accepted to CVPR 2022 (camera-ready version)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [247] arXiv:2206.03061 [pdf, other]
-
Title: Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detectionComments: Accepted by IJCNN2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [248] arXiv:2206.03062 [pdf, other]
-
Title: Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud MapComments: 9 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [249] arXiv:2206.03064 [pdf, other]
-
Title: A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action DetectorComments: Accepted By WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [250] arXiv:2206.03086 [pdf, other]
-
Title: Online Deep Clustering with Video Track ConsistencyComments: Accepted at ICPR2022 as oralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [251] arXiv:2206.03087 [pdf, other]
-
Title: Critical Regularizations for Neural Surface Reconstruction in the WildComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [252] arXiv:2206.03105 [pdf, other]
- [253] arXiv:2206.03111 [pdf, other]
-
Title: Medical Image Registration via Neural FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [254] arXiv:2206.03113 [pdf, other]
-
Title: Wavelet Prior Attention Learning in Axial Inpainting NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [255] arXiv:2206.03149 [pdf, other]
-
Title: Self-Training of Handwritten Word Recognition for Synthetic-to-Real AdaptationComments: Accepted for publication in International Conference on Pattern Recognition (ICPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [256] arXiv:2206.03164 [pdf, other]
-
Title: Utility of Equivariant Message Passing in Cortical Mesh SegmentationComments: 13 pages, 3 figures, accepted for MIUA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [257] arXiv:2206.03196 [pdf, other]
-
Title: Improving Image Captioning with Control Signal of Sentence QualityComments: Accepted by ICASSP2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [258] arXiv:2206.03207 [pdf, other]
-
Title: Omnivision forecasting: combining satellite observations with sky images for improved intra-hour solar energy predictionsComments: Submitted to Renewable EnergySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [259] arXiv:2206.03210 [pdf, other]
-
Title: Deep Neural Patchworks: Coping with Large Segmentation TasksSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [260] arXiv:2206.03287 [pdf, other]
-
Title: NeMF: Neural Motion Fields for Kinematic AnimationComments: Accepted to NeurIPS 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [261] arXiv:2206.03361 [pdf, other]
-
Title: Hierarchical Similarity Learning for Aliasing Suppression Image Super-ResolutionComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [262] arXiv:2206.03367 [pdf, other]
-
Title: Localizing Semantic Patches for Accelerating Image ClassificationComments: Accepted by ICME-2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [263] arXiv:2206.03368 [pdf, other]
-
Title: IL-MCAM: An interactive learning and multi-channel attention mechanism-based weakly supervised colorectal histopathology image classification approachAuthors: Haoyuan Chen, Chen Li, Xiaoyan Li, Md Mamunur Rahaman, Weiming Hu, Yixin Li, Wanli Liu, Changhao Sun, Hongzan Sun, Xinyu Huang, Marcin GrzegorzekJournal-ref: Computers in Biology and Medicine, Volume 143, April 2022, 105265Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [264] arXiv:2206.03373 [pdf, other]
-
Title: Garment Avatars: Realistic Cloth Driving using Pattern RegistrationAuthors: Oshri Halimi, Fabian Prada, Tuur Stuyck, Donglai Xiang, Timur Bagautdinov, He Wen, Ron Kimmel, Takaaki Shiratori, Chenglei Wu, Yaser SheikhSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [265] arXiv:2206.03410 [pdf, other]
-
Title: Fast and Robust Non-Rigid Registration Using Accelerated Majorization-MinimizationComments: Accepted to IEEE Transactions on Pattern Analysis and Machine IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [266] arXiv:2206.03428 [pdf, other]
-
Title: Revealing Single Frame Bias for Video-and-Language LearningComments: 19 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [267] arXiv:2206.03429 [pdf, other]
-
Title: Generating Long Videos of Dynamic ScenesAuthors: Tim Brooks, Janne Hellsten, Miika Aittala, Ting-Chun Wang, Timo Aila, Jaakko Lehtinen, Ming-Yu Liu, Alexei A. Efros, Tero KarrasSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [268] arXiv:2206.03431 [pdf, other]
-
Title: Self-supervised Domain Adaptation in Crowd CountingComments: Accepted at ICIP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [269] arXiv:2206.03452 [pdf, other]
-
Title: Can CNNs Be More Robust Than Transformers?Comments: ICLR2023. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [270] arXiv:2206.03461 [pdf, other]
-
Title: Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion ModelsAuthors: Walter H. L. Pinaya, Mark S. Graham, Robert Gray, Pedro F Da Costa, Petru-Daniel Tudosiu, Paul Wright, Yee H. Mah, Andrew D. MacKinnon, James T. Teo, Rolf Jager, David Werring, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge CardosoSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
- [271] arXiv:2206.03480 [pdf, other]
-
Title: SHRED: 3D Shape Region Decomposition with Learned Local OperationsComments: SIGGRAPH ASIA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [272] arXiv:2206.03484 [pdf, other]
-
Title: Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language EmbeddingAuthors: Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang JiangComments: CVPR camera readySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [273] arXiv:2206.03544 [pdf, other]
-
Title: A Penny for Your (visual) Thoughts: Self-Supervised Reconstruction of Natural Movies from Brain ActivitySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [274] arXiv:2206.03591 [pdf, other]
-
Title: ObPose: Leveraging Pose for Object-Centric Scene Inference and Generation in 3DComments: 14 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [275] arXiv:2206.03600 [pdf, other]
-
Title: OneRing: A Simple Method for Source-free Open-partial Domain AdaptationComments: Updated. It only focuses on source-free open-partial domain adaptation, to avoid any potential misunderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [276] arXiv:2206.03612 [pdf, other]
-
Title: Predictive Modeling of Charge Levels for Battery Electric Vehicles using CNN EfficientNet and IGTD AlgorithmSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Signal Processing (eess.SP)
- [277] arXiv:2206.03657 [pdf, other]
-
Title: Delving into the Pre-training Paradigm of Monocular 3D Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [278] arXiv:2206.03661 [pdf, other]
-
Title: One Hyper-Initializer for All Network Architectures in Medical Image AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [279] arXiv:2206.03666 [pdf, other]
-
Title: Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and TrackingAuthors: Longlong Jing, Ruichi Yu, Henrik Kretzschmar, Kang Li, Charles R. Qi, Hang Zhao, Alper Ayvaci, Xu Chen, Dillon Cower, Yingwei Li, Yurong You, Han Deng, Congcong Li, Dragomir AnguelovJournal-ref: ICRA2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [280] arXiv:2206.03673 [pdf, other]
-
Title: Unsupervised Learning of 3D Scene Flow from Monocular CameraComments: ICRA2021Journal-ref: 2021 IEEE International Conference on Robotics and Automation (ICRA)Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [281] arXiv:2206.03678 [pdf, other]
-
Title: UHD Image Deblurring via Multi-scale Cubic-MixerComments: 8 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [282] arXiv:2206.03680 [pdf, other]
-
Title: Improving Evaluation of Debiasing in Image ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [283] arXiv:2206.03687 [pdf, other]
-
Title: A Unified Model for Multi-class Anomaly DetectionComments: Accepted by NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [284] arXiv:2206.03691 [pdf, other]
-
Title: Robust Deep Ensemble Method for Real-world Image DenoisingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [285] arXiv:2206.03697 [pdf, other]
-
Title: Blind Face Restoration: Benchmark Datasets and a Baseline ModelSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [286] arXiv:2206.03698 [pdf, other]
-
Title: What do we learn? Debunking the Myth of Unsupervised Outlier DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [287] arXiv:2206.03727 [pdf, other]
-
Title: Wavelet Regularization Benefits Adversarial TrainingComments: Preprint versionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [288] arXiv:2206.03740 [pdf, other]
-
Title: Large Loss Matters in Weakly Supervised Multi-Label ClassificationComments: CVPR 2022. First two authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [289] arXiv:2206.03753 [pdf, other]
-
Title: Task Agnostic Restoration of Natural Video DynamicsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [290] arXiv:2206.03775 [pdf, other]
-
Title: PixSelect: Less but Reliable Pixels for Accurate and Efficient LocalizationAuthors: Mohammad AltillawiJournal-ref: IEEE International Conference on Robotics and Automation (ICRA), May 23-27, 2022. Philadelphia, PA, USA, p 4156-4162Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [291] arXiv:2206.03778 [pdf, other]
-
Title: Learning Digital Terrain Models from Point Clouds: ALS2DTM Dataset and Rasterization-based GANSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [292] arXiv:2206.03789 [pdf, other]
-
Title: Language-Bridged Spatial-Temporal Interaction for Referring Video Object SegmentationComments: Accepted by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [293] arXiv:2206.03799 [pdf, other]
-
Title: Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth MapsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [294] arXiv:2206.03820 [pdf, ps, other]
-
Title: SUPER-IVIM-DC: Intra-voxel incoherent motion based Fetal lung maturity assessment from limited DWI data using supervised learning coupled with data-consistencyAuthors: Noam Korngut, Elad Rotman, Onur Afacan, Sila Kurugol, Yael Zaffrani-Reznikov, Shira Nemirovsky-Rotman, Simon Warfield, Moti FreimanComments: Accepted to the International Conference on Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, to be held during Sept 18-22 in SingaporeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
- [295] arXiv:2206.03858 [pdf, other]
-
Title: Rotation-Equivariant Conditional Spherical Neural Fields for Learning a Natural Illumination PriorComments: NeurIPS 2022 - Project Website: jadgardner.github.io/RENISubjects: Computer Vision and Pattern Recognition (cs.CV)
- [296] arXiv:2206.03860 [pdf, other]
-
Title: Orthonormal Convolutions for the Rotation Based Iterative GaussianizationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [297] arXiv:2206.03862 [pdf, other]
-
Title: Perceptual Quality Assessment for Fine-Grained Compressed ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [298] arXiv:2206.03876 [pdf, other]
-
Title: Progressive GANomaly: Anomaly detection with progressively growing GANsComments: SPIE Medical Imaging 2022: Image Processing conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [299] arXiv:2206.03888 [pdf, other]
-
Title: ConFUDA: Contrastive Fewshot Unsupervised Domain Adaptation for Medical Image SegmentationAuthors: Mingxuan Gu, Sulaiman Vesal, Mareike Thies, Zhaoya Pan, Fabian Wagner, Mirabela Rusu, Andreas Maier, Ronak KostiSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [300] arXiv:2206.03891 [pdf, other]
-
Title: PrivHAR: Recognizing Human Actions From Privacy-preserving LensAuthors: Carlos Hinojosa, Miguel Marquez, Henry Arguello, Ehsan Adeli, Li Fei-Fei, Juan Carlos NieblesComments: Oral paper presented at European Conference on Computer Vision (ECCV) 2022, in Tel Aviv, IsraelJournal-ref: Computer Vision--ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23--27, 2022, Proceedings, Part IVSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [301] arXiv:2206.03928 [pdf, other]
-
Title: Direct Triangulation with Spherical Projection for Omnidirectional CamerasAuthors: Ciarán EisingComments: 8 pages, 4 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [302] arXiv:2206.03939 [pdf, other]
-
Title: Depth-Adapted CNNs for RGB-D Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [303] arXiv:2206.03943 [pdf, other]
-
Title: Robust Environment Perception for Automated Driving: A Unified Learning Pipeline for Visual-Infrared Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [304] arXiv:2206.03970 [pdf, other]
-
Title: Narrowing the Coordinate-frame Gap in Behavior Prediction Models: Distillation for Efficient and Accurate Scene-centric Motion ForecastingComments: Accepted at ICRA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [305] arXiv:2206.04003 [pdf, other]
-
Title: Patch-based Object-centric Transformers for Efficient Video GenerationComments: Project Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [306] arXiv:2206.04028 [pdf, other]
-
Title: CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous DrivingComments: Pre-trained backbones and fine-tuned downstream models are now available: this https URL Code will be releasedSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [307] arXiv:2206.04029 [pdf, other]
-
Title: Accelerating Score-based Generative Models for High-Resolution Image SynthesisSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [308] arXiv:2206.04040 [pdf, other]
-
Title: MobileOne: An Improved One millisecond Mobile BackboneComments: Accepted at CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [309] arXiv:2206.04042 [pdf, other]
-
Title: Learning Ego 3D Representation as Ray TracingComments: ECCV 2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [310] arXiv:2206.04046 [pdf, other]
-
Title: Sparse Mixture-of-Experts are Domain Generalizable LearnersComments: ICLR 2023 (accepted as Oral presentation)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [311] arXiv:2206.04124 [pdf, other]
-
Title: DRHDR: A Dual branch Residual Network for Multi-Bracket High Dynamic Range ImagingComments: Accepted by CVPRW 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [312] arXiv:2206.04125 [pdf, other]
-
Title: Towards Self-supervised and Weight-preserving Neural Architecture SearchAuthors: Zhuowei Li, Yibo Gao, Zhenzhou Zha, Zhiqiang HU, Qing Xia, Shaoting Zhang, Dimitris N. MetaxasSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [313] arXiv:2206.04158 [pdf, other]
-
Title: Texture Extraction Methods Based Ensembling Framework for Improved ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [314] arXiv:2206.04170 [pdf, other]
-
Title: CASS: Cross Architectural Self-Supervision for Medical Image AnalysisComments: (27 pages, 14 figures), Accepted at NeurIPS 2022 Workshop: Self-Supervised Learning - Theory and PracticeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [315] arXiv:2206.04176 [pdf, other]
-
Title: VN-Transformer: Rotation-Equivariant Attention for Vector NeuronsComments: Published in Transactions on Machine Learning Research (TMLR), 2023; Previous version appeared in Workshop on Machine Learning for Autonomous Driving, Conference on Neural Information Processing Systems (NeurIPS), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [316] arXiv:2206.04197 [pdf, other]
-
Title: SCAMPS: Synthetics for Camera Measurement of Physiological SignalsAuthors: Daniel McDuff, Miah Wander, Xin Liu, Brian L. Hill, Javier Hernandez, Jonathan Lester, Tadas BaltrusaitisSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [317] arXiv:2206.04231 [pdf, other]
-
Title: JNMR: Joint Non-linear Motion Regression for Video Frame InterpolationComments: Accepted by IEEE Transactions on Image Processing (TIP)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [318] arXiv:2206.04242 [pdf, other]
-
Title: OOD Augmentation May Be at Odds with Open-Set RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [319] arXiv:2206.04246 [pdf, other]
-
Title: SwinCheX: Multi-label classification on chest X-ray images with transformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [320] arXiv:2206.04271 [pdf, other]
-
Title: DeepVerge: Classification of Roadside Verge Biodiversity and Conservation PotentialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [321] arXiv:2206.04281 [pdf, other]
-
Title: Local Spatiotemporal Representation Learning for Longitudinally-consistent Neuroimage AnalysisComments: Accepted at NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [322] arXiv:2206.04295 [pdf, other]
-
Title: Reconstruct Face from Features Using GAN Generator as a Distribution ConstraintSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [323] arXiv:2206.04325 [pdf, other]
-
Title: CFA: Coupled-hypersphere-based Feature Adaptation for Target-Oriented Anomaly LocalizationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [324] arXiv:2206.04349 [pdf, other]
-
Title: Deep radiomic signature with immune cell markers predicts the survival of glioma patientsAuthors: Ahmad Chaddad, Paul Daniel Mingli Zhang, Saima Rathore, Paul Sargos, Christian Desrosiers, Tamim NiaziJournal-ref: Neurocomputing, Volume 469, 16 January 2022, Pages 366-375Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
- [325] arXiv:2206.04365 [pdf, other]
-
Title: CARLA-GeAR: a Dataset Generator for a Systematic Evaluation of Adversarial Robustness of Vision ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [326] arXiv:2206.04374 [pdf, other]
-
Title: Uncovering bias in the PlantVillage datasetAuthors: Mehmet Alican NoyanSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [327] arXiv:2206.04381 [pdf, other]
-
Title: STIP: A SpatioTemporal Information-Preserving and Perception-Augmented Model for High-Resolution Video PredictionComments: This journal paper is extended from our previous work accepted in CVPR2022 and has been submitted to IEEE Transactions on MultimediaSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [328] arXiv:2206.04382 [pdf, other]
-
Title: CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human MeshesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
- [329] arXiv:2206.04399 [pdf, ps, other]
-
Title: Depression Recognition using Remote Photoplethysmography from Facial VideosComments: 10 pages, 5 figures, 8 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
- [330] arXiv:2206.04401 [pdf, other]
-
Title: Cross-modal Local Shortest Path and Global Enhancement for Visible-Thermal Person Re-IdentificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [331] arXiv:2206.04403 [pdf, other]
-
Title: VITA: Video Instance Segmentation via Object Token AssociationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [332] arXiv:2206.04406 [pdf, other]
-
Title: Unsupervised Learning of the Total Variation FlowSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [333] arXiv:2206.04425 [pdf, other]
-
Title: Multiple Instance Learning for Digital Pathology: A Review on the State-of-the-Art, Limitations & Future PotentialSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [334] arXiv:2206.04449 [pdf, other]
-
Title: Segmentation Enhanced Lameness Detection in Dairy Cows from RGB and Depth VideoComments: Accepted at the CV4Animals workshop in CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [335] arXiv:2206.04452 [pdf, other]
-
Title: Draft-and-Revise: Effective Image Generation with Contextual RQ-TransformerComments: 20 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [336] arXiv:2206.04453 [pdf, other]
-
Title: The Missing Link: Finding label relations across datasetsComments: ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [337] arXiv:2206.04479 [pdf, ps, other]
-
Title: BSM loss: A superior way in modeling aleatory uncertainty of fine_grained classificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [338] arXiv:2206.04503 [pdf, other]
-
Title: cycle text2face: cycle text-to-face gan via transformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [339] arXiv:2206.04511 [pdf, other]
-
Title: Efficient Human Pose Estimation via 3D Event Point CloudComments: Accepted to 3DV 2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [340] arXiv:2206.04531 [pdf, other]
-
Title: ECLAD: Extracting Concepts with Local Aggregated DescriptorsComments: 34 pages, under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [341] arXiv:2206.04557 [pdf, other]
-
Title: SparseFormer: Attention-based Depth Completion NetworkComments: Accepted at CV4ARVR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [342] arXiv:2206.04558 [pdf, other]
-
Title: BFS-Net: Weakly Supervised Cell Instance Segmentation from Bright-Field Microscopy Z-StacksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [343] arXiv:2206.04575 [pdf, other]
-
Title: Transformer based Urdu Handwritten Text Optical Character ReaderSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [344] arXiv:2206.04584 [pdf, other]
-
Title: Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerComments: Tech report. Work in progressSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [345] arXiv:2206.04590 [pdf, other]
-
Title: GASP: Gated Attention For Saliency PredictionComments: International Joint Conference on Artificial Intelligence (IJCAI-21)Journal-ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (2021) 584-591Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [346] arXiv:2206.04636 [pdf, other]
-
Title: Spatial Entropy as an Inductive Bias for Vision TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [347] arXiv:2206.04655 [pdf, other]
-
Title: Towards Layer-wise Image VectorizationAuthors: Xu Ma, Yuqian Zhou, Xingqian Xu, Bin Sun, Valerii Filev, Nikita Orlov, Yun Fu, Humphrey ShiComments: Accepted as Oral Presentation at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [348] arXiv:2206.04656 [pdf, other]
-
Title: Simple Cues Lead to a Strong Multi-Object TrackerComments: Accepted to CVPR2023!Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [349] arXiv:2206.04662 [pdf, other]
-
Title: DiSparse: Disentangled Sparsification for Multitask Model CompressionComments: Accepted at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [350] arXiv:2206.04664 [pdf, other]
-
Title: On Data Scaling in Masked Image ModelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [351] arXiv:2206.04665 [pdf, other]
-
Title: AGConv: Adaptive Graph Convolution on 3D Point CloudsAuthors: Mingqiang Wei, Zeyong Wei, Haoran Zhou, Fei Hu, Huajian Si, Zhilei Chen, Zhe Zhu, Jingbo Qiu, Xuefeng Yan, Yanwen Guo, Jun Wang, Jing QinComments: arXiv admin note: substantial text overlap with arXiv:2108.08035Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [352] arXiv:2206.04667 [pdf, other]
-
Title: Extreme Masking for Learning Instance and Distributed Visual RepresentationsComments: Accepted in TMLRSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [353] arXiv:2206.04668 [pdf, other]
-
Title: GateHUB: Gated History Unit with Background Suppression for Online Action DetectionComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [354] arXiv:2206.04669 [pdf, other]
-
Title: Beyond RGB: Scene-Property Synthesis with Neural Radiance FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [355] arXiv:2206.04670 [pdf, other]
-
Title: PointNeXt: Revisiting PointNet++ with Improved Training and Scaling StrategiesAuthors: Guocheng Qian, Yuchen Li, Houwen Peng, Jinjie Mai, Hasan Abed Al Kader Hammoud, Mohamed Elhoseiny, Bernard GhanemComments: Accepted by NeurIPS'22. Code and models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [356] arXiv:2206.04671 [pdf, other]
-
Title: Open Challenges in Deep Stereo: the Booster DatasetAuthors: Pierluigi Zama Ramirez, Fabio Tosi, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di StefanoComments: CVPR 2022, New Orleans. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [357] arXiv:2206.04673 [pdf, other]
-
Title: Neural Prompt SearchComments: Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [358] arXiv:2206.04674 [pdf, other]
-
Title: Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEsComments: Code shall be released at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [359] arXiv:2206.04783 [pdf, other]
-
Title: ReFace: Real-time Adversarial Attacks on Face Recognition SystemsAuthors: Shehzeen Hussain, Todd Huster, Chris Mesterharm, Paarth Neekhara, Kevin An, Malhar Jere, Harshvardhan Sikka, Farinaz KoushanfarSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [360] arXiv:2206.04785 [pdf, other]
-
Title: Building Spatio-temporal Transformers for Egocentric 3D Pose EstimationComments: 4 pages, Extended abstract, Joint International Workshop on Egocentric Perception, Interaction and Computing (EPIC) and Ego4D, IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [361] arXiv:2206.04790 [pdf, other]
-
Title: Learn2Augment: Learning to Composite Videos for Data Augmentation in Action RecognitionComments: Accepted to ECCV-2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [362] arXiv:2206.04797 [pdf, other]
-
Title: Memory-efficient model-based deep learning with convergence and robustness guaranteesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [363] arXiv:2206.04831 [pdf, other]
-
Title: R4D: Utilizing Reference Objects for Long-Range Distance EstimationComments: ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [364] arXiv:2206.04846 [pdf, other]
-
Title: Masked Autoencoders are Robust Data AugmentorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [365] arXiv:2206.04854 [pdf, other]
-
Title: Heterogeneous Face Recognition via Face Synthesis with Identity-Attribute DisentanglementComments: Accepted for publication in IEEE Transactions on Information Forensics and Security (TIFS)Journal-ref: IEEE Transactions on Information Forensics and Security, vol. 17, pp. 1344-1358, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [366] arXiv:2206.04863 [pdf, other]
-
Title: Symbolic image detection using scene and knowledge graphsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [367] arXiv:2206.04867 [pdf, other]
-
Title: The Gender Gap in Face Recognition Accuracy Is a Hairy ProblemSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [368] arXiv:2206.04874 [pdf, ps, other]
-
Title: The 1st Data Science for Pavements ChallengeAuthors: Ashkan Behzadian, Tanner Wambui Muturi, Tianjie Zhang, Hongmin Kim, Amanda Mullins, Yang Lu, Neema Jasika Owor, Yaw Adu-Gyamfi, William Buttlar, Majidifard Hamed, Armstrong Aboah, David Mensching, Spragg Robert, Matthew Corrigan, Jack Youtchef, Dave EshanSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [369] arXiv:2206.04879 [pdf, other]
-
Title: Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label DiffusionComments: IEEE Transactions on Image Processing 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [370] arXiv:2206.04901 [pdf, other]
-
Title: NeRF-In: Free-Form NeRF Inpainting with RGB-D PriorsComments: Hao-Kang Liu and I-Chao Shen contributed equally to the paper. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [371] arXiv:2206.04906 [pdf, other]
-
Title: Out of Sight, Out of Mind: A Source-View-Wise Feature Aggregation for Multi-View Image-Based RenderingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [372] arXiv:2206.04916 [pdf, other]
-
Title: PatchComplete: Learning Multi-Resolution Patch Priors for 3D Shape Completion on Unseen CategoriesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [373] arXiv:2206.04927 [pdf, other]
-
Title: Ego2HandsPose: A Dataset for Egocentric Two-hand 3D Global Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [374] arXiv:2206.04942 [pdf, other]
-
Title: Neural Template: Topology-aware Reconstruction and Disentangled Generation of 3D MeshesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [375] arXiv:2206.04949 [pdf, other]
-
Title: Deep Multi-View Semi-Supervised Clustering with Sample Pairwise ConstraintsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [376] arXiv:2206.04958 [pdf, other]
-
Title: Self-Supervised Deep Subspace Clustering with Entropy-normSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [377] arXiv:2206.04975 [pdf, other]
-
Title: NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression RecognitionComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [378] arXiv:2206.04979 [pdf, ps, other]
-
Title: Convolutional layers are equivariant to discrete shifts but not continuous translationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [379] arXiv:2206.04981 [pdf, other]
-
Title: Positional Label for Self-Supervised Vision TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [380] arXiv:2206.05028 [pdf, other]
-
Title: Spatial Cross-Attention Improves Self-Supervised Visual Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [381] arXiv:2206.05039 [pdf, other]
-
Title: Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [382] arXiv:2206.05099 [pdf, other]
-
Title: SimVP: Simpler yet Better Video PredictionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [383] arXiv:2206.05102 [pdf, other]
-
Title: Saccade Mechanisms for Image Classification, Object Detection and TrackingComments: 4 Pages, 6 figures, will be presented at CVPR2022-NeuroVision workshop as a Lightning talkSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [384] arXiv:2206.05127 [pdf, other]
-
Title: Globally-Optimal Contrast Maximisation for Event CamerasComments: arXiv admin note: substantial text overlap with arXiv:2203.03914Journal-ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [385] arXiv:2206.05128 [pdf, ps, other]
-
Title: Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware AcceleratorsAuthors: Indhumathi Kandaswamy, Saurabh Farkya, Zachary Daniels, Gooitzen van der Wal, Aswin Raghavan, Yuzheng Zhang, Jun Hu, Michael Lomnitz, Michael Isnardi, David Zhang, Michael PiacentinoComments: 9 pages, 15 figures. Will be presented in Embedded Vision Workshop at CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR)
- [386] arXiv:2206.05149 [pdf, other]
-
Title: Referring Image MattingComments: Accepted to CVPR2023. The dataset, code and models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [387] arXiv:2206.05158 [pdf, other]
-
Title: MEAT: Maneuver Extraction from Agent TrajectoriesComments: Accepted at IEEE Intelligent Vehicles Symposium (IV) 2022 2nd Workshop on Autonomy@ScaleSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [388] arXiv:2206.05159 [pdf, ps, other]
-
Title: An Image Processing Pipeline for Camera Trap Time-Lapse RecordingsComments: 5 pages, 2 figures, presented at the CV4Animals workshop of CVIP2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [389] arXiv:2206.05184 [pdf, other]
-
Title: SERE: Exploring Feature Self-relation for Self-supervised TransformerComments: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)Journal-ref: 10.1109/TPAMI.2023.3309979Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [390] arXiv:2206.05194 [pdf, other]
-
Title: Learning the Space of Deep ModelsComments: Accepted at ICPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [391] arXiv:2206.05225 [pdf, other]
-
Title: ClamNet: Using contrastive learning with variable depth Unets for medical image segmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [392] arXiv:2206.05252 [pdf, other]
-
Title: Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning ModelsComments: 12 pages, 12 figures (with supplemental: 34 pages)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [393] arXiv:2206.05253 [pdf, other]
-
Title: Rethinking Spatial Invariance of Convolutional Networks for Object CountingComments: Accepted to CVPR 2022, Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP)
- [394] arXiv:2206.05257 [pdf, other]
-
Title: Explaining Image Classifiers Using Contrastive Counterfactuals in Generative Latent SpacesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [395] arXiv:2206.05259 [pdf, other]
-
Title: Is Self-Supervised Learning More Robust Than Supervised Learning?Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [396] arXiv:2206.05260 [pdf, other]
-
Title: Balanced Product of Calibrated Experts for Long-Tailed RecognitionComments: Accepted at CVPR 2023, 19 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [397] arXiv:2206.05275 [pdf, other]
-
Title: Spatial-temporal Concept based Explanation of 3D ConvNetsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [398] arXiv:2206.05281 [pdf, other]
-
Title: Less Is More: Linear Layers on CLIP Features as Powerful VizWiz ModelComments: VizWiz Grand Challenge: Describing Images and Videos Taken by Blind People (CVPR Workshop 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [399] arXiv:2206.05282 [pdf, other]
-
Title: Learning to Estimate Shapley Values with Vision TransformersComments: ICLR 2023 camera-readySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [400] arXiv:2206.05291 [pdf, other]
-
Title: ProActive: Self-Attentive Temporal Point Process Flows for Activity SequencesComments: KDD 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [401] arXiv:2206.05309 [pdf, ps, other]
-
Title: EigenFairing: 3D Model Fairing using Image CoherenceComments: British Machine Vision Conference, BMVC 2004, Kingston, UK, September 7-9, 2004Journal-ref: Proceedings of the British Machine Conference, pages 1-10, BMVA Press, September 2004Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [402] arXiv:2206.05319 [pdf, other]
-
Title: Object Instance Identification in Dynamic EnvironmentsComments: Joint 1st Ego4D and 10th EPIC Workshop (EPIC@CVPR2022) Extended AbstractSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [403] arXiv:2206.05375 [pdf, other]
-
Title: Generalizable Neural Radiance Fields for Novel View Synthesis with TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [404] arXiv:2206.05377 [pdf, other]
-
Title: Fast building segmentation from satellite imagery and few local labelsAuthors: Caleb Robinson, Anthony Ortiz, Hogeun Park, Nancy Lozano Gracia, Jon Kher Kaw, Tina Sederholm, Rahul Dodhia, Juan M. Lavista FerresComments: Accepted at EarthVision 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [405] arXiv:2206.05379 [pdf, other]
-
Title: A Benchmark for Compositional Visual ReasoningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [406] arXiv:2206.05390 [pdf, other]
-
Title: Transformer-based Self-Supervised Fish Segmentation in Underwater VideosComments: 11 pages, 6 figures. Submitted to the journal, International Journal of Intelligent SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [407] arXiv:2206.05394 [pdf, other]
-
Title: Applications of Deep Learning in Fish Habitat Monitoring: A Tutorial and SurveyComments: 26 pages, 7 figures. Submitted to the journal, Expert Systems With ApplicationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [408] arXiv:2206.05398 [pdf, other]
-
Title: E2PN: Efficient SE(3)-Equivariant Point NetworkComments: CVPR 2023, 16 pages. See this https URL for codeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [409] arXiv:2206.05420 [pdf, other]
-
Title: VAC2: Visual Analysis of Combined Causality in Event SequencesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [410] arXiv:2206.05422 [pdf, other]
-
Title: Access Control of Semantic Segmentation Models Using Encrypted Feature MapsSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [411] arXiv:2206.05424 [pdf, other]
-
Title: Precise Affordance Annotation for Egocentric Action Video DatasetsComments: Technical report for CVPR 2022 EPIC-Ego4D WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [412] arXiv:2206.05431 [pdf, other]
-
Title: Learned reconstruction methods with convergence guaranteesAuthors: Subhadip Mukherjee, Andreas Hauptmann, Ozan Öktem, Marcelo Pereyra, Carola-Bibiane SchönliebSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [413] arXiv:2206.05432 [pdf, ps, other]
-
Title: Luminance-Guided Chrominance Image Enhancement for HEVC Intra CodingComments: ISCAS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [414] arXiv:2206.05488 [pdf, ps, other]
-
Title: Kaggle Kinship Recognition Challenge: Introduction of Convolution-Free Model to boost conventionalSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [415] arXiv:2206.05496 [pdf, other]
-
Title: An Evaluation of OCR on Egocentric DataComments: Extended Abstract, EPIC workshop at CVPR 22Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [416] arXiv:2206.05498 [pdf, other]
-
Title: A Review of Causality for Learning Algorithms in Medical Image AnalysisComments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL". ; Paper ID: 2022:028Journal-ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); General Literature (cs.GL)
- [417] arXiv:2206.05514 [pdf, other]
-
Title: Toward Real-world Single Image Deraining: A New Benchmark and BeyondSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [418] arXiv:2206.05520 [pdf, other]
-
Title: A Two-stage Method for Non-extreme Value Salt-and-Pepper Noise RemovalComments: UESTC course projectSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [419] arXiv:2206.05539 [pdf, other]
-
Title: A Simplified Un-Supervised Learning Based Approach for Ink Mismatch Detection in Handwritten Hyper-Spectral Document ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [420] arXiv:2206.05542 [pdf, other]
-
Title: Surround-View Cameras based Holistic Visual Perception for Automated DrivingAuthors: Varun Ravi KumarComments: Doctoral thesisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [421] arXiv:2206.05617 [pdf, other]
-
Title: Federated Learning with Research Prototypes for Multi-Center MRI-based Detection of Prostate Cancer with Diverse HistopathologyAuthors: Abhejit Rajagopal, Ekaterina Redekop, Anil Kemisetti, Rushi Kulkarni, Steven Raman, Kirti Magudia, Corey W. Arnold, Peder E. Z. LarsonComments: under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
- [422] arXiv:2206.05619 [pdf, other]
-
Title: Deep Learning Models for Automated Classification of Dog Emotional States from Facial ExpressionsAuthors: Tali Boneh-Shitrit, Shir Amir, Annika Bremhorst, Daniel S. Mills, Stefanie Riemer, Dror Fried, Anna ZamanskySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [423] arXiv:2206.05641 [pdf, ps, other]
-
Title: An Unsupervised Deep-Learning Method for Bone Age AssessmentSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [424] arXiv:2206.05648 [pdf, other]
-
Title: Indirect-Instant Attention Optimization for Crowd Counting in Dense ScenesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [425] arXiv:2206.05651 [pdf, other]
-
Title: STD-NET: Search of Image Steganalytic Deep-learning Architecture via Hierarchical Tensor DecompositionComments: Submitted to IEEE T-DSCSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [426] arXiv:2206.05683 [pdf, other]
-
Title: APT-36K: A Large-scale Benchmark for Animal Pose Estimation and TrackingComments: Neurips 2022 dataset and benchmark trackSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [427] arXiv:2206.05707 [pdf, other]
-
Title: DPCN++: Differentiable Phase Correlation Network for Versatile Pose RegistrationAuthors: Zexi Chen, Yiyi Liao, Haozhe Du, Haodong Zhang, Xuecheng Xu, Haojian Lu, Rong Xiong, Yue WangSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [428] arXiv:2206.05708 [pdf, other]
-
Title: Narrowing the Gap: Improved Detector Training with Noisy Location AnnotationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [429] arXiv:2206.05712 [pdf, other]
-
Title: Graph-based Spatial Transformer with Memory Replay for Multi-future Pedestrian Trajectory PredictionComments: This paper has been accepted by CVPR 2022. Reference: Li, L., Pagnucco, M. and Song, Y., 2022. Graph-Based Spatial Transformer With Memory Replay for Multi-Future Pedestrian Trajectory Prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 2231-2241)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [430] arXiv:2206.05717 [pdf, other]
-
Title: Crowd Localization from Gaussian Mixture Scoped Knowledge and Scoped TeacherComments: Accepted by IEEE TIPSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [431] arXiv:2206.05730 [pdf, other]
-
Title: Object Occlusion of Adding New Categories in Objection DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [432] arXiv:2206.05737 [pdf, other]
-
Title: SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse ViewsComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [433] arXiv:2206.05741 [pdf, other]
-
Title: Bootstrapping Multi-view Representations for Fake News DetectionComments: Authors are from Fudan University, China. Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [434] arXiv:2206.05763 [pdf, other]
-
Title: SeATrans: Learning Segmentation-Assisted diagnosis model via TransformerAuthors: Junde Wu, Huihui Fang, Fangxin Shang, Dalu Yang, Zhaowei Wang, Jing Gao, Yehui Yang, Yanwu XuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [435] arXiv:2206.05765 [pdf, other]
-
Title: A Semantic Consistency Feature Alignment Object Detection Model Based on Mixed-Class Distribution MetricsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [436] arXiv:2206.05810 [pdf, other]
-
Title: Analysis of Branch Specialization and its Application in Image DecompositionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
- [437] arXiv:2206.05833 [pdf, other]
-
Title: COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion RecognitionAuthors: Mani Kumar Tellamekala, Shahin Amiriparian, Björn W. Schuller, Elisabeth André, Timo Giesbrecht, Michel ValstarComments: Accepted to IEEE Transactions on Pattern Analysis and Machine IntelligenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
- [438] arXiv:2206.05836 [pdf, other]
-
Title: GLIPv2: Unifying Localization and Vision-Language UnderstandingAuthors: Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng GaoComments: NeurIPS 2022; updated with reviewers' comments addressed; Code is released at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
- [439] arXiv:2206.05837 [pdf, other]
-
Title: NeuralODF: Learning Omnidirectional Distance Fields for 3D Shape RepresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [440] arXiv:2206.05842 [pdf, ps, other]
-
Title: Efficiency Comparison of AI classification algorithms for Image Detection and Recognition in Real-timeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [441] arXiv:2206.05844 [pdf, other]
-
Title: FisheyeEX: Polar Outpainting for Extending the FoV of Fisheye LensSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [442] arXiv:2206.05846 [pdf, other]
-
Title: InBiaseD: Inductive Bias Distillation to Improve Generalization and Robustness through Shape-awarenessComments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [443] arXiv:2206.05853 [pdf, other]
-
Title: Modeling Generalized Specialist Approach To Train Quality Resilient Snapshot EnsembleSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [444] arXiv:2206.05866 [pdf, other]
-
Title: TC-SfM: Robust Track-Community-Based Structure-from-MotionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [445] arXiv:2206.05896 [pdf, other]
-
Title: Improve Ranking Correlation of Super-net through Training Scheme from One-shot NAS to Few-shot NASSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [446] arXiv:2206.05897 [pdf, other]
-
Title: $\texttt{GradICON}$: Approximate Diffeomorphisms via Gradient Inverse ConsistencyAuthors: Lin Tian, Hastings Greer, François-Xavier Vialard, Roland Kwitt, Raúl San José Estépar, Richard Jarrett Rushmore, Nikolaos Makris, Sylvain Bouix, Marc NiethammerComments: 29 pages, 16 figures, CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [447] arXiv:2206.05898 [pdf, other]
-
Title: Pixel to Binary Embedding Towards Robustness for CNNsComments: Accepted to ICPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [448] arXiv:2206.05903 [pdf, other]
-
Title: Geometrically Guided Integrated GradientsComments: 19 pages, 23 figures, funding sources addedSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [449] arXiv:2206.05912 [pdf, other]
-
Title: INDIGO: Intrinsic Multimodality for Domain GeneralizationAuthors: Puneet Mangla, Shivam Chandhok, Milan Aggarwal, Vineeth N Balasubramanian, Balaji KrishnamurthyComments: Under SubmissionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [450] arXiv:2206.05927 [pdf, other]
-
Title: LinK3D: Linear Keypoints Representation for 3D LiDAR Point CloudSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [451] arXiv:2206.05962 [pdf, other]
-
Title: PRO-TIP: Phantom for RObust automatic ultrasound calibration by TIP detectionAuthors: Matteo Ronchetti, Julia Rackerseder, Maria Tirindelli, Mehrdad Salehi, Nassir Navab, Wolfgang Wein, Oliver ZettinigComments: This preprint was submitted to MICCAI 2022. The Version of Record of this contribution will be published in Springer LNCSSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [452] arXiv:2206.05963 [pdf, ps, other]
-
Title: ATDN vSLAM: An all-through Deep Learning-Based Solution for Visual Simultaneous Localization and MappingComments: Published in Periodica Polytechnica Electrical Engineering 11 pagesJournal-ref: Periodica Polytechnica Electrical Engineering and Computer Science, 66(3), pp. 236-247, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [453] arXiv:2206.05967 [pdf, other]
-
Title: GoToNet: Fast Monocular Scene Exposure and ExplorationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [454] arXiv:2206.05970 [pdf, other]
-
Title: Hypernetwork-Based Adaptive Image RestorationComments: 5 pages, 5 Figures, ICASSP 2023Journal-ref: ICASSP 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [455] arXiv:2206.05981 [pdf, other]
-
Title: Efficient Human-in-the-loop System for Guiding DNNs AttentionComments: 13 pages, 11 figures, proceeding of ACM IUI 2023, video this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [456] arXiv:2206.05982 [pdf, other]
-
Title: Learning Fashion Compatibility from In-the-wild ImagesComments: Accepted to ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [457] arXiv:2206.06014 [pdf, other]
-
Title: Exploring and Exploiting Hubness Priors for High-Quality GAN Latent SamplingComments: Accepted at ICML 2022. Our code is available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [458] arXiv:2206.06023 [pdf, other]
-
Title: Virtual embeddings and self-consistency for self-supervised learningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [459] arXiv:2206.06067 [pdf, other]
-
Title: Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge DistillationComments: ICLR'23 acceptedSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [460] arXiv:2206.06079 [pdf, other]
-
Title: OHM: GPU Based Occupancy Map GenerationComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [461] arXiv:2206.06100 [pdf, other]
-
Title: AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance FieldsAuthors: Takuhiro KanekoComments: Accepted to CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [462] arXiv:2206.06103 [pdf, other]
-
Title: Learning Feature Disentanglement and Dynamic Fusion for Recaptured Image ForensicComments: Accepted by CVPR2022 workshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [463] arXiv:2206.06119 [pdf, other]
-
Title: Satellite-based high-resolution maps of cocoa planted area for Côte d'Ivoire and GhanaAuthors: Nikolai Kalischek, Nico Lang, Cécile Renier, Rodrigo Caye Daudt, Thomas Addoah, William Thompson, Wilma J. Blaser-Hart, Rachael Garrett, Konrad Schindler, Jan D. WegnerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [464] arXiv:2206.06120 [pdf, ps, other]
-
Title: Brain tumour segmentation with incomplete imaging dataComments: 26 pages, 8 figures, 4 supplementary tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Tissues and Organs (q-bio.TO)
- [465] arXiv:2206.06122 [pdf, other]
-
Title: Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuningAuthors: Yanpeng Sun, Qiang Chen, Xiangyu He, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jian Cheng, Zechao Li, Jingdong WangComments: Accepted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [466] arXiv:2206.06168 [pdf, other]
-
Title: 2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning ApproachComments: 2nd Place Solution for ICCV 2021 VIPriors Image Classification ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [467] arXiv:2206.06177 [pdf, other]
-
Title: Transductive CLIP with Class-Conditional Contrastive LearningComments: Published in IEEE ICASSP 2022Journal-ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [468] arXiv:2206.06214 [pdf, other]
-
Title: Real-World Light Field Image Super-Resolution via Degradation ModulationComments: 15 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [469] arXiv:2206.06219 [pdf, other]
-
Title: Making Sense of Dependence: Efficient Black-box Explanations Using Dependence MeasureComments: Accepted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML); Other Statistics (stat.OT)
- [470] arXiv:2206.06252 [pdf, other]
-
Title: Transformer Lesion TrackerComments: Accepted MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [471] arXiv:2206.06258 [pdf, other]
-
Title: Featurized Query R-CNNComments: Tech ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [472] arXiv:2206.06289 [pdf, other]
-
Title: Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object ManipulationComments: Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Top-performing systems for both no interaction and no restriction tracks in SAPIEN ManiSkill Challenge 2021. The source code and model are publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Robotics (cs.RO)
- [473] arXiv:2206.06291 [pdf, other]
-
Title: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction DetectionComments: CVPR 2022; Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [474] arXiv:2206.06292 [pdf, other]
-
Title: MLP-3D: A MLP-like 3D Architecture with Grouped Time MixingComments: CVPR 2022; Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [475] arXiv:2206.06293 [pdf, other]
-
Title: Learning Domain Adaptive Object Detection with Probabilistic TeacherAuthors: Meilin Chen, Weijie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Yunfeng Yan, Donglian Qi, Yueting Zhuang, Di Xie, Shiliang PuComments: To appear in ICML 2022. Code is coming soon: this https URLJournal-ref: International Conference on Machine Learning (ICML), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [476] arXiv:2206.06323 [pdf, other]
-
Title: Visual Transformer for Object DetectionAuthors: Michael YangComments: In preparation for short paper of conferences. I am using the name Michael YangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [477] arXiv:2206.06340 [pdf, other]
-
Title: SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete DataComments: First two authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [478] arXiv:2206.06346 [pdf, ps, other]
-
Title: Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object TokensAuthors: Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir GlobersonComments: Tech reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [479] arXiv:2206.06359 [pdf, other]
-
Title: EnergyMatch: Energy-based Pseudo-Labeling for Semi-Supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [480] arXiv:2206.06360 [pdf, other]
-
Title: ARF: Artistic Radiance FieldsComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [481] arXiv:2206.06363 [pdf, other]
-
Title: Discovering Object Masks with Transformers for Unsupervised Semantic SegmentationComments: Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [482] arXiv:2206.06404 [pdf, other]
-
Title: Compositional Mixture Representations for Vision and TextComments: Workshop on Learning with Limited Labelled Data for Image and Video Understanding (L3D-IVU), CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [483] arXiv:2206.06420 [pdf, other]
-
Title: GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose EstimationComments: Open SourcedSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [484] arXiv:2206.06427 [pdf, other]
-
Title: A Multi-purpose Realistic Haze Benchmark with Quantifiable Haze Levels and Ground TruthAuthors: Priya Narayanan, Xin Hu, Zhenyu Wu, Matthew D Thielke, John G Rogers, Andre V Harrison, John A D'Agostino, James D Brown, Long P Quang, James R Uplinger, Heesung Kwon, Zhangyang WangComments: This paper has been ACCEPTED for publication as a REGULAR paper in the IEEE Transactions on Image Processing (TIP)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [485] arXiv:2206.06430 [pdf, ps, other]
-
Title: A Training Method For VideoPose3D With Ideology of Action RecognitionAuthors: Hao BaiComments: Published by IEEE, on conference CONF-SPMLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [486] arXiv:2206.06435 [pdf, ps, other]
-
Title: ICP Algorithm: Theory, Practice And Its SLAM-oriented TaxonomyAuthors: Hao BaiComments: Accepted by CONF-CDS'22Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [487] arXiv:2206.06461 [pdf, other]
-
Title: Self-Supervised Representation Learning With MUlti-Segmental Informational Coding (MUSIC)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [488] arXiv:2206.06466 [pdf, other]
-
Title: Revisiting the Shape-Bias of Deep Learning for Dermoscopic Skin Lesion ClassificationAuthors: Adriano Lucieri, Fabian Schmeisser, Christoph Peter Balada, Shoaib Ahmed Siddiqui, Andreas Dengel, Sheraz AhmedComments: Submitted preprint accepted for MIUA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [489] arXiv:2206.06481 [pdf, other]
-
Title: RigNeRF: Fully Controllable Neural 3D PortraitsComments: The project page can be found here: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [490] arXiv:2206.06484 [pdf, other]
-
Title: On Image Segmentation With Noisy Labels: Characterization and Volume Properties of the Optimal Solutions to Accuracy and DiceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [491] arXiv:2206.06487 [pdf, other]
-
Title: The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge DistillationComments: Accepted by ICLR 2023 (top-5%). The first three authors contribute equally. Project website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [492] arXiv:2206.06488 [pdf, other]
-
Title: Multimodal Learning with Transformers: A SurveyComments: This paper is accepted by IEEE TPAMISubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [493] arXiv:2206.06490 [pdf, other]
-
Title: Learning Task-Independent Game State Representations from Unlabeled ImagesComments: Conference on Games (CoG) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [494] arXiv:2206.06506 [pdf, other]
-
Title: Spiking Neural Networks for Frame-based and Event-based Single Object LocalizationComments: 21 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [495] arXiv:2206.06510 [pdf, other]
-
Title: Generalizable Method for Face Anti-Spoofing with Semi-Supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [496] arXiv:2206.06518 [pdf, other]
-
Title: Estimating Pose from Pressure Data for Smart Beds with Deep Image-based Pose EstimatorsComments: The version of record of this article, first published in Applied Intelligence, is available online at Publisher's website this https URL arXiv admin note: substantial text overlap with arXiv:1908.08919Journal-ref: Applied Intelligence (2021): 1-15Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [497] arXiv:2206.06533 [pdf, other]
-
Title: 3D scene reconstruction from monocular spherical video with motion parallaxAuthors: Kenji TanakaComments: 13 pages, 18 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
- [498] arXiv:2206.06544 [pdf, ps, other]
-
Title: A Survey of Automated Data Augmentation Algorithms for Deep Learning-based Image Classification TasksComments: 68 pages, 9 figures. Submitted to Knowledge and Information Systems (KAIS)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [499] arXiv:2206.06607 [pdf, other]
-
Title: Plug-and-Play Pseudo Label Correction Network for Unsupervised Person Re-identificationComments: 19 pages,9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [500] arXiv:2206.06608 [pdf, other]
-
Title: Label Matching Semi-Supervised Object DetectionAuthors: Binbin Chen, Weijie Chen, Shicai Yang, Yunyi Xuan, Jie Song, Di Xie, Shiliang Pu, Mingli Song, Yueting ZhuangComments: To appear in CVPR 2022. Code is coming soon: this https URLJournal-ref: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [501] arXiv:2206.06619 [pdf, other]
-
Title: TransVG++: End-to-End Visual Grounding with Language Conditioned Vision TransformerAuthors: Jiajun Deng, Zhengyuan Yang, Daqing Liu, Tianlang Chen, Wengang Zhou, Yanyong Zhang, Houqiang Li, Wanli OuyangComments: arXiv admin note: text overlap with arXiv:2104.08541Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [502] arXiv:2206.06620 [pdf, other]
-
Title: Slimmable Domain AdaptationAuthors: Rang Meng, Weijie Chen, Shicai Yang, Jie Song, Luojun Lin, Di Xie, Shiliang Pu, Xinchao Wang, Mingli Song, Yueting ZhuangComments: To appear in CVPR 2022. Code is coming soon: this https URLJournal-ref: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [503] arXiv:2206.06637 [pdf, other]
-
Title: RF-Next: Efficient Receptive Field Search for Convolutional Neural NetworksComments: Accepted by TPAMI. This paper is a journal extension of our CVPR 2021 paper (arXiv:2101.00910)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [504] arXiv:2206.06640 [pdf, other]
-
Title: Confidence Score for Source-Free Unsupervised Domain AdaptationComments: ICML 2022 camera readySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [505] arXiv:2206.06665 [pdf, other]
-
Title: Online Easy Example Mining for Weakly-supervised Gland Segmentation from Histology ImagesComments: MICCAI 2022 AccepetedSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [506] arXiv:2206.06694 [pdf, other]
-
Title: ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation datasetAuthors: Moritz Roman Hernandez Petzsche, Ezequiel de la Rosa, Uta Hanning, Roland Wiest, Waldo Enrique Valenzuela Pinilla, Mauricio Reyes, Maria Ines Meyer, Sook-Lei Liew, Florian Kofler, Ivan Ezhov, David Robben, Alexander Hutton, Tassilo Friedrich, Teresa Zarth, Johannes Bürkle, The Anh Baran, Bjoern Menze, Gabriel Broocks, Lukas Meyer, Claus Zimmer, Tobias Boeckh-Behrens, Maria Berndt, Benno Ikenberg, Benedikt Wiestler, Jan S. KirschkeComments: 12 pages, 2 figuresJournal-ref: Scientific data 9.1 (2022): 762Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [507] arXiv:2206.06712 [pdf, other]
-
Title: Visual Radial Basis Q-NetworkComments: This paper has been accepted for publication at the 3rd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2022. \c{opyright}Springer Nature 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [508] arXiv:2206.06714 [pdf, other]
-
Title: Interpretable Gait Recognition by Granger CausalityComments: Preprint. Full paper accepted at the IEEE/IAPR International Conference on Pattern Recognition (ICPR), Montreal, Canada, August 2022. 7 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [509] arXiv:2206.06715 [pdf, other]
-
Title: Semi-signed prioritized neural fitting for surface reconstruction from unoriented point cloudsAuthors: Runsong Zhu, Di Kang, Ka-Hei Hui, Yue Qian, Xuefei Zhe, Zhen Dong, Linchao Bao, Pheng-Ann Heng, Chi-Wing FuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [510] arXiv:2206.06731 [pdf, ps, other]
-
Title: Learning Dense Features for Point Cloud Registration Using a Graph Attention NetworkComments: 15 pages, 3 figuresJournal-ref: Applied Sciences 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [511] arXiv:2206.06741 [pdf, other]
-
Title: Recurrent Transformer Variational Autoencoders for Multi-Action Motion SynthesisComments: accepted at Transformers for Vision workshop at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [512] arXiv:2206.06743 [pdf, other]
-
Title: Weakly-Supervised Crack DetectionComments: Submitted to IEEE Transactions on Intelligent Transportation SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [513] arXiv:2206.06761 [pdf, other]
-
Title: Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINOComments: ICML 2022 Workshop paper accepted at AdvML FrontiersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [514] arXiv:2206.06801 [pdf, other]
-
Title: Peripheral Vision TransformerComments: Accepted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [515] arXiv:2206.06803 [pdf, other]
-
Title: Asymmetric Dual-Decoder U-Net for Joint Rain and Haze RemovalComments: 12 pages, 35 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [516] arXiv:2206.06829 [pdf, other]
-
Title: Efficient Decoder-free Object Detection with TransformersAuthors: Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua ShenComments: Update metadata, 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [517] arXiv:2206.06922 [pdf, other]
-
Title: Object Scene Representation TransformerAuthors: Mehdi S. M. Sajjadi, Daniel Duckworth, Aravindh Mahendran, Sjoerd van Steenkiste, Filip Pavetić, Mario Lučić, Leonidas J. Guibas, Klaus Greff, Thomas KipfComments: Accepted at NeurIPS '22. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [518] arXiv:2206.06923 [pdf, ps, other]
-
Title: A Multi-task Framework for Infrared Small Target Detection and SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [519] arXiv:2206.06930 [pdf, other]
-
Title: Comprehending and Ordering Semantics for Image CaptioningComments: CVPR 2022; Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
- [520] arXiv:2206.06931 [pdf, other]
-
Title: Stand-Alone Inter-Frame Attention in Video ModelsComments: CVPR 2022; Code is publicly available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [521] arXiv:2206.06948 [pdf, other]
-
Title: Monitoring Urban Forests from Auto-Generated Segmentation MapsComments: accepted for presentation and publication at IGARSS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [522] arXiv:2206.06959 [pdf, other]
-
Title: AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled DataComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [523] arXiv:2206.07011 [pdf, other]
-
Title: Consistent Video Instance Segmentation with Inter-Frame Recurrent AttentionComments: 11 pages, 5 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [524] arXiv:2206.07018 [pdf, other]
-
Title: Turning a Curse into a Blessing: Enabling In-Distribution-Data-Free Backdoor Removal via Stabilized Model InversionAuthors: Si Chen, Yi Zeng, Jiachen T.Wang, Won Park, Xun Chen, Lingjuan Lyu, Zhuoqing Mao, Ruoxi JiaComments: Because of an equation and author informational error, this paper has been withdrawn by the submitterSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [525] arXiv:2206.07028 [pdf, other]
-
Title: Learning 3D Object Shape and Layout without 3D SupervisionComments: CVPR 2022, project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [526] arXiv:2206.07036 [pdf, other]
-
Title: Accurate 3D Body Shape Regression using Metric and Semantic AttributesAuthors: Vasileios Choutas, Lea Muller, Chun-Hao P. Huang, Siyu Tang, Dimitrios Tzionas, Michael J. BlackComments: First two authors contributed equallyJournal-ref: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [527] arXiv:2206.07038 [pdf, other]
-
Title: AnimeSR: Learning Real-World Super-Resolution Models for Animation VideosComments: NeurIPS 2022. Codes and models are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [528] arXiv:2206.07045 [pdf, other]
-
Title: ReCo: Retrieve and Co-segment for Zero-shot TransferComments: Tech report. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [529] arXiv:2206.07047 [pdf, other]
-
Title: RGB-Multispectral Matching: Dataset, Learning Methodology, EvaluationAuthors: Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Luigi Di StefanoComments: CVPR 2022, New Orleans. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [530] arXiv:2206.07117 [pdf, other]
-
Title: TriHorn-Net: A Model for Accurate Depth-Based 3D Hand Pose EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [531] arXiv:2206.07125 [pdf, other]
-
Title: Self-Supervised Pretraining for Differentially Private LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [532] arXiv:2206.07160 [pdf, other]
-
Title: LAVENDER: Unifying Video-Language Understanding as Masked Language ModelingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [533] arXiv:2206.07162 [pdf, other]
-
Title: Category-Agnostic 6D Pose Estimation with Conditional Neural ProcessesComments: Accepted at CVPR2022 workshop: Women in Computer Vision (WiCV)Journal-ref: CVPR2022 workshop: Women in Computer Vision (WiCV)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [534] arXiv:2206.07163 [pdf, other]
-
Title: DeepRecon: Joint 2D Cardiac Segmentation and 3D Volume Reconstruction via A Structure-Specific Generative MethodAuthors: Qi Chang, Zhennan Yan, Mu Zhou, Di Liu, Khalid Sawalha, Meng Ye, Qilong Zhangli, Mikael Kanski, Subhi Al Aref, Leon Axel, Dimitris MetaxasComments: MICCAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [535] arXiv:2206.07171 [pdf, other]
-
Title: Segmentation in large-scale cellular electron microscopy with deep learning: A literature surveySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [536] arXiv:2206.07198 [pdf, other]
-
Title: Surgical Phase Recognition in Laparoscopic CholecystectomyAuthors: Yunfan Li, Vinayak Shenoy, Prateek Prasanna, I.V. Ramakrishnan, Haibin Ling, Himanshu GuptaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [537] arXiv:2206.07207 [pdf, other]
-
Title: Beyond Grounding: Extracting Fine-Grained Event Hierarchies Across ModalitiesAuthors: Hammad A. Ayyubi, Christopher Thomas, Lovish Chum, Rahul Lokesh, Long Chen, Yulei Niu, Xudong Lin, Xuande Feng, Jaywon Koo, Sounak Ray, Shih-Fu ChangComments: AAAI 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [538] arXiv:2206.07240 [pdf, other]
-
Title: Test-Time Adaptation for Visual Document UnderstandingComments: Accepted at TMLR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [539] arXiv:2206.07255 [pdf, other]
-
Title: GRAM-HD: 3D-Consistent Image Generation at High Resolution with Generative Radiance ManifoldsComments: ICCV2023 camera ready version (more results and method comparisons). Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [540] arXiv:2206.07259 [pdf, other]
-
Title: Self-Supervised Learning of Image Scale and OrientationComments: Presented in BMVC 2021, code is available on this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [541] arXiv:2206.07267 [pdf, other]
-
Title: Rethinking Generalization in Few-Shot ClassificationComments: Accepted at NeurIPS 2022. Code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [542] arXiv:2206.07272 [pdf, ps, other]
-
Title: Machine vision for vial positioning detection toward the safe automation of material synthesisAuthors: Leslie Ching Ow Tiong, Hyuk Jun Yoo, Na Yeon Kim, Kwan-Young Lee, Sang Soo Han, Donghun KimSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [543] arXiv:2206.07282 [pdf, other]
-
Title: Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial NoisesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [544] arXiv:2206.07298 [pdf, other]
-
Title: S$^2$-FPN: Scale-ware Strip Attention Guided Feature Pyramid Network for Real-time Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [545] arXiv:2206.07307 [pdf, other]
-
Title: VCT: A Video Compression TransformerAuthors: Fabian Mentzer, George Toderici, David Minnen, Sung-Jin Hwang, Sergi Caelles, Mario Lucic, Eirikur AgustssonComments: NeurIPS'22 Camera Ready Version. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [546] arXiv:2206.07326 [pdf, other]
-
Title: Recent Advances in Scene Image Representation and ClassificationComments: This paper is under review in Multimedia Tools and Applications (Springer) journal. This article may be deleted or updated based on the policies of the journalJournal-ref: Multimedia Tools and Applications, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [547] arXiv:2206.07344 [pdf, other]
-
Title: Automatic Detection of Rice Disease in Images of Various Leaf SizesAuthors: Kantip Kiratiratanapruk, Pitchayagan Temniranrat, Wasin Sinthupinyo, Sanparith Marukatat, Sujin PatarapuwadolComments: 28 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [548] arXiv:2206.07348 [pdf, ps, other]
-
Title: Unsupervised multi-branch Capsule for Hyperspectral and LiDAR classificationComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [549] arXiv:2206.07349 [pdf, other]
-
Title: XMorpher: Full Transformer for Deformable Medical Image Registration via Cross AttentionAuthors: Jiacheng Shi, Yuting He, Youyong Kong, Jean-Louis Coatrieux, Huazhong Shu, Guanyu Yang, Shuo LiComments: accepted by MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [550] arXiv:2206.07352 [pdf, ps, other]
-
Title: Robust SAR ATR on MSTAR with Deep Learning Models trained on Full Synthetic MOCEM dataSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Image and Video Processing (eess.IV)
- [551] arXiv:2206.07372 [pdf, other]
- [552] arXiv:2206.07389 [pdf, other]
-
Title: Ultra Fast Deep Lane Detection with Hybrid Anchor Driven Ordinal ClassificationComments: TPAMI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [553] arXiv:2206.07394 [pdf, other]
-
Title: Efficient Adaptive Ensembling for Image ClassificationJournal-ref: Expert Systems (2023)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [554] arXiv:2206.07423 [pdf, other]
-
Title: Zero-shot object goal visual navigationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [555] arXiv:2206.07431 [pdf, other]
-
Title: Physically-admissible polarimetric data augmentation for road-scene analysisAuthors: Cyprien Ruffino, Rachel Blin, Samia Ainouz, Gilles Gasso, Romain Hérault, Fabrice Meriaudeau, Stéphane CanuSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [556] arXiv:2206.07434 [pdf, other]
-
Title: Self-Supervised Implicit Attention: Guided Attention by The Model ItselfSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [557] arXiv:2206.07435 [pdf, other]
-
Title: Forecasting of depth and ego-motion with transformers and self-supervisionComments: Accepted in ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [558] arXiv:2206.07458 [pdf, other]
-
Title: VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature SelectionComments: Accepted by ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [559] arXiv:2206.07459 [pdf, other]
-
Title: READ: Aggregating Reconstruction Error into Out-of-distribution DetectionComments: Accepted to AAAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [560] arXiv:2206.07460 [pdf, other]
- [561] arXiv:2206.07468 [pdf, ps, other]
-
Title: PolyU-BPCoMa: A Dataset and Benchmark Towards Mobile Colorized Mapping Using a Backpack Multisensorial SystemComments: 11 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [562] arXiv:2206.07510 [pdf, other]
-
Title: Deep Multi-Task Networks For Occluded Pedestrian Pose EstimationAuthors: Arindam Das, Sudip Das, Ganesh Sistu, Jonathan Horgan, Ujjwal Bhattacharya, Edward Jones, Martin Glavin, Ciarán EisingComments: 4 pages, 5 tables, 2 figuresJournal-ref: Proceedings of the 2022 Irish Machine Vision and Image Processing ConferenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [563] arXiv:2206.07557 [pdf, other]
-
Title: How to Reduce Change Detection to Semantic SegmentationComments: Accepted by Pattern Recognition. Code is at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [564] arXiv:2206.07565 [pdf, other]
-
Title: A Meta-Analysis of Distributionally-Robust ModelsComments: To be presented at ICML Workshop on Principles of Distribution Shift 2022. Copyright 2022 by the author(s)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [565] arXiv:2206.07578 [src]
-
Title: E2V-SDE: From Asynchronous Events to Fast and Continuous Video Reconstruction via Neural Stochastic Differential EquationsComments: arXiv admin note: This submission has been withdrawn by arXiv administrators due to inappropriate text overlap with external sources. Additional information at this https URLJournal-ref: The IEEE / CVF Computer Vision and Pattern Recognition Conference 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [566] arXiv:2206.07580 [pdf, other]
-
Title: Evaluating object detector ensembles for improving the robustness of artifact detection in endoscopic video streamsAuthors: Pedro Esteban Chavarrias-Solano, Carlos Axel Garcia-Vega, Francisco Javier Lopez-Tiro, Gilberto Ochoa-Ruiz, Thomas Bazin, Dominique Lamarque, Christian DaulSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [567] arXiv:2206.07634 [pdf, other]
-
Title: Real3D-Aug: Point Cloud Augmentation by Placing Real Objects with Occlusion Handling for 3D Detection and SegmentationComments: Submitted on 15th June 2022 to IEEE RA-L journalJournal-ref: Computer Vision Winter Workshop 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [568] arXiv:2206.07643 [pdf, other]
-
Title: Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneAuthors: Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan WangComments: NeurIPS 2022. Project Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [569] arXiv:2206.07662 [pdf, other]
-
Title: SP-ViT: Learning 2D Spatial Priors for Vision TransformersAuthors: Yuxuan Zhou, Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Lei Zhang, Margret Keuper, Xiansheng HuaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [570] arXiv:2206.07669 [pdf, other]
-
Title: A Unified Sequence Interface for Vision TasksComments: The first three authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [571] arXiv:2206.07684 [pdf, other]
-
Title: AVATAR: Unconstrained Audiovisual Speech RecognitionAuthors: Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia SchmidSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [572] arXiv:2206.07687 [pdf, other]
-
Title: Structured Sparsity Learning for Efficient Video Super-ResolutionComments: Accepted by CVPR2023, code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [573] arXiv:2206.07689 [pdf, other]
-
Title: Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022Authors: Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir GlobersonComments: Ego4D CVPR22 Object State Localization challenge. arXiv admin note: substantial text overlap with arXiv:2206.06346Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [574] arXiv:2206.07690 [pdf, other]
-
Title: ELUDE: Generating interpretable explanations via a decomposition into labelled and unlabelled featuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [575] arXiv:2206.07692 [pdf, other]
-
Title: A Simple Data Mixing Prior for Improving Self-Supervised LearningComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [576] arXiv:2206.07695 [pdf, other]
-
Title: VoxGRAF: Fast 3D-Aware Image Synthesis with Sparse Voxel GridsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [577] arXiv:2206.07696 [pdf, other]
-
Title: Diffusion Models for Video Prediction and InfillingComments: Published in TMLR (11/2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [578] arXiv:2206.07698 [pdf, other]
-
Title: Neural Deformable Voxel Grid for Fast Optimization of Dynamic View SynthesisComments: Technical Report: 29 pages; project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [579] arXiv:2206.07699 [pdf, other]
-
Title: Write and Paint: Generative Vision-Language Models are Unified Modal LearnersComments: ICLR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [580] arXiv:2206.07700 [pdf, other]
-
Title: Masked Siamese ConvNetsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [581] arXiv:2206.07704 [pdf, other]
-
Title: Waymo Open Dataset: Panoramic Video Panoptic SegmentationAuthors: Jieru Mei, Alex Zihao Zhu, Xinchen Yan, Hang Yan, Siyuan Qiao, Yukun Zhu, Liang-Chieh Chen, Henrik Kretzschmar, Dragomir AnguelovComments: Our dataset can be found at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [582] arXiv:2206.07705 [pdf, other]
-
Title: LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D DetectionComments: Find the primary metrics for the 2022 Waymo Open Dataset 3D Camera-Only Detection Challenge at this https URL . Find the code at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [583] arXiv:2206.07706 [pdf, other]
-
Title: Masked Frequency Modeling for Self-Supervised Visual Pre-TrainingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [584] arXiv:2206.07707 [pdf, other]
-
Title: Variable Bitrate Neural FieldsAuthors: Towaki Takikawa, Alex Evans, Jonathan Tremblay, Thomas Müller, Morgan McGuire, Alec Jacobson, Sanja FidlerComments: SIGGRAPH 2022. Project Page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
- [585] arXiv:2206.07710 [pdf, other]
-
Title: PlanarRecon: Real-time 3D Plane Detection and Reconstruction from Posed Monocular VideosComments: CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [586] arXiv:2206.07764 [pdf, other]
-
Title: SAVi++: Towards End-to-End Object-Centric Learning from Real-World VideosAuthors: Gamaleldin F. Elsayed, Aravindh Mahendran, Sjoerd van Steenkiste, Klaus Greff, Michael C. Mozer, Thomas KipfComments: Project page at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [587] arXiv:2206.07771 [pdf, other]
-
Title: Discrete Contrastive Diffusion for Cross-Modal Music and Image GenerationComments: ICLR 2023. Project at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [588] arXiv:2206.07802 [pdf, other]
-
Title: Improving generalization by mimicking the human visual dietSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
- [589] arXiv:2206.07835 [pdf, other]
-
Title: Disentangling visual and written concepts in CLIPSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [590] arXiv:2206.07846 [pdf, ps, other]
-
Title: Action Spotting using Dense Detection Anchors Revisited: Submission to the SoccerNet Challenge 2022Comments: v2: a few more experiments, more detailed method descriptionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [591] arXiv:2206.07850 [pdf, other]
-
Title: HF-NeuS: Improved Surface Reconstruction Using High-Frequency DetailsComments: To appear in NeurIPS 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [592] arXiv:2206.07893 [pdf, other]
-
Title: PeQuENet: Perceptual Quality Enhancement of Compressed Video with Adaptation- and Attention-based NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
- [593] arXiv:2206.07897 [pdf, other]
-
Title: NCAGC: A Neighborhood Contrast Framework for Attributed Graph ClusteringJournal-ref: Neurocomputing, 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [594] arXiv:2206.07932 [pdf, other]
-
Title: Lifelong Wandering: A realistic few-shot online continual learning settingComments: CVPR 2022 Workshop on Continual LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [595] arXiv:2206.07934 [pdf, other]
-
Title: BANet: Motion Forecasting with Boundary Aware NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [596] arXiv:2206.07953 [pdf, other]
-
Title: Analysis and Extensions of Adversarial Training for Video ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [597] arXiv:2206.07959 [pdf, other]
-
Title: Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [598] arXiv:2206.07967 [pdf, other]
-
Title: DreamNet: A Deep Riemannian Network based on SPD Manifold Learning for Visual ClassificationComments: 9 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [599] arXiv:2206.07981 [pdf, other]
-
Title: Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in VideosSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [600] arXiv:2206.07986 [pdf, other]
-
Title: Image Captioning based on Feature Refinement and Reflective DecodingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [601] arXiv:2206.07990 [pdf, other]
-
Title: Patch-level Representation Learning for Self-supervised Vision TransformersComments: Accepted to CVPR 2022 (Oral). Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [602] arXiv:2206.07994 [pdf, other]
-
Title: Joint Class-Affinity Loss Correction for Robust Medical Image Segmentation with Noisy LabelsComments: Accepted to MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [603] arXiv:2206.08009 [pdf, other]
-
Title: Balancing Discriminability and Transferability for Source-Free Domain AdaptationAuthors: Jogendra Nath Kundu, Akshay Kulkarni, Suvaansh Bhambri, Deepesh Mehta, Shreyas Kulkarni, Varun Jampani, R. Venkatesh BabuComments: ICML 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [604] arXiv:2206.08016 [pdf, other]
-
Title: Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning ApproachesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [605] arXiv:2206.08026 [pdf, other]
-
Title: DeepFormableTag: End-to-end Generation and Recognition of Deformable Fiducial MarkersJournal-ref: ACM Transactions on Graphics 40, 4, Article 67 (August 2021)Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [606] arXiv:2206.08083 [pdf, other]
-
Title: CARLANE: A Lane Detection Benchmark for Unsupervised Domain Adaptation from Simulation to multiple Real-World DomainsComments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks, 22 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [607] arXiv:2206.08084 [pdf, other]
-
Title: An Improved Normed-Deformable Convolution for Crowd CountingJournal-ref: IEEE Signal Processing Letters 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [608] arXiv:2206.08105 [pdf, other]
-
Title: A Simple Baseline for Adversarial Domain Adaptation-based Unsupervised Flood ForecastingComments: Technical reportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [609] arXiv:2206.08126 [pdf, other]
-
Title: Channel Importance Matters in Few-Shot Image ClassificationComments: Accepted to ICML 2022; code available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [610] arXiv:2206.08129 [pdf, other]
-
Title: Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong BaselineComments: Accepted at NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [611] arXiv:2206.08150 [pdf, other]
-
Title: Self-Adaptive Label Augmentation for Semi-supervised Few-shot ClassificationComments: 9 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [612] arXiv:2206.08155 [pdf, other]
-
Title: Zero-Shot Video Question Answering via Frozen Bidirectional Language ModelsComments: NeurIPS 2022 Camera-Ready; Project Webpage: this https URL; 25 pages; 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [613] arXiv:2206.08158 [pdf, other]
-
Title: Volumetric Supervised Contrastive Learning for Seismic Semantic SegmentationJournal-ref: The International Meeting for Applied Geoscience & Energy (IMAGE) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
- [614] arXiv:2206.08171 [pdf, other]
-
Title: K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather ConditionsComments: Accepted at NeurIPS 2022 Datasets and Benchmarks TrackJournal-ref: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS Datasets and Benchmarks 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [615] arXiv:2206.08172 [pdf, other]
-
Title: RefCrowd: Grounding the Target in Crowd with Referring ExpressionsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [616] arXiv:2206.08176 [pdf, other]
-
Title: Level 2 Autonomous Driving on a Single Device: Diving into the Devils of OpenpilotAuthors: Li Chen, Tutian Tang, Zhitian Cai, Yang Li, Penghao Wu, Hongyang Li, Jianping Shi, Junchi Yan, Yu QiaoComments: Tech report. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [617] arXiv:2206.08182 [pdf, other]
-
Title: Nucleus Segmentation and Analysis in Breast Cancer with the MIScnn FrameworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [618] arXiv:2206.08186 [pdf, other]
-
Title: Asymptotic Soft Cluster Pruning for Deep Neural NetworksSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [619] arXiv:2206.08194 [pdf, other]
-
Title: Online Segmentation of LiDAR Sequences: Dataset and AlgorithmComments: Code and data are available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [620] arXiv:2206.08206 [pdf, other]
-
Title: Selective Multi-Scale Learning for Object DetectionComments: Accepted by ICANN2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [621] arXiv:2206.08219 [pdf, other]
-
Title: HaGRID - HAnd Gesture Recognition Image DatasetAuthors: Alexander Kapitanov, Karina Kvanchiani, Alexander Nagaev, Roman Kraynov, Andrei MakhliarchukComments: 12 pages, 5 figures, open-source dataset for computer visionJournal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024) 4572-4581Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [622] arXiv:2206.08222 [pdf, other]
-
Title: Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking ConsistencySubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [623] arXiv:2206.08224 [pdf, other]
-
Title: Multi scale Feature Extraction and Fusion for Online Knowledge DistillationComments: 12 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [624] arXiv:2206.08227 [pdf, other]
-
Title: Delving into the Scale Variance Problem in Object DetectionComments: Accepted by ICTAI2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [625] arXiv:2206.08229 [pdf, other]
-
Title: Open-Set Recognition with Gradient-Based RepresentationsComments: Published at IEEE International Conference on Image Processing (ICIP) 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [626] arXiv:2206.08236 [pdf, other]
-
Title: Simple and Efficient Architectures for Semantic SegmentationAuthors: Dushyant Mehta, Andrii Skliar, Haitam Ben Yahia, Shubhankar Borse, Fatih Porikli, Amirhossein Habibian, Tijmen BlankevoortComments: To be presented at Efficient Deep Learning for Computer Vision Workshop at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [627] arXiv:2206.08275 [pdf, other]
-
Title: Rank the triplets: A ranking-based multiple instance learning framework for detecting HPV infection in head and neck cancers using routine H&E imagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [628] arXiv:2206.08304 [pdf, other]
-
Title: Adversarial Patch Attacks and Defences in Vision-Based Tasks: A SurveyComments: A. Sharma and Y. Bian share equal contributionSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [629] arXiv:2206.08339 [pdf, other]
-
Title: iBoot: Image-bootstrapped Self-Supervised Video Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [630] arXiv:2206.08343 [pdf, other]
-
Title: Realistic One-shot Mesh-based Head AvatarsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [631] arXiv:2206.08345 [pdf, ps, other]
-
Title: Real-World Single Image Super-Resolution Under Rainy ConditionAuthors: Mohammad Shahab UddinSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [632] arXiv:2206.08347 [pdf, other]
-
Title: Beyond Supervised vs. Unsupervised: Representative Benchmarking and Analysis of Image Representation LearningComments: CVPR 2022, project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [633] arXiv:2206.08355 [pdf, other]
-
Title: FWD: Real-time Novel View Synthesis with Forward Warping and DepthComments: CVPR 2022. Project website this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [634] arXiv:2206.08356 [pdf, other]
-
Title: OmniMAE: Single Model Masked Pretraining on Images and VideosAuthors: Rohit Girdhar, Alaaeldin El-Nouby, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan MisraComments: CVPR 2023. Code/models: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [635] arXiv:2206.08357 [pdf, other]
-
Title: Spatially-Adaptive Multilayer Selection for GAN Inversion and EditingSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [636] arXiv:2206.08358 [pdf, other]
-
Title: MixGen: A New Multi-Modal Data AugmentationComments: First three authors contributed equally. Code are available at this https URL Oral presentation at WACV 2023 Pretraining Large Vision and Multimodal Models WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [637] arXiv:2206.08361 [pdf, other]
-
Title: Controllable 3D Face Synthesis with Conditional Generative Occupancy FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [638] arXiv:2206.08362 [pdf, other]
-
Title: Unified Fourier-based Kernel and Nonlinearity Design for Equivariant Networks on Homogeneous SpacesComments: Accepted at ICML2022 Thirty-ninth International Conference on Machine LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [639] arXiv:2206.08365 [pdf, other]
-
Title: Virtual Correspondence: Humans as a Cue for Extreme-View GeometryComments: CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [640] arXiv:2206.08367 [pdf, other]
-
Title: SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain AdaptationAuthors: Tao Sun, Mattia Segu, Janis Postels, Yuxuan Wang, Luc Van Gool, Bernt Schiele, Federico Tombari, Fisher YuComments: Published at IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [641] arXiv:2206.08368 [pdf, other]
-
Title: Unbiased 4D: Monocular 4D Reconstruction with a Neural Deformation ModelComments: 26 pages, 17 figures, 8 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [642] arXiv:2206.08405 [pdf, ps, other]
-
Title: Going Deeper than Tracking: a Survey of Computer-Vision Based Recognition of Animal Pain and Affective StatesAuthors: Sofia Broomé, Marcelo Feighelstein, Anna Zamansky, Gabriel Carreira Lencioni, Pia Haubro Andersen, Francisca Pessanha, Marwa Mahmoud, Hedvig Kjellström, Albert Ali SalahSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [643] arXiv:2206.08423 [pdf, other]
-
Title: IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor ScenesComments: CVPR 22 camera ready version with supplementarySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [644] arXiv:2206.08427 [pdf, other]
-
Title: SATBench: Benchmarking the speed-accuracy tradeoff in object recognition by humans and dynamic neural networksAuthors: Ajay Subramanian, Sara Price, Omkar Kumbhar, Elena Sizikova, Najib J. Majaj, Denis G. PelliComments: 19 pages, 12 figures. Under Review at NeurIPS Datasets and Benchmarks Track 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [645] arXiv:2206.08428 [pdf, other]
-
Title: EyeNeRF: A Hybrid Representation for Photorealistic Synthesis, Animation and Relighting of Human EyesAuthors: Gengyan Li (1 and 2), Abhimitra Meka (1), Franziska Müller (1), Marcel C. Bühler (2), Otmar Hilliges (2), Thabo Beeler (1) ((1) Google Inc., (2) ETH Zürich)Comments: 16 pages, 16 figures, 1 table, to be published in ACM Transactions on Graphics (TOG) (Volume: 41, Issue: 4), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [646] arXiv:2206.08429 [pdf, other]
-
Title: Scalable Temporal Localization of Sensitive Activities in Movies and TV EpisodesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [647] arXiv:2206.08460 [pdf, other]
-
Title: TUSK: Task-Agnostic Unsupervised KeypointsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [648] arXiv:2206.08462 [pdf, other]
-
Title: Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole HierarchiesComments: 9 pages, 6 figures. fixed LaTeX typo for algorithm referenceSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [649] arXiv:2206.08477 [pdf, other]
-
Title: Backdoor Attacks on Vision TransformersAuthors: Akshayvarun Subramanya, Aniruddha Saha, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Hamed PirsiavashSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [650] arXiv:2206.08488 [pdf, other]
-
Title: Controllable Image EnhancementSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [651] arXiv:2206.08500 [pdf, other]
-
Title: What do navigation agents learn about their environment?Comments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [652] arXiv:2206.08509 [pdf, other]
-
Title: Neural Architecture Adaptation for Object Detection by Searching Channel Dimensions and Mapping Pre-trained ParametersComments: Accepted to ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [653] arXiv:2206.08524 [pdf, other]
-
Title: CDNet: Contrastive Disentangled Network for Fine-Grained Image Categorization of Ocular B-Scan UltrasoundAuthors: Ruilong Dan, Yunxiang Li, Yijie Wang, Gangyong Jia, Ruiquan Ge, Juan Ye, Qun Jin, Yaqi WangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [654] arXiv:2206.08537 [pdf, ps, other]
-
Title: Large-Margin Representation Learning for Texture ClassificationAuthors: Jonathan de Matos, Luiz Eduardo Soares de Oliveira, Alceu de Souza Britto Junior, Alessandro Lameiras KoerichComments: 7 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [655] arXiv:2206.08547 [pdf, other]
-
Title: Texture Generation Using A Graph Generative Adversarial Network And Differentiable RenderingComments: The final publication is available at Springer via this http URLJournal-ref: Springer.13836.(2023)388-401Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [656] arXiv:2206.08549 [pdf, other]
-
Title: Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [657] arXiv:2206.08566 [pdf, other]
-
Title: Active Data Discovery: Mining Unknown Data using Submodular Information MeasuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [658] arXiv:2206.08567 [pdf, other]
-
Title: Rectify ViT Shortcut Learning by Visual SaliencyAuthors: Chong Ma, Lin Zhao, Yuzhong Chen, David Weizhong Liu, Xi Jiang, Tuo Zhang, Xintao Hu, Dinggang Shen, Dajiang Zhu, Tianming LiuComments: NeurIPS2022 Under ReviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [659] arXiv:2206.08568 [pdf, other]
-
Title: Multi-Contextual Predictions with Vision Transformer for Video Anomaly DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [660] arXiv:2206.08572 [pdf, other]
-
Title: Enhanced Bi-directional Motion Estimation for Video Frame InterpolationComments: Accepted by WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [661] arXiv:2206.08585 [pdf, other]
-
Title: HairFIT: Pose-Invariant Hairstyle Transfer via Flow-based Hair Alignment and Semantic-Region-Aware InpaintingAuthors: Chaeyeon Chung, Taewoo Kim, Hyelin Nam, Seunghwan Choi, Gyojung Gu, Sunghyun Park, Jaegul ChooComments: BMVC 2021 Oral PresentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [662] arXiv:2206.08605 [pdf, ps, other]
-
Title: On Efficient Real-Time Semantic Segmentation: A SurveyComments: 19 pages, 13 figures, 4 tables This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [663] arXiv:2206.08610 [pdf, other]
-
Title: Masked Autoencoders for Generic Event Boundary Detection CVPR'2022 Kinetics-GEBD ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [664] arXiv:2206.08614 [pdf, other]
-
Title: Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic AssessmentComments: Accepted to NeurIPS Track on Datasets and Benchmarks 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [665] arXiv:2206.08632 [pdf, other]
-
Title: Learning Using Privileged Information for Zero-Shot Action RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [666] arXiv:2206.08638 [pdf, ps, other]
- [667] arXiv:2206.08640 [pdf, other]
-
Title: Uncertainty-aware Evaluation of Time-Series Classification for Online Handwriting Recognition with Domain ShiftAuthors: Andreas Klaß, Sven M. Lorenz, Martin W. Lauer-Schmaltz, David Rügamer, Bernd Bischl, Christopher Mutschler, Felix OttSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [668] arXiv:2206.08641 [pdf, other]
-
Title: Diverse Multiple Trajectory Prediction Using a Two-stage Prediction Network Trained with Lane LossComments: RA-L acceptedJournal-ref: IEEE Robotics and Automation Letters (2022)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [669] arXiv:2206.08645 [pdf, other]
-
Title: Local Slot Attention for Vision-and-Language NavigationComments: ICMR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [670] arXiv:2206.08655 [pdf, other]
-
Title: Learning Implicit Feature Alignment Function for Semantic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [671] arXiv:2206.08657 [pdf, other]
-
Title: BridgeTower: Building Bridges Between Encoders in Vision-Language Representation LearningComments: Accepted by AAAI 2023, OralSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [672] arXiv:2206.08683 [pdf, other]
-
Title: AggNet: Learning to Aggregate Faces for Group Membership VerificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [673] arXiv:2206.08701 [pdf, ps, other]
-
Title: Towards Real-Time Visual Tracking with Graded Color-names FeaturesComments: 12 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [674] arXiv:2206.08712 [pdf, other]
-
Title: An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping FunctionsComments: Accepted to RAL2022, code at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [675] arXiv:2206.08748 [pdf, ps, other]
-
Title: ReViSe: Remote Vital Signs Measurement Using Smartphone CameraSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [676] arXiv:2206.08749 [pdf, other]
-
Title: From a few Accurate 2D Correspondences to 3D Point CloudsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [677] arXiv:2206.08751 [pdf, other]
-
Title: Perceptual Quality Assessment of Virtual Reality Videos in the WildComments: Accepted by IEEE Transactions on Circuits and Systems for Video TechnologySubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [678] arXiv:2206.08778 [pdf, other]
-
Title: CTooth: A Fully Annotated 3D Dataset and Benchmark for Tooth Volume Segmentation on Cone Beam Computed Tomography ImagesAuthors: Weiwei Cui, Yaqi Wang, Qianni Zhang, Huiyu Zhou, Dan Song, Xingyong Zuo, Gangyong Jia, Liaoyuan ZengSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [679] arXiv:2206.08789 [pdf, ps, other]
-
Title: Reconstructing vehicles from orthographic drawings using deep neural networksAuthors: Robin KlippertComments: 9 PagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [680] arXiv:2206.08791 [pdf, other]
-
Title: DU-Net based Unsupervised Contrastive Learning for Cancer Segmentation in Histology ImagesComments: arXiv admin note: text overlap with arXiv:2002.05709 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [681] arXiv:2206.08792 [pdf, other]
-
Title: FD-CAM: Improving Faithfulness and Discriminability of Visual Explanation for CNNsComments: Accepted by ICPR 2022 and also accepted by CVPR 2022 Explainable Artificial Intelligence for Computer Vision (XAI4CV) WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [682] arXiv:2206.08794 [pdf, other]
-
Title: The Importance of Background Information for Out of Distribution GeneralizationComments: 6 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [683] arXiv:2206.08801 [pdf, other]
-
Title: Video Shadow Detection via Spatio-Temporal Interpolation Consistency TrainingAuthors: Xiao Lu, Yihong Cao, Sheng Liu, Chengjiang Long, Zipei Chen, Xuanyu Zhou, Yimin Yang, Chunxia XiaoComments: Accepted in CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [684] arXiv:2206.08833 [pdf, ps, other]
-
Title: A Comparative Study of Confidence Calibration in Deep Learning: From Computer Vision to Medical ImagingAuthors: Riqiang Gao, Thomas Li, Yucheng Tang, Zhoubing Xu, Michael Kammer, Sanja L. Antic, Kim Sandler, Fabien Moldonado, Thomas A. Lasko, Bennett LandmanComments: 17 pages, 6 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [685] arXiv:2206.08861 [pdf, other]
-
Title: DGMIL: Distribution Guided Multiple Instance Learning for Whole Slide Image ClassificationComments: accepted by MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [686] arXiv:2206.08880 [pdf, other]
-
Title: Improving Generalization of Metric Learning via Listwise Self-distillationComments: 11 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [687] arXiv:2206.08883 [pdf, other]
-
Title: CtrlFormer: Learning Transferable State Representation for Visual Control via TransformerComments: ICML 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [688] arXiv:2206.08898 [pdf, other]
-
Title: SimA: Simple Softmax-free Attention for Vision TransformersComments: Code is available here: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [689] arXiv:2206.08903 [pdf, other]
-
Title: Colonoscopy 3D Video Dataset with Paired Depth from 2D-3D RegistrationAuthors: Taylor L. Bobrow, Mayank Golhar, Rohan Vijayan, Venkata S. Akshintala, Juan R. Garcia, Nicholas J. DurrSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [690] arXiv:2206.08916 [pdf, other]
-
Title: Unified-IO: A Unified Model for Vision, Language, and Multi-Modal TasksSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [691] arXiv:2206.08919 [pdf, other]
-
Title: VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMixSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [692] arXiv:2206.08920 [pdf, other]
-
Title: VectorMapNet: End-to-end Vectorized HD Map LearningComments: Accepted by ICML 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [693] arXiv:2206.08927 [pdf, other]
-
Title: Cross-task Attention Mechanism for Dense Multi-task LearningComments: 10 figures, 6 tables, 23 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [694] arXiv:2206.08929 [pdf, other]
-
Title: TAVA: Template-free Animatable Volumetric ActorsAuthors: Ruilong Li, Julian Tanke, Minh Vo, Michael Zollhofer, Jurgen Gall, Angjoo Kanazawa, Christoph LassnerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [695] arXiv:2206.08948 [pdf, other]
-
Title: CMT-DeepLab: Clustering Mask Transformers for Panoptic SegmentationAuthors: Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh ChenComments: CVPR 2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [696] arXiv:2206.08954 [pdf, other]
-
Title: Bag of Image Patch Embedding Behind the Success of Self-Supervised LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [697] arXiv:2206.08970 [pdf, other]
-
Title: MultiEarth 2022 -- The Champion Solution for the Matrix Completion Challenge via Multimodal Regression and GenerationComments: CVPR 2022, MultiEarth 2022, Matrix Completion ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [698] arXiv:2206.08977 [pdf, ps, other]
-
Title: BN-HTRd: A Benchmark Dataset for Document Level Offline Bangla Handwritten Text Recognition (HTR) and Line SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [699] arXiv:2206.08990 [pdf, other]
-
Title: Shadows Shed Light on 3D ObjectsComments: 19 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [700] arXiv:2206.09027 [pdf, other]
-
Title: Landscape Learning for Neural Network InversionComments: 15 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [701] arXiv:2206.09038 [pdf, other]
-
Title: Validation of Vector Data using Oblique ImagesComments: In Proceedings of 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM GIS'08)Journal-ref: Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM GIS '08), pp. 1-10. 2008Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [702] arXiv:2206.09055 [src]
-
Title: Augmented Imagefication: A Data-driven Fault Detection Method for Aircraft Air Data SensorsComments: a crucial design defect to acquire flying data by simulationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [703] arXiv:2206.09061 [pdf, other]
-
Title: Design of Supervision-Scalable Learning Systems: Methodology and Performance BenchmarkingComments: 16 pages, 12 figures, 4 tables, under consideration at Pattern RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [704] arXiv:2206.09068 [pdf, other]
-
Title: Attention-based Dynamic Subspace Learners for Medical Image AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [705] arXiv:2206.09071 [pdf, other]
-
Title: Analysis & Computational Complexity Reduction of Monocular and Stereo Depth Estimation TechniquesSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [706] arXiv:2206.09082 [pdf, other]
-
Title: Context-aware Proposal Network for Temporal Action DetectionComments: First place winning solution for temporal action detection task in CVPR-2022 AcitivityNet Challenge. arXiv admin note: substantial text overlap with arXiv:2106.11812Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [707] arXiv:2206.09089 [pdf, ps, other]
-
Title: A Dynamic Data Driven Approach for Explainable Scene UnderstandingComments: Unpublished draft of book chapterSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [708] arXiv:2206.09106 [pdf, other]
-
Title: Embodied Scene-aware Human Pose EstimationComments: NeurIPS 2022. Project website: this https URL Zhengyi Luo and Shun Iwase contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [709] arXiv:2206.09111 [pdf, other]
-
Title: VReBERT: A Simple and Flexible Transformer for Visual Relationship DetectionComments: Published at International Conference on Pattern Recognition (ICPR) 2022, Montreal QuebecSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [710] arXiv:2206.09114 [pdf, other]
-
Title: Bear the Query in Mind: Visual Grounding with Query-conditioned ConvolutionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [711] arXiv:2206.09132 [pdf, other]
-
Title: Replacing Labeled Real-image Datasets with Auto-generated ContoursAuthors: Hirokatsu Kataoka, Ryo Hayamizu, Ryosuke Yamada, Kodai Nakashima, Sora Takashima, Xinyu Zhang, Edgar Josafat Martinez-Noriega, Nakamasa Inoue, Rio YokotaComments: Accepted to CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [712] arXiv:2206.09148 [pdf, other]
-
Title: Deep Compatible Learning for Partially-Supervised Medical Image SegmentationComments: 16 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [713] arXiv:2206.09178 [pdf, other]
-
Title: REVECA -- Rich Encoder-decoder framework for Video Event CAptionerComments: The IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR). LOng-form VidEo Understanding (LOVEU) workshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [714] arXiv:2206.09191 [pdf, other]
-
Title: Gender Artifacts in Visual DatasetsComments: ICCV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [715] arXiv:2206.09202 [pdf, other]
-
Title: Camera Adaptation for Fundus-Image-Based CVD Risk EstimationComments: This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this contribution will be added soonSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [716] arXiv:2206.09221 [pdf, ps, other]
-
Title: 3D Face Parsing via Surface Parameterization and 2D Semantic Segmentation NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [717] arXiv:2206.09242 [pdf, other]
-
Title: GaLeNet: Multimodal Learning for Disaster Prediction, Management and ReliefAuthors: Rohit Saha, Mengyi Fang, Angeline Yasodhara, Kyryl Truskovskyi, Azin Asgarian, Daniel Homola, Raahil Shah, Frederik Dieleman, Jack Weatheritt, Thomas RogersComments: Accepted to CVPR 2022 Workshop on Multimodal Learning for Earth and EnvironmentSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [718] arXiv:2206.09243 [pdf, other]
-
Title: Structured Light with Redundancy CodesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [719] arXiv:2206.09244 [pdf, other]
-
Title: GAN2X: Non-Lambertian Inverse Rendering of Image GANsComments: Accepted to 3DV 2022. The video demo is available at the project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [720] arXiv:2206.09256 [pdf, other]
-
Title: Multistream Gaze Estimation with Anatomical Eye Region Isolation by Synthetic to Real Transfer LearningComments: 15 pages, 7 figures, 14 tables. This work has been accepted to the IEEE Transactions on Artificial Intelligence $\copyright$ 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other usesJournal-ref: IEEE Transactions on Artificial Intelligence, 2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [721] arXiv:2206.09265 [pdf, ps, other]
-
Title: SAViR-T: Spatially Attentive Visual Reasoning with TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [722] arXiv:2206.09293 [pdf, other]
-
Title: Rethinking Bayesian Deep Learning Methods for Semi-Supervised Volumetric Medical Image SegmentationComments: To appear at CVPR 2022, and the supplementary material can be found at the official site. The source codes are at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [723] arXiv:2206.09325 [pdf, other]
-
Title: EATFormer: Improving Vision Transformer Inspired by Evolutionary AlgorithmSubjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET)
- [724] arXiv:2206.09358 [pdf, other]
-
Title: What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text InputsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [725] arXiv:2206.09362 [src]
-
Title: Towards Generalizable Person Re-identification with a Bi-stream Generative ModelComments: There is a mistake of equation 1Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [726] arXiv:2206.09365 [pdf, other]
-
Title: Semi-supervised Change Detection of Small Water Bodies Using RGB and Multispectral Images in Peruvian RainforestsAuthors: Kangning Cui, Seda Camalan, Ruoning Li, Victor P. Pauca, Sarra Alqahtani, Robert J. Plemmons, Miles Silman, Evan N. Dethier, David Lutz, Raymond H. ChanComments: 8 pages, 5 figures. Accepted to Proceedings of IEEE WHISPERS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
- [727] arXiv:2206.09372 [pdf, other]
-
Title: mvHOTA: A multi-view higher order tracking accuracy metric to measure spatial and temporal associations in multi-point detectionAuthors: Lalith Sharan, Halvar Kelm, Gabriele Romano, Matthias Karck, Raffaele De Simone, Sandy EngelhardtComments: 16 pages, 9 figuresJournal-ref: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization (2022) 1-9Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [728] arXiv:2206.09410 [pdf, other]
-
Title: Low-Mid Adversarial Perturbation against Unauthorized Face Recognition SystemComments: published in Information SciencesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [729] arXiv:2206.09414 [pdf, other]
-
Title: Terrain Classification using Transfer Learning on Hyperspectral Images: A Comparative studySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [730] arXiv:2206.09420 [pdf, other]
-
Title: Agricultural Plantation Classification using Transfer Learning Approach based on CNNAuthors: Uphar Singh, Tushar Musale, Ranjana Vyas, O.P.Vyas (Indian Institute of Information Technology, Allahabad, India)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [731] arXiv:2206.09474 [pdf, other]
-
Title: 3D Object Detection for Autonomous Driving: A Comprehensive SurveyComments: Accepted to International Journal of Computer Vision (IJCV). Project page is at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [732] arXiv:2206.09479 [pdf, other]
-
Title: StudioGAN: A Taxonomy and Benchmark of GANs for Image SynthesisComments: 32 pages, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, 2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [733] arXiv:2206.09485 [pdf, other]
-
Title: Video frame interpolation for high dynamic range sequences captured with dual-exposure sensorsComments: 13 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [734] arXiv:2206.09500 [pdf, other]
-
Title: Unbiased Teacher v2: Semi-supervised Object Detection for Anchor-free and Anchor-based DetectorsComments: Project Page is at this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [735] arXiv:2206.09504 [pdf, other]
-
Title: A Parallel Implementation of Computing Mean Average PrecisionAuthors: Beinan WangSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [736] arXiv:2206.09509 [pdf, ps, other]
-
Title: Hybrid Facial Expression Recognition (FER2013) Model for Real-Time Emotion Classification and PredictionComments: 8 Pages, 8 Figures, 5 TablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
- [737] arXiv:2206.09541 [pdf, other]
-
Title: DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited AnnotationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [738] arXiv:2206.09548 [pdf, other]
-
Title: Variational Distillation for Multi-View LearningAuthors: Xudong Tian, Zhizhong Zhang, Cong Wang, Wensheng Zhang, Yanyun Qu, Lizhuang Ma, Zongze Wu, Yuan Xie, Dacheng TaoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [739] arXiv:2206.09552 [pdf, other]
-
Title: Dynamic Message Propagation Network for RGB-D Salient Object DetectionComments: 12 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [740] arXiv:2206.09553 [pdf, other]
-
Title: Capturing and Inferring Dense Full-Body Human-Scene ContactAuthors: Chun-Hao P. Huang, Hongwei Yi, Markus Höschle, Matvey Safroshkin, Tsvetelina Alexiadis, Senya Polikovsky, Daniel Scharstein, Michael J. BlackComments: CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [741] arXiv:2206.09554 [pdf, other]
-
Title: Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic SegmentationComments: TMM2022, 11 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [742] arXiv:2206.09564 [pdf, other]
-
Title: A Novel Long-term Iterative Mining Scheme for Video Salient Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [743] arXiv:2206.09575 [pdf, other]
-
Title: C-SENN: Contrastive Self-Explaining Neural NetworkComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [744] arXiv:2206.09581 [pdf, ps, other]
-
Title: Explicit and implicit models in infrared and visible image fusionComments: 8 pages, 5 figures, 2 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [745] arXiv:2206.09585 [pdf, other]
-
Title: 5th Place Solution for YouTube-VOS Challenge 2022: Video Object SegmentationComments: 5th Place Solution for Video Object Segmentation in the 4th Large-scale Video Object Segmentation Challenge, CVPR 2022 WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [746] arXiv:2206.09592 [pdf, other]
-
Title: DALL-E for Detection: Language-driven Compositional Image Synthesis for Object DetectionComments: v3(same as v2) version, update structure (add foreground generation, stable diffusion), add more experimentsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [747] arXiv:2206.09596 [pdf, other]
-
Title: Efficient and Flexible Sublabel-Accurate Energy MinimizationComments: To be published at ICPR 2022, Copyright 2022 IEEESubjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
- [748] arXiv:2206.09597 [pdf, other]
-
Title: Winning the CVPR'2022 AQTC Challenge: A Two-stage Function-centric ApproachSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [749] arXiv:2206.09604 [pdf, other]
-
Title: Distortion-Aware Network Pruning and Feature Reuse for Real-time Video SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [750] arXiv:2206.09664 [pdf, other]
-
Title: What Can be Seen is What You Get: Structure Aware Point Cloud AugmentationComments: Published in IEEE IV 2022Journal-ref: 33rd IEEE Intelligent Vehicles Symposium, Aachen, Germany, June 5th - June 9th 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [751] arXiv:2206.09667 [pdf, other]
-
Title: MSANet: Multi-Similarity and Attention Guidance for Boosting Few-Shot SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [752] arXiv:2206.09683 [pdf, other]
-
Title: Distribution Regularized Self-Supervised Learning for Domain Adaptation of Semantic SegmentationComments: Accepted for publication at Image and Vision ComputingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [753] arXiv:2206.09731 [pdf, other]
-
Title: Semantic Labeling of High Resolution Images Using EfficientUNets and TransformersSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [754] arXiv:2206.09736 [pdf, other]
-
Title: Geo-NI: Geometry-aware Neural Interpolation for Light Field RenderingComments: 13 pages, 8 figures, 4 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [755] arXiv:2206.09742 [pdf, ps, other]
-
Title: Developing a Free and Open-source Automated Building Exterior Crack Inspection Software for Construction and Facility ManagersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [756] arXiv:2206.09753 [pdf, other]
-
Title: Visualizing and Understanding Contrastive LearningComments: Accepted to IEEE Transactions on Image ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [757] arXiv:2206.09756 [pdf, other]
-
Title: Time Gated Convolutional Neural Networks for Crop ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [758] arXiv:2206.09769 [pdf, other]
-
Title: Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathologyComments: Accepted at MICCAI2022 ConferenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [759] arXiv:2206.09770 [pdf, other]
-
Title: Real-time Full-stack Traffic Scene Perception for Autonomous Driving with Roadside CamerasAuthors: Zhengxia Zou, Rusheng Zhang, Shengyin Shen, Gaurav Pandey, Punarjay Chakravarty, Armin Parchami, Henry X. LiuComments: This paper is accepted and presented in ICRA 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [760] arXiv:2206.09796 [pdf, other]
-
Title: Knowledge Distillation for Oriented Object Detection on Aerial ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [761] arXiv:2206.09806 [pdf, other]
-
Title: Self-Supervised Consistent Quantization for Fully Unsupervised Image RetrievalComments: 10 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [762] arXiv:2206.09842 [pdf, other]
-
Title: Practical Deepfake Detection: Vulnerabilities in Global ContextsComments: 6 pages, 6 figures, presented as a workshop paper at Responsible AI @ ICLR 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [763] arXiv:2206.09843 [pdf, other]
-
Title: Contextual Squeeze-and-Excitation for Efficient Few-Shot Image ClassificationAuthors: Massimiliano Patacchiola, John Bronskill, Aliaksandra Shysheya, Katja Hofmann, Sebastian Nowozin, Richard E. TurnerComments: Advances in Neural Information Processing Systems (NeurIPS 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [764] arXiv:2206.09852 [pdf, other]
-
Title: M&M Mix: A Multimodal Multiview Transformer EnsembleComments: Technical report for Epic-Kitchens challenge 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [765] arXiv:2206.09853 [pdf, other]
-
Title: DisCoVQA: Temporal Distortion-Content Transformers for Video Quality AssessmentSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [766] arXiv:2206.09885 [pdf, other]
-
Title: KOLOMVERSE: KRISO open large-scale image dataset for object detection in the maritime universeComments: 13 Pages, 12 figures, submitted to NeurIPS 2022 Datasets and Benchmarks Track (Under Review)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [767] arXiv:2206.09900 [pdf, other]
-
Title: Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy AutoencodersComments: Accepted by TIVSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [768] arXiv:2206.09907 [pdf, other]
-
Title: ORFD: A Dataset and Benchmark for Off-Road Freespace DetectionComments: Accepted by ICRA2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [769] arXiv:2206.09959 [pdf, other]
-
Title: Global Context Vision TransformersComments: Accepted to ICML 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [770] arXiv:2206.10033 [pdf, other]
-
Title: Test Time Transform Prediction for Open Set Histopathological Image RecognitionAuthors: Adrian Galdran, Katherine J. Hewitt, Narmin L. Ghaffari, Jakob N. Kather, Gustavo Carneiro, Miguel A. González BallesterComments: Accepted to MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [771] arXiv:2206.10041 [pdf, other]
-
Title: MPA: MultiPath++ Based Architecture for Motion PredictionAuthors: Stepan KonevComments: CVPR 2022, Workshop on Autonomous DrivingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [772] arXiv:2206.10059 [pdf, other]
-
Title: Bypass Network for Semantics Driven Image Paragraph CaptioningComments: Under consideration at Computer Vision and Image UnderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [773] arXiv:2206.10066 [pdf, other]
-
Title: RendNet: Unified 2D/3D Recognizer With Latent Space RenderingComments: CVPR 2022 OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [774] arXiv:2206.10075 [pdf, other]
-
Title: Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [775] arXiv:2206.10080 [pdf, other]
-
Title: One-stage Action Detection TransformerSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [776] arXiv:2206.10082 [pdf, other]
-
Title: Optimally Controllable Perceptual Lossy CompressionComments: ICML 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [777] arXiv:2206.10090 [pdf, other]
-
Title: KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D CorrespondencesJournal-ref: Transaction on Circuits and Systems for Video Technology,2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [778] arXiv:2206.10092 [pdf, other]
-
Title: BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object DetectionAuthors: Yinhao Li, Zheng Ge, Guanyi Yu, Jinrong Yang, Zengran Wang, Yukang Shi, Jianjian Sun, Zeming LiComments: Accepted by AAAI2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [779] arXiv:2206.10095 [pdf, other]
-
Title: Pyramid Region-based Slot Attention Network for Temporal Action Proposal GenerationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [780] arXiv:2206.10096 [pdf, ps, other]
-
Title: Transformers Improve Breast Cancer Diagnosis from Unregistered Multi-View MammogramsAuthors: Xuxin Chen, Ke Zhang, Neman Abdoli, Patrik W. Gilley, Ximin Wang, Hong Liu, Bin Zheng, Yuchen QiuSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [781] arXiv:2206.10098 [pdf, other]
-
Title: Reconstruct from BEV: A 3D Lane Detection Approach based on Geometry Structure PriorComments: Proceedings of the CVPR 2022 Workshop of Autonomous DrivingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [782] arXiv:2206.10107 [pdf, other]
-
Title: Sensitivity of Average Precision to Bounding Box PerturbationsAuthors: Ali BorjiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [783] arXiv:2206.10118 [pdf, other]
-
Title: HOPE: Hierarchical Spatial-temporal Network for Occupancy Flow PredictionAuthors: Yihan Hu, Wenxin Shao, Bo Jiang, Jiajie Chen, Siqi Chai, Zhening Yang, Jingyu Qian, Helong Zhou, Qiang LiuComments: 1st Ranking Solution for the Occupancy and Flow Prediction of the Waymo Open Dataset Challenges 2022 (this http URL)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [784] arXiv:2206.10129 [pdf, other]
-
Title: Automatic Concept Extraction for Concept Bottleneck-based Video ClassificationAuthors: Jeya Vikranth Jeyakumar, Luke Dickens, Luis Garcia, Yu-Hsi Cheng, Diego Ramirez Echavarria, Joseph Noor, Alessandra Russo, Lance Kaplan, Erik Blasch, Mani SrivastavaComments: 10 pages, Appendix: 2 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [785] arXiv:2206.10131 [pdf, other]
-
Title: An Integrated Representation & Compression Scheme Based on Convolutional Autoencoders with 4D DCT Perceptual Encoding for High Dynamic Range Light FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [786] arXiv:2206.10137 [pdf, other]
-
Title: Few-Max: Few-Shot Domain Adaptation for Unsupervised Contrastive Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [787] arXiv:2206.10145 [pdf, other]
- [788] arXiv:2206.10146 [pdf, other]
-
Title: KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute ParsingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [789] arXiv:2206.10155 [pdf, other]
-
Title: Review Neural Networks about Image Transformation Based on IGC Learning Framework with Annotated InformationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [790] arXiv:2206.10157 [pdf, other]
-
Title: Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [791] arXiv:2206.10177 [pdf, other]
-
Title: TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural NetworksComments: Accepted by IEEE Transactions on Neural Networks and Learning SystemsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [792] arXiv:2206.10186 [pdf, other]
-
Title: Improving Localization for Semi-Supervised Object DetectionJournal-ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [793] arXiv:2206.10192 [pdf, other]
-
Title: LDD: A Dataset for Grape Diseases Object Detection and Instance SegmentationJournal-ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [794] arXiv:2206.10207 [pdf, other]
-
Title: SemMAE: Semantic-Guided Masking for Learning Masked AutoencodersComments: Accepted by NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [795] arXiv:2206.10213 [pdf, other]
-
Title: Rethinking Unsupervised Neural Superpixel SegmentationComments: ICIP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [796] arXiv:2206.10225 [pdf, other]
-
Title: Broken News: Making Newspapers Accessible to Print-ImpairedJournal-ref: Extended Abstract at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [797] arXiv:2206.10241 [pdf, other]
-
Title: Deep Active Latent Surfaces for Medical GeometriesComments: 14 pages, 9 figures, submitted for reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [798] arXiv:2206.10253 [pdf, other]
-
Title: Document Navigability: A Need for Print-ImpairedComments: Published at Accessibility, Vision, and Autonomy Meet, CVPR 2022 WorkshopJournal-ref: Extended Abstract for Poster Session at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [799] arXiv:2206.10254 [pdf, other]
-
Title: Towards Optimizing OCR for AccessibilityJournal-ref: Extended Abstract for Poster Session at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [800] arXiv:2206.10263 [pdf, other]
-
Title: Object Structural Points Representation for Graph-based Semantic Monocular Localization and MappingComments: submitted to IROS 2015 (rejected)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [801] arXiv:2206.10324 [pdf, other]
- [802] arXiv:2206.10329 [pdf, other]
-
Title: SVG Vector Font Generation for Chinese Characters with TransformerComments: Accepted to ICIP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [803] arXiv:2206.10360 [pdf, other]
-
Title: Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal LossComments: 5 pages, 3 figures; Accepted to ICIP2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [804] arXiv:2206.10375 [pdf, other]
-
Title: MEStereo-Du2CNN: A Novel Dual Channel CNN for Learning Robust Depth Estimates from Multi-exposure Stereo Images for HDR 3D ApplicationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [805] arXiv:2206.10411 [pdf, other]
-
Title: Audio-video fusion strategies for active speaker detection in meetingsAuthors: Lionel Pibre, Francisco Madrigal, Cyrille Equoy, Frédéric Lerasle, Thomas Pellegrini, Julien Pinquier, Isabelle FerranéSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [806] arXiv:2206.10436 [pdf, other]
-
Title: Transformer-Based Multi-modal Proposal and Re-Rank for Wikipedia Image-Caption MatchingComments: Accepted for publication at the Wiki-M3L workshop, co-located with ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [807] arXiv:2206.10457 [pdf, other]
-
Title: Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh RecoverySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [808] arXiv:2206.10465 [pdf, other]
-
Title: An Overview of Privacy-enhancing Technologies in Biometric RecognitionComments: 12 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [809] arXiv:2206.10491 [pdf, other]
-
Title: Bi-Calibration Networks for Weakly-Supervised Video Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [810] arXiv:2206.10520 [pdf, other]
-
Title: SFace: Privacy-friendly and Accurate Face Recognition using Synthetic DataSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [811] arXiv:2206.10526 [pdf, other]
-
Title: QuantFace: Towards Lightweight Face Recognition by Synthetic Data Low-bit QuantizationComments: Accepted ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [812] arXiv:2206.10531 [pdf, other]
-
Title: Neural Transformers for Intraductal Papillary Mucosal Neoplasms (IPMN) Classification in MRI imagesAuthors: Federica Proietto Salanitri, Giovanni Bellitto, Simone Palazzo, Ismail Irmakci, Michael B. Wallace, Candice W. Bolan, Megan Engels, Sanne Hoogenboom, Marco Aldinucci, Ulas Bagci, Daniela Giordano, Concetto SpampinatoSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [813] arXiv:2206.10535 [pdf, other]
-
Title: EpiGRAF: Rethinking training of 3D GANsComments: NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [814] arXiv:2206.10536 [pdf, other]
-
Title: HealNet -- Self-Supervised Acute Wound Heal-Stage ClassificationAuthors: Héctor Carrión, Mohammad Jafari, Hsin-Ya Yang, Roslyn Rivkah Isseroff, Marco Rolandi, Marcella Gomez, Narges NorouziSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [815] arXiv:2206.10552 [pdf, other]
-
Title: Vicinity Vision TransformerAuthors: Weixuan Sun, Zhen Qin, Hui Deng, Jianyuan Wang, Yi Zhang, Kaihao Zhang, Nick Barnes, Stan Birchfield, Lingpeng Kong, Yiran ZhongComments: code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [816] arXiv:2206.10555 [pdf, other]
-
Title: LargeKernel3D: Scaling up Kernels in 3D Sparse CNNsComments: In CVPR 2023. Code is at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [817] arXiv:2206.10562 [pdf, other]
-
Title: Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and DepthSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [818] arXiv:2206.10571 [pdf, other]
-
Title: Toward Unpaired Multi-modal Medical Image Segmentation via Learning Structured Semantic ConsistencyComments: MIDL23Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [819] arXiv:2206.10573 [pdf, ps, other]
-
Title: H&E-based Computational Biomarker Enables Universal EGFR Screening for Lung AdenocarcinomaAuthors: Gabriele Campanella, David Ho, Ida Häggström, Anton S Becker, Jason Chang, Chad Vanderbilt, Thomas J FuchsSubjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [820] arXiv:2206.10587 [pdf, ps, other]
-
Title: Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye MovementsComments: 28 pages, 6 figures, 3 supplementary figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [821] arXiv:2206.10589 [pdf, other]
-
Title: EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsAuthors: Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Muhammad Anwer, Fahad Shahbaz KhanComments: Accepted at ECCVW 2022 (Oral, CADL: Computational Aspects of Deep Learning)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [822] arXiv:2206.10590 [pdf, other]
-
Title: Temporally Consistent Semantic Video EditingComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [823] arXiv:2206.10665 [pdf, other]
-
Title: BOSS: A Benchmark for Human Belief Prediction in Object-context ScenariosComments: 9 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [824] arXiv:2206.10673 [pdf, ps, other]
-
Title: Natural Backdoor DatasetsAuthors: Emily Wenger, Roma Bhattacharjee, Arjun Nitin Bhagoji, Josephine Passananti, Emilio Andere, Haitao Zheng, Ben Y. ZhaoComments: 18 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [825] arXiv:2206.10690 [pdf, other]
-
Title: Learning Continuous Rotation Canonicalization with Radial Beam SamplingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [826] arXiv:2206.10692 [pdf, other]
-
Title: Multi-level Domain Adaptation for Lane DetectionComments: Proceedings of the CVPR 2022 Workshop of Autonomous DrivingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [827] arXiv:2206.10698 [pdf, other]
-
Title: TiCo: Transformation Invariance and Covariance Contrast for Self-Supervised Visual Representation LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [828] arXiv:2206.10711 [pdf, other]
-
Title: Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive LearningComments: Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS). Extended version of arXiv:2103.00868. The project is at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
- [829] arXiv:2206.10737 [pdf, other]
-
Title: Deep Metric Color Embeddings for Splicing Localization in Severely Degraded ImagesComments: 14 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [830] arXiv:2206.10779 [pdf, other]
-
Title: Not Just Streaks: Towards Ground Truth for Single Image DerainingAuthors: Yunhao Ba, Howard Zhang, Ethan Yang, Akira Suzuki, Arnold Pfahnl, Chethan Chinder Chandrappa, Celso de Melo, Suya You, Stefano Soatto, Alex Wong, Achuta KadambiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [831] arXiv:2206.10789 [pdf, other]
-
Title: Scaling Autoregressive Models for Content-Rich Text-to-Image GenerationAuthors: Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, Zirui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui WuComments: PreprintSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [832] arXiv:2206.10809 [pdf, other]
-
Title: SSMI: How to Make Objects of Interest Disappear without Accessing Object Detectors?Comments: 6 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [833] arXiv:2206.10821 [pdf, other]
-
Title: Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized ActivationsAuthors: Lin Zhao, Haixing Dai, Zihao Wu, Zhenxiang Xiao, Lu Zhang, David Weizhong Liu, Xintao Hu, Xi Jiang, Sheng Li, Dajiang Zhu, Tianming LiuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [834] arXiv:2206.10830 [pdf, other]
-
Title: A Feature Memory Rearrangement Network for Visual Inspection of Textured Surface Defects Toward Edge Intelligent ManufacturingComments: Revision to IEEE transactions on automation science and engineeringSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [835] arXiv:2206.10831 [pdf, other]
-
Title: MultiEarth 2022 Deforestation Challenge -- ForestGumpComments: CVPR 2022, MultiEarth 2022, Deforestation Estimation ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [836] arXiv:2206.10845 [pdf, other]
-
Title: Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance SegmentationAuthors: Ming Li, Jie Wu, Jinhang Cai, Jie Qin, Yuxi Ren, Xuefeng Xiao, Min Zheng, Rui Wang, Xin PanComments: The solution of 1st Place in AVA Accessibility Vision and Autonomy Challenge on CVPR 2022 workshop. Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [837] arXiv:2206.10861 [pdf, other]
-
Title: UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022Comments: 5 pages, 3 figures; technical report for AVA Challenge (see this https URL) at the International Challenge on Activity Recognition (ActivityNet), CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [838] arXiv:2206.10869 [pdf, other]
-
Title: NVIDIA-UNIBZ Submission for EPIC-KITCHENS-100 Action Anticipation Challenge 2022Authors: Tsung-Ming Tai, Oswald Lanz, Giuseppe Fiameni, Yi-Kwan Wong, Sze-Sen Poon, Cheng-Kuang Lee, Ka-Chun Cheung, Simon SeeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [839] arXiv:2206.10878 [pdf, other]
-
Title: Feature Re-calibration based Multiple Instance Learning for Whole Slide Image ClassificationAuthors: Philip Chikontwe, Soo Jeong Nam, Heounjeong Go, Meejeong Kim, Hyun Jung Sung, Sang Hyun ParkComments: MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [840] arXiv:2206.10879 [pdf, other]
-
Title: Symmetric Network with Spatial Relationship Modeling for Natural Language-based Vehicle RetrievalComments: 8 pages, 3 figures, publised to CVPRWJournal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 3226-3233Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [841] arXiv:2206.10885 [pdf, other]
-
Title: KiloNeuS: A Versatile Neural Implicit Surface Representation for Real-Time RenderingComments: 9 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
- [842] arXiv:2206.10886 [pdf, other]
-
Title: Optical Flow Regularization of Implicit Neural Representations for Video Frame InterpolationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [843] arXiv:2206.10892 [pdf, other]
-
Title: I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose EstimationAuthors: Yiwei Ding, Wenjin Deng, Yinglin Zheng, Pengfei Liu, Meihong Wang, Xuan Cheng, Jianmin Bao, Dong Chen, Ming ZengComments: Accepected by IJCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [844] arXiv:2206.10902 [pdf, other]
-
Title: S2TNet: Spatio-Temporal Transformer Networks for Trajectory Prediction in Autonomous DrivingComments: Accepted by ACML2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [845] arXiv:2206.10903 [pdf, ps, other]
-
Title: UniUD-FBK-UB-UniBZ Submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2022Comments: Ranked joint 1st place in the Multi-Instance Action Retrieval Challenge organized at EPIC@CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [846] arXiv:2206.10910 [pdf, other]
-
Title: SpA-Former: Transformer image shadow detection and removal via spatial attentionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [847] arXiv:2206.10915 [pdf, other]
-
Title: Understanding the effect of sparsity on neural networks robustnessSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [848] arXiv:2206.10965 [pdf, other]
-
Title: Polar Parametrization for Vision-based Surround-View 3D DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [849] arXiv:2206.10969 [pdf, other]
-
Title: Single Morphing Attack Detection using Siamese Network and Few-shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [850] arXiv:2206.10988 [pdf, other]
-
Title: AdvSmo: Black-box Adversarial Attack by Smoothing Linear Structure of TextureComments: 6 pages,3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [851] arXiv:2206.10989 [pdf, other]
-
Title: Identity Documents Authentication based on Forgery Detection of Guilloche PatternSubjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
- [852] arXiv:2206.10996 [pdf, other]
-
Title: ProtoCLIP: Prototypical Contrastive Language Image PretrainingComments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [853] arXiv:2206.11011 [pdf, other]
-
Title: Weakly-Supervised Temporal Action Localization by Progressive Complementary LearningAuthors: Jia-Run Du, Jia-Chang Feng, Kun-Yu Lin, Fa-Ting Hong, Xiao-Ming Wu, Zhongang Qi, Ying Shan, Wei-Shi ZhengSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [854] arXiv:2206.11053 [pdf, other]
-
Title: Surgical-VQA: Visual Question Answering in Surgical Scenes using TransformerComments: Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO); Image and Video Processing (eess.IV)
- [855] arXiv:2206.11080 [pdf, other]
-
Title: Motion Gait: Gait Recognition via Motion ExcitationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [856] arXiv:2206.11095 [pdf, other]
-
Title: A High Resolution Multi-exposure Stereoscopic Image & Video Database of Natural ScenesSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [857] arXiv:2206.11115 [pdf, other]
-
Title: ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition CanvasAuthors: Prathmesh Madhu, Tilman Marquart, Ronak Kosti, Dirk Suckow, Peter Bell, Andreas Maier, Vincent ChristleinSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [858] arXiv:2206.11134 [pdf, other]
-
Title: Open Vocabulary Object Detection with Proposal Mining and Prediction EqualizationAuthors: Peixian Chen, Kekai Sheng, Mengdan Zhang, Mingbao Lin, Yunhang Shen, Shaohui Lin, Bo Ren, Ke LiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [859] arXiv:2206.11180 [pdf, other]
-
Title: Optimal transport meets noisy label robust loss and MixUp regularization for domain adaptationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [860] arXiv:2206.11203 [pdf, other]
-
Title: Facke: a Survey on Generative Models for Face SwappingSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [861] arXiv:2206.11212 [pdf, other]
-
Title: VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason ObjectivesComments: NeurIPS 2022 (first two authors contributed equally)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [862] arXiv:2206.11215 [pdf, other]
-
Title: Certifiable 3D Object Pose Estimation: Foundations, Learning Models, and Self-TrainingSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [863] arXiv:2206.11250 [pdf, other]
-
Title: Depth-aware Glass Surface Detection with Cross-modal Context MiningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [864] arXiv:2206.11253 [pdf, other]
-
Title: Towards Robust Blind Face Restoration with Codebook Lookup TransformerComments: Accepted by NeurIPS 2022. Code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [865] arXiv:2206.11352 [pdf, ps, other]
-
Title: Doubly Reparameterized Importance Weighted Structure Learning for Scene Graph GenerationComments: arXiv admin note: substantial text overlap with arXiv:2205.07017Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [866] arXiv:2206.11358 [pdf, other]
-
Title: Monocular Spherical Depth Estimation with Explicitly Connected Weak Layout CuesComments: Project page at this https URLJournal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, Volume 183, January 2022, Pages 269-285Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [867] arXiv:2206.11404 [pdf, other]
-
Title: The ArtBench Dataset: Benchmarking Generative Models with ArtworksComments: The first two authors contributed equally to this work. The code and data are available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [868] arXiv:2206.11428 [pdf, other]
-
Title: LidarMultiNet: Unifying LiDAR Semantic Segmentation, 3D Object Detection, and Panoptic Segmentation in a Single Multi-task NetworkComments: Official 1st Place Solution for the Waymo Open Dataset Challenges 2022 - 3D Semantic Segmentation. Official leaderboard: this https URL CVPR 2022 Workshop on Autonomous Driving: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [869] arXiv:2206.11443 [pdf, other]
-
Title: Image-based Stability QuantificationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [870] arXiv:2206.11459 [pdf, other]
-
Title: Explore Spatio-temporal Aggregation for Insubstantial Object Detection: Benchmark Dataset and BaselineSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [871] arXiv:2206.11462 [pdf, ps, other]
-
Title: ICME 2022 Few-shot LOGO detection top 9 solutionSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [872] arXiv:2206.11473 [pdf, other]
-
Title: Complementary datasets to COCO for object detectionAuthors: Ali BorjiSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [873] arXiv:2206.11474 [pdf, other]
-
Title: Entropy-driven Sampling and Training Scheme for Conditional Diffusion GenerationComments: 24 pages, 8 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [874] arXiv:2206.11476 [pdf, other]
-
Title: Dynamic Scene Deblurring Based on Continuous Cross-Layer Attention TransmissionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [875] arXiv:2206.11493 [pdf, other]
-
Title: Learning to Refactor Action and Co-occurrence Features for Temporal Action LocalizationComments: Accepted by CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [876] arXiv:2206.11499 [pdf, other]
-
Title: Parallel Structure from Motion for UAV Images via Weighted Connected Dominating SetComments: 14 pages, 11 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [877] arXiv:2206.11502 [pdf, ps, other]
-
Title: A Review of Published Machine Learning Natural Language Processing Applications for Protocolling Radiology ImagingAuthors: Nihal Raju (5), Michael Woodburn (1 and 5), Stefan Kachel (2 and 3), Jack O'Shaughnessy (5), Laurence Sorace (5), Natalie Yang (2), Ruth P Lim (2 and 4) ((1) Harvard University, Extension School, Cambridge, MA, USA, (2) Department of Radiology, The University of Melbourne, Parkville, (3) Department of Radiology, Columbia University in the City of New York, (4) Department of Surgery, Austin, The University of Melbourne, (5) Austin Hospital, Austin Health, Melbourne, Australia)Comments: 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [878] arXiv:2206.11520 [pdf, other]
-
Title: ICOS Protein Expression Segmentation: Can Transformer Networks Give Better Results?Comments: Accepted MIUA conference (Abstract short paper)Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [879] arXiv:2206.11541 [pdf, other]
-
Title: A Neuromorphic Vision-Based Measurement for Robust Relative Localization in Future Space Exploration MissionsAuthors: Mohammed Salah, Mohammed Chehadah, Muhammed Humais, Mohammed Wahbah, Abdulla Ayyad, Rana Azzam, Lakmal Seneviratne, Yahya ZweiriSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [880] arXiv:2206.11589 [pdf, other]
-
Title: Learning Towards the Largest MarginsComments: ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [881] arXiv:2206.11610 [pdf, other]
-
Title: 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)Comments: Winner of the 2nd RxR-Habitat Competition @ CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [882] arXiv:2206.11629 [pdf, other]
-
Title: Global Sensing and Measurements Reuse for Image Compressed SensingSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [883] arXiv:2206.11653 [pdf, other]
-
Title: Learning To Generate Scene Graph from Head to TailSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [884] arXiv:2206.11657 [pdf, other]
-
Title: Warped Convolutional Networks: Bridge Homography to sl(3) algebra by Group ConvolutionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [885] arXiv:2206.11678 [pdf, other]
-
Title: BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose EstimationAuthors: Ivan Grishchenko, Valentin Bazarevsky, Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, Richard Yee, Karthik Raveendran, Matsvei Zhdanovich, Matthias Grundmann, Cristian SminchisescuComments: 4 pages, 4 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [886] arXiv:2206.11695 [pdf, other]
-
Title: NTIRE 2022 Challenge on Perceptual Image Quality AssessmentComments: This report has been published in CVPR 2022 NTIRE workshop. arXiv admin note: text overlap with arXiv:2105.03072Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [887] arXiv:2206.11723 [pdf, other]
-
Title: Self-Supervised Training with Autoencoders for Visual Anomaly DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [888] arXiv:2206.11736 [pdf, other]
-
Title: NovelCraft: A Dataset for Novelty Detection and Discovery in Open WorldsAuthors: Patrick Feeney, Sarah Schneider, Panagiotis Lymperopoulos, Li-Ping Liu, Matthias Scheutz, Michael C. HughesComments: Published in Transactions on Machine Learning Research (03/2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [889] arXiv:2206.11739 [pdf, other]
-
Title: Evidence fusion with contextual discounting for multi-modality medical image segmentationComments: MICCAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [890] arXiv:2206.11752 [pdf, other]
-
Title: CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal PoseComments: CVPR2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [891] arXiv:2206.11759 [pdf, other]
-
Title: What makes you, you? Analyzing Recognition by Swapping Face PartsComments: Accepted for publication at 26TH International Conference on Pattern Recognition (ICPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [892] arXiv:2206.11768 [pdf, other]
-
Title: FitGAN: Fit- and Shape-Realistic Generative Adversarial Networks for FashionComments: 26th International Conference on Pattern Recognition (ICPR) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [893] arXiv:2206.11804 [pdf, other]
-
Title: Rethinking Surgical Instrument Segmentation: A Background Image Can Be All You NeedComments: 10 pages, MICCAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [894] arXiv:2206.11808 [pdf, other]
-
Title: Unseen Object 6D Pose Estimation: A Benchmark and BaselinesSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [895] arXiv:2206.11825 [pdf, other]
-
Title: YOLOSA: Object detection based on 2D local feature superimposed self-attentionComments: This paper is under consideration at Pattern Recognition LettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [896] arXiv:2206.11826 [pdf, other]
- [897] arXiv:2206.11892 [pdf, other]
-
Title: DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change DetectionComments: Code available at: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [898] arXiv:2206.11894 [pdf, other]
-
Title: MaskViT: Masked Visual Pre-Training for Video PredictionComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [899] arXiv:2206.11895 [pdf, other]
-
Title: Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D SpaceComments: NeurIPS 2022. Our code is at this https URL Our project page is at this https URL v3, v4 for minor updates on figures and visualizationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [900] arXiv:2206.11896 [pdf, other]
-
Title: EventNeRF: Neural Radiance Fields from a Single Colour Event CameraComments: 19 pages, 21 figures, 3 tables; CVPR 2023Journal-ref: Computer Vision and Pattern Recognition (CVPR) 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [901] arXiv:2206.11920 [pdf, other]
-
Title: Agriculture-Vision Challenge 2022 -- The Runner-Up Solution for Agricultural Pattern Recognition via Transformer-based ModelsComments: CVPR 2022, Agriculture-Vision Challenge, Remote SensingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [902] arXiv:2206.11927 [pdf, other]
-
Title: Towards Galaxy Foundation Models with Hybrid Contrastive LearningComments: Accepted at the ICML 2022 Workshop on Machine Learning for Astrophysics. Data: www.github.com/mwalmsley/pytorch-galaxy-datasets. Please reach out to share your labelled data - all contributions will be credited in future workSubjects: Computer Vision and Pattern Recognition (cs.CV); Astrophysics of Galaxies (astro-ph.GA)
- [903] arXiv:2206.11952 [pdf, other]
-
Title: UNeRF: Time and Memory Conscious U-Shaped Network for Training Neural Radiance FieldsSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [904] arXiv:2206.12035 [pdf, other]
-
Title: The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge--Track 3: Referring Video Object SegmentationComments: 4 pages, 2 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [905] arXiv:2206.12043 [pdf, other]
-
Title: Protecting President Zelenskyy against Deep FakesSubjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [906] arXiv:2206.12046 [pdf, other]
-
Title: Bilateral Network with Channel Splitting Network and Transformer for Thermal Image Super-ResolutionComments: The second place solution for CVPR2022 PBVS-TISR challengeSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [907] arXiv:2206.12055 [pdf, other]
-
Title: SDF-StyleGAN: Implicit SDF-Based StyleGAN for 3D Shape GenerationComments: Accepted to Computer Graphics Forum (SGP), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [908] arXiv:2206.12063 [src]
-
Title: Mutual Information-guided Knowledge Transfer for Novel Class DiscoveryComments: The derivation of Mutual Information in the manuscript is wrongSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [909] arXiv:2206.12071 [pdf, other]
-
Title: Contrastive Learning of Features between Images and LiDARComments: accepted in CASE2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [910] arXiv:2206.12073 [pdf, other]
-
Title: MaskRange: A Mask-classification Model for Range-view based LiDAR SegmentationComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [911] arXiv:2206.12099 [pdf, ps, other]
-
Title: A novel approach for glaucoma classification by wavelet neural networks using graph-based, statisitcal features of qualitatively improved imagesComments: 25 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [912] arXiv:2206.12117 [pdf, other]
-
Title: Self Supervised Learning for Few Shot Hyperspectral Image ClassificationComments: Accepted in IGARSS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [913] arXiv:2206.12123 [pdf, ps, other]
-
Title: Some theoretical results on discrete contour treesAuthors: Yuqing SongComments: 5 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG)
- [914] arXiv:2206.12126 [pdf, other]
-
Title: Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive LearningComments: Accepted by CVPR 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [915] arXiv:2206.12128 [pdf, other]
-
Title: Excavating RoI Attention for Underwater Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [916] arXiv:2206.12216 [pdf, other]
-
Title: Optimized Views Photogrammetry: Precision Analysis and A Large-scale Case Study in QingdaoComments: 16 pages, 24 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [917] arXiv:2206.12351 [pdf, other]
-
Title: Megapixel Image Generation with Step-Unrolled Denoising AutoencodersComments: 17 pages + 9 appendix pages. 20 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [918] arXiv:2206.12356 [pdf, other]
-
Title: HM3D-ABO: A Photo-realistic Dataset for Object-centric Multi-view 3D ReconstructionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [919] arXiv:2206.12370 [pdf, other]
-
Title: Mixed Sample Augmentation for Online DistillationComments: 5 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [920] arXiv:2206.12372 [pdf, other]
-
Title: QReg: On Regularization Effects of QuantizationAuthors: MohammadHossein AskariHemmat, Reyhane Askari Hemmat, Alex Hoffman, Ivan Lazarevich, Ehsan Saboori, Olivier Mastropietro, Sudhakar Sah, Yvon Savaria, Jean-Pierre DavidSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [921] arXiv:2206.12381 [pdf, other]
-
Title: Defending Backdoor Attacks on Vision Transformer via Patch ProcessingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [922] arXiv:2206.12396 [pdf, other]
-
Title: Text-Driven Stylization of Video ObjectsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [923] arXiv:2206.12403 [pdf, other]
-
Title: ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal EmbeddingsComments: code: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [924] arXiv:2206.12455 [pdf, other]
-
Title: Ev-NeRF: Event Based Neural Radiance FieldComments: Accepted to WACV 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [925] arXiv:2206.12458 [pdf, other]
-
Title: Bag of Tricks for Long-Tail Visual Recognition of Animal Species in Camera-Trap ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [926] arXiv:2206.12464 [pdf, other]
-
Title: Motion Estimation for Large Displacements and DeformationsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [927] arXiv:2206.12480 [pdf, other]
-
Title: Attention-Guided Autoencoder for Automated Progression Prediction of Subjective Cognitive Decline with Structural MRIComments: 10 pages, 12 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [928] arXiv:2206.12498 [pdf, other]
-
Title: Optimal and Robust Category-level Perception: Object Pose and Shape Estimation from 2D and 3D Semantic KeypointsComments: arXiv admin note: text overlap with arXiv:2104.08383Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [929] arXiv:2206.12505 [pdf, other]
-
Title: Stain Based Contrastive Co-training for Histopathological Image AnalysisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [930] arXiv:2206.12533 [pdf, other]
-
Title: From Shallow to Deep: Compositional Reasoning over Graphs for Visual Question AnsweringAuthors: Zihao ZhuSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [931] arXiv:2206.12534 [pdf, other]
-
Title: SLIC: Self-Supervised Learning with Iterative Clustering for Human Action VideosComments: CVPR2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [932] arXiv:2206.12558 [pdf, other]
-
Title: FastBVP-Net: a lightweight pulse extraction network for measuring heart rhythm via facial videosComments: 9 pages, 2figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [933] arXiv:2206.12571 [pdf, other]
-
Title: CV 3315 Is All You Need : Semantic Segmentation CompetitionComments: arXiv admin note: text overlap with arXiv:2105.15203 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [934] arXiv:2206.12590 [pdf, other]
-
Title: RSTAM: An Effective Black-Box Impersonation Attack on Face Recognition using a Mobile and Compact PrinterSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [935] arXiv:2206.12592 [pdf, other]
-
Title: Asymmetric Transfer Hashing with Adaptive Bipartite Graph LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [936] arXiv:2206.12596 [pdf, ps, other]
-
Title: Non-iterative Coarse-to-fine Registration based on Single-pass Deep Cumulative LearningComments: Accepted at International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2022)Journal-ref: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 88-97, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [937] arXiv:2206.12612 [pdf, other]
-
Title: Learn to Predict How Humans Manipulate Large-sized Objects from Interactive MotionsAuthors: Weilin Wan, Lei Yang, Lingjie Liu, Zhuoying Zhang, Ruixing Jia, Yi-King Choi, Jia Pan, Christian Theobalt, Taku Komura, Wenping WangJournal-ref: IEEE Robotics and Automation Letters ( Volume: 7, Issue: 2, April 2022)Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [938] arXiv:2206.12614 [pdf, other]
-
Title: BokehMe: When Neural Rendering Meets Classical RenderingComments: Accepted by CVPR 2022 (Oral); Project: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [939] arXiv:2206.12622 [pdf, other]
-
Title: SAT: Self-adaptive training for fashion compatibility predictionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [940] arXiv:2206.12623 [pdf, other]
-
Title: Inverted Semantic-Index for Image RetrievalAuthors: Ying WangSubjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
- [941] arXiv:2206.12634 [pdf, other]
-
Title: SC-Transformer++: Structured Context Transformer for Generic Event Boundary DetectionComments: winner method at LOVEU@CVPR'22 Generic Event Boundary Detection ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [942] arXiv:2206.12648 [pdf, other]
-
Title: BIMS-PU: Bi-Directional and Multi-Scale Point Cloud UpsamplingComments: Accepted to RA-L 2022. in IEEE Robotics and Automation LettersJournal-ref: in IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 7447-7454, July 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [943] arXiv:2206.12650 [pdf, ps, other]
-
Title: Machine Learning-based Biological Ageing Estimation Technologies: A SurveyComments: in Recent Advances in AI-enabled Automated Medical Diagnosis this https URLJournal-ref: Recent Advances in AI-enabled Automated Medical Diagnosis, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [944] arXiv:2206.12651 [pdf, ps, other]
-
Title: Review on Social Behavior Analysis of Laboratory Animals: From Methodologies to ApplicationsComments: this https URLJournal-ref: Recent Advances in AI-enabled Automated Medical Diagnosis, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [945] arXiv:2206.12653 [pdf, ps, other]
-
Title: Diagnostic Communication and Visual System based on Vehicle UDS ProtocolSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [946] arXiv:2206.12657 [pdf, other]
-
Title: Enhanced Deep Animation Video InterpolationSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [947] arXiv:2206.12675 [pdf, other]
-
Title: Learning to Infer 3D Shape Programs with Differentiable RendererAuthors: Yichao LiangComments: Technical report written in 2020; 10 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:1901.02875 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [948] arXiv:2206.12681 [pdf, other]
-
Title: UltraMNIST Classification: A Benchmark to Train CNNs for Very Large ImagesAuthors: Deepak K. Gupta, Udbhav Bamba, Abhishek Thakur, Akash Gupta, Suraj Sharan, Ertugrul Demir, Dilip K. PrasadSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [949] arXiv:2206.12685 [pdf, ps, other]
-
Title: Defense against adversarial attacks on deep convolutional neural networks through nonlocal denoisingJournal-ref: IAES International Journal of Artificial Intelligence, Vol. 11, No. 3, September 2022, pp. 961~968, ISSN: 2252-8938Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [950] arXiv:2206.12694 [pdf, other]
-
Title: RandStainNA: Learning Stain-Agnostic Features from Histology Slides by Bridging Stain Augmentation and NormalizationComments: 12 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [951] arXiv:2206.12704 [pdf, other]
-
Title: Anatomy-Guided Weakly-Supervised Abnormality Localization in Chest X-raysComments: Accepted by MICCAI 20222Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [952] arXiv:2206.12714 [pdf, other]
-
Title: Defending Multimodal Fusion Models against Single-Source AdversariesComments: CVPR 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [953] arXiv:2206.12725 [pdf, other]
-
Title: Empirical Evaluation of Physical Adversarial Patch Attacks Against Overhead Object Detection ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [954] arXiv:2206.12738 [pdf, other]
-
Title: Self-Supervised 3D Monocular Object Detection by Recycling Bounding BoxesAuthors: Sugirtha T, Sridevi M, Khailash Santhakumar, Hao Liu, B Ravi Kiran, Thomas Gauthier, Senthil YogamaniComments: Published at ICCVW-SSLAD 2021. arXiv admin note: substantial text overlap with arXiv:2104.10786Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [955] arXiv:2206.12740 [pdf, other]
-
Title: Multi Visual Modality Fall Detection DatasetAuthors: Stefan Denkovski, Shehroz S. Khan, Brandon Malamis, Sae Young Moon, Bing Ye, Alex MihailidisSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [956] arXiv:2206.12745 [pdf, ps, other]
-
Title: Sequential image recovery using joint hierarchical Bayesian learningComments: 24 pages, 15 figuresJournal-ref: J Sci Comput 96, 4 (2023)Subjects: Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
- [957] arXiv:2206.12755 [pdf, other]
-
Title: Training Your Sparse Neural Network Better with Any MaskComments: Accepted by ICML 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [958] arXiv:2206.12772 [pdf, other]
-
Title: Exploiting Transformation Invariance and Equivariance for Self-supervised Sound LocalisationComments: Camera-ready Version for ACMMM 2022, Project page is this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [959] arXiv:2206.12788 [pdf, other]
-
Title: Representative Teacher Keys for Knowledge Distillation Model Compression Based on Attention Mechanism for Image ClassificationComments: eight pages, six figures, three tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [960] arXiv:2206.12794 [pdf, other]
-
Title: CTMQ: Cyclic Training of Convolutional Neural Networks with Multiple Quantization StepsComments: submitted to NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [961] arXiv:2206.12798 [pdf, other]
-
Title: Multiple Instance Learning with Mixed Supervision in Gleason GradingComments: Accepted by MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [962] arXiv:2206.12837 [pdf, other]
-
Title: Perceptual Conversational Head Generation with Regularized Driver and Enhanced RendererComments: Ailin and Zhewei contributed equally to this work. ACM MM22 workshop paperSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [963] arXiv:2206.12845 [pdf, other]
-
Title: RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video RetrievalComments: Preprint, under review in TCSVT JournalSubjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [964] arXiv:2206.12849 [pdf, other]
-
Title: Semantic Role Aware Correlation Transformer for Text to Video RetrievalComments: Camera-ready for ICIP 2021Journal-ref: IEEE International Conference on Image Processing (ICIP), 2021, pp. 1334-1338Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [965] arXiv:2206.12869 [pdf, other]
-
Title: Image Aesthetics Assessment Using Graph Attention NetworkComments: International Conference on Pattern Recognition (ICPR), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [966] arXiv:2206.12885 [pdf, ps, other]
-
Title: FingerGAN: A Constrained Fingerprint Generation Scheme for Latent Fingerprint EnhancementSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [967] arXiv:2206.12912 [pdf, other]
-
Title: Woodscape Fisheye Object Detection for Autonomous Driving -- CVPR 2022 OmniCV Workshop ChallengeAuthors: Saravanabalagi Ramachandran, Ganesh Sistu, Varun Ravi Kumar, John McDonald, Senthil YogamaniComments: Workshop on Omnidirectional Computer Vision (OmniCV) at Conference on Computer Vision and Pattern Recognition (CVPR) 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [968] arXiv:2206.12914 [pdf, other]
-
Title: Video Anomaly Detection via Prediction Network with Enhanced Spatio-Temporal Memory ExchangeComments: Accepted at ICASSP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [969] arXiv:2206.12921 [pdf, other]
-
Title: Non-Parametric Style TransferSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [970] arXiv:2206.12923 [pdf, other]
-
Title: Video Activity Localisation with Uncertainties in Temporal BoundarySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [971] arXiv:2206.12925 [pdf, other]
-
Title: Vision Transformer for Contrastive ClusteringSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [972] arXiv:2206.12930 [pdf, other]
-
Title: SVBR-NET: A Non-Blind Spatially Varying Defocus Blur Removal NetworkComments: Accepted to ICIP2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [973] arXiv:2206.12943 [pdf, other]
-
Title: Multi-view Feature Augmentation with Adaptive Class Activation MappingComments: An arxiv version of the paper published in Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21). See this https URLJournal-ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence. Main Track. 2021. Pages 678-684Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [974] arXiv:2206.12946 [pdf, other]
-
Title: AFT-VO: Asynchronous Fusion Transformers for Multi-View Visual Odometry EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [975] arXiv:2206.12952 [pdf, other]
-
Title: Nonwatertight Mesh ReconstructionAuthors: Partha GhoshComments: arXiv admin note: text overlap with arXiv:2106.03452 by other authorsSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [976] arXiv:2206.12958 [pdf, ps, other]
-
Title: Szloca: towards a framework for full 3D tracking through a single camera in context of interactive artsAuthors: Sahaj GargSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
- [977] arXiv:2206.12959 [pdf, other]
-
Title: Probabilistic PolarGMM: Unsupervised Cluster Learning of Very Noisy Projection Images of Unknown PoseComments: 13 pages, including appendicesSubjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
- [978] arXiv:2206.12963 [pdf, other]
-
Title: Self-Healing Robust Neural Networks via Closed-Loop ControlComments: 48 pages, 5 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [979] arXiv:2206.12972 [pdf, other]
-
Title: VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph CaptioningComments: accepted by The 29th IEEE International Conference on Image Processing (IEEE ICIP) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [980] arXiv:2206.12994 [pdf, other]
-
Title: Automatic Generation of Product-Image Sequence in E-commerceAuthors: Xiaochuan Fan, Chi Zhang, Yong Yang, Yue Shang, Xueying Zhang, Zhen He, Yun Xiao, Bo Long, Lingfei WuComments: Accepted by KDD 2022 ADSSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [981] arXiv:2206.13028 [pdf, other]
-
Title: Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action RecognitionComments: 10 pages, 4 figures, accepted by AAAI 2021Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [982] arXiv:2206.13042 [pdf, other]
-
Title: A Strategy Optimized Pix2pix Approach for SAR-to-Optical Image Translation TaskSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [983] arXiv:2206.13076 [pdf, other]
-
Title: SearchMorph:Multi-scale Correlation Iterative Network for Deformable RegistrationAuthors: Xiao Fan, Shuxin Zhuang, Zhemin Zhuang, Ye Yuan, Shunmin Qiu, Alex Noel Joseph Raj, Yibiao RongSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [984] arXiv:2206.13078 [pdf, other]
-
Title: Video2StyleGAN: Encoding Video in Latent Space for ManipulationSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [985] arXiv:2206.13079 [pdf, other]
-
Title: Dynamic Bank Learning for Semi-supervised Federated Image Diagnosis with Class ImbalanceComments: Early accepted by 25th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI'22)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [986] arXiv:2206.13082 [pdf, ps, other]
-
Title: PST: Plant segmentation transformer for 3D point clouds of rapeseed plants at the podding stageComments: 46 pages, 10 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [987] arXiv:2206.13115 [pdf, other]
-
Title: Lesion-Aware Contrastive Representation Learning for Histopathology Whole Slide Images AnalysisComments: accepted for MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [988] arXiv:2206.13117 [src]
-
Title: SARNet: Semantic Augmented Registration of Large-Scale Urban Point CloudsComments: Author information changesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [989] arXiv:2206.13142 [pdf, other]
-
Title: Representing motion as a sequence of latent primitives, a flexible approach for human motion modellingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [990] arXiv:2206.13155 [pdf, other]
-
Title: Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document UnderstandingComments: Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
- [991] arXiv:2206.13156 [pdf, other]
-
Title: Kernel Attention Transformer (KAT) for Histopathology Whole Slide Image ClassificationComments: accepted for MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [992] arXiv:2206.13188 [pdf, other]
-
Title: Self-supervised Learning in Remote Sensing: A ReviewComments: Accepted by IEEE Geoscience and Remote Sensing Magazine. 32 pages, 22 content pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [993] arXiv:2206.13199 [pdf, other]
-
Title: MGNet: Monocular Geometric Scene Understanding for Autonomous DrivingJournal-ref: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 15784-15795Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [994] arXiv:2206.13263 [pdf, other]
-
Title: Learning with Weak Annotations for Robust Maritime Obstacle DetectionComments: Published in MDPI Sensors, 23 pages, 8 figuresJournal-ref: Sensors 2022, 22, 9139Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [995] arXiv:2206.13282 [pdf, other]
-
Title: Monocular Depth Decomposition of Semi-Transparent Volume RenderingsComments: accepted at IEEE TVCG 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [996] arXiv:2206.13294 [pdf, other]
-
Title: LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic SegmentationAuthors: Florent Bartoccioni, Éloi Zablocki, Andrei Bursuc, Patrick Pérez, Matthieu Cord, Karteek AlahariJournal-ref: CoRL 2022 https://openreview.net/forum?id=abd_D-iVjk0Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [997] arXiv:2206.13296 [pdf, other]
-
Title: Consistency-preserving Visual Question Answering in Medical ImagingComments: Appears in Medical Image Computing and Computer Assisted Interventions (MICCAI), 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [998] arXiv:2206.13304 [pdf, other]
-
Title: PARTICUL: Part Identification with Confidence measure using Unsupervised LearningAuthors: Romain Xu-Darme (LSL, MRIM ), Georges Quénot (MRIM ), Zakaria Chihani (LSL), Marie-Christine Rousset (SLIDE )Comments: Accepted at XAIE: 2nd Workshop on Explainable and Ethical AI -- ICPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [999] arXiv:2206.13317 [pdf, other]
-
Title: Automatic identification of segmentation errors for radiotherapy using geometric learningComments: Accepted in 25th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2022). This preprint has not undergone peer review or any post-submission improvements or correctionsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1000] arXiv:2206.13318 [pdf, other]
-
Title: Key-frame Guided Network for Thyroid Nodule Recognition using Ultrasound VideosSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1001] arXiv:2206.13329 [pdf, other]
-
Title: Prior-Guided One-shot Neural Architecture SearchComments: Official 3st Place Solution for the Second workshop Neural Architecture Search Second lightweight NAS Challenge 2022 - Track1 Supernet Track. Official leaderboard: this https URL CVPR 2022 Workshop: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1002] arXiv:2206.13342 [pdf, other]
-
Title: Open Set Classification of Untranscribed Handwritten DocumentsAuthors: José Ramón Prieto, Juan José Flores, Enrique Vidal, Alejandro H. Toselli, David Garrido, Carlos AlonsoSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [1003] arXiv:2206.13346 [pdf, other]
-
Title: Distributional Gaussian Processes Layers for Out-of-Distribution DetectionComments: Published in Journal of Machine Learning for Biomedical Imaging: Special Issue: Information Processing in Medical Imaging (IPMI) 2021Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [1004] arXiv:2206.13356 [pdf, other]
-
Title: iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and RecognitionComments: This is a technical report from the Chinese University of Hong KongSubjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1005] arXiv:2206.13381 [pdf, other]
-
Title: TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform MaskComments: This paper has been accepted by IEEE Transactions on MultimediaSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1006] arXiv:2206.13383 [pdf, ps, other]
-
Title: Mushroom image recognition and distance generation based on attention-mechanism model and genetic informationAuthors: Wenbin Liao, Jiewen Xiao, Chengbo Zhao, Yonggong Han, ZhiJie Geng, Jianxin Wang, Yihua YangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1007] arXiv:2206.13386 [pdf, other]
-
Title: Uncovering variability in human driving behavior through automatic extraction of similar traffic scenes from large naturalistic datasetsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1008] arXiv:2206.13388 [pdf, ps, other]
-
Title: Rotated Digit Recognition by Variational Autoencoders with Fixed Output DistributionsAuthors: David YevickSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [1009] arXiv:2206.13389 [pdf, other]
-
Title: UI Layers Merger: Merging UI layers via Visual Learning and Boundary PriorAuthors: Yun-nong Chen, Yan-kun Zhen, Chu-ning Shi, Jia-zhi Li, Liu-qing Chen, Ze-jian Li, Ling-yun Sun, Ting-ting Zhou, Yan-fang ChangComments: 15 pages, accepted to Frontiers of Information Technology & Electronic Engineering. This is a preprint version, the copyright belongs to the Springer Nature journalsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1010] arXiv:2206.13390 [pdf, other]
-
Title: A Comprehensive Survey on Video Saliency Detection with Auditory Information: the Audio-visual Consistency Perceptual is the Key!Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [1011] arXiv:2206.13391 [pdf, other]
-
Title: Deep reinforced active learning for multi-class image classificationComments: 10 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1012] arXiv:2206.13392 [pdf, ps, other]
-
Title: Remote Sensing Image Classification using Transfer Learning and Attention Based Deep Neural NetworkSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1013] arXiv:2206.13395 [pdf, other]
-
Title: Gait Cycle Reconstruction and Human Identification from Occluded SequencesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1014] arXiv:2206.13396 [pdf, other]
-
Title: A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic SearchAuthors: Brandon Trabucco, Gunnar Sigurdsson, Robinson Piramuthu, Gaurav S. Sukhatme, Ruslan SalakhutdinovComments: Winner of the Rearrangement Challenge at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
- [1015] arXiv:2206.13397 [pdf, other]
-
Title: Generative Modelling With Inverse Heat DissipationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [1016] arXiv:2206.13398 [pdf, other]
-
Title: An Efficient Industrial Federated Learning Framework for AIoT: A Face Recognition ApplicationAuthors: Youlong Ding, Xueyang Wu, Zhitao Li, Zeheng Wu, Shengqi Tan, Qian Xu, Weike Pan, Qiang YangComments: FL-IJCAL'22 Accepted PaperSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1017] arXiv:2206.13413 [pdf, other]
-
Title: RES: A Robust Framework for Guiding Visual ExplanationComments: Published in KDD 2022Journal-ref: In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USASubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1018] arXiv:2206.13434 [pdf, other]
-
Title: ContraReg: Contrastive Learning of Multi-modality Unsupervised Deformable Image RegistrationComments: Accepted by MICCAI 2022. 13 pages, 6 figures, and 1 tableSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1019] arXiv:2206.13454 [pdf, other]
-
Title: Optimizing Video Prediction via Video Frame InterpolationComments: Accepted by the CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1020] arXiv:2206.13462 [pdf, other]
-
Title: Learn Fast, Segment Well: Fast Object Segmentation Learning on the iCub RobotAuthors: Federico Ceola, Elisa Maiettini, Giulia Pasquale, Giacomo Meanti, Lorenzo Rosasco, Lorenzo NataleComments: \copyright 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other worksSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1021] arXiv:2206.13500 [pdf, other]
-
Title: Neural Neural Textures Make Sim2Real ConsistentComments: 9 pages, 10 figures (without references or appendix); 16 pages, 16 figures (with appendix)Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
- [1022] arXiv:2206.13502 [pdf, other]
-
Title: Programmatic Concept Learning for Human Motion Description and SynthesisComments: CVPR 2022. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [1023] arXiv:2206.13559 [pdf, other]
-
Title: ST-Adapter: Parameter-Efficient Image-to-Video Transfer LearningComments: Accepted in NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1024] arXiv:2206.13577 [pdf, other]
-
Title: A View Independent Classification Framework for Yoga PosturesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1025] arXiv:2206.13597 [pdf, other]
-
Title: NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal PriorsAuthors: Jiepeng Wang, Peng Wang, Xiaoxiao Long, Christian Theobalt, Taku Komura, Lingjie Liu, Wenping WangSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1026] arXiv:2206.13608 [pdf, other]
-
Title: Reducing Annotation Need in Self-Explanatory Models for Lung Nodule DiagnosisComments: 10 pages, 4 figures, 2 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1027] arXiv:2206.13626 [pdf, other]
-
Title: Patch Selection for Melanoma ClassificationSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1028] arXiv:2206.13628 [pdf, other]
- [1029] arXiv:2206.13644 [pdf, other]
-
Title: Feature Refinement to Improve High Resolution Image InpaintingComments: 5 pages, 5 figures, Published in CVPR Workshop on Computer Vision for Augmented and Virtual Reality, New Orleans, LA, 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1030] arXiv:2206.13673 [pdf, other]
-
Title: How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying PixelsComments: 8 pagesJournal-ref: IEEE Robotics and Automation Letters 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [1031] arXiv:2206.13677 [pdf, other]
-
Title: Towards Global-Scale Crowd+AI Techniques to Map and Assess Sidewalks for People with DisabilitiesAuthors: Maryam Hosseini, Mikey Saugstad, Fabio Miranda, Andres Sevtsuk, Claudio T. Silva, Jon E. FroehlichComments: CVPR 2022 AVA (Accessibility, Vision, and Autonomy Meet) WorkshopSubjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [1032] arXiv:2206.13718 [pdf, other]
-
Title: The Third Place Solution for CVPR2022 AVA Accessibility Vision and Autonomy ChallengeComments: The third place solution for CVPR2022 AVA Accessibility Vision and Autonomy ChallengeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1033] arXiv:2206.13728 [pdf, ps, other]
-
Title: Boosting R-CNN: Reweighting R-CNN Samples by RPN's Error for Underwater Object DetectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1034] arXiv:2206.13732 [pdf, other]
-
Title: A Comprehensive Survey on Deep Gait Recognition: Algorithms, Datasets and ChallengesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1035] arXiv:2206.13737 [pdf, other]
-
Title: Adversarial Consistency for Single Domain Generalization in Medical Image SegmentationAuthors: Yanwu Xu, Shaoan Xie, Maxwell Reynolds, Matthew Ragoza, Mingming Gong, Kayhan BatmanghelichComments: MICCAI2022 accptedSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1036] arXiv:2206.13785 [pdf, other]
-
Title: 3D Multi-Object Tracking with Differentiable Pose EstimationComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1037] arXiv:2206.13803 [pdf, other]
-
Title: FedIIC: Towards Robust Federated Learning for Class-Imbalanced Medical Image ClassificationComments: This paper has been accepted by MICCAI 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1038] arXiv:2206.13829 [pdf, other]
-
Title: Cross-Forgery Analysis of Vision Transformers and CNNs for Deepfake Image DetectionAuthors: Davide Alessandro Coccomini, Roberto Caldelli, Fabrizio Falchi, Claudio Gennaro, Giuseppe AmatoSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1039] arXiv:2206.13850 [pdf, other]
-
Title: When the Sun Goes Down: Repairing Photometric Losses for All-Day Depth EstimationSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1040] arXiv:2206.13858 [pdf, other]
-
Title: Accurate and Real-time Pseudo Lidar Detection: Is Stereo Neural Network Really Necessary?Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1041] arXiv:2206.13887 [pdf, other]
-
Title: Generating near-infrared facial expression datasets with dimensional affect labelsSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1042] arXiv:2206.13951 [pdf, other]
-
Title: Robustifying Vision Transformer without Retraining from Scratch by Test-Time Class-Conditional Feature AlignmentComments: Accepted to IJCAI-ECAI2022. Code is available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1043] arXiv:2206.13962 [src]
-
Title: Multi-Prior Learning via Neural Architecture Search for Blind Face RestorationComments: We found some problems with the article and need to withdrawal itSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1044] arXiv:2206.13963 [pdf, other]
-
Title: Primitive Graph Learning for Unified Vector MappingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1045] arXiv:2206.13964 [pdf, other]
-
Title: Learning Gait Representation from Massive Unlabelled Walking Videos: A BenchmarkSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1046] arXiv:2206.13996 [pdf, other]
-
Title: Detecting tiny objects in aerial images: A normalized Wasserstein distance and a new benchmarkComments: Accepted by ISPRS Journal of Photogrammetry and Remote SensingJournal-ref: ISPRS Journal of Photogrammetry and Remote Sensing (2022) 190:79-93Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1047] arXiv:2206.14009 [pdf, other]
-
Title: Show Me Your Face, And I'll Tell You How You SpeakSubjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
- [1048] arXiv:2206.14011 [pdf, ps, other]
-
Title: Taxonomy and evolution predicting using deep learning in imagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1049] arXiv:2206.14020 [pdf, other]
-
Title: Rethinking Adversarial Examples for Location Privacy ProtectionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1050] arXiv:2206.14116 [pdf, other]
-
Title: SSL-Lanes: Self-Supervised Learning for Motion Forecasting in Autonomous DrivingComments: Accepted to CoRL-2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
- [1051] arXiv:2206.14164 [pdf, ps, other]
-
Title: Visualizing and Alleviating the Effect of Radial Distortion on Camera Calibration Using Principal LinesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1052] arXiv:2206.14180 [pdf, other]
-
Title: High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled ConditionsComments: Accepted to ECCV 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1053] arXiv:2206.14195 [pdf, other]
-
Title: Pedestrian 3D Bounding Box PredictionComments: Accepted and published in hEART2022 (the 10th Symposium of the European Association for Research in Transportation): this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1054] arXiv:2206.14245 [pdf, other]
-
Title: SImProv: Scalable Image Provenance Framework for Robust Content AttributionAuthors: Alexander Black, Tu Bui, Simon Jenni, Zhifei Zhang, Viswanathan Swaminanthan, John CollomosseComments: Under consideration at Computer Vision and Image UnderstandingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1055] arXiv:2206.14263 [pdf, other]
-
Title: ZoDIAC: Zoneout Dropout Injection Attention CalculationComments: This work has been submitted to SN-AIRE journal and is currently under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1056] arXiv:2206.14302 [pdf, ps, other]
-
Title: Reinforcement Learning in Medical Image Analysis: Concepts, Applications, Challenges, and Future DirectionsComments: 30 pages, 13 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1057] arXiv:2206.14314 [pdf, other]
-
Title: Generative Neural Articulated Radiance FieldsAuthors: Alexander W. Bergman, Petr Kellnhofer, Wang Yifan, Eric R. Chan, David B. Lindell, Gordon WetzsteinComments: Project website: this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1058] arXiv:2206.14344 [pdf, other]
-
Title: A New Adjacency Matrix Configuration in GCN-based Models for Skeleton-based Action RecognitionComments: 19 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1059] arXiv:2206.14350 [pdf, ps, other]
-
Title: Convolutional Neural Network Based Partial Face DetectionAuthors: Md. Towfiqul Islam, Tanzim Ahmed, A.B.M. Raihanur Rashid, Taminul Islam, Md. Sadekur Rahman, Md. Tarek HabibComments: Accepted in 7th International Conference for Convergence in Technology (I2CT), 2022, 6 pages, 7 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1060] arXiv:2206.14355 [pdf, other]
-
Title: EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question AnsweringSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [1061] arXiv:2206.14381 [pdf, other]
-
Title: Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022Comments: Ranked joint 3rd place in the Multi-Instance Retrieval Challenge at EPIC@CVPR2022. (v2: ref error is corrected)Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [1062] arXiv:2206.14409 [pdf, ps, other]
-
Title: BATFormer: Towards Boundary-Aware Lightweight Transformer for Efficient Medical Image SegmentationComments: Accepted by IEEE Journal of Biomedical and Health Informatics The source code is publicly available at this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1063] arXiv:2206.14413 [pdf, other]
-
Title: The Lighter The Better: Rethinking Transformers in Medical Image Segmentation Through Adaptive PruningSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1064] arXiv:2206.14437 [pdf, other]
-
Title: MaNi: Maximizing Mutual Information for Nuclei Cross-Domain Unsupervised SegmentationComments: Accepted at MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1065] arXiv:2206.14451 [pdf, other]
-
Title: SRCN3D: Sparse R-CNN 3D for Compact Convolutional Multi-View 3D Object Detection and TrackingAuthors: Yining Shi, Jingyan Shen, Yifan Sun, Yunlong Wang, Jiaxin Li, Shiqi Sun, Kun Jiang, Diange YangComments: Accepted to Vision-centric Autonomous Driving(VCAD) Workshop at CVPR2023, For more details refer to this http URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1066] arXiv:2206.14467 [pdf, other]
-
Title: Single-domain Generalization in Medical Image Segmentation via Test-time Adaptation from Shape DictionaryComments: Accepted to AAAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1067] arXiv:2206.14475 [pdf, other]
-
Title: Siamese Contrastive Embedding Network for Compositional Zero-Shot LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1068] arXiv:2206.14538 [pdf, other]
-
Title: vMFNet: Compositionality Meets Domain-generalised SegmentationComments: Accepted by MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1069] arXiv:2206.14554 [pdf, other]
-
Title: Uncertainty-aware Panoptic SegmentationSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1070] arXiv:2206.14555 [pdf, other]
-
Title: Technical Report for CVPR 2022 LOVEU AQTC ChallengeComments: 4 pages, 3 figures, technical report for track3 of CVPR 2022 LOVEU challengeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1071] arXiv:2206.14651 [pdf, other]
-
Title: BoT-SORT: Robust Associations Multi-Pedestrian TrackingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1072] arXiv:2206.14702 [pdf, other]
-
Title: Interventional Contrastive Learning with Meta Semantic RegularizerComments: Accepted by ICML 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1073] arXiv:2206.14718 [pdf, other]
-
Title: LViT: Language meets Vision Transformer in Medical Image SegmentationAuthors: Zihan Li, Yunxiang Li, Qingde Li, Puyang Wang, Dazhou Guo, Le Lu, Dakai Jin, You Zhang, Qingqi HongComments: Accepted by IEEE Transactions on Medical Imaging (TMI)Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1074] arXiv:2206.14735 [pdf, other]
-
Title: GO-Surf: Neural Feature Grid Optimization for Fast, High-Fidelity RGB-D Surface ReconstructionComments: 3DV2022 (Oral), first two authors contributed equally. Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1075] arXiv:2206.14797 [pdf, other]
-
Title: 3D-Aware Video GenerationAuthors: Sherwin Bahmani, Jeong Joon Park, Despoina Paschalidou, Hao Tang, Gordon Wetzstein, Leonidas Guibas, Luc Van Gool, Radu TimofteComments: TMLR 2023; Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1076] arXiv:2206.14841 [pdf, other]
-
Title: Causality for Inherently Explainable Transformers: CAT-XPLAINComments: Accepted for spotlight presentation at the Explainable Artificial Intelligence for Computer Vision Workshop at CVPR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1077] arXiv:2206.14892 [pdf, other]
-
Title: Semantic Unfolding of StyleGAN Latent SpaceComments: Accepted at ICIP22Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1078] arXiv:2206.14923 [pdf, other]
-
Title: On Non-Random Missing Labels in Semi-Supervised LearningJournal-ref: ICLR 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1079] arXiv:2206.14938 [pdf, other]
-
Title: Regularization of NeRFs using differential geometrySubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1080] arXiv:2206.14971 [pdf, other]
-
Title: Boosting 3D Object Detection by Simulating Multimodality on Point CloudsComments: Published in CVPR 2022 as OralSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1081] arXiv:2206.14973 [pdf, other]
-
Title: Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital PathologyComments: MICAAI2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1082] arXiv:2206.14989 [pdf, other]
-
Title: A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQASubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1083] arXiv:2206.14996 [pdf, other]
-
Title: Cross-domain Federated Object DetectionComments: ICME 2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1084] arXiv:2206.15002 [pdf, other]
-
Title: Spatial Transformer Network with Transfer Learning for Small-scale Fine-grained Skeleton-based Tai Chi Action RecognitionComments: 6 pages, 4 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1085] arXiv:2206.15015 [pdf, other]
-
Title: Exploring Temporally Dynamic Data Augmentation for Video RecognitionComments: Technical ReportSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1086] arXiv:2206.15031 [pdf, other]
-
Title: Timestamp-Supervised Action Segmentation with Graph Convolutional NetworksAuthors: Hamza Khan, Sanjay Haresh, Awais Ahmed, Shakeeb Siddiqui, Andrey Konin, M. Zeeshan Zia, Quoc-Huy TranComments: Accepted to IROS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1087] arXiv:2206.15083 [pdf, other]
-
Title: UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask CalibrationComments: Accepted to CVPR2023Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1088] arXiv:2206.15085 [pdf, other]
-
Title: Skeleton-based Action Recognition via Adaptive Cross-Form LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1089] arXiv:2206.15109 [pdf, ps, other]
-
Title: MKIoU Loss: Towards Accurate Oriented Object Detection in Aerial ImagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1090] arXiv:2206.15128 [pdf, other]
-
Title: Detecting and Recovering Adversarial Examples from Extracting Non-robust and Highly Predictive Adversarial PerturbationsComments: 10 pagesSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1091] arXiv:2206.15138 [pdf, other]
- [1092] arXiv:2206.15154 [pdf, other]
-
Title: BoxGraph: Semantic Place Recognition and Pose Estimation from 3D LiDARComments: Accepted for publication at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1093] arXiv:2206.15157 [pdf, other]
-
Title: HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object DetectionAuthors: Tim Broedermann (1), Christos Sakaridis (1), Dengxin Dai (2), Luc Van Gool (1 and 3) ((1) ETH Zurich, (2) MPI for Informatics, (3) KU Leuven)Comments: IEEE International Conference on Intelligent Transportation Systems (ITSC) 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1094] arXiv:2206.15186 [pdf, other]
-
Title: Out-of-Distribution Detection for Long-tailed and Fine-grained Skin Lesion ImagesComments: Accepted to MICCAI 2022 (top 13% paper; early accept)Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [1095] arXiv:2206.15189 [pdf, other]
-
Title: Multi-Granularity Regularized Re-Balancing for Class Incremental LearningSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1096] arXiv:2206.15248 [pdf, other]
-
Title: CTrGAN: Cycle Transformers GAN for Gait TransferSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1097] arXiv:2206.15255 [pdf, other]
-
Title: Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic SurgeryComments: 11 pages, 4 figures, conferenceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1098] arXiv:2206.15258 [pdf, other]
-
Title: Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D CameraComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1099] arXiv:2206.15268 [pdf, other]
-
Title: Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding ApproachComments: arXiv admin note: text overlap with arXiv:2112.04771Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1100] arXiv:2206.15275 [pdf, other]
-
Title: Multiclass-SGCN: Sparse Graph-based Trajectory Prediction with Agent Class EmbeddingSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1101] arXiv:2206.15282 [pdf, other]
-
Title: TINC: Temporally Informed Non-Contrastive Learning for Disease Progression Modeling in Retinal OCT VolumesAuthors: Taha Emre, Arunava Chakravarty, Antoine Rivail, Sophie Riedl, Ursula Schmidt-Erfurth, Hrvoje BogunovićComments: Accepted at MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1102] arXiv:2206.15296 [pdf, other]
-
Title: Self-SuperFlow: Self-supervised Scene Flow Prediction in Stereo SequencesComments: Accepted at ICIP 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1103] arXiv:2206.15328 [pdf, other]
-
Title: Neural Annotation Refinement: Development of a New 3D Dataset for Adrenal Gland AnalysisComments: MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [1104] arXiv:2206.15349 [pdf, other]
-
Title: Revisiting Competitive Coding Approach for Palmprint Recognition: A Linear Discriminant Analysis PerspectiveComments: 12 pages, 14 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1105] arXiv:2206.15351 [pdf, ps, other]
-
Title: Deep Learning to See: Towards New Foundations of Computer VisionSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1106] arXiv:2206.15353 [pdf, other]
-
Title: Learning Underrepresented Classes from Decentralized Partially Labeled Medical ImagesComments: Accepted by MICCAI 2022Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1107] arXiv:2206.15369 [pdf, other]
-
Title: No Reason for No Supervision: Improved Generalization in Supervised ModelsComments: Accepted to ICLR 2023 (spotlight)Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1108] arXiv:2206.15398 [pdf, other]
-
Title: PolarFormer: Multi-camera 3D Object Detection with Polar TransformerComments: Accepted to AAAI2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
- [1109] arXiv:2206.15415 [pdf, other]
-
Title: MEAD: A Multi-Armed Approach for Evaluation of Adversarial Examples DetectorsComments: This paper has been accepted to appear in the Proceedings of the 2022 European Conference on Machine Learning and Data Mining (ECML-PKDD), 19th to the 23rd of September, Grenoble, FranceSubjects: Computer Vision and Pattern Recognition (cs.CV)
- [1110] arXiv:2206.15436 [pdf, other]
-
Title: Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New DatasetComments: Project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1111] arXiv:2206.15462 [pdf, other]
-
Title: Improving Visual Grounding by Encouraging Consistent Gradient-based ExplanationsSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [1112] arXiv:2206.15472 [pdf, other]
-
Title: On-Device Training Under 256KB MemoryComments: NeurIPS 2022Subjects: Computer Vision and Pattern Recognition (cs.CV)
- [1113] arXiv:2206.00169 (cross-list from cs.LG) [pdf, other]
-
Title: Discovering the Hidden Vocabulary of DALLE-2Comments: 6 pages, 4 figuresSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1114] arXiv:2206.00266 (cross-list from cs.RO) [pdf, other]
-
Title: PaGO-LOAM: Robust Ground-Optimized LiDAR OdometryComments: 7 pages, 5 figures, conferenceSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1115] arXiv:2206.00380 (cross-list from cs.LG) [pdf, other]
-
Title: Strongly Augmented Contrastive ClusteringSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1116] arXiv:2206.00393 (cross-list from cs.SD) [pdf, other]
-
Title: Towards Generalisable Audio Representations for Audio-Visual NavigationComments: CVPR 2022 Embodied AI WorkshopSubjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO); Audio and Speech Processing (eess.AS)
- [1117] arXiv:2206.00432 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Evaluating Gaussian Grasp Maps for Generative Grasping ModelsComments: 9 pages, 6 figures, to be published in IJCNN 2022Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1118] arXiv:2206.00471 (cross-list from cs.LG) [pdf, other]
-
Title: Augmentation Component Analysis: Modeling Similarity via the Augmentation OverlapsComments: Accept to ICLR 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1119] arXiv:2206.00606 (cross-list from cs.LG) [pdf, other]
-
Title: Topological Deep Learning: Going Beyond Graph DataAuthors: Mustafa Hajij, Ghada Zamzmi, Theodore Papamarkou, Nina Miolane, Aldo Guzmán-Sáenz, Karthikeyan Natesan Ramamurthy, Tolga Birdal, Tamal K. Dey, Soham Mukherjee, Shreyas N. Samaga, Neal Livesay, Robin Walters, Paul Rosen, Michael T. SchaubSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI); Algebraic Topology (math.AT); Machine Learning (stat.ML)
- [1120] arXiv:2206.00621 (cross-list from cs.CL) [pdf, other]
-
Title: Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-trainingComments: ACL 2023Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1121] arXiv:2206.00719 (cross-list from cs.LG) [pdf, other]
-
Title: Dataset Distillation using Neural Feature RegressionComments: NeurIPS 2022 camera-ready versionSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1122] arXiv:2206.00785 (cross-list from cs.DL) [pdf, other]
-
Title: Delivering Document Conversion as a Cloud Service with High Throughput and ResponsivenessAuthors: Christoph Auer (1), Michele Dolfi (1), André Carvalho (2), Cesar Berrospi Ramis (1), Peter W. J. Staar (1) ((1) IBM Research, (2) SoftINSA Lda.)Comments: 11 pages, 7 figures, to be published in IEEE CLOUD 2022Subjects: Digital Libraries (cs.DL); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
- [1123] arXiv:2206.00809 (cross-list from cs.MM) [pdf, other]
-
Title: Distilling Knowledge from Object Classification to Aesthetics AssessmentSubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [1124] arXiv:2206.00843 (cross-list from cs.LG) [pdf, other]
-
Title: DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural NetworksAuthors: Yonggan Fu, Haichuan Yang, Jiayi Yuan, Meng Li, Cheng Wan, Raghuraman Krishnamoorthi, Vikas Chandra, Yingyan LinComments: Accepted at ICML 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1125] arXiv:2206.00845 (cross-list from cs.LG) [pdf, other]
-
Title: Hyperspherical Consistency RegularizationComments: Accepted by CVPR 2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1126] arXiv:2206.00913 (cross-list from cs.LG) [pdf, other]
-
Title: Improving the Robustness and Generalization of Deep Neural Network with Confidence Threshold ReductionComments: Under reviewSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1127] arXiv:2206.00941 (cross-list from cs.LG) [pdf, other]
-
Title: Improving Diffusion Models for Inverse Problems using Manifold ConstraintsComments: NeurIPS 2022 camera-ready; 29 pages, 16 figuresSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1128] arXiv:2206.00944 (cross-list from cs.LG) [pdf, other]
-
Title: Feature Space Particle Inference for Neural Network EnsemblesComments: ICML2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1129] arXiv:2206.00991 (cross-list from cs.RO) [pdf, ps, other]
-
Title: StopNet: Scalable Trajectory and Occupancy Prediction for Urban Autonomous DrivingAuthors: Jinkyu Kim, Reza Mahjourian, Scott Ettinger, Mayank Bansal, Brandyn White, Ben Sapp, Dragomir AnguelovJournal-ref: IEEE International Conference on Robotics and Automation 2022Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1130] arXiv:2206.01002 (cross-list from cs.LG) [pdf, other]
-
Title: Introducing One Sided Margin Loss for Solving Classification Problems in Deep NetworksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1131] arXiv:2206.01094 (cross-list from cs.MM) [pdf, ps, other]
-
Title: A DTCWT-SVD Based Video Watermarking resistant to frame rate conversionSubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [1132] arXiv:2206.01178 (cross-list from cs.LG) [pdf, other]
-
Title: Discretization Invariant Networks for Learning Maps between Neural FieldsComments: Published in Transactions on Machine Learning Research 2023Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1133] arXiv:2206.01197 (cross-list from cs.LG) [pdf, other]
-
Title: Hard Negative Sampling Strategies for Contrastive Representation LearningSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1134] arXiv:2206.01251 (cross-list from cs.LG) [pdf, other]
-
Title: Using Representation Expressiveness and Learnability to Evaluate Self-Supervised Learning MethodsJournal-ref: TMLR 2023 -- Transactions of Machine Learning Research, 11/2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1135] arXiv:2206.01366 (cross-list from cs.LG) [pdf, other]
-
Title: Supernet Training for Federated Image Classification under System HeterogeneityComments: Oral paper on ICML 22 Workshop: "Dynamic Neural Networks"; Under reviewSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1136] arXiv:2206.01382 (cross-list from cs.DS) [pdf, ps, other]
- [1137] arXiv:2206.01612 (cross-list from cs.LG) [pdf, other]
-
Title: OmniXAI: A Library for Explainable AIComments: Github repo: this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1138] arXiv:2206.01634 (cross-list from cs.LG) [pdf, other]
-
Title: Reinforcement Learning with Neural Radiance FieldsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1139] arXiv:2206.01690 (cross-list from cs.LG) [pdf, other]
-
Title: Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-learningComments: Published at CVPR 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1140] arXiv:2206.01829 (cross-list from cs.LG) [pdf, other]
-
Title: Drawing out of Distribution with Neuro-Symbolic Generative ModelsComments: Preprint. Under review. 25 pagesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Symbolic Computation (cs.SC)
- [1141] arXiv:2206.01898 (cross-list from cs.LG) [pdf, other]
-
Title: Saliency Attack: Towards Imperceptible Black-box Adversarial AttackSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1142] arXiv:2206.02102 (cross-list from cs.LG) [pdf, other]
-
Title: AUTM Flow: Atomic Unrestricted Time Machine for Monotonic Normalizing FlowsComments: 20 pages, 3 figuresSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
- [1143] arXiv:2206.02131 (cross-list from cs.LG) [pdf, other]
-
Title: Federated Adversarial Training with TransformersSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1144] arXiv:2206.02183 (cross-list from cs.LG) [pdf, other]
-
Title: Functional Ensemble DistillationSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1145] arXiv:2206.02284 (cross-list from cs.SD) [pdf, other]
-
Title: Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous TranslatorAuthors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Jiachen Zhuo, Maureen Stone, Georges El Fakhri, Jonghye WooComments: MICCAI 2022 (early accept, Oral Presentation ~3%)Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
- [1146] arXiv:2206.02286 (cross-list from cs.LG) [pdf, other]
-
Title: AugLoss: A Robust Augmentation-based Fine Tuning MethodologyComments: 10 pages, 6 figures, 6 tablesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1147] arXiv:2206.02353 (cross-list from cs.LG) [pdf, other]
-
Title: Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal DataComments: 36 pages, 5 figures, 9 tables, Survey paperSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1148] arXiv:2206.02409 (cross-list from cs.AI) [pdf, other]
-
Title: Is More Data All You Need? A Causal ExplorationComments: 10 pagesSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [1149] arXiv:2206.02574 (cross-list from cs.LG) [pdf, other]
-
Title: On the duality between contrastive and non-contrastive self-supervised learningAuthors: Quentin Garrido (FAIR, LIGM), Yubei Chen (FAIR), Adrien Bardes (FAIR, WILLOW), Laurent Najman (LIGM), Yann Lecun (FAIR, CIMS)Comments: The Eleventh International Conference on Learning Representations, 2023, Kigali, RwandaSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1150] arXiv:2206.02659 (cross-list from cs.LG) [pdf, other]
-
Title: Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization GuaranteesComments: 38 pages. Appeared in ICML 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Statistics Theory (math.ST); Machine Learning (stat.ML)
- [1151] arXiv:2206.02671 (cross-list from cs.SD) [pdf, ps, other]
-
Title: Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Audio-Visual Hearing AidsSubjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
- [1152] arXiv:2206.02792 (cross-list from cs.LG) [pdf, other]
-
Title: FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced DataSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (stat.ML)
- [1153] arXiv:2206.02840 (cross-list from cs.RO) [pdf, other]
-
Title: Spatial Acoustic Projection for 3D Imaging Sonar ReconstructionComments: PreprintJournal-ref: IEEE International Conference on Robotics and Automation (ICRA) 2022Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1154] arXiv:2206.02881 (cross-list from cs.RO) [pdf, other]
-
Title: Mesh-based Dynamics with Occlusion Reasoning for Cloth ManipulationComments: RSS 2022, $\href{this https URL}{\text{project website}}$Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1155] arXiv:2206.02916 (cross-list from cs.LG) [pdf, other]
-
Title: Remember the Past: Distilling Datasets into Addressable Memories for Neural NetworksSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1156] arXiv:2206.02958 (cross-list from cs.LG) [pdf, other]
-
Title: Saliency Cards: A Framework to Characterize and Compare Saliency MethodsComments: Published at FAccT 2023, 19 pages, 8 figures, 2 tablesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1157] arXiv:2206.03083 (cross-list from cs.RO) [pdf, other]
-
Title: Pushing the Limits of Learning-based Traversability Analysis for Autonomous Driving on CPUAuthors: Daniel Fusaro, Emilio Olivastri, Daniele Evangelista, Marco Imperoli, Emanuele Menegatti, Alberto PrettoComments: Accepted to 17th International Conference on Intelligent Autonomous Systems (IAS-17)Journal-ref: Proceedings of the 17th International Conference on Intelligent Autonomous Systems (IAS 2022)Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1158] arXiv:2206.03271 (cross-list from cs.LG) [pdf, other]
-
Title: On the Effectiveness of Fine-tuning Versus Meta-reinforcement LearningSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1159] arXiv:2206.03354 (cross-list from cs.CL) [pdf, other]
-
Title: cViL: Cross-Lingual Training of Vision-Language Models using Knowledge DistillationComments: Accepted at ICPR 2022; 9 pagesSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1160] arXiv:2206.03380 (cross-list from cs.GR) [pdf, other]
-
Title: Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and DenoisingComments: Project website: this https URLSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1161] arXiv:2206.03382 (cross-list from cs.DC) [pdf, other]
-
Title: Tutel: Adaptive Mixture-of-Experts at ScaleAuthors: Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, Joe Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang XiongSubjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1162] arXiv:2206.03398 (cross-list from cs.LG) [pdf, other]
-
Title: Towards a General Purpose CNN for Long Range Dependencies in $N$DAuthors: David W. Romero, David M. Knigge, Albert Gu, Erik J. Bekkers, Efstratios Gavves, Jakub M. Tomczak, Mark HoogendoornComments: First two authors contributed equally to this workSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1163] arXiv:2206.03430 (cross-list from cs.RO) [pdf, other]
-
Title: Robot Self-Calibration Using Actuated 3D SensorsAuthors: Arne PetersComments: 15 pages, 9 figuresSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1164] arXiv:2206.03491 (cross-list from cs.AI) [pdf, other]
-
Title: EiX-GNN : Concept-level eigencentrality explainer for graph neural networksSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1165] arXiv:2206.03583 (cross-list from cs.CR) [pdf, other]
-
Title: Contributor-Aware Defenses Against Adversarial Backdoor AttacksSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1166] arXiv:2206.03584 (cross-list from cs.CR) [pdf, ps, other]
-
Title: White-box Membership Attack Against Machine Learning Based Retinopathy ClassificationSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1167] arXiv:2206.03596 (cross-list from cs.LG) [pdf, other]
-
Title: Neural Network Compression via Effective Filter Analysis and Hierarchical PruningSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1168] arXiv:2206.03739 (cross-list from cs.AI) [pdf, other]
-
Title: Disentangled Ontology Embedding for Zero-shot LearningAuthors: Yuxia Geng, Jiaoyan Chen, Wen Zhang, Yajing Xu, Zhuo Chen, Jeff Z. Pan, Yufeng Huang, Feiyu Xiong, Huajun ChenComments: Accepted by KDD'22Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1169] arXiv:2206.03826 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Understanding Why Mask-Reconstruction Pretraining Helps in Downstream TasksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
- [1170] arXiv:2206.04006 (cross-list from cs.SD) [pdf, other]
-
Title: Few-Shot Audio-Visual Learning of Environment AcousticsComments: Accepted to NeurIPS 2022Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
- [1171] arXiv:2206.04016 (cross-list from cs.NE) [pdf, other]
-
Title: SYNERgy between SYNaptic consolidation and Experience Replay for general continual learningComments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1172] arXiv:2206.04129 (cross-list from cs.RO) [pdf, other]
-
Title: Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D ConvolutionsComments: Accepted for RA-LSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1173] arXiv:2206.04310 (cross-list from cs.LG) [pdf, other]
-
Title: GSmooth: Certified Robustness against Semantic Transformations via Generalized Randomized SmoothingSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1174] arXiv:2206.04318 (cross-list from cs.MM) [pdf, other]
-
Title: Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual SaliencySubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [1175] arXiv:2206.04363 (cross-list from cs.MM) [pdf, other]
-
Title: Deep Neural Network for Blind Visual Quality Assessment of 4K ContentAuthors: Wei Lu, Wei Sun, Xiongkuo Min, Wenhan Zhu, Quan Zhou, Jun He, Qiyuan Wang, Zicheng Zhang, Tao Wang, Guangtao ZhaiSubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [1176] arXiv:2206.04459 (cross-list from cs.LG) [pdf, other]
-
Title: SDQ: Stochastic Differentiable Quantization with Mixed PrecisionAuthors: Xijie Huang, Zhiqiang Shen, Shichao Li, Zechun Liu, Xianghong Hu, Jeffry Wicaksana, Eric Xing, Kwang-Ting ChengComments: ICML 2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1177] arXiv:2206.04523 (cross-list from cs.CL) [pdf, other]
-
Title: Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of VideosAuthors: Alexander Waibel, Moritz Behr, Fevziye Irem Eyiokur, Dogucan Yaman, Tuan-Nam Nguyen, Carlos Mullov, Mehmet Arif Demirtas, Alperen Kantarcı, Stefan Constantin, Hazım Kemal EkenelSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
- [1178] arXiv:2206.04530 (cross-list from cs.LG) [pdf, other]
-
Title: DORA: Exploring Outlier Representations in Deep Neural NetworksComments: 24 pages, 18 figuresJournal-ref: Published in Transactions on Machine Learning Research (06/2023)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1179] arXiv:2206.04625 (cross-list from cs.LG) [pdf, other]
-
Title: AttX: Attentive Cross-Connections for Fusion of Wearable Signals in Emotion RecognitionComments: 13 pages, 8 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [1180] arXiv:2206.04676 (cross-list from cs.LG) [pdf, other]
-
Title: Extending Momentum Contrast with Cross Similarity Consistency RegularizationComments: IEEE Transactions on Circuits and Systems for Video TechnologySubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1181] arXiv:2206.04677 (cross-list from cs.CR) [pdf, other]
-
Title: On the Permanence of Backdoors in Evolving ModelsSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1182] arXiv:2206.04679 (cross-list from cs.LG) [pdf, other]
-
Title: POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution SamplesComments: Accepted at NeurIPS 2021 (First two authors contribute equally)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1183] arXiv:2206.04756 (cross-list from cs.LG) [pdf, other]
-
Title: An Empirical Study on Disentanglement of Negative-free Contrastive LearningComments: Accepted to NeurIPS 2022; 10 pages main text + 15 pages appendixSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1184] arXiv:2206.04776 (cross-list from cs.LG) [pdf, other]
-
Title: What should AI see? Using the Public's Opinion to Determine the Perception of an AIAuthors: Robin Chan, Radin Dardashti, Meike Osinski, Matthias Rottmann, Dominik Brüggemann, Cilia Rücker, Peter Schlicht, Fabian Hüger, Nikol Rummel, Hanno GottschalkComments: 26 pages, 12 figuresJournal-ref: AI and Ethics (2023)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [1185] arXiv:2206.04779 (cross-list from cs.LG) [pdf, other]
-
Title: Challenges and Opportunities in Offline Reinforcement Learning from Visual ObservationsAuthors: Cong Lu, Philip J. Ball, Tim G. J. Rudner, Jack Parker-Holder, Michael A. Osborne, Yee Whye TehComments: Published at TMLR, 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1186] arXiv:2206.04881 (cross-list from cs.CR) [pdf, other]
-
Title: Enhancing Clean Label Backdoor Attack with Two-phase Specific TriggersSubjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1187] arXiv:2206.04888 (cross-list from cs.MM) [pdf, other]
-
Title: AntPivot: Livestream Highlight Detection via Hierarchical Attention MechanismSubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [1188] arXiv:2206.05008 (cross-list from cs.GR) [pdf, other]
-
Title: Subjective Quality Assessment for Images Generated by Computer GraphicsSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1189] arXiv:2206.05093 (cross-list from cs.LG) [pdf, other]
-
Title: Federated Momentum Contrastive ClusteringComments: Originally submitted March 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1190] arXiv:2206.05263 (cross-list from cs.LG) [pdf, other]
-
Title: Causal Balancing for Domain GeneralizationComments: Published at ICLR 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1191] arXiv:2206.05266 (cross-list from cs.LG) [pdf, other]
-
Title: Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?Comments: NeurIPS 2022. Code for ELo-SACv3 is at this https URL and code for ELo-Rainbow is at this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1192] arXiv:2206.05323 (cross-list from cs.LG) [pdf, other]
-
Title: Memory Classifiers: Two-stage Classification for Robustness in Machine LearningSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1193] arXiv:2206.05344 (cross-list from cs.GR) [pdf, other]
-
Title: Differentiable Rendering of Neural SDFs through ReparameterizationAuthors: Sai Praveen Bangaru, Michaël Gharbi, Tzu-Mao Li, Fujun Luan, Kalyan Sunkavalli, Miloš Hašan, Sai Bi, Zexiang Xu, Gilbert Bernstein, Frédo DurandSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1194] arXiv:2206.05365 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Object Detection, Recognition, Deep Learning, and the Universal Law of GeneralizationSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1195] arXiv:2206.05400 (cross-list from cs.RO) [pdf, ps, other]
-
Title: High-Definition Map Generation Technologies For Autonomous DrivingComments: 25 pages, 17 figures, submitted to a journalSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1196] arXiv:2206.05555 (cross-list from cs.CL) [pdf, other]
-
Title: A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-trainingSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1197] arXiv:2206.05625 (cross-list from cs.AI) [pdf, ps, other]
-
Title: Exploring the Intersection between Neural Architecture Search and Continual LearningSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1198] arXiv:2206.05649 (cross-list from cs.GR) [pdf, other]
-
Title: TileGen: Tileable, Controllable Material Generation and CaptureAuthors: Xilong Zhou, Miloš Hašan, Valentin Deschaintre, Paul Guerrero, Kalyan Sunkavalli, Nima KalantariComments: 18 pages, 19 figuresSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1199] arXiv:2206.05687 (cross-list from cs.HC) [pdf, other]
-
Title: DRNet: Decomposition and Reconstruction Network for Remote Physiological MeasurementSubjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1200] arXiv:2206.05751 (cross-list from cs.LG) [pdf, other]
-
Title: Consistent Attack: Universal Adversarial Perturbation on Embodied Vision NavigationJournal-ref: Pattern Recognition Letters (PRL), 2023Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1201] arXiv:2206.05859 (cross-list from cs.LG) [pdf, ps, other]
-
Title: A Directed-Evolution Method for Sparsification and Compression of Neural Networks with Application to Object Identification and Segmentation and considerations of optimal quantization using small number of bitsAuthors: Luiz M Franca-NetoComments: 12 pages total, 5 figures, 2 appendicesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1202] arXiv:2206.05893 (cross-list from cs.LG) [pdf, other]
-
Title: Deploying Convolutional Networks on Untrusted Platforms Using 2D Holographic Reduced RepresentationsComments: To appear in the Proceedings of the 39 th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1203] arXiv:2206.05930 (cross-list from cs.LG) [pdf, other]
-
Title: Faster Optimization-Based Meta-Learning Adaptation PhaseAuthors: Kostiantyn KhabarlakSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1204] arXiv:2206.06173 (cross-list from eess.SY) [pdf, other]
-
Title: LiVeR: Lightweight Vehicle Detection and Classification in Real-TimeSubjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
- [1205] arXiv:2206.06273 (cross-list from cs.CG) [pdf, other]
-
Title: Learning Joint Surface AtlasesSubjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
- [1206] arXiv:2206.06489 (cross-list from cs.AI) [pdf, other]
-
Title: BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI AgentsSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1207] arXiv:2206.06522 (cross-list from cs.CL) [pdf, other]
-
Title: LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer LearningComments: NeurIPS 2022 (our code is available at: this https URL)Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1208] arXiv:2206.06553 (cross-list from cs.RO) [pdf, other]
-
Title: Safe Output Feedback Motion Planning from Images via Learned Perception Modules and Contraction TheoryComments: Workshop on the Algorithmic Foundations of Robotics (WAFR) XV, 2022, College Park, MD, USASubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
- [1209] arXiv:2206.06577 (cross-list from cs.GR) [pdf, other]
-
Title: Physics Informed Neural Fields for Smoke Reconstruction with Sparse DataAuthors: Mengyu Chu, Lingjie Liu, Quan Zheng, Erik Franz, Hans-Peter Seidel, Christian Theobalt, Rhaleb ZayerComments: accepted to ACM Transactions On Graphics (SIGGRAPH 2022), further info:\url{this https URL}Journal-ref: ACM Trans. Graph.41, 4 (2022), 119:1-119:14Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1210] arXiv:2206.06662 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Best Combination for Efficient N:M SparsityAuthors: Yuxin Zhang, Mingbao Lin, Zhihang Lin, Yiting Luo, Ke Li, Fei Chao, Yongjian Wu, Rongrong JiComments: Accepted by 36th Conference on Neural Information Processing Systems (NeurIPS 2022)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1211] arXiv:2206.06737 (cross-list from cs.LG) [pdf, other]
-
Title: Adversarial Vulnerability of Randomized EnsemblesComments: Published as a conference paper in ICML 2022Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1212] arXiv:2206.06854 (cross-list from cs.AI) [pdf, other]
-
Title: On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport PerspectiveAuthors: Mathieu Serrurier (IRIT-ADRIA, UT), Franck Mamalet (UT), Thomas Fel (UT), Louis Béthune (UT3, UT, IRIT-ADRIA), Thibaut Boissin (UT)Journal-ref: Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, Dec 2023, New Orleans (Louisiana), United StatesSubjects: Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
- [1213] arXiv:2206.06994 (cross-list from cs.AI) [pdf, other]
-
Title: ProcTHOR: Large-Scale Embodied AI Using Procedural GenerationAuthors: Matt Deitke, Eli VanderBilt, Alvaro Herrasti, Luca Weihs, Jordi Salvador, Kiana Ehsani, Winson Han, Eric Kolve, Ali Farhadi, Aniruddha Kembhavi, Roozbeh MottaghiComments: ProcTHOR website: this https URLSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1214] arXiv:2206.07081 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Applications of Generative Adversarial Networks in Neuroimaging and Clinical NeuroscienceAuthors: Rongguang Wang, Vishnu Bashyam, Zhijian Yang, Fanyang Yu, Vasiliki Tassopoulou, Sai Spandana Chintapalli, Ioanna Skampardoni, Lasya P. Sreepada, Dushyant Sahoo, Konstantina Nikita, Ahmed Abdulkadir, Junhao Wen, Christos DavatzikosJournal-ref: NeuroImage 269:119898 (2023)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1215] arXiv:2206.07136 (cross-list from cs.LG) [pdf, other]
-
Title: Automatic Clipping: Differentially Private Deep Learning Made Easier and StrongerComments: accepted to NeurIPS 2023Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1216] arXiv:2206.07137 (cross-list from cs.LG) [pdf, other]
-
Title: Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet LearntAuthors: Sören Mindermann, Jan Brauner, Muhammed Razzak, Mrinank Sharma, Andreas Kirsch, Winnie Xu, Benedikt Höltgen, Aidan N. Gomez, Adrien Morisot, Sebastian Farquhar, Yarin GalComments: ICML 2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1217] arXiv:2206.07148 (cross-list from cs.MM) [pdf, other]
-
Title: It's Time for Artistic Correspondence in Music and VideoComments: CVPR 2022Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
- [1218] arXiv:2206.07155 (cross-list from cs.LG) [pdf, other]
-
Title: Self-Supervision on Images and Text Reduces Reliance on Visual Shortcut FeaturesComments: 4 pages, 2 figures, spotlight talk at SCIS workshop, ICML 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1219] arXiv:2206.07173 (cross-list from cs.CY) [pdf, other]
-
Title: Measuring Representational Harms in Image CaptioningComments: ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2022Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
- [1220] arXiv:2206.07179 (cross-list from cs.LG) [pdf, other]
-
Title: Proximal Splitting Adversarial Attacks for Semantic SegmentationComments: CVPR 2023. Code available at: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1221] arXiv:2206.07260 (cross-list from cs.LG) [pdf, other]
-
Title: On Enforcing Better Conditioned Meta-Learning for Rapid Few-Shot AdaptationComments: Accepted at NeurIPS 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1222] arXiv:2206.07290 (cross-list from cs.LG) [pdf, other]
-
Title: Differentiable Top-k Classification LearningComments: Published at ICML 2022, Code @ this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1223] arXiv:2206.07387 (cross-list from cs.LG) [pdf, other]
-
Title: The Manifold Hypothesis for Gradient-Based ExplanationsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1224] arXiv:2206.07538 (cross-list from cs.RO) [pdf, other]
-
Title: Body Gesture Recognition to Control a Social RobotSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [1225] arXiv:2206.07736 (cross-list from cs.LG) [pdf, other]
-
Title: Improving Diversity with Adversarially Learned Transformations for Domain GeneralizationAuthors: Tejas Gokhale, Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Chitta Baral, Yezhou YangComments: WACV 2023. Code: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1226] arXiv:2206.07741 (cross-list from cs.LG) [pdf, other]
-
Title: Edge Inference with Fully Differentiable Quantized Mixed Precision Neural NetworksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1227] arXiv:2206.07758 (cross-list from cs.LG) [pdf, other]
-
Title: Reconstructing Training Data from Trained Neural NetworksComments: Fixed a typo in the acknowledgementsSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
- [1228] arXiv:2206.07795 (cross-list from cs.LG) [pdf, other]
-
Title: On Calibrated Model Uncertainty in Deep LearningComments: The European Conference on Machine Learning (ECML PKDD 2020). arXiv admin note: text overlap with arXiv:2103.11214Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1229] arXiv:2206.07898 (cross-list from cs.AI) [pdf, other]
-
Title: Multimodal Dialogue State TrackingComments: Accepted at NAACL 2022 (Oral)Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1230] arXiv:2206.08010 (cross-list from cs.GR) [pdf, other]
-
Title: MoDi: Unconditional Motion Synthesis from Diverse DataAuthors: Sigal Raab, Inbal Leibovitch, Peizhuo Li, Kfir Aberman, Olga Sorkine-Hornung, Daniel Cohen-OrSubjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1231] arXiv:2206.08076 (cross-list from cs.HC) [pdf, other]
-
Title: Learning Effect of Lay People in Gesture-Based Locomotion in Virtual RealitySubjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
- [1232] arXiv:2206.08077 (cross-list from cs.RO) [pdf, other]
-
Title: Neural Scene Representation for Locomotion on Structured TerrainSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1233] arXiv:2206.08138 (cross-list from cs.LG) [pdf, other]
-
Title: Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classificationAuthors: Adrian El Baz, Ihsan Ullah, Edesio Alcobaça, André C. P. L. F. Carvalho, Hong Chen, Fabio Ferreira, Henry Gouk, Chaoyu Guan, Isabelle Guyon, Timothy Hospedales, Shell Hu, Mike Huisman, Frank Hutter, Zhengying Liu, Felix Mohr, Ekrem Öztürk, Jan N. van Rijn, Haozhe Sun, Xin Wang, Wenwu ZhuComments: version 2 is the correct version, including supplementary material at the endJournal-ref: NeurIPS 2021 Competition and Demonstration Track, Dec 2021, On-line, United StatesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1234] arXiv:2206.08213 (cross-list from cs.LG) [pdf, other]
-
Title: A Closer Look at Smoothness in Domain Adversarial TrainingComments: ICML 2022. Code: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1235] arXiv:2206.08242 (cross-list from cs.LG) [pdf, other]
-
Title: Catastrophic overfitting can be induced with discriminative non-robust featuresAuthors: Guillermo Ortiz-Jiménez, Pau de Jorge, Amartya Sanyal, Adel Bibi, Puneet K. Dokania, Pascal Frossard, Gregory Rogéz, Philip H.S. TorrComments: Published in Transactions on Machine Learning Research (TMLR)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1236] arXiv:2206.08255 (cross-list from cs.LG) [pdf, other]
-
Title: Gradient-Based Adversarial and Out-of-Distribution DetectionComments: International Conference on Machine Learning (ICML) Workshop on New Frontiers in Adversarial Machine Learning, July 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1237] arXiv:2206.08312 (cross-list from cs.SD) [pdf, other]
-
Title: SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic LearningAuthors: Changan Chen, Carl Schissler, Sanchit Garg, Philip Kobernik, Alexander Clegg, Paul Calamia, Dhruv Batra, Philip W Robinson, Kristen GraumanSubjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
- [1238] arXiv:2206.08316 (cross-list from cs.LG) [pdf, other]
-
Title: Boosting the Adversarial Transferability of Surrogate Models with Dark KnowledgeComments: Accepted at 2023 International Conference on Tools with Artificial Intelligence (ICTAI)Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1239] arXiv:2206.08422 (cross-list from cs.GR) [pdf, ps, other]
-
Title: Real-time motion amplification on mobile devicesAuthors: Henning U. VossComments: Supplemental data at this https URL Changes to v1: Inclusion of offline video processingSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1240] arXiv:2206.08476 (cross-list from cs.LG) [pdf, other]
-
Title: Zero-Shot AutoML with Pretrained ModelsAuthors: Ekrem Öztürk, Fabio Ferreira, Hadi S. Jomaa, Lars Schmidt-Thieme, Josif Grabocka, Frank HutterJournal-ref: International Conference on Machine Learning 2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1241] arXiv:2206.08497 (cross-list from cs.GR) [pdf, other]
-
Title: Unsupervised Kinematic Motion Detection for Part-segmented 3D Shape CollectionsComments: SIGGRAPH 2022Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1242] arXiv:2206.08517 (cross-list from cs.RO) [pdf, other]
-
Title: ECTLO: Effective Continuous-time Odometry Using Range Image for LiDAR with Small FoVComments: 8 pages, 5 figures. Accepted for publication in the Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1243] arXiv:2206.08522 (cross-list from cs.RO) [pdf, other]
-
Title: VLMbench: A Compositional Benchmark for Vision-and-Language ManipulationSubjects: Robotics (cs.RO); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1244] arXiv:2206.08653 (cross-list from cs.LG) [pdf, other]
-
Title: All Mistakes Are Not Equal: Comprehensive Hierarchy Aware Multi-label Predictions (CHAMP)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1245] arXiv:2206.08684 (cross-list from cs.LG) [pdf, other]
-
Title: Sparse Double Descent: Where Network Pruning Aggravates OverfittingComments: ICML 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1246] arXiv:2206.08704 (cross-list from cs.LG) [pdf, other]
-
Title: Maximum Class Separation as Inductive Bias in One MatrixAuthors: Tejaswi Kasarla, Gertjan J. Burghouts, Max van Spengler, Elise van der Pol, Rita Cucchiara, Pascal MettesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1247] arXiv:2206.08802 (cross-list from cs.LG) [pdf, other]
-
Title: Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasetsComments: Accepted by ICML 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1248] arXiv:2206.08826 (cross-list from cs.LG) [pdf, other]
-
Title: Multimodal Attention-based Deep Learning for Alzheimer's Disease DiagnosisComments: 11 pages, 5 figuresJournal-ref: Journal of the American Medical Informatics Association, 2022; ocac168Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1249] arXiv:2206.08842 (cross-list from cs.MM) [pdf, other]
-
Title: Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product RetrievalAuthors: Xiao Dong, Xunlin Zhan, Yunchao Wei, Xiaoyong Wei, Yaowei Wang, Minlong Lu, Xiaochun Cao, Xiaodan LiangSubjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Information Retrieval (cs.IR)
- [1250] arXiv:2206.08853 (cross-list from cs.LG) [pdf, other]
-
Title: MineDojo: Building Open-Ended Embodied Agents with Internet-Scale KnowledgeAuthors: Linxi Fan, Guanzhi Wang, Yunfan Jiang, Ajay Mandlekar, Yuncong Yang, Haoyi Zhu, Andrew Tang, De-An Huang, Yuke Zhu, Anima AnandkumarComments: Outstanding Paper Award at NeurIPS 2022. Project website: this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1251] arXiv:2206.08869 (cross-list from cs.LG) [pdf, other]
-
Title: Fast Lossless Neural Compression with Integer-Only Discrete FlowsComments: Accepted as a conference paper at International Conference on Machine Learning (ICML) 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [1252] arXiv:2206.08882 (cross-list from cs.MA) [pdf, other]
-
Title: Edge-Aided Sensor Data Sharing in Vehicular Communication NetworksComments: Accepted for IEEE 95th Vehicular Technology Conference (VTC2022-Spring)Subjects: Multiagent Systems (cs.MA); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [1253] arXiv:2206.08890 (cross-list from cs.LG) [pdf, other]
-
Title: Disentangling Model Multiplicity in Deep LearningComments: 13 pages, 6 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1254] arXiv:2206.08965 (cross-list from cs.AI) [pdf, other]
-
Title: KitBit: A New AI Model for Solving Intelligence Tests and Numerical SeriesComments: 11 pagesJournal-ref: Corsino, V., Gilperez, J. M., & Herrera, L. (2023). "KitBit: A New AI Model for Solving Intelligence Tests and Numerical Series." IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(11), 13893-13903Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1255] arXiv:2206.09012 (cross-list from cs.LG) [pdf, other]
-
Title: Diffusion models as plug-and-play priorsComments: NeurIPS 2022; code: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1256] arXiv:2206.09034 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Better Selective ClassificationSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1257] arXiv:2206.09059 (cross-list from cs.CL) [pdf, other]
-
Title: CLiMB: A Continual Learning Benchmark for Vision-and-Language TasksAuthors: Tejas Srinivasan, Ting-Yun Chang, Leticia Leonor Pinto Alva, Georgios Chochlakis, Mohammad Rostami, Jesse ThomasonComments: Accepted to NeurIPS 2022 Datasets and Benchmarks trackSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1258] arXiv:2206.09203 (cross-list from cs.AI) [pdf, other]
-
Title: Interactive Visual Reasoning under UncertaintyComments: Accepted at NeurIPS 2023 (Datasets and Benchmarks)Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1259] arXiv:2206.09272 (cross-list from cs.CR) [pdf, other]
-
Title: DECK: Model Hardening for Defending Pervasive BackdoorsAuthors: Guanhong Tao, Yingqi Liu, Siyuan Cheng, Shengwei An, Zhuo Zhang, Qiuling Xu, Guangyu Shen, Xiangyu ZhangSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1260] arXiv:2206.09286 (cross-list from cs.GR) [pdf, other]
-
Title: From Universal Humanoid Control to Automatic Physically Valid Character CreationComments: Project page: this https URLSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1261] arXiv:2206.09359 (cross-list from cs.LG) [pdf, other]
-
Title: Productive Reproducible Workflows for DNNs: A Case Study for Industrial Defect DetectionComments: 7 pages, 5 figures, AccML 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF); Software Engineering (cs.SE)
- [1262] arXiv:2206.09378 (cross-list from cs.CL) [pdf, ps, other]
-
Title: A Self-Guided Framework for Radiology Report GenerationComments: 11 pages, 3 figures, accepted by Medical Image Computing and Computer Assisted Intervention 2022(MICCAI 2022)Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [1263] arXiv:2206.09386 (cross-list from cs.LG) [pdf, other]
-
Title: Scalable Neural Data Server: A Data Recommender for Transfer LearningComments: Neurips 2021Journal-ref: Advances in Neural Information Processing Systems, Volume 34, pages 8984-8997, year 2021Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1264] arXiv:2206.09391 (cross-list from cs.LG) [pdf, other]
-
Title: Towards Adversarial Attack on Vision-Language Pre-training ModelsComments: Accepted by ACM MM2022. Code is available in GitHubSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1265] arXiv:2206.09449 (cross-list from cs.NE) [pdf, other]
-
Title: SNN2ANN: A Fast and Memory-Efficient Training Framework for Spiking Neural NetworksSubjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1266] arXiv:2206.09570 (cross-list from cs.HC) [pdf, other]
-
Title: Guardian Angel: A Novel Walking Aid for the Visually ImpairedComments: 2 pages, 1 figureSubjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
- [1267] arXiv:2206.09616 (cross-list from cs.LG) [pdf, other]
-
Title: Revisiting lp-constrained Softmax Loss: A Comprehensive StudySubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1268] arXiv:2206.09628 (cross-list from cs.LG) [pdf, other]
-
Title: Diversified Adversarial Attacks based on Conjugate Gradient MethodAuthors: Keiichiro Yamamura, Haruki Sato, Nariaki Tateiwa, Nozomi Hata, Toru Mitsutake, Issa Oe, Hiroki Ishikura, Katsuki FujisawaComments: Proceedings of the 39th International Conference on Machine Learning (ICML 2022)Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1269] arXiv:2206.09699 (cross-list from cs.CG) [pdf, other]
-
Title: FoR$^2$M: Recognition and Repair of Foldings in Mesh Surfaces. Application to 3D Object DegradationSubjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1270] arXiv:2206.09811 (cross-list from cs.LG) [pdf, other]
-
Title: Shapley-NAS: Discovering Operation Contribution for Neural Architecture SearchComments: Accepted to CVPR2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1271] arXiv:2206.09868 (cross-list from cs.LG) [pdf, other]
-
Title: Understanding Robust Learning through the Lens of Representation SimilaritiesAuthors: Christian Cianfarani, Arjun Nitin Bhagoji, Vikash Sehwag, Ben Y. Zhao, Prateek Mittal, Haitao ZhengComments: 35 pages, 29 figures; Accepted to Neurips 2022Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1272] arXiv:2206.09880 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Breaking Down Out-of-Distribution Detection: Many Methods Based on OOD Training Data Estimate a Combination of the Same Core QuantitiesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1273] arXiv:2206.09946 (cross-list from cs.CY) [pdf, ps, other]
-
Title: Short Video Uprising: How #BlackLivesMatter Content on TikTok Challenges the Protest ParadigmComments: Workshop Proceedings of the 16th International AAAI Conference on Web and Social MediaSubjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
- [1274] arXiv:2206.10011 (cross-list from cs.LG) [pdf, other]
-
Title: When Does Re-initialization Work?Authors: Sheheryar Zaidi, Tudor Berariu, Hyunjik Kim, Jörg Bornschein, Claudia Clopath, Yee Whye Teh, Razvan PascanuComments: Published in PMLR Volume 187; spotlight presentation at I Can't Believe It's Not Better Workshop at NeurIPS 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1275] arXiv:2206.10244 (cross-list from cs.RO) [pdf, other]
-
Title: Experimental Evaluation of Pose Initialization Methods for Relative Navigation Between Non-Cooperative SatellitesAuthors: Sebastiano Chiodini, Marco Pertile, Pierdomenico Fracchiolla, Andrea Valmorbida, Enrico Lorenzini, Stefano DebeiComments: To be presented at the 2022 IEEE INTERNATIONAL WORKSHOP ON Metrology for AeroSpaceSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1276] arXiv:2206.10249 (cross-list from cs.HC) [pdf, other]
-
Title: Incorporating Voice Instructions in Model-Based Reinforcement Learning for Self-Driving CarsComments: NeurIPS 2021 Workshop on Machine Learning for Autonomous DrivingSubjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [1277] arXiv:2206.10255 (cross-list from eess.SY) [pdf, other]
-
Title: GNN-PMB: A Simple but Effective Online 3D Multi-Object Tracker without Bells and WhistlesComments: accepted by IEEE Transactions on Intelligent VehiclesSubjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
- [1278] arXiv:2206.10274 (cross-list from cs.RO) [pdf, other]
-
Title: Attention-driven Active Vision for Efficient Reconstruction of Plants and Targeted Plant PartsSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1279] arXiv:2206.10326 (cross-list from cs.HC) [pdf, other]
-
Title: The Metaverse Data Deluge: What Can We Do About It?Authors: Beng Chin Ooi, Gang Chen, Mike Zheng Shou, Kian-Lee Tan, Anthony Tung, Xiaokui Xiao, James Wei Luen Yip, Meihui ZhangSubjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
- [1280] arXiv:2206.10352 (cross-list from cs.HC) [pdf, other]
-
Title: Psychologically-Inspired, Unsupervised Inference of Perceptual Groups of GUI Widgets from GUI ImagesComments: 12 Pages, accepted to ESEC/FSE '2022Journal-ref: In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022)Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Software Engineering (cs.SE)
- [1281] arXiv:2206.10365 (cross-list from cs.LG) [pdf, other]
- [1282] arXiv:2206.10421 (cross-list from cs.SD) [pdf, other]
-
Title: Rethinking Audio-visual Synchronization for Active Speaker DetectionComments: Accepted by IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2022)Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
- [1283] arXiv:2206.10480 (cross-list from cs.LG) [pdf, other]
-
Title: Learning to Estimate and Refine Fluid Motion with Physical DynamicsComments: published at ICML 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
- [1284] arXiv:2206.10620 (cross-list from cs.LG) [pdf, other]
-
Title: CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing FrameworkSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Programming Languages (cs.PL)
- [1285] arXiv:2206.10670 (cross-list from cs.RO) [pdf, other]
-
Title: SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene UnderstandingComments: accepted at ISRR 2022Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1286] arXiv:2206.10797 (cross-list from cs.LG) [pdf, other]
-
Title: Imitation Learning for Generalizable Self-driving Policy with Sim-to-real TransferComments: Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Source code is available at: this https URLSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1287] arXiv:2206.10816 (cross-list from cs.LG) [pdf, other]
-
Title: Fighting Fire with Fire: Avoiding DNN Shortcuts through PrimingComments: 28 pages, 13 figures, ICML2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1288] arXiv:2206.10843 (cross-list from cs.LG) [pdf, other]
-
Title: Learning Debiased Classifier with Biased CommitteeComments: Conference on Neural Information Processing Systems (NeurIPS), New Orleans, 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1289] arXiv:2206.10935 (cross-list from cs.LG) [pdf, other]
-
Title: A Study on the Evaluation of Generative ModelsComments: 13 pagesSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1290] arXiv:2206.11073 (cross-list from cs.NE) [pdf, other]
-
Title: A Unified and Biologically-Plausible Relational Graph Representation of Vision TransformersAuthors: Yuzhong Chen, Yu Du, Zhenxiang Xiao, Lin Zhao, Lu Zhang, David Weizhong Liu, Dajiang Zhu, Tuo Zhang, Xintao Hu, Tianming Liu, Xi JiangComments: 11 pages,7 figures, submitted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022)Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1291] arXiv:2206.11141 (cross-list from cs.RO) [pdf, other]
-
Title: Hybrid Physical Metric For 6-DoF Grasp Pose DetectionComments: 7 pages, 7 figures, accepted by ICRA 2022Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1292] arXiv:2206.11229 (cross-list from cs.IR) [pdf, other]
-
Title: Business Document Information Extraction: Towards Practical BenchmarksComments: Accepted to CLEF 2022Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1293] arXiv:2206.11251 (cross-list from cs.LG) [pdf, other]
-
Title: Behavior Transformers: Cloning $k$ modes with one stoneComments: Code and data available at this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1294] arXiv:2206.11260 (cross-list from cs.SD) [pdf, other]
-
Title: Few-shot Long-Tailed Bird Audio RecognitionComments: LifeCLEF2022 (best paper award)Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
- [1295] arXiv:2206.11376 (cross-list from cs.RO) [pdf, other]
-
Title: Real-Time Online Skeleton Extraction and Gesture Recognition on PepperSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1296] arXiv:2206.11461 (cross-list from cs.GR) [pdf, other]
-
Title: Towards Better User Studies in Computer Graphics and VisionComments: 18 pages of text, 6 pages of references, 3 figures, 1 tableJournal-ref: Foundations and Trends in Computer Graphics and Vision (2023). Vol. 15: No. 3, pp 201-252Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
- [1297] arXiv:2206.11481 (cross-list from cs.CG) [pdf, ps, other]
-
Title: A Novel Algorithm for Exact Concave Hull ExtractionSubjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
- [1298] arXiv:2206.11488 (cross-list from cs.LG) [pdf, other]
-
Title: On the Importance and Applicability of Pre-Training for Federated LearningComments: Accepted to ICLR 2023Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1299] arXiv:2206.11602 (cross-list from cs.LG) [pdf, other]
-
Title: Prototype-Anchored Learning for Learning with Imperfect AnnotationsComments: ICML 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1300] arXiv:2206.11623 (cross-list from cs.RO) [pdf, other]
-
Title: Waypoint Generation in Row-based Crops with Deep Learning and Contrastive ClusteringComments: Accepted at ECML PKDD 2022Journal-ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13718, SpringerSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [1301] arXiv:2206.11849 (cross-list from cs.LG) [pdf, other]
-
Title: Sample Condensation in Online Continual LearningComments: Accepted as a conference paper at 2022 International Joint Conference on Neural Networks (IJCNN 2022). Part of 2022 IEEE World Congress on Computational Intelligence (IEEE WCCI 2022)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1302] arXiv:2206.12139 (cross-list from cs.NI) [pdf, other]
-
Title: HARU: Haptic Augmented Reality-Assisted User-Centric Industrial Network PlanningSubjects: Networking and Internet Architecture (cs.NI); Computer Vision and Pattern Recognition (cs.CV)
- [1303] arXiv:2206.12145 (cross-list from cs.RO) [pdf, other]
-
Title: Efficient and Robust Training of Dense Object Nets for Multi-Object Robot ManipulationSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1304] arXiv:2206.12251 (cross-list from cs.CR) [pdf, other]
-
Title: Adversarial Zoom Lens: A Novel Physical-World Attack to DNNsSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1305] arXiv:2206.12292 (cross-list from cs.LG) [pdf, other]
-
Title: InfoAT: Improving Adversarial Training Using the Information Bottleneck PrincipleComments: Published in: IEEE Transactions on Neural Networks and Learning Systems ( Early Access )Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1306] arXiv:2206.12322 (cross-list from cs.LG) [pdf, other]
-
Title: How to train accurate BNNs for embedded systems?Journal-ref: Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing (2023)Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1307] arXiv:2206.12484 (cross-list from cs.LG) [pdf, other]
-
Title: An Intensity and Phase Stacked Analysis of Phase-OTDR System using Deep Transfer Learning and Recurrent Neural NetworksComments: 15 pages, 9 figures. Title updatedSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [1308] arXiv:2206.12649 (cross-list from cs.CL) [pdf, other]
-
Title: Sentiment Analysis with R: Natural Language Processing for Semi-Automated Assessments of Qualitative DataAuthors: Dennis KlinkhammerComments: 14 pages, 6 figuresSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
- [1309] arXiv:2206.12705 (cross-list from cs.LG) [pdf, other]
-
Title: p-Meta: Towards On-device Deep Model AdaptationComments: Published in SIGKDD 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1310] arXiv:2206.12753 (cross-list from cs.DB) [pdf, other]
-
Title: Spatiotemporal Data Mining: A SurveySubjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
- [1311] arXiv:2206.12941 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Object Detection and Tracking with Autonomous UAVSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1312] arXiv:2206.13043 (cross-list from cs.LG) [pdf, other]
-
Title: Automated Systems For Diagnosis of Dysgraphia in Children: A Survey and Novel FrameworkSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1313] arXiv:2206.13387 (cross-list from cs.AI) [pdf, other]
-
Title: ScePT: Scene-consistent, Policy-based Trajectory Predictions for PlanningSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
- [1314] arXiv:2206.13399 (cross-list from cs.LG) [pdf, other]
-
Title: Transfer Learning via Test-Time Neural Networks AggregationComments: 8 pagesJournal-ref: Proceedings of the 17th international joint conference on computer vision, imaging and computer graphics theory and applications, VISIGRAPP 2022, volume 5: VISAPP, online streaming, february 6-8, 2022, 2022, pp. 642-649Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1315] arXiv:2206.13406 (cross-list from cs.RO) [pdf, other]
-
Title: Explicitly incorporating spatial information to recurrent networks for agricultureSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1316] arXiv:2206.13491 (cross-list from cs.LG) [pdf, other]
-
Title: Effective training-time stacking for ensembling of deep neural networksSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1317] arXiv:2206.13497 (cross-list from cs.LG) [pdf, other]
-
Title: Robustness Implies Generalization via Data-Dependent Generalization BoundsComments: Accepted by ICML 2022, and selected for ICML long presentation (top 2% of submissions)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Probability (math.PR); Machine Learning (stat.ML)
- [1318] arXiv:2206.13498 (cross-list from cs.LG) [pdf, other]
-
Title: Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous BehaviorComments: Fixed backdoor localization results, made changes to abstract and introductionSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1319] arXiv:2206.13499 (cross-list from cs.LG) [pdf, other]
-
Title: Prompting Decision Transformer for Few-Shot Policy GeneralizationComments: ICML 2022. Project page: this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1320] arXiv:2206.13630 (cross-list from cs.AI) [pdf, ps, other]
-
Title: Toward an ImageNet Library of Functions for Global Optimization BenchmarkingSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1321] arXiv:2206.13687 (cross-list from cs.LG) [pdf, other]
-
Title: POEM: Out-of-Distribution Detection with Posterior SamplingComments: ICML 2022 (Long Talk); First two authors contributed equallyJournal-ref: Thirty-ninth International Conference on Machine Learning (2022)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1322] arXiv:2206.13883 (cross-list from cs.RO) [pdf, other]
-
Title: Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera SystemsAuthors: Stephen Hausler, Ming Xu, Sourav Garg, Punarjay Chakravarty, Shubham Shrivastava, Ankit Vora, Michael MilfordComments: 8 pages, 5 figures, To be published in RA-L 2022Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1323] arXiv:2206.13932 (cross-list from cs.LG) [pdf, other]
-
Title: Discrete Morse Sandwich: Fast Computation of Persistence Diagrams for Scalar Data -- An Algorithm and A BenchmarkSubjects: Machine Learning (cs.LG); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
- [1324] arXiv:2206.13968 (cross-list from cs.LG) [pdf, other]
-
Title: Information Entropy Initialized Concrete Autoencoder for Optimal Sensor Placement and Reconstruction of Geophysical FieldsComments: 18 pages, 6 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Atmospheric and Oceanic Physics (physics.ao-ph)
- [1325] arXiv:2206.13991 (cross-list from cs.LG) [pdf, other]
-
Title: Increasing Confidence in Adversarial Robustness EvaluationsComments: Oral at CVPR 2022 Workshop (Art of Robustness). Project website this https URLSubjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
- [1326] arXiv:2206.14056 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Deep Neural Networks pruning via the Structured Perspective RegularizationSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
- [1327] arXiv:2206.14085 (cross-list from cs.LG) [pdf, other]
-
Title: Continual Learning with Transformers for Image ClassificationComments: Appeared in CVPR CLVision workshop. arXiv admin note: substantial text overlap with arXiv:2203.04640Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1328] arXiv:2206.14098 (cross-list from cs.LG) [pdf, other]
-
Title: RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid NetworkAuthors: Vitaliy Chiley, Vithursan Thangarasa, Abhay Gupta, Anshul Samar, Joel Hestness, Dennis DeCosteComments: Presented at MLSys 2023. Code available from Cerebras Systems: this https URLSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1329] arXiv:2206.14137 (cross-list from cs.NE) [pdf, ps, other]
-
Title: aSTDP: A More Biologically Plausible LearningAuthors: Shiyuan LiComments: 17 pages, 6 figures. arXiv admin note: text overlap with arXiv:1912.00009Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
- [1330] arXiv:2206.14244 (cross-list from cs.RO) [pdf, other]
-
Title: Masked World Models for Visual ControlAuthors: Younggyo Seo, Danijar Hafner, Hao Liu, Fangchen Liu, Stephen James, Kimin Lee, Pieter AbbeelComments: Project website: this https URL Accepted to CoRL 2022Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1331] arXiv:2206.14256 (cross-list from cs.LG) [pdf, other]
-
Title: GAN-based Intrinsic Exploration For Sample Efficient Reinforcement LearningAuthors: Doğay Kamar (1), Nazım Kemal Üre (1 and 2), Gözde Ünal (1 and 2) ((1) Faculty of Computer and Informatics, Istanbul Technical University (2) Artificial Intelligence and Data Science Research Center, Istanbul Technical University)Journal-ref: International Conference on Agents and Artificial Intelligence - ICAART, Volume 2, 264-272 (2022)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1332] arXiv:2206.14372 (cross-list from cs.RO) [pdf, other]
-
Title: Formalizing and Evaluating Requirements of Perception Systems for Automated Vehicles using Spatio-Temporal Perception LogicComments: 32 pages, 11 figures, 6 tables, 4 algorithms, 2 appendixesSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Formal Languages and Automata Theory (cs.FL)
- [1333] arXiv:2206.14486 (cross-list from cs.LG) [pdf, other]
-
Title: Beyond neural scaling laws: beating power law scaling via data pruningComments: Outstanding Paper Award @ NeurIPS 2022. Added github link to metric scoresSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
- [1334] arXiv:2206.14502 (cross-list from cs.LG) [pdf, other]
-
Title: RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution RobustnessComments: 22 pages, 18 figuresSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1335] arXiv:2206.14528 (cross-list from cs.RO) [pdf, other]
-
Title: Procrustes Analysis with Deformations: A Closed-Form Solution by Eigenvalue DecompositionComments: Published on International journal of computer vision (IJCV) 2022Journal-ref: International Journal of Computer Vision 130, no. 2 (2022): 567-593Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
- [1336] arXiv:2206.14541 (cross-list from cs.LG) [pdf, other]
-
Title: Why patient data cannot be easily forgotten?Comments: Ruolin Su and Xiao Liu contributed equally. Accepted by MICCAI 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1337] arXiv:2206.14579 (cross-list from cs.CL) [pdf, other]
-
Title: Competence-based Multimodal Curriculum Learning for Medical Report GenerationComments: Accepted by ACL 2021 (Oral)Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1338] arXiv:2206.14581 (cross-list from cs.ET) [pdf, other]
-
Title: On-device Synaptic Memory Consolidation using Fowler-Nordheim Quantum-tunnelingSubjects: Emerging Technologies (cs.ET); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1339] arXiv:2206.14617 (cross-list from cs.GR) [pdf, other]
-
Title: Perspective (In)consistency of Paint by TextAuthors: Hany FaridSubjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
- [1340] arXiv:2206.14658 (cross-list from cs.LG) [pdf, other]
-
Title: Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANsComments: ICML Workshop on Hardware Aware Efficient Training, 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1341] arXiv:2206.14687 (cross-list from cs.LG) [pdf, other]
-
Title: Multi-scale Physical Representations for Approximating PDE Solutions with Graph Neural OperatorsComments: ICLR 2022 Workshop on Geometrical and Topological Representation LearningJournal-ref: ICLR 2022 Workshop on Geometrical and Topological Representation LearningSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1342] arXiv:2206.14709 (cross-list from cs.LG) [pdf, other]
-
Title: An extensible Benchmarking Graph-Mesh dataset for studying Steady-State Incompressible Navier-Stokes EquationsComments: ICLR 2022 Workshop on Geometrical and Topological Representation LearningJournal-ref: ICLR 2022 Workshop on Geometrical and Topological Representation LearningSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
- [1343] arXiv:2206.14854 (cross-list from cs.RO) [pdf, other]
-
Title: Neural Motion Fields: Encoding Grasp Trajectories as Implicit Value FunctionsAuthors: Yun-Chun Chen, Adithyavairavan Murali, Balakumar Sundaralingam, Wei Yang, Animesh Garg, Dieter FoxComments: RSS 2022 Workshop on Implicit Representations for Robotic ManipulationSubjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
- [1344] arXiv:2206.14868 (cross-list from cs.LG) [pdf, other]
-
Title: Teach me how to Interpolate a Myriad of EmbeddingsSubjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
- [1345] arXiv:2206.15007 (cross-list from cs.CL) [pdf, other]
-
Title: GSCLIP : A Framework for Explaining Distribution Shifts in Natural LanguageComments: Accepted by ICML 2022 DataPerfSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1346] arXiv:2206.15170 (cross-list from cs.AI) [pdf, other]
-
Title: LiDAR-as-Camera for End-to-End DrivingSubjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1347] arXiv:2206.15316 (cross-list from cs.LG) [pdf, other]
-
Title: Anomaly Detection in Echocardiograms with Dynamic Variational Trajectory ModelsJournal-ref: Proceedings of the 7th Machine Learning for Healthcare Conference, PMLR 182:425-458, 2022Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computation (stat.CO); Machine Learning (stat.ML)
- [1348] arXiv:2206.15469 (cross-list from cs.RO) [pdf, other]
-
Title: Watch and Match: Supercharging Imitation with Regularized Optimal TransportComments: Code and robot videos are available on this https URLSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1349] arXiv:2206.15470 (cross-list from cs.GR) [pdf, other]
-
Title: Dressing Avatars: Deep Photorealistic Appearance for Physically Simulated ClothingAuthors: Donglai Xiang, Timur Bagautdinov, Tuur Stuyck, Fabian Prada, Javier Romero, Weipeng Xu, Shunsuke Saito, Jingfan Guo, Breannan Smith, Takaaki Shiratori, Yaser Sheikh, Jessica Hodgins, Chenglei WuComments: SIGGRAPH Asia 2022 (ACM ToG) camera ready. The supplementary video can be found on this https URLSubjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
- [1350] arXiv:2206.00002 (cross-list from eess.IV) [pdf, other]
-
Title: Calibrated Bagging Deep Learning for Image Semantic Segmentation: A Case Study on COVID-19 Chest X-ray ImageSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1351] arXiv:2206.00041 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Characterization of 3D Printers and X-Ray Computerized TomographyComments: Total 13 Pages, 11 Figures, 5 Tables, 10 ReferencesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1352] arXiv:2206.00105 (cross-list from eess.IV) [pdf, other]
-
Title: Deep learning pipeline for image classification on mobile phonesComments: 20 pagesJournal-ref: 9th International Conference on Artificial Intelligence and Applications (AIAPP 2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1353] arXiv:2206.00305 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Supervised Denoising of Diffusion-Weighted Magnetic Resonance Images Using a Convolutional Neural Network and Transfer LearningAuthors: Jakub Jurek, Andrzej Materka, Kamil Ludwisiak, Agata Majos, Kamil Gorczewski, Kamil Cepuch, Agata ZawadzkaComments: Preprint submitted to NeuroImageSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1354] arXiv:2206.00338 (cross-list from eess.IV) [pdf, other]
-
Title: CellCentroidFormer: Combining Self-attention and Convolution for Cell DetectionComments: Accepted at MIUA 2022; Added experiments with CircleNets and extended figure captionsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1355] arXiv:2206.00356 (cross-list from eess.IV) [pdf, other]
-
Title: A Survey on Deep Learning for Skin Lesion SegmentationAuthors: Zahra Mirikharaji, Kumar Abhishek, Alceu Bissoto, Catarina Barata, Sandra Avila, Eduardo Valle, M. Emre Celebi, Ghassan HamarnehComments: Published in Medical Image Analysis (2023); 55 pages, 10 figures; Mirikharaji and Abhishek: Joint first authors; Celebi and Hamarneh: Joint senior authorsJournal-ref: Medical Image Analysis (2023): 102863Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1356] arXiv:2206.00389 (cross-list from eess.IV) [pdf, other]
-
Title: A comparative study between vision transformers and CNNs in digital pathologyAuthors: Luca Deininger, Bernhard Stimpel, Anil Yuce, Samaneh Abbasi-Sureshjani, Simon Schönenberger, Paolo Ocampo, Konstanty Korski, Fabien GaireComments: 8 pages, 2 figures, accepted for workshop T4Vision (CVPR 2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1357] arXiv:2206.00455 (cross-list from q-bio.QM) [pdf, ps, other]
-
Title: A robust and lightweight deep attention multiple instance learning algorithm for predicting genetic alterationsSubjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Genomics (q-bio.GN)
- [1358] arXiv:2206.00536 (cross-list from eess.IV) [pdf, other]
-
Title: Impact of loss function in Deep Learning methods for accurate retinal vessel segmentationComments: Paper submitted to MICAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1359] arXiv:2206.00566 (cross-list from eess.IV) [pdf, ps, other]
-
Title: The Fully Convolutional Transformer for Medical Image SegmentationJournal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, pp. 3660-3669Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1360] arXiv:2206.00831 (cross-list from eess.IV) [pdf, other]
-
Title: Dynamic Cardiac MRI Reconstruction Using Combined Tensor Nuclear Norm and Casorati Matrix Nuclear Norm RegularizationsComments: 4 pages, 3 figures, 1 table, accepted in IEEE ISBI 2022Journal-ref: [C]//2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). IEEE, 2022: 1-4Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1361] arXiv:2206.00850 (cross-list from eess.IV) [pdf, other]
-
Title: Dynamic MRI using Learned Transform-based Tensor Low-Rank Network (LT$^2$LR-Net)Comments: 4 pages, 2 figures, 1 tabel, accepted by IEEE ISBI 2023 ConferenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1362] arXiv:2206.01088 (cross-list from eess.IV) [pdf, other]
-
Title: Machine Learning-based Lung and Colon Cancer Detection using Deep Feature Extraction and Ensemble LearningAuthors: Md. Alamin Talukder, Md. Manowarul Islam, Md Ashraf Uddin, Arnisha Akhter, Khondokar Fida Hasan, Mohammad Ali MoniComments: Accepted for publication in the Special Issue of Expert Systems with Applications (IF:6.954, Cite:12.70) How to Cite: Md. Alamin Talukder, Md. Manowarul Islam, Md Ashraf Uddin, Arnisha Akhter, Khondokar Fida Hasan, Mohammad Ali Moni. "Machine Learning-based Lung and Colon Cancer Detection using Deep Feature Extraction and Ensemble Learning", Expert Systems with Applications. 2022 Jun 1Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1363] arXiv:2206.01096 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A Dual-fusion Semantic Segmentation Framework With GAN For SAR ImagesComments: 4 pages,4 figures, 2022 IEEE International Geoscience and Remote Sensing SymposiumSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1364] arXiv:2206.01103 (cross-list from eess.IV) [pdf, other]
-
Title: Noise2NoiseFlow: Realistic Camera Noise Modeling without Clean ImagesComments: CVPR 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1365] arXiv:2206.01118 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Comparing Conventional and Deep Feature Models for Classifying Fundus Photography of HemorrhagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1366] arXiv:2206.01344 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Detecting Pulmonary Embolism from Computed Tomography Using Convolutional Neural NetworkSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1367] arXiv:2206.01397 (cross-list from physics.optics) [pdf, other]
-
Title: Dynamic Structured Illumination Microscopy with a Neural Space-time ModelSubjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
- [1368] arXiv:2206.01430 (cross-list from eess.IV) [pdf, other]
-
Title: LenslessPiCam: A Hardware and Software Platform for Lensless Computational Imaging with a Raspberry PiSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1369] arXiv:2206.01644 (cross-list from quant-ph) [pdf, ps, other]
-
Title: Mirror modular cloning and fast quantum associative retrievalSubjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1370] arXiv:2206.01728 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A review of machine learning approaches, challenges and prospects for computational tumor pathologySubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
- [1371] arXiv:2206.01731 (cross-list from eess.IV) [pdf, other]
-
Title: Empirical Study of Quality Image Assessment for Synthesis of Fetal Head Ultrasound Imaging with DCGANsAuthors: Thea Bautista, Jacqueline Matthew, Hamideh Kerdegari, Laura Peralta Pereira, Miguel XochicaleSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1372] arXiv:2206.01735 (cross-list from eess.IV) [pdf, other]
-
Title: Examining the behaviour of state-of-the-art convolutional neural networks for brain tumor detection with and without transfer learningSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1373] arXiv:2206.01736 (cross-list from eess.IV) [pdf, other]
-
Title: Adaptive Adversarial Training to Improve Adversarial Robustness of DNNs for Medical Image Segmentation and DetectionComments: 17 pagesSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1374] arXiv:2206.01737 (cross-list from eess.IV) [pdf, other]
-
Title: MaxStyle: Adversarial Style Composition for Robust Medical Image SegmentationComments: Early accepted by MICCAI 2022 (Camera-ready version)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1375] arXiv:2206.01738 (cross-list from eess.IV) [pdf, other]
-
Title: RIDDLE: Lidar Data Compression with Range Image Deep Delta EncodingComments: 14 pages, 10 figures; CVPR 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1376] arXiv:2206.01739 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Mutual- and Self- Prototype Alignment for Semi-supervised Medical Image SegmentationComments: 11 pages, 3 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1377] arXiv:2206.01740 (cross-list from eess.IV) [pdf, other]
-
Title: Denoising Fast X-Ray Fluorescence Raster Scans of PaintingsAuthors: Henry Chopp, Alicia McGeachy, Matthias Alfeld, Oliver Cossairt, Marc Walton, Aggelos KatsaggelosSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1378] arXiv:2206.01741 (cross-list from eess.IV) [pdf, other]
-
Title: Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image SegmentationAuthors: Yanglan Ou, Ye Yuan, Xiaolei Huang, Stephen T.C. Wong, John Volpi, James Z. Wang, Kelvin WongComments: MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1379] arXiv:2206.01742 (cross-list from eess.IV) [pdf, other]
-
Title: Learning Probabilistic Topological Representations Using Discrete Morse TheoryComments: 16 pages, 11 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1380] arXiv:2206.01743 (cross-list from eess.IV) [pdf, other]
-
Title: Orthogonal Transform based Generative Adversarial Network for Image DehazingComments: 12 pages, 14 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1381] arXiv:2206.01745 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Detection of Fibrosis in Cine Magnetic Resonance Images Using Artificial Intelligence TechniquesAuthors: Ariel. H. Curiale, Facundo Cabrera, Pablo Jimenez, Jorgelina Medus, GermÁn Mato, MatÍas E. CalandrelliSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1382] arXiv:2206.01746 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Automatic Quantification of Volumes and Biventricular Function in Cardiac Resonance. Validation of a New Artificial Intelligence ApproachAuthors: Ariel H. Curiale, MatÍas E. Calandrelli, Lucca Dellazoppa, Mariano Trevisan, Jorge Luis BociÁn, Juan Pablo Bonifacio, GermÁn MatoSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1383] arXiv:2206.01774 (cross-list from eess.IV) [pdf, other]
-
Title: Monkeypox Image Data collectionComments: This is the attempt of creating monkeypox image dataset collected from various sources and it will continue to update by collectiong samples from journals and other public access domainsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1384] arXiv:2206.01793 (cross-list from eess.IV) [pdf, ps, other]
-
Title: R2U++: A Multiscale Recurrent Residual U-Net with Dense Skip Connections for Medical Image SegmentationComments: Paper accepted in Neural Computing and Applications (2022). Please cite the final version available from Springer website this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1385] arXiv:2206.01826 (cross-list from stat.ME) [pdf, other]
-
Title: The Gamma Generalized Normal Distribution: A Descriptor of SAR ImageryComments: 21 pages, 6 figures, 6 tablesJournal-ref: Journal of Computational and Applied Mathematics, vol. 347, pages 257-272, February 2019Subjects: Methodology (stat.ME); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Statistics Theory (math.ST); Data Analysis, Statistics and Probability (physics.data-an)
- [1386] arXiv:2206.01856 (cross-list from eess.IV) [pdf, other]
-
Title: Poisson2Sparse: Self-Supervised Poisson Denoising From a Single ImageComments: Accepted to MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1387] arXiv:2206.01862 (cross-list from eess.IV) [pdf, other]
-
Title: Image Data collection and implementation of deep learning-based model in detecting Monkeypox disease using modified VGG16Authors: Md Manjurul Ahsan, Muhammad Ramiz Uddin, Mithila Farjana, Ahmed Nazmus Sakib, Khondhaker Al Momin, Shahana Akter LunaSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1388] arXiv:2206.01897 (cross-list from eess.IV) [pdf, other]
-
Title: Modeling of Textures to Predict Immune Cell Status and Survival of Brain Tumour PatientsSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
- [1389] arXiv:2206.01903 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Radiomic Analysis for Predicting Coronavirus Disease 2019 in Computerized Tomography and X-ray ImagesJournal-ref: IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):3-11Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1390] arXiv:2206.02061 (cross-list from eess.SP) [pdf, other]
-
Title: Low Power Neuromorphic EMG Gesture ClassificationComments: 3 Pages, 5 figures, 1 tableSubjects: Signal Processing (eess.SP); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Neural and Evolutionary Computing (cs.NE)
- [1391] arXiv:2206.02225 (cross-list from eess.IV) [pdf, other]
-
Title: Physically Inspired Constraint for Unsupervised Regularized Ultrasound ElastographyComments: Accepted in MICCAI 2022Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
- [1392] arXiv:2206.02278 (cross-list from eess.IV) [pdf, other]
-
Title: Autoregressive Model for Multi-Pass SAR Change Detection Based on Image StacksAuthors: B. G. Palm, D. I. Alves, V. T. Vu, M. I. Pettersson, F. M. Bayer, R. J. Cintra, R. Machado, P. Dammert, H. HellstenComments: 9 pages, 10 figuresJournal-ref: Proceedings Volume 10789, Image and Signal Processing for Remote Sensing XXIV; 1078916 (2018)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP); Methodology (stat.ME)
- [1393] arXiv:2206.02358 (cross-list from eess.SP) [pdf, other]
-
Title: Implementation of a Modified U-Net for Medical Image Segmentation on Edge DevicesComments: Preprint of paper accepted in IEEE Transactions on Circuits and Systems II: Express BriefSubjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
- [1394] arXiv:2206.02425 (cross-list from eess.IV) [pdf, other]
-
Title: mmFormer: Multimodal Medical Transformer for Incomplete Multimodal Learning of Brain Tumor SegmentationAuthors: Yao Zhang, Nanjun He, Jiawei Yang, Yuexiang Li, Dong Wei, Yawen Huang, Yang Zhang, Zhiqiang He, Yefeng ZhengComments: Accepted to MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1395] arXiv:2206.02510 (cross-list from physics.optics) [pdf, other]
-
Title: Single pixel imaging at high pixel resolutionsComments: Paper accepted to Optics Express on 23/05/2022Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1396] arXiv:2206.02558 (cross-list from q-bio.NC) [pdf, other]
-
Title: Binding Dancers Into AttractorsSubjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1397] arXiv:2206.02748 (cross-list from eess.IV) [pdf, other]
-
Title: Compound Multi-branch Feature Fusion for Real Image RestorationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1398] arXiv:2206.02797 (cross-list from eess.AS) [pdf, ps, other]
-
Title: FedNST: Federated Noisy Student Training for Automatic Speech RecognitionComments: Accepted at Interspeech 2022Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
- [1399] arXiv:2206.02837 (cross-list from eess.IV) [pdf, other]
-
Title: EVAC+: Multi-scale V-net with Deep Feature CRF Layers for Brain ExtractionComments: Replaced with advancements in the model and resultsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1400] arXiv:2206.02838 (cross-list from eess.IV) [pdf, other]
-
Title: Invertible Sharpening Network for MRI Reconstruction EnhancementComments: Accepted by MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1401] arXiv:2206.02959 (cross-list from eess.IV) [pdf, other]
-
Title: HMRNet: High and Multi-Resolution Network with Bidirectional Feature Calibration for Brain Structure Segmentation in RadiotherapyAuthors: Hao Fu, Guotai Wang, Wenhui Lei, Wei Xu, Qianfei Zhao, Shichuan Zhang, Kang Li, Shaoting ZhangComments: 11 pages, 6 figures, Accepted by IEEE JBHISubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1402] arXiv:2206.03003 (cross-list from eess.IV) [pdf, other]
-
Title: Transformer-based Personalized Attention Mechanism for Medical Images with Clinical RecordsAuthors: Yusuke Takagi, Noriaki Hashimoto, Hiroki Masuda, Hiroaki Miyoshi, Koichi Ohshima, Hidekata Hontani, Ichiro TakeuchiJournal-ref: Takagi, Yusuke, et al. "Transformer-based personalized attention mechanism for medical images with clinical records." Journal of Pathology Informatics (2023): 100185Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1403] arXiv:2206.03009 (cross-list from eess.IV) [pdf, other]
-
Title: Self-Knowledge Distillation based Self-Supervised Learning for Covid-19 Detection from Chest X-Ray ImagesComments: Published as a conference paper at ICASSP 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1404] arXiv:2206.03043 (cross-list from eess.IV) [pdf, other]
-
Title: COVIDx CT-3: A Large-scale, Multinational, Open-Source Benchmark Dataset for Computer-aided COVID-19 Screening from Chest CT ImagesComments: 6 pages, MED-NeurIPS 2022 workshopSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1405] arXiv:2206.03049 (cross-list from eess.IV) [pdf, other]
-
Title: Siamese Encoder-based Spatial-Temporal Mixer for Growth Trend Prediction of Lung Nodules on CT ScansAuthors: Jiansheng Fang, Jingwen Wang, Anwei Li, Yuguang Yan, Yonghe Hou, Chao Song, Hongbo Liu, Jiang LiuComments: MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1406] arXiv:2206.03066 (cross-list from quant-ph) [pdf, other]
-
Title: Recent Advances for Quantum Neural Networks in Generative LearningAuthors: Jinkai Tian, Xiaoyu Sun, Yuxuan Du, Shanshan Zhao, Qing Liu, Kaining Zhang, Wei Yi, Wanrong Huang, Chaoyue Wang, Xingyao Wu, Min-Hsiu Hsieh, Tongliang Liu, Wenjing Yang, Dacheng TaoComments: The first two authors contributed equally to this workSubjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1407] arXiv:2206.03247 (cross-list from eess.IV) [pdf, other]
-
Title: Towards better Interpretable and Generalizable AD detection using Collective Artificial IntelligenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1408] arXiv:2206.03336 (cross-list from eess.IV) [pdf, other]
- [1409] arXiv:2206.03359 (cross-list from eess.IV) [pdf, other]
-
Title: An efficient semi-supervised quality control system trained using physics-based MRI-artefact generators and adversarial trainingAuthors: Daniele Ravi (for the Alzheimer's Disease Neuroimaging Initiative), Frederik Barkhof, Daniel C. Alexander, Lemuel Puglisi, Geoffrey JM Parker, Arman EshaghiJournal-ref: Medical Image Analysis 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1410] arXiv:2206.03413 (cross-list from physics.med-ph) [pdf, ps, other]
-
Title: Deep Learning based Direct Segmentation Assisted by Deformable Image Registration for Cone-Beam CT based Auto-Segmentation for Adaptive RadiotherapySubjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
- [1411] arXiv:2206.03603 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A new method incorporating deep learning with shape priors for left ventricular segmentation in myocardial perfusion SPECT imagesAuthors: Fubao Zhu, Jinyu Zhao, Chen Zhao, Shaojie Tang, Jiaofen Nan, Yanting Li, Zhongqiang Zhao, Jianzhou Shi, Zenghong Chen, Zhixin Jiang, Weihua ZhouComments: 21 pages, 14 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1412] arXiv:2206.03671 (cross-list from eess.IV) [pdf, other]
-
Title: COVIDx CXR-3: A Large-Scale, Open-Source Benchmark Dataset of Chest X-ray Images for Computer-Aided COVID-19 DiagnosticsComments: 5 pages, MED-NeurIPS 2022 workshopSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1413] arXiv:2206.03709 (cross-list from eess.IV) [pdf, other]
-
Title: Hypernetwork-based Personalized Federated Learning for Multi-Institutional CT ImagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1414] arXiv:2206.03803 (cross-list from eess.IV) [pdf, other]
-
Title: Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung WindowSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1415] arXiv:2206.03830 (cross-list from eess.IV) [pdf, other]
-
Title: Generative Myocardial Motion Tracking via Latent Space Exploration with Biomechanics-informed PriorComments: Under reviewSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1416] arXiv:2206.03900 (cross-list from eess.IV) [pdf, other]
-
Title: Unsupervised Deformable Image Registration with Absent Correspondences in Pre-operative and Post-Recurrence Brain Tumor MRI ScansComments: Accepted by MICCAI2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1417] arXiv:2206.03935 (cross-list from eess.IV) [pdf, other]
-
Title: Dual-Distribution Discrepancy for Anomaly Detection in Chest X-RaysComments: Early Accepted to MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1418] arXiv:2206.03955 (cross-list from stat.ML) [pdf, other]
-
Title: Out-of-Distribution Detection with Class Ratio EstimationSubjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1419] arXiv:2206.04056 (cross-list from eess.IV) [pdf, ps, other]
-
Title: An Improved Deep Convolutional Neural Network by Using Hybrid Optimization Algorithms to Detect and Classify Brain Tumor Using Augmented MRI ImagesComments: Multimed Tools Appl (2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1420] arXiv:2206.04145 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Estimation of Speckle Statistics Parametric ImagesComments: Accepted in EMBC 2022Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1421] arXiv:2206.04238 (cross-list from eess.IV) [pdf, other]
-
Title: Cardiac Adipose Tissue Segmentation via Image-Level AnnotationsAuthors: Ziyi Huang, Yu Gan, Theresa Lye, Yanchen Liu, Haofeng Zhang, Andrew Laine, Elsa Angelini, Christine HendonSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1422] arXiv:2206.04272 (cross-list from cond-mat.mes-hall) [pdf, ps, other]
-
Title: STEM image analysis based on deep learning: identification of vacancy defects and polymorphs of ${MoS_2}$Authors: Kihyun Lee, Jinsub Park, Soyeon Choi, Yangjin Lee, Sol Lee, Joowon Jung, Jong-Young Lee, Farman Ullah, Zeeshan Tahir, Yong Soo Kim, Gwan-Hyoung Lee, Kwanpyo KimComments: 24 pages, 5 figuresJournal-ref: Nano Letters, 2022Subjects: Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Materials Science (cond-mat.mtrl-sci); Computer Vision and Pattern Recognition (cs.CV)
- [1423] arXiv:2206.04289 (cross-list from eess.IV) [pdf, other]
-
Title: A No-Reference Deep Learning Quality Assessment Method for Super-resolution Images Based on Frequency MapsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1424] arXiv:2206.04328 (cross-list from eess.IV) [pdf, other]
-
Title: Novel projection schemes for graph-based Light Field codingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1425] arXiv:2206.04336 (cross-list from eess.IV) [pdf, other]
-
Title: Joint Modeling of Image and Label Statistics for Enhancing Model Generalizability of Medical Image SegmentationComments: MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1426] arXiv:2206.04341 (cross-list from eess.IV) [pdf, other]
-
Title: How Asynchronous Events Encode VideoComments: 6 pages, 4 figuresJournal-ref: 2021 55th Asilomar Conference on Signals, Systems, and ComputersSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1427] arXiv:2206.04431 (cross-list from eess.IV) [pdf, other]
-
Title: Cross-boosting of WNNM Image Denoising method by Directional Wavelet PacketsComments: 30 pages, 28 figures. arXiv admin note: substantial text overlap with arXiv:2008.11595. text overlap with arXiv:2001.04899Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1428] arXiv:2206.04514 (cross-list from eess.IV) [pdf, ps, other]
-
Title: SAR Despeckling using a Denoising Diffusion Probabilistic ModelAuthors: Malsha V. Perera, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. PatelComments: Our code is available at this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1429] arXiv:2206.04548 (cross-list from eess.IV) [pdf, other]
-
Title: Classification of COVID-19 in Chest X-ray Images Using Fusion of Deep Features and LightGBMAuthors: Hamid Nasiri, Ghazal Kheyroddin, Morteza Dorrigiv, Mona Esmaeili, Amir Raeisi Nafchi, Mohsen Haji Ghorbani, Payman Zarkesh-HaComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1430] arXiv:2206.04647 (cross-list from eess.IV) [pdf, other]
-
Title: VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-ResolutionAuthors: Zeyuan Chen, Yinbo Chen, Jingwen Liu, Xingqian Xu, Vidit Goel, Zhangyang Wang, Humphrey Shi, Xiaolong WangComments: Accepted to CVPR 2022. Project page: this http URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1431] arXiv:2206.04681 (cross-list from eess.IV) [pdf, other]
-
Title: Gaussian Fourier Pyramid for Local Laplacian FilterJournal-ref: IEEE Signal Processing Letters (SPL), vol. 29, pp. 11-15, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1432] arXiv:2206.04682 (cross-list from eess.IV) [pdf, other]
-
Title: RT-DNAS: Real-time Constrained Differentiable Neural Architecture Search for 3D Cardiac Cine MRI SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1433] arXiv:2206.04684 (cross-list from eess.IV) [pdf, other]
-
Title: Structure-consistent Restoration Network for Cataract Fundus Image EnhancementSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1434] arXiv:2206.04689 (cross-list from eess.IV) [pdf, ps, other]
-
Title: AI-based Clinical Assessment of Optic Nerve Head Robustness Superseding Biomechanical TestingAuthors: Fabian A. Braeu, Thanadet Chuangsuwanich, Tin A. Tun, Alexandre H. Thiery, Tin Aung, George Barbastathis, Michaël J.A. GirardSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1435] arXiv:2206.04732 (cross-list from eess.IV) [pdf, other]
-
Title: AI-MIA: COVID-19 Detection & Severity Analysis through Medical ImagingComments: arXiv admin note: substantial text overlap with arXiv:2106.07524Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1436] arXiv:2206.04877 (cross-list from eess.IV) [pdf, other]
-
Title: Efficient Per-Shot Convex Hull Prediction By Recurrent LearningSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1437] arXiv:2206.05047 (cross-list from eess.IV) [pdf, other]
-
Title: A GPU-Accelerated Light-field Super-resolution Framework Based on Mixed Noise Model and Weighted RegularizationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
- [1438] arXiv:2206.05049 (cross-list from eess.IV) [pdf, other]
-
Title: Denoising Generalized Expectation-Consistent Approximation for MR Image RecoverySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [1439] arXiv:2206.05054 (cross-list from eess.IV) [pdf, other]
-
Title: A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video SequencesComments: Accepted to IEEE 24th International Workshop on Multimedia Signal Processing, 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1440] arXiv:2206.05092 (cross-list from eess.IV) [pdf, other]
-
Title: Learning self-calibrated optic disc and cup segmentation from multi-rater annotationsAuthors: Junde Wu, Huihui Fang, Fangxin Shang, Zhaowei Wang, Dalu Yang, Wenshuo Zhou, Yehui Yang, Yanwu XuSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1441] arXiv:2206.05148 (cross-list from eess.IV) [pdf, other]
-
Title: Weakly-supervised segmentation using inherently-explainable classification models and their application to brain tumour classificationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1442] arXiv:2206.05236 (cross-list from physics.optics) [pdf, ps, other]
-
Title: Optical Diffraction Tomography based on 3D Physics-Inspired Neural Network (PINN)Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV)
- [1443] arXiv:2206.05277 (cross-list from eess.IV) [pdf, other]
-
Title: Superresolution and Segmentation of OCT scans using Multi-Stage adversarial Guided Attention TrainingAuthors: Paria Jeihouni, Omid Dehzangi, Annahita Amireskandari, Ali Dabouei, Ali Rezai, Nasser M. NasrabadiComments: 5 pages,conferenceSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1444] arXiv:2206.05278 (cross-list from eess.IV) [pdf, other]
-
Title: Dual-Branch Squeeze-Fusion-Excitation Module for Cross-Modality Registration of Cardiac SPECT and CTAuthors: Xiongchao Chen, Bo Zhou, Huidong Xie, Xueqi Guo, Jiazhen Zhang, Albert J. Sinusas, John A. Onofrey, Chi liuComments: 10 pages, 4 figures, accepted at MICCAI 2022Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1445] arXiv:2206.05279 (cross-list from eess.IV) [pdf, other]
-
Title: PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural FrameworkSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [1446] arXiv:2206.05283 (cross-list from eess.IV) [pdf, other]
-
Title: Poissonian Blurred Image Deconvolution by Framelet based Local Minimal PriorAuthors: Reza ParvazSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
- [1447] arXiv:2206.05284 (cross-list from eess.IV) [pdf, other]
-
Title: Decoupling Predictions in Distributed Learning for Multi-Center Left Atrial MRI SegmentationComments: Accepted by MICCAI 2022Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1448] arXiv:2206.05288 (cross-list from eess.IV) [pdf, other]
-
Title: From Labels to Priors in Capsule Endoscopy: A Prior Guided Approach for Improving Generalization with Few LabelsSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1449] arXiv:2206.05289 (cross-list from eess.IV) [pdf, other]
-
Title: Localized adversarial artifacts for compressed sensing MRIComments: 14 pages, 7 figuresJournal-ref: SIAM Journal on Imaging Sciences, 16(4):SC14-SC26, 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1450] arXiv:2206.05472 (cross-list from eess.IV) [pdf, other]
-
Title: Differentiable Projection from Optical Coherence Tomography B-Scan without Retinal Layer Segmentation SupervisionComments: ISBI2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1451] arXiv:2206.05516 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Learning-Based MR Image Re-parameterizationComments: A. Narang, A. Raj, M. Pop and M. Ebrahimi, "Deep Learning-Based MR Image Re-parameterization," 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE), Las Vegas, NV, USA, 2023, pp. 536-541, doi: 10.1109/CSCE60160.2023.00094Journal-ref: 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1452] arXiv:2206.05575 (cross-list from eess.IV) [pdf, ps, other]
-
Title: MammoFL: Mammographic Breast Density Estimation using Federated LearningAuthors: Ramya Muthukrishnan, Angelina Heyler, Keshava Katti, Sarthak Pati, Walter Mankowski, Aprupa Alahari, Michael Sanborn, Emily F. Conant, Christopher Scott, Stacey Winham, Celine Vachon, Pratik Chaudhari, Despina Kontos, Spyridon BakasComments: Deep learning, federated learning, mammography, breast density, risk assessmentSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
- [1453] arXiv:2206.05615 (cross-list from eess.IV) [pdf, other]
-
Title: Machine learning approaches for COVID-19 detection from chest X-ray imaging: A Systematic ReviewAuthors: Harold Brayan Arteaga-Arteaga (1), Melissa delaPava (1), Alejandro Mora-Rubio (1), Mario Alejandro Bravo-Ortíz (1), Jesus Alejandro Alzate-Grisales (1), Daniel Arias-Garzón (1), Luis Humberto López-Murillo (2), Felipe Buitrago-Carmona (3), Juan Pablo Villa-Pulgarín (1), Esteban Mercado-Ruiz (1), Simon Orozco-Arias (3 and 4), M. Hassaballah (5), Maria de la Iglesia-Vaya (6), Oscar Cardona-Morales (1), Reinel Tabares-Soto (1) ((1) Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia, (2) Department of Chemical Engineering, Universidad Nacional de Colombia, Manizales, Colombia, (3) Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia, (4) Department of Systems and informatics, Universidad de Caldas, Manizales, Colombia, (5) Faculty of Computers and Information, South Valley University, Qena, Egypt, (6) Unidad Mixta de Imagen Biomédica FISABIO-CIPF, Fundación para el Fomento de la Investigación Sanitario y Biomédica de la Comunidad Valenciana, Valencia, Spain)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1454] arXiv:2206.05618 (cross-list from physics.med-ph) [pdf, other]
-
Title: Synthetic PET via Domain Translation of 3D MRIAuthors: Abhejit Rajagopal, Yutaka Natsuaki, Kristen Wangerin, Mahdjoub Hamdi, Hongyu An, John J. Sunderland, Richard Laforest, Paul E. Kinahan, Peder E.Z. Larson, Thomas A.HopeComments: under reviewSubjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
- [1455] arXiv:2206.05647 (cross-list from eess.IV) [pdf, other]
-
Title: A Fast Alternating Minimization Algorithm for Coded Aperture Snapshot Spectral Imaging Based on Sparsity and Deep Image PriorsComments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibleSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
- [1456] arXiv:2206.05650 (cross-list from eess.IV) [pdf, other]
-
Title: Preprocessing Enhanced Image Compression for Machine VisionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1457] arXiv:2206.05695 (cross-list from eess.IV) [pdf, ps, other]
-
Title: PD-DWI: Predicting response to neoadjuvant chemotherapy in invasive breast cancer with Physiologically-Decomposed Diffusion-Weighted MRI machine-learning modelComments: Accepted to Medical Image Computing and Computer Assisted Intervention - MICCAI 2022 to be held during Sept 18-22 in SingaporeSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1458] arXiv:2206.05782 (cross-list from eess.IV) [pdf, other]
-
Title: DSCA: A Dual-Stream Network with Cross-Attention on Whole-Slide Image Pyramids for Cancer PrognosisComments: 12 pages, 6 figures, 7 tablesJournal-ref: Expert Systems with Applications, 120280 (2023)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1459] arXiv:2206.05935 (cross-list from eess.IV) [pdf, other]
-
Title: Fluorescence angiography classification in colorectal surgery -- A preliminary reportAuthors: Antonio S Soares, Sophia Bano, Neil T Clancy, Laurence B Lovat, Danail Stoyanov, Manish ChandSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1460] arXiv:2206.06065 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Deep ensemble learning for segmenting tuberculosis-consistent manifestations in chest radiographsComments: 13 pages, 6 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1461] arXiv:2206.06070 (cross-list from eess.IV) [pdf, other]
-
Title: Annular Computational Imaging: Capture Clear Panoramic Images through Simple LensComments: Accepted to IEEE Transactions on Computational Imaging (TCI). Code and datasets are publicly available at this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Optics (physics.optics)
- [1462] arXiv:2206.06127 (cross-list from eess.IV) [pdf, other]
-
Title: SyntheX: Scaling Up Learning-based X-ray Image Analysis Through In Silico ExperimentsAuthors: Cong Gao, Benjamin D. Killeen, Yicheng Hu, Robert B. Grupp, Russell H. Taylor, Mehran Armand, Mathias UnberathSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1463] arXiv:2206.06235 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Prostate Cancer Malignancy Detection and localization from mpMRI using auto-Deep Learning: One Step Closer to Clinical UtilizationAuthors: Weiwei Zong, Eric Carver, Simeng Zhu, Eric Schaff, Daniel Chapman, Joon Lee, Hassan Bagher Ebadian, Indrin Chetty, Benjamin Movsas, Winston Wen, Tarik Alafif, Xiangyun ZongComments: arXiv admin note: text overlap with arXiv:1903.12331Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1464] arXiv:2206.06253 (cross-list from eess.IV) [pdf, ps, other]
-
Title: RPLHR-CT Dataset and Transformer Baseline for Volumetric Super-Resolution from CT ScansComments: Accepted MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1465] arXiv:2206.06264 (cross-list from eess.IV) [pdf, other]
-
Title: Automatic Polyp Segmentation with Multiple Kernel Dilated Convolution NetworkJournal-ref: Published CBMS 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1466] arXiv:2206.06267 (cross-list from eess.IV) [pdf, other]
-
Title: MMMNA-Net for Overall Survival Time Prediction of Brain Tumor PatientsComments: Accepted EMBC 2022Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1467] arXiv:2206.06341 (cross-list from eess.IV) [pdf, other]
-
Title: Unsupervised inter-frame motion correction for whole-body dynamic PET using convolutional long short-term memory in a convolutional neural networkAuthors: Xueqi Guo, Bo Zhou, David Pigg, Bruce Spottiswoode, Michael E. Casey, Chi Liu, Nicha C. DvornekComments: Preprint submitted to Medical Image AnalysisSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP); Applications (stat.AP)
- [1468] arXiv:2206.06445 (cross-list from eess.IV) [pdf, other]
-
Title: Fitting Segmentation Networks on Varying Image Resolutions using SplattingAuthors: Mikael Brudfors, Yael Balbastre, John Ashburner, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge CardosoComments: Accepted for MIUA 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1469] arXiv:2206.06448 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Assessing Privacy Leakage in Synthetic 3-D PET Imaging using Transversal GANAuthors: Robert V. Bergen, Jean-Francois Rajotte, Fereshteh Yousefirizi, Arman Rahmim, Raymond T. NgComments: arXiv admin note: text overlap with arXiv:2111.01866Subjects: Image and Video Processing (eess.IV); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1470] arXiv:2206.06541 (cross-list from eess.IV) [pdf, other]
-
Title: Pixel-by-pixel Mean Opinion Score (pMOS) for No-Reference Image Quality AssessmentSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1471] arXiv:2206.06575 (cross-list from eess.IV) [pdf, other]
-
Title: Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1472] arXiv:2206.06598 (cross-list from eess.IV) [pdf, other]
-
Title: CorticalFlow$^{++}$: Boosting Cortical Surface Reconstruction Accuracy, Regularity, and InteroperabilityAuthors: Rodrigo Santa Cruz, Léo Lebrat, Darren Fu, Pierrick Bourgeat, Jurgen Fripp, Clinton Fookes, Olivier SalvadoSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1473] arXiv:2206.06623 (cross-list from eess.IV) [pdf, other]
-
Title: ULTRA: Uncertainty-aware Label Distribution Learning for Breast Tumor Cellularity AssessmentComments: Paper accepted by MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1474] arXiv:2206.06654 (cross-list from eess.IV) [pdf, other]
-
Title: The Kidneys Are Not All Normal: Investigating the Speckle Distributions of Transplanted KidneysAuthors: Rohit Singla, Ricky Hu, Cailin Ringstrom, Victoria Lessoway, Janice Reid, Christopher Nguan, Robert RohlingComments: 25 pages, 2 figures, 3 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
- [1475] arXiv:2206.06657 (cross-list from eess.IV) [pdf, other]
-
Title: The Open Kidney Ultrasound Data SetAuthors: Rohit Singla, Cailin Ringstrom, Grace Hu, Victoria Lessoway, Janice Reid, Christopher Nguan, Robert RohlingComments: 21 pages, 1 figure, 5 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1476] arXiv:2206.06663 (cross-list from q-bio.QM) [pdf, ps, other]
-
Title: Quantitative Imaging Principles Improves Medical Image LearningAuthors: Lambert T. Leong, Michael C. Wong, Yannik Glaser, Thomas Wolfgruber, Steven B. Heymsfield, Peter Sadowski, John A. ShepherdSubjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
- [1477] arXiv:2206.06701 (cross-list from eess.IV) [pdf, other]
-
Title: CNN-based Classification Framework for Lung Tissues with Auxiliary InformationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1478] arXiv:2206.06725 (cross-list from eess.IV) [pdf, other]
-
Title: Automated SSIM Regression for Detection and Quantification of Motion Artefacts in Brain MR ImagesAuthors: Alessandro Sciarra, Soumick Chatterjee, Max Dünnwald, Giuseppe Placidi, Andreas Nürnberger, Oliver Speck, Steffen Oeltze-JafraSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1479] arXiv:2206.06730 (cross-list from eess.IV) [pdf, other]
-
Title: Automated Precision Localization of Peripherally Inserted Central Catheter Tip through Model-Agnostic Multi-Stage NetworksComments: Subin Park and Yoon Ki Cha have contributed equally to this work as the co-first author. Kyung-Su Kim (kskim.doc@gmail.com) and Myung Jin Chung (mj1.chung@samsung.com) have contributed equally to this work as the co-corresponding authorSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1480] arXiv:2206.06813 (cross-list from eess.IV) [pdf, other]
-
Title: Learning towards Synchronous Network Memorizability and Generalizability for Continual Segmentation across Multiple SitesAuthors: Jingyang Zhang, Peng Xue, Ran Gu, Yuning Gu, Mianxin Liu, Yongsheng Pan, Zhiming Cui, Jiawei Huang, Lei Ma, Dinggang ShenComments: Early accepted in MICCAI2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1481] arXiv:2206.06862 (cross-list from q-bio.QM) [pdf, other]
-
Title: Evaluating histopathology transfer learning with ChampKitAuthors: Jakub R. Kaczmarzyk, Tahsin M. Kurc, Shahira Abousamra, Rajarsi Gupta, Joel H. Saltz, Peter K. KooComments: Submitted to NeurIPS 2022 Track on Datasets and Benchmarks. Source code available at this https URLSubjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [1482] arXiv:2206.06947 (cross-list from eess.IV) [pdf, other]
-
Title: K-Space Transformer for Undersampled MRI ReconstructionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1483] arXiv:2206.07122 (cross-list from stat.ML) [pdf, other]
-
Title: Loss Functions for Classification using Structured EntropyAuthors: Brian LucenaSubjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG)
- [1484] arXiv:2206.07156 (cross-list from eess.IV) [pdf, other]
-
Title: Federated Multi-organ Segmentation with Inconsistent LabelsComments: v1: 10 pages, 5 figures; v2: 14 pages, 5 figures, accepted by IEEE Transactions on Medical Imaging (TMI), published version available at this https URL, source code available at this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1485] arXiv:2206.07219 (cross-list from eess.IV) [pdf, ps, other]
-
Title: A Projection-Based K-space Transformer Network for Undersampled Radial MRI Reconstruction with Limited Training SubjectsComments: Accepted at MICCAI 2022Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1486] arXiv:2206.07280 (cross-list from eess.IV) [pdf, ps, other]
-
Title: ERNAS: An Evolutionary Neural Architecture Search for Magnetic Resonance Image ReconstructionsComments: 11 pages, 9 figures, and 4 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1487] arXiv:2206.07281 (cross-list from physics.optics) [pdf, ps, other]
-
Title: Super-resolution image display using diffractive decodersAuthors: Cagatay Isil, Deniz Mengu, Yifan Zhao, Anika Tabassum, Jingxi Li, Yi Luo, Mona Jarrahi, Aydogan OzcanComments: 26 Pages, 9 FiguresJournal-ref: Science Advances (2022)Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Applied Physics (physics.app-ph)
- [1488] arXiv:2206.07364 (cross-list from eess.IV) [pdf, other]
-
Title: Seeking Common Ground While Reserving Differences: Multiple Anatomy Collaborative Framework for Undersampled MRI ReconstructionComments: submitted to an IEEE journalSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1489] arXiv:2206.07388 (cross-list from physics.geo-ph) [pdf, ps, other]
-
Title: Subsurface Depths Structure Maps Reconstruction with Generative Adversarial NetworksAuthors: Dmitry IvlevComments: 12 pages, 12 figures, 1 tableSubjects: Geophysics (physics.geo-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1490] arXiv:2206.07417 (cross-list from eess.IV) [pdf, other]
-
Title: Interpretable differential diagnosis for Alzheimer's disease and Frontotemporal dementiaSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1491] arXiv:2206.07422 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Neural Network Pruning for Nuclei Instance Segmentation in Hematoxylin & Eosin-Stained Histological ImagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1492] arXiv:2206.07481 (cross-list from eess.SP) [pdf, ps, other]
-
Title: A Survey of Detection Methods for Die Attachment and Wire Bonding Defects in Integrated Circuit ManufacturingComments: 13 pages, 9 figures, 8 tablesSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1493] arXiv:2206.07494 (cross-list from cond-mat.stat-mech) [pdf, other]
-
Title: Counting Phases and Faces Using Bayesian Thermodynamic IntegrationComments: 20 pages, 9 figures, plus appendix with additional figuresSubjects: Statistical Mechanics (cond-mat.stat-mech); Disordered Systems and Neural Networks (cond-mat.dis-nn); Computer Vision and Pattern Recognition (cs.CV); Data Analysis, Statistics and Probability (physics.data-an)
- [1494] arXiv:2206.07542 (cross-list from q-bio.NC) [pdf, other]
-
Title: A Deep Generative Model of Neonatal Cortical Surface DevelopmentSubjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
- [1495] arXiv:2206.07595 (cross-list from eess.IV) [pdf, ps, other]
-
Title: BIO-CXRNET: A Robust Multimodal Stacking Machine Learning Technique for Mortality Risk Prediction of COVID-19 Patients using Chest X-Ray Images and Clinical DataAuthors: Tawsifur Rahman, Muhammad E. H. Chowdhury, Amith Khandakar, Zaid Bin Mahbub, Md Sakib Abrar Hossain, Abraham Alhatou, Eynas Abdalla, Sreekumar Muthiyal, Khandaker Farzana Islam, Saad Bin Abul Kashem, Muhammad Salman Khan, Susu M. Zughaier, Maqsud HossainComments: 25 pages, 8 Tables, 10 FiguresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1496] arXiv:2206.07599 (cross-list from eess.IV) [pdf, other]
-
Title: How GNNs Facilitate CNNs in Mining Geometric Information from Large-Scale Medical ImagesComments: 21 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1497] arXiv:2206.07664 (cross-list from eess.IV) [pdf, other]
-
Title: CRISP - Reliable Uncertainty Estimation for Medical Image SegmentationAuthors: Thierry Judge, Olivier Bernard, Mihaela Porumb, Agis Chartsias, Arian Beqiri, Pierre-Marc JodoinComments: 9 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1498] arXiv:2206.08019 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-View Imputation and Cross-Attention Network Based on Incomplete Longitudinal and Multimodal Data for Conversion Prediction of Mild Cognitive ImpairmentSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1499] arXiv:2206.08023 (cross-list from eess.IV) [pdf, other]
-
Title: AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image SegmentationAuthors: Yuanfeng Ji, Haotian Bai, Jie Yang, Chongjian Ge, Ye Zhu, Ruimao Zhang, Zhen Li, Lingyan Zhang, Wanling Ma, Xiang Wan, Ping LuoSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1500] arXiv:2206.08078 (cross-list from eess.IV) [pdf, other]
-
Title: U-PET: MRI-based Dementia Detection with Joint Generation of Synthetic FDG-PET ImagesAuthors: Marcel Kollovieh, Matthias Keicher, Stephan Wunderlich, Hendrik Burwinkel, Thomas Wendler, Nassir NavabSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1501] arXiv:2206.08272 (cross-list from eess.IV) [pdf, other]
-
Title: Longitudinal detection of new MS lesions using Deep LearningComments: preprintSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1502] arXiv:2206.08277 (cross-list from astro-ph.EP) [pdf, other]
-
Title: A machine-generated catalogue of Charon's craters and implications for the Kuiper beltAuthors: Mohamad Ali-DibComments: 16 pages, 2 figures, accepted for publication in IcarusSubjects: Earth and Planetary Astrophysics (astro-ph.EP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1503] arXiv:2206.08298 (cross-list from eess.IV) [pdf, other]
-
Title: Video Capsule Endoscopy Classification using Focal Modulation Guided Convolutional Neural NetworkJournal-ref: CBMS 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1504] arXiv:2206.08308 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Deepfake histological images for enhancing digital pathologyAuthors: Kianoush Falahkheirkhah, Saumya Tiwari, Kevin Yeh, Sounak Gupta, Loren Herrera-Hernandez, Michael R. McCarthy, Rafael E. Jimenez, John C. Cheville, Rohit BhargavaSubjects: Image and Video Processing (eess.IV); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1505] arXiv:2206.08398 (cross-list from eess.IV) [pdf, other]
-
Title: Learning Generic Lung Ultrasound Biomarkers for Decoupling Feature Extraction from Downstream TasksAuthors: Gautam Rajendrakumar Gare, Tom Fox, Pete Lowery, Kevin Zamora, Hai V. Tran, Laura Hutchins, David Montgomery, Amita Krishnan, Deva Kannan Ramanan, Ricardo Luis Rodriguez, Bennett P deBoisblanc, John Michael GaleottiSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1506] arXiv:2206.08439 (cross-list from eess.IV) [pdf, other]
-
Title: OpenSRH: optimizing brain tumor surgery using intraoperative stimulated Raman histologyAuthors: Cheng Jiang, Asadur Chowdury, Xinhai Hou, Akhil Kondepudi, Christian W. Freudiger, Kyle Conway, Sandra Camelo-Piragua, Daniel A. Orringer, Honglak Lee, Todd C. HollonComments: Neural Information Processing Systems (NeurIPS) 2022 Datasets and Benchmarks TrackSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1507] arXiv:2206.08481 (cross-list from eess.IV) [pdf, other]
-
Title: Orientation-guided Graph Convolutional Network for Bone Surface SegmentationAuthors: Aimon Rahman, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M PatelComments: Accepted at MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1508] arXiv:2206.08543 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Multi-Classification of Brain Tumor Images Using Transfer Learning Based Deep Neural NetworkComments: 7 pages, 4 figures, 2 tables, International Virtual Conference on ARTIFICIAL INTELLIGENCE FOR SMART COMMUNITY, MalaysiaJournal-ref: Conference proceedings \c{opyright} 2023 International Conference on Artificial Intelligence for Smart CommunitySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1509] arXiv:2206.08557 (cross-list from eess.IV) [pdf, other]
-
Title: COVID-19 Detection using Transfer Learning with Convolutional Neural NetworkComments: 4 pages, 4 figures, 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST), DHAKA, BangladeshJournal-ref: 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST), DHAKA, Bangladesh, 2021, pp. 429-432Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1510] arXiv:2206.08612 (cross-list from eess.IV) [pdf, other]
-
Title: OADAT: Experimental and Synthetic Clinical Optoacoustic Data for Standardized Image ProcessingComments: Accepted to TMLR. 32 pages, 24 figures, 9 tablesJournal-ref: Transactions on Machine Learning Research (2023) 2835-8856Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1511] arXiv:2206.08671 (cross-list from stat.ML) [pdf, other]
-
Title: FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image ClassificationAuthors: Aliaksandra Shysheya, John Bronskill, Massimiliano Patacchiola, Sebastian Nowozin, Richard E TurnerJournal-ref: The Eleventh International Conference on Learning Representations (ICLR 2023)Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1512] arXiv:2206.08787 (cross-list from eess.IV) [pdf, other]
-
Title: Leveraging Uncertainty in Deep Learning for Pancreatic Adenocarcinoma GradingComments: 26th UK Conference on Medical Image Understanding and Analysis; 27 - 29 July 2022; University of Cambridge, UK. arXiv admin note: text overlap with arXiv:2003.10769Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1513] arXiv:2206.08885 (cross-list from eess.IV) [pdf, other]
-
Title: Incorporating intratumoral heterogeneity into weakly-supervised deep learning models via variance poolingAuthors: Iain Carmichael, Andrew H. Song, Richard J. Chen, Drew F.K. Williamson, Tiffany Y. Chen, Faisal MahmoodComments: MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Methodology (stat.ME)
- [1514] arXiv:2206.08936 (cross-list from eess.IV) [pdf, other]
-
Title: Simultaneous Bone and Shadow Segmentation Network using Task Correspondence ConsistencyComments: Accepted at MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1515] arXiv:2206.08984 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-scale Super-resolution Magnetic Resonance Spectroscopic Imaging with Adjustable SharpnessAuthors: Siyuan Dong, Gilbert Hangel, Wolfgang Bogner, Georg Widhalm, Karl Rössler, Siegfried Trattnig, Chenyu You, Robin de Graaf, John Onofrey, James DuncanComments: Accepted by MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1516] arXiv:2206.08985 (cross-list from eess.IV) [pdf, other]
-
Title: TransResU-Net: Transformer based ResU-Net for Real-Time Colonoscopy Polyp SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1517] arXiv:2206.08994 (cross-list from stat.ML) [pdf, other]
-
Title: Robust Group Synchronization via Quadratic ProgrammingComments: Accepted to ICML 2022Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Numerical Analysis (math.NA)
- [1518] arXiv:2206.09065 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Free-form Lesion Synthesis Using a Partial Convolution Generative Adversarial Network for Enhanced Deep Learning Liver Tumor SegmentationComments: The paper is under review by JACMP-Journal of Applied Medical PhysicsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1519] arXiv:2206.09128 (cross-list from eess.IV) [pdf, other]
-
Title: A Combined PCA-MLP Network for Early Breast Cancer DetectionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1520] arXiv:2206.09146 (cross-list from eess.IV) [pdf, other]
-
Title: A Perceptually Optimized and Self-Calibrated Tone Mapping OperatorComments: 15 pages,17 figuresSubjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1521] arXiv:2206.09193 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Multi-Modality Image Super-Resolution using Generative Adversarial NetworksComments: to be published in the Proceedings of 16th International Conference on Computer Graphics, Visualization, Computer Vision and Image Processing (CGVCVIP), Lisbon, Portugal, July 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1522] arXiv:2206.09210 (cross-list from eess.IV) [pdf, other]
-
Title: Multi-Modality Image Inpainting using Generative Adversarial NetworksComments: to be published in the Proceedings of 26th Int'l Conf on Image Processing, Computer Vision, & Pattern Recognition (IPCV), July 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1523] arXiv:2206.09309 (cross-list from eess.IV) [pdf, other]
-
Title: TBraTS: Trusted Brain Tumor SegmentationComments: 11 pages, 4 figures, Accepted by MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1524] arXiv:2206.09611 (cross-list from eess.IV) [pdf, other]
-
Title: SJ-HD^2R: Selective Joint High Dynamic Range and Denoising Imaging for Dynamic ScenesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1525] arXiv:2206.09867 (cross-list from eess.SP) [pdf, other]
-
Title: WiFi-based Spatiotemporal Human Action PerceptionSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
- [1526] arXiv:2206.10152 (cross-list from physics.optics) [pdf, ps, other]
-
Title: Diffractive Interconnects: All-Optical Permutation Operation Using Diffractive NetworksComments: 22 Pages, 6 FiguresJournal-ref: Nanophotonics (2022)Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
- [1527] arXiv:2206.10183 (cross-list from eess.IV) [pdf, ps, other]
-
Title: covEcho Resource constrained lung ultrasound image analysis tool for faster triaging and active learningAuthors: Jinu Joseph, Mahesh Raveendranatha Panicker, Yale Tung Chen, Kesavadas Chandrasekharan, Vimal Chacko Mondy, Anoop Ayyappan, Jineesh Valakkada, Kiran Vishnu NarayanComments: Submitted to Elsevier CMPBUP on Dec 1, 2021Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1528] arXiv:2206.10286 (cross-list from eess.IV) [pdf, other]
-
Title: Position-prior Clustering-based Self-attention Module for Knee Cartilage SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1529] arXiv:2206.10294 (cross-list from eess.IV) [pdf, other]
-
Title: Using the Polar Transform for Efficient Deep Learning-Based Aorta Segmentation in CTA ImagesComments: Accepted to 64th International Symposium ELMAR-2022, Zadar, CroatiaSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1530] arXiv:2206.10357 (cross-list from eess.IV) [pdf, other]
-
Title: Confidence-Guided Unsupervised Domain Adaptation for Cerebellum SegmentationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1531] arXiv:2206.10455 (cross-list from eess.IV) [src]
-
Title: Automated Coronary Calcium Scoring using U-Net Models through Semi-supervised Learning on Non-Gated CT ScansAuthors: Sanskriti SinghComments: There is no correlation between gated and non-gated CT scans causing the points used in the training and results to be flawed. It was inaccurately assumed that there was a correlation between the scansSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1532] arXiv:2206.10543 (cross-list from eess.IV) [pdf, other]
-
Title: Faster Diffusion Cardiac MRI with Deep Learning-based breath hold reductionAuthors: Michael Tanzer, Pedro Ferreira, Andrew Scott, Zohya Khalique, Maria Dwornik, Dudley Pennell, Guang Yang, Daniel Rueckert, Sonia Nielles-VallespinComments: 15 pages, 1 figures, 2 tables. To be published in MIUA22Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1533] arXiv:2206.10750 (cross-list from eess.SP) [pdf, other]
-
Title: Floor Map Reconstruction Through Radio Sensing and Learning By a Large Intelligent SurfaceAuthors: Cristian J. Vaca-Rubio, Roberto Pereira, Xavier Mestre, David Gregoratti, Zheng-Hua Tan, Elisabeth de Carvalho, Petar PopovskiSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
- [1534] arXiv:2206.10802 (cross-list from eess.IV) [pdf, other]
-
Title: SVoRT: Iterative Transformer for Slice-to-Volume Registration in Fetal Brain MRIAuthors: Junshen Xu, Daniel Moyer, P. Ellen Grant, Polina Golland, Juan Eugenio Iglesias, Elfar AdalsteinssonComments: Accepted by MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1535] arXiv:2206.10810 (cross-list from eess.IV) [pdf, other]
-
Title: A Simple Baseline for Video Restoration with Grouped Spatial-temporal ShiftAuthors: Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng LiComments: Accepted to CVPR2023Journal-ref: 2023 Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern RecognitionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1536] arXiv:2206.10911 (cross-list from eess.IV) [pdf, other]
-
Title: Influence of uncertainty estimation techniques on false-positive reduction in liver lesion detectionComments: Accepted for publication in the Journal of Machine Learning for Biomedical Imaging (MELBA)Journal-ref: https://www.melba-journal.org/papers/2022:030.htmlSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1537] arXiv:2206.10912 (cross-list from eess.IV) [pdf, ps, other]
-
Title: AI-based software for lung nodule detection in chest X-rays -- Time for a second reader approach?Authors: Susanne Ohlmann-Knafo, Naglis Ramanauskas, Sebastian Huettinger, Emil Johnson Jeyakumar, Darius Barušauskas, Neringa Bielskienė, Vytautas Naujalis, Jonas Bialopetravičius, Jonas Ražanskas, Artūras Samuilis, Jūratė Dementavičienė, Dirk PickuthComments: This paper is in submission process to the European Radiology journalSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1538] arXiv:2206.11048 (cross-list from eess.IV) [pdf, other]
-
Title: Automated GI tract segmentation using deep learningAuthors: Manhar SharmaComments: 8 pages, 9 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1539] arXiv:2206.11127 (cross-list from eess.IV) [pdf, ps, other]
-
Title: CNN-based fully automatic wrist cartilage volume quantification in MR ImageAuthors: Nikita Vladimirov, Ekaterina Brui, Anatoliy Levchuk, Vladimir Fokin, Aleksandr Efimtcev, David BendahanComments: 17 pages, 6 Figures, 6 Tables, 1 SuplementarySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
- [1540] arXiv:2206.11458 (cross-list from eess.IV) [pdf, other]
-
Title: Weighted Concordance Index Loss-based Multimodal Survival Modeling for Radiation Encephalopathy Assessment in Nasopharyngeal Carcinoma RadiotherapyAuthors: Jiansheng Fang, Anwei Li, Pu-Yun OuYang, Jiajian Li, Jingwen Wang, Hongbo Liu, Fang-Yun Xie, Jiang LiuComments: 11 pages, 3 figures, MICCAI2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1541] arXiv:2206.11501 (cross-list from eess.IV) [pdf, other]
-
Title: A novel adversarial learning strategy for medical image classificationAuthors: Zong Fan, Xiaohui Zhang, Jacob A. Gasienica, Jennifer Potts, Su Ruan, Wade Thorstad, Hiram Gay, Pengfei Song, Xiaowei Wang, Hua LiSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1542] arXiv:2206.11599 (cross-list from eess.IV) [pdf, other]
-
Title: Universal Learned Image Compression With Low Computational CostComments: 5 pagesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1543] arXiv:2206.11669 (cross-list from physics.ao-ph) [pdf, other]
-
Title: Short-range forecasts of global precipitation using deep learning-augmented numerical weather predictionAuthors: Manmeet Singh, Vaisakh S B, Nachiketa Acharya, Aditya Grover, Suryachandra A Rao, Bipin Kumar, Zong-Liang Yang, Dev NiyogiComments: Accepted at Tackling Climate Change with Machine Learning: workshop at NeurIPS 2022Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [1544] arXiv:2206.11943 (cross-list from eess.IV) [pdf, other]
-
Title: TIAger: Tumor-Infiltrating Lymphocyte Scoring in Breast Cancer for the TiGER ChallengeAuthors: Adam Shephard, Mostafa Jahanifar, Ruoyu Wang, Muhammad Dawood, Simon Graham, Kastytis Sidlauskas, Syed Ali Khurram, Nasir Rajpoot, Shan E Ahmed RazaComments: TiGER Challenge entrySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1545] arXiv:2206.12112 (cross-list from eess.IV) [pdf, other]
-
Title: Dissecting U-net for Seismic Application: An In-Depth Study on Deep Learning Multiple RemovalSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1546] arXiv:2206.12136 (cross-list from eess.IV) [pdf, other]
-
Title: Feature Representation Learning for Robust Retinal Disease Detection from Optical Coherence Tomography ImagesAuthors: Sharif Amit Kamran, Khondker Fariha Hossain, Alireza Tavakkoli, Stewart Lee Zuckerbrod, Salah A. BakerComments: Accepted to MICCAI2022 Ophthalmic Medical Image Analysis (OMIA) WorkshopSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1547] arXiv:2206.12300 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Automatic extraction of coronary arteries using deep learning in invasive coronary angiogramsComments: 22 pages,5 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1548] arXiv:2206.12344 (cross-list from eess.IV) [pdf, other]
-
Title: Segmentation-free PVC for Cardiac SPECT using a Densely-connected Multi-dimensional Dynamic NetworkAuthors: Huidong Xie, Zhao Liu, Luyao Shi, Kathleen Greco, Xiongchao Chen, Bo Zhou, Attila Feher, John C. Stendahl, Nabil Boutagy, Tassos C. Kyriakides, Ge Wang, Albert J. Sinusas, Chi LiuComments: 12 pages, 11 figures. Accepted for publication at IEEE Transactions on Medical ImagingSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1549] arXiv:2206.12407 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Independent evaluation of state-of-the-art deep networks for mammographyComments: 17 pages, 8 figures, 4 tablesSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
- [1550] arXiv:2206.12417 (cross-list from eess.IV) [pdf, other]
-
Title: Deep embedded clustering algorithm for clustering PACS repositoriesJournal-ref: Proceedings of the 2021 IEEE 34th International Symposium on Computer-Based Medical Systems (CBMS)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1551] arXiv:2206.12512 (cross-list from eess.IV) [pdf, other]
-
Title: Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge FindingsAuthors: Sophia Bano, Alessandro Casella, Francisco Vasconcelos, Abdul Qayyum, Abdesslam Benzinou, Moona Mazher, Fabrice Meriaudeau, Chiara Lena, Ilaria Anita Cintorrino, Gaia Romana De Paolis, Jessica Biagioli, Daria Grechishnikova, Jing Jiao, Bizhe Bai, Yanyan Qiao, Binod Bhattarai, Rebati Raman Gaire, Ronast Subedi, Eduard Vazquez, Szymon Płotka, Aneta Lisowska, Arkadiusz Sitek, George Attilakos, Ruwan Wimalasundera, Anna L David, Dario Paladini, Jan Deprest, Elena De Momi, Leonardo S Mattos, Sara Moccia, Danail StoyanovComments: Accepted at MedIA (Medical Image Analysis)Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1552] arXiv:2206.12809 (cross-list from eess.SP) [pdf, other]
-
Title: Role and Integration of Image Processing Systems in Maritime Target TrackingSubjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
- [1553] arXiv:2206.12815 (cross-list from eess.IV) [pdf, other]
-
Title: Breast Cancer Classification using Deep Learned Features Boosted with Handcrafted FeaturesJournal-ref: Biomedical Signal Processing and Control 2023Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1554] arXiv:2206.12980 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Detecting Schizophrenia with 3D Structural Brain MRI Using Deep LearningAuthors: Junhao Zhang, Vishwanatha M. Rao, Ye Tian, Yanting Yang, Nicolas Acosta, Zihan Wan, Pin-Yu Lee, Chloe Zhang, Lawrence S. Kegeles, Scott A. Small, Jia GuoComments: 13 pages, 6 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
- [1555] arXiv:2206.13086 (cross-list from stat.ML) [pdf, other]
-
Title: RankSEG: A Consistent Ranking-based Framework for SegmentationComments: 50 pagesJournal-ref: Journal of Machine Learning Research, 24(224), 1-50 (2023)Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Statistics Theory (math.ST)
- [1556] arXiv:2206.13123 (cross-list from eess.IV) [pdf, other]
-
Title: Unsupervised Domain Adaptation Using Feature Disentanglement And GCNs For Medical Image ClassificationAuthors: Dwarikanath MahapatraSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1557] arXiv:2206.13173 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Context-Aware Transformers For Spinal Cancer Detection and Radiological GradingComments: Pre-print of paper accepted to MICCAI 2022. 15 pages, 7 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1558] arXiv:2206.13295 (cross-list from eess.IV) [pdf, other]
-
Title: Diffusion Deformable Model for 4D Temporal Medical Image GenerationComments: Accepted for MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1559] arXiv:2206.13385 (cross-list from eess.IV) [pdf, other]
-
Title: 3D unsupervised anomaly detection and localization through virtual multi-view projection and reconstruction: Clinical validation on low-dose chest computed tomographyComments: Kyung-Su Kim and Seong Je Oh have contributed equally to this work as the co-first author. Kyung-Su Kim (kskim.doc@gmail.com) and Myung Jin Chung (mj1.chung@samsung.com) have contributed equally to this work as the co-corresponding authorSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1560] arXiv:2206.13393 (cross-list from eess.IV) [pdf, other]
-
Title: Cross-Modal Transformer GAN: A Brain Structure-Function Deep Fusing Framework for Alzheimer's DiseaseSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
- [1561] arXiv:2206.13394 (cross-list from eess.IV) [pdf, other]
-
Title: CS$^2$: A Controllable and Simultaneous Synthesizer of Images and Annotations with Minimal Human InterventionAuthors: Xiaodan Xing, Jiahao Huang, Yang Nan, Yinzhe Wu, Chengjia Wang, Zhifan Gao, Simon Walsh, Guang YangComments: 11 figures, Accepted by MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1562] arXiv:2206.13419 (cross-list from eess.IV) [pdf, other]
-
Title: DeStripe: A Self2Self Spatio-Spectral Graph Neural Network with Unfolded Hessian for Stripe Artifact Removal in Light-sheet MicroscopyComments: Accepted by 25th International Conference on Medical Image Computing and Computer Assisted InterventionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1563] arXiv:2206.13455 (cross-list from eess.IV) [pdf, other]
-
Title: IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic EnvironmentsComments: Accepted for publication in the Journal of Intelligent & Robotic SystemsJournal-ref: J Intell Robot Syst 106, 53 (2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [1564] arXiv:2206.13468 (cross-list from math.AG) [pdf, ps, other]
-
Title: An Atlas for the Pinhole CameraComments: 47 pages with references and appendices, final versionJournal-ref: JFoCM, 2022Subjects: Algebraic Geometry (math.AG); Computer Vision and Pattern Recognition (cs.CV); Commutative Algebra (math.AC)
- [1565] arXiv:2206.13504 (cross-list from eess.IV) [pdf, other]
-
Title: AI-based computer-aided diagnostic system of chest digital tomography synthesis: Demonstrating comparative advantage with X-ray-based AI systemsComments: Kyung-Su Kim, Ju Hwan Lee, and Seong Je Oh have contributed equally to this work as the co-first author. Kyung-Su Kim (kskim.doc@gmail.com) and Myung Jin Chung (mj1.chung@samsung.com) have contributed equally to this work as the co-corresponding authorSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1566] arXiv:2206.13505 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Learning-Based Defect Classification and Detection in SEM ImagesAuthors: Bappaditya Deya, Dipam Goswamif, Sandip Haldera, Kasem Khalilb, Philippe Leraya, Magdy A. BayoumiJournal-ref: In Metrology, Inspection, and Process Control XXXVI, SPIE (2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1567] arXiv:2206.13506 (cross-list from eess.IV) [pdf, other]
-
Title: Tensor Recovery Based on A Novel Non-convex Function Minimax Logarithmic Concave Penalty FunctionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1568] arXiv:2206.13613 (cross-list from eess.IV) [pdf, other]
-
Title: Flexible-Rate Learned Hierarchical Bi-Directional Video Compression With Motion Refinement and Frame-Level Bit AllocationComments: Accepted for publication in IEEE International Conference on Image Processing (ICIP 2022)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1569] arXiv:2206.13632 (cross-list from eess.IV) [pdf, other]
-
Title: Omni-Seg: A Scale-aware Dynamic Network for Renal Pathological Image SegmentationAuthors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Jun Long, Zuhayr Asad, R. Michael Womick, Zheyu Zhu, Agnes B. Fogo, Shilin Zhao, Haichun Yang, Yuankai HuoSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1570] arXiv:2206.13740 (cross-list from eess.IV) [pdf, other]
-
Title: GAN-based Super-Resolution and Segmentation of Retinal Layers in Optical coherence tomography ScansComments: 5 pages,7 figuresSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1571] arXiv:2206.13872 (cross-list from stat.ML) [pdf, other]
-
Title: When are Post-hoc Conceptual Explanations Identifiable?Comments: v5: UAI2023 camera-ready including supplementary material. The first two authors contributed equallySubjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1572] arXiv:2206.13903 (cross-list from eess.IV) [pdf, other]
-
Title: AS-IntroVAE: Adversarial Similarity Distance Makes Robust IntroVAEComments: ACML conference paperSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1573] arXiv:2206.14305 (cross-list from eess.IV) [pdf, ps, other]
-
Title: Multistep Automated Data Labelling Procedure (MADLaP) for Thyroid Nodules on Ultrasound: An Artificial Intelligence Approach for Automating Image AnnotationSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
- [1574] arXiv:2206.14678 (cross-list from eess.IV) [pdf, other]
-
Title: BiometryNet: Landmark-based Fetal Biometry Estimation from Standard Ultrasound PlanesAuthors: Netanell Avisdris, Leo Joskowicz, Brian Dromey, Anna L. David, Donald M. Peebles, Danail Stoyanov, Dafna Ben Bashat, Sophia BanoComments: 13 pages, 6 figures, Accepted to MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1575] arXiv:2206.14713 (cross-list from eess.IV) [pdf, other]
-
Title: CONVIQT: Contrastive Video Quality EstimatorSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [1576] arXiv:2206.14746 (cross-list from eess.IV) [pdf, other]
-
Title: Placenta Segmentation in Ultrasound Imaging: Addressing Sources of Uncertainty and Limited Field-of-ViewAuthors: Veronika A. Zimmer, Alberto Gomez, Emily Skelton, Robert Wright, Gavin Wheeler, Shujie Deng, Nooshin Ghavami, Karen Lloyd, Jacqueline Matthew, Bernhard Kainz, Daniel Rueckert, Joseph V. Hajnal, Julia A. SchnabelComments: 21 pages (18 + appendix), 13 figures (9 + appendix)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1577] arXiv:2206.14820 (cross-list from astro-ph.CO) [pdf, other]
-
Title: Strong Lensing Source Reconstruction Using Continuous Neural FieldsComments: 9+2 pages, 3+2 figures, Spotlight at the Machine Learning for Astrophysics Workshop at ICML 2022; v2, references addedSubjects: Cosmology and Nongalactic Astrophysics (astro-ph.CO); Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1578] arXiv:2206.14847 (cross-list from eess.IV) [pdf, other]
-
Title: Deep Reinforcement Learning for Small Bowel Path Tracking using Different Types of AnnotationsComments: Accepted to MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1579] arXiv:2206.14861 (cross-list from eess.IV) [pdf, other]
-
Title: Two-Stage COVID19 Classification Using BERT FeaturesComments: arXiv admin note: text overlap with arXiv:2106.14403Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1580] arXiv:2206.14903 (cross-list from eess.IV) [pdf, other]
-
Title: CIRDataset: A large-scale Dataset for Clinically-Interpretable lung nodule Radiomics and malignancy predictionComments: MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1581] arXiv:2206.14919 (cross-list from eess.IV) [pdf, other]
-
Title: Identifying and Combating Bias in Segmentation Networks by leveraging multiple resolutionsSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1582] arXiv:2206.14951 (cross-list from eess.IV) [pdf, other]
-
Title: CLTS-GAN: Color-Lighting-Texture-Specular Reflection Augmentation for ColonoscopyComments: MICCAI 2022. **First two authors contributed equallySubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1583] arXiv:2206.15069 (cross-list from eess.IV) [pdf, other]
-
Title: PVT-COV19D: Pyramid Vision Transformer for COVID-19 DiagnosisAuthors: Lilang Zheng, Jiaxuan Fang, Xiaorun Tang, Hanzhang Li, Jiaxin Fan, Tianyi Wang, Rui Zhou, Zhaoyan YanComments: 8 pages,1 figureSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1584] arXiv:2206.15073 (cross-list from eess.IV) [pdf, other]
-
Title: COVID Detection and Severity Prediction with 3D-ConvNeXt and Custom PretrainingsComments: 17 pages, 3 figures, informations about challenge submissionSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1585] arXiv:2206.15134 (cross-list from eess.IV) [pdf, other]
-
Title: InsMix: Towards Realistic Generative Data Augmentation for Nuclei Instance SegmentationComments: Accepted by MICCAI 2022 (early accepted)Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1586] arXiv:2206.15179 (cross-list from eess.IV) [src]
-
Title: A Medical Image Fusion Method based on MDLatLRRv2Comments: There are some errors that need to be correctedSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [1587] arXiv:2206.15182 (cross-list from eess.IV) [pdf, other]
-
Title: The (de)biasing effect of GAN-based augmentation methods on skin lesion imagesComments: Accepted to MICCAI2022Journal-ref: In: Wang, L., Dou, Q., Fletcher, P.T., Speidel, S., Li, S. (eds) Medical Image Computing and Computer Assisted Intervention - MICCAI 2022. MICCAI 2022. Lecture Notes in Computer Science, vol 13438. Springer, ChamSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1588] arXiv:2206.15217 (cross-list from eess.IV) [pdf, other]
-
Title: Implicit U-Net for volumetric medical image segmentationComments: 11 pages, 4 figures, Accepted MIUA 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1589] arXiv:2206.15254 (cross-list from eess.IV) [pdf, other]
-
Title: Localizing the Recurrent Laryngeal Nerve via Ultrasound with a Bayesian Shape FrameworkAuthors: Haoran Dou, Luyi Han, Yushuang He, Jun Xu, Nishant Ravikumar, Ritse Mann, Alejandro F. Frangi, Pew-Thian Yap, Yunzhi HuangComments: Early Accepted by MICCAI 2022Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1590] arXiv:2206.15274 (cross-list from eess.IV) [pdf, other]
-
Title: Augment like there's no tomorrow: Consistently performing neural networks for medical imagingAuthors: Joona Pohjonen, Carolin Stürenberg, Atte Föhr, Reija Randen-Brady, Lassi Luomala, Jouni Lohi, Esa Pitkänen, Antti Rannikko, Tuomas MirttiComments: Code for the paper is available from this https URLSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
- [1591] arXiv:2206.15431 (cross-list from eess.IV) [pdf, other]
-
Title: Ensemble CNN models for Covid-19 Recognition and Severity Perdition From 3D CT-scanSubjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ showing 1591 entries per page: fewer | more | all ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, 2404, contact, help (Access key information)