We gratefully acknowledge support from
the Simons Foundation and member institutions.

Robotics

New submissions

[ total of 33 entries: 1-33 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Fri, 24 Jun 22

[1]  arXiv:2206.11319 [pdf, ps, other]
Title: Graph-Based Multi-Robot Path Finding and Planning
Authors: Hang Ma
Comments: This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this article is published in Current Robotics Reports, and is available online at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)

Purpose of Review
Planning collision-free paths for multiple robots is important for real-world multi-robot systems and has been studied as an optimization problem on graphs, called Multi-Agent Path Finding (MAPF). This review surveys different categories of classic and state-of-the-art MAPF algorithms and different research attempts to tackle the challenges of generalizing MAPF techniques to real-world scenarios.
Recent Findings
Solving MAPF problems optimally is computationally challenging. Recent advances have resulted in MAPF algorithms that can compute collision-free paths for hundreds of robots and thousands of navigation tasks in seconds of runtime. Many variants of MAPF have been formalized to adapt MAPF techniques to different real-world requirements, such as considerations of robot kinematics, online optimization for real-time systems, and the integration of task assignment and path planning.
Summary
Algorithmic techniques for MAPF problems have addressed important aspects of several multi-robot applications, including automated warehouse fulfillment and sortation, automated train scheduling, and navigation of non-holonomic robots and quadcopters. This showcases their potential for real-world applications of large-scale multi-robot systems.

[2]  arXiv:2206.11350 [pdf, other]
Title: Vision- and tactile-based continuous multimodal intention and attention recognition for safer physical human-robot interaction
Comments: 10 pages, 8 figures, preprint under review
Subjects: Robotics (cs.RO)

Employing skin-like tactile sensors on robots enhances both the safety and usability of collaborative robots by adding the capability to detect human contact. Unfortunately, simple binary tactile sensors alone cannot determine the context of the human contact -- whether it is a deliberate interaction or an unintended collision that requires safety manoeuvres. Many published methods classify discrete interactions using more advanced tactile sensors or by analysing joint torques. Instead, we propose to augment the intention recognition capabilities of simple binary tactile sensors by adding a robot-mounted camera for human posture analysis. Different interaction characteristics, including touch location, human pose, and gaze direction, are used to train a supervised machine learning algorithm to classify whether a touch is intentional or not with 92% accuracy. We demonstrate that multimodal intention recognition is significantly more accurate than monomodal analysis with the collaborative robot Baxter. Furthermore, our method can also continuously monitor interactions that fluidly change between intentional or unintentional by gauging the user's attention through gaze. If a user stops paying attention mid-task, the proposed intention and attention recognition algorithm can activate safety features to prevent unsafe interactions. In addition, the proposed method is robot and touch sensor layout agnostic and is complementary with other methods.

[3]  arXiv:2206.11376 [pdf, other]
Title: Real-Time Online Skeleton Extraction and Gesture Recognition on Pepper
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)

We present a multi-stage pipeline for simple gesture recognition. The novelty of our approach is the association of different technologies, resulting in the first real-time system as of now to conjointly extract skeletons and recognise gesture on a Pepper robot. For this task, Pepper has been augmented with an embedded GPU for running deep CNNs and a fish-eye camera to capture whole scene interaction. We show in this article that real-case scenarios are challenging, and the state-of-the-art approaches hardly deal with unknown human gestures. We present here a way to handle such cases.

[4]  arXiv:2206.11623 [pdf, other]
Title: Waypoint Generation in Row-based Crops with Deep Learning and Contrastive Clustering
Comments: Accepted at ECML PKDD 2022
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)

The development of precision agriculture has gradually introduced automation in the agricultural process to support and rationalize all the activities related to field management. In particular, service robotics plays a predominant role in this evolution by deploying autonomous agents able to navigate in fields while executing different tasks without the need for human intervention, such as monitoring, spraying and harvesting. In this context, global path planning is the first necessary step for every robotic mission and ensures that the navigation is performed efficiently and with complete field coverage. In this paper, we propose a learning-based approach to tackle waypoint generation for planning a navigation path for row-based crops, starting from a top-view map of the region-of-interest. We present a novel methodology for waypoint clustering based on a contrastive loss, able to project the points to a separable latent space. The proposed deep neural network can simultaneously predict the waypoint position and cluster assignment with two specialized heads in a single forward pass. The extensive experimentation on simulated and real-world images demonstrates that the proposed approach effectively solves the waypoint generation problem for both straight and curved row-based crops, overcoming the limitations of previous state-of-the-art methodologies.

[5]  arXiv:2206.11626 [pdf, other]
Title: Model-Based Disturbance Estimation for a Fiber-Reinforced Soft Manipulator using Orientation Sensing
Subjects: Robotics (cs.RO)

For soft robots to work effectively in human-centered environments, they need to be able to estimate their state and external interactions based on (proprioceptive) sensors. Estimating disturbances allows a soft robot to perform desirable force control. Even in the case of rigid manipulators, force estimation at the end-effector is seen as a non-trivial problem. And indeed, other current approaches to address this challenge have shortcomings that prevent their general application. They are often based on simplified soft dynamic models, such as the ones relying on a piece-wise constant curvature (PCC) approximation or matched rigid-body models that do not represent enough details of the problem. Thus, the applications needed for complex human-robot interaction can not be built. Finite element methods (FEM) allow for predictions of soft robot dynamics in a more generic fashion. Here, using the soft robot modeling capabilities of the framework SOFA, we build a detailed FEM model of a multi-segment soft continuum robotic arm composed of compliant deformable materials and fiber-reinforced pressurized actuation chambers with a model for sensors that provide orientation output. This model is used to establish a state observer for the manipulator. Model parameters were calibrated to match imperfections of the manual fabrication process using physical experiments. We then solve a quadratic programming inverse dynamics problem to compute the components of external force that explain the pose error. Our experiments show an average force estimation error of around 1.2%. As the methods proposed are generic, these results are encouraging for the task of building soft robots exhibiting complex, reactive, sensor-based behavior that can be deployed in human-centered environments.

[6]  arXiv:2206.11693 [pdf, other]
Title: Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Learning agile skills is one of the main challenges in robotics. To this end, reinforcement learning approaches have achieved impressive results. These methods require explicit task information in terms of a reward function or an expert that can be queried in simulation to provide a target control output, which limits their applicability. In this work, we propose a generative adversarial method for inferring reward functions from partial and potentially physically incompatible demonstrations for successful skill acquirement where reference or expert demonstrations are not easily accessible. Moreover, we show that by using a Wasserstein GAN formulation and transitions from demonstrations with rough and partial information as input, we are able to extract policies that are robust and capable of imitating demonstrated behaviors. Finally, the obtained skills such as a backflip are tested on an agile quadruped robot called Solo 8 and present faithful replication of hand-held human demonstrations.

[7]  arXiv:2206.11789 [pdf, other]
Title: Probabilistically Resilient Multi-Robot Informative Path Planning
Comments: 9 pages, 6 figures, submitted to IEEE Robotics and Automation Letters (RA-L)
Subjects: Robotics (cs.RO)

In this paper, we solve a multi-robot informative path planning (MIPP) task under the influence of uncertain communication and adversarial attackers. The goal is to create a multi-robot system that can learn and unify its knowledge of an unknown environment despite the presence of corrupted robots sharing malicious information. We use a Gaussian Process (GP) to model our unknown environment and define informativeness using the metric of mutual information. The objectives of our MIPP task is to maximize the amount of information collected by the team while maximizing the probability of resilience to attack. Unfortunately, these objectives are at odds especially when exploring large environments which necessitates disconnections between robots. As a result, we impose a probabilistic communication constraint that allows robots to meet intermittently and resiliently share information, and then act to maximize collected information during all other times. To solve our problem, we select meeting locations with the highest probability of resilience and use a sequential greedy algorithm to optimize paths for robots to explore. Finally, we show the validity of our results by comparing the learning ability of well-behaving robots applying resilient vs. non-resilient MIPP algorithms.

[8]  arXiv:2206.11884 [pdf, other]
Title: Augmenting differentiable physics with randomized smoothing
Subjects: Robotics (cs.RO)

In the past few years, following the differentiable programming paradigm, there has been a growing interest in computing the gradient information of physical processes (e.g., physical simulation, image rendering). However, such processes may be non-differentiable or yield uninformative gradients (i.d., null almost everywhere). When faced with the former pitfalls, gradients estimated via analytical expression or numerical techniques such as automatic differentiation and finite differences, make classical optimization schemes converge towards poor quality solutions. Thus, relying only on the local information provided by these gradients is often not sufficient to solve advanced optimization problems involving such physical processes, notably when they are subject to non-smoothness and non-convexity issues.In this work, inspired by the field of zero-th order optimization, we leverage randomized smoothing to augment differentiable physics by estimating gradients in a neighborhood. Our experiments suggest that integrating this approach inside optimization algorithms may be fruitful for tasks as varied as mesh reconstruction from images or optimal control of robotic systems subject to contact and friction issues.

Cross-lists for Fri, 24 Jun 22

[9]  arXiv:2206.11299 (cross-list from cs.LG) [pdf, other]
Title: Latent Policies for Adversarial Imitation Learning
Comments: 8 pages, 5 figures
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)

This paper considers learning robot locomotion and manipulation tasks from expert demonstrations. Generative adversarial imitation learning (GAIL) trains a discriminator that distinguishes expert from agent transitions, and in turn use a reward defined by the discriminator output to optimize a policy generator for the agent. This generative adversarial training approach is very powerful but depends on a delicate balance between the discriminator and the generator training. In high-dimensional problems, the discriminator training may easily overfit or exploit associations with task-irrelevant features for transition classification. A key insight of this work is that performing imitation learning in a suitable latent task space makes the training process stable, even in challenging high-dimensional problems. We use an action encoder-decoder model to obtain a low-dimensional latent action space and train a LAtent Policy using Adversarial imitation Learning (LAPAL). The encoder-decoder model can be trained offline from state-action pairs to obtain a task-agnostic latent action representation or online, simultaneously with the discriminator and generator training, to obtain a task-aware latent action representation. We demonstrate that LAPAL training is stable, with near-monotonic performance improvement, and achieves expert performance in most locomotion and manipulation tasks, while a GAIL baseline converges slower and does not achieve expert performance in high-dimensional environments.

[10]  arXiv:2206.11354 (cross-list from cs.HC) [pdf, other]
Title: Continual Learning for Affective Robotics: A Proof of Concept for Wellbeing
Comments: 12 pages, 7 figures
Subjects: Human-Computer Interaction (cs.HC); Robotics (cs.RO)

Sustaining real-world human-robot interactions requires robots to be sensitive to human behavioural idiosyncrasies and adapt their perception and behaviour models to cater to these individual preferences. For affective robots, this entails learning to adapt to individual affective behaviour to offer a personalised interaction experience to each individual. Continual Learning (CL) has been shown to enable real-time adaptation in agents, allowing them to learn with incrementally acquired data while preserving past knowledge. In this work, we present a novel framework for real-world application of CL for modelling personalised human-robot interactions using a CL-based affect perception mechanism. To evaluate the proposed framework, we undertake a proof-of-concept user study with 20 participants interacting with the Pepper robot using three variants of interaction behaviour: static and scripted, using affect-based adaptation without personalisation, and using affect-based adaptation with continual personalisation. Our results demonstrate a clear preference in the participants for CL-based continual personalisation with significant improvements observed in the robot's anthropomorphism, animacy and likeability ratings as well as the interactions being rated significantly higher for warmth and comfort as the robot is rated as significantly better at understanding how the participants feel.

[11]  arXiv:2206.11403 (cross-list from cs.LG) [pdf, other]
Title: Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)

It has been a long-standing dream to design artificial agents that explore their environment efficiently via intrinsic motivation, similar to how children perform curious free play. Despite recent advances in intrinsically motivated reinforcement learning (RL), sample-efficient exploration in object manipulation scenarios remains a significant challenge as most of the relevant information lies in the sparse agent-object and object-object interactions. In this paper, we propose to use structured world models to incorporate relational inductive biases in the control loop to achieve sample-efficient and interaction-rich exploration in compositional multi-object environments. By planning for future novelty inside structured world models, our method generates free-play behavior that starts to interact with objects early on and develops more complex behavior over time. Instead of using models only to compute intrinsic rewards, as commonly done, our method showcases that the self-reinforcing cycle between good models and good exploration also opens up another avenue: zero-shot generalization to downstream tasks via model-based planning. After the entirely intrinsic task-agnostic exploration phase, our method solves challenging downstream tasks such as stacking, flipping, pick & place, and throwing that generalizes to unseen numbers and arrangements of objects without any additional training.

[12]  arXiv:2206.11421 (cross-list from cs.AI) [pdf, other]
Title: On Specifying for Trustworthiness
Comments: 12 pages, 2 figures
Subjects: Artificial Intelligence (cs.AI); Robotics (cs.RO)

As autonomous systems are becoming part of our daily lives, ensuring their trustworthiness is crucial. There are a number of techniques for demonstrating trustworthiness. Common to all these techniques is the need to articulate specifications. In this paper, we take a broad view of specification, concentrating on top-level requirements including but not limited to functionality, safety, security and other non-functional properties. The main contribution of this article is a set of high-level intellectual challenges for the autonomous systems community related to specifying for trustworthiness. We also describe unique specification challenges concerning a number of application domains for autonomous systems.

[13]  arXiv:2206.11610 (cross-list from cs.CV) [pdf, other]
Title: 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Comments: Winner of the 2nd RxR-Habitat Competition @ CVPR2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)

This report presents the methods of the winning entry of the RxR-Habitat Competition in CVPR 2022. The competition addresses the problem of Vision-and-Language Navigation in Continuous Environments (VLN-CE), which requires an agent to follow step-by-step natural language instructions to reach a target. We present a modular plan-and-control approach for the task. Our model consists of three modules: the candidate waypoints predictor (CWP), the history enhanced planner and the tryout controller. In each decision loop, CWP first predicts a set of candidate waypoints based on depth observations from multiple views. It can reduce the complexity of the action space and facilitate planning. Then, a history-enhanced planner is adopted to select one of the candidate waypoints as the subgoal. The planner additionally encodes historical memory to track the navigation progress, which is especially effective for long-horizon navigation. Finally, we propose a non-parametric heuristic controller named tryout to execute low-level actions to reach the planned subgoal. It is based on the trial-and-error mechanism which can help the agent to avoid obstacles and escape from getting stuck. All three modules work hierarchically until the agent stops. We further take several recent advances of Vision-and-Language Navigation (VLN) to improve the performance such as pretraining based on large-scale synthetic in-domain dataset, environment-level data augmentation and snapshot model ensemble. Our model won the RxR-Habitat Competition 2022, with 48% and 90% relative improvements over existing methods on NDTW and SR metrics respectively.

[14]  arXiv:2206.11733 (cross-list from cs.LG) [pdf, other]
Title: Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)

Learning a diverse set of skills by interacting with an environment without any external supervision is an important challenge. In particular, obtaining a goal-conditioned agent that can reach any given state is useful in many applications. We propose a novel method for training such a goal-conditioned agent without any external rewards or any domain knowledge. We use random walk to train a reachability network that predicts the similarity between two states. This reachability network is then used in building goal memory containing past observations that are diverse and well-balanced. Finally, we train a goal-conditioned policy network with goals sampled from the goal memory and reward it by the reachability network and the goal memory. All the components are kept updated throughout training as the agent discovers and learns new goals. We apply our method to a continuous control navigation and robotic manipulation tasks.

[15]  arXiv:2206.11808 (cross-list from cs.CV) [pdf, other]
Title: Unseen Object 6D Pose Estimation: A Benchmark and Baselines
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)

Estimating the 6D pose for unseen objects is in great demand for many real-world applications. However, current state-of-the-art pose estimation methods can only handle objects that are previously trained. In this paper, we propose a new task that enables and facilitates algorithms to estimate the 6D pose estimation of novel objects during testing. We collect a dataset with both real and synthetic images and up to 48 unseen objects in the test set. In the mean while, we propose a new metric named Infimum ADD (IADD) which is an invariant measurement for objects with different types of pose ambiguity. A two-stage baseline solution for this task is also provided. By training an end-to-end 3D correspondences network, our method finds corresponding points between an unseen object and a partial view RGBD image accurately and efficiently. It then calculates the 6D pose from the correspondences using an algorithm robust to object symmetry. Extensive experiments show that our method outperforms several intuitive baselines and thus verify its effectiveness. All the data, code and models will be made publicly available. Project page: www.graspnet.net/unseen6d

[16]  arXiv:2206.11857 (cross-list from math.OC) [pdf, other]
Title: Change of Optimal Values: A Pre-calculated Metric
Authors: Fang Bai
Comments: 6 pages on IEEE International Conference on Robotics and Automation (ICRA), 2020
Journal-ref: 2020 IEEE International Conference on Robotics and Automation (ICRA), 2020, pp. 8295-8301
Subjects: Optimization and Control (math.OC); Robotics (cs.RO)

A variety of optimization problems takes the form of a minimum norm optimization. In this paper, we study the change of optimal values between two incrementally constructed least norm optimization problems, with new measurements included in the second one. We prove an exact equation to calculate the change of optimal values in the linear least norm optimization problem. With the result in this paper, the change of the optimal values can be pre-calculated as a metric to guide online decision makings, without solving the second optimization problem as long the solution and covariance of the first optimization problem are available. The result can be extended to linear least distance optimization problems, and nonlinear least distance optimization with (nonlinear) equality constraints through linearizations. This derivation in this paper provides a theoretically sound explanation to the empirical observations shown in RA-L 2018 bai et al. As an additional contribution, we propose another optimization problem, i.e. aligning two trajectories at given poses, to further demonstrate how to use the metric. The accuracy of the metric is validated with numerical examples, which is quite satisfactory in general (see the experiments in RA-L 2018 bai et al.} as well), unless in some extremely adverse scenarios. Last but not least, calculating the optimal value by the proposed metric is at least one magnitude faster than solving the corresponding optimization problems directly.

[17]  arXiv:2206.11894 (cross-list from cs.CV) [pdf, other]
Title: MaskViT: Masked Visual Pre-Training for Video Prediction
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)

The ability to predict future visual observations conditioned on past observations and motor commands can enable embodied agents to plan solutions to a variety of tasks in complex environments. This work shows that we can create good video prediction models by pre-training transformers via masked visual modeling. Our approach, named MaskViT, is based on two simple design decisions. First, for memory and training efficiency, we use two types of window attention: spatial and spatiotemporal. Second, during training, we mask a variable percentage of tokens instead of a fixed mask ratio. For inference, MaskViT generates all tokens via iterative refinement where we incrementally decrease the masking ratio following a mask scheduling function. On several datasets we demonstrate that MaskViT outperforms prior works in video prediction, is parameter efficient, and can generate high-resolution videos (256x256). Further, we demonstrate the benefits of inference speedup (up to 512x) due to iterative decoding by using MaskViT for planning on a real robot. Our work suggests that we can endow embodied agents with powerful predictive models by leveraging the general framework of masked visual modeling with minimal domain knowledge.

Replacements for Fri, 24 Jun 22

[18]  arXiv:2106.12111 (replaced) [pdf, other]
Title: Robust Task Scheduling for Heterogeneous Robot Teams under Capability Uncertainty
Comments: Video: this https URL
Subjects: Robotics (cs.RO); Multiagent Systems (cs.MA)
[19]  arXiv:2108.03807 (replaced) [pdf, other]
Title: Model-free online motion adaptation for energy efficient flights of multicopters
Comments: 11 pages + appendix
Subjects: Robotics (cs.RO)
[20]  arXiv:2112.03227 (replaced) [pdf, other]
Title: CALVIN: A Benchmark for Language-Conditioned Policy Learning for Long-horizon Robot Manipulation Tasks
Comments: Accepted for publication at IEEE Robotics and Automation Letters (RAL). Code, models and dataset available at this http URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[21]  arXiv:2202.12873 (replaced) [pdf, other]
Title: TerraPN: Unstructured Terrain Navigation using Online Self-Supervised Learning
Comments: 10 pages, 6 figures
Subjects: Robotics (cs.RO)
[22]  arXiv:2203.00538 (replaced) [pdf, other]
Title: Capability-based Frameworks for Industrial Robot Skills: a Survey
Comments: 8 pages, 4 figures, 1 table, accepted CASE
Subjects: Robotics (cs.RO)
[23]  arXiv:2203.04874 (replaced) [pdf, other]
Title: VGQ-CNN: Moving Beyond Fixed Cameras and Top-Grasps for Grasp Quality Prediction
Comments: Accepted for International Joint Conference on Neural Networks (IJCNN) 2022
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[24]  arXiv:2205.15670 (replaced) [src]
Title: REF: A Rapid Exploration Framework for Deploying Autonomous MAVs in Unknown Environments
Comments: More Experimental results are needed
Subjects: Robotics (cs.RO)
[25]  arXiv:2206.02881 (replaced) [pdf, other]
Title: Mesh-based Dynamics with Occlusion Reasoning for Cloth Manipulation
Comments: RSS 2022, $\href{this https URL}{\text{project website}}$
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[26]  arXiv:2206.03211 (replaced) [pdf, other]
Title: Variational Meta Reinforcement Learning for Social Robotics
Comments: 16 pages, 14 figures submitted to Neural Networks
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI)
[27]  arXiv:2206.10397 (replaced) [pdf, other]
Title: Neural Moving Horizon Estimation for Robust Flight Control
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
[28]  arXiv:2109.07047 (replaced) [pdf, other]
Title: Dataflow Accelerator Architecture for Autonomous Machine Computing
Comments: Please note that this may be a special case in that Professor Gao sadly passed away on September 12th, just as we had put the finishing touches on this submission
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[29]  arXiv:2110.05415 (replaced) [pdf, other]
Title: Safe Reinforcement Learning Using Robust Control Barrier Functions
Comments: Submitted to IEEE Robotics and Automation Letters (RA-L)
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[30]  arXiv:2110.08258 (replaced) [pdf, other]
Title: A Framework for Learning to Request Rich and Contextually Useful Information from Humans
Comments: Accepted to ICML 2022
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
[31]  arXiv:2111.07395 (replaced) [pdf, other]
Title: Explicit Explore, Exploit, or Escape ($E^4$): near-optimal safety-constrained reinforcement learning in polynomial time
Comments: Accepted at Machine Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[32]  arXiv:2206.05266 (replaced) [pdf, other]
Title: Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[33]  arXiv:2206.11215 (replaced) [pdf, other]
Title: Correct and Certify: A New Approach to Self-Supervised 3D-Object Perception
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[ total of 33 entries: 1-33 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, recent, 2206, contact, help  (Access key information)