Publications

Avleen S. Bijral, Markus Breitenbach, and Greg Grudic. Mixture of watson distributions: A generative model for hyperspherical embeddings. In AI and Statistics 2007, 2007.

Greg Grudic, Jane Mulligan, Michael Otte, and Adam Bates. Online learning of multiple perceptual models for navigation in unknown terrain. In The 6th International Conference on Field and Service Robotics, 2007.

Michael Otte, Scott Richardson, Greg Grudic, and Jane Mulligan. Local path planning in image space for autonomous robot navigation in unstructured environments. In 2007 IEEE International Conference on Intelligent Robots and Systems, To Appear: 2007.

Michael J. Procopio, Jane Mulligan, and Greg Grudic. Long-term learning using multiple models for outdoor autonomous robot navigation. In 2007 IEEE International Conference on Intelligent Robots and Systems, To Appear: 2007.

Greg Grudic and Jane Mulligan. Outdoor path labeling using polynomial mahalanobis distance. In Robotics: Science and Systems Conference, 2006.


 

Human-to-Robot Skill Transfer

NSF Award Abstract - #0535269

Investigator

Gregory Z. Grudic (Principal Investigator current)
I. Jane Mulligan (Co-Principal Investigator current)

Abstract

Programming robots that function robustly in unstructured environments has met with limited success. In contrast, humans can often successfully teleoperate a robot to accomplish complex tasks in natural environments, using only the robot's sensors and actuators. The goal of this proposal is to exploit the knowledge encoded in the actions of human teleoperators to learn more robust controllers for autonomous robot tasks.

The first step is to refine the huge volumes of data available to the robot to a set of features or percepts, which are both tractable for our systems to process, and sufficient to perform the task at hand. One way to empirically satisfy these constraints is to demonstrate whether a human user can execute the task in a teleoperation setting, given only the displayed sensor features. We will identify task appropriate feature sets by analysis of teleoperator performance (success, speed, distance etc.) under displayed feature combinations.

The proposed approach to learning robust controllers can be summarized as: 1) a human demonstrates a task remotely; 2) recorded sensor sequences are used to learn compact low dimensional manifolds that represent regions of the feature space safe for a robot to traverse; 3) a reinforcement learning paradigm is employed to find task-optimal paths within the manifolds. The skill or task is considered mastered when the robot's performance equals or exceeds the human's. The learned controller autonomously directs the robot, and reinforcement reward is used to autonomously optimize it. However, whenever the robot runs into difficulties, the human operator can take over, generating new examples used to modify and refine the manifold.


Figure 1: Human-to-Robot Skill Transfer


Figure 2: Human task demonstrations mapped into Image Feature spaces. Safe regions in the feature space form low dimensional manifolds.

Original Image
Black Square is Sample Path…
Mahalanobis Seg.
(Poor Path Segmentation)
Poly Mahalanobis Segmentation.
Clear identification of Path

Figure 3: Segmentation of Paths Using the New Mahalanobis Distance Metric. The new metric clearly outperforms existing techniques in Path segmentation.

 

 

 

 

 
Home | People | Projects | Publications | Contact