Results 1 - 10
of
30
Plasticity in value systems and its role in adaptive behavior
- Adaptive Behavior
, 2000
"... On behalf of: ..."
Teachable Robots: Understanding Human Teaching Behavior to Build More Effective Robot Learners
, 2008
"... ..."
A Consideration of the Biological and Psychological Foundations of Autonomous Robotics
, 1998
"... The new wave of robotics aims to provide robots with the capacity to learn, develop and evolve in interaction with their environments using biologically inspired techniques. This work is placed in perspective by considering its biological and psychological basis with reference to some of the grand t ..."
Abstract
-
Cited by 20 (9 self)
- Add to MetaCart
The new wave of robotics aims to provide robots with the capacity to learn, develop and evolve in interaction with their environments using biologically inspired techniques. This work is placed in perspective by considering its biological and psychological basis with reference to some of the grand theorists of living systems. In particular, we examine what it means to have a body by outlining theories of the mechanisms of bodily integration in multicellular organisms and their means of solidarity with the environment. We consider the implications of not having a living body for current ideas on robot learning, evolution, and cognition and issue words of caution about wishful attributions that can smuggle more into observations of robot behaviour than is scientifically supportable. To round off the arguments we take an obligatory swipe at ungrounded artificial intelligence but quickly move on to assess physical grounding and embodiment in terms of the rooted cognition of the living.
From Baby Steps to Leapfrog: How “Less is More” in unsupervised dependency parsing
- IN NAACL-HLT
"... We present three approaches for unsupervised grammar induction that are sensitive to data complexity and apply them to Klein and Manning’s Dependency Model with Valence. The first, Baby Steps, bootstraps itself via iterated learning of increasingly longer sentences and requires no initialization. Th ..."
Abstract
-
Cited by 19 (5 self)
- Add to MetaCart
We present three approaches for unsupervised grammar induction that are sensitive to data complexity and apply them to Klein and Manning’s Dependency Model with Valence. The first, Baby Steps, bootstraps itself via iterated learning of increasingly longer sentences and requires no initialization. This method substantially exceeds Klein and Manning’s published scores and achieves 39.4 % accuracy on Section 23 (all sentences) of the Wall Street Journal corpus. The second, Less is More, uses a low-complexity subset of the available data: sentences up to length 15. Focusing on fewer but simpler examples trades off quantity against ambiguity; it attains 44.1% accuracy, using the standard linguisticallyinformed prior and batch training, beating state-of-the-art. Leapfrog, our third heuristic, combines Less is More with Baby Steps by mixing their models of shorter sentences, then rapidly ramping up exposure to the full training set, driving up accuracy to 45.0%. These trends generalize to the Brown corpus; awareness of data complexity may improve other parsing models and unsupervised algorithms.
Reinforcement learning and shaping: Encouraging intended behaviors
- Proceedings of the Nineteenth International Conference on Machine Learning
, 2002
"... We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial increment to the native task rewards in order to encourage or discourage behaviors. Previously, shaping functions have b ..."
Abstract
-
Cited by 7 (1 self)
- Add to MetaCart
We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial increment to the native task rewards in order to encourage or discourage behaviors. Previously, shaping functions have been static: the additional rewards do not vary with experience. But some prior knowledge cannot be expressed as static shaping. We take an explanation-based approach in which the specific shaping function emerges from initial experiences with the world. We compare no shaping, static shaping, and dynamic shaping in the task of learning bipedal-walking on a simulator. We empirically evaluate the convergence rate and final performance among these conditions while varying the accuracy of the prior knowledge. We conclude that in the appropriate context, dynamic shaping can greatly improve the learning of action policies. 1.
Task Transfer by a Developmental Robot
"... Abstract—Scaffolding is a process of transferring learned skills to new and more complex tasks through arranged experience in open-ended development. In this paper, we propose a developmental learning architecture that enables a robot to transfer skills acquired in early learning settings to later m ..."
Abstract
-
Cited by 7 (6 self)
- Add to MetaCart
Abstract—Scaffolding is a process of transferring learned skills to new and more complex tasks through arranged experience in open-ended development. In this paper, we propose a developmental learning architecture that enables a robot to transfer skills acquired in early learning settings to later more complex task settings. We show that a basic mechanism that enables this transfer is sequential priming combined with attention, which is also the driving mechanism for classical conditioning, secondary conditioning, and instrumental conditioning in animal learning. A major challenge of this work is that training and testing must be conducted in the same program operational mode through online, real-time interactions between the agent and the trainers. In contrast with former modeling studies, the proposed architecture does not require the programmer to know the tasks to be learned and the environment is uncontrolled. All possible perceptions and actions, including the actual number of classes, are not available until the programming is finished and the robot starts to learn in the real world. Thus, a predesigned task-specific symbolic representation is not suited for such an open-ended developmental process. Experimental results on a robot are reported in which the trainer shaped the behaviors of the agent interactively, continuously, and incrementally through verbal commands and other sensory signals so that the robot learns new and more complex sensorimotor tasks by transferring sensorimotor skills learned in earlier periods of open-ended development. Index Terms—Attention, classical conditioning, incremental learning, instrumental conditioning, mental architecture, mental development, multitask learning, online learning, scaffolding, skill transfer. I.
Behavior chaining: incremental behavioral integration for evolutionary robotics
- in Artificial Life XI: Proceedings of the Eleventh International Conference on the Simulation and Synthesis of Living Systems
, 2008
"... One of the open problems in autonomous robotics is how to consistently and scalably integrate new behaviors into a robot with an existing behavioral repertoire. In this work a new technique called behavior chaining is introduced, which allows for gradually expanding the behavioral repertoire of a dy ..."
Abstract
-
Cited by 6 (6 self)
- Add to MetaCart
One of the open problems in autonomous robotics is how to consistently and scalably integrate new behaviors into a robot with an existing behavioral repertoire. In this work a new technique called behavior chaining is introduced, which allows for gradually expanding the behavioral repertoire of a dynamically behaving robot. The approach relies heavily on scaffolding: gradually restructuring the robot’s environment such that selection pressure favors the incorporation of a new behavior. This method teaches a robot a compound behavior not yet reported in the literature: dynamic legged locomotion toward an object followed by grasping, lifting and holding of that object in a physically-realistic three-dimensional environment. The method assumes that success is dependent on the order in which behaviors are learned. This is justified by results which show that if a robot is forced to learn lifting first and then incorporate locomotion, it eventually succeeds at both more often than a robot forced to learn locomotion first and then lifting.
A real-world rational agent: Unifying old and new AI
, 2002
"... Explanations of cognitive processes provided by traditional artificial intelligence were based on the notion of the knowledge level. This perspective has been challenged by new AI that proposes an approach based on embodied systems that interact with the real world. We demonstrate that these two ..."
Abstract
-
Cited by 6 (0 self)
- Add to MetaCart
Explanations of cognitive processes provided by traditional artificial intelligence were based on the notion of the knowledge level. This perspective has been challenged by new AI that proposes an approach based on embodied systems that interact with the real world. We demonstrate that these two views can be unified. Our argument is based on the assumption that knowledge level explanations can be defined in the context of Bayesian theory while the goals of new AI are captured by using a well established robot based model of learning and problem solving, called Distributed Adaptive Control (DAC). In our analysis we consider random foraging and we prove that minor modifications of the DAC architecture renders a model that is equivalent to a Bayesian analysis of this task. Subsequently, we compare this enhanced, "rational", model to its, "non-rational", predecessor and a further control condition using both simulated and real robots, in a variety of environments. Our results show that the changes made to the DAC architecture, in order to unify the perspectives of old and new AI, also lead to a significant improvement in random foraging.
May We Have Your Attention: Analysis of a Selective Attention Task
"... In this paper we present a deeper analysis than has previously been carried out of a selective attention problem, and the evolution of continuous-time recurrent neural networks to solve it. We show that the task has a rich structure, and agents must solve a variety of subproblems to perform well. We ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
In this paper we present a deeper analysis than has previously been carried out of a selective attention problem, and the evolution of continuous-time recurrent neural networks to solve it. We show that the task has a rich structure, and agents must solve a variety of subproblems to perform well. We consider the relationship between the complexity of an agent and the ease with which it can evolve behavior that generalizes well across subproblems, and demonstrate a shaping protocol that improves generalization. 1.

