Results 1 - 10
of
27
Graph Kernels and Gaussian Processes for Relational Reinforcement Learning
- Machine Learning
, 2003
"... Relational reinforcement learning is a Q-learning technique for relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no natural representation as a tuple of constants. In this case, the learning algorithm used to approximate the mapping between stat ..."
Abstract
-
Cited by 34 (7 self)
- Add to MetaCart
Relational reinforcement learning is a Q-learning technique for relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no natural representation as a tuple of constants. In this case, the learning algorithm used to approximate the mapping between state-action pairs and their so called Q(uality)-value has to be not only very reliable, but it also has to be able to handle the relational representation of state-action pairs. In this paper we investigate...
Relational reinforcement learning: An overview
- In Proceedings of the ICML’04 Workshop on Relational Reinforcement Learning
, 2004
"... Relational reinforcement learning (RRL) is both a young and an old eld. In this paper, we trace the history of the eld to related disciplines, outline some current work and promising new directions, and survey the research issues and opportunities that lie ahead. 1. ..."
Abstract
-
Cited by 23 (3 self)
- Add to MetaCart
Relational reinforcement learning (RRL) is both a young and an old eld. In this paper, we trace the history of the eld to related disciplines, outline some current work and promising new directions, and survey the research issues and opportunities that lie ahead. 1.
Integrating Experimentation and Guidance in Relational Reinforcement Learning (Extended Abstract)
, 2002
"... Q-learning [3] is a form of reinforcement learning where the optimal policy is learned implicitly in the form of a Q-function, which takes a state-action pair as input and outputs the quality of the action in that state. The optimal action in a given state is the action with the greatest Q-value. Wh ..."
Abstract
-
Cited by 20 (2 self)
- Add to MetaCart
Q-learning [3] is a form of reinforcement learning where the optimal policy is learned implicitly in the form of a Q-function, which takes a state-action pair as input and outputs the quality of the action in that state. The optimal action in a given state is the action with the greatest Q-value. When dealing with large state spaces Q-learning encounters two major problems.
Relational Reinforcement Learning
- Multi-Agent Systems and Applications, 9th ECCAI Advanced Course ACAI 2001 and Agent Link’s 3rd European Agent Systems Summer School (EASSS 2001), volume 2086 of Lecture Notes in Computer Science
, 2001
"... This paper presents an introduction to reinforcement learning and relational reinforcement learning at a level to be understood by students and researchers with different backgrounds. ..."
Abstract
-
Cited by 17 (1 self)
- Add to MetaCart
This paper presents an introduction to reinforcement learning and relational reinforcement learning at a level to be understood by students and researchers with different backgrounds.
Online learning and exploiting relational models in reinforcement learning
- In M. Veloso (Ed.), Proceedings of the 20th International Joint Conference on Artificial Intelligence (p
, 2007
"... In recent years, there has been a growing interest in using rich representations such as relational languages for reinforcement learning. However, while expressive languages have many advantages in terms of generalization and reasoning, extending existing approaches to such a relational setting is a ..."
Abstract
-
Cited by 13 (2 self)
- Add to MetaCart
In recent years, there has been a growing interest in using rich representations such as relational languages for reinforcement learning. However, while expressive languages have many advantages in terms of generalization and reasoning, extending existing approaches to such a relational setting is a non-trivial problem. In this paper, we present a first step towards the online learning and exploitation of relational models. We propose a representation for the transition and reward function that can be learned online and present a method that exploits these models by augmenting Relational Reinforcement Learning algorithms with planning techniques. The benefits and robustness of our approach are evaluated experimentally. 1
Non-Parametric Policy Gradients: A Unified Treatment of Propositional and Relational Domains
"... Policy gradient approaches are a powerful instrument for learning how to interact with the environment. Existing approaches have focused on propositional and continuous domains only. Without extensive feature engineering, it is difficult – if not impossible – to apply them within structured domains, ..."
Abstract
-
Cited by 9 (5 self)
- Add to MetaCart
Policy gradient approaches are a powerful instrument for learning how to interact with the environment. Existing approaches have focused on propositional and continuous domains only. Without extensive feature engineering, it is difficult – if not impossible – to apply them within structured domains, in which e.g. there is a varying number of objects and relations among them. In this paper, we describe a non-parametric policy gradient approach – called NPPG – that overcomes this limitation. The key idea is to apply Friedmann’s gradient boosting: policies are represented as a weighted sum of regression models grown in an stage-wise optimization. Employing off-the-shelf regression learners, NPPG can deal with propositional, continuous, and relational domains in a unified way. Our experimental results show that it can even improve on established results. 1.
Transfer learning in reinforcement learning problems through partial policy recycling
- In Proc. of The 18th European Conf. on Machine Learning
, 2007
"... Abstract. In this paper we investigate the relation between transfer learning in reinforcement learning with function approximation and supervised learning with concept drift. We present a new incremental relational regression tree algorithm that is capable of dealing with concept drift through tree ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
Abstract. In this paper we investigate the relation between transfer learning in reinforcement learning with function approximation and supervised learning with concept drift. We present a new incremental relational regression tree algorithm that is capable of dealing with concept drift through tree restructuring and show that it enables a reinforcement learner, more precisely a Q-learner, to transfer knowledge from one task to another by recycling those parts of the generalized Q-function that still hold interesting information for the new task. We illustrate the performance of the algorithm in experiments with both supervised learning tasks with concept drift and reinforcement learning tasks that allow the transfer of knowledge from easier, related tasks.
On the numeric stability of gaussian processes regression for relational reinforcement learning
- In ICML-2004 Workshop on Relational Reinforcement Learning
, 2004
"... In this work we investigate the behavior of Gaussian processes as a regression technique for reinforcement learning. When confronted with too many mutually dependant learning examples, the matrix inversion needed for prediction of a new target value becomes numerically unstable. By paying attention ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
In this work we investigate the behavior of Gaussian processes as a regression technique for reinforcement learning. When confronted with too many mutually dependant learning examples, the matrix inversion needed for prediction of a new target value becomes numerically unstable. By paying attention to using suitable numerical techniques and employing QR-factorization these instabilities can be avoided. This leads to better and more stable performance of the attached reinforcement learner. 1.
Learning relational options for inductive transfer in relational reinforcement learning
- In Proceedings of the Seventeenth Conference on Inductive Logic Programming
, 2007
"... Abstract. In reinforcement learning problems, an agent has the task of learning a good or optimal strategy from interaction with his environment. At the start of the learning task, the agent usually has very little information. Therefore, when faced with complex problems that have a large state spac ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
Abstract. In reinforcement learning problems, an agent has the task of learning a good or optimal strategy from interaction with his environment. At the start of the learning task, the agent usually has very little information. Therefore, when faced with complex problems that have a large state space, learning a good strategy might be infeasible or too slow to work in practice. One way to overcome this problem, is the use of guidance to supply the agent with traces of “reasonable policies”. However, in a lot of cases it will be hard for the user to supply such a policy. In this paper, we will investigate the use of transfer learning in Relational Reinforcement Learning. The goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. More specifically, we introduce an extension of the options framework to the relational setting and show how one can learn skills that can be transferred across similar, but different domains. We present experiments showing the possible benefits of using relational options for transfer learning.
Transfer learning for reinforcement learning through goal and policy parametrization
- In ICML Workshop on Structural Knowledge Transfer for Machine Learning
, 2006
"... Relational reinforcement learning has allowed results from reinforcement learning tasks to be re-used in other, closely related, tasks. This transfer of knowledge is made possible by the use of parameters in the representations of the task-description and the learned policy. In this paper, we will g ..."
Abstract
-
Cited by 7 (1 self)
- Add to MetaCart
Relational reinforcement learning has allowed results from reinforcement learning tasks to be re-used in other, closely related, tasks. This transfer of knowledge is made possible by the use of parameters in the representations of the task-description and the learned policy. In this paper, we will give a description of the current state of the art of transfer learning with relational reinforcement learning, make some observations about the usefulness and limitations of this current state and discuss some directions for future research. We also present a first small step along one of those directions. 1.

