• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Reinforcement Learning for Mapping Instructions to Actions

Cached

  • Download as a PDF

Download Links

  • [www.aclweb.org]
  • [aclweb.org]
  • [people.csail.mit.edu]
  • [www.cs.washington.edu]
  • [people.csail.mit.edu]
  • [www.sls.csail.mit.edu]
  • [people.csail.mit.edu]
  • [people.csail.mit.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by S. R. K. Branavan , Harr Chen , Luke S. Zettlemoyer , Regina Barzilay
Citations:22 - 3 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Branavan_reinforcementlearning,
    author = {S. R. K. Branavan and Harr Chen and Luke S. Zettlemoyer and Regina Barzilay},
    title = {Reinforcement Learning for Mapping Instructions to Actions},
    year = {}
}

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function that defines the quality of the executed actions. During training, the learner repeatedly constructs action sequences for a set of documents, executes those actions, and observes the resulting reward. We use a policy gradient algorithm to estimate the parameters of a log-linear model for action selection. We apply our method to interpret instructions in two domains — Windows troubleshooting guides and game tutorials. Our results demonstrate that this method can rival supervised learning techniques while requiring few or no annotated training examples. 1 1

Citations

2827 Reinforcement Learning: An Introduction - Sutton, Barto - 1998
1548 BConditional random fields: Probabilistic models for segmenting and labeling sequence data - Lafferty, McCallum, et al.
464 Inducing Features of Random Fields - Pietra, Pietra, et al. - 1997
297 Understanding natural language - Winograd - 1972
262 Policy gradient methods for reinforcement learning with function approximation - Sutton, Mcallester, et al. - 2000
179 Learning the Semantic of Words and Pictures - Barnard, Forsyth - 2001
106 Autonomous helicopter flight via reinforcement learning - Ng, Kim, et al.
50 Understanding Natural Language Instructions: a Computational Approach to Purpose Clauses - Eugenio - 1993
35 2002. Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning - Scheffler, Young
32 Automatic optimization of dialogue management - Litman, Kearns, et al. - 2000
19 Learning to sportscast: A test of grounded language acquisition - Chen, Mooney - 2008
14 On the integration of grounding language and learning objects - Yu, Ballard - 2004
5 Intentional context in situated language learning - Fleischman, Roy - 2005
3 Learning to connect language and perception - Mooney - 2008
3 dialogue management using probabilistic reasoning - Spoken
1 Learning language from its perceptual context - 2008a
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University