• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Exploiting model equivalences for solving interactive dynamic influence diagrams. (2012)

by Y Zeng, P Doshi
Venue:JAIR,
Add To MetaCart

Tools

Sorted by:
Results 1 - 7 of 7

Incremental Clustering and Expansion for Faster Optimal Planning in Decentralized POMDPs

by Frans A. Oliehoek, Matthijs T. J. Spaan, Christopher Amato, Shimon Whiteson , 2013
"... This article presents the state-of-the-art in optimal solution methods for decentralized partially observable Markov decision processes (Dec-POMDPs), which are general models for collaborative multiagent planning under uncertainty. Building off the generalized multiagent A * (GMAA*) algorithm, which ..."
Abstract - Cited by 18 (12 self) - Add to MetaCart
This article presents the state-of-the-art in optimal solution methods for decentralized partially observable Markov decision processes (Dec-POMDPs), which are general models for collaborative multiagent planning under uncertainty. Building off the generalized multiagent A * (GMAA*) algorithm, which reduces the problem to a tree of one-shot collaborative Bayesian games (CBGs), we describe several advances that greatly expand the range of Dec-POMDPs that can be solved optimally. First, we introduce lossless incremental clustering of the CBGs solved by GMAA*, which achieves exponential speedups without sacrificing optimality. Second, we introduce incremental expansion of nodes in the GMAA * search tree, which avoids the need to expand all children, the number of which is in the worst case doubly exponential in the node’s depth. This is particularly beneficial when little clustering is possible. In addition, we introduce new hybrid heuristic representations that are more compact and thereby enable the solution of larger Dec-POMDPs. We provide theoretical guarantees that, when a suitable heuristic is used, both incremental clustering and incremental expansion yield algorithms that are both complete and search equivalent. Finally, we present extensive empirical results demonstrating that GMAA*-ICE, an algorithm that synthesizes these advances, can optimally solve Dec-POMDPs of unprecedented size.
(Show Context)

Citation Context

...pproach also resembles a number of methods that employ other equivalence notions. First, several approaches exploit the notion of behavioral equivalence (Pynadath & Marsella, 2007; Zeng et al., 2011; =-=Zeng & Doshi, 2012-=-). They consider, from the perspective of a protagonist agent i, the possible models of another agent j. Since j affects i only through its actions, i.e., its behavior, agent i can cluster together al...

Team Behavior in Interactive Dynamic Influence Diagrams with Applications to Ad Hoc Teams

by Prashant Doshi, Yifeng Zeng, Yingke Chen
"... Planning for ad hoc teamwork is challenging because it involves in-dividual agents collaborating with others without any prior coordi-nation. However, individual decision making in multiagent settings faces the task of having to reason about other agents ’ actions, which in turn involves reasoning a ..."
Abstract - Cited by 3 (3 self) - Add to MetaCart
Planning for ad hoc teamwork is challenging because it involves in-dividual agents collaborating with others without any prior coordi-nation. However, individual decision making in multiagent settings faces the task of having to reason about other agents ’ actions, which in turn involves reasoning about others. An established approxi-mation that operationalizes this approach is to bound the infinite nesting from below by introducing level 0 models, which results in suboptimal team solutions in cooperative settings. We demonstrate this limitation and mitigate it by integrating learning into planning. The augmented framework is applied to ad hoc teamwork.
(Show Context)

Citation Context

...rameworks such as interactive dynamic influence diagrams (I-DIDs) [1] are recognized to be suitable for ad hoc teamwork but their complexity is challenging. While recent advances on model equivalence =-=[4]-=- allow frameworks such as I-DIDs to scale, another significant challenge that merits attention is due to the finitely-nested modeling used in these frameworks, which assumes the presence of level 0 mo...

systems

by Z. Y. Zeng, F. Claro , 2001
"... Delocalization and conductance quantization in one-dimensional ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Delocalization and conductance quantization in one-dimensional
(Show Context)

Citation Context

...es the prediction on behavior of the modeling agent [3]. Another efficient way to reduce the model space is achieved by clustering models that are actionally equivalent [16]. Recently,Zeng and Doshi =-=[17]-=- compare various I-DID solutions and demonstrate their utilities in more problem domains. 7. DISCUSSION I-DIDs provide a graphical formalism for modeling the sequential decision making of an agent in ...

Team Behavior in Interactive Dynamic Influence Diagrams with Applications to Ad Hoc Teams (Extended Abstract)

by Muthukumaran Chandrasekaran , Prashant Doshi , Yifeng Zeng , Yingke Chen
"... ABSTRACT Planning for ad hoc teamwork is challenging because it involves individual agents collaborating with others without any prior coordination. However, individual decision making in multiagent settings faces the task of having to reason about other agents' actions, which in turn involves ..."
Abstract - Add to MetaCart
ABSTRACT Planning for ad hoc teamwork is challenging because it involves individual agents collaborating with others without any prior coordination. However, individual decision making in multiagent settings faces the task of having to reason about other agents' actions, which in turn involves reasoning about others. An established approximation that operationalizes this approach is to bound the infinite nesting from below by introducing level 0 models, which results in suboptimal team solutions in cooperative settings. We demonstrate this limitation and mitigate it by integrating learning into planning. The augmented framework is applied to ad hoc teamwork.
(Show Context)

Citation Context

...hallenging thereby making frameworks such as DEC-POMDPs unsuitable for ad hoc teamwork. Other approaches such as online planning in ad hoc teams (OPAT) [3] assume perfect observability of physical states and others’ actions, which often may not apply. Our focus is on how an inividual agent should behave online as an ad hoc teammate in partially observable settings with minimal prior assumptions. Frameworks such as interactive dynamic influence diagrams (I-DIDs) [1] are recognized to be suitable for ad hoc teamwork but their complexity is challenging. While recent advances on model equivalence [4] allow frameworks such as I-DIDs to scale, another significant challenge that merits attention is due to the finitely-nested modeling used in these frameworks, which assumes the presence of level 0 models that do Appears in: Alessio Lomuscio, Paul Scerri, Ana Bazzan, and Michael Huhns (eds.), Proceedings of the 13th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2014), May 5-9, 2014, Paris, France. Copyright c© 2014, International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved. not explicitly reason about others. By a...

Iterative Online Planning in Multiagent Settings with Limited Model Spaces and PAC Guarantees

by Yingke Chen, Prashant Doshi, Yifeng Zeng
"... Methods for planning in multiagent settings often model other agents ’ possible behaviors. However, the space of these models – whether these are policy trees, finite-state controllers or inten-tional models – is very large and thus arbitrarily bounded. This may exclude the true model or the optimal ..."
Abstract - Add to MetaCart
Methods for planning in multiagent settings often model other agents ’ possible behaviors. However, the space of these models – whether these are policy trees, finite-state controllers or inten-tional models – is very large and thus arbitrarily bounded. This may exclude the true model or the optimal model. In this paper, we present a novel iterative algorithm for online planning that consid-ers a limited model space, updates it dynamically using data from interactions, and provides a provable and probabilistic bound on the approximation error. We ground this approach in the context of graphical models for planning in partially observable multiagent settings – interactive dynamic influence diagrams. We empirically demonstrate that the limited model space facilitates fast solutions and that the true model often enters the limited model space.
(Show Context)

Citation Context

...esentative models of j. Lines 7-15 implement the model update link in an I-DID. Finally, lines 17-18 solve the transformed I-DID using standard DID algorithms. Previous offline techniques such as DMU =-=[26]-=- and -BE [28] solve I-DIDs by exploiting equivalences between models. In particular,DMU exactly solves I-DIDs while - BE compromises the solution quality to achieve greater efficiency. I-DID EXACT(l...

4 Team Behavior in Interactive Dynamic Influence Diagrams with Applications to Ad Hoc Teams

by Prashant Doshi, Yifeng Zeng, Yingke Chen
"... ar ..."
Abstract - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...lexity. Indeed, Albrecht and Ramamoorthy [2] note the suitability of these frameworks to the problem of ad hoc teamwork but find the complexity challenging. While recent advances on model equivalence =-=[25]-=- allow frameworks such as I-DIDs to scale, another significant challenge that merits attention is due to the finitely-nested modeling used in these frameworks, which assumes the presence of level 0 mo...

Fast Solving of Influence Diagrams for Multiagent Planning on GPU-enabled Architectures

by Fadel Adoe, Yingke Chen, Prashant Doshi
"... Abstract: Planning under uncertainty in multiagent settings is highly intractable because of history and plan space com-plexities. Probabilistic graphical models exploit the structure of the problem domain to mitigate the com-putational burden. In this paper, we introduce the first parallelization o ..."
Abstract - Add to MetaCart
Abstract: Planning under uncertainty in multiagent settings is highly intractable because of history and plan space com-plexities. Probabilistic graphical models exploit the structure of the problem domain to mitigate the com-putational burden. In this paper, we introduce the first parallelization of planning in multiagent settings on a CPU-GPU heterogeneous system. In particular, we focus on the algorithm for exactly solving interactive dynamic influence diagrams, which is a recognized graphical models for multiagent planning. Beyond paral-lelizing the standard Bayesian inference, the computation of decisions ’ expected utilities are parallelized. The GPU-based approach provides significant speedup on two benchmark problems. 1
(Show Context)

Citation Context

...mework (Chen et al., 2013), and for ad hoc teamwork (Chandrasekaran et al., 2014) motivate improved solutions of I-DIDs. While techniques exist for introducing further efficiency into solving I-DIDs (=-=Zeng and Doshi, 2012-=-), we may also explore parallelizing its solution algorithm on new high-performance computing architectures such as those utilizing graphic processing units (GPU). A GPU consists of an array of stream...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University