• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

An exponential lower bound for the latest deterministic strategy iteration algorithms (0)

by O Friedmann
Venue:Logical Methods in Computer Science
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 11
Next 10 →

A subexponential lower bound for Zadeh’s pivoting rule for solving linear programs and games

by Oliver Friedmann
"... ..."
Abstract - Cited by 15 (3 self) - Add to MetaCart
Abstract not found

A subexponential lower bound for the Least Recently Considered rule for solving linear programs and games

by Oliver Friedmann
"... The simplex algorithm is among the most widely used algorithms for solving linear programs in practice. Most pivoting rules are known, however, to need an exponential number of steps to solve some linear programs. No non-polynomial lower bounds were known, prior to this work, for Cunningham’s Least ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
The simplex algorithm is among the most widely used algorithms for solving linear programs in practice. Most pivoting rules are known, however, to need an exponential number of steps to solve some linear programs. No non-polynomial lower bounds were known, prior to this work, for Cunningham’s Least Recently Considered rule [5], which belongs to the family of history-based rules. Also known as the ROUND-ROBIN rule, Cunningham’s pivoting method fixes an initial ordering on all variables first, and then selects the improving variables in a round-robin fashion. We provide the first subexponential (i.e., of the form 2 Ω( √ n)) lower bound for this rule in a concrete setting. Our lower bound is obtained by utilizing connections between pivoting steps performed by simplex-based algorithms and improving switches performed by policy iteration algorithms for 1-player and 2-player games. We start by building 2-player parity games (PGs) on which the policy iteration with the ROUND-ROBIN rule performs a subexponential number of iterations. We then transform the parity games into 1-player Markov Decision Processes (MDPs) which correspond almost immediately to concrete linear programs. 1
(Show Context)

Citation Context

... of iterations when solved by policyOliver Friedmann 11 iteration. The lower bound result immediately transfers to mean payoff games, discounted payoff games and turned-based simple stochastic games =-=[11]-=-. Although now Zadeh’s well-known LEAST-ENTERED rule [27] and Cunningham’s ROUND-ROBIN rule are known to have subexponential lower bounds in concrete settings [14], there are still some historybased p...

Parity games and propositional proofs

by Arnold Beckmann, Neil Thapen - ACM Transactions on Computational Logic
"... A propositional proof system is weakly automatizable if there is a polynomial time algorithm which separates satisfiable formulas from formulas which have a short refutation in the system, with respect to a given length bound. We show that if the resolution proof system is weakly automatizable, then ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
A propositional proof system is weakly automatizable if there is a polynomial time algorithm which separates satisfiable formulas from formulas which have a short refutation in the system, with respect to a given length bound. We show that if the resolution proof system is weakly automatizable, then parity games can be decided in poly-nomial time. We give simple proofs that the same holds for depth-1 propositional calculus (where resolution has depth 0) with respect to mean payoff and simple stochastic games. We define a new type of combinatorial game and prove that resolution is weakly automatizable if and only if one can separate, by a set decidable in polynomial time, the games in which the first player has a positional winning strategy from the games in which the second player has a positional winning strategy. Our main technique is to show that a suitable weak bounded arith-metic theory proves that both players in a game cannot simultaneously have a winning strategy, and then to translate this proof into proposi-tional form. ∗This research was partially done while the authors were visiting fellows at the Isaac Newton Institute for the Mathematical Sciences in the programme “Semantics & Syntax”. †Partially supported by grant IAA100190902 of GA AV ČR
(Show Context)

Citation Context

...lems is known to be in P, despite intensive research work on developing algorithms for them. For several of the existing algorithms, exponential lower bounds on their runtime have been given recently =-=[15, 16]-=-. Automatizability is an important concept for automated theorem proving. Call a propositional proof system automatizable if there is an algorithm which, given a tautology, produces a proof in time po...

A Subexponential Lower Bound for Policy Iteration Based on Snare Memorization

by Oliver Friedmann , 2011
"... This paper presents a subexponential lower bound for the recently proposed snare memorization non-oblivious strategy iteration algorithm due to Fearnley. Snare memorization is a method to train policy iteration techniques to remember certain profitable substrategies and reapply them again. We show t ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
This paper presents a subexponential lower bound for the recently proposed snare memorization non-oblivious strategy iteration algorithm due to Fearnley. Snare memorization is a method to train policy iteration techniques to remember certain profitable substrategies and reapply them again. We show that there is not much hope to find a polynomial-time algorithm for solving parity games by applying such non-oblivious techniques.
(Show Context)

Citation Context

...stic improvement rule is the original locally improving rule that selects the best local improvement in every point [19]. However, we have shown that there is an exponential lower bound for this rule =-=[6, 7]-=-. The exponential lower bound is based on the implementation of a binary counter in the context of a parity game. Cyclic structures represent the bits of the counter with certain strategy settings cor...

Random-Facet and Random-Bland require subexponential time even for shortest paths

by Oliver Friedmann , Thomas Dueholm Hansen , Uri Zwick , 2014
"... ..."
Abstract - Cited by 2 (2 self) - Add to MetaCart
Abstract not found

The complexity of ergodic meanpayoff games

by Krishnendu Chatterjee, Rasmus Ibsen-jensen - In Automata, Languages, and Programming - 41st International Colloquium, ICALP 2014
"... ar ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...mber of iterations for almost-sure ergodic games. Again our result is optimal, since even for turn-based stochastic reachability games the strategy-iteration algorithms require exponential iterations =-=[16, 14]-=-. We analyze the value-iteration algorithm for ergodic games and show that for all ǫ > 0, the value-iteration algorithm requires at most O(H ·W · ǫ−1 · log(ǫ−1)) iterations, where H is the upper bound...

Symmetric Strategy Improvement

by Sven Schewe, Ashutosh Trivedi, Thomas Varghese , 2015
"... Symmetry is inherent in the definition of most of the two-player zero-sum games, including parity, mean-payoff, and discounted-payoff games. It is therefore quite surprising that no symmetric analysis techniques for these games exist. We develop a novel symmetric strategy improvement algorithm where ..."
Abstract - Add to MetaCart
Symmetry is inherent in the definition of most of the two-player zero-sum games, including parity, mean-payoff, and discounted-payoff games. It is therefore quite surprising that no symmetric analysis techniques for these games exist. We develop a novel symmetric strategy improvement algorithm where, in each iteration, the strategies of both players are improved simultaneously. We show that symmetric strategy improvement defies Friedmann’s traps, which shook the belief in the potential of classic strategy improvement to be polynomial.
(Show Context)

Citation Context

...e will use parity games where every colour is unique, i.e., where is injective. The reason for this assumption is that we extended [19], which implements variants of [31], and the lower bounds from =-=[13, 12, 14]-=- refer to such games. All parity games can be translated into such games as discussed in [31]. For these games, we use a valuation function based on their progress measure. We define ⌘ as hc0, c1, . ....

Benchmarks for Parity Games

by Jeroen J. A. Keiren
"... ar ..."
Abstract - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...her they have exponential lower bounds. We consider the games described by Jurdziński that shows the exponential lower bound for small progress measures [42], the ladder games described by Friedmann =-=[27]-=- with the variation of recursive ladder games that give a lower bound for the strategy improvement algorithms [75, 63], and model checker ladder games [26] for which the algorithm by Stevens and Stirl...

The mu-calculus and model-checking

by Julian Bradfield, Igor Walukiewicz
"... This chapter presents a part of the theory of the mu-calculus that is relevant to the, broadly understood, model-checking problem. The mu-calculus is one of the most important logics in model-checking. It is a logic with an exceptional balance between expressiveness and algorithmic properties. The ..."
Abstract - Add to MetaCart
This chapter presents a part of the theory of the mu-calculus that is relevant to the, broadly understood, model-checking problem. The mu-calculus is one of the most important logics in model-checking. It is a logic with an exceptional balance between expressiveness and algorithmic properties. The chapter describes in length the game characterization of the semantics of the mu-calculus. It discusses the theory of the mu-calculus starting with the tree model property, and bisimulation invariance. Then it develops the notion of modal automaton: an automaton-based model behind the mu-calculus. It gives a quite detailed explanation of the satisfiability algorithm, followed by the results on alternation hierarchy, proof systems, and interpolation. Finally, the chapter discusses the relations of the mu-calculus to monadic second-order logic as well as to some program and temporal logics. It also presents two extensions of the mu-calculus that allow us to address issues such as inverse modalities.
(Show Context)

Citation Context

...́ski [122]. Later Schewe [104] has proposed a modification of the strategy improvement policy. Even for this improvement, Friedmann gives examples of games requiring exponential numbers of iterations =-=[52]-=-. It is actually not that surprising that the quest for polynomial time algorithm for model-checking is still on. The problem is closely related to other stubborn questions of a similar type, as for e...

unknown title

by Oliver Friedmann, David Avis, David Avis, Oliver Friedmann
"... Noname manuscript No. (will be inserted by the editor) An exponential lower bound for Cunningham’s rule ..."
Abstract - Add to MetaCart
Noname manuscript No. (will be inserted by the editor) An exponential lower bound for Cunningham’s rule
(Show Context)

Citation Context

...G the sink existence property can be verified by standard graph algorithms. Given an initial strategy θ the sink seeking property can also be easily checked, as shown by the following lemma. Lemma 2 (=-=[8]-=-) Let G be a parity game with initial strategy θ fulfilling the sink existence property w.r.t. v∗. G is a sink game iff G is completely won by player 1 (i.e. W1 = V ) and for each node w it holds that...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University