Iteration

Results: 420



#Item
71

Strategy iteration is strongly polynomial for 2-player turn-based stochastic games with a constant discount factor Thomas Dueholm Hansen1,2 Peter Bro Miltersen1 Uri Zwick2 1

Add to Reading List

Source URL: cs.au.dk

Language: English - Date: 2013-04-08 17:05:14
    72Markov processes / Mathematics / Probability theory / Mathematical analysis / Dynamic programming / Markov decision process / Stochastic control / Markov chain / Mathematical optimization / Distribution

    Stat 260/CSLearning in Sequential Decision Problems. Peter Bartlett 1. Recall: MDPs. 2. Value iteration. 3. Policy iteration.

    Add to Reading List

    Source URL: www.stat.berkeley.edu

    Language: English - Date: 2014-11-25 12:45:38
    73

    The complexity of solving reachability games using value and strategy iteration ∗ Kristoffer Arnsfelt Hansen, Rasmus Ibsen-Jensen, and Peter Bro Miltersen Aarhus University {arnsfelt,rij,pbmiltersen}@cs.au.dk

    Add to Reading List

    Source URL: www.cs.au.dk

    Language: English - Date: 2015-08-11 05:53:51
      74Belief revision / Reinforcement learning / Dots per inch / Mathematical optimization / Rollout / Artificial intelligence / Applied mathematics / Learning

      - Classification-based Policy Iteration with a Critic 1 1

      Add to Reading List

      Source URL: victorgabillon.nfshost.com

      Language: English - Date: 2011-06-17 22:49:05
      75Dynamic programming / Markov decision process / Stochastic control / Probability theory / Probability / Statistics

      Classification-based Policy Iteration with a Critic V. Gabillon1 , A. Lazaric1 , M. Ghavamzadeh1 & B. Scherrer2 1 2 INRIA Lille - Nord Europe, Team Sequel,

      Add to Reading List

      Source URL: victorgabillon.nfshost.com

      Language: English - Date: 2011-06-30 11:49:57
      76

      CADGen Code Generator for Semi-Algebraic Iteration Sets Editionfor CADGen versionOctoberArmin Gr¨

      Add to Reading List

      Source URL: www.infosun.fim.uni-passau.de

      Language: English - Date: 2009-10-01 09:07:04
        77

        Exercises in functional iteration: the function f(x) = ln(2-exp(-x)) A selfstudy using formal powerseries and operator-matrices Gottfried Helmsupdate

        Add to Reading List

        Source URL: go.helms-net.de

        Language: English - Date: 2011-04-05 09:48:29
          78Mathematics / Mathematical analysis / Artificial intelligence / Backgammon / Rollout / Markov decision process / Multi-armed bandit / Reinforcement learning / Inverted pendulum / Pendulum / Prime-counting function / Valuation

          Rollout Allocation Strategies for Classification-based Policy Iteration Victor Gabillon Alessandro Lazaric

          Add to Reading List

          Source URL: victorgabillon.nfshost.com

          Language: English - Date: 2010-07-01 09:47:14
          79

          Hyperbolic Interpolation and Iteration towards a Zero File: 9Sep09 Version dated September 16, 2009 6:48 am

          Add to Reading List

          Source URL: http.cs.berkeley.edu

          Language: English - Date: 2009-09-16 09:49:40
            UPDATE