Stochastic games
game-theory
Stochastic games
A Stochastic game is defined by the following data:
- A number of players $n$,
- A set $S$ of states the game can be in (normally thought of as $n$ tuples),
- A set of actions $A_i$ for each player this may depend on the state $A_i(s) \subset A_i$ we refer to the joint set of actions as $A = A_1 \times A_2 \times \ldots \times A_n$,
- A model which determines for each set of states $s,s' \in S$ and action $a \in A$ what the transition probabilities there are from $s$ to $s'$ given action $a$, i.e. $T(s, a, s') = \mathbb{P}[s' \vert s, a]$,
- A reward function for each player $R_i: S \rightarrow \mathbb{R}$ for being at a given state. and
- A discount factor $\gamma$ for discounted rewards.
When provided with this we are looking for a strategy for each player $\pi_i: S \rightarrow A_i$ that determines what action we take in each state.