Games with payoffs are a type of graph game where the players have payoffs associated with each state. The payoff of a state is a number that represents the value of that state to the player.
In more details, we can actually compute the payoff from the sequence of weights from the play (sequence of nodes) in various ways.
A first approach is to consider the maximum weight from the sequence of weights , that we indicate as , which is an extension of the qualitative objective Reach(Win), since the latter corresponds to the quantitative objective Sup using the weight for Lose and for Win.
The dual objective of is to consider the smallest weight , which extends the qualitative objective Safe(Win) where the weight is used for Win and for Lose.
Similarly, we can extend:
- Buchi with
- CoBuchi with
Those games are uniformly positionally determined, meaning there exists an algorithm for computing the value function in polynomial time and space. For Sup and Inf the time complexity if , while it’s for LimSup and LimInf.
Mean Payoff Games
With mean payoff games we talk about taking the mean of the weights in the sequence (this is the natural approach to aggregate an infinite sequence of weights).
Since the summation of values in the sequence could not converge, we can take the superior limit or the inferior limit. In particular we can define: and
Note
Note that , where in we are taking the opposite of each weight. The two types of mean payoff are dual objectives
Parity games can be reduced to games with payoff with threshold 0 (meaning that a player wins if the mean payoff is greater than 0).
Mean payoff games are prefix independent, meaning that
Solving Mean Payoff Games
todo The computation of the mean payoff values runs in time, while we need to solve the mean payoff games, meaning determining wether a player can get a mean payoff greater than a threshold .
Note that:
- is the number of vertices
- is the number of edges
- is the maximum payoff.
- is the length of the play.
We can solve Mean Payoff Games using a Value Iteration paradigm.
Energy Games
Weights can also be modelled as Energy, meaning that we consider negative weights as energy consumptions and positive weights ad recharges. We define as the smallest initial budget such that the player energy can remain non-negative forever. This value can be computed in .
Note
Given a mean payoff graph , we say that it satisfies if and only if it satisfies
For the game to be solvable, we also need that doesn’t contain any negative cycles (a cycle where the sum of the weights is negative). The conditions:
- All cycles in are non-negative are all connected, meaning that any one of the three implies the other twos. #todo (see the proof of this that can be interesting for the project.)
Solving Energy Games
Using this definition, we can define another value iteration algorithm that solves energy games, and so that also solves mean payoff games.
Discounted Payoff Games
In discounted payoff games, we introduce a term:
Since , it will give more weight to the initial weights, and less to the following weights (because of the at the exponent, shrinks).
The more is near , the more the payoff is similar to the mean payoff.
tags: game-theory - graph-theory resources: