Swap regret

Last updated

Swap regret is a concept from online learning and game theory. It is a generalization of regret in a repeated, n-decision game.

Definition

In each round , the learner chooses decision with probability and the utility for decision is . A learner's swap-regret is defined to be the following:

Intuitively, it is how much a player could improve by switching each occurrence of decision i to the best decision j possible in hindsight. The swap regret is always nonnegative. Swap regret is useful for computing correlated equilibria.

References