In Stochastic game, Bayesian regret is the expected difference ("regret") between the utility of a given strategy and the utility of the best possible strategy in hindsight—i.e., the strategy that would have maximized expected payoff if the true underlying model or distribution were known. This notion of regret measures how much is lost, on average, due to uncertainty or imperfect information.
"In 1957, for example, a statistician named James Hanna called his theorem Bayesian Regret. He had been preceded by David Blackwell, also a statistician, who called his theorem Controlled Random Walks.Controlled random walks, D Blackwell, Proceedings of the International Congress of Mathematicians 3, 336-338 Other, later papers had titles like 'On Pseudo Games', 'How to Play an Unknown Game', 'Universal Coding' and 'Universal Portfolios'".
|
|