TY - JOUR
T1 - An axiomatization of learning rules when counterfactuals are not observed
AU - Mengel, F
AU - Rivas, J
PY - 2012
Y1 - 2012
N2 - In this paper we study learning procedures when counterfactuals (payoffs of not chosen actions) are not observed. The decision maker reasons in two steps: First, she updates her propensities for choosing each action after every payoff experience, where propensities can be interpreted as preferences. Then, she transforms these propensities into choice probabilities. We introduce a set of axioms on how propensities are updated and on how these propensities are translated into choices and study the decision marker's behavior when such axioms are in place. Our characterization includes the linear reinforcement learning rule from Roth and Erev (1995).
AB - In this paper we study learning procedures when counterfactuals (payoffs of not chosen actions) are not observed. The decision maker reasons in two steps: First, she updates her propensities for choosing each action after every payoff experience, where propensities can be interpreted as preferences. Then, she transforms these propensities into choice probabilities. We introduce a set of axioms on how propensities are updated and on how these propensities are translated into choices and study the decision marker's behavior when such axioms are in place. Our characterization includes the linear reinforcement learning rule from Roth and Erev (1995).
UR - http://dx.doi.org/10.1515/1935-1704.1828
U2 - 10.1515/1935-1704.1828
DO - 10.1515/1935-1704.1828
M3 - Article
SN - 1935-1704
VL - 12
SP - 1935
EP - 1704
JO - The B.E. Journal of Theoretical Economics
JF - The B.E. Journal of Theoretical Economics
IS - 1
ER -