Exploration through reward biasing: Reward-biased maximum likelihood estimation for stochastic multi-armed bandits

Xi Liu*, Ping Chun Hsieh, Yu Heng Hung, Anirban Bhattacharya, P. R. Kumar

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Fingerprint

Dive into the research topics of 'Exploration through reward biasing: Reward-biased maximum likelihood estimation for stochastic multi-armed bandits'. Together they form a unique fingerprint.

Keyphrases

Mathematics

Social Sciences