TY - GEN
T1 - Stay with me
T2 - 36th International Conference on Machine Learning, ICML 2019
AU - Hsieh, Ping Chun
AU - Liu, Xi
AU - Bhattacharya, Anirban
AU - Kumar, P. R.
N1 - Publisher Copyright:
© 36th International Conference on Machine Learning, ICML 2019. All rights reserved.
PY - 2019/6
Y1 - 2019/6
N2 - Sequential decision making for lifetime maximization is a critical problem in many real-world applications, such as medical treatment and portfolio selection. In these applications, a "reneging" phenomenon, where participants may disengage from future interactions after observing an unsatisfiable outcome, is rather prevalent. To address the above issue, this paper proposes a model of heteroscedastic linear bandits with reneging, which allows each participant to have a distinct "satisfaction level," with any interaction outcome falling short of that level resulting in that participant reneging. Moreover, it allows the variance of the outcome to be context-dependent. Based on this model, we develop a UCB-type policy, namely HR-UCB, and prove that it achieves 0(vT(log(T))3) regret. Finally, we validate the performance of HR-UCB via simulations.
AB - Sequential decision making for lifetime maximization is a critical problem in many real-world applications, such as medical treatment and portfolio selection. In these applications, a "reneging" phenomenon, where participants may disengage from future interactions after observing an unsatisfiable outcome, is rather prevalent. To address the above issue, this paper proposes a model of heteroscedastic linear bandits with reneging, which allows each participant to have a distinct "satisfaction level," with any interaction outcome falling short of that level resulting in that participant reneging. Moreover, it allows the variance of the outcome to be context-dependent. Based on this model, we develop a UCB-type policy, namely HR-UCB, and prove that it achieves 0(vT(log(T))3) regret. Finally, we validate the performance of HR-UCB via simulations.
UR - http://www.scopus.com/inward/record.url?scp=85078138157&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85078138157
T3 - 36th International Conference on Machine Learning, ICML 2019
SP - 4957
EP - 4966
BT - 36th International Conference on Machine Learning, ICML 2019
PB - International Machine Learning Society (IMLS)
Y2 - 9 June 2019 through 15 June 2019
ER -