related documents Incentivized Exploration for Multi-Armed Bandits under Reward Drift Conference Proceeding Optimal and Efficient Stochastic Motion Planning in Partially-Known Environments Conference Proceeding