Acronyms

In the following, a list of acronyms used in the project is provided along with their full forms.

  • MAB: Multi-Armed Bandit

  • sMAB: Stochastic Multi-Armed Bandit

  • cMAB: Contextual Multi-Armed Bandit

  • MC: Monte Carlo

  • MCMC: Markov Chain Monte Carlo

  • MO: Multi-Objective

  • SO: Single-Objective

  • VI: Variational Inference

  • OPE: Offline Policy Evaluation