Acronyms
In the following, a list of acronyms used in the project is provided along with their full forms.
MAB: Multi-Armed Bandit
sMAB: Stochastic Multi-Armed Bandit
cMAB: Contextual Multi-Armed Bandit
MC: Monte Carlo
MCMC: Markov Chain Monte Carlo
MO: Multi-Objective
SO: Single-Objective
VI: Variational Inference
OPE: Offline Policy Evaluation