Bandit class that is used in Thompson sampling.
Attributes:
alpha: Alpha parameter of the beta distribution (number of positive examples).
beta: Beta parameter of the beta distribution (number of negative examples).
Methods:
update: Updates alpha and beta priors of the BetaBandit.
sample: Samples the BetaBandit's distribution n times.