Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards

Publication date: Available online 15 May 2020Source: Statistics & Probability LettersAuthor(s): Sakshi Arya, Yuhong Yang
Source: Statistics and Probability Letters - Category: Statistics Source Type: research
More News: Statistics