Asymptotically Efficient Allocation Rules for the Multiarmed Bandit Problem with Multiple Plays Part I: I.I.D. Rewards, Part II: Markovian Rewards

Venkat Anantharam and Pravin Varaiya and Jean Walrand

EECS Department, University of California, Berkeley

Technical Report No. UCB/ERL M86/62

, 1986

BibTeX citation:

@techreport{Anantharam:M86/62,
    Author= {Anantharam, Venkat and Varaiya, Pravin and Walrand, Jean},
    Title= {Asymptotically Efficient Allocation Rules for the Multiarmed Bandit Problem with Multiple Plays Part I: I.I.D. Rewards, Part II: Markovian Rewards},
    Year= {1986},
    Month= {Aug},
    Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/1986/724.html},
    Number= {UCB/ERL M86/62},
}

EndNote citation:

%0 Report
%A Anantharam, Venkat 
%A Varaiya, Pravin 
%A Walrand, Jean 
%T Asymptotically Efficient Allocation Rules for the Multiarmed Bandit Problem with Multiple Plays Part I: I.I.D. Rewards, Part II: Markovian Rewards
%I EECS Department, University of California, Berkeley
%D 1986
%@ UCB/ERL M86/62
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/1986/724.html
%F Anantharam:M86/62