We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Know Your Customer: Multi-armed Bandits with Capacity Constraints

Abstract: A wide range of resource allocation and platform operation settings exhibit the following two simultaneous challenges: (1) service resources are capacity constrained; and (2) clients' preferences are not perfectly known. To study this pair of challenges, we consider a service system with heterogeneous servers and clients. Server types are known and there is fixed capacity of servers of each type. Clients arrive over time, with types initially unknown and drawn from some distribution. Each client sequentially brings $N$ jobs before leaving. The system operator assigns each job to some server type, resulting in a payoff whose distribution depends on the client and server types.
Our main contribution is a complete characterization of the structure of the optimal policy for maximization of the rate of payoff accumulation. Such a policy must balance three goals: (i) earning immediate payoffs; (ii) learning client types to increase future payoffs; and (iii) satisfying the capacity constraints. We construct a policy that has provably optimal regret (to leading order as $N$ grows large). Our policy has an appealingly simple three-phase structure: a short type-"guessing" phase, a type-"confirmation" phase that balances payoffs with learning, and finally an "exploitation" phase that focuses on payoffs. Crucially, our approach employs the shadow prices of the capacity constraints in the assignment problem with known types as "externality prices" on the servers' capacity.
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Methodology (stat.ME); Machine Learning (stat.ML)
Cite as: arXiv:1603.04549 [cs.LG]
  (or arXiv:1603.04549v1 [cs.LG] for this version)

Submission history

From: Vijay Kamble [view email]
[v1] Tue, 15 Mar 2016 04:29:31 GMT (328kb,D)
[v2] Sun, 18 Jun 2017 00:11:06 GMT (148kb,D)
[v3] Mon, 1 Oct 2018 00:39:01 GMT (818kb,D)
[v4] Wed, 28 Nov 2018 21:36:16 GMT (818kb,D)
[v5] Sat, 7 Dec 2019 18:16:30 GMT (1326kb,D)
[v6] Thu, 23 Apr 2020 19:49:49 GMT (1350kb,D)
[v7] Wed, 5 Aug 2020 22:17:03 GMT (1351kb,D)

Link back to: arXiv, form interface, contact.