Generalized Linear Bandits With Safety Constraints

Sanae Amani, Mahnoosh Alizadeh, Christos Thrampoulidis

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 17:40

04 May 2020

The classical multi-armed bandit is a class of sequential decision making problems where selecting actions incurs costs that are sampled independently from an unknown underlying distribution. Bandit algorithms have many applications in safety critical systems, where several constraints must be respected during the run of the algorithm in spite of uncertainty about problem parameters. This paper formulates a generalized linear stochastic multi-armed bandit problem with generalized linear safety constraints that depend on an unknown parameter vector. In this setting, we propose a Safe UCB-GLM algorithm for which we provide general and problem-dependent regret bounds.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Value-Added Bundle(s) Including this Product

04 May 2020

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

26 Apr 2024

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

SPS

Members: $150.00
IEEE Members: $250.00
Non-members: $350.00

19 Apr 2024

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

SPS

Members: $150.00
IEEE Members: $250.00
Non-members: $350.00

16 Oct 2022

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

SPS

Members: $150.00
IEEE Members: $250.00
Non-members: $350.00