Multi Armed Bandit | Towards Data Science

The Multi-Armed Bandit Problem-A Beginner-Friendly Guide

Data Science

Understanding the exploitation-exploration trade-off with an example

Saankhya Mondal

December 23, 2024

6 min read

Optimizing Marketing Campaigns with Budgeted Multi-Armed Bandits

Data Science

With demos, our new solution, and a video

Vadim Arzamasov

August 16, 2024

10 min read

Handling Feedback Loops in Recommender Systems – Deep Bayesian Bandits

Machine Learning

Understanding fundamentals of exploration and Deep Bayesian Bandits to tackle feedback loops in recommender systems

Sachin Hosmani

July 31, 2024

13 min read

An Overview of Contextual Bandits

A dynamic approach to treatment personalization

Ugur Yildirim

February 2, 2024

23 min read

Dynamic Pricing with Multi-Armed Bandit: Learning by Doing

Artificial Intelligence

Applying Reinforcement Learning strategies to real-world use cases, especially in dynamic pricing, can reveal many…

Massimiliano Costacurta

August 16, 2023

19 min read

Beyond the Basics: Reinforcement Learning with Jax - Part II: Developing an exploitative…

Beyond the Basics: Reinforcement Learning with Jax – Part II: Developing an Exploitative Alternative to…

Lando L

June 2, 2023

23 min read

Confused robot observing three one-armed slot machines in Picasso style. Source: DALL-E 2.

Multi-armed bandits applied to order allocation among execution algorithms

Machine Learning

Finding the right balance between exploitation and exploration

Lars ter Braak

March 2, 2023

6 min read

Batched Bandit Problems

Programming

Multi-Armed Bandits with Delayed Rewards

Sean Smith

February 17, 2023

13 min read

Solving Multi-Armed Bandit Problems

A powerful and easy way to apply reinforcement learning.

Hennie de Harder

November 4, 2022

12 min read