Wouter van Heeswijk, PhD, Author at Towards Data Science

Instead of benchmark optimization- and machine learning algorithms against each other, we should consider how they can strengthen each other [Photo by Wedding Dreamz on Unsplash]

A Marriage of Machine Learning and Optimization Algorithms

Artificial Intelligence

How pattern detection and pattern exploitation might elevate each other to a new level

Wouter van Heeswijk, PhD

December 2, 2023

15 min read

Entropy-Regularized Reinforcement Learning Explained

Machine Learning

Learn more reliable, robust, and transferable policies by adding entropy bonuses to your algorithm

Wouter van Heeswijk, PhD

October 26, 2023

10 min read

And...action! [Photo by Jakob Owens on Unsplash]

Five Ways To Handle Large Action Spaces in Reinforcement Learning

Action spaces, particularly in combinatorial optimization problems, may grow unwieldy in size. This article discusses…

Wouter van Heeswijk, PhD

August 18, 2023

18 min read

Deep Deterministic Policy Gradients (DDPG) Explained

Artificial Intelligence

A gradient-based reinforcement learning algorithm to learn deterministic policies for continuous action spaces

Wouter van Heeswijk, PhD

April 5, 2023

12 min read

Solving The Taxi Environment With Q-Learning – A Tutorial

A Python implementation of Q-learning to solve the Taxi-v3 environment from OpenAI Gym in an…

Wouter van Heeswijk, PhD

March 20, 2023

8 min read

Rock-paper-scissors would be a dull affair with deterministic policies [Photo by Marcus Wallis on Unsplash]

When Stochastic Policies Are Better Than Deterministic Ones

Machine Learning

Why we let randomness dictate our action selection in Reinforcement Learning

Wouter van Heeswijk, PhD

February 18, 2023

7 min read

Three Fundamental Flaws In Common Reinforcement Learning Algorithms (And How To Fix Them)

Harness yourself against these shortcomings encountered in everyday RL algorithms

Wouter van Heeswijk, PhD

January 30, 2023

9 min read

Empirical performance, averaged over 57 Atari games. Rainbow DQN strongly outperforms individual DQN techniques, both in terms of rate of learning and final performance [source: DeepMind paper]

Rainbow DQN – The Best Reinforcement Learning Has to Offer?

Machine Learning

What happens if the most successful techniques in Deep Q-Learning are combined into a single…

Wouter van Heeswijk, PhD

December 8, 2022

14 min read

Proximal Policy Optimization (PPO) Explained

The journey from REINFORCE to the go-to algorithm in continuous control

Wouter van Heeswijk, PhD

November 29, 2022

16 min read

Author: Wouter van Heeswijk, PhD