Tags » Reinforcement Learning

So, what is Bayesian Bandits

This is your very first post. Click the Edit link to modify or delete it, or start a new post. If you like, use this post to tell readers why you started this blog and what you plan to do with it.

Reinforcement Learning

Dopamine: reward or aversion?

Everybody Loves Dopamine

Dopamine is love. Dopamine is reward. Dopamine is addiction. Neuroscientists have a love/hate relationship with how this monoamine neurotransmitter is portrayed in the popular press. 52 more words

Behavioral Economics

AlphaGo vs Lee Sedol - The new AI Challenge

On March 4, I was contacted by the Xinhua News Agency to comment on the upcoming Go match between Google DeepMind’s AlphaGo algorithm and the top-Go player Lee Sedol. 1,527 more words


5 easy pieces: How Deepmind mastered Go

Google Deepmind announced last week that it created an AI that can play professional-level Go. The game of Go has always been something of a holy grail for game AI, given its large branching factor and the difficulty of evaluating a position. 1,741 more words

Journal Club

Play Your Cards Right with Python

Peters, my morbidly objectionable Dutch lodger, has won every game of cards against me. He now owns the deeds to my house, the yellow car on the driveway and even the clothes on my back (if the fat fool could squeeze into them, that is). 655 more words


Useful Control Variates for Variance Reduction


For many problems in machine learning (ranging from Generative Models to Reinforcement Learning), we rely on Monte Carlo estimators of gradients for optimization. Often, the… 129 more words


NIPS 2015 - Deep RL Workshop

This is a brief summary of the first part of the Deep RL workshop at NIPS 2015. I couldn’t get a seat for the second half… 1,198 more words