Behind AlphaGo
- Deep Reinforcement Learning
Houston Machine Learning
Yan Xu
08/05/2017
Roadmap
• Introduction and Feature Engineering (2 lectures)
• Supervised Learning (4 lectures)
• Unsupervised Learning (3 lectures)
• Deep Learning series (4 lectures)
• Optimization in Deep learning
• Behind AlphaGo
• Mastering the game of Go with deep neural networks and tree search
• Reinforcement learning 101 – Ravi
• Deep reinforcement learning
• Attention network
• Cuda Programming Hands-on - Martin
• Application of Deep Learning
• Object recognition
• Chatbot
Slides posted on:
http://www.slideshare.net/xuyangela
Outline
• Go game
• Deep learning recap
• Reinforcement learning 101
• AlphaGo system overview: Deep reinforcement
learning
More depth into deep reinforcement learning in upcoming
meetups
Go game
Go game
Why hard for computer to play
Deep Learning
+
Reinforcement Learning
=
Deep reinforcement learning
Deep Learning: Basic Component
Activation
function
Deep Learning: Representation
Deep Learning: Optimization
Deep Learning: Architecture
Convolutional neural network
feature
map
Feature map
Reinforcement Learning
Reinforcement Learning
Agent
Environment
Reinforcement Learning
Reinforcement Learning
https://www.youtube.com/watch?v=xWe58WGWmlk&t=64s
Deep Reinforcement Learning
AlphaGo: Deep Reinforcement Learning
http://techtalks.tv/talks/deep-reinforcement-learning/62360/
Mimic human
experts
Play against self
Estimate wins
Policy Network
Value Network
http://techtalks.tv/talks/deep-reinforcement-learning/62360/
http://techtalks.tv/talks/deep-reinforcement-learning/62360/
http://techtalks.tv/talks/deep-reinforcement-learning/62360/
http://techtalks.tv/talks/deep-reinforcement-learning/62360/
Wide application
More coming up!
• Behind AlphaGo
• Mastering the game of Go with deep neural networks and
tree search
• Reinforcement learning 101 – Ravi
• Deep reinforcement learning
• Attention network
• Cuda Programming Hands-on - Martin
• Application of Deep Learning
• Object recognition
• Chatbot
• Any proposal?
HML Speaker Hall of Fame
Recognize contribution to Houston Machine Learning Meetup
Thank you ~
Slides will be posted at:
http://www.slideshare.net/xuyangela
Leave a
group
review
please 

Secrets behind AlphaGo