Secrets behind AlphaGo

Behind AlphaGo
- Deep Reinforcement Learning
Houston Machine Learning
Yan Xu
08/05/2017

Roadmap
• Introduction and Feature Engineering (2 lectures)
• Supervised Learning (4 lectures)
• Unsupervised Learning (3 lectures)
• Deep Learning series (4 lectures)
• Optimization in Deep learning
• Behind AlphaGo
• Mastering the game of Go with deep neural networks and tree search
• Reinforcement learning 101 – Ravi
• Deep reinforcement learning
• Attention network
• Cuda Programming Hands-on - Martin
• Application of Deep Learning
• Object recognition
• Chatbot
Slides posted on:
http://www.slideshare.net/xuyangela

Outline
• Go game
• Deep learning recap
• Reinforcement learning 101
• AlphaGo system overview: Deep reinforcement
learning
More depth into deep reinforcement learning in upcoming
meetups

Deep Learning
+
Reinforcement Learning
=
Deep reinforcement learning

Deep Learning: Basic Component
Activation
function

Deep Learning: Architecture
Convolutional neural network
feature
map
Feature map

Agent
Environment

https://www.youtube.com/watch?v=xWe58WGWmlk&t=64s

AlphaGo: Deep Reinforcement Learning
http://techtalks.tv/talks/deep-reinforcement-learning/62360/
Mimic human
experts
Play against self
Estimate wins
Policy Network
Value Network

http://techtalks.tv/talks/deep-reinforcement-learning/62360/

More coming up!
• Behind AlphaGo
• Mastering the game of Go with deep neural networks and
tree search
• Reinforcement learning 101 – Ravi
• Deep reinforcement learning
• Attention network
• Cuda Programming Hands-on - Martin
• Application of Deep Learning
• Object recognition
• Chatbot
• Any proposal?

HML Speaker Hall of Fame
Recognize contribution to Houston Machine Learning Meetup

Thank you ~
Slides will be posted at:
http://www.slideshare.net/xuyangela
Leave a
group
review
please 

Secrets behind AlphaGo

More Related Content

What's hot

Similar to Secrets behind AlphaGo

More from Yan Xu

Recently uploaded

Secrets behind AlphaGo