|
|
1. Approximation Intro.mp4
|
MP4
|
6.5 MB
|
|
|
1. Approximation Intro.srt
|
SRT
|
8 KB
|
|
|
1. Gridworld.mp4
|
MP4
|
3.4 MB
|
|
|
1. Gridworld.srt
|
SRT
|
4 KB
|
|
|
1. Intro to Dynamic Programming and Iterative Policy Evaluation.mp4
|
MP4
|
4.8 MB
|
|
|
1. Intro to Dynamic Programming and Iterative Policy Evaluation.srt
|
SRT
|
5.4 KB
|
|
|
1. Introduction.mp4
|
MP4
|
34.2 MB
|
|
|
1. Introduction.srt
|
SRT
|
4.2 KB
|
|
|
1. Monte Carlo Intro.mp4
|
MP4
|
5 MB
|
|
|
1. Monte Carlo Intro.srt
|
SRT
|
6 KB
|
|
|
1. Naive Solution to Tic-Tac-Toe.mp4
|
MP4
|
6.1 MB
|
|
|
1. Naive Solution to Tic-Tac-Toe.srt
|
SRT
|
7.2 KB
|
|
|
1. Problem Setup and The Explore-Exploit Dilemma.mp4
|
MP4
|
6.5 MB
|
|
|
1. Problem Setup and The Explore-Exploit Dilemma.srt
|
SRT
|
7.8 KB
|
|
|
1. Stock Trading Project Section Introduction.mp4
|
MP4
|
26.8 MB
|
|
|
1. Stock Trading Project Section Introduction.srt
|
SRT
|
6.8 KB
|
|
|
1. Temporal Difference Intro.mp4
|
MP4
|
2.7 MB
|
|
|
1. Temporal Difference Intro.srt
|
SRT
|
3.3 KB
|
|
|
1. What is Reinforcement Learning.mp4
|
MP4
|
54.6 MB
|
|
|
1. What is Reinforcement Learning.srt
|
SRT
|
10.9 KB
|
|
|
1. What is the Appendix.mp4
|
MP4
|
5.5 MB
|
|
|
1. What is the Appendix.srt
|
SRT
|
3.7 KB
|
|
|
10. Thompson Sampling vs. Epsilon-Greedy vs. Optimistic Initial Values vs. UCB1.mp4
|
MP4
|
10.6 MB
|
|
|
10. Thompson Sampling vs. Epsilon-Greedy vs. Optimistic Initial Values vs. UCB1.srt
|
SRT
|
6.1 KB
|
|
|
10. Tic Tac Toe Code Main Loop and Demo.mp4
|
MP4
|
9.4 MB
|
|
|
10. Tic Tac Toe Code Main Loop and Demo.srt
|
SRT
|
9.2 KB
|
|
|
10. Value Iteration in Code.mp4
|
MP4
|
4.9 MB
|
|
|
10. Value Iteration in Code.srt
|
SRT
|
3.3 KB
|
|
|
10. What order should I take your courses in (part 1).mp4
|
MP4
|
29.3 MB
|
|
|
10. What order should I take your courses in (part 1).srt
|
SRT
|
16 KB
|
|
|
11. Dynamic Programming Summary.mp4
|
MP4
|
8.3 MB
|
|
|
11. Dynamic Programming Summary.srt
|
SRT
|
9.4 KB
|
|
|
11. Nonstationary Bandits.mp4
|
MP4
|
7.5 MB
|
|
|
11. Nonstationary Bandits.srt
|
SRT
|
7.8 KB
|
|
|
11. Tic Tac Toe Summary.mp4
|
MP4
|
8.3 MB
|
|
|
11. Tic Tac Toe Summary.srt
|
SRT
|
10.2 KB
|
|
|
11. What order should I take your courses in (part 2).mp4
|
MP4
|
37.6 MB
|
|
|
11. What order should I take your courses in (part 2).srt
|
SRT
|
23 KB
|
|
|
12. BONUS Where to get discount coupons and FREE deep learning material.mp4
|
MP4
|
37.8 MB
|
|
|
12. BONUS Where to get discount coupons and FREE deep learning material.srt
|
SRT
|
7.9 KB
|
|
|
12. Bandit Summary, Real Data, and Online Learning.mp4
|
MP4
|
33.9 MB
|
|
|
12. Bandit Summary, Real Data, and Online Learning.srt
|
SRT
|
9.1 KB
|
|
|
12. Tic Tac Toe Exercise.mp4
|
MP4
|
19.8 MB
|
|
|
12. Tic Tac Toe Exercise.srt
|
SRT
|
4.6 KB
|
|
|
2. Applications of the Explore-Exploit Dilemma.mp4
|
MP4
|
51.2 MB
|
|
|
2. Applications of the Explore-Exploit Dilemma.srt
|
SRT
|
10.9 KB
|
|
|
2. Components of a Reinforcement Learning System.mp4
|
MP4
|
12.7 MB
|
|
|
2. Components of a Reinforcement Learning System.srt
|
SRT
|
14.8 KB
|
|
|
2. Data and Environment.mp4
|
MP4
|
52 MB
|
|
|
2. Data and Environment.srt
|
SRT
|
15.7 KB
|
|
|
2. Gridworld in Code.mp4
|
MP4
|
11.5 MB
|
|
|
2. Gridworld in Code.srt
|
SRT
|
11 KB
|
|
|
2. Linear Models for Reinforcement Learning.mp4
|
MP4
|
6.5 MB
|
|
|
2. Linear Models for Reinforcement Learning.srt
|
SRT
|
7.4 KB
|
|
|
2. Monte Carlo Policy Evaluation.mp4
|
MP4
|
8.8 MB
|
|
|
2. Monte Carlo Policy Evaluation.srt
|
SRT
|
10.8 KB
|
|
|
2. On Unusual or Unexpected Strategies of RL.mp4
|
MP4
|
37.1 MB
|
|
|
2. On Unusual or Unexpected Strategies of RL.srt
|
SRT
|
7.9 KB
|
|
|
2. TD(0) Prediction.mp4
|
MP4
|
5.8 MB
|
|
|
2. TD(0) Prediction.srt
|
SRT
|
6.4 KB
|
|
|
2. The Markov Property.mp4
|
MP4
|
7.2 MB
|
|
|
2. The Markov Property.srt
|
SRT
|
8.4 KB
|
|
|
2. Where to get the Code.mp4
|
MP4
|
4.4 MB
|
|
|
2. Where to get the Code.srt
|
SRT
|
5.4 KB
|
|
|
2. Windows-Focused Environment Setup 2018.mp4
|
MP4
|
186.4 MB
|
|
|
2. Windows-Focused Environment Setup 2018.srt
|
SRT
|
20.1 KB
|
|
|
3. Defining Some Terms.mp4
|
MP4
|
42.3 MB
|
|
|
3. Defining Some Terms.srt
|
SRT
|
9.1 KB
|
|
|
3. Defining and Formalizing the MDP.mp4
|
MP4
|
6.6 MB
|
|
|
3. Defining and Formalizing the MDP.srt
|
SRT
|
7.9 KB
|
|
|
3. Designing Your RL Program.mp4
|
MP4
|
22.3 MB
|
|
|
3. Designing Your RL Program.srt
|
SRT
|
6.6 KB
|
|
|
3. Epsilon-Greedy.mp4
|
MP4
|
2.8 MB
|
|
|
3. Epsilon-Greedy.srt
|
SRT
|
3.2 KB
|
|
|
3. Features.mp4
|
MP4
|
6.3 MB
|
|
|
3. Features.srt
|
SRT
|
6.9 KB
|
|
|
3. How to Model Q for Q-Learning.mp4
|
MP4
|
44.9 MB
|
|
|
3. How to Model Q for Q-Learning.srt
|
SRT
|
12 KB
|
|
|
3. How to install Numpy, Scipy, Matplotlib, Pandas, IPython, Theano, and TensorFlow.mp4
|
MP4
|
43.9 MB
|
|
|
3. How to install Numpy, Scipy, Matplotlib, Pandas, IPython, Theano, and TensorFlow.srt
|
SRT
|
18.3 KB
|
|
|
3. Monte Carlo Policy Evaluation in Code.mp4
|
MP4
|
7.9 MB
|
|
|
3. Monte Carlo Policy Evaluation in Code.srt
|
SRT
|
6.1 KB
|
|
|
3. Notes on Assigning Rewards.mp4
|
MP4
|
4.2 MB
|
|
|
3. Notes on Assigning Rewards.srt
|
SRT
|
4.9 KB
|
|
|
3. Strategy for Passing the Course.mp4
|
MP4
|
9.5 MB
|
|
|
3. Strategy for Passing the Course.srt
|
SRT
|
11.8 KB
|
|
|
3. TD(0) Prediction in Code.mp4
|
MP4
|
5.3 MB
|
|
|
3. TD(0) Prediction in Code.srt
|
SRT
|
4 KB
|
|
|
4. Course Outline.mp4
|
MP4
|
31 MB
|
|
|
4. Course Outline.srt
|
SRT
|
6.8 KB
|
|
|
4. Design of the Program.mp4
|
MP4
|
23.3 MB
|
|
|
4. Design of the Program.srt
|
SRT
|
8.5 KB
|
|
|
4. Future Rewards.mp4
|
MP4
|
5.2 MB
|
|
|
4. Future Rewards.srt
|
SRT
|
6 KB
|
|
|
4. How to Code by Yourself (part 1).mp4
|
MP4
|
24.5 MB
|
|
|
4. How to Code by Yourself (part 1).srt
|
SRT
|
30.2 KB
|
|
|
4. Iterative Policy Evaluation in Code.mp4
|
MP4
|
12.1 MB
|
|
|
4. Iterative Policy Evaluation in Code.srt
|
SRT
|
10.2 KB
|
|
|
4. Monte Carlo Prediction with Approximation.mp4
|
MP4
|
2.8 MB
|
|
|
4. Monte Carlo Prediction with Approximation.srt
|
SRT
|
2.3 KB
|
|
|
4. Policy Evaluation in Windy Gridworld.mp4
|
MP4
|
7.8 MB
|
|
|
4. Policy Evaluation in Windy Gridworld.srt
|
SRT
|
5.3 KB
|
|
|
4. SARSA.mp4
|
MP4
|
8.2 MB
|
|
|
4. SARSA.srt
|
SRT
|
9.7 KB
|
|
|
4. The Value Function and Your First Reinforcement Learning Algorithm.mp4
|
MP4
|
103.7 MB
|
|
|
4. The Value Function and Your First Reinforcement Learning Algorithm.srt
|
SRT
|
22.8 KB
|
|
|
4. Updating a Sample Mean.mp4
|
MP4
|
2.2 MB
|
|
|
4. Updating a Sample Mean.srt
|
SRT
|
2.2 KB
|
|
|
5. Code pt 1.mp4
|
MP4
|
49.7 MB
|
|
|
5. Code pt 1.srt
|
SRT
|
9.6 KB
|
|
|
5. Designing Your Bandit Program.mp4
|
MP4
|
24.5 MB
|
|
|
5. Designing Your Bandit Program.srt
|
SRT
|
5.6 KB
|
|
|
5. How to Code by Yourself (part 2).mp4
|
MP4
|
14.8 MB
|
|
|
5. How to Code by Yourself (part 2).srt
|
SRT
|
18.4 KB
|
|
|
5. Monte Carlo Control.mp4
|
MP4
|
9.3 MB
|
|
|
5. Monte Carlo Control.srt
|
SRT
|
10.2 KB
|
|
|
5. Monte Carlo Prediction with Approximation in Code.mp4
|
MP4
|
6.6 MB
|
|
|
5. Monte Carlo Prediction with Approximation in Code.srt
|
SRT
|
4 KB
|
|
|
5. Policy Improvement.mp4
|
MP4
|
4.5 MB
|
|
|
5. Policy Improvement.srt
|
SRT
|
5.2 KB
|
|
|
5. SARSA in Code.mp4
|
MP4
|
8.8 MB
|
|
|
5. SARSA in Code.srt
|
SRT
|
5.5 KB
|
|
|
5. Tic Tac Toe Code Outline.mp4
|
MP4
|
5 MB
|
|
|
5. Tic Tac Toe Code Outline.srt
|
SRT
|
6.4 KB
|
|
|
5. Value Function Introduction.mp4
|
MP4
|
19.7 MB
|
|
|
5. Value Function Introduction.srt
|
SRT
|
15.6 KB
|
|
|
6. Code pt 2.mp4
|
MP4
|
65.3 MB
|
|
|
6. Code pt 2.srt
|
SRT
|
11.8 KB
|
|
|
6. Comparing Different Epsilons.mp4
|
MP4
|
8 MB
|
|
|
6. Comparing Different Epsilons.srt
|
SRT
|
5.3 KB
|
|
|
6. How to Succeed in this Course (Long Version).mp4
|
MP4
|
18.3 MB
|
|
|
6. How to Succeed in this Course (Long Version).srt
|
SRT
|
14.5 KB
|
|
|
6. Monte Carlo Control in Code.mp4
|
MP4
|
10.2 MB
|
|
|
6. Monte Carlo Control in Code.srt
|
SRT
|
5.8 KB
|
|
|
6. Policy Iteration.mp4
|
MP4
|
3.1 MB
|
|
|
6. Policy Iteration.srt
|
SRT
|
3.5 KB
|
|
|
6. Q Learning.mp4
|
MP4
|
4.8 MB
|
|
|
6. Q Learning.srt
|
SRT
|
5.8 KB
|
|
|
6. TD(0) Semi-Gradient Prediction.mp4
|
MP4
|
8.4 MB
|
|
|
6. TD(0) Semi-Gradient Prediction.srt
|
SRT
|
6.4 KB
|
|
|
6. Tic Tac Toe Code Representing States.mp4
|
MP4
|
4.4 MB
|
|
|
6. Tic Tac Toe Code Representing States.srt
|
SRT
|
4.9 KB
|
|
|
6. Value Functions.mp4
|
MP4
|
8.3 MB
|
|
|
6. Value Functions.srt
|
SRT
|
11.8 KB
|
|
|
7. Bellman Examples.mp4
|
MP4
|
87.1 MB
|
|
|
7. Bellman Examples.srt
|
SRT
|
27.7 KB
|
|
|
7. Code pt 3.mp4
|
MP4
|
33.7 MB
|
|
|
7. Code pt 3.srt
|
SRT
|
5.4 KB
|
|
|
7. Is this for Beginners or Experts Academic or Practical Fast or slow-paced.mp4
|
MP4
|
39 MB
|
|
|
7. Is this for Beginners or Experts Academic or Practical Fast or slow-paced.srt
|
SRT
|
31.8 KB
|
|
|
7. Monte Carlo Control without Exploring Starts.mp4
|
MP4
|
4.6 MB
|
|
|
7. Monte Carlo Control without Exploring Starts.srt
|
SRT
|
5.5 KB
|
|
|
7. Optimistic Initial Values.mp4
|
MP4
|
15.8 MB
|
|
|
7. Optimistic Initial Values.srt
|
SRT
|
3.1 KB
|
|
|
7. Policy Iteration in Code.mp4
|
MP4
|
7.6 MB
|
|
|
7. Policy Iteration in Code.srt
|
SRT
|
6.1 KB
|
|
|
7. Q Learning in Code.mp4
|
MP4
|
5.4 MB
|
|
|
7. Q Learning in Code.srt
|
SRT
|
3.5 KB
|
|
|
7. Semi-Gradient SARSA.mp4
|
MP4
|
4.7 MB
|
|
|
7. Semi-Gradient SARSA.srt
|
SRT
|
5.5 KB
|
|
|
7. Tic Tac Toe Code Enumerating States Recursively.mp4
|
MP4
|
9.8 MB
|
|
|
7. Tic Tac Toe Code Enumerating States Recursively.srt
|
SRT
|
11.3 KB
|
|
|
8. Code pt 4.mp4
|
MP4
|
49.1 MB
|
|
|
8. Code pt 4.srt
|
SRT
|
8 KB
|
|
|
8. Monte Carlo Control without Exploring Starts in Code.mp4
|
MP4
|
8.1 MB
|
|
|
8. Monte Carlo Control without Exploring Starts in Code.srt
|
SRT
|
3.6 KB
|
|
|
8. Optimal Policy and Optimal Value Function.mp4
|
MP4
|
3.2 MB
|
|
|
8. Optimal Policy and Optimal Value Function.srt
|
SRT
|
5 KB
|
|
|
8. Policy Iteration in Windy Gridworld.mp4
|
MP4
|
9.1 MB
|
|
|
8. Policy Iteration in Windy Gridworld.srt
|
SRT
|
8.2 KB
|
|
|
8. Proof that using Jupyter Notebook is the same as not using it.mp4
|
MP4
|
78.3 MB
|
|
|
8. Proof that using Jupyter Notebook is the same as not using it.srt
|
SRT
|
14.1 KB
|
|
|
8. Semi-Gradient SARSA in Code.mp4
|
MP4
|
10.6 MB
|
|
|
8. Semi-Gradient SARSA in Code.srt
|
SRT
|
5.4 KB
|
|
|
8. TD Summary.mp4
|
MP4
|
3.9 MB
|
|
|
8. TD Summary.srt
|
SRT
|
4.7 KB
|
|
|
8. Tic Tac Toe Code The Environment.mp4
|
MP4
|
10 MB
|
|
|
8. Tic Tac Toe Code The Environment.srt
|
SRT
|
12 KB
|
|
|
8. UCB1.mp4
|
MP4
|
8.2 MB
|
|
|
8. UCB1.srt
|
SRT
|
8.1 KB
|
|
|
9. Bayesian Thompson Sampling.mp4
|
MP4
|
51.8 MB
|
|
|
9. Bayesian Thompson Sampling.srt
|
SRT
|
11.8 KB
|
|
|
9. Course Summary and Next Steps.mp4
|
MP4
|
13.2 MB
|
|
|
9. Course Summary and Next Steps.srt
|
SRT
|
16 KB
|
|
|
9. MDP Summary.mp4
|
MP4
|
5.7 MB
|
|
|
9. MDP Summary.srt
|
SRT
|
2 KB
|
|
|
9. Monte Carlo Summary.mp4
|
MP4
|
5.7 MB
|
|
|
9. Monte Carlo Summary.srt
|
SRT
|
7.1 KB
|
|
|
9. Python 2 vs Python 3.mp4
|
MP4
|
7.8 MB
|
|
|
9. Python 2 vs Python 3.srt
|
SRT
|
6.1 KB
|
|
|
9. Stock Trading Project Discussion.mp4
|
MP4
|
15.8 MB
|
|
|
9. Stock Trading Project Discussion.srt
|
SRT
|
4.3 KB
|
|
|
9. Tic Tac Toe Code The Agent.mp4
|
MP4
|
9 MB
|
|
|
9. Tic Tac Toe Code The Agent.srt
|
SRT
|
10.9 KB
|
|
|
9. Value Iteration.mp4
|
MP4
|
6.2 MB
|
|
|
9. Value Iteration.srt
|
SRT
|
7 KB
|
|
|
[GigaCourse.com].url
|
URL
|
0 B
|