|
|
1. Distributional Deep Q-Networks.html
|
HTML
|
102.4 B
|
|
|
1. Dueling Deep Q-Networks.html
|
HTML
|
102.4 B
|
|
|
1. Hyperparameter tuning with Optuna.mp4
|
MP4
|
32.4 MB
|
|
|
1. Hyperparameter tuning with Optuna.srt
|
SRT
|
11 KB
|
|
|
1. Introduction.mp4
|
MP4
|
32.4 MB
|
|
|
1. Introduction.mp4.jpg
|
JPG
|
174.8 KB
|
|
|
1. Maximization bias and Double Deep Q-Learning.mp4
|
MP4
|
13.8 MB
|
|
|
1. Module overview.mp4
|
MP4
|
1.3 MB
|
|
|
1. Module overview.srt
|
SRT
|
614.4 B
|
|
|
1. N-step Deep Q-Learning.html
|
HTML
|
102.4 B
|
|
|
1. Noisy Deep Q-Networks.html
|
HTML
|
102.4 B
|
|
|
1. Prioritized Experience Replay.html
|
HTML
|
102.4 B
|
|
|
1. PyTorch Lightning.mp4
|
MP4
|
32 MB
|
|
|
1. PyTorch Lightning.srt
|
SRT
|
10.5 KB
|
|
|
1.1 Advanced Reinforcement Learning in Python from DQN to SAC.html
|
HTML
|
102.4 B
|
|
|
1.2 Reinforcement Learning beginner to master.html
|
HTML
|
102.4 B
|
|
|
10. Bellman equations.mp4
|
MP4
|
12.4 MB
|
|
|
10. Bellman equations.srt
|
SRT
|
3.4 KB
|
|
|
10. Prepare the data loader and the optimizer.mp4
|
MP4
|
30.4 MB
|
|
|
10. Prepare the data loader and the optimizer.srt
|
SRT
|
4.9 KB
|
|
|
11. Define the train_step() method.mp4
|
MP4
|
49.8 MB
|
|
|
11. Define the train_step() method.srt
|
SRT
|
10.9 KB
|
|
|
11. Solving a Markov decision process.mp4
|
MP4
|
14.2 MB
|
|
|
11. Solving a Markov decision process.srt
|
SRT
|
3.6 KB
|
|
|
12. Define the train_epoch_end() method.mp4
|
MP4
|
32.2 MB
|
|
|
12. Define the train_epoch_end() method.srt
|
SRT
|
4.7 KB
|
|
|
13. Train the Deep Q-Learning algorithm.mp4
|
MP4
|
35.1 MB
|
|
|
13. Train the Deep Q-Learning algorithm.srt
|
SRT
|
7.5 KB
|
|
|
14. Explore the resulting agent.mp4
|
MP4
|
20.3 MB
|
|
|
14. Explore the resulting agent.srt
|
SRT
|
3.6 KB
|
|
|
2. Deep Q-Learning.mp4
|
MP4
|
16.2 MB
|
|
|
2. Deep Q-Learning.srt
|
SRT
|
3.4 KB
|
|
|
2. Elements common to all control tasks.mp4
|
MP4
|
38.7 MB
|
|
|
2. Elements common to all control tasks.srt
|
SRT
|
6.8 KB
|
|
|
2. Function approximators.mp4
|
MP4
|
36.3 MB
|
|
|
2. Function approximators.srt
|
SRT
|
9.8 KB
|
|
|
2. Link to the code notebook.html
|
HTML
|
204.8 B
|
|
|
2. Reinforcement Learning series.html
|
HTML
|
409.6 B
|
|
|
2. Temporal difference methods.mp4
|
MP4
|
12.6 MB
|
|
|
2. Temporal difference methods.srt
|
SRT
|
4.1 KB
|
|
|
2.1 Google colab.html
|
HTML
|
204.8 B
|
|
|
3. Artificial Neural Networks.mp4
|
MP4
|
24.3 MB
|
|
|
3. Artificial Neural Networks.srt
|
SRT
|
4.4 KB
|
|
|
3. Create the Double Deep Q-Learning algorithm.mp4
|
MP4
|
49.9 MB
|
|
|
3. Create the Double Deep Q-Learning algorithm.srt
|
SRT
|
8.5 KB
|
|
|
3. Create the dueling DQN.mp4
|
MP4
|
54.4 MB
|
|
|
3. Create the dueling DQN.srt
|
SRT
|
11.7 KB
|
|
|
3. DQN for visual inputs.mp4
|
MP4
|
69.1 MB
|
|
|
3. DQN for visual inputs.srt
|
SRT
|
15.1 KB
|
|
|
3. Experience replay.mp4
|
MP4
|
9 MB
|
|
|
3. Experience replay.srt
|
SRT
|
2.5 KB
|
|
|
3. Google Colab.mp4
|
MP4
|
5.8 MB
|
|
|
3. Google Colab.srt
|
SRT
|
2 KB
|
|
|
3. Introduction to PyTorch Lightning.mp4
|
MP4
|
30.9 MB
|
|
|
3. Introduction to PyTorch Lightning.srt
|
SRT
|
7 KB
|
|
|
3. Log average return.mp4
|
MP4
|
33.6 MB
|
|
|
3. Log average return.srt
|
SRT
|
5.6 KB
|
|
|
3. Solving control tasks with temporal difference method.mp4
|
MP4
|
14.5 MB
|
|
|
3. Solving control tasks with temporal difference method.srt
|
SRT
|
4.1 KB
|
|
|
3. The Markov decision process (MDP).mp4
|
MP4
|
25.1 MB
|
|
|
3. The Markov decision process (MDP).srt
|
SRT
|
6.4 KB
|
|
|
4. Artificial Neurons.mp4
|
MP4
|
25.6 MB
|
|
|
4. Artificial Neurons.srt
|
SRT
|
6.6 KB
|
|
|
4. Check the resulting agent.mp4
|
MP4
|
9.1 MB
|
|
|
4. Check the resulting agent.srt
|
SRT
|
1.7 KB
|
|
|
4. Create the Deep Q-Network.mp4
|
MP4
|
22.9 MB
|
|
|
4. Create the Deep Q-Network.srt
|
SRT
|
5.9 KB
|
|
|
4. Create the environment - Part 1.mp4
|
MP4
|
41.3 MB
|
|
|
4. Create the environment - Part 1.srt
|
SRT
|
9 KB
|
|
|
4. Define the objective function.mp4
|
MP4
|
29.8 MB
|
|
|
4. Define the objective function.srt
|
SRT
|
6.2 KB
|
|
|
4. Prioritized Experience Repay Buffer.mp4
|
MP4
|
63.6 MB
|
|
|
4. Prioritized Experience Repay Buffer.srt
|
SRT
|
15 KB
|
|
|
4. Q-Learning.mp4
|
MP4
|
11.1 MB
|
|
|
4. Q-Learning.srt
|
SRT
|
2.9 KB
|
|
|
4. Target Network.mp4
|
MP4
|
16.6 MB
|
|
|
4. Target Network.srt
|
SRT
|
4.6 KB
|
|
|
4. Types of Markov decision process.mp4
|
MP4
|
8.7 MB
|
|
|
4. Types of Markov decision process.srt
|
SRT
|
2.4 KB
|
|
|
4. Where to begin.mp4
|
MP4
|
4.6 MB
|
|
|
4. Where to begin.srt
|
SRT
|
2.1 KB
|
|
|
5. Advantages of temporal difference methods.mp4
|
MP4
|
3.7 MB
|
|
|
5. Advantages of temporal difference methods.srt
|
SRT
|
1.3 KB
|
|
|
5. Create and launch the hyperparameter tuning job.mp4
|
MP4
|
18.5 MB
|
|
|
5. Create and launch the hyperparameter tuning job.srt
|
SRT
|
3.2 KB
|
|
|
5. Create the environment - Part 2.mp4
|
MP4
|
36.6 MB
|
|
|
5. Create the environment - Part 2.srt
|
SRT
|
6.7 KB
|
|
|
5. Create the environment.mp4
|
MP4
|
62.6 MB
|
|
|
5. Create the environment.srt
|
SRT
|
14 KB
|
|
|
5. Create the policy.mp4
|
MP4
|
18 MB
|
|
|
5. Create the policy.srt
|
SRT
|
5.7 KB
|
|
|
5. How to represent a Neural Network.mp4
|
MP4
|
38.2 MB
|
|
|
5. How to represent a Neural Network.srt
|
SRT
|
8.2 KB
|
|
|
5. Trajectory vs episode.mp4
|
MP4
|
4.9 MB
|
|
|
5. Trajectory vs episode.srt
|
SRT
|
1.2 KB
|
|
|
6. Create the replay buffer.mp4
|
MP4
|
23 MB
|
|
|
6. Create the replay buffer.srt
|
SRT
|
6.6 KB
|
|
|
6. Explore the best trial.mp4
|
MP4
|
19.2 MB
|
|
|
6. Explore the best trial.srt
|
SRT
|
3.1 KB
|
|
|
6. Implement Deep Q-Learning.mp4
|
MP4
|
36.4 MB
|
|
|
6. Implement Deep Q-Learning.srt
|
SRT
|
6.7 KB
|
|
|
6. Implement the Deep Q-Learning algorithm with Prioritized Experience Replay.mp4
|
MP4
|
63.3 MB
|
|
|
6. Implement the Deep Q-Learning algorithm with Prioritized Experience Replay.srt
|
SRT
|
12.9 KB
|
|
|
6. Reward vs Return.mp4
|
MP4
|
5.3 MB
|
|
|
6. Reward vs Return.srt
|
SRT
|
1.8 KB
|
|
|
6. Stochastic Gradient Descent.mp4
|
MP4
|
49.9 MB
|
|
|
6. Stochastic Gradient Descent.srt
|
SRT
|
7.2 KB
|
|
|
7. Check the resulting agent.mp4
|
MP4
|
20.9 MB
|
|
|
7. Check the resulting agent.srt
|
SRT
|
2.7 KB
|
|
|
7. Create the environment.mp4
|
MP4
|
32.2 MB
|
|
|
7. Create the environment.srt
|
SRT
|
8.9 KB
|
|
|
7. Discount factor.mp4
|
MP4
|
14.8 MB
|
|
|
7. Discount factor.srt
|
SRT
|
4.6 KB
|
|
|
7. Launch the training process.mp4
|
MP4
|
42.5 MB
|
|
|
7. Launch the training process.srt
|
SRT
|
5.8 KB
|
|
|
7. Neural Network optimization.mp4
|
MP4
|
23.4 MB
|
|
|
7. Neural Network optimization.srt
|
SRT
|
5 KB
|
|
|
8. Check the resulting agent.mp4
|
MP4
|
16.8 MB
|
|
|
8. Check the resulting agent.srt
|
SRT
|
1.9 KB
|
|
|
8. Define the class for the Deep Q-Learning algorithm.mp4
|
MP4
|
54.5 MB
|
|
|
8. Define the class for the Deep Q-Learning algorithm.srt
|
SRT
|
13.6 KB
|
|
|
8. Policy.mp4
|
MP4
|
7.4 MB
|
|
|
8. Policy.srt
|
SRT
|
2.3 KB
|
|
|
9. Define the play_episode() function.mp4
|
MP4
|
29.1 MB
|
|
|
9. Define the play_episode() function.srt
|
SRT
|
5.5 KB
|
|
|
9. State values v(s) and action values q(s,a).mp4
|
MP4
|
4.3 MB
|
|
|
9. State values v(s) and action values q(s,a).srt
|
SRT
|
1.3 KB
|
|
|
Bonus Resources.txt
|
TXT
|
409.6 B
|
|
|
Get Bonus Downloads Here.url
|
URL
|
204.8 B
|