|
|
001 Continuous action spaces.mp4
|
MP4
|
29.6 MB
|
|
|
001 Continuous action spaces_en.vtt
|
VTT
|
6.8 KB
|
|
|
001 Hindsight Experience Replay (HER).mp4
|
MP4
|
17.1 MB
|
|
|
001 Hindsight Experience Replay (HER)_en.vtt
|
VTT
|
4.3 KB
|
|
|
001 Hyperparameter tuning with Optuna.mp4
|
MP4
|
32.4 MB
|
|
|
001 Hyperparameter tuning with Optuna_en.vtt
|
VTT
|
9.6 KB
|
|
|
001 Introduction.mp4
|
MP4
|
24.3 MB
|
|
|
001 Introduction_en.vtt
|
VTT
|
6.2 KB
|
|
|
001 Module Overview.mp4
|
MP4
|
2.6 MB
|
|
|
001 Module Overview_en.vtt
|
VTT
|
1 KB
|
|
|
001 Module overview.mp4
|
MP4
|
1.3 MB
|
|
|
001 Module overview_en.vtt
|
VTT
|
512 B
|
|
|
001 Next steps.mp4
|
MP4
|
17.3 MB
|
|
|
001 Next steps_en.vtt
|
VTT
|
2.2 KB
|
|
|
001 Policy gradient methods.mp4
|
MP4
|
21.7 MB
|
|
|
001 Policy gradient methods_en.vtt
|
VTT
|
4.8 KB
|
|
|
001 PyTorch Lightning.mp4
|
MP4
|
32 MB
|
|
|
001 PyTorch Lightning_en.vtt
|
VTT
|
9.2 KB
|
|
|
001 Soft Actor-Critic (SAC).mp4
|
MP4
|
24 MB
|
|
|
001 Soft Actor-Critic (SAC)_en.vtt
|
VTT
|
7.5 KB
|
|
|
001 The Brax Physics engine.mp4
|
MP4
|
20 MB
|
|
|
001 The Brax Physics engine_en.vtt
|
VTT
|
3.5 KB
|
|
|
001 Twin Delayed DDPG (TD3).mp4
|
MP4
|
34 MB
|
|
|
001 Twin Delayed DDPG (TD3)_en.vtt
|
VTT
|
11.4 KB
|
|
|
002 Deep Deterministic Policy Gradient (DDPG).mp4
|
MP4
|
32.3 MB
|
|
|
002 Deep Deterministic Policy Gradient (DDPG)_en.vtt
|
VTT
|
9.9 KB
|
|
|
002 Deep Q-Learning.mp4
|
MP4
|
16.2 MB
|
|
|
002 Deep Q-Learning_en.vtt
|
VTT
|
2.9 KB
|
|
|
002 Elements common to all control tasks.mp4
|
MP4
|
38.7 MB
|
|
|
002 Elements common to all control tasks_en.vtt
|
VTT
|
5.9 KB
|
|
|
002 Function approximators.mp4
|
MP4
|
36.3 MB
|
|
|
002 Function approximators_en.vtt
|
VTT
|
8.5 KB
|
|
|
002 Implement Hindsight Experience Replay (HER) - Part 1.mp4
|
MP4
|
34 MB
|
|
|
002 Implement Hindsight Experience Replay (HER) - Part 1_en.vtt
|
VTT
|
5.2 KB
|
|
|
002 Link to the code notebook.html
|
HTML
|
307.2 B
|
|
|
002 Next steps.html
|
HTML
|
512 B
|
|
|
002 Policy performance.mp4
|
MP4
|
8.5 MB
|
|
|
002 Policy performance_en.vtt
|
VTT
|
2.6 KB
|
|
|
002 Reinforcement Learning series.html
|
HTML
|
512 B
|
|
|
002 SAC pseudocode.mp4
|
MP4
|
9.5 MB
|
|
|
002 SAC pseudocode_en.vtt
|
VTT
|
2.1 KB
|
|
|
002 TD3 pseudocode.mp4
|
MP4
|
20 MB
|
|
|
002 TD3 pseudocode_en.vtt
|
VTT
|
4.3 KB
|
|
|
002 Temporal difference methods.mp4
|
MP4
|
12.6 MB
|
|
|
002 Temporal difference methods_en.vtt
|
VTT
|
3.5 KB
|
|
|
002 The advantage function.mp4
|
MP4
|
13.4 MB
|
|
|
002 The advantage function_en.vtt
|
VTT
|
4.8 KB
|
|
|
003 Artificial Neural Networks.mp4
|
MP4
|
24.3 MB
|
|
|
003 Artificial Neural Networks_en.vtt
|
VTT
|
3.8 KB
|
|
|
003 Create the robotics task.mp4
|
MP4
|
74 MB
|
|
|
003 Create the robotics task_en.vtt
|
VTT
|
11.4 KB
|
|
|
003 DDPG pseudocode.mp4
|
MP4
|
20.9 MB
|
|
|
003 DDPG pseudocode_en.vtt
|
VTT
|
3.9 KB
|
|
|
003 Experience Replay.mp4
|
MP4
|
9 MB
|
|
|
003 Experience Replay_en.vtt
|
VTT
|
2.2 KB
|
|
|
003 Google Colab.mp4
|
MP4
|
5.8 MB
|
|
|
003 Google Colab_en.vtt
|
VTT
|
1.8 KB
|
|
|
003 Implement Hindsight Experience Replay (HER) - Part 2.mp4
|
MP4
|
21.7 MB
|
|
|
003 Implement Hindsight Experience Replay (HER) - Part 2_en.vtt
|
VTT
|
2.9 KB
|
|
|
003 Introduction to PyTorch Lightning.mp4
|
MP4
|
30.9 MB
|
|
|
003 Introduction to PyTorch Lightning_en.vtt
|
VTT
|
6.2 KB
|
|
|
003 Link to code notebook.html
|
HTML
|
307.2 B
|
|
|
003 Log average return.mp4
|
MP4
|
33.6 MB
|
|
|
003 Log average return_en.vtt
|
VTT
|
4.8 KB
|
|
|
003 Normalized Advantage Function (NAF).mp4
|
MP4
|
10.1 MB
|
|
|
003 Normalized Advantage Function (NAF)_en.vtt
|
VTT
|
3.3 KB
|
|
|
003 Representing policies using neural networks.mp4
|
MP4
|
27.8 MB
|
|
|
003 Representing policies using neural networks_en.vtt
|
VTT
|
5.3 KB
|
|
|
003 Solving control tasks with temporal difference methods.mp4
|
MP4
|
14.5 MB
|
|
|
003 Solving control tasks with temporal difference methods_en.vtt
|
VTT
|
3.6 KB
|
|
|
003 The Markov decision process (MDP).mp4
|
MP4
|
25.1 MB
|
|
|
003 The Markov decision process (MDP)_en.vtt
|
VTT
|
5.7 KB
|
|
|
004 Artificial Neurons.mp4
|
MP4
|
25.6 MB
|
|
|
004 Artificial Neurons_en.vtt
|
VTT
|
5.8 KB
|
|
|
004 Create the Deep Q-Network.mp4
|
MP4
|
19 MB
|
|
|
004 Create the Deep Q-Network_en.vtt
|
VTT
|
3.5 KB
|
|
|
004 Define the objective function.mp4
|
MP4
|
29.8 MB
|
|
|
004 Define the objective function_en.vtt
|
VTT
|
5.3 KB
|
|
|
004 Implement Hindsight Experience Replay (HER) - Part 3.mp4
|
MP4
|
73.7 MB
|
|
|
004 Implement Hindsight Experience Replay (HER) - Part 3_en.vtt
|
VTT
|
9.9 KB
|
|
|
004 Link to the code notebook.html
|
HTML
|
307.2 B
|
|
|
004 Normalized Advantage Function pseudocode.mp4
|
MP4
|
23.2 MB
|
|
|
004 Normalized Advantage Function pseudocode_en.vtt
|
VTT
|
5.7 KB
|
|
|
004 Q-Learning.mp4
|
MP4
|
11.1 MB
|
|
|
004 Q-Learning_en.vtt
|
VTT
|
2.5 KB
|
|
|
004 Target Network.mp4
|
MP4
|
16.6 MB
|
|
|
004 Target Network_en.vtt
|
VTT
|
3.9 KB
|
|
|
004 The policy gradient theorem.mp4
|
MP4
|
15.9 MB
|
|
|
004 The policy gradient theorem_en.vtt
|
VTT
|
3.8 KB
|
|
|
004 Twin Delayed DDPG (TD3).mp4
|
MP4
|
19.9 MB
|
|
|
004 Twin Delayed DDPG (TD3)_en.vtt
|
VTT
|
3.2 KB
|
|
|
004 Types of Markov decision process.mp4
|
MP4
|
8.7 MB
|
|
|
004 Types of Markov decision process_en.vtt
|
VTT
|
2.2 KB
|
|
|
004 Where to begin.mp4
|
MP4
|
5.1 MB
|
|
|
004 Where to begin_en.vtt
|
VTT
|
2 KB
|
|
|
005 Advantages of temporal difference methods.mp4
|
MP4
|
3.7 MB
|
|
|
005 Advantages of temporal difference methods_en.vtt
|
VTT
|
1.2 KB
|
|
|
005 Check the results.mp4
|
MP4
|
7.4 MB
|
|
|
005 Check the results_en.vtt
|
VTT
|
1 KB
|
|
|
005 Clipped double Q-Learning.mp4
|
MP4
|
31.5 MB
|
|
|
005 Clipped double Q-Learning_en.vtt
|
VTT
|
3.9 KB
|
|
|
005 Create and launch the hyperparameter tuning job.mp4
|
MP4
|
18.5 MB
|
|
|
005 Create and launch the hyperparameter tuning job_en.vtt
|
VTT
|
2.6 KB
|
|
|
005 Create the gradient policy.mp4
|
MP4
|
53.8 MB
|
|
|
005 Create the gradient policy_en.vtt
|
VTT
|
12.6 KB
|
|
|
005 Create the policy.mp4
|
MP4
|
18 MB
|
|
|
005 Create the policy_en.vtt
|
VTT
|
5.1 KB
|
|
|
005 Deep Deterministic Policy Gradient (DDPG).mp4
|
MP4
|
31.8 MB
|
|
|
005 Deep Deterministic Policy Gradient (DDPG)_en.vtt
|
VTT
|
5.7 KB
|
|
|
005 Entropy Regularization.mp4
|
MP4
|
23.2 MB
|
|
|
005 Entropy Regularization_en.vtt
|
VTT
|
6.5 KB
|
|
|
005 How to represent a Neural Network.mp4
|
MP4
|
38.2 MB
|
|
|
005 How to represent a Neural Network_en.vtt
|
VTT
|
7.2 KB
|
|
|
005 Link to the code notebook.html
|
HTML
|
307.2 B
|
|
|
005 Trajectory vs episode.mp4
|
MP4
|
4.9 MB
|
|
|
005 Trajectory vs episode_en.vtt
|
VTT
|
1.1 KB
|
|
|
006 Create the gradient policy.mp4
|
MP4
|
43.4 MB
|
|
|
006 Create the gradient policy_en.vtt
|
VTT
|
9.7 KB
|
|
|
006 Create the replay buffer.mp4
|
MP4
|
23 MB
|
|
|
006 Create the replay buffer_en.vtt
|
VTT
|
5.6 KB
|
|
|
006 Delayed policy updates.mp4
|
MP4
|
12.1 MB
|
|
|
006 Delayed policy updates_en.vtt
|
VTT
|
2.1 KB
|
|
|
006 Explore the best trial.mp4
|
MP4
|
19.2 MB
|
|
|
006 Explore the best trial_en.vtt
|
VTT
|
2.6 KB
|
|
|
006 Hyperbolic tangent.mp4
|
MP4
|
4.7 MB
|
|
|
006 Hyperbolic tangent_en.vtt
|
VTT
|
1.6 KB
|
|
|
006 Implement the Soft Actor-Critic algorithm - Part 1.mp4
|
MP4
|
40.1 MB
|
|
|
006 Implement the Soft Actor-Critic algorithm - Part 1_en.vtt
|
VTT
|
7.1 KB
|
|
|
006 Reward vs Return.mp4
|
MP4
|
5.3 MB
|
|
|
006 Reward vs Return_en.vtt
|
VTT
|
1.6 KB
|
|
|
006 Stochastic Gradient Descent.mp4
|
MP4
|
49.9 MB
|
|
|
006 Stochastic Gradient Descent_en.vtt
|
VTT
|
6.3 KB
|
|
|
007 Create the Deep Q-Network.mp4
|
MP4
|
22.8 MB
|
|
|
007 Create the Deep Q-Network_en.vtt
|
VTT
|
4.3 KB
|
|
|
007 Create the environment.mp4
|
MP4
|
32.2 MB
|
|
|
007 Create the environment_en.vtt
|
VTT
|
7.5 KB
|
|
|
007 Creating the (NAF) Deep Q-Network 1.mp4
|
MP4
|
41.4 MB
|
|
|
007 Creating the (NAF) Deep Q-Network 1_en.vtt
|
VTT
|
7.5 KB
|
|
|
007 Discount factor.mp4
|
MP4
|
14.8 MB
|
|
|
007 Discount factor_en.vtt
|
VTT
|
4 KB
|
|
|
007 Implement the Soft Actor-Critic algorithm - Part 2.mp4
|
MP4
|
66.7 MB
|
|
|
007 Implement the Soft Actor-Critic algorithm - Part 2_en.vtt
|
VTT
|
9.2 KB
|
|
|
007 Neural Network optimization.mp4
|
MP4
|
23.4 MB
|
|
|
007 Neural Network optimization_en.vtt
|
VTT
|
4.4 KB
|
|
|
007 Target policy smoothing.mp4
|
MP4
|
31 MB
|
|
|
007 Target policy smoothing_en.vtt
|
VTT
|
4.1 KB
|
|
|
008 Check the resulting agent.mp4
|
MP4
|
31.1 MB
|
|
|
008 Check the resulting agent_en.vtt
|
VTT
|
2.2 KB
|
|
|
008 Check the results.mp4
|
MP4
|
12.1 MB
|
|
|
008 Check the results_en.vtt
|
VTT
|
2.1 KB
|
|
|
008 Create the DDPG class.mp4
|
MP4
|
38.9 MB
|
|
|
008 Create the DDPG class_en.vtt
|
VTT
|
7.3 KB
|
|
|
008 Creating the (NAF) Deep Q-Network 2.mp4
|
MP4
|
15 MB
|
|
|
008 Creating the (NAF) Deep Q-Network 2_en.vtt
|
VTT
|
3.3 KB
|
|
|
008 Define the class for the Deep Q-Learning algorithm.mp4
|
MP4
|
54.5 MB
|
|
|
008 Define the class for the Deep Q-Learning algorithm_en.vtt
|
VTT
|
11.6 KB
|
|
|
008 Policy.mp4
|
MP4
|
7.4 MB
|
|
|
008 Policy_en.vtt
|
VTT
|
2.2 KB
|
|
|
009 Creating the (NAF) Deep Q-Network 3.mp4
|
MP4
|
5.4 MB
|
|
|
009 Creating the (NAF) Deep Q-Network 3_en.vtt
|
VTT
|
1.1 KB
|
|
|
009 Define the play method.mp4
|
MP4
|
13.2 MB
|
|
|
009 Define the play method_en.vtt
|
VTT
|
2.2 KB
|
|
|
009 Define the play_episode() function.mp4
|
MP4
|
29.1 MB
|
|
|
009 Define the play_episode() function_en.vtt
|
VTT
|
4.9 KB
|
|
|
009 State values v(s) and action values q(s,a).mp4
|
MP4
|
4.3 MB
|
|
|
009 State values v(s) and action values q(s,a)_en.vtt
|
VTT
|
1.2 KB
|
|
|
010 Bellman equations.mp4
|
MP4
|
12.4 MB
|
|
|
010 Bellman equations_en.vtt
|
VTT
|
3 KB
|
|
|
010 Creating the (NAF) Deep Q-Network 4.mp4
|
MP4
|
47.9 MB
|
|
|
010 Creating the (NAF) Deep Q-Network 4_en.vtt
|
VTT
|
9.3 KB
|
|
|
010 Prepare the data loader and the optimizer.mp4
|
MP4
|
30.4 MB
|
|
|
010 Prepare the data loader and the optimizer_en.vtt
|
VTT
|
4.2 KB
|
|
|
010 Setup the optimizers and dataloader.mp4
|
MP4
|
22.3 MB
|
|
|
010 Setup the optimizers and dataloader_en.vtt
|
VTT
|
3.2 KB
|
|
|
011 Creating the policy.mp4
|
MP4
|
25 MB
|
|
|
011 Creating the policy_en.vtt
|
VTT
|
5.2 KB
|
|
|
011 Define the train_step() method.mp4
|
MP4
|
49.8 MB
|
|
|
011 Define the train_step() method_en.vtt
|
VTT
|
9.3 KB
|
|
|
011 Define the training step.mp4
|
MP4
|
57.9 MB
|
|
|
011 Define the training step_en.vtt
|
VTT
|
9.9 KB
|
|
|
011 Solving a Markov decision process.mp4
|
MP4
|
14.1 MB
|
|
|
011 Solving a Markov decision process_en.vtt
|
VTT
|
3.1 KB
|
|
|
012 Create the environment.mp4
|
MP4
|
22.5 MB
|
|
|
012 Create the environment_en.vtt
|
VTT
|
4.6 KB
|
|
|
012 Define the train_epoch_end() method.mp4
|
MP4
|
32.2 MB
|
|
|
012 Define the train_epoch_end() method_en.vtt
|
VTT
|
4 KB
|
|
|
012 Launch the training process.mp4
|
MP4
|
34.2 MB
|
|
|
012 Launch the training process_en.vtt
|
VTT
|
3.9 KB
|
|
|
013 Check the resulting agent.mp4
|
MP4
|
30.2 MB
|
|
|
013 Check the resulting agent_en.vtt
|
VTT
|
1.6 KB
|
|
|
013 Polyak averaging.mp4
|
MP4
|
4.8 MB
|
|
|
013 Polyak averaging_en.vtt
|
VTT
|
1.5 KB
|
|
|
013 [Important] Lecture correction.html
|
HTML
|
614.4 B
|
|
|
014 Implementing Polyak averaging.mp4
|
MP4
|
10.4 MB
|
|
|
014 Implementing Polyak averaging_en.vtt
|
VTT
|
2.2 KB
|
|
|
014 Train the Deep Q-Learning algorithm.mp4
|
MP4
|
35 MB
|
|
|
014 Train the Deep Q-Learning algorithm_en.vtt
|
VTT
|
6.5 KB
|
|
|
015 Create the (NAF) Deep Q-Learning algorithm.mp4
|
MP4
|
42.9 MB
|
|
|
015 Create the (NAF) Deep Q-Learning algorithm_en.vtt
|
VTT
|
7.9 KB
|
|
|
015 Explore the resulting agent.mp4
|
MP4
|
20.3 MB
|
|
|
015 Explore the resulting agent_en.vtt
|
VTT
|
2.8 KB
|
|
|
016 Implement the training step.mp4
|
MP4
|
13.3 MB
|
|
|
016 Implement the training step_en.vtt
|
VTT
|
2.4 KB
|
|
|
017 Implement the end-of-epoch logic.mp4
|
MP4
|
12.5 MB
|
|
|
017 Implement the end-of-epoch logic_en.vtt
|
VTT
|
2.2 KB
|
|
|
018 Debugging and launching the algorithm.mp4
|
MP4
|
20 MB
|
|
|
018 Debugging and launching the algorithm_en.vtt
|
VTT
|
2.9 KB
|
|
|
019 Checking the resulting agent.mp4
|
MP4
|
16.4 MB
|
|
|
019 Checking the resulting agent_en.vtt
|
VTT
|
2 KB
|
|
|
Bonus Resources.txt
|
TXT
|
409.6 B
|
|
|
Get Bonus Downloads Here.url
|
URL
|
204.8 B
|
|
|
external-assets-links.txt
|
TXT
|
102.4 B
|