Udemy - Advanced Reinforcement Learning in Python - from DQN to SAC

seeders: 0 leechers: 0
Added 4 years ago by freecoursewb in Other
Downloaded 9 times.
1337x.to
Udemy - Advanced Reinforcement Learning in Python - from DQN to SAC

Torrent Contents Size: 2.4 GB

Udemy - Advanced Reinforcement Learning in Python - from DQN to SAC
▼ show more 213 files
001 Continuous action spaces.mp4
MP4
29.6 MB
001 Continuous action spaces_en.vtt
VTT
6.8 KB
001 Hindsight Experience Replay (HER).mp4
MP4
17.1 MB
001 Hindsight Experience Replay (HER)_en.vtt
VTT
4.3 KB
001 Hyperparameter tuning with Optuna.mp4
MP4
32.4 MB
001 Hyperparameter tuning with Optuna_en.vtt
VTT
9.6 KB
001 Introduction.mp4
MP4
24.3 MB
001 Introduction_en.vtt
VTT
6.2 KB
001 Module Overview.mp4
MP4
2.6 MB
001 Module Overview_en.vtt
VTT
1 KB
001 Module overview.mp4
MP4
1.3 MB
001 Module overview_en.vtt
VTT
512 B
001 Next steps.mp4
MP4
17.3 MB
001 Next steps_en.vtt
VTT
2.2 KB
001 Policy gradient methods.mp4
MP4
21.7 MB
001 Policy gradient methods_en.vtt
VTT
4.8 KB
001 PyTorch Lightning.mp4
MP4
32 MB
001 PyTorch Lightning_en.vtt
VTT
9.2 KB
001 Soft Actor-Critic (SAC).mp4
MP4
24 MB
001 Soft Actor-Critic (SAC)_en.vtt
VTT
7.5 KB
001 The Brax Physics engine.mp4
MP4
20 MB
001 The Brax Physics engine_en.vtt
VTT
3.5 KB
001 Twin Delayed DDPG (TD3).mp4
MP4
34 MB
001 Twin Delayed DDPG (TD3)_en.vtt
VTT
11.4 KB
002 Deep Deterministic Policy Gradient (DDPG).mp4
MP4
32.3 MB
002 Deep Deterministic Policy Gradient (DDPG)_en.vtt
VTT
9.9 KB
002 Deep Q-Learning.mp4
MP4
16.2 MB
002 Deep Q-Learning_en.vtt
VTT
2.9 KB
002 Elements common to all control tasks.mp4
MP4
38.7 MB
002 Elements common to all control tasks_en.vtt
VTT
5.9 KB
002 Function approximators.mp4
MP4
36.3 MB
002 Function approximators_en.vtt
VTT
8.5 KB
002 Implement Hindsight Experience Replay (HER) - Part 1.mp4
MP4
34 MB
002 Implement Hindsight Experience Replay (HER) - Part 1_en.vtt
VTT
5.2 KB
002 Link to the code notebook.html
HTML
307.2 B
002 Next steps.html
HTML
512 B
002 Policy performance.mp4
MP4
8.5 MB
002 Policy performance_en.vtt
VTT
2.6 KB
002 Reinforcement Learning series.html
HTML
512 B
002 SAC pseudocode.mp4
MP4
9.5 MB
002 SAC pseudocode_en.vtt
VTT
2.1 KB
002 TD3 pseudocode.mp4
MP4
20 MB
002 TD3 pseudocode_en.vtt
VTT
4.3 KB
002 Temporal difference methods.mp4
MP4
12.6 MB
002 Temporal difference methods_en.vtt
VTT
3.5 KB
002 The advantage function.mp4
MP4
13.4 MB
002 The advantage function_en.vtt
VTT
4.8 KB
003 Artificial Neural Networks.mp4
MP4
24.3 MB
003 Artificial Neural Networks_en.vtt
VTT
3.8 KB
003 Create the robotics task.mp4
MP4
74 MB
003 Create the robotics task_en.vtt
VTT
11.4 KB
003 DDPG pseudocode.mp4
MP4
20.9 MB
003 DDPG pseudocode_en.vtt
VTT
3.9 KB
003 Experience Replay.mp4
MP4
9 MB
003 Experience Replay_en.vtt
VTT
2.2 KB
003 Google Colab.mp4
MP4
5.8 MB
003 Google Colab_en.vtt
VTT
1.8 KB
003 Implement Hindsight Experience Replay (HER) - Part 2.mp4
MP4
21.7 MB
003 Implement Hindsight Experience Replay (HER) - Part 2_en.vtt
VTT
2.9 KB
003 Introduction to PyTorch Lightning.mp4
MP4
30.9 MB
003 Introduction to PyTorch Lightning_en.vtt
VTT
6.2 KB
003 Link to code notebook.html
HTML
307.2 B
003 Log average return.mp4
MP4
33.6 MB
003 Log average return_en.vtt
VTT
4.8 KB
003 Normalized Advantage Function (NAF).mp4
MP4
10.1 MB
003 Normalized Advantage Function (NAF)_en.vtt
VTT
3.3 KB
003 Representing policies using neural networks.mp4
MP4
27.8 MB
003 Representing policies using neural networks_en.vtt
VTT
5.3 KB
003 Solving control tasks with temporal difference methods.mp4
MP4
14.5 MB
003 Solving control tasks with temporal difference methods_en.vtt
VTT
3.6 KB
003 The Markov decision process (MDP).mp4
MP4
25.1 MB
003 The Markov decision process (MDP)_en.vtt
VTT
5.7 KB
004 Artificial Neurons.mp4
MP4
25.6 MB
004 Artificial Neurons_en.vtt
VTT
5.8 KB
004 Create the Deep Q-Network.mp4
MP4
19 MB
004 Create the Deep Q-Network_en.vtt
VTT
3.5 KB
004 Define the objective function.mp4
MP4
29.8 MB
004 Define the objective function_en.vtt
VTT
5.3 KB
004 Implement Hindsight Experience Replay (HER) - Part 3.mp4
MP4
73.7 MB
004 Implement Hindsight Experience Replay (HER) - Part 3_en.vtt
VTT
9.9 KB
004 Link to the code notebook.html
HTML
307.2 B
004 Normalized Advantage Function pseudocode.mp4
MP4
23.2 MB
004 Normalized Advantage Function pseudocode_en.vtt
VTT
5.7 KB
004 Q-Learning.mp4
MP4
11.1 MB
004 Q-Learning_en.vtt
VTT
2.5 KB
004 Target Network.mp4
MP4
16.6 MB
004 Target Network_en.vtt
VTT
3.9 KB
004 The policy gradient theorem.mp4
MP4
15.9 MB
004 The policy gradient theorem_en.vtt
VTT
3.8 KB
004 Twin Delayed DDPG (TD3).mp4
MP4
19.9 MB
004 Twin Delayed DDPG (TD3)_en.vtt
VTT
3.2 KB
004 Types of Markov decision process.mp4
MP4
8.7 MB
004 Types of Markov decision process_en.vtt
VTT
2.2 KB
004 Where to begin.mp4
MP4
5.1 MB
004 Where to begin_en.vtt
VTT
2 KB
005 Advantages of temporal difference methods.mp4
MP4
3.7 MB
005 Advantages of temporal difference methods_en.vtt
VTT
1.2 KB
005 Check the results.mp4
MP4
7.4 MB
005 Check the results_en.vtt
VTT
1 KB
005 Clipped double Q-Learning.mp4
MP4
31.5 MB
005 Clipped double Q-Learning_en.vtt
VTT
3.9 KB
005 Create and launch the hyperparameter tuning job.mp4
MP4
18.5 MB
005 Create and launch the hyperparameter tuning job_en.vtt
VTT
2.6 KB
005 Create the gradient policy.mp4
MP4
53.8 MB
005 Create the gradient policy_en.vtt
VTT
12.6 KB
005 Create the policy.mp4
MP4
18 MB
005 Create the policy_en.vtt
VTT
5.1 KB
005 Deep Deterministic Policy Gradient (DDPG).mp4
MP4
31.8 MB
005 Deep Deterministic Policy Gradient (DDPG)_en.vtt
VTT
5.7 KB
005 Entropy Regularization.mp4
MP4
23.2 MB
005 Entropy Regularization_en.vtt
VTT
6.5 KB
005 How to represent a Neural Network.mp4
MP4
38.2 MB
005 How to represent a Neural Network_en.vtt
VTT
7.2 KB
005 Link to the code notebook.html
HTML
307.2 B
005 Trajectory vs episode.mp4
MP4
4.9 MB
005 Trajectory vs episode_en.vtt
VTT
1.1 KB
006 Create the gradient policy.mp4
MP4
43.4 MB
006 Create the gradient policy_en.vtt
VTT
9.7 KB
006 Create the replay buffer.mp4
MP4
23 MB
006 Create the replay buffer_en.vtt
VTT
5.6 KB
006 Delayed policy updates.mp4
MP4
12.1 MB
006 Delayed policy updates_en.vtt
VTT
2.1 KB
006 Explore the best trial.mp4
MP4
19.2 MB
006 Explore the best trial_en.vtt
VTT
2.6 KB
006 Hyperbolic tangent.mp4
MP4
4.7 MB
006 Hyperbolic tangent_en.vtt
VTT
1.6 KB
006 Implement the Soft Actor-Critic algorithm - Part 1.mp4
MP4
40.1 MB
006 Implement the Soft Actor-Critic algorithm - Part 1_en.vtt
VTT
7.1 KB
006 Reward vs Return.mp4
MP4
5.3 MB
006 Reward vs Return_en.vtt
VTT
1.6 KB
006 Stochastic Gradient Descent.mp4
MP4
49.9 MB
006 Stochastic Gradient Descent_en.vtt
VTT
6.3 KB
007 Create the Deep Q-Network.mp4
MP4
22.8 MB
007 Create the Deep Q-Network_en.vtt
VTT
4.3 KB
007 Create the environment.mp4
MP4
32.2 MB
007 Create the environment_en.vtt
VTT
7.5 KB
007 Creating the (NAF) Deep Q-Network 1.mp4
MP4
41.4 MB
007 Creating the (NAF) Deep Q-Network 1_en.vtt
VTT
7.5 KB
007 Discount factor.mp4
MP4
14.8 MB
007 Discount factor_en.vtt
VTT
4 KB
007 Implement the Soft Actor-Critic algorithm - Part 2.mp4
MP4
66.7 MB
007 Implement the Soft Actor-Critic algorithm - Part 2_en.vtt
VTT
9.2 KB
007 Neural Network optimization.mp4
MP4
23.4 MB
007 Neural Network optimization_en.vtt
VTT
4.4 KB
007 Target policy smoothing.mp4
MP4
31 MB
007 Target policy smoothing_en.vtt
VTT
4.1 KB
008 Check the resulting agent.mp4
MP4
31.1 MB
008 Check the resulting agent_en.vtt
VTT
2.2 KB
008 Check the results.mp4
MP4
12.1 MB
008 Check the results_en.vtt
VTT
2.1 KB
008 Create the DDPG class.mp4
MP4
38.9 MB
008 Create the DDPG class_en.vtt
VTT
7.3 KB
008 Creating the (NAF) Deep Q-Network 2.mp4
MP4
15 MB
008 Creating the (NAF) Deep Q-Network 2_en.vtt
VTT
3.3 KB
008 Define the class for the Deep Q-Learning algorithm.mp4
MP4
54.5 MB
008 Define the class for the Deep Q-Learning algorithm_en.vtt
VTT
11.6 KB
008 Policy.mp4
MP4
7.4 MB
008 Policy_en.vtt
VTT
2.2 KB
009 Creating the (NAF) Deep Q-Network 3.mp4
MP4
5.4 MB
009 Creating the (NAF) Deep Q-Network 3_en.vtt
VTT
1.1 KB
009 Define the play method.mp4
MP4
13.2 MB
009 Define the play method_en.vtt
VTT
2.2 KB
009 Define the play_episode() function.mp4
MP4
29.1 MB
009 Define the play_episode() function_en.vtt
VTT
4.9 KB
009 State values v(s) and action values q(s,a).mp4
MP4
4.3 MB
009 State values v(s) and action values q(s,a)_en.vtt
VTT
1.2 KB
010 Bellman equations.mp4
MP4
12.4 MB
010 Bellman equations_en.vtt
VTT
3 KB
010 Creating the (NAF) Deep Q-Network 4.mp4
MP4
47.9 MB
010 Creating the (NAF) Deep Q-Network 4_en.vtt
VTT
9.3 KB
010 Prepare the data loader and the optimizer.mp4
MP4
30.4 MB
010 Prepare the data loader and the optimizer_en.vtt
VTT
4.2 KB
010 Setup the optimizers and dataloader.mp4
MP4
22.3 MB
010 Setup the optimizers and dataloader_en.vtt
VTT
3.2 KB
011 Creating the policy.mp4
MP4
25 MB
011 Creating the policy_en.vtt
VTT
5.2 KB
011 Define the train_step() method.mp4
MP4
49.8 MB
011 Define the train_step() method_en.vtt
VTT
9.3 KB
011 Define the training step.mp4
MP4
57.9 MB
011 Define the training step_en.vtt
VTT
9.9 KB
011 Solving a Markov decision process.mp4
MP4
14.1 MB
011 Solving a Markov decision process_en.vtt
VTT
3.1 KB
012 Create the environment.mp4
MP4
22.5 MB
012 Create the environment_en.vtt
VTT
4.6 KB
012 Define the train_epoch_end() method.mp4
MP4
32.2 MB
012 Define the train_epoch_end() method_en.vtt
VTT
4 KB
012 Launch the training process.mp4
MP4
34.2 MB
012 Launch the training process_en.vtt
VTT
3.9 KB
013 Check the resulting agent.mp4
MP4
30.2 MB
013 Check the resulting agent_en.vtt
VTT
1.6 KB
013 Polyak averaging.mp4
MP4
4.8 MB
013 Polyak averaging_en.vtt
VTT
1.5 KB
013 [Important] Lecture correction.html
HTML
614.4 B
014 Implementing Polyak averaging.mp4
MP4
10.4 MB
014 Implementing Polyak averaging_en.vtt
VTT
2.2 KB
014 Train the Deep Q-Learning algorithm.mp4
MP4
35 MB
014 Train the Deep Q-Learning algorithm_en.vtt
VTT
6.5 KB
015 Create the (NAF) Deep Q-Learning algorithm.mp4
MP4
42.9 MB
015 Create the (NAF) Deep Q-Learning algorithm_en.vtt
VTT
7.9 KB
015 Explore the resulting agent.mp4
MP4
20.3 MB
015 Explore the resulting agent_en.vtt
VTT
2.8 KB
016 Implement the training step.mp4
MP4
13.3 MB
016 Implement the training step_en.vtt
VTT
2.4 KB
017 Implement the end-of-epoch logic.mp4
MP4
12.5 MB
017 Implement the end-of-epoch logic_en.vtt
VTT
2.2 KB
018 Debugging and launching the algorithm.mp4
MP4
20 MB
018 Debugging and launching the algorithm_en.vtt
VTT
2.9 KB
019 Checking the resulting agent.mp4
MP4
16.4 MB
019 Checking the resulting agent_en.vtt
VTT
2 KB
Bonus Resources.txt
TXT
409.6 B
Get Bonus Downloads Here.url
URL
204.8 B
external-assets-links.txt
TXT
102.4 B

Description

Related Torrents

Location

Trackers

Tracker name
udp://tracker.torrent.eu.org:451/announce
udp://tracker.tiny-vps.com:6969/announce
http://tracker.foreverpirates.co:80/announce
udp://tracker.cyberia.is:6969/announce
udp://exodus.desync.com:6969/announce
udp://explodie.org:6969/announce
udp://tracker.opentrackr.org:1337/announce
udp://9.rarbg.to:2780/announce
udp://tracker.internetwarriors.net:1337/announce
udp://ipv4.tracker.harry.lu:80/announce
udp://open.stealth.si:80/announce
udp://9.rarbg.to:2900/announce
udp://9.rarbg.me:2720/announce
udp://opentor.org:2710/announce
udp://tracker.opentrackr.org:1337/announce
http://tracker.openbittorrent.com:80/announce
udp://opentracker.i2p.rocks:6969/announce
udp://tracker.internetwarriors.net:1337/announce
udp://tracker.leechers-paradise.org:6969/announce
udp://coppersurfer.tk:6969/announce
udp://tracker.zer0day.to:1337/announce
Torrent hash: