|
|
001. Why should you care.mp4
|
MP4
|
32.4 MB
|
|
|
001. Why should you care.srt
|
SRT
|
15.4 KB
|
|
|
002. Reinforcement learning vs all.mp4
|
MP4
|
10.8 MB
|
|
|
002. Reinforcement learning vs all.srt
|
SRT
|
4.9 KB
|
|
|
003. Multi-armed bandit.mp4
|
MP4
|
17.9 MB
|
|
|
003. Multi-armed bandit.srt
|
SRT
|
7.3 KB
|
|
|
004. Decision process & applications.mp4
|
MP4
|
23 MB
|
|
|
004. Decision process & applications.srt
|
SRT
|
9.7 KB
|
|
|
005. Markov Decision Process.mp4
|
MP4
|
18 MB
|
|
|
005. Markov Decision Process.srt
|
SRT
|
8.3 KB
|
|
|
006. Crossentropy method.mp4
|
MP4
|
36 MB
|
|
|
006. Crossentropy method.srt
|
SRT
|
15.5 KB
|
|
|
007. Approximate crossentropy method.mp4
|
MP4
|
19.3 MB
|
|
|
007. Approximate crossentropy method.srt
|
SRT
|
8.2 KB
|
|
|
008. More on approximate crossentropy method.mp4
|
MP4
|
22.9 MB
|
|
|
008. More on approximate crossentropy method.srt
|
SRT
|
10.5 KB
|
|
|
009. Evolution strategies core idea.mp4
|
MP4
|
20.9 MB
|
|
|
009. Evolution strategies core idea.srt
|
SRT
|
7.3 KB
|
|
|
010. Evolution strategies math problems.mp4
|
MP4
|
17.7 MB
|
|
|
010. Evolution strategies math problems.srt
|
SRT
|
8.6 KB
|
|
|
011. Evolution strategies log-derivative trick.mp4
|
MP4
|
27.8 MB
|
|
|
011. Evolution strategies log-derivative trick.srt
|
SRT
|
12.6 KB
|
|
|
012. Evolution strategies duct tape.mp4
|
MP4
|
21.2 MB
|
|
|
012. Evolution strategies duct tape.srt
|
SRT
|
9.7 KB
|
|
|
013. Blackbox optimization drawbacks.mp4
|
MP4
|
15.2 MB
|
|
|
013. Blackbox optimization drawbacks.srt
|
SRT
|
7.3 KB
|
|
|
014. Reward design.mp4
|
MP4
|
49.7 MB
|
|
|
014. Reward design.srt
|
SRT
|
23.2 KB
|
|
|
015. State and Action Value Functions.mp4
|
MP4
|
37.3 MB
|
|
|
015. State and Action Value Functions.srt
|
SRT
|
18.2 KB
|
|
|
016. Measuring Policy Optimality.mp4
|
MP4
|
18.1 MB
|
|
|
016. Measuring Policy Optimality.srt
|
SRT
|
8.5 KB
|
|
|
017. Policy evaluation & improvement.mp4
|
MP4
|
31.9 MB
|
|
|
017. Policy evaluation & improvement.srt
|
SRT
|
14.5 KB
|
|
|
018. Policy and value iteration.mp4
|
MP4
|
24.2 MB
|
|
|
018. Policy and value iteration.srt
|
SRT
|
12.1 KB
|
|
|
019. Model-based vs model-free.mp4
|
MP4
|
28.8 MB
|
|
|
019. Model-based vs model-free.srt
|
SRT
|
14.1 KB
|
|
|
020. Monte-Carlo & Temporal Difference; Q-learning.mp4
|
MP4
|
30.1 MB
|
|
|
020. Monte-Carlo & Temporal Difference; Q-learning.srt
|
SRT
|
14.5 KB
|
|
|
021. Exploration vs Exploitation.mp4
|
MP4
|
28.2 MB
|
|
|
021. Exploration vs Exploitation.srt
|
SRT
|
14 KB
|
|
|
022. Footnote Monte-Carlo vs Temporal Difference.mp4
|
MP4
|
10.3 MB
|
|
|
022. Footnote Monte-Carlo vs Temporal Difference.srt
|
SRT
|
4.8 KB
|
|
|
023. Accounting for exploration. Expected Value SARSA..mp4
|
MP4
|
37.7 MB
|
|
|
023. Accounting for exploration. Expected Value SARSA..srt
|
SRT
|
17.3 KB
|
|
|
024. On-policy vs off-policy; Experience replay.mp4
|
MP4
|
26.7 MB
|
|
|
024. On-policy vs off-policy; Experience replay.srt
|
SRT
|
11.2 KB
|
|
|
025. Supervised & Reinforcement Learning.mp4
|
MP4
|
50.6 MB
|
|
|
025. Supervised & Reinforcement Learning.srt
|
SRT
|
25.4 KB
|
|
|
026. Loss functions in value based RL.mp4
|
MP4
|
33.8 MB
|
|
|
026. Loss functions in value based RL.srt
|
SRT
|
15.2 KB
|
|
|
027. Difficulties with Approximate Methods.mp4
|
MP4
|
47 MB
|
|
|
027. Difficulties with Approximate Methods.srt
|
SRT
|
21.9 KB
|
|
|
028. DQN bird's eye view.mp4
|
MP4
|
27.8 MB
|
|
|
028. DQN bird's eye view.srt
|
SRT
|
11.4 KB
|
|
|
029. DQN the internals.mp4
|
MP4
|
29.6 MB
|
|
|
029. DQN the internals.srt
|
SRT
|
12.3 KB
|
|
|
030. DQN statistical issues.mp4
|
MP4
|
19.2 MB
|
|
|
030. DQN statistical issues.srt
|
SRT
|
9.2 KB
|
|
|
031. Double Q-learning.mp4
|
MP4
|
20.5 MB
|
|
|
031. Double Q-learning.srt
|
SRT
|
9.4 KB
|
|
|
032. More DQN tricks.mp4
|
MP4
|
33.9 MB
|
|
|
032. More DQN tricks.srt
|
SRT
|
16.4 KB
|
|
|
033. Partial observability.mp4
|
MP4
|
57.2 MB
|
|
|
033. Partial observability.srt
|
SRT
|
27.7 KB
|
|
|
034. Intuition.mp4
|
MP4
|
34.9 MB
|
|
|
034. Intuition.srt
|
SRT
|
15.6 KB
|
|
|
035. All Kinds of Policies.mp4
|
MP4
|
16 MB
|
|
|
035. All Kinds of Policies.srt
|
SRT
|
7.4 KB
|
|
|
036. Policy gradient formalism.mp4
|
MP4
|
31.6 MB
|
|
|
036. Policy gradient formalism.srt
|
SRT
|
13.3 KB
|
|
|
037. The log-derivative trick.mp4
|
MP4
|
13.3 MB
|
|
|
037. The log-derivative trick.srt
|
SRT
|
5.9 KB
|
|
|
038. REINFORCE.mp4
|
MP4
|
31.4 MB
|
|
|
038. REINFORCE.srt
|
SRT
|
14 KB
|
|
|
039. Advantage actor-critic.mp4
|
MP4
|
24.6 MB
|
|
|
039. Advantage actor-critic.srt
|
SRT
|
11.8 KB
|
|
|
040. Duct tape zone.mp4
|
MP4
|
17.5 MB
|
|
|
040. Duct tape zone.srt
|
SRT
|
7.8 KB
|
|
|
041. Policy-based vs Value-based.mp4
|
MP4
|
16.8 MB
|
|
|
041. Policy-based vs Value-based.srt
|
SRT
|
7.1 KB
|
|
|
042. Case study A3C.mp4
|
MP4
|
26.1 MB
|
|
|
042. Case study A3C.srt
|
SRT
|
11.1 KB
|
|
|
043. A3C case study (2 2).mp4
|
MP4
|
15 MB
|
|
|
043. A3C case study (2 2).srt
|
SRT
|
6 KB
|
|
|
044. Combining supervised & reinforcement learning.mp4
|
MP4
|
24 MB
|
|
|
044. Combining supervised & reinforcement learning.srt
|
SRT
|
11.9 KB
|
|
|
045. Recap bandits.mp4
|
MP4
|
24.7 MB
|
|
|
045. Recap bandits.srt
|
SRT
|
11.9 KB
|
|
|
046. Regret measuring the quality of exploration.mp4
|
MP4
|
21.3 MB
|
|
|
046. Regret measuring the quality of exploration.srt
|
SRT
|
10.2 KB
|
|
|
047. The message just repeats. 'Regret, Regret, Regret.'.mp4
|
MP4
|
18.4 MB
|
|
|
047. The message just repeats. 'Regret, Regret, Regret.'.srt
|
SRT
|
8.7 KB
|
|
|
048. Intuitive explanation.mp4
|
MP4
|
22.3 MB
|
|
|
048. Intuitive explanation.srt
|
SRT
|
10.9 KB
|
|
|
049. Thompson Sampling.mp4
|
MP4
|
17.1 MB
|
|
|
049. Thompson Sampling.srt
|
SRT
|
7.9 KB
|
|
|
050. Optimism in face of uncertainty.mp4
|
MP4
|
16.5 MB
|
|
|
050. Optimism in face of uncertainty.srt
|
SRT
|
7.9 KB
|
|
|
051. UCB-1.mp4
|
MP4
|
22.2 MB
|
|
|
051. UCB-1.srt
|
SRT
|
10.4 KB
|
|
|
052. Bayesian UCB.mp4
|
MP4
|
40.8 MB
|
|
|
052. Bayesian UCB.srt
|
SRT
|
19.3 KB
|
|
|
053. Introduction to planning.mp4
|
MP4
|
51.6 MB
|
|
|
053. Introduction to planning.srt
|
SRT
|
25.4 KB
|
|
|
054. Monte Carlo Tree Search.mp4
|
MP4
|
30.9 MB
|
|
|
054. Monte Carlo Tree Search.srt
|
SRT
|
14.8 KB
|
|
|
[CourseClub.NET].url
|
URL
|
102.4 B
|
|
|
[DesireCourse.Com].url
|
URL
|
0 B
|