Natasha Jaques
26.8K subscribers
1:26:07
3 - Personalized RLHF
Natasha Jaques
181 views • 7 days ago
1:21:57
2 - Deep RL and RL post-training intro
Natasha Jaques
247 views • 7 days ago
20:47
1 - Intro
Natasha Jaques
283 views • 8 days ago
15:28
Self Play for Safety - Online Multi-Agent Adversarial Training for Provably Robust LLMs
Natasha Jaques
2.6K views • 4 months ago
39:04
What Makes ChatGPT Chat? Modern AI for the layperson
Natasha Jaques
388K views • 5 months ago
33:10
Reinforcement Learning (RL) for LLMs
Natasha Jaques
10K views • 7 months ago
37:41
Social Reinforcement Learning talk at RLDM
Natasha Jaques
2.8K views • 1 year ago
1:40
Badly trained policy after 40000 steps
Natasha Jaques
1.2K views • 1 year ago
1:40
Multi-agent DQN training step 90000 trajectory video
Natasha Jaques
499 views • 1 year ago
1:40
Multi-agent DQN training step 0 trajectory video
Natasha Jaques
265 views • 1 year ago
1:14
Learning to grab with bell as reward
Natasha Jaques
3.4K views • 1 year ago
57:28
Intel Deep Learning Community of Practice talk
Natasha Jaques
5.3K views • 3 years ago
1:30:15
Natasha Jaques PhD Thesis Defense
Natasha Jaques
872K views • 3 years ago
1:53
Personalized Multi-task Learning for Predicting Tomorrow's Mood, Stress, and Health
Natasha Jaques
1.9K views • 4 years ago
0:27
VHRED Cornell baseline
Natasha Jaques
1.1K views • 6 years ago
0:34
Influence agent in Harvest game
Natasha Jaques
1K views • 7 years ago
0:34
A3C baseline in Harvest
Natasha Jaques
496 views • 7 years ago
0:32
Agent trained with intrinsic social influence reward - Tragedy of the Commons
Natasha Jaques
592 views • 7 years ago
0:13
Agent trained with intrinsic social influence reward
Natasha Jaques
343 views • 7 years ago
0:17
A3C will not free other agent trapped in a box
Natasha Jaques
296 views • 7 years ago
0:17
Influence agent frees compatriot trapped in a box
Natasha Jaques
388 views • 7 years ago
0:06
Note RNN
Natasha Jaques
3.5K views • 8 years ago
0:08
Q
Natasha Jaques
3.7K views • 8 years ago
0:09
G
Natasha Jaques
3.2K views • 8 years ago
0:08
Basic LSTM
Natasha Jaques
23K views • 8 years ago
0:08
Psi
Natasha Jaques
3.3K views • 8 years ago
0:08
RL Tuner
Natasha Jaques
35K views • 8 years ago
0:31
EDAExplorer PeakTutorial
Natasha Jaques
1K views • 9 years ago
0:29
EDAExplorer ArtifactTutorial
Natasha Jaques
727 views • 9 years ago
0:21
The Challenge
Natasha Jaques
3.4K views • 10 years ago
Load More