YouTube

Natasha Jaques

Natasha Jaques

20,500 Subscribers

What Makes ChatGPT Chat? Modern AI for the layperson

What Makes ChatGPT Chat? Modern AI for the layperson

Natasha Jaques

Reinforcement Learning (RL) for LLMs

Reinforcement Learning (RL) for LLMs

Natasha Jaques

Social Reinforcement Learning talk at RLDM

Social Reinforcement Learning talk at RLDM

Natasha Jaques

Badly trained policy after 40000 steps

Badly trained policy after 40000 steps

Natasha Jaques

Multi-agent DQN training step 0 trajectory video

Multi-agent DQN training step 0 trajectory video

Natasha Jaques

Multi-agent DQN training step 90000 trajectory video

Multi-agent DQN training step 90000 trajectory video

Natasha Jaques

Learning to grab with bell as reward

Learning to grab with bell as reward

Natasha Jaques

Intel Deep Learning Community of Practice talk

Intel Deep Learning Community of Practice talk

Natasha Jaques

Natasha Jaques PhD Thesis Defense

Natasha Jaques PhD Thesis Defense

Natasha Jaques

Personalized Multi-task Learning for Predicting Tomorrow's Mood, Stress, and Health

Personalized Multi-task Learning for Predicting Tomorrow&#39

Natasha Jaques

VHRED Cornell baseline

VHRED Cornell baseline

Natasha Jaques

Influence agent in Harvest game

Influence agent in Harvest game

Natasha Jaques

A3C baseline in Harvest

A3C baseline in Harvest

Natasha Jaques

Influence agent in Cleanup game

Influence agent in Cleanup game

Natasha Jaques

A3C baseline in Cleanup game

A3C baseline in Cleanup game

Natasha Jaques

Agent trained with intrinsic social influence reward - Tragedy of the Commons

Agent trained with intrinsic social influence reward - Trage

Natasha Jaques

Agent trained with intrinsic social influence reward

Agent trained with intrinsic social influence reward

Natasha Jaques

Influence reward in River with 1 influencer

Influence reward in River with 1 influencer

Natasha Jaques

A3C will not free other agent trapped in a box

A3C will not free other agent trapped in a box

Natasha Jaques

Influence agent frees compatriot trapped in a box

Influence agent frees compatriot trapped in a box

Natasha Jaques

Note RNN

Note RNN

Natasha Jaques

Q

Natasha Jaques

Basic LSTM

Basic LSTM

Natasha Jaques

RL Tuner

RL Tuner

Natasha Jaques

G

Natasha Jaques

Psi

Psi

Natasha Jaques

EDAExplorer PeakTutorial

EDAExplorer PeakTutorial

Natasha Jaques

EDAExplorer ArtifactTutorial

EDAExplorer ArtifactTutorial

Natasha Jaques

The Challenge

The Challenge

Natasha Jaques

Affective Computing - Spring 2015 Virtual Visit

Affective Computing - Spring 2015 Virtual Visit

Natasha Jaques

Eye gaze data

Eye gaze data

Natasha Jaques

5 Lego Robots Dancing to Gangnam Style

5 Lego Robots Dancing to Gangnam Style

Natasha Jaques

Lego Robot Gangnam Style

Lego Robot Gangnam Style

Natasha Jaques

Natasha Jaques

Natasha Jaques