Dark Web

Don't forget to turn on your firewall!

Reinforcement Learning

Offline Reinforcement Learning

Reinforcement Learning Upside Down: Don’t Predict Rewards - Just Map Them to Actionslink

RL as one big sequence modeling problem article Q-Transformer article Control-Oriented Learning for Dynamical Systems video

Imitation Learning