1

Latent-Variable Advantage-Weighted Policy Optimization for Offline RL
NeurIPS 2022, L-DOD Workshop at RSS 2022
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
ICLR 2022