Publications

(2024). Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?. In Submission.

Cite Code

(2024). FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning. ICML 2024, ICLR AGI Workshop 2024.

PDF Cite Code Project

(2023). A Survey on Transformers in Reinforcement Learning. TMLR.

PDF Cite

(2022). Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery. AAAI 2023.

PDF Cite

(2022). Latent-Variable Advantage-Weighted Policy Optimization for Offline RL. NeurIPS 2022, L-DOD Workshop at RSS 2022.

PDF Cite

(2022). Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL. ICLR 2022.

PDF Cite

(2021). Offline Reinforcement Learning with Reverse Model-based Imagination. NeurIPS 2021.

PDF Cite Code Project Poster Slides

(2021). Estimating High Order Gradients of the Data Distribution by Denoising. NeurIPS 2021.

PDF Cite

(2021). Tractable Computation of Expected Kernels. UAI 2021.

PDF Cite Code Poster Slides Video