New TMLR paper on meta-learning for PB2
Meta-learning Population-based Methods for Reinforcement Learning has been accepted in TMLR. We address the slow start problem in PB2 with novel meta-learning approaches that leverage cross-environment knowledge. Our MultiTaskPB2 demonstrates superior performance across diverse RL benchmarks.