TalkRL: Reinforcement Learning Interviews

Scott Fujimoto

Informações:

Sinopsis

Scott Fujimoto expounds on his TD3 and BCQ algorithms, DDPG, Benchmarking Batch RL, and more!