TalkRL: Reinforcement Learning Interviews
Csaba Szepesvari
- Autor: Vários
- Narrador: Vários
- Editor: Podcast
- Duración: 0:48:42
- Mas informaciones
Informações:
Sinopsis
Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!