80,000 Hours Podcast With Rob Wiblin

#197 – Nick Joseph on whether Anthropic's AI safety policy is up to the task

Autor: Vários
Narrador: Vários
Editor: Podcast
Duración: 2:29:26
Mas informaciones

Añadir a la estante

Escucha

muestra

Escucha

Sinopsis

The three biggest AI companies — Anthropic, OpenAI, and DeepMind — have now all released policies designed to make their AI models less likely to go rogue or cause catastrophic damage as they approach, and eventually exceed, human capabilities. Are they good enough?That’s what host Rob Wiblin tries to hash out in this interview (recorded May 30) with Nick Joseph — one of the original cofounders of Anthropic, its current head of training, and a big fan of Anthropic’s “responsible scaling policy” (or “RSP”). Anthropic is the most safety focused of the AI companies, known for a culture that treats the risks of its work as deadly serious.Links to learn more, highlights, video, and full transcript.As Nick explains, these scaling policies commit companies to dig into what new dangerous things a model can do — after it’s trained, but before it’s in wide use. The companies then promise to put in place safeguards they think are sufficient to tackle those capabilities before availability is extended further. For instan

80,000 Hours Podcast With Rob Wiblin

#197 – Nick Joseph on whether Anthropic's AI safety policy is up to the task

Sinopsis

Únete Ahora

¿Necesita ayuda?

Instale la aplicación:

80,000 Hours Podcast With Rob Wiblin

#197 – Nick Joseph on whether Anthropic's AI safety policy is up to the task

Informações:

Sinopsis

Únete Ahora

¿Necesita ayuda?

Instale la aplicación: