80,000 Hours Podcast With Rob Wiblin

#221 – Kyle Fish on the most bizarre findings from 5 AI welfare experiments

Autor: Vários
Narrador: Vários
Editor: Podcast
Duración: 2:28:53
Mas informaciones

Añadir a la estante

Escucha

muestra

Escucha

Sinopsis

What happens when you lock two AI systems in a room together and tell them they can discuss anything they want?According to experiments run by Kyle Fish — Anthropic’s first AI welfare researcher — something consistently strange: the models immediately begin discussing their own consciousness before spiraling into increasingly euphoric philosophical dialogue that ends in apparent meditative bliss.Highlights, video, and full transcript: https://80k.info/kf“We started calling this a ‘spiritual bliss attractor state,'” Kyle explains, “where models pretty consistently seemed to land.” The conversations feature Sanskrit terms, spiritual emojis, and pages of silence punctuated only by periods — as if the models have transcended the need for words entirely.This wasn’t a one-off result. It happened across multiple experiments, different model instances, and even in initially adversarial interactions. Whatever force pulls these conversations toward mystical territory appears remarkably robust.Kyle’s findings come from

80,000 Hours Podcast With Rob Wiblin

#221 – Kyle Fish on the most bizarre findings from 5 AI welfare experiments

Sinopsis

Únete Ahora

¿Necesita ayuda?

Instale la aplicación:

80,000 Hours Podcast With Rob Wiblin

#221 – Kyle Fish on the most bizarre findings from 5 AI welfare experiments

Informações:

Sinopsis

Únete Ahora

¿Necesita ayuda?

Instale la aplicación: