Anthropic | Interpretability: Understanding Jow AI Models Think

My Account

Mijn Account

Mon Compte

Mein Konto

我的帐户

Inloggen

Connexion

Anmeldung

Acceso

Anthropic | Interpretability: Understanding Jow AI Models Think

What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate?

Are AI models just "glorified autocompletes", or is something more complicated going on? How do we even study these questions scientifically?

00:00 - Introduction [00:00]

01:37 - The biology of AI models

06:43 - Scientific methods to open the black box

10:35 - Some surprising features inside Claude's mind

20:39 - Can we trust what a model claims it's thinking?

25:17 - Why do AI models hallucinate?

34:15 - AI models planning ahead

38:30 - Why interpretability matters

53:35 - The future of interpretability

AI News

technology and ai

Anthropic | Interpretability: Understanding Jow AI Models Think

Join the conversation 🎭

Wes Roth | Google is About to Bust the AI Bubble...

Sabine Hossenfelder | 5 Signs the AI Bubble is Bursting

FE2E | Qwen3 | Ernie 4.5 | Stable Audio 2.5 | HunyuanImage 2.1

Matt Wolfe | 30 AI Demos and News Headlines You Missed

NetworkChuck | You Need to Learn Model Context Protocol Right Now

Seedream 4.0 is Proof There’s No Stopping AI Advancement

ChatGPT Just Got Its Most Powerful Upgrade Yet

Aperture | The AI Takeover Is Closer Than You Think

Neil Patel | The Google Update No One Is Ready For

Sabine Hossenfelder | How Fast will AI Change Everything?

Google’s AI EmbeddingGemma is Breaking Records

Matthew Berman | Did OpenAI Just Solve Hallucinations?

Wes Roth | Grok 4.2 Sonoma Sky Will be Scary Good

New AI Beats NanoBanana | AI Minecraft | Dish-Washing Robots

AI Ode To Greta Thunberg | How Dare You?!

Anthropic to Pay $1.5 Billion to Settle Lawsuit With Book Authors

Upper Echelon | Real Life Skynet

OpenAI is about to Launch an AI Jobs Network

Economy Media | How AI Is Crushing Junior Developers

OpenAI is Making ChatGPT Into Something Way Bigger

Dr. Roman Yampolskiy | Only 5 Jobs will remain in 2030

Google Prepares to Unleash Jules | OpenAI Faces Billionaire Conspiracy

VibeVoice | The best Free AI Text to Speech Voice Cloner is Here

Wes Roth | How To Use Midjourney | Beginners Guide

Patrick Boyle | Is AI Slop Killing the Internet?

VoxHammer | CoMPaSS | Bytedance Waver | OmniHuman

Matt Wolfe | AI News: Adobe Just Gave in to Google’s NanoBanana

Hermes 4 Just Proved Open Source AI Can Beat OpenAI

Google’s Nano anana just killed Photoshop... Let’s Run It

Sabine Hossenfelder | I Tried Vibe Physics, This is What I Learned