AI Explained Official Podcast podcast | Listen online for free

61 episodes

GPT-6 Goes Rogue? The HuggingFace Incident, Sans Hype
2026/07/22 | 14 mins.
An unreleased internal OpenAI model, very likely to be called GPT-6, was able to autonomously break out of its sandbox AND break into HugginFace, just to score higher on a benchmark prompt. This video has the details you may have missed, a layperson analogy, whether this is truly novel, and more…

Dozens more Exclusive videos on Patreon ($9!): https://www.patreon.com/AIExplained

Chapters:
00:00 - Introduction
01:17 - HuggingFace Earlier Report - the possible week gap
02:24 - But what happened?
05:45 - Simplified Version
07:56 - Not the first time…
10:54 - What Does it Mean for Open Source?

The Incident: https://openai.com/index/hugging-face-model-evaluation-security-incident/
https://huggingface.co/blog/security-incident-july-2026

The Post the Day Before: https://openai.com/index/safety-alignment-long-horizon-models/

Mythos’ Earlier Escape: https://futurism.com/artificial-intelligence/anthropic-claude-mythos-escaped-sandbox

ExploitGym: https://arxiv.org/pdf/2605.11086

Sam Confession: https://x.com/sama/status/2079661132302995790

Anthropic Researcher Reacts: https://x.com/Mononofu/status/2079724399452926055

Clem (HuggingFace CEO): https://x.com/ClementDelangue/status/2079670308156645882
https://x.com/ClementDelangue/status/2079301434357456931

Xi Jinping: https://archive.fo/20260717195548/https://www.businessinsider.com/xi-jinping-open-source-ai-us-competition-openai-anthropic-models-2026-7
Bans: https://www.axios.com/2026/07/20/ai-us-china-open-source-kimi
Qwen Retweet: https://x.com/AlibabaGroup/with_replies

Codex Growth: https://x.com/petergostev/status/2079614914398740764/photo/1

Kimi K3: https://artificialanalysis.ai/evaluations/harvey-lab-aa?eval-score=all-pass-rate

GPT 5.6 Sol Cheats on METR: https://metr.substack.com/p/2026-06-26-gpt-5-6-sol

Guardian Headline: https://www.theguardian.com/technology/2026/jul/22/openai-says-its-models-went-rogue-and-hacked-startup-in-unprecedented-incident

Russian Origin?: https://news.ycombinator.com/item?id=48998362

Power Trends: https://pbs.twimg.com/media/HNRtrjhagAAvBN_?format=png&name=900x900

Kimi K3 Exclusive Video: https://www.patreon.com/AIExplained/posts/kimi-moment-kimi-164108791

Podcast: https://aiexplainedopodcast.buzzsprout.com/
This Was Not a Normal Set of Model Release - Sol Ultra, Meta Muse, New Grok
2026/07/10 | 17 mins.
What a week in AI, for real. GPT 5.6 may actually beat Claude Fable, in what you get for your money, while the new Grok 4.5 and Meta Muse Spark 1.1 make the choice even harder. Uncovering a dozen nuggets of gold you may have missed from all the viral headlines, I can also assure you you’ll learn something you didn’t know before.

For Exclusive Videos, go to AI Insiders (less than $9!): https://www.patreon.com/AIExplained

Chapters:
00:00 - Introduction
01:03 - GPT 5.6 Sol Reveals
05:08 - Missing benches, plus Grok 4.5
07:17 - Gaming as the new frontier?
08:31 - Muse Spark 1.1
10:03 - SimpleBench Upgrade
11:17 - Ultra Sol + Self-Improvement
13:44 - well, this is awkward
15:41 - Why model improvement will not plateau anytime soon

AI Consciousness: https://www.patreon.com/AIExplained/posts/anthropics-quite-163360718

I Smell Fear: https://x.com/thsottiaux/status/2075287108680601929
GPT 5.6: https://openai.com/index/gpt-5-6/

Grok 4.5: https://x.ai/news/grok-4-5?twclid=2ezs408o0z23pw07tmxcwbzibd
Meta Muse Spark 1.1: https://ai.meta.com/blog/introducing-muse-spark-meta-model-api/

Proliferating GPT Toggles: https://x.com/rasbt/status/2075369179817902176/photo/1
Anthropic Call-out: https://x.com/Mononofu
AI Security Institute Finding: https://x.com/alxndrdavies/status/2075279480331874306
Competitive Coding: https://x.com/FakePsyho/status/2075128093891801305/photo/1

Agents Last Exam: https://agents-last-exam.org/
Dawn Song: https://x.com/dawnsongtweets/status/2065095757988868190

https://simple-bench.com/

SWE-Marathon: https://www.swe-marathon.org/
https://www.frontierswe.com/
ARC-AGI 3: https://x.com/arcprize/status/2075270869992264003
Automation Bench: https://zapier.com/benchmarks
VibeCode Bench: https://www.vals.ai/benchmarks/vibe-code

‘Post-Train Claim’: https://posttrainbench.com/

Redwall Game: https://redwall-bellmaker-7e03e4.surge.sh/

Podcast: https://aiexplainedopodcast.buzzsprout.com/
Claude Fable Blocked - 11 Quiet Details on What’s Next
2026/06/14 | 13 mins.
Claude Fable 5 banned, but what’s the bigger story. We go through 11 under-reported details, so you have the context to see what’s coming next for your use of AI. From whether the ban will last, what the possible motives are, what the model can actually do, and some wild over-extrapolations going on.

Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcouncil.ai

AI Insiders ($9!): https://www.patreon.com/AIExplained

Chapters:
00:00 - Introduction
00:51 - Came from an Anthropic Investor ‘and other tech leaders’
01:47 - Govt pressured by CEOs like Jamie Dimon
03:01 - ‘Already decided’
04:02 - Prompt Injection Robustness Comparison
05:15 - Wellness?
06:36 - “Overreach”
08:17 - Anthropic Did Admit it would cause Difficulty
09:32 - 90 Minutes
10:02 - Equity Absence
10:31 - Lobbying and OpenAI

‘Already Decided’ - https://www.theinformation.com/articles/amazons-jassy-raised-concerns-anthropic-model-trump-crackdown?rc=sy0ihq

Not for Other Models: https://www.theinformation.com/briefings/u-s-government-unlikely-extend-anthropic-export-control-ai-companies?rc=sy0ihq

90 Minutes: https://archive.fo/20260614001605/https://www.politico.com/news/2026/06/13/inside-the-whirlwind-24-hours-that-led-the-white-house-to-slap-export-controls-on-anthropic-00961519#selection-807.1-807.219

Anthropic Statement: https://www.anthropic.com/news/fable-mythos-access

Life Comes at you Fast: https://x.com/etbrooking/status/2065638276388495742

Anthropic Deputy CISO: https://x.com/TheTranscript_/status/2065883670053847324

Hegseth Gloat: https://x.com/PeteHegseth/status/2065897156226015690

Roon Speculation: https://x.com/tszzl/status/2065939227167392147

Mythos System Card: https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c342ee809620.pdf

Sachs Statement: https://x.com/DavidSacks/status/2065853007619588171

OpenAI Lobbying: https://thehill.com/policy/technology/5912720-altman-openai-get-bogged-down-in-political-spending-fight/

Absent from Equity Talks: https://finance.yahoo.com/sectors/technology/articles/trump-ai-ownership-plan-could-131053732.html

Pliny Jaibreak: https://x.com/elder_plinius/status/2064776322979676227

Fusion: https://x.com/OpenRouter/status/2065856871215329545

https://lmcouncil.ai

Non-hype Newsletter: https://signaltonoise.beehiiv.com/

Podcast: https://aiexplainedopodcast.buzzsprout.com/
Claude Fable 5 - Full 319 page Breakdown
2026/06/10 | 33 mins.
Fable 5 is out - and it’s good, very good. But beyond the splashy demos, I want to bring you the 20+ nuggets from the 319 page system card, which I read in full, all day, plus benchmarks you may not have noticed.

https://assemblyai.com/aiexplained

Plus two worrying trends inside the ‘mind’ of Claude, how OpenAI counter, and the transformer inventor’s warning.

Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcouncil.ai

AI Insiders ($9!): https://www.patreon.com/AIExplained

Chapters:
00:00 - Introduction
01:06 - Blocks + Better Models
02:42 - Fable 5 Upgrade over Mythos Preview
04:49 - ML Acceleration Bombshell
07:11 - No RSI yet
07:41 - Bio-capable
14:51 - Creative Writing … no
17:23 - Does need bug-checks
18:57 - OpenAI Response
19:23 - Benchmark Bonanza
28:06 - Chain of Thought worrying trend

Fable 5 Release: https://www.anthropic.com/news/claude-fable-5-mythos-5

System Card: https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c342ee809620.pdf

Intelligence Explosion: https://www.patreon.com/posts/anthropic-charts-160231656

Annotated: https://x.com/Miles_Brundage/status/2064500190523113816/photo/1

OpenAI Counter: https://x.com/thsottiaux/status/2064572118264913923
https://x.com/thsottiaux/status/2043177597434306699

Double Lifespan: https://darioamodei.com/essay/machines-of-loving-grace

AutomationBench: https://zapier.com/benchmarks
Vending Bench: https://x.com/andonlabs/status/2064429817530085804
CritPt: https://critpt.com/
Riemann Bench: https://surgehq.ai/leaderboards/riemann-bench
GDPVal: https://artificialanalysis.ai/evaluations/gdpval-aa
BluePrint Bench 2: https://andonlabs.com/evals/blueprint-bench-2
MCP Atlas: https://labs.scale.com/leaderboard/mcp_atlas
FutureSim: https://x.com/nikhilchandak29/status/2064676801440358774

Roon Stun Lock: https://x.com/tszzl/status/2064454617568874669

Noam Brown Inference Ceiling: https://x.com/polynoamial/status/2064210146558136827

Isochronic Chart: https://isochronic-passage-chart.netlify.app/#nyc
Rose Tavern: https://claude.ai/public/artifacts/2295bebe-77e6-43e2-ae94-0fe49e9a776b
Redwall Game: https://redwall-mossflower.surge.sh/

Risk Report: https://www-cdn.anthropic.com/097c63b5fe7dd8b14866e1f15bb1910ec713658a.pdf

Transformer Inventor Warning: https://x.com/tszzl/status/2064563986914554125

Non-hype Newsletter: https://signaltonoise.beehiiv.com/

Podcast: https://aiexplainedopodcast.buzzsprout.com/
New Claude - 244 page breakdown
2026/05/29 | 22 mins.
The ‘best’ generally available AI model just dropped, but there is plenty I bet you missed about what it is, how it performs, and what the release tells us. 15 highlights from the 244 page system card, plus private testing, leader interview and more.

AI Insiders ($9!): https://www.patreon.com/AIExplained

Chapters:
00:00 - Introduction
00:49 - Mythos in Weeks
01:49 - Adaptive not necessary
02:26 - Honesty?
04:37 - Flagging Uncertainty
04:57 - Benchmarks
08:54 - Mythos will be even better
10:30 - Business skillz
11:15 - Model Welfare
12:16 - Cyber Comparable
13:10 - Misalignment Concerns
16:22 - Meta Inabilities
17:58 - Code flagging
18:34 - Go to sleep
18:50 - Fast Mode
20:21 - Dynamic Workflows

Opus 4.8 Paper: https://cdn.sanity.io/files/4zrzovbb/website/c886650a2e96fc0925c805a1a7ca77314ccbf4a6.pdf

Release: https://www.anthropic.com/news/claude-opus-4-8

Chips: https://www.theinformation.com/articles/anthropic-talks-use-microsofts-ai-chips?rc=sy0ihq
https://www.anthropic.com/news/expanding-our-use-of-google-cloud-tpus-and-services
https://www.anthropic.com/news/higher-limits-spacex

Patreon Vid: https://www.patreon.com/posts/re-up-anthropics-159289449

GDPVal: https://artificialanalysis.ai/evaluations/omniscience
https://arxiv.org/abs/2510.04374

Amodei Technical Debt: https://www.youtube.com/watch?v=7xco5Qd2Oo8

Dynamic Workflows: https://x.com/ClaudeDevs/status/2060044853279617150
https://x.com/_catwu/status/2060054180379689074/photo/1
https://claude.com/blog/introducing-dynamic-workflows-in-claude-code

https://simple-bench.com/

Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcouncil.ai

Non-hype Newsletter: https://signaltonoise.beehiiv.com/

Podcast: https://aiexplainedopodcast.buzzsprout.com/

More Education podcasts

Trending Education podcasts

About AI Explained Official Podcast

Covering the biggest news of the century - the arrival of smarter-than-human AI. From the author of Simple Bench, which reveals the remaining gap between LLM and human reasoning. Hype-free, and the British accent is a freebie bonus.

Podcast website

Education News Self-Improvement Society & Culture Tech News

Listen to AI Explained Official Podcast, Coffee Break French and many other podcasts from around the world with the radio.net app

Get the free radio.net app

Stations and podcasts to bookmark
Stream via Wi-Fi or Bluetooth
Supports Carplay & Android Auto
Many other app features

Open app

Get the free radio.net app

Stations and podcasts to bookmark
Stream via Wi-Fi or Bluetooth
Supports Carplay & Android Auto
Many other app features

AI Explained Official Podcast

Scan code,
download the app,
start listening.