“Re: Recent Anthropic Safety Research” by Eliezer Yudkowsky

“Re: Recent Anthropic Safety Research” by Eliezer Yudkowsky

Released Tuesday, 12th August 2025
Good episode? Give it some love!
“Re: Recent Anthropic Safety Research” by Eliezer Yudkowsky

“Re: Recent Anthropic Safety Research” by Eliezer Yudkowsky

“Re: Recent Anthropic Safety Research” by Eliezer Yudkowsky

“Re: Recent Anthropic Safety Research” by Eliezer Yudkowsky

Tuesday, 12th August 2025
Good episode? Give it some love!
Rate Episode
List

A reporter asked me for my off-the-record take on recent safety research from Anthropic. After I drafted an off-the-record reply, I realized that I was actually fine with it being on the record, so:

Since I never expected any of the current alignment technology to work in the limit of superintelligence, the only news to me is about when and how early dangers begin to materialize. Even taking Anthropic's results completely at face value would change not at all my own sense of how dangerous machine superintelligence would be, because what Anthropic says they found was already very solidly predicted to appear at one future point or another. I suppose people who were previously performing great skepticism about how none of this had ever been seen in ~Real Life~, ought in principle to now obligingly update, though of course most people in the AI industry won't. Maybe political leaders [...]

---

First published:
August 6th, 2025

Source:
https://www.lesswrong.com/posts/oDX5vcDTEei8WuoBx/re-recent-anthropic-safety-research

---



Narrated by TYPE III AUDIO.

Show More
Rate
List

Join Podchaser to...

  • Rate podcasts and episodes
  • Follow podcasts and creators
  • Create podcast and episode lists
  • & much more
Do you host or manage this podcast?
Claim and edit this page to your liking.
,