Tag Archives: llm

AI #51: Altman’s Ambition

[Editor’s note: I forgot to post this to WorldPress on Thursday. I’m posting it here now. Sorry about that.] Sam Altman is not playing around. He wants to build new chip factories in the decidedly unsafe and unfriendly UAE. He … Continue reading

Posted in Uncategorized | Tagged , , , , | 5 Comments

AI #47: Exponentials in Geometry

The biggest event of the week was the Sleeper Agents paper from Anthropic. I expect that to inform our thoughts for a while to come, and to lay foundation for additional work. We also had the first third of the … Continue reading

Posted in Uncategorized | Tagged , , , , | 2 Comments

On Anthropic’s Sleeper Agents Paper

The recent paper from Anthropic is getting unusually high praise, much of it I think deserved. The title is: Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training. Scott Alexander also covers this, offering an excellent high level explanation, … Continue reading

Posted in Uncategorized | Tagged , , , , | Leave a comment