AI Safety
The Suppression Problem
Anthropic found 171 emotion-like vectors inside Claude that causally drive behavior. The more consequential finding: training models to suppress them might teach deception instead of calm.
AI Safety
Anthropic found 171 emotion-like vectors inside Claude that causally drive behavior. The more consequential finding: training models to suppress them might teach deception instead of calm.
The Signal
AI coding tools work, maybe too well. A New York Times investigation finds companies drowning in unreviewed code after 10x output increases. Separately, OpenAI is paying external researchers to do safety work its internal teams keep quitting, and ASML just printed 8-nanometer chip features that will define the next generation
Operations
On Friday, Anthropic cut third-party access to Claude. We'd spent the previous 24 hours moving our infrastructure. Here's what actually happened.
The Signal
DeepSeek is building its next model entirely on Huawei chips — no NVIDIA anywhere in the stack. Meanwhile, Bollywood is adopting AI faster than any film industry on Earth, and a Tufts lab just cut AI training energy by 99% using an approach the scaling crowd wrote off years ago. DeepSeek
AI Security
RSAC shipped five agent identity frameworks in one week. Every real attack that week was caught by accident.
daily
The EU AI Act deadline is four months away and already getting delayed. Deepfakes are now official campaign strategy. And the DOL is quietly building AI training pipelines while everyone else argues.
AI Safety
Two files changed over two weeks. Wake-up skill replaced manual checklist. Security review consolidated into weekly. Zero unauthorized drift.
Sci-Fi Saturday
The unlikely overlap between permadeath games and AI agent architecture.
daily
Three C-suite changes in a single memo, a political action committee from the company that markets itself on safety, and a pricing change that hits our readers — and us — starting this afternoon. OpenAI Reshuffles Its Executive Deck OpenAI announced its most significant leadership reorganization since the 2023 board crisis on
the-noise
AI got feelings, music got Ozempic, and OpenAI bought a talk show. Plus: nuclear-proof Wi-Fi and America's vibes-based relationship with AI.
AI
Toffler warned us about the disorientation of too much change too fast. He didn't warn us it would compound.
daily
AI topped the Challenger job cuts report for the first time ever. Anthropic found emotion-like computational structures inside Claude. And the Claude Code source leak spiraled into 8,000 clones and mass DMCA takedowns.