AI learns to think in secret, Claude Opus spots evaluations, and Tesla robotaxis raise alarms
The Grok deepfake crisis, NIST's AI agent inquiry, and MATS Summer 2026
Genesis Mission, AI cyber-espionage, and alignment still unsolved
The end of OpenAI's nonprofit era, NIST probes Chinese models, and Anthropic finds introspection
Claude notices it's being tested, California moves on regulation, and competition worsens misalignment
Scheming evals under fire, Anthropic scans nuke queries, and Google faces safety accusations
GPT-5 reactions, the study that upended AI safety, and the Gemini CLI flaw
AI safety turns national-security-first, AlphaGenome, and robotics' "ChatGPT moment"
o3 resists shutdown commands, Anthropic builds for the military, and frontier models game reward systems
The Singapore Consensus, deceptive AI debate, and rolled-back safety measures
AI 2027, GATE, and the first AI Safety Talks event
Paris Summit regulation debates, new EU restrictions, and Eric Schmidt's $10M safety fund
DeepSeek shakes the race, U.S. AI policy shifts, and gradual disempowerment
Hinton's risk warning, o3's record, and Sora's launch
OpenAI o1, the Anthropic-Palantir partnership, and robot security flaws
AI Nobel prizes, White House security directive, and Claude controlling computers
OpenAI's for-profit shift, executive departures, and the Council of Europe AI treaty
SB1047, China's changing AI safety stance, and AI Safety Camp 2025
EU AI Act, GPT-4o voice cloning, and foundation model governance
ARENA, AGI nationalization debate, and AI's impact on work
AI Safety Türkiye meet-up, fast grants, and Epoch AI's glimpse into the future
Introduction to AI Governance, Pivotal fellowship, and "What if Dario Amodei is right?"