We use GPT-4 to automatically write explanations for the behavior of neurons in large language models and to score those explanations. We release a dataset of these (imperfect) explanations and scores for every neuron in GPT-2.
Originally published on
OpenAI News.
Latest Briefs
Fast updates from the latest stories.
NEWS
+1
New-Age Tech Stocks Rebound: FirstCry Leads Gains This Week
Mar 21, 2026
EXCLUSIVE
+4
How fusion power works and the startups pursuing it
Mar 21, 2026
COMPANIES
AI boom? OpenAI set to double its team by end of 2026; new hires to be deployed across these fields - Report
Mar 21, 2026
NEWS
NeuroPause Lab Introduces 'AI Action Firewall' for Enhanced AI Safety
Mar 21, 2026