We trained “critique-writing” models to describe flaws in summaries. Human evaluators find flaws in summaries much more often when shown our model’s critiques. Larger models are better at self-critiquing, with scale improving critique-writing more than summary-writing. This shows promise for using AI systems to assist human supervision of AI systems on difficult tasks.
Originally published on
OpenAI News.
Latest Briefs
Fast updates from the latest stories.
NEWS
CoinDCX Founders Arrested Amid Allegations of Impersonation Fraud
Mar 21, 2026
NEWS
ChatGPT to Introduce Ads for Free and Low-Cost Users Soon
Mar 21, 2026
NEWS
+1
New-Age Tech Stocks Rebound: FirstCry Leads Gains This Week
Mar 21, 2026
EXCLUSIVE
+4
How fusion power works and the startups pursuing it
Mar 21, 2026