back
Get SIGNAL/NOISE in your inbox daily

METR blogpost | X | Github •
Summary
In the paper Measuring AI Ability to Complete Long Software Tasks (Kwa & West et al. 2025), METR defined an AI…