325
Audio & Video Production313
Software Development238
Automation & Workflow203
AI Infrastructure & MLOps146
Marketing & Growth185
Writing & Content Creation190
Data & Analytics115
Design & Creative148
Photography & Imaging140
Customer Support120
Voice & Speech130
Sales & Outreach109
Education & Learning120
Operations & Admin80
A popular METR chart tracks how long real work tasks AI can finish, showing capabilities have followed a steady doubling trend over the past six years.
In short: A widely watched METR chart suggests leading AI models are getting able to finish longer work tasks at a steady pace, roughly doubling every seven months.
METR, a nonprofit that studies AI, made a chart that has become a common reference point across the AI industry. Instead of looking at puzzle-like test scores, it tracks something simpler, how long a real task is, and whether an AI system can finish it.
METR starts by timing how long skilled human experts take to complete complex projects, such as building a web app or solving a hard programming problem. Then it gives the same tasks to AI “agents” (AI tools that can take steps on their own, like a junior assistant following instructions) and measures how well they do.
The chart includes “50%” and “80%” success time horizons. That is the task length where the AI is expected to succeed about half the time, or four out of five times. METR’s data shows an exponential pattern, meaning progress keeps compounding, with capability roughly doubling every seven months over the last six years.
METR’s examples show how fast this has moved. Around the launch of ChatGPT in late 2022, top models handled tasks that took humans about 30 seconds. By early 2026, some frontier models were reported to complete certain computing tasks that would take a human expert more than 14 hours. METR also reports current models are around a 50% success rate on tasks that take an expert human about one hour.
If this doubling trend continues, METR suggests AI could handle much longer projects by the end of the decade, possibly even month-long efforts. Still, the chart is focused on software and research tasks, not on whether AI can replace most jobs or match humans at everything.
Source: NYTimes