AI can handle tasks twice as complex every few months. What does this exponential growth mean for how we use it?

AIs can outperform humans easily on short tasks, but longer ones are the true hurdle to overcome before we can deem them to be truly intelligent systems.

an illustration of a line of robots working on computers
A new benchmark for AI performance could give us an idea of when to expect true generalist AI agents.
(Image credit: MASTER via Getty Images)

Scientists have devised a new way to measure how capable artificial intelligence (AI) systems are — how fast they can beat, or compete with, humans in challenging tasks.

While AIs can generally outperform humans in text prediction and knowledge tasks, when given more substantive projects to carry out, such as remote executive assistance, they are less effective.

Roland Moore-Colyer

Roland Moore-Colyer is a freelance writer for Live Science and managing editor at consumer tech publication TechRadar, running the Mobile Computing vertical. At TechRadar, one of the U.K. and U.S.’ largest consumer technology websites, he focuses on smartphones and tablets. But beyond that, he taps into more than a decade of writing experience to bring people stories that cover electric vehicles (EVs), the evolution and practical use of artificial intelligence (AI), mixed reality products and use cases, and the evolution of computing both on a macro level and from a consumer angle.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.