Agents’ Last Exam reveals AI agents struggle with real work tasks, passing just 2.6% of the time

Comments

Join the discussion on this story.