5 November 2025

Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a "messiness tax" for real-world tasks (Boaz Barak/Windows On Theory)

Boaz Barak / Windows On Theory:
Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a “messiness tax” for real-world tasks  —  METR has had a very influential work by Kwa and West et al on measuring AI's ability to complete long tasks.

Posted from: this blog via Microsoft Power Automate.

Daily Deals