Boaz Barak / Windows On Theory:
Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a “messiness tax” for real-world tasks — METR has had a very influential work by Kwa and West et al on measuring AI's ability to complete long tasks.
Posted from: this blog via Microsoft Power Automate.