Pinned post

Google DeepMind unveils Gemini Robotics 1.5 and Robotics-ER 1.5, enabling robots to perform multi-step tasks like sorting laundry, including by using web search (Melissa Heikkilä/Financial Times)

Melissa Heikkilä / Financial Times : Google DeepMind unveils Gemini Robotics 1.5 and Robotics-ER 1.5, enabling robots to perform multi-st...

25 September 2025

OpenAI releases GDPval, a benchmark to test AI performance on "economically valuable, real-world tasks", and says Claude Opus 4.1 was the best performing model (Maxwell Zeff/TechCrunch)

Maxwell Zeff / TechCrunch:
OpenAI releases GDPval, a benchmark to test AI performance on “economically valuable, real-world tasks”, and says Claude Opus 4.1 was the best performing model  —  OpenAI released a new benchmark on Thursday that tests how its AI models perform compared to human professionals across a wide range of industries and jobs.

Posted from: this blog via Microsoft Power Automate.

Daily Deals