Will's Blog: Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and "higher than all" public models, and debuts Claude 3.5 Haiku (Anthropic)

22 October 2024

Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and "higher than all" public models, and debuts Claude 3.5 Haiku (Anthropic)

Anthropic:
Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and “higher than all” public models, and debuts Claude 3.5 Haiku — Today, we're announcing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku.

Posted from: this blog via Microsoft Power Automate.