Pinned post

Google says it hit a milestone of 1.3 quadrillion monthly tokens processed across its services this summer, up from 980T monthly tokens announced in July (Matthias Bastian/The Decoder)

Matthias Bastian / The Decoder : Google says it hit a milestone of 1.3 quadrillion monthly tokens processed across its services this summ...

10 October 2025

SemiAnalysis launches InferenceMAX, an open-source benchmark that automatically tracks LLM inference performance across AI models and frameworks every night (SemiAnalysis)

SemiAnalysis:
SemiAnalysis launches InferenceMAX, an open-source benchmark that automatically tracks LLM inference performance across AI models and frameworks every night  —  NVIDIA GB200 NVL72, AMD MI355X, Throughput Token per GPU, Latency Tok/s/user, Perf per Dollar, Cost per Million Tokens …

Posted from: this blog via Microsoft Power Automate.

Daily Deals