Pinned post
As inference splits into prefill and decode, Nvidia's Groq deal could enable a "Rubin SRAM" variant optimized for ultra-low latency agentic reasoning workloads (Gavin Baker/@gavinsbaker)
Gavin Baker / @gavinsbaker : As inference splits into prefill and decode, Nvidia's Groq deal could enable a “Rubin SRAM” variant optim...
8 July 2024
Not to brag, but VB has the best reporters in the biz — get in the room with them at VB Transform - 2024-07-08 21:15:30Z
Title:Not to brag, but VB has the best reporters in the biz — get in the room with them at VB Transform Summary: Don't miss your chance to meet some of our top reporters and editors at this year's VB Transform in San Francisco. Register now! Link: Not to brag, but VB has the best reporters in the biz — get in the room with them at VB Transform