Pinned post

DeepSeek researchers detail a new mHC architecture they used to train 3B, 9B, and 27B models, finding it scaled without adding significant computational burden (Vincent Chow/South China Morning Post)

Vincent Chow / South China Morning Post : DeepSeek researchers detail a new mHC architecture they used to train 3B, 9B, and 27B models, f...

13 January 2025

The Biden admin says chip orders with collective computation power up to ~1,700 advanced GPUs need no license and don't count against country specific chip caps (Karen Freifeld/Reuters)

Karen Freifeld / Reuters:
The Biden admin says chip orders with collective computation power up to ~1,700 advanced GPUs need no license and don't count against country specific chip caps  —  The U.S. government said on Monday it would issue a new regulation designed to control access to U.S.-designed artificial intelligence chips …

Posted from: this blog via Microsoft Power Automate.

Daily Deals