Title:New Transformer architecture could enable powerful LLMs without GPUs Summary: MatMul-free LM removes matrix multiplications from language model architectures to make them faster and much more memory-efficient. Link:
New Transformer architecture could enable powerful LLMs without GPUs Best Sellers