Return to Article Details Attention-Free Transformers: State Space Models as Scalable Alternatives Download Download PDF