1 Repositories
Python attention_with_linear_biases Libraries
Code for our ALiBi method for transformer language models.
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation This repository contains the code and models for our paper Tra
211 Dec 31, 2022