Language Model Progression

Background

Training Parallelism in Large Language Models

Many applications of transformers