Introducing Tandem Transformers to enhance inference efficiency by combining small autoregressive models with large block mode models.