DenseFormer improves transformer models' efficiency and performance through Depth Weighted Averaging.