Program-aided Distillation (PaD) enhances small models' reasoning abilities by distilling reasoning programs from large language models.
PaD introduces reasoning programs to improve distillation quality for small models in reasoning tasks.