Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Tilastot
数値実験から得られた結果:「Expanding responses yields more benefit than prompts.」
数値実験から得られた結果:「The empirical formulation of prompt diversity can establish a linear correlation with the final performance of LLMs.」