CJEval, a comprehensive benchmark based on Chinese Junior High School exam data, is introduced to assess the capabilities of Large Language Models in diverse educational tasks, including knowledge concept tagging, question difficulty prediction, question answering, and question generation.
The Edu-Values benchmark is designed to comprehensively evaluate the alignment of Chinese large language models with key educational values, including professional ideology, education laws and regulations, teachers' professional ethics, cultural literacy, basic competencies, educational knowledge and skills, and subject knowledge.
Hint generation is a critical component of intelligent tutoring systems that can facilitate self-learning. This survey article presents a comprehensive review of prior research on hint generation, aiming to bridge the gap between research in education and cognitive science, and research in AI and Natural Language Processing.