Core Concepts
Automated teaching assistants have the potential to reduce human workload in logistics-related question answering, but there is a gap in factuality precision between automated approaches and humans.
Abstract
Introduction to SYLLABUSQA dataset for course logistics-related QA.
Importance of factuality evaluation in QA tasks.
Benchmarking strong baselines on SYLLABUSQA.
Challenges faced by LLM-based approaches in answering questions accurately.
Future work and conclusions on improving automated teaching assistants.
Stats
"We introduce SYLLABUSQA, an open-source dataset with 63 real course syllabi covering 36 majors, containing 5,078 open-ended course logistics-related question-answer pairs."
"There are 2,177 Explicit QA pairs, 2,181 Implicit pairs, and 720 Insufficient Information pairs."
"Question type prediction accuracy is at 54%."