ELITR-Bench: A Benchmark for Evaluating Long-Context Language Models on Meeting Assistant Tasks
ELITR-Bench is a new benchmark for evaluating long-context language models on a practical meeting assistant scenario, featuring transcripts obtained by automatic speech recognition and a set of manually crafted questions.