Comprehensive Evaluation of Taiwanese Mandarin Language Understanding in Large Language Models
This work presents TMLU, a comprehensive evaluation suite tailored for assessing advanced knowledge and reasoning capabilities of large language models in the context of Taiwanese Mandarin.