LLMs show varying performance across different domains in the Xiezhi Benchmark.
LLMs performance evaluated using Xiezhi benchmark.