核心概念
(Chat)GPT performs worse than BERT in detecting short-term semantic changes but slightly lower in long-term changes.
統計
"Our results indicate that ChatGPT performs significantly worse than the foundational GPT version."
"ChatGPT achieves slightly lower performance than BERT in detecting long-term changes but performs significantly worse in detecting short-term changes."
引用
"Our results indicate that ChatGPT performs significantly worse than the foundational GPT version."
"ChatGPT achieves slightly lower performance than BERT in detecting long-term changes but performs significantly worse in detecting short-term changes."