Core Concepts
Psychologically plausible models outperform neural language models in predicting brain activation across diverse contexts, modalities, and languages, with embodied models exhibiting exceptional performance at the discourse level.
Abstract
The study conducted a comparative analysis of the encoding performance of neural language models (NLMs) and psychologically plausible models (PPMs) on eight multi-modal cognitive datasets in Chinese and English, covering both word and discourse levels.
Key findings:
PPMs significantly outperform NLMs in predicting brain activation across various brain networks and languages like English and Chinese, especially at the word level. At the discourse level, PPMs show superior average performance across most brain networks.
Among PPMs, the embodied-based model (EBM) emerges as particularly exceptional, demonstrating superior performance at both word and discourse levels, exhibiting robust prediction of brain activation across numerous regions in both English and Chinese.
In word-level encoding, network-topological and embodied properties in PPMs explain certain aspects of the brain's conceptual representation mechanism, while the lower performance of NLMs may be attributed to their inconsistency with the brain's language comprehension mechanisms.
At the discourse level, context-aware NLMs can obtain specific word meanings within their context, allowing them to encode brain activation for complex language units more effectively compared to the word level.
The shallow layers of context-aware NLMs excel in word-level brain activation, whereas middle layers are better at capturing discourse-level activation, suggesting distinct information encoding within different model layers.
The cortical encoding map reveals unique correlations between specific models and various brain regions, indicating distinct and exclusive information encoding within these models.
Stats
The fluffy white cloud in the sky resembled cotton candy.
The brain cortex encoding map reveals unique correlations between different models and various brain regions.
Context-aware models with more parameters capture additional word-level semantic information akin to the human brain.
Psychologically plausible models integrating embodied and network-topological information excel in word-level encoding, with embodied models outperforming others at the discourse level.
Quotes
"Psychologically plausible models prove to be more effective in predicting brain activation than neural language models."
"The embodied-based model demonstrates superior performance at both word and discourse levels, exhibiting robust prediction of brain activation across numerous regions in both English and Chinese."
"The shallow layers of context-aware neural language models excel in word-level brain activation, whereas middle layers are better at capturing discourse-level activation."