Large language models for Brazilian Portuguese based on open data show varying performance in downstream tasks.
PeLLE introduces large language models for Brazilian Portuguese based on open data, showcasing the impact of model size and data curation on downstream NLP tasks.