Instruction Tuning for Large Language Models

Войти

аналитика - Instruction Tuning for Large Language Models

Kun: A Self-Training Approach for Generating High-Quality Chinese Instruction-Tuning Datasets for Large Language Models Using Instruction Back-Translation and Answer Polishment

Kun is a novel self-training method that leverages instruction back-translation and answer polishment to automatically generate large-scale, high-quality Chinese instruction-tuning datasets for LLMs, reducing the reliance on manual annotation.

IDEA-MCTS: A Framework for Optimizing Instruction Synthesis Using Monte Carlo Tree Search and Evaluation Models

This paper introduces IDEA-MCTS, a novel framework that leverages Monte Carlo Tree Search (MCTS) and evaluation models to automatically generate high-quality, diverse, and complex instruction data for improving the instruction-following abilities of large language models (LLMs).

О нас

Условия и конфиденциальность
Связаться с нами

Продукты

Расширение Research Copilot для браузера
Инструменты для исследований
Сокращение PDF
Сокращение презентаций
Сокращение документов
Сокращение научных статей
Переводчик презентаций
Переводчик PDF
Переводчик документов
Переводчик научных статей

Ресурсы

Ускорение исследований
Научные идеи
Цены