Instruct model

← Back to Glossary

A base model fine-tuned with instruction-following data and/or RLHF to make it useful for conversation and following directions. Contrasted with a raw base/pretrained model which just predicts next tokens.

Chat and conversational models - Hugging Face

Referenced in

RLHF (Reinforcement Learning from Human Feedback)

GGUF

IQ (importance-matrix quantization)

Codex

Matt Oswalt

Title here

Instruct model

Referenced in