What is an instruction tuning?
A form of fine-tuning that improves a generative AI model's ability to follow instructions.
instruction tuning explained in plain English
A form of fine-tuning that improves a generative AI model's ability to follow instructions. Instruction tuning involves training a model on a series of instruction prompts, typically covering a wide variety of tasks. The resulting instruction-tuned model then tends to generate useful responses to zero-shot prompts across a variety of tasks. Compare and contrast with: - parameter-efficient tuning - prompt tuning
Example
Practitioners refer to instruction tuning when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.
People also read
- average precision at k
A metric for summarizing a model's performance on a single prompt that generates ranked results, such as a numbered list of book recommendations.
- black box model
A model whose "reasoning" is impossible or difficult for humans to understand.
- Chain-of-Thought Prompting
Asking an AI to show its reasoning step by step before giving a final answer, which often improves accuracy on complex tasks.
- conversational coding
An iterative dialog between you and a generative AI model for the purpose of creating software.
- deterministic
A system that always returns the same output for a given input.
- direct prompting
Synonym for zero-shot prompting.
- distillation
The process of reducing the size of one model (known as the teacher) into a smaller model (known as the student) that emulates the original model's predictions as faithfully as possible.
- emergent behavior
An LLM's ability to generate responses for prompts that it was not explicitly trained on.
- factuality
Within the ML world, a property describing a model whose output is based on reality.
- few-shot learning
A machine learning approach, often used for object classification, designed to train effective classification models from only a small number of training examples.