Question 1

What is model distillation?

Accepted Answer

Model distillation is a process where a small 'student' model learns to mimic the behaviour of a large 'teacher' model by training on the teacher's outputs, capturing much of its knowledge at a fraction of the size.

Question 2

Why distill a model?

Accepted Answer

Smaller models are cheaper and faster to run. Distillation lets companies deploy capable AI at lower inference cost, on edge devices, or within strict latency requirements without retraining from scratch on raw data.

Question 3

Is a distilled model as good as the original?

Accepted Answer

Generally not quite as capable, but distilled models often retain 90% or more of the performance on specific tasks at a fraction of the parameter count. The right trade-off depends on the task requirements and budget.

Question 4

What is the relationship between distillation and quantization?

Accepted Answer

Both compress models, but differently. Distillation trains a structurally smaller model. Quantization reduces the numerical precision of an existing model's weights. They are often combined for maximum efficiency.

Model Distillation

Technical definition

Business use case

Example

Frequently asked questions

Keep exploring

Machine Learning

Large Language Model

Fine-Tuning

Put AI intelligence to work in your business