Language & LLMs

What Is Temperature Sampling?

Temperature sampling adjusts the sharpness of a language model's probability distribution before sampling a token. A low temperature makes the model favor high-probability tokens, producing focused text, while a high temperature increases randomness and diversity. It is a common control for creative versus deterministic output.

Further reading

Read more about temperature sampling — articles and blogs from around the web: