Language & LLMs
What Is a Repetition Penalty?
A repetition penalty is a decoding adjustment that lowers the likelihood of tokens the model has already produced. It helps prevent the model from getting stuck repeating words or phrases. The strength of the penalty can be tuned to balance variety and coherence.
Further reading
Read more about repetition penalty — articles and blogs from around the web: