Language & LLMs

What Is TF-IDF?

TF-IDF, short for Term Frequency-Inverse Document Frequency, is a statistic that reflects how important a word is to a document within a collection. It increases with the word's frequency in the document but decreases for words common across all documents. It is widely used in search and text feature extraction.

Further reading

Read more about TF-IDF — articles and blogs from around the web: