Text vectorization with TF-IDF for strong classical baselines

7108
0

Before I fine-tune transformers, I almost always try a TF-IDF baseline. It is fast, interpretable, and often surprisingly competitive for moderate text classification tasks. If a linear model over sparse features is already good enough, that is usually the correct production choice.