Baseline classifiers in scikit-learn for fast benchmark setting

1132
0

I like setting a few strong baselines before chasing complexity. A regularized logistic regression, a random forest, and a gradient boosting model usually tell me whether the problem is linearly separable, non-linear, or data-limited. Good baseline discipline saves weeks of unnecessary model experimentation.