pipelines

Scaling and normalization choices for different model families

Not every model cares about scale, but enough of them do that I keep scaling explicit. Linear models, SVMs, neural nets, and distance-based methods all benefit from well-behaved inputs. I prefer putting scalers inside the pipeline so train and inferen

ColumnTransformer pipelines that keep preprocessing honest

I push nearly all preprocessing into a Pipeline so training and inference paths share exactly the same logic. ColumnTransformer is the workhorse here because real-world tables mix numeric, categorical, boolean, and text fields. It gives you reproducib

CI/CD pipeline with GitLab CI

GitLab CI/CD uses .gitlab-ci.yml for pipeline configuration. Pipelines consist of stages that run sequentially. Jobs within the same stage run in parallel. The image key sets the Docker image for each job. variables define global or job-level environm

CI/CD pipeline with GitHub Actions

GitHub Actions automates build, test, and deploy workflows. Workflows are defined in .github/workflows/ YAML files. The on key specifies triggers—push, pull_request, schedule, or workflow_dispatch. Jobs run on runners with configurable OS. steps execu