Advanced techniques to help you combine transformation and modeling parameters in a single grid search

Pipelines are extremely useful and versatile objects in the scikit-learn package. They can be nested and combined with other sklearn objects to create repeatable and easily customizable data transformation and modeling workflows.

One of the most useful things you can do with a Pipeline is to chain data transformation steps…

In this guide, I discuss the benefits of writing functions in Python, provide practical advice for beginners who are learning to write them, and walk through an example.

In programming, a function is a block of reusable code designed to perform a specific task or set of related tasks. They…

Consider which category you drop from a one-hot-encoded column, if you care about the interpretability of your model

Paint shade cards


As a student of data science, I recently learned how to model variable interactions using Ordinary Least Squares (OLS) linear regression. …

In this post, I’ll show you how to add annotations to your visualizations built using Matplotlib.

Annotations allow you to put text labels, boxes and arrows anywhere you like on your graph. …

Jessica Miles

Student of Data Science

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store