Table of Contents

Data Description

This dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015.

Task: Try to estimate the price based on given features.

Imports

Important Scripts

Load the data

Log transform large values

Train-Test split after log transform

Making pipeline for various sklearn Regressors (with automatic scaling)

Hyperparameter Tuning

Grid Search for Gradient Boosting Regressor

Reference: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.GradientBoostingRegressor.html

Grid search is extremely slow. It need to fit the model for all the values in the search space.

Hyper Parameter using hyperopt-sklearn for Gradient Boosting Regressor

References:

Scale data for hyperparameter tuning

For sklearn we used pipeline with scaling, now for tuning hyperparameter using hyperopt, we need to scale them.