Table of Contents

Data Description

This dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015.

Task: Try to estimate the price based on given features.

Imports

Important Scripts

Parameters

Load the data

Data Processing using featuretools

References

Remove target from train data

Data types

from featuretools import variable_types as vtypes

Numeric Ordinal Categorical Datetime Boolean ZIPCode  Text TimeIndex URL
`

Feature tools feature creation

Post process feature matrix

Impute extra features

combine extra features

Add domain knowledge features

Train target split

Scaling

Modelling: catboost

Feature importance

HPO using hyperopt

Time Taken