Table of Contents

Data Description

This dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015.

Task: Try to estimate the price based on given features.

Imports

Important Scripts

Parameters

Load the data

Data Processing

Drop unwanted features

Log transform large values

Log transform target of training data (we need to get back later)

Create squared features

Feature selection

Train validation split

Scaling

Create lgb Dataset

Modelling: lightgbm (lgb api)

Baseline model

Model convergence

Feature Importance

Cross validation

Time Taken