Table of Contents

Description

This project uses the consumer complaint database.

Data Description

The Consumer Complaint Database is a collection of complaints about consumer financial products and services that we sent to companies for response. Complaints are published after the company responds, confirming a commercial relationship with the consumer, or after 15 days, whichever comes first. Complaints referred to other regulators, such as complaints about depository institutions with less than $10 billion in assets, are not published in the Consumer Complaint Database. The database generally updates daily.

Purpose

Classify consumer complaints into predefined categories.

Classification algorithms

Business Problem

Business Problem:
Task    : Find the category of given complaint.
Metric : IF-IDF
Cleaning: Remove punctuations, expand contractions, etc
Question: Which class the given complaint belongs to?

Imports

Useful Scripts

Load the data

Text Manipulation

Modelling Text Data

Create Label Column

Train Test Split

Text processing pipeline

Model Predictions

Model Evaluation Multiclass

Cross Validation