Modelling Fraud Detection using evalML

Go to Top

References

Load the libraries

Go to Top

Colab

Useful Scripts

Go to Top

Load the Data

Go to Top

Data Processing

Go to Top

Train Test Split

Go to Top

Modelling: evalML

Go to Top

Configure Cost of Fraud

Go to Top

To optimize the pipelines toward the specific business needs of this model, you can set your own assumptions for the cost of fraud. These parameters are

AutoML Search

Go to Top

AutoMLSearch(
    problem_type             = None,
    objective                = 'auto', # r2, log loss binary/multiclass
    max_iterations           = None,
    max_time                 = None, # inteter is seconds, strings: minutes hours
    patience                 = None, # default is no early stopping
    tolerance                = None,
    data_split               = None, # StratifiedKFold
    allowed_pipelines        = None,
    allowed_model_families   = None,
    start_iteration_callback = None,
    add_result_callback      = None,
    error_callback           = None,
    additional_objectives    = None,
    random_state             = 0,
    n_jobs                   = -1,
    tuner_class              = None,
    verbose                  = True,
    optimize_thresholds      = False,
    ensembling               = False,
    max_batches              = None,
    problem_configuration    = None,
    _pipelines_per_batch     = 5,
)


automl.search(X, y,
              data_checks='auto',
              show_iteration_plot=True)

Select the Ojbective: FraudCost

Go to Top

View rankings and select pipeline

Go to Top

Fit the Model

Go to Top

Model Evaluation

Go to Top

Time Taken

Go to Top