simonschoe / imbalanced-learning

This workshop is part of the "Machine Learning in R" graduate course held at University of Münster, School of Business and Economics (winter term 2020/21). :mortar_board:

Home Page:https://simonschoe.github.io/imbalanced-learning/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Imbalanced Learning

This workshop is part of the "Machine Learning in R" graduate course held at University of Münster, School of Business and Economics (winter term 2020/21). 🎓

Time: Monday, 2020-11-23, 16.00-18.00
Location: Zoom
Presenter: Simon Schölzel, M.Sc.
Slides: https://simonschoe.github.io/imbalanced-learning/

fb4-logo ftb-logo ipb-logo

Contents

This 1 hour workshop serves as a gentle introduction to the field of imbalanced learning. Participants will not only learn about the peculiarities and implications of working with imbalanced data sets, but also how to address class imbalance within the typical machine learning pipeline.

More specifically, after this workshop participants will

  • be able to identify an imbalanced data set and know about its implications for modeling,
  • carry a toolbox of techniques for addressing class imbalance at various stages in your machine learning pipeline (e.g., data collection, resampling, model estimation or model evaluation),
  • have internalized basic (random under- and oversampling) and more advanced techniques of resampling (SMOTE, Borderline SMOTE, NearMiss),
  • know how to distinguish alternative routes to handling class imbalance, such as imbalanced learning or cost-sensitive learning.

Agenda

1 Learning Objectives

2 Introduction to Imbalanced Learning

3 Techniques for Addressing Class Imbalance

3.1 Sampling Strategies

3.1.1 Random Oversampling
3.1.2 Synthetic Minority Over-Sampling Technique (SMOTE)
3.1.3 Borderline SMOTE
3.1.4 Random Undersampling
3.1.5 Informed NearMiss Undersampling

3.2 Case Weighting
3.3 Excursus: Evaluation of Classification Models
3.4 Tinkering with Classification Cutoffs

4 Cost-Sensitive Learning

About

This workshop is part of the "Machine Learning in R" graduate course held at University of Münster, School of Business and Economics (winter term 2020/21). :mortar_board:

https://simonschoe.github.io/imbalanced-learning/


Languages

Language:HTML 61.2%Language:R 22.8%Language:CSS 16.0%