LeszekBlazewski / data-science

All my work related to data science, machine learning, deep learning and similar.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

data-science

All my work related to data science, machine learning, deep learning and similar.

The datasets/models and other large files can be downloaded from my google-drive.

You can find a summary of each of the projects in their folders.

  1. Spam detection models evaluation

The task was to evaluate and measure different classification models in task of detecting spam e-mails based on data from SpamAssasin. The task was to tune classifiers in order to achieve desired recall and precision instead of accuracy. The data was also a little imbalance and many different approaches were used to conduct the sensitivity studies.

  1. Toxic text classification visualisations

Big data visualisations - multiclassification of 6 types of toxicity: ['toxic', 'severe_toxic', 'obscene', 'threat', 'insult', 'identity_hate'].

  1. K-NN evaluation and benchmarks

Benchmark K-NN classifier in supporting identification of myocardial infarction.

  1. ARQ protocol analysis in Matlab

Benchmark of ARQ protocol used in data correction during transmission.

  1. ETL and OLAP multidimensional data analysis

Multidimensional analysis with SSIS and SSAS of dean's office data.

About

All my work related to data science, machine learning, deep learning and similar.


Languages

Language:Jupyter Notebook 99.2%Language:MATLAB 0.8%