networkedsystemsIITB / Traffic_Classification

Identification and prioritization of multimedia traffic in wireless access points

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Traffic Classification & Prioritization

The overall goal of this project is to improve the quality of multimedia streaming. This becomes important when other users/applications are downloading large files etc., and it leads to the multimedia quality deterioration. So we first identify the multimedia flow (classification) and then prioritize it. Current Version is 1.0, and it is released on June 22, 2016.

Outline

Traffic classification is useful for traffic engineering and network security. Network administrators can use it to allocate, control and manage the network resources as per their requirements. Classification methods can be used to classify P2P traffic, encrypted traffic, web, streaming, download or any specific application.

Our classification model classifies traffic into two classes, i.e., multimedia and download. We used supervised machine learning algorithms (Decision Tree and K-NN) to build the classification model. This model is trained using pre-labeled training instances and later used to classify the traffic in real-time. We use packet level statistics (average packet size, average inter-arrival time, receiver's window size, flow duration etc.) as features for classification algorithms.

Prioritization module ensures that once the flow is identified as multimedia it will get higher priority over the download flows. We used HTB (Hierarchical Token Bucket Filter) for this purpose.

We have also developed heuristics that can automatically label the training data set with some manual inputs, i.e. labeling each flow in the data set as either multimedia or download. These heuristics look at URI of HTTP GET request and search for multimedia file formats in it, if found then it labels that flow as multimedia.

This project can be used to create a large training data, train the classifier and further classify the traffic. Someone may try to add few new features and change specific settings to analyze the classification behavior.

List of modules developed

  • Classification module (2 approaches, K-NN and decision tree)
  • Prioritization module (HTB)
  • Auto-labeling heuristics to create large training data
  • Configure laptop as AP
  • Configure the DHCP server

Directory Structure

  • doc: Contains project documentation
  • scripts: Contains necessary scripts for the setup, classification and prioritization.

Contents

  • A Detailed Report containing an explanation our work in detail.
  • Scripts for various setups, classification and prioritization.
  • A user guide containing the setup and installation instructions.
  • A developer guide which explains the structure of the scripts.

Authors

  • Hiren Patel, Master's student (2014-2016), Dept. of Computer Science and Engineering, IIT Bombay.
  • Vidya Sagar Kushwaha, Master's student (2014-2016), Dept. of Computer Science and Engineering, IIT Bombay.
  • Prof. Mythili Vutukuru, Dept. of Computer Science and Engineering, IIT Bombay.

Contact Us

  • Hiren Patel, hiren131292[AT]gmail.com
  • Vidya Sagar Kushwaha, vskushwaha21[AT]gmail.com
  • Prof. Mythili Vutukuru, mythili[AT]cse.iitb.ac.in

About

Identification and prioritization of multimedia traffic in wireless access points


Languages

Language:Python 70.7%Language:Shell 29.3%