JinYang88 / LogZip

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

logzip

An efficient compression tool specific for log files. It compresses log files by utilizing the inherent structures of raw log messages, and thereby achieves a high compression ratio.

Prerequisites

  • python3
  • pandas

Installation

Logzip can be directly execute through source code.

  1. Download and install python3 here.

  2. Install Pandas.

    $ pip3 install pandas

  3. Clone logzip.

    $ clone https://github.com/logpai/logzip.git

Data

We've conducted comprehensive experiments to evaluate the efficiency of logzip on five real-world datasets. All the datasets that we use are available at loghub.

Usage

A demo is uploaded to this repo (logzip/src/demo). We use a HDFS log file with 2k lines as a demo.

Compression

$ cd logzip/src/demo/
$ python3 zip_demo.py

Decompression

$ cd logzip/src/demo/
$ python3 unzip_demo.py

About

License:MIT License


Languages

Language:Makefile 58.9%Language:C++ 23.7%Language:Python 14.3%Language:Shell 2.0%Language:C 0.7%Language:Ruby 0.4%