chdd / OpenCompetition

A collection of algorithms for data science competition. Only focus on the tested and latest

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The Open Competition Toolkit

Overview

This is a library that collects the useful codes for data science competition, as well as some code for research purposes.

The library contains materials for Tabular Data Mining, NLP, CV and RL.

Most of the code are written in Python 3.7, except some part of Cython, C++ and R. We do, however, provide an unified interface for python.

Some reference for what will be implemented can be found here https://www.overleaf.com/read/xftrzgtcxkpd (registration needed). I will be hosting the details on the lecture on GeekBang (https://time.geekbang.org/).

Feature of The Library

Environment

The following environment includes:

  1. Anaconda 3.7.
  2. PyTorch 1.4.
  3. TensorFlow 1.13.
  4. Other required packages (Need to be finished).

Available contents

General

Tabular

  1. Category Encoders (need to finish some details).

NLP

CV

RL

To Finish

General

Tabular

NLP

CV

RL

About

A collection of algorithms for data science competition. Only focus on the tested and latest

License:Apache License 2.0


Languages

Language:Python 83.3%Language:Jupyter Notebook 16.3%Language:Shell 0.2%Language:CSS 0.1%Language:R 0.1%Language:JavaScript 0.0%Language:Dockerfile 0.0%