TuringAI-Team / data-cleaning

A tool to clean various datasets from Turing AI

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data-cleaning

A tool for cleaning datasets from turing ai

Usage guide

  1. Clone the repository
git clone https://github.com/TuringAI-Team/Data-cleaning.git
  1. Install the requirements
npm install
  1. Create folders, where you will store your datasets and results Create a folder in root called data Create a folder in root called steps
  2. Export your dataset from turing ai to the data folder and name it data.csv
  3. Run the CLI
npm run dev
  1. Follow the instructions in the CLI
  2. The results will be stored in the steps folder

To do

  • Fix bug where sometimes it answer with invalid json
  • Improve cleaning to not remove so much data

About

A tool to clean various datasets from Turing AI

License:MIT License


Languages

Language:JavaScript 51.4%Language:TypeScript 48.6%