A tool for cleaning datasets from turing ai
- Clone the repository
git clone https://github.com/TuringAI-Team/Data-cleaning.git
- Install the requirements
npm install
- Create folders, where you will store your datasets and results Create a folder in root called data Create a folder in root called steps
- Export your dataset from turing ai to the data folder and name it data.csv
- Run the CLI
npm run dev
- Follow the instructions in the CLI
- The results will be stored in the steps folder
- Fix bug where sometimes it answer with invalid json
- Improve cleaning to not remove so much data