kapsali29 / ErenTransormer

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

EREN dataset translation scripts

This is a series of scripts for translating categorical values from spanish to english This repository contains the following files:

  1. DATA/: In this folder all the source files sould be placed
  2. requirements.txt: This file contains all the library requirements for running the translation
  3. settings.py: Contains all the value mappings from spanish to english
  4. transformer.py: This is the execution script of the EREN dataset translation

Prerequisites

  1. Have Python 3.7 installed
  2. Install python requirements pip install -r requirements.txt

Steps for using the script

  1. Create DATA repository and download the following files and put them in DATA folder:
  2. Put the EREN datasets to DATA/ directory
  3. Execute this command for installing python requirements: pip install -r requirements.txt
  4. Execute this script: python transformer.py

The translated datasets will be generated to the OUTPUT/ folder

About

License:MIT License


Languages

Language:Python 100.0%