shhdgit / sample-csv-files

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Sample CSV files

This repository contains sample Comma Separated Value (CSV) files. CSV is a generic flat file format used to store structured data. Datasets are split in 3 categories: Customers, Users and Organizations. For each, sample CSV files range from 100 to 2 millions records. Those CSV files can be used for testing purpose. They can be open by any application compatible with CSV files or with a CSV editor.

The datasets are generated using random values. Mosly using Python Faker package.

Customers CSV Sample

Customer Schema

  • Index
  • Customer Id
  • First Name
  • Last Name
  • Company
  • City
  • Country
  • Phone 1
  • Phone 2
  • Email
  • Subscription Date
  • Website

People CSV Samples

People Schema

  • Index
  • User Id
  • First Name
  • Last Name
  • Sex
  • Email
  • Phone
  • Date of birth
  • Job Title

Organizations CSV Samples

Organization Schema

  • Index
  • Organization Id
  • Name
  • Website
  • Country
  • Description
  • Founded
  • Industry
  • Number of employees

Local Set up to generate files

Python Environments

Create a Python virtual env:

python3 -m venv venv/sample-csv

Activate it

source venv/sample-csv/bin/activate

So you can install dependencies:

pip install -r requirements.txt

Run script

python src/main.py

About


Languages

Language:Python 100.0%