c-w-a / eng-intern-assessment-data

A Starter repo for Shopify Data Eng Intern Assessments

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Engineer Intern Assessment

Welcome to the Data Engineer Intern assessment for Shopify! This assessment is designed to evaluate your skills in SQL, data manipulation, and problem-solving. Please follow the instructions below to complete the assessment.

Dataset

The assessment is based on a simulated dataset containing sales information from an e-commerce platform. The dataset is provided in the /data directory as a number of CSV files named <table-name>_data.csv. These dataset includes columns such as product_id, sales_amount, customer_id, etc.

Instructions

  1. Fork the Repository: Start by forking this repository to your local machine.
  2. Create a new Branch: Create a new branch to store your work in
  3. Data Understanding: Review the schema.sql file and data .csv files and understand its structure and columns.
  4. Write SQL Queries: Create SQL files (task1.sql, task2.sql, task3.sql) in the /sql directory to solve each task mentioned above.
  5. Submit your Work: Once completed, create a pull request with your changes to the Shopify main branch and submit the link to your PR

Notes

  • Ensure your SQL files contain clear and commented queries for each task.
  • Use the provided datasets (<table-name>_data.csv) for all tasks.
  • Feel free to ask any clarifying questions by creating an issue in this repository.

Good luck, and we look forward to reviewing your work!

About

A Starter repo for Shopify Data Eng Intern Assessments


Languages

Language:Python 100.0%