PrachpaveenY / PySpark-project

PySpark-project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PySpark-project (Complete)

  • Project : PySpark Project
  • Tools = Python, PySpark, Pandas, Azure SQL Database

Detail :

  • Create File : CSV, TXT.
  • Use PySpark and Pandas : manage data.
  • microsoft azure project : use manage data of Azure SQL Database

Structure :

πŸ“ PySpark-project
└── πŸ“ all project
|   └── πŸ“ apache airflow
|   |
|   └── πŸ“ app
|   |   └── πŸ“ tables                           // All CSV Files
|   |   └── πŸ“„ use.ipynb                        // Main File of app ( use manage data )
|   |
|   └── πŸ“ google cloud platform
|   |
|   └── πŸ“ microsoft azure project
|   |   └── πŸ“„ Azure-sql-database-project.ipynb // Main File of microsoft azure project ( use manage data of Azure SQL Database )
|   |
└── πŸ“ img
└── πŸ“ tutorial
|   └── πŸ“„ HR01.csv                         // Use CSV of tutorial
|   └── πŸ“„ Pandas_Tutorial1.ipynb           // Pandas Tutorial
|   └── πŸ“„ PySpark_Tutorial1.ipynb          // PySpark Tutorial
|   └── πŸ“„ Python_Tutorial01.ipynb          // Python Tutorial
|   └── πŸ“„ python_tutorial01.py             // Python Tutorial
|
└── πŸ“„ HR01.csv
└── πŸ“„ hr01.txt
└── πŸ“„ PySpark_createData.ipynb           // Create File
└── πŸ“„ README.md

Run project step :

  • Run use.ipynb
  • Run PySpark_Tutorial1.ipynb



About

PySpark-project


Languages

Language:Jupyter Notebook 94.0%Language:Python 6.0%