k00sharath / SQL-Project-for-Data-Analysis-part-1-7

Complete SQL Project for data analysis with source code.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SQL Projects for Data Analysis using Oracle Database

๐—”๐—œ๐— :
In this SQL Project for Data Analysis, We will learn to efficiently leverage various analytical features and functions accessible through SQL in Oracle Database.

๐—ช๐—ต๐—ฎ๐˜ ๐—ถ๐˜€ ๐˜๐—ต๐—ฒ ๐—”๐—ด๐—ฒ๐—ป๐—ฑ๐—ฎ ๐—ผ๐—ณ ๐˜๐—ต๐—ฒ ๐—ฝ๐—ฟ๐—ผ๐—ท๐—ฒ๐—ฐ๐˜?
The projectโ€™s Agenda involves Analyzing the data using SQL on the Oracle Database Software. We first download the Oracle Database 21c edition from the Oracle website and understand the problem. Then tables are created in the database followed by data insertion into tables and exploration, i.e., noticing relationships between tables, walking through the columns, and seeing comments. And perform the following activties--

  1. Records are displayed in an ordered manner, handling NULL values, and selecting records based on patterns like Wildcard, Operators, etc. Then working on Data Manipulation commands(DML) for Data Analysis. Then taking Backup of the Table where migration is going on and use COMMIT and ROLLBACK commands.
  2. Understanding different types of Joins(Inner join, Left outer join, Right outer join, Full outer join, Self join), different types of Operators(Minus, Union, Union all, Intersect).
  3. data analysis using Sub-query, Group-by clause and Exists clause. It also consists of using inline view and aggregate functions(Min, Max, Count, Avg) to perform better analysis on data.
  4. data analysis using WITH clause, the difference between COUNT(*) and COUNT(column_name), Categorization using the CASE statement, and various real-life case studies/problem statements.
  5. data analysis using different SQL functions like ROW_NUMBER, RANK, DENSE_RANK, SUBSTR, INSTR, COALESCE and NVL. It also involves the use of some built-in functions like concat, upper, lower, initcap, rtrim, ltrim, length, lpad, rpad.
  6. introduction to Data Wrangling, operations on missing data, unwanted features and duplicated records. It also involves the use of the pivot and unpivot in SQL.
  7. understanding of the Online Shopping Database, and using this database to perform the following Data Wrangling activities-
    a. Split full name into the first name and last name.
    b. Correct phone numbers and emails which are not in a proper format and Correct contact number and remove full name.
    c. Read BLOB column and fetch attribute details from the regular tag, nested columns.
    d. Create separate tables for blob attributes.
    e. Remove invalid records from order_items where shipment_id is not mapped.
    f. Map missing first name and last name with email id credentials.

๐—ช๐—ต๐—ฎ๐˜ ๐—ถ๐˜€ ๐——๐—ฎ๐˜๐—ฎ๐˜€๐—ฒ๐˜ ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜€๐—ถ๐˜€?
Dataset Analysis is defined as manipulating or processing unstructured data or raw data to draw valuable insights and conclusions that will help derive critical decisions that will add some business value. The dataset analysis process is followed by organizing the dataset, transforming the dataset, visualizing the dataset, and finally modeling the dataset to derive predictions for solving the business problems, making informed decisions, and effectively planning for the future.

๐——๐—ฎ๐˜๐—ฎ ๐—ฃ๐—ถ๐—ฝ๐—ฒ๐—น๐—ถ๐—ป๐—ฒ:
It refers to a system for moving data from one system to another. The data may or may not be transformed, and it may be processed in real-time (or streaming) instead of batches. A data pipeline is extracting or capturing data using various tools, storing raw data, cleaning, validating data, transforming data into a query-worthy format, visualization of KPIs including Orchestration of the above process is data pipeline.

๐—ข๐—ฟ๐—ฎ๐—ฐ๐—น๐—ฒ ๐—ฆ๐—ค๐—Ÿ ๐——๐—ฒ๐˜ƒ๐—ฒ๐—น๐—ผ๐—ฝ๐—ฒ๐—ฟ: Oracle SQL Developer is a free IDE that makes it easy to develop and operate Oracle Database in both traditional and cloud environments. SQL Developer is a complete end-to-end development of PL/SQL jobs, worksheets for running queries and scripts,and comprehensive data modeling output.

๐—ง๐—ฒ๐—ฐ๐—ต ๐˜€๐˜๐—ฎ๐—ฐ๐—ธ:
โ— SQL Programming language
โ— Oracle SQL Developer

๐—ž๐—ฒ๐˜† ๐—ง๐—ฎ๐—ธ๐—ฒ๐—ฎ๐˜„๐—ฎ๐˜†๐˜€:
โ— Understanding the project and how to use Oracle Database 21c.
โ— Understanding the basics of data analysis, SQL commands, and their application.
โ— Working on DML commands and listing employee details based on complex nested conditions.
โ— Introduction to Oracle SQL Developer.
โ— Usage of Oracle SQL Developer and connecting it to Oracle Database.
โ— Creating tables and Inserting data into them.
โ— Listing Employees and Departments based on some conditions.
โ— Displaying records in an ordered manner using DESC keyword and Handling NULL values.
โ— Selecting records based on some patterns like operators etc and Working on DML commands for analysis.
โ— Creating a backup of the table where migration is going on and Executing COMMIT and ROLLBACK commands.
โ— Listing DISTINCT & Renaming the column records for analysis.
โ— Listing down employee details based on complex nested conditions.
โ— Understanding different types of joins and operators.
โ— Understanding the difference between normal queries and ANSI queries.
โ— Joining multiple tables & with ANSI queries.
โ— Understanding the difference between UNION and UNION ALL operators.
โ— Understanding ambiguously defined error and resolving column ambiguoty.
โ— Understanding different types of aggregate functions(Min, Max, Count, Avg) & clauses.
โ— Data analysis using Sub-query and its background process.
โ— Understanding the inline view and Data analysis using Group-by clause.
โ— Combine different aggregate results in a single row.
โ— Understanding the difference between COUNT(*) and COUNT(column_name).
โ— Data analysis using WITH clause and Simplify query with WITH clause and View.
โ— Categorization using CASE statement and the use of the ROWNUM clause.
โ— Understanding the ROW_NUMBER function and SUBSTR and INSTR functions.
โ— Data analysis using the RANK function and Difference between RANK and DENSE_RANK functions.
โ— Data analysis using the built-in functions. Deal with NULL values using the NVL function.
โ— Understanding the use of COALESCE function. Change the date format.
โ— Understanding the concept of Data Wrangling. Remove unwanted features from data using SQL queries.
โ— Deal with missing data. How to remove missing data and how to impute missing data using SQL queries.
โ— Understanding Pivot and Unpivot functions in SQL.
โ— Pivoting rows to columns using SQL queries. Pivoting rows to columns with joins using SQL queries.
โ— Understanding the concept of Data Wrangling and Online Shopping database.
โ— Perform Data Wrangling activities on the data.

๐—œ๐—ป๐˜€๐˜๐—ฎ๐—น๐—น๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐˜€๐˜๐—ฒ๐—ฝ๐˜€ ๐—ต๐—ฒ๐—ฟ๐—ฒ:
https://github.com/Ajay026/Projects/tree/main/Installation%20%26%20Execution/Installation%20%26%20Execution

๐—ฆ๐—ผ๐—น๐˜‚๐˜๐—ถ๐—ผ๐—ป ๐˜„๐—ถ๐˜๐—ต ๐—ฆ๐—ผ๐˜‚๐—ฟ๐—ฐ๐—ฒ ๐—–๐—ผ๐—ฑ๐—ฒ:
https://github.com/Ajay026/Projects/tree/main/Codes/momo1606-sql_basic_latest-2ed0938ee08d

image

image

About

Complete SQL Project for data analysis with source code.