Chayan Shrang Raj 's repositories
Data-Pipeline-with-dbt-using-Airflow-on-GCP
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.
Python-ETL-pipeline-using-Airflow-on-AWS
This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow orchestration tool on AWS EC2 instance.
LINHAC-2022-Data-Science-Student-Competition
Linköping Hockey Analytics Conference - LINHAC 2022 | Given the event data, generate findings/patterns related to sequences of events leading up to a particular outcome.
Youtube-video-data-analytics-using-AWS
This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data engineering, data analysis and data science parts.
Microsoft-Azure-Medallion-Data-pipeline
In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.
End-to-end-machine-learning
The idea of this project is to apply statistical methods learned in university lectures to find patterns in the data and use machine learning to solve a supervised classification problem
Advanced-Regression-Techniques-for-Ames-housing-data-prediction
Prediction of Ames house prices using advanced regression techniques and ML algorithms.
Neural-Networks-and-Learning-Systems
Solving problems using different machine learning algorithms. Machine learning, classification, pattern recognition and high-dimensional data analysis.
Snowflake-data-ingestion-hands-on-tutorial
This repo covers the two most widely used and recommended file based data ingestion approaches: COPY INTO and Snowpipe.
Vision-Transformer-Research
The purpose of this research project is to compare traditional CNNs to vision transformers, can transformers give a higher AUC when classifying Atypical Femoral Fracture / Normal Femoral Fracture?
CIFAR10-competition_model
Keras implementation of a model for the in-class competition in Neural Networks and Learning Systems course at Linköping Univerity, spring 2022.
Data-Science-Apache-Spark-Databricks-ETL-Project
Using Databricks community edition to build multiple end-to-end ETL pipelines using PySpark for different file formats such as CSV, Parquet, Delta table. Predictive modeling is performed using different machine learning algorithms.
dbt_learn_fundamentals
dbt is a SQL-first transformation workflow that lets teams quickly and collaboratively deploy analytics code following software engineering best practices like modularity, portability, CI/CD, and documentation.
Prediction-of-used-car-prices-using-various-regression-techniques
It is a work on a regression problem in which our objective is to predict the prices of used cars given a number of features/predictors about them
h3-py-notebooks
Jupyter notebooks for h3-py, a hierarchical hexagonal geospatial indexing system
text-mining
Notebooks for the course 732A92/TDDE16 Text Mining
Text-Mining-Research
Text Mining develops methods for accessing information in and extracting knowledge from large volumes of text. The purpose of this repository is to research on classification of fake/real news on a dataset obtained from kaggle
chayansraj
May the code be with you!
ML-Clustering-soft-drink-preferences
The aim is to perform cluster analysis to identify grouping by cases of data based on the similarity of shares.
SQL-Data-Analysis-Projects
The goal is to learn and perform hands-on SQL to analyze different types of datasets. All the SQL queries are written in Google BigQuery which is a serverless, distributed SQL Query engine. The projects go from Basic level to Advanced level.