Tejashri Rajendra Pathak's starred repositories

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:36544Issues:385Issues:67

LLM-As-Chatbot

LLM as a Chatbot Service

Language:PythonLicense:Apache-2.0Stargazers:3273Issues:53Issues:66

Data-Science-Interview-Preperation-Resources

Resoruce to help you to prepare for your comming data science interviews

Practical-Machine-Learning

Practical machine learning notebook & articles covers the machine learning end to end life cycle.

Language:Jupyter NotebookStargazers:914Issues:19Issues:0

Amazing-Feature-Engineering

Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.

Language:Jupyter NotebookStargazers:640Issues:15Issues:1

LLMRec

[WSDM'2024 Oral] "LLMRec: Large Language Models with Graph Augmentation for Recommendation"

Language:PythonLicense:Apache-2.0Stargazers:333Issues:4Issues:18

free-data-engineering-course-for-beginners

Learn the entire ETL process based on Spotify API data

e2e-data-engineering

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

Data-Engineering-track-with-Python

All Data Engineering notebooks from Datacamp course

Language:Jupyter NotebookLicense:MITStargazers:113Issues:3Issues:0

Building-Recommender-Systems-with-Machine-Learning-and-AI

Building Recommender Systems with Machine Learning and AI, published by Packt

Language:Jupyter NotebookLicense:MITStargazers:96Issues:10Issues:1

aws-data-engineering

Resources for the free AWS Data Engineering course on youtube

data-engineering-nanodegree

notebooks produced throughout the Udacity's Nanodegree Data Engineering Course

Language:Jupyter NotebookStargazers:72Issues:1Issues:0

RedditDataEngineering

This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and services including Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.

Language:PythonStargazers:53Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:45Issues:3Issues:2
Language:PythonLicense:GPL-3.0Stargazers:38Issues:4Issues:2

DSA-Together-HacktoberFest

DSA-Together is a collection of python solutions for problems from DSA-450 Sheet by Love Babbar , Leetcode Sheet by Fraz , important algorithms and concepts necessary for becoming an excellent programmer .

Language:PythonLicense:MITStargazers:37Issues:1Issues:50
Language:Jupyter NotebookLicense:MITStargazers:31Issues:0Issues:0

University-of-Minnesota-Recommender-System-Specialization

Repository for the Honor Track of Recommender Systems Specialization from University of Minnesota on Coursera

Language:HTMLStargazers:28Issues:2Issues:0

aws-data-engineering

Course Material Data Engineering on AWS Course

Language:PythonStargazers:26Issues:1Issues:0

data-pipeline-automation-with-github-actions-4503382

This repo is for LinkedIn Learning course: Data Pipeline Automation with GitHub Actions

Language:HTMLLicense:NOASSERTIONStargazers:23Issues:7Issues:12

sql-for-data-engineering-course

sql-for-data-engineering-course

Language:Jupyter NotebookStargazers:16Issues:0Issues:0
Language:PythonStargazers:15Issues:0Issues:0

IBM-DevOps-and-Software-Engineering

All assignments from the 13 courses in the "IBM DevOps and Software Engineering Professional Certificate" on Coursera.

Stargazers:9Issues:0Issues:0

Japan-visa-data-engineering

This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark clusters are set up within a Docker container on Azure.

Language:HTMLStargazers:8Issues:2Issues:0

YoutubeAnalytics

An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlDB. The processed analytics data is then sent to Telegram for real-time notifications.

Language:HTMLStargazers:6Issues:2Issues:0

llmrecsys

Experiments with LLMs for recommendation systems

Language:Jupyter NotebookStargazers:6Issues:1Issues:0

using-large-datasets-with-pandas-4467955

This repo is for linkedin learning course: Using Large Datasets with pandas

Language:PythonLicense:NOASSERTIONStargazers:5Issues:7Issues:0

llmusic

An LLM-based playlist generator connected to Spotify

Language:JavaScriptLicense:MITStargazers:4Issues:0Issues:0