Siddharth Patel (sid6389)

sid6389

Geek Repo

Github PK Tool:Github PK Tool

Siddharth Patel's starred repositories

Language:JavaScriptStargazers:9Issues:0Issues:0

aws-genai-llm-chatbot

A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS

Language:TypeScriptLicense:MIT-0Stargazers:997Issues:0Issues:0

rasterframes

Geospatial Raster support for Spark DataFrames

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:241Issues:0Issues:0

SQL-DQC

SQL based data profiling & data quality checks, which will help you to perform data profiling & data quality checks on SQL database at table & database level.

Language:TSQLStargazers:7Issues:0Issues:0

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Language:ScalaLicense:Apache-2.0Stargazers:3204Issues:0Issues:0

sql-translator

SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.

Language:TypeScriptLicense:MITStargazers:4116Issues:0Issues:0

data-engineer-roadmap

Roadmap to becoming a data engineer in 2021

Stargazers:12247Issues:0Issues:0

prefect

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Language:PythonLicense:Apache-2.0Stargazers:15561Issues:0Issues:0

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

License:MITStargazers:26900Issues:0Issues:0

Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Language:Jupyter NotebookLicense:MITStargazers:36741Issues:0Issues:0

manifesto

The OpenTF Manifesto expresses concern over HashiCorp's switch of the Terraform license from open-source to the Business Source License (BSL) and calls for the tool's return to a truly open-source license.

Language:HTMLLicense:Apache-2.0Stargazers:36234Issues:0Issues:0

serge

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Language:SvelteLicense:Apache-2.0Stargazers:5613Issues:0Issues:0

ideas-for-projects-people-would-use

Every time I have an idea, I write it down. These are a collection of my top software ideas -- problems I think enough people have that don't have solutions. I expect you can reach a decent userbase if marketed correctly, as I am surely not the only one with these problems.

License:MITStargazers:1337Issues:0Issues:0

usaddress

:us: a python library for parsing unstructured United States address strings into address components

Language:PythonLicense:MITStargazers:1505Issues:0Issues:0

awesome-selfhosted

A list of Free Software network services and web applications which can be hosted on your own servers

License:NOASSERTIONStargazers:189848Issues:0Issues:0

interview-guide

An opinionated, actionable guide for software engineering interviews.

Language:AstroStargazers:2938Issues:0Issues:0

DataProfiler

What's in your data? Extract schema, statistics and entities from datasets

Language:PythonLicense:Apache-2.0Stargazers:1396Issues:0Issues:0

tutorial-great-expectations

A tutorial for the Great Expectations library.

Language:Jupyter NotebookLicense:MITStargazers:64Issues:0Issues:0

shopify_django_app

Get a Shopify app up and running with Django and Python Shopify API

Language:PythonLicense:MITStargazers:465Issues:0Issues:0

kedro-great

The easiest way to integrate Kedro and Great Expectations

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:24034Issues:0Issues:0

address-net

A package to structure Australian addresses

Language:PythonLicense:MITStargazers:193Issues:0Issues:0

parserator

:bookmark: A toolkit for making domain-specific probabilistic parsers

Language:PythonLicense:MITStargazers:791Issues:0Issues:0

Data-Engineering-HowTo

A list of useful resources to learn Data Engineering from scratch

Stargazers:1Issues:0Issues:0

libpostal

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

Language:CLicense:MITStargazers:4014Issues:0Issues:0

WebCrawlerForOnlineInflation

Price Crawler - Tracking Price Inflation

Language:PythonStargazers:180Issues:0Issues:0

HashtagCashtag

My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on ​lambda architecture​, that aggregates Twitter and US stock market data for user sentiment analysis using open source tools - ​Apache Kafka ​for data ingestions, Apache Spark ​& ​Spark Streaming ​for batch & real-time processing, ​Apache Cassandra f​ or storage, ​Flask​, ​Bootstrap and ​HighCharts f​ or frontend.

Language:ScalaStargazers:470Issues:0Issues:0

TASK-Management-System

Spring Boot and Angular 7 web application for task management .

Language:JavaLicense:UnlicenseStargazers:90Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:10284Issues:0Issues:0