Siddharth Patel (sid6389)

sid6389

Geek Repo

Github PK Tool:Github PK Tool

Siddharth Patel's starred repositories

Language:JavaScriptStargazers:9Issues:0Issues:0

aws-genai-llm-chatbot

A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS

Language:TypeScriptLicense:MIT-0Stargazers:955Issues:0Issues:0

rasterframes

Geospatial Raster support for Spark DataFrames

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:239Issues:0Issues:0

SQL-DQC

SQL based data profiling & data quality checks, which will help you to perform data profiling & data quality checks on SQL database at table & database level.

Language:TSQLStargazers:7Issues:0Issues:0

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Language:ScalaLicense:Apache-2.0Stargazers:3165Issues:0Issues:0

sql-translator

SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.

Language:TypeScriptLicense:MITStargazers:4063Issues:0Issues:0

data-engineer-roadmap

Roadmap to becoming a data engineer in 2021

Stargazers:12172Issues:0Issues:0

prefect

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

Language:PythonLicense:Apache-2.0Stargazers:15276Issues:0Issues:0

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

License:MITStargazers:26544Issues:0Issues:0

Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Language:Jupyter NotebookLicense:MITStargazers:36536Issues:0Issues:0

manifesto

The OpenTF Manifesto expresses concern over HashiCorp's switch of the Terraform license from open-source to the Business Source License (BSL) and calls for the tool's return to a truly open-source license.

Language:HTMLLicense:Apache-2.0Stargazers:36253Issues:0Issues:0

serge

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Language:SvelteLicense:Apache-2.0Stargazers:5587Issues:0Issues:0

ideas-for-projects-people-would-use

Every time I have an idea, I write it down. These are a collection of my top software ideas -- problems I think enough people have that don't have solutions. I expect you can reach a decent userbase if marketed correctly, as I am surely not the only one with these problems.

License:MITStargazers:1272Issues:0Issues:0

usaddress

:us: a python library for parsing unstructured United States address strings into address components

Language:PythonLicense:MITStargazers:1496Issues:0Issues:0

awesome-selfhosted

A list of Free Software network services and web applications which can be hosted on your own servers

License:NOASSERTIONStargazers:185086Issues:0Issues:0

interview-guide

An opinionated, actionable guide for software engineering interviews.

Language:AstroStargazers:2849Issues:0Issues:0

DataProfiler

What's in your data? Extract schema, statistics and entities from datasets

Language:PythonLicense:Apache-2.0Stargazers:1377Issues:0Issues:0

tutorial-great-expectations

A tutorial for the Great Expectations library.

Language:Jupyter NotebookLicense:MITStargazers:62Issues:0Issues:0

shopify_django_app

Get a Shopify app up and running with Django and Python Shopify API

Language:PythonLicense:MITStargazers:461Issues:0Issues:0

kedro-great

The easiest way to integrate Kedro and Great Expectations

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:23561Issues:0Issues:0

address-net

A package to structure Australian addresses

Language:PythonLicense:MITStargazers:193Issues:0Issues:0

parserator

:bookmark: A toolkit for making domain-specific probabilistic parsers

Language:PythonLicense:MITStargazers:789Issues:0Issues:0

Data-Engineering-HowTo

A list of useful resources to learn Data Engineering from scratch

Stargazers:1Issues:0Issues:0

libpostal

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

Language:CLicense:MITStargazers:3985Issues:0Issues:0

WebCrawlerForOnlineInflation

Price Crawler - Tracking Price Inflation

Language:PythonStargazers:175Issues:0Issues:0

HashtagCashtag

My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on ​lambda architecture​, that aggregates Twitter and US stock market data for user sentiment analysis using open source tools - ​Apache Kafka ​for data ingestions, Apache Spark ​& ​Spark Streaming ​for batch & real-time processing, ​Apache Cassandra f​ or storage, ​Flask​, ​Bootstrap and ​HighCharts f​ or frontend.

Language:ScalaStargazers:461Issues:0Issues:0

TASK-Management-System

Spring Boot and Angular 7 web application for task management .

Language:JavaLicense:UnlicenseStargazers:87Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:10249Issues:0Issues:0