Nathan Mauro's starred repositories

aws-glue-schema-registry

AWS Glue Schema Registry Client library provides serializers / de-serializers for applications to integrate with AWS Glue Schema Registry Service. The library currently supports Avro, JSON and Protobuf data formats. See https://docs.aws.amazon.com/glue/latest/dg/schema-registry.html to get started.

Language:JavaLicense:Apache-2.0Stargazers:122Issues:0Issues:0

aws-glue-etl-boilerplate

A complete example of an AWS Glue application that uses the Serverless Framework to deploy the infrastructure and DevContainers and/or Docker Compose to run the application locally with AWS Glue Libs, Spark, Jupyter Notebook, AWS CLI, among other tools. It provides jobs using Python Shell and PySpark.

Language:PythonStargazers:16Issues:0Issues:0

zed

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Language:RustLicense:NOASSERTIONStargazers:42776Issues:0Issues:0

pages-cms

A user-friendly CMS for static site generators.

Language:VueLicense:NOASSERTIONStargazers:1277Issues:0Issues:0

glue-devcontainer-template

VSCode Dev Container template for AWS Glue jobs development

Language:ShellLicense:MITStargazers:15Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35272Issues:0Issues:0

llm-viz

3D Visualization of an GPT-style LLM

Language:TypeScriptStargazers:3335Issues:0Issues:0

ActionWeaver

Make function calling with LLM easier

Language:PythonLicense:Apache-2.0Stargazers:293Issues:0Issues:0

fzf

:cherry_blossom: A command-line fuzzy finder

Language:GoLicense:MITStargazers:62709Issues:0Issues:0

logseq-plugin-gpt3-openai

A plugin for GPT-3 AI assisted note taking in Logseq

Language:TypeScriptLicense:MITStargazers:701Issues:0Issues:0

gpt4-pdf-chatbot-langchain

GPT4 & LangChain Chatbot for large PDF docs

Language:TypeScriptStargazers:14780Issues:0Issues:0

langchain-tutorials

Overview and tutorial of the LangChain Library

Language:Jupyter NotebookStargazers:6553Issues:0Issues:0

docker-intellij

Run IntelliJ IDEA inside a Docker container

Language:ShellLicense:Apache-2.0Stargazers:35Issues:0Issues:0

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14462Issues:0Issues:0

hadooponwindows

Hadoop 2.7.1 on windows

Language:ShellStargazers:85Issues:0Issues:0

awsglue-local-dev

A local development setup for AWS Glue, modified from https://github.com/big-data-europe/docker-spark

Language:ShellLicense:MITStargazers:3Issues:0Issues:0

Automated_ETL_Finance_Data_Pipeline_with_AWS_Lambda_Spark_Transformation_Job_Python

This project covers the implementation of building an automated ETL data pipeline using Python and AWS Services with Spark transformation job for financial stocks trade transactions. The ETL Data Pipeline is automated using AWS Lambda Function with a Trigger defined. Whenever a new file is ingested into the AWS S3 Bucket; then the AWS Lambda Function gets triggered and will implement the further action to execute the AWS Glue Crawler ETL Spark Transformation Job. The Spark Transformation Job implemented using Python PySpark transforms the trade transactions data stored in the AWS S3 Bucket; further to filter a sub-set of trade transactions for which the total number of shares transacted are less than or equal to 100. Tools & Technologies: Python, Boto3, PySpark, SDK, AWS CLI, AWS Virtual Private Cloud (VPC), AWS VPC Endpoint, AWS S3, AWS Glue, AWS Glue Crawler, AWS Glue Jobs, AWS Athena, AWS Lambda, Spark

Language:PythonStargazers:4Issues:0Issues:0

AWS_ETL_Pipeline_Project

A personal project to gain hands-on experience with AWS and how data flows in the cloud. I created a data pipeline using some of the most popular AWS tools: S3, Glue, Lambda, IAM, RedShift, EventBridge, and CloudWatch.

Language:PythonStargazers:3Issues:0Issues:0

BayMax

Tool to help ETL developers get started with AWS Glue

Language:PythonStargazers:1Issues:0Issues:0

aws-lambda-powertools-examples

This repo holds example projects demoing different types of utilities provided by aws-lambda-powertools project for different runtimes

License:MIT-0Stargazers:29Issues:0Issues:0
Language:PythonLicense:MIT-0Stargazers:8Issues:0Issues:0

gkeepapi

An unofficial client for the Google Keep API.

Language:PythonLicense:MITStargazers:1511Issues:0Issues:0

spark-sandbox

A playground for Spark jobs.

Language:ScalaStargazers:44Issues:0Issues:0

clean-architecture-dotnet

🕸 Yet Another .NET Clean Architecture, but for Microservices project. It uses Minimal Clean Architecture with DDD-lite, CQRS-lite, and just enough Cloud-native patterns apply on the simple eCommerce sample and run on Tye with Dapr extension 🍻

Language:C#License:MITStargazers:1195Issues:0Issues:0

computer-science

:mortar_board: Path to a free self-taught education in Computer Science!

License:MITStargazers:166751Issues:0Issues:0

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:35690Issues:0Issues:0

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

License:MITStargazers:26870Issues:0Issues:0

deobfuscator

The real deal

Language:JavaLicense:Apache-2.0Stargazers:1550Issues:0Issues:0

yake

A Rake-like DSL for writing AWS Lambda handlers

Language:RubyLicense:MITStargazers:169Issues:0Issues:0

mern_shopping_list

Shopping List built with MERN and Redux

Language:TypeScriptStargazers:604Issues:0Issues:0