Josh Crosby's repositories
fastapi_async_db
An implementation for fastapi with Alembic and databases for Asynchronicity
machine_learning_examples
A collection of machine learning examples and tutorials.
admin-wrapper-docker
Convenient admin wrapper script for ad-hoc running of things within a container
athena-glue-service-logs
Glue scripts for converting AWS Service Logs for use in Athena
awesome-compose
Awesome Docker Compose samples
aws-data-wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
aws-glue-samples
AWS Glue code samples
bash-lambda-layer
Run Bash scripts in AWS Lambda via Layers
bookish-spork
This is a test repo for UNC class
dask-kubernetes
Native Kubernetes integration for dask
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
docker-stacks
Ready-to-run Docker images containing Jupyter applications
FullStack
Full Stack prep
kafka-pocs
Apache Kafka and Related Projects
kafka-pyspark-postgres
kafka-pyspark-postgres
little-book-of-pipelines
This repository goes over how to handle massive variety in data engineering
project-one
UNC First Project
pygwalker
PyGWalker: Turn your pandas dataframe into a Tableau-style User Interface for visual analysis
solid-fishstick
Test repo for UNC Night 3
spark
Apache Spark
sync-buckets-state-machine
A sample AWS Step Functions (SFN) state machine, designed to one-way synchronize an Amazon S3 source bucket into another S3 destination bucket.
Zappa
Serverless Python