usr-av's repositories
amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
appsmith
A web framework to build admin panels and internal tools.
argo-client-python
Python client for Argo Workflows
argo-helm
ArgoProj Helm Charts
connectors
Connectors for Delta Lake
couler
Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.
CyFHIR
A Neo4j Plugin for Handling HL7 FHIR Data
datahub
A Generalized Metadata Search & Discovery Tool
datahub-helm
Repository of helm charts for deploying DataHub on a Kubernetes cluster
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
dev-setup
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults.
fastapi-course
This repository compliments the Udemy FastAPI course
FHIR
The IBM® FHIR® Server and related projects
FHIR-from-Jupyter
Originally developed for FHIR DevDays 2020
handsonscala
Discussion and and code examples for the book Hands-on Scala Programming
katacoda-scenarios
Katacoda Scenarios
OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
podman-scripts
Script examples - Bash, PowerShell, etc.
PublicScripts
Scripts that are public for anyone's use that I have created over time
Python
All Algorithms implemented in Python
python-cheatsheet
Comprehensive Python Cheatsheet
spark
Apache Spark - A unified analytics engine for large-scale data processing
spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
superset
Apache Superset is a Data Visualization and Data Exploration Platform
ToolJet
ToolJet is an open-source low-code platform for building and deploying internal tools with minimal engineering efforts 🚀
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
trino-db2
Db2 JDBC connector for Trino
trino-event-stream
Stream events from trino to a kafka topic