Patrick Carlson's repositories
shiny-retirement-calc
Shiny app for performing retirement planning via Monte Carlo simulation, FIRE, etc.
docker-spark-datascience
Spark, Jupyterlab, and other Data Science tooling via Docker Swarm
carlsonp.github.io
Personal website
bert-from-scratch
Trains an English language BERT model from Yelp text data to do fill-in-the-blank of words in sentences.
chroma
the AI-native open-source embedding database
chromadb-photo-organizer
A photo similarity and organization tool leveraging the chromadb vector database.
ckan
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers datahub.io, catalog.data.gov and europeandataportal.eu/data/en/dataset among many other sites.
counter-processor
A processor for sorting out stats from log files for Datasets for the Counter Code Of Practice (v5)
DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
dataverse
Open source research data repository software
dataverse-docker
Dataverse 5.8 on Docker is an "Archive in a box" package which could be used both as demo and production system and easily integrated with other services.
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
docker-dask-datascience
Dask, Jupyter Notebook, and other Data Science tooling via Docker Swarm
document-api-python
Create and modify Tableau workbook and datasource files
elm-pure
Pure implementation of ELM (Extreme Learning Machine) in python (just with numpy)
great_expectations
Always know what to expect from your data.
hpelm
High performance implementation of Extreme Learning Machines (fast randomized neural networks).
mlflow-spark-summit-2019
MLFlow Spark Summit 2019 Presentation
NLIT-presentation-2019
The "Viz Wars: Tableau vs. Shiny" example presentation material.
NLIT_2019_Sandia_Insights
The presentation, "Sandia Insights: Data Architecture and Framework" for NLIT 2019.
NLIT_2022_Sandia_Insights
The presentation, "Sandia Insights: A Data Sciences Architecture and Framework" for NLIT 2022.
NLIT_2024_Got_Data
The presentation, "Got Data? Discovery is the Key" for NLIT 2024.
R-Finance-Scripts
Various R markdown and other scripts for finance related calculations and graphing.
RallyRestToolkitForPython
Python Toolkit for Rally
shiny-covid-historical
Very simple Shiny app showing COVID-19 historical trends