Patrick Carlson's repositories
shiny-retirement-calc
Shiny app for performing retirement planning via Monte Carlo simulation, FIRE, etc.
docker-spark-datascience
Spark, Jupyterlab, and other Data Science tooling via Docker Swarm
carlsonp.github.io
Personal website
bert-from-scratch
Trains an English language BERT model from Yelp text data to do fill-in-the-blank of words in sentences.
chroma
the AI-native open-source embedding database
chromadb-photo-organizer
A photo similarity and organization tool leveraging the chromadb vector database.
counter-processor
A processor for sorting out stats from log files for Datasets for the Counter Code Of Practice (v5)
DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
dataverse
Open source research data repository software
dataverse-docker
Dataverse 5.8 on Docker is an "Archive in a box" package which could be used both as demo and production system and easily integrated with other services.
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
docker-dask-datascience
Dask, Jupyter Notebook, and other Data Science tooling via Docker Swarm
elm-pure
Pure implementation of ELM (Extreme Learning Machine) in python (just with numpy)
great_expectations
Always know what to expect from your data.
hpelm
High performance implementation of Extreme Learning Machines (fast randomized neural networks).
llm-find-jobs
A job posting classification model and LLM to help in finding open jobs. Also leverages CrewAI and LLMs for searching and finding relevant positions.
mlflow-spark-summit-2019
MLFlow Spark Summit 2019 Presentation
NLIT-presentation-2019
The "Viz Wars: Tableau vs. Shiny" example presentation material.
NLIT_2019_Sandia_Insights
The presentation, "Sandia Insights: Data Architecture and Framework" for NLIT 2019.
NLIT_2022_Sandia_Insights
The presentation, "Sandia Insights: A Data Sciences Architecture and Framework" for NLIT 2022.
NLIT_2024_Got_Data
The presentation, "Got Data? Discovery is the Key" for NLIT 2024.
R-Finance-Scripts
Various R markdown and other scripts for finance related calculations and graphing.
resume-job-description-keyword-matcher
Parses a provided resume and job description for keyword matches and similarity.
shiny-covid-historical
Very simple Shiny app showing COVID-19 historical trends