Derek Willis's starred repositories
generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
newscatcher
Programmatically collect normalized news from (almost) any website.
secret-llama
Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.
open-parse
Improved file parsing for LLM’s
news-please
news-please - an integrated web crawler and information extractor for news that just works
databonsai
clean & curate your data with LLMs.
flask-muck
🧹 Flask REST framework for generating CRUD APIs and OpenAPI specs in the SQLAlchemy, Marshmallow/Pydantic application stack.
DatawRappr
R-Package to connect to the Datawrapper-API
censusdis
censusdis is a package for discovering, loading, analyzing, and computing diversity, integration, and segregation metrics to U.S. Census demographic data. It is designed to be intuitive and Pythonic, but give users access to the full collection of data and maps the U.S. Census publishes via their APIs.
2024-openai-gpt-hiring-racial-discrimination
Data and materials to reproduce Bloomberg's investigation into racial and gender bias in OpenAI's GPT
whisper-audio-transcriber
Whisper Audio Transcriber: Streamlined tool for converting audio to text using the powerful Whisper ASR model. User-friendly and efficient.
course-materials
This is the course repository for the Spring 2024 iteration of MACS 30123 "Large-Scale Computing for the Social Sciences" at the University of Chicago.
plotting-county-election-results
🇺🇸🏁 Draw a beautiful county-level election results map with only a few lines of code
tulsa-1921-data
Data files associated with our story on the 1921 race massacre in Tulsa, Oklahoma.
stats-notes
Notes for teaching statistics
data-institute-2023
Materials used to teach the 2023 Data Institute.