Arengard's repositories
amazon-textract-code-samples
Amazon Textract Code Samples
ArcticDB
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
astro
astro_user
botasaurus
The All in One Framework to build Awesome Scrapers.
cds
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
dfply
dplyr-style piping operations for pandas dataframes
lazy-nvim-ide
My đź’¤ LazyVim IDE config for Neovim
lineapy
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
opensanctions
An open database of international sanctions data, persons of interest and politically exposed persons
Pandas-to_sql-upsert
Extend pandas to_sql function to perform multi-threaded, concurrent "insert or update" command in memory
polars
Fast multi-threaded DataFrame library in Rust and Python
PowerQueryFunctional
Power Query utility library with a functional twist
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
mage-ai
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
nyoom.nvim
A Neovim framework and doom emacs alternative for the stubborn martian hacker. Powered by fennel and the oxocarbon theme
polars_ds_extension
Polars extension for general data science use cases
public-gateway-checker
Checks which public gateways are online or not
python-diskcache
Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
yente
API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.