Alex Furrier's repositories
deepfake-radio
T2S4FWF -> (Text to Speech for Fun With Friends)
data-science-utils
Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, etc.
Entity-Sentiment-Extraction
Pipeline for extracting sentiment towards entities
applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
python-collab-template
🛠 Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.
rustchain-discord-bot
A Discord bot for LLM chain apps
Supervised-Learning-Reproducible-Analysis-Example
Example reproducible analysis project for a supervised learning data science problem
Unsupervised-Learning-Reproducible-Analysis-Example
Example reproducible analysis project utilizing unsupervised learning techniques (clustering, dimensionality reduction)
CalibreLibgenStore
A Libgen Fiction store plugin for Calibre
ClassMetrics
A module with boilerplate code for computing and plotting common classification metrics. Flexible to multiclass problems.
cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
data_science_toolbox
Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, model validation, etc.
default-data-science-project
A default project structure for data projects with a focus on repoducible research and build automation
famous-last-words
Fiddling with GPT2 and other NLP models for interesting corpus text generation
Google-Sheets-Scraping
Scraping UA salary database with python script
lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
mlops-zoomcamp
Free MLOps course from DataTalks.Club
pandas_exercises
Practice your pandas skills!
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Rec-Center-Counts
Rec Center Count data for finding the optimal time to visit
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
UA-PTS-Parking-Tickets-15-16
21,947 reasons not to park illegally:Mapping the hot spots of parking tickets issued by PTS