Roman Pearah's starred repositories

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Language:RustLicense:NOASSERTIONStargazers:26909Issues:151Issues:7579

umap

Uniform Manifold Approximation and Projection

Language:PythonLicense:BSD-3-ClauseStargazers:7075Issues:127Issues:773

imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

Language:PythonLicense:MITStargazers:6727Issues:141Issues:584

neuralforecast

Scalable and user friendly neural :brain: forecasting algorithms.

Language:PythonLicense:Apache-2.0Stargazers:2573Issues:33Issues:471

tabnet

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Language:PythonLicense:MITStargazers:2522Issues:38Issues:299

hamilton

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

Language:Jupyter NotebookLicense:BSD-3-Clause-ClearStargazers:1458Issues:12Issues:239

PDPbox

python partial dependence plot toolbox

Language:Jupyter NotebookLicense:MITStargazers:829Issues:18Issues:67

azure-storage-fuse

A virtual file system adapter for Azure Blob storage

Language:GoLicense:NOASSERTIONStargazers:625Issues:37Issues:765

GPBoost

Combining tree-boosting with Gaussian process and mixed effects models

Language:C++License:NOASSERTIONStargazers:516Issues:12Issues:122

XGBoostLSS

An extension of XGBoost to probabilistic modelling

Language:PythonLicense:Apache-2.0Stargazers:507Issues:26Issues:41

cookiecutter-fastapi

Cookiecutter template for FastAPI projects using: Machine Learning, Poetry, Github Actions and Pytests

Language:PythonLicense:MITStargazers:424Issues:7Issues:8

lleaves

Compiler for LightGBM gradient-boosted trees, based on LLVM. Speeds up prediction by ≥10x.

Language:PythonLicense:MITStargazers:316Issues:10Issues:40

LightGBMLSS

An extension of LightGBM to probabilistic modelling

Language:PythonLicense:Apache-2.0Stargazers:232Issues:10Issues:20

fastapi-health

Implement the Health Check API pattern on your FastAPI application! :rocket:

Language:PythonLicense:MITStargazers:169Issues:6Issues:14

properscoring

Proper scoring rules in Python

Language:PythonLicense:Apache-2.0Stargazers:153Issues:60Issues:9

GraXpert

GraXpert is an astronomical image processing program for extracting and removing gradients from the background of your astrophotos.

Language:PythonLicense:GPL-3.0Stargazers:136Issues:15Issues:85

Transformer_Timeseries

Pytorch code for Google's Temporal Fusion Transformer

ML-API

Guide on creating an API for serving your ML model

Language:Jupyter NotebookStargazers:66Issues:4Issues:1

MLAlchemy

Python library to convert YAML/JSON into SQLAlchemy SELECT queries

Language:PythonLicense:MITStargazers:41Issues:4Issues:2

aggregate

Tools for creating and working with aggregate probability distributions.

Language:PythonLicense:BSD-3-ClauseStargazers:38Issues:4Issues:0

autolgbm

LightGBM + Optuna: Auto train LightGBM directly from CSV files, Auto tune them using Optuna, Auto serve best model using FastAPI. Inspired by Abhishek Thakur's AutoXGB.

Language:PythonLicense:Apache-2.0Stargazers:31Issues:1Issues:2

genpipes

Library to write readable and reproducible data processing code using python.

Language:PythonLicense:GPL-3.0Stargazers:22Issues:3Issues:7

Py-BoostLSS

An extension of Py-Boost to probabilistic modelling

Language:PythonLicense:Apache-2.0Stargazers:20Issues:4Issues:3

fastapi-mlflow

Deploy mlflow models as JSON APIs with minimal new code

Language:PythonLicense:Apache-2.0Stargazers:19Issues:5Issues:12

nested-cross-validation-comparison

Experimenting with various implementations and methods of nested cross-validation in R and Python

Language:RStargazers:16Issues:1Issues:0

mbbefd

Destruction rate modeling with the Maxwell Boltzmann Bose Einstein Fermi Dirac (MBBEFD) distribution

pydwt

Modeling tool like DBT to use SQL Alchemy core with a DataFrame interface like

Language:PythonLicense:NOASSERTIONStargazers:10Issues:2Issues:6

double_lift

Double Lift Charts in Python

Language:PythonLicense:MPL-2.0Stargazers:8Issues:2Issues:1

DeepTriangle

Attempt at Deep triangle model as presented in Kevin Kuo/Kasa AI. Hopefully in time connected to det ChainLadder paclage

Language:Jupyter NotebookStargazers:4Issues:1Issues:1