hbot's starred repositories

dotfiles

:wrench: .files, including ~/.macos — sensible hacker defaults for macOS

Language:ShellLicense:MITStargazers:30103Issues:679Issues:389

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Language:RustLicense:NOASSERTIONStargazers:28680Issues:160Issues:8380

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27788Issues:247Issues:7019

headlessui

Completely unstyled, fully accessible UI components, designed to integrate beautifully with Tailwind CSS.

Language:TypeScriptLicense:MITStargazers:25463Issues:166Issues:1247

Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

Language:C++License:Apache-2.0Stargazers:21992Issues:716Issues:18223

rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Language:PythonLicense:Apache-2.0Stargazers:18500Issues:350Issues:6644

luigi

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Language:PythonLicense:Apache-2.0Stargazers:17623Issues:474Issues:987

doccano

Open source annotation tool for machine learning practitioners.

Language:PythonLicense:MITStargazers:9329Issues:132Issues:1517

OpenSearch

🔎 Open source distributed and RESTful search engine.

Language:JavaLicense:Apache-2.0Stargazers:9247Issues:141Issues:5452

datasette

An open source multi-tool for exploring and publishing data

Language:PythonLicense:Apache-2.0Stargazers:9189Issues:99Issues:1785

icecream

🍦 Never use print() to debug again.

Language:PythonLicense:MITStargazers:8828Issues:50Issues:130

vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

Language:C++License:NOASSERTIONStargazers:8449Issues:351Issues:1267

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8428Issues:130Issues:1060

gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Language:PythonLicense:MITStargazers:8193Issues:177Issues:137

metaflow

:rocket: Build and manage real-life ML, AI, and data science projects with ease!

Language:PythonLicense:Apache-2.0Stargazers:7949Issues:287Issues:644

joblib

Computing with Python functions.

Language:PythonLicense:BSD-3-ClauseStargazers:3795Issues:64Issues:862

KeyBERT

Minimal keyword extraction with BERT

Language:PythonLicense:MITStargazers:3388Issues:32Issues:200

ecco

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1956Issues:24Issues:64

tools

The Standard Ebooks toolset for producing our ebook files.

Language:PythonLicense:NOASSERTIONStargazers:1413Issues:38Issues:237

pyjanitor

Clean APIs for data cleaning. Python implementation of R package Janitor

Language:PythonLicense:MITStargazers:1336Issues:18Issues:555

nbgrader

A system for assigning and grading notebooks

Language:PythonLicense:BSD-3-ClauseStargazers:1272Issues:43Issues:921

scikit-lego

Extra blocks for scikit-learn pipelines.

Language:PythonLicense:MITStargazers:1231Issues:27Issues:315

frangipanni

Program to convert lines of text into a tree structure.

Language:GoLicense:MITStargazers:1194Issues:12Issues:19

watermark

An IPython magic extension for printing date and time stamps, version numbers, and hardware information

Language:PythonLicense:NOASSERTIONStargazers:883Issues:13Issues:46

spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

Language:PythonLicense:MITStargazers:782Issues:18Issues:26

bigbird

Transformers for Longer Sequences

Language:PythonLicense:Apache-2.0Stargazers:562Issues:11Issues:33

mabwiser

[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library

Language:PythonLicense:Apache-2.0Stargazers:206Issues:12Issues:24

capreolus

A toolkit for end-to-end neural ad hoc retrieval

Language:PythonLicense:Apache-2.0Stargazers:96Issues:6Issues:76

streamlit-d3-demo

D3 in React in Streamlit tech demo

Language:TypeScriptLicense:MITStargazers:77Issues:8Issues:2

lottery-ticket-experiments

Experiments on the lottery ticket hypothesis for finding sparse trainable neural networks

Language:PythonLicense:MITStargazers:9Issues:2Issues:1