Jesse Hoogland's starred repositories

pygments

Pygments is a generic syntax highlighter written in Python

Language:PythonLicense:BSD-2-ClauseStargazers:1740Issues:0Issues:0

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Language:RustLicense:NOASSERTIONStargazers:27021Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:897Issues:0Issues:0

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:435Issues:0Issues:0

CircuitsVis

Mechanistic Interpretability Visualizations using React

Language:Jupyter NotebookLicense:MITStargazers:145Issues:0Issues:0

arrow2

Transmute-free Rust library to work with the Arrow format

Language:RustLicense:Apache-2.0Stargazers:1073Issues:0Issues:0

torcharrow

High performance model preprocessing library on PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:635Issues:0Issues:0

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8580Issues:0Issues:0

diskhash

Diskbased (persistent) hashtable

Language:CLicense:NOASSERTIONStargazers:156Issues:0Issues:0

kneser-ney

Kneser-Ney implementation in Python

Language:PythonStargazers:81Issues:0Issues:0

kenlm

KenLM: Faster and Smaller Language Model Queries

Language:C++License:NOASSERTIONStargazers:2426Issues:0Issues:0

opt_einsum

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Language:PythonLicense:MITStargazers:809Issues:0Issues:0

xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Language:C++License:NOASSERTIONStargazers:2352Issues:0Issues:0

pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Language:PythonLicense:Apache-2.0Stargazers:521Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11487Issues:0Issues:0

algorithmica

A computer science textbook

Language:Jupyter NotebookStargazers:3154Issues:0Issues:0

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonLicense:Apache-2.0Stargazers:1496Issues:0Issues:0

promptsource

Toolkit for creating, sharing and using natural language prompts.

Language:PythonLicense:Apache-2.0Stargazers:2550Issues:0Issues:0

kronfluence

Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature

Language:PythonLicense:Apache-2.0Stargazers:71Issues:0Issues:0

slt-of-dd

Singular Learning of Double Descent

Language:Jupyter NotebookStargazers:3Issues:0Issues:0

cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Language:PythonLicense:Apache-2.0Stargazers:192Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10052Issues:0Issues:0

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8986Issues:0Issues:0

mistral

Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.

Language:PythonLicense:Apache-2.0Stargazers:544Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:127284Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5476Issues:0Issues:0

anonymous_github

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

Language:TypeScriptLicense:GPL-3.0Stargazers:1296Issues:0Issues:0
Language:HTMLStargazers:133Issues:0Issues:0

aider

aider is AI pair programming in your terminal

Language:PythonLicense:Apache-2.0Stargazers:11030Issues:0Issues:0

gd2md-html

Convert a Google Doc to Markdown or HTML. This Docs add-on converts a Google Doc to simple Markdown and/or HTML.

Language:JavaScriptLicense:Apache-2.0Stargazers:636Issues:0Issues:0