stanbiryukov

stanbiryukov

Geek Repo

Location:USA

Github PK Tool:Github PK Tool

stanbiryukov's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:48580Issues:542Issues:194

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:19199Issues:195Issues:101

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:16972Issues:204Issues:39

uv

An extremely fast Python package installer and resolver, written in Rust.

Language:RustLicense:Apache-2.0Stargazers:11999Issues:27Issues:1489

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:11827Issues:115Issues:475

rye

a Hassle-Free Python Experience

Language:RustLicense:MITStargazers:11653Issues:53Issues:488

transformers.js

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!

Language:JavaScriptLicense:Apache-2.0Stargazers:7875Issues:60Issues:463

llrt

LLRT (Low Latency Runtime) is an experimental, lightweight JavaScript runtime designed to address the growing demand for fast and efficient Serverless applications.

Language:JavaScriptLicense:Apache-2.0Stargazers:7654Issues:51Issues:113

burn

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

Language:RustLicense:Apache-2.0Stargazers:7308Issues:54Issues:543

marimo

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

Language:PythonLicense:Apache-2.0Stargazers:4500Issues:22Issues:337

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4079Issues:41Issues:153
Language:PythonLicense:NOASSERTIONStargazers:3789Issues:63Issues:0

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonLicense:BSD-3-ClauseStargazers:3256Issues:37Issues:288

dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Language:PythonLicense:NOASSERTIONStargazers:2434Issues:38Issues:22

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2246Issues:21Issues:143

chronos-forecasting

Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

Language:PythonLicense:Apache-2.0Stargazers:1849Issues:21Issues:32

testcontainers-python

Testcontainers is a Python library that providing a friendly API to run Docker container. It is designed to create runtime environment to use during your automatic tests.

Language:PythonLicense:Apache-2.0Stargazers:1365Issues:17Issues:212

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonLicense:Apache-2.0Stargazers:1298Issues:23Issues:60
Language:PythonLicense:Apache-2.0Stargazers:1079Issues:17Issues:46

evo

DNA foundation modeling from molecular to genome scale

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:807Issues:18Issues:38

GLiNER

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 24

Language:PythonLicense:Apache-2.0Stargazers:696Issues:9Issues:60

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:377Issues:31Issues:77

GeneGPT

Code and data for GeneGPT.

Language:PythonLicense:NOASSERTIONStargazers:352Issues:16Issues:4

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonLicense:Apache-2.0Stargazers:348Issues:35Issues:4

croissant

Croissant is a high-level format for machine learning datasets that brings together four rich layers.

Language:PythonLicense:Apache-2.0Stargazers:271Issues:21Issues:190

optimistix

Nonlinear optimisation (root-finding, least squares, ...) in JAX+Equinox. https://docs.kidger.site/optimistix/

Language:PythonLicense:Apache-2.0Stargazers:250Issues:7Issues:34

mandala

A powerful and easy to use Python framework for experiment tracking and incremental computing

Language:PythonLicense:Apache-2.0Stargazers:231Issues:2Issues:8

earth2mip

Earth-2 Model Intercomparison Project (MIP) is a python framework that enables climate researchers and scientists to inter-compare AI models for weather and climate.

Language:PythonLicense:Apache-2.0Stargazers:141Issues:5Issues:99
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:127Issues:6Issues:0

neuralgcm

Hybrid ML + physics model of the Earth's atmosphere

Language:PythonLicense:Apache-2.0Stargazers:83Issues:5Issues:24