Zhangir Azerbayev's repositories

ProofNet

Benchmark for undergraduate-level formal mathematics

Language:LeanLicense:MITStargazers:83Issues:6Issues:7

proof-pile

Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.

Language:Jupyter NotebookStargazers:16Issues:2Issues:0
Language:PythonLicense:MITStargazers:14Issues:4Issues:0
Language:PythonStargazers:10Issues:0Issues:0

repl

A simple REPL for Lean 4, returning information about errors and sorries.

Language:LeanStargazers:10Issues:0Issues:0

pySagredo

LM based automatic theorem prover that can write code, respond to error messages, and look up docs.

Language:PythonLicense:MITStargazers:7Issues:2Issues:0

mm-extract

Extracting human readable pre-training data from set.mm

Language:PythonStargazers:5Issues:2Issues:0

nn-generalization

Neural Network Generalization Reading List

License:MITStargazers:3Issues:1Issues:0

ETK

Code for "Explicit Knowledge Transfer for Weakly Supervised Code Generation"

Language:Jupyter NotebookLicense:MITStargazers:2Issues:2Issues:0

llemma_formal2formal

Llemma formal2formal (tactic prediction) theorem proving experiments

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

math_cc_net

Tools to download and cleanup Common Crawl data

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dotfiles

config files, symbolically linked to correct locations

Language:ShellStargazers:0Issues:1Issues:0

infinigen

Infinite Photorealistic Worlds using Procedural Generation

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

lean-chat-server

Server for lean-chat

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

levanter

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llmstep

llmstep: [L]LM proofstep suggestions in Lean 4.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mathematics_in_lean

The user home repository for the Mathematics in Lean tutorial.

Language:HTMLStargazers:0Issues:0Issues:0

mathport

Experimenting with porting proofnet and minif2f

Language:LeanLicense:Apache-2.0Stargazers:0Issues:0Issues:0

miniF2F

Adding Lean 4

Language:Objective-C++License:MITStargazers:0Issues:0Issues:0

mizar-mirror

storing mizar files here to make them easy to access through the REST api.

Language:PythonStargazers:0Issues:1Issues:0

nvim-config

nvim configuration

Language:LuaLicense:MITStargazers:0Issues:0Issues:0
Language:XSLTStargazers:0Issues:0Issues:0

seqax

seqax = sequence modeling + JAX

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

tiny-ml-projects

misc. machine learning projects monorepo

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLLicense:MITStargazers:0Issues:0Issues:0