Beast code in Giters

Alexey Zemtsov's starred repositories

MOSE

Multi-level Online Sequential Experts (MOSE) for online continual learning problem. (CVPR2024)

Language:Python1100

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonApache-2.014400

Odysseus-Transformer

Language:Python1300

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Language:PythonMIT38000

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonApache-2.0148300

PufferLib

Simplifying reinforcement learning for complex game environments

Language:PythonMIT31300

rejax

Language:PythonApache-2.07600

mintext

Minimal but scalable implementation of large language models in JAX

Language:PythonApache-2.01200

euler-scheduler

My implementation Diffusers-like Scheduler for performing Euler Method on Conditional Flow Matching models

Language:Jupyter NotebookMIT600

Elastic-DT

[NeurIPS 2023] Implementation of Elastic Decision Transformer

Language:CMIT2000

online-dt

Online Decision Transformer

Language:PythonNOASSERTION21800

CLAP

Contrastive Language-Audio Pretraining

Language:PythonCC0-1.0121600

Reinformer

Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL

Language:Python1900

JAX-CORL

Clean single-file implementation of offline RL algorithms in JAX

Language:PythonMIT3500

long-range-arena

Long Range Arena for Benchmarking Efficient Transformers

Language:PythonApache-2.069300

croc

Easily and securely send things from one computer to another :crocodile: :package:

Language:GoMIT2666000

DVL

A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning

Language:Python700

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT60500

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION120500

tdmpc

Code for "Temporal Difference Learning for Model Predictive Control"

Language:PythonMIT29500

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookApache-2.01291000

tensorneat

GPU-accelerated NeuroEvolution of Augmenting Topologies (NEAT)

Language:Python3300

ThunderKittens

Tile primitives for speedy kernels

Language:CudaMIT128100

pvp

Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlight)

Language:Python1700

pgeon

Language:Jupyter NotebookNOASSERTION600

Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Language:PythonApache-2.066400

memory-maze

Evaluating long-term memory of reinforcement learning algorithms

Language:PythonMIT12500

s5-pytorch

Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)

Language:PythonMPL-2.04800

Craftax_Baselines

Language:Python800

flash-attention-jax

Implementation of Flash Attention in Jax

Language:PythonMIT18100