Alexey Zemtsov's starred repositories

MOSE

Multi-level Online Sequential Experts (MOSE) for online continual learning problem. (CVPR2024)

Language:PythonStargazers:11Issues:0Issues:0

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonLicense:Apache-2.0Stargazers:144Issues:0Issues:0
Language:PythonStargazers:13Issues:0Issues:0

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Language:PythonLicense:MITStargazers:380Issues:0Issues:0

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1483Issues:0Issues:0

PufferLib

Simplifying reinforcement learning for complex game environments

Language:PythonLicense:MITStargazers:313Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:76Issues:0Issues:0

mintext

Minimal but scalable implementation of large language models in JAX

Language:PythonLicense:Apache-2.0Stargazers:12Issues:0Issues:0

euler-scheduler

My implementation Diffusers-like Scheduler for performing Euler Method on Conditional Flow Matching models

Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0

Elastic-DT

[NeurIPS 2023] Implementation of Elastic Decision Transformer

Language:CLicense:MITStargazers:20Issues:0Issues:0

online-dt

Online Decision Transformer

Language:PythonLicense:NOASSERTIONStargazers:218Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:1216Issues:0Issues:0

Reinformer

Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL

Language:PythonStargazers:19Issues:0Issues:0

JAX-CORL

Clean single-file implementation of offline RL algorithms in JAX

Language:PythonLicense:MITStargazers:35Issues:0Issues:0

long-range-arena

Long Range Arena for Benchmarking Efficient Transformers

Language:PythonLicense:Apache-2.0Stargazers:693Issues:0Issues:0

croc

Easily and securely send things from one computer to another :crocodile: :package:

Language:GoLicense:MITStargazers:26660Issues:0Issues:0

DVL

A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning

Language:PythonStargazers:7Issues:0Issues:0

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:605Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1205Issues:0Issues:0

tdmpc

Code for "Temporal Difference Learning for Model Predictive Control"

Language:PythonLicense:MITStargazers:295Issues:0Issues:0

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12910Issues:0Issues:0

tensorneat

GPU-accelerated NeuroEvolution of Augmenting Topologies (NEAT)

Language:PythonStargazers:33Issues:0Issues:0

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1281Issues:0Issues:0

pvp

Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlight)

Language:PythonStargazers:17Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:6Issues:0Issues:0

Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Language:PythonLicense:Apache-2.0Stargazers:664Issues:0Issues:0

memory-maze

Evaluating long-term memory of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:125Issues:0Issues:0

s5-pytorch

Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)

Language:PythonLicense:MPL-2.0Stargazers:48Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

flash-attention-jax

Implementation of Flash Attention in Jax

Language:PythonLicense:MITStargazers:181Issues:0Issues:0