avnishn

Avnish Narayan's starred repositories

ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.033715 472 18812

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT24290 246 139

mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

Language:Jupyter NotebookApache-2.08150 106 1527

llama-dl

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Language:ShellGPL-3.04168 68 15

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.02281 31 90

garage

A toolkit for reproducible reinforcement learning research.

Language:PythonMIT1877 56 1012

Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Language:PythonMIT1264 29 216

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

1262 17 1

ray-llm

RayLLM - LLMs on Ray

Language:PythonApache-2.01231 20 89

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookApache-2.0764 11 17

llmperf

LLMPerf is a library for validating and benchmarking LLMs

Language:PythonApache-2.0629 9 30

machine-learning-specialization-andrew-ng

A collection of notes and implementations of machine learning algorithms from Andrew Ng's machine learning specialization.

Language:Jupyter NotebookMIT584 80

mtrl

Multi Task RL Baselines

Language:PythonMIT223 9 29

EasyReinforcementLearning

EasyRL: An easy-to-use and comprehensive reinforcement learning package.

Language:PythonApache-2.0211 17 3

MirageStock

Open-Source Implementations of Multi-Modal Diffusion Models Optimized for Highest Quality and Ease of Use

Language:PythonMIT190 11 1

hash-hop

Long context evaluation for large language models

Language:PythonMIT185 7 3

minimal-stable-PPO

A minimal and stable PPO.

Language:Python116 1 3

raylab

Reinforcement learning algorithms in RLlib

Language:PythonMIT56 4 5

sapg

Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)

Language:Jupyter NotebookMIT41 5 3

Betha

Language:Python5 10

language-world

Language:PythonMIT5 10

soccerprojects

My adventures exploring soccer data analysis

1 20

hebi

Language:Julia1 20