Avnish Narayan (avnishn)

avnishn

Geek Repo

Company:nvidia

Location:Bay Area, CA

Github PK Tool:Github PK Tool


Organizations
ray-project
rlworkgroup

Avnish Narayan's starred repositories

ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:33715Issues:472Issues:18812

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:24290Issues:246Issues:139

mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8150Issues:106Issues:1527

llama-dl

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Language:ShellLicense:GPL-3.0Stargazers:4168Issues:68Issues:15

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2281Issues:31Issues:90

garage

A toolkit for reproducible reinforcement learning research.

Language:PythonLicense:MITStargazers:1877Issues:56Issues:1012

Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Language:PythonLicense:MITStargazers:1264Issues:29Issues:216

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

ray-llm

RayLLM - LLMs on Ray

Language:PythonLicense:Apache-2.0Stargazers:1231Issues:20Issues:89

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:764Issues:11Issues:17

llmperf

LLMPerf is a library for validating and benchmarking LLMs

Language:PythonLicense:Apache-2.0Stargazers:629Issues:9Issues:30

machine-learning-specialization-andrew-ng

A collection of notes and implementations of machine learning algorithms from Andrew Ng's machine learning specialization.

Language:Jupyter NotebookLicense:MITStargazers:584Issues:8Issues:0

mtrl

Multi Task RL Baselines

Language:PythonLicense:MITStargazers:223Issues:9Issues:29

EasyReinforcementLearning

EasyRL: An easy-to-use and comprehensive reinforcement learning package.

Language:PythonLicense:Apache-2.0Stargazers:211Issues:17Issues:3

MirageStock

Open-Source Implementations of Multi-Modal Diffusion Models Optimized for Highest Quality and Ease of Use

Language:PythonLicense:MITStargazers:190Issues:11Issues:1

hash-hop

Long context evaluation for large language models

Language:PythonLicense:MITStargazers:185Issues:7Issues:3

minimal-stable-PPO

A minimal and stable PPO.

raylab

Reinforcement learning algorithms in RLlib

Language:PythonLicense:MITStargazers:56Issues:4Issues:5

sapg

Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)

Language:Jupyter NotebookLicense:MITStargazers:41Issues:5Issues:3
Language:PythonStargazers:5Issues:1Issues:0
Language:PythonLicense:MITStargazers:5Issues:1Issues:0

soccerprojects

My adventures exploring soccer data analysis

Language:JuliaStargazers:1Issues:2Issues:0