Abhinav Gupta (backpropper)

backpropper

Geek Repo

Company:MILA

Location:London, United Kingdom

Home Page:https://www.guabhinav.com

Twitter:@backpropper

Github PK Tool:Github PK Tool

Abhinav Gupta's repositories

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:1Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dm_robotics

Libraries, tools and tasks created and used at DeepMind Robotics.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ede

Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

License:NOASSERTIONStargazers:0Issues:0Issues:0

human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

License:MITStargazers:0Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

llama-recipes

Examples and recipes for Llama model

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

License:Apache-2.0Stargazers:0Issues:0Issues:0

math

The MATH Dataset (NeurIPS 2021)

License:MITStargazers:0Issues:0Issues:0

METER

METER: A Multimodal End-to-end TransformER Framework

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mlx

MLX: An array framework for Apple silicon

License:MITStargazers:0Issues:0Issues:0

mlx-examples

Examples in the MLX framework

License:MITStargazers:0Issues:0Issues:0

mujoco_menagerie

A collection of high-quality models for the MuJoCo physics engine, curated by DeepMind.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

Language:PythonStargazers:0Issues:1Issues:0

openai-quickstart-python

Python example app from the OpenAI API quickstart tutorial

Language:CSSStargazers:0Issues:1Issues:0

optax

Optax is a gradient processing and optimization library for JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

procgen

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

Language:C++License:MITStargazers:0Issues:1Issues:0

PromptPG

Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

retro

Retro Games in Gym

Language:CLicense:MITStargazers:0Issues:1Issues:0

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

License:NOASSERTIONStargazers:0Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

License:Apache-2.0Stargazers:0Issues:0Issues:0

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
License:MITStargazers:0Issues:0Issues:0

trax

Trax — Deep Learning with Clear Code and Speed

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0