Abhinav Gupta (backpropper)

backpropper

Geek Repo

Company:MILA

Location:London, United Kingdom

Home Page:https://www.guabhinav.com

Twitter:@backpropper

Github PK Tool:Github PK Tool

Abhinav Gupta's repositories

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:1Issues:1Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dm_robotics

Libraries, tools and tasks created and used at DeepMind Robotics.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ede

Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

License:NOASSERTIONStargazers:0Issues:0Issues:0

human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

License:MITStargazers:0Issues:0Issues:0

inference

Reference implementations of MLPerf™ inference benchmarks

License:Apache-2.0Stargazers:0Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

llama-recipes

Examples and recipes for Llama model

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

License:Apache-2.0Stargazers:0Issues:0Issues:0

math

The MATH Dataset (NeurIPS 2021)

License:MITStargazers:0Issues:0Issues:0

mlx

MLX: An array framework for Apple silicon

Language:C++License:MITStargazers:0Issues:0Issues:0

mlx-examples

Examples in the MLX framework

License:MITStargazers:0Issues:0Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

Language:PythonStargazers:0Issues:1Issues:0

openai-quickstart-python

Python example app from the OpenAI API quickstart tutorial

Language:CSSStargazers:0Issues:1Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

License:MITStargazers:0Issues:0Issues:0

optax

Optax is a gradient processing and optimization library for JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PromptPG

Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

reward-bench

RewardBench: the first evaluation tool for reward models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

License:NOASSERTIONStargazers:0Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

License:Apache-2.0Stargazers:0Issues:0Issues:0

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

training

Reference implementations of MLPerf™ training benchmarks

License:Apache-2.0Stargazers:0Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0