shatu

Shashank Gupta's repositories

adapter-transformers

Huggingface Transformers + Adapters = ❤️

Language:PythonApache-2.0000

AgentGym

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Language:PythonMIT000

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0000

appworld

🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL2024

Language:PythonApache-2.0000

DialoGPT

Large-scale pretraining for dialogue

Language:PythonMIT010

Docker-Containers

Language:Dockerfile010

awesome-system-design-resources

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

GPL-3.0000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Language:PythonMIT010

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookApache-2.0020

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.

Language:PythonApache-2.0000

Generating_Text_Summary_With_GPT2

A simple approach to use GPT2-medium (345M) for generating high quality text summaries with minimal training.

Language:Jupyter Notebook010

gorilla

Gorilla: An API store for LLMs

Language:PythonApache-2.0000

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonMIT000

NeuralDialog-CVAE

Tensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU

Language:OpenEdge ABLApache-2.0010

OpenHands

🙌 OpenHands: Code Less, Make More

MIT000

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Language:PythonMIT010

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonMIT000

PyMarlin

Lightweight Deep Learning Model Training library based on PyTorch

Language:PythonMIT010

pytorch-pretrained-BERT

📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer-XL.

Language:PythonApache-2.0010

reasoning-on-cots

Language:PythonMIT000

SelfEval-Guided-Decoding

Language:PythonMIT000

shatu.github.io

Code for the personal website

Language:HTML010

SimCSE

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonMIT000

SpaceFusion

An implementation for the SpaceFusion model, https://arxiv.org/abs/1902.11205

Language:Python010

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.0000

t5x

Language:PythonApache-2.0000

TheoremQA

The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset

Language:PythonMIT000

ThoughtSource

A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

Language:PythonMIT000

tree-of-thought-llm

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

000

unify-parameter-efficient-tuning

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)

Language:PythonApache-2.0000