Sourab Mangrulkar (pacman100)

pacman100

Geek Repo

Company:Amazon

Location:🇮🇳

Home Page:https://pacman100.github.io/

Twitter:@sourab_m

Github PK Tool:Github PK Tool

Sourab Mangrulkar's starred repositories

t-few

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Language:PythonLicense:MITStargazers:426Issues:0Issues:0

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12470Issues:0Issues:0

petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Language:PythonLicense:MITStargazers:9117Issues:0Issues:0

natural-instructions

Expanding natural instructions

Language:PythonLicense:Apache-2.0Stargazers:950Issues:0Issues:0

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1574Issues:0Issues:0

pandarallel

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Language:PythonLicense:BSD-3-ClauseStargazers:3645Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:359Issues:0Issues:0

cc2dataset

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Language:PythonLicense:MITStargazers:304Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:54467Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19629Issues:0Issues:0

deep-rl-class

This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.

Language:MDXLicense:Apache-2.0Stargazers:3837Issues:0Issues:0

P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Language:PythonLicense:Apache-2.0Stargazers:1968Issues:0Issues:0
Language:PythonStargazers:144Issues:0Issues:0

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonLicense:MITStargazers:16735Issues:0Issues:0

diffusion-models-class

Materials for the Hugging Face Diffusion Models Course

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3559Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13626Issues:0Issues:0

torchscale

Foundation Architecture for (M)LLMs

Language:PythonLicense:MITStargazers:3004Issues:0Issues:0

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

Stargazers:4058Issues:0Issues:0

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:10432Issues:0Issues:0

electra

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Language:PythonLicense:Apache-2.0Stargazers:2326Issues:0Issues:0

DeBERTa

The implementation of DeBERTa

Language:PythonLicense:MITStargazers:1973Issues:0Issues:0

parallelformers

Parallelformers: An Efficient Model Parallelization Toolkit for Deployment

Language:PythonLicense:Apache-2.0Stargazers:776Issues:0Issues:0

galai

Model API for GALACTICA

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2675Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:73Issues:0Issues:0

E.T.

Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal transformer that encodes language inputs and the full episode history of visual observations and actions.

Language:CLicense:MITStargazers:85Issues:0Issues:0

teach

TEACh is a dataset of human-human interactive dialogues to complete tasks in a simulated household environment.

Language:PythonStargazers:134Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8855Issues:0Issues:0
Language:PythonStargazers:20Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:12919Issues:0Issues:0

pytorch_geometric

Graph Neural Network Library for PyTorch

Language:PythonLicense:MITStargazers:21081Issues:0Issues:0