Vasudev Gupta (thevasudevgupta)

thevasudevgupta

Geek Repo

Company:@Unbox-AI

Location:New Delhi, India

Home Page:https://thevasudevgupta.github.io

Twitter:@thevasudevgupta

Github PK Tool:Github PK Tool


Organizations
analytics-club-iitm
Unbox-AI

Vasudev Gupta's starred repositories

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:33337Issues:355Issues:297

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29027Issues:341Issues:267

dash

Data Apps & Dashboards for Python. No JavaScript Required.

Language:PythonLicense:MITStargazers:20721Issues:418Issues:1710

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18312Issues:155Issues:467

plotly.py

The interactive graphing library for Python :sparkles: This project now includes Plotly Express!

Language:PythonLicense:MITStargazers:15516Issues:279Issues:2860

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14579Issues:109Issues:929

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9509Issues:64Issues:102

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8876Issues:77Issues:441

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6712Issues:59Issues:137

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:4472Issues:82Issues:241

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3519Issues:47Issues:170

diffusion-models-class

Materials for the Hugging Face Diffusion Models Course

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3281Issues:101Issues:21

torchscale

Foundation Architecture for (M)LLMs

Language:PythonLicense:MITStargazers:2952Issues:46Issues:75

galai

Model API for GALACTICA

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2658Issues:43Issues:71

safetensors

Simple, safe way to store and distribute tensors

Language:PythonLicense:Apache-2.0Stargazers:2545Issues:42Issues:159

ddim

Denoising Diffusion Implicit Models

Language:PythonLicense:MITStargazers:1267Issues:10Issues:33

streaming

A Data Streaming Library for Efficient Neural Network Training

Language:PythonLicense:Apache-2.0Stargazers:988Issues:20Issues:134

codecarbon

Track emissions from Compute and recommend ways to reduce their impact on the environment.

Language:PythonLicense:MITStargazers:976Issues:21Issues:269

datacomp

DataComp: In search of the next generation of multimodal datasets

Language:PythonLicense:NOASSERTIONStargazers:569Issues:17Issues:58

transformers-bloom-inference

Fast Inference Solutions for BLOOM

Language:PythonLicense:Apache-2.0Stargazers:550Issues:13Issues:64
Language:Jupyter NotebookLicense:MITStargazers:541Issues:8Issues:15

SimCTG

[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation

Language:PythonLicense:MITStargazers:447Issues:9Issues:26

ml-deployment-k8s-fastapi

This project shows how to serve an ONNX-optimized image classification model as a web service with FastAPI, Docker, and Kubernetes.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:183Issues:3Issues:11

olm-datasets

Pipeline for pulling and processing online language model pretraining data from the web

Language:PythonLicense:Apache-2.0Stargazers:169Issues:12Issues:5

jax-smi

JAX Synergistic Memory Inspector

Language:PythonLicense:CC0-1.0Stargazers:147Issues:5Issues:2

transformers_without_tears

Transformers without Tears: Improving the Normalization of Self-Attention

Language:PythonLicense:MITStargazers:128Issues:8Issues:4

gsoc-wav2vec2

GSoC'2021 | TensorFlow implementation of Wav2Vec2

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:89Issues:4Issues:22

streamlit-tensorboard

Streamlit component for TensorBoard, TensorFlow's visualization toolkit

Language:PythonLicense:MITStargazers:37Issues:2Issues:8

count-tokens-hf-datasets

This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Dataflow.

Language:PythonStargazers:22Issues:3Issues:0

gpu-programming

GPU Programming @ IIT Madras

Language:CudaLicense:MITStargazers:2Issues:1Issues:0