Varun Kumar (varunkumar-dev)

varunkumar-dev

Geek Repo

Company:Amazon

Location:USA

Home Page:http://varunkumar-dev.github.io/

Github PK Tool:Github PK Tool

Varun Kumar's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:67569Issues:559Issues:710

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:36276Issues:368Issues:315

comprehensive-rust

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.

Language:RustLicense:Apache-2.0Stargazers:27525Issues:139Issues:285

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

tree-sitter

An incremental parsing system for programming tools

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13438Issues:115Issues:1031

static-analysis

⚙️ A curated list of static analysis (SAST) tools and linters for all programming languages, config files, build tools, and more. The focus is on tools which improve code quality.

Language:RustLicense:MITStargazers:13207Issues:322Issues:575

semgrep

Lightweight static analysis for many languages. Find bug variants with patterns that look like source code.

Language:OCamlLicense:LGPL-2.1Stargazers:10398Issues:103Issues:2967

amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9985Issues:265Issues:1427

aws-doc-sdk-examples

Welcome to the AWS Code Examples Repository. This repo contains code examples used in the AWS documentation, AWS SDK Developer Guides, and more. For more information, see the Readme.md file below.

Language:JavaLicense:Apache-2.0Stargazers:9415Issues:204Issues:2261

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9311Issues:73Issues:1100

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8785Issues:98Issues:1296

CodeGeeX

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Language:PythonLicense:Apache-2.0Stargazers:8117Issues:85Issues:215

llama-recipes

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7850Issues:68Issues:227

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7671Issues:143Issues:46

GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:7653Issues:99Issues:198

latexify_py

A library to generate LaTeX expression from Python code.

Language:PythonLicense:Apache-2.0Stargazers:7159Issues:56Issues:82

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:4534Issues:82Issues:243

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3659Issues:47Issues:174

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1563Issues:20Issues:0

diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Language:PythonLicense:NOASSERTIONStargazers:1279Issues:23Issues:20

bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Language:PythonLicense:Apache-2.0Stargazers:777Issues:12Issues:138

transformers-bloom-inference

Fast Inference Solutions for BLOOM

Language:PythonLicense:Apache-2.0Stargazers:556Issues:12Issues:64

python-linters-and-code-analysis

Python Linters and Code Analysis tools curated list

TOXIGEN

This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:270Issues:8Issues:20
Language:PythonLicense:Apache-2.0Stargazers:96Issues:4Issues:2

recode

Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"

Language:PythonLicense:Apache-2.0Stargazers:46Issues:2Issues:3

prod-neural-materials

Background materials for the article "Productivity Assessment of Neural Code Completion"

Language:RStargazers:9Issues:1Issues:0