Vik Paruchuri (VikParuchuri)

VikParuchuri

User data from Github https://github.com/VikParuchuri

Company:@dataquestio

Location:Brooklyn, NY

Home Page:https://www.vikas.sh

GitHub:@VikParuchuri

Twitter:@VikParuchuri


Organizations
datalab-to
dataquestio

Vik Paruchuri's repositories

zero_to_gpt

Go from no deep learning knowledge to implementing GPT.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1277Issues:34Issues:2

texify

Math OCR model that outputs LaTeX and markdown

Language:PythonLicense:GPL-3.0Stargazers:1093Issues:15Issues:13

apartment-finder

A Slack bot that helps you find an apartment.

Language:PythonLicense:MITStargazers:1065Issues:54Issues:15

tabled

Detect and extract tables to markdown and csv

Language:PythonLicense:GPL-3.0Stargazers:755Issues:7Issues:18

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:506Issues:8Issues:6

libgen_to_txt

Convert all of libgen to high quality markdown

Language:PythonLicense:MITStargazers:253Issues:3Issues:3

researcher

Concise answers to search queries using Google and GPT-3. Includes citations.

Language:PythonLicense:MITStargazers:81Issues:1Issues:0

triton_tutorial

Tutorials for Triton, a language for writing gpu kernels

Language:Jupyter NotebookStargazers:56Issues:3Issues:0

classified

Score LLM pretraining data with classifiers

Language:PythonLicense:MITStargazers:55Issues:3Issues:0

scan

Score essays automatically with an easy web interface.

Language:PythonLicense:AGPL-3.0Stargazers:41Issues:5Issues:6
Language:PythonStargazers:10Issues:2Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5Issues:0Issues:0

medicare-analysis

Analyze medicare data from the recent release.

Language:CSSStargazers:5Issues:3Issues:0

nyt-articles

Get articles from new york times API.

Language:PythonStargazers:5Issues:2Issues:0

streamlit-drawable-canvas

Do you like Quick, Draw? Well what if you could train/predict doodles drawn inside Streamlit? Also draws lines, circles and boxes over background images for annotation.

Language:TypeScriptLicense:MITStargazers:5Issues:2Issues:0
Language:SvelteStargazers:4Issues:2Issues:0

olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:2Issues:1Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

accelerate

๐Ÿš€ A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

openedx-scorm-xblock

SCORM XBlock for Open edX

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:1Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.

Language:C++License:MITStargazers:0Issues:0Issues:0

bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

bitsandbytes

8-bit CUDA functions for PyTorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

edx_xblock_scorm

XBlock to display SCORM content within the Open edX LMS. Editable within Open edx Studio. Will save student state and report scores to the progress tab of the course. Currently supports SCORM 1.2 and SCORM 2004 standard.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

optimum

๐Ÿš€ Accelerate training and inference of ๐Ÿค— Transformers and ๐Ÿค— Diffusers with easy to use hardware optimization tools

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:1Issues:0