Vik Paruchuri (VikParuchuri)

VikParuchuri

Geek Repo

Company:@dataquestio

Location:Brooklyn, NY

Home Page:https://www.vikas.sh

Twitter:@VikParuchuri

Github PK Tool:Github PK Tool


Organizations
dataquestio

Vik Paruchuri's repositories

marker

Convert PDF to markdown + JSON quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:18690Issues:83Issues:284

surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:14708Issues:107Issues:172

zero_to_gpt

Go from no deep learning knowledge to implementing GPT.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1078Issues:37Issues:1

apartment-finder

A Slack bot that helps you find an apartment.

Language:PythonLicense:MITStargazers:1066Issues:54Issues:15

texify

Math OCR model that outputs LaTeX and markdown

Language:PythonLicense:GPL-3.0Stargazers:948Issues:15Issues:13

tabled

Detect and extract tables to markdown and csv

Language:PythonLicense:GPL-3.0Stargazers:689Issues:6Issues:18

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:493Issues:9Issues:6

pdftext

Extract structured text from pdfs quickly

Language:PythonLicense:Apache-2.0Stargazers:364Issues:3Issues:4

libgen_to_txt

Convert all of libgen to high quality markdown

Language:PythonLicense:MITStargazers:243Issues:5Issues:3

researcher

Concise answers to search queries using Google and GPT-3. Includes citations.

Language:PythonLicense:MITStargazers:76Issues:4Issues:0

classified

Score LLM pretraining data with classifiers

Language:PythonLicense:MITStargazers:54Issues:3Issues:0
Language:PythonStargazers:10Issues:2Issues:0

triton_tutorial

Tutorials for Triton, a language for writing gpu kernels

Language:Jupyter NotebookStargazers:6Issues:2Issues:0

nyt-articles

Get articles from new york times API.

Language:PythonStargazers:5Issues:3Issues:0

streamlit-drawable-canvas

Do you like Quick, Draw? Well what if you could train/predict doodles drawn inside Streamlit? Also draws lines, circles and boxes over background images for annotation.

Language:TypeScriptLicense:MITStargazers:4Issues:2Issues:0
Language:SvelteStargazers:3Issues:2Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:2Issues:0Issues:0

openedx-scorm-xblock

SCORM XBlock for Open edX

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:2Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:1Issues:2Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

accelerate

๐Ÿš€ A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.

Language:C++License:MITStargazers:0Issues:1Issues:0

bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

bitsandbytes

8-bit CUDA functions for PyTorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

edx_xblock_scorm

XBlock to display SCORM content within the Open edX LMS. Editable within Open edx Studio. Will save student state and report scores to the progress tab of the course. Currently supports SCORM 1.2 and SCORM 2004 standard.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

optimum

๐Ÿš€ Accelerate training and inference of ๐Ÿค— Transformers and ๐Ÿค— Diffusers with easy to use hardware optimization tools

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:1Issues:0