Ross Wightman (rwightman)

rwightman

Geek Repo

Location:Vancouver, BC

Home Page:rwightman.com

Twitter:@wightmanr

Github PK Tool:Github PK Tool

Ross Wightman's starred repositories

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:32394Issues:346Issues:292

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:22929Issues:187Issues:3572

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:15853Issues:203Issues:76

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:6926Issues:64Issues:67

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3501Issues:47Issues:168

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonLicense:MITStargazers:3468Issues:100Issues:159

doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Language:PythonLicense:Apache-2.0Stargazers:3137Issues:42Issues:335

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonLicense:MITStargazers:3037Issues:98Issues:52

EVA

EVA Series: Visual Representation Fantasies from BAAI

Language:PythonLicense:MITStargazers:2003Issues:29Issues:147

GLIP

Grounded Language-Image Pre-training

Language:PythonLicense:MITStargazers:1993Issues:45Issues:166

huggingface_hub

The official Python client for the Huggingface Hub.

Language:PythonLicense:Apache-2.0Stargazers:1725Issues:60Issues:806

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1389Issues:39Issues:61

ConvNeXt-V2

Code release for ConvNeXt V2 model

Language:PythonLicense:NOASSERTIONStargazers:1364Issues:7Issues:64

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

torchdynamo

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Language:PythonLicense:BSD-3-ClauseStargazers:971Issues:47Issues:567

phenaki-pytorch

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch

Language:PythonLicense:MITStargazers:720Issues:39Issues:30

tensordict

TensorDict is a pytorch dedicated tensor container.

Language:PythonLicense:MITStargazers:605Issues:27Issues:85

datacomp

DataComp: In search of the next generation of multimodal datasets

Language:PythonLicense:NOASSERTIONStargazers:554Issues:17Issues:57
Language:PythonLicense:Apache-2.0Stargazers:553Issues:17Issues:22

maxvit

[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:421Issues:9Issues:20

robotic-transformer-pytorch

Implementation of RT1 (Robotic Transformer) in Pytorch

Language:PythonLicense:MITStargazers:343Issues:10Issues:4

cc2dataset

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Language:PythonLicense:MITStargazers:294Issues:9Issues:33

pypdfium2

Python bindings to PDFium

mlx-llm

Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.

Language:PythonLicense:NOASSERTIONStargazers:257Issues:8Issues:1

chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Language:PythonLicense:Apache-2.0Stargazers:134Issues:10Issues:3

apted

Python APTED algorithm for the Tree Edit Distance

Language:PythonLicense:MITStargazers:83Issues:2Issues:7

timm-models-explorer

Timm model explorer

Language:PythonLicense:MITStargazers:34Issues:1Issues:0

posenet-pytorch

A PyTorch port of Google TensorFlow.js PoseNet (Real-time Human Pose Estimation)

Language:PythonLicense:Apache-2.0Stargazers:22Issues:1Issues:0

timm-lr-scheduler-explorer

A dashboard for exploring timm learning rate schedulers

Language:PythonLicense:MITStargazers:18Issues:2Issues:0