KathPra

Katharina Prasse's starred repositories

hungarian-algorithm

Python 3 implementation of the Hungarian Algorithm

Language:PythonMIT6900

umap

Uniform Manifold Approximation and Projection

Language:PythonBSD-3-Clause723600

moondream

tiny vision language model

Language:Jupyter NotebookApache-2.0456600

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.03083200

imagenette

A smaller subset of 10 easily classified classes from Imagenet, and a little more French

Language:Jupyter NotebookApache-2.092700

bottom-up-attention.pytorch

A PyTorch reimplementation of bottom-up-attention models

Language:Jupyter NotebookApache-2.029100

ITIN

Multimodal Sentiment Analysis with Image-Text Interaction Network

Language:Python1000

pruneshift-public

Language:Python100

adv_mmsegmentation

Language:PythonApache-2.0100

MultiMax

This is the official implementation of our ICML 2024 paper "MultiMax: Sparse and Multi-Modal Attention Learning""

Apache-2.0400

isc2021

Code for the Image similarity challenge.

Language:PythonNOASSERTION19300

graph

Graphs and Graph Algorithms in C++, including Minimum Cost (Lifted) Multicuts

Language:C++23300

Thesis

Language:Python100

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonMIT346300

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonBSD-3-Clause211600

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookMIT227900

CLIP_benchmark

CLIP-like model evaluation

Language:Jupyter NotebookMIT54200

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook1052500

KathPra

Katharina Prasse's starred repositories

CLIP-based-NSFW-Detector

hungarian-algorithm

umap

moondream

pytorch-image-models

imagenette

bottom-up-attention.pytorch

ITIN

pruneshift-public

adv_mmsegmentation

MultiMax

isc2021

graph

Thesis

img2dataset

webdataset

clip-retrieval

CLIP_benchmark

llama-recipes

llama3

WaffleCLIP

MetaCLIP

open_clip

natural-adv-examples

vision_transformer

ConvNeXt-V2

imagenet-r

big_vision

vision-language-models-are-bows

grid-feats-vqa