Katharina Prasse (KathPra)

KathPra

Geek Repo

Location:Germany

Github PK Tool:Github PK Tool

Katharina Prasse's starred repositories

Language:PythonLicense:NOASSERTIONStargazers:293Issues:0Issues:0

hungarian-algorithm

Python 3 implementation of the Hungarian Algorithm

Language:PythonLicense:MITStargazers:69Issues:0Issues:0

umap

Uniform Manifold Approximation and Projection

Language:PythonLicense:BSD-3-ClauseStargazers:7236Issues:0Issues:0

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4566Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30832Issues:0Issues:0

imagenette

A smaller subset of 10 easily classified classes from Imagenet, and a little more French

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:927Issues:0Issues:0

bottom-up-attention.pytorch

A PyTorch reimplementation of bottom-up-attention models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:291Issues:0Issues:0

ITIN

Multimodal Sentiment Analysis with Image-Text Interaction Network

Language:PythonStargazers:10Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

MultiMax

This is the official implementation of our ICML 2024 paper "MultiMax: Sparse and Multi-Modal Attention Learning""

License:Apache-2.0Stargazers:4Issues:0Issues:0

isc2021

Code for the Image similarity challenge.

Language:PythonLicense:NOASSERTIONStargazers:193Issues:0Issues:0

graph

Graphs and Graph Algorithms in C++, including Minimum Cost (Lifted) Multicuts

Language:C++Stargazers:233Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3463Issues:0Issues:0

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:2116Issues:0Issues:0

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookLicense:MITStargazers:2279Issues:0Issues:0

CLIP_benchmark

CLIP-like model evaluation

Language:Jupyter NotebookLicense:MITStargazers:542Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:10525Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:23387Issues:0Issues:0

WaffleCLIP

Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts"

Language:PythonLicense:MITStargazers:49Issues:0Issues:0

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonLicense:NOASSERTIONStargazers:1123Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9261Issues:0Issues:0

natural-adv-examples

A Harder ImageNet Test Set (CVPR 2021)

Language:PythonLicense:MITStargazers:579Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9785Issues:0Issues:0

ConvNeXt-V2

Code release for ConvNeXt V2 model

Language:PythonLicense:NOASSERTIONStargazers:1420Issues:0Issues:0

imagenet-r

ImageNet-R(endition) and DeepAugment (ICCV 2021)

Language:PythonLicense:MITStargazers:248Issues:0Issues:0

big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2050Issues:0Issues:0

vision-language-models-are-bows

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

Language:PythonLicense:MITStargazers:217Issues:0Issues:0

grid-feats-vqa

Grid features pre-training code for visual question answering

Language:PythonLicense:Apache-2.0Stargazers:268Issues:0Issues:0