cailk

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.04573400

DeCLIP

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Language:Python62100

grok-1

Grok open release

Language:PythonApache-2.04919500

siamese-triplet

Siamese and triplet networks with online pair/triplet mining in PyTorch

Language:PythonBSD-3-Clause307900

onlinetripletmining

Fast Online Triplet mining in Pytorch

Language:Python800

slot-attention

Implementation of Slot Attention from GoogleAI

Language:PythonMIT36400

detr

End-to-End Object Detection with Transformers

Language:PythonApache-2.01316700

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonBSD-3-Clause212400

OTTER

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

Language:PythonMIT6400

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonMIT347100

DownloadConceptualCaptions

Reliably download millions of images efficiently

Language:Jupyter NotebookMIT11000

conceptual-captions

Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image captioning systems.

Language:ShellNOASSERTION50600

conceptual-12m

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

NOASSERTION34500

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

Language:PythonMIT35400

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookApache-2.0846900

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonApache-2.0610400

Awesome-Information-Bottleneck

This is a curated list for Information Bottleneck Principle, in memory of Professor Naftali Tishby.

MIT29400

Tree-Transformer

Implementation of the paper Tree Transformer

Language:Python20800

UniCL

[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"

Language:PythonMIT37800

cailk

Likun Cai's starred repositories

ALLaVA

ALIP

clip-rocket

textaugment

minbpe

llama3-from-scratch

OpenAnnotate3D

perceiver-pytorch

perceiver-io

segment-anything